Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning

Ma, Yonggang; Zang, Jiagen

doi:10.3390/su18063092

Open AccessArticle

Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning

by

Yonggang Ma

¹

and

Jiagen Zang

^2,*

¹

School of Business Administration, Lanzhou University of Finance and Economics, Lanzhou 730020, China

²

School of Information Engineering and Artificial Intelligence, Lanzhou University of Finance and Economics, Lanzhou 730020, China

^*

Author to whom correspondence should be addressed.

Sustainability 2026, 18(6), 3092; https://doi.org/10.3390/su18063092

Submission received: 11 February 2026 / Revised: 10 March 2026 / Accepted: 19 March 2026 / Published: 21 March 2026

(This article belongs to the Special Issue Logistics for a Resilient Future: Sustainable Approaches to Supply Chain Management)

Download

Browse Figures

Versions Notes

Abstract

Although digital economic development is often viewed as a catalyst for green transformation, the causal implications of policy-driven AI deployment for low-carbon logistics development remain unclear. To address this gap, this study leverages China’s National New Generation Artificial Intelligence Innovation Development Pilot Zones (AIIDPZs) as a quasi-natural experiment. Using panel data from 30 provincial regions from 2012 to 2022, this research employs a double machine learning framework to rigorously quantify the AIIDPZ policy’s causal effects on the logistics industry’s green total factor productivity (GTFP). We further examine underlying transmission mechanisms and spatial spillover effects. Results show that the AIIDPZ policy significantly enhances logistics GTFP, a finding robust to parallel trend tests, sample adjustments, and algorithm substitutions. Mechanism analysis reveals that the AIIDPZ policy promotes logistics GTFP by alleviating manufacturing agglomeration and collaborative agglomeration. This occurs mainly through the mitigation of environmental externalities and the easing of inter-sectoral resource competition. Heterogeneity analysis highlights substantial regional variation: the policy impact is strongest in East China, Central China, and Southwest China; positive but weaker in Northeast and Northwest China; and statistically insignificant in North and South China. Spatial econometric results confirm significant positive spillovers to neighboring regions. Temporally, the logistics industry’s GTFP shows a sustained upward trajectory, while spatially it follows a spatial pattern of “Eastern leadership, Central rise, and Western catch-up.” Robust empirical evidence is presented to evaluate the environmental outcomes of AI policy implementation, alongside policy-relevant insights for advancing coordinated and spatially differentiated regional development.

Keywords:

artificial intelligence policy; logistics industry; green total factor productivity; double machine learning; quasi-natural experiment

1. Introduction

The Fourth Industrial Revolution has emerged from the deep integration of intelligent technologies, data-driven systems, and ubiquitous connectivity, fundamentally altering global industrial structures and economic organization [1]. According to a 2023 report by the International Data Corporation (IDC) [2], the global AI market is projected to exceed $200 billion by 2025, with China’s market expected to reach $30 billion, solidifying its role as a major force in global AI development. In response, major economies have elevated AI to a core national strategic priority. Notable policy efforts include a U.S.-led national strategy for advancing AI research and development [3] and a coordinated AI policy framework implemented by the European Union [4], aimed at reinforcing technological leadership and long-term competitive advantages.

In an effort to embed artificial intelligence more deeply into real-sector activities, a national policy framework for AI innovation pilot zones was introduced in 2019 by China’s central science and technology authority, officially initiating the Artificial Intelligence Innovation and Development Pilot Zones (AIIDPZ) program [5]. By 2023, this initiative had expanded to 16 provinces, forming a nationwide pilot network. As a foundational sector of the national economy, the low-carbon transformation of the logistics industry plays a critical role in achieving China’s climate mitigation and carbon neutrality objectives [6]. Data show that China’s total social logistics costs accounted for 14.7% of its GDP in 2022 [7], while carbon emissions from transportation constituted approximately 10% of the national total [8]. Therefore, promoting energy conservation and emission reduction in logistics is a key step toward the “Dual Carbon” targets. In this context, Green Total Factor Productivity (GTFP) serves as a key indicator for assessing high-quality development in the logistics industry, comprehensively reflecting the efficiency with which economic output is balanced with environmental impacts [9]. The AIIDPZ policy can be viewed as an exogenous institutional shock that facilitates intelligent dispatching, smart warehousing, and digital supply chain integration, thereby reducing empty-load rates, improving energy efficiency, and lowering carbon emissions [10].

The digital economy is often characterized as a catalyst for green transformation. However, the causal impact of policy-driven AI deployment on the low-carbon development of the logistics industry remains ambiguous. Furthermore, its underlying mechanisms and spatial effects have been systematically underexplored. In this context, this study leverages China’s AIIDPZ policy as a quasi-natural experiment. Utilizing provincial panel data from 2012 to 2022, we employ a Double Machine Learning (DML) framework to examine the causal effects of this policy on the logistics industry’s GTFP. We analyze its heterogeneous impacts across different regions and investigate how AI enhances logistics GTFP by mitigating the degrees of manufacturing agglomeration and collaborative agglomeration, alleviating environmental externalities and resource competition. Furthermore, by integrating spatial econometric models, this study reveals the spatial spillover effects and spatiotemporal evolution characteristics of logistics GTFP. Collectively, our findings provide theoretical and empirical evidence for understanding how AI empowers the green development of the logistics sector.

2. Literature Review

The first category of literature focuses on the factors influencing logistics efficiency. Early studies primarily examined traditional drivers of the logistics industry’s Total Factor Productivity (TFP), including foreign direct investment [11], infrastructure investment [12], retail trade development [13], and levels of informatization [14]. With the rollout of the “Dual Carbon” objectives and the increasing focus on environmentally sustainable, low-emission development, this research focus has gradually shifted from conventional logistics industry’s TFP to GTFP, explicitly incorporating environmental constraints. For instance, Liu H. et al. identified a non-linear threshold effect between urbanization and the GTFP of the logistics industry [15]. Liu W demonstrated that digital technologies simultaneously enhance logistics operational efficiency and environmental performance [16]. Using the GDIM decomposition approach, Liang Y. et al. identified energy structure optimization and carbon regulation intensity as critical drivers of logistics industry’s GTFP [17]. Furthermore, research on digital infrastructure and automation provides important insights for understanding this issue. McElheran et al. indicate that the significant increase in U.S. productivity signals a transition of artificial intelligence from the “investment phase” to the “harvest phase.” Firms that adopt AI to augment—rather than replace—human labor achieve dual growth in output and employment, offering a viable path for technology-enabled economic development [18]. Meanwhile, Acemoglu and Restrepo highlight from an automation perspective that pursuing labor substitution alone, without enhancing human capabilities, may lead to resource misallocation and social welfare losses [19]. These insights are particularly relevant to the labor-intensive logistics industry, which is undergoing rapid intelligent transformation, and provide a theoretical foundation for understanding how AI policies influence its green transition. However, these analyses typically focus on single-factor mechanisms and have not yet systematically examined the overall impact of institutional policy shocks on the logistics industry’s GTFP.

The second category concentrates on analyzing the effectiveness of artificial intelligence innovation and development. Theoretically, AI is widely regarded as a General-Purpose Technology (GPT) with pervasive effects on productivity across sectors [20]. Empirical evidence has documented these effects at both the microeconomic and macroeconomic levels. At the micro level, AI adoption has been shown to facilitate high-quality firm development [21] and significantly enhance Total Factor Productivity [22]. At the macro level, AI development contributes to regional innovation capacity [23], reshapes urban industrial structure evolution [24], and supports the formation of new quality productivity in agriculture [25]. Furthermore, some studies have begun to explore the environmental and green effects of AI. For example, using a Difference-in-Differences approach, Zhang L and Zhou B found that AI development promotes green finance by improving energy efficiency [26]. Similarly, Dong X. et al. confirmed that AI significantly enhances both the quantity and quality of corporate green innovation [27]. Furthermore, advances in spatial policy evaluation methods offer important methodological insights for identifying AI policy effects. Anselin et al. systematically explored spatial econometric model specifications, demonstrating the advantages of instrumental variable or generalized method of moments (IV-GMM) over traditional maximum likelihood estimation in addressing spatial dependence [28]. Fratesi et al. emphasized the critical role of spatial spillovers in small-scale spatial analysis, highlighting that policy evaluation must account for interregional interdependencies [29]. While emerging studies provide preliminary evidence on the green effects of AI, they predominantly rely on conventional policy evaluation methods, offering limited control over policy endogeneity and high-dimensional confounders. Moreover, these studies have not yet extended their analysis to the logistics sector—a key carbon-emitting industry—and largely overlook spatial spillover effects.

Despite these insights, important gaps remain. First, while existing studies have extensively examined AI’s economic and innovative impacts or focused on individual drivers of logistics GTFP, little research systematically integrates dedicated AI policies—such as the AIIDPZ—with logistics GTFP within a unified causal inference framework. Consequently, whether and how AI policies promote low-carbon logistics development remains largely unidentified. Second, methodologically, most empirical studies rely on conventional approaches such as difference-in-differences or static panel models, which are inherently limited in addressing policy endogeneity (e.g., self-selection bias, omitted variables) and high-dimensional confounders [30], and they commonly overlook spatial spillover effects. Although spatial econometric methods have advanced, their integration with cutting-edge causal inference techniques to evaluate the green effects of AI policies remains scarce. Third, analytically, many evaluations remain a “black box,” offering limited insight into the transmission mechanisms, spatiotemporal dynamics, or regional heterogeneity of policy effects. How AI policies reshape industrial spatial structures—such as manufacturing agglomeration and collaborative agglomeration—to indirectly affect logistics GTFP, and whether these effects vary systematically across regions, are questions that have yet to be systematically answered.

This study addresses these gaps through three key innovations. First, it integrates AI policy and logistics GTFP into a unified causal framework, using China’s AIIDPZ policy as a quasi-natural experiment to rigorously identify the causal effects on the green transformation of the logistics industry. This extends the scope of AI policy evaluation and enriches the analysis of logistics GTFP determinants. Second, methodologically, it combines DML with spatial econometric modeling to effectively control for high-dimensional confounders and endogeneity while quantifying spatial spillovers and spatiotemporal patterns. DML’s orthogonalization and cross-fitting techniques alleviate model specification bias inherent in traditional methods, and the inclusion of spatial econometrics remedies the neglect of interregional interdependence—thereby achieving methodological integration and innovation. Third, in terms of analytical depth, grounded in Regional Innovation System theory, this study empirically tests the mechanism roles of manufacturing agglomeration and collaborative agglomeration, uncovering the indirect pathways through which AI policy influences logistics GTFP. Moreover, through heterogeneity analysis and spatiotemporal characterization, it systematically reveals how policy effects differ across regions and over time. Together, these elements form an integrated analytical framework—encompassing causal identification, mechanism testing, and spatial analysis—that systematically illuminates how AI policies can empower the green development of the logistics sector. The findings provide theoretical and empirical support for designing policies that harmonize economic growth with environmental sustainability.

3. Theoretical Analysis and Research Hypotheses

3.1. Overall Effects Analysis

This study develops an integrated theoretical framework grounded in Regional Innovation System theory and Endogenous Growth theory to elucidate the mechanisms through which the AIIDPZ policy affects the GTFP of the logistics industry. Regional Innovation System theory emphasizes that targeted institutional arrangements and policy designs can facilitate the diffusion and application of knowledge and technology among regional firms by fostering geographically proximate innovation networks [31,32]. As an institutional vehicle for building regional AI innovation ecosystems in China, the AIIDPZ systematically reduces barriers to local AI adoption and collaborative innovation costs through concentrated policy support, dedicated infrastructure, and real-world application environments. In the technology-intensive logistics industry, which exhibits pronounced network effects, the innovation ecosystem fostered by the AIIDPZ can accelerate the agglomeration, experimentation, and diffusion of AI solutions within the sector. This process promotes the green transformation of the logistics industry while strengthening the innovation base for sustained productivity improvement. Endogenous Growth theory posits that fundamental technological innovations—especially those that permeate and reshape production processes—constitute the primary drivers of sustained economic growth and efficiency gains [33]. Functioning as a general-purpose innovation, AI’s extensive integration within logistics has caused a “green technology shock” across the sector’s value chain. By incorporating energy-related inputs and environmental externalities as undesirable outputs, GTFP provides a robust measure of the logistics sector’s progress in green transformation while accounting for economic expansion. Existing studies indicate that AI technologies can generate substantial energy savings and emission reductions in logistics activities—such as warehousing, transportation, and distribution—through optimized decision-making and enhanced automation [34]. Accordingly, AI policy is expected to enhance the logistics industry’s GTFP primarily through direct technological empowerment. This leads to the research hypothesis:

H1.

The AIIDPZ policy significantly enhances the GTFP of the logistics industry.

3.2. Mechanism Analysis

According to agglomeration economics theory, industrial spatial structure affects economic efficiency primarily through two forms: manufacturing agglomeration and collaborative agglomeration. These two agglomeration patterns entail distinct economic and environmental consequences, whose combined effects constitute a critical mediating channel through which the AIIDPZ policy influences the GTFP of the logistics industry. However, the relationship between agglomeration and GTFP is not monotonic—it depends on the balance between positive externalities (knowledge spillovers, labor pooling, input sharing) and negative externalities (congestion, pollution, factor price inflation) [35,36]. The following sections will analyze these two pathways in detail.

First, the manufacturing agglomeration pathway. Agglomeration economics theory posits that the geographic concentration of industries can generate economies of scale through shared infrastructure, pooled labor markets, and access to intermediate inputs [37]. However, excessive agglomeration may also give rise to agglomeration diseconomies—including traffic congestion, escalating land prices, and intensified environmental pollution—that can offset efficiency gains [38]. From the perspective of green logistics, the high spatial concentration of manufacturing activities is likely to induce large-scale, high-frequency demand for the transportation of raw materials and finished goods. This process significantly increases regional road freight pressure, energy consumption, and associated carbon emissions [39]. When the expansion of manufacturing is dominated by energy-intensive and pollution-intensive activities, the resulting congestion effects and environmental negative externalities may outweigh the efficiency gains typically associated with industrial agglomeration, thereby constraining the green transformation of regional logistics systems. In this context, AI-related policies may play a mitigating role by steering industrial upgrading, optimizing the spatial distribution of manufacturing activities, or encouraging the development of greener production modes. On one hand, these policies promote industrial upgrading and the diffusion of green production technologies, encouraging energy-intensive manufacturing firms to relocate from congested urban centers to peripheral areas. On the other hand, by fostering AI-enabled service industries, these policies reallocate factor inputs—capital, talent, and land—from traditional manufacturing toward emerging sectors, thereby reducing excessive manufacturing concentration. This process alleviates the environmental externalities imposed on the logistics sector, creating favorable conditions for GTFP improvement. Consequently, the policy is expected to enhance logistics GTFP by mitigating the adverse effects associated with excessive manufacturing agglomeration. This leads to the following hypothesis:

H2.

The AIIDPZ policy can promote improvements in the GTFP of the logistics industry by mitigating the negative environmental externalities associated with excessive manufacturing agglomeration.

Second, the collaborative agglomeration pathway. From a theoretical perspective, collaborative agglomeration—the spatial co-location of AI-related industries and the logistics sector—can promote knowledge exchange, deepen the division of labor, and facilitate collaborative innovation through Porter-type externalities [40]. However, this positive effect is conditional on the presence of effective coordination mechanisms and complementary institutional arrangements. During the stages of policy rollout, in the absence of effective institutional coordination mechanisms, the two sectors may compete for critical resources such as land, credit, energy, and government subsidies. This competition may generate a “resource crowding-out effect,” thereby constraining cross-sectoral collaboration [41]. In particular, the AI industry—being a capital-intensive and technology-intensive emerging sector with high expected returns—may divert financial resources away from traditional productive sectors such as logistics. This diversion may crowd out the financial resources required for green equipment upgrading and technological transformation in the logistics industry [42]. Such competition for limited resources may erode the technological spillovers and collaborative advantages that spatial proximity would otherwise generate, thereby partially offsetting the direct positive effects of AI policies on green logistics development. Under these conditions, the effect of collaborative agglomeration on logistics GTFP may be negative, as coordination costs and resource competition outweigh knowledge spillover benefits. By dampening the disorderly co-agglomeration, AI-related policies help avoid resource crowding-out effects arising from congestion, competition, and coordination failures—thereby mitigating the negative externalities of agglomeration on collaborative clustering. This, in turn, strengthens intersectoral linkages and ultimately contributes to improvements in the logistics industry’s GTFP. Accordingly, the policy is expected to enhance logistics GTFP by alleviating the adverse effects associated with disorderly collaborative agglomeration. This leads to the following hypothesis:

H3.

The AIIDPZ policy can promote improvements in the GTFP of the logistics industry by alleviating the resource crowding-out effects arising from disorderly collaborative agglomeration.

The theoretical framework of this study is shown in Figure 1.

4. Methodology

4.1. Model

A rigorous causal assessment of the AIIDPZ policy’s impact on logistics GTFP must address selection bias arising from the non-random placement of pilot zones. These zones were designated by the central government in batches based on national strategic priorities in China—emphasizing regional AI innovation capacity, digital infrastructure, and technological ecosystems—rather than logistics-sector green productivity. Thus, conditional on observable characteristics, the policy can be regarded as plausibly exogenous to pre-existing GTFP trends. Furthermore, the designation of AIIDPZ pilot zones was determined by the central government through a top-down policy process primarily targeting the promotion of artificial intelligence innovation ecosystems and digital infrastructure development. The selection criteria focused on technological capability, industrial innovation capacity, and regional strategic positioning, rather than the environmental performance or green productivity of the logistics sector. Therefore, the policy assignment is unlikely to be systematically correlated with pre-existing trends in logistics GTFP, reinforcing the plausibility of the exogeneity assumption in our empirical framework.

However, identification faces two key challenges. First, pilot zone assignment correlates with factors such as economic development, technological endowments, industrial structure, and environmental regulation, leading to potential selection bias. Second, these confounders are high-dimensional and may exhibit nonlinear relationships with GTFP, rendering traditional parametric approaches potentially misspecified. Conventional methods—Difference-in-Differences (DID), synthetic control, and propensity score matching—face limitations in this context. DID relies on strict parallel trends under low-dimensional linear controls; synthetic control is sensitive to donor pool selection and data availability; and propensity score matching involves subjective covariate specification and cannot adequately address high-dimensional confounding [43].

To overcome these challenges, this study adopts the Double Machine Learning (DML) framework proposed by Chernozhukov V. et al. [44]. DML flexibly estimates high-dimensional nuisance functions using machine learning while preserving valid inference for the treatment effect. Orthogonalization removes the influence of control variables from both the outcome and treatment equations, reducing omitted variable bias. Cross-fitting mitigates overfitting and regularization bias by splitting the sample into training and estimation subsets. Importantly, DML does not rely on strict parallel trends; instead, it flexibly controls for time-varying confounders, making it well-suited for staggered treatment settings with heterogeneous effects across cohorts and time. This improves robustness within the conditional independence framework without artificially imposing exogeneity.

Building on Zhang T and Li J [45], this study adopts a double machine learning framework based on partial linear regression, specified as follows:

Y_{i t} = θ_{0} D_{i t} + g (X_{i t}) + U_{i t}

(1)

E (U_{i t} ∣ D_{i t}, X_{i t}) = 0

(2)

In Equations (1) and (2),

θ_{0}

represents the parameter of primary interest, capturing the average treatment effect of the AIIDPZ policy. The dependent variable

Y_{i t}

is defined as the GTFP of the logistics sector for province

i

in period

t

.

D_{i t}

serves as the key independent indicator, taking a value of one for provinces officially included in the AIIDPZ program by time

t

, and zero otherwise. The vector

X_{i t}

contains a set of high-dimensional covariates that jointly influence both policy exposure and GTFP in the logistics sector. The function

g (X_{i t})

captures the potentially nonlinear and unknown relationship between the high-dimensional covariates and the independent variable. To accommodate flexible machine-learning estimation, we follow the DML framework and approximate

g (X_{i t})

using a data-driven estimator

\hat{g} (X_{i t})

.

U_{i t}

is the random error term with a conditional mean of zero.

Direct estimation of Equations (1) and (2) may induce regularization bias when the double machine learning framework relies on regularized machine-learning algorithms to approximate the unknown function

\hat{g} (X_{i t})

. Although such regularization helps control estimator variance, it may introduce bias into the estimation of the treatment effect

{\hat{θ}}_{0}

, particularly in finite samples. To improve convergence rates and mitigate the bias of the treatment effect estimator in finite samples, auxiliary regression Equations (3) and (4) are specified as follows:

D_{i t} = m (X_{i t}) + V_{i t}

(3)

E (V_{i t} ∣ X_{i t}) = 0

(4)

Here,

m (X_{i t})

represents the conditional expectation of the treatment indicator

D_{i t}

given the high-dimensional covariate set

X_{i t}

, capturing the selection mechanism through which

X_{i t}

jointly affects policy assignment via

m (X_{i t})

and the outcome variable via

g (X_{i t})

. As with

g (X_{i t})

,

m (X_{i t})

is approximated in a nonparametric manner through data-driven learning algorithms, denoted by

\hat{m} (X_{i t})

.

V_{i t}

represents the stochastic error term with a conditional mean of zero.

The consistent estimate of the policy effect

θ_{0}

, as reported in Equation (5):

{\hat{θ}}_{0} = {(\frac{1}{n} \sum_{i = 1}^{I} \sum_{t = 1}^{T} {\hat{V}}_{i t} \cdot D_{i t})}^{- 1} \cdot \frac{1}{n} \sum_{i = 1}^{I} \sum_{t = 1}^{T} {\hat{V}}_{i t} \cdot (Y_{i t} - \hat{g} (X_{i t}))

(5)

Implementation details. The DML estimator is implemented using StataMP 18 (64-bit) ’s ddml command with a Python 3.8 backend. A 5-fold cross-fitting procedure is applied that respects the panel structure by randomly splitting provinces into folds, keeping all observations of a given province together. Province and year fixed effects are included as additional controls within the high-dimensional covariate set. The machine learners (e.g., Random Forest) automatically account for these dummies, residualizing the dependent variable and treatment indicator to isolate policy variation net of unobserved heterogeneity and common time shocks. Nuisance functions are estimated using Random Forest (RF) in the baseline, with LASSO, Gradient Boosting (GB), and Neural Networks (NNET) used for robustness checks. Hyperparameter tuning is performed within each training fold using the package’s default automatic tuning to prevent information leakage; the random seed is fixed at 42 for all stochastic processes to guarantee reproducibility. The number of folds is set to K = 5 in the main analysis, with K = 3 and K = 7 used in sensitivity checks.

4.2. Variables

Dependent variable. The GTFP index is adopted to characterize the overall performance of the logistics sector when resource use and environmental constraints are jointly considered [46,47]. Relative to conventional TFP, the GTFP of the logistics industry explicitly incorporates undesirable outputs, such as energy consumption and carbon emissions, thereby internalizing environmental externalities and offering a more comprehensive assessment of green transition performance. Drawing upon the environmental efficiency literature, this study employs the Super-efficiency SBM-GML index model with undesirable outputs to measure the logistics industry’s GTFP [48]. This model does not require a pre-specified production function and effectively addresses efficiency assessment involving multiple inputs and outputs, making it particularly suitable for analyzing the logistics sector under dual resource and environmental constraints. The measurement system is summarized in Table 1. Input indicators include fixed asset investment in logistics [49], labor input measured by the number of employees, and energy consumption [50]. Desired outputs include logistics value added and freight turnover, while undesirable outputs are represented by logistics-related carbon emissions. The carbon emissions of the logistics industry are estimated following the methodology established in Ren Z [51]. Energy consumption data for 16 fuel types (e.g., raw coal, cleaned coal, coke, crude oil, gasoline, kerosene, diesel, fuel oil, natural gas) are obtained from the regional energy balance tables in the China Energy Statistical Yearbook. CO₂ emission factors are calculated based on the IPCC (2006) Guidelines, using the formula: Emission factor = Net Calorific Value × Carbon Content per Unit Calorific Value × Carbon Oxidation Rate. The specific coefficients for each fuel type are consistent with IPCC default values and are adjusted to reflect China’s fuel characteristics. Emissions are calculated following the “terminal energy consumption” approach, excluding energy used as industrial raw materials (e.g., lubricants, asphalt) to avoid double-counting. The geographical scope covers 30 provinces in mainland China, and the time span is consistent with the panel data of this study (2012–2022).

Independent variable. The AIIDPZ policy was implemented in phases across provinces, with pilot provinces designated in multiple batches starting from 2019, as shown in Table 2. To capture this staggered adoption, AIIDPZ is defined as a binary variable: it takes a value of 1 if a province has been designated as a pilot in a given year, and 0 otherwise. Once a province becomes a pilot, it remains 1 in all subsequent years. This setup naturally accommodates variation in the timing of treatment [52].

Mechanism variables. Following Zhang H. et al., Zhang Z and Wu D [53,54], we employ manufacturing agglomeration (MA) and collaborative agglomeration (CA) as mechanism variables. Specifically, MA measures the concentration of manufacturing industries in a region using the location entropy index. CA evaluates the degree of industrial synergy by assessing the similarity in spatial distribution and the depth of agglomeration between AI-related industries and the logistics sector.

Control variables. To account for potential confounding factors that may simultaneously influence regional AI policy assignment and logistics GTFP, we select a set of appropriate control variables grounded in both theoretical reasoning and prior empirical literature on GTFP and logistics efficiency. Variable definitions are summarized in Table 3. Following Ling S. et al. [55], Shen N. et al. [56], and Jiang Y and Sun J [57], our controls capture four key dimensions: First, regional economic development and industrial structure (GDP per capita, industrial structure, industrial share) are included because more developed regions tend to have better digital infrastructure and innovation capacity, which may correlate with both pilot zone designation and logistics productivity. Second, government support and policy environment (R&D expenditure share, environmental protection spending, transport spending) are controlled for, as fiscal priorities directly shape the local innovation ecosystem and green transition incentives. Third, logistics infrastructure and factor endowments (logistics infrastructure, talent agglomeration) are essential determinants of the sector’s absorptive capacity for AI technologies. Fourth, openness to trade and investment (foreign trade volume, fixed asset investment) capture external knowledge spillovers and capital availability that can drive productivity improvements. This multi-dimensional approach ensures that our empirical strategy accounts for the main observable confounders identified in previous GTFP and logistics research. While DML mitigates bias from high-dimensional controls, the theoretical relevance of these variables—as validated by the cited studies—provides an additional layer of confidence in the conditional independence assumption underlying our causal identification.

4.3. Data

For this research, we collected a dataset that tracks 30 provinces over the years 2012 to 2022. China officially designated the first group of provinces as Artificial Intelligence Innovation Development Pilot Zones (AIIDPZ) in September 2019. The sample comprises 16 provinces designated as AIIDPZ, forming the treatment group, with the remaining provinces serving as the control group. Data used in this study were mainly obtained from the Statistical Yearbook of the National Bureau of Statistics of China and are summarized in Table 4, which presents descriptive statistics for the variables.

5. Results and Analysis

5.1. Benchmark Regression

Guided by the theoretical framework and prior model specifications, DML is employed for estimation, with 5-fold CV applied for robustness. Specifically, the model implements the main and auxiliary regressions using the random forest (RF) algorithm. The estimation results are reported in Table 5.

As shown in Column (1) of Table 5, the AIIDPZ coefficient is 0.106 and statistically significant at the 1% level. Since the dependent variable GTFP is measured as an index and the independent variable is binary, this coefficient implies that the implementation of the AIIDPZ policy leads to an average increase of approximately 10.6% in logistics GTFP compared to non-pilot provinces, providing strong empirical support for Hypothesis H1. Column (2), which includes quadratic terms for the control variables, indicates that the policy effect remains robust and statistically significant, further confirming the robustness of the benchmark estimates.

5.2. Robustness Tests

To ensure the credibility of our baseline findings, we conduct a series of robustness checks addressing potential concerns related to parallel trends, outliers, functional form, sample selection, cross-validation folds, algorithm choice, and omitted variables. These tests confirm that our main conclusion is robust to alternative model specifications and potential confounding factors.

5.2.1. Parallel Trend Test

Although DML provides flexibility regarding parallel trends, assessing pre-policy comparability remains important in policy evaluation. We therefore employ an event study approach using the period immediately before implementation as the baseline. As shown in Figure 2, all pre-policy coefficients are statistically insignificant, confirming that the parallel trends assumption holds. Post-policy coefficients are positively significant, indicating that the AIIDPZ policy enhances logistics GTFP. The insignificant effect in the first year after implementation suggests a short-term policy lag, likely reflecting the time required to establish supporting measures [58].

5.2.2. Delete Outliers

To mitigate the potential influence of extreme outliers and account for substantial heterogeneity across provinces or years, continuous variables were winsorized at the 1% and 5% quantiles to mitigate the impact of outliers, except for policy variables. As shown in Table 6 (Column 1), the coefficients of the AIIDPZ remain positive with 1% significance after trimming, confirming benchmark regression results are robust and not influenced by outliers.

5.2.3. Interactive Model

To address potential biases associated with functional form misspecification in the benchmark model, this study employs a more flexible interactive specification within the dual machine learning framework. This model allows for interactions between the policy variable and all high-dimensional control variables. Table 6 (Column 2) contains the results of the estimation procedure. Within the interactive model, the AIIDPZ variable retains a positive and statistically significant coefficient at the 1% level, confirming that its impact on the logistics industry’s GTFP is robust and not sensitive to the choice of functional form.

5.2.4. Sample Adjustments

Focusing on a symmetric three-year window around the policy implementation (2016–2022) helps reduce potential confounding from distant-period data. Column (3) of Table 6 demonstrates that the independent variable retains a positive effect with 5% significance, confirming that the policy’s impact is robust throughout the main implementation period.

5.2.5. Fold Adjustments

To assess the sensitivity of estimation results to different sample partitioning ratios, this study varied the cross-validation fold ratio from the baseline 1:4 (5 folds) to 1:2 (3 folds) and 1:6 (7 folds) for robustness checks. The results in Table 7 (Column 1) show that across different partitioning ratios, the coefficients remained positively significant at the 5% level or higher, exhibiting stability. Across alternative sample partitions, the estimation results exhibit consistency, providing further evidence that the study’s conclusions are robust.

5.2.6. Algorithm Substitutions

To assess robustness to algorithmic choice, we re-estimate the model using LASSO, gradient boosting (GB), and neural networks (NNET). As shown in Column (2) of Table 7, the AIIDPZ coefficient remains positive and statistically significant at least at the 5% level across all algorithms, confirming that our main finding is not driven by a specific modeling approach. NNET, as a deep learning algorithm, captures complex high-dimensional nonlinear relationships through the hierarchical stacking of neuron weights. Their more aggressive feature extraction and function approximation characteristics often lead to amplified coefficient values. In contrast, traditional machine learning algorithms such as RF, LASSO, and GB, based on decision tree ensembles or regularized linear estimation, have more conservative coefficient scales. Despite these numerical variations arising from model mechanics, the consistent sign and significance across all algorithms robustly support the core conclusion that the AIIDPZ policy significantly enhances logistics GTFP.

5.2.7. Additional Control Variables

To address potential omitted variable bias, we sequentially introduce three additional controls: transportation network density (road), measured as the total length of railway, highway, and waterway per km² of regional area; environmental regulation intensity (regul), calculated as the ratio of industrial pollution treatment investment to industrial value added; and energy consumption structure (ener), defined as the share of coal consumption in total regional energy consumption. Following a stepwise approach, we first include the linear term of road (Column 1), then add its quadratic term (Column 2). Next, we incorporate the linear terms of road and regul (Column 3), followed by their quadratic terms (Column 4). Finally, we add the linear terms of all three variables (Column 5) and subsequently their quadratic terms (Column 6). The original controls (Table 3) and their quadratic terms are included throughout. As reported in Table 8, the AIIDPZ coefficient remains positive and statistically significant at the 1% level across all specifications, with magnitudes ranging from 0.088 to 0.112—closely comparable to the baseline estimate of 0.106. The slight variations reflect the incremental inclusion of covariates rather than any substantive change in the policy effect, confirming that our main finding is robust to potential omitted variable bias.

5.3. Endogeneity Analysis

To further address potential endogeneity arising from omitted variables or reverse causality, we incorporate an instrumental variable (IV) strategy within the Double Machine Learning framework. Although DML mitigates bias from high-dimensional observable confounders, it does not fully eliminate the possibility that policy designation correlates with unobserved regional characteristics.

We construct an instrumental variable Z defined as the interaction between regional AI patent applications and the AIIDPZ policy dummy [59]. Regions with stronger AI patent activity possess greater technological capacity and institutional readiness to effectively implement the policy when it is introduced. Thus, the interaction captures heterogeneity in policy implementation intensity and is strongly correlated with the endogenous treatment variable, satisfying the relevance condition. At the same time, AI patent stock alone does not directly enhance logistics GTFP in the absence of the AIIDPZ policy. Without the institutional coordination, financial incentives, and regulatory support provided under the AIIDPZ framework, AI patent activity is unlikely to translate into measurable improvements in logistics green productivity. Moreover, we control for observable factors that could independently affect GTFP, reducing potential alternative transmission channels. Hence, the instrument influences logistics GTFP only through its effect on the policy variable, satisfying the exclusion restriction. The interaction term thus captures the “policy-enabled activation effect” of technological endowment on policy intensity.

Within the DML-IV framework, estimation proceeds via an orthogonalized two-stage procedure. In the first stage, machine learning algorithms flexibly estimate the relationship between the instrument and the endogenous policy variable. In the second stage, the predicted treatment component is used to estimate the causal effect on GTFP while implementing orthogonal score construction and cross-fitting to ensure valid inference under high-dimensional controls. As reported in Table 9, after accounting for endogeneity through the DML-IV approach, the AIIDPZ coefficient remains positive and statistically significant at the 5% level, consistent with baseline estimates. This consistency further supports the robustness and credibility of the estimated policy effect.

5.4. Heterogeneity Analysis

Regional imbalances in China’s economic development have led to substantial regional heterogeneity in the effectiveness of the AIIDPZ policy. We conduct grouped regression analyses across seven geographical divisions, with results summarized in Table 10. Regions can be classified into three categories: (1) Key beneficiary regions, comprising East China, Central China, and Southwest China, where the policy effects are significant at the 1–5% level. East China rapidly assimilates AI technologies through its well-established industrial chains and digital infrastructure [60]; Central China leverages “external technological empowerment” to circumvent traditional development pathways; Southwest China exploits the “East Data, West Computing” digital infrastructure advantage to effectively promote distinctive and cross-border logistics operations [61]. (2) Transformation-Catching-Up Regions, including Northeast and Northwest China, the policy coefficients reach statistical significance at the 10% threshold. The Northeast’s legacy industrial base leverages the policy to upgrade traditional industries, while the underdeveloped Northwest regions alleviate information constraints via knowledge spillovers and capital inflows. (3) Complex Scenario Regions, comprising North and South China, exhibit statistically insignificant policy effects. In North China, internal developmental disparities dilute the average effects, whereas in South China, the outward-oriented economic structure and spontaneous market-driven technological upgrades diminish the marginal effectiveness of the policies. Evidence suggests that the effectiveness of the AIIDPZ policy is strongly dependent on regional industrial foundations and digital readiness. Therefore, promoting a close and effective alignment between artificial intelligence and the real economy requires regionally tailored policies rather than a generalized, one-size-fits-all approach.

5.5. Analysis of the Influence Mechanisms

To systematically examine the underlying mechanisms by which the AIIDPZ policy affects the logistics industry’s GTFP, we empirically test two potential mediating mechanisms: manufacturing agglomeration and collaborative agglomeration. Following the mechanism analysis methods of Chen et al. [62], and Jiang T [63], the outcomes of the empirical analysis are illustrated in Table 11.

5.5.1. Manufacturing Agglomeration

Column (2) of Table 11 reports the effect of the AIIDPZ policy on manufacturing agglomeration. The coefficient is −0.074, statistically significant at the 1% level in the negative direction, indicating that the policy’s implementation significantly reduces the spatial concentration of manufacturing enterprises within the region [64]. This finding aligns with theoretical predictions: the AIIDPZ policy triggers spatial substitution and factor reallocation—encouraging energy-intensive manufacturing firms to relocate from congested areas and redirecting capital and talent toward AI-enabled service sectors. Theoretically, while manufacturing agglomeration can generate “agglomeration economies” that yield scale benefits (positive environmental externalities), it may also lead to “agglomeration diseconomies”—generating a large-scale, high-frequency logistics demand that further exacerbates regional traffic congestion, environmental pollution, and factor price inflation (negative environmental externalities) [65,66]. Further empirical evidence suggests that manufacturing agglomeration, particularly under energy-intensive industrial structures, significantly undermines the green efficiency of the logistics sector by intensifying regional transport congestion and energy consumption [67]. This study finds that the policy reduces the level of manufacturing agglomeration, thereby alleviating the congestion pressures and negative environmental externalities that hinder the green transition of the logistics sector, creating favorable conditions for logistics GTFP improvement. Column (1) confirms the positive direct effect of the policy on GTFP, thereby establishing a clear mediating pathway: the AIIDPZ policy can enhance logistics green total factor productivity by mitigating the negative environmental externalities associated with excessive manufacturing agglomeration, thus supporting Hypothesis H2.

5.5.2. Collaborative Agglomeration

Column (3) of Table 11 reports the effect of the AIIDPZ policy on collaborative agglomeration. The estimated coefficient for collaborative agglomeration is −0.049, negative and significant at the 10% level. These results indicate that the AIIDPZ policy reduces disorderly spatial co-location and integration between the AI industry and the logistics sector. This finding aligns with theoretical predictions: in the absence of effective coordination mechanisms and planning, rapid and disorderly co-agglomeration generates competition for land, energy, capital, and financial resources, leading to a “resource crowding-out effect” that outweighs short-term knowledge spillover benefits [68]. Further empirical evidence suggests that industrial co-agglomeration may trigger intersectoral competition for resources, giving rise to crowding-out effects that ultimately hinder overall green total factor productivity [69]. Our results show that by reducing this disorderly co-agglomeration, the AIIDPZ policy helps avoid resource competition that would otherwise divert financial resources away from logistics firms’ green technology upgrades. In other words, the policy mitigates the negative environmental externalities of agglomeration—resource crowding-out effects arising from congestion, competition, and coordination failures—rather than suppressing beneficial collaboration, thereby contributing to logistics GTFP improvement. This interpretation is consistent with the positive direct effect of the policy on GTFP (Column 1), confirming that the AIIDPZ policy promotes improvements in logistics GTFP by alleviating the resource crowding-out effects arising from collaborative agglomeration. Thus, Hypothesis H3 is supported.

5.6. Further Analysis

While the DML framework is employed to identify the causal impact of the AIIDPZ policy under high-dimensional confounding, it primarily captures the average treatment effect at the regional level. However, given the strong interregional linkages of the logistics industry and the diffusion characteristics of digital technologies, policy impacts may extend beyond the treated provinces through spatial spillovers. Therefore, we further incorporate spatial econometric models to examine whether the AIIDPZ policy generates spillover effects across neighboring regions. To formally examine spatial dependence and quantify interregional spillover effects, this study employs a Spatial Autoregressive Model (SAR). The SAR specification is selected because the logistics industry’s GTFP is likely to exhibit regional coordination and technology diffusion effects, efficiency improvements in one region can affect neighboring regions via demonstration and competitive effects. Prior to model estimation, an evaluation of potential spatial spillovers is conducted through the computation of Global Moran’s I statistics using provincial AIIDPZ policy dummy variables for the period 2019–2022.

The computation procedure is presented in Equation (6):

M o r a n ’ I = \frac{n}{\sum_{i = 1}^{n} \sum_{j = 1}^{n} w_{i j}} \times \frac{\sum_{i = 1}^{n} \sum_{j = 1}^{n} w_{i j} (x_{i} - \bar{x}) (x_{j} - \bar{x})}{\sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}}

(6)

Among these,

n

denotes the number of provinces,

x_{i}

and

x_{j}

are the observed values for provinces

i

and

j

, respectively,

\bar{x}

is the mean of the observed values, and

w_{i j}

is the spatial weight matrix between provinces

i

and

j

. The spatial autocorrelation test results are presented in Table 12.

The Moran’s I results reported in Table 12 confirm significant positive spatial autocorrelation during the early implementation period (2019–2021), suggesting that provinces designated as AIIDPZ tend to cluster geographically. This spatial clustering justifies the adoption of a spatial econometric framework. Analysis indicates that the spatial distribution of the AIIDPZ policy has evolved over time, with policy intensity concentrated in several core urban clusters during the initial rollout in 2019, notably the Beijing–Tianjin–Hebei region, the Yangtze River Delta, and the Greater Bay Area (Moran’s I = 0.1167, p < 0.01). The Moran’s I index decreased slightly to 0.0958 in 2020 and 0.0940 in 2021, while remaining statistically significant. By 2022, the Moran’s I index had fallen to 0.0460 and was no longer statistically significant, indicating a shift from a highly concentrated policy distribution in core regions to a more dispersed pattern across China. This trend reflects the strategic transition from localized pilot demonstrations to broader regional diffusion.

Following LeSage and Pace (2009), a SAR model as the baseline spatial specification is employed to account for such spatial dependence in the dependent variable [70], with the model specification presented in Equation (7):

Y_{i t} = ρ W \cdot Y_{j t} + β_{1} D_{i t} + β X_{i t} + ε_{i t}

(7)

Among these,

ρ

is the spatial autoregression coefficient, measuring the extent to which the logistics industry’s GTFP in neighboring regions influences that in the local region. The benchmark spatial weight matrix is constructed as an economic distance matrix, defined as the inverse of the absolute difference in per capita GDP between provinces. The matrix is row-standardized to ensure comparability across regions. To test robustness, an alternative economic–geographic nested spatial matrix is constructed by interacting geographic contiguity with economic similarity. This nested matrix captures both physical proximity and economic linkage intensity.

Unlike conventional regression models, coefficients in SAR models do not directly represent marginal effects. Therefore, following impact decomposition methodology, we calculate the direct effects (local impacts), indirect effects (spatial spillovers), and total effects (overall impacts). Table 13 presents the estimation results of the SAR model. Specifically, Column (1) reports the results based on the economic distance spatial weight matrix, while Column (2) presents the results using the economic–geographic nested spatial weight matrix for robustness analysis.

The decomposed indirect effects derived from the SAR model indicate that the AIIDPZ policy exerts a robustly positive and significant spatial spillover affecting GTFP in the logistics industry. The estimated direct effect is 0.050, the indirect effect is 0.098, and the total effect is 0.147, with all effects significant at the 10% level. Notably, the magnitude of the indirect effect exceeds that of the direct effect, suggesting that the policy’s influence operates primarily through interregional transmission mechanisms rather than purely localized improvements. This pattern reflects the networked nature of the logistics industry, where digital infrastructure upgrading, intelligent scheduling systems, and supply-chain integration in one province can reduce coordination costs and carbon intensity across adjacent regions. From an economic perspective, the findings imply that AIIDPZ functions as a platform-type policy instrument that generates increasing returns through spatial diffusion and technology spillovers. From a policy standpoint, the dominance of spillover effects underscores the necessity of strengthening cross-regional coordination, data interoperability, and green logistics corridors to fully internalize positive externalities. Therefore, optimizing the spatial deployment of AI pilot zones should move beyond isolated provincial designation toward a more connectivity-oriented strategy that maximizes national-level green productivity gains. This finding aligns with China’s ongoing strategy of promoting integrated regional development and digital infrastructure interconnection across provinces. Using the economic–geographic nested matrix, the spillover effects remain robust. The persistence of significant spillovers confirms that the findings are not sensitive to alternative spatial weight specifications.

The temporal and spatial evolution of GTFP is further examined in the context of the logistics sector under AIIDPZ influence. Over time, a stepwise increase in sectoral GTFP is observed, corresponding closely to the rollout of the AIIDPZ policy. The period from 2012 to 2016 represents an exploratory phase, with average efficiency stabilizing at low-to-medium levels. Following the promulgation of China’s New Generation Artificial Intelligence Development Plan in 2017, an acceleration phase began, during which the average efficiency increased from low-to-medium to medium-to-high levels by 2020. The pilot zone policy initiated in 2019 marked a pivotal turning point. The period from 2021 to 2022 represents a stabilization phase, with average efficiency remaining at high levels, indicating a shift from isolated AI-driven breakthroughs toward systematic integration with green logistics. Spatially, the efficiency distribution follows a gradient characterized by Eastern leadership, Central rise, and Western catch-up. Eastern coastal provinces maintain high efficiency levels, benefiting from advanced digital infrastructure. Central regions show steady efficiency gains, reflecting AI’s enabling effects on traditional manufacturing areas. Western regions continue to exhibit relatively low efficiency, although provinces such as Gansu demonstrate gradual recovery, demonstrating the inclusive benefits of policy implementation. Notably, the coefficient of variation of efficiency across provinces has declined annually, suggesting a gradual convergence of regional disparities despite fluctuations, as the diffusion of AI technology facilitates coordinated regional development. In summary, the policy-driven improvement in the logistics industry’s GTFP under the AIIDPZ initiative reflects a dynamic process that intensifies over time and progresses spatially from regional imbalance toward coordinated development. Beyond confirming the aggregate policy effect, the analysis uncovers substantial regional heterogeneity. Figure 3 visualizes the temporal and spatial evolution of GTFP within the logistics sector at the provincial level in China.

6. Conclusions and Policy Implications

6.1. Conclusions

This study exploits China’s New Generation Artificial Intelligence Innovation and Development Pilot Zones (AIIDPZ) policy as a quasi-natural experiment. Using provincial-level panel data from 2012 to 2022, and integrating the double machine learning framework with spatial econometric models, it systematically identifies the causal effects and the channels through which the AIIDPZ policy affects the logistics sector’s GTFP. The analysis is conducted within the conceptual frameworks of regional innovation systems and endogenous growth theory. The primary outcomes are reported below. First, the AIIDPZ policy significantly increases the logistics industry’s GTFP, and the result is consistently supported by extensive robustness checks. Second, the mechanism analysis indicates that the AIIDPZ policy promotes improvements in the logistics industry’s GTFP by mitigating the negative externalities associated with manufacturing agglomeration and collaborative agglomeration. Specifically, the policy alleviates the environmental externalities arising from excessive manufacturing concentration, thereby reducing congestion and pollution pressures that hinder green efficiency gains in logistics. Meanwhile, by easing the resource crowding-out effects embedded in collaborative agglomeration between the AI and logistics sectors, the policy ensures that technological spillovers are more effectively transformed into green productivity improvements. Third, heterogeneity analysis indicates substantial regional variation in policy effectiveness. The policy exerts the strongest positive impact in East China, Central China, and Southwest China; a moderate positive effect in Northeast and Northwest China; and no statistically significant effect in North and South China. These findings imply that the AIIDPZ policy’s effectiveness is contingent upon regional development levels and industrial structure characteristics. Fourth, the AIIDPZ policy contributes positively to neighboring regions through a significant spatial spillover mechanism. Beyond directly enhancing the logistics industry’s GTFP in local regions, the policy also generates substantial gains for neighboring provinces through spatial interactions. This contributes to a stepwise increase in the time evolution of logistics industry’s GTFP and establishes a spatial pattern characterized by “Eastern leadership, Central rise, and Western catch-up” across China. Overall, by employing a rigorous causal inference framework, this study systematically uncovers the multiple transmission channels, regional heterogeneity, and spatiotemporal dynamics through which AI policies affect green development in the logistics sector. The findings provide robust empirical evidence on the role of digital technologies in promoting industrial green transformation and offer a foundation for formulating differentiated and coordinated regional development policies.

Despite these contributions, several limitations warrant further research. First, due to data constraints, the analysis is conducted at the provincial level, potentially obscuring city- or firm-level heterogeneity. Future studies could employ more granular micro-level data to uncover firm-level behavioral mechanisms underlying policy-induced green productivity improvements. Second, while manufacturing agglomeration and collaborative agglomeration are identified as key transmission channels, other mechanisms—such as technology diffusion, green finance, and digital platform integration—may also mediate the relationship between AI policy and logistics GTFP. Exploring these additional pathways would provide a more comprehensive understanding of AI-driven industrial transformation. Third, as the findings are grounded in China’s institutional context, their external validity may depend on structural factors such as digital infrastructure, marketization level, and regional innovation capacity. Future cross-country comparisons could further assess the generalizability of AI-oriented industrial policies across different institutional settings.

6.2. Policy Implications

Given the empirical results and the overarching objective of achieving synergy between technological progress and environmental sustainability, relevant policy implications are articulated as follows.

First, design differentiated policy pathways tailored to regional development stages. Heterogeneity analysis shows that the AIIDPZ policy has the strongest positive effects in East China, Central China, and Southwest China; weaker effects in Northeast and Northwest China; and insignificant effects in North and South China. This implies that a uniform policy approach is ineffective. For eastern coastal regions, where digital infrastructure and logistics systems are mature, policies should focus on building an integrated “AI + Green Logistics” innovation ecosystem. This involves promoting smart ports, autonomous delivery networks, and scenario-based applications to generate scalable and replicable solutions. For western and northeastern regions with relatively weak industrial foundations, a phased strategy is recommended. In the short term, leverage basic AI dispatch tools to upgrade bulk commodity logistics and cross-border corridors (e.g., the China-Europe Railway Express), avoiding blind replication of coastal e-commerce models. In the medium term, develop specialized logistics niches—such as cold-chain logistics in the northwest or heavy equipment logistics in the northeast—by integrating AI-enabled route optimization and predictive maintenance. In the long term, through national initiatives like “East Data, West Computing,” transform these regions from resource suppliers into core nodes in the digital logistics network, hosting data-intensive applications and providing computational support for smart logistics systems nationwide.

Additionally, optimize industrial spatial layouts to foster green and collaborative agglomeration. Mechanism analysis indicates that the AIIDPZ policy enhances logistics GTFP by mitigating the negative environmental externalities of excessive manufacturing agglomeration and disorderly collaborative agglomeration. To operationalize these findings, policy design should guide the development of green and collaborative spatial layouts. In regions with high manufacturing concentration, policymakers should establish industrial entry assessment mechanisms based on logistics intensity and environmental impact to prevent excessive clustering of energy-intensive, high-emission industries. Governments should promote smart shared warehousing and multimodal transport hubs to relieve congestion pressures and reduce carbon emissions. For collaborative agglomeration, resource-sharing platforms between AI enterprises and logistics operators should be established to facilitate knowledge spillovers and mitigate early-stage competition for capital, talent, and infrastructure.

Finally, leverage spatial spillovers through cross-regional knowledge transfer and infrastructure integration. Spatial econometric results confirm significant positive spillovers to neighboring regions. To amplify these spillovers and promote balanced regional development, “AI + Logistics” cross-regional knowledge transfer platforms should be established to facilitate the sharing of best practices, green logistics technologies, and intelligent dispatching algorithms between eastern coastal provinces and inland regions. These platforms can take the form of joint innovation alliances, regular technology workshops, and open-source algorithm repositories. Furthermore, the AIIDPZ policy should be deeply integrated with national infrastructure strategies such as “East Data, West Computing” by prioritizing the deployment of AI-powered logistics data sub-centers in western computing hub nodes. These centers would host data-intensive computing tasks—such as predictive fleet management and real-time route optimization—generated by eastern logistics enterprises, transforming the western regions from mere data storage locations into intelligent computing service providers. Building on this, a “data computation in the west + logistics application in the west” closed-loop empowerment mechanism should be established, supporting local enterprises in developing customized applications like intelligent load planning and cross-border cold-chain logistics using nearby computing resources. This will enable central and western regions to transition from passive resource suppliers into active participants and beneficiaries of the digital logistics ecosystem.

In summary, effectively integrating artificial intelligence innovation with green logistics development necessitates a context-sensitive policy framework that balances technological advancement with environmental sustainability. These policy recommendations offer a structured approach for regions to design strategies tailored to their development conditions, collectively supporting long-term sustainable development objectives.

Author Contributions

Conceptualization, Y.M. and J.Z.; methodology, J.Z.; software, J.Z.; validation, Y.M.; formal analysis, Y.M. and J.Z.; resources, Y.M.; writing—original draft preparation, J.Z.; writing—review and editing, Y.M. and J.Z.; visualization, J.Z.; supervision, Y.M.; project administration, Y.M.; funding acquisition, Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

Natural Science Foundation of Gansu Province: 25JRRA231; Project of Soft Science in Gansu Province: 24JRZA173.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We would like to sincerely thank all the editorial staff and related personnel of the journal for their assistance and support throughout the review and publication process. We would also like to thank the reviewers for their professional and insightful comments on the manuscript. In addition, we are also grateful to Xufeng Cui for his valuable assistance and support during the manuscript writing process.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Rawat, K.S.; Yadav, M. Decoding the intellectual landscape and evolving research paradigm of artificial intelligence in environmental sustainability. IEEE Internet Things J. 2025, 12, 44343–44366. [Google Scholar] [CrossRef]
IDC. Worldwide AI and Generative AI Spending Guide. Available online: https://my.idc.com/getdoc.jsp?containerId=IDC_P33198 (accessed on 10 February 2026).
National Science and Technology Council. The National Artificial Intelligence Research and Development Strategic Plan: 2023 Update; National Science and Technology Council: Washington, DC, USA, 2023. [Google Scholar]
EC. Coordinated Plan on Artificial Intelligence 2021 Review. Available online: https://digital-strategy.ec.europa.eu/en/library/coordinated-plan-artificial-intelligence-2021-review (accessed on 10 February 2026).
MSTPRC. Notice of the Ministry of Science and Technology on Issuing the Work Guidelines for the Construction of National New-Generation Artificial Intelligence Innovation and Development Pilot Zones. Available online: https://www.gov.cn/zhengce/zhengceku/2019-12/03/content_5457884.htm (accessed on 10 February 2026).
Han, W.; Liu, J.; Yang, X.; Yang, Y. The path to green transformation of the logistics sector from the perspective of eco-efficiency. J. Clean. Prod. 2025, 525, 146547. [Google Scholar] [CrossRef]
Yao, H.; Yu, X.; Mao, H.; Zhang, H.; Thompson, R. Logistics hub policies and carbon emission efficiency: Insights into emission reduction in China. Environ. Dev. Sustain. 2025, 1–30. [Google Scholar] [CrossRef]
Wu, Y. Spatial-Temporal Characteristics and Spatial Convergence Analysis of Synergistic Pollution Reduction and Carbon Emission Reduction in China’s Transport Sector. Sustain. Dev. 2025, 15, 81. [Google Scholar] [CrossRef]
Sharma, H.; Padhi, B.; Sharif, A.; Bashir, M.F. Striving towards green total factor productivity: A bibliometric and systematic literature review for future research agenda. J. Environ. Manag. 2025, 377, 124639. [Google Scholar] [CrossRef]
Mohsen, B.M.; Mohsen, M. Reducing Emissions Through AI-Driven Multimodal Transport Optimization in IoT-Connected Environments. Int. J. Energy Res. 2025, 2025, 2399288. [Google Scholar] [CrossRef]
Zhu, Z. Spillover effects of outward foreign direct investment on total factor productivity of China’s logistics industry. J. Commer. Econ. 2022, 9, 100–103. [Google Scholar]
Chen, X.; Xu, W. Data factors, transportation infrastructure, and industrial structure upgrading: Evidence from a quantitative spatial general equilibrium model. Manag. World 2024, 40, 78–98. [Google Scholar]
Ye, C.; Feng, H. Spatiotemporal evolution and influencing factors of logistics efficiency in the Yangtze River Delta urban agglomeration. East China Econ. Manag. 2024, 38, 52–60. [Google Scholar]
Wang, C.; Ma, J.; Ma, Y.; Zhang, J. Has informatisation boosted green total factor productivity growth in China’s logistics sector? Manag. Rev. 2025, 37, 233–247. [Google Scholar]
Liu, H.; Chen, X.; Wang, X. A Study on the Threshold Effects of Urbanisation and Green Total Factor Productivity in Logistics. Rev. Econ. Manag. 2020, 36, 123–132. [Google Scholar]
Liu, W. The digital economy and environmental pollution: New evidence based on the support of logistics development. J. Clean. Prod. 2023, 427, 139210. [Google Scholar] [CrossRef]
Liang, Y.; Xu, C.; Gao, Z. Analysis of Spatiotemporal Evolution and Driving Factors of Carbon Emissions in the Logistics Sector Along China’s Belt and Road Initiative Routes. J. Environ. Eng. Technol. 2025, 15, 798–810. [Google Scholar]
McElheran, K.; Yang, M.-J.; Kroff, Z.; Brynjolfsson, E. The Rise of Industrial AI in America: Microfoundations of the Productivity J-Curve(s); US Census Bureau, Center for Economic Studies: Suitland, MD, USA, 2025. [Google Scholar]
Acemoglu, D.; Restrepo, P. Automation and rent dissipation: Implications for wages, inequality, and productivity. Q. J. Econ. 2026, qjag006. [Google Scholar] [CrossRef]
Brynjolfsson, E.; Mcafee, A. The business of artificial intelligence. Harv. Bus. Rev. 2017, 7, 1–2. [Google Scholar]
Lin, B.-Q.; Yang, Y.-X. Building efficiency: How the national AI innovation pilot zones enhance green energy utilization? Evidence from China. J. Environ. Manag. 2025, 387, 125945. [Google Scholar] [CrossRef]
Zhou, W.; Zhu, C. Artificial Intelligence Innovation and Development and Enterprise Total Factor Productivity: Evidence from National New Generation AI Innovation and Development Pilot Zones. Ind. Technol. Econ. 2025, 44, 79–88. [Google Scholar]
He, Y.; Yuan, J. The Development of Artificial Intelligence and the Enhancement of New Quality Productivity: Theoretical Mechanisms and Empirical Tests. Sci. Technol. Prog. Policy 2025, 42, 1–11. [Google Scholar]
Cao, Y.; Liu, J. Has the Development of Next-Generation Artificial Intelligence Promoted the Optimisation of Urban Industrial Structures?—A Dual Perspective Analysis of Efficiency Enhancement and Coordinated Development. Manag. Rev. 2025, 1–17. [Google Scholar] [CrossRef]
Xu, X.; Shen, S. Innovative Applications of Artificial Intelligence and the Development of New Quality Productivity in Agriculture: Empirical Evidence from National Pilot Zones for AI Innovation and Application. Inq. Into Econ. Issues 2025, 10, 57–72. [Google Scholar]
Zhang, L.; Zhou, B. The Impact of Artificial Intelligence on China’s Green Finance Development: Policy Effects from National New Generation AI Innovation Development Pilot Zones. China Bus. Mark. 2025, 39, 112–126. [Google Scholar]
Dong, X.; Zhou, N.; Zhao, X.; Yang, S. The impact of artificial intelligence on corporate green innovation: Can “increasing quantity” and “improving quality” go hand in hand? J. Environ. Manag. 2025, 376, 124439. [Google Scholar] [CrossRef] [PubMed]
Anselin, L.; Serenini, R.; Amaral, P. Alternative perspectives on spatial econometrics model specification search. Spat. Econ. Anal. 2025, 1–48. [Google Scholar] [CrossRef]
Fratesi, U.; Abreu, M.; Bond-Smith, S.; Corrado, L.; Ditzen, J.; Felsenstein, D.; Fuerst, F.; Ioramashvili, C.; Kopczewska, K.; Monastiriotis, V. Spillovers and small Spatial Scale Analyses: Contributions from Spatial Economics; Taylor & Francis: Abingdon, UK, 2025; Volume 20, pp. 1–7. [Google Scholar]
Athey, S.; Imbens, G.W. The state of applied econometrics: Causality and policy evaluation. J. Econ. Perspect. 2017, 31, 3–32. [Google Scholar] [CrossRef]
Asheim, B.T.; Gertler, M.S. The geography of innovation: Regional innovation systems. In The Oxford Handbook of Innovation; Oxford Academic: Oxford, UK, 2006. [Google Scholar]
Cooke, P. Regional innovation systems: Competitive regulation in the new Europe. Geoforum 1992, 23, 365–382. [Google Scholar] [CrossRef]
Aghion, P.; Howitt, P. A Model of Growth Through Creative Destruction; National Bureau of Economic Research: Cambridge, MA, USA, 1990. [Google Scholar]
Rolnick, D.; Donti, P.L.; Kaack, L.H.; Kochanski, K.; Lacoste, A.; Sankaran, K.; Ross, A.S.; Milojevic-Dupont, N.; Jaques, N.; Waldman-Brown, A. Tackling climate change with machine learning. ACM Comput. Surv. (CSUR) 2022, 55, 1–96. [Google Scholar] [CrossRef]
Glaeser, E.; Kallal, H.; Scheinkman, J.; Shleifer, A. Growth in cities. J. Political Econ. 1992, 100, 1126–1152. [Google Scholar] [CrossRef]
Fujita, M.; Thisse, J.-F. Economics of Agglomeration: Cities, Industrial Location, and Regional Growth; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2002. [Google Scholar]
Krugman, P. Increasing returns and economic geography. J. Political Econ. 1991, 99, 483–499. [Google Scholar] [CrossRef]
Porter, M.E.; Linde, C. Toward a new conception of the environment-competitiveness relationship. J. Econ. Perspect. 1995, 9, 97–118. [Google Scholar] [CrossRef]
Jiao, K.; Xuan, J.; Du, Q.; Bao, Z.; Xie, B.; Wang, B.; Zhao, Y.; Fan, L.; Wang, H.; Hou, Z. Designing the next generation of proton-exchange membrane fuel cells. Nature 2021, 595, 361–369. [Google Scholar] [CrossRef]
Grant, R.M. Porter’s ‘competitive advantage of nations’: An assessment. Strateg. Manag. J. 1991, 12, 535–548. [Google Scholar] [CrossRef]
Yu, K.; Li, Z. Coupling coordination and spatial network characteristics of carbon emission efficiency and urban green innovation in the Yellow River Basin, China. Sci. Rep. 2024, 14, 27690. [Google Scholar] [CrossRef]
Wang, M.; Wang, Y.; Feng, C. Artificial intelligence, institutional environment, and corporate green transformation: Evidence from China’s resource-based sector. Int. Rev. Econ. Financ. 2025, 103, 104473. [Google Scholar] [CrossRef]
Athey, S.; Imbens, G.W. Machine learning methods that economists should know about. Annu. Rev. Econ. 2019, 11, 685–725. [Google Scholar] [CrossRef]
Chernozhukov, V.; Chetverikov, D.; Demirer, M.; Duflo, E.; Hansen, C.; Newey, W.; Robins, J. Double/Debiased Machine Learning for Treatment and Structural Parameters; Oxford University Press: Oxford, UK, 2018. [Google Scholar]
Zhang, T.; Li, J. Network Infrastructure, Inclusive Green Growth and Regional Disparities: Causal Inference Based on Dual Machine Learning. J. Quant. Technol. Econ. 2023, 40, 113–135. [Google Scholar]
Cao, X.; Deng, M.; Li, H. How does e-commerce city pilot improve green total factor productivity? Evidence from 230 cities in China. J. Environ. Manag. 2021, 289, 112520. [Google Scholar] [CrossRef]
Yang, W.; Liu, D.; Sui, X.; Li, F. Does logistics efficiency matter? Evidence from green economic efficiency side. Res. Int. Bus. Financ. 2022, 61, 101650. [Google Scholar]
Zhou, P.; Ang, B.W.; Poh, K.L. A survey of data envelopment analysis in energy and environmental studies. Eur. J. Oper. Res. 2008, 189, 1–18. [Google Scholar] [CrossRef]
Zhang, J.; Wu, G.; Zhang, J. Estimation of Interprovincial Physical Capital Stock in China: 1952–2000. Econ. Res. J. 2004, 10, 35–44. [Google Scholar]
Ren, J.; Gao, B.; Zhang, J.; Chen, C. Measuring the Energy and Carbon Emission Efficiency of Regional Transportation Systems in China: Chance-Constrained DEA Models. Math. Probl. Eng. 2020, 2020, 9740704. [Google Scholar] [CrossRef]
Ren, Z. Regional Variations in China’s Carbon Emissions and Research on Reduction Mechanisms. Ph.D. Thesis, Capital University of Economics and Business, Beijing, China, 2014. [Google Scholar]
Mijit, R.; Hu, Q.; Xu, J.; Ma, G. Greening through AI? The impact of Artificial Intelligence Innovation and Development Pilot Zones on green innovation in China. Energy Econ. 2025, 146, 108507. [Google Scholar] [CrossRef]
Zhang, H.; Han, A.; Yang, Q. Spatial Effects Analysis of Synergistic Agglomeration between China’s Manufacturing Sector and Productive Services. J. Quant. Technol. Econ. 2017, 34, 3–20. [Google Scholar]
Zhang, Z.; Wu, D. Industrial Spatial Agglomeration, Factor Mobility and Regional Balanced Development: An Analysis of Economic Disparities Among Cities Along the Yangtze River Economic Belt. Reform Econ. Syst. 2019, 4, 42–48. [Google Scholar]
Ling, S.; Jin, S.; Wang, H.; Zhang, Z.; Feng, Y. Transportation infrastructure upgrading and green development efficiency: Empirical analysis with double machine learning method. J. Environ. Manag. 2024, 358, 120922. [Google Scholar] [CrossRef]
Shen, N.; Zhang, G.; Zhou, J.; Zhang, L.; Wu, L.; Zhang, J.; Shang, X. Can big data policy drive urban carbon unlocking efficiency? A new approach based on double machine learning. J. Environ. Manag. 2024, 372, 123296. [Google Scholar] [CrossRef]
Jiang, Y.; Sun, J. Does smart city construction promote urban green development? Evidence from a double machine learning model. J. Environ. Manag. 2025, 373, 123701. [Google Scholar] [CrossRef] [PubMed]
Cui, Q.; Yu, H. The Development of Pilot Free Trade Zones and the Advancement of Enterprises’ New Quality Productive Forces. West. Froum 2024, 34, 1–16. [Google Scholar]
Miric, M.; Jia, N.; Huang, K.G. Using supervised machine learning for large-scale classification in management research: The case for identifying artificial intelligence patents. Strateg. Manag. J. 2023, 44, 491–519. [Google Scholar] [CrossRef]
Zhang, W.; Liu, H.; Yao, Y.; Fan, Z. A study measuring the degree of integration between the digital economy and logistics industry in China. PLoS ONE 2022, 17, e0274006. [Google Scholar] [CrossRef]
Zhang, Y.; Li, H.; Wang, S. Decarbonizing data centers through regional bits migration: A comprehensive assessment of China’s ‘eastern data, Western computing’ initiative and its global implications. Appl. Energy 2025, 392, 126020. [Google Scholar] [CrossRef]
Chen, Y.; Fan, Z.; Gu, X.; Zhou, L.-A. Arrival of young talent: The send-down movement and rural education in China. Am. Econ. Rev. 2020, 110, 3393–3430. [Google Scholar] [CrossRef]
Jiang, T. Mediation and Moderation Effects in Empirical Studies of Causal Inference. China Ind. Econ. 2022, 100–120. [Google Scholar] [CrossRef]
Richey, R.G., Jr.; Chowdhury, S.; Davis-Sramek, B.; Giannakis, M.; Dwivedi, Y.K. Artificial Intelligence in Logistics and Supply Chain Management: A Primer and Roadmap for Research; Wiley Online Library: Hoboken, NJ, USA, 2023; Volume 44, pp. 532–549. [Google Scholar]
Zhang, Z.; Chi, L.; Wu, J.; Zhu, M.; Shen, C. Industrial clustering and environmental pollution: Spatial-temporal dynamics and driving factors of China’s feed processing industry. J. Clean. Prod. 2025, 498, 145173. [Google Scholar] [CrossRef]
Lv, T.; Zhao, Q.; Fu, S.; Jin, G.; Zhang, X.; Hu, H.; Xu, G. Deciphering flows: Spatial correlation characteristics and factors influencing carbon emission intensity in the Yangtze River Delta. J. Clean. Prod. 2024, 483, 144290. [Google Scholar] [CrossRef]
Zhang, P.; Tu, X. Research on the Impact of Manufacturing Agglomeration on Improving Urban Carbon Total Factor Productivity. Urban Issues 2023, 37–45. [Google Scholar] [CrossRef]
Xie, X.; Guo, K.; Han, Z. Digital technology application and green technological innovation under economic policy uncertainty: Empirical evidence from China’s manufacturing industry. Int. Rev. Financ. Anal. 2025, 108, 104597. [Google Scholar] [CrossRef]
Li, Y.; Liu, S.; He, Z. Industrial collaborative agglomeration agglomeration, technological innovation and green total factor productivity. Syst. Eng. 2023, 41, 1–14. [Google Scholar]
LeSage, J.; Pace, R.K. Introduction to Spatial Econometrics; Chapman and Hall/CRC: Boca Raton, FL, USA, 2009. [Google Scholar]

Figure 1. Integrated analytical framework of this study.

Figure 2. Parallel trend test.

Figure 3. Spatial and temporal distribution of logistics industry’s GTFP.

Table 1. Indicator system.

Category		Indicator	Definition	Unit
Input Indicators		Fixed Asset Investment in the Logistics	Capital Investment in the Logistics	10⁸ CNY
		Number of Employees in the Logistics	Labor Input in the Logistics	10⁴ persons
		Energy Consumption of the Logistics	Material Input used in Logistics Transportation	10⁴ tce
Output Indicators	Desirable Outputs	Logistics Industry Value-Added	Value Output of the Logistics	10⁸ CNY
	Desirable Outputs	Freight Turnover	Quantity Output in Logistics	10⁸ ton·km
	Undesirable Output	Carbon Emissions from the Logistics	Environmental Pollutants Generated by Logistics Activities	10⁴ tons

Table 2. Designation years and pilot provinces of the AIIDPZ policy.

Designation Year	Provinces
2019	Beijing, Shanghai
2020	Tianjin, Zhejiang, Anhui, Shandong, Hubei, Guangdong, Chongqing, Sichuan, Shaanxi
2021	Jiangsu, Hunan
2022	Liaoning, Heilongjiang, Henan

Table 3. Control variables.

Control Variables	Symbol	Definition	Unit
Industrial Structure	Str	Ratio of tertiary to primary sector value-added	%
Industrial Share	Ind	Ratio of secondary to primary sector value-added	%
Sci-Tech Development	Sci	Share of local government R&D expenditure in regional GDP	%
Fixed Asset Investment	Inv	Gross fixed capital formation relative to regional GDP	%
Environmental Emphasis	Env	Local fiscal spending on environmental protection relative to GDP	%
Logistics Infrastructure	Len	Ln (total length of operational routes)	ln(km)
Economic Development	Ed	Ln (GDP per capita)	ln(CNY/person)
Openness to Trade	Ex	Ln (total domestic imports + exports)	ln(10³ USD)
Talent Agglomeration	Lab	Employees in IT and software services	10⁴ persons
Logistics & Transport Development	Trans	Local government transport spending relative to GDP	%

Table 4. Descriptive statistics.

Variable	Symbol	N	Mean	Std. Dev.	Min	Max
Dependent variable	GTFP	330	0.809	0.210	0.269	1.541
Independent variable	AIIDPZ	330	0.127	0.334	0.000	1.000
Control variables	Str	330	20.05	54.170	1.617	335.000
	Ind	330	0.400	0.079	0.160	0.587
	Sci	330	0.005	0.003	0.002	0.013
	Inv	330	0.639	0.292	0.096	1.724
	Env	330	0.008	0.005	0.002	0.043
	Len	330	9.789	0.830	7.569	12.270
	Ed	330	10.910	0.445	9.849	12.150
	Ex	330	17.750	1.620	12.650	21.100
	Lab	330	2.046	1.040	0.000	4.617
	Trans	330	0.018	0.016	0.004	0.116
Mechanism variables	MA	330	0.823	0.345	0.293	1.828
Mechanism variables	CA	330	2.607	0.454	1.804	3.975

Table 5. Benchmark regression results.

Variable	(1)	(2)
Variable	GTFP	GTFP
AIIDPZ	0.106 ***	0.106 ***
	(0.038)	(0.039)
Control	Y	Y
Control²	N	Y
Time FE	Y	Y
Province FE	Y	Y
Num	330	330
KF	5	5
DML	RF	RF

Notes: Robust standard errors are in parentheses. *** denotes significance at 1% levels. Y and N denote “Yes” and “No”, respectively. This notation applies to all subsequent tables.

Table 6. Robustness tests.

Variable	Delete Outliers		Interactive Model		Sample Adjustments
	(1)		(2)		(3)
	GTFP	GTFP	GTFP	GTFP	GTFP
AIIDPZ	0.100 ***	0.094 ***	0.208 ***	0.204 ***	0.090 **
	(0.039)	(0.036)	(0.020)	(0.016)	(0.040)
Control	Y	Y	Y	Y	Y
Control²	N	Y	N	Y	Y
Time FE	Y	Y	Y	Y	Y
Province FE	Y	Y	Y	Y	Y
Num	330	330	330	330	210
KF	5	5	5	5	5
DML	RF	RF	RF	RF	RF

Notes: Robust standard errors are in parentheses. **, and *** denote significance at 5%, and 1% levels. Y and N denote “Yes” and “No”, respectively.

Table 7. DML robustness tests.

Variable	Fold Adjustments				Algorithm Substitutions
	(1)				(2)
	GTFP		GTFP		GTFP		GTFP		GTFP
AIIDPZ	0.105 ***	0.105 ***	0.100 **	0.102 **	0.115 ***	0.093 ***	0.087 **	0.085 **	0.195 ***	1.032 ***
	(0.037)	(0.038)	(0.039)	(0.040)	(0.037)	(0.035)	(0.042)	(0.041)	(0.031)	(0.029)
Control	Y	Y	Y	Y	Y	Y	Y	Y	Y	Y
Control²	N	Y	N	Y	N	Y	N	Y	N	Y
Time FE	Y	Y	Y	Y	Y	Y	Y	Y	Y	Y
Province FE	Y	Y	Y	Y	Y	Y	Y	Y	Y	Y
Num	330	330	330	330	330	330	330	330	330	330
KF	3	3	7	7	5	5	5	5	5	5
DML	RF	RF	RF	RF	LASSO	LASSO	GB	GB	NNET	NNET

Notes: Robust standard errors are in parentheses. **, and *** denote significance at 5%, and 1% levels. Y and N denote “Yes” and “No”, respectively.

Table 8. Control variables robustness tests.

Variable	(1)	(2)	(3)	(4)	(5)	(6)
Variable	GTFP	GTFP	GTFP	GTFP	GTFP	GTFP
AIIDPZ	0.112 ***	0.110 ***	0.099 ***	0.098 ***	0.099 ***	0.088 ***
	(0.037)	(0.037)	(0.033)	(0.032)	(0.032)	(0.031)
Control	Y	Y	Y	Y	Y	Y
Control²	N	Y	N	Y	N	Y
Additional Controls	road	road, road²	road, regul	road, regul, road², regul²	road, regul, ener	road, regul, ener, road², regul², ener²
Time FE	Y	Y	Y	Y	Y	Y
Province FE	Y	Y	Y	Y	Y	Y
Num	330	330	330	330	330	330
KF	5	5	5	5	5	5
DML	RF	RF	RF	RF	RF	RF

Notes: Robust standard errors are in parentheses. *** denotes significance at 1% levels. Y denotes “Yes”.

Table 9. Endogeneity analysis.

Variable	IV
	Z
	(1)	(2)
	GTFP	GTFP
AIIDPZ	0.094 **	0.096 **
	(0.040)	(0.041)
Control	Y	Y
Control²	N	Y
Time FE	Y	Y
Province FE	Y	Y
Num	330	330
KF	5	5
DML	RF	RF

Notes: Robust standard errors are in parentheses. ** denotes significance at 5% levels. Y and N denote “Yes” and “No”, respectively.

Table 10. Heterogeneity analysis.

Variable	North China	East China	South China	Central China	Northwest China	Northeast China	Southwest China
	(1)	(2)	(3)	(4)	(5)	(6)	(7)
	GTFP	GTFP	GTFP	GTFP	GTFP	GTFP	GTFP
AIIDPZ	0.143	0.101 **	−0.219	0.155 ***	0.134 *	0.138 *	0.137 **
	(0.096)	(0.051)	(0.349)	(0.051)	(0.080)	(0.077)	(0.060)
Control	Y	Y	Y	Y	Y	Y	Y
Control²	Y	Y	Y	Y	Y	Y	Y
Time FE	Y	Y	Y	Y	Y	Y	Y
Province FE	Y	Y	Y	Y	Y	Y	Y
Num	55	77	33	33	55	33	44
KF	5	5	5	5	5	5	5
DML	RF	RF	RF	RF	RF	RF	RF

Notes: Robust standard errors are in parentheses. *, **, and *** denote significance at 10%, 5%, and 1% levels. Y denotes “Yes”.

Table 11. Analysis of the Influence Mechanism.

Variable	(1)	(2)	(3)
Variable	GTFP	MA	CA
AIIDPZ	0.106 ***	−0.074 ***	−0.049 *
	(0.039)	(0.026)	(0.028)
Control	Y	Y	Y
Control²	Y	Y	Y
Time FE	Y	Y	Y
Province FE	Y	Y	Y
Num	330	330	330
KF	5	5	5
DML	RF	RF	RF

Notes: Robust standard errors are in parentheses. *, and *** denote significance at 10%, and 1% levels. Y denotes “Yes”.

Table 12. Spatial autocorrelation test.

Year	I	E(I)	Sd(I)	Z	p
2019	0.1167	−0.0345	0.0389	3.8867	0.0001
2020	0.0958	−0.0345	0.0516	2.5258	0.0115
2021	0.0940	−0.0345	0.0518	2.4795	0.0132
2022	0.0460	−0.0345	0.0519	1.5515	0.1208

Table 13. Spatial autoregressive model results.

	(1)			(2)
Variable	Direct Effect	Indirect Effect	Total Effect	Direct Effect	Indirect Effect	Total Effect
Variable	GTFP	GTFP	GTFP	GTFP	GTFP	GTFP
AIIDPZ	0.050 *	0.098 *	0.147 *	0.052 *	0.057 *	0.109 *
	(0.027)	(0.056)	(0.080)	(0.030)	(0.034)	(0.063)
Control	Y	Y	Y	Y	Y	Y
FE	Y	Y	Y	Y	Y	Y
Num	330	330	330	330	330	330

Notes: Robust standard errors are in parentheses. * denotes significance at 10% levels. Y denotes “Yes”.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ma, Y.; Zang, J. Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning. Sustainability 2026, 18, 3092. https://doi.org/10.3390/su18063092

AMA Style

Ma Y, Zang J. Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning. Sustainability. 2026; 18(6):3092. https://doi.org/10.3390/su18063092

Chicago/Turabian Style

Ma, Yonggang, and Jiagen Zang. 2026. "Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning" Sustainability 18, no. 6: 3092. https://doi.org/10.3390/su18063092

APA Style

Ma, Y., & Zang, J. (2026). Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning. Sustainability, 18(6), 3092. https://doi.org/10.3390/su18063092

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Artificial Intelligence Innovation and Development Pilot Zones and Green Total Factor Productivity of the Logistics Industry: An Empirical Analysis Based on Double Machine Learning

Abstract

1. Introduction

2. Literature Review

3. Theoretical Analysis and Research Hypotheses

3.1. Overall Effects Analysis

3.2. Mechanism Analysis

4. Methodology

4.1. Model

4.2. Variables

4.3. Data

5. Results and Analysis

5.1. Benchmark Regression

5.2. Robustness Tests

5.2.1. Parallel Trend Test

5.2.2. Delete Outliers

5.2.3. Interactive Model

5.2.4. Sample Adjustments

5.2.5. Fold Adjustments

5.2.6. Algorithm Substitutions

5.2.7. Additional Control Variables

5.3. Endogeneity Analysis

5.4. Heterogeneity Analysis

5.5. Analysis of the Influence Mechanisms

5.5.1. Manufacturing Agglomeration

5.5.2. Collaborative Agglomeration

5.6. Further Analysis

6. Conclusions and Policy Implications

6.1. Conclusions

6.2. Policy Implications

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI