Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis

Wu, Shang; Du, Qingyun

doi:10.3390/f17040443

Open AccessArticle

Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis

by

Shang Wu

and

Qingyun Du

^*

School of Resource and Environmental Sciences, Wuhan University, Wuhan 430072, China

^*

Author to whom correspondence should be addressed.

Forests 2026, 17(4), 443; https://doi.org/10.3390/f17040443

Submission received: 26 February 2026 / Revised: 23 March 2026 / Accepted: 30 March 2026 / Published: 1 April 2026

(This article belongs to the Topic Artificial Intelligence, Remote Sensing and Digital Twin Driving Innovation in Sustainable Natural Resources and Ecology)

Download

Browse Figures

Versions Notes

Abstract

Vegetation resilience plays a key role in ecosystem stability as climate change and human disturbance intensify. We quantified resilience via AR(1) from kNDVI data over mainland China (2000–2024), and assessed its spatiotemporal patterns, long-term causal drivers (Causal Forest), and breakpoint-related mechanism shifts (non-stationary causal networks). Resilience varied strongly across space, with higher AR(1) values concentrated in northern transition belts and inland regions. Breakpoints clustered in 2010–2018 and showed broad synchronicity nationwide. Long-term effects were dominated by environmental background states: mean variables generally outweighed variability (CV) and memory terms, suggesting that persistent climate–environment conditions primarily shaped resilience gradients. Temperature emerged as the strongest national-scale control and acted asymmetrically across metrics—TMX strongly suppressed resilience, whereas TMN tended to enhance it—while precipitation and CO₂ gained importance regionally. Driver networks reorganized markedly across breakpoints, exhibiting high edge turnover and heterogeneous lag shifts—pointing to stage-dependent restructuring that goes beyond changes in driver strength. This framework links net effects with mechanism reorganization to help diagnose vegetation resilience under non-stationary conditions.

Keywords:

vegetation resilience; AR(1); kNDVI; Causal Forest; temperature asymmetry; background-state effect; CD-NOTS; BFAST

1. Introduction

Terrestrial ecosystems are increasingly exposed to multiple and interacting disturbances, and these pressures have substantially increased the uncertainty and risk associated with ecosystem states. Given that vegetation is a central regulator of land–atmosphere interactions, understanding its capacity to maintain functioning and recover from external perturbations is a key scientific challenge for elucidating ecosystem stability dynamics and informing appropriate mitigation strategies [1].

Since the concept of ecological resilience was introduced [2], its definition and quantitative assessment have remained central to ecological research [3]. From a methodological perspective, the theory of critical slowing down (CSD) provides a classical framework for indirectly quantifying resilience. As a system approaches a critical threshold, its recovery rate following disturbance declines, typically accompanied by statistical signals such as increasing autocorrelation and variance [4,5]. Accordingly, autocorrelation-based metrics—particularly the first-order autoregressive coefficient AR(1)—have been widely used to characterize changes in ecosystem resilience [3,6]. However, AR(1) captures a statistical signal of slowing recovery and therefore serves as a proxy for resilience rather than a direct observation of the ecosystem’s immediate recovery speed; its interpretation may also be influenced by the type of observational data, vegetation structure, and preprocessing procedures [6,7]. In remote-sensing studies of vegetation resilience based on time-series data, the choice of vegetation-state indicator can directly affect the stability and reliability of resilience estimates. Previous studies have shown that kNDVI is more closely related to vegetation productivity and performs better in terms of resistance to saturation, robustness to noise, and spatiotemporal stability; it has therefore been increasingly used to characterize vegetation ecosystem states [8,9]. More importantly, a growing number of recent studies have used kNDVI time series as the basis for deriving resilience metrics such as temporal autocorrelation, AR(1), and TAC to assess vegetation resilience across different regions and ecosystems. For example, Forzieri et al. used kNDVI-based anomaly series to evaluate changes in global forest resilience [10], Wang et al. combined kNDVI with AR(1) to examine the relationship between vegetation greening and resilience on the Loess Plateau [9], and Wu et al. used the lag-one autocorrelation of kNDVI to characterize grassland resilience on the Mongolian Plateau [11]. In addition, Smith and Boers included kNDVI in a global multi-dataset comparison and showed that it can provide reliable resilience estimates under appropriate biomass conditions [7]. Therefore, kNDVI was used in this study to characterize vegetation state and to derive resilience metrics accordingly.

Satellite observations provide extensive evidence that vegetation resilience has declined over broad areas, though this decline is far from spatially uniform, showing pronounced heterogeneity [10,12,13,14,15]. At the global scale, decreasing resilience has been repeatedly reported for forests and terrestrial ecosystems, whereas studies from China and other regional settings likewise reveal marked differences in resilience changes under contrasting drought types, ecological backgrounds, and event characteristics [16,17,18]. In addition, existing evidence suggests that both climatic mean states and climate variability jointly shape ecosystem recovery processes, with water availability, precipitation variability, warming conditions, and drought characteristics acting as major controls on resilience patterns and trends [13,14]. Meanwhile, rising atmospheric CO₂ and persistent warming may partially buffer unfavorable climatic influences in some regions, but such buffering effects are neither stable nor spatially universal, and they are often constrained by regional hydrothermal conditions and ecosystem background [19,20]. Vegetation responses to climatic stress are also strongly scale-dependent: different biomes show contrasting sensitivities to short- and long-term drought, seasonal processes, and interannual variability [21,22]. In particular, semi-arid regions and ecological transition zones tend to contribute disproportionately to interannual ecosystem variability and, therefore, often exhibit more evident shifts in resilience [17,18]. It should also be noted that different Earth observation-based resilience metrics do not always yield consistent conclusions, and that seasonal dynamics, long-term trends, and observational noise may all be embedded in vegetation signals, complicating the interpretation of CSD-based resilience indicators [23]. Taken together, current studies have provided a relatively clear picture of the broad patterns and spatial heterogeneity of vegetation resilience changes, but substantial uncertainty remains regarding the mechanisms underlying these patterns.

In response, increasing attention has turned to identifying the drivers of vegetation resilience. Recent studies have also emphasized that predictive or correlational relationships should not be directly equated with causal attribution [24,25,26,27]. Existing work at global, national, and regional scales suggests that vegetation resilience is jointly shaped by hydroclimatic conditions and variability, carbon-related background processes, and human influences, although the dominant drivers often differ across biomes, ecoregions, and drought contexts [13,16,17,18,28]. Methodologically, most of these studies still rely on correlation analysis, regression-based coefficients, residual trend analysis, explanatory or relative-importance metrics, or interpretable machine learning approaches such as random forests, BRT, and SHAP to identify dominant factors, compare their relative contributions, and characterize nonlinear relationships [17,18,28,29]. While these approaches have substantially advanced driver diagnosis, they still mainly operate at the level of associative, explanatory, or predictive attribution rather than strict causal attribution. Classical statistical and ecological studies have long distinguished description, prediction, explanation, and causal inference as different inferential goals; accordingly, strong explanatory power or predictive performance does not itself justify causal claims, especially in observational settings where confounding, unobserved confounding, shared background forcing, lagged responses, and scale mismatch are common [24,26,27]. For vegetation resilience, these challenges are particularly acute because multiple climatic, biogeochemical, and anthropogenic controls often covary in space and time. As a result, existing approaches are generally able to identify which factors are associated with resilience changes, but they are much less able to determine through which pathways these drivers act or how large their conditional net effects remain once correlated background conditions are considered. Thus, the central unresolved issue is not simply to continue ranking potential drivers but to move from driver identification toward a causal attribution framework that can distinguish driver identity, pathway interpretation, and effect magnitude [30,31].

Recent reviews emphasize that causal questions must be distinguished from correlations, predictions, and general mechanistic explanations, because these differ in their goals, identifying assumptions, and interpretations [27,32,33]. On this basis, the causal literature is commonly organized around two major branches. The first is causal discovery, which seeks to identify latent structural relationships, directions of influence, and causal networks from observational data. The second is causal inference or causal effect estimation, which focuses on quantifying the net effects of specific drivers and their heterogeneity under explicit identification assumptions [32,34]. Within causal discovery, methodological development has broadly proceeded along three major lines: constraint-based methods, which rely on conditional independence relations; score-based methods, which learn graph structures via optimization and scoring; and functional causal models, which identify directions through asymmetries in functional form and noise structure. Representative methods include PC, FCI, GES/FGES, LiNGAM, and ANM [34,35]. In parallel, causal effect estimation has evolved from earlier approaches such as matching, propensity score methods, inverse probability weighting, g-computation, and instrumental variables toward TMLE and, more recently, high-dimensional causal machine learning approaches such as DML and causal forests. This evolution reflects a broader shift from constructing more credible counterfactual comparisons to estimating average and heterogeneous effects more robustly under complex covariate structures [32,36].

As research has increasingly shifted from static to dynamic systems, causal analysis has further expanded into time-series and non-stationary settings, where the main challenges include temporal dependence, lagged pathways, contemporaneous effects, hidden confounding, and structural relations that may change over time [37]. In this line of work, one of the earliest and most influential approaches was Granger causality, followed by multivariate time-series causal discovery methods such as PCMCI, as well as CCM/cross-mapping for complex nonlinear dynamical systems. When attention shifted from dynamic linkage to possible regime-dependent change, non-stationary causal approaches such as CD-NOD and its later extensions were developed to address structural breaks and mechanism reorganization [34,37]. In ecology and environmental science, this methodological development has been accompanied by a parallel expansion in application: research has moved from experimental and quasi-experimental identification toward observational causal inference based on long-term monitoring and multi-source data, and further into Earth-system, biogeochemical, and remote-sensing attribution studies. In these settings, the key challenge is no longer simply whether variables are associated but, rather, how to make more credible attribution statements under multivariate coupling, measurement error, scale mismatch, and dynamic non-stationarity [26,27,33,37]. From this perspective, the central challenge for large-scale vegetation resilience studies is not merely to identify correlates of change but to simultaneously recover structural clues about potential dominant drivers, estimate their conditional net effects, and determine whether driver–vegetation relationships exhibit lagged propagation and stage-dependent reorganization. This provides the methodological basis for introducing different analytical tools in the subsequent sections.

Over the past 25 years, China has undergone pronounced climate change alongside rapid socioeconomic development. In this context, understanding the spatiotemporal dynamics and driving mechanisms of vegetation resilience is essential for clarifying how ecosystem stability has evolved and why it varies across regions. Previous studies have made substantial progress in characterizing resilience patterns, detecting early warning signals, assessing the impacts of extreme events, and examining multiple environmental drivers. Nevertheless, several key challenges remain in the context of large-scale resilience attribution. First, under high-dimensional covariate co-variation, the net effects of individual environmental drivers are still difficult to distinguish robustly. Second, the spatial heterogeneity of underlying mechanisms has not yet been systematically resolved. Third, under potential structural shifts and non-stationary conditions, it remains unclear whether driver–vegetation relationships undergo stage-dependent reorganization. At the same time, the rapid accumulation of publicly available remote-sensing products has created new opportunities for investigating vegetation resilience at large spatial extents and relatively fine spatial resolutions.

Considering these issues, this study conducts a nationwide analysis at a 1 km spatial resolution. Compared with coarser-resolution data, this scale represents differences in vegetation types more explicitly and reduces the influence of mixed pixels, thereby providing a stronger basis for detecting vegetation responses and spatial variability. However, finer-resolution data also introduce stronger spatial autocorrelation and higher observational noise, which place greater demands on methodological robustness.

Accordingly, this study focuses on three main scientific questions:

(1): What are the spatial patterns, long-term trends, and stage-dependent differences in vegetation resilience across China, and how does the relationship between vegetation resilience and vegetation state vary under different ecological contexts?
(2): Under high-dimensional covariate co-variation and spatial dependence, what are the magnitudes and directions of the net effects of major climatic and environmental drivers on the resilience proxy, and do these effects exhibit pronounced spatial heterogeneity and ecological-zone dependence?
(3): Under potential structural changes and non-stationary conditions, do driver–vegetation relationships show lagged propagation and stage-dependent reorganization, and if so, what are the key variables, dominant pathways, and spatial characteristics of these transitions?

To address these questions, we use the AR(1) coefficient as a statistical proxy for vegetation resilience, and then characterize its spatiotemporal dynamics at a fine spatial scale. We combine spatial causal screening, high-dimensional causal effect estimation, and breakpoint-constrained dynamic mechanism analysis to evaluate structural clues, conditional net effects, and stage-dependent reorganization among potential drivers.

2. Materials and Methods

2.1. Study Area

This study focuses on the major terrestrial ecosystems of mainland China. To enhance ecological interpretability and comparability at the national scale, vegetation communities were grouped into eight primary ecosystem types based on an established ecoregional classification scheme: Temperate Desert (TempDES), Temperate Grassland (TempGRS), Cold–Temperate Coniferous Forest (ColtemCNF), Temperate Mixed Forest (TemMIX), Qinghai–Tibet Plateau Alpine Vegetation (QTP), Warm–Temperate Deciduous Forest (WarDCD), Subtropical Evergreen Forest (SubtroDCD), and Tropical Monsoon Rainforest (TRF) (Figure 1). Hereafter, these eight terrestrial ecoregions are referred to by their abbreviations throughout the manuscript.

All subsequent analyses of vegetation resilience and its driving mechanisms were conducted on pixels representing stable natural vegetation during the study period, ensuring consistency of the land-cover background across time.

2.2. Data Sources and Overview

The present analysis integrates multiple categories of data, including vegetation indices, climatic and environmental variables, human pressure indicators, and static background factors. These datasets were compiled from publicly accessible remote-sensing products, reanalysis archives, and other authoritative sources spanning the period from 2000 to 2024. Key information on data origin, spatial resolution, and temporal coverage is provided in Table 1.

2.2.1. Vegetation Index Data

Vegetation dynamics were derived from the MODIS Vegetation Indices product MOD13A2 (Collection 6.1), which provides data at a spatial resolution of 1 km. The product includes 16-day composite NDVI observations. To construct a continuous monthly series, we processed the data in Google Earth Engine (GEE; https://earthengine.google.com/, accessed on 29 March 2026) and applied a maximum value compositing (MVC) strategy to minimize cloud-related contamination and other observational noise. Quality assurance information embedded in MOD13A2 (e.g., summary QA/VI quality flags) was used to filter out low-quality observations prior to compositing. NDVI was subsequently transformed to kNDVI using a nonlinear transformation to enhance sensitivity in high-biomass regions. The resulting monthly kNDVI series served as the primary input for constructing the vegetation resilience proxy and subsequent driver analyses.

2.2.2. Climate Drivers

Climatic drivers were used to represent water and energy conditions potentially affecting vegetation resilience, including precipitation (PRE), near-surface maximum and minimum air temperatures (TMX and TMN, respectively), vapor pressure deficit (VPD), soil moisture (SM), and surface downward shortwave radiation (SRAD). Monthly PRE, TMX, and TMN were obtained from the China Meteorological Forcing Dataset (TPDC), which provides spatially continuous gridded climate forcing data across China. VPD, SM, and SRAD were derived from the TerraClimate dataset, which combines climatically aided interpolation with a water-balance model to produce internally consistent climate and hydrological variables at the global scale [38].

These climate variables were treated as candidate drivers in subsequent analyses to characterize vegetation resilience responses under contrasting water and energy conditions.

2.2.3. Atmospheric CO₂

Atmospheric CO₂ data were obtained from the NOAA Carbon Tracker inversion product (CT series) to represent spatiotemporal variability in background CO₂ concentrations during the study period. Carbon Tracker provides gridded CO₂ mole fraction fields at a 3 h temporal resolution, derived using an atmospheric transport model constrained by multiple observational datasets [39]. To ensure consistency with the temporal resolution of vegetation indices and climate drivers, the 3-hourly CO₂ fields were aggregated to monthly means, which were then included as a candidate driver in subsequent causal analyses.

2.2.4. Land-Use Data

Land-use and land-cover (LULC) data were used to identify background surface changes during the study period and to construct a mask of stable natural vegetation. We employed the China Land Cover Dataset (CLCD), which is derived from multi-source Landsat imagery using a consistent classification framework and provides annual land-cover products at 30 m spatial resolution across China [40].

In this study, CLCD data were primarily used to detect pixels experiencing land-cover transitions during the study period. Pixels affected by substantial land-use changes were excluded from subsequent analyses to ensure that vegetation resilience was evaluated under relatively stable natural vegetation conditions.

2.2.5. Soil Data

Soil properties were used to characterize static background conditions and were included as potential control variables in subsequent analyses. We employed the Harmonized World Soil Database (HWSD)—developed jointly by the FAO, IIASA, and other institutions—which provides global gridded information on major physical and chemical properties of soil. From the HWSD, soil clay fraction and soil organic carbon (SOC) content were extracted as static covariates to represent differences in soil texture and nutrient conditions across the study area.

2.2.6. Human Footprint (HF) Dataset

The Human Footprint Index (HF) integrates multiple indicators of anthropogenic pressure, including population density, land use, transportation infrastructure, and nighttime lights, and has been widely used to quantify cumulative human disturbance at the global and regional scales [41]. HF was included as a baseline anthropogenic covariate to control for potential confounding effects of human activities on vegetation resilience.

2.2.7. Topographic Data

Topographic variables were used to represent terrain constraints on vegetation resilience. We employed the ASTER Global Digital Elevation Model (ASTER GDEM, Version 3), which provides global elevation data at a fine spatial resolution. Elevation and slope were derived from ASTER GDEM and included as topographic covariates in subsequent analyses.

2.2.8. Vegetation Type Data

Vegetation type data were used to characterize functional differences among vegetation communities across ecological contexts. We employed the vegetation type dataset of China provided by the Resource and Environment Science and Data Center (RESDC), which offers national-scale spatial information on vegetation types. Vegetation type was included as a static categorical variable to distinguish vegetation functional backgrounds in the analysis.

Before analysis, all datasets were harmonized in both space and time. For the kNDVI time series, short gaps were filled by linear interpolation after quality screening to ensure continuity for AR(1) estimation. In contrast, the other driver datasets were not gap-filled to preserve the original data values and avoid introducing artificial signals into subsequent driver analyses. Variables from different sources were reprojected and aligned to a common 1 km analysis grid. During grid matching, nearest-neighbor resampling was applied to preserve the original pixel values as much as possible and to avoid additional smoothing introduced by resampling; for categorical data such as land cover and vegetation type, this approach also helps prevent distortion of class boundaries and the generation of meaningless intermediate values. The datasets were then temporally standardized through monthly aggregation, annual summarization, or direct matching, according to the native temporal resolution of each product. In addition, annual land-cover information was used to exclude pixels affected by substantial land-use changes, so that the analysis was restricted to stable natural vegetation. Based on the harmonized datasets, the statistical features and resilience-related variables used in subsequent analyses were further derived (see Section 2.3).

2.3. Methods

To investigate the spatiotemporal dynamics of vegetation resilience and its driving mechanisms across China, this study developed a causal analytical framework that explicitly considers ecological non-stationarity and structural change. As the theoretical basis has been introduced in the Introduction, the focus here is on the operational definition of resilience used in this study. Specifically, kNDVI time series were used to characterize vegetation state, and AR(1) derived from these series was used as a proxy for vegetation resilience. In this study, AR(1) reflects temporal persistence and critical slowing down in system dynamics; therefore, we use it to capture resilience-related signals rather than to directly measure the immediate recovery speed of ecosystems.

On this basis, the subsequent analyses included three main components: First, AR(1) was derived from the kNDVI time series to characterize its spatial pattern and long-term change, and BFAST was further used to detect potential structural breakpoints in resilience dynamics. Second, GCCM was applied to screen potential drivers from high-dimensional covariates, and DML together with Causal Forest was used to estimate average causal effects and their spatial heterogeneity. Third, CD-NOTS was employed within the stages defined by the detected breakpoints to identify lagged causal structures and their reorganization.

Because the different components of the framework were designed to address distinct questions, a dual-timescale strategy was adopted. The resilience proxy itself was derived from monthly kNDVI time series, whereas the analyses of long-term drivers and net effects were conducted at the annual scale. Specifically, monthly climatic variables were first aggregated to annual means, in order to align them with the annual human footprint (HF) data, and three long-term descriptors were then derived from the resulting annual series: namely, the multi-year mean, the coefficient of variation (CV), and the lag-1 autocorrelation (AR(1)), representing background state, interannual variability, and temporal persistence, respectively. These annual-scale descriptors were used in GCCM and DML–Causal Forest analyses. In contrast, CD-NOTS was designed to identify lagged causal structure and its reorganization across breakpoint-defined stages, which required higher temporal resolution; therefore, it was applied to monthly deseasonalized anomaly series rather than to the annual descriptors.

2.3.1. kNDVI

Kernel NDVI (kNDVI) was adopted to represent pixel-level vegetation dynamics and to construct the time series used in subsequent resilience analyses, including indicators related to critical slowing down (CSD) [10]. Unlike conventional NDVI, kNDVI incorporates nonlinear information through kernel transformation, which constrains the value range and improves the stability of error propagation.

{k N D V I}_{t} = t a n h ({N D V I}_{t}^{2})

(1)

where

t a n h (\cdot)

denotes the hyperbolic tangent function.

2.3.2. CSD-Based Vegetation Resilience Indicators

To improve the robustness of autocorrelation-based resilience indicators (AR(1)) in capturing changes in recovery rates, and to minimize the influence of missing observations, seasonal cycles, and long-term trends [7], we applied a unified preprocessing procedure to pixel-level kNDVI time series. First, quality screening was performed by retaining only pixels with a valid-observation ratio above a predefined threshold (≥80%), in order to avoid unstable and biased autocorrelation estimates under excessive missingness. Second, missing values in the retained time series were filled using linear interpolation to obtain continuous series; this step was primarily applied to short gaps, whereas pixels with extensive missingness were excluded during screening. We then removed seasonal and trend components using STL decomposition (with a 12-month seasonal period) and obtained the residual series, denoted as

{\tilde{z}}_{t}

.

Based on

{\tilde{z}}_{t}

, the first-order autoregressive coefficient

ϕ

was estimated within a sliding window of length

W

. Two AR(1) metrics were derived: (1) a baseline AR(1) estimated from the full residual series over 2000–2024, and (2) a rolling AR(1) series

ϕ_{t}

estimated using a 5-year window (

W = 60

months) with a 1-month step. AR(1) was estimated via an OLS formulation of the first-order autoregressive model:

{\tilde{z}}_{t} = ϕ {\tilde{z}}_{t - 1} + ε_{t},

(2)

where

ε_{t}

is a stochastic disturbance term. Under the CSD framework, larger

ϕ_{t}

values generally indicate slower recovery and, thus, reduced resilience.

2.3.3. Trend Analysis

To examine temporal trends in resilience-related metrics over the study period, we applied the non-parametric Mann–Kendall trend test and used Kendall’s τ to quantify trend strength. The rank-based approach is robust to non-normality, outliers, and heteroscedasticity, which are common in ecological time series. Specifically, for each pixel (or ecoregion-averaged series), we computed Kendall’s τ between the time index t and the target metric, together with the associated p value, to determine the direction and strength of the trend. Positive (τ > 0) and negative (τ < 0) values indicate increasing and decreasing trends, respectively.

2.3.4. Breakpoint Detection

To identify potential structural changes in resilience time series (e.g., phase shifts, mechanism transitions, or regime shifts), we applied BFAST package in R (bfast, v1.5.7; R, v4.3.3). BFAST (Breaks For Additive Seasonal and Trend) represents a time series as an additive combination of a trend component, an optional seasonal component, and a remainder term, and it iteratively detects breakpoints in the trend (and, if specified, seasonal) component using statistical tests; it has been widely used for change detection in remote-sensing time series [42].

In this study, BFAST was applied to pixel-level (and ecoregion-averaged) resilience-related series, with the primary goal of detecting structural breaks in the trend component. Because the resilience metrics analyzed here are not assumed to exhibit a stable seasonal cycle, we disabled the seasonal component (season = none) and focused breakpoint detection on the trend term. We set the minimal segment size parameter to h = 0.15, ensuring that each segment contained at least 15% of the full record length, thereby reducing unstable estimates caused by overly short sub-series. BFAST outputs breakpoint locations as time indices. For resilience metrics derived from rolling windows, the timestamp of each metric value is defined as the window-end date; accordingly, breakpoint times were mapped onto the corresponding metric timestamps (i.e., window-end dates).

2.3.5. Construction of Long-Term Statistical Descriptors for Candidate Drivers

Because HF is available at an annual resolution, and because this study focuses on the effects of interannual variability on vegetation resilience, all climate variables were aggregated from monthly to annual timescales to ensure temporal consistency among drivers. Specifically, for each pixel, the 12 monthly values within each year were averaged to obtain an annual mean series for that variable. Based on these annual series, long-term statistical descriptors were further derived to characterize the background state, interannual variability, and temporal autocorrelation, or temporal memory, of each driver.

For each candidate driver, three long-term descriptors were extracted: The first was the multi-year mean (mean), which represents the long-term background state of the variable and was calculated as the arithmetic mean of the annual series over the whole study period. The second was the coefficient of variation (CV), which was used to characterize the intensity of interannual variability and was calculated as the standard deviation of the annual series divided by its multi-year mean. The third was AR(1), which was used to characterize the temporal persistence or memory of the interannual series. To avoid confusion with the AR(1)-based resilience proxy derived from the kNDVI time series, AR(1) here specifically refers to the first-order autoregressive coefficient of the annual driver series.

Let

X_{i, y, m}

denote the original value of a given driver at pixel

i

, in year

y

and month

m

; its annual mean series is defined as follows:

A_{i, y} = \frac{1}{12} \sum_{m = 1}^{12} X_{i, y, m}

(3)

Based on this annual series, the multi-year mean is defined as follows:

{m e a n}_{i} = \frac{1}{n} \sum_{y = 1}^{n} A_{i, y}

(4)

The coefficient of variation is defined as follows:

{C V}_{i} = \frac{S D (A_{i, 1}, A_{i, 2}, \dots, A_{i, n})}{{m e a n}_{i}}

(5)

The AR(1) of the annual driver series was estimated using the following first-order autoregressive model:

A_{i, y} = ϕ_{i} A_{i, y - 1} + ε_{i, y}

(6)

where

ϕ_{i}

denotes the first-order autoregressive coefficient of the annual series; that is,

{A R (1)}_{i} = ϕ_{i}

(7)

where

n

is the number of years in the study period. These three descriptors were subsequently used as inputs to the Causal Forest analysis to examine how interannual characteristics of different drivers influenced vegetation resilience.

2.3.6. GCCM for Spatial Causal Inference

To identify potential nonlinear and directional causal associations between the resilience metric and its candidate drivers, we employed Geographical Convergent Cross-Mapping (GCCM) [43]. GCCM extends the convergent cross-mapping idea from dynamical systems and state-space reconstruction to spatial cross-sections (or time-slice maps). Under the space-for-dynamics assumption, directional coupling is supported when the spatial information embedded in one variable can predict the other on a reconstructed manifold, and when the cross-mapping skill increases and saturates as the library size grows. We therefore interpret GCCM results as predictability-based causal evidence rather than definitive mechanistic proof.

In GCCM, a manifold

M_{X}

is reconstructed from the spatial field of a driver

X

[43]. For each spatial location

s

, we identified the

E + 1

nearest neighbors

{s_{i}}_{i = 1}^{E + 1}

on

M_{X}

(where

E

is the embedding dimension), assigned distance-based kernel weights

w_{s i}

(typically exponential and normalized), and formed the cross-mapped prediction of

Y

as

{\hat{Y}}_{s ∣ M_{X}} = \sum_{i = 1}^{E + 1} w_{s i} Y_{s_{i}}

(8)

Prediction skill was quantified based on the Pearson correlation

ρ

between predicted and observed values, which we used as a measure of predictability-based coupling strength [44]. Evidence for directional identifiability was evaluated from the convergence of

ρ (L)

with increasing library size

L

, and directionality was assessed by contrasting bidirectional cross-mapping (e.g.,

ρ (Y ∣ M_{X})

vs.

ρ (X ∣ M_{Y})

) in terms of both magnitude and convergence behavior; significant skill in both directions may indicate bidirectional coupling or asymmetric associations under common forcing.

We treated the resilience proxy AR(1) as the response

Y

and considered a set of candidate drivers

\{X_{k}\}

, including temperature (TMX, TMN), precipitation (PRE), atmospheric aridity (VPD), soil moisture (SM), shortwave radiation (SRAD), atmospheric CO₂ mole fraction (CO₂), and human footprint (HF). Static background variables (clay, soc, dem, slope) were included to characterize spatial heterogeneity and assist in interpretation. For each time slice

t

(aligned with the timestamp of the rolling-window AR(1), defined at the window-end date), we constructed synchronous spatial rasters

\{Y_{t}, X_{1, t}, \dots, X_{K, t}\}

after harmonizing projection, resolution, and grid alignment. Each candidate driver was represented by annual-scale descriptors, including the multi-year mean, coefficient of variation (CV), and AR(1) of the annual driver series; the calculation of these descriptors is detailed in Section 2.3.4.

To reduce spurious predictability arising from shared large-scale spatial gradients rather than genuine directional coupling, we detrended each variable by removing a fitted planar surface using OLS on spatial coordinates and used the residual fields as GCCM inputs. This step mitigates, but does not eliminate, the influence of common forcing and residual spatial autocorrelation [43]. Convergence was defined by a significant increase in

ρ (L)

with

L

, followed by saturation (e.g.,

ρ (L_{m a x}) - ρ (L_{m a x} - Δ L)

below a predefined threshold). We selected

E

from a candidate set by maximizing

ρ

. Library size was evaluated over an increasing sequence from 10 to 300 (step 20), using a consistent spatial sampling strategy for comparability across time slices. Statistical significance was assessed at

α = 0.01

[45]. GCCM analysis was implemented in Python using custom scripts following Gao et al. (2023) [43] for causal inference from spatial cross-sectional data, the analyses were performed in Python 3.8.20.

For each driver

X_{k}

and each time slice

t

, we output bidirectional

ρ

–library size curves and compared their convergence patterns across time slices to quantify whether driver–resilience coupling was stable or changing. These findings informed the selection of variables for DML-based effect estimation and were further integrated with the subsequent CD-NOTS results to jointly analyze stage-dependent changes in driver–resilience relationships.

2.3.7. Causal Effect Estimation with DML–Causal Forest

To robustly estimate the causal effects of multiple drivers on the resilience proxy AR(1) under high dimensionality, nonlinearity, and strong interactions, we used the Causal Forest DML estimator implemented in the Econ ML framework, which belongs to the class of Double/Debiased Machine Learning (DML) methods. The DML approach proceeds in two stages.

In the first stage, machine learning models are used to estimate nuisance functions: the outcome regression

m (x) = E [Y∣ X = x]

(9)

and the treatment regression

e (x) = E [T∣ X = x]

(10)

Based on these estimates, orthogonalized (residualized) variables are constructed as

{\tilde{y}}_{i} = y_{i} - \hat{m} (x_{i}), {\tilde{t}}_{i} = t_{i} - \hat{e} (x_{i}) .

(11)

In the second stage, causal effects are learned by fitting models to the orthogonalized residuals, exploiting Neyman orthogonality to reduce first-order sensitivity to nuisance estimation errors. Cross-fitting is applied to mitigate overfitting and “self-prediction” bias.

Causal Forest DML estimates heterogeneous treatment effects as a function of covariates. In general, the target estimand can be expressed as an effect contrast

τ (X, T_{0}, T_{1})

, representing the causal effect of changing the treatment from level

T_{0}

to

T_{1}

conditional on

X

[46]. In the case of continuous treatments, this estimand can be interpreted in terms of marginal effects, i.e., local causal responses to small changes in treatment intensity (for example, derivative-like effects

\partial E [Y ∣ d o (T = t), X = x] / \partial t

). The second-stage model uses causal forests (a form of generalized random forests) to non-parametrically learn

τ (\cdot)

, allowing treatment effects to vary flexibly across spatial and ecological contexts [47]. Because heterogeneous treatment effects are counterfactual quantities, they cannot be directly evaluated using standard supervised learning metrics. We therefore assessed model performance using the R-loss and the associated R-score. For a candidate effect function

\hat{τ} (\cdot)

, the R-loss is defined as follows:

L_{R} (\hat{τ}) = \frac{1}{n} \sum_{i = 1}^{n} {({\tilde{y}}_{i} - \hat{τ} (x_{i}) {\tilde{t}}_{i})}^{2}

(12)

Using the optimal constant-effect baseline

{\hat{τ}}_{0}

(the constant that minimizes

L_{R}

), we define an

R^{2}

-like measure, the R-score, as follows:

R - score = 1 - \frac{L_{R} (\hat{τ})}{L_{R} ({\hat{τ}}_{0})}

(13)

A higher R-score indicates that the heterogeneous effect model provides greater explanatory gain relative to the constant-effect baseline, whereas a negative R-score suggests worse out-of-sample performance, often signaling overfitting or insufficient overlap [47].

Model outputs include both the average treatment effect (ATE), defined as

E [τ (X)]

, and the pixel-level individual treatment effect (ITE),

τ (x_{i})

. Accordingly, ATE > 0 implies that the driver tends to increase AR(1) (slower recovery and reduced resilience), whereas ATE < 0 indicates a tendency to decrease AR(1) (faster recovery and enhanced resilience) [48].

We first defined a set of candidate driver categories, including climate variables (CO₂, TMX, TMN, precipitation, VPD, soil moisture, shortwave radiation) and a human activity indicator (human footprint). For each driver category, three long-term statistical descriptors were constructed: the multi-year mean (mean), the coefficient of variation (CV), and the temporal autocorrelation (AR(1)). This expansion captures differences in the average state, variability, and autocorrelation of each driver. In each iteration, one metric (e.g., CO₂_mean) was assigned as the treatment variable T, while all remaining driver metrics were included in the covariate matrix X. Static environmental baseline variables—topography (dem), soil properties (clay, SOC), and dominant vegetation type (vegetation type)—were consistently incorporated into X to reduce omitted-variable bias associated with spatial background heterogeneity and ecosystem differences.

Prior to estimation, continuous variables were standardized using z-scores. As a result, the reported ATEs and ITEs correspond to the effect of a one-standard-deviation increase in each treatment variable on the resilience proxy AR(1), allowing comparisons across different drivers and statistical measures. For implementation, 300,000 valid pixels were randomly sampled for causal effect estimation, and the data were split into training and test subsets at a 60%/40% ratio. All experiments were conducted with a fixed random seed of 42. In Causal Forest DML, the second-stage causal forest was fitted with 500 trees and a minimum leaf size of 100, whereas the first-stage nuisance random forest models were fitted with 200 trees and a minimum leaf size of 20. Average treatment effects and their uncertainty were evaluated using spatial block bootstrapping with 500 repetitions. All analyses were performed in Python 3.8.20 using econml 0.15.1.

We adopted a spatial block size of 200 km to reduce spatial autocorrelation bias in model training and validation. Preliminary variogram and Moran’s I diagnostics indicated that vegetation resilience and major climate drivers exhibit spatial dependence extending beyond local grid resolution but largely decay within 150–250 km. A 200 km block size therefore provides a conservative buffer to minimize information leakage across the training and testing sets while retaining sufficient sample size for model estimation.

To check the stability of the results, we carried out three additional tests: First, we re-estimated ATEs using independent subsamples to see whether the estimates changed under different data partitions. Second, we performed placebo tests by randomly permuting the treatment variables to disrupt the treatment–outcome link. Third, we controlled for multiple comparisons using the Benjamini–Hochberg FDR and examined how uncertainty varied under alternative block aggregation scales. Sensitivity analyses showed that the estimated effects were stable and not driven by random variation, indicating that the Causal Forest results are robust. Detailed results are provided in Supplementary Material S2.

Causal effects estimated with Causal Forest DML were interpreted as conditional causal effects given the observed covariates, rather than unconditional or assumption-free causal conclusions. In each estimation, one climate- or human-activity-related driver statistic (for example, CO₂_mean or HF_CV) was specified as the treatment variable, while the remaining driver statistics together with static background variables—such as topography, soil properties, and dominant vegetation type—were included as covariates to control for observable confounding as far as possible. Static background variables were included mainly to capture spatial heterogeneity and background environmental differences, and they were therefore treated as adjustment covariates rather than focal treatments; for this reason, their ITEs and ATEs were not estimated separately. The causal effect analysis in this study was focused on climate and human-activity driver statistics that carry interannual information. In this framework, effect identification relies on the following assumptions: First, conditional on the covariates included in the model, there is no systematic unobserved confounding between the treatment and the potential outcomes. Second, there is sufficient overlap across treatment levels within the observed covariate space, so that comparable counterfactual estimation is possible. Third, measurement errors in the variables are not strong enough to systematically distort the treatment–outcome relationship. Even so, the estimates may still be affected by unobserved disturbances, errors in remote-sensing or reanalysis data, or spatial interference among neighboring pixels. The results should therefore be interpreted as conditional causal evidence under observational constraints, rather than as definitive proof of the full underlying mechanism.

2.3.8. CD-NOTS for Non-Stationary Causal Networks

Remote-sensing ecosystem time series commonly exhibit long-term change, distribution shift, mechanism drift, and lagged ecological responses. Under such conditions, causal discovery methods that assume stationarity may confound time-varying mechanisms with stable dependence structure, leading to unstable skeleton estimation and unreliable edge orientation. To address this issue, we adopted CD-NOTS (Constraint-Based Causal Discovery from Non-Stationary Time Series), which extends the CD-NOD framework to non-stationary time series with lagged dependencies by explicitly incorporating a time-index node into constraint-based causal discovery. CD-NOTS follows a four-stage procedure: (1) adding a time-indexed node to represent non-stationarity, (2) learning an undirected causal skeleton using conditional independence (CI) tests, (3) orienting edges using temporal information and identifiable structures, and (4) further orienting the remaining edges using the principle of independent changes in causal modules [49].

In this study, CD-NOTS was applied to monthly deseasonalized anomaly series. Because the resilience proxy AR(1) was computed from rolling windows and, therefore, exhibited strong temporal smoothing and autocorrelation, it was not used directly for monthly lagged causal discovery. Instead, we used monthly deseasonalized kNDVI anomalies as the vegetation-state response variable (

R_{t}

) and monthly deseasonalized anomalies of candidate drivers (e.g., temperature, precipitation, VPD, soil moisture, shortwave radiation, and CO₂) as predictor variables. For each pixel, we constructed a multivariate time series and expanded it with a maximum lag of

L = 6

months. A time/background index node (

U_{t}

) was included in the CI-based discovery procedure to represent temporal nonstationarity (and, under the CD-NOTS assumptions, time-smooth background changes). The causal skeleton was obtained by iteratively pruning candidate links using CI tests while recording separating sets

S E P (A, B)

. We implemented linear partial correlation (ParCorr) as the CI test for pixel-wise computational scalability,

r_{A B ∣ S} = C o r r ({\hat{ε}}^{A}, {\hat{ε}}^{B})

(14)

with significance assessed using Fisher-z or t-approximation-based inference. After skeleton learning, edges were oriented using temporal precedence (arrow of time), separating-set rules (including collider/v-structure identification), and Meek rules; for unresolved cases (especially those involving

U_{t}

), CD-NOTS further uses the independent changes in causal modules principle to assist orientation [50].

CD-NOTS was run separately for the pre-breakpoint (pre) and post-breakpoint (post) segments. Pixel-wise outputs included (i) best-lag rasters (

b e s t l a g_{p r e / p o s t}

; multi-band, one band per variable), in which a value of

- 1

indicates that no significant driver–vegetation link was detected for that variable at that pixel; (ii) association-strength rasters (

b e s t c o e f_{p r e / p o s t}

), which store the conditional association statistic at the selected lag; and (iii) a quality-control mask (

s k i p_r e a s o n

) indicating whether a pixel was successfully processed. All downstream analyses were restricted to valid pixels (

s k i p_r e a s o n = 0

). The CD-NOTS analysis was implemented using custom scripts following Sadeghi et al. (2025) [50], with methodological reference to the publicly available CD-NOD framework for the nonstationary causal-discovery framework, the analyses were performed in Python 3.8.20.

It should be noted that the lagged causal structures identified by CD-NOTS are interpreted under the assumptions of constraint-based causal discovery in non-stationary settings. These include causal faithfulness, random sampling without selection bias, and pseudo-causal sufficiency, under which the effects of latent confounders can be represented by smooth functions of time-indexed variables. In addition, CD-NOTS assumes that the contemporaneous and lagged causal relations are repeatedly generated under a temporally consistent structure, so that non-stationary changes can be detected through distributional shifts across stages. Accordingly, the CD-NOTS results are interpreted here as data-driven evidence for stage-dependent lagged causal organization, rather than as definitive proof of the full underlying ecological mechanism.

3. Results

3.1. Spatial Pattern of Vegetation Resilience

From 2000 to 2024, both vegetation state and the resilience proxy showed pronounced spatial heterogeneity across China (Figure 2). The multi-year mean kNDVI generally decreased from southeast to northwest. Higher kNDVI values were mainly found in humid and semi-humid regions of southeastern China, especially in areas with dense forest cover and favorable hydrothermal conditions, where the values were mostly around 0.3–0.6. In contrast, lower kNDVI values were concentrated in the arid northwest and high-elevation regions, where they were generally within the range of 0.1–0.3. Overall, this pattern is broadly consistent with regional differences in hydrothermal conditions and vegetation growth background, and it provides the ecological context for interpreting the resilience pattern.

Compared with kNDVI, the spatial pattern of AR(1) (Figure 3) showed stronger local heterogeneity and more pronounced transition-zone features. Low AR(1) values still dominated across most of the country and were mainly associated with regions with better vegetation conditions and relatively stable ecosystems. Higher AR(1) values, in contrast, were more likely to occur in ecologically fragile areas, climatic transition zones, and regions that are more sensitive to environmental change, often appearing as patchy or belt-like clusters. Because AR(1) is used here as a proxy for vegetation resilience, higher AR(1) generally indicates stronger temporal persistence and more evident critical slowing down, thereby corresponding to lower resilience.

In general, higher kNDVI values were often associated with lower AR(1), but this relationship was not universal across China. Some areas with relatively high vegetation cover still showed elevated AR(1), indicating that greater greenness does not necessarily imply higher resilience. In other words, vegetation state and resilience were related, but they were not equivalent. This result suggests that, at the national scale, ecological stability cannot be inferred from vegetation greenness or productivity alone, and that further causal analysis is needed to identify the key drivers underlying spatial differences in resilience.

3.2. Nonlinear Changes in Vegetation Resilience

3.2.1. Spatiotemporal Distribution of BFAST Breakpoint Years

AR(1) is used as an indicator of vegetation resilience, and structural breakpoints can be interpreted as shifts in resilience states. Breakpoint years vary considerably across China (Figure 4), ranging from 2008 to 2021.

Breakpoint years are distributed across most regions, although some areas exhibit localized patch-like or banded clusters. Most breakpoints occurred between 2010 and 2018. In contrast, northeastern China and parts of the northern interior experienced relatively later transitions, mainly during 2018–2021. These differences indicate that resilience shifts were not fully synchronized nationwide; instead, the transition timing varied across regions while still showing partial temporal coherence.

Breakpoint intensity was quantified by aggregating breakpoint pixels within each of the eight terrestrial ecoregions and expressing them as a proportion of the total pixels in each region (Figure 5). This proportion reflects the relative concentration of structural shifts.

In all of the regions, breakpoint activity exhibited pronounced interannual clustering. A clear synchronous peak occurred in 2012, when several ecoregions recorded proportions exceeding 10%. Another active period was observed between 2015 and 2018, during which most regions showed sustained moderate-to-high breakpoint levels. In contrast, activity remained relatively weak during 2007–2009, and again in 2020–2021.

Temporal dynamics differed among the eight terrestrial ecoregions. Northern forest systems (ColtemCNF and TemMIX) displayed strong activity between 2012 and 2015, with elevated levels persisting locally until 2021. Transitional regions (WarDCD and TempGRS) peaked in 2012 and remained relatively active through 2014–2018. Breakpoints in high-altitude and arid zones (QTP and TempDES) were more concentrated during 2015–2018. In comparison, humid southern regions (SubtroDCD and TRF) exhibited relatively continuous breakpoint occurrences between 2010 and 2016, indicating repeated resilience adjustments over multiple years.

Overall, the breakpoints are not randomly distributed in time. Instead, they are concentrated within key temporal windows—particularly 2012 and 2015–2018—while exhibiting region-specific peak timing and persistence. These findings indicate that resilience shifts are characterized by cross-ecoregional synchrony superimposed on distinct regional response patterns.

3.2.2. Trend Patterns of Vegetation Resilience Before and After Breakpoints

At the national scale, the spatial distribution of Kendall’s τ for AR(1) differs markedly before and after the identified breakpoints (Figure 6). During the pre-break phase, negative τ values (τ < 0), indicating decreasing AR(1), dominate the mid-latitude transitional belt of northern China and several inland regions. In contrast, during the post-break phase, the proportion of significant pixels with positive τ (τ > 0) increases in multiple regions, particularly in eastern and parts of central China.

The spatial pattern derived from the full study period (2000–2024) appears comparatively weaker and more spatially fragmented. This suggests that the resilience dynamics followed stage-dependent evolutionary pathways across the breakpoint, rather than a single monotonic long-term trend.

At the ecoregional scale, the τ distributions reveal a clear northward shift from predominantly negative skewness to more positive skewness after the breakpoint, whereas arid regions remain largely negatively skewed. Before the identified structural breakpoints, northern ecoregions—including Cold–Temperate Coniferous Forest (ColtemCNF), Warm–Temperate Deciduous Forest (WarDCD), Temperate Grassland (TempGRS), and Temperate Desert (TempDES)—show τ distributions that are generally skewed toward negative values, with medians below zero in most cases. This pattern reflects a tendency for AR(1) to decline more frequently during this stage (i.e., resilience increasing). Humid southern ecoregions, i.e., Subtropical Evergreen Forest (SubtroDCD) and Tropical Monsoon Rainforest (TRF), display τ values clustered closer to zero, implying comparatively limited directional change over the same period. After the breakpoint, the τ distributions shift toward positive values in several forested and high-altitude ecoregions, particularly in WarDCD, parts of ColtemCNF, and the Qinghai–Tibet Plateau (QTP). This shift indicates an increased prevalence of rising AR(1) (i.e., resilience decreasing) after the structural transition. In contrast, Temperate Desert (TempDES) and portions of Temperate Grassland (TempGRS) retain predominantly negative or near-zero τ distributions, suggesting that arid and semi-arid systems did not synchronously transition toward increasing AR(1) after the breakpoint.

Across both pre-break and post-break phases, the relatively wide distribution ranges of τ indicate substantial internal spatial heterogeneity within each ecoregion. Over the full study period, 56.52% of statistically significant pixels exhibited τ > 0, indicating that resilience declined across the majority of regions during the past 25 years.

3.3. Causal Inference Based on GCCM

The results of GCCM (Figure 7) show that PRE, TMN, TMX, VPD, SOIL(SM), SRAD, and CO₂ exhibit relatively high convergent cross-mapping coefficients (ρ > 0.3) with statistical significance (p < 0.01), suggesting strong nonlinear coupling with vegetation resilience. In contrast, evaluation (DEM), CLAY, and SOC show moderate convergence strength (ρ between 0.2 and 0.3, p < 0.01), indicating intermediate levels of causal association.

For most climatic drivers, the X → Y mapping direction shows consistently higher ρ values than the reverse Y → X mapping, supporting a dominant directional influence from environmental drivers to AR(1) rather than strong feedback effects.

Slope (SLOPE) exhibits relatively weak X → Y convergence (ρ ≈ 0.12–0.15 across the three temporal snapshots), while the reverse mapping remains negligible. The weak and unstable convergence suggests only secondary or indirect influence, potentially reflecting spatial co-variation rather than a robust causal effect; therefore, SLOPE was excluded from subsequent analyses. The human footprint index (HF), used as a proxy for anthropogenic pressure, displays relatively low convergence values (ρ < 0.25) with noticeable fluctuation and no clear increasing trend along library size (L). The convergence weakens with increasing L, and partial intersections between the X → Y and Y → X curves are observed. The pattern indicates relatively weak coupling and may, in certain cases, reflect bidirectional effects. Part of this signal could stem from spatial heterogeneity as well as the discrete structure of the data. Even so, HF was kept in subsequent analyses to assess whether anthropogenic disturbance plays a measurable role in vegetation resilience dynamics. The spatial distribution maps of SLOPE and HF are provided in Supplementary Material S3.

We further examined the temporal evolution of ρ across three representative time slices between 2000 and 2024 (Table 2). The coupling strength exhibited clear differentiation among drivers. We can see that most variables showed a declining coupling strength over time, whereas only a small number exhibited slight increases.

The most pronounced declines occur in temperature-related variables. For example, TMX_mean decreases from 0.338 to 0.298, TMX_cv from 0.343 to 0.305, and TMX_AR(1) from 0.479 to 0.451. Similarly, VPD_AR(1) declines markedly, from 0.604 to 0.504.

Radiation- and CO₂-related variables also show moderate decreases, including SRAD_mean (0.581 to 0.568), SRAD_cv (0.595 to 0.584), SRAD_AR(1) (0.589 to 0.583), CO₂_mean (0.361 to 0.355), CO₂_cv (0.370 to 0.350), and CO₂_AR(1) (0.365 to 0.351). Soil-related variables such as SM_mean (0.570 to 0.556) and SM_cv (0.561 to 0.540) likewise exhibit gradual reductions.

In contrast, increasing trends are observed mainly in precipitation variability and certain soil attributes. Specifically, PRE_cv increases from 0.481 to 0.490 and PRE_AR(1) from 0.460 to 0.466, while soil texture and organic carbon variables (clay and SOC) show slight increases (from 0.251 to 0.261 and 0.281 to 0.287, respectively). Meanwhile, VPD_mean and VPD_cv display modest upward shifts (0.513 to 0.522 and 0.532 to 0.540, respectively).

In summary, the GCCM results suggest that, from 2000 to 2024, the inferred coupling strength between AR(1) and temperature-related variables—particularly TMX—and the memory component of VPD has weakened most clearly, whereas precipitation variability and soil-related coupling exhibit slight enhancements.

3.4. Causal Forest Analysis

3.4.1. Distribution of Estimated Average Treatment Effects (ATE) for Each Drive

The Causal Forest-estimated average treatment effects (ATEs) reveal a clear dominance of temperature-related variables, whereas other drivers exhibit smaller but often statistically significant effects.

Because increasing AR(1) corresponds to declining resilience under the critical slowing down framework, positive ATE values indicate resilience reduction (AR(1) increase), while negative ATE values indicate resilience enhancement (AR(1) decrease).

The average treatment effects show that temperature-related factors exert the strongest influences on vegetation resilience (Figure 8). TMX_mean has the largest resilience-reducing effect (ATE = +0.1111, 95% CI [0.0984, 0.1254], p < 0.001), whereas TMX_cv shows the largest absolute resilience-enhancing effect (ATE = −0.1712, 95% CI [−0.2363, −0.1125], p < 0.001). TMN_mean also significantly enhances resilience (ATE = −0.1251, 95% CI [−0.1335, −0.1161], p < 0.001). In comparison, moisture-, radiation-, CO₂-, and human-activity-related variables generally show weaker effects, although many remain statistically significant. For example, PRE_mean enhances resilience, whereas PRE_cv and PRE_AR(1) show small resilience-reducing effects. CO₂-related variables display mixed behavior, with CO₂_mean and CO₂_cv reducing resilience but CO₂_AR(1) enhancing it.

In summary, except for the TMX/TMN mean terms and TMX_cv, most ATE magnitudes are below 0.02 in absolute value, indicating that temperature exerts the dominant national-scale influence on resilience, whereas other drivers contribute secondary but statistically robust effects. A sensitivity experiment excluding the HF variable group yields nearly identical ATE distributions; these results are provided in Supplementary Material S1.

3.4.2. Pixel-Level Dominant Environmental Drivers

At the national scale, dominant drivers exhibited a clear temperature-dominated pattern (Figure 9a). TMX and TMN together dominated the vegetation resilience changes in most areas, accounting for 83.21% of pixels, followed by CO₂ (9.80%) and precipitation (6.45%). In contrast, radiation (0.43%), soil moisture and soil-related variables (0.10%), and VPD (0.001%) rarely constituted the dominant drivers.

Spatially, temperature dominance forms a broad and continuous national-scale background. CO₂ and precipitation dominance appear in more fragmented, patch-like distributions. CO₂ dominance is more concentrated along the southwestern plateau margins and mountainous regions, whereas precipitation dominance clusters in the northeastern and northern transitional zones.

The dominant drivers are primarily associated with mean-state information rather than variability or memory terms (Figure 9b). Nationally, mean-type variables account for 78.32% of dominant pixels, cv-type variables for 17.25%, and AR(1)-type variables for only 4.42%. Within climate drivers, dominance is overwhelmingly driven by mean and variability terms (climate_mean: 74.06%; climate_cv: 15.99%; climate_AR(1): 0.15%). In contrast, CO₂ dominance shows substantial participation of the memory component: CO₂_mean and CO₂_AR(1) each account for 4.27%, indicating that when CO₂ becomes dominant, its temporal autocorrelation structure frequently contributes alongside mean-state effects.

There were clear differences in dominant drivers among the eight terrestrial ecoregions, and these differences showed a broad north–south pattern (ColtemCNF → TemMIX → WarDCD → TempGRS → TempDES → QTP → SubtroDCD → TRF). Overall, temperature-related variables dominated in most ecoregions, but the leading temperature component varied across regions. In the northern and mid-latitude ecoregions, vegetation resilience was more often controlled by daytime maximum temperature. WarDCD was the clearest example, and ColtemCNF also showed strong TMX dominance. In contrast, TempGRS and TempDES were more strongly controlled by TMN, suggesting a greater role of minimum-temperature conditions in grassland and arid systems. TemMIX showed a different pattern; in this region, precipitation made a much larger contribution and, together with TMN, formed a more balanced temperature–moisture control. This suggests that ecosystems in this transitional zone are more sensitive to joint changes in water and heat conditions. On the Qinghai–Tibet Plateau, temperature still played the leading role, but the contribution of CO₂ was relatively high, indicating stronger CO₂ sensitivity in high-elevation ecosystems. TRF was the most distinct region. Although temperature-related variables remained important, the contribution of CO₂ was much higher than in other ecoregions and was close to that of temperature. Precipitation and soil moisture rarely acted as the dominant drivers there. The dominant variable types also differed strongly among ecoregions (Figure 9d). ColtemCNF was almost entirely dominated by mean-type variables, indicating that vegetation resilience in this region mainly responded to changes in average climate conditions. In contrast, the contribution of cv-type variables increased clearly in SubtroDCD and TRF. This suggests that climate variability itself more often becomes the dominant signal in humid southern ecosystems. AR(1)-type variables were most prominent in QTP and were also relatively important in TempDES and TRF. This pattern suggests that vegetation resilience in plateau and some arid systems is more strongly influenced by temporal memory effects and persistent processes. In summary, national-scale dominance is centered on temperature mean-state effects. Although CO₂ dominance accounts for roughly 10% nationally, it becomes substantially stronger in TRF and QTP and frequently involves temporal memory (AR(1)). Meanwhile, variability-type dominance increases in subtropical and tropical systems, highlighting differentiated dominant information structures between ecosystem types.

3.5. Causal Mechanism Reorganization on CD-NOTS

To characterize the breakpoint-related reorganization of driver–vegetation interactions, we analyzed CD-NOTS outputs in three dimensions: First, we examined dominant-driver reorganization, including changes in dominant-driver composition, transition matrices, and spatial hotspots of key transitions. Then, we quantified the structural reorganization of significant driver edges using edge-presence indicators derived from

b e s t l a g

, and we summarized keep/gain/loss/none categories and network connectivity (degree). Thirdly, we assessed response-timescale reorganization using lag shifts (

Δ l a g

) and their magnitude distributions for pixels with significant edges in both periods (keep pixels).

3.5.1. National-Scale Changes in Dominant Driver Composition and Transition Patterns

For each segment (pre- and post-breakpoint), we ran CD-NOTS independently and used the association-strength raster bestcoef (pre/post) to quantify the conditional association strength of each driver at its optimal lag (as indicated by bestlag_(pre/post)). At each valid pixel, the dominant driver was defined as the variable with the largest absolute conditional association magnitude among drivers showing a significant vegetation link; pixels without any significant driver–vegetation link were labeled as “none”. Unless otherwise stated, national-scale statistics were computed over all valid pixels (n = 4,654,452), with composition percentages additionally reported after normalization to pixels with definable dominant drivers in given segment; transition probabilities were computed on pixels with definable dominant drivers (definable pixels).

Before the breakpoint, the national-scale dominant-driver composition (excluding “none” and normalized over pixels with definable dominant drivers in the pre-breakpoint period; n = 3,907,436) exhibits a pattern of joint dominance by energy- and carbon–water-related controls (Figure 10c). SRAD (19.4%) is the most prevalent category, followed closely by CO₂ (19.2%) and precipitation (PRE; 18.8%), which together form the leading group. The remaining drivers contribute comparable shares, including VPD (10.8%), soil moisture (SM; 10.7%), maximum temperature (TMX; 10.5%), and minimum temperature (TMN; 10.4%).

After the breakpoint, the dominant-driver composition undergoes a clear reorganization (Figure 10c). In terms of coverage, the number of pixels with a definable dominant driver decreases from 3,907,436 (83.95%) in the pre-breakpoint period to 3,754,157 (80.66%) in the post-breakpoint period, and the “none” category increases accordingly from 16.05% to 19.34% (+3.29 percentage points). Considering definable pixels only (excluding “none”), the post-breakpoint composition shifts toward stronger water- and carbon-related dominance and weaker energy- and heat-related dominance: CO₂ increases from 19.2% to 22.4% (+3.2), PRE from 18.8% to 21.4% (+2.6), and VPD from 10.8% to 14.5% (+3.7). In contrast, SRAD declines from 19.4% to 15.3% (−4.1), and TMX decreases from 10.5% to 6.9% (−3.6), with smaller decreases for TMN (10.4% → 9.3%; −1.1) and SM (10.7% → 10.2%; −0.5).

The dominant mechanism displays a pronounced patchy–banded spatial pattern (Figure 10a,b). The agro-pastoral ecotone of northern China and the arid–semi-arid transition zone emerge as the most active hotspots of dominant-driver reorganization. In contrast, the southwestern plateau and adjacent mountainous regions exhibit more spatially coherent and contiguous dominance structures, primarily linked to SRAD, SM, and CO₂.

To quantify dominant-driver switching, we define the transition probability as

P_{i j} = \frac{C_{i j}}{N_{common}}

(15)

where

C_{i j}

is the count of pixels transitioning from category i (pre) to category j (post), and

N_{common}

is the number of pixels with definable dominant drivers in both stages. The transition matrix highlights pronounced reorganization of dominant drivers (Figure 11). Diagonal entries account for 25.83% of pixels, indicating that only about one-quarter retain the same dominant category across the breakpoint, whereas most pixels shift in dominant control.

Excluding self-transitions, SRAD → PRE (4.29%) is the largest single cross-category shift, meaning that a big number of pixels switch from radiation control to precipitation control. This aligns with the overall compositional change after the breakpoint—SRAD becomes less prevalent while PRE gains share. At the same time, PRE and VPD show strong two-way switching—PRE → VPD (3.61%) and VPD → PRE (2.58%)—reflecting alternating dominance between water-supply limitation and atmospheric evaporative demand across dry–wet transition zones. CO₂ and PRE also exchange frequently in both directions (CO₂ → PRE: 3.05%; PRE → CO₂: 2.63%), suggesting tight coupling—and mutual replacement—between hydroclimatic regulation and CO₂-related background effects across the breakpoint. In addition, TMN → CO₂ (2.75%) is another major pathway, indicating that, in some regions, CO₂-related control strengthens after the breakpoint and replaces minimum-temperature dominance in a non-trivial fraction of pixels.

To characterize the spatial concentration of driver switching, we applied a 2° × 2° grid-based hotspot counting and ranking analysis for the main transition types (Table 3). Hotspots of PRE → VPD are concentrated mainly within 112–118° E and 42–46° N. One representative grid cell at 116° E/44° N alone accounts for 8.76% of this transition, forming a continuous high-frequency belt from Inner Mongolia to the southern margin of northeast China. A secondary hotspot appears over the northern Loess Plateau (around 106° E, 36–38° N). The reverse shift, VPD → PRE, also clusters strongly along the same belt and in parts of the Loess Plateau, and the strong overlap between the two directions points to active back-and-forth switching between precipitation control and aridity control in this transition zone. Hotspots of SRAD → PRE show a clear two-center pattern: one in the high-latitude north (e.g., 116° E, 48–50° N) and another across several grids in the western plateau–arid region (e.g., 88–90° E, 28–30° N). This suggests that shifts from radiation dominance to precipitation dominance occur both along the northern transition belt and in parts of the plateau and nearby arid areas. In contrast, transitions into CO₂ dominance (e.g., TMX → CO₂, TMN → CO₂, SM → CO₂) are more scattered and multi-centered, spanning the Loess Plateau–Inner Mongolia belt, the northeastern Tibetan Plateau margin, the Yunnan–Guizhou Plateau and southwestern mountains, and parts of the eastern monsoon region. This implies that CO₂ dominance is expanding beyond a single climatic belt and manifests across multiple eco-climatic transition zones.

In summary, dominant drivers reorganize strongly across the breakpoint, with a stability (self-transition) of only 25.83%. Compositionally, SRAD and TMX lose importance after the breakpoint, while VPD, PRE, and CO₂ gain. Spatially, these changes are concentrated in major eco-climatic transition belts, especially Inner Mongolia–southern northeast China, the Loess Plateau, and the western plateau–arid regions (Figure 10 and Figure 11; Table 3).

3.5.2. Spatial Reorganization of Significant Driver Edges

We quantified driver-edge reorganization using the pixel-wise outputs of CD-NOTS run separately for the pre- and post-breakpoint segments. For each driver

d

, a significant edge (driver

\to

resilience) at a pixel was defined when CD-NOTS detected a significant link for that driver in the corresponding segment (bestlag_d

\neq

−1); otherwise, the edge was treated as absent. Analyses were restricted to the same valid-pixel domain as in Section 3.5.1 (skip_reason = 0,

N_{valid} = 4,654,452

). For each driver, we constructed binary edge-presence sets

E_{pre}^{d}

and

E_{post}^{d}

, and we classified the pixel-wise changes as keep (

E_{p r e} \cap E_{p o s t}

), gain (

E_{p o s t} ∖ E_{p r e}

), loss (

E_{p r e} ∖ E_{p o s t}

), and none (absent in both periods). The union (%) in Table 3 was computed as

∣ E_{p r e} \cup E_{p o s t} ∣ / N_{valid} \times 100

, while pct_keep/pct_gain/pct_loss are reported within the union set for each driver. Network connectivity (degree) was defined as the number of significant driver edges (across the seven drivers) connected to the resilience node at each pixel, and Δdegree was calculated as

{d e g r e e}_{post}

−

{d e g r e e}_{pre}

and then summarized by ecoregion.

At the national scale, significant driver-edge reorganization shows clear differences among drivers (Table 4). The share of pixels with at least one significant edge in either period (union) ranges from 30.87% to 45.96%. Precipitation (45.96%) and CO₂ (44.54%) have the broadest spatial coverage, followed by SRAD (42.14%), whereas TMX is much more limited (30.87%). In other words, precipitation- and carbon-related edges are detected across a wider portion of the study area, while temperature-related edges are confined to a smaller spatial domain.

Within the union set, the relative shares of keep, gain, and loss (Table 4) vary substantially across drivers. The keep proportion ranges from 10.40% to 29.47%, being the highest for CO₂ (29.47%) and precipitation (24.53%), intermediate for SRAD (20.56%), and lowest for TMX (10.40%), TMN (11.66%), and SM (11.93%). The lower keep values indicate weaker cross-breakpoint persistence and more pronounced reorganization in these drivers.

Comparing gain and loss, VPD, CO₂, and precipitation show an overall expansion tendency, whereas TMX, SRAD, TMN, and SM tend to contract. This pattern is better described as a redistribution within the driver network than as the uniform strengthening of any single driver. Expansion is concentrated in VPD-related, CO₂-related, and precipitation-related edges, whereas SRAD-related and TMX-related edges contract more strongly. At the same time, CO₂ and precipitation retain a relatively larger share of persistent edges across periods.

While the national statistics summarize the overall direction of change, Figure 12 shows where keep, gain, and loss are concentrated for each driver. For TMX, loss is the dominant class (53.57%) and keep is low (10.40%), indicating a broad post-breakpoint reduction in significant TMX-related edges. Spatially, loss is concentrated across northeast China and extends over large parts of eastern and southern China. TMN shows a similar contraction pattern. Loss (48.14%) is slightly higher than gain (40.20%), while keep remains low (11.66%). In the maps, gain and loss form alternating belt-like patterns across northern and northeastern China. Precipitation (PRE) has a relatively high keep proportion (24.53%), suggesting stronger cross-period persistence than most other drivers. A coherent retained cluster is visible in northeast China, whereas gain and loss are more scattered in other regions. VPD shows the clearest expansion signal. Gain reaches 46.38%, and the national increase in edge prevalence is the largest among all drivers. Newly emerged significant edges are mainly clustered in northeast China and the northern transition zone. Soil moisture (SM) tends to contract overall, with loss concentrated in southern and eastern China. Gain pixels are sparse and scattered, suggesting mostly localized adjustments rather than broad expansion. SRAD also shows marked contraction, with loss clearly exceeding gain and the strongest national decline in edge prevalence. Loss is concentrated in the southwestern mountains and along the eastern margin of the Tibetan Plateau. CO₂ combines relatively high keep (29.47%) with an overall expansion tendency. Gain pixels are concentrated in southeastern and southern China, while retained edges remain widely distributed.

As shown by the driver-specific keep/gain/loss patterns in Figure 12, these shifts are not spatially uniform, motivating a further comparison across the eight terrestrial ecoregions. To compare how these national-scale shifts vary among ecosystems, we examined edge reorganization and changes in network connectivity (Δdegree) across the eight terrestrial ecoregions. Here, degree was defined as the number of significant driver edges connected to the resilience node (across the seven candidate drivers) at each pixel, and Δdegree was calculated as the post-breakpoint degree minus the pre-breakpoint degree, and then summarized at the ecoregion level. Edge reorganization is evident in all ecoregions, but its magnitude differs; ColtemCNF and TemMIX show stronger restructuring, whereas TempGRS is comparatively weaker.

Network connectivity exhibits pronounced heterogeneity across ecoregions (Figure 13). After the breakpoint, connectivity declines in most regions, with the largest decreases observed in TRF (−0.322) and QTP (−0.307), indicating substantial network sparsification. SubtroDCD (−0.070) and TempDES (−0.052) show only modest reductions, whereas WarDCD is the only ecoregion displaying increased connectivity (+0.187), suggesting a trend toward network densification.

Notably, connectivity changes are not uniform across regions. In ColtemCNF and TemMIX, overall connectivity remains nearly stable (Δdegree close to zero), indicating that structural adjustments occurred without large-scale loss of connections. In contrast, TRF and QTP exhibit clear connectivity declines accompanied by pronounced sparsification.

In summary, the driver–response network undergoes substantial but uneven structural shifts following the breakpoint. Some regions maintain relatively stable connection patterns, while others experience marked reductions in connectivity. These contrasting responses indicate that ecoregions follow distinct structural pathways under breakpoint influence. The breakpoint is therefore associated not only with changes in dominant drivers but also with broader spatial differentiation in the structural configuration of driver–response networks across ecosystems.

3.5.3. Driver–Response Timescale Reorganization and Lag-Shift Patterns

Using the optimal lag order (best lag) determined separately for the pre- and post-breakpoint periods, lag responses were evaluated for valid pixels. Each driver was subsequently classified into one of six categories (Figure 14): stable decrease (significant in both periods with Δlag < 0), stable no change (Δlag = 0 in both periods), stable increase (Δlag > 0 in both periods), gained significance (not significant before but significant after the breakpoint), lost significance (significant before but not after), or never significant (non-significant in both periods).

The sum of the three stable classes represents pixels that remain statistically significant in both periods (hereafter referred to as the stable significant zone). Lag-direction comparisons (Δlag) are meaningful only within this intersection set. Gained and lost significance describe turnover in statistical detectability, whereas never significant represents the dominant background, where no significant causal edge is identified in either period.

At the national scale, never significant ranges from 54.04% to 69.13% across drivers (mean: 61.49%). TMX shows the highest proportion (69.13%), whereas PRE shows the lowest (54.04%). This indicates that, for most regions and most drivers, no statistically significant causal edge is detected in either period. Consequently, lag comparisons are restricted to a relatively limited subset of stable significant pixels (Figure 14a–g).

At the national scale, the proportions of gained significance and lost significance generally exceed that of the stable significant zone, indicating that breakpoint-related restructuring is primarily expressed as changes in statistical significance rather than lag adjustment. Gained significance ranges from 11.12% to 17.86% (mean: 14.95%), with the highest values for PRE (17.86%) and VPD (17.72%), and the lowest for TMX (11.12%). Lost significance ranges from 14.63% to 19.50% (mean: 16.40%), peaking in SRAD (19.50%) and reaching its minimum in VPD (14.63%). Most drivers exhibit lost ≥ gained (e.g., SRAD, TMN, TMX), whereas VPD and (to a lesser extent) PRE show a slight dominance of gained significance.

In contrast, the proportion of the stable significant zone is relatively small, ranging from 3.21% to 13.13%, with substantial inter-driver variability. CO₂ accounts for the largest area of stable significance (13.13%), followed by precipitation (11.27%), while SRAD occupies an intermediate position (8.66%). In contrast, TMX shows the smallest stable area (3.21%), with SM (3.97%) and TMN (4.03%) remaining comparatively limited.

Interpretation of lag direction (Δlag < 0, =0, or >0) applies only to pixels that are significant in both periods. Accordingly, conclusions regarding lag reconfiguration refer to a spatially restricted subset.

Spatial patterns further reveal three recurring structural configurations across drivers:

(1): Significance contraction pattern (e.g., TMX and SRAD).

For these drivers, lost pixels equal or exceed gained pixels, and stable significant areas remain limited. Breakpoint-related change is therefore expressed mainly as contraction of statistically detectable edges, rather than as adjustment of lag. In TMX, never-significant pixels dominate (69.13%), the proportion of lost pixels (16.54%) exceeds that of gained pixels (11.12%), and the stable zone is small (3.21%). SRAD shows a similar tendency, with the highest lost proportion among drivers (19.50%). In both cases, lag-direction analysis is restricted by the limited spatial extent of stable significant pixels.

(2): Pronounced lag heterogeneity (e.g., TMN and SM).

Although stable significant coverage is small (TMN: 4.03%; SM: 3.97%), lag adjustment within that subset is substantial. For TMN, negative (39.45%) and positive (40.38%) Δlag are nearly balanced, while unchanged lag accounts for 20.17%, indicating strong but bidirectional reorganization. SM follows a comparable pattern, with Δlag = 0 representing 25.20% and a high change_intensity (0.748). In both cases, the lag structure shifts considerably without a clear dominant direction.

(3): Expansion with relative stabilization (CO₂, PRE, and VPD).

These drivers combine relatively broad, stable significant zones with higher shares of unchanged lag. CO₂ exhibits the largest stable area (13.13%) and the highest proportion of Δlag = 0 within that area (53.94%), together with the lowest change_intensity (0.461). PRE shows a similar configuration (stable zone: 11.27%; Δlag = 0: 49.44%), with a slight tendency toward negative lag shifts. VPD displays moderate stable coverage (5.85%), but gained pixels exceed lost pixels, suggesting that change is expressed primarily through the expansion of statistical detectability rather than substantial lag adjustment.

Across drivers, breakpoint-related restructuring is more strongly reflected in shifts in statistical significance than in lag modification. Lag reorganization is concentrated in pixels that remain significant in both periods. CO₂ and precipitation show the greatest spatial persistence and relatively stable temporal alignment, whereas temperature- and soil-moisture-related drivers exhibit stronger but more spatially restricted lag restructuring.

4. Discussion

4.1. Spatial Vulnerability Patterns and Breakpoint Clustering of AR(1)-Based Resilience

The spatial pattern of AR(1), used here as a proxy for vegetation resilience, exhibited clear spatial differentiation. Lower AR(1) values were generally observed in humid southern regions and parts of southwestern China, whereas elevated values formed clustered and banded structures across the northern mid–high-latitude transition belt and several inland areas. Recent studies have increasingly used kNDVI-based time series to characterize vegetation resilience in China and surrounding dryland–forest transition systems. For example, research on the Loess Plateau showed that resilience derived from kNDVI and AR(1) did not increase consistently with vegetation greening, indicating that higher greenness does not necessarily imply stronger resilience [9]. Similar kNDVI-based resilience analyses on the Mongolian Plateau, in subtropical evergreen forests of southern China, and in southwest China have also reported pronounced spatial heterogeneity and region-dependent shifts in resilience dynamics [11,51,52]. These findings are broadly consistent with our results, in which vegetation state and resilience show related but non-equivalent spatial patterns, while breakpoint timing and post-break trajectories differ substantially among ecoregions.

The banded aggregation of elevated AR(1) values in northern China is best explained by the combined effect of hydroclimatic constraints and amplified interannual variability. Extensive research indicates that semi-arid and transitional ecosystems are highly sensitive to precipitation deficits and atmospheric drought, often exhibiting pronounced vegetation responses at relatively short drought timescales [21,53]. These regions also play a disproportionate role in regulating the interannual variability in the global terrestrial carbon sink, implying that their ecological dynamics are more tightly coupled with climate fluctuations [54,55]. Consistent with this interpretation, northern ecoregions such as ColtemCNF and TemMIX show not only prominent breakpoint peaks during 2012–2015 in the breakpoint heatmap (Figure 5) but also localized resurgence in later years (e.g., 2021). This pattern likely reflects that high-latitude forest and transition systems may experience delayed or multi-phase breakpoint clustering. Accordingly, the elevated AR(1) signals identified in the northern transition zone are most plausibly explained by semi-arid sensitivity combined with amplified interannual variability. In contrast, interpreting resilience signals in high-biomass regions such as TRF requires greater emphasis on relative changes and multi-source validation, including joint interpretation with subsequent analyses of mechanism–network reorganization.

Consistent with this interpretation, northern ecoregions such as ColtemCNF and TemMIX show not only prominent breakpoint peaks during 2012–2015 in the breakpoint heatmap (Figure 5) but also localized resurgence in later years (e.g., 2021). This pattern likely reflects that high-latitude forest and transition systems may experience delayed or multi-phase breakpoint clustering. Accordingly, the elevated AR(1) signals identified in the northern transition zone are most plausibly explained by semi-arid sensitivity combined with amplified interannual variability. In contrast, interpreting resilience signals in high-biomass regions such as TRF requires greater emphasis on relative changes and multi-source validation, including joint interpretation with subsequent analyses of mechanism–network reorganization.

This pattern aligns with the documented intensification of compound climate extremes in China in recent decades, including heatwave–drought interactions and abrupt dry–wet transitions. For example, comprehensive analyses of the strong 2015–2016 El Niño event report increased drought frequency in northeastern China (summer 2015), northwestern China (spring 2016), and across large areas during the winter of 2015 [56]. Meanwhile, the prolonged and severe drought in southwestern China during 2009–2010 had significant effects on vegetation indices and productivity [57].

4.2. Interdecadal Shifts in GCCM Coupling Signals

For screening plausible causal signals, GCCM indicates that multiple climatic and environmental variables exhibit relatively strong and statistically significant convergence in the driver → AR(1) direction. Precipitation, TMN, TMX, VPD, soil moisture, shortwave radiation, and CO₂ generally show ρ values > 0.3, with significant convergence as the library size increases. Topographic and soil variables (DEM, clay, and SOC) show moderate coupling strength. Although the human footprint (HF) yields lower ρ values and less consistent bidirectional patterns, it was retained as an indicator of anthropogenic pressure to facilitate comparison with subsequent effect-estimation results [43].

Temperature-related metrics exhibit the most pronounced declines, particularly the mean, coefficient of variation, and AR(1) of TMX. The AR(1) of VPD also decreases substantially. Radiation- and CO₂-related statistics show overall weakening. In comparison, precipitation-related variability increases slightly. Particularly, temperature-related coupling and atmospheric drought-related memory coupling appear to weaken, whereas variability-related water constraints and background soil controls become relatively more prominent. This pattern is consistent with previous evidence suggesting a broad-scale shift from energy limitation toward water limitation in many temperate and high-latitude ecosystems of the Northern Hemisphere [58,59], as well as with studies documenting inhibitory effects of increasing VPD on vegetation growth, photosynthesis, and interannual variability in terrestrial carbon sinks. However, GCCM-derived ρ reflects nonlinear coupling strength and directional clues in spatial cross-sections, and it should not be interpreted as direct proof of net causal effect magnitude under controlled confounding [60,61,62,63].

Comparison with the dominant-driver shifts derived from CD-NOTS (Figure 10c) suggests partial convergence across methods. Although the two approaches target different ecological dimensions, both are anchored in vegetation system dynamics. Specifically, CD-NOTS characterizes shifts in the dominant controls on vegetation state as reflected by kNDVI, whereas GCCM and the subsequent effect-estimation framework focus on changes in the coupling structure and estimated effects associated with vegetation resilience, for which AR(1) is used here as a proxy. In this context, the broad consistency in directional change before and after the breakpoint is still informative: TMX-, TMN-, and SRAD-related signals tend to weaken under both approaches, whereas increases in precipitation- and VPD-related dominance appear to be linked mainly to stronger variability or background-state coupling. These cross-method consistencies support the interpretation of stage-specific reorganization of vegetation–climate relationships, while also highlighting clear methodological boundaries [63,64].

4.3. DML–Causal Forest Estimates of Conditional Net Effects

4.3.1. Background Climatic Controls and Regional Divergence of Dominant Mechanisms

At the national scale, vegetation resilience responses are most strongly associated with the thermal background state. The multi-year mean of maximum temperature (TMX_mean) shows a significant positive effect on AR(1), is associated with higher AR(1) values and, under the AR(1)-based proxy framework, is consistent with slower system adjustment. In contrast, the multi-year mean of minimum temperature (TMN_mean) exhibits a negative effect on AR(1), is associated with lower AR(1) values, and may be consistent with faster adjustment under the resilience-proxy interpretation [10].

In contrast, higher minimum temperatures may reduce cold stress and extend the effective growing season in temperature-limited regions, thereby lowering temporal autocorrelation and accelerating recovery. This interpretation is consistent with studies showing that diurnal asymmetric warming can exert contrasting ecological effects, with daytime heat stress often constraining productivity, while moderate nighttime warming may relieve low-temperature limitations [65,66]. Previous global-scale analyses of forest resilience have emphasized the role of heat stress and evaporative demand in shaping resilience decline under ongoing warming [10]. Experimental and observational studies further highlight the importance of evaporative demand and atmospheric dryness in regulating vegetation productivity [67,68].

The strong negative national effect of the TMX variability metric (TMX_cv; −0.1712) warrants a more careful interpretation. Decomposition of weighted contributions across ecoregions shows that the Tropical Monsoon Rainforest (TRF) contributes −0.1520, accounting for approximately 88.8% of the total national negative effect, whereas other regions—such as TempGRS (−0.0109) and TemMIX (−0.0063)—contribute comparatively little. The national mean effect therefore reflects pronounced spatial heterogeneity rather than a uniformly distributed response. A more appropriate formulation is that the impact of the TMX variability indicator on AR(1) is strongly ecoregion-dependent, with the aggregate signal largely driven by TRF, where water-related variables are rarely identified as dominant drivers in the present effect-estimation framework. Instead, ecosystem dynamics are more sensitive to variability in radiation, cloud cover, and monsoon-driven alternations between overcast and high-radiation conditions. Under these conditions, the coefficient-of-variation metric of TMX may function partially as a proxy for high-frequency atmospheric forcing and radiative fluctuations. Stronger high-frequency external forcing can modify the statistical structure of vegetation residuals (after STL decomposition), potentially reducing temporal autocorrelation. In such cases, a reduced AR(1) may reflect changes in residual memory structure rather than a genuine acceleration of ecological recovery. Methodological considerations further support this cautious interpretation. Dense, high-biomass tropical forests are prone to optical index saturation and complex noise structures. AR(1)-based resilience proxies derived from remotely sensed vegetation indices may therefore be more sensitive to observational noise and external variability in these systems [10]. Consequently, the pronounced negative coefficient of TMX_cv in TRF may partially capture changes in residual time-memory properties induced by external high-frequency forcing, rather than a substantial enhancement of intrinsic ecosystem resilience.

Pixel-level dominant-driver attribution provides additional insight into spatial control patterns. At the national scale, temperature-related mechanisms account for the majority of dominant effects. Metrics linked to maximum and minimum temperature together represent 83.21% of the dominant contributions, followed by CO₂-related mechanisms (9.80%) and precipitation-related mechanisms (6.45%). Across all dominant metrics, background climatic states constitute the primary control dimension (78.32%), whereas variability regimes account for 17.25%, and autocorrelation structures contribute only 4.42%. Ecoregion-level decomposition further reveals regional pathway divergence underlying the national-scale patterns. WarDCD is characterized by an exceptionally high TMX dominance (74.51%), indicating that vegetation in this warm, dry region is more directly affected by excessive daytime temperature. In contrast, TemMIX shows a much larger precipitation contribution (28.85%), suggesting that vegetation dynamics there are more strongly regulated by moisture availability and hydroclimatic fluctuations. In QTP, the higher share of CO₂ dominance (12.98%) may be associated with the particular environmental sensitivity of alpine ecosystems, where gradual background changes can have amplified ecological effects. TRF shows the strongest CO₂ signal (31.22%), which may reflect the fact that in humid forest systems, where direct water limitation is weaker, long-term background forcing related to CO₂ becomes relatively more prominent than short-term hydrothermal constraints. These spatial contrasts align with two established lines of evidence: First, global greening attribution studies highlight the importance of CO₂ fertilization in enhancing long-term vegetation productivity [20]. Second, the spatial partitioning of limiting factors (energy vs. water limitation) implies that identical external forcing may manifest through distinct pathways across ecosystems [60,61,62,69].

4.3.2. Robustness of Causal Estimation

Independent subsample validation shows close agreement between training and test ATEs across treatments. Placebo permutation tests provide further evidence, as randomly shuffled treatments produce effect estimates centered near zero (Supplementary Material S2, Table S3). Spatial block bootstrap analyses across aggregation scales from 75 to 400 km reveal a clear hierarchy of robustness [70,71]. A substantial group of core variables—including thermal metrics (TMX-related metrics and TMN_mean), CO₂ metrics (mean, variability, and autocorrelation), VPD variability, precipitation mean and autocorrelation, soil moisture variability, radiation autocorrelation, and the variability/persistence components of HF—remain significant at both the 95% and 99% confidence levels across all tested block sizes. Their stability across scales suggests that these effects are not driven by localized clustering or residual spatial autocorrelation but instead reflect broadly distributed structural controls (Supplementary Material S2, Table S5).

Other variables exhibit progressive attenuation as spatial aggregation becomes coarser, pointing to regionally concentrated rather than nationwide influence. Precipitation variability (PRE_cv), for example, remains significant at both the 95% and 99% levels up to 250 km, weakens to 95%–only significance at 300 km, and becomes non-significant at 350–400 km. A similar pattern appears for the radiation mean (SRAD_mean), which loses significance under coarser aggregation. The soil moisture mean (SM_mean) represents the most marginal case: it is only weakly significant at the primary 200 km scale (95% but not 99%, p ≈ 0.047) and becomes non-significant at ≥250 km. These patterns indicate that attenuation under larger block sizes is concentrated in secondary hydroclimatic background metrics and reflects limited effect magnitude and spatial concentration, rather than instability of the overall causal framework (Supplementary Material S2, Table S5).

Multiple-testing correction using the Benjamini–Hochberg false discovery rate (BH-FDR) procedure [72] further confirmed that the principal effects remain statistically robust after controlling for multiplicity (Supplementary Material S2, Table S4).

In summary, these robustness analyses demonstrate that the identified causal effect structure is not an artifact of model specification, sample partitioning, or spatial dependence assumptions. Vegetation resilience dynamics, as proxied by AR(1), exhibit a stable climate-dominated control architecture across samples and spatial scales, with thermal and carbon-related drivers showing the strongest structural autocorrelation, consistent with recent global-scale resilience evidence [10].

4.3.3. Anthropogenic Effects and Metric Sensitivity

In contrast to the robustness assessment above, the evaluation of anthropogenic influence addresses a distinct mechanistic question. The GCCM framework did not detect significant convergent causal relationships between human footprint (HF) variables and AR(1). However, within the DML–Causal Forest framework—after adjusting for climatic and environmental covariates—certain HF metrics exhibit weak but statistically detectable net effects.

This apparent discrepancy arises from methodological differences in causal identification. GCCM (and related convergent cross-mapping approaches) relies on state-space reconstruction and convergence properties, detecting dynamic embedding-based causal dependence and, therefore, favoring strong dynamical coupling relationships [73]. In contrast, Causal Forest estimates conditional average treatment effects under controlled covariate adjustment, capturing conditional net effects after covariate adjustment. A variable may lack strong dynamic reconstructability while still exerting a weak but consistent conditional statistical effect.

Specifically, HF_mean remains non-significant at the national scale, whereas HF_cv shows a small resilience-enhancing effect and HF_AR(1) produces a slight resilience-reducing effect. Importantly, these effects are approximately two orders of magnitude smaller than those associated with dominant thermal drivers; thus, even when statistically detectable, the HF-related effects are structurally marginal relative to background climatic forcing.

This interpretation is also consistent with previous evidence showing that anthropogenic influences on vegetation dynamics can be substantial in some regions but are highly pathway- and region-dependent, with both positive and negative effects depending on land-use management, restoration, and disturbance regimes [74,75]. In summary, the differences between GCCM and Causal Forest are better viewed as complementary rather than contradictory.

4.4. CD-NOTS Evidence for Breakpoint-Associated Causal Network Reorganization

4.4.1. Dominant Drivers Switching Across Breakpoints

Within the set of pixels where dominant drivers are identifiable in both periods, the sum of diagonal entries in the transition matrix is 25.83%, exhibiting a change in dominant-driver category across the breakpoint.

Among off-diagonal transitions, SRAD → PRE(4.29%) represents the largest single pathway. Bidirectional exchanges between VPD and PRE (PRE → VPD: 3.61%; VPD→ PRE: 2.58%) suggest recurring shifts between precipitation supply and atmospheric demand constraints. Transitions between PRE and CO₂ are also notable (CO₂ → PRE: 3.05%; PRE→ CO₂: 2.63%), which may reflect redistribution between water limitation and background carbon forcing.

These transition patterns can be more plausibly interpreted as shifts in dominant constraint signals rather than simple strengthening or weakening of individual drivers. The relatively frequent PRE → SRAD transitions suggest reorganization between water- and energy-limited regimes [62]. In northern transitional zones, two-way substitution between VPD and PRE may indicate alternating dominance of supply- vs. demand-side water constraints—a pattern that is compatible with the documented role of VPD in regulating vegetation growth, photosynthesis, and carbon sink variability [61]. The spatial clustering of dominant transitions in transition belts and plateau margins may further reflect sensitivity to regional climate background and extreme-event exposure.

4.4.2. Edge Reorganization and Pathway Rewiring

Across drivers, edge reorganization is widespread, but the balance among keep, gain, and loss differs clearly. Within the union set (pixels with at least one significant edge in either period), CO₂ (29.47%) and precipitation (24.53%) show the highest keep proportions, indicating relatively stronger cross-breakpoint persistence. SRAD is intermediate (20.56%), whereas TMX (10.40%), TMN (11.66%), and SM (11.93%) show much lower keep values, suggesting that temperature- and soil-moisture-related edges are more frequently replaced after the breakpoint.

Differences in the gain–loss balance further indicate contrasting directions of reorganization among drivers. VPD shows the highest gain share (46.38%) and a larger gain than loss (38.31%), pointing to the expansion of statistically detectable VPD-related edges in many regions. CO₂ also shows relatively strong persistence together with substantial gain (36.97%) and lower loss (33.56%), while precipitation combines high keep (24.53%) with a near-balanced but slightly gain-dominant pattern (38.87% gain vs. 36.60% loss). In contrast, TMX is strongly loss-dominated (53.57% loss vs. 36.02% gain), and SRAD also shows a clear loss-dominant pattern (46.29% vs. 33.15%). TMN (48.14% vs. 40.20%) and SM (47.21% vs. 40.86%) follow the same general tendency.

These patterns support a post-breakpoint redistribution of inferred driver–response pathways rather than uniform strengthening of a single control factor. In particularly, water- and carbon- related pathways (PRE/VPD/CO₂) are more often retained or gained, whereas radiation- and TMX-related edges are more often lost or replaced. This interpretation is broadly consistent with evidence for increasing water limitation [69], and with studies showing that elevated VPD can suppress vegetation productivity and carbon uptake [76,77,78].

4.4.3. Reorganization of Driver–Response Timescales

CO₂-related and precipitation-related links show larger stable–significant domains and higher lag persistence, implying that some broad-scale driver–response linkages remain relatively temporally coherent even when the surrounding network structure is rewired.

This pattern is ecologically plausible because vegetation responses to climate forcing are well known to be timescale-dependent and biome-specific, rather than being governed by a single universal response window. Global evidence has shown that vegetation’s sensitivity to drought depends on characteristic drought timescales across biomes, with different systems responding preferentially to short vs. longer drought persistence [21]. In parallel, timescale dependence has also been emphasized in Earth-observation-based vegetation productivity relationships, indicating that apparent driver–response linkages may differ substantially between seasonal, interannual, and longer timescales and should not be assumed invariant [22].

The VPD-related and heat-related lag reorganization is also noteworthy in the context of warming-induced atmospheric demand. Previous global studies have shown that rising VPD can suppress vegetation growth and offset part of the greening signal or CO₂ fertilization gains [60]. More recently, evidence from compound hot–dry events indicates that elevated VPD can prolong ecosystem recovery time and, in some contexts, outweigh the role of low soil moisture in explaining recovery-time differences, especially in drylands [79]. These findings provide a useful ecological interpretation for why VPD/TMX pathways in our study often exhibit strong significance turnover and localized lag reconfiguration around breakpoints.

The lag-shift results should be interpreted cautiously. In this study, lag categories were derived from the lag among significant edges under the CD-NOTS framework. A transition from “stable no change” to “loss” (or “gain”) does not necessarily mean that the ecological process vanished (or newly appeared) in a strict mechanistic sense; it may also indicate that the process became weaker, redistributed across correlated pathways, or moved outside the tested lag range under a changed background state. This is precisely why the lag analysis is most informative when interpreted jointly with dominant-driver transitions and edge turnover, rather than in isolation. Together, these results suggest that breakpoint periods in China’s vegetation resilience may be characterized by multi-dimensional causal reorganization rather than a single monotonic intensification of climate stress.

4.5. Coupled Evidence Across GCCM, DML–Causal Forest, and CD-NOTS

Geographical CCM, DML–Causal Forest, and CD-NOTS address complementary causal targets under different data windows and assumptions. GCCM is applied to multiple temporal snapshots and provides predictability-based, directional coupling clues in spatial cross-sections, making it useful for screening plausible drivers and diagnosing stage-dependent shifts in coupling centrality. DML–Causal Forest, in contrast, estimates conditional net effects under explicit covariate adjustment over the full study period, allowing direct comparison of effect magnitudes across drivers and across statistical descriptors (mean/CV/AR(1)), while accounting for nonlinearities and treatment–effect heterogeneity. CD-NOTS further targets non-stationary mechanism changes by learning lagged causal structures separately before and after breakpoints, thereby quantifying dominant-driver switching, edge turnover, and lag-scale reorganization that cannot be captured by time-invariant effect summaries.

Despite their different estimands, several consistent patterns emerge when the three lines of evidence are interpreted jointly. First, temperature-related controls remain central at the national scale: both screening and effect estimation highlight the importance of thermal background states, while CD-NOTS indicates that temperature-related pathways are among those with strong post-breakpoint turnover, suggesting that thermal dominance can coexist with substantial structural rewiring. Second, multiple results converge on a breakpoint-related redistribution toward water- and carbon-related regulation. GCCM shows interdecadal weakening in temperature-related coupling and relatively stronger water-related variability signals, while CD-NOTS documents a post-breakpoint increase in precipitation/VPD/CO₂ dominance and widespread dominant-driver replacement. Third, transition belts emerge as mechanistically sensitive hotspots: dominant-driver switching and bidirectional substitutions between precipitation supply and atmospheric demand are spatially clustered in agro-pastoral and arid–semi-arid transition zones, consistent with the notion of limiting-factor switching under non-stationary climate backgrounds.

Importantly, discrepancies across methods are expected and diagnostically meaningful. High GCCM coupling does not necessarily imply a large controlled net effect, and a stable average net effect can coexist with major network rewiring if the dominant pathways or lag structures shift across regimes. Therefore, the most defensible interpretation is based on triangulation: GCCM provides directional screening and stage-dependent coupling context, DML–Causal Forest quantifies conditional net contributions and their heterogeneity, and CD-NOTS reveals how these mechanisms reorganize across breakpoint-defined regimes.

4.6. Limitations

AR(1) is a statistical proxy motivated by the critical slowing down framework, not a direct measurement of ecological recovery. Changes in AR(1) may track recovery-related dynamics, but they can also be influenced by preprocessing choices, external forcing, and the temporal scale of analysis. The conclusions in this study therefore concern shifts in resilience-proxy signals rather than definitive changes in ecological resilience itself.

Breakpoint detection and causal effect estimates are also sensitive to model specification and variable coverage. Rolling-window design, BFAST parameter settings, spatial blocking strategies, and the set of available drivers can affect both breakpoint timing and estimated effect magnitude. Unobserved disturbances and measurement error may still contribute to uncertainty.

In addition, GCCM, CD-NOTS, and DML–Causal Forest address different aspects of the system—nonlinear coupling, structural reorganization, and conditional net effects, respectively. Unlike conventional regression, which is mainly used to characterize associations, these methods are better suited to the exploration of potential causal mechanisms. As a result, agreement across methods supports the broader interpretation, whereas discrepancies are not unexpected and should be understood in light of their different inferential aims.

These considerations limit the scope of interpretation, but they do not change the main empirical pattern observed in this study: the resilience proxy exhibits stage-wise transitions and structured driver reorganization at the national scale. Further validation using multi-source observations and process-based models would improve confidence in the ecological interpretation.

5. Conclusions

At a 1 km resolution across China (2000–2024), this study combines vegetation-state dynamics, AR(1)-based resilience-proxy analysis, breakpoint detection, causal screening, counterfactual estimation, and non-stationary structural learning within a unified framework; several findings emerge:

(1): Spatial asymmetry and prevailing resilience decline

Vegetation greenness follows a broad southeast–northwest gradient, whereas AR(1) shows a different spatial organization, with higher temporal autocorrelation concentrated in northern transitional belts and parts of inland China. Although greenness and AR(1) generally conform to a “high greenness–low AR(1)” pattern, their relationship is clearly nonlinear, indicating asymmetric combinations of vegetation state and recovery dynamics under different hydrothermal backgrounds.

Nationwide trend analysis further shows that significant increases in AR(1) (i.e., weakening of the resilience proxy) occur more widely than decreases, suggesting that resilience decline has been more prevalent than resilience enhancement during the study period.

(2): Stage-wise transitions during 2010–2018

Breakpoint analysis identified 2010–2018 as the main transition window, with substantial differences in timing among the eight terrestrial ecoregions. Northern and transitional regions show clearer stage-like shifts, whereas humid southern regions tend to exhibit more gradual adjustment. Taken together, these patterns support a stage-wise transition process rather than a uniform long-term drift.

(3): Temperature-dominant attributes with asymmetric thermal effects

Results from causal screening and counterfactual estimation consistently point to temperature-related mean-state components—especially the mean of maximum temperature—as the strongest national-scale controls on the resilience proxy. At the interannual scale, mean-state thermal forcing has a larger influence than variability (CV) or autocorrelation (AR(1)) metrics.

At the same time, TMX and TMN do not act symmetrically: TMX is more often associated with resilience-proxy weakening, whereas TMN is linked to resilience enhancement. Precipitation and CO₂ contribute secondary but regionally differentiated effects. Pixel-level dominant-driver patterns also show marked ecoregional heterogeneity under an overall temperature-dominant background.

(4): Post-breakpoint reorganization dominated by significance turnover

Across breakpoints, driver–response structures undergo substantial reorganization. Replacement of dominant drivers is widespread, and edge turnover exceeds stable retention for most drivers. In most cases, change is expressed more strongly through gain/loss of statistically significant edges than through systematic shifts in lag direction. Lag reorganization is concentrated in a smaller subset of pixels that remain significant in both periods, with CO₂ and precipitation showing comparatively persistent temporal alignment.

Overall, the results suggest spatial resilience-proxy patterns, stage-wise transitions, driver attribution, and post-breakpoint structural reorganization within a single empirical framework. They also highlight resilience-sensitive regions—particularly northern transitional belts—and identify key periods of system adjustment.

Our finding that background-state variables dominate over variability is consistent with previous studies suggesting that ecosystem resilience is primarily constrained by long-term environmental conditions rather than short-term fluctuations.

From a methodological perspective, the combined use of GCCM (directional coupling), DML-Causal Forest (conditional net effects), and CD-NOTS (non-stationary structural learning) provides a reproducible workflow for remote-sensing-based resilience analysis. These methods address different inferential targets and are best interpreted as complementary.

These findings suggest that ecological monitoring and adaptive management should prioritize resilience-sensitive transition belts in northern China and other hotspot regions, with particular attention to persistent daytime heat stress, shifts between precipitation supply and atmospheric water-demand constraints, and breakpoint periods characterized by rapid driver reorganization.

Future work could strengthen the ecological interpretation by validating AR(1)-based inference against independent ecosystem functional indicators, explicitly linking transition windows to compound climate extremes, and moving from structural diagnostics toward pathway-level mechanistic testing.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/f17040443/s1.

Author Contributions

Conceptualization, Q.D.; methodology, Q.D. and S.W.; software, S.W.; validation, S.W.; formal analysis, S.W.; investigation, S.W.; resources, Q.D.; data curation, S.W.; writing—original draft preparation, S.W.; writing—review and editing, Q.D.; visualization, S.W.; supervision, Q.D.; project administration, Q.D.; funding acquisition, Q.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Collaborative Innovation Center of Geo-Information Technology in Zhongyuan (Henan Province) and the Key Laboratory of Spatiotemporal Perception and Intelligent Processing, Ministry of Natural Resources (Grant No. 231201), as well as the National Key Research and Development Program of China (Grant No. 2022YFC3005704).

Data Availability Statement

All datasets were obtained from publicly available remote-sensing products.

Acknowledgments

During the preparation of this work, a generative AI tool (ChatGPT-5) was used for language polishing, grammar correction, and readability improvement. The authors carefully reviewed and edited the output and take full responsibility for the content of the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest regarding this study.

Abbreviations

The following abbreviations are used in this manuscript:

TempDES	Temperate Desert
TempGRS	Temperate Grassland
ColtemCNF	Cold–Temperate Coniferous Forest
TemMIX	Temperate Mixed Forest
QTP	Qinghai–Tibet Plateau Alpine Vegetation
WarDCD	Warm–Temperate Deciduous Forest
SubtroDCD	Subtropical Evergreen Forest
TRF	Tropical Monsoon Rainforest
GCCM	Geographical Convergent Cross-Mapping
CD-NOTS	Constraint-Based Causal Discovery from Non-Stationary Time Series

References

Pimm, S.L. The Complexity and Stability of Ecosystems. Nature 1984, 307, 321–326. [Google Scholar] [CrossRef]
Holling, C.S. Resilience and Stability of Ecological Systems. Annu. Rev. Ecol. Syst. 1973, 4, 1–23. [Google Scholar] [CrossRef]
Yi, C.; Jackson, N. A Review of Measuring Ecosystem Resilience to Disturbance. Environ. Res. Lett. 2021, 16, 053008. [Google Scholar] [CrossRef]
Scheffer, M.; Bascompte, J.; Brock, W.A.; Brovkin, V.; Carpenter, S.R.; Dakos, V.; Held, H.; Van Nes, E.H.; Rietkerk, M.; Sugihara, G. Early-Warning Signals for Critical Transitions. Nature 2009, 461, 53–59. [Google Scholar] [CrossRef]
Dakos, V.; Kéfi, S. Ecological Resilience: What to Measure and How. Environ. Res. Lett. 2022, 17, 043003. [Google Scholar] [CrossRef]
Bathiany, S.; Bastiaansen, R.; Bastos, A.; Blaschke, L.; Lever, J.; Loriani, S.; De Keersmaecker, W.; Dorigo, W.; Milenković, M.; Senf, C.; et al. Ecosystem Resilience Monitoring and Early Warning Using Earth Observation Data: Challenges and Outlook. Surv. Geophys. 2025, 46, 265–301. [Google Scholar] [CrossRef]
Smith, T.; Boers, N. Reliability of Vegetation Resilience Estimates Depends on Biomass Density. Nat. Ecol. Evol. 2023, 7, 1799–1808. [Google Scholar] [CrossRef]
Camps-Valls, G.; Campos-Taberner, M.; Moreno-Martínez, Á.; Walther, S.; Duveiller, G.; Cescatti, A.; Mahecha, M.D.; Muñoz-Marí, J.; García-Haro, F.J.; Guanter, L.; et al. A Unified Vegetation Index for Quantifying the Terrestrial Biosphere. Sci. Adv. 2021, 7, eabc7447. [Google Scholar] [CrossRef]
Wang, Z.; Fu, B.; Wu, X.; Li, Y.; Feng, Y.; Wang, S.; Wei, F.; Zhang, L. Vegetation Resilience Does Not Increase Consistently with Greening in China’s Loess Plateau. Commun. Earth Environ. 2023, 4, 336. [Google Scholar] [CrossRef]
Forzieri, G.; Dakos, V.; McDowell, N.G.; Ramdane, A.; Cescatti, A. Emerging Signals of Declining Forest Resilience under Climate Change. Nature 2022, 608, 534–539. [Google Scholar] [CrossRef]
Wu, J.; Sun, Z.; Yao, Y.; Liu, Y. Trends of Grassland Resilience under Climate Change and Human Activities on the Mongolian Plateau. Remote Sens. 2023, 15, 2984. [Google Scholar] [CrossRef]
Smith, T.; Traxl, D.; Boers, N. Empirical Evidence for Recent Global Shifts in Vegetation Resilience. Nat. Clim. Change 2022, 12, 477–484. [Google Scholar] [CrossRef]
Smith, T.; Boers, N. Global Vegetation Resilience Linked to Water Availability and Variability. Nat. Commun. 2023, 14, 498. [Google Scholar] [CrossRef] [PubMed]
Yao, Y.; Liu, Y.; Fu, F.; Song, J.; Wang, Y.; Han, Y.; Wu, T.; Fu, B. Declined Terrestrial Ecosystem Resilience. Glob. Change Biol. 2024, 30, e17291. [Google Scholar] [CrossRef]
Boulton, C.A.; Lenton, T.M.; Boers, N. Pronounced Loss of Amazon Rainforest Resilience since the Early 2000s. Nat. Clim. Change 2022, 12, 271–278. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, X.; Jiao, W.; Wu, X.; Zeng, X.; Zhao, L.; Wang, L.; Guo, J.; Xing, X.; Hong, Y. Spatial Heterogeneity of Vegetation Resilience Changes to Different Drought Types. Earth’s Future 2023, 11, e2022EF003108. [Google Scholar] [CrossRef]
Hu, Y.; Wei, F.; Fu, B.; Wang, S.; Zhang, W.; Zhang, Y. Changes and Influencing Factors of Ecosystem Resilience in China. Environ. Res. Lett. 2023, 18, 094012. [Google Scholar] [CrossRef]
Jiang, L.; Liu, B.; Guo, H.; Yuan, Y.; Liu, W.; Jiapaer, G. Assessing Vegetation Resilience and Vulnerability to Drought Events in Central Asia. J. Hydrol. 2024, 634, 131012. [Google Scholar] [CrossRef]
Kaufmann, R.K.; Paletta, L.F.; Tian, H.Q.; Myneni, R.B.; D’Arrigo, R.D. The Power of Monitoring Stations and a CO₂ Fertilization Effect: Evidence from Causal Relationships between NDVI and Carbon Dioxide. Earth Interact. 2008, 12, 1–23. [Google Scholar] [CrossRef]
Zhu, Z.; Piao, S.; Myneni, R.B.; Huang, M.; Zeng, Z.; Canadell, J.G.; Ciais, P.; Sitch, S.; Friedlingstein, P.; Arneth, A.; et al. Greening of the Earth and Its Drivers. Nat. Clim. Change 2016, 6, 791–795. [Google Scholar] [CrossRef]
Vicente-Serrano, S.M.; Gouveia, C.; Camarero, J.J.; Beguería, S.; Trigo, R.; López-Moreno, J.I.; Azorín-Molina, C.; Pasho, E.; Lorenzo-Lacruz, J.; Revuelto, J.; et al. Response of Vegetation to Drought Time-Scales across Global Land Biomes. Proc. Natl. Acad. Sci. USA 2013, 110, 52–57. [Google Scholar] [CrossRef]
Linscheid, N.; Mahecha, M.D.; Rammig, A.; Carvalhais, N.; Gans, F.; Nelson, J.A.; Walther, S.; Weber, U.; Reichstein, M. Time-Scale Dependent Relations Between Earth Observation Based Proxies of Vegetation Productivity. Geophys. Res. Lett. 2021, 48, e2021GL093285. [Google Scholar] [CrossRef]
Runge, K.; Tucker, M.; Crowther, T.W.; Fournier De Laurière, C.; Guirado, E.; Bialic-Murphy, L.; Berdugo, M. Monitoring Terrestrial Ecosystem Resilience Using Earth Observation Data: Identifying Consensus and Limitations Across Metrics. Glob. Change Biol. 2025, 31, e70115. [Google Scholar] [CrossRef] [PubMed]
Laubach, Z.M.; Murray, E.J.; Hoke, K.L.; Safran, R.J.; Perng, W. A Biologist’s Guide to Model Selection and Causal Inference. Proc. R. Soc. B 2021, 288, 20202815. [Google Scholar] [CrossRef] [PubMed]
Arif, S.; MacNeil, M.A. Predictive Models Aren’t for Causal Inference. Ecol. Lett. 2022, 25, 1741–1745. [Google Scholar] [CrossRef]
Schrodt, F.; Beck, M.; Estopinan, J.; Bowler, D.E.; Fontaine, C.; Gaüzère, P.; Goury, R.; Grenié, M.; Martins, I.S.; Morueta-Holme, N.; et al. Advancing Causal Inference in Ecology: Pathways for Biodiversity Change Detection and Attribution. Methods Ecol. Evol. 2025, 16, 2276–2304. [Google Scholar] [CrossRef]
Byrnes, J.E.K.; Dee, L.E. Causal Inference With Observational Data and Unobserved Confounding Variables. Ecol. Lett. 2025, 28, e70023. [Google Scholar] [CrossRef]
Yu, H.; Yang, Q.; Jiang, S.; Zhan, B.; Zhan, C. Detection and Attribution of Vegetation Dynamics in the Yellow River Basin Based on Long-Term Kernel NDVI Data. Remote Sens. 2024, 16, 1280. [Google Scholar] [CrossRef]
Wang, Q.; Huang, L.; Yang, M.; Wang, L.; Cao, W. Distinguishing Dominant Drivers on Long-Term Vegetation Dynamics across China Considering Time-Lag and Accumulation Effects Using Machine Learning Techniques. J. Environ. Manag. 2025, 391, 126477. [Google Scholar] [CrossRef]
Kimmel, K.; Dee, L.E.; Avolio, M.L.; Ferraro, P.J. Causal Assumptions and Causal Inference in Ecological Experiments. Trends Ecol. Evol. 2021, 36, 1141–1152. [Google Scholar] [CrossRef]
Giannarakis, G.; Sitokonstantinou, V.; Lorilla, R.S.; Kontoes, C. Towards Assessing Agricultural Land Suitability with Causal Machine Learning. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA, 19–20 June 2022; IEEE: New Orleans, LA, USA, 2022; pp. 1441–1451. [Google Scholar]
Nogueira, A.R.; Pugnana, A.; Ruggieri, S.; Pedreschi, D.; Gama, J. Methods and Tools for Causal Discovery and Causal Inference. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2022, 12, e1449. [Google Scholar] [CrossRef]
Siegel, K.; Dee, L.E. Foundations and Future Directions for Causal Inference in Ecological Research. Ecol. Lett. 2025, 28, e70053. [Google Scholar] [CrossRef] [PubMed]
Glymour, C.; Zhang, K.; Spirtes, P. Review of Causal Discovery Methods Based on Graphical Models. Front. Genet. 2019, 10, 524. [Google Scholar] [CrossRef] [PubMed]
Wang, L.; Huang, S.; Wang, S.; Liao, J.; Li, T.; Liu, L. A Survey of Causal Discovery Based on Functional Causal Model. Eng. Appl. Artif. Intell. 2024, 133, 108258. [Google Scholar] [CrossRef]
Arif, S. Estimating Causal Effects with Machine Learning: A Guide for Ecologists. Methods Ecol. Evol. 2025, 16, 2771–2783. [Google Scholar] [CrossRef]
Runge, J.; Gerhardus, A.; Varando, G.; Eyring, V. Gustau Camps-Valls Causal Inference for Time Series. Nat. Rev. Earth Environ. 2023, 4, 487–505. [Google Scholar] [CrossRef]
Abatzoglou, J.T.; Dobrowski, S.Z.; Parks, S.A.; Hegewisch, K.C. TerraClimate, a High-Resolution Global Dataset of Monthly Climate and Climatic Water Balance from 1958–2015. Sci. Data 2018, 5, 170191. [Google Scholar] [CrossRef]
Global Monitoring Laboratory (GML); Jacobson, A.R.; Schuldt, K.N.; Andrews, A.; Miller, J.B.; Oda, T.; Basu, S.; Mund, J.; Weir, B.; Ott, L.; et al. CarbonTracker CT-NRT.V2025-1; Global Monitoring Laboratory (GML): Boulder, CO, USA, 2025. [Google Scholar]
Yang, J.; Huang, X. 30 m Annual Land Cover and Its Dynamics in China from 1990 to 2019. Earth Syst. Sci. Data 2021, 13, 3907–3925. [Google Scholar] [CrossRef]
Mu, H.; Li, X.; Wen, Y.; Huang, J.; Du, P.; Su, W.; Miao, S.; Geng, M. An Annual Global Terrestrial Human Footprint Dataset from 2000 to 2018. Sci. Data 2022, 9, 8721751357. [Google Scholar] [CrossRef]
Verbesselt, J.; Hyndman, R.; Newnham, G.; Culvenor, D. Detecting Trend and Seasonal Changes in Satellite Image Time Series. Remote Sens. Environ. 2010, 114, 106–115. [Google Scholar] [CrossRef]
Gao, B.; Yang, J.; Chen, Z.; Sugihara, G.; Li, M.; Stein, A.; Kwan, M.-P.; Wang, J. Causal Inference from Cross-Sectional Earth System Data with Geographical Convergent Cross Mapping. Nat. Commun. 2023, 14, 5875. [Google Scholar] [CrossRef] [PubMed]
Deng, X.; Yu, W.; Shi, J.; Huang, Y.; Li, D.; He, X.; Zhou, W.; Xie, Z. Characteristics of Surface Urban Heat Islands in Global Cities of Different Scales: Trends and Drivers. Sustain. Cities Soc. 2024, 107, 105483. [Google Scholar] [CrossRef]
Han, Y.; Zhou, P.; Lv, Q.; Cui, R.; Meng, L. Synergistic Drivers and Threshold Effects of Vapor Pressure Deficit in China: An Integrated Framework of Causal Inference and Machine Learning. J. Environ. Manag. 2025, 395, 127739. [Google Scholar] [CrossRef] [PubMed]
Hirano, K.; Imbens, G.W. The Propensity Score with Continuous Treatments. In Wiley Series in Probability and Statistics; Gelman, A., Meng, X., Eds.; Wiley: Hoboken, NJ, USA, 2004; pp. 73–84. ISBN 978-0-470-09043-5. [Google Scholar]
Chernozhukov, V.; Chetverikov, D.; Demirer, M.; Duflo, E.; Hansen, C. Double/Debiased Machine Learning for Treatment and Structural Parameters. Econom. J. 2018, 21, C1–C68. [Google Scholar] [CrossRef]
Nie, X.; Wager, S. Quasi-Oracle Estimation of Heterogeneous Treatment Effects. Biometrika 2020, 108, 299–319. [Google Scholar] [CrossRef]
Zhang, K.; Huang, B.; Zhang, J.; Glymour, C.; Schölkopf, B. Causal Discovery from Nonstationary/Heterogeneous Data: Skeleton Estimation and Orientation Determination. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, Melbourne, Australia, 15–21 August 2017; International Joint Conferences on Artificial Intelligence Organization: Melbourne, Australia, 2017; pp. 1347–1353. [Google Scholar]
Sadeghi, A.; Gopal, A.; Fesanghary, M. Causal Discovery from Nonstationary Time Series. Int. J. Data Sci. Anal. 2025, 19, 33–59. [Google Scholar] [CrossRef]
Chen, H.; Liu, J.; He, W.; Xu, P.; Nguyen, N.T.; Lv, Y.; Huang, C. Shifted Vegetation Resilience from Loss to Gain Driven by Changes in Water Availability and Solar Radiation over the Last Two Decades in Southwest China. Agric. For. Meteorol. 2025, 368, 110543. [Google Scholar] [CrossRef]
Wu, J.; Zhao, Q.; Zhou, Y.; Ni, Y.; Fan, J. Temperature and Water Availability Drive Vegetation Resilience Dynamics in China: An Empirical Study from Causal Perspective. J. Geogr. Sci. 2025, 35, 2069–2090. [Google Scholar] [CrossRef]
Xu, H.; Wang, X.; Zhao, C.; Yang, X. Diverse Responses of Vegetation Growth to Meteorological Drought across Climate Zones and Land Biomes in Northern China from 1981 to 2014. Agric. For. Meteorol. 2018, 262, 1–13. [Google Scholar] [CrossRef]
Ahlström, A.; Raupach, M.R.; Schurgers, G.; Smith, B.; Arneth, A.; Jung, M.; Reichstein, M.; Canadell, J.G.; Friedlingstein, P.; Jain, A.K.; et al. The Dominant Role of Semi-Arid Ecosystems in the Trend and Variability of the Land CO₂ Sink. Science 2015, 348, 895–899. [Google Scholar] [CrossRef]
Wieder, W.R.; Butterfield, Z.; Lindsay, K.; Lombardozzi, D.L.; Keppel-Aleks, G. Interannual and Seasonal Drivers of Carbon Cycle Variability Represented by the Community Earth System Model (CESM2). Glob. Biogeochem. Cycles 2021, 35, e2021GB007034. [Google Scholar] [CrossRef] [PubMed]
Ma, F.; Ye, A.; You, J.; Duan, Q. 2015–2016 Floods and Droughts in China, and Its Response to the Strong El Niño. Sci. Total Environ. 2018, 627, 1473–1484. [Google Scholar] [CrossRef] [PubMed]
Wang, W.; Wang, W.; Li, J.; Wu, H.; Xu, C.; Liu, T. The Impact of Sustained Drought on Vegetation Ecosystem in Southwest China Based on Remote Sensing. Procedia Environ. Sci. 2010, 2, 1679–1691. [Google Scholar] [CrossRef]
Jiao, W.; Wang, L.; Smith, W.K.; Chang, Q.; Wang, H.; D’Odorico, P. Observed Increasing Water Constraint on Vegetation Growth over the Last Three Decades. Nat. Commun. 2021, 12, 3777. [Google Scholar] [CrossRef]
Denissen, J.M.C.; Teuling, A.J.; Pitman, A.J.; Koirala, S.; Migliavacca, M.; Li, W.; Reichstein, M.; Winkler, A.J.; Zhan, C.; Orth, R. Widespread Shift from Ecosystem Energy to Water Limitation with Climate Change. Nat. Clim. Change 2022, 12, 677–684. [Google Scholar] [CrossRef]
Yuan, W.; Zheng, Y.; Piao, S.; Ciais, P.; Lombardozzi, D.; Wang, Y.; Ryu, Y.; Chen, G.; Dong, W.; Hu, Z.; et al. Increased Atmospheric Vapor Pressure Deficit Reduces Global Vegetation Growth. Sci. Adv. 2019, 5, eaax1396. [Google Scholar] [CrossRef]
Fu, Z.; Ciais, P.; Prentice, I.C.; Gentine, P.; Makowski, D.; Bastos, A.; Luo, X.; Green, J.K.; Stoy, P.C.; Yang, H.; et al. Atmospheric Dryness Reduces Photosynthesis along a Large Range of Soil Water Deficits. Nat. Commun. 2022, 13, 989. [Google Scholar] [CrossRef]
He, B.; Chen, C.; Lin, S.; Yuan, W.; Chen, H.W.; Chen, D.; Zhang, Y.; Guo, L.; Zhao, X.; Liu, X.; et al. Worldwide Impacts of Atmospheric Vapor Pressure Deficit on the Interannual Variability of Terrestrial Carbon Sinks. Natl. Sci. Rev. 2022, 9, nwab150. [Google Scholar] [CrossRef]
Runge, J.; Bathiany, S.; Bollt, E.M.; Camps-Valls, G.; Coumou, D. Inferring Causation from Time Series in Earth System Sciences. Nat. Commun. 2019, 10, 2553. [Google Scholar] [CrossRef]
Mahecha, M.D.; Runge, J.; Nowack, P.; Kretschmer, M.; Flaxman, S.; Sejdinović, D. Detecting and Quantifying Causal Associations in Large Nonlinear Time Series Datasets. Sci. Adv. 2019, 5, eaau4996. [Google Scholar] [CrossRef]
Kahnert, M.; Nousiainen, T.; Veihelmann, B. Spherical and Spheroidal Model Particles as an Error Source in Aerosol Climate Forcing and Radiance Computations: A Case Study for Feldspar Aerosols. J. Geophys. Res. 2005, 110, 2004JD005558. [Google Scholar] [CrossRef]
Bodman, R.W.; Rayner, P.J.; Karoly, D.J. Uncertainty in Temperature Projections Reduced Using Carbon Cycle and Climate Observations. Nat. Clim. Change 2013, 3, 725–729. [Google Scholar] [CrossRef]
Zhao, M.; Running, S.W. Drought-Induced Reduction in Global Terrestrial Net Primary Production from 2000 Through 2009. Science 2010, 329, 940–943. [Google Scholar] [CrossRef] [PubMed]
Frölicher, T.L. Strong Warming at High Emissions. Nat. Clim. Change 2016, 6, 823–824. [Google Scholar] [CrossRef]
Jonard, F.; Feldman, A.F.; Short Gianotti, D.J.; Entekhabi, D. Observed Water and Light Limitation across Global Ecosystems. Biogeosciences 2022, 19, 5575–5590. [Google Scholar] [CrossRef]
Carlstein, E. The Use of Subseries Values for Estimating the Variance of a General Statistic from a Stationary Sequence. Ann. Statist. 1986, 14, 1171–1179. [Google Scholar] [CrossRef]
Politis, D.N.; Romano, J.P. The Stationary Bootstrap. J. Am. Stat. Assoc. 1994, 89, 1303–1313. [Google Scholar] [CrossRef]
Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 1995, 57, 289–300. [Google Scholar] [CrossRef]
Sugihara, G.; May, R.M.; Ye, H.; Hsieh, C.; Deyle, E.R. Detecting Causality in Complex Ecosystems. Science 2012, 338, 496–500. [Google Scholar] [CrossRef]
Venter, O.; Sanderson, E.W.; Magrach, A.; Allan, J.R.; Beher, J.; Jones, K.R.; Possingham, H.P.; Laurance, W.F.; Wood, P.; Fekete, B.M.; et al. Sixteen Years of Change in the Global Terrestrial Human Footprint and Implications for Biodiversity Conservation. Nat. Commun. 2016, 7, 12558. [Google Scholar] [CrossRef]
Chen, C.; Park, T.; Wang, X.; Piao, S.; Xu, B.; Chaturvedi, R.K.; Fuchs, R.; Brovkin, V.; Ciais, P.; Fensholt, R.; et al. China and India Lead in Greening of the World through Land-Use Management. Nat. Sustain. 2019, 2, 122–129. [Google Scholar] [CrossRef]
Novick, K.A.; Ficklin, D.L.; Stoy, P.C.; Williams, C.A.; Bohrer, G.; Oishi, A.C.; Papuga, S.A.; Blanken, P.D.; Noormets, A.; Sulman, B.N.; et al. The Increasing Importance of Atmospheric Demand for Ecosystem Water and Carbon Fluxes. Nat. Clim. Change 2016, 6, 1023–1027. [Google Scholar] [CrossRef]
Grossiord, C.; Buckley, T.N.; Cernusak, L.A.; Novick, K.A.; Poulter, B.; Siegwolf, R.T.W.; Sperry, J.S.; McDowell, N.G. Plant Responses to Rising Vapor Pressure Deficit. New Phytol. 2020, 226, 1550–1566. [Google Scholar] [CrossRef]
Song, J.; Zhou, S.; Yu, B.; Li, Y.; Liu, Y.; Yao, Y.; Wang, S.; Fu, B. Serious Underestimation of Reduced Carbon Uptake Due to Vegetation Compound Droughts. NPJ Clim. Atmos. Sci. 2024, 7, 23. [Google Scholar] [CrossRef]
Yao, Y.; Fu, B.; Liu, Y.; Zhang, Y.; Ding, J.; Li, Y.; Zhou, S.; Song, J.; Wang, S.; Li, C.; et al. Compound Hot–Dry Events Greatly Prolong the Recovery Time of Dryland Ecosystems. Natl. Sci. Rev. 2024, 11, nwae274. [Google Scholar] [CrossRef]

Figure 1. Location of the study area, analyzed pixels, and the eight terrestrial ecoregions in China.

Figure 2. Multi-year mean kNDVI (2000–2024).

Figure 3. Long-term resilience proxy AR(1) (2000–2024).

Figure 4. Spatial pattern of breakpoint years identified by BFAST across China (2000–2024).

Figure 5. Annual breakpoint proportions summarized by the eight terrestrial ecoregions, displayed as a heatmap.

Figure 6. Spatial patterns of resilience proxy AR(1) trends (Kendall’s τ). Note: Increasing AR(1) indicates declining resilience under the critical slowing down framework; positive τ values represent resilience weakening. (a–c) AR(1) trend distributions during the pre-break, post-break, and full study periods, respectively. (d–f) The distribution and boxplots of τ across the eight terrestrial ecoregions from 2000 to 2024. Only statistically significant pixels (p < 0.01) are shown.

Figure 7. Causal strength (ρ) and statistical significance of drivers across different temporal snapshots. Note: To identify potential causal drivers of vegetation resilience, we applied Geographical Convergent Cross-Mapping (GCCM) to examine the nonlinear causal relationships between AR(1) (resilience proxy, Y) and candidate driving variables (X). The X → Y direction reflects the influence of a potential driver on AR(1) (resilience proxy, Y).

Figure 8. Average treatment effects estimated using the Causal Forest model. Note: “MEAN”, “CV”, and “AR(1)” represent background climate conditions, climate variability, and temporal autocorrelation, respectively, and * and ** denote statistically significant causal effects at p < 0.05 and p < 0.01, respectively.

Figure 9. Pixel-level dominant causal drivers of vegetation resilience across China. (a) Map of variables with the largest individual causal effects. (b) Map of variable types with the largest individual causal effects. (c,d) Map of variables/variable types with the largest individual causal effect distributions across the eight terrestrial ecoregions.

Figure 10. Spatial distribution and composition of dominant drivers before and after the breakpoint. Note: (a,b) Dominant-driver maps, pre to post; (c) changes in dominant-driver proportions (definable pixels). Subfigures (a–c) share the same legend. The color meanings are identical across all three subfigures.

Figure 11. Global transition matrix of dominant drivers, expressed as proportions of all pixels with definable dominant drivers in both periods.

Figure 12. Spatial distribution of significant driver-edge changes before and after the breakpoint (drivers). Note: (a–g) Keep, gain, loss, and none maps for each driver; (h) summary of the national proportions of keep, gain, and loss within the union set.

Figure 13. Changes in network connectivity across ecoregions (Δdegree).

Figure 14. Spatial distribution of lag-direction comparisons before and after the breakpoint (drivers): (a–g) six-category changes in the optimal lag (bestlag) of each driver before and after the breakpoint; (h) the proportions of these six categories across all valid pixels at the national scale.

Table 1. Data descriptions and sources.

Variable	Unit	Resolution	Temporal Characteristics	Source
NDVI	-	1 km	Monthly (2000–2024)	Google Earth Engine/MODIS/Terra Vegetation Indices Monthly L3 Global 1 km (MOD13A2.061) (https://lpdaac.usgs.gov/products/mod13a3v061/, accessed on 29 March 2026)
Precipitation (PRE)	mm month⁻¹	~1 km (0.0083333°)	Monthly (2000–2024)	National Tibetan Plateau Data Center (TPDC): 1-km monthly precipitation dataset for China (https://data.tpdc.ac.cn/en/, accessed on 29 March 2026)
Minimum Temperature (TMN)	°C	~1 km (0.0083333°)	Monthly (2000–2024)	National Tibetan Plateau Data Center (TPDC): 1-km monthly minimum temperature dataset for China (https://data.tpdc.ac.cn/en/, accessed on 29 March 2026)
Maximum Temperature (TMX)	°C	~1 km (0.0083333°)	Monthly (2000–2024)	National Tibetan Plateau Data Center (TPDC): 1-km monthly maximum temperature dataset for China (https://data.tpdc.ac.cn/en/, accessed on 29 March 2026)
Vapor (VPD)	kPa	~4 km (1/24°)	Monthly (2000–2024)	TerraClimate (vpd) (https://www.climatologylab.org/terraclimate.html, accessed on 29 March 2026)
Soil Moisture (SM)	mm	~4 km (1/24°)	Monthly (2000–2024)	TerraClimate (soil) (https://www.climatologylab.org/terraclimate.html, accessed on 29 March 2026)
Downward Shortwave Radiation (SRAD)	W m⁻²	~4 km (1/24°)	Monthly (2000–2024)	TerraClimate (srad) (https://www.climatologylab.org/terraclimate.html, accessed on 29 March 2026)
Atmospheric CO₂ Mole Fraction (CO₂)	ppm	3° × 2° (global grid; 34 vertical levels)	Monthly (2000–2024)	NOAA Global Monitoring Laboratory (GML): CarbonTracker CT2022 (https://gml.noaa.gov/ccgg/carbontracker/CT2022/) + CarbonTracker Near-Real-Time CT-NRT.v2025-1 (https://gml.noaa.gov/ccgg/carbontracker/CT-NRT/, accessed on 29 March 2026)
Land Cover	Class code	30 m	Yearly (2000–2024)	China Land Cover Dataset (CLCD) (https://zenodo.org/records/4417810, accessed on 29 March 2026)
Soil Background (clay, SOC)	wt.%; wt.%	~1 km (30 arc-sec)	Static	Harmonized World Soil Database (HWSD) v1.2 (https://www.fao.org/soils-portal/data-hub/soil-maps-and-databases/harmonized-world-soil-database-v12/en/, accessed on 29 March 2026)
Topography (elevation, slope)	m; °	30 m	Static	ASTER Global Digital Elevation Model (GDEM) Version 3 (https://lpdaac.usgs.gov/products/astgtmv003/, accessed on 29 March 2026)
Human Footprint (HF)	Index score	~1 km (30 arc-sec)	Yearly (2000–2024)	Global Annual Human Footprint Dataset (https://www.nature.com/articles/s41597-022-01284-8, accessed on 29 March 2026)
Vegetation Type	Class code	1 km	Static	Resource and Environment Science and Data Center (RESDC): Vegetation Type Map of China (https://www.resdc.cn/data.aspx?DATAID=122, accessed on 29 March 2026)

Table 2. Temporal evolution of GCCM coupling strength (ρ) for candidate drivers from 2000 to 2024.

Variable	Descriptor	ρ (2000)	ρ (2011)	ρ (2024)	Significance	Change (2024 − 2000)
VPD	mean	0.513	0.49	0.522	sig	0.009
	cv	0.532	0.528	0.54	sig	0.008
	AR(1)	0.604	0.533	0.504	sig	−0.1
TMX	mean	0.338	0.325	0.298	sig	−0.04
	cv	0.343	0.334	0.305	sig	−0.038
	AR(1)	0.479	0.442	0.451	sig	−0.028
TMN	mean	0.429	0.402	0.427	sig	−0.002
	cv	0.406	0.375	0.405	sig	−0.001
	AR(1)	0.471	0.471	0.469	sig	−0.002
SRAD	mean	0.581	0.556	0.568	sig	−0.013
	cv	0.595	0.585	0.584	sig	−0.011
	AR(1)	0.589	0.59	0.583	sig	−0.006
SM	mean	0.57	0.551	0.556	sig	−0.014
	cv	0.561	0.521	0.54	sig	−0.021
	AR(1)	0.539	0.516	0.542	sig	0.003
PRE	mean	0.488	0.459	0.485	sig	−0.003
	cv	0.481	0.473	0.49	sig	0.009
	AR(1)	0.46	0.426	0.466	sig	0.006
CO₂	mean	0.361	0.317	0.355	sig	−0.007
	cv	0.37	0.311	0.35	sig	−0.02
	AR(1)	0.365	0.324	0.351	sig	−0.014
HF	mean	-	-	-	no sig	-
	cv	-	-	-	no sig	-
	AR(1)	-	-	-	no sig	-
DEM		0.321	0.305	0.325	sig	0.004
Slope		-	-	-	no sig	-
clay		0.251	0.258	0.261	sig	0.01
SOC		0.281	0.279	0.287	sig	0.006

Note: For climate and human-activity drivers, “mean”, “cv”, and “AR(1)” denote the multi-year mean, coefficient of variation, and first-order autoregressive coefficient of the annual series, respectively. “Change (2024 − 2000)” was calculated as ρ (2024) − ρ (2000); “sig” and “no sig” indicate significant and non-significant GCCM relationships, respectively. Only variables that show clear convergence and directional causal evidence with the resilience proxy AR(1) are displayed. HF and slope are marked as “-“ because no significant causal relationship was detected.

Table 3. Hotspot statistics of dominant-driver transitions at the 2° × 2° grid scale.

Transition	Rank	Hotspot_2deg	Share (%)
PRE → VPD	1	116° E–118° E, 44° N–46° N	8.76
PRE → VPD	2	112° E–114° E, 44° N–46° N	4.17
VPD → PRE	1	118° E–120° E, 44° N–46° N	4.8
VPD → PRE	2	104° E–106° E, 36° N–38° N	4.38
SRAD → PRE	1	116° E–118° E, 48° N–50° N	4.39
SRAD → PRE	2	88° E–90° E, 28° N–30° N	2.94
TMX → CO₂	1	108° E–110° E, 32° N–34° N	3.52
TMX → CO₂	2	106° E–108° E, 32° N–34° N	2.71
TMN → CO₂	1	116° E–118° E, 24° N–26° N	2.76
TMN → CO₂	2	110° E–112° E, 30° N–32° N	2.5
SM → CO₂	1	108° E–110° E, 38° N–40° N	3.31
SM → CO₂	2	102° E–104° E, 28° N–30° N	2.9

Note: Transition denotes the dominant-driver transition type from the pre- to the post-breakpoint period (e.g., PRE → VPD). Rank indicates the descending order of hotspot importance based on share_%. Hotspot_2deg represents the spatial extent of each hotspot grid cell (longitude × latitude, in degrees). Share_% denotes the proportion of pixels of a given transition that fall within the corresponding hotspot grid cell (i.e., the within-transition share). Count denotes the absolute number of pixels of the corresponding transition within the hotspot grid cell. All statistics were computed on pixels where dominant drivers were identifiable in both periods.

Table 4. National statistics of significant driver-edge changes across the breakpoint.

Statistic	TMX	TMN	PRE	VPD	SM	SRAD	CO₂
$U n i o n$ (%)	30.8747	34.5728	45.9568	38.2016	33.2934	42.1356	44.5396
$p c t_{k e e p}$ (%)	10.4044	11.6612	24.5327	15.312	11.9276	20.5603	29.4714
$p c t_{g a i n}$ (%)	36.021	40.2006	38.8663	46.3829	40.8575	33.1522	36.9682
$p c t_{l o s s}$ (%)	53.5746	48.1381	36.601	38.3052	47.2149	46.2875	33.5603

Note: Stable, gain, and loss proportions (within union set):

p c t_{k e e p} = ∣ E_{p r e} \cap E_{p o s t} ∣ / ∣ E_{p r e} \cup E_{p o s t} ∣ \times 100, p c t_{g a i n} = ∣ E_{p o s t} ∖ E_{p r e} ∣ / ∣ E_{p r e} \cup E_{p o s t} ∣ \times 100

,

p c t_{l o s s} = ∣ E_{p r e} ∖ E_{p o s t} ∣ / ∣ E_{p r e} \cup E_{p o s t} ∣ \times 100

,

U n i o n = ∣ E_{p r e} \cup E_{p o s t} ∣ / N_{valid} \times 100

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, S.; Du, Q. Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis. Forests 2026, 17, 443. https://doi.org/10.3390/f17040443

AMA Style

Wu S, Du Q. Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis. Forests. 2026; 17(4):443. https://doi.org/10.3390/f17040443

Chicago/Turabian Style

Wu, Shang, and Qingyun Du. 2026. "Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis" Forests 17, no. 4: 443. https://doi.org/10.3390/f17040443

APA Style

Wu, S., & Du, Q. (2026). Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis. Forests, 17(4), 443. https://doi.org/10.3390/f17040443

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Structural Responses of Vegetation Resilience to Background-State and Temperature Asymmetry Across China: An Annual-Scale Causal Analysis

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. Data Sources and Overview

2.2.1. Vegetation Index Data

2.2.2. Climate Drivers

2.2.3. Atmospheric CO2

2.2.4. Land-Use Data

2.2.5. Soil Data

2.2.6. Human Footprint (HF) Dataset

2.2.7. Topographic Data

2.2.8. Vegetation Type Data

2.3. Methods

2.3.1. kNDVI

2.3.2. CSD-Based Vegetation Resilience Indicators

2.3.3. Trend Analysis

2.3.4. Breakpoint Detection

2.3.5. Construction of Long-Term Statistical Descriptors for Candidate Drivers

2.3.6. GCCM for Spatial Causal Inference

2.3.7. Causal Effect Estimation with DML–Causal Forest

2.3.8. CD-NOTS for Non-Stationary Causal Networks

3. Results

3.1. Spatial Pattern of Vegetation Resilience

3.2. Nonlinear Changes in Vegetation Resilience

3.2.1. Spatiotemporal Distribution of BFAST Breakpoint Years

3.2.2. Trend Patterns of Vegetation Resilience Before and After Breakpoints

3.3. Causal Inference Based on GCCM

3.4. Causal Forest Analysis

3.4.1. Distribution of Estimated Average Treatment Effects (ATE) for Each Drive

3.4.2. Pixel-Level Dominant Environmental Drivers

3.5. Causal Mechanism Reorganization on CD-NOTS

3.5.1. National-Scale Changes in Dominant Driver Composition and Transition Patterns

3.5.2. Spatial Reorganization of Significant Driver Edges

3.5.3. Driver–Response Timescale Reorganization and Lag-Shift Patterns

4. Discussion

4.1. Spatial Vulnerability Patterns and Breakpoint Clustering of AR(1)-Based Resilience

4.2. Interdecadal Shifts in GCCM Coupling Signals

4.3. DML–Causal Forest Estimates of Conditional Net Effects

4.3.1. Background Climatic Controls and Regional Divergence of Dominant Mechanisms

4.3.2. Robustness of Causal Estimation

4.3.3. Anthropogenic Effects and Metric Sensitivity

4.4. CD-NOTS Evidence for Breakpoint-Associated Causal Network Reorganization

4.4.1. Dominant Drivers Switching Across Breakpoints

4.4.2. Edge Reorganization and Pathway Rewiring

4.4.3. Reorganization of Driver–Response Timescales

4.5. Coupled Evidence Across GCCM, DML–Causal Forest, and CD-NOTS

4.6. Limitations

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.2.3. Atmospheric CO₂