Next Article in Journal
MissPred: A Robust Two-Stage Radar Echo Extrapolation Algorithm for Incomplete Sequences
Previous Article in Journal
Hyperspectral Image Classification Based on Fractional Fourier Transform
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Increasing Ecosystem Fluxes Observed from Eddy Covariance and Solar-Induced Fluorescence Data

1
Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, School of Environmental Science and Engineering, Nanjing University of Information Science & Technology (NUIST), Nanjing 210044, China
2
College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China
3
High Impact Weather Key Laboratory of China Meteorological Administration (CMA), Changsha 410073, China
4
School of Earth Sciences and Engineering, Hohai University, Nanjing 211100, China
5
Frontiers Science Center for Critical Earth Material Cycling, International Institute for Earth System Science, Nanjing University, Nanjing 210023, China
6
Earth Systems Research Center, Institute for the Study of Earth, Oceans, and Space, University of New Hampshire, Durham, NH 03824, USA
7
School of Geography and Planning, Sun Yat-sen University, Guangzhou 510275, China
8
Key Laboratory of Agrometeorology of Jiangsu Province, School of Ecology and Applied Meteorology, NUIST, Nanjing 210044, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Remote Sens. 2025, 17(12), 2064; https://doi.org/10.3390/rs17122064
Submission received: 8 April 2025 / Revised: 6 June 2025 / Accepted: 12 June 2025 / Published: 15 June 2025

Abstract

Ecosystems modulate Earth’s climate through the exchange of carbon and water fluxes. However, long-term trends in these terrestrial fluxes remain unclear due to the lack of continuous measurements on the global scale. This study combined flux data from 197 eddy covariance sites with satellite-retrieved solar-induced chlorophyll fluorescence (SIF) to investigate spatiotemporal variations in gross primary productivity (GPP), evapotranspiration (ET), and their coupling via water use efficiency (WUE) from 2001 to 2020. We developed six global GPP and ET products at 0.05° spatial and 8-day temporal resolution, using two machine learning models and three SIF products, which integrate vegetation physiological parameters with data-driven approaches. These datasets provided mean estimates of 128 ± 2.3 Pg C yr−1 for GPP, 522 ± 58.2 mm yr−1 for ET, and 1.8 ± 0.21 g C kg−1 H2O yr−1 for WUE, with upward trends of 0.22 ± 0.04 Pg C yr−2 in GPP, 0.64 ± 0.14 mm yr−2 in ET, and 0.0019 ± 0.0005 g C kg−1 H2O yr−2 in WUE over the past two decades. These high-resolution datasets are valuable for exploring terrestrial carbon and water responses to climate change, as well as for benchmarking terrestrial biosphere models.

1. Introduction

Terrestrial ecosystems influence the land–atmosphere interactions of carbon and water fluxes through surface physical and biological processes [1,2,3]. Gross primary productivity (GPP) represents the total carbon gains by plant photosynthesis, while evapotranspiration (ET) accounts for the total water losses, including soil evaporation, canopy intercepted evaporation, and plant transpiration [4,5]. As the representation of terrestrial carbon and water interactions, water use efficiency (WUE) is calculated as the ratio of GPP and ET to indicate coupling relationships between carbon gains and water losses. The spatiotemporal variations in GPP, ET, and WUE substantially influence carbon and water cycles from regional to global scales [6]. Therefore, the accurate quantification of GPP, ET, and WUE globally is vital for advancing our understanding of terrestrial carbon and water dynamics and their interactions under the changing climate. Moreover, understanding long-term trends in WUE is also critical for predicting ecosystem responses to global change, improving Earth system models, and informing policies on climate mitigation, adaptation, and water sustainability.
Different approaches have been applied to quantify regional/global GPP and ET, including process-based models [7], semi-mechanistic models [8,9], and machine learning approaches [10,11,12]. Several machine learning (ML) methods for estimating GPP already exist, such as cost-effective and scalable ML frameworks for GPP modeling that combine remote sensing and flux tower data; hybrid models that merge machine learning techniques with process-based or light-use efficiency (LUE) models; and automated machine learning (AutoML) [13,14]. Each of these approaches has its advantages and disadvantages [2]. Process-based models, such as the Breathing Earth System Simulator [7] and Boreal Ecosystem Productivity Simulator [15], can simultaneously simulate regional GPP and ET based on land surface physical and chemical mechanisms [16]. However, their performance is highly dependent on parameterization schemes and model structures [17,18]. Semi-mechanistic models, such as the Light Use Efficiency (LUE) model and the Penman–Monteith equations, simulate regional GPP and ET based on satellite observations and empirical relationships [8]. However, establishing uniform constraint equations across different ecosystems is challenging, and these models are limited in their application to climate or vegetation change scenarios. Machine learning (ML) algorithms have also been applied to derive GPP and ET using ground- and satellite-based observations [19,20,21,22]. Validations have shown that these data-driven approaches often outperform traditional process-based and semi-mechanistic models [18,23]. However, most of these ML algorithms have not taken advantage of recent advances in remote sensing vegetation parameters, such as solar-induced chlorophyll fluorescence (SIF), which represents emitted electromagnetic radiation related to photosynthesis. Furthermore, only a few studies (e.g., FLUXCOM) have focused on the coupled relationships between carbon and water fluxes, such as water use efficiency (WUE), based on simultaneously derived GPP and ET.
SIF is a spectral signal (650–800 nm) emitted by the photosynthetic center of plants under solar illumination, reflecting the complex physiological functions of terrestrial plants [24]. SIF has been widely regarded as a direct indicator of plant photosynthesis [25,26,27,28] as it represents an energy flux emitted from plant chlorophyll molecules a few nanoseconds after light absorption [29]. Terrestrial GPP often shows strong correlations with SIF for almost all biomes in unstressed conditions [30], offering a more effective means to monitor photosynthesis compared to traditional vegetation indices like the Normalized Difference Vegetation Index and Enhanced Vegetation Index [28,29,31,32,33]. Given the physiological coupling between plant photosynthesis and transpiration, some studies have also shown that SIF could be used to estimate regional ET based on empirical SIF-ET relationships [34,35]. Zhou et al. [36] demonstrated that SIF-based ET models outperformed other satellite-based methods. However, the tight relationship between SIF and ET is influenced by environmental factors such as air temperature and vapor pressure deficit (VPD), which are linked to soil and vegetation water evaporation [37,38,39].
Considering the limitations of simultaneous carbon and water fluxes datasets, we developed global GPP, ET, and WUE datasets (referred to as ECSIF) by combining site-level Eddy Covariance measurements with satellite-retrieval SIF products from 2001 to 2020. We first explored the relationships between GPP (or ET) and three SIF products for different plant functional types (PFTs) at the global FLUXNET sites. We then constructed and validated sixty PFT-specific machine learning models for GPP and ET, respectively, using ground measurements, two ML algorithms, and three SIF products across ten PFTs. Thereafter, we generated six ECSIF datasets of global GPP, ET, and WUE at 8-day temporal resolution and 0.05° spatial resolutions, based on gridded SIF, meteorological, and land cover products. The main objectives of this study are to (a) validate the effectiveness of SIF products in deriving site-level GPP and ET, (b) generate SIF-based GPP, ET, and WUE products with high spatiotemporal resolution on the global scale, and (c) explore the spatiotemporal variations in GPP, ET and their coupling WUE over the past two decades.

2. Materials and Methods

2.1. Site-Level Observations

We selected 197 sites from the FLUXNET2015 dataset for model development and validation, each with at least 1-year consistent observations of carbon and water fluxes, as well as meteorological measurements after 2001 (Figure S1 and Table S1). These sites were categorized into ten PFTs according to the International Geosphere-Biosphere Program (IGBP) land cover classification scheme. At each site, meteorological variables including total shortwave radiation (SW, W m−2), surface air pressure (Ps, hPa), air temperature (Tair, K), total precipitation (Pre, mm), wind speed (Wind, m s−1), and vapor pressure difference (VPD, kPa) were observed at half-hour intervals. In addition, carbon and water fluxes, such as net ecosystem exchange (NEE) and latent heat (LH), were measured simultaneously. In this study, we used the carbon fluxes of GPP (μmol CO2 m−2 s−1) derived from the mean values of daytime (GPP_DT_VUT_MEAN) and nighttime (GPP_NT_VUT_MEAN) fluxes based on the NEE-partitioning methods. For water fluxes, ET (mm hr−1) was calculated from the observed LH using the following method [40]:
E T = L H a · ρ w a t e r
Here, a is the latent heat of the vaporization of water (2.45 KJ g−1), and ρ w a t e r is the density of water (1 g cm−3). All datasets were aggregated into 8-day timescales to build ML models.

2.2. Gridded Observations

For the gridded data, we selected meteorological data, which include SW, Ps, Tair, Pre, Wind, and VPD, from the Modern-Era Retrospective analysis for Research and Applications version 2 (MERRA-2) reanalysis. In addition, we calculated direct (Direct) and diffuse (Diff) shortwave radiation by combining the shortwave radiation from MERRA-2 with the satellite-based diffuse radiative fraction (DF) provided by Ryu et al. [41]:
D i r e c t = S W M E R R A - 2 × ( 1 D F )
D i f f = S W M E R R A - 2 × D F
We also calculated site-level direct and diffuse SW using the same methods. Atmospheric carbon dioxide (CO2) concentration data were obtained from measurements at the Mauna Loa Observatory. Global annual land cover data were obtained from the MCD12C1 product of the MODIS retrieval. All the gridded meteorological data were downscaled to a spatial resolution of 0.05° and an 8-day temporal resolution for the period of 2001–2020 on a global scale.
To evaluate the derived gridded GPP and ET datasets, we used the products from MODIS [42,43], PML-V2 [5], and X-BASE [44] as the basis for comparisons. Based on the radiation-use efficiency concept, MODIS GPP (MOD17A2H) was developed combining the Leaf Area Index (LAI) and Fraction of absorbed Photosynthetically Active Radiation (FPAR) with meteorological data. MODIS ET (MOD16A2) was estimated using an improved ET algorithm [45] based on the Penman–Monteith equation, which considered soil evaporation, canopy intercepted evaporation, and plant transpiration. PML-V2 GPP and ET were estimated using the PML-V2 model, which incorporates water–carbon coupled canopy conductance and CO2 fertilization and were driven by meteorological and remote sensing data. X-BASE GPP and ET were upscaled from eddy covariance flux data using a combination of several ML algorithms and meteorological forcing data [44].

2.3. SIF Products

We selected three recently developed SIF products, including CSIF [46], GOSIF [47] and RTSIF [30], as model inputs. Specifically, the CSIF product was obtained by combining neural network models with MODIS-based surface reflectance and Orbiting Carbon Observatory-2 (OCO-2) SIF observations [46]. GOSIF was developed using a data-driven (i.e., ML) approach combining discrete OCO-2 SIF soundings, remote sensing data from MODIS, and meteorological reanalysis over the period 2001–2023 [47]. RTSIF was constructed using the XGB algorithm based on TROPOMI SIF [30], the nadir bidirectional reflectance distribution function adjusted reflectance [48], land surface temperature, photosynthetically active radiation, and land cover. The CSIF datasets were aggregated into a high spatial resolution of 0.05° and 8-day temporal resolution to match the GOSIF and RTSIF datasets.

2.4. Machine Learning Algorithms

We used two ML algorithms, eXtreme Gradient Boosting (XGB) and Random Forest (RF), to develop data-driven models. The XGB algorithm is an updated version of the Gradient Boosted Decision Tree (GBDT) algorithm [49] and can build augmented trees to handle complex nonlinear relationships between inputs and outputs. In addition, the XGB algorithm adds regularization terms to the loss function, effectively preventing overfitting during the training process [50]. The RF is an integrative algorithm that randomly samples features during the training of each tree, rather than using all features. This randomness increases model diversity and reduces over-reliance on the same features for each tree [51]. Both the XGB and RF algorithms are based on decision tree models and can offer excellent accuracy and efficient operation on large datasets without overfitting [52,53,54]. Due to the feature selection process, decision trees are inherently resistant to the covariance between input variables. Specifically, when two input features are highly correlated, the decision tree model will select only one of them as a splitting feature during the decision process [55]. Recently, the XGB model has been used to identify the main climatic drivers of GPP variability at global FLUXNET sites and to quantify reasonable responses of GPP to various climatic factors such as VPD [56]. Similarly, RF models have been used to estimate regional GPP for rice–wheat rotation croplands, showing better performance than traditional methods [57,58].
We constructed site-level ML models for each PFT using available observations (Figure 1), including nine inputs (SIF and climatic drivers) and two outputs (GPP or ET). For each output (GPP or ET), a total of sixty models were built using one of the two ML algorithms (XGB or RF) by combining one out of three SIF products (CSIF, GOSIF, or RTSIF) with site-level observations across ten PFTs. Observations from all sites within each PFT were randomly divided into training (80%) and validation (20%) datasets to develop PFT-dependent models. During the training process, we optimized model hyperparameters using cross-validation and hyperparameter tuning to identify the optimal parameter combinations. Specifically, the hyperparameters were set to several candidate values, and different combinations were sequentially tested using five-fold cross-validation. The optimal combination of hyperparameters was then selected based on the results of the cross-validation. For validation, we used the remaining 20% of site-level observations to evaluate the performance of our data-driven models in deriving GPP and ET. Thereafter, based on year-to-year dominant PFTs from MODIS for each grid cell, we extended data-driven models from site level to global scales using gridded meteorology from the MERRA-2 reanalysis and the three SIF products by applying the corresponding PFT-specific ML models at individual grids. Thus, we generated six GPP, ET, and WUE products (2 ML algorithms × 3 SIF datasets) at a spatial resolution of 0.05° and 8-day temporal resolution during 2001–2020 on the global scale.

2.5. Interpretation of Factor Contributions in Machine Learning

We used the Shapley Additive Explanation (SHAP) method to interpret the output of machine learning models. SHAP values, based on cooperative game theory, quantify the contribution of each input variable to the predicted value [59]. The basic principle of SHAP is to calculate the marginal contribution of a feature, i.e., the effect of a feature on the prediction results when it is added to a combination of existing features. This process requires considering all possible combinations of features. For model f and feature set S with feature i, the Shapley value is calculated as follows:
ϕ i = S N \ i S ! N S 1 ! N ! f S i x S i f S x S
where N is the set of all features, |S| is the size of S, S N \ i represents all subsets that do not contain feature i, and xS represents the values of the input features in the set S. When calculating the Shapley value ϕ i for each feature, all possible permutations of the features are considered with S ! N S 1 ! N ! . For each permutation, the difference in model output before and after the addition of the particular feature, f S i x S i f S x S , is calculated. The marginal contributions of the particular feature across all permutations are averaged to obtain the Shapley value, representing the average contribution of the feature to the prediction results.
The SHAP model has been widely used to interpret the factor contributions of XGB and RF models [60,61,62,63,64]. Specifically, Zhou et al. [56] combined the SHAP method with XGB models to quantify the contributions of individual climatic factors and their interactions to GPP variability at different time scales and across various PFTs at the global FLUXNET sites. In this study, the mean absolute Shapley value of each feature was normalized to quantify the relative contributions of individual features to GPP and ET.

3. Results

3.1. Relationships Between SIF and Ecosystem Fluxes

We examined the relationships between 8-day SIF and GPP, as well as between SIF and ET, at global FLUXNET sites (Figure S2 and Table 1). All SIF products showed positive correlations with the observed GPP, with correlation coefficients (r) of 0.78, 0.7, and 0.69 (p < 0.05) for CSIF, GOSIF, and RTSIF, respectively. The slopes (g C m2 day−1/m W m−2 nm−1 sr−1) were 21.68 (23.02), 21.82 (26.63), and 12.35 (14.86) with (without) intercepts for the three SIF products (Figure S2a–c), respectively. Among various PFTs, the SIF-GPP relationships showed positive correlations from 0.59 to 0.91 for CSIF, 0.21 to 0.9 for GOSIF, and 0.19 to 0.9 for RTSIF products (Table 1), with the highest correlations observed in mixed forest (MF) and the lowest in evergreen broadleaf forest (EBF). For forests, excluding EBF due to its weaker relevance [28], the SIF-GPP correlations were higher (0.73 to 0.90) compared to non-forest ecosystems (0.61 to 0.79) for all SIF products. Additionally, the SIF-GPP slopes were higher for forests, ranging from 19.59 to 21.07 g C m2 day−1 per unit SIF, indicating the stronger reabsorption of SIF by more complex canopy structures [30,65].
In comparison, the SIF-ET relationships showed lower correlations, with an r of 0.59–0.67 (p < 0.05) across the three SIF products (Figure S2d–f). The linear slopes of SIF-ET (mm hr−1/m W m−2 nm−1 sr−1) were 0.26, 0.27, and 0.15 for CSIF, GOSIF, and RTSIF, respectively. Among the various PFTs, positive SIF-ET correlations were found, ranging from 0.51 to 0.75 for CSIF, 0.41 to 0.77 for GOSIF, and 0.4 to 0.75 for RTSIF. The lowest correlations were found for EBF and wetland (WET) sites (Table 1). The linear slopes of SIF-ET ranged from 0.21 to 0.51 for CSIF, 0.19 to 0.52 for GOSIF, and 0.12–0.25 for RTSIF across various PFTs, with the highest slopes observed in woody savanna (WSA, Table 1). Forest ecosystems showed higher SIF-ET correlations (0.63–0.76) and slopes (0.21–0.23) compared to non-forest ecosystems (r: 0.5–0.7; slope: 0.19–0.43), though these differences were less pronounced than those observed for the SIF-GPP relationships.

3.2. Site-Level Validations of ML Models Across Various PFTs

For each PFT, we constructed six data-driven models using 80% of the total data samples for all FLUXNET sites with the same PFT classification (see Section 2). We validated the derived GPP and ET averaged for the six (2 ML × 3 SIF) models using the remaining 20% of observations (Figure 2 and Figure 3). The GPP derived by six data-driven models showed a Nash–Sutcliffe Efficiency (NSE) of 0.82 ± 0.11 and relative mean biases (RMB) of 0.34 ± 1.91% against observations average for various PFTs (Figure 2). The highest NSE value of 0.91 was found for mixed forest (MF), while the lowest value of 0.54 was found at cropland (CRO), potentially due to missing human disturbances in ML models such as irrigation and fertilizer management [66,67]. The largest bias of 4.55% was found for savanna (SAV), and the lowest bias was −0.09% at EBF sites. For ET, the data-driven models also exhibited a good performance, with an average NSE of 0.8 ± 0.07 and an RMB of −0.33 ± 1.68% across all PFTs (Figure 3). Among different PFTs, the highest NSE value of 0.87 was found for EBF, while the lowest NSE of 0.69 was found at GRA sites. The largest bias was 3.14% at SAV sites, while the smallest bias was 0.33% for EBF.
Data-driven models showed slightly varied performances in estimating GPP and ET across two algorithms and three SIF products (Table 2).
For GPP, the highest NSE value was 0.83 ± 0.1 for the CSIF-RF model, while the lowest NSE value was 0.81 ± 0.12 for the GOSIF-XGB model. The lowest RMB was −0.14 ± 1.69% for the RTSIF-XGB model, and the highest bias was 0.69 ± 2.35% for the GOSIF-RF model. Across various PFTs, all six data-driven models showed the lowest NSE values of 0.51–0.57 for CRO sites and 0.69–0.8 for ENF sites. In contrast, high NSE values were found for MF (0.91–0.92) and SH (0.89–0.92) sites, with low biases of −0.34%–0.32% for EBF and −0.91%–0.34% for MF. For ET, the highest NSE was 0.82 ± 0.05 for the CSIF-RF model, while the lowest NSE value was 0.73 ± 0.2 for the RTSIF-XGB model. The lowest RMB was 0.04 ± 1.82% for the GOSIF-XGB model, and the highest RMB was −0.58 ± 1.68% for the RTSIF-RF model.
Based on the SHAP values, we further explored the contributions of SIF and other factors to the variability of site-level GPP and ET using either XGB (Figure 4) or RF (Figure S3) models. For both algorithms, SIF showed the highest contributions to GPP variability, averaging 24%–49% across most PFTs excluding EBF (16%), where the correlations between SIF and GPP (or ET) were usually low (Table 1). However, Tair acted as the dominant contributor, with 30%–31% for DBF, ENF, and WET. Diffuse radiation and VPD contributed to GPP variability by 2–12% and 5–11% across various PFTs, respectively, while other factors had no significant impacts. For ET variability, SIF showed lower mean absolute SHAP values, up to 10–42%, compared to GPP variability (Figure 4 and Figure S3). Following SIF contributions, VPD and Tair influenced ET variability by 10–38% and 6–16%, respectively, among various PFTs, indicating the significant impacts of water and heat conditions on soil evaporation and plant transpiration. Other factors, such as diffuse radiation and CO2, showed limited impacts on ET variability across the three SIF products and two algorithms. Overall, SIF showed tight relationships and dominant roles in GPP and ET variability at site-level scales. This indicates a high potential and advantage in deriving GPP and ET simultaneously by taking into account SIF and associated meteorology in the ML models.

3.3. Spatiotemporal Variations in ECSIF-Based GPP and ET

We further applied PFT-dependent models to construct the ECSIF_GPP and ECSIF_ET datasets on a global scale and compared the spatiotemporal variations in average GPP and ET from six ECSIF products with the ensemble of satellite-based products (Figure 5 and Figure 6). On the global scale, high GPP (>2000 gC m−2 yr−1) and ET (>1000 mm yr−1) values were found over the tropical regions such as the Amazon, Central Africa, and Southeastern Asia (Figure 5a,d). In contrast, low GPP (<1000 gC m−2 yr−1) and ET (500 < mm yr−1) values appeared at mid–high latitudes (e.g., boreal Asia), consistent with the observed spatial pattern (Figure 5b,e). Compared to the ensemble products, ECSIF_GPP showed a correlation of 0.93 and bias of 10.33% globally, with positive differences over the Amazon and Central Africa (Figure 5c). Similarly, ECSIF_ET exhibited a correlation of 0.94 and bias of 2.79% against observations (Figure 5f). The highest r for GPP (ET) was achieved up to 0.94 (0.92) with the CSIF-RF model (Table 3), and RF models showed better performance in both r and RMB than XGB-based models.
On the global scale, ECSIF products yielded a total GPP of 128 ± 2.3 Pg C yr−1 (mean ± standard deviation) and area-weighed ET of 522 ± 58.2 mm yr−1 during 2001–2020. These values are within the range of 106–145.8 Pg C yr−1 for GPP and 417–560 mm yr−1 for ET as estimated by different studies based on varied methods and data sources (Table 4). Among the six sets of ECSIF products, the highest (lowest) GPP estimates were 130.8 (124.8) Pg C yr−1 for the GOSIF-RF (CSIF-XGB) model (Table 3). The highest and lowest ET estimates were 582.4 mm yr−1 for CSIF-XGB and 460.4 mm yr−1 for CSIF-RF models, respectively. Regionally, the total GPP and area-weighed ET were 34.3 Pg C yr−1 and 931 mm yr−1 in South America, 34.4 Pg C yr−1 and 548 mm yr−1 in Asia, 25.7 Pg C yr−1 and 745 mm yr−1 in Africa, 17.4 Pg C yr−1 and 492 mm yr−1 in North America, 9.9 Pg C yr−1 and 465 mm yr−1 in Europe, and 6.1 Pg C yr−1 and 481 mm yr−1 in Oceania.
For trends, ECSIF_GPP showed annual trends of 0.22 ± 0.04 Pg C yr−2, mainly resulting from positive trends over eastern China, India, and Europe (Figure 6a). This positive trend was similar to the 0.35 ± 0.11 Pg C yr−2 observed in ensemble GPP products (Figure 6b), though the ECSIF_GPP slightly underestimated the positive GPP trends in boreal regions (Figure 6c). Meanwhile, ECSIF_ET exhibited a positive trend of 0.64 ± 0.14 mm yr−2, with significant spatial heterogeneity, primarily driven by positive changes over eastern China and negative trends over Central Africa (Figure 6d). This trend was lower than that of 1.73 ± 0.64 mm yr−2 in the ensemble ET products (Figure 6e). The largest differences were found over South America and Central Africa (Figure 6f), potentially due to limitations in training sites in the southern hemisphere. For different ML algorithms, ECSIF showed higher trends of 0.16–0.26 Pg C yr−2 for GPP and 0.71–0.87 mm yr−2 for ET with XGB, as compared to 0.19–0.24 Pg C yr−2 for GPP and 0.45–0.58 mm yr−2 for ET with the RF algorithms (Table 3).

3.4. Spatiotemporal Variations in Global WUE

We further explored spatiotemporal variations in WUE (=GPP/ET) for ECSIF products on the global scale (Figure 7). High WUE (>2 g C kg−1 H2O yr−1) was found mainly in forest regions, including tropical rainforest (e.g., Amazon and Central Africa) and temperate forest (e.g., Europe and Eastern US). The low WUE appeared in the dry (e.g., Middle East, Western U.S., and Australia) and/or cold regions (e.g., boreal North America and Eurasia) (Figure 7a). This pattern of WUE was similar to satellite-based observations (Figure 7b), though the ECSIF product estimated lower WUE at the high latitudes (Figure 7c). On the global scale, ECSIF products yielded an area-weighted WUE of 1.8 ± 0.207 g C kg−1 H2O yr−1 (mean ± standard deviation) during 2001–2020, close to 1.63 ± 0.118 g C kg−1 H2O yr−1 from ensemble products. Meanwhile, ECSIF products showed positive trends of 0.0019 ± 0.00048 g C kg−1 H2O yr−2 in WUE, with regional hotspots over Eastern China, India, Europe, Sahel, and northwestern US (Figure 7d), where cropland is usually located [78]. However, the ensemble satellite-based products revealed positive trends close to 0 (0± 0.00206 g C kg−1 H2O yr−2) for WUE (Figure 7e), which differed from the trends of 0.0008 g C kg−1 H2O yr−2 in PML-V2, −0.0028 g C kg−1 H2O yr−2 in MODIS, and 0.002 g C kg−1 H2O yr−2 in X-BASE. The trends in WUE for ECSIF and X-BASE demonstrate a high degree of spatial coherence, particularly in the Northern Hemisphere, South America, and Australia (Figure S4).
Semi-mechanistic products showed larger differences globally, likely due to the varying models used in the two products. MODIS WUE displayed significant decreasing trends in Africa, Australia, and the Middle East, primarily because the positive trends in ET exceeded the positive trends in GPP in these regions. This imbalance resulted in a negative trend in global WUE (Figure S5a–c). In contrast, PML WUE showed a broader decline across the tropics, driven by higher ET trends relative to GPP at low latitudes, along with severe GPP declines in parts of Africa and South America. These declines were offset by significant increases in WUE in much of the Northern Hemisphere, including Europe, Central Asia, and parts of North America, ultimately resulting in a slight positive trend in global WUE for PML (Figure S5d–f). The most pronounced differences were observed in tropical regions, attributed to the strong positive ET trends derived from ensemble products (Figure 6e). Therefore, the accurate estimate of GPP and ET is of great importance to better understand ecosystem carbon and water fluxes and their coupling.
For different PFTs, forests exhibited a larger WUE (2.73 g C kg−1 H2O yr−1) than non-forests (1.58 g C kg−1 H2O yr−1). The largest WUE of 3.14 g C kg−1 H2O yr−1 was observed in DBF, while the lowest was 1.27 g C kg−1 H2O yr−1 in GRA. Croplands showed an intermediate WUE of 2.19 g C kg−1 H2O yr−1. Similarly, the trend for forests (0.0018 g C kg−1 H2O yr−2) was higher than that for non-forests (0.0008 g C kg−1 H2O yr−2). Within forests, DNF, ENF, and MF all showed significant positive trends ranging from 0.0017 to 0.0048 g C kg−1 H2O yr−2. In contrast, DBF and EBF showed insignificant trends of 0.0002 g C kg−1 H2O yr−2 and −0.0004 g C kg−1 H2O yr−2, respectively. For non-forests, CRO and GRA showed positive trends of 0.0042 g C kg−1 H2O yr−2 and 0.0037 g C kg−1 H2O yr−2, respectively, while the others showed no significant trends (Table S2).

4. Discussion

Our results indicate that site-level SIF showed a strong agreement with flux tower GPP at global sites (Figure S2a–c). However, the SIF-ET correlations were consistently lower than the SIF-GPP relationships (Figure S2d–f). This difference was likely attributed to the closer physiological link between SIF and GPP compared to SIF and ET at the canopy/leaf scale [79,80]. Theoretically, changes in SIF directly reflect plant photosynthesis due to the immediate energy flux emitted during the light reaction [81,82]. Thus, increases (or decreases) in SIF correspond to positive (or negative) changes in plant photosynthesis at the canopy scale [83]. Linked by the electron transport rate (J), the SIF-GPP relationship can be broken into two components: SIF–J and J–GPP. The SIF-J relationship, governed by energy allocation in the light reaction, can be modeled using the Mechanistic Light Response (MLR) model [84]. Meanwhile, the J-GPP relationship is linked to carbon assimilation during the Calvin cycle and aligns with the Eco-Evolutionary Optimality (EEO) theory [85], which states that stomata maximize carbon gain while minimizing water loss [86]. This highlights SIF’s key role in providing physiological insights for estimating both GPP and ET. The SHAP analysis further revealed that SIF accounted for 24–49% of GPP variability across all PFTs (excluding EBF), a significantly higher contribution than any climatic factors (Figure 4 and Figure S3). The high contribution of SIF is likely because it is the only vegetation state variable included in the predictor set, capturing all effects associated with canopy structure and APARchl variability. In contrast, SIF had a weaker influence on terrestrial ET, primarily through its role in linking photosynthesis and stomatal transpiration [39]. Increases in SIF enhance transpirations via stomatal opening, which is driven by photosynthesis [35,87,88]. However, soil and canopy evaporation, which are substantial components of ET, are more strongly controlled by environmental factors like temperature and VPD rather than photosynthesis [89]. For example, Lu et al. [35] showed that correlations between ET and SIF could be influenced by VPD, air temperature, and radiation at the site level. Consistent with this, our SHAP results indicate that temperature, rather than SIF, was the dominant driver of ET variability across most PFTs (Figure 4 and Figure S3). These findings highlight the importance of incorporating both SIF and meteorological factors when estimating ET.
The derived ECSIF datasets presented spatial patterns and temporal trends in global GPP, ET, and WUE consistent with observations (Figure 5, Figure 6, Figure 7, Figures S4, S6 and S7). Previous studies reported global GPP estimates of 106–145.8 Pg C yr−1 and ET estimates of 417–574 mm yr−1 across various periods and resolutions. These values align with the 124.8–130.8 Pg C yr−1 for GPP and 460.4–583.8 mm yr−1 for ET obtained from six ECSIF datasets (Table 4). Meanwhile, ECSIF products showed 1.55–2.03 g C kg−1 H2O yr−1 of global WUE during 2001–2020, falling within the range of 1.53–1.96 g C kg−1 H2O yr−1 from previous estimates [22,44,90,91]. In terms of temporal trends, ECSIF products showed significant trends of 0.16–0.26 Pg C yr−2 for global GPP during 2001–2020. These values are close to the 0.2–0.46 Pg C yr−2 reported Zhang et al. [5], Running et al. [42], Guo et al. [58], and [44] for the similar periods. However, global trends in ET showed large differences among various studies [92]. ECSIF products showed positive trends of 0.45–0.87 mm yr−2 of global ET during 2001–2020, which were similar to 1.09 mm yr−2 caused by Nelson et al. [44] for the period 2001–2020. These positive values were much lower than the value of 2 mm yr−2 from PML-V2 datasets for similar periods [5]. Additionally, Jiang and Ryu [16] found negative global ET trends up to −0.65 mm yr−2 during 2001–2011, while other studies indicated no significant trends [76,93]. For WUE, ECSIF products showed positive trends of 0.0001–0.0024 g C kg−1 H2O yr−2 during 2001–2020, similar to 0.0025 g C kg−1 H2O yr−2 and 0.002 g C kg−1 H2O yr−2 as estimated by Xue et al. [90] and Nelson, et al. [44]. Furthermore, previous studies revealed substantial increases in WUE from regional [94,95] and global [96] scales, confirming the reasonableness of our derived positive trends of WUE. Meanwhile, we also compared the performances of GPP and ET among ECSIF and FLUXCOM X-BASE with global FLUXNET sites (Figure S6), and we found ECSIF showed NSE (bias) by 0.51 (−16.12%) and 0.62 (3.2%) with site-level GPP and ET, close to FLUXCOM X-BASE products. Overall, ECSIF products showed good performance in capturing the magnitudes and trends in GPP, ET, and WUE compared to ground and satellite-based observations and other studies. Considering uncertainties in various ECSIF products, we suggested that the average of six ECSIF products be used as benchmark GPP and ET with lowest uncertainties.
Previous studies have found that rising atmospheric CO2 concentrations could enhance plant photosynthesis, driving the trend in global GPP upward through the CO2 fertilization effect [97,98,99]. The coupling effects of leaf stomata and plant transpiration further amplify this process, as increased greening leads to higher global ET due to the expanded leaf area [92,100]. However, increases in plant photosynthesis typically outpace those in ET [101], resulting in an overall increase in WUE attributed to the CO2 fertilization effect. Meanwhile, elevated CO2 concentrations can partially inhibit leaf stomata, further enhancing WUE [95]. Variations in GPP were often influenced by surface radiation and temperature, particularly at the mid–high latitudes [98], while ET variations were mainly controlled by water exchange processes, such as precipitation [92,102]. Our results show that WUE trends were positively correlated with GPP enhancements driven by rising temperatures and negatively correlated with ET responses to precipitation variations (Figure 7d and Figure S8c,d). Similar findings have been reported by Xue et al. [90]. As a result, changes in WUE were determined by the competitions between regional GPP and ET, with GPP increases dominating WUE enhancements in most areas.
Nevertheless, there are some limitations and uncertainties in this study. First, the selection of SIF datasets may influence GPP and ET estimates of ECSIF products. For example, GOSIF-based datasets showed higher GPP and ET magnitudes and trends than CSIF- and RTSIF-based datasets (Table 3), due to differences among various SIF products. In addition, the time spans of SIF datasets limited the temporal extensions of ECSIF products, such as CSIF (2001–2020) and RTSIF (2001–2020). Based on that, we plan to update the annual ECSIF_GPP and ECSIF_ET datasets in the future, considering the continuously updated GOSIF dataset. Second, we selected only XGB and RF algorithms as the data-driven models to derive ECSIF products. Our analyses showed that the RF model caused larger biases than the XGB algorithm (Table 3). Other machine (or deep) learning algorithms may potentially perform better than the current ECSIF datasets. Third, the time spans and spatial unevenness of global FLUXNET sites may influence the accuracy of the upscaled GPP and ET estimates [21], as well as the effectiveness of ECSIF datasets. For example, the data used for training in this paper are only available up to 2014, and over 86% of the sites analyzed in this study were located in the Northern Hemisphere, with 64% in America and Europe. Therefore, more evenly distributed flux tower sites are needed to improve the accuracy of ML models in future studies. Fourth, uncertainties in input datasets (e.g., site-level GPP, CO2 products, and land use data) may introduce biases into the derived GPP and ET estimates from the XGB and RF models. We examined the effects of these datasets on global-scale GPP and ET (Table S3–S5). Using site-level GPP data derived from daytime and nighttime methods, we found similar magnitudes and trends for global GPP, ET and WUE as the original estimates (Table S3). In addition, we used heterogeneous CO2 datasets [103] to estimate global GPP, ET, and WUE, yielding values of 129 Pg C yr−1, 471 mm yr−1, and 1.95 g C kg−1 H2O yr−1, respectively, which are close to our estimates using unified CO2 datasets (Table S4). Finally, we assessed the impact of land use datasets on derived GPP, ET, and WUE, finding small biases of −1.28%, −1.61%, and −0.3%, respectively, compared to our estimates derived from dominant land cover datasets (Table S5).
Despite these limitations, our derived ECSIF products showed several advantages compared to previous GPP or/and ET products and provide a reliable tool to explore WUE: (1) Comprehensive considerations of plant physiology and environmental constraints. The ECSIF products fully accounted for the effects of plant physiology (e.g., SIF) and environmental changes (e.g., diffuse radiation), which were usually ignored in previous studies especially for ET products [104,105]. Zhou et al. [56] found that diffuse radiation dominated 30% of site-level GPP variability at the sub-daily scales. Wang et al. [106] also found that diffuse radiation significantly contributed to ET variability due to changes in transpiration. Our results confirmed that diffuse radiation can contribute significantly, both up to 8%to GPP and ET variability (Figure 4 and Figure S3), comparable to the contributions of VPD (8% and 10%). (2) Simultaneous derivation of GPP and ET. Previous terrestrial products typically included only GPP or ET [5], with very few studies capable of deriving both GPP and ET on a global scale. In this study, ECSIF products included six combined datasets of global GPP and ET using three SIF datasets and two ML algorithms to minimize the uncertainties from single methods or datasets. These datasets are expected to be useful for exploring water and carbon fluxes and their interactions such as WUE at regional and global scales in the future. (3) High spatiotemporal resolution. ECSIF estimated global GPP, ET, and WUE at a spatial resolution of ~5km and an 8-day temporal resolution. As a result, ECSIF products could serve as benchmarks for model validations and data explorations at a very fine resolution.

5. Conclusions

In this study, we developed the ECSIF-based product with six datasets of GPP, ET, and WUE at 0.05° spatial resolution and 8-day temporal resolution during 2001–2020. These datasets were based on three SIF products and two machine learning algorithms. First, we examined the relationships of SIF and GPP, as well as SIF and ET, at global FLUXNET sites. In general, positive correlations were found between SIF and these flux variables across most PFTs. We then created six ML models for each of ten PFTs, based on the cross-combination of two ML algorithms and three SIF datasets. Validations of these models showed that the derived GPP and ET values had high correlations and low biases when compared to site-level observations. Using SHAP methods, we confirmed that SIF played a dominant role in predicting site-level GPP and ET. Finally, we generated global ECSIF_GPP and ECSIF_ET datasets by applying the ML models to gridded measurements and found significant increases in global GPP, ET, and WUE during 2001–2020. The causes and consequences of these profound changes in global carbon and water fluxes deserve further investigations in future studies. Moreover, we consider using more spatially distributed site datasets with longer temporal coverage, SIF data observed from satellite directly, and different machine learning algorithms such as deep learning to derive more accurate, further spatiotemporally spanning, and higher-resolution global GPP and ET.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/rs17122064/s1. Figure S1: Global distribution of FLUXNET sites; Figure S2: The relationships between Solar-Induced chlorophyll Fluorescence (SIF) and (a–c) gross primary productivity (GPP) or (d–f) evapotranspiration (ET) at the global FLUXNET sites; Figure S3: The same with Figure 4 but for the XGB model; Figure S4: Long-term (a) mean and (d) trend of water use efficiency (WUE) derived with ECSIF and their comparisons with the (b,e) X-BASE products during 2001–2020; Figure S5: Long-term trend of GPP, ET and WUE from MODIS (a–c) and PML (d–f) during 2001–2020; Figure S6 Comparisons of GPP and ET for ECSIF and FLUXCOM X-BASE products with global FLUXNET sites; Figure S7: Long-term mean of GPP and ET derived with ECSIF and their comparisons with the X-BASE products during 2001–2020; Figure S8: Long-term trend of (a) GPP and (d) ET derived with ECSIF and their comparisons with the (b,e) X-BASE products during 2001–2020; Figure S9: Long-term trend of SIF, meteorology and carbon dioxide used as inputs during 2001–2020; Table S1: Summary of global FLUXNET sites; Table S2: WUE across different PFTs; Table S3: Differences using site-level GPP derived by various methods; Table S4: Differences in GPP, ET and WUE due to various CO2 products; Table S5: Differences in using dominate PFT and the percentage of PFT in 2001.

Author Contributions

Conceptualization, H.Z. and X.Y.; methodology, H.Z. and X.Y.; validation, J.Z. and X.L. (Xichuan Liu) and X.L. (Xing Li); formal analysis, J.Z., H.Z., Z.X., J.X., J.W. and F.Z.; data curation, J.Z., H.Z. and J.X.; writing—original draft preparation, J.Z. and H.Z.; writing—review and editing, H.Z., X.Y., J.X. and F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Key Research and Development Program of China (Grant No. 2023YFF0805402) and the National Natural Science Foundation of China (Grant No. 42275128). Jingfeng Xiao was supported by University of New Hampshire (via bridge support) and Iola Hubbard Climate Change Endowment.

Data Availability Statement

SIF products include CSIF, GOSIF, and RTSIF: CSIF datasets are available via 10.17605/OSF.IO/8XQY6 (accessed on 17 May 2024), GOSIF datasets can be downloaded through http://globalecology.unh.edu (accessed on 17 May 2024), and RTSIF datasets are obtained publicly from https://doi.org/10.6084/m9.figshare.19336346.v2 (accessed on 17 May 2024). FLUXNET data are available from https://fluxnet.org/data/fluxnet2015-dataset/ (accessed on 10 May 2024). Gridded meteorological datasets are downloaded from https://gmao.gsfc.nasa.gov/reanalysis/MERRA-2/ (accessed on 10 May 2024). Diffuse and total photosynthetically active radiation are available through http://environment.snu.ac.kr/ (accessed on 20 December 2023). Carbon dioxide (CO2) is downloaded from ground-based measurements at the Mauna Loa Observatory (https://gml.noaa.gov/) (accessed on 20 June 2024). Annual global land cover data are from the MCD12C1 product of the MODIS retrieval (https://ladsweb.modaps.eosdis.nasa.gov/) (accessed on 10 May 2024). Global GPP and ET datasets of MODIS and PML_V2 are available through Google Earth Engine (GEE) systems. All ECSIF products at 0.5° are publicly available at https://figshare.com/articles/dataset/ECSIF_GPP_ET/28079052 (accessed on 11 June 2025), and ECSIF products at 0.05° are publicly available at https://figshare.com/articles/dataset/ECSIF_GPP_ET_0_05/28270599 (accessed on 11 June 2025) and https://figshare.com/articles/dataset/ECSIF_ET_0_05/28425025 (accessed on 11 June 2025).

Acknowledgments

The authors thank for helpful suggestions caused by anonymous reviewers and editors.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. Zhang, Z.; Chen, J.M.; Guanter, L.; He, L.; Zhang, Y. From Canopy-Leaving to Total Canopy Far-Red Fluorescence Emission for Remote Sensing of Photosynthesis: First Results From TROPOMI. Geophys. Res. Lett. 2019, 46, 12030–12040. [Google Scholar] [CrossRef]
  2. Chen, A.; Mao, J.; Ricciuto, D.; Lu, D.; Xiao, J.; Li, X.; Thornton, P.E.; Knapp, A.K. Seasonal changes in GPP/SIF ratios and their climatic determinants across the Northern Hemisphere. Glob. Change Biol. 2021, 27, 5186–5197. [Google Scholar] [CrossRef] [PubMed]
  3. Yu, L.; Wen, J.; Chang, C.Y.; Frankenberg, C.; Sun, Y. High Resolution Global Contiguous SIF Estimates from OCO-2 SIF and MODIS, Version 2; ORNL DAAC: Oak Ridge, TN, USA, 2021. [CrossRef]
  4. Zhang, Q.; Liu, X.; Zhou, K.; Zhou, Y.; Gentine, P.; Pan, M.; Katul, G.G. Solar-induced chlorophyll fluorescence sheds light on global evapotranspiration. Remote Sens. Environ. 2024, 305, 114061. [Google Scholar] [CrossRef]
  5. Zhang, Y.; Kong, D.; Gan, R.; Chiew, F.H.S.; McVicar, T.R.; Zhang, Q.; Yang, Y. Coupled estimation of 500 m and 8-day resolution global evapotranspiration and gross primary production in 2002–2017. Remote Sens 2019, 222, 165–182. [Google Scholar] [CrossRef]
  6. Zhao, L.; Xia, J.; Xu, C.-y.; Wang, Z.; Sobkowiak, L.; Long, C. Evapotranspiration estimation methods in hydrological models. J. Geogr. Sci. 2013, 23, 359–369. [Google Scholar] [CrossRef]
  7. Ryu, Y.; Baldocchi, D.D.; Kobayashi, H.; van Ingen, C.; Li, J.; Black, T.A.; Beringer, J.; van Gorsel, E.; Knohl, A.; Law, B.E.; et al. Integration of MODIS land and atmosphere products with a coupled-process model to estimate gross primary productivity and evapotranspiration from 1 km to global scales. Glob. Biogeochem. Cycles 2011, 25, 4. [Google Scholar] [CrossRef]
  8. Yuan, W.; Liu, S.; Yu, G.; Bonnefond, J.-M.; Chen, J.; Davis, K.; Desai, A.R.; Goldstein, A.H.; Gianelle, D.; Rossi, F.; et al. Global estimates of evapotranspiration and gross primary production based on MODIS and global meteorology data. Remote Sens. Environ. 2010, 114, 1416–1431. [Google Scholar] [CrossRef]
  9. Zhao, M.; Heinsch, F.A.; Nemani, R.R.; Running, S.W. Improvements of the MODIS terrestrial gross and net primary production global data set. Remote Sens. Environ. 2005, 95, 164–176. [Google Scholar] [CrossRef]
  10. Jung, M.; Reichstein, M.; Bondeau, A. Towards global empirical upscaling of FLUXNET eddy covariance observations: Validation of a model tree ensemble approach using a biosphere model. Biogeosci. 2009, 6, 2001–2013. [Google Scholar] [CrossRef]
  11. Jung, M.; Reichstein, M.; Margolis, H.A.; Cescatti, A.; Richardson, A.D.; Arain, M.A.; Arneth, A.; Bernhofer, C.; Bonal, D.; Chen, J.; et al. Global patterns of land-atmosphere fluxes of carbon dioxide, latent heat, and sensible heat derived from eddy covariance, satellite, and meteorological observations. J. Geophys. Res. Biogeosci. 2011, 116, G3. [Google Scholar] [CrossRef]
  12. Xiao, J.; Zhuang, Q.; Law, B.E.; Chen, J.; Baldocchi, D.D.; Cook, D.R.; Oren, R.; Richardson, A.D.; Wharton, S.; Ma, S.; et al. A continuous measure of gross primary production for the conterminous United States derived from MODIS and AmeriFlux data. Remote Sens. Environ. 2010, 114, 576–591. [Google Scholar] [CrossRef]
  13. Yu, T.; Zhang, Q.; Sun, R. Comparison of Machine Learning Methods to Up-Scale Gross Primary Production. Remote Sens. 2021, 13, 2448. [Google Scholar] [CrossRef]
  14. Yu, R.; Yao, Y.; Tang, Q.; Shao, C.; Fisher, J.B.; Chen, J.; Jia, K.; Zhang, X.; Li, Y.; Shang, K.; et al. Coupling a light use efficiency model with a machine learning-based water constraint for predicting grassland gross primary production. Agric. For. Meteorol. 2023, 341, 109634. [Google Scholar] [CrossRef]
  15. Liu, J.; Chen, J.M.; Cihlar, J.; Park, W.M. A process-based boreal ecosystem productivity simulator using remote sensing inputs. Remote Sens. Environ. 1997, 62, 158–175. [Google Scholar] [CrossRef]
  16. Jiang, C.; Ryu, Y. Multi-scale evaluation of global gross primary productivity and evapotranspiration products derived from Breathing Earth System Simulator (BESS). Remote Sens. Environ. 2016, 186, 528–547. [Google Scholar] [CrossRef]
  17. Bonan, G.B.; Lombardozzi, D.L.; Wieder, W.R.; Oleson, K.W.; Lawrence, D.M.; Hoffman, F.M.; Collier, N. Model Structure and Climate Data Uncertainty in Historical Simulations of the Terrestrial Carbon Cycle (1850–2014). Glob. Biogeochem. Cycles 2019, 33, 1310–1326. [Google Scholar] [CrossRef]
  18. Minaya, V.; Corzo, G.A.; Solomatine, D.P.; Mynett, A.E. Data-driven techniques for modelling the gross primary production of the páramo vegetation using climate data: Application in the Ecuadorian Andean region. Ecol. Inform. 2018, 43, 222–230. [Google Scholar] [CrossRef]
  19. Tramontana, G.; Jung, M.; Schwalm, C.R.; Ichii, K.; Camps-Valls, G.; Ráduly, B.; Reichstein, M.; Arain, M.A.; Cescatti, A.; Kiely, G.; et al. Predicting carbon dioxide and energy fluxes across global FLUXNET sites with. regression algorithms. Biogeosciences 2016, 13, 4291–4313. [Google Scholar] [CrossRef]
  20. Jung, M.; Reichstein, M.; Ciais, P.; Seneviratne, S.I.; Sheffield, J.; Goulden, M.L.; Bonan, G.; Cescatti, A.; Chen, J.; de Jeu, R.; et al. Recent decline in the global land evapotranspiration trend due to limited moisture supply. Nature 2010, 467, 951–954. [Google Scholar] [CrossRef]
  21. Xiao, J.; Ollinger, S.V.; Frolking, S.; Hurtt, G.C.; Hollinger, D.Y.; Davis, K.J.; Pan, Y.; Zhang, X.; Deng, F.; Chen, J.; et al. Data-driven diagnostics of terrestrial carbon dynamics over North America. Agric. For. Meteorol. 2014, 197, 142–157. [Google Scholar] [CrossRef]
  22. Li, F.; Xiao, J.; Chen, J.; Ballantyne, A.; Jin, K.; Li, B.; Abraha, M.; John, R. Global water use efficiency saturation due to increased vapor pressure deficit. Science 2023, 381, 672–677. [Google Scholar] [CrossRef]
  23. Feng, Q.; Shen, J.; Yang, F.; Liang, S.; Liu, J.; Kuang, X.; Wang, D.; Zeng, Z. Long-term gridded land evapotranspiration reconstruction using Deep Forest with high generalizability. Sci. Data 2023, 10, 908. [Google Scholar] [CrossRef] [PubMed]
  24. Mohammed, G.H.; Colombo, R.; Middleton, E.M.; Rascher, U.; van der Tol, C.; Nedbal, L.; Goulas, Y.; Pérez-Priego, O.; Damm, A.; Meroni, M.; et al. Remote sensing of solar-induced chlorophyll fluorescence (SIF) in vegetation: 50 years of progress. Remote Sens. Environ. 2019, 231, 111177. [Google Scholar] [CrossRef]
  25. Joiner, J.; Guanter, L.; Lindstrot, R.; Voigt, M.; Vasilkov, A.P.; Middleton, E.M.; Huemmrich, K.F.; Yoshida, Y.; Frankenberg, C. Global monitoring of terrestrial chlorophyll fluorescence from moderate-spectral-resolution near-infrared satellite measurements: Methodology, simulations, and application to GOME-2. Atmos. Meas. Tech. 2013, 6, 2803–2823. [Google Scholar] [CrossRef]
  26. Frankenberg, C.; O’Dell, C.; Berry, J.; Guanter, L.; Joiner, J.; Köhler, P.; Pollock, R.; Taylor, T.E. Prospects for chlorophyll fluorescence remote sensing from the Orbiting Carbon Observatory-2. Remote Sens. Environ. 2014, 147, 1–12. [Google Scholar] [CrossRef]
  27. Walther, S.; Voigt, M.; Thum, T.; Gonsamo, A.; Zhang, Y.; Köhler, P.; Jung, M.; Varlagin, A.; Guanter, L. Satellite chlorophyll fluorescence measurements reveal large-scale decoupling of photosynthesis and greenness dynamics in boreal evergreen forests. Glob. Change Biol. 2016, 22, 2979–2996. [Google Scholar] [CrossRef]
  28. Li, X.; Xiao, J.; He, B.; Altaf Arain, M.; Beringer, J.; Desai, A.R.; Emmel, C.; Hollinger, D.Y.; Krasnova, A.; Mammarella, I.; et al. Solar-induced chlorophyll fluorescence is strongly correlated with terrestrial photosynthesis for a wide variety of biomes: First global analysis based on OCO-2 and flux tower observations. Glob Chang Biol 2018, 24, 3990–4008. [Google Scholar] [CrossRef]
  29. Baker, N.R. Chlorophyll fluorescence: A probe of photosynthesis in vivo. Annu. Rev. Plant Biol. 2008, 59, 89–113. [Google Scholar] [CrossRef]
  30. Chen, X.; Huang, Y.; Nie, C.; Zhang, S.; Wang, G.; Chen, S.; Chen, Z. A long-term reconstructed TROPOMI solar-induced fluorescence dataset using machine learning algorithms. Sci. Data. 2022, 9, 427. [Google Scholar] [CrossRef]
  31. Garbulsky, M.F.; Filella, I.; Verger, A.; Peñuelas, J. Photosynthetic light use efficiency from satellite sensors: From global to Mediterranean vegetation. Environ. Exp. Bot. 2014, 103, 3–11. [Google Scholar] [CrossRef]
  32. Guanter, L.; Frankenberg, C.; Dudhia, A.; Lewis, P.E.; Gómez-Dans, J.; Kuze, A.; Suto, H.; Grainger, R.G. Retrieval and global assessment of terrestrial chlorophyll fluorescence from GOSAT space measurements. Remote Sens. Environ. 2012, 121, 236–251. [Google Scholar] [CrossRef]
  33. Joiner, J.; Yoshida, Y.; Vasilkov, A.P.; Yoshida, Y.; Corp, L.A.; Middleton, E.M. First observations of global and seasonal terrestrial chlorophyll fluorescence from space. Biogeosciences 2011, 8, 637–651. [Google Scholar] [CrossRef]
  34. Alemohammad, S.H.; Fang, B.; Konings, A.G.; Aires, F.; Green, J.K.; Kolassa, J.; Miralles, D.; Prigent, C.; Gentine, P. Water, Energy, and Carbon with Artificial Neural Networks (WECANN): A statistically based estimate of global surface turbulent fluxes and gross primary productivity using solar-induced fluorescence. Biogeosciences 2017, 14, 4101–4124. [Google Scholar] [CrossRef]
  35. Lu, X.; Liu, Z.; An, S.; Miralles, D.G.; Maes, W.; Liu, Y.; Tang, J. Potential of solar-induced chlorophyll fluorescence to estimate transpiration in a temperate forest. Agric. For. Meteorol. 2018, 252, 75–87. [Google Scholar] [CrossRef]
  36. Zhou, K.; Zhang, Q.; Xiong, L.; Gentine, P. Estimating evapotranspiration using remotely sensed solar-induced fluorescence measurements. Agric. For. Meteorol. 2022, 314. [Google Scholar] [CrossRef]
  37. Tong, B.; Guo, J.; Xu, H.; Wang, Y.; Li, H.; Bian, L.; Zhang, J.; Zhou, S. Effects of soil moisture, net radiation, and atmospheric vapor pressure deficit on surface evaporation fraction at a semi-arid grass site. Sci. Total Environ. 2022, 849, 157890. [Google Scholar] [CrossRef]
  38. Flo, V.; Martínez-Vilalta, J.; Granda, V.; Mencuccini, M.; Poyatos, R. Vapour pressure deficit is the main driver of tree canopy conductance across biomes. Agric. For. Meteorol. 2022, 322, 109029. [Google Scholar] [CrossRef]
  39. Shan, N.; Zhang, Y.; Chen, J.M.; Ju, W.; Migliavacca, M.; Peñuelas, J.; Yang, X.; Zhang, Z.; Nelson, J.A.; Goulas, Y. A model for estimating transpiration from remotely sensed solar-induced chlorophyll fluorescence. Remote Sens. Environ. 2021, 252, 112134. [Google Scholar] [CrossRef]
  40. Zhang, Y.; Zhao, W.; He, J.; Zhang, K. Energy exchange and evapotranspiration over irrigated seed maize agroecosystems in a desert-oasis region, northwest China. Agric. For. Meteorol. 2016, 223, 48–59. [Google Scholar] [CrossRef]
  41. Ryu, Y.; Jiang, C.; Kobayashi, H.; Detto, M. MODIS-derived global land products of shortwave radiation and diffuse and total photosynthetically active radiation at 5km resolution from 2000. Remote Sens. Environ. 2018, 204, 812–825. [Google Scholar] [CrossRef]
  42. Running, S.; Mu, Q.; Zhao, M.; Moreno, A. MODIS/Terra Gross Primary Productivity 8-Day L4 Global 500m SIN Grid V061; NASA EOSDIS Land Processes Distributed Active Archive Center: Sioux Falls, SD, USA, 2021. [CrossRef]
  43. Running, S.; Mu, Q.; Zhao, M.; Moreno, A. MODIS/Terra Net Evapotranspiration Gap-Filled 8-Day L4 Global 500m SIN Grid V061; NASA EOSDIS Land Processes Distributed Active Archive Center: Sioux Falls, SD, USA, 2021. [CrossRef]
  44. Nelson, J.A.; Walther, S.; Gans, F.; Kraft, B.; Weber, U.; Novick, K.; Buchmann, N.; Migliavacca, M.; Wohlfahrt, G.; Šigut, L.; et al. X-BASE: The first terrestrial carbon and water flux products from an extended data-driven scaling framework, FLUXCOM-X. EGUsphere 2024, 2024, 1–51. [Google Scholar] [CrossRef]
  45. Mu, Q.; Zhao, M.; Running, S.W. Improvements to a MODIS global terrestrial evapotranspiration algorithm. Remote Sens. Environ. 2011, 115, 1781–1800. [Google Scholar] [CrossRef]
  46. Zhang, Y.; Joiner, J.; Alemohammad, S.H.; Zhou, S.; Gentine, P. A global spatially contiguous solar-induced fluorescence (CSIF) dataset using neural networks. Biogeosciences 2018, 15, 5779–5800. [Google Scholar] [CrossRef]
  47. Li, X.; Xiao, J.F. A Global, 0.05-Degree Product of Solar-Induced Chlorophyll Fluorescence Derived from OCO-2, MODIS, and Reanalysis Data. Remote Sens 2019, 11, 517. [Google Scholar] [CrossRef]
  48. Roesch, A.; Schaaf, C.; Gao, F. Use of Moderate-Resolution Imaging Spectroradiometer bidirectional reflectance distribution function products to enhance simulated surface albedos. J. Geophys. Res. Atmos. 2004, 109, D12. [Google Scholar] [CrossRef]
  49. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
  50. AlThuwaynee, O.F.; Kim, S.-W.; Najemaden, M.A.; Aydda, A.; Balogun, A.-L.; Fayyadh, M.M.; Park, H.-J. Demystifying uncertainty in PM10 susceptibility mapping using variable drop-off in extreme-gradient boosting (XGB) and random forest (RF) algorithms. Environ. Sci. Pollut. Res. 2021, 28, 43544–43566. [Google Scholar] [CrossRef]
  51. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  52. Wu, J.; Li, Y.; Ma, Y. Comparison of XGBoost and the Neural Network model on the class-balanced datasets. In Proceedings of the 2021 IEEE 3rd International Conference on Frontiers Technology of Information and Computer (ICFTIC), Greenville, SC, USA, 12–14 November 2021; IEEE: New York, NY, USA, 2021; pp. 457–461. [Google Scholar]
  53. Chen, W.; Fu, K.; Zuo, J.; Zheng, X.; Huang, T.; Ren, W. Radar emitter classification for large data set based on weighted-xgboost. IET Radar Sonar Navig. 2017, 11, 1203–1207. [Google Scholar] [CrossRef]
  54. Genuer, R.; Poggi, J.-M.; Tuleau-Malot, C.; Villa-Vialaneix, N. Random Forests for Big Data. Big Data Res. 2017, 9, 28–46. [Google Scholar] [CrossRef]
  55. Zhou, H.; Yue, X.; Dai, H.; Geng, G.; Yuan, W.; Chen, J.; Shen, G.; Zhang, T.; Zhu, J.; Liao, H. Recovery of ecosystem productivity in China due to the Clean Air Action plan. Nat. Geosci. 2024, 17, 1233–1239. [Google Scholar] [CrossRef]
  56. Zhou, H.; Yue, X.; Wang, B.; Tian, C.; Lu, X.; Zhu, J.; Cao, Y. Distinguishing the main climatic drivers to the variability of gross primary productivity at global FLUXNET sites. Environ. Res. Lett. 2023, 18, 124007. [Google Scholar] [CrossRef]
  57. Duan, Z.; Yang, Y.; Zhou, S.; Gao, Z.; Zong, L.; Fan, S.; Yin, J. Estimating Gross Primary Productivity (GPP) over Rice–Wheat-Rotation Croplands by Using the Random Forest Model and Eddy Covariance Measurements: Upscaling and Comparison with the MODIS Product. Remote Sens. 2021, 13, 4229. [Google Scholar] [CrossRef]
  58. Guo, R.; Chen, T.; Chen, X.; Yuan, W.; Liu, S.; He, B.; Li, L.; Wang, S.; Hu, T.; Yan, Q.; et al. Estimating Global GPP From the Plant Functional Type Perspective Using a Machine Learning Approach. J. Geophys. Res. Biogeosci. 2023, 128, e2022JG007100. [Google Scholar] [CrossRef]
  59. Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Proceedings of the Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; NeurIPS: Denver, CO, USA, 2017; Volume 30. [Google Scholar]
  60. Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef]
  61. Nohara, Y.; Matsumoto, K.; Soejima, H.; Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput. Methods Programs Biomed. 2022, 214, 106584. [Google Scholar] [CrossRef]
  62. Mao, H.; Deng, X.; Jiang, H.; Shi, L.; Li, H.; Tuo, L.; Shi, D.; Guo, F. Driving safety assessment for ride-hailing drivers. Accid. Anal. Prev. 2021, 149, 105574. [Google Scholar] [CrossRef]
  63. Wang, H.; Yan, S.; Ciais, P.; Wigneron, J.-P.; Liu, L.; Li, Y.; Fu, Z.; Ma, H.; Liang, Z.; Wei, F.; et al. Exploring complex water stress–gross primary production relationships: Impact of climatic drivers, main effects, and interactive effects. Glob. Change Biol. 2022, 28, 4110–4123. [Google Scholar] [CrossRef]
  64. Cui, Z.; Li, M.; Huang, Y.; Wang, Y.; Chen, H. An interpretation framework for autonomous vehicles decision-making via SHAP and RF. In Proceedings of the 2022 6th CAA International Conference on Vehicular Control and Intelligence (CVCI), Nanjing, China, 28–30 October 2022; pp. 1–7. [Google Scholar]
  65. Gao, H.; Liu, S.; Lu, W.; Smith, A.R.; Valbuena, R.; Yan, W.; Wang, Z.; Xiao, L.; Peng, X.; Li, Q.; et al. Global Analysis of the Relationship between Reconstructed Solar-Induced Chlorophyll Fluorescence (SIF) and Gross Primary Production (GPP). Remote Sens. 2021, 13, 2824. [Google Scholar] [CrossRef]
  66. Xin, Q.; Gong, P.; Yu, C.; Yu, L.; Broich, M.; Suyker, A.E.; Myneni, R.B. A Production Efficiency Model-Based Method for Satellite Estimates of Corn and Soybean Yields in the Midwestern US. Remote Sens. 2013, 5, 5926–5943. [Google Scholar] [CrossRef]
  67. Chen, X.; Mo, X.; Hu, S.; Liu, S. Contributions of climate change and human activities to ET and GPP trends over North China Plain from 2000 to 2014. J. Geogr. Sci. 2017, 27, 661–680. [Google Scholar] [CrossRef]
  68. He, L.; Chen, J.M.; Gonsamo, A.; Luo, X.; Wang, R.; Liu, Y.; Liu, R. Changes in the Shadow: The Shifting Role of Shaded Leaves in Global Carbon and Water Cycles Under Climate Change. Geophys. Res. Lett. 2018, 45, 5052–5061. [Google Scholar] [CrossRef]
  69. Chen, J.M.; Mo, G.; Pisek, J.; Liu, J.; Deng, F.; Ishizawa, M.; Chan, D. Effects of foliage clumping on the estimation of global terrestrial gross primary productivity. Glob. Biogeochem. Cycles 2012, 26, 1. [Google Scholar] [CrossRef]
  70. Zhang, Y.; Xiao, X.; Wu, X.; Zhou, S.; Zhang, G.; Qin, Y.; Dong, J. A global moderate resolution dataset of gross primary production of vegetation for 2000–2016. Sci. Data 2017, 4, 170165. [Google Scholar] [CrossRef]
  71. Yu, T.; Sun, R.; Xiao, Z.; Zhang, Q.; Liu, G.; Cui, T.; Wang, J. Estimation of Global Vegetation Productivity from Global LAnd Surface Satellite Data. Remote Sens. 2018, 10, 327. [Google Scholar] [CrossRef]
  72. Badgley, G.; Anderegg, L.D.L.; Berry, J.A.; Field, C.B. Terrestrial gross primary production: Using NIRV to scale from site to globe. Glob. Change Biol. 2019, 25, 3731–3740. [Google Scholar] [CrossRef]
  73. Li, X.; Xiao, J. Mapping Photosynthesis Solely from Solar-Induced Chlorophyll Fluorescence: A Global, Fine-Resolution Dataset of Gross Primary Production Derived from OCO-2. Remote Sens. 2019, 11, 2563. [Google Scholar] [CrossRef]
  74. Zheng, Y.; Shen, R.; Wang, Y.; Li, X.; Liu, S.; Liang, S.; Chen, J.M.; Ju, W.; Zhang, L.; Yuan, W. Improved estimate of global gross primary production for reproducing its long-term variation, 1982–2017. Earth Syst. Sci. Data 2020, 12, 2725–2746. [Google Scholar] [CrossRef]
  75. Miralles, D.G.; Holmes, T.R.H.; De Jeu, R.A.M.; Gash, J.H.; Meesters, A.G.C.A.; Dolman, A.J. Global land-surface evaporation estimated from satellite-based observations. Hydrol. Earth Syst. Sci. 2011, 15, 453–469. [Google Scholar] [CrossRef]
  76. Zeng, Z.; Piao, S.; Lin, X.; Yin, G.; Peng, S.; Ciais, P.; Myneni, R.B. Global evapotranspiration over the past three decades: Estimation based on the water balance equation combined with empirical models. Environ. Res. Lett. 2012, 7, 14026. [Google Scholar] [CrossRef]
  77. Zhang, Y.; Peña-Arancibia, J.L.; McVicar, T.R.; Chiew, F.H.S.; Vaze, J.; Liu, C.; Lu, X.; Zheng, H.; Wang, Y.; Liu, Y.Y.; et al. Multi-decadal trends in global terrestrial evapotranspiration and its components. Sci. Rep. 2016, 6, 19124. [Google Scholar] [CrossRef]
  78. Potapov, P.; Turubanova, S.; Hansen, M.C.; Tyukavina, A.; Zalles, V.; Khan, A.; Song, X.-P.; Pickens, A.; Shen, Q.; Cortez, J. Global maps of cropland extent and change show accelerated cropland expansion in the twenty-first century. Nat. Food 2022, 3, 19–28. [Google Scholar] [CrossRef]
  79. Zhang, Y.; Guanter, L.; Berry, J.; Joanna, J.; Tol, C.; Huete, A.; Gitelson, A.; Voigt, M.; Köhler, P. Estimation of vegetation photosynthetic capacity from space-based measurements of chlorophyll fluorescence for terrestrial biosphere models. Glob. Change Biol. 2014, 20, 3727–3742. [Google Scholar] [CrossRef]
  80. Zhang, K.; Kimball, J.S.; Running, S.W. A review of remote sensing based actual evapotranspiration estimation. WIREs Water 2016, 3, 834–853. [Google Scholar] [CrossRef]
  81. Frankenberg, C.; Berry, J.A. Solar Induced Chlorophyll Fluorescence: Origins, Relation to Photosynthesis and Retrieval. Compr. Remote Sens. 2018, 3, 143–162. [Google Scholar]
  82. Genty, B.; Harbinson, J.; Baker, N.R. Relative quantum efficiencies of the two photosystems of leaves in photorespiratory and non-respiratory conditions. Biochemistry 1990, 28, 1–10. [Google Scholar]
  83. Yang, H.; Yang, X.; Zhang, Y.; Heskel, M.A.; Lu, X.; Munger, J.W.; Sun, S.; Tang, J. Chlorophyll fluorescence tracks seasonal variations of photosynthesis from leaf to canopy in a temperate forest. Glob. Change Biol. 2017, 23, 2874–2886. [Google Scholar] [CrossRef] [PubMed]
  84. Gu, L.; Han, J.; Wood, J.; Chang, C.; Sun, Y. Sun-induced Chl fluorescence and its importance for biophysical modeling of photosynthesis based on light reactions. New Phytol. 2019, 223, 1179–1191. [Google Scholar] [CrossRef] [PubMed]
  85. Harrison, S.P.; Cramer, W.; Franklin, O.; Prentice, I.C.; Wang, H.; Brännström, Å.; de Boer, H.; Dieckmann, U.; Joshi, J.; Keenan, T.F.; et al. Eco-evolutionary optimality as a means to improve vegetation and land-surface models. New Phytol. 2021, 231, 2125–2141. [Google Scholar] [CrossRef]
  86. Cowan, I.R.; Farquhar, G.D. Stomatal function in relation to leaf metabolism and environment. Symp. Soc. Exp. Biol. 1977, 31, 471–505. [Google Scholar]
  87. Shan, N.; Ju, W.; Migliavacca, M.; Martini, D.; Guanter, L.; Chen, J.; Goulas, Y.; Zhang, Y. Modeling canopy conductance and transpiration from solar-induced chlorophyll fluorescence. Agric. For. Meteorol. 2019, 268, 189–201. [Google Scholar] [CrossRef]
  88. Maes, W.H.; Pagán, B.R.; Martens, B.; Gentine, P.; Guanter, L.; Steppe, K.; Verhoest, N.E.C.; Dorigo, W.; Li, X.; Xiao, J.; et al. Sun-induced fluorescence closely linked to ecosystem transpiration as evidenced by satellite data and radiative transfer models. Remote Sens. Environ. 2020, 249, 112030. [Google Scholar] [CrossRef]
  89. Damm, A.; Roethlin, S.; Fritsche, L. Towards Advanced Retrievals of Plant Transpiration Using Sun-Induced Chlorophyll Fluorescence: First Considerations. In Proceedings of the IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018; IEEE: New York, NY, USA, 2018; pp. 5983–5986. [Google Scholar]
  90. Xue, B.-L.; Guo, Q.; Otto, A.; Xiao, J.; Tao, S.; Li, L. Global patterns, trends, and drivers of water use efficiency from 2000 to 2013. Ecosphere 2015, 6, 174. [Google Scholar] [CrossRef]
  91. Tang, X.; Li, H.; Desai, A.R.; Nagy, Z.; Luo, J.; Kolb, T.E.; Olioso, A.; Xu, X.; Yao, L.; Kutsch, W.; et al. How is water-use efficiency of terrestrial ecosystems distributed and changing on Earth? Sci. Rep. 2014, 4, 7483. [Google Scholar] [CrossRef] [PubMed]
  92. Yang, Y.; Roderick, M.L.; Guo, H.; Miralles, D.G.; Zhang, L.; Fatichi, S.; Luo, X.; Zhang, Y.; McVicar, T.R.; Tu, Z.; et al. Evapotranspiration on a greening Earth. Nat. Rev. Earth Environ. 2023, 4, 626–641. [Google Scholar] [CrossRef]
  93. Badgley, G.; Fisher, J.B.; Jiménez, C.; Tu, K.P.; Vinukollu, R. On Uncertainty in Global Terrestrial Evapotranspiration Estimates from Choice of Input Forcing Datasets. J. Hydrometeorol. 2015, 16, 1449–1455. [Google Scholar] [CrossRef]
  94. Guerrieri, R.; Belmecheri, S.; Ollinger, S.V.; Asbjornsen, H.; Jennings, K.; Xiao, J.; Stocker, B.D.; Martin, M.; Hollinger, D.Y.; Bracho-Garrillo, R.; et al. Disentangling the role of photosynthesis and stomatal conductance on rising forest water-use efficiency. Proc. Natl. Acad. Sci. USA 2019, 116, 16909–16914. [Google Scholar] [CrossRef]
  95. Keenan, T.F.; Hollinger, D.Y.; Bohrer, G.; Dragoni, D.; Munger, J.W.; Schmid, H.P.; Richardson, A.D. Increase in forest water-use efficiency as atmospheric carbon dioxide concentrations rise. Nature 2013, 499, 324–327. [Google Scholar] [CrossRef]
  96. Dekker, S.C.; Groenendijk, M.; Booth, B.B.B.; Huntingford, C.; Cox, P.M. Spatial and temporal variations in plant water-use efficiency inferred from tree-ring, eddy covariance and atmospheric observations. Earth Syst. Dynam. 2016, 7, 525–533. [Google Scholar] [CrossRef]
  97. Farquhar, G.D.; Sharkey, T.D. Stomatal Conductance and Photosynthesis. Annu. Rev. Plant Physiol. 1982, 33, 317–345. [Google Scholar] [CrossRef]
  98. Sun, Z.; Wang, X.; Yamamoto, H.; Tani, H.; Zhong, G.; Yin, S.; Guo, E. Spatial pattern of GPP variations in terrestrial ecosystems and its drivers: Climatic factors, CO2 concentration and land-cover change, 1982–2015. Ecol. Inform. 2018, 46, 156–165. [Google Scholar] [CrossRef]
  99. Piao, S.; Wang, X.; Park, T.; Chen, C.; Lian, X.; He, Y.; Bjerke, J.W.; Chen, A.; Ciais, P.; Tømmervik, H.; et al. Characteristics, drivers and feedbacks of global greening. Nat. Rev. Earth Environ. 2020, 1, 14–27. [Google Scholar] [CrossRef]
  100. Bernacchi, C.J.; VanLoocke, A. Terrestrial Ecosystems in a Changing Environment: A Dominant Role for Water. Annu. Rev. Plant Physiol. 2015, 66, 599–622. [Google Scholar] [CrossRef]
  101. Gentine, P.; Green, J.K.; Guérin, M.; Humphrey, V.; Seneviratne, S.I.; Zhang, Y.; Zhou, S. Coupling between the terrestrial carbon and water cycles—A review. Environ. Res. Lett. 2019, 14, 83003. [Google Scholar] [CrossRef]
  102. Xie, S.; Mo, X.; Hu, S.; Liu, S. Contributions of climate change, elevated atmospheric CO2 and human activities to ET and GPP trends in the Three-North Region of China. Agric. For. Meteorol. 2020, 295, 108183. [Google Scholar] [CrossRef]
  103. Hou, W.; Jin, J.; Yan, T.; Liu, Y. Global Atmospheric Carbon Dioxide Concentration Simulation Grid Dataset (1992–2020). Digit. J. Glob. Change Data Repos. 2021, 9, 83. [Google Scholar] [CrossRef]
  104. Wang, S.; Ibrom, A.; Bauer-Gottwein, P.; Garcia, M. Incorporating diffuse radiation into a light use efficiency and evapotranspiration model: An 11-year study in a high latitude deciduous forest. Agric. For. Meteorol. 2018, 248, 479–493. [Google Scholar] [CrossRef]
  105. Chakraborty, T.; Lee, X.; Lawrence, D.M. Diffuse Radiation Forcing Constraints on Gross Primary Productivity and Global Terrestrial Evapotranspiration. Earth’s Future 2022, 10, e2022EF002805. [Google Scholar] [CrossRef]
  106. Wang, B.; Yue, X.; Zhou, H.; Lu, X.; Zhu, J. Enhanced Ecosystem Water-Use Efficiency Under the More Diffuse Radiation Conditions. Glob. Biogeochem. Cycles 2023, 37, e2022GB007606. [Google Scholar] [CrossRef]
Figure 1. Flowchart of data-driven models for gross primary productivity (GPP) and evapotranspiration (ET) on the global scale. The SIF products include CSIF [46], GO-SIF [47], and RT-SIF [30]. The machine learning algorithms include eXtreme Gradient Boosting (XGB) and Random Forest (RF) models.
Figure 1. Flowchart of data-driven models for gross primary productivity (GPP) and evapotranspiration (ET) on the global scale. The SIF products include CSIF [46], GO-SIF [47], and RT-SIF [30]. The machine learning algorithms include eXtreme Gradient Boosting (XGB) and Random Forest (RF) models.
Remotesensing 17 02064 g001
Figure 2. The independent validations between derived and observed GPP across various plant functional types (PFTs). ML models are trained using 80% (randomly selected) of the total datasets and validated using the remaining 20% of the datasets at global FLUXNET sites. For each PFT, derived GPP are averaged for the ensemble prediction by six data-driven models (2 ML × 3 SIF datasets). The ten PFTs include ENF, evergreen needleleaf forest; EBF, evergreen broadleaf forest; DNF, deciduous needleleaf forests; DBF, deciduous broadleaf forests; MF, mixed forest; CSH, closed shrublands; OSH, open shrublands; WSA, woody savannas; SAV, savannas; GRA, grassland; WET, permanent wetland; CRO, cropland. Statistical metrics are shown in each panel, including the Nash–Sutcliffe Efficiency (NSE), relative mean biases (RMB), and number of samples (N). The black lines represent linear regressions and the red lines denote 1:1 ratio between observations and derivations. The colors of the scatters represent the kernel density.
Figure 2. The independent validations between derived and observed GPP across various plant functional types (PFTs). ML models are trained using 80% (randomly selected) of the total datasets and validated using the remaining 20% of the datasets at global FLUXNET sites. For each PFT, derived GPP are averaged for the ensemble prediction by six data-driven models (2 ML × 3 SIF datasets). The ten PFTs include ENF, evergreen needleleaf forest; EBF, evergreen broadleaf forest; DNF, deciduous needleleaf forests; DBF, deciduous broadleaf forests; MF, mixed forest; CSH, closed shrublands; OSH, open shrublands; WSA, woody savannas; SAV, savannas; GRA, grassland; WET, permanent wetland; CRO, cropland. Statistical metrics are shown in each panel, including the Nash–Sutcliffe Efficiency (NSE), relative mean biases (RMB), and number of samples (N). The black lines represent linear regressions and the red lines denote 1:1 ratio between observations and derivations. The colors of the scatters represent the kernel density.
Remotesensing 17 02064 g002
Figure 3. The same as Figure 2 but for ET.
Figure 3. The same as Figure 2 but for ET.
Remotesensing 17 02064 g003
Figure 4. Feature importance of different factors to (aj) GPP and (kt) ET across various PFTs for the RF model. The values of feature impacts are calculated as the mean absolutely SHAP (SHapley Additive exPlanation) value of each factor. The number of sites for each PFT is shown in the bracket. The errorbars represent one standard deviation of three SIF products.
Figure 4. Feature importance of different factors to (aj) GPP and (kt) ET across various PFTs for the RF model. The values of feature impacts are calculated as the mean absolutely SHAP (SHapley Additive exPlanation) value of each factor. The number of sites for each PFT is shown in the bracket. The errorbars represent one standard deviation of three SIF products.
Remotesensing 17 02064 g004
Figure 5. Long-term mean of GPP and ET derived with ECSIF and their comparisons with the ensemble products during 2001–2020. The ECSIF (a) GPP and (d) ET are calculated as the average results of six data-driven models (2 ML × 3 SIF datasets). The (b,e) ensemble products are the mean values of MODIS [42,43], PML [5], and X-BASE [44] products. The differences between these products are shown in (c,f) with r, RMB indicated on the panels.
Figure 5. Long-term mean of GPP and ET derived with ECSIF and their comparisons with the ensemble products during 2001–2020. The ECSIF (a) GPP and (d) ET are calculated as the average results of six data-driven models (2 ML × 3 SIF datasets). The (b,e) ensemble products are the mean values of MODIS [42,43], PML [5], and X-BASE [44] products. The differences between these products are shown in (c,f) with r, RMB indicated on the panels.
Remotesensing 17 02064 g005
Figure 6. Long-term trend of (a) GPP and (d) ET derived with ECSIF and their comparisons with the (b,e) ensemble products during 2001–2020. The differences between these datasets are shown in (c,f).
Figure 6. Long-term trend of (a) GPP and (d) ET derived with ECSIF and their comparisons with the (b,e) ensemble products during 2001–2020. The differences between these datasets are shown in (c,f).
Remotesensing 17 02064 g006
Figure 7. Long-term (a) mean and (d) trend in water use efficiency (WUE) derived with ECSIF and their comparisons with the (b,e) ensemble products during 2001–2020. Differences are shown in (c,f).
Figure 7. Long-term (a) mean and (d) trend in water use efficiency (WUE) derived with ECSIF and their comparisons with the (b,e) ensemble products during 2001–2020. Differences are shown in (c,f).
Remotesensing 17 02064 g007
Table 1. The relationships of GPP, ET, and SIF across PFTs at FLUXNET sites.
Table 1. The relationships of GPP, ET, and SIF across PFTs at FLUXNET sites.
PFTsCRO
N = 17
DBF
N = 23
EBF
N = 13
ENF
N = 49
GRA
N = 37
MF
N = 9
SH
N = 16
SAV
N = 9
WET
N = 18
WSA
N = 6
CSIFGPPr0.620.840.590.810.810.910.810.680.750.82
Slope119.1224.9415.3023.7422.9121.5820.2516.5617.9526.84
Slope218.6524.2222.9927.1122.8421.5520.9020.3018.7425.43
ETr0.580.750.710.690.710.750.700.590.510.69
Slope10.210.260.340.250.270.240.340.290.250.51
Slope20.290.290.380.350.340.270.460.440.360.58
GOSIFGPPr0.610.770.210.690.780.900.610.690.710.77
Slope120.7824.495.3822.1926.6523.2118.3820.5918.3426.96
Slope222.5127.5625.7530.4828.1525.9323.4524.3921.0027.19
ETr0.590.740.410.610.710.770.520.650.500.67
Slope10.240.280.190.250.330.270.300.380.270.52
Slope20.350.340.440.390.430.320.510.540.400.63
RTSIFGPPr0.610.770.190.680.770.900.610.710.700.60
Slope110.9513.793.2513.0014.9313.9711.3413.1510.5713.68
Slope211.5315.0316.9317.6715.0915.0813.2814.9011.5715.89
ETr0.560.730.400.600.670.750.510.640.490.48
Slope10.120.150.120.140.180.160.180.240.150.25
Slope20.180.180.290.230.230.190.300.330.220.37
Notes: N represents the number of sites; the slope1 (slope2) indicates the linear regressions coefficient of GPP and SIF or ET and SIF with (without) intercepts, respectively.
Table 2. The independent validation of derived GPP and ET across PFTs.
Table 2. The independent validation of derived GPP and ET across PFTs.
PFTsCRO
N = 959
DBF
N = 1214
EBF
N = 622
ENF
N = 2995
GRA
N = 1795
MF
N = 637
SH
N = 701
SAV
N = 427
WET
N = 613
WSA
N = 411
CSIFXGBGPPNSE0.520.900.860.710.860.910.920.810.810.85
RMB1.84%0.05%0.21%1.64%0.24%0.09%−0.08%3.49%−1.34%−3.02%
ETNSE0.750.860.860.800.810.830.720.690.860.88
RMB0.65%−1.38%0.25%−0.71%−1.49%0.99%0.12%4.40%−2.25%−1.82%
RFGPPNSE0.570.900.850.800.870.910.910.800.810.87
RMB2.47%0.01%0.32%1.00%0.34%0.34%0.43%5.06%−1.67%−2.23%
ETNSE0.740.870.890.750.810.820.790.780.880.86
RMB1.17%−2.20%0.36%−1.11%−1.45%1.13%0.26%1.67%−2.73%−2.71%
GSIFXGBGPPNSE0.510.900.860.700.850.910.900.820.800.85
RMB1.87%−0.73%0.07%1.78%−1.26%−0.13%1.24%4.04%−3.68%−1.65%
ETNSE0.740.850.860.750.810.850.660.690.850.87
RMB0.74%−1.01%0.43%−0.56%−1.25%0.86%0.79%4.44%−2.52%−1.50%
RFGPPNSE0.550.900.860.800.870.920.890.780.810.87
RMB2.25%−0.06%0.02%1.06%0.10%0.05%0.87%6.61%−2.76%−1.28%
ETNSE0.740.850.880.750.810.820.780.770.880.85
RMB0.79%−2.47%0.20%−1.02%−1.46%0.83%0.47%3.29%−2.99%−2.15%
RSIFXGBGPPNSE0.520.900.860.690.850.910.920.820.810.84
RMB1.54%−0.36%−0.34%1.62%−1.29%−0.91%0.29%2.96%−2.01%−2.87%
ETNSE0.740.850.860.760.10.840.690.700.860.87
RMB0.46%−1.83%0.39%−0.58%−0.94%0.80%−0.14%3.05%−2.8%−1.52%
RFGPPNSE0.560.890.860.800.860.920.900.800.820.86
RMB1.94%−0.19%0.26%1.50%−0.29%−0.16%0.89%5.14%−2.58%−2.24%
ETNSE0.740.850.880.750.800.820.780.780.880.85
RMB0.82%−2.58%0.36%−0.76%−1.17%0.89%0.52%1.98%−2.91%−2.93%
Table 3. Summary of GPP and ET from the ECSIF datasets.
Table 3. Summary of GPP and ET from the ECSIF datasets.
SIFsAlgorithmsVariablesGlobalrRMBTrends
CSIFXGBGPP124.8 ± 1.6 Pg C yr−10.928.02%0.25 Pg C yr−2
ET582.4 ± 5.0 mm yr−10.8815.88%0.72 mm yr−2
RFGPP128.1 ± 1.4 Pg C yr−10.9411.13%0.22 Pg C yr−2
ET460.4 ± 3.6 mm yr−10.95−9.04%0.52 mm yr−2
GSIFXGBGPP126.1 ± 1.7 Pg C yr−10.899.20%0.26 Pg C yr−2
ET583.8 ± 6.2 mm yr−10.9114.99%0.87 mm yr−2
RFGPP130.8 ± 1.5 Pg C yr−10.9412.48%0.24 Pg C yr−2
ET468.2 ± 4.0 mm yr−10.95−8.28%0.58 mm yr−2
RSIFXGBGPP125.3 ± 1.2 Pg C yr−10.898.66%0.16 Pg C yr−2
ET574.5 ± 5.4 mm yr−10.8912.06%0.71 mm yr−2
RFGPP130.1 ± 1.2 Pg C yr−10.9312.49%0.19 Pg C yr−2
ET463.4 ± 3.3 mm yr−10.94−8.89%0.45 mm yr−2
Table 4. Summary of global GPP and ET products.
Table 4. Summary of global GPP and ET products.
VariablesMethodsGlobalTemporal CoverSpatial ResolutionTemporal ResolutionReferences
GPP
and ET
BESS122 ± 25 Pg C yr−12000–20151 km8 dayJiang and Ryu [16]
501 mm yr−1
BEPS124 ± 4 Pg C yr−11982–20160.5° × 0.6°hourlyHe, et al. [68]
485 ± 8 mm yr−1
PML-V2145.8 Pg C yr−12002–2017500 m8 dayZhang et al. [5]
560 mm yr−1
X-BASE124.7 ± 2.1 Pg C yr−12001–20200.05° × 0.05°hourlyNelson et al. [44]
574 ± 7 mm yr−1
SIF-ML128 ± 2.3 Pg C yr−12001–20200.05° × 0.05°8 dayThis study
522 ± 58.2 mm yr−1
GPPEC-LUE111 ± 21 Pg C yr−12000–20030.5° × 0.6°8 dayYuan, et al. [8]
BEPS132 ± 22 Pg C yr−120031° × 1°hourlyChen, et al. [69]
LUE122–130 Pg C yr−12000–2016500 m8 dayZhang et al. [70]
LUE108–119 Pg C yr−12004–20121 km8 dayYu, et al. [71]
ML&BESS131–163 Pg C yr−12005–20150.5° × 0.5°monthlyBadgley, et al. [72]
GOSIF136 ± 9 Pg C yr−12000–20230.05° × 0.05°8 dayLi and Xiao [73]
EC-LUE106 ± 3 Pg C yr−11982–20170.05° × 0.05°5, 6, 8 dayZheng, et al. [74]
ML117 ± 1.5 Pg C yr−11999–20190.05° × 0.05°monthlyGuo et al. [58]
ETRS-PM417 ± 38 mm yr−12000–20030.5° × 0.6°8 dayYuan, et al. [8]
ML500 ± 23 mm yr−11982–20080.5° × 0.5°monthlyJung, et al. [20]
PT522 mm yr−11980–20160.25° × 0.25°dailyMiralles, et al. [75]
WB558–650 mm yr−11982–20090.5° × 0.5°annualZeng, et al. [76]
PML-V1538 ± 57 mm yr−11998–20120.5° × 0.5°monthlyZhang et al. [77]
Notes: the original units of km3 were converted to mm year−1 through dividing by the total land area excluding Antarctica and Greenland.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zheng, J.; Zhou, H.; Yue, X.; Liu, X.; Xia, Z.; Wang, J.; Xiao, J.; Li, X.; Zhang, F. Increasing Ecosystem Fluxes Observed from Eddy Covariance and Solar-Induced Fluorescence Data. Remote Sens. 2025, 17, 2064. https://doi.org/10.3390/rs17122064

AMA Style

Zheng J, Zhou H, Yue X, Liu X, Xia Z, Wang J, Xiao J, Li X, Zhang F. Increasing Ecosystem Fluxes Observed from Eddy Covariance and Solar-Induced Fluorescence Data. Remote Sensing. 2025; 17(12):2064. https://doi.org/10.3390/rs17122064

Chicago/Turabian Style

Zheng, Jiao, Hao Zhou, Xu Yue, Xichuan Liu, Zhuge Xia, Jun Wang, Jingfeng Xiao, Xing Li, and Fangmin Zhang. 2025. "Increasing Ecosystem Fluxes Observed from Eddy Covariance and Solar-Induced Fluorescence Data" Remote Sensing 17, no. 12: 2064. https://doi.org/10.3390/rs17122064

APA Style

Zheng, J., Zhou, H., Yue, X., Liu, X., Xia, Z., Wang, J., Xiao, J., Li, X., & Zhang, F. (2025). Increasing Ecosystem Fluxes Observed from Eddy Covariance and Solar-Induced Fluorescence Data. Remote Sensing, 17(12), 2064. https://doi.org/10.3390/rs17122064

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop