Effect of Assimilating SMAP Soil Moisture on CO 2 and CH 4 Fluxes through Direct Insertion in a Land Surface Model

: Soil moisture impacts the biosphere–atmosphere exchange of CO 2 and CH 4 and plays an important role in the terrestrial carbon cycle. A better representation of soil moisture would improve coupled carbon–water dynamics in terrestrial ecosystem models and could potentially improve model estimates of large-scale carbon ﬂuxes and climate feedbacks. Here, we investigate using soil moisture observations from the Soil Moisture Active Passive (SMAP) satellite mission to inform simulated carbon ﬂuxes in the global terrestrial ecosystem model LPJ-wsl. Results suggest that the direct insertion of SMAP reduces the bias in simulated soil moisture at in situ measurement sites by 40%, with a greater improvement at temperate sites. A wavelet analysis between the model and measurements from 26 FLUXNET sites suggests that the assimilated run modestly reduces the bias of simulated carbon ﬂuxes for boreal and subtropical sites at 1–2-month time scales. At regional scales, SMAP soil moisture can improve the estimated responses of CO 2 and CH 4 ﬂuxes to extreme events such as the 2018 European drought and the 2019 rainfall event in the Sudd (Southern Sudan) wetlands. The simulated improvements to land–surface carbon ﬂuxes using the direct insertion of SMAP are shown across a variety of timescales, which suggests the potential of SMAP soil moisture in improving the model representation of carbon–water coupling.


Introduction
Soil moisture plays a critical role in controlling interactions between the soil, vegetation, and atmosphere, and is one of the major drivers that affects the carbon fluxes of terrestrial ecosystems [1]. Because of the coupling between water and carbon fluxes at the leaf level, soil moisture is a major constraint for the assimilation of carbon by vegetation through photosynthesis [2]. Soil moisture is the dominant driver of dryness stress on ecosystem production across more than 70% of vegetated land areas [3]. The spatial and temporal variations in soil moisture strongly affect the terrestrial carbon uptake [4]. Meanwhile, soil moisture serves as a proxy for water table depth, a variable that regulates the soil redox state for microbes to favor methanogenesis, which is used to determine the time and location of inundated areas at a regional scale that can significantly affect the estimate of methane (CH 4 ) flux [5]. Therefore, it is essential to understand the spatio-temporal distribution of soil moisture and its role in influencing terrestrial carbon fluxes in a warming climate.
Understanding the influence of soil moisture on terrestrial carbon fluxes is challenging because of the large uncertainty in the spatio-temporal distributions of soil moisture, which The terrestrial ecosystem model LPJ-wsl is a process-based land surface model developed for simulating the fully coupled water and carbon budget of the terrestrial biosphere based on the development of the LPJ dynamic global vegetation model [33]. LPJ-wsl includes land surface processes, such as hydrology [39], the soil biogeochemical cycle, and vegetation dynamics that are represented by plant functional types (PFTs). The distribution of PFTs is simulated based on a set of bioclimatic limits and physiological parameters that govern the establishment of and the competition for resources. Using the modified-LPJ approach of [40] that integrates freeze/thaw cycles in the representation of soil hydrology, the soil column has a depth of 1000 cm, split into the top 200 cm, referred to as soil, and the bottom 800 cm, as the buffer for soil temperature calculation. The upper 200 cm of soil is represented by eight vertically discretized layers with thicknesses of 10,10,10,20,20,30,50, and 50 cm. The lower 800 cm buffer is split into four layers with thicknesses of 82, 135, 221, and 362 cm. The original two-bucket model of LPJ, with an upper layer of 0-50 cm and a lower layer of 50-200 cm, provides soil moisture that corresponds to the top four soil layers (10, 10, 10, and 20 cm) and the bottom four soil layers (20,30,50, and 50 cm), respectively, in the updated multiple-soil-layer scheme of LPJ-wsl. The hydrology scheme describes the diffusion of water according to the physical soil properties, which are prescribed using a look-up table of 13 soil types following the distribution of a global texture map from the Harmonized World Soil Database [41], with water-holding capacity estimated using a set of empirical equations [6]. The hydrology module also includes a dynamic snow module and a freeze/thaw module, which allow the model to simulate the permafrost processes [34]. The soil moisture at different vertical levels is computed at a daily time step as influenced by rain infiltration, percolation, evaporation, transpiration, and surface and sub-surface drainage.
Within the upper 0-50 cm of the soil, the daily change in water content ∆w i (mm) in each layer i is calculated as: where P is precipitation, M is meltwater from the snow layer, E s is bare soil evaporation, T up is transpiration from the upper soil layer, i.e., transpiration weighted by the root fraction in the upper 0-50 cm soil layer, R perc is percolation to the lower 50-200 cm soil layer, ∆z i is the depth of soil layer i, and ∆z up is the depth of the upper soil layer (50 cm). The water allocation is proportional to the depth of the soil layer. When the water content of any soil layer in the upper 0-50 cm of the soil exceeds the actual water holding capacity (i.e., water holding capacity after deducing ice content), the surplus water is added to the surface runoff (R surf ). The percolation to the lower (50-200 cm) portion of the soil is allowed when there are water inputs from precipitation and meltwater. The percolation (R perc ) is calculated as a function of soil texture (represented as empirical parameter k 1 and k 2 ) and layer thickness, and declines exponentially with soil moisture: where k is soil-type-dependent hydraulic conductivity and w up is the water content of the upper soil layer. The change in water content ∆w i in each of the bottom 4 layers is given by: where T low is transpiration from the 50-200 cm layer and ∆z low is the depth of the lower soil layers (150 cm). If the water content of any lower soil layers exceeds the actual water holding capacity, the water is added to subsurface runoff. The hydrologic processes affect the soil heat capacity and its thermal conductivity, which are related to the volumetric fractions of the soil physical components, such as the water and ice fractions, as well as whether the soil is mineral or peat. The representation of Hortonian runoff in the LPJ-wsl model considers the effects of a reduction in water infiltration capacity in frozen soils [34]. In general, the model includes the effect of soil moisture on the soil thermal regime and the feedback mechanisms of the soil thermal regime on the water cycle, which means that altering the status of soil moisture can affect the carbon cycle processes directly (through water availability constraints) and indirectly (through the modulation of soil temperature constraints by soil moisture).

LPJ-Wsl Carbon Processes
The net ecosystem exchange (defined positive for a net CO 2 source) in LPJ-wsl is calculated as the balance between carbon uptake from photosynthesis and the losses from ecosystem respiration and fire, excluding carbon fluxes from land use change.
The CH 4 flux calculation is based on a prognostic wetland model as presented in [42], which is a function of the two scaling factors for boreal (F B ) and tropical (F T ) wetlands, soil temperature in the upper 0-50 cm soil layer, the soil moisture-dependent fraction of heterotrophic respiration, and inundation extent as: where A(t,x) and Rh(t,x) represent wetland extent (Unit: fraction) and ecosystem respiration (Unit: g C m −2 mon −1 ) of grid cell x at time t, respectively. The Q10 factor σ(x) describes the temperature dependence of the ratio of C respired as CH 4 and is calculated as exp(T(x) − T max )/10, where T(x) is the soil temperature (Unit: K), T max is 303.35 K, and F T (0.084) and F B (0.016) are the tropical and boreal scaling factors, respectively, derived from fitting to match to the estimates from regional inventories for the Hudson Bay lowlands [43] and the Amazon Basin lowland [44]. Inundation is simulated by a TOPMODEL hydrological framework [34], which determines the area fraction with soil water saturation from the knowledge of the mean watershed water table depth and a probability density function of combined topographic and soil properties. LPJ-wsl has been applied in carbon cycle studies [45] and has been evaluated using global inventory data sets and satellite observations. The simulated dynamics of wetland area and CH 4 emissions have been evaluated against large-scale observations in previous studies [34,[46][47][48].

SMAP Soil Moisture
Since 31 March 2015, SMAP has been continuously observing L-band surface brightness temperature emission from the Earth with near-global revisit coverage every 2-3 days, repeating the same ground track every eight days, and with a spatial resolution of~40 km 2 .
Here, we use Version 6 of the SMAP Level 3 daily soil moisture retrievals on the 36 km EASE grid [49]. The SMAP soil moisture retrieval algorithm derives moisture conditions in the top 5 cm of the soil from the surface brightness temperature observations after correcting for the effects of vegetations and soil surface roughness and using the Mironov dielectric soil mixing model [50,51]. Soil moisture retrievals for times and locations with vegetation canopy water content in excess of 5 kg m −2 are considered unreliable. Soil mois- ture retrievals with quality flags indicating dense vegetation, snow, or freezing temperature were not used here. We used bilinear interpolation to re-project the daily SMAP data from the 36 km EASE grid to the 0.5-degree global grid of our LPJ-wsl simulation.
Traditionally, it is suggested that the microwave satellite soil moisture has varying penetrating depth depending on the soil moisture conditions, with less effective depth on frozen and saturated soils [52,53]. However, some studies suggest that the skin emission depth of L-band microwaves increase with dryness to 1 m in some cases, with 5 cm potentially being a lower bound under wet conditions [54]. Furthermore, since soil moisture is a storage variable with land surface memory [55], soil moisture variations at different soil layers have similar information content. Namely, in wetter ecosystems, SMAP soil moisture in the top 5 cm has similar information content to soil layers much lower (often to 50 cm) [56]. Consequently, SMAP soil moisture suggests the potentials to represent soil storage layer thicknesses of deeper than 5 cm [57]. Therefore, we expect SMAP's utility to extend beyond 5 cm based on these recent findings. Hereafter, we remain with the null hypothesis in the analysis that it represents 5 cm depths, but test alternative hypotheses of SMAP's representation of deeper soil layers.

Simulation Setups
There are several challenges in directly comparing satellite soil moisture observations and model estimates of soil moisture [7,58,59]. These are largely due to differences in the parameterizations of soil texture information, the difficulty in matching representative soil depths and spatio-temporal resolutions, and the large within-pixel variability in soil moisture. Here, LPJ-wsl soil moisture estimates from the 0-10 cm top layer are the closest model equivalent to the typical sensing depth of 5 cm associated with SMAP retrievals. Thus, we assume that the water content is evenly distributed within the 0-10 cm soil profile in LPJ-wsl and represents the 0-5 cm soil layer. In addition, SMAP measures soil moisture at the specific local overpass time [16], whereas LPJ-wsl simulates daily average soil moisture.
Given these systematic observing and modeling differences between SMAP and LPJwsl, we first compare the LPJ-wsl and SMAP soil moisture information via the development of a normalized Soil Wetness Index (SWI) [60]: where x i,t represents soil moisture at grid cell i at time t, and minx i and maxx i represent the mean annual minimum and maximum of soil moisture at grid cell i, respectively. We determined minx i and maxx i for SMAP using April 2015-December 2019 observations and for LPJ-wsl using results from the 'baseline' simulation for the same period when SMAP observations are available. In the LPJ-wsl 'assimilation' run, the simulated LPJ volumetric soil moisture at corresponding depth is replaced with soil moisture x + i,t calculated from SMAP SWI at the end of a daily step whenever a SMAP observation is available, such that the SWI for the updated LPJ-wsl soil moisture matches the SWI of the SMAP observation. Note that the SWI-based rescaling does not preserve the mean soil moisture, unlike the cumulative distribution function matching or standard normal deviate rescaling that is widely used in soil moisture data assimilation [26]. That is, soil moisture assimilation using SWI-based rescaling may result in a non-negligible difference in the distribution of soil moisture between the baseline and assimilation runs. As discussed in Section 2.2, SMAP L3 surface soil moisture may also contain information about soil moisture in deeper layers [61]. We designed experiments to test the sensitivity of assimilating SMAP soil moisture at different depths. Specifically, we conducted two sets of model simulations: (1) a 'baseline' LPJ-wsl simulation without SMAP assimilation and (2) three "assimilation" schemes that assimilate SMAP soil moisture into LPJ-wsl for different soil depths in the model: 0-10 cm (topsoil layer, denoted LPJ_10), 0-50 cm (top four soil layers, denoted LPJ_50), and 0-200 cm (all 8 soil layers, denoted LPJ_200). The soil moisture values for these soil depths (0-10 cm, 0-50 cm, and 0-200 cm, respectively) were replaced with volumetric soil moisture derived from SMAP SWI using Equation (5). The performance of the different assimilation schemes was tested against in situ measurements to find the best representative depth.
We used surface meteorological forcing data from the Climate Research Unit (CRU v4.05) [62] to run the LPJ-wsl model in this study. CRU forcing data is a global-scale, in situbased, monthly spatial-interpolated climate dataset for 1901-2019. A weather generator in LPJ is used to generate daily precipitation events based on a wet-day frequency input and a set of linear interpolations with monthly temperature and cloud cover as inputs to calculate daily temperature, photosynthetically active radiation flux, and potential evapotranspiration. The LPJ-wsl simulations include a spin-up phase, which equilibrates the prognostic variables, including vegetation state, soil carbon pools, and soil moisture content. The spin-up run is a 1000-year simulation that recycles the CRU forcing data from 1901-1930 for the spin-up period until the onset of transient simulation in 1901. During the spin-up run, the atmospheric CO 2 concentration is held constant at the pre-industrial level (286.00 ppm). For the transient simulation, the simulation is forced with varying annual atmospheric CO 2 concentrations using data from observations. For the baseline and assimilation runs, the model setup is the same until 31 March 2015 when SMAP is available.
We first compare our baseline to the assimilated version to assess the effect of different assimilating soil depth on simulated soil moisture values. Because the FLUXNET observations are not available for the study period of 2015-2019, we evaluated the simulated average seasonal cycle from the SMAP assimilation period against the observations. The different schemes were compared at regional scales to discuss the effect of the soil moisture assimilation on large-scale carbon fluxes.

Benchmark Datasets
Site-level observations of soil moisture and carbon (i.e., CO 2 and CH 4 ) fluxes from FLUXNET2015 and FLUXNET-CH4 (hereafter, FLUXNET) were used to evaluate the temporal patterns of simulated results. Given that there are systematic differences between the remotely sensed and ground data due to the spatial representativeness, different measurement depths [63], and uncertainty in the LPJ parameterizations, we evaluated soil moisture on the basis of SWI. We used the same minimum and maximum soil moisture values derived from the baseline run for the SWI calculation of the assimilated runs. The FLUXNET soil moisture used in this study are measurements for the top soil layer (0-15 cm) listed in Tables 1 and A1 [35,36,64]. Regional-level evaluations were also carried out by comparing with one independent satellite soil moisture product and two gridded carbon flux products that were developed based on upscaling eddy covariance measurements. We use the European Space Agency's Climate Change Initiative for Soil Moisture (ESA CCI SM) product, which harmonizes and merges soil moisture retrievals from multiple satellites into a single product at a 0.25 • regular grid [65]. The version applied in this study was the v4.07 combined product from 1 January 2015-31 December 2019 at daily time step. The CO 2 flux estimates were based on a net ecosystem exchange estimate from upscaled FLUXNET observations [66,67]. Figure 1 provides a map of the sites for the evaluation of soil moisture and carbon fluxes. The site selection for soil moisture evaluation followed the criteria to represent land cover spanning from evergreen needleleaf forest (ENF) to grasslands, including croplands; all sites were situated within North America. Therefore, we used 7 sites from FLUXNET for comparison with the LPJ-wsl simulated soil moisture and CO 2 fluxes. Because the upland sites do not have measurements of CH 4 fluxes, we used 26 wetland sites from FLUXNET-CH4 (Table A1; [36,64]) to assess the influence of assimilation on CO 2 and CH 4 fluxes at different time scales for different biome sub-types across the globe.

Statistical Analysis and Wavelets
A variety of metrics are applied in this study for evaluating the model performance, which include: -Root mean standard difference (RMSD) between two datasets; -Three components of RMSD: squared bias, difference in the magnitude of fluctuation between simulation and measurements (∆ ) 2 , and lack of correlation weighted by the standard deviations (LCS), which are used to evaluate the relative skill in reproducing the observations (see below for details); - The Pearson product-moment correlation coefficient r to assess the relative agreement of the temporal structures between two datasets. The spatial correlation is also calculated using the same formula but on a single vector of locations at specific time steps; -Taylor diagrams [68] are used to visually evaluate the relative skill among model-data fits by illustrating the linear correlation coefficient, RMSD, and standard deviation in a polar coordinate plot.
The square of RMSD can be decomposed into three informative components related to bias, the difference in standard deviation between the model and the observations, and the lack of correlation. Specifically, the RMSD can be written as follows [51,60]:

Statistical Analysis and Wavelets
A variety of metrics are applied in this study for evaluating the model performance, which include: -Root mean standard difference (RMSD) between two datasets; -Three components of RMSD: squared bias, difference in the magnitude of fluctuation between simulation and measurements (∆σ) 2 , and lack of correlation weighted by the standard deviations (LCS), which are used to evaluate the relative skill in reproducing the observations (see below for details); - The Pearson product-moment correlation coefficient r to assess the relative agreement of the temporal structures between two datasets. The spatial correlation is also calculated using the same formula but on a single vector of locations at specific time steps; -Taylor diagrams [68] are used to visually evaluate the relative skill among model-data fits by illustrating the linear correlation coefficient, RMSD, and standard deviation in a polar coordinate plot.
The square of RMSD can be decomposed into three informative components related to bias, the difference in standard deviation between the model and the observations, and the lack of correlation. Specifically, the RMSD can be written as follows [51,60]: where bias is the mean difference between the time series of the model (x i = 1..n ) and the observations (y i = 1..n ) for a given location i: and the square differences of the standard deviations of the model and observations (VAR) are expressed as: A large value of VAR indicates that the model fails to simulate the variability. Finally, LCS reflects the lack of correlation (r) between the observations and the model, weighted by the product of the standard deviations: To assess the model behavior of simulating soil moisture and carbon fluxes across space and time, we apply wavelet analysis, a model-data evaluation approach in the frequency domain that has been applied to quantify the variability of CO 2 and CH 4 fluxes across timescales [69][70][71]. To determine whether/how assimilating SMAP observations every 3-5 days improves the modeled variability at different time scales, we applied wavelet analysis to decompose the (normalized) data-model differences, which is useful to provide insights about data-model disagreements at different temporal scales. Since the modeled carbon fluxes are not directly comparable to the eddy covariance measurements owing to the spatial mismatch between modeled gridded fluxes and site-level observations, we evaluated the simulated CO 2 and CH 4 fluxes by calculating the normalized residual error (NRE) in between LPJ and the observed soil moisture as: where the subscripts denote site (s), model (m), and time (t), and the overbar denotes the average over the full length of the time series. For each LPJ-wsl simulation experiment, we calculate the spectral characteristics of NRE and compare the spectra to the appropriate null model (i.e., baseline), which is defined as a spectra account for the errors in the flux observations (see Appendix A.1 for more details about the calculation).

Results
In Sections 3.1 and 3.2, we first show evidence that assimilating SMAP soil moisture into LPJ improves LPJ's soil moisture representation. This ultimately leads to an improved LPJ representation of carbon fluxes, as shown in Section 3.3, especially in locations where LPJ's baseline soil moisture representation is not reflective of that observed.

An Analysis of Model Depth Selection
The comparison of site-level soil moisture for different assimilation schemes suggests that the majority of the assimilation experiments showed improved simulations over the baseline in comparison to the FLUXNET measurements, with the degree of improvement depending on the underlying agreement of SMAP with FLUXNET data. The model generally compares closest to FLUXENT when assimilating the SMAP soil moisture for the soil columns 0-50 cm and 0-200 cm, as indicated in Figure 2. The performance of the modeled results also largely depended on the agreement between SMAP and the observation at the evaluation sites. For the boreal site (US-Prr), which showed higher discrepancies between SMAP and FLUXNET, the assimilation runs tended to have little/no improvement against the in situ observations. The spread in performance within biome types was broad: two of the seven sites with the best performance were open shrubland and savanna, where the relatively low vegetation density and low soil organic carbon tended to have less influence on the accuracy of the SMAP soil moisture. Figure 3 more comprehensively shows the discrepancies between the assimilation schemes' soil moisture values (represented as SWI) and those of the in situ FLUXNET soil moisture values using the decomposed RMSD2 metrics in SWI. It ultimately confirms that several components of LPJ_200 soil moisture compare closely to those of the in situ measurements. In each LPJ-wsl simulation, the bias and LCS with FLUXNET site were considerably reduced after assimilating the SWI from the SMAP soil moisture, while the Var (∆σ) 2 was only slightly affected by the assimilation. This was mainly due to the assimilation of normalized observations, which preserves the variability but alters the seasonality of soil moisture. This indicates that, after the assimilation of SMAP soil moisture, the biases with site-level observations were removed, and the agreement was considerably improved. However, the ability to improve the performance in simulating the variability at the level of individual sites was limited. Since assimilating SMAP into the whole soil column (0-200 cm) in LPJ-wsl had a better performance than the other schemes, we use the LPJ_200 scheme for the remainder of this paper to interpret the effect of soil moisture assimilation on carbon fluxes.

Site-Level Comparison of Soil Moisture
The comparison of the simulated daily soil moisture with the FLUXNET in situ and SMAP satellite measurements suggested that, by construction, the assimilated results had a better agreement with the SMAP observations. The agreement of the density distribution of SMAP assimilated soil moisture vs. that of FLUXNET strongly depended on the accuracy of the SMAP retrievals in detecting surface soil moisture (Figure 4). At the US-Prr site, the assimilation estimates agreed less well with the in situ measurements than they did with the baseline estimates, owing to the mismatch between SMAP and the in situ data. This was mainly attributed to the effect of the soil freeze/thaw processes on the SMAP retrievals at high latitudes, as suggested by previous studies [72][73][74], which causes relatively poor performance in the validation. Other than US-Prr, for five of the seven sites (CA-TP1, US-MMS, US-Oho, US-Ton, US-Whs), the assimilated results had better agreement than the LPJ-wsl baseline simulation with FLUXNET in terms of the median value and distribution. For open shrubland and savanna sites (US-Whs and US-Ton), all simulations were comparable with each other, given the reduced influence of vegetation canopies on the SMAP retrievals and the relatively good performance of the LPJ-wsl baseline in dry and temperate regions. Taylor plots of monthly mean soil moisture in the 0-10 cm layer by assimilation schemes for all 7 sites. The cosine of the angle from the x axis is the correlation coefficient between simulated and observed soil moisture, which measures how well the simulated soil moisture captures the phasing and timing of the observed soil moisture from in situ sites. The black dots on the x axis represent observations. The radial distance from the origin is the standard deviation. The dotted and dashed grey lines represent correlation and RMSD normalized by standard deviation, respectively. The star symbol indicates a perfect model that has a zero RMSD and a correlation of 1.0.
simulation, the bias and LCS with FLUXNET site were considerably reduced after assimilating the SWI from the SMAP soil moisture, while the Var (∆ ) 2 was only slightly affected by the assimilation. This was mainly due to the assimilation of normalized observations, which preserves the variability but alters the seasonality of soil moisture. This indicates that, after the assimilation of SMAP soil moisture, the biases with site-level observations were removed, and the agreement was considerably improved. However, the ability to improve the performance in simulating the variability at the level of individual sites was limited. Since assimilating SMAP into the whole soil column (0-200 cm) in LPJ-wsl had a better performance than the other schemes, we use the LPJ_200 scheme for the remainder of this paper to interpret the effect of soil moisture assimilation on carbon fluxes.

Figure 3.
Boxplots showing the decomposed MSD for simulated monthly SWI from different assimilation schemes against the FLUXNET sites. The dots represent the outliers from the boreal site US-Prr, where SMAP has large discrepancies with the FLUXNET measurements.

Site-Level Comparison of Soil Moisture
The comparison of the simulated daily soil moisture with the FLUXNET in situ and SMAP satellite measurements suggested that, by construction, the assimilated results had a better agreement with the SMAP observations. The agreement of the density distribution of SMAP assimilated soil moisture vs. that of FLUXNET strongly depended on the accuracy of the SMAP retrievals in detecting surface soil moisture (Figure 4). At the US-Prr site, the assimilation estimates agreed less well with the in situ measurements than they did with the baseline estimates, owing to the mismatch between SMAP and the in situ data. This was mainly attributed to the effect of the soil freeze/thaw processes on the SMAP retrievals at high latitudes, as suggested by previous studies [72][73][74], which causes relatively poor performance in the validation. Other   The evaluation of the mean seasonal cycle of soil wetness in the simulations (versus that of FLUXNET measurements) suggested diverse patterns among sites and better agreement between the assimilation run and FLUXNET than was seen for the baseline ( Figure 5). The improvement was more significant for the biomes that were less influenced by the freeze/thaw cycle and dense vegetation canopies. For the high-latitude site (US-Prr), both simulations captured well the onset The evaluation of the mean seasonal cycle of soil wetness in the simulations (versus that of FLUXNET measurements) suggested diverse patterns among sites and better agreement between the assimilation run and FLUXNET than was seen for the baseline ( Figure 5). The improvement was more significant for the biomes that were less influenced by the freeze/thaw cycle and dense vegetation canopies. For the high-latitude site (US-Prr), both simulations captured well the onset and termination of the freeze/thaw cycle and produced a lower peak growing season compared to FLUXNET, with the assimilation run producing even lower soil moisture due to the weak seasonality captured by SMAP. For temperate forest sites (CA-TP1, US-MMS, and US-Oho), the assimilation run had a generally better agreement with FLUXNET, capturing the timing fairly well with improved peak and phase compared to the baseline run. The good agreements for US-Ton and US-Whs suggest there was less discrepancy between the model and the observations for the biomes that had open-canopy vegetations, where the assimilation run slightly improved the agreements with FLUXNET due to the higher accuracy of SMAP in these regions.

Effect of Assimilation on Carbon Fluxes
Generally, for sites that have considerable discrepancies in soil moisture, the SMAP assimilation improved the agreement of seasonality in the net ecosystem CO2 exchange with the FLUXNET observations ( Figure 6). The assimilation of SMAP slightly altered the simulated CO2 flux

Effect of Assimilation on Carbon Fluxes
Generally, for sites that have considerable discrepancies in soil moisture, the SMAP assimilation improved the agreement of seasonality in the net ecosystem CO 2 exchange with the FLUXNET observations ( Figure 6). The assimilation of SMAP slightly altered the simulated CO 2 flux during the growing season, but had a limited effect on the seasonal pattern, despite significant changes in soil moisture for a few sites (e.g., US-Prr and CA-CP1). For the sites where the probability density function of modeled soil moisture agreed relatively well with SMAP (Figure 4; S-Ton and US-Whs), the assimilation resulted in a higher carbon uptake during the growing season but had a slightly weaker correlation with FLUXNET observations, suggesting that factors other than soil moisture likely control CO 2 flux during the growing season.  To assess the effect of the assimilation of soil moisture on carbon fluxes at different time scales, we calculated the wavelet spectral of normalized residual errors between modeled carbon fluxes and FLUXNET measurement on the basis of site year (Figure 7). To improve the representativeness for biomes, 26 FLUXNET sites where SMAP observations were available and assimilated into LPJwsl were used to make the comparison. For each site year (2015-2019) from the LPJ-wsl simulation, the mean marginal distribution of the power spectrum showed the variations of model error with time scales. In this context, any time scales for which the model had lower power indicated improvements against the observations. This showed, for example, that the error in CO2 flux in the model was generally lowest at the monthly time scale (30 days) and higher at the shorter and longer time scales. This was reflected partly because of using the monthly climate datasets, as the submonthly simulations were based on a weather generator. To assess the effect of the assimilation of soil moisture on carbon fluxes at different time scales, we calculated the wavelet spectral of normalized residual errors between modeled carbon fluxes and FLUXNET measurement on the basis of site year (Figure 7). To improve the representativeness for biomes, 26 FLUXNET sites where SMAP observations were available and assimilated into LPJ-wsl were used to make the comparison. For each site year (2015-2019) from the LPJ-wsl simulation, the mean marginal distribution of the power spectrum showed the variations of model error with time scales. In this context, any time scales for which the model had lower power indicated improvements against the observations. This showed, for example, that the error in CO 2 flux in the model was generally lowest at the monthly time scale (30 days) and higher at the shorter and longer time scales. This was reflected partly because of using the monthly climate datasets, as the sub-monthly simulations were based on a weather generator. The comparison between the baseline and the assimilation run suggested improved agreement at time scales greater than 40 days for the boreal and tropical sites, with little/no improvement for the tundra and temperate sites. However, the assimilation run tended to have higher bias for time scales less than 30 days, which was partly due to the simple assimilation strategy that corrected the soil moisture every 3-5 days given the revisit time of SMAP. This assimilation strategy could possibly introduce unrealistic variability at these time scales, due to misaligned SMAP sampling meteorology with the weather generator in LPJ-wsl. Note that the site observations have large uncertainties in carbon fluxes with high year-to-year variations, especially for CH4 observations, which are sensitive to environmental conditions such as water table depth and microbial composition [36,64,71]. The lack of measurements limits the evaluation for the tropical sites since only two subtropical grassland sites are available while no representation of tropical floodplain is included. In addition, the sites with CH4 flux measurements tend to have high soil organic matter and saturated soil water; thus, the evaluation of assimilated CH4 is affected by the accuracy of SMAP retrieval [73,75].

Evaluation of the Assimilation at Global Scale
To assess the effect of soil moisture assimilation at a global scale, we calculated the correlation coefficients and RMSD of the simulated daily soil moisture from the baseline and assimilated the run against SMAP and against the satellite-based surface soil moisture retrievals from the ESA CCI SM dataset for 2015-2019 ( Figure 8). The results showed that LPJ-wsl baseline produced different temporal patterns than SMAP for the high latitudes and had a higher RMSD for most regions, except semi-arid regions, which is consistent with previous studies [74,76]. This was also the case for the metrics computed versus ESA CCI, which suggested a systematic difference between the satellite and model data, with the highest agreement for semi-arid regions. As expected, the correlation and RMSD were improved (by construction) between the assimilation run and SMAP. The comparison with ESA CCI suggested that the assimilation could significantly increase the correlation coefficient and lower the RMSD for the northern high latitudes. Given that ESA CCI is an independent product, The comparison between the baseline and the assimilation run suggested improved agreement at time scales greater than 40 days for the boreal and tropical sites, with little/no improvement for the tundra and temperate sites. However, the assimilation run tended to have higher bias for time scales less than 30 days, which was partly due to the simple assimilation strategy that corrected the soil moisture every 3-5 days given the revisit time of SMAP. This assimilation strategy could possibly introduce unrealistic variability at these time scales, due to misaligned SMAP sampling meteorology with the weather generator in LPJ-wsl. Note that the site observations have large uncertainties in carbon fluxes with high year-to-year variations, especially for CH 4 observations, which are sensitive to environmental conditions such as water table depth and microbial composition [36,64,71]. The lack of measurements limits the evaluation for the tropical sites since only two subtropical grassland sites are available while no representation of tropical floodplain is included. In addition, the sites with CH 4 flux measurements tend to have high soil organic matter and saturated soil water; thus, the evaluation of assimilated CH 4 is affected by the accuracy of SMAP retrieval [73,75].

Evaluation of the Assimilation at Global Scale
To assess the effect of soil moisture assimilation at a global scale, we calculated the correlation coefficients and RMSD of the simulated daily soil moisture from the baseline and assimilated the run against SMAP and against the satellite-based surface soil moisture retrievals from the ESA CCI SM dataset for 2015-2019 (Figure 8). The results showed that LPJ-wsl baseline produced different temporal patterns than SMAP for the high latitudes and had a higher RMSD for most regions, except semi-arid regions, which is consistent with previous studies [74,76]. This was also the case for the metrics computed versus ESA CCI, which suggested a systematic difference between the satellite and model data, with the highest agreement for semi-arid regions. As expected, the correlation and RMSD were improved (by construction) between the assimilation run and SMAP. The comparison with ESA CCI suggested that the assimilation could significantly increase the correlation coefficient and lower the RMSD for the northern high latitudes. Given that ESA CCI is an independent product, the reduced RMSD over large regions against the ESA CCI estimate indicates that the behavior of the model soil moisture is largely adjusted to be more consistent with satellite-based observations.   ) and RMSD (c,d) of simulated daily SWI between LPJ_200 and baseline when compared to two satellite soil moisture datasets (SMAP and ESA CCI) for 2015-2019. SMAP data were assimilated into LPJ_200. Blue color suggests improved agreement of LPJ-simulated SWI against the satellite datasets with the darker color representing higher agreements, while the red color represents reduced agreements.

Implications of Assimilated Results for Carbon Cycle Science
Even though, for many regions (e.g., tropical forests, high latitudes), SMAP observations are unavailable or have a very limited number of samples, it is still feasible to assess the influence of assimilating SMAP soil moisture at a regional scale. Below, we use two examples to demonstrate its potential influences on estimating carbon fluxes.

European Drought in 2018
Europe was stricken by an extreme summer heat wave and drought in 2018, which resulted in strong reductions in vegetation productivity and increased ecosystem respiration due to soil moisture deficits and, consequently, a reduction in the net CO2 uptake in ecosystems [77]. Previous studies [78][79][80] have shown that prognostic land surface models tend to underestimate the effect of droughts due to the limited representation of hydrologic processes, which affects the simulated carbon balance through soil moisture deficits, high water vapor pressure, and heat stress. Figure 9 shows that assimilating SMAP moisture into LPJ-wsl can introduce an improved seasonality in the net carbon uptake. Figure 9a shows that, despite both the baseline and assimilation runs showing good agreement, that there was a minimum in the soil moisture during the growing season, and assimilating SMAP reduced the amplitude of the seasonal cycle, with drier conditions during the non-growing seasons for 2015-2019. It should also be noted that, even though the baseline simulated the extreme drought condition in 2018 reasonably well given the precipitation input from the observation-based climate datasets, SMAP suggested a less distinctive drought feature with higher 2018 summer soil moisture compared to the baseline.  coefficient (a,b) and RMSD (c,d) of simulated daily SWI between LPJ_200 and baseline when compared to two satellite soil moisture datasets (SMAP and ESA CCI) for 2015-2019. SMAP data were assimilated into LPJ_200. Blue color suggests improved agreement of LPJ-simulated SWI against the satellite datasets with the darker color representing higher agreements, while the red color represents reduced agreements.

Implications of Assimilated Results for Carbon Cycle Science
Even though, for many regions (e.g., tropical forests, high latitudes), SMAP observations are unavailable or have a very limited number of samples, it is still feasible to assess the influence of assimilating SMAP soil moisture at a regional scale. Below, we use two examples to demonstrate its potential influences on estimating carbon fluxes.

European Drought in 2018
Europe was stricken by an extreme summer heat wave and drought in 2018, which resulted in strong reductions in vegetation productivity and increased ecosystem respiration due to soil moisture deficits and, consequently, a reduction in the net CO 2 uptake in ecosystems [77]. Previous studies [78][79][80] have shown that prognostic land surface models tend to underestimate the effect of droughts due to the limited representation of hydrologic processes, which affects the simulated carbon balance through soil moisture deficits, high water vapor pressure, and heat stress. Figure 9 shows that assimilating SMAP moisture into LPJ-wsl can introduce an improved seasonality in the net carbon uptake. Figure 9a shows that, despite both the baseline and assimilation runs showing good agreement, that there was a minimum in the soil moisture during the growing season, and assimilating SMAP reduced the amplitude of the seasonal cycle, with drier conditions during the non-growing seasons for 2015-2019.
It should also be noted that, even though the baseline simulated the extreme drought condition in 2018 reasonably well given the precipitation input from the observation-based climate datasets, SMAP suggested a less distinctive drought feature with higher 2018 summer soil moisture compared to the baseline. The seasonal cycle of net ecosystem productivity reveals important differences between the baseline and assimilation runs in estimating carbon fluxes (Figure 9b). The assimilation run shows that SMAP produced extremely low peak carbon uptake during the summer of 2018, while the baseline run suggests a moderate drought effect on net carbon uptake. Considering that SMAP introduces a wetter soil condition during the growing season than the baseline, this indicates the significant influence of soil moisture during the early growing season, when the drier soil conditions introduced by SMAP had a weak impact on vegetation growth during the early growing season but had a strong influence on evapotranspiration ( Figure A2). The less affected vegetation growth contributes to a high soil moisture depletion during the growing season, which amplifies the overall effect of summer droughts on net ecosystem productivity ( Figure A2). This resulted in a weak net annual carbon source of 0.07 Pg C yr −1 in 2018, in contrast to a net annual carbon uptake (sink) of −0.17 Pg C yr −1 from the baseline. In addition, the assimilation can produce a lower seasonality in net carbon uptake during the growing season, which is in better agreement with FLUXCOM.

Sudd Wetlands
Assimilated soil moisture has been shown to improve the spatio-temporal distribution of modeled CH4 fluxes over the South Sudan region, with a central focus on the Sudd wetlands. We chose the Sudd wetlands because of their important role in wetland CH4 emissions highlighted by recent studies [81][82][83], as well as the good accuracy and spatial coverage of SMAP soil moisture for the low-canopy-density region [74]. The map of difference in soil moisture between the assimilation run and baseline clearly shows that the assimilation run produced higher soil moisture around the Sudd wetland regions, causing higher estimates of wetland area due to higher water table depth through the TOPMODEL hydrologic module of LPJ-wsl. The higher wetland areas estimated by the assimilation run were consistent with the findings from [83], which suggests that wetland models largely underestimate the wetland area of the Sudd region.
The modeled CH4 fluxes over the Sudd wetlands from the assimilation run had better agreement in terms of their seasonal cycle and annual total with posterior estimates from the inversion models [81,83] based on satellite measurements (Figure 10). The assimilation run showed a dual peak of CH4 fluxes in the March-April-May and September-October-November seasons, which is consistent with the seasonality detected by inversion models based on GOSAT and TROPOMI measurements [81,82]. In contrast, the baseline generally produces a lower intra-seasonal variability than the assimilation run, with the highest emissions in the late growing season. In addition, the timing of a record high peak in SON 2019 is consistent with the posterior estimates The seasonal cycle of net ecosystem productivity reveals important differences between the baseline and assimilation runs in estimating carbon fluxes (Figure 9b). The assimilation run shows that SMAP produced extremely low peak carbon uptake during the summer of 2018, while the baseline run suggests a moderate drought effect on net carbon uptake. Considering that SMAP introduces a wetter soil condition during the growing season than the baseline, this indicates the significant influence of soil moisture during the early growing season, when the drier soil conditions introduced by SMAP had a weak impact on vegetation growth during the early growing season but had a strong influence on evapotranspiration ( Figure A2). The less affected vegetation growth contributes to a high soil moisture depletion during the growing season, which amplifies the overall effect of summer droughts on net ecosystem productivity ( Figure A2). This resulted in a weak net annual carbon source of 0.07 Pg C yr −1 in 2018, in contrast to a net annual carbon uptake (sink) of −0.17 Pg C yr −1 from the baseline. In addition, the assimilation can produce a lower seasonality in net carbon uptake during the growing season, which is in better agreement with FLUXCOM.

Sudd Wetlands
Assimilated soil moisture has been shown to improve the spatio-temporal distribution of modeled CH 4 fluxes over the South Sudan region, with a central focus on the Sudd wetlands. We chose the Sudd wetlands because of their important role in wetland CH 4 emissions highlighted by recent studies [81][82][83], as well as the good accuracy and spatial coverage of SMAP soil moisture for the low-canopy-density region [74]. The map of difference in soil moisture between the assimilation run and baseline clearly shows that the assimilation run produced higher soil moisture around the Sudd wetland regions, causing higher estimates of wetland area due to higher water table depth through the TOPMODEL hydrologic module of LPJ-wsl. The higher wetland areas estimated by the assimilation run were consistent with the findings from [83], which suggests that wetland models largely underestimate the wetland area of the Sudd region.
The modeled CH 4 fluxes over the Sudd wetlands from the assimilation run had better agreement in terms of their seasonal cycle and annual total with posterior estimates from the inversion models [81,83] based on satellite measurements ( Figure 10). The assimilation run showed a dual peak of CH 4 fluxes in the March-April-May and September-October-November seasons, which is consistent with the seasonality detected by inversion models based on GOSAT and TROPOMI measurements [81,82]. In contrast, the baseline generally produces a lower intra-seasonal variability than the assimilation run, with the highest emissions in the late growing season. In addition, the timing of a record high peak in SON 2019 is consistent with the posterior estimates based on TROPOMI measurements [82], which suggests a strong positive anomaly of CH 4

Discussion
Throughout this paper, we assessed the potential of the SMAP soil moisture product to be used to improve the LPJ-wsl simulation of carbon fluxes. Because systematic errors exist in both the SMAP product and the process-based models, the climatological rescaling of soil moisture between SMAP and LPJ-wsl is essential. The SWI conversion approach in this study has been proven to be useful at inducing the seasonality of soil moisture dynamics [84,85]. Given that it is difficult to adjust the model formulation, soil hydraulic characterization, and the degree of wetness relative to the range of moisture over which key processes (drainage, evapotranspiration) occur, the potential bias in the SWI conversion in this study and its effect on assimilation needs further investigation. A better rescaling approach for SMAP is needed to keep both sources relatively consistent, such as the linear cumulative density distribution (CDF) matching or the full CDF matching [26,60]. In addition, because the SMAP descending retrievals at local time (6 a.m.) were applied in this study, a proper conversion of SMAP soil moisture to daily averages may need to be developed to minimize the mismatch with the modeled daily average soil moisture.
The simulation experiments with different representative depths demonstrate the importance of assessing the suitable depth to be considered in the assimilation. Despite the fact that LPJ-wsl is discretized by an eight-layered hydrology scheme, there is no exact corresponding depth to match the thin surface layer detected by the SMAP instrument. In addition, the sensing depth of SMAP is affected by local environmental conditions and vegetation types [75]. Therefore, it is impossible to fully reconcile the discrepancies in modeled soil moisture with remotely sensed measurements, as there is a structural mismatch between satellite-based observations and terrestrial ecosystem models. In this study, we tested altering the soil moisture using a first-order method by replacing the soil moisture with observations for a specific depth. A better approach to improve the issue of representativeness is to use an exponential filter [86][87][88], which assumes that the soil moisture of deeper layers can be derived from an exponential relationship given the observed soil moisture in the surface layer.
The improvement of the assimilation over different regions largely depended on the agreement of SMAP with the in situ measurements. This is reflected in the comparison between the model and the data, which highlighted that the model behavior with respect to soil moisture after assimilation had high similarity in a spatio-temporal pattern with SMAP, despite the simulated

Discussion
Throughout this paper, we assessed the potential of the SMAP soil moisture product to be used to improve the LPJ-wsl simulation of carbon fluxes. Because systematic errors exist in both the SMAP product and the process-based models, the climatological rescaling of soil moisture between SMAP and LPJ-wsl is essential. The SWI conversion approach in this study has been proven to be useful at inducing the seasonality of soil moisture dynamics [84,85]. Given that it is difficult to adjust the model formulation, soil hydraulic characterization, and the degree of wetness relative to the range of moisture over which key processes (drainage, evapotranspiration) occur, the potential bias in the SWI conversion in this study and its effect on assimilation needs further investigation. A better rescaling approach for SMAP is needed to keep both sources relatively consistent, such as the linear cumulative density distribution (CDF) matching or the full CDF matching [26,60]. In addition, because the SMAP descending retrievals at local time (6 a.m.) were applied in this study, a proper conversion of SMAP soil moisture to daily averages may need to be developed to minimize the mismatch with the modeled daily average soil moisture.
The simulation experiments with different representative depths demonstrate the importance of assessing the suitable depth to be considered in the assimilation. Despite the fact that LPJ-wsl is discretized by an eight-layered hydrology scheme, there is no exact corresponding depth to match the thin surface layer detected by the SMAP instrument.
In addition, the sensing depth of SMAP is affected by local environmental conditions and vegetation types [75]. Therefore, it is impossible to fully reconcile the discrepancies in modeled soil moisture with remotely sensed measurements, as there is a structural mismatch between satellite-based observations and terrestrial ecosystem models. In this study, we tested altering the soil moisture using a first-order method by replacing the soil moisture with observations for a specific depth. A better approach to improve the issue of representativeness is to use an exponential filter [86][87][88], which assumes that the soil moisture of deeper layers can be derived from an exponential relationship given the observed soil moisture in the surface layer.
The improvement of the assimilation over different regions largely depended on the agreement of SMAP with the in situ measurements. This is reflected in the comparison between the model and the data, which highlighted that the model behavior with respect to soil moisture after assimilation had high similarity in a spatio-temporal pattern with SMAP, despite the simulated carbon fluxes that are still subject to the model structure, forcing errors, and parameter errors. This indicates that, globally, the assimilation does not necessarily yield better agreement with upscaled carbon flux products. This is possibly due to the limited accuracy of SMAP for regions with dense vegetation or high soil organic matter, given the influence of vegetation water content on the canopy in terms of the retrieval of soil moisture at the surface [89] and the underrepresentation of soil organic matter in radiative transfer modeling [90]. It is also worth noting that it is difficult to evaluate the spatial patterns of carbon fluxes with the upscaling products, which are also affected by the spatial distribution of sites, where regions with little/no representativeness tend to have higher bias in carbon fluxes, e.g., in the tropics [67].
Nevertheless, our result, showing that SMAP soil moisture can be used to represent deeper layers than 5 cm, was expected based on previous studies. Namely, under drier conditions, the emission depth of L-band microwaves greatly increases beyond 5 cm. Under wetter conditions, L-band soil moisture has a greater information content of deeper soil layers [56,57]. It is unexpected that SMAP would best represent moisture dynamics in the full 200 cm LPJ soil column. We speculate that this results from the fact that LPJ's simple representation of infiltration to deeper soil layers may not capture monthly and seasonal dynamics. SMAP may provide a first-order constraint on the deeper soil layer dynamics. It is possible that the best performance may come from an improved LPJ representation of deeper soil moisture dynamics between 50 and 200 cm, along with SMAP assimilated into the top 50 cm layer. Alternatively, future LPJ representations can consider assimilating GRACE terrestrial water storage dynamics for the deepest soil layers.
The evaluation of simulated carbon fluxes against FLUXNET is inherently limited by the sparse spatial observations and the nature of high site-to-site variability. Owing to the very sparse coverage of the FLUXNET sites (n = 26) applied in the wavelet analysis, there is still large uncertainty and potential bias in the analysis that needs to be further investigated. Because of the purpose of evaluating CO 2 and CH 4 fluxes simultaneously, the selected sites tended to have high soil moisture and soil organic matter. This influences the accuracy of SMAP retrievals as the dielectric mixing model in SMAP does not explicitly consider organic matter or the potential bias of SMAP in high soil wetness conditions [75,89]. In addition, errors in the surface meteorological forcing may explain the differences between the model estimate and the SMAP product, as most of the variability of soil moisture is driven by precipitation events [91]. Therefore, the evaluation metrics are affected by the forcing errors, especially since the gridded climate dataset (i.e., CRU) applied in LPJ-wsl simulation is different from the in situ meteorological conditions.
The promising but limited improvements in modeled carbon fluxes after the first-order assimilation in this study show that more sophisticated assimilation approaches are needed. A thorough examination of the bias in simulated soil moisture would help to isolate the bias induced by parameterization and unmodeled processes from the total systematic difference between the model and observations, which was treated as bias in this study. This is the prerequisite to employ more sophisticated assimilation approaches such as the ensemble Kalman filter [92]. To improve the simulated carbon fluxes at a sub-monthly scale, a few temporal gapfilling approaches (e.g., SWI smoothing filter) are useful to potentially improve the soil moisture at a sub-monthly scale. Lastly, given that soil moisture can influence multiple components of the carbon cycle processes in terrestrial ecosystem models, it is better to use multiple independent constraints in the data assimilation, as proven by previous studies [30,32]. A further principal component analysis is needed to examine the best combination of constraining parameters with SMAP to determine the multiple-constraint assimilation scheme either by a stepwise approach [93] or by a simultaneous approach [30].
The improved estimates of CO 2 and CH 4 fluxes after assimilation at regional scale suggest important implications of assimilating SMAP soil moisture into terrestrial ecosystem models for carbon cycle research. The assimilation of SMAP can help explain the response of terrestrial ecosystems to climate events. As the simulations for the 2018 European extreme drought suggested, the assimilated spatio-temporal patterns of soil moisture could result in strong reductions in vegetation productivity during the onset of drought events and cause a strong legacy effect in the subsequent vegetation mortality, yielding a significant impact on the net carbon uptake. Assimilating SMAP soil moisture also shows capacity in capturing pulses of wetland CH 4 emissions for regions that do not have reliable direct measurements. The spatio-temporal distribution of wetland CH 4 in Sudd wetlands corroborates the underestimated wetland CH 4 emission in that region detected by the TROPOMI measurements [82], as well as a few recent atmospheric inversion results [81,82]. Note that improving soil moisture has a limited effect. An example is that TROPOMI detected high emission in the December-January-February (DJF) season for the Sudd wetland, whereas for both the baseline and assimilation runs the inundated area was lowest. This can be explained by model structure: the model is mainly driven by precipitation, which is suggested to be lowest in the DJF season, while river height is at a high level. This is a feasible way to incorporate the advantages of SMAP to assess the effect of soil moisture on the terrestrial carbon cycle given its unique role in directly monitoring the surface condition continuously.

Conclusions
Even though there have been many studies evaluating the usefulness of the SMAP dataset for land surface modeling, this study provides the first experiment at a global scale that evaluates the effect of SMAP soil moisture on simulated CO 2 and CH 4 fluxes. Through a direct insertion method, we found that assimilating SMAP can produce a modest improvement in soil moisture against site-level observations with altered seasonal dynamics. The assimilated run showed a promising but limited improvement over site-level CO 2 and CH 4 fluxes at long time scales (>40 days) for boreal and tropical sites, with little/no improvement for the tundra and temperate sites. The assessment of the assimilating run for two climatic events at a regional scale suggested that SMAP can help the model to capture the observed seasonality of CO 2 and CH 4 fluxes. The work presented here confronts how well the model can capture the temporal pattern of soil moisture and prompts future analysis to utilize SMAP in simulating coupling between soil moisture and carbon fluxes. Acknowledgments: We thank Alka Singh for sharing the processed SMAP Level 3 soil moisture dataset. We gratefully acknowledge the eddy covariance data acquired and shared by the FLUXNET community.

Conflicts of Interest:
The authors declare no conflict of interest.

Appendix A
Appendix A.1. Wavelet Analysis Figure A1 show an example of how the marginal distribution of power spectra is calculated. First, we use the time series of model and observation for each site-year to calculate the normalized residual error. Second, we tested data-model agreement in the frequency domain for CO 2 and CH 4 fluxes by calculating the spectral properties of the daily times series of normalized residual error in CO 2 and CH 4 fluxes. At the last we calculate marginal distribution of spectra of the normalized residual errors grouped by site biomes.

Biome
Site ID