Next Article in Journal
Scale Effects and Time Variation of Trade-Offs and Synergies among Ecosystem Services in the Pearl River Delta, China
Next Article in Special Issue
Groundwater Management in an Uncommon and Artificial Aquifer Based on Kc Approach and MODIS ET Products for Irrigation Assessment in a Subtropical Island
Previous Article in Journal
A Continuous PDR and GNSS Fusing Algorithm for Smartphone Positioning
Previous Article in Special Issue
Different Vegetation Information Inputs Significantly Affect the Evapotranspiration Simulations of the PT-JPL Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Can Data Assimilation Improve Short-Term Prediction of Land Surface Variables?

1
State Key Laboratory of Earth Surface Processes and Resource Ecology, School of Natural Resource, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China
2
National Center for Atmospheric Research, Boulder, CO 80301, USA
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(20), 5172; https://doi.org/10.3390/rs14205172
Submission received: 13 September 2022 / Revised: 11 October 2022 / Accepted: 12 October 2022 / Published: 16 October 2022

Abstract

:
Data assimilation methods have been used to improve the performances of land surface models by integrating remote sensing and in situ measurements. However, the impact of data assimilation on improving the forecast of land surface variables has not been well studied, which is essential for weather and hydrology forecasting. In this study, a multi-pass land data assimilation scheme (MLDAS) based on the Noah-MP model was used to predict short-term land surface variables (e.g., sensible heat fluxes (H), latent heat fluxes (LE), and surface soil moisture (SM)) by jointly assimilating soil moisture, leaf area index (LAI) and solar-induced chlorophyll fluorescence (SIF). The test was conducted at the Mead site during the growing season (1 May to 30 September) in 2003, 2004, and 2005. Four assimilation-prediction scenarios (assimilating for 15 days, 45 days, 75 days, and 105 days from 1 May, then predicting one future month) are adapted to evaluate the influence of assimilation on subsequent prediction against Noah-MP open-loop simulation (OL). On average, MLDAS produces 28.65%, 27.79%, and 19.15% lower root square deviations (RMSD) for daily H, LE, and SM prediction compared to open-loop run, respectively. The influence of assimilation on prediction can reach around 60 days and 100 days for H (LE) and SM, respectively. Our findings indicate that data assimilation can improve the accuracy of land surface variables in a short-term prediction period.

1. Introduction

It is critical to monitor and predict land surface variables related to vegetation dynamics, water and carbon fluxes, which are essential for better understanding complex land–atmosphere interactions and climate change [1,2,3]. Process-based land surface models (LSMs) make it possible to simulate land surface variables and represent complex land surface processes [4]. A series of models have been developed to study land and lower atmosphere conditions. Among them, the Noah with multi-parameterization model (Noah-MP model) significantly improves the simulations of land surface vegetation and heat fluxes with its advantages of using multiple optional parameterization schemes to interpret the differences in modeling simulations [5].
While LSMs can provide temporally and spatially continuous simulations of land surface variables, there are still substantial uncertainties caused by initial conditions, forcing data, interactions between land surface and atmosphere, and model physics within LSMs, leading to significant differences between simulation and reality [6]. Data assimilation (DA) methods that can merge observational information into process-based LSMs can constrain parameters and optimize the simulations of land surface variables. Most data assimilation schemes are developed with variational-based or ensemble-based methods to improve the performance of LSMs [4,5,6,7,8]. Based on the assimilation theories, various observational variables such as land surface temperature (LST), leaf area index (LAI), and soil moisture (SM) observations have been assimilated into LSMs, improving the simulations of land surface variables from LSMs. For instance, assimilation of SM improves the estimation of evapotranspiration [9,10], and assimilation of LAI optimizes estimates of carbon fluxes of gross primary production (GPP) and net ecosystem exchange (NEE) [11,12,13].
It is notable that these traditional data assimilation frameworks are built with a single or dual observational variable assimilated into the process-based models [11,14,15]. Albergel et al. [16] found that root zone soil moisture (RZSM) can benefit from the assimilation of SM and LAI, which indicates that assimilation of multi-variables at once may improve the performance of land surface variable simulations. Recently, a multi-pass land data assimilation scheme (MLDAS) has been proposed, which can assimilate different observational variables into LSMs to optimize simulations of land surface variables all at once [17]. As different observed variables are jointly assimilated, MLDAS potentially improves the capability of LSM simulations.
Although some data assimilation schemes have been constructed within LSMs to better simulate land surface variables, few studies are focused on the potential of improvement of predictivity skills brought by data assimilation. It is believed that uncertainties of land surface forecast mainly arise from initial conditions (ICs), meteorological forcing, and errors in model structure, and initial conditions play an important role among them. Wood and Lettenmaier [7] evaluated the relative importance of initial conditions and meteorological forcing in hydrologic forecast and discovered that initial conditions dominated the forecast in some cases. Sawada et al. [18] assimilated microwave brightness temperature into a land surface model for better monitoring and forecast of agricultural drought. The results showed that data assimilation especially sequential data assimilation may provide more accurate initial conditions that lead to better LAI and SM forecasts, which were likely to benefit agricultural drought monitoring. Quantities of data assimilation experiments have proved that accurate initial conditions are crucial for prediction, but the observational data are assimilated into the LSM separately in those experiments. It is a topic of interest to test whether assimilating several land surface variables into LSM jointly would improve prediction of LSMs.
The objective of this study is to test whether Noah-MP LSM coupled with MLDAS (Noah-MP-MLDAS) can better predict land surface variables such as LAI, SM, sensible heat flux (H) and latent heat flux (LE) compared to Noah-MP open-loop simulations (OL) (simulations of Noah-MP without assimilation). Assimilating for 15 days, 45 days, 75 days, and 105 days from May 1st, four sets of data assimilation experiments (four scenarios) were designed and conducted at an AmeriFlux crop site, namely Mead (Mead, Nebraska, USA) during the growing season (1 May to 1 October) in 2003, 2004 and 2005, respectively. The SM, H, and LE assimilation outputs for the growing season served as benchmarks for evaluating the predictability of MLDAS and open-loop run (OL), the duration of impact on prediction in each scenario was further evaluated. This study demonstrates how data assimilation can improve the prediction capability of LSM and is expected to be useful for scientific research on LSM prediction and agricultural monitoring.

2. Materials and Methods

2.1. Data

The assimilation-prediction experiment was conducted using MLDAS at the AmeriFlux Mead (US-Ne1) site, which is located at the University of Nebraska Agricultural Research and Development Center. There are 3 sites (US-Ne1, US-Ne2, and US-Ne3) over Mead which are within 1.6 km of each other. Site 1 (US-Ne1), used in our study, is an irrigated site covered by corn [19].
Meteorological forcing data of the Mead site, including air temperature, pressure, humidity, precipitation, wind speed, incoming shortwave and longwave radiation used to drive Noah-MP, were downloaded from AmeriFlux (http://ameriflux.lbl.gov/, accessed on 7 October 2021). LAI and SM were also acquired from the AmeriFlux database. SM was measured at depths of 0.02, 0.05, and 0.1 m, which was averaged to be consistent with the topsoil layer of Noah-MP (0.1m) [17]. The remotely-sensed solar-induced chlorophyll fluorescence (SIF) products with the spatiotemporal resolution of 5 km and 4 days were available via https://osf.io/8xqy6/?view_only=4d14a64b7be644d-39564c82a371e1d20 (accessed on 1 November 2021) [20].

2.2. Noah-MP Model

Noah-MP is enhanced from Noah LSM [21] with expanded and multi-parameterization options [4,5]. A two-source energy balance module, short-term dynamic vegetation growth model, modified two-stream radiation transfer scheme, dynamic groundwater component, and multilayer snowpack are included in Noah-MP LSM. The multi-parameterization options make it possible for users to try combinations of parameters in different modules to optimize the model performance, such as leaf dynamics, surface-layer turbulence treatment, canopy stomatal resistance, runoff, and groundwater [5]. A crop module has also been added to Noah-MP LSM, improving the simulations of vegetation dynamics and turbulent fluxes for corn and soybean by using several agricultural management data, such as growing degree days (GDDs) and planting/harvesting dates [8,9,22].

2.3. Multi-Pass Land Data Assimilation Scheme

A multi-pass land data assimilation scheme (MLDAS) has been proposed based on the Noah-MP model [4,5] and ensemble Kalman filter (EnKF) method. Unlike other data assimilation (DA) methods, MLDAS can assimilate multisource remotely sensed or ground-measured observations in multiple passes to optimize parameters and state variables simultaneously. Currently, four passes have been built to assimilate LAI (pass I–pass II), SM (pass III), and SIF (pass IV) to update specific leaf area (SLA), leaf biomass (LFMASS), SM and maximum carboxylation rate at a reference temperature of 25 °C (Vcmax25) in MLDAS successfully [17]. Pass I to pass II are designed to update simulation on plant productivity, ecological behavior and plant growing rates, which are key factors for vegetation status and evapotranspiration. SLA is defined as ratio of LAI to LFMASS, so LAI can be assimilated in two passes to update these two parameters jointly. In pass I, LAI is assimilated to update the constant parameter SLA which is equivalent to BIO2LAI in Noah-MP. In pass II, state variable LFMASS is updated. SM is updated in pass III directly. Through assimilating SIF in pass IV, we can optimize Vcmax25 to better control carbon assimilation processes and represent plant productivity [23].
In our study, four passes were opened to jointly assimilate LAI, SM and SIF. LAI observations will be assimilated when available, and SM and SIF observations will be assimilated when available at daily and four-day intervals, respectively.

2.4. Experiments Setup

In this study, the MLDAS system was used to test the impact of data assimilation on land surface variable prediction. Using the same atmospheric forcing data, the MLDAS predictions (red arrows in Figure 1) were compared with simulations of Noah-MP OL (without assimilation, blue arrow in Figure 1) to evaluate the influence of assimilation on predictions. Four assimilation-prediction scenarios (assimilating for 15 days, 45 days, 75 days, and 105 days from 1 May, and then predicting until 1 October) were adapted during the growing season (1 May to 30 September) in 2003, 2004, and 2005, respectively (as shown in Figure 1). Furthermore, considering that the differences between OL and MLDAS are significant in the first month of prediction, the first 30 days of the prediction period of each scenario (15 May–15 June, 15 June–15 July, 15 July–15 August, and 15 August–15 September for scenario I, II, III, and IV, respectively) were used for statistical analysis. This part corresponds to the red-dashed lines in Figure 1. Notably, the main goal of this study was to assess to what extent data assimilation could influence the prediction of land surface variables, so we took the output of assimilation for the whole growing season (1 May to 30 September) as the “benchmarks” that contain information from both models and observations considering the uncertainties in raw observations. The “benchmarks” corresponds to the green arrow in Figure 1 and will be further compared with the prediction of MLDAS and OL.
Here, normalized information contribution (NICRMSD) was adapted to quantify the differences between MLDAS and OL, which was defined by Equation (1) [24]. RMSDAnalysis (RMSDModel) refers to RMSD between “benchmarks” and MLDAS (OL). A positive NICRMSD indicates the prediction of MLDAS is improved compared with OL. A negative NICRMSD means degradation. Besides, the relative difference (RD) between OL and MLDAS defined by Equation (2) was used to show how long the influence of assimilation exists during the prediction period. RD in the equation is relative difference, LSVModel and LSVAnalysis are simulations of land surface variables (LSV) from OL and MLDAS, respectively.
N I C R M S D = R M S D M o d e l R M S D A n a l y s i s R M S D M o d e l × 100
R D = | L S V M o d e l L S V A n a l y s i s | L S V M o d e l × 100

3. Results

3.1. Impacts of Assimilation on Land Surface Variables Prediction

The output time step of MLDAS is 30 min, so H, LE, and SM predictions were daytime-averaged (09:00–18:00, local time) for further estimates within all assimilation-prediction scenarios. As with each assimilation scenario, a short-term prediction (forecasting lead time of 30 days) for LAI, SM, H, and LE was conducted separately in MLDAS and OL. NICRMSD was used to show the advances in land surface variables prediction by MLDAS compared with Noah-MP OL.

3.1.1. Leaf Area Index

Figure 2 shows the prediction of LAI in MLDAS and OL at the Mead site from 2003 to 2005. Since LAI observations are limited, the ground-measured LAI was used as the references in this section, and NICRMSD of LAI was not calculated herein. As shown, MLDAS performs better than OL in most scenarios compared to ground-measured LAI. In general, LAI is tended to be overestimated, especially during DOY 180–220 in OL. MLDAS corrects the overestimation and produces predictions that can better fit LAI observations, suggesting that assimilation can bring better prediction skills to MLDAS. However, as Figure 2 shows, the prediction skill of MLDAS is worsened in scenario I assimilating for 15 days in 2004 and 2005. This is because only one set of LAI observations with relatively low values was assimilated into the Noah-MP model, bringing uncertainties to future prediction. In contrast, longer assimilation then prediction generally leads to better predictions of LAI peaks, which are especially better captured in scenario III from 2003 to 2005. It is also notable that the correction in MLDAS is most obvious in midsummer, when LAI is at peak. Correction brought by MLDAS is no longer evident compared with OL after midsummer (scenario IV).

3.1.2. Soil Moisture

The statistics of RMSDAnalysis, RMSDModel, and NICRMSD at the Mead site from 2003 to 2005 are shown in Table 1. As is shown, 9 out of 12 experiments experience improvement in the prediction from OL to MLDAS. The maximum relative difference is up to 60.41% in scenario IV in 2003. There are three sets of experiments that appear to be a degradation of predicting performance after assimilation. The NICRMSD is negative in scenarios I and II from 2003 to 2005, indicating that SM prediction improvements are unstable with a short data assimilation period (i.e., 15 or 45 days). The NICRMSD of scenarios III and IV generally increases in all 3 years, suggesting that forecast performance is better with the extended assimilation period.
The detailed impacts of data assimilation on SM prediction are shown in Figure 3. As is shown, Noah-MP OL tends to overestimate SM at the Mead site from 2003 to 2005. For assimilation-prediction scenarios I and II, the data assimilation performs similarly to the OL. However, with longer data assimilation periods (scenarios III and IV), data assimilation can significantly improve SM prediction compared to OL. For assimilation-prediction scenario IV (assimilation of 105 days), the prediction of SM can reach even the maximum and minimum benchmark values for the whole future month, indicating that the data assimilation significantly impacts the prediction of future SM.

3.1.3. Sensible Heat Flux

The performance of H from OL and MLDAS with a prediction lead time of 30 days is shown in Figure 4. Table 2 shows the NICRMSD of MLDAS and OL prediction in 2003 to 2005. Overall, RMSDAnalysis is decreased compared to RMSDModel in 10 of the 12 experiments. The result indicates that data assimilation can potentially improve the prediction skill of H in the Noah-MP model. Larger NICRMSD can be seen in scenario IV (assimilation of 105 days) with 57.07% in 2003, 47.80% in 2004, and 47.57% in 2005, indicating that a longer assimilation period results in better prediction performance for H. On the contrary, NICRMSD is negative in scenario I over 2003 and 2005, revealing that assimilating observations with large uncertainties into the model does not improve the prediction performance, but even degrades it.
Details on H prediction in different assimilation scenarios from 2003 to 2005 are given in Figure 4. Considering that there are subtler differences in H prediction between MLDAS and OL compared with SM, H prediction from four scenarios with lead times of 30 days are given to highlight the differences. In general, both MLDAS and OL can fit the benchmarks quite well. In Scenario I, the OL slightly outperformed the MLDAS in prediction, especially in 2003 and 2005. However, for scenarios II–IV, prediction in MLDAS can better capture the variations in benchmarks both in trend and magnitude. Among scenarios II–IV, scenario II (assimilation of 45 days) improves the prediction skill of MLDAS most significantly, which can also be seen from NICRMSD in Table 2. Note that H predictions are relatively low with some cases lower than −50 W/m2 in scenario II–IV. To confirm the accuracy of benchmarks and predictions, raw observations of H from AmeriFlux were checked. There are also several H observations smaller than 0 W/m2 at this site during the middle of summer, indicating that the benchmarks and predictions are reliable here. Details regarding the observations can be seen at: https://ameriflux.lbl.gov/data/download-data/ (accessed on 10 October 2022). There are probably two reasons for this. Firstly, it is probably because SM simulations are generally higher than the benchmarks (as shown in Figure 3), making H prediction lower than expected. Secondly, irrigation at this site in the middle of the growing season may also be a factor influencing simulation on H. Moreover, the H and LE are mainly affected by the atmospheric state variables (i.e., air temperature and specific humidity) in wet and/or densely vegetated sites [25,26], not land surface variables (e.g., LAI, soil moisture). As it is very humid for scenario III (from Mid-July to Mid-August), the differences between OL and MLDAS will be small.

3.1.4. Latent Heat Flux

Statistical evaluation metrics (i.e., RMSDAnalysis, RMSDModel, NICRMSD) of LE predictions are summarized in Table 3. In general, data assimilation significantly improves the prediction skill of LE, and 10 out of 12 experiments obtain positive NICRMSD. Similarly, the MLDAS prediction is slightly worse compared to OL in scenario I (assimilation for 15 days), with a negative NICRMSD for 2003 and 2005. The prediction performance in MLDAS is improved rapidly in scenarios II–IV, which is similar than that of H. As for the trend of NICRMSD, NICRMSD increased from 25.69% to 58.08% with a longer assimilation period in 2003. However, the highest NICRMSD falls in scenario II in 2004 and 2005 (57.34% and 61.56%, respectively), which is similar than the trend of NICRMSD in H.
Figure 5 shows the comparison of the LE prediction with a lead time of 30 days between MLDAS and OL. Prediction in MLDAS, especially in scenarios II and IV of each year, is closer to the benchmarks in most experiments. The most apparent improvement can be seen in scenario II, where OL tends to overestimate LE while MLDAS corrects it. Moreover, the differences between MLDAS prediction and OL prediction diminish with lead time in scenario II over 3 years, indicating that the prediction skill brought by data assimilation also decreases with forecast lead time.

3.2. Duration of Impact of Data Assimilation on Prediction

This work focuses not only on whether data assimilation improves the performance of prediction, but also on how long the effect exists. Previous results have demonstrated that MLDAS does have the potential to improve the prediction skill of Noah-MP compared to OL, and specific comparisons are summarized in Table 1, Table 2 and Table 3. Therefore, it is necessary to analyze the impact of assimilation on the duration of the prediction. Herein, SM and LE are chosen to demonstrate this issue.
The overall dynamic changes of RD over four scenarios from 2003 to 2005 are given in Figure 6 and Figure 7. The RD of SM prediction is given in Figure 6. As indicated, RD in each of the four scenarios decreases with the forecast lead time, suggesting that improved prediction skill brought by data assimilation degrades with time. RD in scenario IV is generally larger than other scenarios and the decreasing tendency is not as clear as other scenarios. Figure 7 shows how RD of LE prediction changes over forecast lead time. Generally, the trend that RD diminishes with lead time is clear in scenarios I and II over the 3 years. Minimum fluctuations of RD can be seen in scenario III (assimilation for 75 days), where RD is less than 10% and close to 0% over most of the lead time, and corresponding changes can also be seen in scenario III of Figure 6. No significant trend of decreasing RD was found in scenario IV, indicating that the impact of data assimilation in scenario IV may persist for a longer period.
Scenario I (assimilation for 15 days) of each year (2003, 2004, and 2005) is chosen to show how RD changes with forecast lead time so that the forecast lead time can be long enough for evaluation. Figure 8 shows the RD of SM over the 3 years. Overall, the influence of assimilation on prediction also diminishes with forecast lead days. The impact of assimilation lasted for around 80 days, 70 days, and 100 days in 2003, 2004, and 2005, respectively. Then the RD converges to around 0. Additionally, precipitation is also an essential factor that affects SM simulation. There is a good correspondence between precipitation events and fluctuation of RD between MLDAS and OL before convergence (as shown in the grey parts). As is shown, RD between MLDAS and OL becomes smaller when precipitation happens, and RD increases fast without precipitation (i.e., 20–40 days in 2005), confirming that accurate precipitation inputs are important for SM prediction.
Figure 9 shows detailed RD of LE prediction between MLDAS and OL. Similar to SM prediction, the overall tendency of RD diminishes with lead time and apparent fluctuation can be seen in the first 20 to 30 forecast lead days. The RD also converged to around 0 at around 60, 55, and 55 lead days in 2003, 2004, and 2005, respectively.

4. Discussion

4.1. Can MLDAS Improve the near Future Prediction Performance?

Most NICRMSD of H, LE, and SM are positive within four data assimilation scenarios, indicating that MLDAS outperforms OL during the prediction period. It is widely accepted that forecasting errors arise from the uncertainty of initial conditions, model structure and parameters, interactions between land surface and atmosphere, and atmospheric forcing. Previous studies have found that accurate initial conditions are essential for better predictions of land surface variables in LSM [7,27,28]. Wood and Lettenmaier [7] evaluated the relative importance of initial conditions and meteorological forcing in seasonal hydrological forecast and found that initial conditions were more important than meteorological forcing. Here, the initial conditions provided by process-based models (MLDAS or OL) are crucial for future prediction as they determine where the prediction begins. In particular, the forcing data used in our study are from AmeriFlux observations instead of forecasted atmospheric reanalysis, making it possible to highlight the influence of initial conditions on prediction. However, given that the forcing data used here are AmeriFlux observations with high accuracy, the impact of initial conditions may become overestimated and uncertainties in forcing data may not be well represented. Here, for a point-scale simulation driven by AmeriFlux observations, the prediction was likely to be more accurate if the initial condition was close to the “benchmark”. For instance, in Figure 3, the initial conditions of SM prediction are different between MLDAS and OL in assimilation-prediction scenario I (assimilation for 15 days) in 2004 (DOY 136, the edge of the grey region), with SM provided by MLDAS (red solid line) closer to the “benchmark”. Accordingly, NICRMSD is 50.51%, indicating that assimilation (assimilation for 15 days) can improve the performance of prediction, despite being rare in scenario I. Hence, initial conditions play an essential role in the performance of prediction [18].
Due to the climatic characteristics of the site, though overall prediction skill of SM, H, and LE is increased in MLDAS compared with that of OL, prediction performance in different assimilation scenario varies. For instance, as mentioned above, the prediction performance of scenario I is not as good as expected and even degrades in some experiments such as SM, H, and LE prediction in 2005. For the other three scenarios (especially scenario IV) MLDAS received better responses in most experiments. The seasonal transition of land surface variables such as LAI may influence the prediction skill in different assimilation scenarios. As in scenario I, initial conditions are in the spring/summer transition with LAI around 0, which can be seen in Figure 2. Such a low LAI is likely to bring high uncertainties to data assimilation, which lowers the accuracy and reliability of initial conditions and worsens the prediction skill of MLDAS. However, as in scenarios II to IV, the prediction experiments are conducted during the summer or summer/autumn transition with relatively higher LAI. Furthermore, assimilation of SIF in pass IV updates Vcmax25, which is by default a fixed parameter depending on vegetation type in Noah-MP, leading to better LAI simulation. Hence, LAI is assimilated effectively, and lower uncertainty is introduced so that initial conditions are more reliable, contributing to better prediction performance of MLDAS. Regional climate and corresponding phenology are crucial in the forecast framework dominated by initial conditions [7,18], so we suggest that initial conditions for prediction should be carefully checked, considering the uncertainties introduced from climate variations of assimilated land surface variables.

4.2. How Long Does the Influence of Assimilation Exist?

Overall, as Figure 6, Figure 7, Figure 8 and Figure 9 show, the RD between OL and MLDAS diminishes and converges to around 0 with forecast lead time, suggesting that assimilation exerts significant influence on prediction during the early prediction period. A similar pattern can be seen in other forecast work with LSM [6]. Relatively accurate initial conditions from data assimilation play an important role during the early prediction period, while the prediction skill brought by initial conditions decreases with lead time due to uncertainties in forcing data. As mentioned in Section 4.1, uncertainties in forcing data are not well considered here. However, the relatively ‘perfect’ forcing data makes it possible to emphasize the impact of initial conditions on prediction. It can be assumed that the influence of initial conditions on prediction decreases with lead time and disappears when RD converges to around 0.
As with fluxes prediction, the impact of assimilation can last for around 60 days. Compared with fluxes prediction, the impact of assimilation on SM prediction generally lasts longer, which can reach around 100 days. This can be explained through the fact that turbulent fluxes are not only controlled by assimilated SM, LAI, and SIF, but also by atmospheric conditions such as advection, condensation, and evaporation during the growing season [29]. Using the same set of AmeriFlux observations as forcing data, it can be considered that the influence of initial conditions for LE prediction is not as significant as that in SM. Furthermore, in the model, LE is updated indirectly by assimilating LAI, SM, and SIF, causing the impact of assimilation on turbulent fluxes prediction to last a shorter amount of time than that of SM. Several studies have also shown that initial conditions of SM significantly impact weather forecast skills in the short or medium range [30,31,32]. Using AmeriFlux observations as forcing data, uncertainties introduced from precipitation and radiation was not well represented in this study. Coupling WRF to test the forecast skill of MLDAS more comprehensively is needed in the future.

5. Conclusions

In this study, four data assimilation scenarios were adapted to evaluate whether the assimilation of SM, LAI, and SIF can jointly increase the prediction performance of MLDAS. Data assimilation experiments were conducted during the growing season over 3 years at an AmeriFlux site. With more accurate initial conditions provided by data assimilation, predictions from MLDAS outperform those from OL. On average, MLDAS produces 28.65%, 27.79%, and 19.15% lower root square deviations (RMSD) for daily H, LE, and SM prediction compared to OL. Moreover, the results also indicate that the impact of assimilation on SM prediction can last longer (around 100 days) than that of LE (around 60 days). Our work proves that data assimilation can potentially increase the short-term prediction skill of LSMs. Considering that our assimilation-prediction test was conducted at only one site, more sites with different climates and land cover conditions need to be included in our assimilation-prediction experiments to draw some more general conclusions in the future. Furthermore, Noah-MP should be further coupled with atmospheric models (e.g., WRF) to test whether data assimilation influences meteorological forcing, which can give feedback on real-time land surface variable forecasts.

Author Contributions

Conceptualization, T.X.; methodology, X.H.; software, Y.T. and X.H.; validation, T.X., F.C., S.L. and X.H.; formal analysis, Y.T.; writing—original draft preparation, Y.T.; writing—review and editing, all; visualization, Y.T.; supervision, T.X.; project administration, T.X.; funding acquisition, T.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Natural Science Foundation of China (42171315) and supported by the State Key Laboratory of Earth Surface Processes and Resource Ecology (2021-ZD-04).

Data Availability Statement

Meteorological forcing data of the Mead site including air temperature, pressure, humidity, precipitation, wind speed, incoming shortwave and longwave radiation were downloaded from AmeriFlux (http://ameriflux.lbl.gov/ accessed on 7 October 2021). LAI and SM are also acquired from the AmeriFlux database. The remotely sensed solar-induced chlorophyll fluorescence (SIF) products were downloaded from: https://osf.io/8xqy6/?view_only=4d14a64b7be644d-39564c82a371e1d20 (accessed on 1 November 2021).

Acknowledgments

We would like to thank the academic editor and three anonymous reviewers for their kind help and valuable comments that greatly improved this manuscript. We also would like to thank the high-performance computing support from the Center for Geodata and Analysis, Faculty of Geographical Science, Beijing Normal University.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. He, X.; Xu, T.; Xia, Y.; Bateni, S.M.; Guo, Z.; Liu, S.; Mao, K.; Zhang, Y.; Feng, H.; Zhao, J. A Bayesian Three-Cornered Hat (BTCH) method: Improving the terrestrial evapotranspiration estimation. Remote Sens. 2020, 12, 878. [Google Scholar] [CrossRef] [Green Version]
  2. Lokupitiya, E.; Denning, A.S.; Schaefer, K.; Ricciuto, D.; Anderson, R.; Arain, M.A.; Baker, I.; Barr, A.G.; Chen, G.; Chen, J.M.; et al. Carbon and energy fluxes in cropland ecosystems: A model-data comparison. Biogeochemistry 2016, 129, 53–76. [Google Scholar] [CrossRef] [Green Version]
  3. Xu, T.; Guo, Z.; Liu, S.; He, X.; Meng, Y.; Xu, Z.; Xia, Y.; Xiao, J.; Zhang, Y.; Ma, Y.; et al. Evaluating different machine learning methods for upscaling evapotranspiration from flux towers to the regional scale. J. Geophys. Res. Atmos. 2018, 123, 8674–8690. [Google Scholar] [CrossRef]
  4. Yang, Z.L.; Niu, G.Y.; Mitchell, K.E.; Chen, F.; Ek, M.B.; Barlage, M.; Longuevergne, L.; Manning, K.; Niyogi, D.; Tewari, M.; et al. The community noah land surface model with multiparameterization options (Noah-MP): 2. Evaluation over global river basins. J. Geophys. Res. Atmos. 2011, 116, D12110. [Google Scholar] [CrossRef]
  5. Niu, G.Y.; Yang, Z.L.; Mitchell, K.E.; Chen, F.; Ek, M.B.; Barlage, M.; Kumar, A.; Manning, K.; Niyogi, D.; Rosero, E.; et al. The community noah land surface model with multiparameterization options (Noah-MP): 1. Model description and evaluation with local-scale measurements. J. Geophys. Res. Atmos. 2011, 116, D12109. [Google Scholar] [CrossRef] [Green Version]
  6. Mucia, A.; Bonan, B.; Zheng, Y.; Albergel, C.; Calvet, J.C. From monitoring to forecasting land surface conditions using a land data assimilation system: Application over the contiguous United States. Remote Sens. 2020, 12, 2020. [Google Scholar] [CrossRef]
  7. Wood, A.W.; Lettenmaier, D.P. An ensemble approach for attribution of hydrologic prediction uncertainty. Geophys. Res. Lett. 2008, 35, L14401. [Google Scholar] [CrossRef] [Green Version]
  8. Zhang, Z.; Barlage, M.; Chen, F.; Li, Y.; Helgason, W.; Xu, X.; Liu, X.; Li, Z. joint modeling of crop and irrigation in the central United States using the Noah-MP land surface model. J. Adv. Model. Earth Syst. 2020, 12, e2020MS002159. [Google Scholar] [CrossRef]
  9. Xu, X.; Chen, F.; Barlage, M.; Gochis, D.; Miao, S.; Shen, S. Lessons learned from modeling irrigation from field to regional scales. J. Adv. Model. Earth Syst. 2019, 11, 2428–2448. [Google Scholar] [CrossRef] [Green Version]
  10. Xu, T.R.; Liu, S.M.; Xu, Z.W.; Liang, S.L.; Xu, L. A dual-pass data assimilation scheme for estimating surface fluxes with FY3A-VIRR land surface temperature. Sci. China Earth Sci. 2015, 58, 211–230. [Google Scholar] [CrossRef]
  11. Kumar, S.V.; Mocko, D.M.; Wang, S.; Peters-Lidard, C.D.; Borak, J. Assimilation of remotely sensed leaf area index into the Noah-Mp land surface model: Impacts on water and carbon fluxes and states over the continental United States. J. Hydrometeorol. 2019, 20, 1359–1377. [Google Scholar] [CrossRef]
  12. He, X.; Liu, S.; Xu, T.; Yu, K.; Gentine, P.; Zhang, Z.; Xu, Z.; Jiao, D.; Wu, D. Improving predictions of evapotranspiration by integrating multi-source observations and land surface model. Agric. Water Manag. 2022, 272, 107827. [Google Scholar] [CrossRef]
  13. Xu, T.; He, X.; Bateni, S.M.; Auligne, T.; Liu, S.; Xu, Z.; Zhou, J.; Mao, K. Mapping regional turbulent heat fluxes via variational assimilation of land surface temperature data from polar orbiting satellites. Remote Sens. Environ. 2019, 221, 444–461. [Google Scholar] [CrossRef]
  14. Pinnington, E.; Quaife, T.; Black, E. Impact of remotely sensed soil moisture and precipitation on soil moisture prediction in a data assimilation system with the JULES land surface model. Hydrol. Earth Syst. Sci. 2018, 22, 2575–2588. [Google Scholar] [CrossRef] [Green Version]
  15. Tian, S.; Tregoning, P.; Renzullo, L.J.; van Dijk, A.I.J.M.; Walker, J.P.; Pauwels, V.R.N.; Allgeyer, S. Improved water balance component estimates through joint assimilation of GRACE water storage and SMOS soil moisture retrievals. Water Resour. Res. 2017, 53, 1820–1840. [Google Scholar] [CrossRef]
  16. Albergel, C.; Munier, S.; Jennifer Leroux, D.; Dewaele, H.; Fairbairn, D.; Lavinia Barbu, A.; Gelati, E.; Dorigo, W.; Faroux, S.; Meurey, C.; et al. Sequential assimilation of satellite-derived vegetation and soil moisture products using SURFEX-v8.0: LDAS-Monde Assessment over the Euro-Mediterranean Area. Geosci. Model Dev. 2017, 10, 3889–3912. [Google Scholar] [CrossRef] [Green Version]
  17. Xu, T.; Chen, F.; He, X.; Barlage, M.; Zhang, Z.; Liu, S.; He, X. Improve the performance of the Noah-MP-pCro Model by jointly assimilating soil moisture and vegetation phenology data. J. Adv. Model. Earth Syst. 2021, 13, e2020MS002394. [Google Scholar] [CrossRef]
  18. Sawada, Y.; Koike, T.; Ikoma, E.; Kitsuregawa, M. Monitoring and predicting agricultural droughts for a water-limited subcontinental region by integrating a land surface model and microwave remote sensing. IEEE Trans. Geosci. Remote Sens. 2020, 58, 14–33. [Google Scholar] [CrossRef]
  19. Verma, S.B.; Dobermann, A.; Cassman, K.G.; Walters, D.T.; Knops, J.M.; Arkebauer, T.J.; Suyker, A.E.; Burba, G.G.; Amos, B.; Yang, H.; et al. Annual carbon dioxide exchange in irrigated and rainfed maize-based agroecosystems. Agric. Meteorol. 2005, 131, 77–96. [Google Scholar] [CrossRef] [Green Version]
  20. Zhang, Y.; Joiner, J.; Hamed Alemohammad, S.; Zhou, S.; Gentine, P. A global spatially contiguous solar-induced fluorescence (CSIF) dataset using neural networks. Biogeosciences 2018, 15, 5779–5800. [Google Scholar] [CrossRef]
  21. Chen, F.; Mitchell, K.; Schaake, J.; Xue, Y.; Pan, H.L.; Koren, V.; Duan, Q.Y.; Ek, M.; Betts, A. Modeling of land surface evaporation by four schemes and comparison with FIFE observations. J. Geophys. Res. Atmos. 1996, 101, 7251–7268. [Google Scholar] [CrossRef] [Green Version]
  22. Liu, X.; Chen, F.; Barlage, M.; Zhou, G.; Niyogi, D. Noah-MP-Crop: Introducing dynamic crop growth in the Noah-MP land surface model. J. Geophys. Res. Atmos. 2016, 121, 13953–13972. [Google Scholar] [CrossRef] [Green Version]
  23. Zhang, Y.; Guanter, L.; Berry, J.A.; Joiner, J.; van der Tol, C.; Huete, A.; Gitelson, A.; Voigt, M.; Köhler, P. Estimation of vegetation photosynthetic capacity from space-based measurements of chlorophyll fluorescence for terrestrial biosphere models. Glob. Chang. Biol. 2014, 20, 3727–3742. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Kumar, S.V.; Reichle, R.H.; Koster, R.D.; Crow, W.T.; Peters-Lidard, C.D. Role of subsurface physics in the assimilation of surface soil moisture observations. J. Hydrometeorol. 2009, 10, 1534–1547. [Google Scholar] [CrossRef]
  25. Shokri, N.; Lehmann, P.; Or, D. Characteristics of evaporation from partially wettable porous media. Water Resour. Res. 2009, 45, W02415. [Google Scholar] [CrossRef] [Green Version]
  26. Shokri, N.; Lehmann, P.; Vontobel, P.; Or, D. Drying front and water content dynamics during evaporation from sand delineated by neutron radiography. Water Resour. Res. 2008, 44, W06418. [Google Scholar] [CrossRef] [Green Version]
  27. Donegan, S.; Murphy, C.; Harrigan, S.; Broderick, C.; Foran Quinn, D.; Golian, S.; Knight, J.; Matthews, T.; Prudhomme, C.; Scaife, A.A.; et al. Conditioning ensemble streamflow prediction with the North Atlantic oscillation improves skill at longer lead times. Hydrol. Earth Syst. Sci. 2021, 25, 4159–4183. [Google Scholar] [CrossRef]
  28. Shukla, S.; Sheffield, J.; Wood, E.F.; Lettenmaier, D.P. On the sources of global land surface hydrologic predictability. Hydrol. Earth Syst. Sci. 2013, 17, 2781–2796. [Google Scholar] [CrossRef] [Green Version]
  29. Drusch, M.; Viterbo, P. Assimilation of screen-level variables in ECMWF’s integrated forecast system: A study on the impact on the forecast quality and analyzed soil moisture. Mon. Weather Rev. 2007, 135, 300–314. [Google Scholar] [CrossRef]
  30. De Rosnay, P.; Balsamo, G.; Albergel, C.; Muñoz-Sabater, J.; Isaksen, L. Initialisation of land surface variables for numerical weather prediction. Surv. Geophys. 2014, 35, 607–621. [Google Scholar] [CrossRef]
  31. Van den Hurk, B.; Ettema, J.; Viterbo, P. Analysis of soil moisture changes in europe during a single growing season in a new ECMWF soil moisture assimilation system. J. Hydrometeorol. 2008, 9, 116–131. [Google Scholar] [CrossRef]
  32. Weisheimer, A.; Doblas-Reyes, F.J.; Jung, T.; Palmer, T.N. On the predictability of the extreme summer 2003 over Europe. Geophys. Res. Lett. 2011, 38, L05704. [Google Scholar] [CrossRef]
Figure 1. Diagram for four assimilation scenarios. The green arrow represents benchmarks with data assimilation; the blue arrow represents Noah-MP Open-Loop run (without data assimilation); the red arrows represent the prediction of LAI, SM, H, or LE; and the dashed lines in red arrows are the first-month prediction used for statistical analysis.
Figure 1. Diagram for four assimilation scenarios. The green arrow represents benchmarks with data assimilation; the blue arrow represents Noah-MP Open-Loop run (without data assimilation); the red arrows represent the prediction of LAI, SM, H, or LE; and the dashed lines in red arrows are the first-month prediction used for statistical analysis.
Remotesensing 14 05172 g001
Figure 2. Leaf area index simulation and prediction from OL and MLDAS in 2003–2005. Results of four assimilation scenarios with different assimilation periods are given. Columns represent different assimilation scenarios in each year. The blue line in each subplot represents MLDAS simulation and prediction; the orange line in each subplot represents OL simulation; the red scatters represent LAI observations; and the shaded regions are the assimilation period, which starts from 1 May and lasts for 15 days, 45 days, 75 days, and 105 days for scenarios I, II, III, and IV, respectively.
Figure 2. Leaf area index simulation and prediction from OL and MLDAS in 2003–2005. Results of four assimilation scenarios with different assimilation periods are given. Columns represent different assimilation scenarios in each year. The blue line in each subplot represents MLDAS simulation and prediction; the orange line in each subplot represents OL simulation; the red scatters represent LAI observations; and the shaded regions are the assimilation period, which starts from 1 May and lasts for 15 days, 45 days, 75 days, and 105 days for scenarios I, II, III, and IV, respectively.
Remotesensing 14 05172 g002
Figure 3. Soil moisture simulation and prediction from OL and MLDAS in 2003–2005. Results of four assimilation scenarios with different assimilation periods are given. Columns represent different assimilation scenarios in each year. The red solid line in each subplot represents MLDAS simulation and prediction; the blue dashed line in each subplot represents OL simulation; the green hollow scatters represent SM benchmarks; the blue bars are precipitation; and the shaded regions are the assimilation period, which starts from 1 May and lasts for 15 days, 45 days, 75 days, and 105 days for scenario I, II, III, and IV, respectively.
Figure 3. Soil moisture simulation and prediction from OL and MLDAS in 2003–2005. Results of four assimilation scenarios with different assimilation periods are given. Columns represent different assimilation scenarios in each year. The red solid line in each subplot represents MLDAS simulation and prediction; the blue dashed line in each subplot represents OL simulation; the green hollow scatters represent SM benchmarks; the blue bars are precipitation; and the shaded regions are the assimilation period, which starts from 1 May and lasts for 15 days, 45 days, 75 days, and 105 days for scenario I, II, III, and IV, respectively.
Remotesensing 14 05172 g003
Figure 4. Sensible heat fluxes prediction from OL and MLDAS in 2003–2005. Predictions of four assimilation scenarios are given. Columns represent different assimilation scenarios in each year. The red solid line in each subplot represents MLDAS prediction in one future month; the blue dashed line in each subplot represents OL simulation; and the green hollow scatters represent H benchmarks.
Figure 4. Sensible heat fluxes prediction from OL and MLDAS in 2003–2005. Predictions of four assimilation scenarios are given. Columns represent different assimilation scenarios in each year. The red solid line in each subplot represents MLDAS prediction in one future month; the blue dashed line in each subplot represents OL simulation; and the green hollow scatters represent H benchmarks.
Remotesensing 14 05172 g004
Figure 5. Latent heat flux predictions from OL and MLDAS in 2003–2005. Predictions of four assimilation scenarios are given. Columns represent different assimilation scenarios in each year. The red solid line in each subplot represents MLDAS prediction in one future month; the blue dashed line in each subplot represents OL simulation; and the green hollow scatters represent LE benchmarks.
Figure 5. Latent heat flux predictions from OL and MLDAS in 2003–2005. Predictions of four assimilation scenarios are given. Columns represent different assimilation scenarios in each year. The red solid line in each subplot represents MLDAS prediction in one future month; the blue dashed line in each subplot represents OL simulation; and the green hollow scatters represent LE benchmarks.
Remotesensing 14 05172 g005
Figure 6. RD of soil moisture prediction in four scenarios over 2003–2005. The red solid lines, green dashed lines, blue dotted lines, and cyan dashed lines represent RD of scenario I, II, III, and IV, respectively. Blue bars are precipitation.
Figure 6. RD of soil moisture prediction in four scenarios over 2003–2005. The red solid lines, green dashed lines, blue dotted lines, and cyan dashed lines represent RD of scenario I, II, III, and IV, respectively. Blue bars are precipitation.
Remotesensing 14 05172 g006
Figure 7. Same as Figure 6, but for latent heat flux prediction in four scenarios over 2003–2005.
Figure 7. Same as Figure 6, but for latent heat flux prediction in four scenarios over 2003–2005.
Remotesensing 14 05172 g007
Figure 8. RD between OL and MLDAS of soil moisture prediction in scenario I in 2003, 2004, and 2005. The shaded area represents the duration of influence brought by assimilation on future predictions.
Figure 8. RD between OL and MLDAS of soil moisture prediction in scenario I in 2003, 2004, and 2005. The shaded area represents the duration of influence brought by assimilation on future predictions.
Remotesensing 14 05172 g008
Figure 9. RD between OL and MLDAS of latent heat flux prediction in scenario I in 2003, 2004, and 2005. The shaded area represents the duration of influence brought by assimilation on future predictions.
Figure 9. RD between OL and MLDAS of latent heat flux prediction in scenario I in 2003, 2004, and 2005. The shaded area represents the duration of influence brought by assimilation on future predictions.
Remotesensing 14 05172 g009
Table 1. NICRMSD of soil moisture in different assimilation scenarios (I–IV) in 2003, 2004, and 2005. Positive NICRMSD numbers are marked in bold.
Table 1. NICRMSD of soil moisture in different assimilation scenarios (I–IV) in 2003, 2004, and 2005. Positive NICRMSD numbers are marked in bold.
Year200320042005
RMSDModel
(m3/m3)
RMSDAnalysis
(m3/m3)
NICRMSD
(%)
RMSDModel
(m3/m3)
RMSDAnalysis
(m3/m3)
NICRMSD
(%)
RMSDModel
(m3/m3)
RMSDAnalysis
(m3/m3)
NICRMSD
(%)
Scenario I0.01020.0106−3.870.02330.011550.510.00840.0106−26.50
Scenario II0.01380.01361.230.01210.0136−12.080.04690.040214.26
Scenario III0.03860.024536.350.02240.020110.100.02510.022510.15
Scenario IV0.05400.021460.410.02440.016631.930.04110.022744.83
Table 2. NICRMSD of sensible heat fluxes in different assimilation scenarios (I–IV) in 2003, 2004, and 2005. Positive NICRMSD numbers are marked in bold.
Table 2. NICRMSD of sensible heat fluxes in different assimilation scenarios (I–IV) in 2003, 2004, and 2005. Positive NICRMSD numbers are marked in bold.
Year200320042005
RMSDModel
(W/m2)
RMSDAnalysis
(W/m2)
NICRMSD
(%)
RMSDModel
(W/m2)
RMSDAnalysis
(W/m2)
NICRMSD
(%)
RMSDModel
(W/m2)
RMSDAnalysis
(W/m2)
NICRMSD
(%)
Scenario I26.71629.411−10.0948.16846.8862.6636.92652.082−41.04
Scenario II32.04321.71832.2220.0317.82260.9563.83425.28460.39
Scenario III15.0217.55249.727.9615.21034.569.8535.74541.70
Scenario IV22.4539.64057.0712.4226.48447.8034.70718.19847.57
Table 3. NICRMSD of latent heat fluxes in different assimilation scenarios (I–IV) in 2003, 2004, and 2005. Positive NICRMSD numbers are marked in bold.
Table 3. NICRMSD of latent heat fluxes in different assimilation scenarios (I–IV) in 2003, 2004, and 2005. Positive NICRMSD numbers are marked in bold.
Year200320042005
RMSDModel
(W/m2)
RMSDAnalysis
(W/m2)
NICRMSD
(%)
RMSDModel
(W/m2)
RMSDAnalysis
(W/m2)
NICRMSD
(%)
RMSDModel
(W/m2)
RMSDAnalysis
(W/m2)
NICRMSD
(%)
Scenario I31.12133.785−8.5656.80854.2514.5044.71459.053−32.07
Scenario II36.85827.39025.6922.3099.51757.3469.30026.64261.56
Scenario III21.38211.16547.7911.7497.76333.9313.5458.00940.87
Scenario IV32.45913.60858.0817.4089.23546.9539.64721.52745.70
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tian, Y.; Xu, T.; Chen, F.; He, X.; Li, S. Can Data Assimilation Improve Short-Term Prediction of Land Surface Variables? Remote Sens. 2022, 14, 5172. https://doi.org/10.3390/rs14205172

AMA Style

Tian Y, Xu T, Chen F, He X, Li S. Can Data Assimilation Improve Short-Term Prediction of Land Surface Variables? Remote Sensing. 2022; 14(20):5172. https://doi.org/10.3390/rs14205172

Chicago/Turabian Style

Tian, Yingze, Tongren Xu, Fei Chen, Xinlei He, and Shi Li. 2022. "Can Data Assimilation Improve Short-Term Prediction of Land Surface Variables?" Remote Sensing 14, no. 20: 5172. https://doi.org/10.3390/rs14205172

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop