Assessing the Impact of LAI Data Assimilation on Simulations of the Soil Water Balance and Maize Development Using MOHID-Land

: Hydrological modeling at the catchment scale requires the upscaling of many input parameters for better characterizing landscape heterogeneity, including soil, land use and climate variability. In this sense, remote sensing is often considered as a practical solution. This study aimed to access the impact of assimilation of leaf area index (LAI) data derived from Landsat 8 imagery on MOHID-Land’s simulations of the soil water balance and maize state variables (LAI, canopy height, aboveground dry biomass and yield). Data assimilation impacts on ﬁnal model results were ﬁrst assessed by comparing distinct modeling approaches to measured data. Then, the uncertainty related to assimilated LAI values was quantiﬁed on ﬁnal model results using a Monte Carlo method. While LAI assimilation improved MOHID-Land’s estimates of the soil water balance and simulations of crop state variables during early stages, it was never sufﬁcient to overcome the absence of a local calibrated crop dataset. Final model estimates further showed great uncertainty for LAI assimilated values during earlier crop stages, decreasing then with season reaching its end. Thus, while model simulations can be improved using LAI data assimilation, additional data sources should be considered for complementing crop parameterization.


Introduction
In recent decades, modeling has become an essential part of the decision-making process for improving irrigation water use [1][2][3][4], optimizing fertilization practices [5,6], predicting crop yields [7,8] and coping with climate change [9,10] at the field and regional scales. However, modeling tools require first a considerable time investment in calibration to provide feasible results to their users. This is often accomplished at the plot scale, where most variables influencing crop development (soil properties, plant physiology, groundwater levels and weather conditions) can be more easily monitored. The problem often arises when upscaling to the field or regional scales due to the difficulties in portraying landscape heterogeneity, including soil, land use and climate variability.
Remote sensing technology offers today a potential solution for accurately and reliably describing the spatial distribution of soil properties and canopy state variables (leaf area index, canopy height, biomass) at the field and regional scales [11]. A vast number of new satellite sensors (Landsat 8, Sentinel-2, Spot-6, RapidEye, Huanjing-1) together with versatile, light-weighted and low-cost sensors mounted in farm tractors or unmanned aerial vehicles are now available for providing information with high spatial and temporal resolution to farmers and technicians. This information can potentially be also assimilated into field or regional scale models, overcoming many constraints in terms of input parameterization.
The purpose of assimilation is to optimize model input parameters by integrating, both in space and time, soil or canopy state variables derived from remote sensing methods [12]. Accurate and up-to-date information has been increasingly available at low cost, which has led to numerous research studies focusing on assimilation of remote sensing measurements [13][14][15][16][17][18][19]. Some of these have estimated leaf area index (LAI) using different remote sensing data sources, then assimilating those values by directly replacing the simulated LAI to improve model estimates of the aboveground dry biomass, yield and crop transpiration [14,17]. Other studies have used more advanced assimilation techniques, usually based on the Extended Kalman Filter [20] and Ensemble Kalman Filter [21] assimilation methods, for integrating remote sensing LAI into model simulations [15,16,19]. Overall, regardless of the technique used, most of those studies concluded that remote sensing provides useful measurements which can then be used for improving model simulations.
While a wide variety of models exist capable of simulating crop growth processes at the regional scale, thus portraying landscape heterogeneity at some extent [22][23][24], fully distributed process-based models such as MIKE SHE [25], SHETRAN [26] and MOHID-Land [27] are often considered ideal for further studying distributed state variables (the spatiotemporal variability of soil moisture) and flow paths (sediment and nutrient transport) [28]. These fully distributed process-based models consider interactions between multiple components of the soil-water-atmosphere continuum, with fundamental process being formulated at fine spatial (plot) and temporal scales, contributing to the overall dynamics at a higher organizational level, such as the watershed [28]. For the case of MOHID-Land, the model has been used for improving irrigation practices at the plot and field scales [3,29,30], understanding the contribution of flood events to the eutrophication of water reservoirs [31,32] and forecasting fresh water quantity and quality in coastal rivers [33]. Extensive calibration has been normally required for characterizing soil, groundwater, crop and river flow properties. Thus, data assimilation may have here a decisive contribution for more accurately describing the spatial and temporal variability of many of the required input parameters. However, the impact of data assimilation on final model outputs needs to be first assessed.
The main objective of this study was thus to understand the impact of LAI assimilation on MOHID-Land's estimates of the soil water balance and crop state variables (LAI, canopy height, aboveground dry biomass and yield). The hypothesis addressed were that (i) the MOHID-Land model could accurately estimate the soil water balance and aboveground biomass growth in a one-dimensional domain; (ii) LAI assimilation could improve simulations of crop development and (iii) the related uncertainties could be assessed. Results from this study will help to improve hydrological modeling at the field and regional scales by quantifying the uncertainty related to data assimilation using the MOHID-Land model.

Field Site Description and Data
Field data used in this study was collected at Herdade do Zambujeiro ( 46.63 W, 6 m a.s.l.) (Figure 1). The climate in the region is semi-arid to dry sub-humid, with hot dry summers and mild winters with irregular rainfall. The mean annual temperature is 16.8 • C, with the mean daily temperatures at the coolest (January) and warmest (August) months reaching 11.4 and 22.7 • C, respectively. The mean annual precipitation is 668 mm, mostly occurring between October and May. The soil was a Haplic Fluvisol [34], with the main soil physical and chemical properties presented in Table 1. The bottom layers exhibited higher Water 2018, 10, 1367 3 of 18 dry bulk density and lower measured saturated hydraulic conductivity values than the topsoil layers [3], evidencing some soil compaction due to tillage operations carried out throughout the years and the relatively high soil moisture that was constant along the seasons because of the shallow groundwater levels.
Water 2018, 10, x FOR PEER REVIEW 3 of 18 the relatively high soil moisture that was constant along the seasons because of the shallow groundwater levels. The MOHID-Land model was previously implemented in the area by Ramos et al. [3]. These authors evaluated the model's capacity in predicting soil water contents and fluxes and the evolution of different crop growth parameters, including the leaf area index (LAI), canopy height, aboveground dry biomass and yields during the 2014 and 2015 maize growing seasons. Details on the calibration/validation approach can be found in the cited reference. For that, the field was cropped with maize hybrid P1574 (FAO 600) with a density of approximately 89,000 plants ha −1 . Management practices, including fertilization and irrigation, were performed according to the standard practices in the region and were decided by the farmer. During 2014, maize was sown on May 24 and harvested on October 8; the net rainfall reached 163 mm, while the net irrigation amounted 365 mm ( Figure 2). During 2015, maize was sown on April 16 and harvested on September 20; the net rainfall reached only 12 mm, while the net irrigation summed 620 mm ( Figure 2). Irrigation was applied with the farmer's stationary sprinkler system. Groundwater depth (GWD) varied between approximately 1.5 m depth at the beginning of the growing season to 1.0 m depth during irrigation, further reaching 0.3 m depth during September 2014 after successive rain events ( Figure 2). Crop stages were set as in Table 2 based on field observations. One SM1 capacitance probe (Adcon Telemetry, Klosterneuburg, Austria) and one ECH2O-5 capacitance probe (Decagon Devices, Pullman, WA, USA) were installed at depths of 10, 30 and 50 cm to continuously measure soil water contents. One LEV1 level sensor (Adcon Telemetry, Klosterneuburg, Austria) was used to continuously monitor the groundwater level ( Figure 2). One RG1 (Adcon Telemetry, Klosterneuburg, Austria) and two QMR101 (Vaisala, Helsinki, Finland) rain gauges were used to measure the amount of water applied per irrigation event.
LAI, canopy height and the aboveground dry biomass were further monitored by harvesting 3 random plants in four locations distributed randomly throughout the field plot, every 15 days, between May and September, during the 2014 and 2015 maize growing seasons ( Table 3). The same crop parameters were measured at the end of each crop season, but by harvesting all plants in random areas of 1.5 m 2 (corresponding to approximately 12 plants). The length and width of crop leafs were measured in every harvested plant and then converted to LAI values as documented in Ramos et al. [4]. The aboveground dry biomass was determined by oven drying maize stems, leaves and grain at The MOHID-Land model was previously implemented in the area by Ramos et al. [3]. These authors evaluated the model's capacity in predicting soil water contents and fluxes and the evolution of different crop growth parameters, including the leaf area index (LAI), canopy height, aboveground dry biomass and yields during the 2014 and 2015 maize growing seasons. Details on the calibration/validation approach can be found in the cited reference. For that, the field was cropped with maize hybrid P1574 (FAO 600) with a density of approximately 89,000 plants ha −1 . Management practices, including fertilization and irrigation, were performed according to the standard practices in the region and were decided by the farmer. During 2014, maize was sown on May 24 and harvested on October 8; the net rainfall reached 163 mm, while the net irrigation amounted 365 mm ( Figure 2). During 2015, maize was sown on April 16 and harvested on September 20; the net rainfall reached only 12 mm, while the net irrigation summed 620 mm ( Figure 2). Irrigation was applied with the farmer's stationary sprinkler system. Groundwater depth (GWD) varied between approximately 1.5 m depth at the beginning of the growing season to 1.0 m depth during irrigation, further reaching 0.3 m depth during September 2014 after successive rain events ( Figure 2). Crop stages were set as in Table 2 based on field observations. One SM1 capacitance probe (Adcon Telemetry, Klosterneuburg, Austria) and one ECH2O-5 capacitance probe (Decagon Devices, Pullman, WA, USA) were installed at depths of 10, 30 and 50 cm to continuously measure soil water contents. One LEV1 level sensor (Adcon Telemetry, Klosterneuburg, Austria) was used to continuously monitor the groundwater level ( Figure 2). One RG1 (Adcon Telemetry, Klosterneuburg, Austria) and two QMR101 (Vaisala, Helsinki, Finland) rain gauges were used to measure the amount of water applied per irrigation event.
LAI, canopy height and the aboveground dry biomass were further monitored by harvesting 3 random plants in four locations distributed randomly throughout the field plot, every 15 days, between May and September, during the 2014 and 2015 maize growing seasons ( Table 3). The same crop parameters were measured at the end of each crop season, but by harvesting all plants in random areas of 1.5 m 2 (corresponding to approximately 12 plants). The length and width of crop leafs were measured in every harvested plant and then converted to LAI values as documented in Ramos et al. [4]. The aboveground dry biomass was determined by oven drying maize stems, leaves and grain at 70 • C to a constant weight. Maize yield was obtained from the grain's dry biomass measured at the end of each crop season.

Model Description
MOHID-Land is a distributed model capable of computing different physical and chemical processes in a three-dimensional domain using a finite-volume approach [27]. Variably-saturated water flow is described using the Richards equation, while the van Genuchten-Mualem functional relationships [35] are used for defining the unsaturated soil hydraulic properties, as follows: where S e is the effective saturation (L 3 L −3 ), θ r and θ s denote the residual and saturated water contents (L 3 L −3 ), respectively, K s is the saturated hydraulic conductivity (L T −1 ), α (L −1 ) and η (-) are empirical shape parameters, m = 1 − 1/η and is a pore connectivity/tortuosity parameter (-).
Crop evapotranspiration (ET c ) is determined from reference evapotranspiration (ET 0 ) values computed with the FAO Penman-Monteith method using the single crop coefficient (K c ) approach [36]. ET c is then partitioned into potential soil evaporation (E p ) and potential crop transpiration (T p ) as a function of LAI [37]: where λ is the extinction coefficient of radiation attenuation within the canopy (-).
Root water uptake is computed using the macroscopic approach introduced by Feddes et al. [38], meaning that T p is distributed over the root zone and may be diminished by the presence of depth-varying root zone stressors, namely water stress. Root water uptake reductions (i.e., actual transpiration, T a ) are then described using the piecewise linear model proposed by Feddes et al. [38]. In this model, root water uptake is at the potential rate when the pressure head (h) is between h 2 and h 3 , drops off linearly when h > h 2 or h < h 3 and becomes zero when h < h 4 or h > h 1 (subscripts 1 to 4 denote for different threshold pressure heads). E p values are limited by a pressure head threshold value to obtain the actual soil evaporation rate (E a ) [39].
MOHID-Land further includes a modified version of the EPIC model [40,41] for simulating crop growth. This model is based on the heat unit theory, which considers that all heat above the base temperature will accelerate crop growth and development: when T av > T base (5) where PHU is the total heat units required for plant maturity ( • C), HU is the number of heat units accumulated on day i ( • C), i = 1 corresponds to the sowing date (-), m is the number of days required for plant maturity (-), T av is the mean daily temperature ( • C) and T base is the minimum temperature for plant growth ( • C). Crop growth is modelled by simulating light interception, conversion of intercepted light into biomass and LAI development. Total biomass is calculated from the solar radiation intercepted by the crop leaf area using the Beer's law [42]: where ∆Bio act,i and ∆Bio i are the actual and potential increase in total plant biomass on day i (kg ha −1 ), RUE is the radiation-use efficiency of the plant ((kg ha −1 ) (MJ m −2 ) −1 ), PAR i is the daily incident photosynthetically active radiation (MJ m −2 ), λ is again the light extinction coefficient (-) and γ i is the daily plant growth factor (0-1) which accounts for water and temperature stresses [41]. Leaf area index is computed as a function of heat units, crop stress and the development stage [41]. During early stages (initial and plant development stages), LAI increment on a given day is a function of the fraction of the plant's maximum LAI (LAI max ) that needs to be reached during those stages (fr LAImax ) and crop stress.
where ∆LAI act,i and ∆LAI i are the actual and potential LAI increment added on day i (m 2 m −2 ), respectively and fr LAImax,i and fr LAImax,i−1 are the fraction of the plant's maximum LAI (LAI max , m 2 m −2 ) on day i and i − 1 (-), respectively. During the mid-season stage, LAI is assumed to be constant. During the late-season stage, LAI declines as a function of LAI max , heat units and crop stress. Root depth is also computed as a function of heat units [41], while root biomass is assumed to decrease from 0.4 of the total biomass at emergence to 0.2 at maturity [43]. Finally, yield is obtained from the product of the aboveground dry biomass and the actual harvest index [41]. A more detailed description of the MOHID-Land model governing equations can be found in Trancoso et al. [27] and Ramos et al. [3].

Model Setup and Data Assimilation
The assimilation of LAI data into MOHID-Land simulations has a direct influence on the water balance through the partition of ET c values into T p (Equation (3)) and E p (Equation (4)) and on the computation of the aboveground dry biomass (Equation (6)) Three distinct approaches were thus considered for better understanding the impact of LAI assimilation on model simulations: A-The model was run as in Ramos et al. [3], which simulations of soil water contents, LAI, h c , aboveground dry biomass and yields served as baseline for this study (Calibrated model). These authors followed a traditional calibration/validation approach, where a trial-and-error procedure was carried out to adjust soil hydraulic (Table 1) and crop parameters (Table 4) until deviations between the measured 2014 dataset and simulated values were minimized. The calibrated parameters were then validated using the 2015 dataset, with model simulations being compared to measured data. B-The model was run using LAI values extracted from satellite data as inputs (LAI assimilation). LAI values were derived from the normalized difference vegetation index (NDVI) using the relationship shown in Figure 3. This relationship was found by comparing NDVI values computed from Landsat 8 satellite images (band 4 and 5) with LAI values measured in the study site, at multiple locations and over the 2014 and 2015 growing seasons. The calibrated soil hydraulic parameters were here also adopted (Table 1). However, the default crop parameters from the MOHID-Land's database (Table 4) were considered instead so that model performance in the absence of a calibrated dataset could be assessed. C-The model was again run using LAI values extracted from satellite data as inputs. However, Ramos et al. [3] calibrated crop parameters (Table 4) were here considered (Calibration + LAI assimilation), as well as the calibrated soil hydraulic parameters (Table 1). computation of the aboveground dry biomass (Equation (6)) Three distinct approaches were thus considered for better understanding the impact of LAI assimilation on model simulations: A-The model was run as in Ramos et al. [3], which simulations of soil water contents, LAI, hc, aboveground dry biomass and yields served as baseline for this study (Calibrated model). These authors followed a traditional calibration/validation approach, where a trial-and-error procedure was carried out to adjust soil hydraulic (Table 1) and crop parameters (Table 4) until deviations between the measured 2014 dataset and simulated values were minimized. The calibrated parameters were then validated using the 2015 dataset, with model simulations being compared to measured data. B-The model was run using LAI values extracted from satellite data as inputs (LAI assimilation). LAI values were derived from the normalized difference vegetation index (NDVI) using the relationship shown in Figure 3. This relationship was found by comparing NDVI values computed from Landsat 8 satellite images (band 4 and 5) with LAI values measured in the study site, at multiple locations and over the 2014 and 2015 growing seasons. The calibrated soil hydraulic parameters were here also adopted (Table 1). However, the default crop parameters from the MOHID-Land's database (Table 4) were considered instead so that model performance in the absence of a calibrated dataset could be assessed. C-The model was again run using LAI values extracted from satellite data as inputs. However, Ramos et al. [3] calibrated crop parameters (Table 4) were here considered (Calibration + LAI assimilation), as well as the calibrated soil hydraulic parameters (Table 1).  Landsat 8 images were first corrected to convert the TOA (Top of Atmosphere) planetary reflectance using reflectance rescaling coefficients provided in the Landsat 8 OLI metadata file and to correct the reflectance value with the sun angle. Two images were available from sowing to harvest during the 2014 growing season, while eight images were used during the 2015 growing season (Table 5). These images were used to extract the NDVI values corresponding to the multiple locations where LAI field observations were carried out, in a total of 26 measurements (Figure 3). LAI and NDVI values ranged from 0.41-5.85 and 0.23-0.88, respectively, in line with Pôças et al. [44]. Data assimilation in MOHID-Land was carried out using the forcing method [12]. The approach is relatively straightforward, with the model simply replacing the predicted value by a new input when an image becomes available, updating then fr LAImax to account for what still needs to be reached during a specific crop stage and more interestingly, the water balance and the aboveground dry biomass estimates. From that date on and until another image becomes available model simulations follow the parameterization given in Table 4. Table 5 lists the assimilated LAI values and dates.
In all simulations (Approach A-C), the soil profile was specified with 2 m depth, divided into four soil layers according to observations (Table 1). The soil domain was represented using an Arakawa C-grid type [45], defined by one vertical column (one-dimensional domain) discretized into 100 grid cells with 1 m wide, 1 m long and 0.02 m thickness each (i.e., 1 × 1 × 0.02 m 3 ). The simulation periods covered from sowing to harvest. The upper boundary condition was determined by the actual evaporation and transpiration rates and the irrigation and precipitation fluxes ( Figure 2). Weather data used in this study was taken from a meteorological station located 950 m from the study site (38 • [31]. The K c value for the initial crop stage was then adjusted for the frequency of the wetting events (precipitation and irrigation) and average infiltration depths, while the K c values for mid-season and late season crop stages were adjusted for local climate conditions taking into consideration canopy height, wind speed and minimum relative humidity averages for the periods under consideration [36]. The following parameters of the Feddes et al. [38] model were used to compute T p reductions due to water stress: h 1 = −15, h 2 = −30, h 3 = −325 to −600, h 4 = −8000 cm [46]. The bottom boundary condition was specified using the observed GWD (Figure 2). The initial soil water content conditions were set to field capacity.

Statistical and Uncertainty Analysis
Model calibration and validation was performed by comparing field measured values of soil water contents, LAI, h c and aboveground dry biomass with the MOHID-Land simulations (Approaches A-C) using various quantitative measures of the uncertainty, such as, the coefficient of determination (R 2 ), the root mean square error (RMSE), the ratio of the RMSE to the standard deviation of observed data (NRMSE), the percent bias (PBIAS) and the model efficiency (EF). R 2 values close to 1 indicate that the model explains well the variance of observations. RMSE, NRMSE and PBIAS values close to zero indicate small errors of estimate and good model predictions [47][48][49]. Positive or negative PBIAS values refer to the occurrence of under-or over-estimation bias, respectively. Nash and Sutcliff [50] modelling efficiency EF values close to 1 indicate that the residuals variance is much smaller than the observed data variance, hence the model predictions are good; contrarily, when EF is very close to 0 or negative there is no gain in using the model.
Data assimilation is much dependent on the empirical relationship ( Figure 3) established to derive LAI values from the NDVI measurements [14,18]. As such, the uncertainty related to that conversion was quantified on final model estimates of T a , E a and aboveground dry biomass using a Monte Carlo method. This evaluation was performed on modeling Approach C (Calibration + LAI assimilation) as the objective here was to assess if remote sensing data assimilation could further correct for simulation errors that result from model parameter uncertainty. The 2015 dataset was also considered as more satellite images were available during this season. A randomly population of 10,000 LAI values was first created for each available image date following a normal distribution with mean equal to the estimated parameter given by the LAI-NDVI regression equation and range defined by the 95% confidence intervals (Figure 3, Table 5). The model was then run following Approach C settings until reaching the dates of each of the eight available images (8 × 1 simulation). Afterwards, the 10,000 LAI randomly generated values were assimilated by the model, which then proceeded with simulations until the end season following Approach C settings again (8 × 10,000 simulations). In the end, the uncertainty of final model estimates of T a , E a and aboveground dry biomass were assessed for each assimilation date (8 dates) from 10,000 simulations (80,000 simulations in total). The Monte Carlo simulations were performed with a Python script. Figure 4 shows the evolution of LAI estimated values using the calibrated model in Ramos et al. [3] (Approach A), direct LAI assimilation (Approach B) and the combination of the calibrated model and LAI assimilation (Approach C). Table 6 presents the statistical indicators used to evaluate the agreement between model simulations and measured values. Ramos et al. [3] showed that the MOHID-Land model could reasonably well simulate LAI evolution during the 2014 and 2015 growing seasons. In their study, the values of R 2 were very high (0.97), showing that the model could explain well the variability of the observed data. The errors of the estimates were quite small, resulting in RMSE values lower than 0.63 m 2 m −2 and NRMSE values lower than 0.16. The PBIAS values were lower than 6.40%, indicating some underestimation of the measured data. The modelling efficiency EF were also high (≥0.93), meaning that the residual variance was much smaller than the measured data variance.

LAI Evolution
Water 2018, 10, x FOR PEER REVIEW 9 of 18 deviation of observed data (NRMSE), the percent bias (PBIAS) and the model efficiency (EF). R 2 values close to 1 indicate that the model explains well the variance of observations. RMSE, NRMSE and PBIAS values close to zero indicate small errors of estimate and good model predictions [47][48][49]. Positive or negative PBIAS values refer to the occurrence of under-or over-estimation bias, respectively. Nash and Sutcliff [50] modelling efficiency EF values close to 1 indicate that the residuals variance is much smaller than the observed data variance, hence the model predictions are good; contrarily, when EF is very close to 0 or negative there is no gain in using the model. Data assimilation is much dependent on the empirical relationship ( Figure 3) established to derive LAI values from the NDVI measurements [14,18]. As such, the uncertainty related to that conversion was quantified on final model estimates of Ta, Ea and aboveground dry biomass using a Monte Carlo method. This evaluation was performed on modeling Approach C (Calibration + LAI assimilation) as the objective here was to assess if remote sensing data assimilation could further correct for simulation errors that result from model parameter uncertainty. The 2015 dataset was also considered as more satellite images were available during this season. A randomly population of 10,000 LAI values was first created for each available image date following a normal distribution with mean equal to the estimated parameter given by the LAI-NDVI regression equation and range defined by the 95% confidence intervals (Figure 3, Table 5). The model was then run following Approach C settings until reaching the dates of each of the eight available images (8 × 1 simulation). Afterwards, the 10,000 LAI randomly generated values were assimilated by the model, which then proceeded with simulations until the end season following Approach C settings again (8 × 10,000 simulations). In the end, the uncertainty of final model estimates of Ta, Ea and aboveground dry biomass were assessed for each assimilation date (8 dates) from 10,000 simulations (80,000 simulations in total). The Monte Carlo simulations were performed with a Python script. Figure 4 shows the evolution of LAI estimated values using the calibrated model in Ramos et al. [3] (Approach A), direct LAI assimilation (Approach B) and the combination of the calibrated model and LAI assimilation (Approach C). Table 6 presents the statistical indicators used to evaluate the agreement between model simulations and measured values. Ramos et al. [3] showed that the MOHID-Land model could reasonably well simulate LAI evolution during the 2014 and 2015 growing seasons. In their study, the values of R 2 were very high (0.97), showing that the model could explain well the variability of the observed data. The errors of the estimates were quite small, resulting in RMSE values lower than 0.63 m 2 m −2 and NRMSE values lower than 0.16. The PBIAS values were lower than 6.40%, indicating some underestimation of the measured data. The modelling efficiency EF were also high (≥0.93), meaning that the residual variance was much smaller than the measured data variance.  The direct assimilation of LAI values into model simulations (Approach B) produced worse statistical indicators than when using the calibrated model (Approach A), with the R 2 values decreasing down to 0.35 and the RMSE and NRMSE values increasing up to 2.16 m 2 m −2 and 0.53, respectively. The PBIAS showed contrasting results, while the EF values also decreased down to 0.24, indicating nonetheless that the model was still able to describe field measurements with relative success. The direct assimilation approach made that MOHID-Land's LAI simulated results were directly replaced by the remote sensing LAI values in the dates when satellite images were available. From that date on and until another image was available model simulations followed the default parameterization of the MOHID-Land crop database given in Table 4. As a result, assimilation of remote sensing LAI values using the forcing method available in the MOHID-Land model resulted in several unrealistic discontinuities in simulated LAI (Figure 4), a common feature when using this assimilation approach [14]. Also, LAI increased at a much faster pace during the initial and development crop stages, with maize also reaching senescence earlier. The difference of 200 • C in the total heat units required for plant maturity (PHU) considered between the default and calibrated crop parameters (Table 4) showed here to be critical for model performance. LAI assimilation was able to correct model simulations during the earlier crop stages but failed to counteract the end of the crop cycle as observed in the 2015 simulations ( Figure 4). Here, despite assimilating higher LAI values, the model was obviously never able to extend the crop lifecycle longer than the allowed by the default PHU parameter.

LAI Evolution
The previous results show the importance of considering Approach C, where data assimilation forced simulations of the local calibrated model. Contrarily to the expected, LAI assimilation did not further improve Approach A results. The R 2 values still decreased down to 0.63, while the RMSE and NRMSE values increased up to 1.73 m 2 m −2 and 0.43, respectively. The EF values also decreased down to 0.51. However, these statistics were better than those obtained using only direct LAI assimilation (Approach B), showing the importance of local model calibration. Model simulations fully covered maize's lifecycle this time since no constraints in the PHU existed. However, results were still dependent on the quality of the assimilated data, with LAI evolution at the end of the 2015 season suggesting that some filtering would be needed during the assimilation process (Figure 4).
Despite the lower statistical indicators found when compared to those using only the calibrated model (Approach A), LAI evolution was also considered to be well represented when LAI data assimilation was included in the MOHID-Land model simulations (Approaches B and C), particularly during the earlier crop stages. Results further suggested that a higher time resolution of assimilated data would improve the agreement between model simulations and measured data. Nonetheless, more important than accurately predicting LAI evolution was to understand how data assimilation impacted the soil water balance, aboveground dry biomass and yield estimates during the 2014 and 2015 growing seasons, as shown below.  (Table 6). During the 2015 growing season, the positive impact of LAI data assimilation on soil water content simulation was more modest with only the EF values showing a relative improvement from 0.11 to 0.28. No noticeable differences were found between Approach B and Approach C statistical indicators. All simulations shared the same soil hydraulic parameters (Table 1) to better assess the actual impact of LAI assimilation on soil water content simulations, explaining thus the similarity of model results.

Soil Water Balance
As LAI evolution was used in the partition of ET c values into T p (Equation (3)) and E p (Equation (4)) [37], these two soil water balance components showed the greatest variation when considering LAI data assimilation ( Table 7). The calibrated model (Approach A) produced estimates of T a , E a , capillary rise (CR) and deep percolation (DP), in line with other studies carried out in the region [51][52][53], some of which highlighting the importance of CR to the soil water balance in the Sorraia Valley region. Direct LAI assimilation produced always the lowest T p and T a values (T a /T p = 1), and, naturally, the highest E p and E a values during both seasons. The LAI data forcing on the calibrated model (Approach C) produced contrasting results when compared to Approach A, with T p values decreasing in 2014 when LAI evolution was underestimated (PBIAS = 24.39%) and increasing in 2015 when the opposite occurred (PBIAS = −10.34%). Accurate LAI predictions were thus essential for simulating crop transpiration and soil evaporation, even though other important soil water balance components such as deep percolation and capillary rise were not significantly affected by less accurate LAI predictions. As a result, LAI assimilation in MOHID-Land may thus have a direct influence on biomass development, while estimates of groundwater recharge or solute leaching from the root zone may be impacted less significantly. influence on biomass development, while estimates of groundwater recharge or solute leaching from the root zone may be impacted less significantly.

Crop Height
The direct assimilation of LAI (Approach B) showed the maize canopy growing faster than that measured in the field or simulated by Ramos et al. [3] (Approach A), similarly to LAI predictions ( Figure 6). Canopy height then assumed a default maximum value (h c,max ) of 2.5 m when the mid-season crop stage was reached (default value in Table 4), underestimating field values from that date onward and producing worse statistical indicators than those computed using the calibrated model ( Table 6). The main problem here was thus the lack of a local calibrated dataset with the impact of LAI assimilation on canopy height simulations being only marginal as shown by the good indicators again obtained in Approach C.

Dry Biomass and Yields
Simulations of the aboveground dry biomass were concordant with the estimated LAI values during the 2014 and 2015 growing seasons. In 2014, the underestimation of LAI values led also to lower aboveground dry biomass estimates following the direct LAI assimilation approach (Approach B), with these being further closer to field measurements than the earlier results from the calibrated model (Approach A). As a result, the RMSE values decreased from 5128.3 to 2183.7 kg ha -1 , the NRMSE values reduced from 0.39 to 0.24 and the EF values increased from 0.87 to 0.95 when direct LAI assimilation was considered. Contrarily, the overestimation of LAI values produced larger errors in 2015 when compared to field measurements, with the RMSE values increasing from 4616.8 to 6211.4 kg ha −1 , the NRMSE values increasing from 0.33 to 0.44 and the EF values decreasing from 0.89 to 0.79. The local calibrated crop dataset (Approach C) did not improve aboveground dry biomass estimates.

Dry Biomass and Yields
Simulations of the aboveground dry biomass were concordant with the estimated LAI values during the 2014 and 2015 growing seasons. In 2014, the underestimation of LAI values led also to lower aboveground dry biomass estimates following the direct LAI assimilation approach (Approach B), with these being further closer to field measurements than the earlier results from the calibrated model (Approach A  (Table 8). Yield predictions were in line with the same under-and overestimation tendencies observed in simulations of the aboveground dry biomass. Yield estimated from the three modeling approaches were relatively close (14,670-15,518 kg ha −1 ) during the 2014 season, with Approach C producing the best estimates. On the other hand, during the 2015 season, yield estimates from that same approach produced the worse results (23,016 kg ha −1 ), with all models diverging substantially from the measured value. While many studies throughout the literature reported that precise knowledge of light interception and hence LAI, was critical for predicting biomass and yield accurately [12,17,54], results presented here were more in line with Linker and Ioslovich [19], who found that assimilation of easy-to-obtain canopy cover measurements did not always improve the predictions of biomass. They explained that by model choice, which in their case was a purely water-driven model in which solar radiation and light interception were not considered explicitly, likely resulting in underestimating the overall impact of canopy cover on crop development. A similar reasoning can be considered here. Ines et al. [16] also found that LAI assimilation could not always improve simulated aboveground dry biomass and yield predictions, particularly during dry conditions as the root zone soil moisture could not meet the increased water demand that resulted from improved canopy growth. Likewise, Nearing et al. [15] referred to the failure of LAI and soil moisture data assimilation in improving yield estimates, especially in water-limited environments, pointing out similar reasons as Ines et al. [16]. From a different perspective, Trombetta et al. [55] made use of remote sensed LAI data derived from the MODIS satellite images for calibrating/validating a hydrological model at the plot scale. Remote sensing LAI data, after being converted into canopy cover, was used as an alternative to field measurements during the calibration/validation process, with results suggesting this approach as a viable alternative for characterizing landscape heterogeneity (crop variability) at larger scales.

Uncertainty of Model Estimates
The previous sections showed that the impacts of LAI assimilation on MOHID-Land final estimates of T a , E a and aboveground biomass were substantial and that some filtering would be eventually needed for improving the quality of assimilated data. The uncertainty analysis carried out using a Monte Carlo method confirmed these early findings, with Figure 7 showing a relatively large uncertainty of final model estimates when LAI assimilation was performed during the first two dates (23/04/2015, 09/05/2015) of 2015 crop growing season. In these dates, T a values ranged from 460 to 527 mm, while E a values varied between 102 and 184 mm. Likewise, the aboveground dry biomass showed also considerable variation, ranging from 37,978 to 47,347 kg ha −1 . From those dates onwards, the uncertainty of final model estimates decreased, being relatively small when LAI assimilation was carried out at the end of the crop cycle. Filtering of the assimilation data would thus be important during the earlier crop stages, becoming irrelevant as the crop reaches the end of its life cycle.
The large uncertainty observed on model final estimates was already expected when LAI assimilation was performed during the earlier dates since the crop was still at the initial and development stages and thus is growth cycle was not yet well defined. Yet, the Monte Carlos simulation results showed that the assimilation of too low LAI values could lead the model to greatly underestimate crop transpiration and aboveground dry biomass, while soil evapotranspiration would be greatly overestimated. Hence, this can be quite problematic in the absence of additional information to update model simulations throughout the crop season if new satellite images are no longer available (e.g., due to cloud cover). In this case, the model will never be able to further update simulations of T a , E a and aboveground dry biomass, producing quite substantial errors.

Conclusions
Remote sensing technology can provide valuable information for hydrological modeling at the field and regional scales by better characterizing the spatial and temporal variability of soils, land uses and climate, which otherwise are difficult to portray. This study showed that LAI assimilation from NDVI derived satellite data improved MOHID-Land estimates of the soil water balance and simulations of crop height and aboveground dry biomass during the early stages of the crop growing period. However, data assimilation was never sufficient to improve model simulations in the absence of a crop calibrated dataset, failing to simulate the entire growing season when calibrated potential head units (PHU) were missing or even crop maximum height when crop parameterization was misadjusted.
LAI data assimilation led also to great uncertainty on final estimates of crop transpiration, soil evaporation and the aboveground dry biomass when solely performed during the initial stages of the crop growing period. Although model uncertainty then decreased as LAI assimilation was being carried out closer to the end of the crop cycle, results showed that this approach may lead to complete erroneous estimates of the soil water balance and crop yields even when local calibrated soil and crop datasets are used. Therefore, while LAI remote sensing data can help defining MOHID-Land's input parameters, additional data sources should be accessed for complementing such characterization. The implementation of the MOHID-Land model at the regional scale cannot depend solely on inputs from the LAI data assimilation as estimates may diverge substantially from reality.

Conclusions
Remote sensing technology can provide valuable information for hydrological modeling at the field and regional scales by better characterizing the spatial and temporal variability of soils, land uses and climate, which otherwise are difficult to portray. This study showed that LAI assimilation from NDVI derived satellite data improved MOHID-Land estimates of the soil water balance and simulations of crop height and aboveground dry biomass during the early stages of the crop growing period. However, data assimilation was never sufficient to improve model simulations in the absence of a crop calibrated dataset, failing to simulate the entire growing season when calibrated potential head units (PHU) were missing or even crop maximum height when crop parameterization was misadjusted.
LAI data assimilation led also to great uncertainty on final estimates of crop transpiration, soil evaporation and the aboveground dry biomass when solely performed during the initial stages of the crop growing period. Although model uncertainty then decreased as LAI assimilation was being carried out closer to the end of the crop cycle, results showed that this approach may lead to complete erroneous estimates of the soil water balance and crop yields even when local calibrated soil and crop datasets are used. Therefore, while LAI remote sensing data can help defining MOHID-Land's input parameters, additional data sources should be accessed for complementing such characterization. The implementation of the MOHID-Land model at the regional scale cannot depend solely on inputs from the LAI data assimilation as estimates may diverge substantially from reality.