Estimation and Assessment of the Root Zone Soil Moisture from Near-Surface Measurements over Huai River Basin

: Root zone soil moisture (RZSM) is a vital variable for agricultural production, water resource management and runoff prediction. Satellites provide large-scale and long-term near-surface soil moisture retrievals, which can be used to estimate RZSM through various methods. In this study, we tested the utility of an exponential ﬁlter (ExpF) using in situ soil moisture by optimizing the optimal characteristic time length T_opt for different soil depths. Furthermore, the parameter analysis showed that T_opt correlated negatively with precipitation and had no signiﬁcant correlation with selected soil properties. Two approaches were taken to obtain T_opt: (1) optimization of the Nash–Sutcliffe efﬁciency coefﬁcient (NSE); (2) calculation based on annual average precipitation. The precipitation-based T_pre outperformed the station-speciﬁc T_opt and stations-averaged T_opt. To apply the ExpF on grid scale, the precipitation-based T_pre considering spatial variability was adopted in the ExpF to obtain RZSM from a new soil moisture dataset RF_SMAP_L3_P (Random Forest Soil Moisture Active Passive_L3_Passive) continuous in time and space over Huai River Basin. Finally, the performance of RF_SMAP_L3_P RZSM (0–100 cm) was evaluated using in situ measurements and compared with mainstream products, for instance, Soil Moisture Active Passive (SMAP) and Soil Moisture and Ocean Salinity Level 4 (SMOS L4) RZSM. The results indicated that RF_SMAP_L3_P RZSM could captured the temporal variation of measured RZSM best with R value of 0.586, followed by SMAP L4, which had the lowest bias value of 0.03, and SMOS L4 signiﬁcantly underestimated the measured RZSM with bias value of − 0.048 in the basin. Higher accuracy of RF_SMAP_L3_P RZSM was found in the ﬂood period compared with the non-ﬂood period, which indicates a better application for ExpF in wetter weather conditions.


Introduction
Soil moisture is a critical state variable in land-atmosphere interactions due to its pivotal role in hydrological cycle, exchange of energy and moisture fluxes, weather and climate systems [1][2][3].In particular, root zone soil moisture (RZSM) has important applications in agricultural drought monitoring and prediction, water resource management, and data assimilation for runoff prediction [4][5][6].Huai River Basin (HRB) is an important grain production area in China, where floods and droughts occur frequently that seriously threaten agricultural production.
Various techniques (e.g., gravimetric, time/frequency domain reflectometry) provide in situ soil moisture at fixed locations [4,7], so it is hard to map soil moisture at large spatial scale due to high spatial-temporal variability resulting from the interactions and feedbacks between ecohydrological processes.The spatial-temporal variability of soil moisture is crucial for application in water resource management, agricultural production and ecosystem sustainability and is dominated by different factors across different spatial scales, among which the effect of precipitation distribution on the spatial-temporal soil moisture variability is non-negligible [8,9].In addition, the relation between the coefficient of variation (CV) of mean spatial soil moisture and mean spatial soil moisture often shows a hysteresis pattern, but the occurrence of hysteresis pattern decreases with increasing spatially soil heterogeneity and even disappears completely for the fully heterogeneous soil [10].Satellite remote sensing techniques enable us to monitor soil moisture globally, for instance, Soil Moisture Active Passive (SMAP) and Soil Moisture and Ocean Salinity (SMOS).However, they can only capture soil moisture for the top few centimeters (~5 cm) [11][12][13].Recently, satellite soil moisture sensing has widely been used.For instance, estimating irrigation magnitude by assimilating satellite soil moisture into the land surface model (LSM) or the soil water balance equation [14]; detecting irrigation signals using satellite soil moisture [15]; retrieving rainfall from satellite soil moisture based on the soil water balance equation [16]; monitoring and predicting extreme events (e.g., agricultural drought and heat waves) [17][18][19][20].Similarly, satellite soil moisture could be used to estimate RZSM due to the strong coupling strength between surface soil moisture (SSM) and RZSM [21][22][23][24][25].However, quantities of SMAP and SMOS surface soil moisture are missing, spatially affected by radio frequency interference (RFI), especially in Europe and Asia (most areas of China, including the Huai River Basin) [26], which brings great challenges to the application of satellite soil moisture.Therefore, Wang et al. [27] filled the SMAP soil moisture gaps using random forest and created the RF_SMAP_L3_P surface soil moisture, continuous in time and space, which could be used to estimate RZSM over HRB.Both SMOS and SMAP provide continuous spatial and temporal Level 4 (L4) RZSM, but neither have been validated in the HRB and the accuracy is uncertain.
Statistics-based methods are the simplest among the three types of methods, which establish the linear relationship between RZSM and SSM or multiple atmospheric forcing variables.However, the accuracy of statistics-based methods is the lowest due to the simple model structure and lack of physical mechanisms.Previous studies show that data-driven methods can predict RZSM with good accuracy, but the disadvantages are obvious: (1) the accuracy of an ANN declines sharply once the data used to predict are outside the training conditions; (2) the trained ANN model has low transferability, limited by local climatic conditions [31,32].By comparison, physics-based methods are a better choice for the solid physical mechanism and higher accuracy, in which assimilating remote sensing data into the land surface model may be the most accurate [36][37][38].However, this method needs a wide range of input data (precipitation, humidity, etc.) and is not suitable for areas where measured data are scarce.In addition, the method has a complex model structure and is computationally expensive.Karandish et al. [39] compare a process-based numerical model (HYDRUS-2D), machine learning (SVM) and multiple regression model (MLR), they reported that process-based numerical models are best at predicting soil moisture, followed by machine learning models and then MLR.Zhang et al. [29] compared three methods (ANN, LR and ExpF) for vertical extrapolation of surface soil moisture and indicated that the ExpF outperforms ANN and LR for capturing the relative variability and correlations between soil moisture at different depths.Tian et al. [2] adopted ANN, CDF and ExpF to estimate subsurface soil moisture from surface soil moisture and reported that ExpF best captures the temporal variation of subsurface soil moisture.
The ExpF was proposed by Wagner et al. [34] on the basis of a two-layer soil water balance equation and widely used to estimate profile soil moisture from near-surface soil moisture with little input and reasonable accuracy at several fixed locations [2,6,11,29,33,35,40].However, few studies yet exist on estimating RZSM from SSM on a basin scale.
When the ExpF was used to estimate RZSM from satellite-derived soil moisture grid data on a basin scale, it was crucial to obtain the characteristic time length T_opt for each grid unit.Hence, it was necessary to figure out the main controlling factors on T_opt.Albergel et al. [33] pointed that a weak connection between T_opt and climatic conditions may exist.Ford et al. [40] reported that T_opt was sensitive to near-surface soil moisture conditions and tended to be smaller when surface soil moisture conditions were extremely dry or wet.Wang et al. [41] reported that T_opt was negatively correlated with sand fraction and bulk density, and positively correlated with clay fraction and soil organic matter at Automated Weather Data Network (AWDN) stations, but no significant correlation was found at the Soil Climate Analysis Network (SCAN) stations due to the higher spatial variability in precipitation.However, Tian et al. [2] drew the opposite conclusions, namely that T_opt was negatively correlated with clay fraction and soil organic matter, and positively correlated with bulk density.Therefore, the main driver of T_opt may depend on a combination of local specific climatic conditions and soil properties.For the HRB, further research is needed on the main controlling factors of the T_opt and how to derive the T_opt for each grid at basin-scale from the controlling factors.
This study seeks to provide the following information: (1) how to determine the T_opt suitable for the HRB at fixed locations; (2) how to obtain the T_opt with high spatial variability for each grid at basin-scale, considering climatic conditions and soil properties; (3) how to obtain a relatively accurate RZSM dataset for the HRB.

Study Area
The Huai River Basin (HRB) is a typical humid and subhumid region in eastern China, bordering the Yellow Sea and covering an area of 27 × 10 4 km 2 .The ranges of longitude and latitude for HRB are 111 • 55 to 121 • 25 E and 30 • 55 to 36 • 36 N, respectively (Figure 1).The HRB is located in the transitional zone between northern subtropical (north HRB) and warm temperate climates (south HRB).The average annual precipitation ranges from 600 to 1000 mm with an average of 888 mm, and the annual evaporation ranges from 900 to 1500 mm [42].There are typical north-south precipitation and evaporation gradients in opposite directions [43][44][45].Precipitation is mostly concentrated in June and July.The heterogeneity of precipitation and evaporation leads to dynamic temporal and spatial variations soil moisture; thus, frequent droughts and floods occur in the HRB.The territory is relatively flat, mainly comprising plains surrounded by mountains [42].

In Situ Measurements
The in situ soil moisture measurements were collected at 31 stations from 1 April 2015 to 31 December 2019.At each station, soil moisture is measured daily by time domain reflectometry (TDR) soil moisture probes installed horizontally at depths of 5, 20, 40 and 100 cm.Most of the in situ stations are located on the Huaibei plain where the main land-cover is wheat and maize.Hence, the soil moisture measurements are characterized by similar underlying surface and climate conditions.
Daily precipitation and near-surface air temperature are available on the China Meteorological Data Network (http://data.cma.cn/,accessed on 20 December 2022), and quality control procedures are executed at each station.The grid dataset is obtained by interpolating spatially, using the method of partial thin-plate smoothing splines from more than 2400 national ground meteorological station observations after quality controls and corrections.The average coverage rate of gauging stations located in a grid cell is 38% across the whole of China, but up to 77% in the eastern part of China where the HRB is located.The rainfall data has a mean RMSE of 0.49 mm and R of 0.93 significant at p < 0.01 [46].The air temperature data have a mean bias of ±0.2 • C and RMSE of 0.2-0.5 • C [47].

In Situ Measurements
The in situ soil moisture measurements were collected at 31 stations from 1 April 2015 to 31 December 2019.At each station, soil moisture is measured daily by time domain reflectometry (TDR) soil moisture probes installed horizontally at depths of 5, 20, 40 and 100 cm.Most of the in situ stations are located on the Huaibei plain where the main landcover is wheat and maize.Hence, the soil moisture measurements are characterized by similar underlying surface and climate conditions.
Daily precipitation and near-surface air temperature are available on the China Meteorological Data Network (http://data.cma.cn/,accessed on 20 December 2022), and quality control procedures are executed at each station.The grid dataset is obtained by interpolating spatially, using the method of partial thin-plate smoothing splines from more than 2400 national ground meteorological station observations after quality controls and corrections.The average coverage rate of gauging stations located in a grid cell is 38% across the whole of China, but up to 77% in the eastern part of China where the HRB is located.The rainfall data has a mean RMSE of 0.49 mm and R of 0.93 significant at p < 0.01 [46].The air temperature data have a mean bias of ±0.2 °C and RMSE of 0.2-0.5 °C [47].
For soil properties, information of a gridded dataset with a resolution of 30″ × 30″ (about 1 km at the equator) containing the information of eight layers from 0 to 230 cm was compiled using the soil properties from and surface modeling over China [48].

RF_SMAP_L3_P Surface Soil Moisture
The RF_SMAP_L3_P near-surface soil moisture dataset generated by Wang et al. [27] was produced by using NASA-derived data (e.g., soil moisture, precipitation, land surface temperature) to drive a random forest model to estimate missing SMAP values for the For soil properties, information of a gridded dataset with a resolution of 30" × 30" (about 1 km at the equator) containing the information of eight layers from 0 to 230 cm was compiled using the soil properties from and surface modeling over China [48].

RF_SMAP_L3_P Surface Soil Moisture
The RF_SMAP_L3_P near-surface soil moisture dataset generated by Wang et al. [27] was produced by using NASA-derived data (e.g., soil moisture, precipitation, land surface temperature) to drive a random forest model to estimate missing SMAP values for the Huai River Basin and used as the input for the exponential filter in the study.The RF_SMAP_L3_P near-surface soil moisture dataset has a temporal resolution of 1 day and spatial resolution of 0.25 • × 0.25 • .The RF_SMAP_L3_P near-surface soil moisture dataset was evaluated with the method of in situ validation (accuracy: ubRMSD ≈ 0.05) and triple collocation analysis (accuracy: ubRMSE ≈ 0.04) by Wang et al. [27], which meets the accuracy requirement of the SMAP mission (0.04 m 3 /m 3 ) for soil moisture retrievals.

SMAP L4 SSM and RZSM Products
The SMAP Level 4 (L4) surface (0-5 cm) and root zone (0-100 cm) soil moisture product is generated by assimilating SMAP L1C_TB brightness temperature data into the NASA Catchment land surface model [49].The land surface model is driven with surface meteorological forcing data from the Goddard Earth Observing System (GEOS) forward processing system, including corrected GEOS precipitation with the Center for Climate Prediction Unified (CPCU) 0.5 • daily precipitation product.The L4 soil moisture product is available from 31 March 2015 to present at a resolution of 9 km and 3 h.The SMAP L4 soil moisture product is distributed by the National Snow and Ice Data Center (NSIDC) (https://nsidc.org/,accessed on 3 January 2023).

SMOS L3 and L4 Products
The SMOS satellite, equipped with a Microwave Imaging Radiometer using an Aperture Synthesis (MIRAS), directly sensed soil moisture in the 0 to 5 cm soil layer.CATDS (Centre Aval de Traitement des Données SMOS) produced and distributed a global, 0.25-degree, daily resolution of the Level 4 root zone soil moisture (L4 RZSM) product from 14 January 2010 to present.The SMOS L4 RZSM (0-100 cm) is obtained from SMOS L3 3-day average SSM (0-5 cm) [13], through a modified formulation of the exponential filter that integrates soil texture.The water bucket model considers the soil profile as three layers (0-5 cm, 5-40 cm and 40-100 cm).The scaled 0-5 cm layer soil moisture was applied to the water bucket model to obtain soil moisture in the of 5-40 cm layer, and a similar procedure is executed for soil moisture in the 40-100 cm layer.The RZSM is depth-weighted average of soil moisture in the three layers.SMOS L3 and L4 products are both available at CATDS (https://www.catds.fr/,accessed on 3 January 2023).

Methodology 2.3.1. The Recursive Exponential Filter
The exponential filter proposed by Wagner et al. [34] is a two-layer water bucket model, which assumes that the water flux of the two layers is proportional to the difference between SSM and RZSM.Albergel et al. [50] first used a recursive exponential filter equation to estimate RZSM from the remotely sensed surface layer, which is written as: where SW I m,t n and SW I m,t n−1 are predicted root zone soil wetness index (e.g., 20 cm, 40 cm and 0-100 cm) at times t n and t n−1 , ms t n is normalized near-surface soil wetness index (5 cm) at t n , t n is the time index.ms t n and SW I m,t n are both degree of saturation ranging from 0 to 1, which are normalized by daily volumetric soil moisture with maximum and minimum values observed across the entire observation period at each in-situ station, the gain K t n at time t n is calculated in a recursive form by: where T is the characteristic time length in units of day and represents the transfer time from the surface layer to the root zone layer, a comprehensive parameter affected by hydrological processes (precipitation and others) and hydrogeological conditions (soil texture and other properties) [33].L is the soil layer depth of the root zone, and C is the pseudo diffusion coefficient of soil water.For the initialization of the filter, set In this study, various T (0-60 days) values were used to optimize the exponential filter, and the optimal T T opt value was obtained at each station when the Nash-Sutcliffe efficiency coefficient (NSE) between predicted SW I m and observed SW I o peaked.Furthermore, the exponential filter's code, provided by Brocca et al. [53], is available at (Satellite soil moisture validation-hydrology (cnr.it),accessed on 3 January 2023).

Calculation of Profile Soil Moisture (RZSM) Values
The in situ profile soil moisture (0-100 cm) or RZSM is calculated with a depthweighted method using the soil moisture measured at depths of 5, 20, 40 and 100 cm [54].The equation is calculated by: where θ RZSM is the volumetric soil moisture for the 0-100 cm layer (m 3 /m 3 ), θ n is the volumetric soil moisture at the nth observation depth (m 3 /m 3 ), and L n is the soil layer thickness between adjacent observation depths (cm).

Precipitation-Based T pre
We first optimized the station-specific T opt value using in situ measurements at a few fixed locations.However, affected by precipitation, T opt exhibited high spatial variability among the stations.In order to apply the ExpF on each grid unit, we needed to obtain T opt for each grid.T opt was mainly driven by precipitation, so different statistical models, such as linear regression and exponential function, between precipitation and T opt were established to obtain precipitation-based T pre values for each grid from grid precipitation.A summary of the different statistical models is reported in Table S1 where y 1 , y 2 , y 3 and y 4 represent T opt (day) at depths of 20, 40, 100 and 0-100 cm, respectively.x represents annual average precipitation (mm).

The Spatial-Temporal Variability of Soil Moisture
Figure 2a,b displays the evolution of mean spatial soil moisture and coefficient of variation (CV) with time.There is a hysteresis pattern between the CV of spatial soil moisture and mean spatial soil moisture, which could be attributed to differential soil moisture dynamics and lateral flow redistribution and the heterogeneity in soil properties [8,10].In addition, the root zone soil moisture leads to a slight increase in the occurrence of hysteresis comparing with surface soil moisture in this study.The CV values of spatial soil moisture at depths of 5, 20, 40 and 100 cm across all stations are calculated and are 0.163, 0.150, 0.159 and 0.134, respectively.The root zone soil moisture has a smaller CV than the surface soil moisture, which is significantly affected by strong land-atmosphere interactions.And a smaller CV could lead to an increase in the occurrence of hysteresis.In general, a low precipitation event would mask the hysteresis pattern comparing with a high precipitation event.Figure 2c,d shows the frequency distribution of mean temporal soil moisture and CV of temporal soil moisture.The root zone soil moisture (100 cm) shows similar characteristics to surface soil moisture but has higher range of mean temporal soil moisture (Figure 2c). Figure 2d shows that deeper soil moisture has a smaller range of CV of temporal soil moisture than surface soil moisture and lower frequency distribution.Moreover, the relation between frequency distribution and mean temporal soil moisture shows clear bimodality, but the relation between frequency distribution and CV of temporal soil moisture shows clear unimodality and being skewed to the smaller CV of the temporal soil moisture.precipitation event.Figure 2c, d shows the frequency distribution of mean temporal soil moisture and CV of temporal soil moisture.The root zone soil moisture (100 cm) shows similar characteristics to surface soil moisture but has higher range of mean temporal soil moisture (Figure 2c). Figure 2d shows that deeper soil moisture has a smaller range of CV of temporal soil moisture than surface soil moisture and lower frequency distribution.Moreover, the relation between frequency distribution and mean temporal soil moisture shows clear bimodality, but the relation between frequency distribution and CV of temporal soil moisture shows clear unimodality and being skewed to the smaller CV of the temporal soil moisture.

Cross-Correlation Coefficient Calculations
Figure 3 displays the cross-correlation coefficients between SSM and RZSM at various depths for selected stations.In general, the coupling strength between SSM and RZSM weakened with time lag (up to 50 days) and soil depth, respectively.The maximum 5-20 cm cross correlation coefficient typically occurred with a 0-day lag and varied from 0.66 to 0.97 with an average value of 0.87 across the study stations.The maximum 5-40 cm cross correlation coefficients were lower than those for 5-20 cm, ranging from 0.52 to 0.90 with an average value of 0.71.The maximum 5-100 cm cross correlation coefficients

Cross-Correlation Coefficient Calculations
Figure 3 displays the cross-correlation coefficients between SSM and RZSM at various depths for selected stations.In general, the coupling strength between SSM and RZSM weakened with time lag (up to 50 days) and soil depth, respectively.The maximum 5-20 cm cross correlation coefficient typically occurred with a 0-day lag and varied from 0.66 to 0.97 with an average value of 0.87 across the study stations.The maximum 5-40 cm cross correlation coefficients were lower than those for 5-20 cm, ranging from 0.52 to 0.90 with an average value of 0.71.The maximum 5-100 cm cross correlation coefficients ranged from 0.32 to 0.54 with an average value of 0.43.The results were consistent with previously reported conclusions [25,40].In addition, we found an interesting phenomenon.When time lag ranged from 0 to 10 days, the 5-20 cm cross correlation coefficient was higher than those for 5-40 cm and 5-100 cm, respectively.However, for time lags between 10 and 50 days, the 5-100 cm cross correlation coefficients exceeded those for 5-20 cm and 5-40 cm, which could be explained by the fact that response time of the 5-100 cm soil moisture was longer than those for 5-20 cm and 5-40 cm.

Controls on the Coupling Strength
We further analyzed the controlling factors on the coupling strength of soil moisture between SSM and RZSM at various depths.Average annual precipitation, average annual air temperature, relative humidity and evapotranspiration were investigated on how they affected the coupling strength.The relationships between maximum cross correlation coefficients and correlated impact factors are presented in Figure 4.No significant correlation was found between relative humidity, evapotranspiration and maximum cross correlation, so it is not exhibited.
previously reported conclusions [25,40].In addition, we found an interesting phenomenon.When time lag ranged from 0 to 10 days, the 5-20 cm cross correlation coefficient was higher than those for 5-40 cm and 5-100 cm, respectively.However, for time lags between 10 and 50 days, the 5-100 cm cross correlation coefficients exceeded those for 5-20 cm and 5-40 cm, which could be explained by the fact that response time of the 5-100 cm soil moisture was longer than those for 5-20 cm and 5-40 cm.

Controls on the Coupling Strength
We further analyzed the controlling factors on the coupling strength of soil moisture between SSM and RZSM at various depths.Average annual precipitation, average annual air temperature, relative humidity and evapotranspiration were investigated on how they affected the coupling strength.The relationships between maximum cross correlation coefficients and correlated impact factors are presented in Figure 4.No significant correlation was found between relative humidity, evapotranspiration and maximum cross correlation, so it is not exhibited.

Controls on the Coupling Strength
We further analyzed the controlling factors on the coupling strength of soil moisture between SSM and RZSM at various depths.Average annual precipitation, average annual air temperature, relative humidity and evapotranspiration were investigated on how they affected the coupling strength.The relationships between maximum cross correlation coefficients and correlated impact factors are presented in Figure 4.No significant correlation was found between relative humidity, evapotranspiration and maximum cross correlation, so it is not exhibited.The left panel of Figure 4 displays the scatter plots between maximum cross correlation coefficient and average annual precipitation calculated with the daily observations over all in situ stations.There was a significant negative correlation between precipitation and soil moisture coupling strength of 5-20 cm (R 2 = 0.26), 5-40 cm (R 2 = 0.49) and 5-100 cm (R 2 = 0.53), respectively.Moreover, it seemed that precipitation had a stronger influence on the coupling strength between SSM and deeper RZSM.The negative correlation between cross correlation coefficient and average annual precipitation indicated that the coupling strength between SSM and deeper RZSM was stronger in dry regions (northwest) than that in wet regions (southeast), which confirmed the results in Nebraska but contrasted with that of Oklahoma reported by Ford et al. [40].As mentioned above, the soil moisture might affect the patterns of precipitation by controlling the evapotranspiration and other energy fluxes.Boé et al. [17] indicated that the precipitation tended to be larger in dry soil conditions.Taylor et al. [55] also found that the afternoon precipitation preferentially fell over dry soil regions where convective events driven by increased sensible heat flux over dry soil occurred frequently.Thus, the negative correlation between precipitation and coupling strength could be explained by the negative soil moisture-precipitation feedback caused by the specific circulation regime or convective events.
The right panel of Figure 4 shows similar characteristics but for maximum cross correlation coefficient and average annual air temperature ( • C).There was a negative correlation between air temperature and soil moisture coupling strength of 5-20 cm (R 2 = 0.18), 5-40 cm (R 2 = 0.26) and 5-100 cm (R 2 = 0.33), respectively.The difference is that the coefficient of determination (R 2 ) associated with air temperature was constantly lower than precipitation at different depths, which showed that precipitation might have a stronger control on coupling strength between SSM and RZSM than air temperature.One reason for the negative correlation between coupling strength and air temperature could be explained as follows: The net radiation heat was partitioned into latent heat flux for evapotranspiration and sensible heat flux for rising air temperature.Therefore, the latent heat flux consumed by dry soil was less than wet soil; the remaining sensible heat flux for dry soil was more than wet soil, which increased air temperature [56][57][58].In addition, the hydraulic connection and coupling strength of dry soil is weaker than wet soil, so the maximum cross-coefficient and air temperature is negatively correlated.Finally, due to the strong land surface-atmosphere interactions, precipitation and air temperature might have stronger effect on the coupling strength for 5-20 cm soil moisture than 5-40 cm and 5-100 cm soil moisture, so R 2 increased with soil depth.
The cross-correlation analysis above implied that SSM had a strong connection with the RZSM over in situ stations in HRB.Similar results were reported by [24,25].Thus, it is promising to establish links between SSM and RZSM.However, there are obvious outliers in the scatterplots for Figure 4a,b.First, these in situ stations are mostly located in or close to the mountainous regions of the HRB (e.g., northeast, southwest and northwest of the HRB), where the underlying surface and local weather conditions are different from that in the plain region.Thus, these differences result in heat and moisture exchange between land surface and atmosphere distinctly different in mountainous and plain regions, respectively.In general, the underlying surface in mountainous regions is more complex than that in plain regions, which impedes the transfer and exchange of heat and moisture in the vadose zone, and further weakens the response of coupling strength to precipitation or air temperature.Moreover, due to the stronger land-atmosphere interactions in the surface layer than the root zone layer, the mixed effect of precipitation and air temperature on the coupling strength for 5-20 cm soil moisture leads to the low correlation coefficient between the coupling strength and precipitation or air temperature, i.e., the coupling strength for 5-20 cm soil moisture is not dominated by sole precipitation or air temperature.

Analysis of T opt
In the study, SW I o at 5 cm was used to predict SW I m at 20, 40, 100 cm and profile average values at the in situ stations.Figure 5 shows that NSE between SW I o and SW I m at different depths varied with time for each station.The station-specific T opt was obtained when the NSE peaked.The statistics of T opt at different depth and evaluation metrics are presented in Table 1.For the HRB stations, the T opt ranged from 1 to 25 days with an average value of 4 days at 20 cm, from 1 to 60 days with an average value of 8 days at 40 cm, from 2 to 60 days with an average value of 36 days at 100 cm, and from 1 to 60 days with an average value of 11 days for the profile average value.The T opt at 20 cm was smaller than results from Wagner et al. [34] and Wang et al. [41], who used SW I o at 5 cm to predict SW I m at 20 cm with T opt = 15 and 15.8 days, respectively.The T opt at 40 cm was also smaller than previous results.For instance, Albergel et al. [33] used SW I o at 5 cm to estimate SW I m at 30 cm in SMOSMANIA and SMOSREX networks by setting T opt = 6 days, and Tian et al. [2] used SW I o at 5 cm to predict SW I m at 40 cm with T opt = 12.4 days.By contrast, The T opt at 100 cm was also larger than the results reported by Wagner et al. [34], who used SW I o at 5 cm to estimate the SW I m at 100 cm with T opt = 20 days.The reasoning can be explained as follows: the soil column is divided into the plough layer (0-20 cm), black soil layer (20~50 cm) and the lime concretion soil layer (50~100 cm) due to the soil stratification in the HRB, so the relative impervious layer hinders the infiltration of soil water and hence results in a larger T opt at 100 cm [59].Table 1 also lists the statistics of T opt using surface SW I o to predict root zone SW I m and corresponding NSE reported in the previous literature.Overall, T opt tended to increase and NSE tended to decrease with soil depth.
cm was smaller than results from Wagner et al. [34] and Wang et al. [41], who used  at 5 cm to predict  at 20 cm with  = 15 and 15.8 days, respectively.The  at 40 cm was also smaller than previous results.For instance, Albergel et al. [33] used  at 5 cm to estimate  at 30 cm in SMOSMANIA and SMOSREX networks by setting  = 6 days, and Tian et al. [2] used  at 5 cm to predict  at 40 cm with  = 12.4 days.By contrast, The  at 100 cm was also larger than the results reported by Wagner et al. [34], who used  at 5 cm to estimate the  at 100 cm with  = 20 days.The reasoning can be explained as follows: the soil column is divided into the plough layer (0-20 cm), black soil layer (20~50 cm) and the lime concretion soil layer (50~100 cm) due to the soil stratification in the HRB, so the relative impervious layer hinders the infiltration of soil water and hence results in a larger  at 100 cm [59].Table 1 also lists the statistics of  using surface  to predict root zone  and corresponding NSE reported in the previous literature.Overall,  tended to increase and NSE tended to decrease with soil depth.T opt varied considerably over different in situ stations.In order to apply the exponential filter on large spatial scale, so we evaluated the performance of the exponential filter using the stations-averaged T opt and precipitation-based T pre for specific depth.Figure 6 shows boxplot of NSE, RMSE and MBE between SW I o and SW I m for two different T opt .The NSE first increased with T, then decreased to a steady value when the T reached 60 days.For stations-averaged T opt , the NSE ranged from 0.03 to 0.89 with an average value of 0.53 at 20 cm, from −0.01 to 0.81 with an average value of 0.46 at 40 cm, from −1.26 to 0.60 with an average value of 0.19 at 100 cm, and from −0.67 to 0.84 with an average value of 0.42 for the profile average value.The NSE, at depth of 20 cm was larger than that of 40, 100 cm and profile average value, which was consistent with the trend of coupling strength between SSM and RZSM.For precipitation-based T pre , the NSE between SW I o and SW I m ranged from 0.37 to 0.94 with an average value of 0.63 at 20 cm, from 0.24 to 0.82 with an average value of 0.59 at 40 cm, from −0.74 to 0.71 with an average value of 0.32 at 100 cm, and from −0.38 to 0.84 with an average value of 0.55 for the profile average value.On the whole, the precipitation-based T pre outperformed stations-averaged T opt .A summary of the statistical metrics is shown in Table 2.

Controlling Factors on T opt
The characteristic time length T opt is the unique variable of the exponential filter, so it is pivotal to figuring out that variables mainly control the T opt , which reflects the time scale of soil moisture dynamics.Soil depth, soil properties (bulk density, clay/sand/silt fraction, soil organic matter and porosity) and precipitation are used to investigate how T opt is affected.Figure 7 shows the boxplot of T opt at depths of 20, 40, 100 cm and the profile average value over all in situ stations.We are confident it infers that T opt tends to increase with soil depth.The stations-averaged T opt are 4, 8, 36 and 11 days, respectively.No significant correlation was found between soil properties and T opt (Figure S1), so it was not shown here.

Controlling Factors on 𝑇
The characteristic time length  is the unique variable of the exponential filter, so it is pivotal to figuring out that variables mainly control the  , which reflects the time scale of soil moisture dynamics.Soil depth, soil properties (bulk density, clay/sand/silt fraction, soil organic matter and porosity) and precipitation are used to investigate how  is affected.Figure 7 shows the boxplot of  at depths of 20, 40, 100 cm and the profile average value over all in situ stations.We are confident it infers that  tends to increase with soil depth.The stations-averaged  are 4, 8, 36 and 11 days, respectively.No significant correlation was found between soil properties and  (Figure S1),  Precipitation is the most important driver for soil moisture variation.Figure 8 shows that T opt was negatively correlated with average annual precipitation at all depths.The coefficients of determination (R 2 ) between T opt and average annual precipitation were (R 2 = 0.37) at 20 cm, (R 2 = 0.33) at 40 cm, (R 2 = 0.30) at 100 cm and (R 2 = 0.31) for profile average value.The quantitative analysis between T opt and precipitation helps us better comprehend how soil moisture and precipitation participate in land-atmosphere interactions and affect each other.There is a feedback mechanism (positive or negative) between soil moisture and precipitation, which contributes to predicate droughts and floods from the soil moisture [18, 19,55].For in situ stations with higher precipitation, the hydraulic connection between near-surface and root zone soil water was enhanced due to the frequent precipitation, which improved the infiltration rate for soil water and provided faster drainage conditions.Therefore, a smaller T opt was found for in situ stations with higher precipitation.The abnormally large T opt values mainly exist in the mountainous regions or north of the HRB with less precipitation.For the in situ stations in the mountainous regions, the precipitation reaches the land surface quickly form surface runoff due to the effect of topography (e.g., steep slope), and less precipitation infiltrates into the soil.In addition, the water infiltrating into the soil may be hindered by the relatively impermeable layer (rock formation) and form subsurface runoff.Both reduce the response of T opt value to precipitation.For the in situ stations in the northern HRB, which commonly has less precipitation than that in the southern and eastern HRB bordering the Yellow Sea due to the geographic location.Therefore, less precipitation reaches the land surface and infiltrate into the vadose zone, which takes more time from surface layer to root zone layer, thus producing a large T opt value.
Atmosphere 2023, 14, x FOR PEER REVIEW 13 of 22 Precipitation is the most important driver for soil moisture variation.Figure 8 shows that  was negatively correlated with average annual precipitation at all depths.The coefficients of determination (R 2 ) between  and average annual precipitation were (R 2 = 0.37) at 20 cm, (R 2 = 0.33) at 40 cm, (R 2 = 0.30) at 100 cm and (R 2 = 0.31) for profile average value.The quantitative analysis between  and precipitation helps us better comprehend how soil moisture and precipitation participate in land-atmosphere interactions and affect each other.There is a feedback mechanism (positive or negative) between soil moisture and precipitation, which contributes to predicate droughts and floods from the soil moisture [18, 19,55].For in situ stations with higher precipitation, the hydraulic connection between near-surface and root zone soil water was enhanced due to the frequent precipitation, which improved the infiltration rate for soil water and provided faster drainage conditions.Therefore, a smaller  was found for in situ stations with higher precipitation.The abnormally large  values mainly exist in the mountainous regions or north of the HRB with less precipitation.For the in situ stations in the mountainous regions, the precipitation reaches the land surface quickly form surface runoff due to the effect of topography (e.g., steep slope), and less precipitation infiltrates into the soil.In

Evaluation of RF_SMAP_L3_P SSM Product
The in situ SSM was used to validate the reconstructed RF_SMAP_L3_P SSM, SMAP L4 SSM and SMOS L3 SSM from 2015 to 2019.Evaluation metrics of R, Bias, RMSE and ubRMSE were used to evaluate the accuracy of specific SSM products.Figure 9 (left) displays the SSM time series of model products and in situ measurements.On the whole, there was a significant underestimation of SSM by the SMOS L3 product, but for days with high precipitation, the SMOS L3 product overestimated in situ SSM.SMAP L4 SSM significantly overestimated the in situ measurements when precipitation was low. Figure 9 (right) shows the scatterplots between model SSM and in situ measurements.RF_SMAP_L3_P SSM had the highest R of 0.63, followed by SMAP L4 SSM (R = 0.608) and SMOS L3 (R = 0.445).SMAP L4 SSM had the lowest ubRMSE of 0.032, followed by RF_SMAP_L3_P SSM (ubRMSE = 0.040), and both satisfied the accuracy requirement of 0.04 (m 3 /m 3 ).SMOS L3 had ubRMSE = 0.046.Compared to other model products, the reconstructed RF_SMAP_L3_P SSM best captured the temporal variations of in situ soil moisture and had the lowest MBE.Although RF_SMAP_L3_P had a higher ubRMSE than that of SMAP L4 SSM, we think that it could be used as the exponential filter input to estimate RZSM at different depths.

Evaluation of RF_SMAP_L3_P SSM Product
The in situ SSM was used to validate the reconstructed RF_SMAP_L3_P SSM, SMAP L4 SSM and SMOS L3 SSM from 2015 to 2019.Evaluation metrics of R, Bias, RMSE and ubRMSE were used to evaluate the accuracy of specific SSM products.Figure 9 (left) displays the SSM time series of model products and in situ measurements.On the whole, there was a significant underestimation of SSM by the SMOS L3 product, but for days with high precipitation, the SMOS L3 product overestimated in situ SSM.SMAP L4 SSM significantly overestimated the in situ measurements when precipitation was low. Figure 9 (right) shows the scatterplots between model SSM and in situ measurements.RF_SMAP_L3_P SSM had the highest R of 0.63, followed by SMAP L4 SSM (R = 0.608) and SMOS L3 (R = 0.445).SMAP L4 SSM had the lowest ubRMSE of 0.032, followed by RF_SMAP_L3_P SSM (ubRMSE = 0.040), and both satisfied the accuracy requirement of 0.04 (m 3 /m 3 ).SMOS L3 had ubRMSE = 0.046.Compared to other model products, the reconstructed RF_SMAP_L3_P SSM best captured the temporal variations of in situ soil moisture and had the lowest MBE.Although RF_SMAP_L3_P had a higher ubRMSE than that of SMAP L4 SSM, we think that it could be used as the exponential filter input to estimate RZSM at different depths.

Estimation and Evaluation of RZSM from RF_SMAP_L3_P SSM
The scaled RF-SMAP_L3_P SSM (0-5 cm) was adopted to estimate root zone SWI (20 cm, 40 cm, 100 cm and 0-100 cm) using the exponential filter method from 2015 to 2019, the precipitation-based  ( = 4 at 20 cm, 8 at 40 cm, 36 at 100 cm, 11 at 0-100 cm) were considered as the unique parameter to participate the calculation of root zone SWI, then root zone SWI was rescaled linearly to volumetric soil moisture, which was

Estimation and Evaluation of RZSM from RF_SMAP_L3_P SSM
The scaled RF-SMAP_L3_P SSM (0-5 cm) was adopted to estimate root zone SWI (20 cm, 40 cm, 100 cm and 0-100 cm) using the exponential filter method from 2015 to 2019, the precipitation-based T pre (T pre = 4 at 20 cm, 8 at 40 cm, 36 at 100 cm, 11 at 0-100 cm) were considered as the unique parameter to participate the calculation of root zone SWI, then root zone SWI was rescaled linearly to volumetric soil moisture, which was used to compare to in situ volumetric soil moisture.
Figure 10 displays the stations-averaged time series and scatterplots of RZSM at different depths, generally speaking, the accuracy of RZSM estimates decreased with soil depth, which was consistent with coupling strength.The evaluation metrics of R were 0.554, 0.500 and 0.290; those of MBE were −0.022, −0.025 and −0.064; and of RMSE were 0.038, 0.039 and 0.071 at depths of 20, 40 and 100 cm, respectively.However, the ubRMSE decreased with soil depth, probably because the bias increased faster than RMSE with soil depth.In general, compared to in situ RZSM, the RZSM estimates at all depths derived from the exponential filter were lower.Figure 11 shows the boxplots of evaluation metrics over all in situ stations.Generally speaking, the accuracy of RZSM estimates decreased with soil depth.The evaluation metric of R ranged from 0.19 to 0.59 with an average value of 0.40 at depth of 20 cm, from 0 to 0.54 with an average value of 0.35 at depth of 40 cm, from −0.15 to 0.39 with an average value of 0.20 at depth of 100 cm.The average ubRMSE was 0.045 at 20 cm (ranging from 0.033 to 0.069), 0.046 at 40 cm (from 0.032 to 0.075), 0.041 at 100 cm (from 0.021 to 0.068).Table 3 provides a summary of evaluation metrics.Figure 11 shows the boxplots of evaluation metrics over all in situ stations.Generally speaking, the accuracy of RZSM estimates decreased with soil depth.The evaluation metric of R ranged from 0.19 to 0.59 with an average value of 0.40 at depth of 20 cm, from 0 to 0.54 with an average value of 0.35 at depth of 40 cm, from −0.15 to 0.39 with an average value of 0.20 at depth of 100 cm.The average ubRMSE was 0.045 at 20 cm (ranging from 0.033 to 0.069), 0.046 at 40 cm (from 0.032 to 0.075), 0.041 at 100 cm (from 0.021 to 0.068).Table 3 provides a summary of evaluation metrics.
In this section, we evaluated the accuracy of RF_SMAP_L3_P RZSM, SMAP L4 RZSM and SMOS L4 RZSM compared to in situ RZSM.The left panel of Figure 12 shows the time series comparison between in situ RZSM and model products, the right panel of Figure 12 are scatterplots.RF_SMAP_L3_P RZSM had the highest R of 0.586, followed by SMAP L4 SSM (R = 0.540) and SMOS L3 (R = 0.359), and the lowest ubRMSE of 0.023, followed by RF_SMAP_L3_P SSM (ubRMSE = 0.027) and SMOS L3 (ubRMSE = 0.027), all of them satisfied the accuracy requirement of 0.04 (m 3 /m 3 ).There was no doubt that the R and ubRMSE for RZSM was lower than that for SSM, which was consistent with the conclusion drown by Reichle et al. [61].In general, all model products could roughly reproduce the in situ RZSM.It was clear that the SMOS L4 RZSM was constantly lower than in situ measurements, which was similar to the SMOS L3 SSM, and it didn't respond very well to precipitation.The underestimation of SMOS L4 SSM might be explained by the following two reasons.Firstly, SMOS L3 SSM underestimated the in situ measurements.
Secondly, the root zone SWI derived from exponential filter was lower than that of the in situ measurements.SMAP L4 and RF_SMAP_L3_P RZSM both captured the temporal variation of RZSM better than SMOS L4 RZSM; both of them matched the high in situ measurements well, but slightly overestimated the low in situ measurements.In conclusion, though showing slightly higher MBE (0.04) and RMSE (0.046) of RF_SMAP_L3_P RZSM than SMAP L4 RZSM (MBE = 0.030, RMSE = 0.046), RF_SMAP_L3_P RZSM had the highest R and the lowest ubRMSE (<0.04 m 3 /m 3 ) among all RZSM products; we believe that RF_SMAP_L3_P RZSM captured the temporal variation of in situ measurements best.Figure 11 shows the boxplots of evaluation metrics over all in situ stations.Generally speaking, the accuracy of RZSM estimates decreased with soil depth.The evaluation metric of R ranged from 0.19 to 0.59 with an average value of 0.40 at depth of 20 cm, from 0 to 0.54 with an average value of 0.35 at depth of 40 cm, from −0.15 to 0.39 with an average value of 0.20 at depth of 100 cm.The average ubRMSE was 0.045 at 20 cm (ranging from 0.033 to 0.069), 0.046 at 40 cm (from 0.032 to 0.075), 0.041 at 100 cm (from 0.021 to 0.068).Table 3 provides a summary of evaluation metrics.Because floods and droughts occurred frequently in the Huai River Basin, we divided the research period into a flood period (May-September) and non-flood period (October-April) to investigate the performance of different RZSM products.Figure 13 displays the boxplots of R, MBE, RMSE and ubRMSE between in situ RZSM and model RZSM products over all in situ stations.RF_SMAP_L3_P RZSM obtained the highest R mean and lowest ubRMSE mean than the other products, followed by SMAP L4 RZSM.SMOS L4 RZSM performed worse than the former due to a lower R mean and higher ubRMSE mean, and the negative bias SMOS L4 RZSM indicated that it underestimated the subsurface soil moisture.All the evaluation metrics of SMAP L4 and RF_SMAP_L3_P RZSM in the flood period had better performance than that of the full period and non-flood period.In the flood period, the average R value was 0.564 with an SD of 0.101 for RF_SMAP_L3_P, 0.470 with an SD of 0.104 for SMAP.The average ubRMSE value was 0.033 with an SD of 0.007 for RF_SMAP_L3_P, 0.039 with an SD of 0.008.By comparison, among three research periods, SMOS L4 RZSM performed better in the non-flood period.In the non-flood period, the average R value for SMOS L4 RZSM was 0.200 with an SD of 0.167, and the average ubRMSE value was 0.037 with an SD of 0.007.For SMOS L4 RZSM, though the R of the non-flood period was higher than that of flood period, the same went for the SD, which might be caused by temporally uneven precipitation in the non-flood period.
drown by Reichle et al. [60].In general, all model products could roughly reproduce the in situ RZSM.It was clear that the SMOS L4 RZSM was constantly lower than in situ measurements, which was similar to the SMOS L3 SSM, and it didn't respond very well to precipitation.The underestimation of SMOS L4 SSM might be explained by the following two reasons.Firstly, SMOS L3 SSM underestimated the in situ measurements.Secondly, the root zone SWI derived from exponential filter was lower than that of the in situ measurements.SMAP L4 and RF_SMAP_L3_P RZSM both captured the temporal variation of RZSM better than SMOS L4 RZSM; both of them matched the high in situ measurements well, but slightly overestimated the low in situ measurements.In conclusion, though showing slightly higher MBE (0.04) and RMSE (0.046) of RF_SMAP_L3_P RZSM than SMAP L4 RZSM (MBE = 0.030, RMSE = 0.046), RF_SMAP_L3_P RZSM had the highest R and the lowest ubRMSE (<0.04 m 3 /m 3 ) among all RZSM products; we believe that RF_SMAP_L3_P RZSM captured the temporal variation of in situ measurements best.Because floods and droughts occurred frequently in the Huai River Basin, we divided the research period into a flood period (May-September) and non-flood period (October-April) to investigate the performance of different RZSM products.Figure 13 displays the boxplots of R, MBE, RMSE and ubRMSE between in situ RZSM and model RZSM products over all in situ stations.RF_SMAP_L3_P RZSM obtained the highest R mean and lowest ubRMSE mean than the other products, followed by SMAP L4 RZSM.SMOS L4 RZSM performed worse than the former due to a lower R mean and higher ubRMSE mean, and the negative bias SMOS L4 RZSM indicated that it underestimated the subsurface soil moisture.All the evaluation metrics of SMAP L4 and RF_SMAP_L3_P RZSM in the flood period had better performance than that of the full period and non-flood period.In the flood period, the average R value was 0.564 with an SD of 0.101 for RF_SMAP_L3_P, 0.470 with an SD of 0.104 for SMAP.The average ubRMSE value was 0.033 with an SD of 0.007 for RF_SMAP_L3_P, 0.039 with an SD of 0.008.By comparison, among three research periods, SMOS L4 RZSM performed better in the non-flood period.In the non-flood period, the average R value for SMOS L4 RZSM was 0.200 with an SD of 0.167, and the average ubRMSE value was 0.037 with an SD of 0.007.For SMOS L4 RZSM, though the R of the non-flood period was higher than that of flood period, the same went for the SD, which might be caused by temporally uneven precipitation in the non-flood period.

Conclusions
In this study, we assessed the utility of the recursive exponential filter method using in situ soil moisture measurements from 2015 to 2019 in the HRB.In addition, the impact of controlling factors on the unique parameter  was investigated.The filter method was applied at a watershed scale by using a new RF_SMAP_L3_P soil moisture dataset (0-5 cm) to estimate RF_SMAP_L3_P RZSM over the Huai River Basin.Finally, we evalu-

Figure 1 .
Figure 1.Location of study area (a) and distribution of in situ soil moisture stations in the Huai River Basin (b), which is divided into a grid of 0.25° × 0.25°.The soil moisture stations probes were buried at depths of 5, 20, 40 and 100 cm.

Figure 1 .
Figure 1.Location of study area (a) and distribution of in situ soil moisture stations in the Huai River Basin (b), which is divided into a grid of 0.25 • × 0.25 • .The soil moisture stations probes were buried at depths of 5, 20, 40 and 100 cm.

Figure 2 .
Figure 2. The left panel (a,b) shows the coefficient of variation (CV) of spatial soil moisture and mean spatial soil moisture at the depths of 5, 20, 40 and 100 cm evolve with time.The right panel (c,d) shows the frequency distribution of CV of temporal soil moisture and mean temporal soil moisture, respectively.

Figure 2 .
Figure 2. The left panel (a,b) shows the coefficient of variation (CV) of spatial soil moisture and mean spatial soil moisture at the depths of 5, 20, 40 and 100 cm evolve with time.The right panel (c,d) shows the frequency distribution of CV of temporal soil moisture and mean temporal soil moisture, respectively.

Figure 3 .
Figure 3. Cross correlation coefficients of soil moisture between near-surface (5 cm) and selected root zone depths (20, 40 and 100 cm) varying with lag time (from 0 to 50 days) at sample stations (a-i).

Figure 3 .
Figure 3. Cross correlation coefficients of soil moisture between near-surface (5 cm) and selected root zone depths (20, 40 and 100 cm) varying with lag time (from 0 to 50 days) at sample stations (a-i).

Figure 3 .
Figure 3. Cross correlation coefficients of soil moisture between near-surface (5 cm) and selected root zone depths (20, 40 and 100 cm) varying with lag time (from 0 to 50 days) at sample stations (a-i).

Figure 4 .
Figure 4.The left panel (a,c,e) shows the scatter plots between maximum cross correlation coefficient and average annual precipitation over in situ stations, the right panel (b,d,f) is for maximum cross correlation coefficient and average annual air temperature.

Figure 5 .
Figure 5. NSE between  and  at the depths of 20 (a), 40 (b), 100 cm (c) and for the profile average value (d). standardized by daily volumetric soil moisture at the depth of 5 cm is used to estimated  at the depths of 20 cm, 40 cm, 100 cm and for the profile average value.Each color represents a station.

Figure 5 .
Figure 5. NSE between SW I o and SW I m at the depths of 20 (a), 40 (b), 100 cm (c) and for the profile average value (d).SW I o standardized by daily volumetric soil moisture at the depth of 5 cm is used to estimated SW I m at the depths of 20 cm, 40 cm, 100 cm and for the profile average value.Each color represents a station.

Figure 6 .
Figure 6.Box plots of NSE (a-d), RMSE (e-h) and MBE (i-l) at depths of 20 cm, 40 cm, 100 cm and for the profile average value for the station-specific T, stations-averaged T and precipitation-based T.

Figure 6 .
Figure 6.Box plots of NSE (a-d), RMSE (e-h) and MBE (i-l) at depths of 20 cm, 40 cm, 100 cm and for the profile average value for the station-specific T, stations-averaged T and precipitation-based T.

Figure 7 .
Figure 7. Boxplot of  at the depths of 20, 40, 100 cm and for the profile average value for the station-specific  ."+" represent outliers of  .The  represented by each boxplot is divided into six parts by the minimum value, lower quartile (25th percentile), median (50th percentile), upper quartile (75th percentile), maximum value, respectively.

Figure 7 .
Figure 7. Boxplot of T opt at the depths of 20, 40, 100 cm and for the profile average value for the station-specific T opt ."+" represent outliers of T opt .The T opt represented by each boxplot is divided into six parts by the minimum value, lower quartile (25th percentile), median (50th percentile), upper quartile (75th percentile), maximum value, respectively.

Atmosphere 2023 , 22 Figure 8 .
Figure 8. Scatter plots between  and average annual precipitation over all stations at depths of 20 (a), 40 (b), 100 (c) and profile average value (d), each circle represents a soil moisture station, color lines represent the trends of the scattered points.

Figure 8 .
Figure 8. Scatter plots between T opt and average annual precipitation over all stations at depths of 20 (a), 40 (b), 100 (c) and profile average value (d), each circle represents a soil moisture station, color lines represent the trends of the scattered points.Atmosphere 2023, 14, x FOR PEER REVIEW 15 of 22

Figure 9 .
Figure 9. Left panel (a,c) shows the time series comparison between in situ SSM and model products (SMAP L4 and SMOS L3 SSM); right panel (b,d) is the scatterplots.Blue circle represents SMOS L3 SSM; green cross represents SMAP L4 SSM; cyan triangle represents RF_SMAP_L3_P SSM.

Figure 9 .
Figure 9. Left panel (a,c) shows the time series comparison between in situ SSM and model products (SMAP L4 and SMOS L3 SSM); right panel (b,d) is the scatterplots.Blue circle represents SMOS L3 SSM; green cross represents SMAP L4 SSM; cyan triangle represents RF_SMAP_L3_P SSM.

Atmosphere 2023 , 22 Figure 10 .
Figure 10.Left panel (a,c,e) shows the time series comparison between in situ RZSM and RF_SMAP_L3_P RZSM at depths of 20, 40 and 100 cm; right panel (b,d,f) is for the scatterplot.

Figure 10 .
Figure 10.Left panel (a,c,e) shows the time series comparison between in situ RZSM and RF_SMAP_L3_P RZSM at depths of 20, 40 and 100 cm; right panel (b,d,f) is for the scatterplot.

Figure 10 .
Figure 10.Left panel (a,c,e) shows the time series comparison between in situ RZSM and RF_SMAP_L3_P RZSM at depths of 20, 40 and 100 cm; right panel (b,d,f) is for the scatterplot.

Figure 11 .
Figure 11.The evaluation metrics of R (a), MBE (b), RMSE (c) and ubRMSE (d) at the depths of 20, 40, 100 cm between  normalized by in situ soil moisture and  estimated by RF_SMAP_P_L3 surface soil moisture.

Figure 11 .
Figure 11.The evaluation metrics of R (a), MBE (b), RMSE (c) and ubRMSE (d) at the depths of 20, 40, 100 cm between SW I o normalized by in situ soil moisture and SW I m estimated by RF_SMAP_P_L3 surface soil moisture."+" represent outliers of T opt .

Figure 12 .
Figure 12.Same as Figure 9, but for the RZSM (0-100 cm).Left panel (a,c) shows the time series of in situ observations and RZSM model products.Right panel (b,d) shows the scatterplot.

Figure 12 .
Figure 12.Same as Figure 9, but for the RZSM (0-100 cm).Left panel (a,c) shows the time series of in situ observations and RZSM model products.Right panel (b,d) shows the scatterplot.
when52]ere is a long time interval between the adjacent observations at time t n and t n−1 , K n approaches unity, and the new estimate SW I m,t n takes the value of the new observation ms t n[51,52].Finally, SWI values at different depths are rescaled to volumetric water content.

Table 1 .
Statistics of the characteristic time length T opt using near-surface SWI to predict SWI at various root zone depths and corresponding NSE values.

Table 2 .
Statistical metrics of NSE, RMSE and MBE at depths of 20, 40, 100 cm and profile average value, calculated with observed  standardized by daily volumetric soil moisture and predicted  from normalized near-surface soil moisture with the exponential filter.

Table 2 .
Statistical metrics of NSE, RMSE and MBE at depths of 20, 40, 100 cm and profile average value, calculated with observed SW I o standardized by daily volumetric soil moisture and predicted SW I m from normalized near-surface soil moisture with the exponential filter.

Table 3 .
Summary of metrics (R, MBE, RMSE and ubRMSE) between estimated SW I m and observed SW I o at depths of 20, 40 and 100 cm.