Improving Estimation of Soil Moisture Content Using a Modified Soil Thermal Inertia Model

There has been substantial research for estimating and mapping soil moisture content (SMC) of large areas using remotely sensed images by developing models of soil thermal inertia (STI). However, it is still a great challenge to accurately estimate SMC because of the impact of vegetation canopies and vegetation-induced shadows in mixed pixels on the estimates. In this study, a new method was developed to increase the estimation accuracy of SMC for an irrigated area located in YingKe of Heihe, China, using ASTER data. In the method, an original model of estimating bare STI was modified by decomposing a mixed pixel into three components, bare soil, vegetated soil, and shaded soil, as well as extracting their fractions using a spectral unmixing analysis and then deriving their fluxes. Moreover, the 90 m spatial resolution thermal images were scaled down to the 15 m spatial resolution by data fusion of a discrete wavelet transform (DWT) and re-sampling using the nearest neighbor method (NNM). The modified model was compared with the original model based on the mean absolute error (MAE) and relative root mean square error (RRMSE) between the SMC estimates and observations from 30 validation soil samples. The results indicated that compared to the original model based on the parallel dual layer, the modified STI model based on the serial dual layer statistically significantly decreased the MAE and RRMSE of the SMC estimates by 63.0–63.2% and 63.0–63.5%, respectively. The 15 m spatial resolution thermal bands obtained by the DWT data fusion provided more detailed information of SMC but did not significantly improve its estimation accuracy than the 15 m spatial resolution thermal bands by re-sampling using NNM. This implied that the novel method offered insights on how to increase the accuracy of retrieving SMC estimates in vegetated areas.


Introduction
Variation of soil moisture content (SMC) affects energy balance of land surfaces, soil erosion, and vegetation growth [1]. Thus, it is very important to retrieve the information of SMC. Traditionally, however, the information is often obtained using soil samples from sampled locations, which is time-consuming and costly. The traditional method also ignores the spatial distributions and patterns of SMC. On the other hand, remotely sensed images with spatiotemporal coverages of a study area provide the possibility for rapidly and cost-efficiently mapping SMC and monitoring its dynamics at a regional, national, and global scale [2,3]. For this purpose, substantial research has been conducted, and the used remote sensing data include optical and thermal infrared images and microwave observations to estimate spatial distributions of SMC over large areas [4][5][6].
Because the dielectric property of soils is closely related to SMC, various methods have been developed for estimating SMC using microwave images [7,8]. Microwave remote sensing is suitable for estimating large-scale SMC due to its capability of penetrating surface soil and being less disturbed by clouds and sparse vegetation cover [9,10]. However, microwave images are easily affected by ground roughness and dense vegetation canopy [11,12].
Moreover, optical and thermal image-based methods for mapping SMC became popular because the methods can provide maps of SMC with a wide range of spatial resolutions and a longterm history. Previous studies have shown that SMC has a linear or non-linear relationship with spectral reflectance of surfaces [13,14]. Thus, spectral-vegetation-index-based models are developed to estimate SMC. Gao et al. [13] used red-near infrared reflectance of soil to estimate SMC over a vegetated area. However, surface reflectance has a weak relationship with SMC when there is a dense vegetation cover. The methods are also limited by weather conditions such as clouds and night time.
There are also the methods that combine optical and thermal infrared (TIR) images to estimate SMC [15][16][17][18]. Thermal inertia is affected by the characteristics of surface layers, including thermal conductivity and capacity of heat storage. The heat energy from the surface layers is propagated to and stored in sub-surfaces during days and then returned to the surfaces during nights. Thus, the capacity of the sub-surfaces is characterized by thermal inertia. The surface heat capacity and thermal conductivity vary depending on SMC. The TIR methods are developed based on the sensitivity of land surface temperature (LST) to surface SMC [2]. These kinds of methods consist of 1) estimating LST using TIR images; 2) deriving soil latent and sensible fluxes using LST; 3) estimating soil thermal inertia (STI) using soil latent and sensible fluxes; and 4) developing an empirical model that accounts for the relationship of SMC with STI.
Based on soil thermal characteristics and infrared images, Watson et al. [19] proposed the first thermal inertia model. Based on a generalized theory in the study of Price [20], thermal inertia can be calculated using remote sensing data at a regional and global scale. The thermal properties of soils can also be estimated using the model proposed by Pratt [21] and the results are dependent on the changes in composition, porosity, and moisture content of the soils. However, this model is determined by many physical variables, and among the variables, the estimation of wind speed and humidity using remote sensing data is limited. Therefore, the model of Pratt [21] is hard to use. Moreover, by making an assumption that the coefficients of variation in energy fluxes into the atmosphere and the ground are constant, Price [22] proposed an Apparent Thermal Inertia model that can be only applied to bare soil dominant areas [23]. Based on the phase angle of diurnal temperature change, in addition, Xue and Cracknell [24] advanced the estimation of thermal inertia by presenting a real-time model.
The air-soil interface energy exchange implies a method of improving the estimation of surface STI using satellite data [25]. The estimation accuracy of thermal inertia and SMC can be further increased using the model of Zhang et al. [26] in which surface sensibility and latent fluxes with differential thermal inertia are taken into account. However, the model neglects the impact of vegetation and soil. In the model of Cai et al. [6], the maximum or minimum temperature is used to estimate thermal inertia, which makes it possible that the estimation is not dependent on the time for a satellite overpassing the study area. A relatively new thermal inertia model proposed by Liu et al. [27] introduced surface sensibility, latent fluxes and a parallel two-layer model into a thermal conductivity equation. Minacapilli et al. [28] conducted a laboratory experiment to evaluate the thermal inertia model for estimating surface SMC and found that this model offered great potential for mapping SMC and monitoring its dynamics using visible-near-infrared and TIR data. Although a great deal of research has been conducted and several thermal inertial models have been available for retrieving SMC [15][16][17][18], the models are only applicable in bare soil areas and lack the ability of accurately mapping SMC in the vegetated areas due to mixed pixels and vegetation-induced shadows.
The TIR-based methods have advantages of various spatial resolution satellite images available and relate SMC to thermal inertia. However, the weak relationship of TIR images with SMC in the densely vegetated areas impedes the applications of the TIR methods. So far, there have been no reports that deal with how to mitigate the impact of vegetation-induced shadows on estimation of SMC and improving the mapping of SMC in vegetated areas.
This study aimed to develop a novel method to generate the spatial distribution of surface SMC in both bare soil and vegetated areas by modifying the STI model presented by Liu et al. [27] and reducing the effect of vegetation-induced shadows on estimation of SMC. The proposed method was developed by decomposing mixed pixels into three components, bare soil, vegetated soil, and shaded soil, and taking into account the effects of vegetation and corresponding shadows on the estimation of SMC. The improved model was verified by comparing the measured and estimated values of SMC obtained using Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) data in a selected study area.

Materials and Methods
In this study, we proposed a modified STI model to improve the estimation of SMC. The flow chart of the method is shown in Figure 1. The methodological framework consists of data collection, data pre-processing, derivation of models, modification of the STI model based on decomposition of mixed pixels and derivation of components at a finer spatial resolution, estimation of soil sensible flux and soil latent flux, and estimation and validation of SMC using the modified model. In addition, a comparison of the modified model with the original model was carried out.

Study Site and Data
This study was conducted in an irrigated area located in Yingke of Heihe River Basin, Northwestern China (Figure 2a). This study area falls in the temperate zone characterized by a semiarid climate, where summers are hot, and winters are cold and dry. The high elevation and aridity lead to great variation of diurnal temperature with the mean annual temperature varying from 7.0 to 10.0℃ and the maximum temperature of 40℃. The annual precipitation is 140 mm, concentrating in May to October. Due to dry winters, snowing rarely occurs. This area was dominated by wheat, corn, and other crops ( Figure 2b).
In Figure 2c,d, the study area is shown using the ASTER images acquired on June 4 th and July 10 th of 2008, respectively. In this experiment, a total of 40 sample plots (Figure 2e and 2f) were selected from 4 sampled blocks of 180 m × 180 m (marked with 1, 2, 3, and 4 in Figure 1c and 1d). Soil samples were collected on June 4 th and July 10 th of 2008, respectively, which led to a total of 80 soil samples. The samples fell in in three types of land cover, including wheat, corn, and wheat/corn fields. The sample plots of 15 m × 15 m spatial resolution matched the pixels of the ASTER fusion data mentioned next in size. The locations of the plots were obtained using a global positioning system receiver. Within each of the sample plots, SMC was measured at 5 points in the field with one point at the plot center and other 4 at the middle locations of two diagonal lines from the plot center to four corners and the mean value of the SMC measurements from the 5 points was used as the SMC value of the sample plot. Based on previous experiments [29], a 20 cm depth of soil layer led to the stable measurements of SMC. At each point, SMC at a depth of 20 cm was thus measured in the field with time-domain reflectometry. The time-domain reflectometry sensor for collecting the measurements of SMC was calibrated according to the results of the study by Wu et al. [1]. There were 50 soil samples (purple plots in Figure 2e and 2f) chosen based on land cover types and used to compare and select the models that were used to account for the relationship between thermal inertia and SMC using a leave-one cross validation. The left 30 soil samples were used for validation.
Moreover, within each of the sample plots, an Analytical Spectral Device with a spectroradiometer of 15° and 25° field of views was used to measure hyperspectral reflectance values of the wavelength range from 300 nm to 2500 nm for different cover types. A thermo-radiometer working in the interval of 800 nm to 1400 nm and with a field of view of larger than 5° was used to measure the temperatures of various cover types. At the same time, the data for atmospheric, meteorological, and micrometeorological variables including wind speed, air temperature, and net radiation were collected in the field.
In this study, ASTER images dated on June 4 th of 2008 and July 10 th of 2008 were acquired with a total of 14 bands covering visible through TIR wavelength regions. Five 90 m × 90 m TIR bands and three 15 m × 15 m visible bands were used to estimate land surface temperature. The radiance calibration of each ASTER-TIR band q was carried out based on the equation L (i, j) = Gain × (DN (i, j) − 1), where i and j were the i th row and the j th column of the location in the image, Gain = 0.006882 , Gain = 0.006780 , Gain = 0.006590 , Gain = 0.005693 , and Gain = 0.0055 , and DNq(i,j) is the digital number value of the pixel (i,j) [30]. The atmospheric correction of the ASTER images including VNIR and thermal bands was conducted using MODTRAN4.0. The corrected ASTER-TIR bands at the spatial resolution of 90 m × 90 m were re-sampled using a nearest neighbor method (NNM) to create ASTER-TIR images at a spatial resolution of 15 m × 15 m, which kept the original ASTER-TIR values at the 90 m resolution.

Multiscale Soil Thermal Inertia Model
As a physical variable, STI characterizes the impedance to the variation of temperature. If a heat transfer is given, usually, the higher the STI values, the smaller the changes of temperature and vice versa. The STI is calculated as follows: where P, k, c, and ρ are the STI (J  m  s ⁄  K ), the thermal conductivity (J  m  s  K ), the specific heat of the material (J kg  K ), and the density (kg m ), respectively. Though STI in Equation (1) can be obtained from in situ measurements, it is difficult to be acquired for a large region. Remote sensing can provide the potential of obtaining large-scale STI values based on a conductive heat transfer equation.
The existing thermal models are based on the assumption of one-dimensional periodic heating of a uniform half-space. A diffusion equation (Equation (2)) with boundary conditions is utilized to estimate the soil temperature [19]: where T (x, t) and T (0, t) are the soil temperatures at the depth x and 0 below the surface and time t. The D (= ) represents the thermal diffusivity of the half-space. The A, S , C , and θ indicate the surface albedo, the solar constant, the atmospherically optical transmission, and the solar zenith, respectively. The A and B are dependent on the conditions of the atmosphere and surface with a linear relationship with the surface soil temperature: A + BT (0, t). In Equation (2), an assumption of a flat Lambertian surface underneath a horizontally homogeneous atmosphere is made. Carslow et al. [31] provided a solution of the thermal conduction Equation (2) for the temperature of the ground surface. Based on the ground surface temperature equation, Liu and Zhao [27] proposed a new STI model expressed as where LE, H, and w are, respectively, the bare soil latent, sensible fluxes, and the rate of the earth's rotation. T (0, t) is the surface bare soil temperature. Liu and Zhao [27] considered that each pixel consisted of the bare soil area and vegetated area. A parallel dual-source model was utilized to estimate the soil latent and sensible fluxes in bare soil areas, which neglects the shaded soil flux in the vegetated area. Thus, to improve the STI model so as to apply it to the vegetated areas, in this study, each pixel was decomposed into bare, vegetated, and shaded soil areas. A serial dual-source model was employed to reduce the effects of vegetation and obtain the estimates of the shaded soil latent and sensible fluxes in the vegetated areas. At the same time, to improve the estimation accuracy of the STI model for SMC, a discrete wavelet transform (DWT) was used to obtain a finer spatial resolution thermal inertia image. The STI could be written as where LE and H are, respectively, the soil latent and sensible fluxes in the soil and vegetated area, A is the surface soil albedo in the soil and vegetated area, and T (0, t) is the surface soil temperature.

Estimation of Finer Spatial Resolution Soil Temperature
Wavelet transform of ASTER thermal data The LST is a key physical parameter. To increase the estimation accuracy of the surface component temperatures, the DWT of ASTER TIR images was conducted, which led to finer spatial resolution TIR data. The DWT included the wavelet decomposition of the images and an inverse wavelet transform that resulted in a new image. The wavelet decomposition of an image means that the image is first divided into four components, including the localized high-frequency point features, the localized vertical features, the localized horizontal details, and the lower frequency features. The lower frequency features are then further split at higher levels of the decomposition. For the detail of DWT, readers can be referred to Chang et al. [32].
In this research, as examples three high-frequency components of the ASTER TIR band 14 and green band 1 are shown in Figure 3.  Figure 3k,l provide the localized horizontal features of the June 4 th and July 10 th images, respectively. The ratios of three high-frequency components between the visible green band 1 and TIR band 14 were, respectively, close to the constants of -0.178147, -1.302275, and -0.611796, implying that the high-frequency information of the ASTER TIR data was very similar to that of the ASTER green band data. In fact, the high-frequency components transformed from the 15 m ASTER green band were the same as those from the 90 m ASTER TIR band because the highfrequency components represent the detailed surface features in the whole study area. The 15 m spatial resolution ASTER-TIR products were obtained by fusing the three high-frequency data of the 15 m ASTER green band and the low frequency data of the 90 m ASTER-TIR image ( Figure 4). Compared with the original ASTER data, the fusion image showed more details of the surface features.

Retrieval of land surface component temperatures
Mixed pixels often contain multiple land cover types, which impedes the improvement of estimation accuracy of SMC. Although the ASTER-TIR images were scaled down from a 90 m spatial resolution to a 15 m spatial resolution, the impact of mixed pixels in the images on mapping SMC cannot be ignored. The temperatures of land cover types within a mixed pixel i would be integrated and determine the average temperature of the mixed pixel. The integration could be linear or nonlinear. For simplification, in this study it was assumed that the fluxes of the vegetated area (L ), bare soil (L ) and shaded soil (L ) in each mixed pixel had linear contributions to the heat flux (L ) of the mixed pixel. The area fractions of vegetation, bare soil, and shadow were derived using a linear spectral unmixing analysis. The LST was then obtained from the following equation [33]: where ε , ε , and ε , respectively, represent vegetation, bare soil, and shaded soil emissivity at each mixed pixel. The T , T , and T are the temperatures of pure vegetation, bare soil, and shaded soil within the mixed pixel, respectively. σ is the Boerziman constant (σ = 5.67 × 10 Wm K ). f and f are the fractions of the vegetated and bare soil areas within the mixed pixel. In this study, estimating the land surface component values of temperature and emissivity was conducted using a genetic algorithm (GA) as a nonlinear multi-parameter optimization problem. For the detail of GA, readers can be referred to Liu et al. [34].

Estimation of Soil Sensible Flux
In the vegetated area, a series dual-source energy balance model was driven to estimate soil latent and sensible fluxes. In this model, we considered water vapor and heat as two sources and supposed that the bottom water and heat only come through the top layer [35]. The water and heat were then superimposed on each other. Moreover, the surface net radiance flux, the surface sensible heat flux and the surface latent flux in the reference level were respectively obtained by summing the corresponding components at each layer ( Figure 5). T , T , and T are the atmospheric temperature at the reference level, the surface soil temperature, and the vegetation aerodynamic temperatures, respectively. The r , r , and r represent, respectively, the aerodynamic resistance, the resistance of the whole canopy boundary layer, and the aerodynamic resistance between canopy source and soil surface. e , e , and e represent, respectively, the water vapor pressure at the reference level, canopy height, and soil surface).
In the bare soil area (Figure 5a), the soil sensible heat flux (H ) was written as [36] where T is the bare soil temperature, T is the atmospheric temperature at the reference level, ρ and C are, respectively, the air density (1.2 kg • m ) and the air heat capacity (1003.2 J • kg ), and r is the aerodynamic resistance obtained using the empirical model of Sauer et al. [37].
In the series dual-source energy balance model, H is obtained by adding the soil sensible flux (H ) and canopy sensible flux (H ). Based on the resistance network shown in Figure 5, the soil sensible flux under vegetation cover (H ) is [38] H = H + H (7) where ρ and C are, respectively, the air density (1.2 kg • m ) and the air heat capacity (1003.2 J • kg ). H and H present, respectively, the canopy-covered soil sensible flux and the vegetation sensible flux. T , T , and T are, respectively, the air aerodynamic temperature in the canopy height, the canopy-covered soil, and vegetation aerodynamic temperatures. r and r represent, respectively, the resistance of the whole canopy boundary-layer and the aerodynamic resistance between canopy source and soil surface. Finally, the soil sensible flux (H ) was acquired by the following equation: H = H + H .

Estimation of Soil Latent Flux
The soil latent heat flux LE (= LE + LE ) was derived using the energy residual method, Equation (10) ( Figure 5) [35].
where the soil net radiance flux R (= R + R ) was estimated by where R and A are the surface net radiance flux and the surface albedo. S and C are, respectively, a solar constant (1367 Wm ) and the atmospheric optical transmission (0.562). The θ is the solar zenith (20°); R is the incident longwave radiation (Wm ) and is calculated according to the biomass equation (R = aT ) [39], where a = 5.3 × 10 Wm K , and T is the atmospheric temperature (=29°). ε ( = ε + ε ) is the surface emissivity . ε and ε are, respectively, the emissivity of bare soil and shaded soil. T (=T + T ) is the surface soil temperature. The surface flux exchange G (= G + G ) can be calculated with the model [40] G = β × R , and β = 0.2 [41] in this study.

Estimation and Evaluation of Soil Moisture
In this study, the STI model (Equation (3)) was modified to accurately derive the spatial distribution of SMC in both bare soil and vegetated areas, which led to the modified model (Equation (4)). The modification was conducted by taking into account the effect of vegetation-induced shadows on the estimation of surface SMC. The original model and the modified model were, respectively, used to generate the estimates of STI. On the other hand, the change of STI caused the change of SMC. Thus, the relationship of SMC with STI derived from ASTER images can be developed and used to create the estimates of SMC. In this study, four models, including a linear, logarithm, power, and exponential model [42], were compared and selected using a leave-one cross validation method [43] based on the field measurements of 50 sample plots in the study area. The derivation of STI and its relationships with SMC was conducted using the ASTER TIR images at the spatial resolution of 15 m × 15 m by the NNM re-sample method and the DWT data fusion, respectively.
The accuracies of SMC estimates obtained from the original and modified models with two images at the 15 m spatial resolution obtained using NNM and DWT, respectively, were evaluated. Integrating the original and modified models with two images led to four combinations, including 1) the original-model-based bare STI using the 15 m resolution TIR bands from NNM (OBSTI_NNM); 2) the modified-model-based STI using the 15 m resolution TIR bands from the NNM (MSTI_ NNM); 3) the original-model-based bare STI using the 15 m resolution TIR bands from the DWT (OBSTI_DWT); and 4) the modified-model-based STI using the 15 m resolution TIR bands from the DWT (MSTI_ DWT). The evaluations and comparisons were conducted using the coefficient of determination (R 2 ), RMSE, relative RMSE (RRMSE), and mean absolute error (MAE) between the estimated and observed values of SMC. In addition, the mean value of SMC predictions was also utilized. The mean absolute error was defined as the mean value of the absolute differences of the estimates from the observations. The RMSE was defined as the square root of the average squared differences between the estimated and observed values. The RRMSE indicated the per-unit RMSE in percentage.

Surface Albedo
The broadband albedo of land surface is an important variable in this study. At present, the conversion models from narrowband to broadband albedos [44] are not available. There are two alternatives to derive the total shortwave albedo, that is, using an empirical relationship of the surface total shortwave albedo with the spectral variables from remotely sensed images and utilizing the radiative transfer simulation with a number of surface reflectance spectra. In this study, the surface albedo was defined as the ratio of irradiance fluxes based on an assumption that narrowband albedos are linearly correlated with broadband ones. The surface narrowband albedos were first derived using the MODTRAN4.0 model and then converted to broadband albedos by taking the spectral distribution solar irradiance at the surface as a weighted function [33]. The obtained broadband albedo, α, was expressed as follows: = 0.468594 + 0.303217 + 0.228189 (12) where α , α , and α are, respectively, the narrowband albedos for ASTER band 1, band 2, and band 3. The albedo results are shown in Figure 6.

Soil Temperature
In Equation (5), the vegetation canopy fraction was estimated using ASTER band 1, band 2, and band 3 as follows [34]: where ρ is the spectral reflectance of a mixed pixel in the i th band, and ρ , ρ , and ρ are, respectively, the spectral reflectance values of vegetation, bare soil, and shaded soil under vegetation canopy for the i th band. In order to obtain more accurate spectral reflectance values, atmospheric corrections for the ASTER band 1, band 2, and band 3 were carried out by the Modtran model. Then, a linear spectral unmixing analysis was applied to derive the area fractions ( f , f , and f ) of vegetation canopy, soil, and shaded soil (Figure 7). Due to the lack of field measurements of three area fractions, a total of 32 sample areas was randomly selected. The spectral reflectance values of the mixed pixels were estimated based on the linear combination of the area fractions with the spectral values of the pure pixels as the training samples for three endmembers. The obtained estimates were compared with the actual values of the mixed pixels, and the determination coefficients were used as an indirect validation of the area fractions. It was found that the mean coefficients of determination varied from 0.78 to 0.85 for the ASTER band 1, band 2, and band 3 dated on June 4 th and July 10 th of 2008. This indicated that the area fractions obtained using the linear spectral unmixing analysis were potentially reliable. In order to obtain the unknown parameters(T , T , T , ε , ε , and ε )in Equation (5), the GA that solves nonlinear optimization problems was driven. The convergence property and speed of GA depended mainly on the given parameters, including the population size, mutation probability, crossover, etc. Because the input parameters of GA are not biophysically and mathematically meaningful, a random trial was utilized to search for the most accurate parameters [34]. In this study, the initial parameters of GA were set as follows: . The number of the maximum runs and the input parameters for the population size, crossover probability, and mutation probability were 250, 128, 0.9, and 0.02, respectively. The resulting temperatures are demonstrated in Figure 8.

Soil Latent and Sensible Fluxes
The soil thermal transfer equation was driven to obtain the surface soil inertia, which needed the soil sensible flux, soil latent flux, and net radiant flux. In the bare soil, the soil sensible flux was obtained from Equation (6). Here, r was estimated from an experimental model [37]: where h is the height for measuring air temperature and wind speed u above the canopy height of 2 m. As the roughness length for momentum transfer, z equals to 0.01 m. The measured wind speed above the canopy height is 3.92m/s. The bare soil sensible flux is shown in Figure 9a.
In the vegetated area, the shaded soil sensible flux was obtained from Equation (9) based on the TSEB described by Cammalleri et al. [35]. The r in Equation (9) was estimated using the model in the studies of Kustas and Norman and Kondo and Ishida [40,45]: where b was set up to be 0.012 for natural surfaces, c is the coefficient and equal to 0.0025 m • s • K / [46,47], and the measured surface wind speed (u ) is 1.2m/s in the experiment field. The shaded soil sensible flux is shown in Figure 9b. Thus, the total soil flux of the sensible heat for the whole surface was estimated as the sum of the shaded soil sensible flux and the bare soil sensible flux in Figure 9c. Moreover, Equation (10) and Equation (11) were respectively used to derive the values of soil latent flux and the soil net radiant flux that are shown in Figure 10 and Figure 11.

The Estimation of Surface Soil Moisture Content
Based on the obtained spectral albedo, sensible flux, latent flux, and temperatures, Equation (7) for soil sensible flux under vegetation cover and Equation (8) for canopy sensible flux were used to estimate the STI and the bare STI using two 15 m spatial resolution TIR images by NNM and DWT. In this study, the thermal inertia varied from 0 to 1. The spatial distributions of the normalized STI for four combinations formed by the original and modified STI models and two 15 spatial resolution images are shown in Figure 12. After STI was obtained using the downscaled images, the regional-scale SMC was generated using the obtained relationship of STI with SMC. In this study, the relationship was modeled and optimized by comparing a linear model, a logarithm model, a power model, and an exponential model in Table 1. A leave-one cross validation method was utilized to select the most accurate model. Table 1. The comparison of four models to estimate soil moisture content (SMC) using the 50 soil samples and leave-one cross validation for four combinations: OBSTI_NNM-the originalmodel-based bare STI using the 15 m resolution thermal infrared bands by re-sampling using the nearest neighbor method; OBSTI_DWT-the original-model-based bare STI using the 15 m resolution fusion thermal infrared bands; MSTI_NNM-the modified-model-based STI using the 15 m resolution thermal infrared bands by re-sampling using the nearest neighbor method; and MSTI_DWT-the modified-model-based STI using the 15 m resolution fusion thermal infrared bands. R 2 and RMSE (J • m • s / • K ): coefficient of determination and root mean square error between the estimates and observations of SMC.

Model
Linear For OBSTI_NNM and OBSTI_DWT, there were no statistically significant differences of RMSE among the four models, and the linear model led to slightly smaller RMSE value (Table 1). On the other hand, the exponential model resulted in the most accurate estimates of SMC for both MSTI_NNM and MSTI_DWT, and their RMSE values were statistically significantly smaller than those using the logarithm and power models but not than that using the linear model. Thus, the exponential mode most accurately accounted for the relationship of SMC with the modified-modelbased STI, while the linear model most accurately explained the relationship of SMC with the original-model-based bare STI. The results of the models are shown in Figure 13. Given the same model, the image obtained by DWT only slightly reduced the RMSE values compared with that by NNM. The spatial distributions of SMC estimates derived using the original and modified models with 15 m resampling and fusion spatial resolution TIR bands were compared in Figure 14. Given the original model, the SMC estimates from the image dated on July 10 th (Figures 14e and 14f) are overall greater than those from the image dated on June 4 th (Figures 14a and 14b), while for the modified model, the SMC estimates from the image dated on July 10 th (Figures 14g and 14h) are slightly smaller than those from the image dated on June 4 th (Figures 14c and 14d). In June and July, the study area was dominated by wheat and corn, respectively. The SMC varied very much depending on the canopy density of the crops. Generally, the crop-dominated areas had greater SMC values. In addition, precipitation also affected the SMC values. Given the same model (the original or modified model) and the same image date (June 4 th or July 10 th ), the spatial patterns of the SMC predictions looked similar (Figure 14a vs. Figure 14b, Figure 14c vs. Figure 14d, Figure 14e vs. Figure 14f, Figure  14g vs. Figure 14h,). However, the 15 m fusion image by DWT resulted in slightly more detailed information of SMC prediction. A total of 30 test soil samples was utilized to evaluate the SMC estimates from the ASTER data by calculating MAE and RRMSE between the observations and estimates in Figure 15. The sample mean of SMC was 23.57% with a standard deviation of 5.80% and a confidence interval of 21.56-25.58% at the significance level of 0.05. All the mean estimates of SMC statistically did not significantly differ from the sample mean, but the mean estimates of 23.85% and 23.58% from the modified model with the 15 m spatial resolution images by NNM and DWT, respectively, were closer to the sample mean than those from the original model. Moreover, the original model without the consideration of the effect of vegetation-induced shadows on estimation of SMC led to an MAE of 3.51% and 3.42% and RRMSE values of 18.95% and 18.60% for the images by NNM and DWT, respectively, while the modified model resulted in a corresponding MAE of 1.30% and 1.26% and RRMSE values of 6.92% and 6.89%. The MAE and RRMSE values from the original model were statistically significantly larger than those from the modified model at a significant level of 0.05. This implied that the modified model significantly increased the estimation accuracy of SMC compared with the original model. Given the same model (the original or the modified model), however, the estimation accuracies of SMC by the images from NNM and DWT were statistically similar to each other.

Discussion
The widely used Apparent Thermal Inertia model was developed to estimate SMC using remotely sensed images in bare soil or scarcely vegetated regions [18,23]. Due to its ignorance of the effect from vegetation canopy-induced shadows, it cannot be applied to accurately estimate SMC in the densely vegetated areas. The STI model proposed in this study significantly contributed to the effort being made to improve the applications of the Apparent Thermal Inertia model from bare soil and sparsely vegetated areas to densely vegetated areas by introducing a vegetation-induced shadow component into the modified model using a spectral unmixing analysis. Some other models (e.g., real thermal model) were also devised to acquire the estimates of SMC in vegetated areas [24,26,27]. However, the models did not consider the serial dual-layer effectiveness for vegetated areas, and thus, the estimation accuracy of SMC retrieved by the models for vegetated areas was limited. The new approach provides a promising solution because it does not only take into account the effect of vegetation canopy-induced shadows, but also considers the serial dual-layer effectiveness for vegetated areas and used more detailed soil temperature information derived from the downscaled thermal images.
Moreover, Zhang et al. [4] used the information of red and near-infrared bands to develop a normalized difference vegetation index (NDVI)-albedo model and to retrieve the estimates of SMC, and the results were highly consistent with the field measurements at 5 cm soil depth. However, the limited penetration ability of the spectral reflectance might degrade the quality of SMC estimates. Because the estimates of SMC from the NDVI-albedo model were related to NDVI change, it was difficult to accurately estimate SMC in the irrigated areas using the NDVI-albedo model according to the widely spread triangular scatter distribution of the albedo values against the NDVI values ( Figure  16). The approach proposed in this study improved the estimation of SMC in the irrigated soil areas because of using the thermal images and the consideration of the shadows under the vegetation canopies. There were also some simple methods for the estimation of SMC using the relationships of thermal image-derived LST with NDVI and other vegetation indices [4,48]. However, the LST-NDVIbased methods require a large number of SMC values that well characterize various land cover types, and the scatter plot of the LST against NDVI often has a wide spread (i.e., Figure 17) that cannot be utilized to accurately obtain the estimates of SMC. The proposed approach in this study modeled the relationship of SMC with STI and the values of STI were derived using the modified model in which the shadows of vegetation canopies were extracted by the shade fraction from the spectral unmixing analysis. That is, the improved inertial model (Equation (4)) theoretically takes into account bare soil, vegetated soil, and shaded soil to obtain the estimates of shaded soil latent and sensible fluxes in the vegetated areas. Thus, it has no limitations of empirical model data and can be applied to other areas with any kind of vegetation types and canopy densities. In this study, the values of SMC obtained by four inertial methods in bare soil regions varied relatively little. However, in the vegetated area, there was a great variation, and the values of SMC from the modified models MSTI_NNM and MSTI_DWT were closer to the field measurements than those from the original models OBSTI_NNM and OBSTI_DWT, which attributes to the consideration of shaded soil effects. This further indicated that the modified model was more reliable to retrieve SMC. Moreover, although the 15 m spatial resolution thermal bands from DWT showed more detailed information than those from NNM, given the modified or original model, the former only slightly decreased the values of MAE and RRMSE compared with those from the latter (Figure 15). That is, the 15 m spatial resolution thermal bands obtained by DWT did not significantly improve the estimation accuracy of SMC compared to the 15 m spatial resolution thermal bands by NNM. The main reason might be due to the flat study area with relatively simple and uniform topographic and crop canopy features, in which most of the wheat and corn areas had sizes of larger than 90 m × 90 m, and thus, the DWT made the spatial information more available but did not significantly change the values of the thermal band pixels compared with those from NNM.
It had to be pointed out that in the modified model of this study, the relationship of soil moisture with the modified model derived STI was most accurately fit by an exponential model. The exponential model showed a statistically significant relationship of using STI to estimate SMC, but lacked a clearly biophysical mechanism, which might have limited the capacity of its predictions in the case of no field measurements available. On the other hand, when the modified model is applied to other regions, its calibration based on some field measurements is needed. Moreover, the application of the proposed method was limited to wheat, corn, and wheat/corn fields, and the application of a future study should be expanded to other land cover types. In addition, to ensure the reliability of the data collected, the collection of data (such as spectroscopy and temperature measurements) was performed within a half hour before and after the satellite's pass. Thus, in the experiment we only measured a total of 30 sample points used for validation within the limited time.
In the future, more field measurements should be collected to further validate this method.

Conclusions
In this study, the modified STI model was developed to improve the estimation of SMC in the vegetated and irrigated region located in the basin of Heihe River, Northwestern China. The modified model reduced the impact of vegetation on the estimation of SMC by extracting the fraction of shadows due to vegetation canopies using a spectral unmixing analysis, which overcame the gap that currently exists in the original STI model. The results showed that compared to those from the original STI model, the modified STI model significantly decreased the MAE and RRMSE values of SMC by 63.0-63.2% for MAE and 63.0-63.5% for RRMSE ( Figure 15). The 15 m spatial resolution thermal bands obtained by DWT data fusion provided more detailed information of SMC but did not significantly improve the estimation of SMC compared to the 15 m spatial resolution thermal bands re-sampled by NNM. This study implied that the new approach can offer the great potential to increase the accuracy of retrieving SMC estimates in the vegetated areas.