Retrieval and Multi-scale Validation of Soil Moisture from Multi-temporal SAR Data in a Semi-Arid Tropical Region

The current study presents an algorithm to retrieve surface Soil Moisture (SM) from multi-temporal Synthetic Aperture Radar (SAR) data. The developed algorithm is based on the Cumulative Density Function (CDF) transformation of multi-temporal RADARSAT-2 backscatter coefficient (BC) to obtain relative SM values, and then converts relative SM values into absolute SM values using soil information. The algorithm is tested in a semi-arid tropical region in South India using 30 satellite images of RADARSAT-2, SMOS L2 SM products, and 1262 SM field measurements in 50 plots spanning over 4 years. The validation with the field data showed the ability of the developed algorithm to retrieve SM with RMSE ranging from 0.02 to 0.06m/m for the majority of plots. Comparison with the SMOS SM showed a good temporal behaviour with RMSE of approximately 0.05 m/m and a correlation coefficient of approximately 0.9. The developed model is compared and found to be better than the change detection and delta index model. The approach does not require calibration of any parameter to obtain relative SM and hence can easily be extended to any region having time series of SAR data available. Remote Sens. 2015, 7 8129


Introduction
Knowledge of Soil Moisture (SM) is useful for climatology, meteorology, hydrology and agriculture [1].Wood et al. [2] pointed out the requirement of higher resolution (1 km 2 or finer) observations of land surface variables as one of six grand challenges ahead for hydrology.Estimation of SM over large region and time periods through in situ observations is cumbersome, while satellite remote sensing through optical or microwave sensors can estimate SM with a broad spatial coverage and repeat temporal coverage [3].The optical sensors are hampered by cloud cover, while the microwave sensors have all weather penetration capability.Passive microwave sensors (e.g., SMOS) provide SM globally at a relatively coarser spatial resolution and can not meet the requirement of higher resolution SM.Active microwave sensors (e.g., RADARSAT-2) can meet the requirement of higher resolution SM though with a relatively coarse temporal resolution (2-4 weeks) [4].
Retrieval of SM from active microwave is difficult as apart from SM, measured BC is also influenced by surface roughness and vegetation canopy structure [5].Several physically based, empirical and semi-empirical models have been developed to retrieve SM.Physically based models have been applied to retrieve the SM successfully [6,7], but these models require extensive information regarding the vegetation and soil surface, which might be difficult to obtain at a larger scale.Several empirical, regression or machine learning based models have been used to retrieve the SM with relative success (e.g., [8][9][10][11][12]).A major disadvantage of these models is the requirement of extensive in situ SM to calibrate the models.The resulting models are also site specific and can not be used for other climate, soil and vegetation conditions.An overview of the commonly used approaches for retrieving SM along with their physical principles, advantages and limitations can be found in [13].
For the regional estimation of SM, models based on change detection (CD) have been developed to retrieve SM by utilizing the temporal data [14][15][16].These models utilize the difference between a dry and wetter image BC to minimize the impact of soil and vegetation on BC.SM is retrieved by assuming a linear relationship between the difference in BC and the SM [10].The coefficients of the linear relationship are obtained by regression and require site specific calibration.By using a long term temporal data, [14] developed a model whose coefficients are related to the minimum and maximum observed SM, which are computed from the soil information and hence avoided the requirement of a site specific calibration.The delta index (DI) approach proposed by [16] normalizes the difference in BC by the BC in dry condition to obtain directly the SM without requiring any calibration and soil information.All these models assume a linear relationship between SM and the change in BC, which might not hold true in some cases (e.g., [6,11,17,18]).
In the current study, an empirical CDF Transformation (CT) is proposed to normalize the BC.CDF transformation is used here as an optimal multi-linear estimate between backscatter coefficient and surface soil moisture enabling the spatialization of the model over the area of interest with an intrinsic parameter estimation.The use of CDF approach not as a stochastic model but as an explicit model has been used in previous studies like [19] for normalization of incidence angle.The relationship between BC and SM may be nonlinear but it is a monotonic relationship.CT can convert a monotonic nonlinear relationship into linear and avoids the need of the assumption of linearity used in the CD based approaches.Another advantage of using empirical CT is that it does not rely only on one point (dry image) or two points (dry and wet image) to normalize the BC, rather it utilizes the entire time series for normalization.This minimizes the error introduced by the possible uncertainty in the dry or wet image.Similar to the CD approach, empirical CT also provides the relative SM, which needs to be converted into absolute SM by using the soil information and does not require site specific calibration.
The main objectives of the current study are to: (i) asses the accuracy of CT model retrieved SM at higher resolution from BC time series, (ii) comparison of the performance of CT model with existing similar approaches (CD and DI model) in retrieving SM at higher resolution, (iii) evaluate the impact of number of images available on the SM retrieval accuracy, and (iv) study the impact of uncertainty in the BC (arising by ignoring the temporal variation of soil roughness and vegetation) on the retrieved SM at higher resolution.The models are applied in retrieving SM using 30 images of RADARSAT-2 spanning over four years in an experimental watershed in South India.A multi-scale validation is performed: at local scale by comparing the retrieved SM with the field data (SM measured in 50 plots) and at coarse scale by upscaling and comparing with the available SM products from SMOS.

Study Area and Data Set
The study is applied over Berambadi watershed located near to Gundlupet town in Chamarajanagar district of Karnataka state in South India (Figure 1).It is part of the ongoing project "Assimilation of Multi-satellite at Berambadi watershed for Hydrology And land Surface experiments (AMBHAS)" (www.ambhas.com).The main objectives of AMBHAS is to validate remote sensing products and to integrate them into a land data assimilation system.For the present study, an area of 625 km 2 (25 km × 25 km) is selected around the Berambadi watershed.The study area lies in semi-arid climate zone with a mean annual dominant South-West monsoon rainfall of 800 mm.Based on the latest Köppen-Geiger updated world climate classification [20], the study area is classified as AWh (Equatorial, Desert/arid, Dry).It has Bandipur National park at its west boundary.The main three soil types in the area are comprising of black, red and rocky/weathered soils, as identified by the geophysical studies.These are representative of the soil types for granitic/gnessic lithology found in South India [21].The main land use classes encountered in the area are: dense/closed forest, scrub forest, land with scrub, Kharif (summer) crop, double crop and plantation.During Kharif (summer) and Rabi (winter) marigold, sunflower, finger millet, maize, garlic, sugarcane, sorghum, water-melon, lentils, and groundnut etc., are grown [22,23].Digital Elevation Model (DEM) obtained by processing the Cartosat-1 data at a spatial resolution of 30 m is available for entire India [24].The DEM for the study area is shown in Figure 1.The elevation shows a significant variation with the difference in minimum and maximum elevation being approximately 600 m.The soil map prepared by Karnataka State Remote Sensing Application Centre (KSRSAC), Bangalore, India based on panchromatic and Linear Imaging Self Scanner (LISS) III merged, Indian Remote Sensing (IRS) satellite images at the scale of 1:50000 is used in this study [25].The soil map of the study area is shown in Figure 2a.The soil in the watershed comprises of Sandy Clay, Clay, Sandy Clay Loam, Clay Loam, Sandy Loam and Loamy Sand.The land cover map available at a spatial resolution of 1 km from ECOCLIMAP which is used in the SMOS retrieval processor [26] divides the land cover in the study area into two classes, (i) nominal surface (low vegetation and bare soil) and (ii) forest (dense vegetation).The spatial distribution of nominal and forest surface is shown in Figure 2b.The left side of the study area, which is covered by Bandipur National park, is shown as forest surface, and the right side of the study area, which is mainly covered by crops or is barren, is shown as nominal surface.Using the Rosetta pedo-transfer function and fraction of clay, silt and sand, van Genuchten (VG) soil hydraulic parameters are computed for the study area [27].These are used to compute the wilting point and field capacity.Map of the field capacity and wilting point is shown in Figure 2c-d

Field Data Set
The SM is measured using the SM-300 theta probes (Delta-T devices; Delta-T Devices Ltd, Cambridge, UK), which measures SM averaged over 0 to 5 cm depth.Local calibration of the Delta-T device using 76 gravimetric measurements showed good performance with a correlation coefficient of 0.91 and RMSE of 0.031 m 3 /m 3 .SM is measured manually within a window of 3 hours of the satellite pass at three locations in each field plot, and these measurements are averaged to get the plot averaged SM.The SM is measured in 50 field plots in the watershed, and location of the monitoring field plots inside the watershed is shown in the Figure 1.The fields are chosen to be representative of the spatial heterogeneity of soil and crop types while maintaining a reasonable accessibility.The size of field plots varied approximately from 0.07 ha to 1.8 ha with an average value of 0.38 ha.Throughout the article, SM is presented in volumetric unit (m 3 /m 3 ).

Satellite Data Set
RADARSAT-2 is a C-band radar having a frequency of 5.40 GHz (wavelength equal to 5.6 cm).A total of 30 RADARSAT-2 images were acquired during the period from December 2009 to August 2013.Details (acquisition date, pixel/line spacing, pass direction, incidence angle) of the RADARSAT-2 images used in the study are given in Table 1.The local overpass time of ascending and descending direction is 6:30 PM and 6:30 AM respectively.All the images are fine quad polarization geocoded products.Incidence angle of the images varied between 21 to 25 degree.Lambert's law of optics is used to correct the BC for the incidence angle variation [28], which has been successfully used in previous studies (e.g., [29]).All the images are corrected to a reference incidence angle of 23 degree.Since, the incidence angle is already close to 23 degree, a minor correction having absolute maximum of 0.15 dB is observed.Median filter performs better for denoising salt and pepper noise and Wiener filter works better for Gaussian and speckle noise [30].Consequently, RADARSAT-2 data are filtered using the median and the Wiener filters with a 7 × 7 window size.The correction for topography is performed using the NEST toolbox [31].Radar shadow occurs when the local incidence angle is higher than 90 degree and layover occurs when the incidence angle is higher than local slope [32].No area is found to be affected by radar shadow.Approximately 0.4% and 1.2% area is found to be affected by layover for an incidence angle of 25 and 20 degree respectively, and is masked out.Plot scale BC is obtained by taking the median of the data sampled at 5 m grid.Throughout the article BC is presented in dB.For the computation of SM for entire study area, RADARSAT-2 images are aggregated to a grid size of 20 m.
SMOS-L2 version 551 SM products from ascending and descending overpasses are used in the study.SMOS is a L-band radiometer operating at a frequency of 1.4 GHz (wavelength equal to 21 cm) [26].The SM at a temporal resolution of 3 days and at a spatial resolution of approximately 40 km from SMOS is available from January 2010.The Discrete Global Grid (DGG) of the SMOS data products is Icosahedral Snyder Equal Area (ISEA) 15 km grid.The ID number of the DGG lying inside the study area is 3160498, which has the latitude of 11.765 N and longitude of 76.590 E. The SM data are filtered for Radio Frequency Interferences (RFI) by excluding data with RFI probability higher than 10%.Retrievals with higher uncertainty are also excluded using a threshold of 0.05 for the data quality index (DQX).3. Methodology

SM Retrieval Models
BC is a function of the SM, soil roughness and vegetation.Past studies have shown the existence of nonlinear relationship between SM and BC over bare soils [6,17] and over vegetated areas [11].A generalized nonlinear equation relating the BC to SM, as proposed by [18], can be written as, where, BC is the backscatter coefficient (dB) and SM is the soil moisture (m 3 /m 3 ).Subscript i is the index for the position in space and t is the index for the time.The parameters (P 1, P 2, P 3) are a function of soil roughness and vegetation [33].In this formulation, the parameters are a function of space and time, and rendering their calibration is difficult.In the present study, an alternative formulation is adopted to avoid the estimation of time and space variant parameters.For sparse vegetation and when the incidence angle is low, the effect of vegetation can be ignored [34,35].For the agricultural fields, the soil roughness is considered constant during the cropping period, and its effect is minimized by using the temporal data over the same field.Impact of these two assumptions on the SM retrieval accuracy is discusses in the Section 4.3.Under such conditions, Equation ( 1) can be re-formulated as, In this formulation, the parameters are only dependant on location (i).It should be noted that the parameters of Equation (2) does not vary with time, while that of Equation ( 1) are time variant.
Equaton ( 2) is a monotonically increasing function, which can be transformed into a linear function by CDF transformation (CT) [36].This is presented schematically in Figure 3. Since the parameters of Equation ( 2) are constant in time and vary only in space, the computation of CDF should be done using the temporal data over same spatial location.By doing the CDF transformation of Equation ( 2), the nonlinearity can be avoided.The CDF transformation of BC results in a variable, ranging between 0 to 1 (see Figure 3), which corresponds to the minimum and maximum relative SM respectively.Hence, this transformed BC can be assumed as the relative SM, where, RSM is the relative SM.
Relative SM can be converted into absolute SM using the minimum and maximum soil moisture data [14]: where, SM min,i and SM max,i are the minimum and maximum observed soil moisture for a location i.
After replacing the RSM i,t from Equation ( 3), the resulting equation to retrieve the SM from BC can be written as, In an arid/semi-arid climate, the minimum surface SM could be 0.5 of the wilting point (θ wp ) [37].In the humid climate, the minimum SM could be higher than the θ wp and can go up to the θ s (saturated SM).In the case of arid/semi-arid climate, the maximum surface SM could be approximated to the field capacity (θ f c ), given the fact that the satellite pass occurs approximately every four weeks, and it would be very unlikely that SM would be above the field capacity except few local fields.Hence, in semi-arid regions with high temporal dynamic of soil moisture due to high evaporation level and high intensity of rainfall event, the SM can be computed as, where BC is the backscatter coefficient (dB).The CDF of BC is computed using the temporal data over the same spatial location.Given a time series of BC data having same incidence angle along with the soil (θ wp and θ f c ) data, SM can be estimated using the Equation ( 6), which avoids the estimation of pixel specific parameters (P 1 i , P 2 i , and P 3 i ).Nonparametric kernel density estimator is used to compute the CDF of BC [38].
The suggested approach is compared with DI and CD approaches.The delta index model to retrieve the SM by [16] is expressed as, where, BC i,dry is the backscatter coefficient at a dry soil condition.SM can be retrieved using the change detection method [14] as, where, BC i,wet is the backscatter coefficient representing the wet soil condition.The highest and lowest observed value of BC i time series are used for BC i,wet and BC i,dry respectively [14].Similar to the CT approach, Equation ( 4) is used to convert the RSM into SM .

Multi-Scale Validation of RADARSAT-2 Retrieved SM
A multi-scale validation of the retrieved SM is performed using the field measured and SMOS SM.Since, the spatial resolution of the SM retrieved from RADARSAT-2 is finer (20 m) than the spatial resolution of the soil moisture from SMOS (40 km), considering that, the soil moisture from the RADARSAT-2 is up-scaled to the coarser scale.Since, RADARSAT-2 and SMOS have different bands and hence different penetration depths, they need to be compared and corrected for bias [39].
There are mainly three kind of heterogeneity (soil type, land cover and antenna footprint) within a SMOS node that impacts the up-scaling of soil moisture [40].The up-scaled soil moisture SM t can be computed as, where, SH is the combined effect of different spatial heterogeneities and is computed as, where, LC is the land cover value (0 for forest surface; 1 for nominal surface), CF is the clay fraction which is used to account for soil texture variability, M AF is the mean antenna footprint.The mean antenna footprint for SMOS has a value of 1 in the center of SMOS grid, and it decreases as the distance from the center of pixel increases, and has a value of around 0.5 at a distance of 20 km [40].
Table 2. Range (minimum-maximum) of spatial heterogeneity for up-scaling the RADARSAT-2 soil moisture using eight strategies.A value of 1 represents that spatial heterogeneity is neglected by assuming a constant value for all the pixels.
S.N.Mean Antenna Footprint Land Cover Soil Texture Eight strategies are used to up-scale the RADARSAT-2 soil moisture.They are presented in Table 2.For example, strategy 1 does not consider any source of heterogeneity and assumes a value of 1 for all the pixels for all source of heterogeneity, and strategy 6 consider the mean antenna footprint and soil texture while neglecting the land cover heterogeneity by assuming a value of 1 for all the pixels.All the eight sets of up-scaled RADARSAT-2 soil moisture product are compared with the SMOS soil moisture while separating ascending and descending passes.The best up-scaling strategy corresponding to lowest RMSE and highest correlation is identified.

Results and Discussion
All the four polarizations are used to retrieve the soil moisture using all the three models.First, results for HH polarization from the three (CT, CD and DI) model are described.Then, the results from CT models are discussed in detail for spatial and temporal behavior.Followed by the discussion of retrieval error from all the three models and for all the four polarizations, along with the sensitivity analysis (effect of number of available data and uncertainty in the BC).

Results for σ HH
The retrieved soil moisture from the three models are compared with field measured soil moisture for the 50 plots through Taylor diagram, presented in Figure 4.The standard deviations are represented through their ratio over the standard deviations of the observations, and hence should ideally be equal to 1.The observed correlation coefficient from all the three model is similar and range between 0.4 to 0.8 for most of the plots.CT and CD models showed the normalized standard deviation relatively closer to the ideal value of 1, while DI model showed relatively higher value for most of the plots.CT and CD models outperformed the DI model for almost all the plots.No significant difference is observed between the performance of CT and CD models.3. Histogram of these indices is presented in Figure 6.The retrieved soil moisture showed a good comparison with the field measured soil moisture.The bias showed approximately uniform distribution with a mean and median approximately equal to 0. Approximately half of the plots showed absolute bias lesser than 0.03.The nil average bias observed from 50 plots, can be assumed to hold good for the up-scaled soil moisture.Most of the plots (approximately 70%) showed RMSE lesser than 0.09, with the median equal to 0.07.Correlation for more than 80% of the plots is higher than 0.4, with a median of 0.61.Similar range of correlation is observed in [41] over National Airborne Field Experiment (NAFE) 2005 by retrieving the SM using the change detection approach with the ENVISAT Advanced Synthetic Aperture Radar (ASAR) Global Monitoring (GM) data which has the spatial resolution of 1 km.Though a slightly higher RMSE was observed in [41] with RMSE of 0.14 and 0.15 m 3 /m 3 for croplands and pastures respectively.Study by [42] using the change detection and ENVISAT ASAR GM data over Oklahoma also reported the same range of correlation.A relatively poor performance of the CT model in few plots (3,4,5,11,16,23,43,44) can be attributed to the fact that these plots often had extensive irrigation, sometimes resulting in ponding like condition.The ponding type condition decreases the roughness and hence decreases the BC, which is opposite of the general trend, i.e., increase in soil moisture results in the increase in BC.Another possible reason might be the enhanced impact of difference in sensing depth between field and RADARSAT-2 measurements in irrigated areas [43].Poor performance in the plot no.19 could be attributed to the fact that it is situated in a foothill.Though correction for topography is performed, it can still be improved by a higher resolution DEM.Most of the plots having higher bias showed a significant correlation coefficient suggesting the effect of scaling the relative soil moisture into the absolute soil moisture using the soil information.This is consistent with the finding of [14], where scaling the RSM into SM with the help of soil information resulted in a bias dependent on soil type.The major source of error in the soil information could be due to the erroneous delineation of the soil classes and error coming from the pedo-transfer function.A better soil map could reduce this bias and hence provide a lower RMSE.Spatial behavior of the retrieved soil moisture for all the 30 RADARSAT-2 images is shown in the Figure 7.The soil moisture shows a significant variability both in space and as well as in time, with the soil moisture ranging from 0.06 to 0.32 (m 3 /m 3 ).Relatively higher soil moisture is observed in the valley of the watershed.Agricultural part of the study area shows a relatively higher variation in SM compared to the forested part, which is consistent with the observed temporal variation (maximum − minimum) of 7.4 dB and 5.4 dB in σ HH for the agricultural and forested part of the study area respectively.The relatively low variation observed in the SM in forested part could be due to the low vegetation penetration capability of C band.Temporal behavior of SM is also compared with the district average monthly precipitation provided by the Indian Meteorological Department (IMD) (http: //www.imd.gov.in/section/hydro/distrainfall/districtrain.html).Figure 8 shows the temporal variation of the mean retrieved SM using the CT model along with the monthly rainfall.For all the four years, the peak rainfall is observed around the month of September-November and minimum rainfall around December-February.The monthly rainfall showed a bi-modal behavior, showing a second peak around the month of March-May.The cumulative annual rainfall for the year 2010, 2011, 2012 and 2013 are 900.5 mm, 758.8 mm, 529.5 mm, and 678.9 mm respectively.Year 2012 received lowest rainfall during the study period.The soil moisture showed a temporal behavior similar to the one shown by monthly rainfall with soil moisture for the December-March being relatively low.The soil moisture for the month of May-October was usually high except year 2012 which was a relatively dry year as shown by the monthly rainfall.A trend of decreasing soil moisture in the dry months (December 2009 to February 2010) is observed which is consistent with the fact that these months observed very low rainfall.Approximately, monthly rainfall lesser than 50 mm resulted in a decrease in soil moisture in time, and a monthly rainfall higher than 100 m showed the increasing trend.For the monthly rainfall between 50 mm and 100 mm, the trend was found to be dependent on the initial condition of soil moisture.

Evaluation in terms of Data Availability
All the three models considered in the current study rely on time series of images to retrieve soil moisture.It is thus desirable to know how the number of images available impact the retrieval accuracy.Soil moisture is retrieved with different window size ranging from 3 to 30 at an interval of 3. Similar window size is used to retrieve soil moisture from all the three models considered and for all the four polarization.Figure 9 shows the RMSE for the three model considered and for all the four polarization.Delta Index (DI) model performed relatively poor for all the four polarizations.Change detection (CD) and CT model showed similar performance at higher window size (more than 12 images).CT model outperformed the CD method for smaller window size (less than 12), which could be due to the fact that for CD only two data points are used for normalization while for CT entire time series is used for normalization.It is thus highly recommended to use the CT approach for retrieving SM when less than 12 images are available.The images used for retrieving the SM should encompass the temporal variability (dry and wet conditions) present in the SM.

Impact of Vegetation and Roughness
The three models considered in the study assume negligible impact of the temporal variation in the vegetation and soil roughness.For C-band, at an incidence angle of around 15 degree this assumption is reasonable [44,45].At an incidence angle higher than 15 degree, the temporal variation of soil roughness and vegetation might impact the BC.The impact will be relatively higher for a relatively higher incidence angle.Studies using the change detection approach observed that the major source of SM retrieval error is the noise of the backscatter measurements and the impact of seasonal vegetation cover is about one order of magnitude smaller.
Taconet et al. [46] suggested that for wheat canopy, at an incidence angle of 21 degree, BC requires a correction of 1 dB per 1 kg/m 2 .The biomass for the agricultural plots in the study area are less than approximately 3 kg/m 2 .A sensitivity analysis is performed to evaluate the impact of uncertainty in the BC on the soil moisture retrieval accuracy.The time series of BC is perturbed with random variable having normal distribution with zero mean and a given standard deviation.Same perturbed time series is used in retrieving the soil moisture from the three models.Figure 10 shows the RMSE in retrieving the soil moisture for different level of standard deviation (std.) of error in the BC for the three models and all the four polarization.DI model showed significant increase in the RMSE with the increase in the standard deviation of error, which could be due to the reason that it normalize the data with one data point and any error in that particular data point will introduce the error in entire time series of retrieved soil moisture.CD and CT showed the similar behavior showing a marginal increase (RMSE increased approximately from 0.07 to 0.10 for the increase in standard deviation of error from 0 to 3.5 respectively) in the RMSE with the increase in standard deviation of error.The retrieval of soil moisture can be improved by accounting for the contribution of vegetation in BC by using Water Cloud model [47].Such a correction requires crop specific parameters, and a crop map at the watershed scale.Since, no crop map is available for the study area, and data only from the lower incidence angle is used which can penetrate the modest vegetation, no correction for vegetation is applied in the current work.Kim et al. [48] estimated the vegetation water content from Radar Vegetation Index (RVI) based on a linear regression model.The RVI is less impacted by environmental condition [49], and is defined as, where, σ is BC in linear unit and subscripts refer to the polarization.RVI can be used as a proxy to quantify the impact of higher and lower stages of vegetation.To quantify the effect of the different stages of vegetation, the RMSE for the retrieved soil moisture is computed for the lower and higher values of RVI.For each plot, RVI is classified into two groups by using the threshold of median.Then, RMSE are computed separately for these two group for all the 50 plots.Two tailed Welch's t-test is used to compare if the expected value of RMSE for lower and higher vegetation are statistically identical.The computed t-statistic for the test is 1.586 with p-value of 0.116.It means that at 5% significance level the mean are identical.This could be explained by a relatively lower incidence angle of RADARSAT-2 images used in the study.It can be concluded that the error introduced by not correcting for vegetation might be bringing relatively lower uncertainty compared to other sources (e.g., soil information).RMSE for each image is computed by comparing the retrieved SM from HH polarization using the CT model.Temporal behavior of the computed RMSE along with the mean (averaged over 50 plots) RVI is presented in Figure 11.Mean RVI shows the seasonal cropping patters observed in the study area, with peak observed around September in all the four years.A moderate correlation of 0.45 is observed between the time series of RVI and RMSE.This could be probably due to the presence of different types of vegetation resulting in differences in scattering behavior, including other factors (soil roughness, soil parameters and soil wetness).However, a relatively higher value of RMSE is observed when there is a seasonal peak in the RVI.This suggests that the moderate relationship between the RVI and RMSE can not be used to quantify the retrieval error, however the RVI can be used to identify the images having the relatively higher vegetation biomass and mask them.Four images belonging to the seasonal peak value of mean RVI are masked which have RMSE of 0.094, 0.086, 0.079 m 3 /m 3 .Using Integral Equation Model (IEM) [6], the behavior of σ HH is modeled in terms of incidence angle and soil roughness mean standard deviation height (H rms ) by assuming the exponential correlation function and a correlation length of 6 cm and SM equal to 0.20 m 3 /m 3 .H rms is assumed to vary from 0.6 to 1.2 cm.Simulated behavior of σ HH is shown in Figure 12.It can be seen from the figure, the minimum variation in the σ HH due to the H rms is observed at an incidence angle of about 17 degree, and higher or lower incidence angle than this results in a relatively higher variation of σ HH .At an incidence angle of 23 degree, which is used in the current study, a saturation of σ HH is observed at higher value of H rms than 1.0 cm.A variation of 2.12 dB in σ HH is observed for H rms varying from 0.6 to 1.2 cm, and the variation reduces to 0.5 dB for the range from 0.8 to 1.2 cm.It should be noted that though the absolute value of BC depends upon the SM, the difference in BC due to the H rms is independent of SM [50].A similar impact of variability in soil roughness was observed by using the IEM model [51].
From Figure 10, it can be inferred that an error of 1 dB standard deviation in BC results in increase in RMSE by 0.01 m 3 /m 3 approximately.Often agricultural field experiences H rms > 0.6 cm [52-56], except soil with a higher (more than 50%) slit content as reported in [50,57].In the study area, all soil tillages correspond to medium or high roughness and therefore the impact of temporal variation in soil roughness can be neglected.To investigate the validity of assumption of maximum SM being equal to field capacity and minimum SM being equal to 0.5 of wilting point, frequency distribution of field measured SM is analyzed.It is observed that the field measured SM remains lower than the field capacity for 92% of the cases, and higher than the 0.5 of wilting point for 87% of the cases.A possible reason for SM falling outside the range of 0.5 of wilting point and field capacity could be the calibration error of 0.031 m 3 /m 3 observed in the theta probe.For few field plots which were irrigated close to the satellite acquisition date, SM exceeded the field capacity and is observed to be closer to saturation value.The assumption that maximum soil moisture can be assumed to be equal to the field capacity might not hold true in the case of very dry or very wet region, and would need suitable change in the maximum soil moisture which can be calibrated as a function of field capacity for that region.Moran et al. [4] reported that the use of difference between dry and wet season BC can normalize the effects of surface roughness and topography.In the current study, since a CDF approach is used, which normalize the BC not only using the wet and dry season BC but using the entire measured time series, the impact of surface roughness and topography can be reduced considering that the RADARSAT-2 images are at approximately same incidence angle.The CT approach has the advantage that it can work in the case of relationship between SM and BC being nonlinear and avoids the identification of dry and wet season images.The CT approach also explicitly account for the soil type, by normalizing using the field capacity and wilting point map.This is similar to the normalization performed by Dobson and Ulaby [58] using the field capacity only, the current approach being different in the sense that it uses wilting point also for the normalization.The normalization with the wilting point ensures the lower range of soil moisture into a physical range.Since current approach does not require any calibration parameter, it has the potential to be applied globally in the case of modest vegetation.

Validation at SMOS Scale
The soil moisture retrieved from RADARSAT-2 is up-scaled to the coarse scale to compare with the soil moisture from SMOS.The up-scaling is performed for the eight strategies mentioned in Table 2.The correlation coefficient and RMSE for the comparison are shown in Figure 13.To analyze the impact of SMOS pass direction (ascending/descending) the comparison is performed separately for only ascending pass data, for only descending pass data and for all the passes.The descending pass of SMOS showed a better comparison (a relatively higher correlation coefficient and a relatively lower RMSE) for all the eight strategies.Strategy number 3, 4, 7 and 8 outperformed the other strategies (a relatively higher correlation coefficient and a relatively lower RMSE) and the difference among these four best strategies (3, 4, 7 and 8) is very minimal.The best strategies showed a RMSE of approximately 0.051 m 3 /m 3 and a correlation coefficient of approximately 0.88.The common factor among these four strategies is land cover, which means that the land cover is the significant factor while aggregating the soil moisture from the scale of RADARSAT-2 to the scale of SMOS.This is consistent with the fact that SMOS level 2 did have the contribution from the forest area [59,60].The under performance of ascending overpass is explained by the presence of radio frequency interferences.It is useful to note that this result can not be generalized for other regions.
The comparison of RADARSAT-2 retrieved SM with SMOS retrieved SM offers the opportunity to make a multi-scale (local to regional) validation with independent sources.However, the spatial resolution and representative depth is different.SM retrieved from RADARSAT-2 is upscaled based on the approach proposed by Al Bitar et al. [40] to bring it to the spatial resolution of SMOS.Representative depth of SM from RADARSAT-2 is typically representative of 0-2 cm [7] and from SMOS is typically representative of 0-5 cm [61].Other studies associate more a 0-3 cm representative depth for soil moisture from L-band radiometers (SMOS, SMAP, Aquarius) [62,63].The difference in representative depth might introduce the difference in estimated SM [64,65].Nevertheless, the correlation between soil moisture at these depths is high [66].To remove the impact of representative depth a bias correction using CDF matching is performed.In addition, at L-band and in passive mode (radiometer) the impact of vegetation is expected to be lower than C-band acquisitions [67].Finally the comparison at SMOS scale offers the opportunity to make a multi-scale (local to regional) validation with independent sources.Figure 13.Comparison of the RADARSAT-2 soil moisture aggregated using eight strategies with the SMOS retrieved soil moisture in the ascending, descending and both the passes.

Conclusions
A methodology is developed to retrieve soil moisture from active microwave based on the CT approach.This approach requires a set of several backscatter images over time and soil (wilting point and field capacity) map but doest not require calibration of any parameter for a semi-arid region.The CT approach to retrieve soil moisture from active microwave is validated against the field data collected during 30 campaigns (concurrently with RADARSAT-2 overpass) in 50 monitored field plots over a span of four years.The mean RMSE is around 0.04-0.06m 3 /m 3 .Some of the plots showed a significant bias though having a good correlation due to the error in soil information.A better soil map could highly improve RMSE.The model is based on the assumption that minimum soil moisture is half of the wilting point and maximum soil moisture is equal to the field capacity in semi-arid region.This assumption might not hold true for very dry or very wet regions, and might require calibration of the minimum and maximum soil moisture in terms of the wilting point and field capacity.A relatively higher RMSE is observed when the vegetation is at peak, and such images can be masked using the RVI data or the crop cycle information.The impact of soil roughness variation can be ignored when H rms is higher than 0.8 cm all the time.
The CT model is compared with the change detection and delta index models.CT model is found to be significantly better than delta index model.The performance of CT and CD is similar when sufficient data are available, but CT model outperformed the change detection model when relatively lesser number of data (less than 9 images) are available.This suggest the use of CT model for the region with sparse temporal data.
The impact of various source of spatial heterogeneities (soil texture, land cover and mean antenna foot print) is tested on the soil moisture aggregation from the scale of RADARSAT-2 to the scale of SMOS.It is found that there is significant impact of heterogeneity on the aggregation.Up-scaling with land-cover is impacting most with RMSE of approximately 0.05 and correlation coefficient of approximately 0.9.
This study highlights the importance of testing various spatial heterogeneities on the aggregation scheme.The impact of land cover on the aggregation can not be generalized for other regions as it depends on several factors.The quality of the coarse scale microwave data from SMOS depends on many effects like the presence of Radio frequency interference, the surface of water bodies in the area,the freezing of the soil, the presence of snow, the distance from the coast and the density of the vegetation cover [40,68].Thus, even though the applied methodology for testing the impact of the heterogeneity is simple, it needs to be considered with care as it may prove inapplicable in certain conditions.

Figure 1 .
Figure 1.Location of the study area inside India (shown in inset) and Digital Elevation Model (DEM) of the study area along with the location of field plots and SMOS DGG.
. The wilting point varied from 0.10 to 0.16 m 3 /m 3 , and the field capacity varied from 0.20 to 0.32 m 3 /m 3 .(a) Soil classes (b) Land-cover (c) Wilting point (d) Field capacity

Figure 2 .
Figure 2. Soil and land-cover map for the study area.In the land-cover, NO is the nominal surface (bare soil and low vegetation) and FO is the forest.(a) Soil classes, (b) Land-cover, (c) Wilting point, (d) Field capacity.

Figure 3 .
Figure 3. Transformation of monotonically increasing nonlinear function to linear function using the CDF transformation.Here X represents the SM, Y corresponds to backscatter coefficient, and F Y (Y ) is the CDF transformed backscatter coefficient.

Figure 4 .
Figure 4. Taylor diagram comparing the retrieved soil moisture from CT, CD and DI model with the field measured soil moisture in 50 plots.

Figure 5 .
Figure 5. Scatter plot of the soil moisture retrieved using the CT model from the RADARSAT-2 against the field measured data in 50 plots.

Figure 6 .
Figure 6.Histogram of bias, RMSE and correlation coefficient computed between soil moisture retrieved using CT and measured soil moisture.(a) Histogram of bias, (b) Histogram of RMSE, (c) Histogram of correlation.

Figure 9 .
Figure 9.Effect of window size on soil moisture retrieval accuracy (CT-CDF transformation based model, CD-change detection, DI-delta index).

Figure 10 .
Figure 10.Effect of error in backscatter coefficient on the retrieval accuracy (CT-CDF transformation based model, CD-change detection, DI-delta index).

Figure 11 .
Figure 11.Temporal behaviour of mean RVI, and RMSE computed by comparing the retrieved SM from HH polarization by using the CT model for each image.

Figure 12 .
Figure 12.Variation in σ HH in terms of incidence angle and H rms .

Table 1 .
Details of the RADARSAT-2 images used in the analysis.

Table 3 .
Model performance indices of the CT model using the RADARSAT-2 HH polarization for the 50 plots.Unit for RMSE and Bias is m 3 /m 3 .