1. Introduction
Surface soil moisture (θ
s) refers to the water held in the space between soil particles within the first few centimeters of the surface soil. This variable plays a fundamental role across spatial scales. At the plot and hillslope scales, it drives deeper-soil infiltration rates, runoff generation type and flux rate, soil evaporation, shallow-root plant transpiration, and surface energy flux partitioning [
1] among other processes. At the regional scale, it is a fundamental factor in sustaining and ending droughts but also in triggering or enhancing floods and mass movements [
1]. While soil moisture only accounts for a very small fraction (approximately 0.05%) of the total quantity of water within the global hydrological cycle, its uneven distribution (in space and time) plays a critical role in the climate and hydrologic systems [
1,
2]. Society depends on accurate measurements of soil moisture. Its correct estimation benefits precision agriculture through precise irrigation and fertilization [
3,
4]. At broader spatial scales, it enhances runoff and flood forecasting [
5,
6], drought monitoring and prediction [
7,
8], numerical weather forecasts [
9,
10,
11,
12], landslides [
13,
14], and wildfire predictions [
15,
16,
17].
Currently, soil water content estimates can be obtained through three primary approaches: (1) in situ measurements, (2) remote sensing retrievals, and (3) land surface model (LSM) outputs. In situ soil moisture measurements have the ability to provide high, spatial, and temporal resolution of soil moisture at different depths [
18,
19,
20]. There are several regional in situ networks designed for monitoring soil moisture within the United States, including the various state Mesonets, the Atmospheric Radiation Measurement Southern Great Plains (ARM-SGP), and the Soil Climate Analysis Network (SCAN) [
20]. Within the state of Oklahoma, in addition to the state-wide network, finer-scale networks, such as the U.S. Department of Agriculture Agricultural Research Service Little Washita and Fort Cobb networks, are designed to have a higher density of stations over a smaller spatial domain [
20]. Furthermore, field campaign activities, including the Southern Great Plains (SGP) hydrology experiments in 1997 and 1999, are sources of short-term, multiscale soil moisture measurements [
18]. Despite all these efforts in building field-, regional- and national-scale soil moisture networks, the number of stations and their spatial coverage are still very limited by their inability to provide spatial representativeness of neighboring areas due to the high spatial heterogeneity of soil moisture [
18,
19,
21]. One alternative to resolving the issue of complex spatial variability is the use of geostatistical techniques to interpolate (or extrapolate) in situ soil moisture measurements to neighboring areas. Nonetheless, results are often inaccurate when the spatial interpolations rely only on distance-related covariance functions [
8,
21,
22,
23,
24] which is usually the case.
Land surface models, on the other hand, can provide soil moisture estimates at various depths with fixed spatiotemporal resolution. Spatially, resolutions (usually ranging from 1 to 10
2 km
2 pixel size) are appropriate, but their model result quality is conditioned by the limited spatial resolution of the forcing inputs (e.g., remotely sensed precipitation fields of 1 km pixel size, hindering sub 1 km
2 variability) [
25]. Finally, remote sensing-based soil moisture products from various orbital sensors working on different spectral bands (e.g., microwave, thermal, and optical) provide global-scale soil water content measurements within the first 1–10 cm of soil depth (i.e., θ
s) [
25], commonly with 1–100 km
2 spatial resolution. Among them, microwave remote sensing techniques have gained momentum over the past 20 years with their advantages in the fast and extensive retrieval of θ
s [
26]. However, all microwave remote sensing soil moisture measurements using C, X, and L bands only measure soil moisture in the top five cm (or less) of the soil under low to moderate vegetation cover [
27]. In summary, each source of soil moisture observations has its strengths and weaknesses. However, none of them, at least by themselves, are adequate for providing accurate θ
s data since their performance differs across diverse spatiotemporal scales and landcover types. Therefore, it is novel and useful to combine these (or, in the future, other) three (or more) independent data sources to capitalize on their individual strengths across scales and land surface types.
Traditionally, θ
s evaluations are carried out through direct comparisons of satellite or LSM outputs against point ground observations. However, the accuracy of such comparisons is often limited by the spatial representativity of each (i.e., point versus pixel)) [
22]. Therefore, the metrics obtained from such comparisons may not truly reflect the error characteristics of the target soil moisture product. In response to this challenge, Scipal et al. [
9] first proposed to use the triple collocation (TC) error estimation technique in soil moisture applications. TC analysis is a method for estimating the random error variances of three spatially and temporally collocated measurement systems of the same geophysical variable without treating any one system as perfectly observed “truth”. Using the same assumptions as TC, McColl, et al. [
28] developed the extended triple collocation (ETC), which provides the Pearson correlation coefficient as an additional validation metric to the root mean square error. The ETC has been widely used in the validation of satellite-based soil moisture retrievals in recent years. For example, Chen et al. [
29] applied ETC-based validation techniques to the soil moisture active/passive (SMAP) Level 2 soil moisture product at five SMAP core validation sites and obtained an unbiased estimation of the satellite-versus-truth correlation metric. Chen et al. [
30] adopted the ETC and conducted a global-scale assessment and inter-comparison of the SMAP Level 3, soil moisture ocean salinity (SMOS) Level 3, and advanced SCATterometer (ASCAT) Level 2 soil moisture products. Wu et al. [
31] presented an ETC-based comprehensive assessment of SMAP, European Space Agency (ESA) Climate Change Initiative (CCI) Soil Moisture, and SMOS with in situ measurements in China. Xu et al. [
32] conducted a global scale ETC-based evaluation of eight root zone soil moisture products, including GLDAS Noah, ERA-5, MERRA-2, NCEP R1, NCEP R2, JRA-55, SMAP level 4, and SMOS level 4 datasets.
Due to the characteristics of the above-mentioned three main sources of soil moisture measurement (in situ, land surface model, and satellite), their data quality and representativeness vary over different land cover types. For example, the Oklahoma Mesonet site standards minimize the influence of urban landscapes, irrigation, forests, bare soil, fast-growing vegetation, and large bodies of water [
33]. It is suggested that vegetation at the site should be uniform and low growing, such as short grasses [
34]. Therefore, soil moisture measurements at the Oklahoma Mesonet sites may not well represent SM variations over bare soil, crops, forests, and other fast-growing vegetation. On the other hand, the vegetation classification of the NLDAS land surface model was derived from the global, 1 km, AVHRR-based, 13-class vegetation database of the UMD (Noah; [
35]). For each 1/8° grid cell, Noah uses the most predominant vegetation class [
36]. Xia et al. [
37] evaluated 20 years (January 1985–December 2004) of NLDAS-2 model-simulated soil moisture with in situ measurements over the continental United States and concluded that the performance for all models was better in the Southeast, Great Plains, Midwest, and Northwest, and lower in the Southwest and the Northeast with their dominant vegetation cover as forest, grassland, a mixture of cropland and grasslands, grassland, open shrubland, and forest, respectively. Zhang et al. [
38] conducted a comprehensive validation of the SMAP Level 3 SM product with ground measurements over varied climates and landscapes from 1 April 2015, to 31 March 2018. Results showed that SMAP level 3 SM products had better performance over grassland than over cropland. In summary, these three benchmark and popular soil moisture products (e.g., Mesonet, Noah, and SMAP) are subject to representation inadequacies over various geographic locations and land cover types, and a correct interpretation of their value requires an in-depth understanding of their scope. Therefore, there is a visible gap in the literature regarding independent evaluations of triplets (or more) of soil moisture products at the state or regional level to determine their value and, upon performance, explore the possibility of merging them into a better multi-source product that outperforms each one individually.
This manuscript is the first of a series of two with the overarching goal of cross-evaluating three θ
s products of different spatial resolutions, independently across various land cover types and climatic regions within the state of Oklahoma (U.S) to then capitalize on their value for a further multi-product merge. Specifically, this first article will conduct a comprehensive assessment of the satellite SMAP_L3 (SMAP), land surface NOAH model (Noah), and Mesonet soil moisture (Mesonet) at daily and seasonal timescales using the triple collocation method. The results of this study are expected to provide a basis for objective data merging to capitalize on the strengths of multi-sensor multiplatform θ
s products. The rest of the article is organized as follows:
Section 2 shows the details of the data and study area;
Section 3 describes methods and data processing;
Section 4 presents the results and analysis;
Section 5 provides a discussion; and
Section 6 offers some conclusions and suggestions for future work.
5. Discussion
The TC is a measurement assessment method for estimating the random error variances of three spatially and temporally collocated sampling systems of the same geophysical variable without treating any one system as perfectly observed “truth” [
52]. This method has been widely used in the validation of both satellite-based and model-output variables in recent years [
29,
30,
31,
32]. Despite the fact that the TC has been advancing, several knowledge gaps still exist in relation to its application in soil moisture measurements, including: (1) the lack of understanding of in situ soil moisture product representativity and seasonal performance variability; and (2) the influence of different land cover types on the data quality or simulation skill of each product. This article addresses these two knowledge gaps by conducting a comprehensive assessment of the satellite SMAP L3_SM_P_E, land surface NLDAS_NOAH0125_H, and Oklahoma Mesonet soil moisture products across the state of Oklahoma at daily and seasonal timescales using the TC method evaluated over different land cover types during more than four consecutive years of simultaneous measurements.
The period-integrated TC intercomparison results for Mesonet, Noah, and SMAP over nine Oklahoma state climate divisions (
Figure 4 and
Table 4) indicate that Noah provided the best performance in the central, northeast, and east-central climate divisions of the state. The same pattern was found in the seasonal TC intercomparison results during the fall season (
Figure 5,
Figure 6,
Figure 7 and
Figure 8). This suggests that it might be inappropriate to regard interpolated Mesonet measurements as the benchmark in the central, northeast, and east-central regions of Oklahoma. The reasons why Noah shows better quality and representativeness in these regions could be due to: (1) the Oklahoma Mesonet site standards require the sites to be far away from urban landscapes, irrigation, forests, bare soil, fast-growing vegetation, and large bodies of water to minimize those influences [
33,
34]. (2) The majority of land cover types in these climate divisions are grassland with urban landscapes, pasture/hay, and deciduous forest (
Figure 2), where Noah performs better. (3) The Mesonet product used in our TC intercomparison was interpolated from point measurements to match the spatial resolution of SMAP (9 km) using an ordinary kriging method [
21,
40]. This interpolation method did not consider auxiliary variables, including soil type, land cover, and topography, that affect the true variation of surface soil moisture. Therefore, our interpolated Mesonet product might not be able to well represent the true soil moisture geographical variations in the central, northeast, and east-central regions.
The seasonal TC intercomparison results for Mesonet, Noah, and SMAP over nine climate divisions (
Table 5,
Table 6,
Table 7 and
Table 8) indicate that in spring and winter, Mesonet has higher mean CC values than Noah, while Noah provides lower averaged RMSE values than Mesonet. According to McColl et al. [
28], this suggests that, while the interpolated Mesonet estimates of true soil moisture are noisier than those of Noah (making the Mesonet’s RMSE with the unknown truth slightly higher), the interpolated Mesonet has higher Pearson correlation coefficients with the unknown truth, a quantity that is proportional to the unbiased signal-to-noise ratio of Mesonet in the context of the TC method.
Both the period-integrated TC intercomparison and the seasonal TC intercomparison results show that SMAP exhibits the third-highest performance over all climate divisions across all seasons. The reasons for SMAP’s lowest performance could be due to: (1) Microwave remote sensing that is responsive to a surface (~5 cm) soil moisture in regions (as opposed to the 10 cm Mesonet and 0–10 cm integrated Noah sample depth) with sparse to moderate vegetation density [
46,
53,
54,
55]. Additionally, the wetter the soil, the shorter the soil sample depth, as the L-band microwave penetration appears to be affected by water content [
55,
56]. (2) There are challenges with retrievals in areas with complex topography, dense vegetation, near water bodies, or cities [
57,
58]. Finally, the stripe patterns shown in the CC
mean and RMSE maps for the SMAP assessment were found to be an artifact of the SMAP product gridding, possibly reinforced by the contrasting signal attenuation given by the strong west-to-east vegetation density gradient from pastures and sparse trees to dense forests [
46,
54,
55]. On a geographical basis, similar patterns of lower (higher) CC and RMSE are shared among the three products, meaning that their performance decreases (increases) in tandem with the TC’s unknown truth. Therefore, despite the relatively low performance of SMAP, we think its inclusion in a multisensory blend is beneficial due to the spatially consistent correlation structures presented with the other two independent products.
In terms of their performance over different land cover types (
Figure 9,
Figure 10 and
Figure 11), Mesonet provided the best estimates for volumetric soil moisture over shrub/scrub, grassland, and cultivated crops, because the Oklahoma Mesonet site standards minimize the influence of urban landscapes, irrigation, forests, bare soil, fast-growing vegetation, and large bodies of water [
33]. It is suggested that vegetation at the Mesonet sites should be uniform and low-growing, such as short grasses [
34]. Noah provided the best estimates of volumetric soil moisture over hay/pasture, deciduous forest, mixed forest, and evergreen forest.
Although we conducted and analyzed both the 6 a.m. and 6 p.m. results, only those corresponding to the 6 a.m. time stamp are shown in this study due to the high similarity of the comparative maps of the CCmean and RMSE. This effect obeys the combination of three factors: (1) consistent estimations without significant differences for the same day between 6 a.m. and 6 p.m. individually across systems (i.e., Mesonet, Noah, and SMAP), (2) temporal persistence of the surface soil moisture values across hours, making correlations and similar errors between 6 a.m. and 6 p.m., and (3) interstorm periods, including 6 a.m. and 6 p.m., being more frequent than storm periods and therefore, producing similar results for 6 a.m. and 6 p.m.
Some limitations of this study are: (1) the use of a low number of years (approximately four, dictated by the availability of SMAP) that might not statistically represent the interannual climatic variability of the study region and, therefore, its effects on the estimation of soil moisture, (2) the inherent, systematic, and perhaps correlated (nonrandom) errors across the different measuring platforms, and (3) the fact that the adopted Mesonet product was interpolated from point-scale ground stations to a spatial resolution of SMAP (9 km) using an ordinary kriging method. To minimize the possible negative effect of these limitations, future work could use more years of analysis, conduct a statistical independence test (outside the TC), and interpolate Mesonet with regression kriging approaches, including independent predictors, such as soil properties, land cover, topography, and precipitation to increase the accuracy of the interpolated product. Overall, this study provides a stepping stone for merging these three independent products by acknowledging that spatial representativity is important when wrongly assuming that a point-scale measurement can be up-scaled to a pixel-scale estimation and that different land cover types are critical drivers of soil moisture variability that entail the blend of multisensor products as opposed to a one-size-fits-all approach.