SMAP Soil Moisture Product Assessment over Wales, U.K., Using Observations from the WSMN Ground Monitoring Network

: Soil moisture (SM) is the primary variable regulating the soil temperature (ST) differences between daytime and night-time, providing protection to crop rooting systems against sharp and sudden changes. It also has a number of practical applications in a range of disciplines. This study presents an approach to incorporating the effect of ST for the accurate estimation of SM using Earth Observation (EO) data from NASA’s SMAP sensor, one of the most sophisticated satellites currently in orbit. Linear regression analysis was carried out between the SMAP-retrieved SM and ground-measured SM. Subsequently, SMAP-derived ST was incorporated with SMAP-derived SM in multiple regression analysis to improve the SM retrieval accuracy. The ability of the proposed method to estimate SM under different seasonal conditions for the year 2016 was evaluated using ground observations from the Wales Soil Moisture Network (WSMN), located in Wales, United Kingdom, as a reference. Results showed reduced retrieval accuracy of SM between the SMAP and ground measurements. The R 2 between the SMAP SM and ground-observed data from WSMN was found to be 0.247, 0.183, and 0.490 for annual, growing and non-growing seasons, respectively. The values of RMSE between SMAP SM and WSMN observed SM are reported as 0.080 m 3 m − 3 , 0.078 m 3 m − 3 and 0.010 m 3 m − 3 , with almost zero bias values for annual, growing and non-growing seasons, respectively. Implementation of the proposed scheme resulted in a noticeable improvement in SSM prediction in both R 2 (0.558, 0.440 and 0.613) and RMSE (0.045 m 3 m − 3 , 0.041 m 3 m − 3 and 0.007 m 3 m − 3 ), with almost zero bias values for annual, growing and non-growing seasons, respectively. The proposed algorithm retrieval accuracy was closely matched with the SMAP target accuracy 0.04 m 3 m − 3 . In overall, use of the new methodology was found to help reducing the SM difference between SMAP and ground-measured SM, using only satellite data. This can provide important assistance in improving cases where the SMAP product can be used in practical and research applications.


Introduction
Soil moisture, particularly surface soil moisture (SSM), is a very important environmental parameter playing a key role in a number of physical processes in the Earth system, including water and carbon cycles, affecting the climate directly or indirectly [1]. It has Catchment land surface model. Srivastava et al. [35] provided a detailed evaluation of land surface models and satellite soil moisture from Soil Moisture and Ocean Salinity (SMOS) through data fusion for improved predictions of catchment-based soil moisture deficit (SMD). Similarly, another study by Srivastava et al. [36] showed the performance of several machine learning algorithms for the prediction of catchment SMD using SMOS and land surface temperature derived from the Moderate Resolution Imaging Spectroradiometer (MODIS) with improved performance for SMD estimation during validation.
Petropoulos et al. [30] evaluated the SMOS soil moisture product accuracy using in situ soil moisture observed by the REMEDHUS International Soil Moisture Network (ISMN) at different influencing parameters, including seasonality and radio frequency. The highest accuracy of SMOS soil moisture was achieved in the autumn season followed by summer, winter, and spring seasons, and reduced the radio frequency interference (RFI) effect by filtering out the high RFI fraction. Petropoulos et al. [31] assessed the accuracy of the SMOS global operational product of soil moisture for different seasons and land cover patterns. Validation was performed with the in situ observations taken by the CarboEurope ground observational network. They also evaluated the land cover, seasonality, and RFI effect on SMOS product.
The National Aeronautics and Space Administration (NASA) launched the Soil Moisture Active Passive (SMAP) satellite on 31 January 2015. SMAP consists of L-band active (radar) and passive (radiometer) microwave instruments [34]. It provides the daily global SSM with a depth of 0-5 cm at 10 km and 40 km spatial resolution. Colliander et al. [37] selected the core 34 validation sites for the SMAP SSM product validation. Authors have reported an RMSE between observed SM and SMAP radiometer-based SM, observed SM and radar-based SM, and observed SM and combined radar-radiometer SM of 0.04 m 3 m −3 , 0.06 m 3 m −3 , 0.04 m 3 m −3 , respectively. Chan et al. [38] enhanced the spatial resolution of SMAP passive SM product from 36km to 9km by the Backus-Gilbert optimal interpolation technique using antenna temperature (Ta) data in original SMAP Level 1B Brightness Temperature. The enhanced brightness temperature was used as a baseline for the SM retrieval algorithm. The enhanced SM was compared with in situ data for different seasons and biomes. RMSE, correlation coefficient and bias were found to be 0.040 m 3 m −3 , 0.80 and −0.015 m 3 m −3 between developed enhanced SM (with 9 km grid) and soil moisture product (with 36 km grid). However, to the best of our knowledge, there are very few studies concerned so far with the evaluation of this product in an oceanic climatic zone and also outside of United States-based validation sites. This is despite the availability of suitable validated observations from global ground observational networks. A particularly useful area to perform such investigation is Wales, in the United Kingdom, because it would help understand how useful EO-based SSM products would be for a wide range of purposes, such as livestock protection, yield prediction, flood forecasting and human health. One such network providing validated observations suitable is the automated Wales Soil Moisture Network (WSMN) [28].
In purview of the above, the present study objectives are to: (1) evaluate the SM and surface temperature (ST) data using in situ measurements from WSMN and SMAP satellite data; (2) assess the effect of seasonality of SM and ST using SMAP satellite data for annual, growing and non-growing seasons; and (3) develop a model to retrieve SM using SMAP data based on multiple regression analysis, and evaluate SM prediction accuracy by this model at the WSMN.

Study Area and InSitu Datasets
WSMN is an in situ data network located in west Wales, U.K., covering a wide area with latitude 51.03611 • to 53.65022 • N and longitude −5.64258 • to −1.90814 • W. Wales is a mountainous country situated on the western side of central southern Great Britain. Different weather conditions are present in Wales, U.K., throughout the year, such as clouds, winters, and warm summers. The average annual temperaturesare10 • C and15 • C Sustainability 2021, 13, 6019 4 of 18 in winter and summer, respectively. The region on which WSMN is installed has a wide range of rock types and a variation in climate [39].
WSMN was established between 2009 and 2013 with 9 different stations near the area of Aberystwyth in Wales. WSMN currently consists of 9 stations spread across 5 sites ( Figure 1) situated in the wider area of the region. A detailed description of the site characteristics composing the ground monitoring network can be found in [28]. Briefly, Sites 1 and 2 are agricultural grassland sites near to the Gogerddan campus of Aberystwyth University. Sites 3 and 4 are of bioenergy crops at the northern outskirts of Aberystwyth. Site 5 is situated under grassland of Miscanthus plots. Site 6 is situated under the willow at the eastern edge of Aberystwyth, near Llanbadarn campus of Aberystwyth. Sites 7, 8 and 9 are situated at grassland on the Pwllpeiran Research Farm, near to the Devil's Bridges. Time domain reflectometry (TDR) has been used for the measurement of soil moisture. TDR is installed horizontally at 5-10 cm depth in the soil surface [28]. In particular, soil moisture sensor depths are 5 cm, 5 cm, 5 cm, 5 cm, 10 cm, 10 cm, 10 cm, 10 cm, and 5 cm for site 1, site 2, site 3, site 4, site 5, site 6, site 7, site 8 and site 9, respectively. A detailed description of test sites is provided in Table 1.

Study Area and InSitu Datasets
WSMN is an in situ data network located in west Wales, U.K., covering a wide area with latitude 51.03611° to 53.65022° N and longitude −5.64258° to −1.90814°W. Wales is a mountainous country situated on the western side of central southern Great Britain. Different weather conditions are present in Wales, U.K., throughout the year, such as clouds, winters, and warm summers. The average annual temperaturesare10 °C and15 °C in winter and summer, respectively. The region on which WSMN is installed has a wide range of rock types and a variation in climate [39].
WSMN was established between 2009 and 2013 with 9 different stations near the area of Aberystwyth in Wales. WSMN currently consists of 9 stations spread across 5 sites (Figure 1) situated in the wider area of the region. A detailed description of the site characteristics composing the ground monitoring network can be found in [28]. Briefly, Sites 1 and 2 are agricultural grassland sites near to the Gogerddan campus of Aberystwyth University. Sites 3 and 4 are of bioenergy crops at the northern outskirts of Aberystwyth. Site 5 is situated under grassland of Miscanthus plots. Site 6 is situated under the willow at the eastern edge of Aberystwyth, near Llanbadarn campus of Aberystwyth. Sites 7,8 and 9 are situated at grassland on the Pwllpeiran Research Farm, near to the Devil's Bridges. Time domain reflectometry (TDR) has been used for the measurement of soil moisture. TDR is installed horizontally at 5-10 cm depth in the soil surface [28]. In particular, soil moisture sensor depths are 5 cm, 5 cm, 5 cm, 5 cm, 10 cm, 10 cm, 10 cm, 10 cm, and 5 cm for site 1, site 2, site 3, site 4, site 5, site 6, site 7, site 8 and site 9, respectively. A detailed description of test sites is provided in Table 1.  In this study, the in situ SM and ST data were obtained by taking the averages from Comins-Coch WSMN site 1 and 2 over agriculture/grasslands land cover for the year 2016. The whole dataset was obtained from International Soil Moisture (ISMN), a global data sharing and distribution platform. The collected data were separated into the growing and non-growing seasons. The growing season starts from March to November in the United Kingdom. The non-growing season was taken as from December to February. The average yearly temperature of the non-growing season was less than 6 • C.

Satellite Datasets
SMAP provides measurements of brightness temperature at spatial resolution (~36 km) scales with a temporal resolution of 3days global coverage. The SMAP satellite is equipped with passive (radiometer) microwave instruments with the L-band (1.41 GHz). The radiometric-based algorithms provide the estimation of near-surface soil moisture (0-5 cm depth) data products. The thermodynamic temperature at the uppermost layer of the Earth's surface is called land surface temperature (LST). It is commonly measured by the thermal radiance obtained from thermal infrared sensors over clear sky conditions [40]. The SMAP passive soil moisture product(L2_SM_P) [41][42][43] was used for extracting the temporal data of soil moisture and surface temperature for the year 2016 for WSMN soil moisture sites in the Wales, U.K. Surface temperature information was imputed to SMAP from the GMAO GEOS-5 model for the SMAP passive soil moisture algorithm. The SMAP SM and ST datasets have been downloaded from NASA Earth science data (https://earthdata.nasa.gov/) from January to December months for the year 2018. Furthermore, the 8-day composite MODIS global evapotranspiration (ET) product (MOD16A2) [44,45] with a spatial resolution of 1 km was also used in the present study to understand the changes in ET with the surface soil moisture and temperature. The aggregation process was applied to convert the ET spatial resolution of 1 km to 36 km within the SMAP pixel size. The cubic spline interpolation technique was applied to improve the temporal resolution (8 day to daily) of ET. The 16-day composite Global MODIS Aqua Vegetation Indices product (MYD13A1) was used for the extracting of normalized difference vegetation index (NDVI) data with a spatial resolution of 500 m. The ET and NDVI datasets have been downloaded from January to December months for the year 2018 (https://lpdaacsvc.cr.usgs.gov/appeears/). The mean aggregation process was applied to convert the NDVI spatial resolution 500 m to 36 km within the SMAP pixel size. The cubic spline interpolation technique was applied to improve the temporal resolution (16 days to daily) of NDVI. The Google Earth engine tool was used to extract the daily GPM rainfall data (NASA/GPM_L3/IMERG_V06). The processing of in situ and satellite datasets has been processed in open access Python libraries (pandas, numpy, matplotlib, rasterio, gdal, sklearn, and scipy) in Jupyter notebook.

Statistical Analysis
In this study, the SMAP SM and ST data were compared with the WSMN in situ data using a series of appropriate statistical metrics summarized in Table 2. The coefficient of determination (R 2 ) is the proportion of the variance in the dependent variable that is predictable from the independent variable, and it varies between 0 and 1. A higher value is an indicator of a good prediction. The bias measures the average tendency of the estimated values to be larger or smaller than their observed values. The optimum value of bias is 0.0 and the smaller value of bias indicates accurate model prediction [46]. Root-mean-square error (RMSE) is another statistical parameter frequently used to measure the differences between estimated values by a model or an estimator and the observed values [47]. The scatter or mean standard deviation shows the bias free error between observed and retrieved variables. The lower values of scatter depict good retrieval values. These specific statistical metrics were selected to be used in our study because they have also been used in many other similar studies [2,5,8,10,[29][30][31]. Table 2. Statistical measures used to assess the agreement between the predicted estimates and the in situ observations. Subscripts i = 1. N denotes the individual observations, P denotes the predicted values, and O denotes the "observed" values. The horizontal bar denotes the mean value.

Name Description Mathematical Definition
Bias/MBE Bias (accuracy) or mean average error

Multiple Regression Analysis
Multiple regression analysis is a statistical procedure to establish the relationship between several independent variables and one dependent variable. The goal of multiple regression analysis is to model a dependent variable as a function of several independent variables. The independent variables may be continuous or categorical [48,49]. The estimates generated through multiple regression analysis are called coefficients. Multiple regression analysis requires at least two or more independent variables [50].
In multiple regression analysis, the relative weight age of each independent variable on the dependent variable is computed by the computation of variance in the dependent variable with respect to the variation in each of the independent variables. Mathematically, the multiple regression equation can be explained as in Equation (1), where b i (i = 1, 2, . . . , n) are the regression coefficients. The values of regression coefficients show the relative importance for the changes in the dependent variable due to the changes in each independent variable with the relationship. The term c represents the y-intercept [51].
In the present study, WSMN in situ soil moisture was considered as the dependent variable, and SMAP SM and ST were considered as the independent variables for the multiple regression analysis to establish the relationship between in situ SM and satellite SM and ST for accurate retrieval.  Figure 4a,b, it is observed that the temporal variation of ET directly and accurately follows the temporal variation of SMAP ST and WSMN ST. The temporal variation of ET roughly inversely follows the temporal variation of SMAP SM and WSMN SM. The ET rate is more dependent on the ST than SM. The rate of change in ET follows the rate of change of ST. The SM is directly connected with the process of ET, which can be potentially evaporated from the ground surface; thus, it is clearly stated that when ET increases, SM decreases. The NDVI is a good indicator for vegetation growth. The higher value of ET is observed during the growing season because evaporation is the process of transferring water stored in the surface of canopies, stems, branches, and soil surface to the atmosphere. The soil surface loses water with the highest ET rate. The results reported herein indicate that SM, ET, and ST are correlated with each other, and variability in these parameters is due to the cumulative effect of various factors such as wind speed, humidity, precipitation, water table, drainage pattern, etc.

Seasonal Assessment
indicator for vegetation growth. The higher value of ET is observed during the growing season because evaporation is the process of transferring water stored in the surface of canopies, stems, branches, and soil surface to the atmosphere. The soil surface loses water with the highest ΕΤ rate. The results reported herein indicate that SM, ET, and ST are correlated with each other, and variability in these parameters is due to the cumulative effect of various factors such as wind speed, humidity, precipitation, water table, drainage pattern, etc.

Performance Assessment of Datasets and Algorithm Development for Annual, Growing and Non-Growing Seasons
A linear regression analysis was carried out between the SMAP (SM, ST) and WSMN (SM, ST) data for annual, growing and non-growing seasons using Equation (2). Table 3 shows the value of linear regression coefficients and performance indices (reported in Table 2) between the SMAP (SM or ST) and WSMN (SM or ST) data for annual, growing and non-growing seasons. Figure 5a,b shows the scatter plots between WSMN ST and SMAP ST and WSMN SM and SMAP SM for the annual datasets, respectively. Figure 6a A poor correlation was found between the WSMN-and SMAP-derived SM for annual, growing and non-growing seasons. However, a higher value of R 2 was found between the SMAP SM and WSMN SM for the non-growing season than the growing season because the vegetation cover hampered the soil moisture retrieval accuracy. Additionally, a poor correlation was found between WSMN and SMAP ST during the non-growing season, which exists during the winter season in Wales, U.K. The winter season months (December to February) are the coldest months of the year. During this period, small snowfall events occurred in the morning. The differences SMAP ST and WSMN ST existed due to the error sources available at various modeling and measurement levels. The average of skin soil temperature and 0-10 cm layer of soil temperature was considered for the computation of SMAP soil temperature from the GEOS-5 data at their native 0.25 • × 0.3125 • grids. These SMAP soil temperature values closely represent the temperature in the 0-5 cm layer of soil [52]. The two-dimensional bilinear interpolation technique was applied to compute surface temperature in the required grids of 36km. The four error sources (in situ sensor error, upscaling error, depth correction error and model error) are available in the SMAP soil temperature during the assessing of in situ soil temperature data. The error in the in situ sensor may be due to calibration error, uncertainty in the depth of measurement, or sensor disturbance. The upscaling error includes the assumption taken as the point measurement and can be used to represent an average over a larger pixel area. Depth correction error exists due to the extrapolation technique used to compute the soil temperature values at various vertical depths using measured or modeled soil temperature values. Finally, all these error sources are responsible for the model error, which was quantified with the in situ measurements.

WSMN( ST or SM ) = A * SMAP( ST or SM ) + B
(2) Table 3. Values of performance indices and coefficients of linear and multiple regression analysis for Equations (2) and (3). Figure No.

Linear regression analysis for SMAP (SM) and WSMN (SM) data (Equation (2))
Annual          The SMAP ST was incorporated into the multiple regression analysis with the SMAP SM to improve the accuracy for the retrieval of SM in Wales, U.K. Separate algorithms were developed for the assessment of SMAP satellite data for the retrieval of WSMN SM for the annual, growing and non-growing seasons. The soil moisture retrieval model using SMAP data is given by Equation (3). Table 2 shows the values of multiple regression coefficients and performance indices. Figure 8a-c shows the scatter plot between model-retrieved SM and WSMN-observed SM for annual, growing and non-growing season, respectively. The soil moisture retrieval accuracy increased after incorporating the SMAP ST into Equation (2). Statistical analysis was carried out between the WSMN-observed SM and Equation (2)-retrieved SM to validate the performance of the adopted approach. The values of R 2 between WSMN SM and retrieved SM (by Equation (3)) are reported as 0.558, 0.440 and 0.613 for annual, growing and non-growing season, respectively. However, the values of RMSE between WSMN SM and retrieved SM were found to be 0.045 m 3 m −3 , 0.041 m 3 m −3 and 0.007 m 3 m −3 , with almost zero bias values for annual, growing and non-growing season, respectively. A noticeable improvement in the agreement between the retrieved SM and WSMN SM is reported. Notably, higher R 2 values and the lower RMSE were found for the non-growing season than the growing season. The latter indicates that the non-growing season data are more highly correlated with the in situ datasets than the growing season. The vegetation cover in the growing season reduced the retrieval accuracy of SM more for the growing season than the non-growing season.

Discussion
A time-series approach was adopted over the locations for validating the existing SMAP SM and ST product with the in situ measured WSMN SM and ST on a daily basis due to the ground-measured SM data having adequate length. This procedure provides the matching between SMAP SM and ST values and in situ WSMN SM and ST measurements with sparse network for a particular day, with the assumption that those locations are geophysical similar in characteristics over the footprint of the sensors. The coarse resolution (36 km) of SMAP SM and ST values may be validated using the point-based WSMN SM and ST measurements for homogeneous and naturally rainfed sites. The requirement of water is fulfilled by only natural rainfall over all the WSMN sites. Figure 4 shows regular rainfall events by the GPM daily rainfall data throughout the year 2016.
In this study, an approach was presented that enabled incorporating the effect of ST for the accurate estimation of SM using NASA's SMAP EO data. A linear regression analysis was carried out between the SMAP-retrieved SM and ground-measured SM. SMAPderived ST was incorporated with the SMAP-derived SM in the multiple regression analysis to improve the SM retrieval accuracy. The proposed method's ability to estimate SM under different seasonal conditions was evaluated for the year 2016 at the WSMN validated ground observational network located in Wales, United Kingdom. Figure 9 shows the spatial SMAP SM and retrieved SM maps over the study area using average SM values for annual, growing and non-growing seasons. Higher spatial variation of SM was found for the non-growing season than growing season because high

Discussion
A time-series approach was adopted over the locations for validating the existing SMAP SM and ST product with the in situ measured WSMN SM and ST on a daily basis due to the ground-measured SM data having adequate length. This procedure provides the matching between SMAP SM and ST values and in situ WSMN SM and ST measurements with sparse network for a particular day, with the assumption that those locations are geophysical similar in characteristics over the footprint of the sensors. The coarse resolution (36 km) of SMAP SM and ST values may be validated using the point-based WSMN SM and ST measurements for homogeneous and naturally rainfed sites. The requirement of water is fulfilled by only natural rainfall over all the WSMN sites. Figure 4 shows regular rainfall events by the GPM daily rainfall data throughout the year 2016.
In this study, an approach was presented that enabled incorporating the effect of ST for the accurate estimation of SM using NASA's SMAP EO data. A linear regression analysis was carried out between the SMAP-retrieved SM and ground-measured SM. SMAP-derived ST was incorporated with the SMAP-derived SM in the multiple regression analysis to improve the SM retrieval accuracy. The proposed method's ability to estimate SM under different seasonal conditions was evaluated for the year 2016 at the WSMN validated ground observational network located in Wales, United Kingdom. Figure 9 shows the spatial SMAP SM and retrieved SM maps over the study area using average SM values for annual, growing and non-growing seasons. Higher spatial variation of SM was found for the non-growing season than growing season because high rainfall events occur during January and February. A single-channel radiative transfer algorithm was used for the retrieval of SMAP soil moisture using the radiometric brightness temperature. The accurate values of various auxiliary or input datasets required for the accurate retrieval of soil moisture. The SMAP soil moisture showed the wet conditions due to discrepancies in the auxiliary datasets over the study area (Wales, U.K.). In the present study, the SMAP soil moisture and SMAP soil temperature were combined with each other and the linear multiple regression equation to obtain the actual values of soil moisture. The coefficients C (0.1229, 0.1048, and 0.2133) and D (−0.0156, −0.0139 and 0.0027) were found for the multiple regression analysis during annual, growing and nongrowing datasets. The coefficients C and D are associated with the SMAP soil moisture and SMAP soil temperature in the multiple regression equation, respectively. The negative values of coefficient D are responsible for the decrease in retrieved soil moisture compared to the SMAP soil moisture for the annual and growing datasets. The positive value of coefficient D is responsible for increasing the value of retrieved soil moisture compared to the SMAP soil moisture for the non-growing datasets. Colliander et al. [37] worked on validating SMAP surface soil moisture products using 34 core sites, which provided in situ soil moisture measurements. Eighteen of these sites were used as primary validation sites, and the rest were used as secondary information. They reached the conclusion that the correspondence between SMAP products and in situ measurements is better when the weighted average of all stations within one pixel is used. Furthermore, they also discovered that when they tested the most representative station, the results were close to those obtained with the average. Finally, they concluded that soil moisture values over a range of conditions, such as seasonal variations and differences in drying and wetting patterns, are better approached by average-based soil moisture. Seasonality was also considered as an important factor by [53], when SMAP product validation is concerned. On the other hand, wetting and drying variations, as well as the overall soil moisture, are better reflected by the most representative station of the field network. Similar conclusions were reached earlier by [54]. This could be a valid justification for the differences presented in this study between the observed correlation of SMAP products and in situ measurements mainly in growing and non-growing seasons, and suggests the need for further trials in the future, based on the conclusions by [37], therefore improving even more upon the proposed model of this study.
Another factor that must be taken under consideration in the attempt to justify the differences in correlation between SMAP SM products and in situ measurements and the necessity of this study's proposed model is the specific environment and climate of each test site. As presented by [55], SMAP and SMOS SM products, compared with in situ measurements, presented a high level of similarity in semiarid regions in northeast Brazil, but in other environments, e.g., rainy and forested (such as Amazonia and Atlantic forests), these are dissimilar. They also emphasized that sites with sand surfaces presented similar SM estimations, while in areas of tropical forest, SMAP and SMOS products showed important limitations, thus leading to the conclusion that these products are sensitive to surface soil moisture, but adequately assess the soil water balance dynamic in semiarid areas. Therefore, the fact that the chosen test site of this study in Wales, U.K., was not semiarid and varied in land use/cover (grassland, crops, etc.) and soil type could offer one more explanation about the correlation differences between SMAP products and field values of SM. Wu et al. [56] also reached a similar conclusion about the importance of the background environment in their attempt to validate SMAP products with sparse networks in China. Briefly, they found that the best performance of SMAP products was over open shrubland, and the worst was over broad leaf forest land cover types. This supports the application of the proposed model in the attempt to overcome the base environmentclimate obstacles. In their recent study, [57] tested SMAP products in India, in an area of extreme seasonal variability fluctuating from very wet to dry soil mainly for the paddy regions, with field measurements. They found that SMAP products worked well during the non-growing season but underperformed in the paddy growing season. Moreover, they argued that the vegetation water content climatology and low clay fraction did not match the real values in the baseline algorithm of SMAP, leading them to the conclusion that it might be the main reason for errors and biases in the SMAP SM products. Along with their proposals on ways to improve the retrieval algorithm, they emphasized the need for further testing of SMAP products in other parts of the world with different soil backgrounds. The present research meets this suggestion, by testing SMAP products in Wales, U.K., and highlights a way to calibrate SMAP products and overcoming possible internal algorithmic issues by introducing a regression model. Finally, it must be stated that the work of [57] once more proves the deviations between field data and SMAP products in growing and non-growing seasons, as this research work did too. Another factor that must be taken under consideration in the attempt to justify the differences in correlation between SMAP SM products and in situ measurements and the necessity of this study's proposed model is the specific environment and climate of each test site. As presented by [55], SMAP and SMOS SM products, compared with in situ measurements, presented a high level of similarity in semiarid regions in northeast Brazil, but in other environments, e.g., rainy and forested (such as Amazonia and Atlantic for- Overall, to understand and analyze the existence of differentiations between SMAP products and field measurements, several factors must be taken under consideration. Deviations such as those found in this research between WSMN SM and SMAP SM or convergences such as WSMN ST and SMAP ST are highly connected to the background regime of the study area, its unique characteristics, and, from another view, how well the retrieval algorithms of the satellite products fit the needs and requirements of the area under study. Many efforts have been made to research the aforementioned factors and validate SMAP and other SM satellite products in various test sites around the world; most of them reached similar conclusions [27,38,43,[58][59][60]. Thus, land use/cover, soil type and texture, climate (rainfall, temperature, wind, etc.), and wetting and drying conditions are amongst the primary factors influencing the validity and adaptation of SMAP products in a specific test site. The seasonal deviations found in this research were attempted to be minimized by proposing a regression model that tries to adjust field values to the SMAP-derived data, overcoming the unique influence of each of the previously mentioned factors to the highest level possible. Therefore, a well-adjusted model is produced that can be used to derive adequately improved values from SMAP products for areas with similar characteristics such as the one studied here. In order to further improve and expand the proposed regression algorithm, more validation sites are needed, ideally with different background characteristics.
Overall, the potential of having appropriate, validated, and optimized EO data, such as SMAP, for the accurate retrieval of SM, is of high importance when sustainability goals are addressed. Environmental, ecosystem, agriculture, resources, food security, etc., and sustainable management, planning and decision-making [2,[4][5][6][7][8][9]11,12] are actions highly supported by EO data, and in this case, by SMAP data, where soil moisture is implicated.

Conclusions
In this study, the SMAP-derived SM and ST predictions were compared with collocated ground observations from the WSMN ground monitoring network operating in mid-Wales, U.K., for one year of observations (2016). As part of this investigation, the seasonality effects on the annual, growing and non-growing seasons were assessed. Next, a method for retrieving SM from the SMAP satellite based on multiple regression analysis was developed, the accuracy of which was assessed using reference data from the WSMN.
The temporal variation of the ST from SMAP followed that of WSMN for both growing and non-growing seasons. A random temporal variation was observed between the SMAP-derived SM and WSMN SM for the growing season. Similarly, a random temporal variation was observed between the SMAP surface temperature and the ground-measured ST from WSMN for the non-growing season. The temporal variation of the ET followed the variation of the SMAP ST and WSMN ST. The temporal variation of the ET roughly followed the inverse variation of the SMAP SM and WSMN SM. A lack of good correlation was reported between the SMAP surface temperature and WSMN ST for the non-growing season, whereas a low correlation was found between the SMAP SM and WSMN SM for growing and non-growing seasons. The SMAP-derived ST was incorporated with the SMAP-derived SM in the multiple regression analysis to improve the SM retrieval accuracy. A noticeable improvement was reported in the retrieval of SM using SMAP data for annual, growing and non-growing seasons.
The present study provides a contemporary and easy-to-adopt approach for obtaining satisfactorily accurate soil moisture using only satellite data, thus reducing the burden of expensive field experiments and time for the development of in situ soil moisture networks. However, further work is required in performing the same approach in a range of other ecosystems and environments, which will allow us to draw more definite conclusions on the method capabilities and potential added value. Furthermore, the use of machine learning techniques (neural networks, support vector machines, etc.), along with a dynamic and constantly updated database of ground measurements in various testing sites (with different background characteristics) around the world, could be investigated to continually optimize and upgrade the proposed regression model. This will be the subject of future work.