A Spatial Downscaling Framework for SMAP Soil Moisture Based on Stacking Strategy
Abstract
:1. Introduction
2. Study Area and Data
2.1. Study Area
2.2. Research Data
2.2.1. Ground Measured SM Data
2.2.2. SMAP SM Data
2.2.3. Modis Products
2.2.4. Sentinel-1 Data
2.2.5. Topographic Data
2.2.6. Soil Property Data
2.2.7. Precipitation Data
2.2.8. Other SM Products
3. Methods
3.1. Data Preparation
3.2. Machine Learning Methods
3.2.1. XGBoost
3.2.2. LightGBM
3.2.3. CatBoost
3.2.4. Stacking Model
3.3. Construction of Downscaling Framework Based on Stacking Strategy
- Data preparation: in Section 3.1, we performed quality control, gap-filling, and SG filtering on MODIS data through the GEE platform, simplifying the land cover types and removing the water bodies.
- Data processing: To standardize the spatial resolution of all predictors to 1 km and 9 km, the MCD12Q1 dataset was aggregated using the mode method and the remaining datasets were aggregated using the mean method. The coordinate systems for all datasets were unified to the UTM projection system with WGS84 datum.
- Sample generation: The 9 km features and SMAP L4-SM were sampled according to the terrain partition (HB, JH, WX, YJ, WN). Besides remotely sensed features, DOY was added to indicate the generation time of features.
- Model construction: Taking the WN region as an example, the samples collected in this region were randomly divided into the training set and test set with a ratio of 7:3. XGBoost, LightGBM, and CatBoost were trained using the five-fold cross validation method and fused through the Stacking strategy. The test set was not involved in training and only used for model evaluation. It is worth mentioning that, to test the effectiveness of terrain partitioning, we also trained a Stacking model without partitioning.
- Model application: The trained Stacking models were then applied to the 1 km resolution features to generate downscaled SM for each region, and finally the downscaled results were merged by date.
- Validation: The downscaling results were validated using the measured SM from 87 sites, together with precipitation data. Furthermore, we compared the SMCI1.0 and SMAP D-SM products as well as the downscaled SM without partitioning.
3.4. Evaluation Method
4. Results
4.1. Validation of Downscaling Framework
4.2. Overall Performance of Downscaled SM
4.3. Temporal Dynamics of Downscaled SM
4.4. Spatial Distribution of Downscaled SM
5. Discussion
5.1. Analysis of Input Predictors
5.2. Uncertainty of This Study
6. Conclusions
- The framework incorporated three ML models, XGBoost, LightGBM, and CatBoost. Comparison revealed that the Stacking model achieved the highest accuracy in all regions, followed by XGBoost, CatBoost, and LightGBM. Validation with measured SM showed that 60% of the sites were highly correlated with downscaled SM (R > 0.6), with an average ubRMSE of 0.040 m3/m3, which satisfied the accuracy requirements of the SMAP products. Moreover, the downscaling results outperformed the available 1 km resolution SM products (SMCI1.0, SMAP D-SM) and method without partitioning (downscaled SM (WP)).
- Both downscaled SM and downscaled SM (WP) exhibited temporal consistency with SMAP L4-SM and responded positively to rainfall events. They also mitigated the systematic bias of the SMAP L4 product, but downscaled SM (WP) performed inconsistently and sometimes even aggravated the bias. The spatial pattern analysis indicated that the downscaled SM preserved the overall trend of SMAP L4-SM while enriching the details. The downscaled SM was higher in the humid regions and lower in the semi-humid regions, which agreed with the actual situation.
- Among the 15 predictors, DOY, clay, and DEM were the most important and determined the overall distribution of SM. It is worth noting that the relationship between DEM and SM was complex; they exhibited a negative correlation in the plains and a positive correlation in the mountains. LST and Albedo reflected the dynamics of surface energy and showed a negative correlation with SM in all regions. VV polarization was less affected by vegetation and thus captured SM changes more effectively than VH. NSDSI was found to be more sensitive to SM than the other spectral indices, and together they regulated local variations and spatial heterogeneity of SM.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Huntington, T.G. Evidence for intensification of the global water cycle: Review and synthesis. J. Hydrol. 2006, 319, 83–95. [Google Scholar] [CrossRef]
- Seneviratne, S.I.; Corti, T.; Davin, E.L.; Hirschi, M.; Jaeger, E.B.; Lehner, I.; Orlowsky, B.; Teuling, A.J. Investigating soil moisture-climate interactions in a changing climate: A review. Earth Sci. Rev. 2010, 99, 125–161. [Google Scholar] [CrossRef]
- Dorigo, W.A.; Wagner, W.; Hohensinn, R.; Hahn, S.; Paulik, C.; Xaver, A.; Gruber, A.; Drusch, M.; Mecklenburg, S.; van Oevelen, P.; et al. The International Soil Moisture Network: A data hosting facility for global in situ soil moisture measurements. Hydrol. Earth Syst. Sci. 2011, 15, 1675–1698. [Google Scholar] [CrossRef]
- Qin, J.; Yang, K.; Lu, N.; Chen, Y.Y.; Zhao, L.; Han, M.L. Spatial upscaling of in-situ soil moisture measurements based on MODIS-derived apparent thermal inertia. Remote Sens. Environ. 2013, 138, 1–9. [Google Scholar] [CrossRef]
- Paloscia, S.; Pettinato, S.; Santi, E.; Notarnicola, C.; Pasolli, L.; Reppucci, A. Soil moisture mapping using Sentinel-1 images: Algorithm and preliminary validation. Remote Sens. Environ. 2013, 134, 234–248. [Google Scholar] [CrossRef]
- Abbaszadeh, P.; Moradkhani, H.; Zhan, X.W. Downscaling SMAP Radiometer Soil Moisture Over the CONUS Using an Ensemble Learning Method. Water Resour. Res. 2019, 55, 324–344. [Google Scholar] [CrossRef]
- Liu, J.; Chai, L.N.; Lu, Z.; Liu, S.M.; Qu, Y.Q.; Geng, D.Y.; Song, Y.Z.; Guan, Y.B.; Guo, Z.X.; Wang, J.; et al. Evaluation of SMAP, SMOS-IC, FY3B, JAXA, and LPRM Soil Moisture Products over the Qinghai-Tibet Plateau and Its Surrounding Areas. Remote Sens. 2019, 11, 792. [Google Scholar] [CrossRef]
- Entekhabi, D.; Njoku, E.G.; O’Neill, P.E.; Kellogg, K.H.; Crow, W.T.; Edelstein, W.N.; Entin, J.K.; Goodman, S.D.; Jackson, T.J.; Johnson, J.; et al. The Soil Moisture Active Passive (SMAP) Mission. Proc. IEEE 2010, 98, 704–716. [Google Scholar] [CrossRef]
- Zhao, W.; Wen, F.; Cai, J. Methods, progresses and challenges of passive microwave soil moisture spatial downscaling. Natl. Remote Sens. Bull. 2022, 26, 1699–1722. [Google Scholar] [CrossRef]
- Peng, J.; Loew, A.; Merlin, O.; Verhoest, N.E.C. A review of spatial downscaling of satellite remotely sensed soil moisture. Rev. Geophys. 2017, 55, 341–366. [Google Scholar] [CrossRef]
- Sabaghy, S.; Walker, J.P.; Renzullo, L.J.; Jackson, T.J. Spatially enhanced passive microwave derived soil moisture: Capabilities and opportunities. Remote Sens. Environ. 2018, 209, 551–580. [Google Scholar] [CrossRef]
- Srivastava, P.K.; Han, D.W.; Ramirez, M.R.; Islam, T. Machine Learning Techniques for Downscaling SMOS Satellite Soil Moisture Using MODIS Land Surface Temperature for Hydrological Application. Water Resour. Manag. 2013, 27, 3127–3144. [Google Scholar] [CrossRef]
- Wei, Z.S.; Meng, Y.Z.; Zhang, W.; Peng, J.; Meng, L.K. Downscaling SMAP soil moisture estimation with gradient boosting decision tree regression over the Tibetan Plateau. Remote Sens. Environ. 2019, 225, 30–44. [Google Scholar] [CrossRef]
- Karthikeyan, L.; Mishra, A.K. Multi-layer high-resolution soil moisture estimation using machine learning over the United States. Remote Sens. Environ. 2021, 266, 19. [Google Scholar] [CrossRef]
- Rao, P.Z.; Wang, Y.C.; Wang, F.; Liu, Y.; Wang, X.Y.; Wang, Z. Daily soil moisture mapping at 1 km resolution based on SMAP data for desertification areas in northern China. Earth Syst. Sci. Data 2022, 14, 3053–3073. [Google Scholar] [CrossRef]
- Yan, R.; Bai, J.J. A New Approach for Soil Moisture Downscaling in the Presence of Seasonal Difference. Remote Sens. 2020, 12, 2818. [Google Scholar] [CrossRef]
- Gao, B.H.; He, Y.; Chen, X.Y.; Zheng, X.Y.; Zhang, L.F.; Zhang, Q.; Lu, J.G. Landslide Risk Evaluation in Shenzhen Based on Stacking Ensemble Learning and InSAR. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023, 16, 1–18. [Google Scholar] [CrossRef]
- Zhang, Y.Z.; Ma, J.; Liang, S.L.; Li, X.S.; Liu, J.D. A stacking ensemble algorithm for improving the biases of forest aboveground biomass estimations from multiple remotely sensed datasets. GISci. Remote Sens. 2022, 59, 234–249. [Google Scholar] [CrossRef]
- Zhen, M.; Yi, M.; Luo, T.; Wang, F.; Yang, K.; Ma, X.; Cui, S.; Li, X. Application of a Fusion Model Based on Machine Learning in Visibility Prediction. Remote Sens. 2023, 15, 1450. [Google Scholar] [CrossRef]
- Bentejac, C.; Csorgo, A.; Martinez-Munoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2021, 54, 1937–1967. [Google Scholar] [CrossRef]
- Huang, S.Z.; Zhang, X.; Wang, C.; Chen, N.C. Two-step fusion method for generating 1 km seamless multi-layer soil moisture with high accuracy in the Qinghai-Tibet plateau. ISPRS J. Photogramm. Remote Sens. 2023, 197, 346–363. [Google Scholar] [CrossRef]
- Huang, G.M.; Wu, L.F.; Ma, X.; Zhang, W.Q.; Fan, J.L.; Yu, X.; Zeng, W.Z.; Zhou, H.M. Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions. J. Hydrol. 2019, 574, 1029–1041. [Google Scholar] [CrossRef]
- Li, H.M.; Zhang, G.L.; Zhong, Q.C.; Xing, L.Q.; Du, H.Q. Prediction of Urban Forest Aboveground Carbon Using Machine Learning Based on Landsat 8 and Sentinel-2: A Case Study of Shanghai, China. Remote Sens. 2023, 15, 284. [Google Scholar] [CrossRef]
- Carlson, T. An overview of the “triangle method” for estimating surface evapotranspiration and soil moisture from satellite imagery. Sensors 2007, 7, 1612–1629. [Google Scholar] [CrossRef]
- Bai, J.Y.; Cui, Q.; Zhang, W.; Meng, L.K. An Approach for Downscaling SMAP Soil Moisture by Combining Sentinel-1 SAR and MODIS Data. Remote Sens. 2019, 11, 2736. [Google Scholar] [CrossRef]
- Zhang, Y.F.; Liang, S.L.; Zhu, Z.L.; Ma, H.; He, T. Soil moisture content retrieval from Landsat 8 data using ensemble learning. ISPRS J. Photogramm. Remote Sens. 2022, 185, 32–47. [Google Scholar]
- Wang, C.H.; Lin, Q.G.; Wang, L.B.; Jiang, T.; Su, B.D.; Wang, Y.J.; Mondal, S.K.; Huang, J.L.; Wang, Y. The influences of the spatial extent selection for non-landslide samples on statistical-based landslide susceptibility modelling: A case study of Anhui Province in China. Nat. Hazard. 2022, 112, 1967–1988. [Google Scholar] [CrossRef]
- Kedzior, M.; Zawadzki, J. Comparative study of soil moisture estimations from SMOS satellite mission, GLDAS database, and cosmic-ray neutrons measurements at COSMOS station in Eastern Poland. Geoderma. 2016, 283, 21–31. [Google Scholar] [CrossRef]
- Reichle, R.; De Lannoy, G.; Koster, R.D.; Crow, W.T.; Kimball, J.S.; Liu, Q.; Bechtold, M. SMAP L4 Global 3-hourly 9 km EASE-Grid Surface and Root Zone Soil Moisture Geophysical Data, 7th ed.; Distributed by NASA National Snow and Ice Data Center Distributed Active Archive Center, 2022. [Google Scholar] [CrossRef]
- Wang, Z.S.; Schaaf, C.B.; Sun, Q.S.; Shuai, Y.M.; Roman, M.O. Capturing rapid land surface dynamics with Collection V006 MODIS BRDF/NBAR/Albedo (MCD43) products. Remote Sens. Environ. 2018, 207, 50–64. [Google Scholar] [CrossRef]
- Fischer, G.; Nachtergaele, F.; Prieler, S.; van Velthuizen, H.T.; Verelst, L.; Wiberg, D. Global Agro-ecological Zones Assessment for Agriculture (GAEZ 2008); IIASA: Laxenburg, Austria; FAO: Rome, Italy, 2008. [Google Scholar]
- Chen, L.; Zhang, K.L.; Zhang, Z.D.; Cao, Z.H.; Ke, Q.H. Response of soil water movement to rainfall under different land uses in karst regions. Environ. Earth Sci. 2023, 82, 17. [Google Scholar] [CrossRef]
- Shangguan, W.; Li, Q.; Shi, G. A 1 km Daily Soil Moisture Dataset over China Based on Situ Measurement (2000–2020); National Tibetan Plateau Data Center, 2022. [Google Scholar]
- Lakshmi, V.; Fang, B. SMAP-Derived 1-km Downscaled Surface Soil Moisture Product, Version 1; Distributed by NASA National Snow and Ice Data Center Distributed Active Archive Center, 2023. [Google Scholar] [CrossRef]
- Chen, J.; Jonsson, P.; Tamura, M.; Gu, Z.; Matsushita, B.; Eklundh, L. A simple method for reconstructing a high-quality NDVI time-series data set based on the Savitzky-Golay filter. Remote Sens. Environ. 2004, 91, 332–344. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. Lightgbm: A highly efficient gradient boosting decision tree. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 8–9 December 2017; p. 30. [Google Scholar]
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased boosting with categorical features. Adv. Neural Inf. Process. Syst. 2018, 6637–6647. [Google Scholar]
- Xu, M.Y.; Yao, N.; Yang, H.X.; Xu, J.; Hu, A.N.; de Goncalves, L.G.G.; Liu, G. Downscaling SMAP soil moisture using a wide & deep learning method over the Continental United States. J. Hydrol. 2022, 609, 22. [Google Scholar]
- Liu, X.W.; Zhang, X.Y.; Chen, S.Y.; Sun, H.Y.; Shao, L.W. Subsoil compaction and irrigation regimes affect the root-shoot relation and grain yield of winter wheat. Agric. Water Manag. 2015, 154, 59–67. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.I. A Unified Approach to Interpreting Model Predictions. Adv. Neural Inf. Process. Syst. 2017, 30, 1–10. [Google Scholar]
- Shangguan, Y.L.; Min, X.X.; Shi, Z. Inter-comparison and integration of different soil moisture downscaling methods over the Qinghai-Tibet Plateau. J. Hydrol. 2023, 617, 17. [Google Scholar] [CrossRef]
- Xu, X.L.; Ma, K.M.; Fu, B.J.; Song, C.J.; Liu, W. Relationships between vegetation and soil and topography in a dry warm river valley, SW China. Catena 2008, 75, 138–145. [Google Scholar] [CrossRef]
- Fu, H.; Yuan, G.X.; Ge, D.B.; Li, W.; Zou, D.S.; Huang, Z.R.; Wu, A.P.; Liu, Q.L.; Jeppesen, E. Cascading effects of elevation, soil moisture and soil nutrients on plant traits and ecosystem multi-functioning in Poyang Lake wetland, China. Aquat. Sci. 2020, 82, 10. [Google Scholar] [CrossRef]
- Chia, R.W.; Yong, L.J.; Jang, J.; Lee, S.-b. Effects of land use change on soil moisture content at different soil depths. J. Geol. Soc. Korea 2022, 58, 117–135. [Google Scholar] [CrossRef]
- Zhao, W.; Sanchez, N.; Lu, H.; Li, A.N. A spatial downscaling approach for the SMAP passive surface soil moisture product using random forest regression. J. Hydrol. 2018, 563, 1009–1024. [Google Scholar] [CrossRef]
- Guan, X.D.; Huang, J.P.; Guo, N.; Bi, J.R.; Wang, G.Y. Variability of Soil Moisture and Its Relationship with Surface Albedo and Soil Thermal Parameters over the Loess Plateau. Adv. Atmos. Sci. 2009, 26, 692–700. [Google Scholar] [CrossRef]
- Sadeghi, M.; Jones, S.B.; Philpot, W.D. A linear physically-based model for remote sensing of soil moisture using short wave infrared bands. Remote Sens. Environ. 2015, 164, 66–76. [Google Scholar] [CrossRef]
- Yang, Z.J.; He, Q.S.; Miao, S.Q.; Wei, F.; Yu, M.X. Surface Soil Moisture Retrieval of China Using Multi-Source Data and Ensemble Learning. Remote Sens. 2023, 15, 2786. [Google Scholar] [CrossRef]
- Hu, F.M.; Wei, Z.S.; Zhang, W.; Dorjee, D.; Meng, L.K. A spatial downscaling method for SMAP soil moisture through visible and shortwave-infrared remote sensing data. J. Hydrol. 2020, 590, 11. [Google Scholar] [CrossRef]
- Joshi, C.; Mohanty, B.P. Physical controls of near-surface soil moisture across varying spatial scales in an agricultural landscape during SMEX02. Water Resour. Res. 2010, 46, 21. [Google Scholar] [CrossRef]
- Piles, M.; Camps, A.; Vall-Llossera, M.; Corbella, I.; Panciera, R.; Rudiger, C.; Kerr, Y.H.; Walker, J. Downscaling SMOS-Derived Soil Moisture Using MODIS Visible/Infrared Data. IEEE Trans. Geosci. Remote Sens. 2011, 49, 3156–3166. [Google Scholar] [CrossRef]
Dataset | Variable | Spatial Resolution | Temporal Resolution |
---|---|---|---|
SMAP L4-SM | SM | 9 km | 3 h |
MOD09A1 * | NDVI; NDWI; NSDSI; EVI | 500 m | 8 d |
Sentinel-1 * | VV; VH | 10 m | 12 d |
MOD11A1 * | LST (Land Surface Temperature) | 1 km | Daily |
MOD15A2H * | LAI (Leaf Area Index) | 500 m | 8 d |
MCD43A3 * | Albedo | 500 m | Daily |
CHIRPS * | Precipitation | 0.05° | Daily |
HWSD | Sand; Silt; Clay | 1 km | —— |
SRTM * | DEM (Digital Elevation Model) | 90 m | —— |
SMCI1.0 | SM | 1 km/9 km | Daily |
SMAP D-SM | SM | 1 km | Daily |
MCD12Q1 * | LC (Land Cover) | 500 m | Yearly |
Original | Reclassified | |||
---|---|---|---|---|
Value | Class | Area (km2) | Class | Value |
0 | Water | 4242.68 | Water | 0 |
1 | Evergreen Needleleaf Forest | 17,272.22 | Forest | 1 |
2 | Evergreen Broadleaf Forest | 11,409.17 | ||
3 | Deciduous Needleleaf Forest | 14.01 | ||
4 | Deciduous Broadleaf Forest | 27,420.06 | ||
5 | Shrub | 0.00 | ||
6 | Grass | 2532.65 | Grassland | 2 |
7 | Cereal Croplands | 45,961.34 | Cropland | 3 |
8 | Broadleaf Croplands | 26,324.97 | ||
9 | Urban and Built-Up | 4938.73 | Urban | 4 |
10 | Permanent Snow and Ice | 0.00 | ||
11 | Non-Vegetated Lands | 8.01 |
Region | HB | JH | WX | YJ | WN | ||
---|---|---|---|---|---|---|---|
Number of samples | 81,064 | 76,062 | 38,794 | 60,043 | 54,972 | ||
XGBoost | Training set | R | 0.964 | 0.966 | 0.988 | 0.984 | 0.991 |
RMSE | 0.017 | 0.014 | 0.012 | 0.014 | 0.010 | ||
Test set | R | 0.952 | 0.950 | 0.978 | 0.976 | 0.987 | |
RMSE | 0.020 | 0.016 | 0.015 | 0.017 | 0.012 | ||
LightGBM | Training set | R | 0.919 | 0.931 | 0.967 | 0.961 | 0.974 |
RMSE | 0.026 | 0.019 | 0.019 | 0.022 | 0.017 | ||
Test set | R | 0.917 | 0.924 | 0.963 | 0.958 | 0.973 | |
RMSE | 0.026 | 0.020 | 0.020 | 0.023 | 0.018 | ||
CatBoost | Training set | R | 0.955 | 0.955 | 0.978 | 0.977 | 0.987 |
RMSE | 0.020 | 0.016 | 0.015 | 0.017 | 0.012 | ||
Test set | R | 0.951 | 0.948 | 0.975 | 0.974 | 0.986 | |
RMSE | 0.020 | 0.017 | 0.016 | 0.018 | 0.013 | ||
Stacking | Training set | R | 0.967 | 0.967 | 0.987 | 0.984 | 0.991 |
RMSE | 0.016 | 0.013 | 0.012 | 0.014 | 0.010 | ||
Test set | R | 0.959 | 0.954 | 0.981 | 0.979 | 0.989 | |
RMSE | 0.018 | 0.015 | 0.014 | 0.016 | 0.011 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xu, J.; Su, Q.; Li, X.; Ma, J.; Song, W.; Zhang, L.; Su, X. A Spatial Downscaling Framework for SMAP Soil Moisture Based on Stacking Strategy. Remote Sens. 2024, 16, 200. https://doi.org/10.3390/rs16010200
Xu J, Su Q, Li X, Ma J, Song W, Zhang L, Su X. A Spatial Downscaling Framework for SMAP Soil Moisture Based on Stacking Strategy. Remote Sensing. 2024; 16(1):200. https://doi.org/10.3390/rs16010200
Chicago/Turabian StyleXu, Jiaxin, Qiaomei Su, Xiaotao Li, Jianwei Ma, Wenlong Song, Lei Zhang, and Xiaoye Su. 2024. "A Spatial Downscaling Framework for SMAP Soil Moisture Based on Stacking Strategy" Remote Sensing 16, no. 1: 200. https://doi.org/10.3390/rs16010200
APA StyleXu, J., Su, Q., Li, X., Ma, J., Song, W., Zhang, L., & Su, X. (2024). A Spatial Downscaling Framework for SMAP Soil Moisture Based on Stacking Strategy. Remote Sensing, 16(1), 200. https://doi.org/10.3390/rs16010200