Optimizing Bare Soil Mosaics for Clay Prediction via Environmental Covariates and Variable Selection
Highlights
- Multi-temporal bare soil mosaics and regional environmental covariates in combination with variable selection methods were tested for clay mapping across croplands.
- Combining mosaic with regional proxies increased model performance.
- The modified greedy feature selection outperformed a combination of variance inflation factor and recursive feature elimination techniques and resulted in the best model performance (RMSE = 8.73%, R2 = 0.42, RPD = 1.34).
- Integrating other environmental variables besides mosaics is an important step toward achieving maximum prediction accuracy, highlighting pedogenic controls beyond reflectance.
- In addition to including other environmental variables besides mosaics, it is important to consider the method of variable selection for soil mapping tasks.
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Area
2.2. Soil Data
2.3. Spatial Modeling Framework
2.3.1. Multi-Temporal Remote Sensing Data
2.3.2. Regional Covariate Stack
2.3.3. Mapping Scenarios
2.4. Variable Selection Techniques
2.5. Evaluation of Model Performance
3. Results
3.1. Summary Statistics of Clay Content Across Croplands
3.2. Final Temporal Mosaic
3.3. Performance of Tested Scenarios
3.4. Key Environmental Covariates
3.5. Digital Maps of Clay Content
4. Discussion
5. Conclusions
- The per-pixel-based bare soil mosaic using a single threshold (NDVI < 0.23) outperformed other temporal alternatives and resulted in RMSE = 10.03%, R2 = 0.25, and RPD = 1.16. Hence, single RS data explained only 25% of the clay variation across croplands.
- Using regional covariates representing soil-forming factors, the models demonstrated comparable performance (RMSE = 9.77%, R2 = 0.27, and RPD = 1.18). Among them, MODIS and Landsat variables were the most important; i.e., RS data representing vegetation conditions remained key predictors among other environmental factors.
- Combining a multi-temporal mosaic and a regional set of covariates showed different accuracy depending on the variable selection method. Using all variables without filtering led to a model accuracy of RMSE = 9.45%, R2 = 0.33, and RPD = 1.24, whereas excluding variables after the VIF and RFE procedures slightly reduced the accuracy (RMSE = 9.65%, R2 = 0.31, RPD = 1.21). This is explained by the fact that the excluded variables (mainly highly correlated) also contained useful information.
- The model that used bare soil mosaic and regional variables with a combination of the MGFS technique increased R2 to 0.42 (RMSE = 8.73%, RPD = 1.34), highlighting the importance of the variable selection method. Under this predictive model, NIR bands and relief were the most important. This confirms that other pedogenic factors should be used beyond reflectance.
- We recommend that future studies incorporate multi-temporal remote sensing data alongside complementary covariates that represent other soil-forming factors, while also adopting a more robust variable selection strategy. This workflow is applicable to other cropland areas with analogous climate and terrain conditions, where bare soil mosaics can be effectively combined with regional covariates and variable selection techniques for DSM tasks.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| RS | Remote sensing |
| VIF | Variance inflation factor |
| RFE | Recursive feature elimination |
| NIR | Near-infrared |
| RMSE | Root mean square error |
| RPD | Ratio of performance to deviation |
| R2 | Coefficient of determination |
| MGFS | Modified greedy feature selection |
| SOC | Soil organic carbon |
| NDVI | Normalized Difference Vegetation Index |
| NBR2 | Normalized Burn Ratio 2 |
| S2WI | Soil Surface Moisture Index |
| AOI | Area of interest |
| BSI | Bare Soil Index |
References
- Dixon, J.B. Roles of Clays in Soils. Appl. Clay Sci. 1991, 5, 489–503. [Google Scholar] [CrossRef]
- Zanini, M.; Priori, S.; Petito, M.; Cantalamessa, S. Digital Soil Mapping for Precision Agriculture Using Multitemporal Sentinel-2 Images of Bare Ground. In Proceedings of the 2023 IEEE International Workshop on Metrology for Agriculture and Forestry (MetroAgriFor), Pisa, Italy, 6–8 November 2023; pp. 232–236. [Google Scholar]
- Schweizer, S.A.; Mueller, C.W.; Höschen, C.; Ivanov, P.; Kögel-Knabner, I. The Role of Clay Content and Mineral Surface Area for Soil Organic Carbon Storage in an Arable Toposequence. Biogeochemistry 2021, 156, 401–420. [Google Scholar] [CrossRef]
- Ayoubi, S.; Milikian, A.; Mosaddeghi, M.R.; Zeraatpisheh, M.; Zhao, S. Impacts of Clay Content and Type on Shear Strength and Splash Erosion of Clay–Sand Mixtures. Minerals 2022, 12, 1339. [Google Scholar] [CrossRef]
- Kriuchkov, N.R.; Makarov, O.A. Modeling Dynamics of Soil Erosion by Water Due to Soil Organic Matter Change (1980–2020) in the Steppe Zone of Russia. Agronomy 2023, 13, 2527. [Google Scholar] [CrossRef]
- Wiesmeier, M.; Urbanski, L.; Hobley, E.; Lang, B.; von Lützow, M.; Marin-Spiotta, E.; van Wesemael, B.; Rabot, E.; Ließ, M.; Garcia-Franco, N.; et al. Soil Organic Carbon Storage as a Key Function of Soils—A Review of Drivers and Indicators at Various Scales. Geoderma 2019, 333, 149–162. [Google Scholar] [CrossRef]
- Johannes, A.; Matter, A.; Schulin, R.; Weisskopf, P.; Baveye, P.C.; Boivin, P. Optimal Organic Carbon Values for Soil Structure Quality of Arable Soils. Does Clay Content Matter? Geoderma 2017, 302, 14–21. [Google Scholar] [CrossRef]
- Mäkipää, R.; Menichetti, L.; Martínez-García, E.; Törmänen, T.; Lehtonen, A. Is the Organic Carbon-to-Clay Ratio a Reliable Indicator of Soil Health? Geoderma 2024, 444, 116862. [Google Scholar] [CrossRef]
- Wenzel, W.W.; Golestanifard, A.; Duboc, O. SOC: Clay Ratio: A Mechanistically-Sound, Universal Soil Health Indicator across Ecological Zones and Land Use Categories? Geoderma 2024, 452, 117080. [Google Scholar] [CrossRef]
- McBratney, A.B.; Mendonça Santos, M.L.; Minasny, B. On Digital Soil Mapping. Geoderma 2003, 117, 3–52. [Google Scholar] [CrossRef]
- Chen, Q.; Vaudour, E.; Richer-de-Forges, A.C.; Arrouays, D. Spectral Indices in Remote Sensing of Soil: Definition, Popularity, and Issues. A Critical Overview. Remote Sens. Environ. 2025, 329, 114918. [Google Scholar] [CrossRef]
- Vaudour, E.; Gomez, C.; Lagacherie, P.; Loiseau, T.; Baghdadi, N.; Urbina-Salazar, D.; Loubet, B.; Arrouays, D. Temporal Mosaicking Approaches of Sentinel-2 Images for Extending Topsoil Organic Carbon Content Mapping in Croplands. Int. J. Appl. Earth Obs. Geoinf. 2021, 96, 102277. [Google Scholar] [CrossRef]
- Gasmi, A.; Gomez, C.; Lagacherie, P.; Zouari, H.; Laamrani, A.; Chehbouni, A. Mean Spectral Reflectance from Bare Soil Pixels along a Landsat-TM Time Series to Increase Both the Prediction Accuracy of Soil Clay Content and Mapping Coverage. Geoderma 2021, 388, 114864. [Google Scholar] [CrossRef]
- Vaudour, E.; Gomez, C.; Loiseau, T.; Baghdadi, N.; Loubet, B.; Arrouays, D.; Ali, L.; Lagacherie, P. The Impact of Acquisition Date on the Prediction Performance of Topsoil Organic Carbon from Sentinel-2 for Croplands. Remote Sens. 2019, 11, 2143. [Google Scholar] [CrossRef]
- Chinilin, A.; Lozbenev, N.; Shilov, P.; Fil, P.; Levchenko, E.; Kozlov, D. Synergetic Use of Bare Soil Composite Imagery and Multitemporal Vegetation Remote Sensing for Soil Mapping (A Case Study from Samara Region’s Upland). Land 2024, 13, 2229. [Google Scholar] [CrossRef]
- Azizi, K.; Garosi, Y.; Ayoubi, S.; Tajik, S. Integration of Sentinel-1/2 and Topographic Attributes to Predict the Spatial Distribution of Soil Texture Fractions in Some Agricultural Soils of Western Iran. Soil Tillage Res. 2023, 229, 105681. [Google Scholar] [CrossRef]
- Suleymanov, A.; Telyagissov, S.; Asylbaev, I.; Mirsayapov, R.; Suleymanov, R.; Keshavarzi, A.; Tuktarova, I.; Belan, L. Multi-Scenario Modeling of Soil Organic Carbon in Semi-Arid Croplands with Uncertainty Quantification and Model Interpretation. Remote Sens. Appl. Soc. Environ. 2026, 41, 101903. [Google Scholar] [CrossRef]
- Zhang, X.; Chen, S.; Xue, J.; Wang, N.; Xiao, Y.; Chen, Q.; Hong, Y.; Zhou, Y.; Teng, H.; Hu, B.; et al. Improving Model Parsimony and Accuracy by Modified Greedy Feature Selection in Digital Soil Mapping. Geoderma 2023, 432, 116383. [Google Scholar] [CrossRef]
- Kasraei, B.; Schmidt, M.G.; Zhang, J.; Bulmer, C.E.; Filatow, D.S.; Arbor, A.; Pennell, T.; Heung, B. A Framework for Optimizing Environmental Covariates to Support Model Interpretability in Digital Soil Mapping. Geoderma 2024, 445, 116873. [Google Scholar] [CrossRef]
- Demir, S.; Sahin, E.K. An Investigation of Feature Selection Methods for Soil Liquefaction Prediction Based on Tree-Based Ensemble Algorithms Using AdaBoost, Gradient Boosting, and XGBoost. Neural Comput. Appl. 2023, 35, 3173–3190. [Google Scholar] [CrossRef]
- Ferhatoglu, C.; Miller, B.A. Choosing Feature Selection Methods for Spatial Modeling of Soil Fertility Properties at the Field Scale. Agronomy 2022, 12, 1786. [Google Scholar] [CrossRef]
- Broeg, T.; Don, A.; Scholten, T.; Erasmi, S. Reducing Bias in Cropland Soil Organic Carbon and Clay Predictions Using Sentinel-2 Composites and Data Balancing. Remote Sens. Environ. 2026, 333, 115109. [Google Scholar] [CrossRef]
- Gomez, C.; Vaudour, E.; Féret, J.-B.; de Boissieu, F.; Dharumarajan, S. Topsoil Clay Content Mapping in Croplands from Sentinel-2 Data: Influence of Atmospheric Correction Methods across a Season Time Series. Geoderma 2022, 423, 115959. [Google Scholar] [CrossRef]
- Suleymanov, A.; Chen, Q.; Richer-de-Forges, A.; Arrouays, D.; Vaudour, E.; Asylbaev, I.; Mirsayapov, R.; Telyagissov, S.; Valiev, G.; Shagaliev, R.; et al. Synergetic Integration of Multi-Temporal Remote Sensing Mosaic and Conventional Soil Map for Mapping Organic Carbon Content in Chernozems. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 28358–28374. [Google Scholar] [CrossRef]
- Kasraei, B.; Schmidt, M.G.; Saurette, D.D.; Bulmer, C.E.; Zhang, J.; Pennell, T.; John, K.; Heung, B. Advancing Digital Soil Mapping with Multi-Year Crop Cover Data: Impacts on Model Accuracy and Soil Interpretation. Geoderma 2025, 461, 117481. [Google Scholar] [CrossRef]
- Wang, X.; Zhou, F.; Chen, S.; Deng, X.; Ren, Z.; Duan, S.-B.; Wang, M.; Shi, Z. Advancing Provincial Cropland Soil Mapping with Temporal Satellite Data Integration. Soil Use Manag. 2025, 41, e70108. [Google Scholar] [CrossRef]
- GOST 12536-2014; Grounds Methods for Laboratory Determination of Granulometric (Grain Size) and Microaggregate Composition. The Interstate Council for Standardization, Metrology and Certification: Minsk, Belarus, 2014.
- Kachinsky, N.A. Mechanical and Microagregate Composition of Soils Methods for Study; USSR Academy of Science Publishers: Moscow, Russia, 1958; p. 193. (In Russian) [Google Scholar]
- Tucker, C. Red and Photographic Infrared Linear Combinations for Monitoring Vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef]
- Key, C.; Benson, N. Landscape Assessment: Ground Measure of Severity, the Composite Burn Index; and Remote Sensing of Severity, the Normalized Burn Ratio. In FIREMON: Fire Effects Monitoring and Inventory System; General Technical Reports; USDA Forest Service, Rocky Mountain Research Station: Ogden, UT, USA, 2006; p. LA 1-51. [Google Scholar]
- Roy, P.; Rikimaru, A.; Miyatake, S. Tropical Forest Cover Density Mapping. Trop. Ecol. 2002, 43, 39–47. [Google Scholar]
- Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-Scale Geospatial Analysis for Everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
- Suleymanov, A.; Valiev, G.; Shagaliev, R.; Suleymanov, R.; Belan, L. Three-Dimensional Mapping of Key Soil Properties with Multi-Stage Validation and Big Data. CATENA 2026, 266, 109949. [Google Scholar] [CrossRef]
- Poggio, L.; de Sousa, L.M.; Batjes, N.H.; Heuvelink, G.B.M.; Kempen, B.; Ribeiro, E.; Rossiter, D. SoilGrids 2.0: Producing Soil Information for the Globe with Quantified Spatial Uncertainty. Soil 2021, 7, 217–240. [Google Scholar] [CrossRef]
- Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V. Gene Selection for Cancer Classification Using Support Vector Machines. Mach. Learn. 2002, 46, 389–422. [Google Scholar] [CrossRef]
- Curto, J.; Pinto, J. The Corrected VIF. J. Appl. Stat. 2011, 38, 1499–1507. [Google Scholar] [CrossRef]
- Drobnič, F.; Kos, A.; Pustišek, M. On the Interpretability of Machine Learning Models and Experimental Feature Selection in Case of Multicollinear Data. Electronics 2020, 9, 761. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. 2025. Available online: https://www.r-project.org/ (accessed on 15 January 2026).
- Kuhn, M. Building Predictive Models in R Using the Caret Package. J. Stat. Softw. 2008, 28, 1–26. [Google Scholar] [CrossRef]
- Gholizadeh, A.; Žižala, D.; Saberioon, M.; Borůvka, L. Soil Organic Carbon and Texture Retrieving and Mapping Using Proximal, Airborne and Sentinel-2 Spectral Imaging. Remote Sens. Environ. 2018, 218, 89–103. [Google Scholar] [CrossRef]
- Gaab, J.; Ruiz, L.; Sekhar, M.; Dharumarajan, S.; Masika Tutondele, J.; Ebengo Okoto, M.; Gomez, C. Detection of Short-Term Changes in Soil Clay Content at Field Scale with Sentinel-2 Enables Mapping of the Reservoir Sediment Reuse in Cropland. Geoderma 2025, 464, 117630. [Google Scholar] [CrossRef]
- Zhou, T.; Geng, Y.; Chen, J.; Pan, J.; Haase, D.; Lausch, A. High-Resolution Digital Mapping of Soil Organic Carbon and Soil Total Nitrogen Using DEM Derivatives, Sentinel-1 and Sentinel-2 Data Based on Machine Learning Algorithms. Sci. Total Environ. 2020, 729, 138244. [Google Scholar] [CrossRef] [PubMed]
- Adeline, K.R.M.; Gomez, C.; Gorretta, N.; Roger, J.-M. Predictive Ability of Soil Properties to Spectral Degradation from Laboratory Vis-NIR Spectroscopy Data. Geoderma 2017, 288, 143–153. [Google Scholar] [CrossRef]
- Tümsavaş, Z.; Tekin, Y.; Ulusoy, Y.; Mouazen, A.M. Prediction and Mapping of Soil Clay and Sand Contents Using Visible and Near-Infrared Spectroscopy. Biosyst. Eng. 2019, 177, 90–100. [Google Scholar] [CrossRef]
- Hobley, E.; Prater, I. Estimating Soil Texture from Vis–NIR Spectra. Eur. J. Soil Sci. 2018, 70, 83–95. [Google Scholar] [CrossRef]
- Hanberry, B.B. Practical Guide for Retaining Correlated Climate Variables and Unthinned Samples in Species Distribution Modeling, Using Random Forests. Ecol. Inform. 2024, 79, 102406. [Google Scholar] [CrossRef]
- Khaledian, Y.; Miller, B.A. Selecting Appropriate Machine Learning Methods for Digital Soil Mapping. Appl. Math. Model. 2020, 81, 401–418. [Google Scholar] [CrossRef]




| Approach | Driver | Mosaic Name | Condition for Selection |
|---|---|---|---|
| Per-pixel | Bare soil | PerPix_Bare_NDVI023_best_FULLSTACK | NDVI ≤ T1 * (NDVI p10 ≈ 0.23). Best: min NDVI |
| Per-pixel | Bare soil | PerPix_Bare_NDVI023_FULLSTACK_mean | NDVI ≤ T1 (NDVI p10 ≈ 0.23). Aggregation: mean. |
| Per-pixel | Bare soil | PerPix_Bare_NDVI023_FULLSTACK_median | NDVI ≤ T1 (NDVI p10 ≈ 0.23). Aggregation: median. |
| Per-pixel | Bare soil | PerPix_Bare_NDVI029_best_FULLSTACK | NDVI ≤ T2 ** (NDVI p25 ≈ 0.29). Best: min NDVI |
| Per-pixel | Bare soil | PerPix_Bare_NDVI029_FULLSTACK_mean | NDVI ≤ T2 (NDVI p25 ≈ 0.29). Aggregation: mean. |
| Per-pixel | Bare soil | PerPix_Bare_NDVI029_FULLSTACK_median | NDVI ≤ T2 (NDVI p25 ≈ 0.29). Aggregation: median. |
| Per-pixel | Bare soil | PerPix_Bare_BSI_best_FULLSTACK | Mask: cloud/shadow-screened scenes. Best: max BSI. |
| Per-pixel | Bare soil | PerPix_Bare_BSI_FULLSTACK_mean | Mask: cloud/shadow-screened scenes. Aggregation: mean. |
| Per-pixel | Bare soil | PerPix_Bare_BSI_FULLSTACK_median | Mask: cloud/shadow-screened scenes. Aggregation: median. |
| Per-pixel | Bare soil | PerPix_Bare_NBR2_best_FULLSTACK | NDVI ≤ T2 (NDVI p25 ≈ 0.29) + GVI1 *** > 0 & GVI2 **** > 0 + NBR2 < 0.16. Best: min NBR2 |
| Per-pixel | Bare soil | PerPix_Bare_NBR2_FULLSTACK_mean | NDVI ≤ T2 (NDVI p25 ≈ 0.29) + GVI1 > 0 & GVI2 > 0 + NBR2 < 0.16. Aggregation: mean. |
| Per-pixel | Bare soil | PerPix_Bare_NBR2_FULLSTACK_median | NDVI ≤ T2 (NDVI p25 ≈ 0.29) + GVI1 > 0 & GVI2 > 0 + NBR2 < 0.16. Aggregation: median NBR2. |
| Per-pixel | Driest soil | PerPix_Driest_S2WI_best_FULLSTACK | NDVI ≤ T2 (NDVI p25 ≈ 0.29). Best: min S2WI. |
| Per-pixel | Driest soil | PerPix_Driest_S2WI_FULLSTACK_mean | NDVI ≤ T2 (NDVI p25 ≈ 0.29). Aggregation: mean S2WI. |
| Per-pixel | Driest soil | PerPix_Driest_S2WI_FULLSTACK_median | NDVI ≤ T2 (NDVI p25 ≈ 0.29). Aggregation: median S2WI. |
| Per-pixel | Bare soil | PerPix_RuleStrict_FULLSTACK_median | NDVI < 0.23 + NBR2 < 0.16 + BSI > 0.20. Aggregation: median. |
| Per-pixel | Bare soil | PerPix_RuleStrict_bestS2WI_FULLSTACK | NDVI < 0.23 + NBR2 < 0.16 + BSI > 0.20. Best: min S2WI. |
| Per-pixel | Driest soil | PerPix_Driest_S2WI_NBR2minLexi_best_FULLSTACK | NDVI ≤ T2 (NDVI p25 ≈ 0.29) + (GVI1 > 0 & GVI2 > 0). Keep per-pixel NBR2 minima, then Best: min S2WI. |
| Per-pixel | Driest soil | PerPix_Driest_S2WI_NBR2minLexi_FULLSTACK_mean | NDVI ≤ T2 (NDVI p25 ≈ 0.29) + (GVI1 > 0 & GVI2 > 0). Candidates: per-pixel NBR2 minima. Aggregation: mean. |
| Per-pixel | Driest soil | PerPix_Driest_S2WI_NBR2minLexi_FULLSTACK_median | NDVI ≤ T2 (NDVI p25 ≈ 0.29) + (GVI1 > 0 & GVI2 > 0). Candidates: per-pixel NBR2 minima. Aggregation: median. |
| Per-date | Bare soil | PerDate_NDVI023_FULLSTACK | NDVI ≤ T1 (NDVI p10 ≈ 0.23). Date ranking: AOI mean NDVI (min). |
| Per-date | Bare soil | PerDate_NDVI029_FULLSTACK | Mask: NDVI ≤ T2 (NDVI p25 ≈ 0.29). Date ranking: AOI mean NDVI (min). |
| Per-date | Bare soil | PerDate_Bare_NBR2_FULLSTACK | Mask: NDVI ≤ T2 (NDVI p25 ≈ 0.29) + (GVI1 > 0 & GVI2 > 0). Date ranking: AOI mean NBR2 (min). |
| Per-date | Bare soil | PerDate_BSI_FULLSTACK | Mask: NDVI ≤ T2 (NDVI p25 ≈ 0.29). Date ranking: AOI mean BSI (max). |
| Per-date | Driest soil | PerDate_Driest_S2WI_FULLSTACK | Mask: NDVI ≤ T2 (NDVI p25 ≈ 0.29). Date ranking: AOI mean S2WI (min). |
| Per-date | Driest soil | PerDate_Driest_S2WI_NBR2thr_FULLSTACK | Scene filter: AOI mean NBR2 < 0.16; Mask: NDVI ≤ T2 (NDVI p25 ≈ 0.29). Ranking: AOI mean S2WI (min). |
| Scenario/Model, № | Covariates | Algorithm | Variable Selection Approach |
|---|---|---|---|
| 1 | Spectral bands from bare soil mosaic (Sentinel-2) | PLS, RF, KNN, Cubist, SVM | - |
| 2 | Regional covariates | PLS, RF, KNN, Cubist, SVM | RFE, VIF |
| 3 | The best mosaic from scenario/model № 1 + all regional covariates | RF | - |
| 4 | The best mosaic from scenario/model № 1 + regional covariates | RF | RFE, VIF |
| 5 | The best mosaic from scenario/model № 1 + regional covariates | RF | MGFS |
| Parameter | Min, % | Max, % | Mean, % | Median, % | SD, % |
|---|---|---|---|---|---|
| Clay | 23.9 | 78.1 | 56.5 | 59.6 | 11.2 |
| Bare Soil Mosaic | Description | Number of Training Samples | Algorithm | RMSE, % | R2 | RPD |
|---|---|---|---|---|---|---|
| 1 | PerPix_Bare_NDVI023_FULLSTACK_median | 181 | RF | 10.03 | 0.25 | 1.16 |
| 2 | PerPix_Bare_NDVI023_FULLSTACK_median | 181 | SVM | 10.11 | 0.24 | 1.15 |
| 3 | PerPix_Bare_NDVI023_FULLSTACK_median | 181 | Cubist | 10.16 | 0.23 | 1.14 |
| 4 | PerDate_NDVI023_FULLSTACK | 172 | SVM | 10.21 | 0.20 | 1.11 |
| 5 | PerPix_Bare_NBR2_FULLSTACK_median | 182 | SVM | 10.28 | 0.19 | 1.09 |
| Blue (B2) | Green (B3) | Red (B4) | RE1 (B5) | RE2 (B6) | RE3 (B7) | NIR (B8) | RE4 (B8A) | SWIR 1 (B11) | SWIR 2 (B12) |
|---|---|---|---|---|---|---|---|---|---|
| −0.31 *** | −0.29 *** | −0.29 *** | −0.32 *** | −0.34 *** | −0.35 *** | −0.36 *** | −0.35 *** | −0.24 ** | 0.06 |
| Scenario/Model, № | Covariates | Variable Selection Approach | Number of Training Samples | Algorithm | RMSE, % | R2 | RPD |
|---|---|---|---|---|---|---|---|
| 1 | Spectral bands from bare soil mosaic (Sentinel-2) | - | 181 | RF | 10.03 | 0.25 | 1.16 |
| 2 | Regional covariates | RFE, VIF | 181 | RF | 9.77 | 0.27 | 1.18 |
| 3 | The best mosaic from scenario/model № 1 + all regional covariates | - | 181 | RF | 9.45 | 0.33 | 1.24 |
| 4 | The best mosaic from scenario/model № 1 + regional covariates | RFE, VIF | 181 | RF | 9.65 | 0.31 | 1.21 |
| 5 | The best mosaic from scenario/model № 1 + regional covariates | MGFS | 181 | RF | 8.73 | 0.42 | 1.34 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Suleymanov, A.; Kriuchkov, N.; Asylbaev, I.; Suleymanov, R. Optimizing Bare Soil Mosaics for Clay Prediction via Environmental Covariates and Variable Selection. Remote Sens. 2026, 18, 1503. https://doi.org/10.3390/rs18101503
Suleymanov A, Kriuchkov N, Asylbaev I, Suleymanov R. Optimizing Bare Soil Mosaics for Clay Prediction via Environmental Covariates and Variable Selection. Remote Sensing. 2026; 18(10):1503. https://doi.org/10.3390/rs18101503
Chicago/Turabian StyleSuleymanov, Azamat, Nikita Kriuchkov, Ilgiz Asylbaev, and Ruslan Suleymanov. 2026. "Optimizing Bare Soil Mosaics for Clay Prediction via Environmental Covariates and Variable Selection" Remote Sensing 18, no. 10: 1503. https://doi.org/10.3390/rs18101503
APA StyleSuleymanov, A., Kriuchkov, N., Asylbaev, I., & Suleymanov, R. (2026). Optimizing Bare Soil Mosaics for Clay Prediction via Environmental Covariates and Variable Selection. Remote Sensing, 18(10), 1503. https://doi.org/10.3390/rs18101503

