1. Introduction
Soil organic matter (SOM, or soil organic carbon, SOC) is a cornerstone of soil fertility, regulating nutrient cycling, water retention, and microbial activity, which are essential for supporting crop productivity, enabling sustainable farming practices, and ensuring global food security [
1,
2]. Accurate, spatiotemporal monitoring of SOM is critical for guiding sustainable management strategies, yet traditional soil sampling and laboratory analysis techniques, while providing high precision, are costly, labor-intensive, and limited by their point-measurement nature [
3,
4]. Geostatistical methods have been employed to predict the spatial distribution of SOM based on the soil sampling points [
5,
6]. However, the accuracy of these predictions depends on the number and spatial arrangement of sampling points, and in regions with highly heterogeneous soil landscapes, modeling SOM variograms becomes difficult [
7,
8].
In recent years, the rapid advancement of remote sensing (RS) technology has provided powerful methods for predicting SOM. For instance, Demattê, et al. [
9] leveraged time-series aggregation of multitemporal satellite data on Google Earth Engine (GEE) platform to increase bare soil coverage from 0.5% in single scene to 68% across the study area, overcoming spatiotemporal bare soil constraints in RS. Luo, et al. [
10] processed Landsat 8 imagery using multitemporal median composites for SOM retrieval in the Songnen Plain and investigated the temporal effects of images. Furthermore, Ma et al. [
11] developed separate prediction models for SOM in drylands and paddy fields, using an optimal image synthesis method, achieving a coefficient of determination (R
2) of 0.74 for drylands and 0.52 for paddy fields in the Sanjiang Plain. These studies used bare soil images or multitemporal median composites to mitigate the impact of adverse factors—such as soil moisture and surface residues—on RS imagery, thereby enhancing characterization accuracy [
12,
13]. Although soil VisNIR spectra (350–2500 nm) with high spectral resolution measured in laboratory have been measured in many regions, and several regional or global spectral libraries have been established [
14,
15,
16,
17], their application has largely been limited to point-based predictions of soil properties. It remains to be investigated whether fusing RS imagery and soil VisNIR spectra can improve prediction accuracy.
Fusing RS imagery and soil VisNIR spectra to predict soil properties typically involves three steps: (1) extracting spectral variables from the soil VisNIR spectra; (2) spatially mapping these point-based variables to align with RS imagery; and (3) developing a predictive model by combining the spatialized variables with RS bands. In the first step, laboratory-measured soil VisNIR spectra typically have hundreds or thousands of bands due to high spectral resolution. This high dimensionality necessitates that these bands be selected or transformed into spectral variables suitable for fusion with RS imagery. For instance, Peng et al. [
18] identified the 1930 nm wavelength as the most important spectral band for SOC prediction based on a Cubist regression tree model, which was then combined with SPOT5 and Landsat 8 images to improve the prediction accuracy in Denmark’s Skjern River catchment. This fusion approach reduced the root mean square error (RMSE) from 3.6% to 2.8% and increased R
2 from 0.46 to 0.59 compared to using only RS imagery and auxiliary environmental data. However, relying on one or a few wavelengths may fail to capture the full spectral information relevant to SOM/SOC, and the fusion performance can be area-specific. These limitations have driven the need for methods characterizing a full range of soil VisNIR spectra. Shi, et al. [
19] applied principal component analysis (PCA) to derive principal components (PCs) and integrated them with time-sequential Landsat images to build a fusion model for lead prediction in urban topsoil, which achieved an R
2 of 0.55 and an RMSE of 25.84 mg/kg, outperforming the accuracy based solely on RS (R
2 = 0.30, RMSE = 35.47 mg/kg) or VisNIR spectra (R
2 = 0.42, RMSE = 32.03 mg/kg). Similarly, Xu, et al. [
20] combined the PCs from field soil spectra with Gaofen-1 satellite imagery to estimate SOM, with a better concordance correlation coefficient of 0.72 for SOM prediction compared to 0.53 using Gaofen-1 imagery alone. Nevertheless, PCA is an unsupervised method that does not account for target soil properties when extracting PCs. This limitation highlights the need to employ a supervised method, such as partial least square regression (PLSR), to extract spectral variables closely related to SOM.
Spatializing extracted spectral variables from soil VisNIR spectra is necessary for fusing them with RS imagery, with two approaches often used in the literature. One is to employ a digital soil mapping method, which uses a machine learning model to relate the spectral variables to environmental covariates. For instance, Rossel and Chen [
21] mapped the extracted PCs derived from soil VisNIR spectra to a three-arc-second grid across Australia based on 31 climate, topography, vegetation, and geology covariates. Ten-fold cross-validation showed high accuracy for PC1 and PC3 with R
2 of 0.48 and 0.42, but moderate accuracy for PC2 with R
2 of 0.31. Alternatively, the second approach uses a geostatistical method like kriging to map the spectral variables based on their spatial autocorrelation. For example, Peng, et al. [
18] applied an empirical bayesian kriging (EBK) to interpolate the spectral values in 1930 nm, achieving an RMSE of 0.04% and explaining approximately 60% of the variance in the independent validation. Similarly, Xu, et al. [
20] employed ordinary kriging (OK) to spatially interpolate the extracted PCs; however, the interpolation accuracy was not reported. Nevertheless, considering the requirement of spatial stationarity for OK, a low accuracy can be expected in highly heterogeneous regions [
22,
23]. Therefore, the spatial mapping method for the extracted spectral variables should be investigated. Residual kriging (RK), which models the spatial trend using a machine learning method and then interpolates the residuals by kriging, may improve the accuracy of mapped spectral variables. Furthermore, the literature generally used all of the first PCs extracted from soil VisNIR spectra for fusing with RS bands. For example, Shi, et al. [
19] and Xu, et al. [
20] combined the first three (explaining 96.7% spectral variance) and first four PCs (explaining 99.32% spectral variance), respectively, for SOM prediction when fusing with RS bands. A potential issue of using all PCs is that many may lack a strong relationship with SOM. Therefore, the spectral variables should be selected based on their relationship with target soil properties to improve the accuracy fusing RS imagery and soil VisNIR spectra.
To enhance the accuracy of SOM predictions, this study proposes a novel fusion method that integrates measured soil VisNIR spectra with RS imagery. This study focuses on addressing the limitations of characterizing soil VisNIR spectra by a supervised dimensionality reduction of PLSR and spatializing spectral variables by RK. Specifically, the objectives include the following: (1) employing PLSR to extract the spectral variables of soil VisNIR spectra closely related to SOM, which could strengthen the rationale of using laboratory spectra; (2) using RK to enhance the spatialization accuracy of extracted point-based spectral variables, which could decrease the uncertainty in the following fusion modeling; and (3) combing the RS bands and the mapped spectral variables to predict SOM, with the first i and ith variables compared to find a valid fusion strategy. This method proposed may contribute to monitoring soil properties and help for sustainable soil resource management.
2. Materials and Methods
2.1. Study Area
To validate the fusion performance of soil VisNIR spectra and RS imagery, this research was conducted in two contrasting agricultural regions in China: Da’an City, Jilin Province, and Fengqiu County, Henan Province (
Figure 1).
Da’an City is situated in the Western Songnen Plain of Northeast China, with an area of approximately 4879 km
2. The topography is gently sloping, with elevations ranging from 120 to 210 m—higher in the west and lower in the east [
24]. The region experiences a cold temperate continental monsoon climate, with a mean annual temperature of 4.5 °C and annual precipitation of approximately 410 mm, primarily concentrated between June and August. According to the World Reference Base for Soil Resources (WRB), the dominant soil types include Solonetz, Chernozems, and Kastanozems, which often exhibit high salinity and low organic matter content [
25]. Agricultural practices primarily utilize a single-cropping system, with maize (
Zea mays L.) and soybean (
Glycine max L.) as the main crops. Plowing generally begins in late April or early May following the snowmelt period in recent years with increasing adoption of conservation tillage practices to enhance SOM sequestration and mitigate soil degradation. The growing season peaks between July and August, and harvest occurs in late September or early October. Therefore, in Da’an, the months from March to June represent the potential temporal window for bare soil surface detection.
Fengqiu County is located in the North China Plain and features flat alluvial terrain with elevations ranging from 45 to 110 m, covering an area of approximately 1220 km
2. The region has a warm temperate monsoon climate, with a mean annual temperature of 14.2 °C and average annual precipitation of 610 mm, most of which falls during the summer months of July and August. The soils—classified as Fluvisols and Cambisols under the WRB system—are derived from Yellow River alluvial deposits, supporting intensive agricultural production [
26,
27]. The dominant cropping system is a double-cropping rotation of winter wheat (
Triticum aestivum L.) and summer maize [
28]. Generally, wheat is sown in October, and maize is planted in June after the wheat harvest. The peak growing season spans July to August for maize and March to April for wheat. After the wheat harvest, substantial residues are often left on the soil surface, whereas after the maize harvest, only limited corn residues are partially retained. Therefore, compared to Da’an, Fengqiu lacks a distinct temporal window of bare soil exposure. However, the period from September to February after the maize harvest offers a more favorable window for bare soil exposure than the period following the wheat harvest.
2.2. Soil Sampling and Analysis
In Da’an, 100 topsoil samples (0–20 cm depth) were collected in 2023, while 117 samples were collected from Fengqiu in 2014. For both regions, sampling sites (
Figure 1) were selected to ensure spatial representativeness while accounting for field accessibility. Non-agricultural areas such as roads and shelterbelts were avoided to minimize mixed-pixel effects in the subsequent RS analysis. Each soil sample was composited from 3–5 subsamples collected within a 5 m radius of each site. The coordinates of each sampling site were recorded using a sub-meter precision GPS device. Soil samples were transported to the laboratory, where they were air-dried, ground, and passed through a 2-mm sieve to ensure uniformity for subsequent soil spectral analysis and SOM measurement. The SOM content was determined using the potassium dichromate oxidation method [
29].
2.3. VisNIR Spectra Acquisition and Processing
Soil spectral measurements were conducted using an ASD FieldSpec 4 spectrometer (Malvern Panalytical Ltd., Malvern, UK), which covers a wavelength range of 350–2500 nm with a nominal interval of 1 nm. The soil spectra in Fengqiu were sourced from Wang and Pan [
30]. For the Da’an samples, measurements were taken by placing soils in a 5.6-cm diameter tray with a 75 W halogen lamp at a zenith angle of 45° in a dark room. A spectral probe with an 8° field of view was positioned vertically 18 cm above the sample. Each soil sample was rotated 90° between four sequential scans, and the average of these spectra was used as the final representative spectrum. Instrument calibration was performed every 30 min using a Spectralon reference panel.
To mitigate the impact of low signal-to-noise ratios at the spectral extremes, the wavelength regions 350–399 nm and 2401–2500 nm were excluded; consequently, only the 400–2400 nm range was used for model development. Additionally, the Savitzky–Golay smoothing algorithm with a window size of 11 and a polynomial order of 2 was applied to the measured spectra to reduce noise and enhance data quality [
31]. It was conducted in the R package prospectr (version 0.2.8).
2.4. Landsat 8 Image Acquisition and Processing
Landsat 8, launched in 2013, is equipped with the Operational Land Imager and the Thermal Infrared Sensor. It provides multispectral imagery at a 30-m spatial resolution for visible, near-infrared, and shortwave infrared bands, and 100 m for thermal bands, making it suitable for monitoring soil properties. In this study, eight spectral bands were selected for analysis: the surface reflectance bands (SR_B1 to SR_B7) and the thermal band (ST_B10).
To address the limited availability of cloud-free and bare soil observations in agricultural regions, Landsat 8 imagery was composited over multiyear windows. This strategy improves spatial coverage and spectral robustness, as single-date images often provide insufficient usable data. A comprehensive compositing strategy was designed for each study area based on their distinct sampling years, climate, and cropping patterns (
Table 1). The strategy involved creating multiyear composites for different rolling time windows (e.g., 3-year, 4-year) and testing various monthly combinations within the critical agricultural periods for each region to identify the optimal temporal window for bare soil exposure. The agricultural systems differed significantly: Da’an utilized a single-cropping system, whereas Fengqiu practiced double-cropping. Consequently, a wider range of monthly combinations was evaluated in Fengqiu (
n = 21) than in Da’an (
n = 10) to retrieve soil spectral information from RS imagery. The total number of unique year–month combinations was 252 for Fengqiu and 120 for Da’an.
To ensure image quality, the pixels contaminated by clouds and cloud shadows were masked. Median compositing was then applied to the remaining clear-sky pixels from all images within each specified temporal combination to generate cloud-free composites for subsequent analysis. All image processing was conducted on the Google Earth Engine (GEE) platform, which provides efficient access to the data archive and robust geospatial analysis tools [
9,
10].
2.5. The Fusion Method
This study proposes a novel method of fusing soil VisNIR spectra and RS bands to predict SOM (
Figure 2). The procedure consists of three main steps:
Step 1: Extraction of spectral variables from the soil VisNIR spectra. We employed a supervised method, PLSR, to extract spectral variables (i.e., latent variables, LVs), which might have strong correlations with SOM. In this study, the number of LVs was determined arbitrarily by a threshold of 70% for the cumulative variance in SOM explained by the spectra. Furthermore, PCA—a commonly used technique for extracting spectral variables (i.e., PCs) in the literature—was employed as a benchmark to evaluate the performance of PLSR. An equivalent number of PCs and LVs were extracted for a direct comparison.
Step 2: Spatialization of PCs and LVs using RK. For each PC or LV, we first used a random forest (RF) model based on composites of multitemporal Landsat 8 images to obtain the spatial trend. Then, the residuals from the RF model were interpolated using OK based on their spatial autocorrelation. The final spatialized map for each LV or PC was generated by summing the predicted trend surface and the kriged residual map.
Step 3: Establishing fusion models by integrating PCs/LVs and RS bands. We implemented two fusion strategies. The first involved combining the first i PCs/LVs with the RS bands. The second involved selectively combining individual PCs/LVs (ith PC/LV) with the RS bands. This strategy was motivated by the fact that not all PCs/LVs have a strong relationship with SOM.
2.6. Model Calibration and Evaluation
Random forest was selected to calibrate the two types of relationships: (1) between PCs/LVs and RS bands, and (2) between SOM and the combined set of RS bands and PCs/LVs. As an ensemble machine learning algorithm, RF constructs multiple decision trees and aggregates predictions through voting or averaging to increase model accuracy and stability. RF has been successfully and widely used to map soil properties in the literature. In this study, the number of trees was tested using the values 500 and 1000, and the node size was tested from 1 to 10. The number of variables available for splitting at each node (mtry) was tested across all possible integer values from one to the total number of input variables. All these parameters were optimized simultaneously using a grid search approach with cross-validation.
A 10-fold cross-validation approach was employed to evaluate the prediction accuracy. The evaluation metrics included R
2 and RMSE:
where
and
are the observed and predicted SOM values, respectively,
is the mean of the observed values, and
n is the number of samples. A higher R
2 and a lower RMSE indicate better predictive performance of the model for SOM.
3. Results
3.1. Statistics of SOM Contents
The mean SOM content was 16.21 and 16.40 g/kg for Da’an and Fengqiu, respectively. Despite similar means in these two regions, the SOM content in Da’an exhibited greater variability, ranging from 4.34 to 36.00 g/kg and standard deviation (SD) with 7.40 g/kg, compared to Fengqiu’s range of 8.60 to 28.40 g/kg (SD = 3.01 g/kg). This difference in variability was quantified by the coefficient of variation (CV), which was 45.66% for Da’an (indicating moderate variability) and 18.36% for Fengqiu (indicating low variability), as classified by [
32]. The skewness values for Da’an (0.65) and Fengqiu (0.41) indicated that the SOM values for both regions were approximately normally distributed; consequently, no data transformation was applied prior to modeling (
Figure 3).
3.2. Extracting Spectral Variables from VisNIR Spectra
Soils from Da’an demonstrated lower reflectance compared to those from Fengqiu, particularly in the visible region (400–780 nm,
Figure 4) where SOM and iron oxides strongly absorb radiation. The greater variability was demonstrated in soil spectra in Da’an, probably resulting from its higher heterogeneity of soil properties (e.g., SOM content). In contrast, Fengqiu’s spectra showed higher overall reflectance with less variability.
We employed both PLSR and PCA to extract spectral variables for subsequent fusion with RS bands. For Da’an, the first six LVs derived from PLSR collectively explained 70.64% of the variance in SOM, with individual contributions of LV1 to LV6 being 30.77%, 10.69%, 9.91%, 3.89%, 2.23%, and 13.15%, respectively. These same LVs accounted for 99.97% of the variance in the soil spectra. For Fengqiu, the first five LVs explained 74.08% of the variance in SOM, with respective contributions of 20.03%, 5.49%, 16.04%, 23.06%, and 9.46% for LV1 to LV5, while capturing 99.94% of the spectral variance. For comparison, an equivalent number of PCs were extracted using PCA. The first six PCs explained 99.98% variance of soil spectra for Da’an, while the first five PCs explained 99.95% variance of soil spectra for Fengqiu.
The correlations between PCs/LVs with SOM are presented in
Figure 5. For Da’an, both PC1 and LV1 showed the strongest correlations with SOM (r = −0.53 and r = 0.55, respectively). While the correlations of subsequent LVs generally decreased with component order, LV6 showed an increased correlation (r = 0.36). A similar pattern was observed for PCs, with PC1, PC2, PC3, and PC5 following comparable trends to LVs. However, PC4 (r = −0.08) and PC6 (r = 0.02) demonstrated substantially weaker correlations compared to LV4 (r = 0.20) and LV6 (r = 0.36). In Fengqiu, LV1, LV3, and LV4 exhibited relatively strong and consistent correlations with SOM (ranging from 0.40 to 0.48), while LV5 showed a moderate correlation (r = 0.31) and LV2 demonstrated the weakest relationship (r = 0.23). The correlation patterns for PCs differed notably from LVs: while PC1 and PC2 showed similar correlations to LVs, PC3 (r = −0.05) and PC4 (r = 0.35) were substantially lower than LV3 (r = 0.40) and LV4 (r = 0.48). Interestingly, PC5 (r = 0.50) exceeded the correlation of LV5 (r = 0.31).
3.3. Spatialization of PCs and LVs
The spatialization of PCs and LVs was performed using RK, which was tested based on all composite images with unique year–month combinations (
Table 1). For comparison, we applied OK for spatializing PCs and LVs, which was commonly used in the literature.
Table 2 presents the optimal results selected from all of the year–month combinations based cross-validation performance. The results show that RK consistently yielded higher spatialization accuracy than OK in both areas. Substantial improvements were observed particularly for intermediate PCs/LVs: in Da’an, the R
2 values for PC2/LV2 increased from 0.18/0.21 with OK to 0.47/0.50 with RK, while PC3/LV3 showed remarkable improvement from −0.03/−0.01 to 0.44/0.44. Similarly, in Fengqiu, the R
2 for PC2/LV2 increased from 0.32/0.39 to 0.52/0.54.
Despite these overall improvements, certain PC/LV maintained relatively low spatialization accuracy even with RK. Specifically, PC6/LV6 in Da’an achieved R
2 values of 0.17/0.06, and PC1/LV1 in Fengqiu remained low at 0.18/0.23. Notably, PC1/LV1, despite exhibiting the strongest correlation with SOM content, showed limited spatialization accuracy, with R
2 values improving from 0.14/0.14 to 0.36/0.33 in Da’an and from 0.09/0.05 to 0.18/0.23 in Fengqiu. These limitations may consequently affect the performance of subsequent fusion models. The maps of all PCs and LVs for both areas are presented in
Figure A1 and
Figure A2.
3.4. SOM Predictions Using the Fusion Method
We evaluated two fusion strategies (four variants), i.e., RS + first
i PCs/LVs and RS +
ith PC/LV (i = 6 for Da’an and = 5 for Fengqiu). For each multiyear window, the optimal spatialization of PCs/LVs selected among all available monthly combinations were fused with their corresponding RS composite images across all unique year–month combinations (
Table 1). The significance of differences in mean R
2 and RMSE between the RS method (using RS bands alone) and the four fusion variants was assessed using one-way ANOVA with post-hoc tests at a 5% significance level. The results are presented in
Figure 6 and
Table 3.
In Da’an, both fusion strategies significantly improved the prediction performance of SOM compared to the RS method (mean R
2 = 0.28, RMSE = 6.23 g/kg). The RS + first
i LVs method achieved the highest accuracy, with a mean R
2 of 0.39 and RMSE of 5.76 g/kg, representing a 39.3% improvement in R
2 and a 7.5% reduction in RMSE. This was closely followed by RS + first
i PCs (mean R
2 = 0.37, and RMSE = 5.85 g/kg). Among the results of four fusion variants—RS + first
i PCs, RS +
ith PC, RS + first
i LVs, and RS +
ith LV, statistical analysis indicated that RS + first
i LVs significantly outperformed RS +
ith PC (
p < 0.05), though no other pairwise differences reached significance. However, LV-based methods demonstrated greater predictive stability than PC-based methods, as reflected by lower SD in both R
2 and RMSE values (
Table 3).
In Fengqiu, the fusion method also enhanced the prediction accuracy of SOM compared to the RS method (mean R2 = 0.10, and RMSE = 2.84 g/kg), though the improvements were more modest than those observed in Da’an. The RS + first i PCs method yielded the highest mean R2 (0.21) and lowest RMSE (2.67 g/kg), representing a 110% improvement in R2 and a 6.0% reduction in RMSE. The RS + first i LVs method performed similarly (mean R2 = 0.20, and RMSE = 2.67 g/kg). No significant differences were observed between RS + first i PCs and RS + first i LVs or between RS + ith PC and RS + ith LV, while the RS + first i PCs method was better than the RS + ith PC method and the RS + first i LVs method outperformed the RS + ith LV method. Meanwhile, the fusion strategy employing multiple variables (first i PCs/LVs) consistently outperformed that using a single variable (ith PC/LV), highlighting the importance of incorporating comprehensive spectral information for SOM prediction in this region. It should be noted that the SD of R2 and RMSE for LV-based methods were similar to that for PC-based methods.
For both study areas, the fusion method achieved substantially higher peak performance compared to the RS method (
Table 3). The RS + first
i PCs method yielded the most substantial improvement, reaching a maximum R
2 of 0.57 and minimum RMSE of 4.82 g/kg, indicating a 5.6% increase in predictive accuracy and a 3.2% reduction in error compared to the best result of the RS method (R
2 = 0.54, and RMSE = 4.98 g/kg). The LV-based fusion method (RS +
ith LV) also demonstrated strong performance with maximum R
2 of 0.55 and minimum RMSE of 4.94 g/kg. In Fengqiu, both RS + first
i LVs and RS +
ith LV achieved the highest R
2 value of 0.40, representing a 14.3% improvement over the maximum R
2 of 0.35 for the RS method. In addition, the RS + first
i LVs method produced the smallest RMSE (2.31 g/kg), a 4.5% reduction from the RS method’s minimum of 2.42 g/kg. These improvements are visually corroborated by the scatter plots in
Figure 7, which shows reduced dispersion and tighter clustering along the 1:1 line for fusion-based predictions compared to the results of the RS method. Therefore, these results demonstrate that the fusion method not only improves average accuracy but also enhances the best-case predictive performance across temporal combinations.
3.5. Temporal Stability of Fusion Method
We further investigated the temporal stability of the fusion method’s performance across different multiyear windows, a critical consideration for scenarios where RS imagery availability may be limited in certain years. For each window, the optimal SOM prediction results from all available monthly combinations are shown for the RS method and the fusion method with four variants (RS + first
i PCs, RS +
ith PC, RS + first
i LVs, and RS +
ith LV) in
Figure 8. The results, in terms of R
2 and RMSE, clearly demonstrate that the fusion method consistently enhanced prediction accuracy compared to the RS method in both Da’an and Fengqiu.
In Da’an, the RS method showed moderate but variable performance with a mean R2 of 0.41 (range: 0.30–0.54) and a mean RMSE of 5.63 g/kg (range: 4.98–6.15 g/kg). Both PC-based fusion methods significantly improved predictive accuracy, compared to the RS method. The RS + first i PCs method achieved a higher mean R2 of 0.46 (range: 0.28–0.57) and a lower mean RMSE of 5.41 g/kg (range: 4.82–6.23 g/kg). Notably, it achieved the highest performance in the 2018–2022 window (R2 = 0.57, and RMSE = 4.82 g/kg). Similarly, the RS + ith PC method yielded a mean R2 of 0.46 (range: 0.35–0.56) and a mean RMSE of 5.39 g/kg (range: 4.90–5.93 g/kg), with peak performance in the 2019–2024 window (R2 = 0.56, and RMSE = 4.90 g/kg). The LV-based fusion methods also enhanced predictions and exhibited greater temporal stability. The RS + first i LVs method produced a mean R2 of 0.47 (range: 0.41–0.54) and a mean RMSE of 5.38 g/kg (range: 5.02–5.68 g/kg), with the best performance in the 2018–2023 window (R2 = 0.54, and RMSE = 5.02 g/kg). The RS + ith LV method resulted in a mean R2 of 0.46 (range: 0.40–0.55) and a mean RMSE of 5.41 g/kg (range: 4.94–5.73 g/kg), peaking in the 2018–2023 window (R2 = 0.55, and RMSE = 4.94 g/kg). The lower variability of R2 and RMSE for LV-based methods (e.g., SD of R2 = 0.04 for RS + first i LVs) confirm their superior stability across years compared to PC-based and RS methods.
In Fengqiu, the RS method performed poorly and inconsistently across multiyear windows, with a mean R2 of 0.24 (range: 0.17–0.35) and a mean RMSE of 2.61 g/kg (range: 2.42–2.73 g/kg). All fusion variants substantially improved accuracy. The PC-based fusion methods increased the mean R2 to 0.30 (RS + first i PCs; range: 0.24–0.38) and 0.29 (RS + ith PC; range: 0.24–0.37), while reducing the mean RMSE to 2.51 g/kg (range: 2.36–2.61 g/kg) and 2.53 g/kg (range: 2.37–2.62 g/kg), respectively. The RS + first i PCs method achieved the highest R2 (0.38) and the lowest RMSE (2.36 g/kg) in the 2013–2015 window, and the RS + ith PC method achieved a mean R2 (0.37) and RMSE (2.37 g/kg) in the same window. The LV-based fusion methods also showed notable improvements. The RS + first i LVs method attained a mean R2 of 0.29 (range: 0.22–0.40) and a mean RMSE of 2.52 g/kg (range: 2.31–2.64 g/kg), with optimal performance in the 2013–2015 window (R2 = 0.40, and RMSE = 2.31 g/kg). The RS + ith LV method achieved a mean R2 of 0.29 (range: 0.21–0.40) and a mean RMSE of 2.53 g/kg (range: 2.32–2.66 g/kg), excelling in the 2013–2015 window (R2 = 0.40, and RMSE = 2.32 g/kg).
3.6. SOM Distributions in Two Areas
The SOM maps in Da’an and Fengqiu (
Figure 9) were generated by the optimal models of the RS method, the RS + PC method, and the RS + LV method. These models were selected through 10-fold cross-validation across all temporal combinations based on the highest validation R
2 and low RMSE values. Their corresponding RF parameters are listed in
Table A1. In Da’an, overall, all methods consistently revealed a distinct east-high, west-low pattern in SOM (
Figure 9a–c). This pattern is primarily driven by land use. High salinity in the Western areas led to lower SOM. In contrast, the central parts show moderately higher SOM content due to cultivation. The highest SOM values are consistently predicted along the northwestern and eastern boundaries, which were mainly characterized by high-quality drylands and paddy fields, respectively. Compared to the RS method, the RS + PC fusion result exhibits a larger spatial extent of low-value SOM patches in the mid-Western and Southwestern regions. The RS + LV fusion method shows a slight expansion in the range of high-value regions, particularly evident along the Northern and Eastern boundaries. This enhancement might better capture the expected high SOM in the paddy fields.
In Fengqiu, the resulting maps show a similar spatial pattern of SOM across the different methods. The high SOM values are predominantly located in the Northeastern and south–central sectors of the county (
Figure 9d–f). This pattern can be attributed to agricultural practices and proximity to branches of the Yellow River. Conversely, the lowest SOM values are found along the Yellow River in the south and southeastern areas. These areas can be affected by river flooding. Compared to the RS method, the RS + PC result sharpens the spatial contrasts. It shows an expanded area of low-value SOM in the central–northern region. The RS + LV method shows noticeably higher SOM levels in the Central and Northern regions. The spatial extent of the low-value area in the mid-western region is intermediate between those predicted by the RS method and the RS + PC method, suggesting a more balanced and potentially more accurate representation of the spatial distribution.