1. Introduction
Soil organic matter (SOM) constitutes a fundamental component of terrestrial ecosystems, regulating soil fertility, carbon sequestration, nutrient retention, and structural stability. However, SOM is chemically heterogeneous, comprising distinct fractions with varying stability, reactivity, and environmental functions. These fractions can be operationally classified into humic acid (HA), fulvic acid (FA), and Humin, each exhibiting unique molecular characteristics and ecological roles [
1]. Humic acid, characterized by higher molecular weight and aromatic content, contributes significantly to soil buffering capacity and heavy metal complexation. Fulvic acid, with smaller molecular weight and greater solubility, serves as a mobile carrier for carbon and nitrogen transport in soil solutions. Humin, the most recalcitrant fraction, remains strongly bound to mineral surfaces and represents a major long-term carbon sink [
2]. This functional differentiation within SOM directly influences soil physicochemical processes, microbial dynamics, and the overall carbon balance of terrestrial ecosystems [
3,
4].
Traditional wet chemical methods for quantifying SOM and its fractions face substantial limitations that constrain their application in contemporary research. Spectroscopic techniques offer promising alternatives for SOM quantification by exploiting the distinct absorption and scattering properties of soil constituents across the visible–near-infrared (vis–NIR) and mid-infrared (MIR) regions. In vis–NIR spectroscopy, absorption features arise from electronic transitions and overtones of molecular vibrations associated with organic matter, clay minerals, iron oxides, and carbonates [
5]. Numerous studies have demonstrated that vis–NIR can predict soil organic carbon (SOC) with sufficient accuracy for rapid field screening applications [
6,
7]. Recent work has extended this capability to humus fractions, with reports indicating satisfactory prediction of FA (R
2 = 0.73), HA (R
2 = 0.56) and Humin (R
2 = 0.83) using vis–NIR combined with machine learning algorithms [
8,
9]. Mid-infrared spectroscopy, which captures fundamental molecular vibrations of functional groups, has shown robust performance in predicting SOC and its fractions across large spectral libraries and diverse soil types [
10,
11]. Comparative analyses suggest that MIR often outperforms other spectral ranges for multiple soil properties and can effectively reveal compositional differences among carbon fractions [
12].
Multi-sensor spectral fusion has emerged as a strategy to overcome the limitations of single-domain spectroscopy by integrating complementary information from different spectral regions. The rationale underlying fusion approaches is that vis–NIR captures electronic transitions and overtone-related features, whereas MIR provides direct information on molecular vibrations and functional groups. By combining these complementary sensitivities, fusion methods theoretically offer more comprehensive representations of soil composition [
5,
13]. However, despite the conceptual appeal, empirical evidence regarding the efficacy of spectral fusion remains inconsistent and often contradictory. While some studies report substantial improvements in SOC prediction accuracy through vis–NIR–MIR fusion under controlled laboratory conditions [
14,
15], others observe limited benefits or even decreased accuracy in soil classification tasks, attributed to overfitting and increased model complexity [
16]. Moreover, the majority of fusion studies have focused on bulk SOM or total organic carbon, with little attention to individual SOM fractions and the mechanisms governing their spectral behavior.
Despite these advances, several critical knowledge gaps persist. First, it remains unclear which specific SOM fraction predominantly governs the spectral predictability of total SOM. For instance, Humin has been hypothesized to display more stable and soil-type–independent spectral responses than other SOM fractions due to its strong mineral associations [
17]. However, to our knowledge, direct empirical evidence demonstrating consistent spectral behavior that can be unequivocally attributed to Humin remains scarce. Second, the existing fusion literature has focused almost exclusively on bulk SOM or total organic carbon, with minimal consideration of individual SOM fractions such as Humin, humic acid, or fulvic acid. This lack of mechanistic understanding and fraction-level evaluation represents a critical knowledge gap and highlights the need for studies that explicitly test whether fusion genuinely enhances the predictability of distinct SOM components rather than merely increasing spectral dimensionality. Third, the underlying mechanisms that explain why fusion sometimes improves or degrades prediction accuracy are poorly understood. Addressing these gaps is essential for developing mechanistically informed spectroscopic methods for soil carbon monitoring and for identifying the structural and chemical determinants of SOM spectral signatures.
This study was designed to address these research gaps through three specific objectives: (1) to compare the predictive performance of vis–NIR, MIR, and their fusion for SOM and its fractions (HA, FA, Humin) using both full spectra and variable-importance-based wavelength selection, (2) to identify which SOM component predominantly determines the overall predictability of SOM, and (3) to elucidate the mechanisms underlying changes in prediction accuracy resulting from vis–NIR–MIR spectral fusion.
Figure 1 illustrates the overall research framework and analytical workflow employed in this investigation.
2. Materials and Methods
2.1. Study Area and Soil Sampling
The study area is situated in the Yangzi region of Nanchang City, Jiangxi Province, southeastern China, within the Gan River floodplain of the middle Yangtze River Basin (
Figure 2). The region is characterized by a flat terrain with minimal topographic relief, and soils are predominantly classified as Fluvisols, developed from Quaternary alluvial and fluvial deposits, with local occurrences of Cambisols and Gleysols on poorly drained areas, and Anthrosols formed under long-term intensive cultivation. In areas influenced by weathered red clay and sandstone materials, Acrisols/Alisols are also present. The key land use type from which samples were taken is cropland. The climate is humid subtropical monsoon, featuring hot and humid summers, mild winters, and abundant precipitation concentrated in spring and early summer. The study area represents a major peri-urban agricultural zone supplying a substantial proportion of Jiangxi Province’s fresh vegetable production, with intensive long-term cultivation history and high cropping intensity. These characteristics make the region an important representative site for investigating soil fertility dynamics, organic matter turnover, and sustainable soil management practices in the middle Yangtze River basin.
Soil sampling was conducted triennially beginning in 2019. Soil samples were collected following a stratified sampling design to capture local spatial variability. A total of 93 samples were taken from the topsoil (0–20 cm) across the study area. Each sampling point was georeferenced using a handheld GPS (±3 m accuracy), and three subsamples collected within a 2 m radius were composited to reduce small-scale microsite variability. Sampling locations were selected to represent contrasting land use types and soil conditions, thereby incorporating heterogeneity in texture, mineralogy, and organic matter content. All samples were air-dried at room temperature, ground to pass through a 2 mm sieve, and stored in sealed containers prior to chemical analysis and spectroscopic measurements.
2.2. Chemical Analysis
Soil organic matter content was determined using the potassium dichromate-sulfuric acid external heating method in accordance with Chinese agricultural standard NY/T 1121.6-2006 [
18]. Organic carbon obtained by dichromate oxidation was converted to SOM using the conventional conversion factor of 1.724. Humus fractions, including HA, FA, and Humin, were determined through alkaline extraction and acid fractionation following Chinese forestry standard [
19]. These operationally defined fractions represent chemically distinct pools with different environmental stabilities and functions.
2.3. Spectroscopic Measurement and Preprocessing
Visible–near-infrared spectra were acquired using a FieldSpec ProFR spectrometer (Analytical Spectral Devices, Boulder, CO, USA), covering the wavelength range of 350–2500 nm, with spectral resolutions of 3 nm (350–1000 nm) and 10 nm (1000–2500 nm). Before spectral acquisition, the instrument was allowed to warm up for 30 min to stabilize the detector. Prior to each measurement, the spectrometer was calibrated using a Spectralon panel with 99 percent reflectance. For each soil sample, thirty spectral measurements were collected at three random positions with ten internal replicates per position, and the average spectrum was used as the final representative spectrum. Spectra from wavelength regions 350–399 nm and 2451–2500 nm were excluded due to low signal-to-noise ratios, retaining the 400–2400 nm range for analysis.
Mid-infrared spectra were measured using an Agilent 4300 Handheld FTIR spectrometer (Agilent Technologies, Santa Clara, CA, USA) with a spectral range of 650–4000 cm−1. Sample preparation and measurement protocols were identical to those employed for vis–NIR spectroscopy to ensure consistency. Both vis–NIR reflectance (R) and MIR absorbance spectra were transformed using log(1/R) to approximate Beer–Lambert behavior and improve linearity with respect to analyte concentration. To reduce high-frequency instrumental noise, Savitzky–Golay smoothing with a second-order polynomial and an 11-point window was applied as the first preprocessing step. A first-derivative transformation (Savitzky–Golay, second-order polynomial) was then used to correct baseline shifts and enhance subtle absorption features relevant to SOM fractions. Spectra were subsequently normalized using standard normal variate (SNV) to minimize scattering effects associated with particle size and soil texture. Finally, noisy and low-signal regions at the edges of the detectors (e.g., 350–400 nm and >2450 nm in vis–NIR) were excluded. All spectra were resampled to uniform 10 nm intervals to facilitate spectral integration and fusion.
2.4. Predictive Modeling
Partial least squares regression (PLSR) was employed as the primary predictive modeling approach. PLSR is a widely used multivariate statistical method that projects both predictor variables (spectral reflectance) and response variables (soil properties) into a new latent space, identifying directions in the predictor space that maximize covariance with the response [
20]. This approach is particularly well-suited for spectroscopic data characterized by high dimensionality, multicollinearity, and high noise-to-signal ratios. The optimal number of latent variables was determined via leave-one-out cross-validation, with the component corresponding to the lowest RMSECV selected as the final model.
To identify the most informative spectral wavelengths and reduce model complexity, the least absolute shrinkage and selection operator (LASSO) was applied for variable selection. LASSO introduces an L1 penalty term that shrinks regression coefficients and performs automatic feature selection by forcing less important coefficients to exactly zero [
21,
22]. The LASSO objective function is defined as follows:
where y represents the target variable (SOM or its fractions), X denotes the spectral matrix, β is the coefficient vector, and λ is the regularization parameter controlling the degree of sparsity. LASSO hyperparameters were tuned using a log-spaced λ grid and 10-fold cross-validation with fixed random seeds to ensure reproducibility. The optimal λ was selected using the one-standard-error rule. All analyses were conducted using the glmnet package with version-controlled scripts. Selected wavelengths identified by LASSO were subsequently used to build PLSR models for comparison with full-spectrum approaches.
Spectral fusion was implemented using direct concatenation of vis–NIR and MIR spectra, creating a combined spectral matrix for model calibration. Both full-spectrum fusion and LASSO-based variable selection from the fused spectra were evaluated. This approach allows comparison of information gain from fusion relative to single-domain spectroscopy. In addition, advanced fusion approaches were not tested because our study aimed to establish a baseline assessment under limited sample size, where complex fusion methods would likely overfit and exceed the scope of this initial investigation (
Table 1).
2.5. Spectral Multicollinearity Diagnosis
Singular value decomposition (SVD) was applied to the spectral matrix X to quantify spectral redundancy and evaluate multicollinearity. The SVD factorization is expressed as X = UΣV^T, where Σ is a diagonal matrix of singular values arranged in descending order. The condition number, defined as the ratio of the largest to smallest singular value (σ
1/σ
n), indicates the degree of multicollinearity, with values exceeding 30 suggesting strong redundancy and potential regression instability [
20]. Cumulative singular value energy was calculated as the proportion of total spectral variance explained by the first k singular values [
28], providing a quantitative measure of effective information dimensionality.
2.6. Model Evaluation Metrics
Model performance was assessed using multiple complementary metrics. The coefficient of determination (R2) quantifies the proportion of variance in the observed values explained by the model. Root mean square error (RMSE) and mean absolute error (MAE) provide measures of absolute prediction error in the same units as the target variable. The ratio of performance to interquartile range (RPIQ), calculated as (Q3 − Q1)/RMSE, offers a standardized performance metric less sensitive to extreme values than traditional RPD. Concordance correlation coefficient (CCC) evaluates both precision and accuracy by measuring agreement between observed and predicted values along the 1:1 line, combining correlation and bias components. Bias quantifies systematic over- or under-prediction. The Kennard–Stone (KS) algorithm was used to split the data into calibration (70 percent) and independent validation (30 percent) subsets to assess model generalizability.
All analyses were conducted in R (version 4.4.1) using the packages prospectr (v0.3.2), glmnet (v4.1-8), and pls (v2.8-2).
4. Discussion
4.1. Humin Fraction Determines SOM Spectral Predictability
This study provides compelling evidence that the spectroscopic predictability of total SOM is fundamentally determined by the Humin fraction rather than by humic or fulvic acids. Both vis–NIR and MIR spectroscopy achieved high prediction accuracy for SOM and Humin (R
2 = 0.79–0.93, CCC = 0.85–0.95); the prediction accuracy achieved in this study for SOM is broadly comparable to the best-performing results reported in the past decade of soil spectroscopy research (
Table 7), whereas FA remained unpredictable (R
2 < 0.24) and HA showed only moderate predictability (R
2 = 0.37–0.39). This differential predictability arises from the distinct chemical and spectroscopic properties of these fractions. Humin contains abundant and spectrally active functional groups, including aromatic and aliphatic C–H, C=O, and C=C moieties, which produce strong and characteristic absorption features in both the vis–NIR and MIR regions [
29]. The large number of characteristic wavelengths selected for Humin (
Figure 3) and the high regression coefficient amplitudes (
Figure 4,
Figure 6 and
Figure 8) confirm its strong spectral signature, indicating a stronger and more coherent spectral signal for Humin and helping to explain its superior predictability.
Equally important, Humin constitutes approximately 40–69 percent of total SOM in the study samples (
Table 2), making it the dominant fraction by mass. This high proportional abundance means that variations in Humin content directly translate into variations in total SOM and that the strong spectral predictability of Humin is effectively transferred to bulk SOM. The structural stability of Humin, arising from strong associations with Fe/Al oxides and clay minerals [
30], further contributes to its consistent spectral behavior across samples. These findings strongly support the conclusion that accurate quantitative estimation of SOM depends primarily on the predictability and relative contribution of the Humin fraction [
31], rather than on the summed contributions of all fractions.
4.2. Chemical Characteristics Explain Fulvic Acid Unpredictability
The poor spectroscopic predictability of fulvic acid observed in this study can be attributed to its unique chemical and structural characteristics. Fulvic acid exhibits low molecular weight, high polarity, and a diffuse distribution of functional groups, resulting in weak and broad infrared absorption bands that are easily masked by overlapping signals from water, clay minerals, and other soil constituents [
32]. The small number of characteristic wavelengths selected for FA (
Figure 3) and the low regression coefficient amplitudes (
Figure 4,
Figure 6 and
Figure 8) provide empirical evidence for its weak spectral signature. Moreover, FA represents a relatively small (mean approximately 3 g kg
−1) and chemically unstable fraction within total SOM (
Table 2), further diminishing its spectral detectability and statistical predictability.
Comparative studies support this interpretation. Although both HA and FA contain aliphatic and aromatic structures, HA is more chemically condensed, exhibits stronger absorption features, and demonstrates greater structural stability [
32]. Fulvic acid, by contrast, shows weaker absorption, more dispersed functional groups, and lower resistance to microbial decomposition [
9]. These intrinsic chemical differences fundamentally limit the capacity of vis–NIR and MIR spectroscopy to quantify FA, regardless of the modeling approach or wavelength selection strategy. Alternative analytical techniques, such as fluorescence spectroscopy or nuclear magnetic resonance, may be more suitable for characterizing FA due to their sensitivity to specific structural and electronic properties.
Table 7.
Summary of representative SOM/SOC spectroscopy fusion studies.
Table 7.
Summary of representative SOM/SOC spectroscopy fusion studies.
| Research | Study Area | Soil Type | Spectral Range | Model(s) | SOC/SOM R2 | Fractions Included |
|---|
| [15] | China | Paddy and upland soils | vis–NIR + MIR | PLSR, SVM | 0.78–0.90 | No |
| [23] | China | Red soils, paddy soils | vis–NIR + MIR | RF, Cubist | 0.60–0.80 | No |
| [24] | Australia | Mixed agricultural soils | vis–NIR + MIR | PLSR, SVM | 0.70–0.85 | No |
| [14] | China | Paddy soils | vis–NIR + MIR | PLSR | 0.85–0.86 | No |
| [21] | China | Red, black, calcareous soils | MIR | PLSR | 0.75–0.88 | No |
| [33] | Germany | Loess, Cambisols | vis–NIR + MIR | PLSR | 0.60–0.75 | No |
| [26] | Brazil | Oxisols, Ferralsols | vis–NIR + MIR | Ensemble, Stacking | 0.80–0.90 | No |
| [22] | China | Mixed farmland soils | vis–NIR | CNN, PLSR | 0.78–0.86 | No |
| [27] | Egypt | Arid sandy/clay soils | vis–NIR + MIR | PLSR, RF, SVR | ~0.85 | No |
4.3. Spectral Fusion Introduces Redundancy Without Information Gain
The failure of vis–NIR–MIR fusion to enhance prediction accuracy in this study contrasts with some previous reports but aligns with theoretical expectations when individual spectral regions already contain comprehensive information. Singular value decomposition analysis (
Figure 10) revealed that direct concatenation of vis–NIR and MIR spectra did not substantially increase the effective information dimensionality. Instead, fusion introduced spectral redundancy because both regions captured overlapping compositional information, albeit through different physical mechanisms (electronic transitions in vis–NIR versus molecular vibrations in MIR). The reduction in the total number of selected wavelengths after fusion compared to separate regional models (
Table 6) confirms this redundancy.
Additionally, direct concatenation created scale imbalance between spectral regions due to differences in measurement units (reflectance versus absorbance), wavelength density, and signal amplitude. The dominance of MIR-derived wavelengths in fused models (approximately 80 percent of selected features from MIR;
Table 6) indicates that the MIR signal effectively overwhelmed the vis–NIR contribution, preventing complementary information integration. This scale imbalance can introduce noise and degrade model performance, particularly when one spectral region is already sufficiently informative. Advanced fusion strategies, such as hierarchical modeling, weighted concatenation, or deep learning architectures that learn optimal region-specific transformations, may overcome these limitations by explicitly addressing scale and redundancy issues [
13]. However, for the soil samples and target properties examined in this study, single-domain spectroscopy (particularly MIR with wavelength selection) provided optimal prediction accuracy with reduced model complexity.
4.4. Implications for Soil Carbon Monitoring
The findings of this study have important implications for spectroscopic soil carbon monitoring programs and SOM research. First, the strong predictability of SOM and Humin using either vis–NIR or MIR spectroscopy confirms the viability of rapid, non-destructive SOM quantification across diverse agricultural and environmental applications. Mid-infrared spectroscopy with LASSO-based wavelength selection emerged as the most accurate approach (validation R2 up to 0.93 for Humin), suggesting that portable MIR sensors could enable cost-effective field-scale SOM assessment. Second, the component-specific predictability patterns identified here underscore the need to shift from bulk SOM estimation toward fraction-specific modeling approaches that explicitly account for the compositional heterogeneity and differential spectral behaviors of SOM pools.
Third, the central role of the Humin fraction in determining SOM predictability suggests that future spectroscopic studies should prioritize mechanistic investigations of Humin-specific spectral features and their relationships to long-term carbon stabilization processes. Understanding which functional groups and mineral associations within Humin generate the strongest spectral signals could inform targeted wavelength selection and improve model interpretability. Finally, the limited benefits of spectral fusion observed here indicate that resources may be better allocated toward optimizing single-domain spectroscopy, expanding spectral libraries, and developing robust calibration transfer methods rather than pursuing multi-sensor fusion for routine SOM monitoring applications.
4.5. Study Limitations and Future Directions
Several limitations should be acknowledged. The moderate sample size (n = 93) and geographic restriction to subtropical croplands in southeastern China limit the generalizability of the findings to other soil types, climatic zones, and land use systems. Validation with larger, more diverse datasets encompassing contrasting soil parent materials, textures, and management histories is essential to confirm whether the Humin-driven predictability of SOM represents a universal phenomenon or is context-specific. Additionally, the operational fractionation scheme employed (alkaline extraction and acid fractionation) provides chemically defined pools that may not directly correspond to ecologically meaningful SOM fractions defined by turnover rates or functional roles. Complementary studies integrating spectroscopy with advanced characterization techniques such as solid-state nuclear magnetic resonance, pyrolysis–gas chromatography–mass spectrometry, or synchrotron-based X-ray spectroscopy could elucidate the specific molecular structures responsible for strong Humin spectral signatures. Future research should also explore alternative fusion strategies, including hierarchical ensemble modeling and deep learning architectures, to determine whether more sophisticated integration methods can overcome the redundancy and scale imbalance limitations identified in this study. In addition, comprehensive uncertainty quantification will be addressed in future work.
5. Conclusions
This study systematically evaluated the capability of visible–near-infrared, mid-infrared, and fused spectroscopy for predicting soil organic matter and its chemically defined fractions. The major conclusions are as follows. First, both vis–NIR and MIR spectroscopy were able to predict soil organic matter and the Humin fraction, with MIR showing the best quantitative agreement with reference measurements (R2 up to 0.93); fulvic acid cannot be reliably quantified by any spectroscopic approach due to weak spectral signatures arising from low molecular weight, diffuse functional group distribution, and small fractional abundance. Second, the reliable predictability of total SOM derives primarily from the strong spectral response and high proportional contribution of the Humin fraction, which comprises approximately 50 percent of SOM and exhibits abundant spectrally active functional groups. Third, direct fusion of vis–NIR and MIR spectra does not enhance prediction accuracy because both spectral regions already contain comprehensive, overlapping information, and fusion introduces redundancy and scale imbalance without increasing effective dimensionality.
These findings establish that accurate spectroscopic estimation of SOM fundamentally depends on the predictability and relative abundance of the Humin fraction, providing new mechanistic understanding of soil carbon monitoring applications. Future research should validate these results across diverse soil types and climatic regions, investigate the specific molecular structures within Humin responsible for strong spectral signals, and develop component-specific calibration models that explicitly account for SOM compositional heterogeneity. For routine soil organic matter assessment, optimized single-domain spectroscopy (particularly MIR with variable selection) offers greater accuracy and simplicity than multi-sensor fusion approaches.