1. Introduction
Understanding vegetation–soil interactions is critical for assessing ecosystem stability, as they regulate water retention, nutrient cycling, and soil conservation [
1]. However, these interactions are increasingly under pressure due to climate change and human activities, which exacerbate land degradation, especially in subtropical and semi-arid regions [
2]. Consequently, vegetation–soil interactions become more dynamic and complex, making it essential to comprehend them for effective land restoration and conservation [
3].
Global studies highlighted the bidirectional feedback between vegetation and soil interactions [
4,
5,
6]. Vegetation relies on soil for nutrients and water, while its growth alters soil structure, fertility, and chemistry [
7,
8]. This interaction is particularly critical in the context of land degradation and erosion. Recent research has shifted from static descriptions to process-driven approaches, emphasizing feedback mechanisms shaped by climate and land use [
9,
10,
11]. Different vegetation and soil types play distinct roles in degradation processes [
12,
13,
14]. For instance, grasslands in arid regions improve soil moisture retention, while forests sustain soil fertility and organic matter accumulation. In subtropical and tropical zones, deforestation, agriculture, and over-exploitation accelerate erosion, degrade soil structure, and reduce biodiversity [
15,
16]. In subtropical regions, acidification and salinization often hinder vegetation recovery, while in arid regions, water and nutrient scarcity limit plant growth and soil restoration [
17]. These differences underscore the need for region-specific management strategies.
The application of remote sensing technology has greatly advanced research on vegetation–soil interactions [
18,
19,
20,
21]. As spatial resolution improves and sensor technologies advance, scientists can now monitor vegetation dynamics and soil conditions over large spatial scales with greater precision [
22,
23,
24]. The integration of multispectral remote sensing, Light Detection and Ranging (LiDAR), thermal infrared technologies, and Synthetic Aperture Radar (SAR) systems, such as Sentinel-1, plays a crucial role in monitoring soil moisture, particularly in areas with dense vegetation or cloud cover. SAR data provides complementary insights that improve the accuracy of vegetation–soil interaction studies. Remote sensing indices, such as the Normalized Difference Vegetation Index (NDVI), Red-Edge NDVI, and the Normalized Difference Moisture Index (NDMI), help track relationships between vegetation growth and soil moisture and nutrients, thereby facilitating more accurate assessments of land degradation processes [
25,
26,
27]. The integrating of Geographic Information Systems (GIS), spatial modeling, and multivariate statistical analysis further enables remote sensing data to support process-based monitoring, prediction, and restoration of land degradation [
28,
29,
30]. High-resolution remote sensing technologies, particularly Sentinel-2 satellite data, hold significant potential for monitoring land degradation, analyzing vegetation changes, and guiding ecological restoration [
31,
32]. Additionally, multitemporal analysis of remote sensing data captures seasonal variations, crucial for improving the accuracy of land cover classification, especially for Red-Bed Bare Land (RBBL) [
33].
In southern China, red-bed desertification is a severe form of landscape degradation, particularly in areas with fragile soils and slow ecological recovery [
33,
34,
35]. Red-bed landscapes consist of highly weathered and poorly cemented red sandstones and siltstones, which are prone to wind and water erosion due to their low cohesion [
36,
37]. Combined pressures from climate change and human activities further exacerbate soil vulnerability in these regions, leading to intensified soil erosion and gradual vegetation cover reduction [
38]. Vegetation loss accelerates soil degradation, which in turn exacerbates vegetation decline, creating a cycle of continuous ecosystem deterioration [
39,
40]. Climate change and human activities amplify soil vulnerability, increasing erosion and reducing vegetation cover, which accelerates the degradation process.
The Nanxiong Basin, located in the northern part of Guangdong Province, China, provides an ideal setting for examining vegetation–soil interactions in red-bed landscapes. This basin exhibits pronounced elevation gradients (80–600 m) and heterogeneous topography, which support diverse vegetation types ranging from mature coniferous–broadleaf forests to degraded xerophytic grasslands [
17,
41,
42]. These changes have not only altered the ecological structure of the region but also markedly reduced the soil’s regenerative capacity, creating a self-reinforcing cycle of degradation [
35,
40] While remote sensing imagery and field surveys have revealed distinct spatial patterns of ecological decline, the mechanisms underlying the interplay of bedrock fragility, soil nutrient depletion, topographic variability, and human disturbance remain insufficiently understood.
Prior studies have not investigated how soil fertility gradients relate to vegetation patterns in red-bed regions, which is the primary knowledge gap this study aims to fill. Existing research primarily focuses on spatial distribution patterns derived from remote sensing, while the dynamic process of vegetation–soil feedbacks has not been fully explored. Specifically, the coupled effects of rock fragility, soil nutrient depletion, topographic variation, and anthropogenic disturbance in red-bed landscapes have not been comprehensively examined [
43]. Furthermore, studies explicitly addressing the mechanisms linking different vegetation types to soil physico-chemical properties are relatively scarce [
44,
45]. To address these gaps, this study is the first to integrate remote sensing classification with soil fertility gradients in red-bed regions and proposes a process-driven degradation mechanism model, offering new insights into the dynamics of ecological degradation in the Nanxiong Basin.
The specific objectives of this study are as follows: (1) to classify vegetation types in the Nanxiong Basin using NDVI data, with accuracy assessed through confusion matrix analysis; (2) to quantitatively examine key soil properties, including pH, soil organic matter (SOM), available nitrogen (AN), and available phosphorus (AP), across different vegetation zones in order to clarify the relationship between vegetation types and soil fertility; and (3) to investigate feedback mechanisms between vegetation and soil conditions using principal component analysis (PCA), thereby elucidating soil changes and ecological degradation dynamics during vegetation decline. This research integrated remote sensing technology with field-based soil data to construct a process-oriented vegetation–soil interaction model. This model not only advanced our understanding of vegetation degradation processes in the red-bed regions of the Nanxiong Basin but also provided a theoretical framework and practical methodology for land management and ecological restoration in similar landscapes. By focusing on a representative red-bed basin, this study contributed novel insights into vegetation–soil feedback mechanisms, ecological degradation pathways, and restoration strategies, thus offering a scientific basis for regional restoration practices and environmental policy development.
2. Materials and Methods
2.1. Study Area
The Nanxiong Basin (25.0–25.2°N, 114.0–114.5°E) in northern Guangdong, South China, is a west–east–elongated catchment drained by the Zhen River, with elevations ranging from approximately 71 to 502 m. The terrain transitions from low hills and eroded terraces to steeper midslopes, producing ridge–valley relief with narrow alluvial floors (
Figure 1). The basin is underlain by highly weathered and weakly cemented Cretaceous red beds, primarily composed of sandstone and siltstone. Thin, acidic, and poorly aggregated soils have developed on these substrates, which, combined with steep slopes and intense convective monsoon storms (~1800 mm yr
−1 concentrated between May and September; mean annual temperature 18–22 °C), promote overland flow, sheetwash, and progressive bedrock exposure. Historically, the region supported evergreen broadleaf and mixed conifer–broadleaf forests. However, since the mid-twentieth century, deforestation, overgrazing, slope farming, quarrying, and road construction have caused extensive vegetation loss, particularly at low to mid elevations. This has resulted in mosaics of xerophytic grassland and red-bed bare land around Nanxiong city and along the main valley. More recently, targeted afforestation, slope stabilization, and erosion-control projects in selected subcatchments have created sharp contrasts between recovering landscapes and degradation-locked patches. These spatial heterogeneities provide a robust template for stratified sampling, remote-sensing classification validation, and the analysis of vegetation–soil thresholds in this representative red-bed landscape.
2.2. Soil Sampling and Laboratory Analysis
In October 2020, thirty soil plots were established using a stratified sampling design across five vegetation types (six plots per type), with sufficient spacing between plots to minimize spatial autocorrelation. At each site, a 20 m × 20 m quadrat was delineated, and five topsoil subsamples (0–20 cm) were collected along an S-shaped transect after removing surface litter and coarse fragments. The five cores were homogenized to obtain a composite sample per plot (approximately 1 kg). Samples were sealed in polyethylene bags, transported to the laboratory within 24 h, air-dried at room temperature, gently disaggregated, and sieved through a 2 mm mesh. A subsample for organic matter determination was further ground to 0.15 mm. All analyses were performed in triplicate, and the results were averaged.
Soil physicochemical properties were measured following standardized protocols. Soil pH was determined in a 1:2.5 soil-to-water suspension using a glass electrode after equilibration [
46]. Soil salinity (SS) and electrical conductivity (EC) were measured in water extracts (soil–water = 1:5, mass basis) with a calibrated conductivity meter [
47,
48]. Soil organic matter (SOM) was quantified by potassium dichromate oxidation with external heating [
35]. Total potassium (TK) was obtained after acid digestion and quantified by flame photometry; available potassium (AK) was extracted with 1 mol L
−1 ammonium acetate and measured by flame photometry. Available nitrogen (AN) was analyzed by alkaline hydrolysis diffusion, and available phosphorus (AP) by the molybdenum–antimony colorimetric method following sodium bicarbonate extraction (Olsen-type) [
37]
2.3. Remote Sensing Data Acquisition and Preprocessing
To accurately capture the spatial heterogeneity of vegetation recovery in the Nanxiong Basin, this study integrated multi-source remote sensing datasets with terrain information. The primary data source was the Sentinel-2 MSI Level-2A surface reflectance product, obtained from the Copernicus Open Access Hub. Data were acquired in October 2020, corresponding to the peak vegetation growth season, thereby ensuring an optimal representation of vegetation conditions.
To preserve spectral integrity and minimize atmospheric interference, a stringent cloud cover threshold of <5% was applied during image selection. This rigorous criterion ensured the reliability of vegetation analysis while maintaining high data quality and consistency. The Sentinel-2 MSI sensor provides multiple spectral bands, including Blue, Green, Red, Red Edge, Near-Infrared, and Shortwave Infrared, which are particularly effective for vegetation monitoring. In this study, the following bands were primarily utilized: B2 (Blue), B3 (Green), B4 (Red), B5–B8A (Red Edge and Near-Infrared), and B11–B12 (Shortwave Infrared). These bands are essential for characterizing vegetation attributes such as canopy health, structural properties, and soil moisture conditions.
Based on these bands, several vegetation indices were derived, including NDVI, Red-Edge NDVI, and NDMI, to evaluate vegetation growth and associated environmental changes. Red-Edge NDVI was included to improve the sensitivity of our analysis to vegetation stress, which is often more pronounced in the red-edge spectrum. This index was particularly useful for areas with varying vegetation densities and moisture levels. NDMI was employed to assess vegetation moisture content, which is vital for understanding vegetation health in semi-arid and degraded areas like the Nanxiong Basin. NDMI provided a complementary metric to NDVI by focusing on moisture levels, thereby improving the precision of vegetation classifications, especially in drier areas. These additional indices, particularly Red-Edge NDVI and NDMI, were used alongside NDVI to improve classification precision and better assess vegetation health and moisture conditions. Furthermore, we integrated field validation with field missions to complement the remote sensing data, ensuring a higher degree of accuracy in vegetation classification. While Google Earth high-resolution imagery was used as a secondary reference, field missions were conducted to gather ground truth data, providing more precise validation of the vegetation conditions observed from satellite images. Additionally, the use of multitemporal analysis was considered to enhance classification accuracy, particularly for distinguishing Red-Bed Bare Land (RBBL). By incorporating imagery from different time points, seasonal variations in vegetation growth and soil moisture conditions can be more effectively captured, which in turn improves classification performance. This approach is expected to significantly improve the overall classification accuracy, particularly in areas prone to degradation, such as RBBL.
NDVI was calculated for each pixel using the following formula:
where NIR represents the Near-Infrared band (B8A) and Red represents the Red band (B4). NDVI values range from −1 to +1, with higher values indicating healthier and denser vegetation, while lower values signify sparse or bare land.
Preprocessing of the raw satellite data included atmospheric correction and reflectance normalization to mitigate atmospheric artifacts and ensure inter-image consistency. Additionally, all images underwent precise geometric correction to enable comparability across multi-temporal datasets. These preprocessing steps were conducted using the Sentinel Application Platform (SNAP) and Google Earth Engine (GEE), both of which are well-suited for large-scale satellite data processing.
2.4. NDVI-Based Vegetation Classification Method
Vegetation in the Nanxiong Basin was mapped using an NDVI-based workflow designed to distinguish physiognomic types that reflect both cover and vitality. NDVI was selected due to its sensitivity to photosynthetic activity and canopy density, making it ideal for separating bare surfaces from herbaceous and woody vegetation under subtropical conditions. Preprocessed Sentinel-2 imagery was used to calculate pixel-wise NDVI, as detailed in
Section 2.3.
The NDVI thresholds were determined using statistical analysis methods and supported by Drori et al. (2020) [
49] to ensure that each vegetation type was accurately represented. These thresholds were further fine-tuned using representative field samples to reflect the unique ecological characteristics of the Nanxiong Basin. The final NDVI threshold ranges used in this study, which have been widely validated in similar ecosystems, were as follows: Red-Bed Bare Land (RBBL, NDVI < 0.15), Subtropical Xerophytic Grassland (SXG, 0.15–0.29), Subtropical Montane Deciduous Thorn Shrubland (SMDT, 0.29–0.39), Subtropical Deciduous–Evergreen Broadleaf Forest (SDEBF, 0.39–0.47), and Subtropical Coniferous–Broadleaf Mixed Forest (SCBMF, 0.47–0.66). These thresholds allowed for clear differentiation between vegetation types, especially in regions where threshold partitions overlapped.
A supervised classification approach was employed, using either Maximum Likelihood or Support Vector Machine (SVM) classifiers, with a radial basis function (RBF) kernel. The classifier was trained using 50 representative training samples per class, drawn from field plots and high-resolution optical imagery, to improve classification accuracy. A post-classification filtering step was then applied to remove noise and refine class boundaries, eliminating misclassified pixels and ensuring adherence to minimum mapping units. This process further improved the classification accuracy and reduced salt-and-pepper noise in the final results. To ensure high-quality analysis, a <5% cloud cover filter was applied to the Sentinel-2 imagery. This filtering process resulted in the removal of 25 Sentinel-2 scenes, leaving 75 images available for final analysis. By using only cloud-free imagery, we improved the overall accuracy and reliability of the classification results. The number of removed and remaining images has been included in the revised manuscript to provide more clarity on the cloud cover filtering process.
To rigorously evaluate classification performance, an error matrix (confusion matrix) was constructed based on stratified random sampling. For each vegetation class, 100 independent reference points were sampled uniformly within mapped polygons, yielding a total of 500 validation samples. This sampling design controlled for class imbalance and ensured sufficient precision per class. Each sample point’s classification label was verified through visual interpretation of the most recent, cloud-free high-resolution imagery available in Google Earth, with dates closest to the Sentinel-2 acquisition. During the verification process, interpreters cross-checked canopy texture, tone, and context against field notes where available. Built structures and impervious surfaces were classified as non-vegetated and assigned to the RBBL class to ensure that the classification accurately distinguished vegetated and bare or sealed surfaces. Classification accuracy was evaluated using overall accuracy, producer’s accuracy, user’s accuracy, and Kappa coefficient (
Figure 2).
2.5. Statistical Analysis
PCA was applied to soil properties (pH, SOM, AN, AP, SS, EC, and K) to identify the major patterns of soil fertility variation and to examine the relationships between soil variables and vegetation types. PCA was conducted using a standardized correlation matrix, and the principal components that accounted for the majority of variance were extracted to reveal the dominant factors driving soil changes. To confirm whether observed differences among vegetation types were statistically significant, analysis of variance (ANOVA) was conducted for each soil property. When the assumptions of normality or homogeneity of variances were not met, a non-parametric alternative, such as the Kruskal–Wallis test, was applied. All statistical analyses were performed with SPSS software (version 26), and statistical significance was determined at p < 0.05.
2.6. Workflow Overview
In this study, we constructed a systematic analysis workflow that integrates soil sampling, remote sensing data processing and analysis, vegetation classification, and Principal Component Analysis (PCA) (
Figure 3). First, soil samples were collected and laboratory analyses were conducted to obtain various soil property data. Then, remote sensing data acquisition and preprocessing were performed, and vegetation indices (such as NDVI) were calculated for vegetation classification. Subsequently, PCA was applied to reveal the main factors driving soil changes, and statistical analysis was conducted on soil properties across different vegetation types. Each step in the workflow is interrelated, aiming to provide a comprehensive framework for analyzing soil and vegetation dynamics.
3. Results
3.1. NDVI-Based Vegetation Classification and Spatial Pattern Analysis
To understand the spatial distribution and ecological patterns of vegetation in the Nanxiong Basin, this study used NDVI-based methods for vegetation classification and spatial pattern analysis, providing insights into vegetation degradation and the distribution of various vegetation types. NDVI, widely recognized for its ability to reflect vegetation health and density, was the core metric used to delineate vegetation categories. Five vegetation types were identified: RBBL, SXG, SMDT, SDEBF, and SCBMF, as summarized in
Table 1.
The spatial distribution of these vegetation types is illustrated in
Figure 4, which highlights distinct ecological gradients, ranging from intact forest ecosystems (SDEBF and SCBMF) to highly degraded bare lands (RBBL). The results indicated that the central and southern parts of the basin are predominantly covered by broadleaf forests, whereas the northern and northeastern regions are largely occupied by bare lands. Specifically, RBBL accounts for approximately 3.59% of the total area, concentrated in the northern and northeastern sectors, and reflects severe land degradation. SXG, covering 10.49% of the area, is mainly distributed across the western and central zones and exhibits the characteristics of xerophytic grassland. SMDTS, occupying 20.68% of the basin, is primarily found in the central and southern regions, functioning as a transitional zone between forests and grasslands. SDEBF is concentrated in the southern basin and covers 33.38% of the area, representing the most extensive vegetation cover. SCBMF is mainly distributed in the southeastern part of the basin, accounting for 31.86%, and reflects high vegetation integrity and ecological function.
Table 1 further presents the NDVI statistics for each vegetation type, revealing distinct differences in NDVI ranges and mean values. The RBBL category, characterized by minimal or no vegetation cover, recorded the lowest NDVI values (<0.15), indicative of severe degradation. In contrast, SCBMF exhibited the highest NDVI range (0.47–0.66) with a mean NDVI of 0.51, signifying dense and healthy vegetation. Vegetation types such as SXG and SDEBF displayed intermediate NDVI values (0.15–0.47), with SDEBF representing a substantial share of the basin (34.40%).
The classification accuracy of the NDVI-based model was calculated using a confusion matrix (
Table 2), in which the predicted vegetation categories with reference data were compared. The overall accuracy (OA) reached 77.8%, and the Kappa coefficient was 0.723, indicating substantial agreement between predicted and observed vegetation types. Producer’s accuracy ranged from 64.0% for RBBL to 97.1% for SCBMF, reflecting higher accuracy for more prevalent vegetation classes. User’s accuracy was particularly high for SXG and SMDT (89.0%), suggesting that these types were consistently identified by the classification model.
These findings underscore the pronounced ecological gradients within the Nanxiong Basin, from severely degraded bare lands to highly productive forests. Areas with robust vegetation cover, particularly in the southern basin, are likely sustained by favorable ecological conditions and effective conservation measures. Conversely, northern regions dominated by RBBL remain vulnerable to soil degradation and require targeted ecological restoration efforts.
The ability of remote sensing to accurately capture these vegetation patterns provides insights into the basin’s ecological dynamics, supporting evidence-based management and restoration strategies in red-bed landscapes.
3.2. Variability in Soil Fertility Across NDVI-Based Vegetation Types
Along the NDVI gradient, soil properties displayed clear patterns of variation, with soil pH showing the most pronounced contrasts. SCBMF soils were near-neutral to slightly alkaline, whereas RBBL was strongly acidic. Statistical tests indicated that SCBMF had significantly higher pH values than all other vegetation types, while SDEBF also showed higher pH compared to SMDTS, SXG, and RBBL (
p < 0.05 to
p < 0.01), consistent with the significance brackets in
Figure 5a. The 95% confidence intervals for pH values in SCBMF were 6.99 ± 0.26, while in RBBL, they were 4.32 ± 0.12, indicating substantial variation across vegetation types.
For salinity-related variables, a different trend emerged. Although SCBMF tended to exhibit higher SS and EC, most between-class contrasts were not significant due to high within-class variability. The only significant difference was between SCBMF and SDEBF for SS, while EC differences remained generally non-significant, as indicated by the wide error bars in
Figure 5b,c. SOM provided the clearest separation among vegetation types. SOM values declined progressively from forests to shrub–grassland types, and finally to bare land. SCBMF recorded the highest SOM (22.29 ± 1.90), SDEBF was intermediate (13.46 ± 0.93), and SMDTS, SXG, and RBBL clustered at the lowest levels. All pairwise comparisons involving SCBMF were significant, and SDEBF also exceeded the three open-vegetation types (
p < 0.01;
Figure 5d). This pattern supports the notion that greater vegetation cover in forests enhances carbon sequestration and nutrient accumulation in soils.
Available macronutrients showed patterns similar to SOM. Both AN and AP were significantly higher in SCBMF (AN: 66.71 ± 4.50, AP: 19.58 ± 3.31), with SDEBF forming a secondary tier (AN: 44.83 ± 3.58, AP: 4.93 ± 2.44). In contrast, shrubland, xerophytic grassland, and bare land consistently exhibited low AN and AP, and AP was nearly absent in some RBBL samples. Statistical tests showed strong differences between SCBMF and other vegetation types, as well as between SDEBF and the three open-vegetation types (
p < 0.01;
Figure 5e,f). Potassium (K) showed greater variability. Total potassium (TK) differed significantly only between SCBMF and SDEBF (
p < 0.05). The 95% confidence intervals for TK in SCBMF were 41.74 ± 3.38, and for SDEBF, they were 29.76 ± 1.41. For available potassium (AK), wide within-class dispersion reduced between-class separation, with the only significant contrast being higher values in SCBMF compared to RBBL (AK: 163.86 ± 33.09, RBBL: 47.49 ± 8.77,
p < 0.01). These patterns are consistent with
Figure 5g,h, where RBBL displayed the greatest dispersion and SXG and SMDTS exhibited moderate variability.
In summary, NDVI-based vegetation classes correspond to a consistent soil fertility sequence for SOM, AN, and AP: SCBMF ranked highest, followed by SDEBF, whereas SMDTS, SXG, and RBBL represented progressively lower fertility levels. Along this sequence, soil pH shifted towards increasing acidity from forests to bare red-bed surfaces. By contrast, SS, EC, TK, and AK showed substantial within-class variation, limiting their discriminating power among vegetation types. These findings reinforce the reliability of the NDVI “greenness” gradient as a proxy for soil fertility, with fertility declining systematically from closed forest ecosystems to severely degraded surfaces.
3.3. Analysis of Soil Property Differences and Their Relationship with Vegetation Types
PCA was conducted on eight key soil properties (pH, SS, EC, SOM, AN, AP, TK, AK) to assess fertility gradients and their relationship with vegetation types across the Nanxiong Basin. PCA was selected because it effectively integrates correlated soil attributes, reduces dimensionality, and identifies the dominant factors underlying soil–vegetation feedbacks. By condensing multidimensional datasets into a few principal components, PCA retains the majority of variance while simplifying interpretation, which is particularly advantageous for complex soil–vegetation interactions.
The analysis extracted two dominant principal components: PC1, explaining 56.7% of the variance, represents the soil fertility gradient, whereas PC2, explaining 20.4%, reflects ionic balance. Together they account for 77.1% of the total variance (
Figure 5). Loading analysis indicated that SOM, AN, AP, and pH were strongly and positively associated with PC1, defining a gradient from nutrient-rich, weakly acidic to neutral soils towards nutrient-poor, strongly acidic substrates. Although SS and EC also loaded positively on PC1, their contributions were weaker and showed partial alignment with the negative PC2. By contrast, AK was the main positive contributor to PC2, while TK and EC loaded negatively.
The ordination biplot revealed distinct vegetation–soil associations. SCBMF and SDEBF clustered in the positive PC1 space, reflecting high SOM, AN, and AP with reduced acidity. SMDTS and SXG shifted towards the negative PC1 region, partially overlapping due to shared low nutrient levels. RBBL was positioned at the extreme negative ends of both PC1 and PC2, representing the most degraded, acidic, and nutrient-poor soils.
The 95% confidence ellipses in
Figure 6 further emphasize the separation between forested and open vegetation types. SCBMF formed a compact ellipse entirely within the fertile quadrant of PC1, indicating homogeneous soil enrichment. SDEBF showed a wider spread towards the origin, suggesting greater heterogeneity. In contrast, shrubland and grassland ellipses overlapped extensively, reflecting minimal edaphic distinction. RBBL remained isolated at the infertile end of the ordination space.
Biplot correlations were consistent with univariate tests (
Section 3.2). SOM, AN, and AP clustered tightly with pH, confirming their co-enrichment under forest cover and depletion in degraded surfaces. SS and EC, despite their positive loadings on PC1, contributed little to class discrimination because of high within-group variance. AK, which loaded positively on PC2, successfully differentiated forest sites with higher exchangeable K from open vegetation and bare soils, whereas TK exhibited a weaker, opposite trend.
Overall, PCA showed the strong correspondence between NDVI-based vegetation classes and soil fertility. Forested types (SCBMF, SDEBF), characterized by higher NDVI values, corresponded to high SOM, AN, AP, and near-neutral pH, while degraded classes (SMDTS, SXG, RBBL) aligned with nutrient depletion and acidity. Salinity and potassium variables exerted secondary but more variable effects. These findings demonstrate that NDVI-derived “greenness” reliably reflects soil fertility variation at the landscape scale, providing a robust quantitative basis for monitoring ecosystem degradation and restoration.
3.4. Process-Driven Mechanism Analysis of Vegetation–Soil Differentiation in the Nanxiong Basin
The ecological degradation mechanisms in the Nanxiong Basin are driven by a complex interplay of environmental and anthropogenic factors, resulting in a continuous gradient of vegetation and soil variation across the landscape. This study explores how these factors contribute to the transition from intact forest ecosystems to degraded red-bed surfaces. By synthesizing findings from
Section 3.1,
Section 3.2 and
Section 3.3, we reveal how the interplay between rock properties, natural influences, and human disturbance drives vegetation–soil differentiation, offering critical insights into the underlying processes of ecological degradation in the region. The NDVI map displays a concentric landscape structure, with bare-rock cores encircled by xerophytic grassland and thorn shrubland, and an outer matrix of broadleaf and conifer–broadleaf forests. Along the transition sequence from RBBL → SXG → SMDTS → SDEBF/SCBMF, soil pH decreases and both SOM and plant-available nitrogen (AN) and phosphorus (AP) decline sharply. Salinity, EC, and potassium (K) fluctuate considerably within vegetation types but contribute only weakly to their differentiation. The PCA results further confirm that the first principal component represents a composite fertility axis, where forest samples score positively and shrub–grassland and bare-land samples score negatively, demonstrating the strong correspondence between NDVI “greenness” and soil fertility. The process diagram (
Figure 7) situates these patterns within a broader framework of ecological degradation mechanisms. The weakly cemented red-bed sandstones and mudstones of the Nanxiong Basin weather rapidly, exhibit low structural strength, and are highly sensitive to water. During the dry season, physical disintegration generates loose debris on slopes. With the onset of the monsoon, sheetwash and rilling mobilize and transport this material, preferentially removing fine, organic-rich fractions. This process progressively thins the soil mantle and ultimately exposes the underlying bedrock. The spatial clustering of RBBL along dissected slopes and gullies, flanked by SXG and SMDTS on the NDVI map, represents the geomorphic expression of this “weathering–transport–exposure” chain.
This hydroclimatic sequence is driven by seasonal rainfall. In the dry season, reduced vegetation cover and litter input lower soil cohesion and increase erodibility, whereas high-intensity rainfall in the wet season amplifies overland flow and incision. Together, these processes generate a cyclical pattern of material storage during dry months and material export during wet months. The stepwise declines in SOM, AN, and AP along the forest-to-shrub–grassland transition (
Section 3.2) suggest selective depletion of nutrient-rich fines and nutrient leaching, accompanied by a reduction in base saturation and progressive soil acidification. In the PCA biplot, the positive loadings of SOM, AN, AP, and pH along the same axis underscore their coordinated changes. In contrast, SS, EC, and K contribute minimally to vegetation type separation, suggesting that they function more as modulators of soil conditions than as primary drivers of vegetation–soil differentiation.
Human disturbance accelerates this degradation process. Forest clearing, repeated cultivation, and fuelwood extraction reduce canopy cover and root reinforcement, thereby increasing runoff and sediment supply. These changes facilitate the transition from SDEBF to SMDTS and SXG, while also increasing the susceptibility of gully heads and convex upper slopes to erosion. The concentric NDVI sequence and the clear PCA separation between forested and open vegetation types together reflect a disturbance-driven, rainfall-amplified degradation trajectory.
Once the threshold between forest and shrubland is crossed (NDVI ≈ 0.39), negative feedbacks dominate. The loss of organic matter reduces aggregate stability and water-holding capacity, while infiltration decreases and surface runoff intensifies. These changes constrain vegetation recovery and accelerate soil acidification, leaving sites trapped in low-NDVI states. The recurrent occurrence of the RBBL → SXG → SMDTS → SDEBF/SCBMF sequence around bedrock exposures, and the clustering of degradation hotspots along gullies and strongly dissected slopes, are consistent with these negative feedback dynamics.
We therefore propose an integrated process framework that links soil properties, seasonal hydroclimatic dynamics, and human disturbance as co-drivers of vegetation–soil differentiation in the Nanxiong Basin (
Figure 6). Weak lithology facilitates rapid weathering and low structural strength, while seasonal rainfall governs the alternating storage and export of materials on slopes. Human activities exacerbate erosion by reducing canopy cover and root reinforcement. Together, these processes lead to soil thinning, nutrient depletion, and acidification, creating a spatio-temporal sequence from forest to shrubland, grassland, and ultimately bare rock. This framework connects the NDVI-based vegetation map with measured soil data, explains the observed concentric landscape structure, and provides a solid foundation for identifying sensitive areas and monitoring degradation stages across the basin.
4. Discussion
4.1. Reliability and Improvement of NDVI-Based Classification
The NDVI stratification applied in this study successfully delineated five vegetation–surface categories along the degradation gradient of the Nanxiong Basin. The resulting NDVI intervals and spatial gradients are consistent with established spectral behaviors, where values near zero reflect barren surfaces, intermediate values characterize shrub–grass mosaics, and higher values indicate dense forests. The overall accuracy of 77.8% and the Kappa coefficient of 0.723 place our scheme within the performance envelope of optical-index-led classifications. Our findings confirm the strong correlation between NDVI and soil fertility gradients, a relationship that has been documented in digital soil mapping studies [
50]. However, the confusion matrix revealed systematic errors concentrated at the transitional boundaries between shrubland and mixed forests, consistent with the spectral ambiguity reported in other subtropical mountainous environments [
51]. The producer’s accuracy of 64.0% for RBBL suggests particular vulnerability to omission errors, while SCBMF achieved the highest accuracy (97.1%). This indicates that forest canopies with dense, mixed structures are more easily separated than heterogeneous bare or grass–shrub mosaics.
Previous studies have demonstrated that incorporating multi-temporal NDVI and SAR backscatter, or applying ensemble machine learning classifiers, can markedly reduce these confusion errors, especially in areas with complex vegetation cover [
52]. These refinements should be prioritized in future monitoring of humid red-bed terrains. In particular, integrating Sentinel-1 SAR data would improve classification accuracy, especially in areas with dense vegetation or persistent cloud cover, where optical remote sensing data may face limitations. Moreover, combining Red-Edge NDVI and NDMI could provide complementary insights into vegetation health and moisture levels, further enhancing vegetation discrimination and improving land cover classification accuracy, as suggested in [
53]. Additionally, as demonstrated by Bai et al. [
54], multi-temporal analysis has proven highly effective in addressing the challenges of classifying transitional zones between different vegetation types. By incorporating seasonal variations in vegetation growth, multi-temporal analysis helps provide a more dynamic and accurate classification model. Similar studies, such as Wakulińska [
55] has highlighted how temporal data can mitigate misclassification of transitional zones and improve overall classification precision. These studies emphasize the importance of temporal data in improving the classification of complex vegetation mosaics and transitional areas, which remain challenging in optical-based NDVI classification.
In conclusion, while the NDVI-based classification performed well, the integration of additional remote sensing indices, multi-temporal data, and SAR backscatter will significantly enhance the classification’s reliability, especially in challenging environments like the Nanxiong Basin. Future work should prioritize these refinements to ensure more accurate and robust vegetation mapping in the region.
4.2. Soil Fertility Gradients and Acidification Signals
Soil analyses revealed that SOM, AN, and AP co-varied positively with pH, forming a fertility axis aligned with NDVI gradients. Forest types (SCBMF, SDEBF) clustered at the high-fertility, weakly acidic end, while shrub–grasslands and bare red-bed land clustered at the low-fertility, strongly acidic end. This is consistent with ecological succession studies in subtropical South China, where forest restoration improves SOM accumulation, nutrient cycling, and soil aggregation [
56]. The extremely acidic and nutrient-depleted soils observed in RBBL echo soil quality assessments in the Nanxiong red-bed desertification areas [
33] These patterns mirror broader trends in subtropical China, where whole-profile acidification and base cation depletion are widely reported [
57]. Importantly, our results support NDVI as a first-order proxy for fertility gradients. This is consistent with digital soil mapping studies that have documented robust NDVI–SOC correlations across diverse landscapes [
58]. These linkages strengthen the case for NDVI-based monitoring of fertility decline in humid badlands.
The loss of SOM not only leads to a decline in soil fertility but also has profound implications for carbon sequestration. As SOM decreases, the soil’s capacity to store carbon significantly diminishes, potentially leading to increased carbon release and exacerbating climate change. In the Nanxiong Basin, the significantly lower SOM levels observed in RBBL and xerophytic grassland (SXG, SMDT) suggest that these areas have weaker carbon sequestration potential. In contrast, the higher SOM content in forest types (such as SCBMF and SDEBF) indicates these regions have stronger carbon sequestration abilities. Further research should explore how soil and vegetation restoration measures can enhance carbon sequestration in these degraded areas, thereby mitigating environmental change.
Therefore, the changes in soil fertility and acidification not only reveal the ecological impacts within the degradation process of the Nanxiong Basin but also highlight the critical role of SOM in regulating regional carbon cycles and carbon storage.
4.3. Process Attribution: Lithological Sensitivity, Hydroclimatic Forcing, and Human Disturbance
The observed concentric NDVI pattern, with bare rock cores surrounded by xerophytic grasslands and thorn-shrub mosaics, reflects a typical weathering–transport–exposure chain in humid red-bed terrains. Particularly during dry periods, weakly cemented sandstones and mudstones undergo rapid physical disintegration, with fine particles being selectively mobilized and transported by intense monsoonal rains. This process leads to the preferential loss of organic-rich materials, explaining the observed thinning and acidification of the soil mantle in RBBL regions. This process aligns with findings from erosion monitoring in the Nanxiong Basin, which highlights the dominant role of storm events in driving slope retreat [
59,
60].
At the landscape scale, this event-driven denudation process explains the clustering of RBBL patches along gullies and steep slopes, as well as the isolation of barren land at the infertile end of PCA ordination. Human activities, particularly deforestation, repeated cultivation, and fuelwood harvesting, further weaken canopy cover and root cohesion, exacerbating runoff and soil erosion. Relevant watershed-scale studies in Nanxiong demonstrate that integrated soil and water conservation strategies can significantly mitigate erosion processes, effectively countering these negative feedback loops [
34]
Overall, our findings support the “disturbance–rainfall–lithology” interaction framework as the primary driver of vegetation–soil differentiation in humid red-bed terrains. This framework emphasizes how climate change and human activities, through altering soil properties and vegetation cover, exacerbate the feedback mechanisms between soil degradation and vegetation decline.
4.4. Implications, Limitations, and Future Directions
Two key implications emerged from this study. First, NDVI gradients can serve as effective rapid screening tools for soil fertility, particularly for indicators such as SOM, AN, and AP. This facilitates targeted sampling and restoration efforts in vegetation-transition zones and gully heads. Second, given the event-driven erosion regime, integrating different remote sensing data and soil chemical indicators can help assess degradation thresholds, particularly across the shrub-forest boundary, where ecological states remain unstable.
However, several limitations should be acknowledged. First, this study relied on a single-season NDVI dataset, which may not fully represent long-term degradation patterns, as vegetation phenology and soil conditions vary seasonally. Using a single 2020 scene may lead to over- or under-estimation of degradation severity. Therefore, future studies should integrate multi-temporal optical data to better capture seasonal and inter-annual variations. Second, the relatively small sample size and the reliance on the first principal component axis (explaining 56.7% of the variance) for soil fertility analysis may limit the comprehensive understanding of soil fertility gradients. Future research could enhance soil fertility assessment accuracy by increasing the sample size, incorporating additional soil chemical parameters, or using higher-order principal components in the analysis. Another limitation is the spatial representativeness of the 30 soil plots used in this study. Although these plots provide valuable insights into soil fertility and degradation, they may not fully capture the spatial heterogeneity of the entire basin. Given the complexity of soil properties and ecological variations within the basin, a more extensive network of soil sampling plots or remote sensing data may be needed to ensure comprehensive spatial coverage, particularly in more remote or ecologically distinct areas. Moreover, similar vegetation–soil feedback mechanisms have been identified in temperate forest ecosystems [
61], suggesting that the findings of this study may have broader applicability, particularly for other humid red-bed regions or similar subtropical areas.
Looking ahead, future research should focus on integrating multi-temporal optical indices, SAR data, and expanded soil chemistry parameters to improve the accuracy of fertility gradient delineation. Additionally, the installation of slope-catenary monitoring systems to track rainfall, infiltration, and micro-erosion would allow for direct testing of the negative feedback loop involving organic matter loss, reduced infiltration, increased runoff, and soil acidification.
Overall, our study supports the conclusion that weak lithology, monsoonal forcing, and anthropogenic disturbance jointly shape the observed banded landscape patterns, through the integration of NDVI mapping, soil fertility contrasts, and principal component analysis (PCA) ordination. This integrated framework provides a foundation for understanding the underlying mechanisms and offers a practical basis for prioritizing restoration interventions in the context of increasing hydroclimatic variability.