3.1. Extraction of Salinization Information and Spatiotemporal Dynamics
3.1.1. Salinity Inversion and Accuracy Assessment
Following the equations presented in
Table 2, we computed four remote sensing-based salinity monitoring index models (SDI1, SDI2, SDI3, and SDI4). The resulting inversion values were subsequently categorized into five classes of soil salinization—non-saline, mildly saline, moderately saline, severely saline, and saline soils—using the Jenks natural breaks classification method (
Table 3). Based on this classification scheme, spatial distribution maps of soil salinization derived from each of the four models were then generated (
Figure 4).
The inversion results reveal notable spatial differences among the four models. Under the SDI1 model, saline soil and severely saline soil are predominantly concentrated in the northern part of the study area, while the southern region is mainly characterized by non-saline soil, mildly saline soil, and moderately saline soil. The SDI2 model exhibits a broadly similar spatial pattern to SDI1; however, differences are observed in the spatial extent and density of saline and severely saline soils. In contrast, the SDI3 model shows a markedly different pattern, with non-saline soil dominating the northern region and extensive distributions of moderately saline soil, severely saline soil, and saline soil occurring in the south. Under the SDI4 model, saline soil is mainly concentrated in the north-central area, while moderately and severely saline soils dominate the central and southern regions. These differences reflect the varying sensitivities and extraction characteristics of different spectral models and provide a spatial basis for identifying the most suitable model for soil salinization monitoring in this study area.
Given the differences among the models, field-measured soil data were used to evaluate their inversion accuracy. According to China’s soil salinization classification standards, and with reference to Soil and Agricultural Chemistry Analysis and Methods, soil salinization in coastal, semi-humid–semi-arid, and arid regions can be classified into five categories: non-saline soil (salt content < 1.0 g·kg−1), mildly saline soil (1.0–2.0 g·kg−1), moderately saline soil (2.0–4.0 g·kg−1), severely saline soil (4.0–6.0 g·kg−1), and saline soil (>6.0 g·kg−1). Based on these criteria, the salinization grades were assigned to the field sampling sites and compared with the corresponding model predictions to assess inversion accuracy.
The evaluation and comparison of the four monitoring models using 87 field validation samples yielded significant findings (
Table 4). Among the tested models, SDI1 demonstrated the highest inversion precision, achieving an overall accuracy (OA) of 86.21% and correctly classifying 75 samples. In contrast, the classification performance of the remaining models followed a decreasing trend: SDI2 (74.71%), SDI3 (62.07%), and SDI4 (54.02%). These results identify SDI1 as the optimal model for soil salinization monitoring in the study area.
A detailed error analysis was further conducted on the SDI1 model using a full confusion matrix (
Table 5) and linear regression (
Figure 5). The confusion matrix reveals high consistency between the inverted results and field observations, with User’s Accuracy (UA) and Producer’s Accuracy (PA) for all salinity grades exceeding 73%. Notably, the UA for non-saline soil reached 94.74%, and the UA for the “saline soil” category remained robust at 80.00%. Furthermore, the linear regression analysis confirms the model’s reliability, yielding a coefficient of determination (R
2) of 0.840 and a root mean square error (RMSE) of 0.103 (
p < 0.001). This indicates that SDI1 model can accurately and stably reflect the actual spatiotemporal dynamics of soil salinity.
3.1.2. Spatiotemporal Variation in Soil Salinization in Zhanhua District
To systematically analyze the spatiotemporal evolution of soil salinization in Zhanhua District over the past 30 years, the optimal SDI1 model was used to invert soil salinization patterns from 1993 to 2023 (
Figure 6). Standard deviational ellipse and mean center trajectory analyses were applied to investigate changes in the spatial orientation and distribution of salinization (
Figure 7), while the proportional area changes in each salinization grade were statistically quantified (
Figure 8).
The spatial analysis indicates that soil salinization in the study area exhibits pronounced spatial heterogeneity, characterized by a general pattern of “lighter in the south and heavier in the north.” The southern region is dominated by non-saline soil and mildly saline soil, whereas the northern coastal areas, influenced by seawater intrusion and highly mineralized shallow groundwater, show concentrated distributions of severely saline soil and saline soil. These patterns are closely associated with strong salt accumulation and salt-return processes.
Over the 30-year study period, severely saline soil and saline soil expanded during the early stages and subsequently declined in the later stages. In contrast, the distribution of non-saline soil and mildly saline soil gradually expanded from south to north. Overall, soil salinization in Zhanhua District followed a phased evolution pattern, transitioning from a condition of “severe in the north and mild in the south” to “overall expansion,” and ultimately to “improvement in the north and optimization in the south.” Notably, in the northern coastal zone, intensive aquaculture activities and surface water accumulation have aggravated local salt accumulation, resulting in persistently high salinization levels (
Figure 6).
The standard deviational ellipse analysis show that the spatial distribution of soil salinization exhibits a clear northeast–southwest elongation trend. However, the mean center of salinization remained relatively stable throughout the study period, consistently located near Fengjia Town in Zhanhua District, indicating limited overall spatial migration of salinization intensity (
Figure 7).
From a temporal perspective, the area proportions of different salinization grades remained relatively stable over the 30-year period. The proportion of non-saline soil remained at approximately 13%, with only minor fluctuations. Mildly saline soil accounted for roughly 16–30% of the total area and exhibited relatively stable changes. Moderately saline soil showed more pronounced interannual variability, ranging from 20% to 32%, with an overall trend of initial expansion followed by contraction. Severely saline soil experienced the most substantial changes between 2003 and 2013, while the proportion of saline soil remained largely unchanged (
Figure 8). Although the overall area proportions of the salinization grades did not vary dramatically, their spatial redistribution reflects the dynamic and complex nature of soil salinization processes in the study area.
3.2. Analysis of Driving Factors
In this study, the accuracy assessment of the XGBoost model was conducted by randomly sampling geospatial grid points to form training and testing datasets. Results from the training dataset indicate that the model achieved a high level of goodness fit, with an MSE of 0.0062 and an R
2 reaching 0.8063, indicating that approximately 80.63% of the variation in soil salinization can be explained by the model. Although model performance on the test dataset showed a slight decrease relative to the training dataset, the predictive accuracy remained satisfactory, indicating acceptable generalization capability (
Table 6,
Figure 9 and
Figure 10). Overall, these results demonstrate that the XGBoost model provides a reliable fit to the data and is suitable for analyzing the driving factors of soil salinization in the study area.
Based on the XGBoost-SHAP analysis, the relative contributions of environmental and anthropogenic factors to soil salinization in Zhanhua District exhibit clear hierarchical differences and spatial heterogeneity (
Figure 11,
Figure 12 and
Figure 13). According to the global SHAP feature importance results, evaporation is the dominant driving factor, with a SHAP value of 0.3357. This indicates that evaporation plays a primary role in controlling soil salinization in the region. The SHAP dependence plot for evaporation shows a strong positive contribution within the range of 0–1000, suggesting that increased evaporation intensifies soil moisture loss and promotes salt accumulation at the surface. However, at extremely high values (>1000), the SHAP contribution becomes negative, possibly reflecting altered salt migration processes and vegetation responses under extreme arid conditions.
Precipitation and population density represent secondary driving factors, with SHAP values of 0.1732 and 0.1518, respectively. Their SHAP dependence plots reveal pronounced nonlinear effects. Precipitation exhibits positive SHAP values within the range of 100–400 mm, indicating that moderate rainfall in this semi-humid region enhances soil salinization, with relatively low and concentrated rainfall during the flood season. Under these conditions, precipitation is insufficient to effectively leach salts from the soil profile but increases soil moisture, thereby intensifying evaporation and accelerating surface salt accumulation. In contrast, excessive precipitation (>500 mm) induces a net desalination effect. This threshold aligns with regional pedological studies in the Yellow River Delta, which indicate that when cumulative rainfall during the summer monsoon exceeds 500 mm, the downward leaching of salts via soil infiltration and surface runoff outweighs the evaporative concentration, leading to a measurable reduction in surface soil salinity [
34]. Population density shows a positive contribution within the range of 1000–4000, reflecting the intensification of soil salinization associated with high-intensity human activities, such as irrigated agriculture. In contrast, lower population density (<1000) exhibits a negative contribution, likely due to the salt interception and buffering effects of natural vegetation cover.
Soil moisture, land use type, and temperature constitute a medium-influence group, with SHAP values of 0.0984, 0.0473, and 0.0464, respectively. Soil moisture displays a positive contribution within the range of 0.2–0.35 range, indicating that moderate soil moisture facilitates salt dissolution and upward transport. However, SHAP values become negative under very low (<0.15) or very high (>0.35) moisture conditions, as insufficient moisture limits salt mobilization, while excessive moisture promotes salt leaching. Land use types such as cropland, forestland, and grassland exhibit negative SHAP values, suggesting that rational agricultural management and vegetation cover effectively suppress soil salinization through evapotranspiration regulation and biological salt amelioration. In contrast, water bodies, construction land, and unused land show positive SHAP values, as water convergence, impervious surface coverage and direct bare soil exposure promote salt accumulation. While temperature and precipitation exhibit a strong climatic coupling (r = 0.81), they represent competing physical mechanisms in the soil–water–salt balance. Temperature primarily drives upward salt capillary migration via evaporation, whereas precipitation promotes downward leaching. The SHAP analysis captures the relative trade-off between these forces. Temperature demonstrates a U-shaped response, at high values (>14 °C), it exacerbates salinization by accelerating surface evaporation, while at low values (<2 °C), its positive contribution is likely linked to salt accumulation during freeze–thaw cycles, a characteristic phenomenon of the Yellow River Delta’s seasonal climate.
The night-time light index, slope, and elevation exhibit relatively low contributions, with SHAP values of 0.0271, 0.0268, and 0.0224, respectively. The nighttime light index shows a positive SHAP value within the range of 10–70, indicating a positive relationship between human activity intensity and salinization. Slope contributes positively at low gradients (0–5°), reflecting the tendency of gentle terrain to facilitate water and salt accumulation. Elevation shows an overall negative contribution, suggesting that higher-altitude areas experience lower salinization risk due to improved drainage conditions. Overall, the results indicate that climatic and anthropogenic forces exert stronger controls on soil salinization patterns in Zhanhua District than topographic factors.
Correlation analysis further reveals interrelationships among the driving factors of soil salinization in Zhanhua District, characterized by climatic dominance and interactions between human and natural factors (
Figure 14). A significant positive correlation exists between evaporation and precipitation (r = 0.63), while temperature and precipitation exhibit an even stronger positive correlation (r = 0.81). These relationships reflect the inherent coupling of high temperature, strong evaporation, and concentrated precipitation under the warm-temperate monsoon climate of the study area. This climatic synergy fundamentally explains the combined driving effects of these factors on soil salinization, whereby elevated temperatures enhance evaporation and surface salt accumulation, while concentrated rainfall during the flood season induces complex water–salt transport processes.
In terms of human–nature interactions, population density and nighttime light intensity show a moderate positive correlation (r = 0.38), indicating spatial consistency in human activity intensity. Land use type is negatively correlated with evaporation (r = −0.50), suggesting that different land use patterns regulate soil salinization by modifying surface evaporation processes. This finding is consistent with the salt-suppressing effects of vegetation cover identified in the SHAP analysis.
Several factors exhibit relatively independent behavior. Soil moisture shows weak correlations with major climatic variables, with correlation coefficients of −0.20 with evaporation and 0.07 with temperature. Elevation displays a moderate positive correlation with slope (r = 0.46) but weak correlations with other variables, indicating that these factors influence soil salinization primarily through their intrinsic properties rather than through strong interactions with other drivers.
3.3. Prediction of Soil Salinization Under Future Climate Scenarios
Based on the XGBoost-SHAP analysis of driving factors, evaporation and precipitation were identified as the dominant climatic controls on soil salinization and were therefore selected as the core predictors for future simulations. Evaporation and precipitation data under four shared socioeconomic pathway scenarios—SSP1-2.6, SSP2-4.5, SSP3-7.0, and SSP5-8.5—were used to predict the spatial evolution of soil salinization in Zhanhua District for the mid-century (2050) end-of-century (2100) periods.
To ensure the reliability of the Random Forest (RF) model for future projections, a hindcast validation was performed (
Figure 15). Given the lack of district-wide historical field data, SDI1—previously calibrated against field-measured total salt content—was used as the reference baseline. The scatter plot shows that most data points are distributed close to the 1:1 reference line. With a sample size of 231,741, the model achieved a coefficient of determination (R
2) of 0.823 and a Root Mean Square Error (RMSE) of 0.153 (
Figure 15), indicating that 82.3% of the variance in soil salinization can be explained by the model and that prediction errors are relatively small. These results confirm the reliability of the Random-Forest-based prediction framework for regional-scale soil salinization simulations and provide a robust basis for future scenario analysis.
The prediction results indicate that by 2050, soil salinization patterns under different emission scenarios have already diverged substantially (
Table 7;
Figure 16). Under the SSP1-2.6 low-emission scenario, non-saline soil accounts for 19.84% of the total area, mainly distributed in the southwestern part of the study area. The proportions of severely saline soil and saline soil are 9.31% and 17.91%, respectively, and are largely confined to narrow zones along the northern coastline, representing the lowest overall salinization intensity among the four scenarios. In contrast, under the SSP5-8.5 high-emission scenario, the proportion of non-saline soil is slightly lower at 19.45%, while saline soil and severely saline soil account for 17.88% and 9.51%, respectively, showing modest expansion in the northern region. At the same time, moderately saline soil increases to 19.69% and extends toward the central part of the study area.
The SSP2-4.5 scenario exhibits the most pronounced salinization by 2050, with the proportion of severely saline soil reaching 22.48%, substantially higher than that under the SSP5-8.5 scenario. This suggests a stronger expansion of highly salinized areas in the northern coastal zone under the intermediate-emission pathway. In contrast, the SSP3-7.0 scenario is characterized by a relatively high proportion of moderately saline soil, accounting for 25.67% of the total area.
By 2100, soil salinization patterns under all scenarios exhibit further evolution. Under the SSP1-2.6 scenario, the proportion of non-saline soil increases to 22.95%, representing a significant expansion relative to 2050, while the proportion of saline soil decreases to 15.28%. This indicates a sustained long-term mitigation of salinization risk under low-emission conditions. Under the SSP5-8.5 scenario, the proportion of non-saline soil rises to 22.33%, but the proportion of severely saline soil increases to 20.77%, and saline soil decreases to 13.23%, resulting in an overall salinization intensity that remains higher than that under SSP1-2.6 scenario.
The SSP2-4.5 scenario shows salinization characteristics increasingly similar to those of SSP1-2.6 by 2100, with severely saline soil and saline soil accounting for 18.24% and 15.28%, respectively. In contrast, the SSP3-7.0 scenario continues to be dominated by moderately saline soil, which reaches the highest proportion among all scenarios at 28.68%.