Next Article in Journal
Citizen Science Tools Reveal Changes in Estuarine Water Quality Following Demolition of Buildings
Previous Article in Journal
Cloud Phase Recognition Based on Oxygen A Band and CO2 1.6 µm Band
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling the Spatial Dynamics of Soil Organic Carbon Using Remotely-Sensed Predictors in Fuzhou City, China

1
State Key Laboratory for Subtropical Mountain Ecology of the Ministry of Science and Technology and Fujian Province, Fujian Normal University, Fuzhou 350007, China
2
School of Geographical Sciences, Fujian Normal University, Fuzhou 350007, China
3
Department of Natural Resource Managemnt, Wolkite University, P.O. Box 07, Wolkite, Ethiopia
4
China-Europe Center for Environment and Landscape Management, Fuzhou 350007, China
5
College of Environmental Science & Engineering, Fujian Normal University, Fuzhou 350007, China
6
Department of Land Management and Landscape Architecture, Faculty of Environmental Engineering and Land Surveying, University of Agriculture in Krakow, 253c Balicka Street, 30-149 Krakow, Poland
7
Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Ave, Ottawa, ON K1A 0C6, Canada
*
Author to whom correspondence should be addressed.
Remote Sens. 2021, 13(9), 1682; https://doi.org/10.3390/rs13091682
Submission received: 22 March 2021 / Revised: 20 April 2021 / Accepted: 22 April 2021 / Published: 27 April 2021
(This article belongs to the Section Environmental Remote Sensing)

Abstract

:
Assessing the spatial dynamics of soil organic carbon (SOC) is essential for carbon monitoring. Since variability of SOC is mainly attributed to biophysical land surface variables, integrating a compressive set of such indices may support the pursuit of an optimum set of predictor variables. Therefore, this study was aimed at predicting the spatial distribution of SOC in relation to remotely sensed variables and other covariates. Hence, the land surface variables were combined from remote sensing, topographic, and soil spectral sources. Moreover, the most influential variables for prediction were selected using the random forest (RF) and classification and regression tree (CART). The results indicated that the RF model has good prediction performance with corresponding R2 and root-mean-square error (RMSE) values of 0.96 and 0.91 mg·g−1, respectively. The distribution of SOC content showed variability across landforms (CV = 78.67%), land use (CV = 93%), and lithology (CV = 64.67%). Forestland had the highest SOC (13.60 mg·g−1) followed by agriculture (10.43 mg·g−1), urban (9.74 mg·g−1), and water body (4.55 mg·g−1) land uses. Furthermore, soils developed in bauxite and laterite lithology had the highest SOC content (14.69 mg·g−1). The SOC content was remarkably lower in soils developed in sandstones; however, the values obtained in soils from the rest of the lithologies could not be significantly differentiated. The mean SOC concentration was 11.70 mg·g−1, where the majority of soils in the study area were classified as highly humus and extremely humus. The soils with the highest SOC content (extremely humus) were distributed in the mountainous regions of the study area. The biophysical land surface indices, brightness removed vegetation indices, topographic indices, and soil spectral bands were the most influential predictors of SOC in the study area. The spatial variability of SOC may be influenced by landform, land use, and lithology of the study area. Remotely sensed predictors including land moisture, land surface temperature, and built-up indices added valuable information for the prediction of SOC. Hence, the land surface indices may provide new insights into SOC modeling in complex landscapes of warm subtropical urban regions.

Graphical Abstract

1. Introduction

Soil organic carbon (SOC) is essential for the normal functioning of ecosystems [1]. It also plays a significant role in global C dynamics and climate change study as it stores the largest total carbon pool of terrestrial ecosystems [2]. In the urban context, the existence of an optimum SOC content is a critical factor for greening projects and is also a good indicator of the state of urban ecosystems and soil quality [3]. In general, SOC spatial distribution is influenced by climatic, geomorphological, lithological, and land-use factors. In particular, land-use change due to intensive human activities including urbanization and industrialization processes hugely impacts its spatial distribution [4].
In line with this, estimation of the spatial variability of SOC in urban areas has attracted the interests of many studies. For instance, Chen et al. [5] studied the SOC densities in urban built-up areas of 35 Chinese cities. The study reported the carbon storage of 35 cities in china including the Fuzhou city (i.e., the current study area). Similarly, Wang et al. [6] determined the spatial variation of SOC on a hilly coastal landscape of Wafangdian, Liaoning Province, and reported higher contents toward the mountainous areas. Additionally, they confirmed strong influence of land use on the spatial variation of SOC. Raciti et al. [7] also made a comprehensive assessment of SOC contents of residential areas and stated that soils in residential areas with past agricultural use had a higher capability to sequester carbon. Likewise, Xia et al. [8] studied the spatial variations of SOC in relation to land-use change in eastern regions of China and confirmed that land-use change from and into a paddy field had a high impact on SOC variability.
Even though many previous researches estimated SOC in urban areas globally and nationally, the current study area has limited data. Moreover, there is a need to find the optimal set of environmental predictors to remotely determine the spatial variability of topsoil organic carbon; thus, assessing the suitability of such predictors, as well as other environmental covariates, is an imperative task to SOC mapping in urban environments. To that end, McBratney et al. [9] also highly recommended the need for new studies so as to select suitable environmental covariates for digital soil mapping. Furthermore, even though the previous studies used multisource datasets, various biophysical land surface variables, which can provide information on the SOC distribution, were generally ignored. For instance, soil moisture, land surface temperature, and built-up index were omitted. Since SOC is highly influenced by long-term mean soil temperature and moisture than any other factors, integrating them may provide new insight for SOC mapping [10]. The previous SOC prediction studies overlooked soil moisture and temperature indices perhaps due to a shortage of high-resolution soil moisture and temperature data. In particular, the traditional ground-based soil-moisture observation networks produce sparse soil-moisture data for smaller regions. Meanwhile, some studies used atmospheric temperature and precipitation as a proxy to measure soil temperature and moisture [11,12]. However, they also had very coarse resolution for smaller geographic scales. In the meantime, the remote sensing products of soil moisture such as Advanced Scatterometer (ASCAT) and Soil Moisture and Ocean Salinity (SMOS) have coarse resolutions (i.e., tens of kilometers) [13]. However, optical/thermal infrared (TIR) sensor products have higher spatial resolutions (meters to kilometers) and could be a good solution for regional and local applications [14]. Variables derived from optical sensor products including vegetation temperature condition index (VTCI) and land surface temperature (LST) can be a better choice to obtain the soil moisture and temperature information.
Therefore, the main aim of this study was to identify the role of remotely sensed variables for SOC prediction and to understand the contribution of the landform, land use, and lithology on the spatial variation of SOC in the coastal city of Fuzhou, China. The soil samples were collected using purposively distributed sample points that represent the dominant land cover, lithology, and landform of the study area. To that end, biophysical land surface variables such as vegetation temperature condition index (VTCI), land surface temperature (LST), and normalized difference built-up index (NDBI) were integrated with other environmental covariates obtained from multiple sources including remote sensing, topography, and proximal sensing. Furthermore, the most important environmental variables were selected and used to estimate the spatial distribution of SOC using a random forest and CART model.

2. Materials and Methods

2.1. Description of the Study Area and Sampling Locations

Fuzhou district is located in the southeastern coastal area of China in the estuary of the Minjiang River. It is the capital city of the Fujian province (Figure 1) and serves as a central city for the Western Taiwan Straits Economic Zone with a total area of 11,462.41 km2. It is geographically located at 118°08′ E to 120°31′ E longitude and 25°15′ N to 26°29′ N latitude in the southeast of China. It neighbors Ningde and Nanping to the north, Quanzhou and Putian to the south, and Sanming to the west. The city has 13 administrative regions comprising six districts, one county-level city, and six counties. The current study site includes all regions except Pingtan County.
The climatic condition of the area belongs to the humid subtropical maritime climate, with an annual average temperature of 16–20 °C and an annual average precipitation of 900–1200 mm.
It is also covered by acidic volcanic rocks and Cretaceous sandstones from the Jurassic Period [15]. The soil of the upper part is dominated by red soil [16], whereas mountainous regions have mainly red and laterite soil.
Additionally, the area is characterized by complex topographic features. The northern, western, and southern parts of the study area are dominated by mountains, whereas the eastern part is mainly plain landform [16].

2.2. Soil Sampling and Laboratory Analysis

Topsoil samples (0–20 cm depth) were collected from all administrative counties of Fuzhou City except Pingtan Island in February and March. The soil samples were collected using purposively distributed sample points that represent the dominant land cover, lithology, and landform of the study area. The sampling sites were spread out across the whole study area (Figure 1). The land use and the topographic and lithology maps were superimposed using ArcGIS 10.3 to identify sample locations. A total of 244 sample sites were selected as predetermined sampling points. However, due to complex topography, land use, lack of budget, and accessibility problems, 121 samples were collected. A hand-held global position system (GPS) receiver was used to identify sample points and to capture the location information. Then, the samples were transported to the laboratory, air-dried, and sieved. The samples were separated for spectroscopic measurements and chemical analysis. The SOC was determined using a dry combustion method using a CNS elemental analyzer (Flash EA 1112 NC-Soil, Thermo Fisher Scientific, Pittsburgh, PA, USA). Spectral soil data were measured using an indoor spectral measurement in the wavelength ranges of 350 nm to 2500 nm using the FieldSpec-3 spectroradiometer. The sampling intervals were set to be 1 nm, and, as a result, a total of 2151 bands were produced.

2.3. Description and Preprocessing of Landsat-8 Images, and Soil Spectral Data Transformations

Landsat images are commonly used for SOC predictions due to their high spectral resolution [17]. This study also used Landsat-8 data of Operational Land Imager (OLI) Level 1 and Thermal Infrared Sensor (TIRS). The images acquired in December 2013 having less cloud cover (i.e., <10%) with path and raw numbers of 119 and 42 were downloaded from the United States Geological Survey archives (https://earthexplorer.usgs.gov/; accessed on 12 June 2018). The radiometric calibration was performed using ENVI. Radiometric calibration was done by converting the images into the top-of-atmosphere (TOA) reflectance using radiometric rescaling coefficients provided in the product metadata (MTL) file [18].
Moreover, bandwidth of the spectral soil data ranged from 350 nm to 2500 nm. Different spectral transformations such as multiplicative scatter correction (MSC), first derivative, second derivative, normalization, and continuum removal were applied to remove background noise. Moreover, smoothing approaches of the Savitzky–Golay filtering algorithm (SG) with a second-order polynomial and averaging were performed across a 10-band window to remove the complexity of bandwidths by eliminating redundancy between adjacent bands and compressing band data without losing information [19].

2.4. Data Sources, Software, and Extraction of Environmental Covariates

The study integrated covariates obtained from multiple sources including remote sensing, digital elevation models (DEMs), soil maps, soil spectral data, and other sources (e.g., OSM layers). The environmental covariates were extracted using the equations provided in Table 1.
A 10 m resolution digital elevation model (DEM) was obtained from the Fuzhou city surveying department. The aerial images for DEM generation were acquired using combined digital camera (i.e., Canon EOS M) systems with resolutions of 18 MP and a 22.3/14.9 CMOS image sensor mounted on UAVs. The flight was planned on sunny days where there was low cloud cover, and the online maps were used for flight planning purpose. The UAV captured photos on the basis of the plan within 2 s intervals so as to remove blurred photos. Then, the DEM was generated using photogrammetric techniques while the ground control points (GCPs) were used for verification. The high-resolution DEM data were used to extract landform, hydrological, and spatial indices Moreover, the topography was classified using the approaches described in FAO guidelines for soil description [46] (Figure 2).
Additional landform characteristics of the study area were described using different landscape metrics including slope, relief, curvature, TWI, and TPI (Table 1). Furthermore, proximity to essential features such as industries, landfill sites, water-points, and port points was calculated as the Euclidean distance from sampling points [43].
The land use/cover of the study area was classified using a supervised image classification technique with the maximum-likelihood classification method. Representative ground control points (i.e., 50 points for each class) were captured to compare the class signatures. Out of 50 ground control points, 40 were used for classification calibration, whereas the remaining 10 were used for validation. The land use was classified into four predominant land use/cover types of urban, vegetation (forest), agriculture, and water bodies (Figure 3).
According to the classification results, 34.44% of the area is covered by vegetation mainly consisting of forestland, whereas the remaining area is covered by urban land (29.40%), agricultural land (21.80%), and water bodies (14.30%). The detailed classification results and the accuracy of classification are given in Table 2.
The biophysical indices including land surface temperature (LST), vegetation temperature condition index (VTCI), and normalized difference built-up index (NDBI) were used as a proxy to measure land temperature, moisture, and the extent of built-up areas, respectively. The land surface temperature (LST) of the study area was calculated using TIR bands of Landsat 8 (Figure 4), as described in Table 1 [29].
The LST had minimum, maximum, and average values of 6.25 °C, 15.15 °C, and 10.65 °C with a CV of 47% for the given season. The LST distribution pattern shows that the higher values were observed toward the urbanized areas (e.g., downtown), whereas it decreased toward the mountains and vegetated land. The VTCI was calculated as stated in Table 1, and the minimum, maximum, and mean VTCI values were 0.04, 0.99, and 0.51, respectively. The lowest VTCI value (0.04) was extracted from the highly urbanized area (downtown), whereas the highest value (0.99) was obtained from the forestland of the study area (Figure 5).
Additionally, the normalized differences built-up index (NDBI) was used to extract the extent of built-up areas (Figure 6) using a threshold value of 0.038 [47]. Additionally, the spectral bands of the Landsat-8 image were used along with soil spectral data.
All environmental covariate maps and point datasets were converted into similar formats and stacked using ArcGIS 10.3 software. All datasets were resampled into 30 m pixels using ‘bilinear resampling’ for continuous data and ‘nearest’ for categorical data, before converting them into the same coordinate and projection systems in the R environment [48].

2.5. Statistical Analysis, Spatial Modeling, and Validation

The summary statistics were calculated using standard descriptive statistics for the variates. The coefficient of variation (CV) and the Pearson correlation coefficients were used to measure the spread of the mean and the linear dependence between SOC values and environmental covariates, respectively. Similarly, Wilding (1985) used CV values to characterize the variability into classes of low (0–15%), moderate (16–35%), and high (>36%) SOC variability [49].
A combination of random forest and CART was used for prediction, performance evaluation, and selection of the essential variables. Random forest avoids overfitting and provides reliable out-of-bag (OOB) error estimates, avoiding the need for an independent validation dataset [50].
Here, 75% of soil samples were used for the training of a model and identifying essential variables for prediction, and then the model was applied to the validation set. Additionally, CART, was implemented using the R package “rpart” to improve interpretability [51]. The regression forest was generated with a minimum split factor of 4. Similarly, Wiesmeier et al. [52] used a combination of CART and RF for SOM determination using limited samples in semi-arid steppes in northern China.
The model parameters (i.e., ntree, mtry, and nodesize) were optimized using both tuning and manual adjustment, which were done iteratively on training datasets in R [53]. Liaw and Wiener [54] confirmed that applying the tuning function can improve the results of the model. Hence, according to previous studies, the tuning function was applied to training data and used to select the number of mtry and number of the trees grown. Finally, the mtry and the total number of trees selected by tune function to grow forest were crosschecked with manual mtry entries. The trained and developed model was first applied to the training dataset and used to identify essential variables for the prediction. The variable importance curve was implemented recursively 15 times, and the highly influential variables that frequently appeared at the top rank were selected accordingly [55].
A tree was built from a bootstrap sample of the original dataset, which allows for robust error estimation with the remaining test of out-of-bag (OOB) samples. The OOB samples were predicted from the bootstrap samples, and the mean square error (MSEOOB) that depends on the samples omitted from the bootstrapped OOB samples was computed as stated in Equation (1).
MSE OOB = n 1 i = 1 n ( z i z ^ i OOB ) 2 ,
where n is the number of observations, zi is the average prediction of the i-th observation, and zi OOB is the average prediction of the i-th observation from all trees for which the observation was OOB.
Additionally, the percentage of explained variance (Varex) was calculated as
Var ex = 1 MSE OOB Var z ,
where, Varz is the total variance of the response variable.
The ntree parameter (the number of trees in the forest) was adjusted using the mean squared error (MSE values) as a measure of the prediction accuracy of the RF model (Figure 7).
Similar to this study, the MSE error estimate was used in the validation procedure of another study [56]. Wiesmeier et al. [52] also used MSEoob for validation of the RF model in the prediction of soil organic matter. The OOB estimate was used to evaluate prediction performance as it solves the problem of collecting an independent validation dataset [57]. Many previous studies that used RF for SOC prediction rarely used cross-validation since RF internally estimates errors during the running of the model. For instance, Minasny et al. [58] reviewed studies that focused on SOC prediction and reported that more than half of the studies did not show validation.
The values of ntree and mtry were selected to be 500 and 52, respectively, as a large number of trees is recommended for datasets with sophisticated features and when the emphasis is on identifying essential variables (Figure 8).
The statistical indices of root-mean-square error (RMSE) and mean error (ME) were used, as stated in Equations (3) and (4), to evaluate the performances of the model.
ME = 1 n i = 1 n ( x i ^ y i ) ,
RMSE = 1 n i = 1 n ( ( x i ^ y i ) 2 ,
where n is the number of data points, x i represents the measured values,   y i represents the predicted values,   x i ¯ is the mean measured value, and y i ¯ is the mean predicted values.

3. Results

3.1. Spatial Prediction of SOC and Model Validation

The result of prediction was evaluated on the training and test datasets. The statistics of validation are provided in Table 3.
The ME of the training set varied from 0.05 mg·g−1 to 0.06 mg·g−1 with a mean value of 0.04 mg·g−1, while the RMSE values ranged from 1.35 mg·g−1 to 1.38 mg·g−1 with a mean value of 1.37 mg·g−1. The ME of the test set varied from 0.3 mg·g−1 to 0.43 mg·g−1 with a mean value of 0.4 mg·g−1, while the RMSE values ranged from 0.94 mg·g−1 to 0.97 mg·g−1 with a mean value of 0.96 mg·g−1. In addition, the tuned random forest model applied to select optimum number of mtry and number of trees was verified. Accordingly, the out-of-bag (OOB) errors were 0.44, 0.22, 0.11, and 0.09 for mtry values of 9, 17, 34, and 52, respectively, implying that the use of more predictor variables led to a lower OOB error (Figure 9) and better accuracy for prediction of SOC in the study area.
The results indicate that the RF had excellent performance for SOC prediction. The SOC variability map predicted by the RF model using selected important predictors is shown in Figure 10.
The mean concentration of SOC was 11.70 mg·g−1, with a minimum value of 0.74 mg·g−1 and a maximum value of 45.80 mg·g−1. It was highly variable with a coefficient of variation (CV) of 81% and a standard deviation of 8.92. The SOC concentration was grouped into six levels (i.e., low, moderate, high, humus, and organo-humus classes.) [59]. As can be portrayed from the reclassification map (Figure 11) of SOC level distribution, a shallow level of SOC content was concentrated in the middle of the study area, where the highly urbanized downtown with high NDBI values is located, comprising about 2.01% of the total area.
A moderate SOC level was predicted in the surroundings of the urbanized area, whereas a high SOC level was predicted in forested (vegetated) areas. According to the reclassification results, 0.01% of the study area had organic SOC levels. The remaining area was covered by moderate, high, humus, and organo-humus SOC levels, with proportions of 5.11%, 2.17%, 47.51%, and 43.19%, respectively.
The humus SOC level covered about half of the study area (47.51%), spreading from east to west through the middle of the study area. The organo-humus SOC level also constituted almost half of the study area (43.19%), mainly concentrated in mountainous areas of the southern and northern parts, whereas a moderate SOC level was located in the central parts.

3.2. Importance of Environmental Variables

Based on the IncNodePurity index model, the 15 most important variables for prediction of SOC distribution in the study area were VTCI, ASTAVI, EVI, lithology, NDBI, TPI, slope, X2224, LST, Band4, X424., RVI, TWI, Band7, and NDVI (Figure 12).
According to the result, the VTCI index was the most crucial variable for the prediction of SOC among all 52 predictors, followed by VIs, lithology, and NDBI. The remaining variables, such as TPI, slope, X2224, LST, Band4, X424, RVI, TWI, Band7, and NDVI, occupied the remaining top 15 positions as influential variables. NDBI and LST were among the influential variables with rankings of fifth and ninth.
Similarly, the topographic variables of TPI and slope had a high impact ranking in sixth and seventh, respectively, while TWI was 13th.

3.3. Influence of Topographic and Vegetation Indices

The topographic and hydrological factors such as slope, curvature, aspect, TPI, MBI, and TWI of the study area had high variability with increased CV (Table 4). Slope, aspect, and MBI were more variable (with >50 % CV) than curvature, TPI, and TWI, which had CV values < 50%. Additionally, the Pearson correlation values of the slope, curvature, aspect, TPI, MBI, and TWI were variable. Slope, curvature, and MBI were negatively correlated to SOC, while aspect, TPI, and TWI were positively correlated.
However, only MBI and curvature were significantly correlated with SOC at (p < 0.05). Moreover, the LS factor was highly variable, with a CV of 68%. On the other hand, all VIs had a very low CV except for RVI and BI indices, which had slight variability. Additionally, the Pearson correlation values showed that NDVI, ASTAVI, GI, EVI, SAVI, CTVI, TVI, NRVI, RVI, and TSAVI_91 had a significant positive correlation with SOC concentration at p < 0.05 (Table 4).

3.4. SOC Distribution across Landform, Land Use, and Lithology

The distribution of SOC content across landforms, land use, and lithology was examined. The result shows that its distribution was highly variable across landforms with a mean CV of 78.67%. The highest SOC variation was recorded within medium-gradient mountains (SM) with a CV of 122.08% (Table 5), followed by high-gradient mountains (TM) (12.64 mg·g−1).
Considerably high SOC contents were predicted in plain, medium-gradient hills (SH), and medium-gradient mountains (SM) with mean values of 11.94 mg·g−1, 11.88 mg·g−1, and 10.90 mg·g−1, respectively. On the contrary, a lower proportion of SOC was predicted in high-gradient hills (TH) and shoreline (WR) with mean values of 7.47 mg·g−1 and 1.67 mg·g−1, respectively.
With respect to land use, forestland had the highest mean SOC contents (13.60 mg·g−1) than other land uses followed by agricultural (10.43 mg·g−1), urban (9.74 mg·g−1), and water body (4.55 mg·g−1) (Table 6).
Furthermore, the highest amount of SOC was recorded in weathered residuum (dominated by bauxite and laterite) with a mean SOC content of 14.69 mg·g−1. Similarly, an increased amount of SOC was predicted on fluvial (14.52 mg·g−1) and shale lithology (13.57 mg·g−1). However, unconsolidated marine rock, granite, gneiss (migmatite), and pyroclastic (ignimbrite) lithology had lower contents of SOC with proportions of 10.99 mg·g−1, 11.61 mg·g−1, 11.91 mg·g−1, and 12.04 mg·g−1, respectively (Table 7).
The lowest SOC content was predicted in sandstone (greywacke) or arkose lithology (5.53 mg·g−1), inland water or lake deposits (7.47 mg·g−1), and siltstone, mudstone, or claystone (7.92 mg·g−1).

4. Discussion

4.1. Spatial Variability of SOC

The results of this study indicated that the RF model has good prediction performance with corresponding R2 and RMSE values of 0.96 and 0.91 mg·g−1, respectively. This result is consistent with previous studies, which reported RF as an accurate model for prediction of SOC. For instance, Nawar and Mouazen [60] reported high accuracies of RF with R2 = 0.98 and RMSE = 0.10% for SOC prediction. Likewise, Forkuor et al. [55] reported a better prediction accuracy for RF than other machine learning models with R2 of 0.26 and RMSE of 0.51. Moreover, the spatial distribution map of SOC had a similar trend with distribution of selected environmental factors including soil temperature, soil moisture, and the extent of impervious surface (see Figure 4, Figure 5 and Figure 6) suggesting their strong relationships. The built-up area had the lowest SOC content, but the forested and mountainous regions had the highest SOC content suggesting a high influence of impervious surface for such disparities.

4.2. Importance of Environmental Variables

VTCI was the most essential variable for the prediction of SOC among the 52 environmental predictors. The main reason for the superior influence of VTCI on the spatial prediction of SOC distribution may be attributed to its precision in measuring the crop water status and the subsequent impacts of soil moisture on the aboveground biomass [39]. Moreover, since VTCI is derived on the basis of the relationship between land surface temperature (LST) and vegetation index (NDVI), it could provide more information for the spatial prediction of SOC. Alvarez and Lavado [61] also reported that the SOC contents of the topsoil were highly correlated with moisture to temperature ratios. However, the wetness index (WI), derived as tasseled cap transformation (TCT), provided little information for prediction of SOC distribution in the study area. This result suggests that an understanding of soil moisture derived in combination with the vegetation and temperature ratio (VTCI) may be a better predictor for SOC mapping in a similar environment.
The reason for the high importance of VTCI for SOC mapping may be due to its control on the extent of vegetation cover (i.e., quantity and quality of OM entering into the soil), the rate of mineralization, and litter decomposition [62]. Moreover, since soil moisture is an essential element for microbial growth, it may facilitate the degradation of plant and animal residues that improve SOC contents [10], even though a continuous microbial activity may increase mineralization, causing depletion of SOC in the long term [20].
ASTAVI and EVI were the second and third most influential variables for the prediction of SOC distribution. The reason for their influence may be due to their ability to provide proxies for measuring aboveground biomass, which may influence SOC contents stored in the soil as a litter [63]. Additionally, the characteristics of ASTAVI (i.e., low sensitivity to soil backgrounds) may contribute to deriving more information related to SOC contents than other vegetation indices [64]. Similarly, the enhanced vegetation index (EVI) has lower soil and atmospheric effects than the other VIs used in this study. The exclusively stronger influences of the ASTAVI and EVI than the other vegetation indices suggest that VIs with minimized brightness-related soil effects (i.e., ASTAVI and EVI) may perform better than RVI, NDVI, and others (Table 1) for SOC prediction in complex landscapes such as Fuzhou city. However, previous studies did not separately use VIs on the basis of their strength in reducing background effects for predictions of SOC; rather, they used a mixture of both. For instance, Peng et al. [65] also confirmed that EVI was one of the top predictors. Compared to remote sensing raw-bands, vegetation indices performed better perhaps due to their ability to accurately inferring crop/vegetation/bare soil characteristics. This result suggested that VIs were good predictors, but VIs that significantly remove soil color effects were better predictors for SOC contents.
NDBI was one of the top predictors for SOC distribution since impervious surfaces may impact the spatial distribution of SOC. This result shows that the NDBI index can be used for SOC prediction at the city scale in a complex urban landscape where the land-use change into built-up areas is prominent. Similarly, previous studies reported that land-use change associated with urbanization processes was profoundly influencing total carbon fluxes. For instance, Raciti et al. [66] compared the carbon (C) pools in residential areas with similar soil type to forested reference sites and reported substantial variability. Hence, the selection of the NDBI variable among the top predictors suggests that land surface variables may be among highly influential predictors for modeling soil properties in complex urban landscapes. This result also suggests that residential and nonresidential areas may have a diverse pattern of SOC distribution.
Additionally, NDBI and SOC had a significant positive correlation (r = 0.26, p < 0.05), showing their relationship. The high extent of impervious surface may impact SOC contents since increased human disturbances in combination with artificial materials from buildings cause variability. Similarly, Yan et al. [3] confirmed the importance of impervious surface on SOC quantification in their study. In line with this, a few previous studies reported the negative impacts of impervious surface on SOC and microbial activities. For instance, Wei et al. [20] reported substantially low concentrations of SOC in impervious areas than open soils.
Likewise, the LST was among the most influential variables, whereby SOC showed decreasing trends when LST increased. This result is in agreement with previous research that confirmed temperature as an important variable for SOC contents [67].
The soil spectral data with 2224 nm and 424 nm wavelength, along with LANDSAT-8 raw spectral data (red and shortwave infrared 2), were among influential predictors. Similarly, other studies confirmed hyperspectral remote sensing data as key predictors for SOC [68,69]. Peng et al. [65] also reported that Landsat bands combined with VIS–NIR were efficient for the prediction of SOC in the topsoil.
Generally, biophysical land surface indices (i.e., VTCI, LST, and NDBI), brightness removed vegetation indices (i.e., ASTAVI and EVI), topographic indices (i.e., TPI and Slope), soil spectral bands (i.e., 424 nm and 2224 nm) were the most influential variables for SOC prediction in the study area, suggesting their potential importance in similar complex urban landscapes.

4.3. Influence of Topographic and Vegetation Indices

MBI (r = −0.18, p < 0.05) and curvature (r = −0.19, p < 0.05) were significantly negatively correlated with SOC contents, emphasizing the influence of the prominent variability of the topographic and hydrological factors of the study area. Another study also confirmed the high influence of landscape and hydrological variables on the SOC patchiness [70]. Furthermore, there was a negative correlation between slope and SOC that can be explained by erosion and deposition processes [71]. Similarly, Stevens [72] reported a positive correlation between SOC contents and aspect, TPI, and TWI. The positive relationship between aspect and SOC contents is explained by the direction of slopes with respect to the sun, which may cause variations in temperature, leading to decomposition. Shaded slopes may have a lower decomposition due to a low soil temperature. Low soil wetness may result in a decrease in SOC since it affects microbial activity [72], which may have a positive impact on the composition of organic materials. Variability of the LS factor may also impact the amount of deposit [73]. Compared with topographic variables, vegetation indices were better correlated with SOC contents.

4.4. SOC Distribution across the Landform, Land Use, and Lithology

The prediction of increased SOC contents in high-gradient mountains (TM), medium-gradient hills (SH), and medium-gradient mountains (SM) may be due to the availability of a large share of vegetation cover in these landforms. Additionally, landform-related lithological, moisture, and temperature variations may have influenced the SOC distribution along the altitudinal gradient [74]. The reason for decent SOC contents in plain landform may be associated with the abundance of fluvial materials. Additionally, the dominant soils were paddy soil (paddy field), with a small proportion of red soil and plaster fields (i.e., cultivate fields covered by plaster sheet). In particular, the paddy field of the area has undergone intensive agricultural practices where the application of organic fertilizers may have influenced the SOC content of the landform [8].
Low SOC contents in the high-gradient hills (TH) may be attributed to the impact of complex land use in the area and disturbance posed by intensive human activities. Kemen port, transportation hubs, and large development projects in proximity may have contributed to the decrease in SOC contents [75]. Additionally, Luoyuan Bay located in this area may also have contributed to the low SOC distribution in this landform. Wu et al. [76] reported that the soil sediments in Luoyuan Bay had a high level of eutrophication, suggesting that the SOC contents of this area may be washed into the surrounding water body. Another reason could be the increased use of nitrogen fertilizers and fossil fuels in the surrounding areas, which may have stimulated the loss of organic carbon from terrestrial soils into the surrounding water bodies through erosion [76].
In general, landform variations may contribute to changes in soil properties (clay content), human activities (i.e., land use), vegetation quality and quantity, and alteration of climatic variables (temperature and precipitations), which may affect the SOC distribution. Similarly, a previous study reported that landform elements play a significant role in the variability of SOC [77]. Even though the variability of SOC depends on geologic, climatic, topographic, vegetation, and land-use variables, landform plays a crucial role in modifying all these factors.
Additionally, the SOC distribution was highly variable across the land use. The study area was characterized by a highly urbanized downtown, which was mainly covered by impervious surfaces, to a hilly and mountainous area, which was dominated by plantation forests. Therefore, these land use/cover dynamics might contribute to SOC variations in the area. Similarly, Chuai et al. [35] reported a higher SOC density in towns, woodland, paddy land, and shallow water areas due to industrial and human influence.
Additionally, weathered residuum had the highest content of SOC. This may be due to possession of aluminosilicate red soil (acid red soil), Fe, and permeate paddy soil. Fe leaching, coupled with the high activities of microorganisms, may lead to the increased content of SOC in the weathered residuum, bauxite, and laterite lithology. The increase in soil acidity limits the mineralization of organic matter, reducing microbial activity and the rate of mineralization. On the other hand, the low oxygenation of soils such as “paddy soils” also limits microbial activity, slowing down the mineralization of organic matter. A previous study also reported increased SOC in bauxite lithology due to increased pyrite, owing to redox reactions [78].
Moreover, a considerable amount of the area dominated by laterite lithology has been used for agriculture, where there is cultivation of paddy rice and a resulting long-term application of feedlot manure to cropland. Therefore, the farm management systems may have affected the SOC contents. Similar results were reported by Lui et al. [79], who stated that long-term fertilization practices profoundly influenced the SOC content of red soil of southern China. However, the lowest content of SOC in the sandstone-dominated area may be attributed to its minimal weathering [80]. Similarly, a high concentration of SOC in siliceous red soil, red clay, and yellow-red soils can be related to their high binding capability to organic matter. Additionally, the high content of silicon material may have influenced the chemical and physical properties, since the soils of this area originated from arenaceous rock. The fluvial lithology was mainly located in the eastern parts of the Mingjian riverbanks of the study area, where the landform was dominated by plain (LP) surface. The reason for the lower concentration of SOC contents in acid red soil, silicon aluminum red soil, and red sand soils may be related to the higher content of sand and the low decomposition rate. This result is consistent with Zhang et al. [81], who reported similar results in the hilly red soil region of South China.
Similarly, previous studies reported consistent results regarding the contribution of landform, land use, and lithology to the spatial variability of SOC [50,55].

5. Conclusions

This study was aimed at predicting the spatial distribution of SOC in relation to environmental covariates, including land temperature, soil moisture, and extent of urbanization using the RF and CART models in the coastal city of Fuzhou city, China. To that end, a compressive set of biophysical land surface variables such as LST, VTCI, and NDBI was combined with other environmental covariates. The environmental covariates extracted from remote sensing, topography, and soil spectral sources were used to predict the SOC distribution and to select the most influential variables for the spatial prediction.
The results indicated that the RF had excellent performance for SOC prediction. The SOC content of the study area was highly variable owing to the heterogeneity of the landform, land use, lithology, surface temperature, soil moisture contents, and the rate of built-up area. Biophysical variables including soil moisture status index (VTCI), adjusted transformed soil-adjusted vegetation index (ASTAVI), enhanced vegetation index (EVI), lithology, and built-up index (NDBI) were the five most influential predictors contributing to the prediction of SOC in the study area. The results suggested that the biophysical land variables of VTCI, LST, and NDBI were good predictors. Additionally, the selection of NDBI as one of the essential predictors may provide further insight to predict SOC in residential areas.
The current study derived biophysical land variables such as soil moisture, land surface temperature, and human influence using substantially improved indices, which were often ignored in the prediction of SOC in previous studies. However, these indices were among the most influential variables for the prediction of the spatial distribution of SOC in a complex coastal urban environment of Fuzhou city. Even though the variables were derived from high-resolution Landsat-8 images, the result might be further improved in future studies by using better-resolution images such as Sentinel products.
This result shows that similar approaches and biophysical land variables can be used in other regional- and local-level SOC prediction studies in similar subtropical coastal environments.

Author Contributions

Conceptualization, T.H.S., J.S. (Jinming Sha), and X.L.; methdology, T.H.S., J.S. (Jinming Sha), and X.L.; software, T.H.S., Z.B., and X.L.; Validation, T.H.S., J.S. (Jinming Sha), and X.L.; formal analysis, T.H.S.; Investigation, T.H.S., J.S. (Jinming Sha), and X.L.; resources, J.S. (Jinming Sha), and T.N.; data curation, T.H.S., J.S. (Jinming Sha), X.L, and Z.B; writing—original draft preparation; T.H.S.; writing—review and editing, T.N., J.S. (Jiali Shang), and A.B.A.; supervision, J.S. (Jinming Sha) and X.L.; project administration, J.S. (Jinming Sha); funding acquisition, J.S. (Jinming Sha) and T.N. All authors read and approved the final manuscript.

Funding

This study was supported by the following projects: “Environmental monitoring and assessment of land use/land cover change impact on ecological security using geospatial technologies” (2018YFE0184300), supported by the Chinese Ministry of Science and Technology (MOST) (2020–2022), “GIS and Remote Sensing for Sustainable Forestry and Ecology (SUFOGIS)” (598838-EPP-1-2018-EL-EPPKA2-CBHE-JP), supported by EU Erasmus+ Project, October 2018–October 2021, and “Soil heavy metal monitoring based on hyperspectral data by use of GWR model” (2018I0005), supported by the Fujian Provincial Department of the Science and Technology, 2018–2020.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

This paper was produced under scientific activity and cooperation carried out by T. Noszczyk with Fujian Normal University (China), and Wolkite University (Ethiopia).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tiessen, H.; Cuevas, E.; Chacon, P. The role of soil organic matter in sustaining soil fertility. Nature 1994, 371, 783–785. [Google Scholar] [CrossRef]
  2. Oyonarte, C.; Aranda, V.; Durante, P. Soil surface properties in Mediterranean mountain ecosystems: Effects of environmental factors and implications of management. For. Ecol. Manag. 2008, 254, 156–165. [Google Scholar] [CrossRef]
  3. Yan, Y.; Kuang, W.; Zhang, C.; Chen, C. Impacts of impervious surface expansion on soil organic carbon—A spatially explicit study. Sci. Rep. 2015, 5, 17905. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Pouyat, R.; Groffman, P.; Yesilonis, I.; Hernandez, L. Soil carbon pools and fluxes in urban ecosystems. Environ. Pollut. 2002, 116, S107–S118. [Google Scholar] [CrossRef]
  5. Chen, W.Y. The role of urban green infrastructure in offsetting carbon emissions in 35 major Chinese cities: A nationwide estimate. Cities 2015, 44, 112–120. [Google Scholar] [CrossRef]
  6. Wang, S.; Zhuang, Q.; Jia, S.; Jin, X.; Wang, Q. Spatial variations of soil organic carbon stocks in a coastal hilly area of China. Geoderma 2017, 314, 8–19. [Google Scholar] [CrossRef]
  7. Raciti, S.M.; Hutyra, L.R.; Rao, P.; Finzi, A.C. Inconsistent definitions of “urban” result in different conclusions about the size of urban carbon and nitrogen stocks. Ecol. Appl. 2012, 22, 1015–1035. [Google Scholar] [CrossRef]
  8. Xia, X.; Yang, Z.; Xue, Y.; Shao, X.; Yu, T.; Hou, Q. Spatial analysis of land use change effect on soil organic carbon stocks in the eastern regions of China between 1980 and 2000. Geosci. Front. 2017, 8, 597–603. [Google Scholar] [CrossRef] [Green Version]
  9. McBratney, A.; Mendonça Santos, M.; Minasny, B. On digital soil mapping. Geoderma 2003, 117, 3–52. [Google Scholar] [CrossRef]
  10. Curtin, D.; Beare, M.H.; Hernandez-Ramirez, G. Temperature and Moisture Effects on Microbial Biomass and Soil Organic Matter Mineralization. Soil Sci. Soc. Am. J. 2012, 76, 2055–2067. [Google Scholar] [CrossRef]
  11. Wang, D.; He, N.; Wang, Q.; Lü, Y.; Wang, Q.; Xu, Z.; Zhu, J. Effects of Temperature and Moisture on Soil Organic Matter Decomposition Along Elevation Gradients on the Changbai Mountains, Northeast China. Pedosphere 2016, 26, 399–407. [Google Scholar] [CrossRef]
  12. Schrumpf, M.; Kaiser, K.; Schulze, E.-D. Soil Organic Carbon and Total Nitrogen Gains in an Old Growth Deciduous Forest in Germany. PLoS ONE 2014, 9, e89364. [Google Scholar] [CrossRef] [Green Version]
  13. Bartalis, Z.; Wagner, W.; Naeimi, V.; Hasenauer, S.; Scipal, K.; Bonekamp, H.; Figa, J.; Anderson, C. Initial soil moisture retrievals from the METOP-A Advanced Scatterometer (ASCAT). Geophys. Res. Lett. 2007, 34. [Google Scholar] [CrossRef] [Green Version]
  14. Zhang, D.; Tang, R.; Zhao, W.; Tang, B.; Wu, H.; Shao, K.; Li, Z.L. Surface soil water content estimation from thermal remote sensing based on the temporal variation of land surface temperature. Remote Sens. 2014, 6, 3170–3187. [Google Scholar] [CrossRef] [Green Version]
  15. Zhuo, L.; Tao, H.; Wei, H.; Chengzhen, W. Plantation in Fujian Province, China. PLoS ONE 2016, 11, 151527. [Google Scholar]
  16. Liu, Y.; Su, C.; Zhang, H.; Li, X.; Pei, J. Interaction of Soil Heavy Metal Pollution with Industrialisation and the Landscape Pattern in Taiyuan City, China. PLoS ONE 2014, 9, e105798. [Google Scholar]
  17. Roy, D.P.; Wulder, M.A.; Loveland, T.R.; Woodcock, C.E.; Allen, R.G.; Anderson, M.C.; Helder, D.; Irons, J.R.; Johnson, D.M.; Kennedy, R.; et al. Landsat-8: Science and product vision for terrestrial global change research. Remote Sens. Environ. 2014, 145, 154–172. [Google Scholar] [CrossRef] [Green Version]
  18. U.S. Department of Interior. Using the USGS Landsat 8 Product. 2013. Available online: http://landsat.usgs.gov (accessed on 27 April 2021).
  19. Savitzky, A.; Golay, M.J. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1951, 40, 1627–1639. [Google Scholar] [CrossRef]
  20. Wei, Z.-Q.; Wu, S.-H.; Zhou, S.-L.; Li, J.-T.; Zhao, Q.-G. Soil Organic Carbon Transformation and Related Properties in Urban Soil Under Impervious Surfaces. Pedosphere 2014, 24, 56–64. [Google Scholar] [CrossRef]
  21. Jenness, J. Topographic Position Index (tpi_jen.avx_extension) for Arcview 3.x, v.1.3a, Jenness Enterprises [EB/OL]. 2006. Available online: http://www.jennessent.com/arcview/tpi.htm (accessed on 12 June 2018).
  22. Rasul, A.; Balzter, H.; Smith, C. Spatial variation of the daytime Surface Urban Cool Island during the dry season in Erbil, Iraqi Kurdistan, from Landsat 8. Urban Clim. 2015, 14, 176–186. [Google Scholar] [CrossRef] [Green Version]
  23. Thompson, J.A.; Bell, J.C.; Butler, C.A. Digital elevation model resolution: Effects on terrain attribute calculation and quantitative soil-landscape modeling. Geoderma 2001, 100, 67–89. [Google Scholar] [CrossRef]
  24. Wan, Z.; Wang, P.; Li, X. Using MODIS Land Surface Temperature and Normalized Difference Vegetation Index products for monitoring drought in the southern Great Plains, USA. Int. J. Remote Sens. 2004, 25, 61–72. [Google Scholar] [CrossRef]
  25. Avdan, U.; Jovanovska, G. Algorithm for Automated Mapping of Land Surface Temperature Using LANDSAT 8 Satellite Data. J. Sens. 2016, 2016, 1480307. [Google Scholar] [CrossRef] [Green Version]
  26. Rondeaux, G.; Steven, M.; Baret, F. Optimization of soil-adjusted vegetation indices. Remote Sens. Environ. 1996, 55, 95–107. [Google Scholar] [CrossRef]
  27. Ford, W.M.; Menzel, M.A.; Odom, R.H. Elevation, aspect, and cove size effects on southern Appalachian salamanders. Southeast. Nat. 2002, 1, 315–324. [Google Scholar] [CrossRef]
  28. Weier, J.; Herring, D. Measuring Vegetation (NDVI and EVI). NASA Earth Observatory. 2000. Available online: http://earthobservatory.nasa.gov/Features/MeasuringVegetation/ (accessed on 27 April 2021).
  29. Chejarla, V.R.; Maheshuni, P.K.; Mandla, V.R. Quantification of LST and CO2 levels using Landsat-8 thermal bands on urban environment. Geocarto Int. 2016, 31, 913–926. [Google Scholar] [CrossRef]
  30. Pennock, D.J.; Zebarth, B.J.; De Jong, E. Landform classification and soil distribution in hummocky terrain, Saskatchewan, Canada. Geoderma 1987, 40, 297–315. [Google Scholar] [CrossRef]
  31. Sørensen, R. Topographical Influence on Soil Chemistry; Sveriges lantbruksuniv: Uppsala, Sweden, 2006. [Google Scholar]
  32. Taghizadeh-Mehrjardi, R.; Nabiollahi, K.; Kerry, R. Digital mapping of soil organic carbon at multiple depths using different data mining techniques in Baneh region, Iran. Geoderma 2016, 266, 98–110. [Google Scholar] [CrossRef]
  33. Möller, M.; Volk, M. Effective map scales for soil transport processes and related process domains—Statistical and spatial characterization of their scale-specific inaccuracies. Geoderma 2015, 247, 151–160. [Google Scholar] [CrossRef]
  34. Li, X.; McCarty, G.W.; Karlen, D.L.; Cambardella, C.A. Topographic metric predictions of soil redistribution and organic carbon in Iowa cropland fields. Catena 2018, 160, 222–232. [Google Scholar] [CrossRef]
  35. Chuai, X.-W.; Huang, X.-J.; Wang, W.-J.; Zhang, M.; Lai, L.; Liao, Q.-L. Spatial Variability of Soil Organic Carbon and Related Factors in Jiangsu Province, China. Pedosphere 2012, 22, 404–414. [Google Scholar] [CrossRef]
  36. Mróz, M.; Sobieraj, A. Comparison of Several Vegetation Indices Calculated on the Basis of a Seasonal Spot XS Time Series, and Their Suitability for Land Cover and Agricultural Crop Identification. Tech. Sci. 2004, 7, 39–66. [Google Scholar]
  37. Van Khoa, P.; Hoa, N.H.; Tuan, D.A. Using remote sensing indices to reduce effects of hillshade on LANDSAT 8 imagery. Available online: https://vnuf.edu.vn/documents/454250/1808434/9.pdf (accessed on 27 April 2021).
  38. Payero, J.O.; Neale, C.M.U.; Wright, J.L. Comparison of eleven vegetation indices for estimating plant height of alfalfa and grass. Appl. Eng. Agric. 2004, 20, 385–393. [Google Scholar] [CrossRef] [Green Version]
  39. Patel, N.R.; Parida, B.R.; Venus, V.; Saha, S.K.; Dadhwal, V.K. Analysis of agricultural drought using vegetation temperature condition index (VTCI) from Terra/MODIS satellite data. Environ. Monit. Assess. 2012, 184, 7153–7163. [Google Scholar] [CrossRef]
  40. Ahmad, F. Spectral vegetation indices performance evaluated for Cholistan Desert. J. Geogr. Reg. Plan. 2012, 5, 165–172. [Google Scholar]
  41. Mróz, M.; Sciences, A.S.-T. Undefined Comparison of several vegetation indices calculated on the basis of a seasonal SPOT XS time series, and their suitability for land cover and agricultural crop identification. Tech. Sci. 2004, 7, 39–66. [Google Scholar]
  42. Kandwal, R.; Jeganathan, C.; Tolpekin, V.; Kushwaha, S.P.S. Discriminating the invasive species, ‘Lantana’ using vegetation indices. J. Indian Soc. Remote Sens. 2009, 37, 275–290. [Google Scholar] [CrossRef]
  43. Peuquet, D.J. An algorithm for calculating minimum Euclidean distance between two geographic features. Comput. Geosci. 1992, 18, 989–1001. [Google Scholar] [CrossRef]
  44. Barsi, J.; Lee, K.; Kvaran, G.; Markham, B.; Pedelty, J. The Spectral Response of the Landsat-8 Operational Land Imager. Remote Sens. 2014, 6, 10232–10251. [Google Scholar] [CrossRef] [Green Version]
  45. Erbek, F.S.; Özkan, C.; Taberner, M. Comparison of maximum likelihood classification method with supervised artificial neural network algorithms for land use activities. Int. J. Remote Sens. 2004, 25, 1733–1748. [Google Scholar] [CrossRef]
  46. FAO. Guidelines for Soil Description, 4th ed.; Food and Agriculture Organization of the United Nations: Rome, Italy, 2006; ISBN 92-5-105521-1. [Google Scholar]
  47. Grover, A.; Singh, R. Monitoring Spatial Patterns of Land Surface Temperature and Urban Heat Island for Sustainable Megacity. Environ. Urban. ASIA 2016, 7, 38–54. [Google Scholar] [CrossRef] [Green Version]
  48. Hengl, T.; Heuvelink, G.B.M.; Kempen, B.; Leenaars, J.G.B.; Walsh, M.G.; Shepherd, K.D.; Sila, A.; MacMillan, R.A.; De Jesus, J.M.; Tamene, L.; et al. Mapping soil properties of Africa at 250 m resolution: Random forests significantly improve current predictions. PLoS ONE 2015, 10, e0125814. [Google Scholar] [CrossRef] [PubMed]
  49. Nielsen, D.R.; Bouma, J. Soil spatial variability. In Proceedings of the Workshop of the ISSS (Int. Society of Soil Science) and the SSSA (Soil Science Society of America), Las Vegas, NV, USA, 30 November–1 December 1985. [Google Scholar]
  50. Grimm, R.; Behrens, T.; Märker, M.; Elsenbeer, H. Soil organic carbon concentrations and stocks on Barro Colorado Island—Digital soil mapping using Random Forests analysis. Geoderma 2008, 146, 102–113. [Google Scholar] [CrossRef]
  51. Breiman, L. Classification and regression trees. In Encyclopedia of Ecology; CRC Press: Boca Raton, FL, USA, 1984. [Google Scholar]
  52. Wiesmeier, M.; Barthold, F.; Blank, B.; Kögel-Knabner, I. Digital mapping of soil organic matter stocks using Random Forest modeling in a semi-arid steppe ecosystem. Plant Soil 2011, 340, 7–24. [Google Scholar] [CrossRef]
  53. Sorenson, P.T.; Small, C.; Tappert, M.C.; Quideau, S.A.; Drozdowski, B.; Underwood, A.; Janz, A. Monitoring organic carbon, total nitrogen, and pH for reclaimed soils using field reflectance spectroscopy. Can. J. Soil Sci. 2017, 97, 241–248. [Google Scholar] [CrossRef]
  54. Liaw, A.; Wiener, M. Classification and Regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  55. Forkuor, G.; Hounkpatin, O.K.L.; Welp, G.; Thiel, M. High resolution mapping of soil properties using Remote Sensing variables in south-western Burkina Faso: A comparison of machine learning and multiple linear regression models. PLoS ONE 2017, 12, e0170478. [Google Scholar] [CrossRef]
  56. Probst, P.; Boulesteix, A.L. To tune or not to tune the number of trees in random forest. J. Mach. Learn. Res. 2018, 18, 6673–6690. [Google Scholar]
  57. Ridgeway, G. gbm: Generalized Boosted Regression Models. R Packag. Version 2006, 1, 55. [Google Scholar]
  58. Minasny, B.; McBratney, A.B.; Malone, B.P.; Wheeler, I. Chapter One—Digital Mapping of Soil Carbon. Adv. Agron. 2013, 118, 1–47. [Google Scholar]
  59. Zhang, L.; Liu, Y.; Li, X.; Huang, L.; Yu, D.; Shi, X.; Chen, H.; Xing, S. Effects of soil map scales on simulating soil organic carbon changes of upland soils in Eastern China. Geoderma 2018, 312, 159–169. [Google Scholar] [CrossRef]
  60. Nawar, S.; Mouazen, A. Comparison between Random Forests, Artificial Neural Networks and Gradient Boosted Machines Methods of On-Line Vis-NIR Spectroscopy Measurements of Soil Total Nitrogen and Total Carbon. Sensors 2017, 17, 2428. [Google Scholar] [CrossRef] [PubMed]
  61. Alvarez, R.; Lavado, R.S. Climate, organic matter and clay content relationships in the Pampa and Chaco soils, Argentina. Geoderma 1998, 83, 127–141. [Google Scholar] [CrossRef]
  62. Jenny, H.; Amundson, R. Factors of Soil Formation: A System of Quantitative Pedology; Courier Corporation: North Chelmsford, MA, USA, 1994. [Google Scholar]
  63. Sun, W.; Li, X.; Niu, B. Prediction of soil organic carbon in a coal mining area by Vis-NIR spectroscopy. PLoS ONE 2018, 13, e0196198. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Purevdorj, T.S.; Tateishi, R.; Ishiyama, T.; Honda, Y. Relationships between percent vegetation cover and vegetation indices. Int. J. Remote Sens. 1998, 19, 3519–3535. [Google Scholar] [CrossRef]
  65. Peng, Y.; Xiong, X.; Adhikari, K.; Knadel, M.; Grunwald, S.; Greve, M.H. Modeling Soil Organic Carbon at Regional Scale by Combining Multi-Spectral Images with Laboratory Spectra. PLoS ONE 2015, 10, e0142295. [Google Scholar] [CrossRef] [Green Version]
  66. Raciti, S.M.; Groffman, P.M.; Jenkins, J.C.; Pouyat, R.V.; Fahey, T.J.; Pickett, S.T.A.; Cadenasso, M.L. Accumulation of Carbon and Nitrogen in Residential Soils with Different Land-Use Histories. Ecosystems 2011, 14, 287–297. [Google Scholar] [CrossRef]
  67. Gray, J.M.; Bishop, T.F.A.; Wilson, B.R. Factors Controlling Soil Organic Carbon Stocks with Depth in Eastern Australia. Soil Sci. Soc. Am. J. 2015, 79, 1741. [Google Scholar] [CrossRef] [Green Version]
  68. Vohland, M.; Ludwig, M.; Thiele-Bruhn, S.; Ludwig, B.; Vohland, M.; Ludwig, M.; Thiele-Bruhn, S.; Ludwig, B. Quantification of Soil Properties with Hyperspectral Data: Selecting Spectral Variables with Different Methods to Improve Accuracies and Analyze Prediction Mechanisms. Remote Sens. 2017, 9, 1103. [Google Scholar] [CrossRef] [Green Version]
  69. Gabriel, J.L.; Zarco-Tejada, P.J.; López-Herrera, P.J.; Pérez-Martín, E.; Alonso-Ayuso, M.; Quemada, M. Airborne and ground level sensors for monitoring nitrogen status in a maize crop. Biosyst. Eng. 2017, 160, 124–133. [Google Scholar] [CrossRef]
  70. Cheng, S.; Fang, H.; Zhu, T.; Zheng, J.; Yang, X.; Zhang, X.; Yu, G. Effects of soil erosion and deposition on soil organic carbon dynamics at a sloping field in Black Soil region, Northeast China. Soil Sci. Plant Nutr. 2010, 56, 521–529. [Google Scholar] [CrossRef]
  71. Rhanor, T. Topographic Position and Land Cover Effects on Soil Organic Carbon Distribution of Loess-Veneered Hillslopes in the Central United States. Ph.D. Dissertation, Southern Illinois University Carbondale, Carbondale, IL, USA, 2013. [Google Scholar]
  72. Stevens, F.; Bogaert, P.; Van Oost, K.; Doetterl, S.; Van Wesemael, B. Regional-scale characterization of the geomorphic control of the spatial distribution of soil organic carbon in cropland. Eur. J. Soil Sci. 2014, 65, 539–552. [Google Scholar] [CrossRef]
  73. Yazidhi, B. A Comparative Study of Soil Erosion Modelling in Lom Kao-Phetchabun, Thailand; ITC: Kolkata, India, 2003.
  74. Bui, L.V.; Stahr, K.; Clemens, G. A fuzzy logic slope-form system for predictive soil mapping of a landscape-scale area with strong relief conditions. Catena 2017, 155, 135–146. [Google Scholar] [CrossRef]
  75. Peterson, B.J.; Melillo, J.M. The potential storage of carbon caused by eutrophication of the biosphere. Tellus B 2017, 37, 117–127. [Google Scholar] [CrossRef]
  76. Wu, H.-Y.; Chen, K.-L.; Zhang, P.; Fu, S.-F.; Hou, J.-P.; Chen, Q.-H. Eco-environmental quality assessment of Luoyuan Bay, Fujian province of East China based on biotic indices. Ying yong sheng tai xue bao = J. Appl. Ecol. 2013, 24, 825–831. [Google Scholar]
  77. Sherwood, W.C.; Hartshorn, A.S.; Eaton, L.S. Soils, geomorphology, landscape evolution, and land use in the Virginia Piedmont and Blue Ridge. In The Mid-Atlantic Shore to the Appalachian Highlands: Field Trip Guidebook for the 2010 Joint Meeting of the Northeastern and Southeastern GSA Sections; Geological Society of America: Boulder, CO, USA, 2010; pp. 31–50. [Google Scholar]
  78. Ellahi, S.S.; Taghipour, B.; Nejadhadad, M.; Salamab Ellahi, S.; Taghipour, B.; Nejadhadad, M. The Role of Organic Matter in the Formation of High-Grade Al Deposits of the Dopolan Karst Type Bauxite, Iran: Mineralogy, Geochemistry, and Sulfur Isotope Data. Minerals 2017, 7, 97. [Google Scholar] [CrossRef] [Green Version]
  79. Liu, Y.; Gao, P.; Zhang, L.; Niu, X.; Wang, B. Spatial heterogeneity distribution of soil total nitrogen and total phosphorus in the Yaoxiang watershed in a hilly area of northern China based on geographic information system and geostatistics. Ecol. Evol. 2016, 6, 6807–6816. [Google Scholar] [CrossRef]
  80. Kidd, D.; Malone, B.; McBratney, A.; Minasny, B.; Webb, M. Operational sampling challenges to digital soil mapping in Tasmania, Australia. Geoderma Reg. 2015, 4, 1–10. [Google Scholar] [CrossRef]
  81. Zhang, Y.; Yu, Q.; Ma, W.; Chen, L. Atmospheric deposition of inorganic nitrogen to the eastern China seas and its implications to marine biogeochemistry. J. Geophys. Res. 2010, 115, D00K10. [Google Scholar] [CrossRef]
Figure 1. The study area map: (A) Fujian Province, China; (B) Fuzhou city; (C) sample points.
Figure 1. The study area map: (A) Fujian Province, China; (B) Fuzhou city; (C) sample points.
Remotesensing 13 01682 g001
Figure 2. The topographic description map of the study area.
Figure 2. The topographic description map of the study area.
Remotesensing 13 01682 g002
Figure 3. The land use/cover map of the study area.
Figure 3. The land use/cover map of the study area.
Remotesensing 13 01682 g003
Figure 4. LST map of the study area.
Figure 4. LST map of the study area.
Remotesensing 13 01682 g004
Figure 5. Vegetation temperature condition index (VTCI) map of the study area.
Figure 5. Vegetation temperature condition index (VTCI) map of the study area.
Remotesensing 13 01682 g005
Figure 6. The normalized difference built-up index (NDBI) map of the study area.
Figure 6. The normalized difference built-up index (NDBI) map of the study area.
Remotesensing 13 01682 g006
Figure 7. MSE error vs. number of trees.
Figure 7. MSE error vs. number of trees.
Remotesensing 13 01682 g007
Figure 8. The optimal value of mtry with respect to the out-of-bag error estimate (OOB error) for random forest (ntree = 500) with a tuning function applied to the model.
Figure 8. The optimal value of mtry with respect to the out-of-bag error estimate (OOB error) for random forest (ntree = 500) with a tuning function applied to the model.
Remotesensing 13 01682 g008
Figure 9. Number of trees adjusted using tuning parameters.
Figure 9. Number of trees adjusted using tuning parameters.
Remotesensing 13 01682 g009
Figure 10. SOC variability map predicted using RF.
Figure 10. SOC variability map predicted using RF.
Remotesensing 13 01682 g010
Figure 11. The classes or levels of SOC distribution in the study area.
Figure 11. The classes or levels of SOC distribution in the study area.
Remotesensing 13 01682 g011
Figure 12. Variable importance measured as a function of IncNodePurity index.
Figure 12. Variable importance measured as a function of IncNodePurity index.
Remotesensing 13 01682 g012
Table 1. Explanation of environmental covariates and their derivation equations.
Table 1. Explanation of environmental covariates and their derivation equations.
Variable ExplanationFormula (Value)Reference
Normalized difference built-up index N D B I = ( S W I R N I R ) ( S W I R + N I R ) [20]
Topographic position index (TPI) T P I = M o n 1 M n n [21]
Greenness index GI = 0.33 × B 2 0.35 × B 3 0.45 × B 4 + 0.69 × B 5 B 6 × 0.024 B 7 × 0.26 [22]
Curvature Curvature = 2 ( D + E ) × 100 [23]
Vegetation temperature condition index V T C I = L S T N D V I ,   M a x L S T N D V I L S T N D V I ,   M a x L S T N D V I , M i n [24]
Proportion of Vegetation P V = ( N D V I N D V I m i n ) ( N D V I M a x N D V I m i n ) 2 [25]
Adjusted transformed soil-adjusted Vegetation Index A S T A V I = 1.22 × ( N I R 1.22 × R 0.03 ) / ( 1.22 × N I R + R 1.22 × 0.03 + 0.08 × ( 1 + 1.22 2 ) [25]
Emissivity E = 0.004     PV   + 0.986 [25]
Soil-adjusted vegetation index S A V I = ( B 5 B 3 ) × ( L ) B 5 + B 4 + L [26]
Aspect Aspect   = 57.29578 tan A 2 ( [ z / y ] , [ z / x ] ) [27]
Normalized difference vegetation index N D V I = ( N I R R ) ( N I R + R ) [28]
Land surface temperature L S T = ( B T ) ( 1 + ( 0.00115 BT / 1.4388 ) × ln ( e ) ) [29]
Profile curvature P Curv = p 2 r + 2 pqs + p 2 tp 2 + q 21 + p 2 + q 2 [30]
Brightness temperature BT = ( 1321.078 ) / Ln ( ( 774.8853 / toa ) + 1 ) ) 273.15 ) [25]
Slope S l o p e = tan A ( [ z / y ] 2 + [ z / x ] 2 ) ) × 57.29578 [31]
Enhanced vegetation index E V I = ( NIR R ) / ( NIR + C 1 × R C 2 × B + L ) [28]
Brightness index BI   = 0.36 × B 2 + 0.3972 × B 3 + 0.39 × B 4 + 0.697 × B 5 + B 6 × 0.23 + B 7 × 0.16 [22]
Wetness index WI = 0.26 × B 2 + 0.21 × B 3 + 0.09 × B 4 + 0.06 × B 5 B 6 × 0.76 B 7 × 0.53 [32]
Mass balance index MBI = 1 + / [ 1 + ln ( 1 + S T I i n S T I s ) ] [33]
Topographic wetness index T W I = ln [ A s / tan β ] [34]
Length slope (LS) factor L S   F a c t o r = n + 1 ( CA ) 22.130.4 × sin × G ( 0.0896 × 1.3 ) [35]
Thiam’s transformed vegetation index T T V I = | ( NDVI   + 0.5 ) | [36]
Corrected transformed vegetation index CTVI   = [ ( NDVI + 0.5 )   /   abs ( NDVI + 0.5 ) ] × [ abs ( NDVI   + 0.5 ) ] ^ 0.5 [37]
Normalized ratio vegetation index TVI   = [ ( NIR R )   /   ( NIR   +   R ) ] 0.5 + 0.5 [38]
Normalized ratio vegetation index N R V I = ( R V I 1 ) ( R V I + 1 ) [38]
Ratio vegetation index R V I = R N I R [38]
Difference vegetation index D V I = N I R R [39]
Transformed soil-adjusted vegetation index (Baret and Guyot, 1991) T S V I 91 = a ( N I R a R b ) / R + a ( N I R b ) + 0.08 ( 1 + a 2)[40]
Transformed soil-adjusted vegetation index (Baret et al. 1989) T S V I 89 = T S V I = a ( NIR aR b ) / [ R + a ( NIR b ) + 0.08 ( 1 + a 2 ) ] [26]
Perpendicular vegetation index (Perry and Lautenschlager, 1984) P V I 84 = 1 / sqrt ( a 2 + 1 )   ×   ( NIR     aR   +   b ) [41]
Perpendicular vegetation index (Qi, et al-, 1994) PVI _ 94 = ( ρ G i r , s ρ P i r ) 2 + ( ρ G r , s ρ P r ) 2 [41]
Perpendicular vegetation index (Richardson and Wiegand, 1977) P V I R = ( N I R a X R E D + b ) 1 + ( a X a ) [42]
Distance from industries, landfillDST_Inds = proximity analysis[43]
Distance from primary roadsDST_PR = proximity analysis[43]
Distance from water bodies, canal, dam, ditch, drain, riverbank, river, stream, waterfall, weirDST_RCD = proximity analysis[43]
Aerosol band 1Coastal aerosol (0.443 µm)[44]
Blue bandBlue (450–510 nm)[44]
Green bandGreen (530–590 nm)[44]
Red bandRed (640–670 nm)[44]
NIR bandNear infrared (NIR) (850–880 nm)[44]
SWIR-1 bandSWIR-1 (1570–1650 nm)[44]
SWIR-2 bandSWIR-2 (2110–2290 nm)[44]
Land-use typesSupervised the maximum-likelihood method[45]
Landform typesTPI-based landform classification[21]
Soil pHpH meter
LithologyFAO soil database
Spectral soil dataWavelength (from 350 to 2500 nm with 1 nm interval)
Table 2. Land-cover classification of the study area.
Table 2. Land-cover classification of the study area.
Main Land CoverArea (ha)Area Proportion (%)Com. ErrorOm. ErrorAcc.
Agriculture28,198.7121.803.040.58
Forest44,562.5134.442.130.7495.52
Water bodies18,580.5914.360.150.38
Urban land38,034.3629.401.23.08
Com. Error, commission error; Om. Error, omission error; Acc., accuracy.
Table 3. Accuracy assessment of SOC prediction using RF.
Table 3. Accuracy assessment of SOC prediction using RF.
DatasetsIndicatorsMin25%Mean75%Max.SD
Training setME0.050.040.040.030.060.003
RMSE1.351.361.371.371.380.026
R20.68 0.76 0.033
Test setME0.30.40.40.430.360.001
RMSE0.940.950.960.960.970.024
R20.88 0.91 0.920.021
N = 121 observations; ME, mean error; RMSE, root-mean-square error of SOC predicted.
Table 4. Descriptive statistics of selected environmental variables.
Table 4. Descriptive statistics of selected environmental variables.
Variables (Unit)Min.Max.MeanzSDCVyrL.B.U.B.Coeff.p−ValuesR2
SOC (mg·g1)0.7045.8011.708.920.811-----
Slope (°)0.00103.5137.5523.550.58−0.16−0.330.020.020.090.03
Curvature−5.805.940.222.110.36−0.19*−0.36−0.010.040.040.04
Aspect−1.00357.92160.70117.890.670.16−0.020.330.030.080.03
TPI−2.292.10−0.010.930.410.10−0.080.270.010.290.01
MBI−0.840.870.060.540.61−0.18*−0.350.000.030.050.03
TWI−1.9511.556.793.660.430.17−0.010.340.030.060.03
LS Factor0.0021.79110.6337.4730.68−0.06−0.230.120.680.960.01
NDVI−0.470.890.640.240.190.20*−0.36−0.020.040.030.04
ASTAVI−0.701.350.960.350.180.23*−0.39−0.050.050.010.05
WI0.010.210.110.040.39−0.14−0.310.040.020.130.02
GI−0.090.270.120.080.350.28*−0.44−0.110.080.000.08
BI0.020.390.200.070.40−0.11−0.280.070.010.230.01
EVI−0.070.550.290.150.380.28*−0.43−0.100.080.000.08
SAVI−0.481.130.780.290.200.24*−0.40−0.070.060.010.06
CTVI0.421.121.000.110.160.22*−0.39−0.050.050.010.05
TVI0.091.371.210.170.280.22*−0.38−0.040.050.010.05
NRVI−0.320.760.520.190.200.24*−0.40−0.070.060.010.06
RVI0.517.183.771.610.460.27*−0.43−0.100.070.000.07
TSAVI_910.010.730.530.150.250.26*−0.42−0.080.070.000.07
PV0.000.590.050.081.680.14−0.040.310.020.130.02
LST (°C)6.2515.1510.652.110.470.09−0.090.270.010.320.01
VTCI0.040.990.510.230.480.09−0.090.270.010.320.01
NDBI−0.650.07−0.300.160.440.26*0.080.410.070.000.07
DST_INDST (km)1.0562.5916.3512.15 −0.12−0.290.060.010.200.01
DST_PR (km)0.0743.649.779.010.88−0.05−0.230.120.000.550.00
DST_RCD (km)0.0425.053.384.921.46−0.13−0.310.050.020.140.02
Band2824.941552.341057.66172.560.700.16−0.020.330.030.080.03
Band3496.941503.39826.82213.150.610.10−0.080.280.010.250.01
Band4308.631526.16647.02270.150.770.13−0.040.310.020.140.02
Band5404.823859.932202.38780.760.42−0.23*−0.39−0.050.050.010.05
Band6123.382683.871230.17543.240.50−0.02−0.190.160.000.870.00
Band754.041817.46671.75397.670.640.09−0.090.260.010.350.01
pH0.009.104.821.810.380.12−0.060.290.010.190.01
N = 121 observations; Min, minimum; Max, maximum; zSD, standard deviation; CV, coefficient of variation; yr, correlation coefficient; L.B., lower boundary; U.B., upper boundary; Coeff, coefficient; p-values in bold are different from 0 with a significance level of α = 0.05; * Pearson correlation coefficient significant at 95% confidence interval.
Table 5. SOC distribution in mg·g−1 across the landforms.
Table 5. SOC distribution in mg·g−1 across the landforms.
LandformMinMaxMeanSDCV (%)
TM3.7034.0612.648.5267.43
LP1.6529.4711.946.7856.79
SH0.7045.8011.889.2878.09
SM2.2926.2410.9013.31122.08
TH1.701.701.70--
WR3.8311.127.485.1568.96
Table 6. SOC distribution in mg·g−1 as a function of land use.
Table 6. SOC distribution in mg·g−1 as a function of land use.
Land UseMinMaxMeanSDCV (%)
Agriculture0.7039.5710.438.3780
Forestland1.6545.8013.608.4962
Urban area1.7026.59.749.83101
Water bodies0.68.894.555.86129
Table 7. SOC (mg·g−1) distribution across soils developed in different lithologies.
Table 7. SOC (mg·g−1) distribution across soils developed in different lithologies.
LithologyMinMaxMeanSDCV (%)
Pyroclastic, ignimbrite1.3445.8012.049.7380.80
Sandstone, greywacke, arkose1.7014.275.544.5381.80
Fluvial2.6329.4713.579.9773.51
Gneiss, migmatite0.7031.4711.918.6172.30
Granite4.3915.2411.614.9842.87
Shale3.7034.0614.529.5865.98
Siltstone, mudstone, claystone1.6512.037.934.4556.14
Inland water or lakes deposits3.8311.127.485.1568.96
Marine unconsolidated rock8.7313.2611.003.2029.10
Weathered residuum, bauxite, laterite2.1235.6714.6911.0675.30
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sodango, T.H.; Sha, J.; Li, X.; Noszczyk, T.; Shang, J.; Aneseyee, A.B.; Bao, Z. Modeling the Spatial Dynamics of Soil Organic Carbon Using Remotely-Sensed Predictors in Fuzhou City, China. Remote Sens. 2021, 13, 1682. https://doi.org/10.3390/rs13091682

AMA Style

Sodango TH, Sha J, Li X, Noszczyk T, Shang J, Aneseyee AB, Bao Z. Modeling the Spatial Dynamics of Soil Organic Carbon Using Remotely-Sensed Predictors in Fuzhou City, China. Remote Sensing. 2021; 13(9):1682. https://doi.org/10.3390/rs13091682

Chicago/Turabian Style

Sodango, Terefe Hanchiso, Jinming Sha, Xiaomei Li, Tomasz Noszczyk, Jiali Shang, Abreham Berta Aneseyee, and Zhongcong Bao. 2021. "Modeling the Spatial Dynamics of Soil Organic Carbon Using Remotely-Sensed Predictors in Fuzhou City, China" Remote Sensing 13, no. 9: 1682. https://doi.org/10.3390/rs13091682

APA Style

Sodango, T. H., Sha, J., Li, X., Noszczyk, T., Shang, J., Aneseyee, A. B., & Bao, Z. (2021). Modeling the Spatial Dynamics of Soil Organic Carbon Using Remotely-Sensed Predictors in Fuzhou City, China. Remote Sensing, 13(9), 1682. https://doi.org/10.3390/rs13091682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop