Next Article in Journal
Intelligent, Flexible Artificial Throats with Sound Emitting, Detecting, and Recognizing Abilities
Previous Article in Journal
Multipoint Energy-Balanced Laser-Ultrasonic Transducer Based on a Thin-Cladding Fiber
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating the Heavy Metal Contents in Entisols from a Mining Area Based on Improved Spectral Indices and Catboost

1
School of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, China
2
College of Geodesy and Geomatics, Shandong University of Science and Technology, Qingdao 266590, China
3
Institute of Resource and Environmental Engineering, Hebei Geo University, Shijiazhuang 050031, China
4
School of Earth Sciences and Engineering, Sun Yat-sen University, Zhuhai 519000, China
5
School of Natural Sciences, Faculty of Science and Engineering, Macquarie University, Sydney, NSW 2109, Australia
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(5), 1492; https://doi.org/10.3390/s24051492
Submission received: 18 January 2024 / Revised: 15 February 2024 / Accepted: 22 February 2024 / Published: 25 February 2024
(This article belongs to the Section Remote Sensors)

Abstract

:
In the study of the inversion of soil multi-species heavy metal element concentrations using hyperspectral techniques, the selection of feature bands is very important. However, interactions among soil elements can lead to redundancy and instability of spectral features. In this study, heavy metal elements (Pb, Zn, Mn, and As) in entisols around a mining area in Harbin, Heilongjiang Province, China, were studied. To optimise the combination of spectral indices and their weights, radar plots of characteristic-band Pearson coefficients (RCBP) were used to screen three-band spectral index combinations of Pb, Zn, Mn, and As elements, while the Catboost algorithm was used to invert the concentrations of each element. The correlations of Fe with the four heavy metals were analysed from both concentration and characteristic band perspectives, while the effect of spectral inversion was further evaluated via spatial analysis. It was found that the regression model for the inversion of the Zn elemental concentration based on the optimised spectral index combinations had the best fit, with R2 = 0.8786 for the test set, followed by Mn (R2 = 0.8576), As (R2 = 0.7916), and Pb (R2 = 0.6022). As far as the characteristic bands are concerned, the best correlations of Fe with the Pb, Zn, Mn and As elements were 0.837, 0.711, 0.542 and 0.303, respectively. The spatial distribution and correlation of the spectral inversion concentrations of the As and Mn elements with the measured concentrations were consistent, and there were some differences in the results for Zn and Pb. Therefore, hyperspectral techniques and analysis of Fe elements have potential applications in the inversion of entisols heavy metal concentrations and can improve the quality monitoring efficiency of these soils.

1. Introduction

Entisols occur in every part of the world and are often affected by serious soil erosion or extensive agricultural activities [1]. Therefore, the detection of their quality is of great practical importance. With the increasing human demand for mineral resources, the accumulation of mining waste and slag has not only degraded land resources but has also degraded the soil ecology around mining areas via heavy metal pollution [2]. Hyperspectral remote sensing technology provides an efficient approach to the in situ monitoring of soil heavy metal contents as it provides information with good timeliness, spectral resolution, and measurement range [3,4,5].
Malley and Williams were the first to employ hyperspectral remote sensing technology to quantitatively invert the heavy metal concentrations in lake sediments, achieving good prediction accuracy [6]. Since then, many scholars have used hyperspectral remote sensing technology for the determination of heavy metal contents in soil and have made great progress [7,8,9,10]. Studies on the direct inversion of heavy metal concentrations in soil using hyperspectral remote sensing have found that derivative processing achieves better results than spectral data preprocessing with S-G convolution smoothing, inverse logarithm, multiple scattering correction, and envelope removal [11,12,13]. Basically, the characteristic bands of target elements are obtained by correlation analysis of the target heavy metal elements with spectral transform data and are then used for the estimation of elemental concentrations [14,15,16,17]. Later, other studies found that the construction of multi-band spectral indices using the characteristic bands of target elements could improve the inversion of soil heavy metal concentrations [18,19]. Peng et al. conducted a Pearson’s correlation analysis on the spectra and soil heavy metal concentration, and a total of 13 feature bands were determined for the inversion model [20]. Sawut et al. have confirmed that the dual-band spectral indices could be applied to the estimation of the content of As [19]. The theoretical basis for using hyperspectral remote sensing technology in soil heavy metal research is being gradually improved [21]. As far as inversion models are concerned, classical models of univariate and multivariate statistical analysis were mostly used in early studies [22,23,24]. However, given the influence of soil mineral composition, particle size, water content, and organic matter concentration on soil spectra, machine learning algorithms have been increasingly used in the field of quantitative estimation to improve the retrieval efficiency of heavy metal concentrations [25,26]. In addition, solar radiation conditions may affect the hyperspectral measurement. For example, clouds may change the illumination regime from directed illumination to diffuse illumination conditions and affect hyperspectral measurements [27]. By using statistical models and machine learning algorithms to study the correlation between soil heavy metal element concentrations and hyperspectral reflectance data, a commendable level of accuracy in estimating soil heavy metal content can also be achieved based on laboratory hyperspectral data [28], airborne hyperspectral imagery [29], and spaceborne hyperspectral imagery [28,30]. The airborne hyperspectral imagery can offer finer details [29], laboratory hyperspectral data is less susceptible to interference and can provide higher accuracy [28], and spaceborne hyperspectral imagery is more suitable for cost-effective large-scale pollution assessments [28,30]. However, regarding the problem of redundant and unstable spectral features due to interaction effects among soil elements, the optimisation of different elemental feature bands or spectral indices is not often considered in the inversion of heavy metal concentrations. In this study, radar plots of characteristic-band Pearson coefficients (RCBP) were used to screen combinations of the three-band spectral indices of different heavy metals. Combined with the Catboost algorithm, which provides automatic optimisation of parameter weights, the present research estimated heavy metal concentrations (Mn, Zn, As, and Pb) in soils using hyperspectral remote sensing.
Previous studies indicate that indirect measurements of heavy metal concentrations using hyperspectral remote sensing techniques are possible and that the relationships between heavy metal concentrations and soil components (e.g., clay minerals, carbonate minerals, iron oxides, and organic matter) are well established. These components have a strong absorption effect on metal cations and are key factors influencing soil spectral morphological characteristics. This means that the characteristic bands of the corresponding soil spectral curve response are easy to identify. Strong correlations between heavy metals and soil components may lead to better measurement [31,32,33,34]. Due to the complex composition of clay and carbonate minerals, most studies have constructed models for estimating indirect inversion based on the concentrations of Fe elements or organic matter and their characteristic bands [3,9,35,36]. Xia et al. analysed the correlation between eight types of heavy metal elements and organic matter concentrations, compared the locations of sensitive bands for different heavy metal elements and organic matter, and developed a partial least squares regression model for the inversion of heavy metal concentrations [37]. Cheng et al. studied soils in suburban Wuhan, Hubei Province, China, using hyperspectral techniques and found that the spectral estimates of Cd concentrations were strongly correlated with organic matter, while Cr and As were closely correlated with Fe, but the concentrations of Pb, Zn, and Cu were weakly correlated with either organic matter or Fe, leading to poor estimation results [35]. Shen et al. analysed the relationship between Fe and heavy metal concentrations and predicted Fe concentrations using transformed spectral data [38]. They then predicted heavy metal Cu concentrations based on the predicted Fe concentrations. However, there are relatively few such studies in entisols. Therefore, we aimed to analyse the correlations between Fe and the concentrations and characteristic bands of four heavy metals (Pb, As, Zn, and Mn. We used Pearson coefficients and radar plots to assess the importance of Fe elements to the estimation of their concentrations in entisols.
In summary, the redundancy and instability of the spectral features of the heavy metal elements in the entisols necessitate the development of a concentration inversion model to optimise the combination of spectral indices and parameter weights. Furthermore, the effect of Fe concentration on the inversion of other heavy metal elements content in entisols (e.g., Mn, Zn, As, and Pb) based on the hyperspectral method requires further investigation. In this study, we developed the inversion model of heavy metal concentrations (Mn, Fe, Zn, As, and Pb) by collecting spectral curves and composition data from entisols in a metal mining-impacted area in Harbin, China, aiming to demonstrate the feasibility of monitoring entisols quality using hyperspectral remote sensing technology. This study is divided into three parts:
(1)
The three-band spectral indices with good correlations with concentrations of Mn, Zn, As, and Pb were extracted. The combinations of spectral indices were screened by RCBP to invert the concentrations of each heavy metal using the Catboost algorithm.
(2)
The correlations between Fe concentration and elemental concentrations (and characteristic bands) of Mn, Zn, As, and Pb were established through Pearson coefficient analysis and radar plotting.
(3)
The spatial distribution and correlation of heavy metal elements in entisols in a metal mining-impacted area were determined using the spectral inversion concentration data of Mn, Zn, As, and Pb in soils.

2. Materials and Methods

2.1. Study Area and Dataset

The study area, approximately 8500 m2 in size, was situated in a maise field (Figure 1), located 1.5 km northwest of the Bailing Cu-Zn deposit in Xiaoling County, A’cheng District, Harbin City, Heilongjiang Province, China. The deposit is a skarn deposit, and the ore mineralogy is dominated by magnetite, sphalerite, arsenopyrite, and chalcopyrite, with minor pyrite and hematite occurrences [39]. The average ore grades in the main ore body for Fe, Cu, Zn, and As are 28.45%, 0.5%, 1.88%, and 5.51%, respectively [39]. The exploration of the deposit was started in the 1980s and has been exploited since 2009 [39]. Soils of the maise field are Entisols per the US Soil Taxonomy [40]. The research area has a continental climate (Dwa, according to the Koppen-Geiger climate classification), with dry and cold winters and hot and humid summers [41]. Precipitation is mainly concentrated during summer (June to September), and the annual precipitation is approximately 518 mm [39]. A small stream with a width of 1–2 m (during the dry season) passes by the deposit and is located along the west side of the studied maise field. The wastewater from the deposit forms a branch of the stream. The studied maise field is located in a low-lying area, where monsoons flood during the rainy season (June to September), likely submerging the overbank on the west side of the maise field.
A total of 95 soil samples were collected from the maise field at a depth of 0–20 cm, following a roughly 10 × 10 m grid pattern. The collected soil samples were then oven-dried at 65 °C for 72 h, ground using a rubber mallet, and sieved using a 0.18 mm mesh. The sieved soil sample then underwent ex situ pXRF analysis using a Niton XL3t 950 (Thermo-Fisher Scientific, Waltham, MA, USA) pXRF analyser. The sieved soil samples were collected in a homemade sample cup with a diameter of 23 mm and a height of 25 mm [42]. The sample cup was sealed by polyethylene films (tens of mm, with nearly no compositional interference and high X-ray transmission rate), which have been used in previous studies for protecting the pXRF detection window [43,44]. The sample cup was placed in a test stand and then underwent PXRF analysis (aperture up) using line power at 220 VAC [45]. The dwell time for pXRF analysis was 90 s, and eight replicate measurements were conducted for each soil sample. In order to monitor drift, insertion of certified reference material (CRM) 180–661 was conducted at regular intervals. The recoveries for As, Pb, and Fe are 87–91%, 92–99%, and 91–94%, respectively.
The ex situ pXRF data quality was also examined by the ICP-MS and ICP-AES method conducted at ALS Minerals (Guangzhou, China) per Chinese Standard SY/T 6404-2018 [46]. To this end, 0.25 g samples were first digested using a perchloric and nitric acid mixture (As3+ was oxidated to As5+, preventing As evaporation). Then, hydrofluoric acid was added, and the digestion vessels were heated using an EG 20A-PLUS hotplate produced by LabTech. Then, hydrochloric acid was added to the residues (after heating evaporation) and brought to a constant volume. Lastly, the prepared solutions underwent ICP-MS and ICP-AES analysis using an Aglient 7900 instrument. Arsenic, Mn, Pb, and Zn were determined using the ICP-MS method, and Fe was determined using the ICP-AES method. The LODs of the ICP-MS method for As, Mn, Pb, and Zn are 0.2 mg kg−1, 5 mg kg−1, 0.5 mg kg−1, and 2 mg kg−1, respectively. The LOD for Fe of the ICP-AES method is 100 mg kg−1. The accuracy of ICP-MS and ICP-AES measurement was confirmed by measuring three certified reference materials for soils (GBM908-10 supplied by Geostats Pty Ltd., O’Connor, WA, Australia, MRGeo08 supplied by ALS Minerals, Vancouver, BC, Canada, OREAS-25a supplied by Ore Research and Exploration Pty Ltd., Bayswater North, VIC, Australia) at regular intervals. The recovery percentages (ICP determined/CRM reported) for As, Fe, Mn, Pb, and Zn are 98–112%, 100–103%, 99–103%, 94–100%, and 97–105%, respectively. Furthermore, replicate ICP-MS/ICP-AES measurements were conducted on random samples to observe the standard deviations of the ICP-MS measurements. Standard deviations (and relative standard deviations) of replicate measurements can reflect the uncertainty of ICP-MS analysis. The average relative standard deviations of replicate ICP-MS measurements were 1.22%, 0.76%, 1.65%, 2.97%, and 5.61% for As, Fe, Mn, Pb, and Zn, respectively. The ex situ pXRF determined Mn, Fe, Zn, As, and Pb concentrations were highly correlated with those determined by ICP-MS. The R2 values for Mn, Fe, Zn, As, and Pb were 0.99, 0.96, 0.97, 0.99, and 0.93, respectively, indicating a good pXRF data quality. Therefore, the Mn, Fe, Zn, As, and Pb concentrations determined by ex situ pXRF were quite reliable and used for further comparison.
The dried and sieved soil samples were measured by using a FieldSpec 4 Hi-Res portable geophysical spectrometer produced by ASD (Analytical Spectral Devices) corporation in the USA with a wavelength of 350–2500 nm in midnoon (9:30 a.m.–14:30 p.m.) on a clear and sunny day in summer. The 95 soil samples (dried and sieved) were laid in 10 × 10 cm square size flat on a black cloth to reduce the reflection interference. The spectrometer’s fibre probe with a view angle of 25° was positioned approximately 18 cm perpendicular to the surface of the sample. Two people with dark clothes were involved in the scanning process, with one operating a laptop connecting FieldSpec 4 Hi-Res portable ASD geophysical spectrometer and another preparing the samples for measurement. Each soil sample underwent 10 replicate measurements to reduce errors, and the average of the spectra was obtained for the following study.

2.2. Methods

Figure 2 shows a flowchart of the whole procedure, which included three steps: (i) first-order derivative pre-processing of spectral data and acquisition of Pb, As, Zn, Mn and Fe elemental concentrations. (ii) Extraction of three-band spectral indices of Pb, Zn, Mn and As concentrations using a three-dimensional slicing method and screening of three-band spectral index combinations based on radar plots of the characteristic band Pearson coefficients (RCBP) using MATLAB R2017b and Excel 2021. (iii) Inversion of the heavy metal concentrations of Pb, As, Zn, and Mn based on the preferred combination of spectral indices and Catboost model, followed by the correlation of Fe with the characteristic bands and concentrations of the four heavy metal elements using Pearson coefficients and radar plots. (iv) Spatial analysis of the concentrations of the four heavy metals based on the concentrations of spectral inversion.

2.2.1. Preferred Combination of Spectral Indices Based on RCBP

Analysis of the existing results revealed that among the soil spectral pre-processing methods, excellent results were achieved by the derivative processing technique, which effectively eliminated the environmental background interference [11]. The first-order derivative data of the spectra were used to construct highly accurate three-band spectral indices with the inversion of soil heavy metal concentrations [18]. First, the collected soil spectra were processed for first-order derivatives. Then, to make full use of the spectral information, three combined forms of three-band spectral indices (SI) were constructed: SI1: (Ri − Rj)/(Ri + Rk), SI2: (Ri − Rj) + (Rk − Rj) and SI3: Ri/(Rj + Rk) (Figure 3a). The spectral indices that correlated well with the concentrations of Pb, As, Zn, and Mn were retained by traversing all possible triple-band combinations over the entire band range (Figure 3b).
RCBP was used to screen the combination of three-band spectral indices, which included the following steps. Based on the spectral indices of the four heavy metals that were retained (Figure 3b), the Pearson correlation coefficients of each element concentration and its three-band spectral index were obtained. The spectral indices were ranked from the largest to smallest correlation coefficient (Figure 3b), and then the spectral index combinations were determined to obtain the characteristic bands (Figure 3c). Pb, As, Zn and Mn were taken as the target elements in turn, and the correlation coefficients between the characteristic bands of the target elements and those of other elements were calculated under the condition of maintaining the same number of spectral index combinations (Figure 3d). The RCBPs between the target element and the other three elements were drawn separately (Figure 3e), and the spectral index combinations of the target element were further screened according to the principle of minimum correlation (Figure 3f). These spectral index combinations were used to construct inversion models of the concentrations of Pb, As, Zn and Mn in the soil. In Figure 3, Pearson coefficient 1 is the correlation between heavy metal element concentrations and spectral indexes. Pearson coefficient 2 is the correlation coefficient between the characteristic bands of the target element and those of other elements calculated according to the same number of spectral index combinations. The target element was cycled between As, Pb, Mn and Zn, with As used here as an example. The first index combination is the SI with the best correlation with the target element, the Top 2 index combinations are the first two SI with good correlation with the target element, and so on. The top 5 index combinations are the top five SI with good correlation with the target element.

2.2.2. Catboost Determination of Heavy Metal Concentrations

Catboost is a new open-source machine learning library proposed by Yandex in 2017, which consists of Categorical and Boosting components. It is a gradient-boosted decision trees (GBDT) framework based on a symmetric decision tree algorithm. It not only has fewer parameters and high accuracy but also supports categorical variables. The efficient and reasonable handling of categorical features is also its main advantage. The Catboost model overcomes the gradient bias and prediction offset problems of the traditional Boosting framework, which reduces the occurrence of overfitting and improves the accuracy and generalizability of the model. In the model training process, Catboost uses a serial approach to integrate multiple base learners. The training sample set remains the same in each round, and the sample weights are continuously updated by the learning results of the previous round, thus gradually reducing the bias caused by noise points. There is a dependency between the multiple weak learners generated by the training, and the final result is obtained by weighting the regression values of all the weak learners (Figure 4). The construction, training, and testing of the Catboost is completed in an integrated development environment based on Python and Anaconda.
The coefficient of determination (R2) and the root-mean-square error (RMSE) were chosen to assess the stability and accuracy of the models. The R2 indicates the stability of the model, with values closer to 1 indicating greater stability. The RMSE indicates the accuracy of the model, with smaller values indicating greater accuracy. The formulae for calculating the R2 and RMSE are as follows.
R 2 = i = 1 n ( y ^ i y ¯ ) 2 / i = 1 n ( y i y ¯ ) 2 R M S E = 1 n i = 1 n ( y ^ i y i ) 2
where y ^ i is the predicted values, y ¯ is the average of the observed values, y i is the observed values, and n is the number of predicted/observed values. The ratio of training set to test set in this study is 8:2.

2.2.3. Spatial Analysis

The accuracy of the inversion results was evaluated from three aspects: spatial distribution, spatial correlation, and spatial clustering. The spatial correlation and clustering of the four heavy metal elements were analysed. Spatial interpolation methods are widely used in resource and disaster management and environmental management. The inverse distance-weighted average interpolation (IDW) method is based on the theory of the first law of geography. This considers that a predicted point becomes less affected with distance from an observation point and is one of the more applied interpolation methods [47].

3. Results

3.1. Spectral Index Combination Preference and Heavy Metal Concentration Assessment

The spectral index combination optimisation is completed using the RCBP method, with the first step being the extraction of SIs. The spectra of all soil samples were processed for first-order derivatives, and then the correlations between the first-order derivative data and the measured concentrations of Pb, Zn, Mn, As, and Fe elements were analysed. The spectral indices with good correlations with each element concentration were extracted. The top five groups of spectral indices with good correlations with the five elemental contents were obtained separately (Table 1). The spectral indices were found to be SI2 and SI3 for Pb concentration, SI2 for Zn concentration, SI1 and SI3 for both Mn and Fe, and SI1 for As. According to the correlation coefficient r, the As concentration showed the best correlation with the spectral indexes, followed by Zn, Mn and Pb.
The latter is to optimise the spectral characteristic bands of each element. To reduce the redundancy of spectral features due to the existence of interactions among soil elements, the spectral indices of Pb, As, Zn and Mn extracted from Table 1 were combined according to the ranking of correlation coefficients (taking Pb as an example, the optimal three-band spectral index is the first index in the table; i.e., ① (RFD1278 − RFD622) + (RFD1530 − RFD622). The top two spectral index combinations are ① (RFD1278 − RFD622) + (RFD1530 − RFD622) and ② (RFD466 − RFD622) + (RFD1530 − RFD622). The top three spectral indices were combined as ① (RFD1278 − RFD622) + (RFD1530 – RFD622), ② (RFD466 − RFD622) + (RFD1530 − RFD622) and ③ (RFD450 − RFD622) + (RFD1530 − RFD622). The top four spectral index combinations are ① ② ③ ④ and the top five spectral index combinations are ① ② ③ ④ ⑤). With combinations of the same number of spectral indices, Pb, As, Zn and Mn were taken in turn as the target element, and the Pearson coefficients between the characteristic bands of the target elements and those of the other three elements were calculated and analysed. Based on the results, separate radar plots of the correlation coefficients between Pb, As, Zn and Mn were obtained (Figure 5). Figure 5 shows that the top three spectral index combinations were suitable for selection to ensure a small correlation between the characteristic bands of Pb and those of each other element; arsenic is the same case. The smallest correlation between the characteristic bands of Zn and those of each other element indicates the top two spectral index combinations and the same is true for Mn.
The top three well-correlated spectral index combinations of Pb and As were used to construct concentration inversion models with characteristic bands of 450, 466, 622, 1278 and 1530 nm for Pb and 622, 746, 930, 938, 1102, 1122 and 1274 nm for As. The top two well-correlated spectral index combinations of Zn and Mn were used for the inversion of elemental concentrations, with characteristic bands of 450, 622, 630, and 1230 nm for Zn and 1318, 1646, 1806, 2271, 2275, and 2383 nm for Mn. The Catboost algorithm was used to invert the elemental concentrations, and the accuracy of the test set used for each elemental concentration inversion model is shown in Figure 6. The regression model with the best fit was obtained as the inversion of Zn concentration, with an R2 of 0.8786 for the test set, followed by Mn (R2 = 0.8576), As (R2 = 0.7916) and Pb (R2 = 0.6022).

3.2. Correlations between Fe and Heavy Metal Elements in Terms of Concentrations and Spectra

The correlations between Fe and the Pb, As, Zn and Mn heavy metal elements in terms of concentration and characteristic bands were analysed using Pearson coefficients. The correlations between concentrations are shown in Figure 7.
The correlations of the characteristic bands of Fe with those of Pb, As, Zn and Mn were analysed using Pearson coefficients and radar plots (Figure 8). The spectral indices ① (in Table 1) of Pb have the best correlation with Fe, with a correlation coefficient of 0.837, followed by the ② and ④. The ③ and ⑤ have weaker correlations with Fe, with correlation coefficients of 0.276 and 0.131, respectively. The spectral indices ④ (in Table 1) of Mn showed the best correlation with Fe, with a correlation coefficient of 0.542, and the ②, ③, ①, and ⑤ showed weaker correlations with Fe, with correlation coefficients of 0.299, 0.289, 0.221 and 0.199, respectively. The spectral indices ④ (in Table 1) of Zn elements have the best correlation with Fe, with a correlation coefficient of 0.711, followed by the ⑤ and ④, while the ③ and ① have the weakest correlations with Fe, with correlation coefficients of 0.22 and 0.004, respectively. The spectral indices ① (in Table 1) for As correlate best with Fe, with a correlation coefficient of 0.303, followed by the ⑤, ③ and ②, and the ④ correlate least with Fe, with a correlation coefficient of 0.089. In summary, Fe has the best correlation with Pb in the 458, 654 and 1090 nm characteristic bands (r = 0.837). In the 458, 474, 654 and 1090 nm characteristic bands, Fe has the best correlation with Zn (r = 0.711). In the 458, 474, 522, 630, 654, 930, 1090, 1330 and 1694 nm characteristic bands, Fe has the highest correlation with Mn (r = 0.542). The correlations between Fe and As in the characteristic bands are low, with coefficients of r < 0.303.

3.3. Analysis of the Spatial Distribution of Soil Heavy Metal Concentrations Based on Spectral Inversion

3.3.1. Spatial Distribution

To test the effect of spectral inversion, inverse distance-weighted interpolation of the inverse concentrations of Pb, As, Zn, and Mn at 95 sampling points was performed to obtain the spatial distribution of soil heavy metal contents (Figure 9). It can be seen from Figure 9 that after interpolation, the distributions of measured and spectral-predicted As and Mn values are in good agreement, with the concentrations of As being generally higher in the west and lower in the east, while the concentrations of Mn are generally higher in the west and north and lower in the central, east and south. The spatial distributions of spectral-predicted and measured concentrations of Pb and Zn are also relatively consistent; however, for Pb, the predicted concentrations are locally lower than the measured concentrations. For Zn, the predicted concentrations are locally higher than the measured concentrations.

3.3.2. Spatial Correlation Analysis

Based on the measured and spectral-predicted concentrations, Moran’s I values were used to determine the global Moran’s I values for the four heavy metals (Pb, As, Zn, and Mn) in the soils of the study area (Table 2). The global Moran’s I are all >0, indicating that all four heavy metals show positive spatial correlations, i.e., larger (smaller) elemental contents are more likely to be clustered [48], but their correlations are low. The Moran’s I of the predicted and measured concentrations of As and Mn are similar, while those of the predicted concentrations of Pb and Zn are much smaller than those of the measured concentrations. The P-values of the measured concentrations of the four heavy metals are <0.05, and the Z-scores are >1.96, indicating that there is a certain spatial aggregation of each heavy metal element [49]. The P-values of the predicted concentrations of As and Mn are both <0.05, and the Z-scores are both >1.96, but the P-values of the predicted concentrations of Pb and Zn are both >0.05 with Z-scores < 1.96. This indicates that in terms of spatial correlation, the predictions of As and Mn concentrations are good, while those of Pb and Zn concentrations are not very satisfactory.

3.3.3. Spatial Clustering Analysis

The global Moran’s I only tell us whether the space appears clustered, discrete, or randomly distributed, but not where it appears [50]. Therefore, further local Moran’s I analysis was conducted to determine where outliers or agglomerations occur. Figure 10 shows the spatial distribution characteristics of the aggregation types of the four heavy metals. The spatial clustering of the measured and predicted concentrations of As and Mn are in good agreement, with most points being insignificant, followed by a high-high aggregation and one point with an anomalously high value. The spatial clustering of the measured and predicted concentrations of Pb and Zn differed, with most points showing consistent insignificance. However, the distributions of high-high aggregation, low-low aggregation, high anomalies, and low anomalies of the predicted and measured concentrations of Pb differed significantly. The distributions of high-high aggregation and high anomalies of measured and predicted concentrations of Zn also differed significantly. This indicates that the predicted concentrations of As and Mn are better than those of Pb and Zn from the spatial clustering analysis.

4. Discussion

To retrieve the heavy metal concentrations in soils using hyperspectral technology, previous research mainly focused on the extraction of characteristic bands, comparison of spectral transformation methods, construction of spectral indexes, and comparison of inversion models; the main emphasis typically lies in comparing the accuracy of inversion algorithms. Malmir et al. developed the partial least square regression (PLSR) models using spectral reflectance of soil, and the models performed a good prediction of Cu and Zn concentrations in soil [8]. Zhou et al. found that the combination of the first derivative and the random forest model performed the highest inversion accuracy for the six heavy metals (Mn, Cu, Zn, Pb, Cr and Ni) [36]. Pyo et al. suggested that the characteristic bands of Al, Cu, As, Zn, Pb, Cd, Ni, Hg, and Cr partly overlap [25]. For the inversion of Cu in soils, the indirect inversion using Fe concentrations based on the backpropagation neural network (BPNN) model performed higher predictive accuracy than direct inversion using spectral characteristic bands of Cu [38]. Bian et al. showed that the BPNN model was better than the PLSR model in retrieving Cu, Zn and Pb concentrations, while the optimal spectral transformation methods for these elements are different [3]. Normally, the predictive accuracy of inversion models could be reflected by the comparison between measured and predicted concentrations (and the derived spatial distribution maps) [19,38]. The selection of the optimal combination of spectral parameters and models tends to vary for different soil heavy metals. Few studies have delved into a detailed examination of the similarities and differences in characteristic bands among different heavy metals. Additionally, there is limited investigation on strategies to minimise redundancy in characteristic bands and to extract the unique spectral responses of each heavy metal element. The improvement of this study is that the three–band spectral index combination of Pb, As, Mn and Zn was improved by using the developed RCBP and Catboost methods, which reduced the correlation between the characteristic bands of heavy metals and avoided interaction compared to previous studies (Table 3). The correlation between Fe and Pb, As, Mn and Zn was studied in terms of concentration and characteristic band, which supported the indirect inversion of heavy metal concentrations. Besides direct comparison between measured and predicted concentrations, the present study further conducted spatial correlation analysis and cluster analysis to evaluate the established inversion models for Pb, As, Mn, and Zn.

5. Conclusions

This study used RCBP to screen three-band spectral index combinations of four elements (Pb, Zn, Mn, and As) in entisols. We analysed the correlations between each element and Fe in terms of concentrations and characteristic bands, inverted the concentration of each element using the Catboost algorithm, and evaluated the effect of spectral inversion with the help of spatial distribution feature analysis. The conclusions of the study are as follows.
When each of the four elements was used as the target element to reduce the correlation between the characteristic bands of the target element and other elements, the extracted characteristic bands of Pb were 450, 466, 622, 1278, and 1530 nm, those of As were 622, 746, 930, 938, 1102, 1122, and 1274 nm, those of Zn were 450, 622, 630, and 1230 nm, and those of Mn were 1318, 1646, 1806, 2271, 2275, and 2383 nm. Based on combinations of the spectral indices of each element, the regression model of the inversion of the Zn concentration had the best fit, with R2 = 0.8786 for the test set, followed by the models for Mn (R2 = 0.8576), As (R2 = 0.7916) and Pb (R2 = 0.6022).
Fe concentrations had the strongest correlation with Mn concentration (r = 0.82), followed by Zn and As, while the weakest correlation was with Pb (r = 0.373). In terms of the characteristic bands, Fe concentrations were most strongly correlated with Pb in the 458, 654 and 1090 nm characteristic bands (r = 0.837). In the 458, 474, 654 and 1090 nm characteristic bands, Fe had the strongest correlation with Zn (r = 0.711). In the 458, 474, 522, 630, 654, 930, 1090, 1330 and 1694 nm characteristic bands, Fe has the highest correlation coefficient with Mn (r = 0.542). The correlation between Fe and As in the characteristic bands was weak (r < 0.303).
The distributions of the measured and spectral-predicted concentrations are consistent for all studied elements (As, Mn, Pb, and Zn), but the predicted concentrations for Pb and Zn exhibited some differences compared to the measured concentrations in local areas. In terms of spatial correlations, the predictions of As and Mn concentrations were good, and those of Pb and Zn were weaker. In the spatial clustering analysis, the predictions of As and Mn concentrations were better than those of Pb and Zn.

Author Contributions

J.Z., J.F. and Y.Z. performed sample preparation and analysis using a portable ASD geophysical spectrometer. Z.Y. performed sample collection, sample preparation, and pXRF analysis. J.Z., J.F. and F.M. assisted with data analysis and experiment design. S.Z. conceived the study and was involved in the review and editing of the manuscript. He contributes as a corresponding author. P.F. conceived the study, designed the experiment, analysed the data, and prepared the manuscript. She contributes as the first author. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Natural Science Foundation of China (42101388), Shandong Top Talent Special Foundation (0031504), Shandong Provincial Natural Science Foundation, China (ZR2022MD070), Youth Innovation Team Project of Higher School in Shandong Province (2022KJ201), the National Key Research and Development Program of China (2023YFC2906402), the National Natural Science Foundation of China (42050103), and the China Geological Survey Project (20190459).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Dataset available on request from the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Dai, C.; Liu, Y.; Wang, T.; Li, Z.; Zhou, Y.; Deng, J. Quantifying the structural characteristics and hydraulic properties of shallow Entisol in a hilly landscape. Int. Agrophys. 2022, 36, 105–113. [Google Scholar] [CrossRef]
  2. Hu, B.; Shao, S.; Ni, H.; Fu, Z.; Hu, L.; Zhou, Y.; Min, X.; She, S.; Chen, S.; Huang, M.; et al. Current status, spatial features, health risks, and potential driving factors of soil heavy metal pollution in China at province level. Environ. Pollut. 2020, 266, 114961. [Google Scholar] [CrossRef]
  3. Bian, Z.; Sun, L.; Tian, K.; Liu, B.; Zhang, X.; Mao, Z.; Huang, B.; Wu, L. Estimation of Heavy Metals in Tailings and Soils Using Hyperspectral Technology: A Case Study in a Tin-Polymetallic Mining Area. Bull. Environ. Contam. Toxicol. 2021, 107, 1022–1031. [Google Scholar] [CrossRef]
  4. Benedet, L.; Silva, S.H.G.; Mancini, M.; dos Santos Teixeira, A.F.; Inda, A.V.; Demattê, J.A.; Curi, N. Variation of properties of two contrasting Oxisols enhanced by pXRF and Vis-NIR. J. S. Am. Earth Sci. 2022, 115, 103748. [Google Scholar] [CrossRef]
  5. Yang, H.; Xu, H.; Zhong, X. Prediction of soil heavy metal concentrations in copper tailings area using hyperspectral reflectance. Environ. Earth Sci. 2022, 81, 183. [Google Scholar] [CrossRef]
  6. Malley, D.F.; Williams, P.C. Use of Near-Infrared Reflectance Spectroscopy in Prediction of Heavy Metals in Freshwater Sediment by Their Association with Organic Matter. Environ. Sci. Technol. 1997, 31, 3461–3467. [Google Scholar] [CrossRef]
  7. Chakraborty, S.; Weindorf, D.C.; Deb, S.; Li, B.; Paul, S.; Choudhury, A.; Ray, D.P. Rapid assessment of regional soil arsenic pollution risk via diffuse reflectance spectroscopy. Geoderma 2017, 289, 72–81. [Google Scholar] [CrossRef]
  8. Malmir, M.; Tahmasbian, I.; Xu, Z.; Farrar, M.B.; Bai, S.H. Prediction of soil macro- and micro-elements in sieved and ground air-dried soils using laboratory-based hyperspectral imaging technique. Geoderma 2019, 340, 70–80. [Google Scholar] [CrossRef]
  9. Song, L.; Jian, J.; Tan, D.J.; Xie, H.B.; Luo, Z.F.; Gao, B. Estimate of heavy metals in soil and streams using combined geochemistry and field spectroscopy in Wan-sheng mining area, Chongqing, China. Int. J. Appl. Earth Obs. 2015, 34, 1–9. [Google Scholar] [CrossRef]
  10. Zhang, S.; Shen, Q.; Nie, C.; Huang, Y.; Wang, J.; Hu, Q.; Ding, X.; Zhou, Y.; Chen, Y. Hyperspectral inversion of heavy metal content in reclaimed soil from a mining wasteland based on different spectral transformation and modeling methods. Spectrochim. Acta Part A 2019, 211, 393–400. [Google Scholar] [CrossRef]
  11. Liu, J.; Zhang, Y.; Wang, H.; Du, Y. Study on the prediction of soil heavy metal elements content based on visible near-infrared spectroscopy. Spectrochim. Acta Part A 2018, 199, 43–49. [Google Scholar] [CrossRef] [PubMed]
  12. Qian, J.; Liu, X.; Zhang, J.; Zhou, W.; Li, J. Constructions of hyperspectral remote sensing monitoring models for heavy metal contents in farmland soil in Zhangjiagang City. Acta Agric. Zhejiangensis 2020, 32, 1437–1445. [Google Scholar] [CrossRef]
  13. Wu, M.Z.; Li, X.M.; Sha, J.M. Spectral Inversion Models for Prediction of Total Chromium Content in Subtropical Soil. Spectrosc. Spect. Anal. 2014, 34, 1660–1666. [Google Scholar]
  14. Dong, J.; Dai, W.; Xu, J.; Li, S. Spectral Estimation Model Construction of Heavy Metals in Mining Reclamation Areas. Int. J. Environ. Res. Public Health 2016, 13, 640. [Google Scholar] [CrossRef] [PubMed]
  15. Lamine, S.; Petropoulos, G.P.; Brewer, P.A.; Bachari, N.E.I.; Srivastava, P.K.; Manevski, K.; Kalaitzidis, C.; Macklin, M.G. Heavy Metal Soil Contamination Detection Using Combined Geochemistry and Field Spectroradiometry in the United Kingdom. Sensors 2019, 19, 762. [Google Scholar] [CrossRef] [PubMed]
  16. Wang, J.; Cui, L.; Gao, W.; Shi, T.; Chen, Y.; Gao, Y. Prediction of low heavy metal concentrations in agricultural soils using visible and near-infrared reflectance spectroscopy. Geoderma 2014, 216, 1–9. [Google Scholar] [CrossRef]
  17. Zhang, C.; Ren, H.; Dai, X.; Qin, Q.; Li, J.; Zhang, T.; Sun, Y. Spectral characteristics of copper-stressed vegetation leaves and further understanding of the copper stress vegetation index. Int. J. Remote Sens. 2019, 40, 4473–4488. [Google Scholar] [CrossRef]
  18. Fu, P.; Yang, K.; Meng, F.; Zhang, W.; Cui, Y.; Feng, F.; Yao, G. A new three-band spectral and metal element index for estimating soil arsenic content around the mining area. Process Saf. Environ. 2022, 157, 27–36. [Google Scholar] [CrossRef]
  19. Sawut, R.; Kasim, N.; Abliz, A.; Hu, L.; Yalkun, A.; Maihemuti, B.; Qingdong, S. Possibility of optimized indices for the assessment of heavy metal contents in soil around an open pit coal mine area. Int. J. Appl. Earth Obs. 2018, 73, 14–25. [Google Scholar] [CrossRef]
  20. Peng, Y.; Zhao, L.; Hu, Y.; Wang, G.; Wang, L.; Liu, Z. Prediction of soil nutrient contents using visible and near-infrared reflectance spectroscopy. ISPRS Int. J. Geo-Inf. 2019, 8, 437. [Google Scholar] [CrossRef]
  21. Chakraborty, S.; Li, B.; Weindorf, D.C.; Morgan, C.L. External parameter orthogonalisation of Eastern European VisNIR-DRS soil spectra. Geoderma 2019, 337, 65–75. [Google Scholar] [CrossRef]
  22. Bilgili, A.V.; Akbas, F.; van Es, H.M. Combined use of hyperspectral VNIR reflectance spectroscopy and kriging to predict soil variables spatially. Precis. Agric. 2010, 12, 395–420. [Google Scholar] [CrossRef]
  23. Hou, L.; Li, X.; Li, F. Hyperspectral-based Inversion of Heavy Metal Content in the Soil of Coal Mining Areas. J. Environ. Qual. 2019, 48, 57–63. [Google Scholar] [CrossRef]
  24. Pandit, C.M.; Filippelli, G.M.; Li, L. Estimation of heavy-metal contamination in soil using reflectance spectroscopy and partial least-squares regression. Int. J. Remote Sens. 2010, 31, 4111–4123. [Google Scholar] [CrossRef]
  25. Pyo, J.; Hong, S.M.; Kwon, Y.S.; Kim, M.S.; Cho, K.H. Estimation of heavy metals using deep neural network with visible and infrared spectroscopy of soil. Sci. Total Environ. 2020, 741, 140162. [Google Scholar] [CrossRef] [PubMed]
  26. Tan, K.; Ma, W.; Chen, L.; Wang, H.; Du, Q.; Du, P.; Yan, B.; Liu, R.; Li, H. Estimating the distribution trend of soil heavy metals in mining area from HyMap airborne hyperspectral imagery based on ensemble learning. J. Hazard. Mater. 2021, 401, 123288. [Google Scholar] [CrossRef] [PubMed]
  27. Aasen, H.; Bolten, A. Multi-temporal high-resolution imaging spectroscopy with hyperspectral 2D imagers–From theory to application. Remote Sens. Environ. 2018, 205, 374–389. [Google Scholar] [CrossRef]
  28. Guo, H.; Yang, K.; Wu, F.; Chen, Y.; Shen, J. Regional Inversion of Soil Heavy Metal Cr Content in Agricultural Land Using Zhuhai-1 Hyperspectral Images. Sensors 2023, 23, 8756. [Google Scholar] [CrossRef] [PubMed]
  29. Tan, K.; Wang, H.; Chen, L.; Du, Q.; Du, P.; Pan, C. Estimation of the spatial distribution of heavy metal in agricultural soils using airborne hyperspectral imaging and random forest. J. Hazard. Mater. 2020, 382, 120987. [Google Scholar] [CrossRef] [PubMed]
  30. Wu, F.; Wang, X.; Liu, Z.; Ding, J.; Tan, K.; Chen, Y. Assessment of heavy metal pollution in agricultural soil around a gold mining area in Yitong County, China, based on satellite hyperspectral imagery. J. Appl. Remote Sens. 2021, 15, 042613. [Google Scholar] [CrossRef]
  31. Chen, M.; Pan, Y.X.; Huang, Y.X.; Wang, X.T.; Zhang, R.D. Spatial Distribution and Sources of Heavy Metals in Soil of a Typical Lead-Zinc Mining Area, Yangshuo. Huan Jing Ke Xue 2022, 43, 4545–4555. [Google Scholar] [CrossRef]
  32. Liu, Y.; Luo, Q.; Cheng, H. Application and development of hyperspectral remote sensing technology to determine the heavy metal content in soil. J. Agro-Environ. Sci. 2020, 39, 2699–2709, (In Chinese with English Abstract). [Google Scholar]
  33. Rathod, P.H.; Müller, I.; Van der Meer, F.D.; de Smeth, B. Analysis of visible and near infrared spectral reflectance for assessing metals in soil. Environ. Monit. Assess. 2016, 188, 558. [Google Scholar] [CrossRef] [PubMed]
  34. Siebielec, G.; McCarty, G.W.; Stuczynski, T.I.; Reeves, J.B., III. Near- and Mid-Infrared Diffuse Reflectance Spectroscopy for Measuring Soil Metal Content. J. Environ. Qual. 2004, 33, 2056–2069. [Google Scholar] [CrossRef] [PubMed]
  35. Cheng, H.; Shen, R.; Chen, Y.; Wan, Q.; Shi, T.; Wang, J.; Wan, Y.; Hong, Y.; Li, X. Estimating heavy metal concentrations in suburban soils with reflectance spectroscopy. Geoderma 2019, 336, 59–67. [Google Scholar] [CrossRef]
  36. Zhou, W.; Yang, H.; Xie, L.; Li, H.; Huang, L.; Zhao, Y.; Yue, T. Hyperspectral inversion of soil heavy metals in Three-River Source Region based on random forest model. Catena 2021, 202, 105222. [Google Scholar] [CrossRef]
  37. Xia, F.; Peng, J.; Wang, Q.; Zhou, L.-Q.; Zhou, S. Prediction of heavy metal content in soil of cultivated land: Hyperspectral technology at provincial. J. Infrared Millim. Waves 2015, 34, 593–599. [Google Scholar]
  38. Shen, Q.; Xia, K.; Zhang, S.; Kong, C.; Hu, Q.; Yang, S. Hyperspectral indirect inversion of heavy-metal copper in reclaimed soil of iron ore area. Spectrochim Acta Part A 2019, 222, 117191. [Google Scholar] [CrossRef]
  39. Tang, M. Ore-forming Regularities and Mineralization Forecast of the Bailing Cu-Zn Deposit in A’Cheng Area, Heilongjiang Province. Master’s Thesis, Jilin University, Changchun, China, 2012. (In Chinese with English Abstract). [Google Scholar]
  40. US-Department-of-Agriculture. Soil Taxonomy—A Basic System of Soil Classification for Making and Interpreting Soil Surveys. In Agriculture Handbook; US-Department-of-Agriculture: Washington, DC, USA, 1999; Volume 436, pp. 96–105. [Google Scholar]
  41. Peel, M.C.; Finlayson, B.L.; McMahon, T.A. Updated world map of the Köppen-Geiger climate classification. Hydrol. Earth Syst. Sc. 2007, 11, 1633–1644. [Google Scholar] [CrossRef]
  42. Zhou, S.; Cheng, Q.; Weindorf, D.C.; Yuan, Z.; Yang, B.; Sun, Q.; Zhang, Z.; Yang, J.; Zhao, M. Elemental assessment of dried and ground samples of leeches via portable X-ray fluorescence. J. Anal. Atom. Spectrom. 2020, 35, 2573–2581. [Google Scholar] [CrossRef]
  43. Zhou, S.; Cheng, Q.; Weindorf, D.C.; Yang, B.; Yuan, Z.; Yang, J. Determination of trace elements concentrations in organic materials of “intermediate-thickness” via portable X-ray fluorescence spectrometry. J. Anal. Atom. Spectrom. 2022, 37, 2461–2469. [Google Scholar] [CrossRef]
  44. Zhou, S.; Yuan, Z.; Cheng, Q.; Weindorf, D.C.; Zhang, Z.; Yang, J.; Zhang, X.; Chen, G.; Xie, S. Quantitative analysis of iron and silicon concentrations in iron ore concentrate using portable X-ray fluorescence (XRF). Appl. Spectrosc. 2020, 74, 55–62. [Google Scholar] [CrossRef] [PubMed]
  45. Zhou, S.; Yuan, Z.; Cheng, Q.; Zhang, Z.; Yang, J. Rapid in situ determination of heavy metal concentrations in polluted water via portable XRF: Using Cu and Pb as example. Environ. Pollut. 2018, 243, 1325–1333. [Google Scholar] [CrossRef]
  46. National Standard SY/T 6404-2018; Analytical Method for Metal Elements in Rock by ICP-AES and ICP-MS. National Energy Administration of China: Beijing, China, 2018.
  47. Zhang, S. Comparative analysis of spatial interpolation methods for soil heavy metals: A case study of Yanggu county. Geomat. Spat. Inf. Technol. 2020, 43, 148–150. (In Chinese) [Google Scholar]
  48. Liu, Z.; Fei, Y.; Shi, H.D.; Mo, L.; Qi, J.X.; Wang, C. Source apportionment of soil heavy metals in Rucheng county of Hunan province based on UNMIX model combined with moran index. Res. Environ. Sci. 2021, 34, 2446–2458, (In Chinese with English Abstract). [Google Scholar]
  49. Khosravi, Y.; Zamani, A.A.; Parizanganeh, A.H.; Yaftian, M.R. Assessment of spatial distribution pattern of heavy metals surrounding a lead and zinc production plant in Zanjan Province, Iran. Geoderma Reg. 2018, 12, 10–17. [Google Scholar] [CrossRef]
  50. Xiang, L.; Su, Q.; Sun, G.; Zhang, Y.; Ding, J.; Wang, Q.; Yu, J. Characteristics and causes of soil heavy metal pollution in typical pharmaceutical enterprise gathering areas. Environ. Chem. 2022, 41, 2022–2034, (In Chinese with English Abstract). [Google Scholar]
  51. Gannouni, S.; Rebai, N.; Abdeljaoued, S. A spectroscopic approach to assess heavy metals contents of the mine waste of Jalta and Bougrine in the North of Tunisia. J. Geogr. Inf. Syst. 2012, 4, 242–253. [Google Scholar] [CrossRef]
  52. Omran, E.S.E. Inference model to predict heavy metals of Bahr El Baqar soils, Egypt using spectroscopy and chemometrics technique. Model. Earth Syst. Environ. 2016, 2, 1–17. [Google Scholar] [CrossRef]
  53. Wang, C.; Li, W.; Guo, M.; Ji, J. Ecological risk assessment on heavy metals in soils: Use of soil diffuse reflectance mid-infrared Fourier-transform spectroscopy. Sci. Rep. 2017, 7, 40709. [Google Scholar] [CrossRef]
Figure 1. Location of the study area and sampling site distribution.
Figure 1. Location of the study area and sampling site distribution.
Sensors 24 01492 g001
Figure 2. Flowchart of the analysis process.
Figure 2. Flowchart of the analysis process.
Sensors 24 01492 g002
Figure 3. Preferred combination of spectral indices for heavy metal elements based on RCBP.
Figure 3. Preferred combination of spectral indices for heavy metal elements based on RCBP.
Sensors 24 01492 g003
Figure 4. Schematic diagram of Catboost algorithm.
Figure 4. Schematic diagram of Catboost algorithm.
Sensors 24 01492 g004
Figure 5. Radar maps of correlation coefficients between the characteristic bands of the target elements and other elements. (a) Target element = Pb, (b) Target element = As, (c) Target element = Zn, (d) Target element = Mn.
Figure 5. Radar maps of correlation coefficients between the characteristic bands of the target elements and other elements. (a) Target element = Pb, (b) Target element = As, (c) Target element = Zn, (d) Target element = Mn.
Sensors 24 01492 g005aSensors 24 01492 g005b
Figure 6. Comparison of the measured concentrations and those estimated by the Catboost algorithm for each heavy metal in the test set (As: p-value < 0.05; Mn: p-value < 0.01; Pb: p-value < 0.05; Zn: p-value < 0.05).
Figure 6. Comparison of the measured concentrations and those estimated by the Catboost algorithm for each heavy metal in the test set (As: p-value < 0.05; Mn: p-value < 0.01; Pb: p-value < 0.05; Zn: p-value < 0.05).
Sensors 24 01492 g006
Figure 7. Pearson correlation matrix of elemental concentrations (Fe, Pb, As, Zn, and Mn) in soils from entisols in a metal mining-impacted area in China.
Figure 7. Pearson correlation matrix of elemental concentrations (Fe, Pb, As, Zn, and Mn) in soils from entisols in a metal mining-impacted area in China.
Sensors 24 01492 g007
Figure 8. Correlations between characteristic bands of Fe and those of Pb, As, Zn, and Mn based on different spectral index combinations.
Figure 8. Correlations between characteristic bands of Fe and those of Pb, As, Zn, and Mn based on different spectral index combinations.
Sensors 24 01492 g008
Figure 9. Spatial distributions of soil heavy metal contents (measured and predicted values).
Figure 9. Spatial distributions of soil heavy metal contents (measured and predicted values).
Sensors 24 01492 g009
Figure 10. Spatial distributions of clusters of predicted and real concentrations of the four soil heavy metals.
Figure 10. Spatial distributions of clusters of predicted and real concentrations of the four soil heavy metals.
Sensors 24 01492 g010
Table 1. The top five groups of spectral indexes have good correlations with the five elemental contents.
Table 1. The top five groups of spectral indexes have good correlations with the five elemental contents.
ElementSpectral IndicesRange of r Values
Pb① (RFD1278 − RFD622) + (RFD1530 − RFD622)0.5893−0.5946
② (RFD466 − RFD622) + (RFD1530 − RFD622)
③ (RFD450 − RFD622) + (RFD1530 − RFD622)
④ RFD1278/(RFD534 − RFD1682)
⑤ (RFD1278 − RFD666) + (RFD1530 − RFD666)
Zn① (RFD450 − RFD622) + (RFD1230 − RFD622)0.6608−0.6744
② (RFD466 − RFD630) + (RFD1530 − RFD630)
③ (RFD462 − RFD622) + (RFD746 − RFD622)
④ (RFD450 − RFD622) + (RFD1530 − RFD622)
⑤ (RFD450 − RFD622) + (RFD1198 − RFD622)
Mn① RFD2271/(RFD2275 + RFD1806)0.6440−0.6595
② (RFD1646 − RFD2383)/(RFD1646 + RFD1318)
③ (RFD1806 − RFD2263)/(RFD1806 + RFD2275)
④ (RFD1094 − RFD1638)/(RFD1094 + RFD1106)
⑤ RFD1342/(RFD2275 + RFD1806)
As① (RFD1274 − RFD1102)/(RFD1274 + RFD930)0.6657−0.6756
② (RFD622 − RFD746)/(RFD622 + RFD1122)
③ (RFD930 − RFD938)/(RFD930 + RFD1274)
④ (RFD1274 − RFD1682)/(RFD1274 + RFD930)
⑤ (RFD622 − RFD738)/(RFD622 + RFD1122)
Fe① RFD458/(RFD1090 + RFD654)0.6077−0.6380
② RFD474/(RFD1090 + RFD654)
③ (RFD522 − RFD630)/(RFD522 + RFD934)
④ (RFD1694 − RFD930)/(RFD1694 + RFD1330)
⑤ RFD458/(RFD1310 + RFD654)
p-value < 0.001.
Table 2. Summary of the global Moran’s I values of the four heavy metal elements in soil.
Table 2. Summary of the global Moran’s I values of the four heavy metal elements in soil.
Heavy Metal ElementExpected IndexVarianceZ-Scorep-ValueMoran’s I
AsReal−0.01110.00483.68740.00020.2450
Pre−0.01110.00503.59560.00030.2440
MnReal−0.01110.00503.79310.00010.2563
Pre−0.01110.00503.16190.00150.2120
PbReal−0.01110.00542.89370.00380.2017
Pre−0.01110.00531.28350.19930.0821
ZnReal−0.01110.00512.01080.04430.1323
Pre−0.01110.00520.65220.51420.0358
Table 3. Selected spectral wavelengths in literature for estimating heavy metal concentrations.
Table 3. Selected spectral wavelengths in literature for estimating heavy metal concentrations.
Heavy Metals StudiedHeavy Metals Studied Selected Spectral Bands (nm)Reference
Pb, Zn, Cu, Cd, Mn800, 1300[24]
Pb, Zn, Mn500, 610[51]
Cd, Cu, Pb, Zn, Ni, Mn, Cr, Co, Fe538, 578, 630, 870, 1900, 2240, 2376[52]
Al480, 500, 565, 610, 680, 750, 1000, 1430, 1755, 1887, 1920, 1950, 2210, 2260[9]
Cu480, 500, 610, 750, 860, 1300, 1430, 1920, 2150, 2260
Cr480, 500, 610, 715, 750, 860, 1300, 1430, 1755, 1920, 1950
Fe, Cu, Zn, Hg486, 424, 1546, 1632, 1462, 1658, 1736, 1832, 1924, 2360[53]
Cd552, 698, 814, 1042, 1370, 1546, 1722, 1868, 2360, 2924
Ni552, 698, 814, 1042, 1332, 1546, 1722, 1868, 2360, 2924
Pb450, 466, 622, 1278, 1530This research
As622, 746, 930, 938, 1102, 1122, 1274
Zn450, 622, 630, 1230
Mn1318, 1646, 1806, 2271, 2275, 2383
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fu, P.; Zhang, J.; Yuan, Z.; Feng, J.; Zhang, Y.; Meng, F.; Zhou, S. Estimating the Heavy Metal Contents in Entisols from a Mining Area Based on Improved Spectral Indices and Catboost. Sensors 2024, 24, 1492. https://doi.org/10.3390/s24051492

AMA Style

Fu P, Zhang J, Yuan Z, Feng J, Zhang Y, Meng F, Zhou S. Estimating the Heavy Metal Contents in Entisols from a Mining Area Based on Improved Spectral Indices and Catboost. Sensors. 2024; 24(5):1492. https://doi.org/10.3390/s24051492

Chicago/Turabian Style

Fu, Pingjie, Jiawei Zhang, Zhaoxian Yuan, Jianfei Feng, Yuxuan Zhang, Fei Meng, and Shubin Zhou. 2024. "Estimating the Heavy Metal Contents in Entisols from a Mining Area Based on Improved Spectral Indices and Catboost" Sensors 24, no. 5: 1492. https://doi.org/10.3390/s24051492

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop