Study on Spectral Response and Estimation of Grassland Plants Dust Retention Based on Hyperspectral Data

: Accurate monitoring of plant dust retention can provide a basis for dust pollution control and environmental protection. The aims of this study were to analyze the spectral response features of grassland plants to mining dust and to predict the spatial distribution of dust retention using hyperspectral data. The dust retention content was determined by an electronic analytical balance and a leaf area meter. The leaf reﬂectance spectrum was measured by a handheld hyperspectral camera, and the airborne hyperspectral data were obtained using an imaging spectrometer. We analyzed the di ﬀ erence between the leaf spectral before and after dust removal. The sensitive spectra of dust retention on the leaf- and the canopy-scale were determined through two-dimensional correlation spectroscopy (2DCOS). The competitive adaptive reweighted sampling (CARS) algorithm was applied to select the feature bands of canopy dust retention. The estimation model of canopy dust retention was built through random forest regression (RFR), and the dust distribution map was obtained based on the airborne hyperspectral image. The results showed that dust retention enhanced the spectral reﬂectance of leaves in the visible wavelength but weakened the reﬂectance in the near-infrared wavelength. Caused by the canopy structure and multiple scattering, a slight di ﬀ erence in the sensitive spectra on dust retention existed between the canopy and leaves. Similarly, the sensitive spectra of leaves and the canopy were closely related to dust and plant physiological parameters. The estimation model constructed through 2DCOS-CARS-RFR showed higher precision, compared with genetic algorithm-random forest regression (GA-RFR) and simulated annealing algorithm-random forest regression (SAA-RFR). Spatially, the amount of canopy dust increased and then decreased with increasing distance from the mining area, reaching a maximum within 300–500 m. This study not only demonstrated the importance of extracting feature bands based on the response of plant physical and chemical parameters to dust, but also laid a foundation for the rapid and non-destructive monitoring of grassland plant dust retention.


Introduction
Mineral coal is the second most widely used fossil fuel in the world and the first in terms of reserves for future use [1]. In the last five years, the annual global production was about 7702 Mt [2]. Coal mining enhances the development of the world economy. However, the mining, loading, unloading, and transportation of open-pit coal mines cause large amounts of fugitive dust. Dust not Additionally, leaf dust retention reflects the degree of dust pollution of individual plants, while canopy dust retention reflects the response features of vegetation composition to dust. Applying the estimation model of canopy dust retention to spectral imaging data can obtain a regional dust distribution map. The dust retention of leaves and the canopy have different application fields, but little attention has been paid to the scale difference in the spectral responses of plant dust retention.
The disparity between the number of hyperspectral data bands (usually more than 100) and the number of sampling points is enormous, which may lead to overfitting in the modeling process. Some spectral studies [22,41] have shown that regression models built using the optimal bands deliver better results than obtained when using the full band or vegetation index. Therefore, how to use limited samples to find the exact feature bands of plant dust retention and then build a stable estimation model have become the key to obtain accurate spatial distribution maps of plant dust retention. Two-dimensional correlation spectroscopy (2DCOS) is a spectral analysis method developed by Noda [42] in the 1980s. This method extends the dynamic spectrum generated by perturbations (time, temperature, concentration, etc.) to two dimensions (synchronous and asynchronous spectra), so that the overlapping part of the spectrum can be decomposed [43]. Through the synchronous and asynchronous spectra, we can analyze the sensitivity of different spectral peaks to perturbations and the relative order of their changes. At present, two-dimensional correlation analysis plays an increasingly important role in the fields of discrimination of medicinal materials [44] and analysis of the binding process of metallic elements to dissolved organic matter (DOM) [43]. The competitive adaptive reweighted sampling (CARS) algorithm [45] is a variable selection method, which has been successfully applied to identify the feature bands of soil heavy metal and plant physicochemical parameters [22,46]. This paper is the first to evaluate the feasibility of combining two-dimensional correlation spectroscopy and the CARS algorithm (2DCOS-CARS) to determine the feature bands of plant dust retention and improve its inversion accuracy.
The objectives of this study were (1) to investigate the spectral response features of dust retention on the leaf-and the canopy-scale; (2) to demonstrate the effectiveness of 2DCOS-CARS on selecting wavebands for canopy dust retention modeling; and (3) to monitor dust retention spatially based on the airborne hyperspectral image. Figure 1 shows the workflow of the analysis.

Study Areas
Considering the validity and universality of the establishment method, we selected four plant species from two typical grassland areas in the Inner Mongolia Autonomous Region, China ( Figure  2). The Xilinhot grassland area is located in the Shengli Coalfield and its surrounding area in the northern suburb of Xilinhot City. The geographical coordinates of this region are 43°54′15″-44°13′52″ north latitude, 115°24′26″-116°26′30″ east longitude, at an elevation of 960-1270 m. The area is located in the mid-latitude westerly air zone and belongs to the semi-arid continental climate in the middle temperate zone. The minimum average monthly temperature is -21.64℃, the maximum average monthly temperature is 19.0℃, and the average annual rainfall is 294.74 mm. Strong winds occur in spring, with a predominant northwestern wind direction and a wind speed of 2.1-8.4 m/s, having an average speed of 3.5 m/s; the instantaneous maximum wind speed is 36.6 m/s.

Study Areas
Considering the validity and universality of the establishment method, we selected four plant species from two typical grassland areas in the Inner Mongolia Autonomous Region, China ( Figure 2). The Xilinhot grassland area is located in the Shengli Coalfield and its surrounding area in the northern suburb of Xilinhot City. The geographical coordinates of this region are 43 •  The HulunBuir grassland area is located in the Hulunbuir City, Inner Mongolia Autonomous Region, southeast of the Hulunbuir Dongming mine. The geographical coordinates are latitude 49°24′22″-49°25′37″ and east longitude 119°38′27″-119°40′32″, at an elevation of 583-651 m. This area has a vulnerable and unstable ecological environment that is characterized by a typical arid or semiarid climate. The average annual temperature is between -2.4 and 2.2℃, while the highest and lowest temperatures recorded are 17 and -48.5℃, respectively. The predominant wind direction is northwest, the instantaneous maximum wind speed is 20 m/s, and the annual average wind speed is more than 3 m/s.

Dust Retention Content and Leaf Spectrum Measurement
In the Xilinhot grassland, the two dominant plant species, Leymus chinensis and Cleistogenes squarrosa, were selected, and 66 samples were collected. In the HulunBuir grassland, Potentilla acaulis and Scutellaria scordifolia were selected, and 70 samples were collected; each sample contained two leaf types. The above four plant species are important forage plants on natural grasslands in eastern Inner Mongolia. They are characterized by high productivity, cold resistance, salt, and alkali tolerance. Their leaves unfold, making it easy to obtain a reflectance spectrum and dust retention content. Although Stipa krylovii is also one of the main species in grasslands, its leaves are curly and needle-like, and the area is too small to measure; therefore, it was excluded from this experiment.
The healthy, disease-free, and insect-free leaves were selected during the collection. Three leaves were collected from each plant for each sample. The real-time kinematic (RTK) technology was used to record the coordinates of the sampling points. The leaves of the same plant in each sample were sealed in a centrifuge tube and transported to the laboratory. Subsequently, spectral measurement, mass weighing, dust removal, and leaf area measurement were performed. All these measurements were performed on the same day of sample collection to avoid the influence of leaf water loss and physiological parameter changes on the results.
In the darkroom, a halogen lamp was used as the light source, and the leaf spectral reflectance was measured by the SPECIM IQ (Handheld Hyperspectral Camera, SPECIM, Finland). SPECIM IQ provides surface radiance measurements in 204 spectral bands between 397 and 1004 nm with a full width at a half maxima (FWHM) of 7 nm. Only the bands in the range of 450-1000 nm were selected

Dust Retention Content and Leaf Spectrum Measurement
In the Xilinhot grassland, the two dominant plant species, Leymus chinensis and Cleistogenes squarrosa, were selected, and 66 samples were collected. In the HulunBuir grassland, Potentilla acaulis and Scutellaria scordifolia were selected, and 70 samples were collected; each sample contained two leaf types. The above four plant species are important forage plants on natural grasslands in eastern Inner Mongolia. They are characterized by high productivity, cold resistance, salt, and alkali tolerance. Their leaves unfold, making it easy to obtain a reflectance spectrum and dust retention content. Although Stipa krylovii is also one of the main species in grasslands, its leaves are curly and needle-like, and the area is too small to measure; therefore, it was excluded from this experiment.
The healthy, disease-free, and insect-free leaves were selected during the collection. Three leaves were collected from each plant for each sample. The real-time kinematic (RTK) technology was used to record the coordinates of the sampling points. The leaves of the same plant in each sample were sealed in a centrifuge tube and transported to the laboratory. Subsequently, spectral measurement, mass weighing, dust removal, and leaf area measurement were performed. All these measurements were performed on the same day of sample collection to avoid the influence of leaf water loss and physiological parameter changes on the results.
In the darkroom, a halogen lamp was used as the light source, and the leaf spectral reflectance was measured by the SPECIM IQ (Handheld Hyperspectral Camera, SPECIM, Finland). SPECIM IQ provides surface radiance measurements in 204 spectral bands between 397 and 1004 nm with a full width at a half maxima (FWHM) of 7 nm. Only the bands in the range of 450-1000 nm were selected for analysis, as the beginning and ending parts of the spectrum presented high amounts of noise caused by the optical instrument and experimental environment. The local correction maximization denoising method [47] was used to smooth the spectrum. The leaf weight was determined with an electronic analytic balance (1/10,000 g scale) and recorded as W 1 . After weighing, the dust on the front side was cleaned off with a soft brush, and the leaf was weighed a second time (W 2 ). Leaf area (cm 2 ) was measured using a leaf area meter (CI-202, CID, USA) and recorded as S. The dust retention content (DRC) of a leaf was determined using Equation (1); the average value of all collected leaves in the sample was used as the dust retention content of the canopy. (1)

Airborne Hyperspectral Data Acquisition and Preprocessing
Since the Xilinhot grassland area is near Xilinhot Airport, which is a no-fly zone, it is impossible to collect airborne hyperspectral data via an unmanned aerial vehicle (UAV). Therefore, only in the HulunBuir grassland, Wind4 (UAV, DJI, China), equipped with SPECIM FX10 (Imaging Spectrometer, SPECIM, Finland), was used to collect canopy hyperspectral data. The SPECIM FX10 was configured with a spectral range of 397-1003 nm and a field of view (FOV) of 38 • . The sensor was used in 4 × 2 binning mode, and the acquisition time was approximately 11:00 a.m. on 13 August 2019 (windless, cloudless, and appropriate sunshine). The flight altitude of the UAV was set at 117 m above the ground, and the spectral resolution and spatial resolution of the hyperspectral image were 5.5 nm and 0.16 m, respectively.
Radiometric calibration and geolocating were performed by the CaliGeoPro tool (Data Processing Tool, SPECIM, Finland). That is, the digital number (DN value) of the original hyperspectral data was converted into radiance by using the dark signal and calibration file. The position and attitude data recorded by the inertial navigation system (INS), and the digital elevation model (DEM) were used to calculate the geodetic coordinates of the pixels by a collinearity equation. Then, the hyperspectral data were registered with the high-resolution orthophoto as the reference. The radiance of the hyperspectral image was converted into reflectance data by using the average spectrum of the whiteboard obtained synchronously. Finally, the flight lines were mosaicked together using the "georeferenced mosaicking" method [23]. The DEM and orthophotos used in this research were acquired by Phantom 4 RTK (UAV, DJI, China).

Two-dimensional Correlation Spectroscopy
We used 2DCOS to analyze the sensitive spectral range of plant dust retention. In this study, the dust retention content of plants was taken as the perturbation, and the spectral variation y(v, t) was a function of the spectral variable v (wavelength) and the perturbation t (dust content). The dynamic spectrum can be defined as follows: where y(v) is the reference spectrum, typically expressed by the average spectrum at the variable v.
The two-dimensional correlation intensity X(v 1 , v 2 ) is obtained from the correlation analysis of two independent spectral variations at v 1 and v 2 , as shown in Equation (3): where ϕ(v 1 , v 2 ) is the synchronous intensity, which indicates the similarity of the spectral intensity changes under the influence of perturbation t; and ψ(v 1 , v 2 ) is the asynchronous intensity, which means the difference in spectral intensity changes. The spectral coordinates, intensities, and signs of correlation peaks appearing on the two-dimensional correlation spectrum can be interpreted by a series of principles [43]. The synchronous spectrum is symmetrical about the main diagonal, and consists of auto-peaks located along the main diagonal and cross-peaks located at the off-diagonal positions. The intensity of the auto-peak indicates the degree of spectral change caused by the perturbation. The cross-peak represents the collaborative change degree of spectral intensity at two different spectral variables (v 1 , v 2 ). A positive cross-peak indicates the same direction of the intensity change at the corresponding spectral coordinates, while a negative value suggests the opposite direction. The asynchronous spectrum is antisymmetric concerning the main diagonal, indicating the sequence in which the two spectral signals change. The same signs of the spectral coordinate (v 1 , v 2 ) in both the synchronous and asynchronous spectra indicate that the spectral intensity changes at v 1 before that of v 2 . This order is reversed when the signs are opposite. The spectral changes coincide if only the cross-peaks show in the synchronous spectrum. When the cross-peaks only show in the asynchronous spectrum, the spectral changes in opposite directions, but the sequence of the changes cannot be determined [48,49].
The reflectance spectrum was transformed into a two-dimensional correlation spectrum using the 2D Shige software (Kwansei-Gakuin University, Japan). Then, the synchronous and asynchronous spectra were drawn with MATLAB R2018b.

Feature Bands Selection and Estimation Model
The CARS algorithm was used to extract the feature bands within the sensitive spectral ranges obtained by 2DCOS. It is a variable selection method that imitates Darwin's "survival of the fittest" principle of evolution. In CARS, several wavelength subsets are selected utilizing the exponential decreasing function and the adaptive reweighted sampling in an iterative and competitive way. Subsequently, each subgroup is modeled through the cross-validation method, and the subset with the smallest root mean square error is the optimal subset.
To verify the effectiveness of 2DCOS-CARS to extract feature bands, the genetic algorithm (GA) and the simulated annealing algorithm (SAA) [50] were used to select wavebands in the whole spectral range. Subsequently, the bands selected by the three methods were used as the variable to build the estimation model of dust retention, and the modeling results were compared.
The random forest regression (RFR) [51,52] was used to construct the estimation model of dust retention content. RFR uses the bootstrap resampling method to generate multiple samples from the original training set, and each new sample is modeled with a decision tree. Then, the prediction results of all decision trees are combined, and their average value or the highest frequency value are used as the optimal prediction results.
The accuracy of the model will be significantly affected by the high dust retention data, so it is necessary to enhance the samples. In this paper, two pixels in the image adjacent to the original sample were selected as the co-selected samples. According to Tobler's First Law of Geography [53], geographical things or attributes are mutually related in spatial distribution, so the original sample's dust data were taken as the value of the co-selected samples. The spectral approximation of the two newly selected samples was higher, and the error was less. Two thirds of the samples were randomly selected as the calibration set, and the remaining one third as the validation set. The coefficient of determination (R 2 ), the root mean square error (RMSE), and the residual predictive deviation (RPD) were selected as the evaluation standards. When RPD ≥ 2.0, the model has excellent prediction ability; when 1.4 ≤ RPD < 2, the difference between a high and low estimated value can be roughly distinguished; when RPD < 1.4, the model cannot predict the sample [22]. All calculations were performed in MATLAB R2018b.
Remote Sens. 2020, 12, 2019 8 of 21 Table 1 summarizes the measurement results of the four plant species and canopy dust retention content. The dust retention content varies from 0.353-51.425 g/m 2 of Leymus chinensis, 1.813-52.810 g/m 2 of Cleistogenes squarrosa, 2.441-62.064 g/m 2 of Potentilla acaulis, 0.532-47.312 g/m 2 of Scutellaria scordifolia, and 1.486-54.688 g/m 2 of the canopy. Cleistogenes squarrosa has the highest mean value of dust retention content of 18.656 g/m 2 than the others. According to the Nielsen rule [54], the coefficient of variation (C.V) of the four plant species and the canopy is between 51.665% and 91.930%, all of which show moderate-intensity variation.

Comparison of Leaf Spectra before and after Dust Removal
The average spectral reflectance of dusty and non-dusty (dust removed) leaves of each plant species was calculated ( Figure 3). Since the objects of this comparison were spectral reflectance of the same leaf before and after dust removal, influences of chlorophyll content, leaf health, relative water content, and other factors were excluded. The reflectance spectra of dusty and non-dusty leaves were similar, with both having typical plant spectral characteristics, albeit with differences for specific bands. Dusty leaves had higher reflectance than non-dusty leaves at the range of 450-700 nm; at 700-1000 nm, dusty leaves' reflectance was obviously lower than non-dusty leaves' reflectance, and the intersection of the two spectra was located at 697-730 nm. The above results show that dust can enhance the leaves' reflectance in the visible wavelength and inhibit the near-infrared wavelength.
To more clearly analyze the difference between the spectra of dusty and non-dusty leaves, principal component analysis (PCA) [55] was used to transform the original spectrum into new effective principal components (PCs) containing as much as possible of the total variation. Figure 4 shows the PCA bi-plot that was depicted based on the first two main PCs (more than 90% of the variance) reflecting the relationship between the dusty and non-dusty leaves. The results show that the dusty and non-dusty spectra of the same leaf are close to each other, and the effect of dust removal on the spectrum is not significant.

Comparison of Leaf spectra before and after dust removal
The average spectral reflectance of dusty and non-dusty (dust removed) leaves of each plant species was calculated ( Figure 3). Since the objects of this comparison were spectral reflectance of the same leaf before and after dust removal, influences of chlorophyll content, leaf health, relative water content, and other factors were excluded. The reflectance spectra of dusty and non-dusty leaves were similar, with both having typical plant spectral characteristics, albeit with differences for specific bands. Dusty leaves had higher reflectance than non-dusty leaves at the range of 450-700 nm; at 700-1000 nm, dusty leaves' reflectance was obviously lower than non-dusty leaves' reflectance, and the intersection of the two spectra was located at 697-730 nm. The above results show that dust can enhance the leaves' reflectance in the visible wavelength and inhibit the near-infrared wavelength.
To more clearly analyze the difference between the spectra of dusty and non-dusty leaves, principal component analysis (PCA) [55] was used to transform the original spectrum into new effective principal components (PCs) containing as much as possible of the total variation. Figure 4 shows the PCA bi-plot that was depicted based on the first two main PCs (more than 90% of the variance) reflecting the relationship between the dusty and non-dusty leaves. The results show that the dusty and non-dusty spectra of the same leaf are close to each other, and the effect of dust removal on the spectrum is not significant.

Two-dimensional Correlation Analysis of Dust Retention in Leaves
The two-dimensional correlation spectra of Leymus chinensis, Cleistogenes squarrosa, Potentilla acaulis, and Scutellaria scordifolia are shown in Figure 5. According to the synchronous spectra, the

Two-dimensional Correlation Analysis of Dust Retention in Leaves
The two-dimensional correlation spectra of Leymus chinensis, Cleistogenes squarrosa, Potentilla acaulis, and Scutellaria scordifolia are shown in Figure 5. According to the synchronous spectra, the

Two-Dimensional Correlation Analysis of Dust Retention in Leaves
The two-dimensional correlation spectra of Leymus chinensis, Cleistogenes squarrosa, Potentilla acaulis, and Scutellaria scordifolia are shown in Figure 5. According to the synchronous spectra, the four plant species showed auto-peaks at 531-566 and 736-787 nm, respectively, with a positive cross-peak at (531-566, 736-787 nm), indicating that the spectrum in these two ranges was most sensitive to the dust retention on leaves, and their reflectance changed in the same direction with the variation in the dust retention content.
In the asynchronism spectra, the samples of the four plant species formed a positive cross-peak at (531-566, 736-787 nm). According to the Noda rule [56], under the influence of dust retention, the spectral intensity change at 531-566 nm occurs before that of 736-787 nm. In addition, Leymus chinensis showed negative cross-peaks at the following ranges: ( four plant species showed auto-peaks at 531-566 and 736-787 nm, respectively, with a positive crosspeak at (531-566, 736-787 nm), indicating that the spectrum in these two ranges was most sensitive to the dust retention on leaves, and their reflectance changed in the same direction with the variation in the dust retention content.
In the asynchronism spectra, the samples of the four plant species formed a positive cross-peak at (531-566, 736-787 nm). According to the Noda rule [56], under the influence of dust retention, the spectral intensity change at 531-566 nm occurs before that of 736-787 nm. In addition, Leymus chinensis showed negative cross-peaks at the following ranges: (

Two-dimensional Correlation Analysis of Dust Retention in the Canopy
Based on the coordinates measured by the RTK method, the canopy spectral of each sample was extracted from the airborne hyperspectral image. Then, the two-dimensional correlation spectrum of canopy dust retention was constructed ( Figure 6). There was a deviation between the sensitive spectral ranges of the canopy and the leaves, that is, the auto-peaks of the canopy in the synchronous spectrum located at 488-526, 649-687, and 747-802 nm, indicating that the spectral reflectance of these three ranges was the most sensitive to canopy dust retention. Simultaneously, there were positive cross-peaks between the three regions, which indicates that the spectral reflectance of these ranges changed in the same direction under the influence of dust retention.

Two-Dimensional Correlation Analysis of Dust Retention in the Canopy
Based on the coordinates measured by the RTK method, the canopy spectral of each sample was extracted from the airborne hyperspectral image. Then, the two-dimensional correlation spectrum of canopy dust retention was constructed ( Figure 6). There was a deviation between the sensitive spectral ranges of the canopy and the leaves, that is, the auto-peaks of the canopy in the synchronous spectrum located at 488-526, 649-687, and 747-802 nm, indicating that the spectral reflectance of these three ranges was the most sensitive to canopy dust retention. Simultaneously, there were positive cross-peaks between the three regions, which indicates that the spectral reflectance of these ranges changed in the same direction under the influence of dust retention.

Two-dimensional Correlation Analysis of Dust Retention in the Canopy
Based on the coordinates measured by the RTK method, the canopy spectral of each sample was extracted from the airborne hyperspectral image. Then, the two-dimensional correlation spectrum of canopy dust retention was constructed ( Figure 6). There was a deviation between the sensitive spectral ranges of the canopy and the leaves, that is, the auto-peaks of the canopy in the synchronous spectrum located at 488-526, 649-687, and 747-802 nm, indicating that the spectral reflectance of these three ranges was the most sensitive to canopy dust retention. Simultaneously, there were positive cross-peaks between the three regions, which indicates that the spectral reflectance of these ranges changed in the same direction under the influence of dust retention.

Estimate of Canopy Dust Retention Based on Feature Analysis
Effectively extracting feature bands of canopy dust retention and constructing a high-precision estimation model are the critical steps in large-scale canopy dust retention monitoring. Based on the results of Section 3.3.2, the sensitive spectra of canopy dust retention were 488-526, 649-687, and 747-802 nm. Ten wavebands centered at 488, 499, 520, 654, 671, 687, 752, 763, 780, and 796 nm were extracted from the above regions by the CARS algorithm. These selected wavebands were then used for RFR modeling, with the results being compared to those of RFR models developed with the bands optimized through GA and SAA (Table 2). From the prediction accuracy, the RFR model's accuracy was affected by the waveband selection method. Compared with GA-RFR and SAA-RER, the accuracies of the calibration set and validation set using 2DCOS-CARS-RFR were generally better, as indicated by the higher R 2 and lower RMSE values. Besides, the RPD value of 2DCOS-CARS-RFR was 2.357, while GA-RFR and SAA-RER were 1.772 and 1.962, respectively. These results show that 2DCOS-CARS can be used to extract the feature bands of dust retention, and the estimation model had great significance for improving the prediction ability of few-shot learning. In contrast, the other two estimation models can only roughly distinguish the level of dust retention.

Spatial Distribution Features of Canopy Dust
The task of this study was to retrieve the dust content in a vegetated area, but to avoid the interference from the invalid estimates for bare soil, the NDVI value can be used to distinguish the vegetated area (NDVI > 0.4) from the non-vegetated area (NDVI ≤ 0.4). The estimation model constructed by 2DCOS-CARS-RFR was applied to the airborne hyperspectral image to obtain the spatial distribution map of canopy dust retention (Figure 7). The estimated value of dust content ranged from 7.207 to 52.135 g/m 2 . The high-value area was mainly distributed within 900 m from the mining area, of which the area with a value higher than 24.121 g/m 2 accounted for 82.218%. However, in the range greater than 900 m, the area with an amount exceeding 24.121 g/m 2 only accounted for 41.401%. In order to further analyze the relationship between the canopy dust and the distance to the mining area, the mining area was taken as the buffer zone center, and a 200 m step was used to produce the average values of the dust content buffer zones (Figure 8). The average dust content in each buffer zone ranged from 21.116 to 30.424 g/m 2 . With increasing distance from the mining area, the dust retention content increased and then decreased, reaching the maximum value of 30.424 g/m 2 within the range of 300-500 m, corroborating Niu et al. [57] regarding the dust distribution trend. Dust is subjected to gravity during the propagation process, and larger particles settle first; therefore, the maximum dust content appears at a certain distance from the mining area. Smaller particles continue to spread and gradually fall, resulting in decreasing dust content with increasing distance from the mine.   Figure 3 shows that although the reflectance spectra of the dusty and non-dusty leaves were similar, the reflectance of dusty leaves at 450-700 nm was higher than that of non-dusty leaves, while at 700-1000 nm, it was lower than that of the non-dusty leaves. These features mentioned above are consistent with previous findings by Peng et al. [31] and Yan et al. [37]. Due to the influence of chlorophyll, the plant reflectance spectrum forms absorption valleys in the blue and red bands, with a reflection peak near the green band. The near-infrared band is repeatedly scattered by the complex cell structure inside the leaf to form a highly reflective platform [58]. Most dust is in the form of solid particles such as soil and coal powder. Due to their shielding effect [59], part of the light cannot enter the leaves, reducing the influences of chlorophyll and cell structure on spectral reflectance. Besides, the dust has a scattering effect in the visible wavelength. Therefore, the reflectivity of dusty leaves is higher than that of non-dusty leaves in the visible range and lower in the near-infrared range. According to Wang et al. [36], the spectral reflectance of non-dusty leaves at 450-700 nm was higher than that of dusty leaves, which is different from the results in this paper. This discrepancy might result from the differences in dust chemical composition and particle size features in various study areas [60,61], as well as the dust retention and adsorption capacities of different leaves.  Figure 3 shows that although the reflectance spectra of the dusty and non-dusty leaves were similar, the reflectance of dusty leaves at 450-700 nm was higher than that of non-dusty leaves, while at 700-1000 nm, it was lower than that of the non-dusty leaves. These features mentioned above are consistent with previous findings by Peng et al. [31] and Yan et al. [37]. Due to the influence of chlorophyll, the plant reflectance spectrum forms absorption valleys in the blue and red bands, with a reflection peak near the green band. The near-infrared band is repeatedly scattered by the complex cell structure inside the leaf to form a highly reflective platform [58]. Most dust is in the form of solid particles such as soil and coal powder. Due to their shielding effect [59], part of the light cannot enter the leaves, reducing the influences of chlorophyll and cell structure on spectral reflectance. Besides, the dust has a scattering effect in the visible wavelength. Therefore, the reflectivity of dusty leaves is higher than that of non-dusty leaves in the visible range and lower in the near-infrared range. According to Wang et al. [36], the spectral reflectance of non-dusty leaves at 450-700 nm was higher than that of dusty leaves, which is different from the results in this paper. This discrepancy might result from the differences in dust chemical composition and particle size features in various study areas [60,61], as well as the dust retention and adsorption capacities of different leaves.

Sensitive Spectral Analysis of Leaf and Canopy Dust Retention
The sensitive spectral range of dust retention in leaves and the canopy was determined by two-dimensional correlation analysis. In order to prove the rationality of the selected sensitive spectra of leaf dust retention, we set up several dust collecting tanks near the mining area and numbered them successively (D1-D18). Since the amount of collected dust at some samples was too small to measure the reflectance spectrum, only five samples (D1-D5) were measured in the darkroom (Figure 9). The spectral curve of each sample was smooth, with the reflectance gradually increasing, and the slope changed around 580 and 780 nm. The sensitive spectra of dust were determined by 2DCOS ( Figure 10). There were auto-peaks within the regions of 583-617 and 746-777 nm in the synchronous spectrum, deviating from the position of leaf dust retention. This is because dust shielding and its influence on plant physicochemical parameters affect the dusty leaves spectrum. Naidoo et al. [7] and Neves et al. [62] found that dust can block leaf surface pores, inhibiting photosynthesis and transpiration rates, which further leads to a decrease in chlorophyll, carotenoids, and relative water content [5,6]. The dust retention sensitive spectral range of 531-566 nm contains the characteristic bands of carotenoids and equivalent water thickness (EWT) [63,64]. Some sensitive bands of chlorophyll and carotenoids range from 736-787 nm [39,[63][64][65]. The above results show that the sensitive spectra of leaf dust retention are closely related to dust and the physicochemical parameters of leaves.
The sensitive spectral ranges of leaf dust retention extracted in this experiment are similar to those reported in previous studies, but there are also differences. For example, Jing et al. [28] recommended using the spectral of 450-500, 550-600, 750-1000, and 1100-1300 nm to estimate three types of leaf dust retention content in southern China, which are consistent with some of the sensitive spectra in this paper. Peng et al. [28] studied the correlation between reflectance and dust retention content on elm leaves in Alaer, China, and the results were similar to those in this paper. Namely, there was a highly significant correlation between 400-708 and 754-1050 nm. Li et al. [32] studied the relationship between dust retention content and reflectance of poplar leaves in Beijing, China. They found that the correlation between 450 and 527 nm was statistically significant, while 528-567 nm was not significant, which was different from the results of this study. These discrepancies might have been caused by the fact that in our paper, we focused on herbaceous species, while previous studies mainly investigated tree species. There are significant differences in the chemical and physiological characteristics as well as in the dust retention ability between leaves of herbaceous and tree species [66,67], leading to differences in leaf spectra characteristics.
Due to the different reflection mechanisms of the canopy and leaves, the sensitive spectra of canopy dust retention are different from those of leaves. The spatial resolution of the airborne hyperspectral data was 0.16 m, and most of the pixels were mixed pixels. The sensor receives multiple scattering spectra of various plant leaves in the canopy [68]. In addition to the effect of dust on the photosynthetic pigments and water content, the influence of dust retention on the canopy structure (mainly leaf area index (LAI) ) will also change the spectral features [40,69]. Based on the monitoring results of the canopy's physical and chemical parameters, the wavelength range of 488-526 nm includes sensitive bands of chlorophyll and carotenoids [26,64]. The range of 649-687 nm contains some sensitive bands of chlorophyll, carotenoids, and LAI [26]; some characteristic bands of phosphorus involved in photosynthesis are also located in this spectral range [70]. Some sensitive bands of chlorophyll, carotenoids, gravimetric water content (GWC), and LAI are within the range of 747-802 nm [26,41,64,71]. In summary, the sensitive spectra of canopy dust retention reflect the response features of plant physicochemical parameters to dust. Remote Sens. 2020, 12, x FOR PEER REVIEW

Accuracy Evaluation of the Estimation Model
Using the bands selected by 2DCOS-CARS, GA, and SAA as independent variables, the estimation models of canopy dust retention were constructed. The results showed that compared with the other two methods, the 2DCOS-CARS-RFR model had better accuracy (the R 2 , RMSE, and RPD values of the validation set were 0.820, 3.910 g/m 2 , and 2.357, respectively). To compare the positions of the selected bands and their importance in the estimation models more intuitively, we normalized the importance and marked it out in the full spectrum ( Figure 11). The feature bands selected by 2DCOS-CARS were concentrated near the sensitive spectra of dust and plant physicochemical parameters. They had a clear physical significance, and the quantitative model established with greater precision. Band selection based on GA and SAA similarly contained the bands adjacent to dust and plant physicochemical parameters, but generally showed a broad and ambiguous selection from the full spectrum. The importance of each band varied greatly, and the number of bands with relative importance less than 0.4 reached more than half, indicating that the model training was unbalanced. Only a few bands were mainly involved in the modeling, while the remaining bands played a small role. We suggest that GA and SAA are susceptible to the interference of other factors (e.g., water absorption bands or elements in leaves), causing the introduction of

Accuracy Evaluation of the Estimation Model
Using the bands selected by 2DCOS-CARS, GA, and SAA as independent variables, the estimation models of canopy dust retention were constructed. The results showed that compared with the other two methods, the 2DCOS-CARS-RFR model had better accuracy (the R 2 , RMSE, and RPD values of the validation set were 0.820, 3.910 g/m 2 , and 2.357, respectively). To compare the positions of the selected bands and their importance in the estimation models more intuitively, we normalized the importance and marked it out in the full spectrum ( Figure 11). The feature bands selected by 2DCOS-CARS were concentrated near the sensitive spectra of dust and plant physicochemical parameters. They had a clear physical significance, and the quantitative model established with greater precision. Band selection based on GA and SAA similarly contained the bands adjacent to dust and plant physicochemical parameters, but generally showed a broad and ambiguous selection from the full spectrum. The importance of each band varied greatly, and the number of bands with relative importance less than 0.4 reached more than half, indicating that the model training was unbalanced. Only a few bands were mainly involved in the modeling, while the remaining bands played a small role. We suggest that GA and SAA are susceptible to the interference of other factors (e.g., water absorption bands or elements in leaves), causing the introduction of irrelevant bands or the missed selection of bands that have physical significance with dust retention, which makes the estimation model less accurate.
The comparison between the measured dust content and the value estimated by the 2DCOS-CARS-RFR model is shown in Figure 12. The majority data of the calibration set and the validation set were distributed closely around the 1:1 line, which shows that the estimation model can perform well in plants with different leaf parameters and structures. However, the calibration set was closer to the 1:1 line than the validation set, and their slope values (0.781 for the calibration set and 0.721 for the validation set) confirm this. Compared with the measured dust content, the results of high dust content (>23 g/m 2 ) were overestimated, while those of low dust content (≤23 g/m 2 ) being underestimated. This is a typical problem when using regression analysis. The use of spectral differences to distinguish the amount of canopy dust is based on the premise that spectral reflectance has a stable relationship with the plant characteristics and environmental factors. It ignores the phenomena of "different things with the same spectrum" and "the same thing with different spectrums" caused by the complex combination of phytochemical composition and structural parameters. The spatial heterogeneity of ecological relationships also affects the results of spatial analysis [72]. Therefore, using a unified relationship will inevitably lead to high variance and an unbalanced estimation.
Compared with the results of other studies using hyperspectral data to predict dust retention in leaves (e.g., R 2 > 0.90 of the validation set) [28,31], there is still room for improvement in the prediction accuracy. This may be attributed to the fact that, although 2DCOS-CARS selected more effective feature bands, it still ignored some information on dust retention inversion. In addition, taking the canopy spectrum as variable data, the complexity of spectrum mixing of various plants, and the uncontrolled external conditions in the field will also reduce the prediction accuracy. Therefore, choosing the best feature extraction algorithm and minimizing data errors through preprocessing will be the focus of future work.
analysis [72]. Therefore, using a unified relationship will inevitably lead to high variance and an unbalanced estimation.
Compared with the results of other studies using hyperspectral data to predict dust retention in leaves (e.g., R 2 > 0.90 of the validation set) [28,31], there is still room for improvement in the prediction accuracy. This may be attributed to the fact that, although 2DCOS-CARS selected more effective feature bands, it still ignored some information on dust retention inversion. In addition, taking the canopy spectrum as variable data, the complexity of spectrum mixing of various plants, and the uncontrolled external conditions in the field will also reduce the prediction accuracy. Therefore, choosing the best feature extraction algorithm and minimizing data errors through preprocessing will be the focus of future work.

2DCOS
-CARS Relative importance Figure 11. Wavebands extracted by the three methods. Figure 11. Wavebands extracted by the three methods.

Future Work
In this study, the change characteristics of leaf and canopy spectra were analyzed through 2DCOS, under the influence of dust retention. However, the mechanism of their change direction and order difference is not clear. In the future, we will continue research to determine the cause of these differences. Moreover, this method will be applied to a broader area to test its applicability on grasslands with different environmental conditions and different plant compositions.
We have pointed out that there is an unbalanced estimation problem in using optical remote sensing to retrieve the dust retention content of a canopy. According to previous research results [73][74][75], the combination of structural features (LiDAR) can effectively improve the accuracy and rationality of canopy chemicals' estimation results. In addition, the spatial distribution of dust depends on wind speed, wind direction, terrain, and other factors [76,77]. Therefore, the application of multi-source data such as the canopy structure, meteorological factors, and terrain factors to invert canopy dust retention will become one of our research focuses.
Although UAVs play an increasingly important role in space element monitoring, the area of spectral imaging data obtained each time is small due to the limitation of their endurance capacity, which cannot be used for the whole grassland or larger-scale researches. In recent years, with the progress of sensor technology, the Sentinel-2 multispectral sensor launched by ESA and the GF-5 hyperspectral sensor launched by China provide an opportunity for large-scale vegetation dust monitoring. Combining the results of this study with satellite remote sensing to build a "satellite-

Future Work
In this study, the change characteristics of leaf and canopy spectra were analyzed through 2DCOS, under the influence of dust retention. However, the mechanism of their change direction and order difference is not clear. In the future, we will continue research to determine the cause of these differences. Moreover, this method will be applied to a broader area to test its applicability on grasslands with different environmental conditions and different plant compositions.
We have pointed out that there is an unbalanced estimation problem in using optical remote sensing to retrieve the dust retention content of a canopy. According to previous research results [73][74][75], the combination of structural features (LiDAR) can effectively improve the accuracy and rationality of canopy chemicals' estimation results. In addition, the spatial distribution of dust depends on wind speed, wind direction, terrain, and other factors [76,77]. Therefore, the application of multi-source data such as the canopy structure, meteorological factors, and terrain factors to invert canopy dust retention will become one of our research focuses.
Although UAVs play an increasingly important role in space element monitoring, the area of spectral imaging data obtained each time is small due to the limitation of their endurance capacity, which cannot be used for the whole grassland or larger-scale researches. In recent years, with the progress of sensor technology, the Sentinel-2 multispectral sensor launched by ESA and the GF-5 hyperspectral sensor launched by China provide an opportunity for large-scale vegetation dust monitoring.
Combining the results of this study with satellite remote sensing to build a "satellite-aerial-terrestrial" dust monitoring system has become an important direction of our future research.

Conclusions
To explore the response features of grassland plants to mining dust and monitor the spatial distribution of plant dust retention are of great significance for the protection of grassland ecology and environment. The spectral response features of dust retention were analyzed from leaf and canopy scales through hyperspectral measurements. Then, we constructed the estimation model of canopy dust retention and obtained its spatial distribution map based on airborne hyperspectral data. The research conclusions are as follows: (1) Dust retention increases the reflectance of grassland plant leaves in the visible wavelength and decreases in the near-infrared wavelength. The two-dimensional correlation spectra of leaves and the canopy are different because of their different reflection mechanisms. The leaves of four plant species had auto-peaks at 531-566 and 736-787 nm in the synchronous spectra. In the synchronous spectrum of the canopy, auto-peaks appeared at 488-526, 649-687, and 747-802 nm, indicating that the above spectral ranges are most sensitive to plant dust retention; (2) Choosing the appropriate modeling bands is the key to estimate dust retention content accurately. The feature bands selected by the 2DCOS-CARS method were strictly related to dust and plant physiological parameters. The estimation model constructed through 2DCOS-CARS-RFR showed higher robustness and accuracy, compared with GA-RFR and SAA-RFR; (3) The high-value areas of canopy dust were mainly distributed within 900 m from the mining area. With increasing distance from the mining area, the dust retention content of the canopy increased and then decreased, reaching a maximum of 30.424 g/m 2 within 300-500 m.