Next Article in Journal
A Black Ice Detection Method Based on 1-Dimensional CNN Using mmWave Sensor Backscattering
Next Article in Special Issue
Study of the Buried Basin C-H, Based on the Multi-Source Remote Sensing Data
Previous Article in Journal
A Local-Sparse-Information-Aggregation Transformer with Explicit Contour Guidance for SAR Ship Detection
Previous Article in Special Issue
Population of Degrading Small Impact Craters in the Chang’E-4 Landing Area Using Descent and Ground Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantitative Inversion of Lunar Surface Chemistry Based on Hyperspectral Feature Bands and Extremely Randomized Trees Algorithm

1
School of Earth Sciences and Resources, China University of Geosciences, Beijing 100083, China
2
Beijing Key Laboratory of Development and Research for Land Resources Information, Beijing 100083, China
3
College of Geomatics, Xi’an University of Science and Technology, Xi’an 710054, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(20), 5248; https://doi.org/10.3390/rs14205248
Submission received: 12 August 2022 / Revised: 15 October 2022 / Accepted: 17 October 2022 / Published: 20 October 2022
(This article belongs to the Special Issue Planetary Geologic Mapping and Remote Sensing)

Abstract

:
In situ resource utilization (ISRU) is required for the operation of both medium and long-term exploration missions to provide metallic materials for the construction of lunar base infrastructure and H2O and O2 for life support. The study of the distribution of the lunar surface elements (Fe, Ti, Al, and Si) is the basis for the in situ utilization of mineral resources. With the arrival of the era of big data, the application of big data concepts and technical methods to lunar surface chemistry inversion has become an inevitable trend. This paper is guided by big data theory, and the Apollo 17 region and the area near the Copernicus crater are selected for analysis. The dimensionality of the first-order differential spectral features of lunar soil samples is reduced based on Pearson correlation analysis and the successive projections algorithm (SPA), and the extremely randomized trees (Extra-Trees) algorithm is applied to Chang’E-1 Interference Imaging Spectrometer (IIM) data to establish a prediction model for the lunar surface chemistry and generate FeO, TiO2, Al2O3, and SiO2 distribution maps. The results show that the optimum number of variables for FeO, TiO2, Al2O3, and SiO2 is 17, 5, 8, and 30, respectively. The accuracy of the Extra-Trees model using the best variables was improved over that of the original band model, with determination coefficients (R2) of 0.962, 0.944, 0.964, and 0.860 for FeO, TiO2, Al2O3, and SiO2, and root mean square errors (RMSEs) of 1.028, 0.672, 0.942, and 0.897, respectively. The modeling feature variables and model preference methods in this study can improve the inversion accuracy of chemical abundance to some extent, demonstrating the potential of IIM data in predicting chemical abundance and providing a good data basis for lunar geological evolution studies and ISRU.

1. Introduction

From ancient times to the present day, humankind has never stopped exploring the vastness of the universe. The Moon, the closest celestial body to the Earth, has always been a prime target for human astronomical activities and is naturally the first step for humankind to step out of the cradle of the Earth and into the vastness of the universe [1]. As humankind is currently facing a shortage of resources, the exploitation and utilization of the rich lunar reserves of mineral resources can effectively alleviate the problem of resource shortages for humankind in the future. Given the high cost of Earth–Moon transportation, in situ resource utilization (ISRU) has become a fundamental technological guarantee for the establishment of a lunar base [2,3,4]. Currently, the most important ISRU project is to obtain H2O and O2 for life support through ilmenite. However, increasing human understanding and knowledge of the moon has revealed that any oxide can also be reduced to its constituent elements through reduction processes and produce H2O and O2 as byproducts, which is a key issue worthy of study [5]. Therefore, the elements of greatest interest to us are the lunar surface elements Fe, Ti, Al, and Si. The study of the distribution of Fe, Ti, Al, and Si is the basis for assessing the prospects for the exploitation of lunar mineral resources and the rational exploitation of resources. Fe, Ti, Al, and Si can all be used as sources of building materials for lunar base construction, with Fe mainly existing in lunar mafic minerals, Ti accounting for a large proportion of the high-titanium mare basalt [6,7], almost all Al on the lunar surface found in highland plagioclase [8], and Si is always present in the form of silicates in nearly every rock and mineral grain on the Moon. Therefore, this paper focuses on the inversion of Fe, Ti, Al, and Si on the lunar surface to study the distribution and abundance of mineral resources and to lay the foundation for a better assessment of the lunar mineral resources.
Currently, multisource remote sensing techniques based on γ-ray, X-ray, and multi/hyperspectral data are widely used for lunar surface chemistry inversion [9,10,11,12,13,14,15,16]. Compared with high-energy techniques, multi/hyperspectral data have the advantage of high resolution; therefore, many studies invert chemistry abundances based on multi/hyperspectral data. The outcomes of these studies include FeO and TiO2 distribution maps with a resolution of 100 m obtained using Clementine data [17]; FeO, TiO2, Al2O3, SiO2, CaO, and MgO distribution maps with a resolution of 200 m obtained based on IIM data [18,19,20,21]; FeO distribution maps with a resolution of 20 m obtained based on Moon Mineralogy Mapper (M3) data [10]; a TiO2 distribution map with a resolution of 400 m obtained from the Lunar Reconnaissance Orbiter Camera (LROC) Wide Angle Camera (WAC) [16]; and FeO, TiO2, Al2O3, CaO, and MgO distribution maps with a resolution of 59 m obtained based on Kaguya Multiband Imager (MI) data [22,23].
Among the four elements found on the lunar surface, namely, Fe, Ti, Al, and Si, Fe and Ti are transition metals that can undergo ligand field transitions (d-d transitions). Accompanying this electronic transition from low to high energy levels, the transition metals Fe2+ and Ti3+ produce absorption features in the visible to near-infrared range [18,24,25,26]; thus, the spectral features of the lunar surface can be directly interpreted based on mineralogical principles [7,24,26]. Early quantitative spectroscopic analysis of lunar samples demonstrates the potential for inversion of lunar surface chemistry and mineral components using band ratio methods [27]. Currently, this approach has been widely used for FeO and TiO2 inversion [16,17,19,28,29,30,31,32]. However, elemental abundance inversion is not limited to the chromophore elements Fe and Ti. Although nonchromophore elements do not have absorption characteristics in the visible NIR, they affect reflectance [18]. Therefore, many studies have used regression models to establish statistical relationships between elemental abundance and spectral characteristics to predict chemical abundance [20,21,33,34]. With continuous research, a variety of machine learning models have been applied for oxide inversion. Regardless of whether the input variables are full or feature bands, machine learning methods have displayed good performance in inversion [7,22,35,36,37]. For example, Korokhin et al. [7] proposed a nonlinear method based on an artificial neural network (ANN) inversion of TiO2 and avoided problems related to the limited number of bands and the subjective selection of band combinations compared with the traditional linear regression method. Zhang et al. [37] used principal component analysis (PCA) combined with a support vector machine (SVM) to estimate the abundance of chemical compositions (SiO2, Al2O3, FeO, MgO, and TiO2) and their maturity indicators (Is/FeO), where PCA was used to downscale the reflectance spectra of lunar soil samples for screening and the SVM was used to build a predictive oxide content model. Wang et al. [22] used a particle swarm optimization-support vector machine (PSO-SVM) algorithm based on KAGUYA MI data to generate oxide abundance maps with a spatial resolution of up to 59 m/pixel, and the map was relatively free of topographic shadows.
The selection of appropriate modeling feature variables and machine learning models is particularly important for the accuracy of oxide inversion. Feature band selection methods can improve the efficiency and accuracy of machine learning models to some extent. However, methods that combine machine learning and multi/hyperspectral data for the inversion of the lunar surface oxide content do not fully consider the preferential selection of model feature variables. For example, Shkuratov [33] used multiple linear regression models for three bands of 750, 915, and 965 nm from SMART-1 data. Wang & Niu [35] used 19 (571 nm-865 nm) bands of IIM data to construct a model after removing anomalous bands. Wu [18] selected 22 bands in the spectral range of 561–918 nm for modeling based on the magnitude of the signal-to-noise ratio. Sun et al. [20] screened four single bands and three band ratios with high correlations in the 513 nm-891 nm range based on Pearson correlation coefficients for modeling. Most of the above methods do not consider the interrelationships between pairs of spectral features or between oxides and spectral features, and adding all available bands to a model can have a negative impact on the accuracy and generalization ability of the model. In response to the above research deficiencies, Pearson correlation coefficients are adopted in this paper for the initial screening of spectral features. Then, input parameters are secondarily screened based on clustering analysis combined with the successive projections algorithm (SPA) to reduce the covariance interference among spectral features while ensuring a high correlation between input spectral features and oxides.
The extremely randomized trees (Extra-Trees) machine learning algorithm is selected to build the oxide inversion model and improve the oxide inversion accuracy. The Extra-Trees algorithm can effectively describe the small-sample, high-dimensional, and complex nonlinear relationship between the oxide content and spectral reflectance by integrating several weak learners to obtain a strong learner [38]. This approach shows a clear advantage in the inversion of the oxide content. For model sample set partitioning, Jin et al. [39], Li [27,40], and Zhang et al. [37] divided the training and test sets according to particle size. Zhou et al. [41] randomly selected 2/3 of the samples as the training set for modeling and 1/3 of the samples as the validation set. These segmentation methods are often random. In this paper, we use sample set partitioning based on the joint x-y distance (SPXY) algorithm to partition the samples in a way that maximizes the characterization of the sample distribution and improves the stability of the model.
The inversion of lunar surface chemistry based on the concept and analysis methods of big data is an inevitable trend in future development. The selection of suitable modeling feature variables and machine learning models is crucial for achieving high-accuracy oxide inversion. Guided by big data theory, big data concepts and technical methods are applied to lunar chemistry inversion, and a prediction method for feature band selection combined with an Extra-Trees model is proposed. First, the first-order difference bands are initially screened according to the Pearson correlation coefficient, and they are then downscaled using the bisecting K-means (BKM) algorithm combined with the SPA to determine the best band combination. Second, the Lunar Soil Characterization Consortium (LSCC) training sample is used to build an Extra-Trees prediction model. Finally, the model is applied to predict the oxide abundance in the Apollo 17 region of the lunar surface and the region near the Copernicus crater. The experimental results are compared with those of representative models [15,17,30,31], and it is found that the model shows good agreement with them, providing a new idea and method for the inversion of oxide abundances.

2. Materials and Methods

This paper proposes a method to invert the distribution of lunar surface chemistry, targeting the need for ISRU. Figure 1 shows the technological flow chart of this paper, which includes the following main steps:
(1)
Feature band selection: The sensitive regions of each chemistry were initially screened according to Pearson correlation coefficients, and then clustering analysis combined with SPA was used for secondary screening to determine the best combination of bands.
(2)
Construction of an Extra-Trees model: Seventy-six LSCC samples with reflectance and oxide content data were used as model inputs for training and testing.
(3)
Prediction of chemical abundance: The IIM reflectance data were put into the model to estimate the lunar surface chemical abundance.
Figure 1. Flow chart of lunar surface chemical inversion.
Figure 1. Flow chart of lunar surface chemical inversion.
Remotesensing 14 05248 g001

2.1. Data Description

2.1.1. LSCC Data

The actual measured samples are important for studying the material components of the lunar surface and are the only standard for testing the effectiveness of the inversion. The LSCC soil sample data are stored in a spectral library created through the spectral analysis of samples collected from the six Apollo lunar landings. The samples include 10 samples of highland lunar soil from Apollo 14 and Apollo 16 and 9 samples of mare lunar soil from Apollo 11, 12, 15, and 17 [42]. Since the lunar surface spectral characteristics are influenced by the particle size of the material, the LSCC screened the measured samples for particle size and divided them into four groups, namely, <10 μm, 10–20 μm, 20–45 μm, and <45 μm, for a total of 76 subsamples [27,42,43,44]. All measured spectra of lunar surface samples were obtained at the RELAB laboratory at Brown University using a phase angle of 30° at a sampling interval of 5 nm in the spectral range of 0.3–2.6 μm [40,41,42]. Composition data for the LSCC soils used in this paper are shown in Table 1.

2.1.2. IIM Data

The IIM, one of the payloads of Chang’E-1, has achieved the first international application of interferometric imaging spectroscopy in the field of planetary exploration. The wavelength range of IIM data is 480–960 nm, with a spectral resolution of 335 cm−1, a total of 32 bands, and a spatial resolution of 200 m/pixel [18,45,46]. Critical to establishing a statistical relationship between the hyperspectral band characteristics of the samples and the mineral element content is the requirement that the laboratory-measured LSCC samples and the IIM image absolute calibrations have the same photometric calibration [7,21,33,47,48]. The IIM 2C data downloaded from the Ground Application System of the Lunar Exploration Project have been preprocessed with radiometric correction, geometric correction, and photometric correction [46,49]. The IIM data used in this paper are based on the processing of 2C data with further recalibration processing, which includes reflectance correction using Apollo samples. Therefore, the photometric conditions between the IIM image and the LSCC sample data are the same after preprocessing, and the laboratory-measured LSCC samples can be used together with the IIM data.
IIM data quality evaluation is also an important prerequisite for performing scientific inversion. As indicated by signal-to-noise ratio analysis, the first four bands of IIM 2C data contain considerable noise, and the 32nd band has the highest noise level (lowest signal-to-noise ratio observed). Therefore, 26 bands of data in the range between 513 and 891 nm (bands 5–30) are used in this paper.

2.2. Feature Band Selection

The reflectance spectra used in this paper are between 513 and 891 nm, and band difference calculation results in 325 spectral bands. If all these spectral bands are input into the model for prediction, the number of bands will be high, as will the correlations between adjacent bands, which will inevitably lead to an increase in the redundancy of spectral information and adversely affect the accuracy and generalization of the model [37]. Therefore, it is necessary to filter the sensitive bands that play a key role in the model.
Pearson correlation analysis is used to correlate the differentially transformed spectral features and the oxide content to find the band differences with correlation coefficients that pass the significance test at the 0.01 level. The results are used to identify the sensitive feature regions of the differentially transformed spectra.
Due to the large number and continuous distribution of selected wavelengths, secondary screening is required within the sensitive regions identified with Pearson correlation analysis. Considering the random nature of the initial bands of the SPA, to reduce the possibility of the selection of invalid initial bands, the range of random initial bands is restricted to the corresponding clusters by combining the above algorithm with a clustering algorithm. Fine screening is performed separately in each cluster to eliminate the presence of covariance bands. The feature wavelengths selected by the SPA are those that are most representative.

2.2.1. Bisecting K-Means Algorithm

The bisecting K-means algorithm is an improvement on and expansion of the K-means algorithm, and it essentially bifurcates the selected clusters until the specified number of clusters is reached while satisfying a criterion based on the sum of the squared error (SSE) [50]. The SSE is defined as shown in Equation (1). The greatest advantage of this algorithm over the traditional K-means method is that it is simple and fast to implement and finds the globally optimal solution. Therefore, in this paper, the dichotomous K-means algorithm is used for the clustering analysis of sensitive feature regions with differences, and the initial K value is set to 3.
S S E = i = 1 k x C i d i s t ( c i , x ) 2
where k is the number of clusters, c i   is the cluster center of cluster   C i , and x is a sample in that cluster.

2.2.2. Successive Projections Algorithm

The successive projections algorithm (SPA) is a forward iterative feature variable selection method that minimizes the covariance in vector space, minimizes the redundant information in the original spectral matrix, eliminates the effect of covariance, and can be used for spectral feature wavelength screening [51,52]. The SPA starts with one wavelength and introduces the wavelength with the largest projection vector into the set of wavelength variables in each iteration by analyzing the projection of the vector until a specified number of wavelengths is reached [53,54,55]. The exact procedure of the algorithm is as follows.
1.
In the 1st iteration ( n = 1 ), any column of wavelength k ( 0 ) is chosen and denoted as x j , where j = 1 , , J .
2.
The wavelengths that are not included in the set are identified as s = { j , 1 j J , j { k ( 0 ) , , k ( n 1 ) } } .
3.
The projection of the initialized band x j with an unselected wavelength in orthogonal space is calculated as
P x j = x j ( x j T x k ( n 1 ) ) x k ( n 1 ) ( x k ( n 1 ) T x k ( n 1 ) ) 1
4.
The maximum wavelength of the projection vector is calculated:
k ( n ) = arg ( max ( P x j ) )
5.
n = n + 1 ; if n < N , return to step 2.
6.
The final combination of wavelength variables is determined.

2.3. Sample Subset Partition

The division of the sample data set will affect the accuracy of the model estimates to some extent. The SPXY algorithm was proposed by Galvao et al. [56]. The objective is to use the physical-chemical variable y and the spectral variable x to calculate the intersample distance, fully characterize the sample distribution, effectively cover the multidimensional vector space, increase intersample variability and representativeness, and improve model stability [57,58]. The distance equation is as follows [56]:
d x ( p , q ) = j = 1 J   [ x p ( j ) x q ( j ) ] 2   p , q   [ 1 , N ]
d y ( p , q ) = ( y p y q ) 2 = | y p y q |   p , q   [ 1 , N ]
To ensure that the variables x and y give the same weight to a sample, d x ( p , q ) and d y ( p , q ) are divided by the maximum value in the data set, respectively. Thus, the formula after normalization is
d x y ( p , q ) = d x ( p , q ) m a x p , q [ 1 , N ] d x ( p , q ) + d y ( p , q ) m a x p , q [ 1 , N ] d y ( p , q )   p , q   [ 1 , N ]
In this study, the SPXY algorithm was used to divide the training and test sets of 76 LSCC samples based on the four oxide contents as y variables and the spectral features as x variables, and the specific division results are shown in Table 2. The training set accounts for 75% of all samples and contains 57 samples, and the test set accounts for 25% of all samples and contains 19 samples. The statistical parameters of the oxide content in the test set are generally within the same ranges as those of the training set, and the sample set is divided reasonably and effectively, which helps improve the stability and reliability of the model.

2.4. Extra-Trees Regression

Extra-Trees is an integrated algorithm based on decision trees with good generalization and robustness and was proposed by Geurts et al. [38]. The Extra-Trees algorithm is trained using all training samples, and different decision trees are constructed according to different features. The score of each random classification node for K random features is calculated with the Score function (Equation (7)), and the node with the highest score is selected as the splitting node [59,60]. When a new sample is input, multiple decision trees in the integrated model score it, and the final prediction is based on the average of all decision tree predictions [61].
S c o r e R ( s , S ) = v a r { y | S } | S l | | S | v a r { y | S l } | S r | | S | v a r { y | S r } v a r { y | S }
where v a r { y | S } is the variance of the output y in sample S and l and r denote the left and right divergence values of the nodes, respectively.
Unlike random forests that build each tree by sampling with feedback, Extra-Trees uses the entire training sample to establish each tree, which can effectively reduce the effects of the bias and variance of the sample set [62]. Moreover, unlike random forests that obtain the best bifurcation within a random subset, Extra-Trees obtains the bifurcation of each decision tree completely randomly [38].
In this paper, the Extra-Trees algorithm is used as the lunar surface oxide content prediction model, and the flow chart of this model is shown in Figure 2. The best combination of bands after secondary screening is input into the model, and each oxide content is output to construct the Extra-Trees oxide content prediction model. The implementation of the Extra-Trees model in this study is based on the ExtraTreesRegressor algorithm provided in the sklearn package of the Python language. The algorithm has two main parameters, including the minimum size of the samples for splitting the nodes, n m i n , and the size of the randomly selected attributes for each node, k [38]. According to Geurts’s experimental results, the default parameter settings are generally satisfactory in terms of accuracy and computational efficiency; therefore, the default parameters are used.

2.5. Evaluation Indicators

The accuracy and predictive ability of the model are mainly evaluated in terms of the coefficient of determination (R2) and root mean square error (RMSE). R2 is used to evaluate the correlation between the predicted and true values of the sample, and the closer R2 is to 1, the higher the correlation between the predicted and true values (Equation (8). The RMSE is used to evaluate the predictive ability of the model for a given data set, and the smaller the RMSE is, the better the predictive ability of the model [37,52].
R 2 = 1 i = 1 m ( c i ˇ c i ) 2 i = 1 m ( c i c i ¯ ) 2
R M S E = i = 1 m ( c i ˇ c i ) 2 n
where c i ˇ , c i and c i ¯ denote the true, predicted, and mean values of the i th   sample, respectively, and n   denotes the number of samples.

3. Results

3.1. Correlation Coefficients between Elements and Reflectance

In this paper, IIM reflectivity data in the range of 513–891 nm were used as the basis for the estimation and analysis of the lunar surface oxide content. First, the standard bidirectional emissivity data for 76 LSCC samples (300–2600 nm) were resampled into the wavelength range of IIM data (513–891 nm) using a linear interpolation method. The spectral data after resampling were differentially transformed to obtain the reflectivity spectral band difference.
The linear correlation coefficients between LSCC reflectance and the content were calculated (Figure 3). The results showed that the correlation coefficients did not vary significantly within 513–891 nm, and the correlation coefficients between FeO, TiO2, Al2O3, and SiO2 and reflectance remained at approximately 0.75, 0.58, 0.79, and 0.25, respectively. Additionally, positive correlations were observed between the Al2O3 and SiO2 contents and spectral reflectance, and negative correlations were observed between the FeO and TiO2 contents and spectral reflectance.
The correlations between the elemental contents and LSCC reflectance are mainly concentrated in the grain size range of 10–20 μm, as the optical information at these sizes is most similar to that of the bulk soil [18,63]. Figure 4 demonstrates the relationship between the 891 nm reflectance and the content of each oxide (FeO and TiO2) in the LSCC samples of size 10–20 μm. The FeO and TiO2 contents decrease with increasing reflectance, highlighting the negative correlations between the FeO and TiO2 contents and spectral reflectance. However, the relationship between elements and IIM reflectance is not necessarily linear. For some oxides, it is difficult to describe the complex relationship between them and the spectra based on conventional linear regression [36], and suitable nonlinear models are needed to invert the major oxide contents.
Moreover, there are also correlations among elements. As shown in Figure 5, there is a significant inverse correlation between the FeO and Al2O3 contents and a significant inverse correlation between TiO2 and SiO2. These findings are consistent with the conclusion that there are positive correlations between the Al2O3 and SiO2 contents and spectral reflectance and negative correlations between the FeO and TiO2 contents and spectral reflectance, as shown in Figure 3.
Figure 6 shows the correlations between the reflectance band differences and the major oxides, and the color of each point in the figure indicates the magnitude of the correlation coefficient between the band differences and the oxide contents. The results show that the band differences display the highest correlations with FeO and Al2O3 and the lowest correlations with TiO2 and SiO2. The maximum value of the correlation coefficient between FeO and the band difference is located at 541 nm–532 nm (0.861); most of the correlation coefficients between Al2O3 and the band difference of IIM are approximately 0.80, and the maximum value is 0.870. Additionally, the correlation between TiO2 and the band difference of IIM decreases from the lower left to the upper right corner, and the maximum value appears at 532 nm–522 nm (0.710). The correlation between SiO2 and the band difference is poor, with a maximum value of 0.427, and most of the correlation coefficients are approximately 0.40. In general, for the four oxides, the high correlations are mainly concentrated in the lower left corner, and the band differences among the longer bands (upper right corner in the figure) are poorly correlated. Correlation calculations for the original bands and all band differences in the IIM spectral range separately show that band differences can be used to effectively enhance the correlations between reflectance and the oxide contents to determine the best combination of bands for subsequent modeling.

3.2. Screening of Feature Bands

The band differences that passed the 0.01 significance test were screened twice, and the sensitive bands identified through BKM clustering analysis combined with SPA are shown in Table 3. These bands are used as the inputs to the model.

3.3. Establishment and Evaluation of the Extra-Trees Model

The spectral features obtained based on the band reduction screening were used as the independent variables for the Extra-Trees modeling analysis. Additionally, the lunar surface oxide content was used as the dependent variable, and the SPXY algorithm was applied to divide the sample set and establish the Extra-Trees model. To effectively analyze the results of variable screening, the modeling results for the original bands were also applied for comparison. The prediction results (Figure 7 and Figure 8) show that the coefficients of determination R2 of the FeO, TiO2, Al2O3, and SiO2 prediction models after differential transformation and variable filtering were 0.962, 0.944, 0.964, and 0.860, respectively, which are all greater than 0.850, indicating good prediction ability for the four oxide contents. The model accuracy is improved compared with that obtained based on the original bands, further verifying the importance of the differential transformation of the original bands and reduced-dimension filtering of the variable information. The modeling accuracy of feature band selection was compared with the accuracy reported by Wu [18] (Table 4), and the accuracy for all four oxide contents after differential transformation and variable screening was improved. Therefore, in this paper, the differential variables obtained based on the SPA were adopted as the inputs to the model for prediction.

3.4. Extra-Trees Modelling in the Apollo 17 Area

The Apollo 17 area is close to the junction of Serenitatis and Tranquillitatis, and the coordinates of the landing site are (30°44′58.3″E, 20°9′50.5″N). The geological environment of the area is complex; according to the analysis of Apollo17 samples, the rock types in the area are mainly overlying mare basalts and orange-grey breccia [64].

3.4.1. Extra-Trees Modelling of FeO

Fe mainly exists in lunar mafic minerals, and understanding the abundance and distribution of Fe can aid in understanding the nature and origin of the Moon and provide an indicator of lithology to distinguish between mare and highland [65]. In this paper, Extra-Trees modeling is performed with the effective bands obtained by the Pearson correlation coefficients and SPA. Due to the large number of bands of IIM data that undergo band-difference transformation, the coverages overlap. The sensitive bands that play a key role in the model are first screened, and the model is then trained and tested; however, the regression results are not the only reference used to evaluate the model. To further verify the accuracy of the model, it was applied to the Apollo 17 area (landing site coordinates: 30°44′58.3″ E, 20°9′50.5″ N) to evaluate the accuracy of FeO inversion.
The FeO training and testing accuracies for the Extra-Trees model after variable screening are shown in Figure 8a, where the blue dots represent the training set, and the red dots represent the test set. The coefficient of determination (R2) for the test set is 0.962, and the RMSE is approximately 1.028. The FeO inversion results (Figure 9) show that the predicted FeO abundance in the region near the Apollo 17 landing site ranges from 7.64 to 15.72 wt.%, and the FeO content of the mare area is significantly higher than that of the highland area, with good consistency regarding the distribution of FeO content values in comparison with the results of Lucey et al. [17].

3.4.2. Extra-Trees Modelling of TiO2

The distribution of the TiO2 content varies greatly among different types of mare basalts; thus, it is the key to classifying mare basalt types and is important for the exploitation of lunar ilmenite resources [6]. The optimal band selection for TiO2 is similar to that in the FeO model. First, the 287 bands that passed the p < 0.01 significance test were used to establish the sensitive region for TiO2, and the best band combination was selected by clustering analysis and applying the SPA. The results are shown in Table 4.
The TiO2 training and testing accuracies of the Extra-Trees model after variable screening are shown in Figure 8b, where the coefficient of determination (R2) for the test set is 0.944 and the RMSE is approximately 0.672. The model was applied for TiO2 inversion in the test area (Figure 10), and the results show that the TiO2 abundance ranges from 2.21 to 9.37 wt.%; the high-value areas are mainly located in the mare, and the low-value areas are mainly in the highland, which is generally similar to the results reported by Lucey et al. [17]. The inversion results display a similar trend, but the lower limit value of TiO2 is higher, and the contents are slightly higher in the lower left corner of the figure, which may be due to the effect of topographic shadowing from the light angle.

3.4.3. Extra-Trees Modelling of Al2O3

Almost all Al on the lunar surface is present in plagioclase, and the distribution of the Al2O3 content is related to the formation and evolution of the lunar crust; notably, levels are highest in highland calcarenite rocks and intermediate in mare basalts and can be used to assist in distinguishing mare basalts from highland plagioclase. To assess the accuracy of its inversion, an initial screening based on Pearson correlation coefficients was carried out, followed by BKM clustering analysis combined with the SPA algorithm to find the best band combinations for Al2O3. From Table 3, it can be seen that the eight best band differences were finally screened and used as input to the model for modeling.
The training and testing accuracies of the Al2O3 Extra-Trees model after variable screening are shown in Figure 8c. The coefficient of determination (R2) for the test set with this method is 0.964, and the RMSE is approximately 0.942. The model was applied to the Apollo 17 region for Al2O3 inversion, and the results (Figure 11) show that the Al2O3 abundance obtained with this method ranges from 11.75–24.43 wt.%, with a mean value of 14.70 wt.%. The abundance of Al2O3 in the highlands region is lower than that in the mare region, and this correlates negatively with FeO.

3.4.4. Extra-Trees Modeling of SiO2

SiO2 is in nearly every rock and mineral grain on the Moon as the fundamental component of silicate minerals. The optimal waveband selection for SiO2 is similar to that for FeO, and the optimum number of bands for SiO2 is shown in Table 3, with a total of 30 band differences.
The training and testing accuracies of the SiO2 Extra-Trees model after variable screening are shown in Figure 8d. The R2 value for the test set obtained with this method is 0.860, and the RMSE is approximately 0.897. The model was applied for SiO2 inversion in the Apollo 17 region (Figure 11c), and the results show that the SiO2 abundance obtained with this method ranges from 41.08 to 45.67 wt.%, with a mean value of 43.84 wt.%. SiO2 is commonly found in lunar rocks, with higher levels in the highland than in the mare, which is negatively correlated with TiO2.

3.5. Oxide Content Mapping for the Copernicus Crater Region

3.5.1. Regional Distribution of Lunar Surface Chemistry

Using ArcGIS as the platform, a 1:2,500,000 lunar geological map of the study area was mapped (Figure 12), which covers a geographical area of approximately 125 × 104 km2 at longitudes of −52° to 6.6° W and latitudes of −2.8° to 28° N. The study area is mainly occupied by the Copernicus crater, Kepler crater, and Aristarchus crater, located in the transition zone between the mare and the lunar highlands in the south of the Oceanus Procellarum. The study area is rich in geomorphic features and material types, and the rocks are mainly mare basalts, followed by KREEP rocks and ferroan anorthositic suite. Linear structures are widely developed and distributed and are of great importance to studies of lunar diagenesis and geological evolution.
In the study area, the mare basalt is widely distributed in the lower and flat region, including south of Oceanus Procellarum and the impact basin floor, basically occupying the entire study area. These mare basalts formed between 19 Ga and 37 Ga ago according to the isotopic age data of lunar samples and the dating results of crater size-frequency distribution (CSFD) for mare basalt units. Meanwhile, a large number of tectonic developments are present, and the main linear formations are rills, wrinkle ridges, and crater-floor fractures, with wrinkle ridges being the most numerous, widespread, and characteristic linear structures, mainly within the mare basalt. Rills are also widespread and distributed around the wrinkle ridges; the circular structures are mainly domes, volcanic vents, craters, and impact basins.
The main geological evolution in the study area includes endodynamic geological evolution processes, such as differentiation of magma ocean, plutonic magmatism, and volcanism, and exodynamic geological evolution processes such as impact. During the Magma-Oceanian period, the primary ferroan anorthositic crust mainly formed, followed by the formation of the Copernicus-H basin in the Aitkenian, the Imbrain basin in the Imbrian, the Timocharis crater in the Eratothenian, and the Copernicus, Kepler, and Aristarchus craters in the Copernican.
An Extra-Trees model based on variable screening was used to invert the oxide contents in the region near the Copernicus crater using IIM data, and frequency histograms were plotted based on the oxide contents (Figure 13 and Figure 14).
Figure 13 shows the predicted results of oxide contents (FeO, TiO2, Al2O3, and SiO2) in the Copernicus crater region. As shown in Figure 13, the FeO content ranges from 3.61 to 17.68 wt.%, with a mean value of 12.51 wt.%. The FeO content in the mare is significantly higher than that in the highland, with the lower FeO content mainly located in the Copernicus crater, Aristarchus crater, and Kepler crater floor, and the high FeO content area mainly located on the southwest side of Kepler crater. The frequency histogram in Figure 14 also shows a clear bimodal distribution, with the first peak at approximately 12.80 wt.% and the second peak at approximately 15.1 wt.%. The TiO2 content ranges from 0.38 to 8.88 wt.%, with a mean value of 2.60 wt.%. Its content is significantly higher in the mare than in the highland, and there is a positive correlation between the FeO content in areas with high TiO2 content in the mare basalt area. The low TiO2 content areas are mainly located around the Copernicus, Aristarchus, and Kepler craters, and the high TiO2 content areas are mainly located on the southwestern side of the Kepler crater and the northeastern side of the Copernicus crater, while the rest of the mare has moderate TiO2 content. The frequency histogram distribution shows a clear bimodal distribution, with the first peak at approximately 2.16 wt.% and the second peak at approximately 2.82 wt.%. The Al2O3 content ranges from 10.27–29.09 wt.%, with a mean value of 13.56 wt.%. It is clearly higher in the highland than in the mare, with an inverse correlation with FeO content. The high-value areas are mainly located in the vicinity of craters, with lower levels in the mare, especially on the southwestern and western sides of the Kepler crater. The frequency histogram of Al2O3 shows a clear bimodal distribution, with the first peak of approximately 13.57 wt.% and the second peak of approximately 18.25 wt.%. The SiO2 content ranges from 39.82 to 47.01 wt.%, with an average value of 39.53 wt.%. The SiO2 content is high throughout the study area, ranging from 39.82 to 47.01 wt.%, with a mean value of 39.53 wt.%. The content is significantly higher in the highland than in the mare, which shows an inverse correlation with TiO2. The high-value areas are mainly located within the crater and its sputtering, with lower levels in the rest of the area. The SiO2 first peak of the frequency histogram is approximately 41.11 wt.%, and the second peak is approximately 44.25 wt.%.

3.5.2. Comparison with Previous Works

Figure 15 and Figure 16 show the FeO and TiO2 content distributions generated based on the different data/methods. First, the correlation analysis was carried out with the FeO content distribution maps generated in this paper (Table 5). The correlations between the FeO inversion results in this paper and the inversion results of Clementine, IIM (band ratio), and LP GRS data were 0.91, 0.89, and 0.90, respectively, and the correlation was basically stable at approximately 0.90. The correlations between the TiO2 inversion and the Clementine and IIM (band ratio) inversions (Table 6) were 0.61 and 0.65, respectively. Due to the low resolution of the GRS TiO2 data, this correlation was not analyzed. As seen in Figure 15, the FeO and TiO2 contents of the inversions in this paper are lower than the mean values of the inversions of other models. To better compare the discrepancy between the inversion results of this paper and the Clementine FeO results. the IIM FeO map is compared with the Clementine FeO product resampled to the same resolution (0.5°), and the correlation between the two products is 91%, with good agreement, as shown in Figure 17. The difference between the results of this paper and Clementine may be due to the influence of topographic shadows caused by the camera angle, post-image processing, and differences in the oxide inversion algorithm. Additionally, using IIM data, the difference between the results of this paper and the band ratio-based method is mainly due to differences in the oxide inversion algorithm and post-image processing. In contrast to the IIM optical image data, which can only capture the material component of the lunar surface, the LP GRS data are not affected by topographic shadows, light angles, etc., and can be detected to a depth of 20–30 cm below the lunar surface [15]. Therefore, the difference between the results of this paper and the LP GRS is mainly because the GRS data are not affected by topographic shadows and because the GRS data are deeper than the IIM data. The results from different data and methods indicate that, despite the different absolute abundance ranges, their relative abundances are similar to each other [19,33].
The existence of negative correlations between FeO and Al2O3 and TiO2 and SiO2 was verified. Specifically, two-dimensional density plots of FeO and Al2O3 and two-dimensional density plots of TiO2 and SiO2 were established (Figure 18). The results show that the correlation between FeO and Al2O3 is 75%, and the correlation between TiO2 and SiO2 is 76%. These results are consistent with the previous conclusion that Fe is negatively correlated with Al content and that a decreasing Si phenomenon tends to occur along with the enrichment of Ti elements [64].

4. Discussion

In this paper, a new method of feature band selection combined with Extra-Trees is presented to predict oxide contents on the lunar surface. The results show that the proposed method has good potential for assessing the distribution of oxide contents using spectral features.

4.1. Comparison with Other Similar Studies

Previous studies mainly used two bands to retrieve oxide contents according to the absorption characteristics of mineral elements in the visible near-infrared (VNIR) band. For example, 750 nm and 950 nm [17], 757 nm and 891 nm [21], and 757 nm and 918 nm [19] were used to retrieve the FeO content. Additionally, wavelengths of 415 nm and 750 nm [17,67], 531 nm and 757 nm [19], and 321 nm and 415 nm [16] were used to retrieve TiO2 content. Due to the complexity of the lunar soil composition, the ability to describe the nonlinear relationships between spectral features and oxide contents from only two bands is very limited [36], and the accuracy of composition inference is highly dependent on the selected inversion models [68]. In addition, nonchromophore elements do not display absorption characteristics in the VNIR region, which makes it difficult to use most methods in nonchromophore element (Al and Si) inversion. Currently, some scholars [18,20,54] have established the relationships between spectral features and oxide contents based on regression models to invert these oxide contents. Therefore, the inversion of oxide contents on the lunar surface using regression models has become a popular research topic.
Spectral pretreatment methods can improve the correlations between spectral features and oxide contents, as has been well established in previous studies. For example, Li [40] attempted to improve the performance of a model by using a log(1/reflectance) transformation approach. Pieters [26] and Sun et al. [20] applied a band-ratio approach to improve the above correlations. Wu [18] experimentally demonstrated that the correlations obtained considering band differences were superior to those obtained with the band-ratio technique. Therefore, the first-order band-difference transformation method was chosen in this paper to improve the correlations between spectral features and oxide contents. After the preprocessing of spectral data, feature band selection is crucial. However, previous studies paid little attention to the optimization of feature variables in models. In this study, on the basis of earlier methods [20,37], the Pearson correlation coefficient and SPA are used to select characteristic bands, thus ensuring that high correlations between input spectral features and oxides are retained and reducing the multicollinearity among spectral features. With expanding research, a variety of machine-learning models have been applied to the inversion of lunar surface oxides. The Extra-Trees algorithm selected in this paper can effectively describe the nonlinear characteristics of the studied features and provides strong generalization ability and robustness. This feature band selection approach combined with the Extra-Trees prediction method for predicting oxide contents on the lunar surface yielded reasonable accuracy for the sample test set (R2 values of 0.962, 0.944, 0.964, and 0.86 for FeO, TiO2, Al2O3, and SiO2, respectively, and RMSEs of 1.028, 0.672, 0.942, and 0.897, respectively) and the regional scale (Apollo17 area; the area near the Copernicus crater).

4.2. Future Prospects

(1)
The LSCC sample data used in this paper are mainly distributed in the low-latitude area of the lunar nearside, and the number of samples is small; consequently, the distribution characteristics of the oxide contents on the lunar surface cannot be comprehensively reflected, and there are limitations in both quantity and region. This leads to some uncertainty in the prediction results. In the future, more samples should be obtained to supplement the sample data limitations in certain regions of the Moon, such as at middle and high latitudes, and increase the number of samples, which is expected to improve the accuracy of chemical abundance distribution characteristics.
(2)
There are spectral anomalies at the edges of different orbit images of IIM hyperspectral data, and factors such as solar azimuth and topographic relief will lead to shadows in the images, which will inevitably increase errors in the inversion results. In future research, we should consider how to mitigate the spectral anomalies and topographic shadows in IIM data or explore the potential of using higher-quality and higher-resolution remote sensing data for inversion, such as the M3 data obtained with the Indian satellite Chandrayaan-1 and the MI data obtained with Kaguya in Japan, to improve the prediction accuracy of the model.
(3)
The continuity of hyperspectral data greatly enriches the amount of information available in remote sensing data, but it can also lead to issues such as information redundancy and high correlations between bands. Therefore, determining how to obtain the best combination of sensitive bands is an important step in the application of hyperspectral data. The sensitive band screening method applied in this paper provides a reference for other chemistry inversion research. In the future, more band screening algorithms can be applied for feature selection with lunar hyperspectral data to reasonably select the best number and combination of bands and improve accuracy.
(4)
In the era of big data, big data theory and technology are important tools for solving practical problems. The application of machine learning and deep learning algorithms for lunar chemistry inversion is still in its infancy. Although the Extra-Trees model developed in this paper provides good prediction ability, there is still room for further improvement. Thus, a future development direction is to complete lunar surface oxide inversion through better machine learning and deep learning methods.

5. Conclusions

The plan to return to the Moon in the new era is to develop and utilize lunar resources, establish a lunar base, and use the Moon as a springboard to gradually carry out deep space exploration. Through comprehensive remote sensing exploration of the Moon to analyze and assess the distribution of lunar mineral resources, it can provide a long-term stable resource reserve for the sustainable development of human society. In this paper, using IIM data, the original reflectance as the initial input and the oxide contents of LSCC samples as the desired outputs, we use a combination of the mathematical transformation of spectral data (first-order difference method), a feature band selection technique (with the SPA) and a machine learning modeling method (Extra-Trees). The correlations between reflectance and oxide contents are improved to identify the difference bands most sensitive to oxide levels and construct a prediction model with excellent inversion ability for lunar surface oxide (FeO, TiO2, Al2O3, and SiO2) contents. This approach can solve the problems of low accuracy and poor generalizability encountered with traditional modeling methods based on raw spectral data. The main conclusions of this paper are as follows.
(1)
The correlation calculations for the original bands and all band differences in the IIM spectral range separately show that considering the band differences can enhance the correlations between reflectance and oxide contents. Thus, the best combination of spectral bands can be used in subsequent modeling, and the accuracy of the model can be improved. Moreover, the calculated interelement correlations suggest that Fe is negatively correlated with Al and that Si depletion is often accompanied by the enrichment of Ti.
(2)
In total, 325 band differences were initially screened using Pearson correlation coefficients, and then secondary downscaling screening was performed according to BKM combined with SPA. Consequently, 17, 5, 8, and 30 feature bands were retained for the four oxides (FeO, TiO2, Al2O3, and SiO2, respectively) for modeling. The SPA encompasses most of the spectral information associated with samples, effectively reduces the complexity of modeling and reduces the covariance interference among spectral features.
(3)
Machine learning algorithms are increasingly integrated with lunar surface chemistry inversion in the big data era. Big data undoubtedly provide new data-driven research methods and can effectively overcome the relatively limited numbers of lunar samples. Moreover, the high dimensional and complex nonlinear relationships between oxide contents and spectral reflectance can be better described. In this paper, we apply the Extra-Trees algorithm for chemistry inversion to predict the distribution of chemistry on the lunar surface. The results show that the R2 values of the test sets for FeO, TiO2, Al2O3, and SiO2 are 0.962, 0.944, 0.964, and 0.860, respectively, and the RMSE values are 1.028, 0.672, 0.942, and 0.897, respectively, improving the modeling accuracy over the original bands.
(4)
The average contents of the four oxides (FeO, TiO2, Al2O3, and SiO2) in the region near the Copernicus crater are 12.51 wt.%, 2.60 wt.%, 13.56 wt.%, and 39.53 wt.%, respectively. The oxide content distributions display obvious variations, and the frequency histograms show a clear bimodal distribution. The comparison of the present result with representative models shows that the model in this paper provides good agreement in the inversion of oxide abundance, thus providing a new idea and method for the inversion of oxide abundance.

Author Contributions

Conceptualization, S.W. and J.C.; methodology, S.W.; software, S.W.; validation, S.W., J.C., and L.L.; formal analysis, S.W.; resources, C.Z. and R.H.; data curation, S.W., J.C., and C.Z.; writing—original draft preparation, S.W.; writing—review and editing, S.W., J.C., and L.L.; visualization, S.W., C.Z., and Q.Z.; supervision, S.W., L.L., and R.H.; funding acquisition, J.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Geological Survey Project of China Geological Survey, grant No. 42932022001, and the Natural Science Basis Research Plan in Shaanxi province of China, grant No. 2020JQ-742.

Data Availability Statement

Chang’E-1 IIM data are provided by “The Science and Application Center for Moon and Deep Space Exploration” at http://Moon.bao.ac.cn/.

Acknowledgments

The authors gratefully acknowledge Zongcheng Ling for providing the improved Chang’E-1 IIM imagery and Chang Liu and Yiwen Pan for their guidance in mineralogy. The authors also thank the cartographers who compiled the geological map of the Moon.

Conflicts of Interest

The authors declare no conflictd of interest.

References

  1. Rossi, A.P.; van Gasselt, S. (Eds.) Planetary Geology; Springer International Publishing: Cham, Switzerland, 2018; ISBN 978-3-319-65177-4. [Google Scholar]
  2. Anderson, S.L.; Sansom, E.K.; Shober, P.M.; Hartig, B.A.D.; Devillepoix, H.A.R.; Towner, M.C. The Proposed Silicate-Sulfuric Acid Process: Mineral Processing for In Situ Resource Utilization (ISRU). Acta Astronaut. 2021, 188, 57–63. [Google Scholar] [CrossRef]
  3. Mills, J.N.; Katzarova, M.; Wagner, N.J. Comparison of Lunar and Martian Regolith Simulant-Based Geopolymer Cements Formed by Alkali-Activation for in-Situ Resource Utilization. Adv. Space Res. 2022, 69, 761–777. [Google Scholar] [CrossRef]
  4. O’Leary, B. Mining the Apollo and Amor Asteroids. Science 1977, 197, 363–366. [Google Scholar] [CrossRef] [PubMed]
  5. Anand, M.; Crawford, L.A.; Balat-Pichelin, M.; Abanades, S.; Westrenen, W.V.; Peraudeau, G.; Jaumann, R.; Seboldt, W. A Brief Review of Chemical and Mineralogical Resources on the Moon and Likely Initial in Situ Resource Utilization (ISRU) Applications. Planet. Space Sci. 2012, 74, 42–48. [Google Scholar] [CrossRef]
  6. Melendrez, D.E.; Johnson, J.R.; Larson, S.M.; Singer, R.B. Remote Sensing of Potential Lunar Resources: 2. High Spatial Resolution Mapping of Spectral Reflectance Ratios and Implications for Nearside Mare TiO2 Content. J. Geophys. Res. 1994, 99, 5601. [Google Scholar] [CrossRef]
  7. Korokhin, V.V.; Kaydash, V.G.; Shkuratov, Y.G.; Stankevich, D.G.; Mall, U. Prognosis of TiO2 Abundance in Lunar Soil Using a Non-Linear Analysis of Clementine and LSCC Data. Planet. Space Sci. 2008, 56, 1063–1078. [Google Scholar] [CrossRef]
  8. Taylor, S.R. The Unique Lunar Composition and Its Bearing on the Origin of the Moon. Geochim. Cosmochim. Acta 1987, 51, 1297–1309. [Google Scholar] [CrossRef]
  9. Athiray, P.S.; Narendranath, S.; Sreekumar, P.; Dash, S.K.; Babu, B.R.S. Validation of Methodology to Derive Elemental Abundances from X-Ray Observations on Chandrayaan-1. Planet. Space Sci. 2013, 75, 188–194. [Google Scholar] [CrossRef]
  10. Bhatt, M.; Mall, U.; Wöhler, C.; Grumpe, A.; Bugiolacchi, R. A Comparative Study of Iron Abundance Estimation Methods: Application to the Western Nearside of the Moon. Icarus 2015, 248, 72–88. [Google Scholar] [CrossRef]
  11. Chevrel, S.D.; Pinet, P.C.; Daydou, Y.; Maurice, S.; Lawrence, D.J.; Feldman, W.C.; Lucey, P.G. Integration of the Clementine UV-VIS Spectral Reflectance Data and the Lunar Prospector Gamma-Ray Spectrometer Data: A Global-Scale Multielement Analysis of the Lunar Surface Using Iron, Titanium, and Thorium Abundances: Lunar Surface Multielement Analysis. J. Geophys. Res. 2002, 107, 15-1–15-14. [Google Scholar] [CrossRef]
  12. Lawrence, D.J.; Feldman, W.C.; Barraclough, B.L.; Binder, A.B.; Elphic, R.C.; Maurice, S.; Thomsen, D.R. Global Elemental Maps of the Moon: The Lunar Prospector Gamma-Ray Spectrometer. Science 1998, 281, 1484–1489. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Lemelin, M.; Lucey, P.G.; Song, E.; Taylor, G.J. Lunar Central Peak Mineralogy and Iron Content Using the Kaguya Multiband Imager: Reassessment of the Compositional Structure of the Lunar Crust: Lunar Central Peak Mineralogy and Iron. J. Geophys. Res. Planets 2015, 120, 869–887. [Google Scholar] [CrossRef]
  14. Narendranath, S.; Athiray, P.S.; Sreekumar, P.; Kellett, B.J.; Alha, L.; Howe, C.J.; Joy, K.H.; Grande, M.; Huovelin, J.; Crawford, I.A.; et al. Lunar X-Ray Fluorescence Observations by the Chandrayaan-1 X-Ray Spectrometer (C1XS): Results from the Nearside Southern Highlands. Icarus 2011, 214, 53–66. [Google Scholar] [CrossRef]
  15. Prettyman, T.H.; Hagerty, J.J.; Elphic, R.C.; Feldman, W.C.; Lawrence, D.J.; McKinney, G.W.; Vaniman, D.T. Elemental Composition of the Lunar Surface: Analysis of Gamma Ray Spectroscopy Data from Lunar Prospector: Lunar Elemental Composition. J. Geophys. Res. 2006, 111, E12. [Google Scholar] [CrossRef]
  16. Sato, H.; Robinson, M.S.; Lawrence, S.J.; Denevi, B.W.; Hapke, B.; Jolliff, B.L.; Hiesinger, H. Lunar Mare TiO2 Abundances Estimated from UV/Vis Reflectance. Icarus 2017, 296, 216–238. [Google Scholar] [CrossRef]
  17. Lucey, P.G.; Blewett, D.T.; Jolliff, B.L. Lunar Iron and Titanium Abundance Algorithms Based on Final Processing of Clementine Ultraviolet-Visible Images. J. Geophys. Res. 2000, 105, 20297–20305. [Google Scholar] [CrossRef]
  18. Wu, Y. Major Elements and Mg# of the Moon: Results from Chang’E-1 Interference Imaging Spectrometer (IIM) Data. Geochim. Cosmochim. Acta 2012, 93, 214–234. [Google Scholar] [CrossRef]
  19. Yan, B.; Xiong, S.Q.; Wu, Y.; Wang, Z.; Dong, L.; Gan, F.; Yang, S.; Wang, R. Mapping Lunar Global Chemical Composition from Chang’E-1 IIM Data. Planet. Space Sci. 2012, 67, 119–129. [Google Scholar] [CrossRef]
  20. Sun, L.; Ling, Z.; Zhang, J.; Li, B.; Chen, J.; Wu, Z.; Liu, J. Lunar Iron and Optical Maturity Mapping: Results from Partial Least Squares Modeling of Chang’E-1 IIM Data. Icarus 2016, 280, 183–198. [Google Scholar] [CrossRef]
  21. Xiang, W.; Jianping, C.; Yanbo, X.; Yongchun, Z.; Bokun, Y.; Yunzhao, W. Inversion of the Main Mineral Compositions and Subdivision of Tectonic Units on Lunar LQ-4 Based on Chang’e Data. Acta Geol. Sin. Engl. Ed. 2015, 89, 1882–1894. [Google Scholar] [CrossRef]
  22. Wang, X.; Zhang, J.; Ren, H. Lunar Surface Chemistry Observed by the KAGUYA Multiband Imager. Planet. Space Sci. 2021, 209, 105360. [Google Scholar] [CrossRef]
  23. Lemelin, M.; Lucey, P.G.; Miljković, K.; Gaddis, L.R.; Hare, T.; Ohtake, M. The Compositions of the Lunar Crust and Upper Mantle: Spectral Analysis of the Inner Rings of Lunar Impact Basins. Planet. Space Sci. 2019, 165, 230–243. [Google Scholar] [CrossRef]
  24. Burns, R.G. Mineralogical Applications of Crystal Field Theory, 2nd ed.; Cambridge University Press: Cambridge, UK, 1993; ISBN 978-0-521-43077-7. [Google Scholar]
  25. Jaumann, R. Spectral-Chemical Analysis of Lunar Surface Materials. J. Geophys. Res. 1991, 96, 22793. [Google Scholar] [CrossRef]
  26. Pieters, C. Statistical Analysis of the Links among Lunar Mare Soil Mineralogy, Chemistry, and Reflectance Spectra. Icarus 2002, 155, 285–298. [Google Scholar] [CrossRef]
  27. Li, L. Partial Least Squares Modeling to Quantify Lunar Soil Composition with Hyperspectral Reflectance Measurements. J. Geophys. Res. 2006, 111, E04002. [Google Scholar] [CrossRef] [Green Version]
  28. Blewett, D.T.; Lucey, P.G.; Hawke, B.R.; Jolliff, B.L. Clementine Images of the Lunar Sample-Return Stations: Refinement of FeO and TiO2 Mapping Techniques. J. Geophys. Res. 1997, 102, 16319–16325. [Google Scholar] [CrossRef]
  29. Gillis-Davis, J.J.; Lucey, P.G.; Hawke, B.R. Testing the Relation between UV–Vis Color and TiO2 Content of the Lunar Maria. Geochim. Cosmochim. Acta 2006, 70, 6079–6102. [Google Scholar] [CrossRef]
  30. Ling, Z.; Zhang, J.; Liu, J.; Zhang, W.; Bian, W.; Ren, X.; Mu, L.; Liu, J.; Li, C. Preliminary Results of FeO Mapping Using Imaging Interferometer Data from Chang’E-1. Chin. Sci. Bull. 2011, 56, 376–379. [Google Scholar] [CrossRef] [Green Version]
  31. Ling, Z.; Zhang, J.; Liu, J.; Zhang, W.; Zhang, G.; Liu, B.; Ren, X.; Mu, L.; Liu, J.; Li, C. Preliminary Results of TiO2 Mapping Using Imaging Interferometer Data from Chang’E-1. Chin. Sci. Bull. 2011, 56, 2082–2087. [Google Scholar] [CrossRef] [Green Version]
  32. Lucey, P.G.; Taylor, G.J.; Malaret, E. Abundance and Distribution of Iron on the Moon. Science 1995, 268, 1150–1153. [Google Scholar] [CrossRef]
  33. Shkuratov, Y.G. Composition of the Lunar Surface as Will Be Seen from SMART-1: A Simulation Using Clementine Data. J. Geophys. Res. 2003, 108, 5020. [Google Scholar] [CrossRef]
  34. Sun, L.; Ling, Z. Partial Least Squares Modeling of Lunar Surface FeO Content with Clementine Ultraviolet-Visible Images. In Planetary Exploration and Science: Recent Results and Advances; Jin, S., Haghighipour, N., Ip, W.-H., Eds.; Springer: Berlin/Heidelberg, Germany, 2015; pp. 1–20. ISBN 978-3-662-45051-2. [Google Scholar]
  35. Wang, X.; Niu, R. Lunar Titanium Abundance Characterization Using Chang’E-1 IIM Data. Sci. China Phys. Mech. Astron. 2012, 55, 170–178. [Google Scholar] [CrossRef]
  36. Xia, W.; Wang, X.; Zhao, S.; Jin, H.; Chen, X.; Yang, M.; Wu, X.; Hu, C.; Zhang, Y.; Shi, Y.; et al. New Maps of Lunar Surface Chemistry. Icarus 2019, 321, 200–215. [Google Scholar] [CrossRef]
  37. Zhang, X.; Li, C.; Lü, C. Quantification of the Chemical Composition of Lunar Soil in Terms of Its Reflectance Spectra by PCA and SVM. Chin. J. Geochem. 2009, 28, 204–211. [Google Scholar] [CrossRef]
  38. Geurts, P.; Ernst, D.; Wehenkel, L. Extremely Randomized Trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef] [Green Version]
  39. Jin, M.; Ding, X.; Han, H.; Pang, J.; Wang, Y. An Improved Method Combining Fisher Transformation and Multiple Endmember Spectral Mixture Analysis for Lunar Mineral Abundance Quantification Using Spectral Data. Icarus 2022, 380, 115008. [Google Scholar] [CrossRef]
  40. Li, L. Quantifying Lunar Soil Composition with Partial Least Squares Modeling of Reflectance. Adv. Space Res. 2008, 42, 267–274. [Google Scholar] [CrossRef]
  41. Zhou, P.; Zhao, Z.; Wei, G.; Huo, H. Mineral Content Estimation of Lunar Soil of Lunar Highland and Lunar Mare Based on Diagnostic Spectral Characteristic and Partial Least Squares Method. Appl. Sci. 2022, 12, 1197. [Google Scholar] [CrossRef]
  42. Taylor, L.A.; Pieters, C.M.; Morris, R.V.; Keller, L.P.; Wentworth, S.J. Integration of the Chemical and Mineralogical Characteristics of Lunar Soils with Reflectance Spectroscopy. In Proceedings of the Lunar and Planetary Science Conference, Houston, TX, USA, 15–19 March 1999. [Google Scholar]
  43. Lawrence, D.J.; Feldman, W.C.; Elphic, R.C.; Little, R.C.; Prettyman, T.H.; Maurice, S.; Lucey, P.G.; Binder, A.B. Iron Abundances on the Lunar Surface as Measured by the Lunar Prospector Gamma-Ray and Neutron Spectrometers: Iron Abundances on the Lunar Surface. J. Geophys. Res. 2002, 107, 13-1–13-26. [Google Scholar] [CrossRef]
  44. Taylor, L.A.; Pieters, C.; Patchen, A.; Taylor, D.H.; Morris, R.V.; Keller, L.P.; Mckay, D.S. Mineralogical Characterization of Lunar Highland Soils. Lunar Planet. Sci. 2003, 34, 1774. [Google Scholar]
  45. Tingyan, X.; Fujiang, L.; Li, L.; Chan, L.; Ying, Z.; Le, Q.; Rong, Y.; Xiaopo, Z.; Meng, Z.; Qi, Z. Prognosis of Ti Abundance in Sinus Iridum Using a Nonlinear Analysis of Chang’ E-1 Interference Imaging Spectrometer Imagery. Earth Space Sci. 2015, 2, 187–193. [Google Scholar] [CrossRef]
  46. Zhu, F.; Liu, J.H.; Ren, X.; Chen, T.Q.; Liu, J. Spectrum Reconstruction for Chang’e-1 Imaging Interferometer Data Using Modified Periodogram Method. In Proceedings of the International Conference on Electronics, Palanga, Lithuania, 13–15 June 2016. [Google Scholar]
  47. Pieters, C.M. The Moon as a Spectral Calibration Standard Enabled by Lunar Samples: The Clementine Example. New Views Moon 2 Underst. Moon Through Integr. Divers. Datasets 1999, 47–49. [Google Scholar]
  48. Pieters, C.M. Bidirectional Spectroscopy of Returned Lunar Soils: Detailed “Ground Truth” for Planetary Remote Sensors. In Proceedings of the Lunar and Planetary Science Conference, Houston, TX, USA, 18–22 March 1991. [Google Scholar]
  49. Wu, Y.; Zheng, Y.; Zou, Y.; Chen, J.; Xu, X.; Tang, Z.; Xu, A.; Yan, B.; Gan, F.; Zhang, X. A Preliminary Experience in the Use of Chang’E-1 IIM Data. Planet. Space Sci. 2010, 58, 1922–1931. [Google Scholar] [CrossRef]
  50. Steinbach, M.; Karypis, G.; Kumar, V. A Comparison of Document Clustering Techniques; Technical Report; University of Minnesota: Minneapolis, MN, USA, 2000. [Google Scholar]
  51. Araújo, M.C.U.; Saldanha, T.C.B.; Galvão, R.K.H.; Yoneyama, T.; Chame, H.C.; Visani, V. The Successive Projections Algorithm for Variable Selection in Spectroscopic Multicomponent Analysis. Chemom. Intell. Lab. Syst. 2001, 57, 65–73. [Google Scholar] [CrossRef]
  52. Ji, S.; Gu, C.; Xi, X.; Zhang, Z.; Hong, Q.; Huo, Z.; Zhao, H.; Zhang, R.; Li, B.; Tan, C. Quantitative Monitoring of Leaf Area Index in Rice Based on Hyperspectral Feature Bands and Ridge Regression Algorithm. Remote Sens. 2022, 14, 2777. [Google Scholar] [CrossRef]
  53. de Araújo Gomes, A.; Galvão, R.K.H.; de Araújo, M.C.U.; Véras, G.; da Silva, E.C. The Successive Projections Algorithm for Interval Selection in PLS. Microchem. J. 2013, 110, 202–208. [Google Scholar] [CrossRef]
  54. Xu, Y.; Wang, J.; Xia, A.; Zhang, K.; Dong, X.; Wu, K.; Wu, G. Continuous Wavelet Analysis of Leaf Reflectance Improves Classification Accuracy of Mangrove Species. Remote Sens. 2019, 11, 254. [Google Scholar] [CrossRef] [Green Version]
  55. Guo, P.; Li, T.; Gao, H.; Chen, X.; Cui, Y.; Huang, Y. Evaluating Calibration and Spectral Variable Selection Methods for Predicting Three Soil Nutrients Using Vis-NIR Spectroscopy. Remote Sens. 2021, 13, 4000. [Google Scholar] [CrossRef]
  56. Galvao, R.; Araujo, M.; Jose, G.; Pontes, M.; Silva, E.; Saldanha, T. A Method for Calibration and Validation Subset Partitioning. Talanta 2005, 67, 736–740. [Google Scholar] [CrossRef]
  57. Yang, Z.; Xiao, H.; Zhang, L.; Feng, D.; Zhang, F.; Jiang, M.; Sui, Q.; Jia, L. Fast Determination of Oxide Content in Cement Raw Meal Using NIR Spectroscopy with the SPXY Algorithm. Anal. Methods 2019, 11, 3936–3942. [Google Scholar] [CrossRef]
  58. Chen, W.; Chen, H.; Feng, Q.; Mo, L.; Hong, S. A Hybrid Optimization Method for Sample Partitioning in Near-Infrared Analysis. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2021, 248, 119182. [Google Scholar] [CrossRef] [PubMed]
  59. Galelli, S.; Castelletti, A. Assessing the Predictive Capability of Randomized Tree-Based Ensembles in Streamflow Modelling. Hydrol. Earth Syst. Sci. 2013, 17, 2669–2684. [Google Scholar] [CrossRef]
  60. Calhoun, P.; Hallett, M.J.; Su, X.; Cafri, G.; Levine, R.A.; Fan, J. Random Forest with Acceptance–Rejection Trees. Comput. Stat. 2020, 35, 983–999. [Google Scholar] [CrossRef]
  61. Simm, J.; Magrans De Abril, I.; Sugiyama, M. Tree-Based Ensemble Multi-Task Learning Method for Classification and Regression. IEICE Trans. Inf. Syst. 2014, 97, 1677–1681. [Google Scholar] [CrossRef] [Green Version]
  62. Liang, T.; Liang, S.; Zou, L.; Sun, L.; Li, B.; Lin, H.; He, T.; Tian, F. Estimation of Aerosol Optical Depth at 30 m Resolution Using Landsat Imagery and Machine Learning. Remote Sens. 2022, 14, 1053. [Google Scholar] [CrossRef]
  63. Taylor, L.A.; Pieters, C.M.; Keller, L.P.; Morris, R.V.; McKay, D.S. Lunar Mare Soils: Space Weathering and the Major Effects of Surface-Correlated Nanophase Fe. J. Geophys. Res. 2001, 106, 27985–27999. [Google Scholar] [CrossRef]
  64. Hörz, F.; Grieve, R.; Heiken, G.; Spudis, P.; Binder, A. Lunar Source Book; Cambridge University Press: Cambridge, UK, 1991. [Google Scholar]
  65. Johnson, J.R.; Larson, S.M.; Singer, R.B. Remote Sensing of Potential Lunar Resources: 1. Near-side Compositional Properties. J. Geophys. Res. Planets 1991, 96, 18861–18882. [Google Scholar] [CrossRef]
  66. Ji, J.; Guo, D.; Liu, J.; Chen, S.; Ling, Z.; Ding, X.; Han, K.; Chen, J.; Cheng, W.; Zhu, K.; et al. The 1:2,500,000-Scale Geologic Map of the Global Moon. Sci. Bull. 2022, S2095927322002316. [Google Scholar] [CrossRef]
  67. Otake, H.; Ohtake, M.; Hirata, N. Lunar Iron and Titanium Abundance Algorithms Based on SELENE (KAGUYA) Multiband Imager Data. In Proceedings of the Annual Lunar and Planetary Science Conference, Woodlands, TX, USA, 19–23 March 2012. [Google Scholar]
  68. Ma, M.; Li, B.; Chen, S.; Lu, T.; Lu, P.; Lu, Y.; Jin, Q. Global Estimates of Lunar Surface Chemistry Derived from LRO Diviner Data. Icarus 2022, 371, 114697. [Google Scholar] [CrossRef]
Figure 2. Flow chart of the Extra-Trees model.
Figure 2. Flow chart of the Extra-Trees model.
Remotesensing 14 05248 g002
Figure 3. Linear correlation coefficients between the contents of the four oxides and LSCC reflectance data.
Figure 3. Linear correlation coefficients between the contents of the four oxides and LSCC reflectance data.
Remotesensing 14 05248 g003
Figure 4. Relationship between the 891 nm reflectance and the (a) FeO and (b) TiO2 contents in LSCC samples with 10–20 μm grain sizes.
Figure 4. Relationship between the 891 nm reflectance and the (a) FeO and (b) TiO2 contents in LSCC samples with 10–20 μm grain sizes.
Remotesensing 14 05248 g004
Figure 5. Correlations between (a) FeO and Al2O3 and (b) TiO2 and SiO2 in the LSCC samples with 10–20 μm grain sizes.
Figure 5. Correlations between (a) FeO and Al2O3 and (b) TiO2 and SiO2 in the LSCC samples with 10–20 μm grain sizes.
Remotesensing 14 05248 g005
Figure 6. Correlation coefficients between spectral differences and chemical abundances. Each point represents the band difference between the x− and y−axes, and the intensity of each point is the correlation between the band difference and chemical abundance.
Figure 6. Correlation coefficients between spectral differences and chemical abundances. Each point represents the band difference between the x− and y−axes, and the intensity of each point is the correlation between the band difference and chemical abundance.
Remotesensing 14 05248 g006
Figure 7. Training and testing accuracy of the Extra-Trees model with chemical abundance based on the original waveband: (a) FeO, (b) TiO2, (c) Al2O3, and (d) SiO2.
Figure 7. Training and testing accuracy of the Extra-Trees model with chemical abundance based on the original waveband: (a) FeO, (b) TiO2, (c) Al2O3, and (d) SiO2.
Remotesensing 14 05248 g007
Figure 8. Training and testing accuracy of the Extra-Trees model with chemical abundance based on feature band selection: (a) FeO, (b) TiO2, (c) Al2O3, and (d) SiO2.
Figure 8. Training and testing accuracy of the Extra-Trees model with chemical abundance based on feature band selection: (a) FeO, (b) TiO2, (c) Al2O3, and (d) SiO2.
Remotesensing 14 05248 g008
Figure 9. Apollo 17 area: (a) IIM 757 nm reflectance image, (b) FeO contents from Lucey et al. [17], and (c) FeO prediction results of the Extra-Trees model based on feature band selection.
Figure 9. Apollo 17 area: (a) IIM 757 nm reflectance image, (b) FeO contents from Lucey et al. [17], and (c) FeO prediction results of the Extra-Trees model based on feature band selection.
Remotesensing 14 05248 g009
Figure 10. Apollo 17 area: (a) IIM 757 nm reflectance image, (b) TiO2 contents from Lucey et al. [17], and (c) TiO2 prediction results of the Extra-Trees model based on feature band selection.
Figure 10. Apollo 17 area: (a) IIM 757 nm reflectance image, (b) TiO2 contents from Lucey et al. [17], and (c) TiO2 prediction results of the Extra-Trees model based on feature band selection.
Remotesensing 14 05248 g010
Figure 11. Apollo 17 area: (a) IIM 891 nm reflectance image, (b) Al2O3, (c) SiO2 prediction results of the Extra-Trees model based on feature band selection.
Figure 11. Apollo 17 area: (a) IIM 891 nm reflectance image, (b) Al2O3, (c) SiO2 prediction results of the Extra-Trees model based on feature band selection.
Remotesensing 14 05248 g011
Figure 12. A 1:2,500,000 geological map [66] of the study area.
Figure 12. A 1:2,500,000 geological map [66] of the study area.
Remotesensing 14 05248 g012
Figure 13. Predicted results of oxide contents in the study area: (a) FeO, (b) TiO2, (c) Al2O3 and (d) SiO2.
Figure 13. Predicted results of oxide contents in the study area: (a) FeO, (b) TiO2, (c) Al2O3 and (d) SiO2.
Remotesensing 14 05248 g013
Figure 14. Smoothed histograms of four oxide contents in the study area.
Figure 14. Smoothed histograms of four oxide contents in the study area.
Remotesensing 14 05248 g014
Figure 15. Distribution of FeO based on different data/methods.
Figure 15. Distribution of FeO based on different data/methods.
Remotesensing 14 05248 g015
Figure 16. Distribution of TiO2 based on different data/methods.
Figure 16. Distribution of TiO2 based on different data/methods.
Remotesensing 14 05248 g016
Figure 17. Two-dimensional scatter plot of the difference between the Clementine FeO and IIM FeO abundances.
Figure 17. Two-dimensional scatter plot of the difference between the Clementine FeO and IIM FeO abundances.
Remotesensing 14 05248 g017
Figure 18. (a) Two-dimensional density plot between IIM FeO and IIM Al2O3 and (b) Two-dimensional density plot between IIM TiO2 and IIM SiO2.
Figure 18. (a) Two-dimensional density plot between IIM FeO and IIM Al2O3 and (b) Two-dimensional density plot between IIM TiO2 and IIM SiO2.
Remotesensing 14 05248 g018
Table 1. Composition data for LSCC soils used in this paper.
Table 1. Composition data for LSCC soils used in this paper.
RegionMissionNumberSize (μm)FeOTiO2Al2O3SiO2
mareApollo 1110,084<1012.007.2515.9042.10
10–2014.707.9413.2041.20
20–4515.508.3012.0041.30
Apollo 1212,001<1012.502.7814.9046.00
10–2015.902.9612.3045.00
20–4516.903.2011.0045.30
12,030<1014.303.0113.9046.20
10–2017.203.3210.7046.30
20–4517.603.7410.5046.10
Apollo 1515,041<1011.001.7916.4046.60
10–2014.401.8813.5046.20
20–4515.202.0312.5046.10
15,071<109.591.5717.1046.90
10–2015.401.8812.9045.70
20–4515.602.3312.4045.80
Apollo 1770,181<1012.706.5415.4041.50
10–2015.507.8812.7040.40
20–4516.008.1111.5040.70
71,061<1014.807.8913.8040.20
10–2017.508.9410.8039.50
20–4518.509.489.3339.20
71,501<1013.508.2714.5040.40
10–2016.409.8311.6039.00
20–4517.8010.709.9438.40
79,221<1011.305.8315.9042.30
10–2015.007.2112.9040.90
20–4515.807.3811.6040.50
highlandApollo 1414,141<107.661.5119.2049.20
10–209.461.7117.2048.40
20–4511.601.9615.0047.20
14,163<108.832.0718.9047.20
10–2010.101.8817.0047.40
20–4511.502.0015.4047.10
14,259<107.822.0219.3047.90
10–209.711.9617.4047.50
20–4511.001.9915.8047.10
14,260<108.101.9419.1047.80
10–209.841.9817.3047.50
20–4510.701.8616.3047.40
Apollo 1661,221<103.640.5028.5044.50
10–204.400.5427.5044.50
20–454.620.5627.2044.50
61,141<103.660.5927.4044.90
10–205.140.6425.6044.60
20–455.150.5826.1044.50
62,231<103.630.5827.4045.00
10–204.860.6126.3044.70
20–455.310.5825.7044.50
64,801<103.840.6127.7044.80
10–204.780.6826.3044.50
20–454.820.6326.5044.60
67,461<103.350.3429.4044.50
10–204.640.3927.8044.10
20–454.930.4427.3044.40
67,481<103.610.4229.1044.50
10–204.040.4028.4044.40
20–455.190.4926.7044.70
Table 2. Statistical parameters of sample set partitioning.
Table 2. Statistical parameters of sample set partitioning.
ElementsTraining SetTest Set
CountMaxMinMeanStdCountMaxMinMeanStd
FeO5718.503.3510.505.011916.403.6610.834.40
TiO25710.700.343.493.21197.940.352.382.41
Al2O35729.409.3318.906.831927.4011.6016.524.96
SiO25749.2038.4044.352.751947.6040.4044.902.31
Table 3. Final screening results for feature bands.
Table 3. Final screening results for feature bands.
OxideClustering
Categories
Number of BandsSelected Band (nm)
FeOCategory 15/91522–583, 631–777, 645–721, and 659–842, and 659–891
Category 29/123513–757, 513–819, 522–659, 551–721, 551–777, 561–842, 606–777, 606–891, and 618–819
Category 33/99532–583, 739–819, and 739–891
TiO2Category 11/81513–631
Category 21/119522–673
Category 33/87606–659, 721–819, and 739–797
Al2O3Category 11/91522–631
Category 24/123513–891, 561–797, 606–757, and 645–842
Category 33/98513–561, 739–891, and 739–819
SiO2Category 110/125513–618, 522–594, 532–583, 541–572, 551–645, 561–673, 572–689, 583–631, 606–659, and 631–757
Category 212/46513–631, 513–673, 522–659, 522–689, 532–645, 551–721, 561–739, 572–721, 572–757, 583–777, 606–739, and 618–777
Category 38/32513–721, 513–777, 522–705, 522–819, 532–739, 551–757, 561–777, and 583–797
Table 4. Comparison of the accuracy of the four chemical abundance tests with the results of Wu [18].
Table 4. Comparison of the accuracy of the four chemical abundance tests with the results of Wu [18].
FeO (wt.%)TiO2 (wt.%)Al2O3 (wt.%)SiO2 (wt.%)
Calibration in this work (Extra-Trees model based on feature band selection)
R21110.974
RMSE0.01200.0320.438
Validation in this work (Extra-Trees model based on feature band selection)
R20.9620.9440.9640.860
RMSE1.0280.6720.9420.897
Calibration by Wu [18]
R20.900.690.920.76
RMSE1.582.001.680.75
Validation by Wu [18]
R20.880.590.900.67
RMSE1.762.261.920.91
Table 5. Correlation coefficients based on different data/methods for FeO inversion.
Table 5. Correlation coefficients based on different data/methods for FeO inversion.
ParametersStatistical Values of Predicted FeO
This WorkClementine [17]IIM (Band Ratio) [30]LP GRS [15]
Average12.5113.9813.815.82
Standard Deviation4.75.616.196.26
Correlation Coefficients10.910.890.9
Table 6. Correlation coefficients based on different data/methods for TiO2 inversion.
Table 6. Correlation coefficients based on different data/methods for TiO2 inversion.
ParametersStatistical Values of Predicted TiO2
This WorkClementine [17]IIM (Band Ratio) [31]LP GRS [15]
Average2.64.12.682.87
Standard Deviation1.362.941.641.67
Correlation Coefficients10.610.650.48
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wu, S.; Chen, J.; Li, L.; Zhang, C.; Huang, R.; Zhang, Q. Quantitative Inversion of Lunar Surface Chemistry Based on Hyperspectral Feature Bands and Extremely Randomized Trees Algorithm. Remote Sens. 2022, 14, 5248. https://doi.org/10.3390/rs14205248

AMA Style

Wu S, Chen J, Li L, Zhang C, Huang R, Zhang Q. Quantitative Inversion of Lunar Surface Chemistry Based on Hyperspectral Feature Bands and Extremely Randomized Trees Algorithm. Remote Sensing. 2022; 14(20):5248. https://doi.org/10.3390/rs14205248

Chicago/Turabian Style

Wu, Shuangshuang, Jianping Chen, Li Li, Cheng Zhang, Rujin Huang, and Quanping Zhang. 2022. "Quantitative Inversion of Lunar Surface Chemistry Based on Hyperspectral Feature Bands and Extremely Randomized Trees Algorithm" Remote Sensing 14, no. 20: 5248. https://doi.org/10.3390/rs14205248

APA Style

Wu, S., Chen, J., Li, L., Zhang, C., Huang, R., & Zhang, Q. (2022). Quantitative Inversion of Lunar Surface Chemistry Based on Hyperspectral Feature Bands and Extremely Randomized Trees Algorithm. Remote Sensing, 14(20), 5248. https://doi.org/10.3390/rs14205248

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop