Next Article in Journal
Running Vacuum and H4 Inflation
Previous Article in Journal
Protoplanet and Proto-Brown Dwarf Clumps in Gravitationally Unstable Protoplanetary Disks of Various Metallicity
Previous Article in Special Issue
Modal Frequency and Damping Identification of the FAST Cabin-Cable System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using Machine Learning for Lunar Mineralogy-I: Hyperspectral Imaging of Volcanic Samples

by
Fatemeh Fazel Hesar
1,2,
Mojtaba Raouf
2,3,4,*,
Peyman Soltani
5,
Bernard Foing
2,3,
Michiel J. A. de Dood
5 and
Fons J. Verbeek
1
1
Leiden Institute of Advanced Computer Science, P.O. Box 9504, 2300 RA Leiden, The Netherlands
2
ILEWG LUNEX-EuroMoonMars Earth-Space, ESTEC European Space Agency, Keplerlaan 1, 2201 AZ Noordwijk, The Netherlands
3
Leiden Observatory, Leiden University, P.O. Box 9513, 2300 RA Leiden, The Netherlands
4
School of Astronomy, Institute for Research in Fundamental Sciences (IPM), Tehran 19395-5746, Iran
5
Huygens-Kamerlingh Onnes Laboratory, Leiden University, Niels Bohnweg 2, 2333 CA Leiden, The Netherlands
*
Author to whom correspondence should be addressed.
Universe 2025, 11(4), 117; https://doi.org/10.3390/universe11040117
Submission received: 31 January 2025 / Revised: 22 March 2025 / Accepted: 28 March 2025 / Published: 2 April 2025
(This article belongs to the Special Issue Planetary Radar Astronomy)

Abstract

:
This study examines the mineral composition of volcanic samples similar to lunar materials, focusing on olivine and pyroxene. Using hyperspectral imaging (HSI) from 400 to 1000 nm, we created data cubes to analyze the reflectance characteristics of samples from Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy, categorizing them into nine regions of interest (ROIs) and analyzing spectral data for each. We applied various unsupervised clustering algorithms, including K-Means, hierarchical clustering, Gaussian mixture models (GMMs), and spectral clustering, to classify the spectral profiles. Principal component analysis (PCA) revealed distinct spectral signatures associated with specific minerals, facilitating precise identification. The clustering performance varied by region, with K-Means achieving the highest silhouette score of 0.47, whereas GMMs performed poorly with a score of only 0.25. Non-negative matrix factorization (NMF) aided in identifying similarities among clusters across different methods and reference spectra for olivine and pyroxene. Hierarchical clustering emerged as the most reliable technique, achieving a 94% similarity with the olivine spectrum in one sample, whereas GMMs exhibited notable variability. Overall, the analysis indicated that both the hierarchical and K-Means methods yielded lower errors in total measurements, with K-Means demonstrating superior performance in estimated dispersion and clustering. Additionally, GMMs showed a higher root mean square error (RMSE) compared to the other models. The RMSE analysis confirmed K-Means as the most consistent algorithm across all samples, suggesting a predominance of olivine in the Vulcano region relative to pyroxene. This predominance is likely linked to historical formation conditions similar to volcanic processes on the Moon, where olivine-rich compositions are common in ancient lava flows and impact-melt rocks. These findings provide a deeper context for mineral distribution and formation processes in volcanic landscapes.

1. Introduction

The study of mineral spectra, particularly pyroxene and olivine, is important for understanding the geological composition of celestial bodies like the Moon [1]. This understanding not only characterizes their history and evolution but also enhances our knowledge of their potential resources [2]. For example, the lunar surface predominantly comprises basaltic rocks rich in ferromagnetic minerals such as pyroxene and olivine, which relate to the Moon’s volcanic activity and mantle composition [3]. Olivine-rich compositions are common in ancient lava flows and impact-melt rocks [4]. This is especially true in lunar geology, where olivine plays a crucial role in the Moon’s crust, particularly in areas influenced by ancient volcanic activity and meteorite impacts. These findings enhance our understanding of mineral distribution and formation processes in volcanic landscapes.
The interpretation of lunar data heavily relies on correlations with samples returned from the Moon and analyzed in terrestrial laboratories [5]. Hyperspectral imaging has emerged as a powerful tool for analyzing the mineralogical composition of geological samples. Unlike traditional multispectral imaging, which captures data in a limited number of broad spectral bands, hyperspectral imaging acquires data across hundreds of narrow, contiguous spectral bands. This high spectral resolution enables the detection of subtle variations in mineral composition, making it particularly useful for studying complex geological materials such as pyroxene and olivine [6]. Hyperspectral data, when combined with advanced machine learning techniques, can significantly enhance the accuracy and efficiency of mineral identification and classification. These terrestrial analogs allow for a better understanding of the spectral characteristics of lunar materials under varying conditions. Optical instruments measuring reflected visible-to-near-infrared (VNIR) radiation are sensitive to surface mineralogy due to distinctive absorption features in this wavelength range. Pyroxenes, for instance, display diagnostic absorptions at 1 and 2 µm, while olivine exhibits a combined absorption near 1 µm [7]. These absorption features shift predictably based on mineral composition [8].
Recent advances in machine learning have revolutionized the analysis of hyperspectral data, enabling more accurate and efficient mineral identification. Machine learning algorithms, particularly unsupervised clustering methods, have proven effective in identifying patterns and structures within complex spectral datasets. Techniques such as K-Means clustering, hierarchical clustering, and Gaussian mixture models are widely used to group similar spectral signatures, facilitating the identification of distinct mineral phases [9]. These methods are particularly valuable for analyzing lunar samples, where the spectral signatures of minerals can be subtle and overlapping. The primary objective of this research is to enhance the understanding of lunar geology through detailed mineral analysis, with a particular focus on olivine and pyroxene. This study focuses on a dataset comprising pyroxene and olivine samples collected from Vulcano, Italy, part of the Aeolian Islands, as terrestrial analogs for lunar materials [10]. Such volcanic regions provide valuable insights for optimizing hyperspectral measurements of lunar rocks due to their compositional similarities with basaltic lunar surfaces.
This paper is structured as follows: Section 2 details the methodology employed in our analysis; Section 3 presents the results of our findings; and Section 5 and Section 6 provide a discussion and conclusions of the implications of these results.

2. Data and Methodology

This study utilizes an advanced hyperspectral imaging system to capture detailed spectral and spatial data from volcanic samples. The analysis is based on hyperspectral image cube data obtained from volcanic samples collected in Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy, primarily consisting of pyroxenes and olivines. This dataset allows for in-depth examination of the significance of various wavelengths in mineral identification, contributing to our understanding of mineralogy. Moreover, the findings highlight similarities between terrestrial and lunar mineralogy, enhancing our knowledge of these geological materials.

2.1. Hyperspectral Imaging Setup

This system includes a grating-based hyperspectral camera with a CMOS sensor (Specim FX10 - SPECIM, Spectral Imaging Ltd. from Oulu, Finland), halogen lamps, and a 20 cm wide lab scanner. The hyperspectral imager functions as a line-scan camera with 1024 pixels, generating 224 spectral images across a wavelength range of 400–1000 nm, with a spectral full-width at half maximum (FWHM) of 5.5 nm. Functioning as a line-scan camera, it captures 1024 spatial pixels per hyperspectral frame or scanned cross-track line. To ensure accuracy and minimize imaging artifacts, the entire setup was operated in a controlled dark room environment, with samples carefully positioned on the lab scanner platform for imaging. The hyperspectral imaging system employed in this study, based on the Specim FX10, is designed to provide detailed mineral composition of volcanic samples from Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy. The hyperspectral imaging system employed in this study enables the identification of subtle spectral features associated with various minerals. As illustrated in Figure 1, the central component of this setup is the grating-based hyperspectral imager (Specim FX10). This capability allows for identifying subtle spectral features associated with various minerals. The system also includes essential components such as halogen lamps for consistent illumination, a motorized transportation plate for precise sample positioning, and a 20 cm wide lab scanner for facilitating the line-scan imaging process. The experimental arrangement in Figure 1 highlights the meticulous organization of the system’s components. The halogen lamps are strategically positioned to ensure optimal lighting, while the transportation plate facilitates the accurate movement of samples under the imaging apparatus. Additionally, preprocessing steps, including dark current correction and radiometric calibration using a white reference panel, enhance the accuracy of the reflectance data collected. This structured approach enables the extraction of reflectance values that are used for analyzing the mineralogical characteristics of the samples.

2.2. Data Cube Creation with Hyperspectral Imaging (HSI)

Hyperspectral imaging transcends traditional imaging by capturing far more detailed spectral information. Instead of a simple RGB (red, green, blue) value per pixel, each spatial location (x, y) within the image is assigned a spectrum of reflectance or radiance values across a wide range of wavelengths. This expansion from a two-dimensional array of pixels to a three-dimensional data cube is what allows for the creation of rich, detailed datasets. Each ‘voxel’ (volume pixel) in this cube holds a complete spectral signature, providing a far more comprehensive representation of the material at that specific location. The spectral dimension ( λ ) of the cube represents the reflection of light measured at various wavelengths, often spanning the visible and near-infrared regions of the electromagnetic spectrum. A total of 448 contiguous wavelength bands are recorded by the camera, which is binned to 224 effective wavelength bands, creating a detailed spectral fingerprint for each location. This detailed spectral information is for distinguishing materials based on their diagnostic spectral features, a technique particularly valuable in fields like geology and mineralogy.
Figure 2 presents a hyperspectral image (400–1000 nm) of volcanic samples from Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy, illustrating the effectiveness of hyperspectral imaging in revealing detailed mineral compositions. The RGB color images utilize typical ranges of R: 650 nm, G: 550 nm, and B: 450 nm to approximate true-color visualization, in line with standard practice, highlighting the extracted regions of interest (ROIs). The samples’ extracted regions of interest (ROIs) are highlighted by green numbers overlaid on the image, with detailed spectral analysis presented in the left panel. Nine samples, marked by green numbers, were selected for an in-depth spectral analysis of pyroxene and olivine compositions. The left panel of the figure shows the median spectra for the selected regions depicted in the right panel. Notably, at least 80% of the volcanic samples align with the median spectra for pyroxene and olivine, suggesting that these minerals should ideally be present in all samples, except for regions 5 and 7, which require further analysis. The database template for pyroxene and olivine, as reported by [11], is overlaid in different colors to represent various ranges of olivine–pyroxene compositions. The percentage of olivine (100% pyroxene) composition is indicated with the corresponding color in the figure.

2.3. Significance of Spectral Data

The reflective properties of different minerals and rocks on the Moon, particularly at specific optical wavelengths, enable us to distinguish between them, building on prior studies of pyroxene and olivine’s spectral features [7,8]. This work applies these features to hyperspectral data (400–1000 nm) from Vulcano samples, using machine learning to classify mineral compositions across specific ROIs.
This capability is useful for lunar studies, as understanding the composition of the Moon’s surface can provide its geological history and processes. For instance, the ultraviolet and blue ranges (400–500 nm) are particularly useful for identifying minerals such as pyroxene and olivine, which exhibit diagnostic absorption features in these wavelengths. These minerals are often found in the lunar highlands and are significant indicators of the Moon’s volcanic history. Additionally, the violet range (400–420 nm) aids in studying the presence of impact glasses formed during meteoroid impacts, which are formed when meteoroids strike the lunar surface at high velocities, melting and fusing the surrounding materials.
The green wavelength range (500–550 nm) is significant for detecting iron-bearing minerals like basalt, which helps scientists identify regions rich in basaltic material, predominantly found in the lunar maria. These dark, flat plains were formed by ancient volcanic activity, providing evidence of the Moon’s geological evolution. Furthermore, the yellow range (570–590 nm) is essential for assessing the maturity of lunar regolith, indicating the extent of space weathering and alteration that surface materials have experienced due to continuous bombardment by meteoroids and solar radiation. The orange range (590–630 nm) highlights the presence of iron oxides, such as hematite, which can form due to both meteoroid impacts and solar wind interactions with the lunar surface.
Hyperspectral imaging and analysis of reflectance spectra at specific wavelengths facilitate the identification of several key minerals on the lunar surface, enhancing understanding of the Moon’s geology and its response to meteoroid impacts. Olivine, which is rich in magnesium and iron, is typically detected between 600 and 700 nm and is commonly found in igneous rocks such as basalt and peridotite. Plagioclase feldspar, also appearing in the 600–700 nm range, is prevalent in many igneous rocks formed from the solidification of molten rock. This mineral exhibits a range of colors and a vitreous-to-pearly luster, making it an important component of the lunar crust [5]. Pyroxene minerals display distinct absorption features in both the visible and near-infrared ranges, aiding in their identification in reflectance spectra. These minerals are critical in understanding the thermal history of the Moon [3].

2.4. Data Acquisition and Preprocessing

This study acquired hyperspectral data from volcanic samples in the Vulcano region, Italy, capturing reflectance spectra across 400–1000 nm. The data cubes have a three-dimensional structure of 818 × 1024 × 224, with 224 representing the number of spectral bands. The linear scanner, with a 200 mm width and an exact 200 mm scanning range in the x-direction, captures data where each pixel measures approximately 0.25 mm in the x-dimension (calculated as 200 mm/818 pixels = 0.24 mm) and ∼0.2 mm in the y-dimension (calculated as 200 mm/1024 pixels = 0.195 mm, rounded to 0.2 mm for simplicity).
The pipeline details of the analysis in this study are shown in Figure 3. We implemented a comprehensive preprocessing pipeline for the hyperspectral cube data, focusing on enhancing data integrity and optimizing clustering performance. This process began with normalizing reflectance values using white and dark reference measurements, effectively addressing illumination variations and sensor noise that can obscure spectral features. Calibration steps, including dark current correction and radiometric calibration, ensured accurate spectral analyses by mitigating systematic errors that could influence subsequent clustering results. Additional techniques, such as outlier detection and removal, were employed to eliminate anomalous data points that could skew results. This meticulous approach ensured high-quality, well-structured hyperspectral data, laying a robust foundation for reliable clustering outcomes and enhancing our ability to accurately interpret the mineralogical characteristics of volcanic materials. Data acquisition utilized a line scanning technique, with each sample positioned on a motorized slider table. As the table advanced, the camera captured the spectral data in a sequential line-by-line manner, producing an initial hyperspectral image. Calibration minimized noise and ensured the accuracy of reflectance values, achieved through dark and white reference measurements. The dark reference was established by measuring the signal from the detector when no light was present, while the white reference was obtained using a Teflon whiteboard/Spectralon white reference with 99% reflectance (shown in the bottom right panel of Figure 1). Subsequently, the calibration process normalized the hyperspectral images using the following formula:
Reflectance = ( Raw Dark ) ( White Dark ) ,
where Raw represents the initial signal that includes both the actual light from the sample and any noise or background signals.

2.5. Machine Learning: Clustering Methods

We employed a range of unsupervised machine learning clustering algorithms to analyze hyperspectral data (400–1000 nm) from volcanic samples, aiming to identify mineral compositions, primarily olivine and pyroxene. The procedure involved preprocessing data cubes to ensure quality, followed by applying various clustering methods to group similar spectral signatures. We utilized K-Means for its efficiency in partitioning large datasets, hierarchical clustering for its structured approach through dendrograms, Gaussian mixture models (GMMs) for probabilistic modeling of complex shapes, and spectral clustering for detecting non-convex clusters using eigenvalues. To optimize processing time and reduce redundancy in adjacent spectral channels without data loss, we applied dimensionality reduction techniques such as principal component analysis (PCA) and non-negative matrix factorization (NMF). This comprehensive framework not only preserved key spectral features but also validated clusters against reference spectra, ultimately enhancing our understanding of the mineralogical characteristics present in the samples.
By utilizing multiple clustering approaches, we aimed to extract relevant features from the data, improve analytical accuracy, and gain a deeper the spectral characteristics of the volcanic samples. This multifaceted strategy allowed for a comprehensive exploration of the complexities and variabilities inherent in the data. The use of diverse models reveals different aspects of the dataset, thereby enhancing our understanding of spectral characteristics, as illustrated in the clustering maps. Relying solely on a single method could introduce bias, so employing multiple models validates our findings and ensures consistency in cluster identification across approaches. Given the complexity of hyperspectral data, which often display non-linear relationships and variable cluster shapes, it is essential to apply different algorithms to capture a broader range of structures.
Among the methods utilized, K-Means clustering stands out for its computational efficiency, particularly suited for the large datasets typical of hyperspectral imaging. This algorithm partitions data points into clusters by minimizing within-cluster variance, effectively delineating distinct divisions within the data. This approach is particularly effective for identifying mineralogical features, with a focus on pyroxenes and olivines. Additionally, hierarchical clustering is employed to construct a dendrogram that visually represents relationships among the data. This technique computes distances between clusters using various linkage criteria, offering the hierarchical structure of the mineral compositions. The hierarchical clustering map enriches our understanding of how different mineral types interrelate, revealing patterns that may be obscured by other methods. To further enhance our analysis, GMMs are utilized. GMMs effectively model the data’s probability distribution as a mixture of Gaussian distributions, capturing complex cluster shapes. Lastly, spectral clustering uses the eigenvalues of a similarity matrix, enabling dimensionality reduction while preserving significant data structures. Collectively, these methodologies provide a robust framework for understanding the mineralogical characteristics of volcanic samples.
We employed dimensionality reduction techniques to optimize our data analysis, utilizing PCA to retain significant spectral information while minimizing redundancy and enhancing computational efficiency. Additionally, we explored NMF as a complementary method, which decomposes the dataset into non-negative components that align with the physical interpretation of spectral data. NMF not only aids in reducing dimensionality and managing redundancy but also enhances interpretability by facilitating the identification of specific mineral signatures. Furthermore, it improves the signal-to-noise ratio, allowing for more accurate mineral identification by separating meaningful spectral information from noise. By combining NMF with PCA, we enhance our ability to extract end-member spectra, providing a clearer understanding of the mineralogy in our samples.

PCA Analysis

Principal component analysis (PCA) was applied to the hyperspectral data extracted from various regions of interest within the volcano dataset. The first two principal components (PC1 and PC2) were visualized to assess the underlying structure and variability of the data. Figure 4 illustrates the PCA results for each region, providing information on the homogeneity and distribution of the data. Each point in the scatter plot represents a pixel within the region, colored according to its cluster membership as determined by the clustering method (K-Means, hierarchical, GMM, and spectral), with n = 4 1 clusters. The spread of the points indicates the diversity of spectral characteristics, while the clustering reveals distinct groups of similar spectral signatures. The performance of these clustering methods was evaluated using the silhouette score, which measures how similar an object is to its own cluster compared to other clusters. Additionally, root mean squared error (RMSE) was employed as an error metric to quantify prediction accuracy, aiding in fine-tuning spectral interpretations against instrumental verification.
The first method, K-Means clustering, partitions the data into distinct clusters by minimizing the variance within each cluster. It is widely used due to its simplicity and efficiency, making it suitable for large datasets. For example, in region 8, K-Means achieved a silhouette score of 0.47, the highest among all methods, suggesting excellent cluster separation. Next, we applied hierarchical clustering, which builds a hierarchy of clusters either through a bottom-up (agglomerative) or top-down (divisive) approach. This method provides a dendrogram representation, allowing for a visual exploration of the data’s structure and the number of clusters. In region 7, hierarchical clustering achieved a silhouette score of 0.45, indicating well-separated clusters, the highest among other regions. We also utilized the GMM, a probabilistic model that assumes all data points are generated from a mixture of several Gaussian distributions with unknown parameters. GMM is particularly advantageous for soft clustering, where data points can belong to multiple clusters with different probabilities, making it more flexible than K-Means. Note that in region 8, GMM had the highest silhouette score of 0.47, the same as K-Means, indicating similarity in both methods’ clustering. Finally, we employed spectral clustering, which utilizes the eigenvalues of a similarity matrix to reduce dimensionality before applying a clustering algorithm. This method effectively identifies clusters in non-convex shapes, leveraging the relationships between points in the original high-dimensional space. In region 7, spectral clustering achieved a silhouette score of 0.44, demonstrating moderate cluster separation.

2.6. Non-Negative Matrix Factorization for Similarity Analysis

To quantify the mineral composition of our volcanic samples, we used non-negative matrix factorization (NMF) to measure the similarity between the spectral signatures of clusters and reference spectra for olivine and pyroxene from [12]. NMF is a technique that breaks down complex data, like our hyperspectral reflectance measurements, into simpler, non-negative components. In this study, it helped us determine how much each cluster resembles pure olivine, pure pyroxene, or a mixture of both. We applied NMF to the median spectra of clusters obtained from K-Means, hierarchical, GMMs, and spectral clustering (Section 2.5). The output provided similarity percentages (e.g., 90–100% for olivine in several clusters; see Figure 5), which we visualized in a heatmap to compare the performance of each clustering method. This step enhances our ability to draw parallels with lunar samples, where precise mineral identification is essential. By linking unsupervised clustering results to known mineral spectra, NMF supports our goal of understanding the mineralogical makeup of Vulcano samples and their relevance to lunar geology (see Section 5). In similarity analysis, NMF uncovers patterns among datasets by representing complex relationships in a more interpretable format. By factorizing the user–item interaction matrix, NMF helps identify similarities between different datasets based on their underlying features. This method is especially useful for non-negative data, such as reflectance measurements in spectral analysis. The resulting factor matrices enable the computation of similarity scores, allowing for effective comparison and clustering of similar items or observations.

3. Results

3.1. Mineral Composition Similarity

The vibrancy of the spectral data, depicted in Figure 6, shows the reflectance properties of the minerals (olivine–pyroxene compositions) for the rectangle selected in the first region (size: 120 × 170). The spectra, plotted against wavelength, reveal distinct peaks and troughs indicative of mineral presence. For instance, olivine exhibits characteristic peaks in the near-infrared range, while pyroxene displays unique spectral signatures. The black line serves as a baseline for comparison, enhancing the understanding of mineral distribution across volcanic terrains. We assessed mineral composition similarity, defined as the percentage match between clustered spectra and reference spectra of olivine and pyroxene [12], using NMF (Section 2.6). We focused on these minerals due to their prevalence in lunar basaltic rocks and Vulcano samples. Spectra were compared to establish robust correlations, as shown in Figure 2. This analysis established robust correlations between the average spectra of the selected region and known minerals, specifically focusing on pyroxene and olivine. We utilized reference spectra from [11], representing typical lunar-relevant compositions (e.g., forsterite-rich olivine, enstatite-rich pyroxene), acknowledging that these mineral groups vary in Mg-Fe and Ca-Mg-Fe ratios and may transform over time. These spectra detail the reflectance properties of olivine and pyroxene mixtures, guiding our similarity analysis.
We compared this existing data with the spectra obtained from each cluster in our models to assess the percentage composition of pyroxene and olivine (represented by dashed colored-line spectra in the bottom panels of Figure 6), thereby reporting the similarity proportions (from 1% to 100%) using NMF techniques described in Section 2.6.
The results revealed high similarity scores in certain regions, indicating a significant presence of these minerals. Such findings are critical for interpreting the geological characteristics of the volcano region, as the prevalence of pyroxene and olivine is often associated with specific volcanic processes and conditions. Analyzing these spectral features not only aids in mineral identification but also informs geological interpretations, contributing to a broader understanding of lunar geology. Figure 5 presents a heatmap illustrating the similarity in mineral compositions (olivine percentage) across the four clustering algorithms. The analysis indicates that all methods perform well with pure olivine compositions, achieving similarity percentages above 90% in several clusters. However, when examining mixed compositions of olivine and pyroxene, variability in performance becomes evident. K-Means and hierarchical clustering showed better adaptability to mixed data compared to GMM, which struggled with lower similarity scores in certain cases.

3.2. The Overall Consequences for All Regions

To improve the accuracy of our results, we selected 10 random regions from each of the nine rocks (regions). For each sample, we identified the best performance and highest probability for olivine–pyroxene composition (maximum similarity) as described above in Section 3.1. We recorded the median and standard deviation ( σ ) of the error values from 10 iterations of sub-sampling for each region (1–9). This approach allowed us to quantify the optimal percentage of olivine–pyroxene composition for each region, incorporating all parts of each area. Specifically, in region 1, K-Means indicates 100% olivine with 98.09% similarity, while in region 6, it shows 75% olivine (25% pyroxene) with 95.56% similarity, highlighting distinct mineral distributions. Table 1 provides a comprehensive summary of the olivine-rich percentage across the nine regions, analyzed using four clustering methods: Hierarchical, K-Means, Gaussian mixture model (GMM), and spectral clustering. Hierarchical clustering consistently achieved maximum compositions of 99% in the first two regions, demonstrating its effectiveness in accurately capturing olivine–pyroxene ratios. These ratios were derived by comparing clustered spectra to reference compositions via NMF and averaging over 10 iterations (Table 1).
K-Means also performed admirably, reaching a maximum of 98% in region 1, although it recorded 75% olivine (25% pyroxene) in region 6 with 96% similarity, a similar finding to the spectral method in this region. In contrast, GMM exhibited significant variability, particularly in region 2, where it reported only 25% olivine (i.e., 75% pyroxene) with 98% similarity but a high σ error of ∼13 compared to other methods, highlighting its limitations in handling complex mineral distributions. Statistical metrics further illustrate the strengths and weaknesses of each method. The median values indicate that hierarchical and K-Means clustering yielded similar central tendencies for olivine–pyroxene composition, with K-Means often showing slightly lower median values. Notably, K-Means exhibited lower standard deviations ( σ ), indicating more consistent results across iterations, especially in regions 1 and 4. In region 2, GMM’s high σ of 12.94 underscores its instability in this context, contrasting with the more robust performance of the other methods.

3.3. Evaluation of Methods

The primary evaluation metric employed is the root mean squared error (RMSE), which quantifies the differences between predicted cluster assignments and true labels. The RMSE is calculated as follows:
R M S E = 1 n i = 1 n ( y i y ^ i ) 2 ,
where y i represents the true label and y ^ i denotes the predicted label for each instance. The results indicate varying performance among the models across different clusters, as shown in Table 1. K-Means consistently displayed the lowest RMSE values across most regions, indicating its accuracy in predicting olivine–pyroxene compositions. For instance, K-Means recorded an RMSE of 0.43 in region 5 and 0.38 in region 9, underscoring its precision. Overall, the results highlight the superiority of hierarchical clustering and K-Means in estimating olivine–pyroxene compositions, while GMM’s variability raises concerns about its reliability. These findings enhance our understanding of volcanic regions’ geological characteristics and formation processes.
The bar chart in Figure 7 depicts the RMSE values for the evaluated clustering methods. GMM recorded the highest median RMSE at 0.52, suggesting potential issues with its underlying assumptions about data distribution. The hierarchical and spectral methods followed closely, with a median RMSE of 0.51, while K-Means demonstrated the lowest at 0.50. The standard error bars associated with each method underscore the variability of the RMSE. This research underscores the importance of selecting appropriate clustering algorithms based on the specific characteristics of mineral compositions.

4. Comparison of Lunar Volcanism Minerals

Our analysis suggests a predominance of olivine in the Vulcano region, potentially linked to historical conditions akin to lunar volcanic processes. These findings resonate with the conclusions of [13,14], who documented the origins of mare lavas on the Moon and noted the significance of olivine-rich compositions, emphasizing that both studies recognize the importance of olivine in volcanic activity. Ref. [15] provides valuable insights into lunar mineralogy by mapping ilmenite abundance using data from the Chandrayaan-1 M3 mission. The study’s findings on the relationship between ilmenite and TiO2 abundances can be related to our study, which focuses on olivine and pyroxene in volcanic samples analogous to lunar materials. Both studies highlight the significance of specific minerals in understanding the Moon’s geological history and composition. The methodologies used in [15] for analyzing absorption features parallel our approach with hyperspectral imaging, emphasizing the importance of spectral data in mineral identification. Furthermore, their conclusion that titanium is not exclusively found in ilmenite suggests broader implications for understanding other titanium-bearing minerals, which may relate to the mineral composition we observe in our Vulcano samples. Ref. [16] focuses on the quantitative estimation of elemental concentrations on the lunar surface using hyperspectral near-infrared (NIR) imaging. The study aims to map the abundances of iron (Fe), calcium (Ca), and magnesium (Mg) with a high accuracy using data from the Moon Mineralogy Mapper (M3) on the Chandrayaan-1 mission. By analyzing the NIR reflectance of the lunar regolith, the study develops a robust combination of spectral parameters that account for soil maturity effects, calibrating these parameters against elemental abundances from the Lunar Prospector Gamma Ray Spectrometer (LP GRS) and the Kaguya GRS (KGRS). The results yield global elemental maps that align well with previous data, providing critical insights into lunar geology and evolution. This study relates to our findings by reinforcing the importance of hyperspectral imaging and spectral data analysis in understanding lunar mineralogy. Both our research and that in [16] utilize advanced techniques to extract meaningful mineralogical information from spectral data, ultimately enhancing our understanding of the Moon’s surface composition. That study’s emphasis on minimizing artifacts from space weathering aligns with our goal of accurately identifying olivine and pyroxene in volcanic analogs. Additionally, the elemental maps generated in [16] could complement our mineral composition analyses, offering a more comprehensive picture of the processes that shaped both lunar and terrestrial volcanic landscapes. Additionally, our results align with insights from [17], who employed machine learning techniques to cluster lunar mineral data from the Moon Mineral Mapper (M3) [15] and in [16], highlighting the effectiveness of K-Means in spectral classification. Their focus on global mineral distribution complements our localized analysis, providing a broader context for understanding lunar analogs. Ref. [18] further contributes to this discourse by exploring the spectral characteristics of iron-bearing minerals and offering methods for distinguishing between olivine and pyroxene based on diagnostic absorption features. Together, these studies advance our comprehension of mineral distribution and formation mechanisms in both terrestrial and lunar volcanic landscapes, enhancing our understanding of the similarities between volcanic processes on Earth and the Moon.
Future research should refine mineral definitions and test the methodology across a wider range of spectra to improve its applicability to lunar geology. Our forthcoming second paper will directly compare lunar observations and samples, providing insights into the relationship between terrestrial and lunar mineralogy.

5. Discussion

This study conducted a thorough investigation into the mineralogical composition of volcanic samples from Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy, with a particular focus on the unique spectral characteristics of each sample. By systematically selecting representative sections and dividing them into nine distinct regions of interest, we effectively computed average spectral data that captured the mineralogical diversity present in the samples. This structured approach provided a nuanced understanding of variations in mineral compositions across different areas of the volcanic material. The results underscore the efficacy of various clustering algorithms applied to the analysis of olivine and pyroxene compositions. Using principal component analysis (PCA) as a preliminary step, we successfully reduced the dimensionality of complex spectral data obtained from volcanic samples, which are terrestrial analogs to lunar minerals. This reduction facilitated clearer identification of patterns in mineral compositions, as illustrated in Figure 4. The clustering algorithms assessed included K-Means, hierarchical clustering, Gaussian mixture model (GMM), and spectral clustering. The performance of these algorithms varied across different regions, as indicated by silhouette scores ranging from 0 to 1, with higher scores suggesting better-defined clusters.
K-Means clustering effectively partitions data into distinct clusters by minimizing variance within each cluster, and it is favored for its simplicity and efficiency with large datasets. For instance, in region 8, K-Means achieved a silhouette score of 0.47, the highest among all methods, indicating excellent cluster separation. Hierarchical clustering, which builds a hierarchy of clusters through either a bottom-up or top-down approach, provided a visual representation of data structure in a dendrogram format. In region 7, this method achieved a silhouette score of 0.45, reflecting well-separated clusters. The Gaussian mixture model (GMM), a probabilistic model assuming that data points arise from a mixture of several Gaussian distributions, showed flexibility in clustering. Notably, in region 8, GMM matched K-Means with a silhouette score of 0.47, indicating similar clustering performance. Finally, spectral clustering, which utilizes eigenvalues of a similarity matrix for dimensionality reduction before clustering, achieved a score of 0.44 in region 7, demonstrating moderate cluster separation.
Consequently, this study demonstrated the efficacy of hyperspectral imaging (400–1000 nm) combined with unsupervised clustering algorithms to analyze volcanic samples from Vulcano, Italy, as analogs to lunar materials. K-Means and hierarchical clustering outperformed GMM and spectral clustering in identifying olivine and pyroxene compositions, with K-Means achieving the lowest RMSE considering the scatter error. Our findings suggest a predominance of olivine, consistent with lunar volcanic processes, enhancing our understanding of mineral distribution. Future work should refine mineral definitions and test the methodology against diverse spectra to strengthen its applicability to lunar geology. Additionally, our second paper in this series will provide a direct comparison with lunar observations and samples, contributing valuable insights into the relationship between terrestrial and lunar mineralogy.

6. Conclusions

Overall, the analysis indicates that all methods performed well with pure olivine compositions, achieving similarity percentages above 90% in several clusters. However, when examining mixed compositions of olivine and pyroxene, performance variability became evident. K-Means and hierarchical clustering displayed better adaptability to mixed data, while GMM struggled with lower similarity scores in certain instances. These findings reveal significant details about the mineralogical composition of volcanic samples from the region, particularly in identifying lunar minerals like pyroxene and olivine [19]. The strong correlations established between spectral data and these minerals enhance our understanding of the region’s volcanic activity and its geological evolution over time, with implications for lunar geology [20]. Our findings indicate a predominance of olivine in the Vulcano region, likely linked to volcanic processes akin to those on the Moon, as highlighted by [13]. In ref. [15], they map ilmenite abundance using Chandrayaan-1 M3 data, offering insights into lunar mineralogy that complement our focus on olivine and pyroxene. Both studies emphasize the importance of specific minerals in understanding the Moon’s geological history. Likewise, ref. [16] quantitatively estimates elemental concentrations on the lunar surface using hyperspectral NIR imaging, reinforcing the significance of spectral data for mineral identification. The study’s global elemental maps enhance our understanding of lunar composition and align with our findings on volcanic analogs. In terms of mineral identification, the analysis predominantly recognized olivine in the samples, though some methods also detected pyroxene. Olivine-rich compositions are common in ancient lava flows and impact-melt rocks, as noted in previous studies [4,21]. This is relevant to lunar geology, where olivine is a significant component of the Moon’s crust, especially in areas shaped by ancient volcanic activity and meteorite impacts. Together, these studies deepen our understanding of mineral distribution and formation mechanisms in both terrestrial and lunar landscapes, underscoring the similarities in volcanic processes on Earth and the Moon.
The findings also highlight that the choice of clustering algorithm significantly impacts performance metrics; specifically, K-Means is optimal for certain clusters, while hierarchical and spectral methods excel in others. In the literature, Support Vector Machines (SVMs) have demonstrated superiority for extraterrestrial bodies [22], particularly when combined with Joint Mutual Information Maximization (JMIM) for higher accuracy [23]. While our unsupervised approach suits exploratory analysis, future work could integrate SVM and JMIM to validate clustering results against labeled data, enhancing real-world applicability.

Author Contributions

Conceptualization: F.F.H.; methodology: F.F.H. and M.R.; software: F.F.H. and M.R.; validation: M.R., B.F. and F.J.V.; formal analysis: F.F.H.; investigation: F.F.H.; resources: M.R. and P.S.; data curation: M.R., P.S. and M.J.A.d.D.; writing—original draft preparation: F.F.H. and M.R.; writing—review and editing: M.R. and M.J.A.d.D.; visualization: F.F.H.; supervision: M.R., F.J.V. and B.F.; project administration: F.F.H. and M.R.; funding acquisition: F.F.H. and B.F. All authors have read and agreed to the published version of the manuscript.

Funding

P.S. and M.J.A.D. acknowledge support from the Dutch National Science Agenda under project NWA.1418.22.022 which is made possible by financial support of the Dutch Research Council (NWO).

Data Availability Statement

The data underlying this article will be shared on reasonable request to the corresponding author.

Acknowledgments

F.F.H. and M.R. would like to express heartfelt appreciation to EuroSpaceHub and LUNEX EuroMoonMars Earth Space Innovation for their generous funding and unwavering support. We sincerely thank Esther Cheng and Chenming Zhou from VGCC School for their assistance in collecting Vulcano samples and supporting the analysis.

Conflicts of Interest

The authors declare no conflicts of interest.

Note

1
The choice of n = 4 clusters is optimal, as both performance metrics and visual assessments indicate superior clustering quality compared to other cluster numbers.

References

  1. Taylor, L.A.; Pieters, C.M.; Keller, L.P.; Morris, R.V.; McKay, D.S. Lunar mare soils: Space weathering and the major effects of surface-correlated nanophase Fe. J. Geophys. Res. Planets 2001, 106, 27985–27999. [Google Scholar]
  2. Jaumann, R.; Hiesinger, H.; Anand, M.; Crawford, I.; Wagner, R.; Sohl, F.; Jolliff, B.; Scholten, F.; Knapmeyer, M.; Hoffmann, H.; et al. Geology, geochemistry, and geophysics of the Moon: Status of current understanding. Planet. Space Sci. 2012, 74, 15–41. [Google Scholar] [CrossRef]
  3. Moriarty, D.P.; Dygert, N.; Valencia, S.N.; Watkins, R.N.; Petro, N.E. The search for lunar mantle rocks exposed on the surface of the Moon. Nat. Commun. 2021, 12, 4659. [Google Scholar] [CrossRef]
  4. McSween, H.Y.; Wyatt, M.B.; Gellert, R.; Bell III, J.F.; Morris, R.V.; Herkenhoff, K.E.; Crumpler, L.S.; Milam, K.A.; Stockstill, K.R.; Tornabene, L.L.; et al. Characterization and petrologic interpretation of olivine-rich basalts at Gusev Crater, Mars. J. Geophys. Res. Planets 2006, 111. [Google Scholar] [CrossRef]
  5. Pieters, C.M. Composition of the lunar highland crust from near-infrared spectroscopy. Rev. Geophys. 1986, 24, 557–578. [Google Scholar] [CrossRef]
  6. van der Meer, F. Remote-sensing image analysis and geostatistics. Int. J. Remote Sens. 2012, 33, 5644–5676. [Google Scholar] [CrossRef]
  7. Adams, J.B. Visible and near-infrared diffuse reflectance spectra of pyroxenes as applied to remote sensing of solid objects in the solar system. J. Geophys. Res. 1974, 79, 4829–4836. [Google Scholar] [CrossRef]
  8. Cloutis, E.A.; Gaffey, M.J. Pyroxene spectroscopy revisited - Spectral-compositional correlations and relationship to geothermometry. J. Geophys. Res. Planets 1991, 96, 22809–22826. [Google Scholar] [CrossRef]
  9. Plaza, A.; Plaza, J.; Paz, A.; Sánchez, S. Parallel Hyperspectral Image and Signal Processing [Applications Corner]. IEEE Signal Process. Mag. 2011, 28, 119–126. [Google Scholar] [CrossRef]
  10. Spampinato, L.; Calvari, S.; Oppenheimer, C.; Boschi, E. Volcano surveillance using infrared cameras. Earth Sci. Rev. 2011, 106, 63–91. [Google Scholar] [CrossRef]
  11. Mandon, L.; Beck, P.; Quantin-Nataf, C.; Dehouck, E.; Thollot, P.; Loizeau, D.; Volat, M. ROMA: A Database of Rock Reflectance Spectra for Martian In Situ Exploration. Earth Space Sci. 2022, 9, e01871. [Google Scholar] [CrossRef] [PubMed]
  12. Korda, D.; Penttilä, A.; Klami, A.; Kohout, T. Neural network for determining an asteroid mineral composition from reflectance spectra. Astron. Astrophys. 2023, 669, A101. [Google Scholar] [CrossRef]
  13. Head, J.W., III. Lunar volcanism in space and time. Rev. Geophys. Space Phys. 1976, 14, 265–300. [Google Scholar] [CrossRef]
  14. Corley, L.M.; McGovern, P.J.; Kramer, G.Y.; Lemelin, M.; Trang, D.; Gillis-Davis, J.J.; Taylor, G.J.; Powell, K.E.; Kiefer, W.S.; Wieczorek, M.; et al. Olivine-bearing lithologies on the Moon: Constraints on origins and transport mechanisms from M3 spectroscopy, radiative transfer modeling, and GRAIL crustal thickness. Icarus 2018, 300, 287–304. [Google Scholar] [CrossRef]
  15. Surkov, Y.; Shkuratov, Y.; Kaydash, V.; Korokhin, V.; Videen, G. Lunar ilmenite content as assessed by improved Chandrayaan-1 M3 data. Icarus 2020, 341, 113661. [Google Scholar] [CrossRef]
  16. Bhatt, M.; Wöhler, C.; Grumpe, A.; Hasebe, N.; Naito, M. Global mapping of lunar refractory elements: Multivariate regression vs. machine learning. Astron. Astrophys. 2019, 627, A155. [Google Scholar] [CrossRef]
  17. Thoresen, F.; Drozdovskiy, I.; Cowley, A.; Laban, M.; Besse, S.; Blunier, S. Insights into Lunar Mineralogy: An Unsupervised Approach for Clustering of the Moon Mineral Mapper (M3) spectral data. arXiv 2024, arXiv:2411.03186. [Google Scholar] [CrossRef]
  18. Zhang, X.; Cloutis, E. Near-infrared Spectra of Lunar Ferrous Mineral Mixtures. Earth Space Sci. 2021, 8, e2020EA001153. [Google Scholar] [CrossRef]
  19. Clénet, H.; Pinet, P.; Daydou, Y.; Heuripeau, F.; Rosemberg, C.; Baratoux, D.; Chevrel, S. A new systematic approach using the Modified Gaussian Model: Insight for the characterization of chemical composition of olivines, pyroxenes and olivine–pyroxene mixtures. Icarus 2011, 213, 404–422. [Google Scholar] [CrossRef]
  20. Dhingra, D. The New Moon: Major Advances in Lunar Science Enabled by Compositional Remote Sensing from Recent Missions. Geosciences 2018, 8, 498. [Google Scholar] [CrossRef]
  21. Staid, M.I.; Pieters, C.M.; Besse, S.; Boardman, J.; Dhingra, D.; Green, R.; Head, J.W.; Isaacson, P.; Klima, R.; Kramer, G.; et al. The mineralogy of late stage lunar volcanism as observed by the Moon Mineralogy Mapper on Chandrayaan-1. J. Geophys. Res. Planets 2011, 116. [Google Scholar] [CrossRef]
  22. Sabat-Tomala, A.; Raczko, E.; Zagajewski, B. Comparison of Support Vector Machine and Random Forest Algorithms for Invasive and Expansive Species Classification Using Airborne Hyperspectral Data. Remote Sens. 2020, 12, 516. [Google Scholar] [CrossRef]
  23. Bhatt, P.; Maclean, A.; Dickinson, Y.; Kumar, C. Fine-Scale Mapping of Natural Ecological Communities Using Machine Learning Approaches. Remote Sens. 2022, 14, 563. [Google Scholar] [CrossRef]
Figure 1. Overview of the hyperspectral imaging setup. (Left) The hyperspectral imaging system (HSI) with labeled light sources (halogen lamps) and components. (Top right) Diagram of the experimental arrangement, clearly annotating the area camera, spectrograph, halogen lamps, transportation plate with sample, and connected computer for data acquisition. (Bottom right) Close-up view of a sample under illumination. The white reference was obtained using a Teflon whiteboard/Spectralon reference with 99% reflectance.
Figure 1. Overview of the hyperspectral imaging setup. (Left) The hyperspectral imaging system (HSI) with labeled light sources (halogen lamps) and components. (Top right) Diagram of the experimental arrangement, clearly annotating the area camera, spectrograph, halogen lamps, transportation plate with sample, and connected computer for data acquisition. (Bottom right) Close-up view of a sample under illumination. The white reference was obtained using a Teflon whiteboard/Spectralon reference with 99% reflectance.
Universe 11 00117 g001
Figure 2. (Right) The image displays the RGB reflectance spectra (combining 450, 550, and 650 nm) for nine distinct regions from Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy, covering the wavelength range of 400–1000 nm. (Left) The average spectral responses of these regions reveal variations in reflectance attributed to differences in mineral composition, particularly the amounts of olivine and pyroxene present. The percentage reported for each spectrum represented by a dashed colored line indicates the proportion of olivine in the olivine–pyroxene composition template from [11]. The solid colored lines represent the median spectrum for each region, labeled with specific numbers in the right panel.
Figure 2. (Right) The image displays the RGB reflectance spectra (combining 450, 550, and 650 nm) for nine distinct regions from Vulcano, a volcanically active island in the Aeolian archipelago, north of Sicily, Italy, covering the wavelength range of 400–1000 nm. (Left) The average spectral responses of these regions reveal variations in reflectance attributed to differences in mineral composition, particularly the amounts of olivine and pyroxene present. The percentage reported for each spectrum represented by a dashed colored line indicates the proportion of olivine in the olivine–pyroxene composition template from [11]. The solid colored lines represent the median spectrum for each region, labeled with specific numbers in the right panel.
Universe 11 00117 g002
Figure 3. Hyperspectral data analysis pipeline for volcanic samples. This figure illustrates the systematic approach used to process and analyze hyperspectral images, including calibration, normalization, and the application of various clustering algorithms. The pipeline ultimately predicts the mineral composition, specifically focusing on pyroxene and olivine, enhancing our understanding of volcanic material characteristics.The color blocks represent clusters for each method, while the lines in the top right diagram show the spectra for each pixel in the selected region.
Figure 3. Hyperspectral data analysis pipeline for volcanic samples. This figure illustrates the systematic approach used to process and analyze hyperspectral images, including calibration, normalization, and the application of various clustering algorithms. The pipeline ultimately predicts the mineral composition, specifically focusing on pyroxene and olivine, enhancing our understanding of volcanic material characteristics.The color blocks represent clusters for each method, while the lines in the top right diagram show the spectra for each pixel in the selected region.
Universe 11 00117 g003
Figure 4. The visualization presents the PCA-based clustering results for volcanic hyperspectral data across nine regions (regions 1–9), evaluated using four methods: K-Means, hierarchical, Gaussian mixture model (GMM), and spectral clustering, with n = 4 clusters. Each panel corresponds to a specific region and clustering technique, accompanied by silhouette scores to assess cluster quality (with higher scores indicating better-defined clusters). Notable findings include strong clustering performance in region 8 (K-Means: 0.47) and region 8 (GMM: 0.47), while weaker performance is observed in region 1 (spectral: 0.21) and region 6 (spectral: 0.27).
Figure 4. The visualization presents the PCA-based clustering results for volcanic hyperspectral data across nine regions (regions 1–9), evaluated using four methods: K-Means, hierarchical, Gaussian mixture model (GMM), and spectral clustering, with n = 4 clusters. Each panel corresponds to a specific region and clustering technique, accompanied by silhouette scores to assess cluster quality (with higher scores indicating better-defined clusters). Notable findings include strong clustering performance in region 8 (K-Means: 0.47) and region 8 (GMM: 0.47), while weaker performance is observed in region 1 (spectral: 0.21) and region 6 (spectral: 0.27).
Universe 11 00117 g004
Figure 5. Comparative analysis of mineral composition (the percentage of olivine (100% pyroxene)) similarity across clustering models using NMF. This heatmap illustrates the effectiveness of the four clustering algorithms—K-Means, hierarchical, Gaussian mixture model (GMM), and spectral clustering—in grouping mineral compositions, specifically olivine (O) and pyroxene (P), across different clusters (C1–C4). Each cell represents the similarity percentage for specific mineral mixtures, with color gradients indicating the strength of mineral presence from high (red) to low (blue). The results highlight the unique clustering tendencies of each algorithm.
Figure 5. Comparative analysis of mineral composition (the percentage of olivine (100% pyroxene)) similarity across clustering models using NMF. This heatmap illustrates the effectiveness of the four clustering algorithms—K-Means, hierarchical, Gaussian mixture model (GMM), and spectral clustering—in grouping mineral compositions, specifically olivine (O) and pyroxene (P), across different clusters (C1–C4). Each cell represents the similarity percentage for specific mineral mixtures, with color gradients indicating the strength of mineral presence from high (red) to low (blue). The results highlight the unique clustering tendencies of each algorithm.
Universe 11 00117 g005
Figure 6. (Top) Maps illustrating the effectiveness of the four clustering algorithms—K-Means, hierarchical, Gaussian mixture model (GMM), and spectral clustering—in grouping mineral compositions into n = 4 clusters within a selected 140 × 190-pixel area of volcano region 1. The color blocks represent clusters for each clustering method. (Bottom) Comparative visualization of olivine and pyroxene compositions (dashed lines, see Figure 2) across clustering models, shown by solid lines, with shading representing the 1- σ error.
Figure 6. (Top) Maps illustrating the effectiveness of the four clustering algorithms—K-Means, hierarchical, Gaussian mixture model (GMM), and spectral clustering—in grouping mineral compositions into n = 4 clusters within a selected 140 × 190-pixel area of volcano region 1. The color blocks represent clusters for each clustering method. (Bottom) Comparative visualization of olivine and pyroxene compositions (dashed lines, see Figure 2) across clustering models, shown by solid lines, with shading representing the 1- σ error.
Universe 11 00117 g006
Figure 7. The bar chart presents the median root mean squared error (RMSE) values for four clustering methods with standard deviation error: GMM, hierarchical, K-Means, and spectral.
Figure 7. The bar chart presents the median root mean squared error (RMSE) values for four clustering methods with standard deviation error: GMM, hierarchical, K-Means, and spectral.
Universe 11 00117 g007
Table 1. A comparative analysis was conducted on the various clustering methods—hierarchical, K-Means, Gaussian mixture model (GMM), and spectral clustering—to evaluate the olivine–pyroxene composition across nine distinct regions over 10 iterations. The analysis reports key metrics, including maximum similarity, median values, and standard deviation ( σ ) for each region and method. Additionally, the root mean square error (RMSE) values are provided in the final column for all methods.
Table 1. A comparative analysis was conducted on the various clustering methods—hierarchical, K-Means, Gaussian mixture model (GMM), and spectral clustering—to evaluate the olivine–pyroxene composition across nine distinct regions over 10 iterations. The analysis reports key metrics, including maximum similarity, median values, and standard deviation ( σ ) for each region and method. Additionally, the root mean square error (RMSE) values are provided in the final column for all methods.
RegionMethodOlivine (100% Pyroxene)Max SimilarityMedian σ RMSE
1Hierarchical100.0099.6697.363.300.54
1GMM100.0097.8393.641.780.55
1K-Means100.0098.0993.794.120.52
1Spectral90.0097.6995.620.800.53
2Hierarchical100.0099.2089.195.320.51
2GMM25.0097.5579.2512.940.54
2K-Means100.0097.0888.944.080.53
2Spectral100.0094.7780.191.980.52
3Hierarchical100.0085.8479.543.20.53
3GMM100.0087.9978.631.490.50
3K-Means100.0092.7181.351.530.48
3Spectral100.0094.7681.926.270.50
4Hierarchical100.0096.3095.400.540.50
4GMM90.0097.6989.664.210.52
4K-Means100.0095.5889.844.050.53
4Spectral90.0096.3393.045.280.49
5Hierarchical100.0084.5072.603.720.46
5GMM100.0079.7676.652.200.44
5K-Means100.0084.3681.102.040.43
5Spectral100.0084.7079.352.940.45
6Hierarchical100.0097.3976.522.620.50
6GMM100.0093.1077.132.150.51
6K-Means75.0095.5695.560.000.54
6Spectral75.0096.8996.890.000.55
7Hierarchical100.0094.6483.243.810.52
7GMM100.0094.6487.132.300.50
7K-Means100.0089.8586.571.400.49
7Spectral100.0094.4783.712.610.54
8Hierarchical90.0093.7988.992.180.52
8GMM90.0096.4391.101.830.55
8K-Means75.0095.6886.096.780.57
8Spectral90.0095.4694.282.270.52
9Hierarchical100.0090.2475.572.910.38
9GMM100.0090.2872.002.760.40
9K-Means100.0090.2871.452.850.38
9Spectral100.0090.7273.902.920.36
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fazel Hesar, F.; Raouf, M.; Soltani, P.; Foing, B.; de Dood, M.J.A.; Verbeek, F.J. Using Machine Learning for Lunar Mineralogy-I: Hyperspectral Imaging of Volcanic Samples. Universe 2025, 11, 117. https://doi.org/10.3390/universe11040117

AMA Style

Fazel Hesar F, Raouf M, Soltani P, Foing B, de Dood MJA, Verbeek FJ. Using Machine Learning for Lunar Mineralogy-I: Hyperspectral Imaging of Volcanic Samples. Universe. 2025; 11(4):117. https://doi.org/10.3390/universe11040117

Chicago/Turabian Style

Fazel Hesar, Fatemeh, Mojtaba Raouf, Peyman Soltani, Bernard Foing, Michiel J. A. de Dood, and Fons J. Verbeek. 2025. "Using Machine Learning for Lunar Mineralogy-I: Hyperspectral Imaging of Volcanic Samples" Universe 11, no. 4: 117. https://doi.org/10.3390/universe11040117

APA Style

Fazel Hesar, F., Raouf, M., Soltani, P., Foing, B., de Dood, M. J. A., & Verbeek, F. J. (2025). Using Machine Learning for Lunar Mineralogy-I: Hyperspectral Imaging of Volcanic Samples. Universe, 11(4), 117. https://doi.org/10.3390/universe11040117

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop