Highlights
What are the main findings?
- We present a machine learning framework to map the spatial distribution of minerals on Mars.
- Our framework utilises the Self-Organising Map and k-means clustering to identify clusters of spectral signatures of minerals.
What are the implications of the main findings?
- Our framework can identify the spatial distribution of minerals on Mars, even in complex spectral environments.
- The SOM model output is interpretable and provides intuitive guidance for mineral exploration.
Abstract
Planetary exploration missions have acquired a growing amount of remote sensing data, offering a reliable basis for studying the geological evolution of planetary bodies such as Mars. In recent years, machine learning models have emerged as powerful tools for remote sensing by providing scalable and adaptive solutions for planetary science. We present a machine learning approach to map the spatial distribution of minerals on Mars, representing a step toward large-scale automated mineral mapping. Although existing CRISM dimensionality reduction methods are useful, the feature space remains high-dimensional, and relying on RGB overlays limits the ability to preserve and detect complex relationships, increasing the risk of missing important spectral patterns. Our framework utilises the Self-Organising Map (SOM) model and k-means clustering to identify clusters of spectral signatures, which may correspond to distinct minerals. It reduces dimensionality to a two-dimensional grid while preserving key high-dimensional patterns and relationships, providing a more reliable and interpretable basis for semi-automated analysis than RGB overlays. Although the clusters can be labelled by referencing a spectral library, our framework does not require labelled data and can operate in an unsupervised manner. The framework retains full spectral dimensionality of input features. The results indicate that our framework can identify the spatial distribution of minerals on Mars, even in complex spectral environments with overlapping features. Moreover, the SOM model output is interpretable rather than a black box, providing intuitive guidance for mineral exploration when applied in a semi-automated workflow.
1. Introduction
Since the start of its mission in 2006, the Compact Reconnaissance Imaging Spectrometer for Mars (CRISM) [1] has delivered high-resolution hyperspectral observations of the Martian surface. Hyperspectral sensors capture data across hundreds of spectral bands, which have enabled better mineral identification than multi-spectral techniques [2]. This advancement, combined with the expanding availability of hyperspectral datasets, has led to a growing body of machine learning research focused on hyperspectral remote sensing [3,4]. The CRISM Targeted Empirical Record (TER) product suite, released in 2016, provides atmospherically and photometrically corrected hyperspectral data for targeted regions of Mars. This data offers valuable information on the composition and spatial distribution of surface minerals, thereby contributing to our understanding of the planet’s geological history and past environmental conditions [5,6,7].
The value of hyperspectral data for mineralogical investigation is not unique to Mars, with similar frameworks applied in other planetary contexts. For example, the Moon Mineralogy Mapper is a state of-the-art imaging spectrometer designed to provide high-spectral-resolution data across large areas of the Moon’s surface, with the goal of identifying mineral and rock compositions [8]. A comprehensive understanding of the lunar surface is essential in preparation for future crewed Artemis missions [9] that aim to establish a long-term presence on the Moon [10]. On Earth, terrestrial mineral exploration has exploited hyperspectral data from airborne and satellite sensors, demonstrating the value of such workflows for geological mapping [11]. Furthermore, Thomas et al. [12] highlighted the application of hyperspectral infrared analysis in the study of hydrothermal alteration, underscoring the broad applicability of spectral interpretation methods across planetary surfaces.
The extraction of meaningful spectral–spatial information from hyperspectral data has traditionally relied on expert judgment and manual processes [3,13], which may not be suitable for large-scale mineral mapping due to the risk of error or omission [6]. For targeted CRISM observations, which contain 544 spectral bands, this challenge is compounded by dimensionality. Summary products and RGB overlays have been developed to reduce data into a more manageable form and enable rapid mineral detection [14], particularly when spectral signatures in a region align well with the chosen overlays. However, these approaches still leave a high-dimensional feature space, and because only three dimensions are visualised at a time, higher-order relationships are lost and subtle or overlapping features may be overlooked, leaving the same risks of error or omission unresolved. Challenges such as these are a key reason machine learning and deep learning methods have emerged as valuable tools in remote sensing [2,15], owing to their ability to capture complex, nonlinear relationships in high-dimensional remote sensing data [15]. Deep learning methods, in particular, have enabled automated feature extraction and improved classification performance [3,4]. For example, Shirmard et al. [16] utilised Convolutional Neural Networks (CNNs) and multispectral remote sensing data to classify lithological units, with almost all test samples predicted correctly, a task that can require a large number of participants and resources if performed manually.
Hyperspectral remote sensing on Mars faces many of the same challenges experienced with terrestrial and lunar datasets, including issues such as size, high dimensionality, noise, atmospheric effects [17], and spatial variability of spectral signatures [11]. Additional complications include coarse spatial resolution, spectral ambiguity from overlapping or subdued absorption features, and surface contaminants such as dust cover and complex mixtures [7,13,18,19,20]. However, the absence of extensive validation data is more distinctive in the case of Mars, which makes Mars a valuable test bed for evaluating advanced data-driven methods in sparse and noise-prone environments. The scarcity of labelled data limits the effectiveness of supervised approaches and motivates the exploration of alternative strategies such as transfer learning and unsupervised methods for geospatial analysis. However, these strategies do not always generalise well to the target domain [21] and still benefit from some labelled data for fine-tuning purposes [22]. In this context, unsupervised learning—particularly clustering-based approaches—have become alternatives for hyperspectral data analysis [13].
Self-Organising Maps (SOMs) [23] are unsupervised neural networks that can be used for the clustering of high-dimensional data onto a two-dimensional grid while preserving topological relationships [23,24,25]. SOMs have been applied across a variety of remote sensing and Earth science contexts. SOMs have been used to cluster hyperspectral data from Baffin Island, achieving reliable differentiation of spectral features and outperforming other techniques in the presence of substantial noise [26]. In geoscientific studies, SOMs have shown promise in clustering geochemical data to identify prospective gold deposits in northern Finland [27]. More recently, SOMs have been employed to identify potential landing sites near the lunar South Pole by integrating multi-source remote sensing data, revealing spatial patterns constrained by scientific and engineering considerations [28]. These examples highlight the versatility of SOMs as a tool for unsupervised pattern recognition and feature extraction in remote sensing, geological, and planetary science research.
In this study, we present an unsupervised machine learning framework that applies SOM and k-means clustering to group spectral signatures and map the spatial distribution of minerals on Mars, moving beyond pixel-based classification and conventional RGB overlays. Even when summary products reduce the dimensionality of CRISM spectral data, the feature space remains high-dimensional, and the viewing of only three dimensions at a time cannot reliably capture or preserve complex relationships. Our framework focuses on selected regions of Mars where diagnostically interpretable remote sensing data is available for mineral classification. It retains full dimensionality of the input space, producing topology-preserving clusters that can be partially labelled using the CRISM Type Spectra Library [29] through a post-processing spectral matching step. In this way, clusters are interpretable, can be correlated with known mineral signatures, and allow for the identification of mineral distributions, even in complex spectral environments with overlapping features. Overall, the framework provides a semi-automated, scalable, and interpretable approach for exploring Martian mineralogy in the absence of extensive ground-truth data.
The rest of this paper is organised as follows: Section 2 provides background and related work, while Section 3 describes the methodology, including data processing and modelling. Section 4 presents the results, and Section 5 provides a discussion, which is followed by a conclusion in Section 6.
Target Regions, Identified Minerals, and Image Adjustments
Table 1 and the accompanying map indicate the referenced target regions, the location of specific minerals, and adjustments made to image files:
Table 1.
Target regions and, where applicable, the original [X, Y] pixel coordinates of mineral locations shown in remote sensing and overlay images used in this research. These coordinates correspond to positions prior to any image rotation. For more details, refer to Section 3.1. * The presence of Fe-olivine, Fe-oxides, or other mafic silicate minerals remains speculative and is not confirmed.
A surface image view of Mars showing the targeted regions is presented in Figure 1.
Figure 1.
Global view of Mars showing enlarged and approximate locations of the target regions.
2. Background and Related Work
2.1. CRISM
Aboard NASA’s Mars Reconnaissance Orbiter (MRO), CRISM [1] is a high-resolution visible-to-near-infrared (VNIR) imaging spectrometer providing up to 544 spectral bands across the 362–3920 nanometre (nm) range, operating in targeted and mapping modes. The mission has produced numerous data records, such as EDRs (Experiment Data Records), TRDRs (Targeted Reduced Data Records), TERs (Targeted Empirical Records), and MTRDRs (Map-Projected Targeted Reduced Data Records), each reflecting different levels of calibration and spatial processing [30]. These datasets have played an important role in the exploration of Martian minerals by enabling the identification of minerals associated with past aqueous activity [30].
2.2. Summary Products for Data Processing
A key challenge is the variability in raw hyperspectral data, which can arise from factors such as atmospheric effects, illumination differences, surface scattering [31], and other environmental or acquisition-related influences. Spectral parameters can be used to overcome some of these difficulties, as it may not be necessary to assess the entire spectrum to identify a mineral, instead focusing on specific spectral features that can be described by a single parameter value [32]. Spectral parameters are widely used in CRISM data products and have proven to be a successful method of collapsing spectral variation into a limited number of dimensions to identify the presence of a mineral on Mars [14]; we refer to the spectral parameters as summary products to remain consistent with Viviano et al. [14]. These summary products offer several advantages over raw hyperspectral bands by (1) emphasising mineralogically meaningful absorption features, (2) reducing dimensionality while preserving domain-relevant information, and (3) improving comparability across regions with differing photometric conditions [33]. Although summary products offer these advantages over raw hyperspectral bands, developing them is inherently time-consuming, requires expert geological knowledge, and is subject to differing expert opinions on optimal formulations [13].
We compute BD1750_2 using hyperspectral data from the MRO CRISM Type Spectra Library as an illustrative example to demonstrate the intent of a summary product. This summary product estimates absorption near 1750 nm as follows:
The a and b parameters are weighting factors with . The value of b is approximately , and the exact ratio depends on the specific selected wavelengths, which is influenced by the kernel width. Figure 2a,b illustrate the absorption feature.

Figure 2.
(a) Gypsum spectrum highlighting the 1750 nm absorption feature. The different coloured lines indicate regions where CRISM measures the spectrum in consistent 6.55 nm increments, with breaks marking interruptions in the regular sampling pattern. (b) Zoomed-in view showing the interpolation anchors used to identify the absorption feature. (c) BD1750_2 values for 31 minerals, with only gypsum and alunite showing strong responses, demonstrating the diagnostic role of this spectral product in mineral detection.
As shown in Figure 2c, the summary product is sensitive primarily to gypsum and alunite among the 31 MRO CRISM Type Spectra Library minerals, consistent with earlier studies [14]. Notably, the absorption features remain clear, even when applied to un-normalised TER-3 data, demonstrating robustness—though normalisation is recommended if a reliable reference pixel can be identified.
2.3. Unsupervised Machine Learning
Dimensionality reduction methods such as Principal Component Analysis (PCA) [26,34] and Minimum Noise Fraction (MNF) [17] have been used to process hyperspectral data. PCA extracts orthogonal components based on variance, which can inadvertently prioritise noise over subtle but important absorption features. MNF aims to separate signal from noise more explicitly but may still obscure localised mineralogical signatures.
Traditional clustering algorithms such as PCA, k-means [15,34], Gaussian Mixture Models (GMMs) [13,34], and DBSCAN (Density-based clustering) have been applied to hyperspectral datasets [34,35]. However, these methods come with limitations, including the need for hyperparameter tuning and assumptions about the underlying distribution of materials. Spectral unmixing approaches such as Linear Mixing Models (LMMs) [36] offer an alternative by modelling each pixel as a linear mixture of spectra from a predefined set of end members. However, their effectiveness is limited by the linear assumption [37] and reliance on a fixed spectral library, which may not capture the substantial spectral variability experienced in practice [37,38,39]. A SOM model can mitigate some of these limitations via the preservation of topological relationships. For instance, an over-dimensionalised SOM will produce an organised map of similar neurons (refer to Section 2.3.1; a neuron can be thought of as a representative vector in this instance) that can be grouped or visually identified, meaning hyperparameter tuning is not as problematic. Similarly, topology separates high-confidence regions with more distinct mineral signatures from areas with more complex and mixed surface compositions, allowing for the identification of regions where spectral variability is not as problematic.
2.3.1. Self-Organised Map (SOM)
A SOM model projects high-dimensional inputs onto a two-dimensional topological grid while preserving neighbourhood relationships [23,24,25,40]. A SOM can be envisioned as a two-dimensional jigsaw puzzle, where the completed puzzle represents an organised map of the dataset. Each puzzle piece corresponds to a neuron, acting as a reference vector or prototype that summarises a cluster of similar data points. Conceptually, a SOM neuron is akin to the centroid of a k-means cluster; however, unlike k-means, a SOM preserves the topological relationships between neurons, arranging the pieces so that their relative positions reflect the original high-dimensional input structure. In this way, the SOM maintains the integrity of the underlying patterns while providing a coherent, interpretable two-dimensional representation. Therefore, SOM models are valued for their interpretability and have been successfully applied in remote sensing tasks such as cloud type classification [40], land cover classification [41], identifying distinct types of regolith and bedrock [42], and even reconstructing underwater sound speed profiles [43]. Some key advantages of a SOM model in this context include the following:
- 1.
- High-dimensional Input Handling: A SOM model can process high-dimensional data directly, without requiring dimensionality reduction (aside from computational performance considerations). Although SOMs typically use Euclidean distance (which assumes the linearity of the input space), other distance metrics can be adopted.
- 2.
- Topological Preservation: Unlike traditional clustering algorithms, a SOM model preserves the topological structure of the input space. This is particularly useful in hyperspectral analysis and can reveal unique mineral clusters in complex situations.
- 3.
- Robustness to Over-dimensioning: The SOM’s structured output space enables intuitive interpretation, even when the number of neurons exceeds the number of distinct clusters. Over-dimensioning leads to finer topological granularity rather than fragmented or scattered clusters, so coherent patterns can be visually observed.
- 4.
- Representative Abstraction: In large-scale clustering (e.g., 270,000 × 29 summary product vectors into 100 clusters), summarising each group meaningfully can be difficult. Simple averaging of cluster members may blur subtle spectral distinctions. In contrast, SOMs assign each neuron a weight vector that captures the core spectral characteristics of the clustered instances, as well as, to a lesser extent, those of nearby clusters, without the need for post-processing.
These features make SOM models particularly compelling for mineral analysis. They support unsupervised classification in an environment where there is little to no validation data whilst yielding spatially coherent, interpretable groupings that reflect physically meaningful spectral variations.
The SOM training (learning/optimisation) process is illustrated in Figure 3a. For a given input instance, the best matching neuron, also known as the Best Matching Unit (BMU), is identified based on the Euclidean distance between the input vector and each neuron’s weight vector. The neurons located within a neighbourhood radius () of the BMU are then updated based on the neurons’ distance from the BMU in the two-dimensional map grid (Figure 3a). The degree of update depends on the current learning rate and the neurons’ distance from the BMU.
Figure 3.
(a) The Best Matching Unit (BMU, red) and its radius of influence (Image A). During training, the weight vectors of neighbouring neurons are updated, with the adjustment strength decreasing with distance from the BMU. Neuron coordinates remain fixed, but their weight vectors move closer together (Image B). (b) The topological arrangement of neurons in a 50 × 50 self-organising map. Together, these panels visualise SOM training and neuron organisation.
Let denote the input vector, and let i and b represent the coordinates of neurons i and the BMU, respectively. The Euclidean distance between the coordinates of neurons i and b is given by
where and are the row and column coordinates of neuron i and and are the row and column coordinates of the BMU.
We use the Gaussian distribution to define the neighbourhood function that captures the strength of the update for neuron i at time t, as given by
We then update the weight vector () of a neuron within the neighbourhood using the following expression:
If the red neuron represents the BMU for the current input (Figure 3a), then the weight vectors of all neurons within the neighbourhood radius () are adjusted toward the input vector, scaled by their respective update factor (). Although the weight vectors of the neurons are updated, their positions on the grid remain fixed; however, the weight vectors shift toward the BMU. We retain the neuron map, including the weight vectors for each neuron, once the SOM model has completed training. We then map each pixel to a specific neuron based on the BMU for that instance.
Figure 3a presents the SOM topology maps visualised as two-dimensional arrays, where each pixel corresponds to a neuron in the model. A key consideration when configuring an SOM model is determining the number of neurons. Although limited research exists on the optimal size, a commonly cited heuristic suggests using approximately neurons, where N is the number of training samples [44].
In the case of an image with 250,000 valid pixels, this equates to around 2500 neurons, prompting our choice of a 50 × 50 grid. Our preliminary tests indicated that a 25 × 25 grid produced similar results, while grids smaller than 10 × 10 led to a loss of detail. As shown in Figure 3b, neuron indexing begins at 0 in the top-left corner and ends at 2499 in the bottom-right corner. Although our study does not refer to specific neuron coordinates, Figure 3b illustrates the standard arrangement.
2.3.2. SOM Competitive Learning
The following example illustrates the competitive learning strategy of the SOM model. In Table 2, the first grid lists numbers sequentially from 1 to 25, while the second shows the total Euclidean distance of each number from all others. While clustering similar numbers, the SOM also seeks to preserve the topological structure indicated by the second grid. Thus, numbers 1 and 25—being most dissimilar to the rest—are projected toward opposite corners; numbers 2–8 and 18–24 are positioned farther away from the centre; and the more similar middle numbers, i.e., 9–17, are grouped more centrally relative to these dominant topological patterns, effectively pushing the more distinct numbers outward.
Table 2.
Left table: Numbers ordered in a 5 × 5 grid from 1 to 25. Right table: Total Euclidean distance of each number from every other number.
If a grid is randomly initialised with numbers 1–25, training with uniformly sampled instances should lead the SOM to cluster similar numbers whilst converging on preservation of these patterns. In Table 3, the first grid shows the random initialisation, while the second shows the ordering after 100,000 training iterations.
Table 3.
Left table: Randomly initialised numbers from 1 to 25. Right table: SOM ordering after 100,000 training iterations.
The SOM successfully clusters similar numbers, with values less than 5 grouped in one corner and those above 20 in the opposite corner. Beyond this neighbourhood clustering, the model preserves the key topological structure—1 and 25 are projected as far apart as possible, occupying opposing corners, while 2–8 and 18–23 are positioned away from the central numbers. The diagonal and midlines form the core region, with extreme values in opposing corners, where the central numbers are located. Even within this central zone, the SOM creates sub-clusters, grouping lower teens in the bottom left and higher teens in the top right while still maintaining the overall topological relationships. In this way, the SOM not only clusters similar numbers but also preserves their relative topological structure—a principle that naturally extends to higher-dimensional datasets, such as spectral summary products.
2.3.3. Biased Instance Selection
For the example in Table 3, we draw new instances uniformly. When instances are biased, the SOM’s ability to cluster and learn topological patterns is reduced. For the following scenario (Table 4), the number 20 is drawn with 75% probability, while the remaining numbers share the remaining probability uniformly. Large topological patterns are still observed, but clustering is hampered as neurons converge toward 20, resulting in lower-order patterns becoming less evident.
Table 4.
Left table: Randomly initialised numbers from 1 to 25. Right table: SOM ordering after 100,000 training iterations with training instances biased toward 20 (drawn with 75% probability). Large-scale topological patterns are preserved, but neuron convergence toward 20 dilutes the ability to observe lower-order topological patterns.
2.3.4. Implications for Mineral Analysis
In mineral mapping, mixed spectra resemble the central numbers—broadly similar or relatively close in Euclidean distance (see the right-hand table of Table 2)—whereas unique spectra are analogous to 1 and 25, differing strongly from all others. As in the 5 × 5 example, the SOM projects these distinct numbers to the map’s corners and edges, since their BMUs consistently differ from the rest, while clustering similar spectra together. This approach allows the SOM to preserve the dataset’s topological relationships while effectively segregating distinct mineral groups, though care must be taken to avoid bias in the training samples.
3. Methodology
3.1. Mars Coordinate System
In this study, we use unprojected TER-3 images, in contrast to other sources that use map-projected MTRDR images, which are geo-referenced to the spherical surface of Mars (Viviano et al. [14]). Map-projected images result in a visually distorted appearance for the same target region; therefore, interpretation must be practised with care, often through comparison with topographic features or by using custom-coloured visualisations that assist in aligning spatial references. Refer to Figure 1 for a global map of Mars identifying the analysed target regions.
The coordinate system we used for Mars is [X,Y], corresponding to row × column indexing as shown in Figure 4. This aligns with the dimensions of an image file when viewed numerically.
Figure 4.
Pixel coordinate system used in this study.
We refer to Figure 1 for known mineral locations referenced in this study—these locations were originally sourced from [14]. We convert the [Y,X] coordinates provided by [14] into our [X,Y] format to ensure consistency and reproducibility. These locations serve as the source of the hyperspectral reference spectra used in the MRO CRISM Type Spectra Library for the specified minerals. In some cases, we rotated the unprojected TER-3 images by 180 degrees to match the orientation presented in [14]. We explicitly noted the rotations in the corresponding image descriptions for transparency and reproducibility, and Table 1 summarises the adjustments.
3.2. SOM Model Inputs
We use summary products derived from specific spectral absorption features [7] to cluster minerals, as these products are designed to highlight key mineral signatures. Although MTRDR image files include such products—and it was initially our intention to use them all—preliminary analysis revealed discrepancies between many MTRDR outputs and the spectral formulas intended to generate them. Consequently, we recalculated 29 summary products from TER-3 files (refer to Section 3.3), following the set used by Viviano et al. [14] (Figure A1), including those required for the RGB overlays, as these products provided greater confidence in representing the intended spectral features. Adjustments and discrepancies relative to CRISM DPIS definitions [33] for these 29 summary products are detailed in Appendix B.
Although this set provides a practical baseline, it is not sufficient for comprehensive mineral discrimination in a machine learning context. The SOM architecture can handle higher-dimensional inputs, which is desirable for reducing collinearity between mineral signatures and capturing more nuanced spectral information. Achieving this requires confidence in the input parameters and, ideally, a standardised automated procedure for generating them, which lies beyond the scope of this study but represents an important avenue for future work.
All 29 features were retained without further dimensionality reduction (e.g., PCA or feature selection) to preserve the full input space. This ensures that the SOM remains sensitive to subtle yet potentially important variations in mineral signatures across target regions. Figure 5 illustrates how these summary products were used as input to the SOM model.
Figure 5.
This heat map illustrates how summary products serve as input features. Rather than displaying all 29 products for each pixel, it shows a subset of recalculated summary products for selected minerals from the MRO CRISM Type Spectra Library, highlighting spectral variation across mineral types. Each horizontal row can be thought of as an individual pixel from a TER-3 image file.
3.3. Data Preprocessing
We utilise two CRISM datasets in the TER format, taking the following issues into consideration:
- 1.
- Summary product recalculation: We use Targeted Empirical Record—version 3 (TER-3) files, which enable us to recalculate summary products from the original spectral measurements. TER-3 is a specific CRISM image type [33] that consists of un-projected hyperspectral observations that have been corrected for atmospheric and photometric effects for Mars.
- 2.
- Spectral labelling: We use the MRO CRISM Type Spectra Library to label clusters, which provides reference spectra for each mineral. Specifically, we use the numerator from the library—the source of which is also a pixel in a TER-3 image—where the mineral has been previously verified [29]. These spectra serve as ground-truth references to support the identification of mineral clusters detected by the SOM model. Ideally, each pixel (numerator) would be normalised using a reference pixel (denominator) and the ratio would be used to calculate summary products. Whilst the library provides a denominator for each reference spectrum, we did not adopt this approach due to uncertainty about reliably identifying a reference pixel in each TER-3 image. If done incorrectly, this could complicate comparisons between the summary products calculated from targeted TER-3 image files and library reference spectra. Spectra for each mineral contained within the library are converted into a summary product for direct comparison.
We applied a unified preprocessing pipeline to both datasets to ensure compatibility between the summary products and the reference spectra, consisting of the following steps (Figure 6, Step 2).
- We mask pixels that have a specific value of 65,535 in any frame, as this indicates invalid or missing spectral data. These pixels are excluded from analysis, and summary products are not calculated for them.
- We compute summary products for each pixel from the unmasked pixels.
- We clip any summary product values below zero to zero, although the formula for summary products does not specify a minimum bound. We refer to BD1750_2 in Equation (1), which can yield negative values, as absorption features are inherently positive by definition. This clipping approach is consistent with recommendations in the literature [14].
- We normalise all summary products to a range of [0, 1] to eliminate dimensional disparities and standardise inputs for the model.
Figure 6.
Machine learning framework illustrating the progression from raw inputs to SOM model outputs. In Step 2, we mask out the pixels with a label value of 65,535; therefore, we do not compute summary products for them. Furthermore, we clip negative summary product values to zero before normalisation. Step 4 uniformly samples a total of N pixels from each cluster generated in Step 3, repeating this process X times. Sequential updates in Step 4.2 imply that we train the SOM model one pixel at a time.
Although normalisation of summary products (input features) to the [0, 1] interval promotes consistency, it also overemphasises certain mineral signatures. Ideally, the inputs should be scaled with reference to a global range of summary product values, not local target regions. For example, if the maximum values of BD1750_2 in target regions “A” and “B” were 0.005 and 0.1, respectively, the local scaling would independently scale both maximums to 1, effectively treating them as equivalent. In contrast, global scaling performed across both regions would result in scaled values of 0.05 for region “A” and 1 for region “B”. In this case, global scaling preserves the relative difference in absorption strength, maintaining the weaker BD1750_2 response in region “A” rather than inflating it to match region “B” under local scaling.
The SOM model in our framework produces several outputs: pixels identified by specific neurons corresponding to particular minerals can be visualised with false-colour or RGB overlays; topological maps representing the neuron grid can be adapted to display correlations, analytical metrics such as hit rate. Furthermore, it produces group analyses that reveal mineral associations projected toward the corners of the SOM, providing insight into the model’s topological structure.
3.4. Unsupervised Machine Learning Framework
After processing the data, as shown in our framework (Figure 6), we apply k-means clustering to group pixels based on spectral similarity. Experimentally, we select the number of clusters such that the smallest cluster contains approximately ten percent of valid pixels in an image (Figure 6—Step 3). For each training iteration (repeated “X” times), we randomly and equally sample from each cluster formed in Step 3 to generate a batch of “N” instances for a single SOM training update. This approach ensures that no single spectral group dominates the training set, maintaining diversity across iterations (Figure 6—Step 4). Clustering before sampling mitigates redundancy in the training data, preventing premature neuron convergence and allowing the SOM to learn a more meaningful representation of the spectral space (refer to Section 2.3.2 for a basic example as to why this is important).
We use each instance in a training batch to sequentially train the SOM model. We then integrate the SOM output with the MRO CRISM Type Spectra Library to produce labelled mineral clusters and topology maps. We then provide an analysis of the minerals clustered through the framework (Figure 6—Step 5).
Each neuron in our SOM model is associated with a 29-dimensional weight vector, corresponding to the summary products used as input features. We initialise these weight vectors with random values in a range of [0, 1], which assists with initial learning, as the summary products used as inputs are restricted to this range. Although this random initialisation is typically sufficient for model convergence, more structured initialisation methods could potentially accelerate learning and enhance clustering performance. We employ a competitive learning strategy based on the principles introduced by Kohonen [45] and extended to clustering tasks by Vesanto [24] to get a better organised topological map from the SOM. This involves a two-stage training process designed to first establish a rough global organisation, then refine local cluster distinctions.
- Stage 1—Rough Topological Organisation: In the initial stage, neurons are updated with a high learning rate and a large neighbourhood radius, facilitating a coarse topological structure that guides the model in later stages.
- Stage 2—Refinement: The learning rate progressively decays, and the neighbourhood radius shrinks, enabling the model to fine-tune and converge on subtle distinctions in mineral characteristics.
3.4.1. SOM Implementation
We enforce the initial organisation of the randomly initialised weights by 50 preliminary model training iterations before commencing formal training. We select the initial learning rate as and the radius as for a SOM map. The initial learning rate and radius of updated neuron nodes gradually decrease.
The SOM model undergoes 1000 iterations of formal training to cluster instances (Step 4 of Figure 6). We use 5000 instances in each iteration, randomly but equally sampled from each cluster formed in Step 3 of Figure 6. The number of clusters formed during the pre-training instance clustering process may vary, but typically, around 20 clusters are generated, with the smallest cluster containing about 2000 instances. Over 1000 iterations, each instance in the smallest cluster is expected to be used for training approximately 125 times, or once every ten iterations, ensuring that small clusters are adequately represented and not overlooked by the SOM model during training.
We relied on heuristic estimates [44] to determine the SOM model dimensions (hyperparameters such as length and width) and did not delve into hyperparameter optimisation due to a SOM’s topological organisation, which is tolerant of over-dimensioning by grouping similarly weighted neurons within, effectively forming a cluster of clusters. Our visual inspection of a neuron heat map suggests that a 50 × 50 SOM is likely over-dimensioned, as there are often topological clusters of high correlation among many neurons. Nonetheless, the impact of moderate over-dimensioning appears minimal, as the SOM’s topological structure maintains meaningful organisation by grouping highly correlated neurons into hierarchical clusters. These can be further clustered by external algorithms or interpreted through visual inspection of the neuron heat map. However, there are instances in which a 50 × 50 SOM is required to isolate a particular mineral—notably, Epidote in target region FRT0000CBE5, which was identified in additional analyses not included here. Therefore, under-dimensioning the SOM, where distinct clusters cannot be effectively separated, likely poses a greater risk than over-dimensioning, which primarily results in redundant but topologically organised clusters.
3.4.2. Cluster Labelling
In order to correlate the clusters with known mineral types, we compare the trained neuron weight vectors with processed reference spectra from the CRISM MRO Type Spectra Library. We compute the correlation scores by comparing the 29 normalised summary products of each neuron’s weight vector with the reference spectra. We assigned the neurons with higher correlation scores, indicating greater spectral similarity, tentative mineral labels. This approach enables the partial labelling of unsupervised clusters, enhancing their interpretability without requiring training data.
3.4.3. Threshold for Mineral Confidence Levels
We focused on regions identified by the top five neurons with the highest correlation to the reference spectrum to assess the spatial distribution of a labelled mineral. These neurons generally exhibited strong spectral similarity, particularly in areas where the mineral was previously confirmed to exist. However, simply selecting the top five neurons can yield misleading results in regions where the mineral is not present. Therefore, we excluded neurons with low correlation scores, even if they ranked among the top five for a given mineral. In an attempt to refine and automate this process, we explored an alternative approach.
- Correlation Calculation: We calculate the correlation of all 2500 neurons with each of the 31 mineral reference spectra in the CRISM MRO Type Spectra Library, resulting in 77,500 individual correlation values.
- Global Thresholding: The neurons are considered to have a high correlation with a mineral if their correlation exceeds the 95th percentile of all neurons. The median correlation value was also used as a threshold, with neurons correlating with the median being excluded from further consideration.
- Z-Score Transformation: For each mineral, the correlation values for each neuron were converted to z-scores (for just that mineral). A neuron was deemed to belong to a mineral if the following conditions were met:
- –
- The neuron had a high correlation with the mineral and its z-score exceeded the mean z-score for that mineral across all neurons (indicating a strong and representative correlation).
- –
- Alternatively, if the mineral’s correlation with the neuron was low, the z-score had to exceed 2 to be considered relevant.
- Rationale for the Approach: This method was adapted based on our observations that some minerals appeared to be widely distributed across a target region, while others were highly localised. The approach aimed to capture both scenarios by emphasising stronger correlations for widely distributed minerals and allowing for more flexibility with more isolated minerals.
Despite experimenting with this approach, we found that selecting the top five neurons was generally sufficient for most analyses, as our focus was on regions where the presence of specific minerals had already been reported. This method offered a straightforward yet effective way to associate SOM neuron weight vectors with known mineral signatures, delivering reliable results, without the need for complex thresholding in most cases.
3.4.4. Surface Dynamics and Temporal Variability
CRISM TER-3 images represent a snapshot in time and do not account for the temporal variability of the Martian surface, which may alter surface reflectance properties over time. No adjustments were applied in this study, and CRISM TER-3 data were used as provided. Future work may incorporate temporal correction methods to address surface dynamics such as dust deposition and seasonal ice variation.
4. Results
We present five investigations to demonstrate the ability of our framework, which utilises a SOM model to detect the spatial distribution of minerals on Mars, highlighting its advantages for mineral exploration.
- 1.
- A visual comparison of the SOM model’s ability to detect the spatial distribution of the mineral illite with reference to the distribution identified by Viviano et al. [14];
- 2.
- The SOM model’s ability to detect the specific location of the mineral low-ca pyroxene, as previously identified by Viviano et al. [14];
- 3.
- An application of the SOM model’s topological structure to identify the presence of Mg-carbonate on Mars, as previously identified by Viviano et al. [14];
- 4.
- A qualitative review of the topological organisation of mineral groups within the SOM model’s neuron grid;
- 5.
- Implementation of a semi-automated framework for mapping mineral spatial distributions without reliance on RGB overlays, leveraging the SOM’s ability to project high-dimensional spectral patterns onto a two-dimensional grid while simultaneously clustering pixels.
4.1. Mineral Classification
Figure A3 as reproduced from Viviano et al. [14] provides an opportunity to evaluate the SOM model’s ability to identify the spatial distribution of Martian minerals. Figure 7a shows the spatial distribution of the mineral illite as identified by the SOM model, along with our attempt to replicate image D of Figure A3. Although illite is intended to appear as pale yellow, in our overlay (Figure 7a), it appears to render as a colour closer to pale pink (a consequence of local scaling). The overlay reveals a strong alignment between the highlighted pale-pink areas and the clusters identified by the SOM model—particularly clusters 0 and 2.
Figure 7.
Our study uses image data from Viviano et al. [14] as a source of ground truth for comparison with outputs derived from the SOM model. For ease of comparison, we show a snapshot of the referenced image in the top-right corner of the corresponding panel and provide a larger version in Appendix A. We highlight the limitations of these visual comparisons in Section 3.1. Both images are rotated 180 degrees to align with Image D from Figure A3 and have a resolution of 390 × 640 pixels, with the X-axis oriented vertically to match image rows. (a) Unprojected map of FRT0000634B with an overlay based on Image D of Figure A3. Faint pale-pink areas indicate illite, with the clustering box reducing the SOM model’s ability to resolve thin strips of this mineral. (b) Pixel locations of illite identified by the top five neurons most correlated with the processed MRO CRISM Type Spectra Library. Thin white strips demonstrate the SOM model’s capacity to resolve subtle spectral differences within regions of spectrally similar pixels, as indicated by the surrounding pink shading in (a).
Although the polygonal boundary includes broad pale-pink zones—especially within cluster 0—in Figure 7a, the SOM model appears more precise in its identification of illite. Hence, Figure 7b highlights the specific regions where the SOM model detected illite as white pixels. This emphasises the model’s accuracy in pinpointing even thin strips of illite, which is not demonstrated by the broader polygon-based representation.
Clusters 1 and 2 in Figure 7 highlight potential illite in areas not previously verified. For all clusters, these pixels originate from the five SOM neurons most associated with illite, which are neurons 2462, 2461, 2463, 2460, and 2454—listed from highest to lowest correlation with the MRO CRISM Type Spectra Library spectrum for illite. Four neurons (2460–2463) form a compact block along the bottom row of the SOM neuron grid. The other neuron, 2454, lies apart from this block. While spectrally similar, it shows weaker and absorptions and belongs to a very similar, albeit a different SOM cluster. This separation is important—it indicates that neuron 2454 is closely related to illite-like spectra (possibly more mixed) but distinct enough for the SOM to preserve it as a separate cluster. Figure 8 illustrates this, with most high- spectra grouped to cluster 0 (containing verified illite; refer to Figure 7), with clusters 1 and 2 (refer to Figure 7) representing spectra with reduced absorption. The existence of both SOM topological clusters demonstrates how the SOM can capture subtle internal variation in high-dimensional data. Achieving a fully automated delineation will likely require more discriminating summary products, as with the current set, neuron 2454 correlates almost identically with neurons 2460 and 2463 when referenced against the illite spectrum from the MRO CRISM Type Spectra Library.
Figure 8.
Reference to clusters shown in Figure 7. The SOM grid contains 2500 neurons. In this rotated 3D orientation, neuron [0, 0] is at the left-hand corner, [50, 50] at the right-hand corner, [50, 0] at the top apex, and [0, 50] at the bottom apex. (a) Neuron weight values for , with both clusters showing a distinct spike—their separation indicates additional high-dimensional differences. (b) Neuron weight values for , where cluster 0 shows elevated absorption, while clusters 1 and 2 are more subdued and not directly visible from this view, leading the SOM to topologically separate them.
Using SOM output in this semi-automated approach, neurons 2412 and 2461–2464 are identified as best representing the high-quality pixels within the SOM topological cluster containing verified illite. Although neuron 2412 introduces a separate spatial distribution cluster (cluster 1 in Figure 9), it is topologically connected to the main group and, thus, likely represents illite, albeit with slightly more mixed spectra. The topological connection of neuron 2412 is illustrated in Figure 9 by the included snippet, which shows the pixels mapped by these neurons and how the cluster appears on the SOM-derived two-dimensional grid for .
Figure 9.
SOM-driven semi-automated selection of neurons 2412 and 2461–2464, representing the SOM-identified topological cluster containing a pixel verified as illite. White pixels indicate individual pixels clustered by these neurons. The snippet shows the SOM neuron grid with absorption, highlighting a separation between neuron 2454 and the neighbouring cluster of neurons (2461–2464) with more elevated absorption. The snippet is positioned over an otherwise uninformative area of the image as per previous images. This image is rotated 180 degrees to align with Image D from Figure A3 and has a resolution of 390 × 640 pixels, with the x-axis oriented vertically to align with the image rows.
Table 1 in Viviano et al. [14] identifies low-Ca pyroxene as present at the coordinate location [174, 528] within a 9 × 9 pixel area in target region FRT000064D9. This location appears to visually align with the cyan regions in Image H from Figure A2, offering an additional opportunity to evaluate the SOM model’s ability to identify the spatial distribution of a mineral. Although this region is not explicitly labelled in Image H from Figure A2, its association with low-Ca pyroxene is visually supported by the coordinate reference and Martian surface features. We note that Figure A2 is map-projected, while the coordinate reference is not.
Figure 10a shows the region identified by the SOM model, overlaid with our replication of Image H from Figure A2. The highlighted area corresponds to the top five neurons most strongly correlated with the processed MRO CRISM Type Spectra Library sample for low-Ca pyroxene.
Figure 10.
We use image data from Viviano et al. [14] as a source of ground truth for comparison with outputs derived from the SOM model. For ease of comparison, a snapshot of the referenced image is shown in the top-right corner of the corresponding panel, with a larger version provided in Appendix A. Readers are reminded of the limitations of these visual comparisons, as discussed in Section 3.1. (a) Overlay based on Image H of Figure A2, showing cyan areas of low-Ca pyroxene identified by the top five correlated neurons. The maximum pixel correlation in Cluster 0 is 0.87. (b) Overlay using the top fifty correlated neurons, where white pixels indicate those clustered by the neurons and underlying cyan areas indicate low-Ca pyroxene. Correlation values range from 0.81 to 0.87, with the latter in Cluster 2. Both images are unprojected, rotated 180 degrees to align with Image H from Figure A2, and have a resolution of 480 × 640 pixels, with the x-axis oriented vertically to match image rows.
Although the SOM model correctly identifies the target region with the top five neurons, this is likely an example of over-dimensioning the SOM model. Regarding Figure 11, many neurons in the top left-hand corner of the SOM model are correlated with the mineral and, ideally, should be clustered to a smaller number of neurons. Figure 10b shows the identified spatial distribution of the top 50 neurons. Overall, the SOM model appears to effectively capture the spatial distribution of low-Ca pyroxene and topologically organises those pixels within the map.
Figure 11.
Heat map of SOM neurons showing correlation with low-Ca pyroxene in region FRT000064D9 based on comparison with the MRO CRISM Type Spectra Library. Each pixel in this image corresponds to a neuron in the SOM model, starting with neuron 0 at the top-left corner [0, 0] and ending with neuron 2499 at the bottom-right corner [50, 50], comprising a total of 2500 neurons.
4.2. SOM Model Structure and Mineral Organization
Fe-Olivine appears to be widespread in target region FRT00003E12, with a substantial majority of SOM neurons exhibiting a strong correlation with the mineral. Rather than assessing the correlation of each neuron with a particular mineral, Figure 12a considers the hit rate of each neuron; this reveals a very distinct topological cluster with a relatively low hit rate in the upper-left corner of the SOM.
Figure 12.
(a) Number of times each neuron was the BMU, with lower hit counts concentrated in the upper-left quadrant. (b) Heat map of SOM neurons showing correlation with Mg-carbonate based on comparison with the MRO CRISM Type Spectra Library. Neurons correlated with Mg-carbonate cluster in the top-left quadrant, whereas Fe-olivine would otherwise correlate to neurons in the top-right quadrant. Each pixel in these images corresponds to a neuron in the SOM model, numbered from neuron 0 at [0, 0] in the top left to neuron 2499 at [50, 50] in the bottom right, for a total of 2500 neurons.
The only mineral mapped to this distinct region is Mg-carbonate. While Mg-carbonate shares activity in OLINDEX3 and BD1300 with Fe-olivine, it additionally exhibits strong responsiveness to BD2290 and several other summary products to which Fe-olivine does not. This spectral distinction is echoed in the SOM structure, which topologically separates Mg-carbonate into a unique cluster of neurons, mirroring the spectral separation intentionally embedded into our overlay design. As shown in Figure 12b, the SOM clusters Mg-carbonate into the top-left quadrant and away from Fe-olivine, which would otherwise correlate to neurons in the top-right quadrant.
The extended moderate neuron correlation with Mg-carbonate in Figure 12b is likely due to the extensive presence of Fe-olivine in target region FRT00003E12, as Mg-carbonate responds to many of the same summary products as Fe-olivine. Figure 13 presents the CR2 overlay to demonstrate the isolated nature of Mg-carbonate in this region, following the approach suggested by Viviano et al. [14] in Figure A1, which proposes this RGB overlay to distinguishing carbonate minerals. This overlay uses the summary products (MIN2295_2480 and MIN2345_2537) and CINDEX2 as RGB components, highlighting Mg-carbonate in areas clearly distinct from the widespread Fe-olivine. Prior to our study, we were unaware of any previous identification of Mg-carbonate at this location and were guided to its presence via the topological relationships preserved by the SOM model. We found that the mineral had, indeed, been previously reported at this site, and the SOM model correctly delineated its known distribution. Table 1 from Viviano et al. [14] identifies Mg-carbonate as present at the coordinate location [103, 136] within a 5 × 5 pixel area in target region FRT00003E12.
Figure 13.
Unprojected map view of FRT00003E12 using the recommended CR2 overlay, highlighting potentially Mg-carbonate-rich areas. Unlike other images, no individual pixels are marked in this overlay. The yellow regions are a direct result of the overlay and represent isolated and distinct mineral clustering in this area. The SOM-delineated regions were generated by selecting the top 15 neurons—those with a correlation with the reference spectra exceeding 0.8, i.e., the highest correlating neurons located in the top-left corner of the neuron heat map.
Similar to the pattern identified in the paragrpah “Implications for Mineral Analysis” of Section 2.3.2, Figure 14a shows the inferred distribution of minerals mapped for target region FRT0000634B, where each neuron follows the application of our thresholding criterion based on correlation strength and z-score. A distinct pattern is revealed in which minerals localise around the corners and edges of the SOM model.
Figure 14.
Mineral mapping, correlation, and hit rate for each neuron in target region FRT0000634B. Each pixel in these images corresponds to a neuron in the SOM model, starting with neuron 0 at the top-left corner [0, 0] and ending with neuron 2499 at the bottom-right corner [50, 50], comprising a total of 2500 neurons. (a) The number of minerals mapped to each neuron. Neurons with multiple mapped minerals suggest a high degree of feature correlation among their mineral groups. (b) Neuron correlation with Al-smectite. (c) The number of times each neuron was identified as the BMU. The SOM model appears to cluster uninformative neurons near the centre of the map, where we observe the highest hit counts. (d) Neuron correlation with talc.
Figure 15 shows the frequency with which each mineral group is associated with neurons in the four corners of the SOM model, revealing a hierarchical pattern in the clustering. Note: ice is not present in this region; however, ice-related summary products are highly correlated with other minerals (e.g., bassanite and others). As a result, neurons representing these minerals or mixed spectra may also appear to be correlated with ices. This reflects the limitations of the utilised summary products, where collinearity persists (see Section 3.2).
- 1.
- The top-left corner of Figure 15 is notably sparse in phyllosilicate minerals, which are primarily mapped to neurons in opposing corners.
- 2.
- Sulfates and ices (the latter represented by only two instances in the MRO CRISM Type Spectra Library) are more prevalent in the upper regions of the SOM model, with ices absent from the bottom corners.
Figure 15.
An analysis of the mineral categories associated with the neurons located at each corner of the SOM model for target area FRT0000634B.
We examine a correlation heat map of the minerals from the processed MRO CRISM Type Spectra Library to better understand the structure and organisation (refer to Figure 16) and note a pattern similar to that of the SOM model:
- 1.
- Phyllosilicates exhibit high intra-group correlation, except for kaolinite and Al-smectite, which are mostly negatively correlated with their peers.
- 2.
- Both H2O and CO2 ices are correlated with many sulfates, plus a few other minerals for CO2 ice.
Figure 16.
Correlation heatmap of processed MRO Library minerals.
As phyllosilicates are highly correlated (which is why some neurons are mapped to multiple minerals), neurons activated by these minerals tend to draw in neighbouring neurons, causing the phyllosilicates to cluster together. This clustering occurs throughout the SOM model, except in the top-left corner, where phyllosilicates are sparse (Figure 15). Similarly, the SOM model appears to group ices and sulfates along the top of the map. Refer to Section 2.3.2 for an illustrative example of the patterns preserved by the SOM model’s competitive learning process.
Further examination reveals that kaolinite and Al-smectite are negatively correlated with their group. The SOM model topologically separates spectra correlated with these minerals into the top-right and bottom-left corners of the grid (Figure 14b), away from spectra correlated with other phyllosilicates, such as talc, which cluster in the bottom-right corner (Figure 14d). Without the negatively correlated kaolinite and Al-smectite, the entire upper region of the SOM would likely be free of spectra that correlate with phyllosilicates. This separation highlights the model’s ability to align with inherent patterns in the data. Importantly, when attempting to identify an unknown mineral, such a topological structure may provide clues about the mineral group to which the unknown mineral belongs.
We also examined the frequency with which neurons were activated and noticed an inverse relationship between the neurons that likely represent a mineral (Figure 14a) and those that receive the most hits (Figure 14c). This may indicate that many pixels contain mixed data, making it difficult to reliably identify anything unique. We believe this is an important observation and a key reason for selecting samples from k-means clustered data when training the SOM model as described. By doing so, we can better represent groups (with few instances) that are likely to contain minerals, as opposed to the majority of instances, which appear to be uninformative and potentially redundant.
Moving Beyond RGB Overlays and into High-Dimensional Clustering
The SOM compresses high-dimensional hyperspectral data onto a two-dimensional grid, preserving multi-dimensional spectral patterns and revealing clusters of distinct spectra. We illustrate this with an example target region FRT00007375 to demonstrate the SOM’s ability to delineate mineral distributions beyond conventional RGB visualisations, which are inherently limited in capturing patterns within the full high-dimensional input space. This region was selected at random from the TER catalogue, without prior knowledge of its location or mineralogy.
For target region FRT00007375, the U-Matrix derived from the SOM output highlights unique spectral behaviour associated with pixels clustered by neurons 276 and 607, as annotated in Figure 17a. The U-Matrix captures differences between neighbouring neurons, with bright regions marking transitions between distinct spectral groups, allowing areas of unique spectra in the high-dimensional data to be identified at a glance. Although neurons 276 and 607 occupy different locations on the SOM grid, they share strong similarities across most summary products, including a pronounced spike in BD2230 (Figure 17c).
Figure 17.
Orientation note: In panels (b–d), the X-axis corresponds to the vertical axis and the Y-axis to the horizontal axis in panel (a). (a) SOM U-Matrix for target region FRT00007375 with annotated neurons. Each pixel represents a neuron in the 50 × 50 grid, numbered from 0 at the top left [0, 0] to 2499 at the bottom right [50, 50]. (b) R3920 reflectance across the SOM grid. (c) BD2230 reflectance, showing spikes for both neurons 276 and 607. (d) BD2265 reflectance, where only neuron 276 exhibits a strong absorption feature.
Due to the persistent spectral similarities, pixels corresponding to these neurons would appear nearly identical in a standard RGB overlay, as only a single dimension of the 29-feature space captures a meaningful difference. The SOM, however, separates them clearly—neuron 276 exhibits a BD2265 spike, whereas neuron 607 does not. Their positions on the SOM grid illustrate the preservation of high-dimensional patterns—for example, a region in the top right of the SOM (bottom left of Figure 17b) shows low R3920 reflectance, rising toward the grid centre.
These neurons had to be separated but only in locations consistent with other multi-dimensional factors. The underlying pixels are clustered by neurons that retain their similar R3920 values (0.36883 and 0.255), with the neurons positioned along a contour in the two-dimensional neuron grid consistent with these R3920 values. In this way, the SOM preserves both local spectral differences and broader pattern continuity. Although the BD2230 and BD2265 spikes may be artefacts, this example demonstrates the SOM’s ability to capture fine-scale high-dimensional variations while maintaining overall spectral relationships.
Many of the spectra in this region are broadly similar, with strong associations with summary products linked to bassanite and analcime (persistent OLINDEX3 and BD1900R2 and consistently elevated SINDEX2, among others). Because the SOM employs competitive learning, unique spectra are positioned toward the edges and corners of the map (refer to Section 2.3.2), separating them from common patterns. Consequently, cluster A in Figure 17a stands out. This cluster is more distinct than the U-Matrix alone, which suggests that the SOM allocated many neurons to separate this group (Figure 18a), producing gradual transitions between neighbouring neurons. The bottom-left panel of Figure 19 shows that this cluster captures Fe-silicate and/or phyllosilicate spectra, separated from sulphates, further underscoring its significance.
Figure 18.
SOM outputs for target region FRT00007375. (a) Neuron hit rate, showing how often each neuron was identified as the BMU, with many neurons allocated to unique spectra in the bottom-left corner. (b) Rotated 3D view of the same SOM grid to emphasise BD1300 responses; the dense cluster corresponds to the bottom-left corner of panel (a). (c) Neuron correlation with Fe-olivine, aligning closely with the low-hit-rate cluster in panel (a). (d) Neuron correlation with margarite, showing a more diffuse correlation than for Fe-olivine. Panels (a,c,d) show the SOM 50 × 50 neuron grid.
Figure 19.
Mineral categories associated with the neurons located at each corner of the SOM model for target region FRT00007375. Unlike other corners, sulphate-like spectra are lacking, whereas Fe-silicate and phyllosilicate spectra are more abundant.
Under local scaling, the high-dimensional patterns in the bottom-left corner of the SOM grid (Figure 17a) are characterised by generally high R3920 reflectance, except for a small sub-cluster with much lower values (visible as the dip in the top right of Figure 17b). This zone also exhibits BD1400 absorption, moderate BD1435 absorption, and low BD2235 absorption. Many of these features correspond to margarite (Figure 18d) and serpentine, particularly the elevated R3920 reflectance. The defining characteristic, however, is the presence of spectra with BD1300 absorption, confined strictly to this zone (Figure 18b). Notably, both Mg-olivine and Fe-olivine correlate strongly with neurons in this cluster (Figure 17c), although the distinct dip in R3920 within one sub-cluster warrants further investigation beyond the scope of this work.
In a semi-automated workflow, an analyst would be immediately drawn to the sub-cluster of lower R3920 reflectance in this region, as the U-Matrix in Figure 17a clearly highlights distinct changes in neuron weights slightly left of the bottom-left corner. Meanwhile, other SOM-derived analytics, such as those shown in Figure 18a–c, indicate that these neurons belong to a larger, coherent cluster, illustrating how the combination of topological and spatial analyses enhances the interpretability of high-dimensional spectral data.
Figure 20 shows the pixels most strongly represented by this cluster—specifically, neurons exceeding the 97.5th percentile of BD1300 values. We also highlight the subset associated with strong BD1300 but reduced R3920 reflectance by displaying pixels clustered by neurons 2404, 2405, and 2453–2456. These pixels occupy different locations yet appear consistent with the topography—clusters with higher R3920 reflectance are more concentrated and predominantly located in the fissuresof the northern terraces (relative to the image), whereas pixels with lower R3920 reflectance appear to wrap around the bluff and appear more concentrated in isolated depressions within the terraces. While we do not have the expertise to confirm this, it is plausible that the mineral may have weathered away from the concentrated regions with higher R3920 reflectance over time. Pixel coordinates from the described topological patterns for further investigation including the following:
- Higher R3920 reflectance values (selected at random, aside from trying to select a pixel from each cluster):
- –
- Neuron 2350: [19, 112] (cluster 0), [29, 181] (cluster 1)
- –
- Neuron 2450: [32, 196] (cluster 1), [35, 134] (cluster 0)
- –
- Neuron 2458: [52, 287] (cluster 3), [61, 288] (cluster 3);
- Lower R3920 reflectance values (refer to Figure 20b—no pixels were selectively chosen from clusters as the clusters are drawn solely to highlight the locations of identified pixels, not density):
- –
- Neuron 2453: [100, 439], [227, 380]
- –
- Neuron 2455: [125, 557], [190, 566].
Figure 20.
Unprojected map views of FRT00007375 showing pixels identified by the SOM. (a) Pixels associated with neurons exceeding the 97.5th percentile of BD1300 values, with density-based clusters suggesting concentrated pixels of this spectral feature, along with other high-dimensional patterns, the SOM has clustered with neurons in this region of the SOM model. (b) Delineated clusters highlighting individual pixels from a sub-cluster within the broader bottom-left cluster shown in Figure 18a, characterised by lower R3920 values—this is not intended as a spatial distribution map but to draw attention to the location of these individual pixels. Images are 390 × 640 pixels, with the x-axis oriented vertically to match image rows.
This example illustrates how the SOM’s preservation of high-dimensional relationships within a two-dimensional projection can guide the identification of potential mineral groups and their spatial distribution. While this analysis cannot confirm the presence of specific minerals—requiring detailed mineralogical validation—it demonstrates that, under local scaling, the SOM identified a unique hyperspectral cluster within the target region. Recognising and preserving such clusters provides a powerful tool to guide subsequent mineral exploration.
In contrast, RGB overlays do not necessarily preserve multidimensional relationships unless directly captured by the three chosen components. In this region, for example, an MAF overlay (RGB composite using OLINDEX3, LCPINDEX2, and HCPINDEX2; see Figure A1) designed to visualise Fe-oxides and mafic silicate minerals (Figure 21) provides limited insight and fails to capture sub-patterns such as R3920 variations as presented here. The SOM, in contrast, preserves spectral relationships across the map, clearly delineating clusters of similar spectra and pointing toward areas for targeted investigation.
Figure 21.
Unprojected map view of FRT00007375 showing the MAF overlay, intended to highlight Fe-silicate regions (Figure A1). The overlay provides limited guidance compared to the SOM, which clearly identifies concentrated clusters of spectra potentially corresponding to Fe-oxides and mafic silicates (Figure 20). Image resolution is 390 × 640 pixels, with the x-axis oriented vertically to match image rows.
This capability underscores the potential of unsupervised machine learning to move beyond qualitative, visually driven methods for the mapping of mineral distributions, toward a quantitative, data-driven framework for automated planetary mineral exploration.
4.3. K-Means Clustering
Compared to the SOM model, k-means clustering is significantly more efficient at clustering instances. However, k-means lacks the topological organisation capabilities of a SOM model—a feature that proved useful when identifying Mg-carbonate in target region FRT00003E12 and in capturing unique spectra in target region FRT00007375. In the latter case, the sub-cluster of spectra with lower R3920 reflectance would likely be separated into a distinct cluster under k-means, losing its relationship to the primary cluster. This topological structure enables neighbouring neurons to group similar instances and appears to result in a more focused and specialised clustering. Across various cluster/neuron sizes, the SOM model:
- 1.
- Demonstrates stable clustering of instances identified as Mg-carbonate, with both SOM training iterations appearing to show reduced variation in clustering assignments (notably, the SOM results remain close to the mean and well within the confidence interval generated from 30 independent k-means trials; see Figure 22a);
- 2.
- Generates tighter clusters, as indicated by a mean weighted distance that is consistently and significantly lower than that of k-means (see Figure 22b);
- 3.
- Allocates more neurons to represent distinct and unique instances, capturing greater detail (see Figure 22c).
Figure 22.
Comparison of clustering performance between 30 independent k-means trials and two SOM model series (SOM and SOM ) across varying cluster/neuron counts. We present the mean performance as solid lines, and the shaded confidence interval for k-means represents the 5th and 95th percentiles across the 30 trials. We calculate error metrics using only clusters or neurons with correlation values with the summary products derived from the Mg-carbonate spectrum in the MRO CRISM Type Spectra Library above 0.75. (a) Total number of instances assigned to clusters (K-Means) or neurons (SOM) identified as representing Mg-carbonate. (b) Mean weighted distance, indicating the compactness and cohesion of clusters (K-Means) or neurons (SOM). (c) Total number of clusters (K-Means) or neurons (SOM) matching Mg-carbonate across tested model sizes. (d) The execution time for SOM across tested model sizes.
The execution time (runtime) of the two models is not directly comparable, as the SOM model includes a greater number of hyperparameters, which can significantly affect training time. In this study, we fixed the training at 1000 batches of 5000 sequentially updated instances per batch, an arguably excessive configuration. A key factor contributing to the extended runtime of the SOM is its requirement for sequential updating, which prevents parallel processing. All models were developed and executed using PyCharm Community Edition 2024.3.4 with Python 3.12.4 on a Windows 11 Pro system equipped with a 12th Generation Intel Core i7 processor (12 physical cores, 20 threads) and 32 GB of RAM. Under this architecture, model runtimes did not consistently scale with model size—in some cases, larger SOMs completed faster than smaller ones. This counterintuitive behaviour is likely due to external factors influencing performance during long training periods potentially related to PyCharm’s memory management, CPU cache utilisation, or other system-level resource allocation effects. For the models referred to as SOM in Figure 22, the training times are specified in Table 5 and displayed in Figure 22d.
Table 5.
SOM model execution time (hours and minutes) by number of neurons.
As shown in Figure 23 and Table 6 and Table 7, an SOM size of 30 × 30 (900 neurons) appears sufficient to cluster Mg-carbonate in this region, as the improvement in normalised Mean Weighted Distance (MWD) is minimal beyond this point. We note that the numerical gradient of the normalised MWD flattens beyond this point. The target image contained 460 × 640 pixels, of which 273,001 were valid (i.e., not equal to 65,535 in any layer), out of a total of 307,200. This suggests that the commonly used heuristic of neurons, where N is the number of valid pixels (see Section 3.4.1) may be excessive in this context.
Figure 23.
Performance trends in SOM clustering with larger SOM configurations. (a) Normalised MWD, calculated by dividing MWD by the number of neurons in each SOM configuration. (b) Gradient of the normalised MWD for SOM series #1 and SOM series #2 across larger SOM configurations, indicating that beyond a 30 × 30 SOM size, the incremental improvement in MWD becomes minimal.
Table 6.
Comparison of normalised mean weighted distance (MWD) and its gradient for SOM series #1 across different neuron counts.
Table 7.
Comparison of normalised mean weighted distance (MWD) and its gradient for SOM series #2 across different neuron counts.
In target region FRT00003E12, sufficient occurrences of Mg-carbonate–like spectra enabled the SOM to achieve fine separation across numerous clusters (Figure 12a). Illite in target region FRT0000634B provides a counterexample, where the SOM showed mixed results relative to k-means in clustering spectra correlated with illite. Neither method identified clusters exceeding a 0.70 correlation with the summary products derived from the illite spectrum in the MRO CRISM Type Spectra Library when the number of neurons or clusters was below 625. Beyond this threshold, the SOM consistently produced a four-neuron cluster (with only one exception across model sizes), whereas k-means did not reliably identify clusters until 1600 clusters were generated (Figure 24a–c), so confidence intervals are reported only from that point.
Figure 24.
Comparison of clustering performance between 30 independent k-means trials and two SOM model series (SOM #1 and SOM #2) across varying cluster/neuron counts. Mean performance is shown as solid lines. For k-means, the shaded region represents the 5th and 95th percentiles across the 30 trials. Illite was not reliably clustered by k-means at fewer than 1600 clusters. We calculate error metrics using only clusters or neurons with correlation with the summary products derived from the illite spectrum in the MRO CRISM Type Spectra Library above 0.70. (a) Total number of instances assigned to clusters (K-Means) or neurons (SOM) identified as representing illite. (b) Mean weighted distance, indicating the compactness and cohesion of clusters (K-Means) or neurons (SOM). (c) Total number of clusters (K-Means) or neurons (SOM) matching illite across tested model sizes. (d) The execution time for SOM #1 across tested model sizes.
Despite similar clustering performance, unlike k-means, the SOM readily provided the overall topological patterns that were supportive of illite (Figure 8) and did delineate a specific small cluster with elevated absorption of both and (amongst other parameters), which contained the verified instance of illite (Figure 8 and the snippet in Figure 9), but with only 26 higher quality illite pixels as clustered by neurons 2461–2464, there is not likely to be enough instances of illite to generate enough BMU hits to better delineate this cluster as occurred with Mg-carbonate.
5. Discussion
The objective of this research was to identify the spatial distribution of minerals on Mars using tools better suited to capture patterns in high-dimensional feature space. Our approach builds on existing dimensionality reduction methods, such as summary products and RGB overlays, but further reduces dimensionality to a two-dimensional grid while preserving high-dimensional relationships, which cannot be achieved by current methods. Our framework quantitatively delineates mineral distributions, forming topology-preserving clusters from the full spectral data in an unsupervised manner.
This research has demonstrated that unsupervised models can identify geologically meaningful patterns in processed hyperspectral data, with resulting clusters that can be partially labelled by referencing spectral libraries such as the MRO CRISM Type Spectra Library. This capability helps bridge a gap in extraterrestrial mineral exploration and identification where validation data are limited, such as on Mars. Our findings show that the SOM model is well-suited for the mapping of mineralogical features. It not only detects regions associated with target minerals but also provides a topological structure that aids in interpreting mineral relationships. For example, analysis of target area FRT00003E12 highlighted the model’s ability to isolate Mg-carbonate as a distinct mineral cluster within a region overwhelmingly dominated by Fe-/Mg-Olivine (see Section 4.2). In target region FRT00007375, the SOM guided us to a cluster consistent with Fe-olivine (Section Moving Beyond RGB Overlays and into High-Dimensional Clustering), including a sub-cluster with lower R3920 reflectance that may correspond to weathered variants of the same mineral. We also identified a potential pattern where unique spectra representing minerals tend to be clustered by neurons located toward the edges of the SOM and exhibit relatively low hit rates. These observations may be useful for identifying unknown minerals in future research and are consistent with the competitive learning strategy employed by an SOM model (Section 2.3.2).
In this study, we identified several areas where further research is needed to enhance the accuracy and reliability of mineral detection and spatial distribution analysis on Mars using CRISM hyperspectral data:
- This study utilised the set of summary products outlined in Figure A1 for reasons discussed in Section 3.2; however, significant spectral overlap remains among many minerals, which limits the ability of clustering algorithms to differentiate between compositionally similar but distinct mineral types. Summary products highlight key absorption features for mineral presence but should also capture reflectance features to identify non-mineral-specific spectral signatures.
- An expert review of the formulae to compute summary products is warranted, as current versions rely on conflicting or overlapping definitions from multiple sources. For example, in some instances, the same summary product has a different formula specified in [14] compared to [33]. Although this research addressed several known issues (refer to Appendix B), uncertainty remains as to whether some summary products accurately capture the intended spectral features. This ambiguity risks leading clustering algorithms to identify misleading or spurious patterns, ultimately undermining the reliability of the results.
- Ideally, a machine learning approach to deriving summary products by automatically identifying key spectral features may be a preferred approach compared to expert review. Unlike traditional, expert-driven methods, which are time-consuming and subjective and may miss complex patterns [13], machine learning methods should be capable of generating summary products that overcome these shortcomings.
- Normalising hyperspectral data using a representative reference (or “bland”) pixel within each target region has the potential to enhance diagnostic absorption features, a technique widely used in CRISM-based analyses [14,29]. In this study, summary products were derived directly from un-normalised TER-3 hyperspectral data. While this approach simplifies preprocessing, it may reduce the clarity of absorption features and hinder mineral differentiation. Future work should explore automated methods to reliably identify representative bland pixels within each region to facilitate normalisation.
- For each target region, summary product values were scaled locally, i.e., normalised between 0 and 1 based on the minimum and maximum values within each target region. Although this method is effective for within-region clustering, it introduces inconsistencies when comparing results across multiple regions. For instance, two regions may appear to contain similar mineral signatures when scaled locally, but global scaling could reveal significant differences. Summary products derived from hyperspectral data from the MRO CRISM Type Spectra Library should be scaled according to the same global limits.
- We used several approaches to automatically identify neurons that could represent certain minerals. We addressd some of the existing limitations, including the local scaling and the reliability of certain summary products. Future research could focus on refining the post-processing of SOM outputs for more rigorous, reliable, and automated mineral identification.
- Our results show that the SOM model organises pixels into stable topological patterns, often clustering similar mineral types in nearby neurons. This raises the possibility of accelerating SOM model training by initialising weights based on an informed prior structure. Further investigation into pre-initialisation techniques may improve training efficiency and output quality.
- A promising avenue for future work involves training a global SOM model exclusively on verified mineral spectra, independent of any specific region. This “Master SOM model” could then be deployed to new Martian sites for rapid classification and mapping. Such a model would enhance consistency, reduce preprocessing time, and allow for more direct comparisons between studies and across target areas. Due to the global nature of this approach, normalised hyperspectral data would be required before deriving summary products.
- A critical challenge in planetary mineralogy is the identification of pixels with spectral profiles that do not closely match any known hyperspectral samples. Developing a systematic approach to detect and flag distinct but unknown spectral signatures might be important for the discovery of new or unknown mineral types. The existing SOM model will delineate such clusters; however, in an automated framework, it can try and label that cluster as the closest match in the MRO CRISM Type Spectra Library.
6. Conclusions
Our study advances the understanding of Martian mineral detection by demonstrating how unsupervised learning with a SOM model can reveal both the spatial distribution and hierarchical organisation of mineral deposits from remote sensing hyperspectral data. Our results show that compared to k-means clustering, the SOM model allocates more resources to unique spectra, producing tighter and more distinct clusters. This improvement stems from a defining feature of the SOM model and its ability to preserve topological relationships, which allows subtle differences among similar spectral signatures to be maintained. Our results show that the SOM model retains complex relationships across the full feature space, providing richer structural and visual detail. This is an advancement when compared to RGB overlay methods, which capture only a subset of high-dimensional patterns. Although more computationally intensive, our SOM-based framework represents a step toward scalable and interpretable mapping of Martian minerals, offering a valuable tool for future planetary mineral classification and exploration efforts.
Author Contributions
T.L. was responsible for feature selection, development of the Self-Organising Map (SOM) model, data acquisition and preprocessing, and the implementation of all Python code. He also drafted and co-authored the technical sections of the manuscript, including the Methodology, Results, and related analyses. R.C. contributed to the conceptualisation of the study, project supervision, experimental design, data analysis, and editing of the final manuscript draft. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable, as this study did not involve humans or animals.
Data Availability Statement
Original CRISM TER-3 data, processed summary product outputs, and SOM model outputs are available for download via Zenodo: Target region FRT00003E12—https://zenodo.org/records/16397494 (accessed on 15 October 2025); Target region FRT000064D9—https://zenodo.org/records/16397520 (accessed on 15 October 2025); Target region FRT0000634B—https://zenodo.org/records/16397583 (accessed on 15 October 2025); Target region FRT0000737—https://zenodo.org/records/17156526 (accessed on 15 October 2025). Python code, including visual overlay generation and topological SOM analysis is available on GitHub at: https://github.com/sydney-machine-learning/mars-minerals/releases/tag/v2.0.0 (accessed on 15 October 2025).
Acknowledgments
We thank Haoyu Sun from UNSW Sydney for earlier contributions. We thank Ehsan Farahbakhsh from the University of Sydney During the preparation of this study, the authors used the public versions of ChatGPT-4 and ChatGPT-5 to assist with some phrasing refinements; the generation of some figure and table captions; and the re-formatting of figures, tables, and equations per MDPI guidelines. The authors have reviewed and edited the output and take full responsibility for the content of this publication.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| BMU | Best Matching Unit |
| CNN | Convolutional Neural Network |
| CRISM | Compact Reconnaissance Imaging Spectrometer for Mars |
| DBSCAN | Density-Based Spatial Clustering of Applications with Noise |
| DPSIS | Data Product Summary Image Specification |
| EDR | Experiment Data Record |
| ICA | Independent Component Analysis |
| LMM | Linear Mixing Model |
| MNF | Minimum Noise Fraction |
| MRO | Mars Reconnaissance Orbiter |
| MTRDR | Map-Projected Targeted Reduced Data Record (CRISM) |
| MWD | Mean Weighted Distance |
| nm | Nanometre |
| PCA | Principal Component Analysis |
| SOM | Self-Organising Map |
| TER | Targeted Empirical Record |
| TER-3 | Targeted Empirical Record V ersion 3 |
| TRDR | Targeted Reduced Data Record |
| VNIR | Visible-to-Near-Infrared |
Appendix A. Visualisation of Mineral Locations
The figures and captions that follow are reproduced from [14].
Figure A1.
Summary product sensitivity matrices for each type spectra class (A–F). The top set of rows of each class show the response of analog laboratory material and the bottom set of rows show the corresponding response of CRISM MICA type spectra for each spectral parameter (columns). Each parameter is linearly scaled and has a fixed lower stretch limit of zero (black indicates a band depth at or below zero) and an upper limit of the highest response of the distribution (white). To the right of each matrix is the representative browse product RGB(s) (see Table 2 from Viviano et al. [14] for abbreviations) for that class set.
Figure A2.
A detailed example of summary and browse product development. The top row shows (A) CRISM MTRDR scene FRT0000B385 in false color stepped through (B) the old HYD browse product with a “global” stretch applied, (C) the same HYD product with the new “local” stretch applied, (D) the new parameter formulations for the revised HYD product, and (E) the new HYD product after noise filtering results. (B–E) include a zoom-up box of a small region illustrating fine-scale changes that occur with improved stretching logic, refined parameter formulations, and noise filtering. The bottom row shows (F) CRISM MTRDR scene FRT000064D9 in false color stepped through (G) the old MAF browse product with global stretch applied, (H) the new MAF product with local stretch and noise filtering, and (I) extracted and ratioed CRISM I/F spectrum from a particular location in this scene (white arrow, (H)) and comparable laboratory spectra (RELAB) of high-Ca pyroxene (HCP) material. The old HCPINDEX displays no detection for this material ((G), blue channel), whereas the new HCPINDEX2 shows a strong detection of HCP ((H), blue channel). Figure and caption reproduced from [14].
Figure A3.
A particularly diverse CRISM scene (FRT0000634B) revealing the utility of new summary product formulations. Although (A) the VNIR color stretch of this scene is somewhat homogenous, there are at least (B) seven type spectra represented in the image (locations of extracted spectra shown in subsequent panels with arrows). While (C) the standard PHY browse product reveals some hydroxylated silicate diversity, (D–F) custom combinations of new summary products are able to visually distinguish subtle spectral differences in the scene. Figure and caption reproduced from [14].
Appendix B. Summary Products
Table A1 presents the 29 recalculated CRISM summary products used in this study.
Table A1.
Summary product utilised.
Table A1.
Summary product utilised.
| OLINDEX3 | BD1300 | LCPINDEX2 | HCPINDEX2 |
| BD1400 | BD1435 | BD1500_2 | BD1750_2 |
| BD1900_2 | BD1900R2 | BD2100_2 | BD2165 |
| BD2190 | MIN2200 | BD2210_2 | D2200 |
| BD2230 | BD2250 | MIN2250 | BD2265 |
| BD2290 | D2300 | BD2355 | SINDEX2 |
| MIN2295_2480 | MIN2345_2537 | BD2500_2 | CINDEX2 |
| R3920 |
With reference to the CRISM DPSIS [33], the following corrections were applied either to align with the summary product outputs contained in the MTRDR files or to amend discrepancies in published summary products that did not conform to the DPSIS definitions.
- OLINDEX3—The following kernel widths are used: R1210:7, R1250:7, R1263:7, R1276:7, R1330:7 anchored at R1750:7, and R1862:7. The parameter is calculated as 0.1 × RB1210 + 0.1 × RB1250 + 0.2 × RB1263 + 0.2 × RB1276 + 0.4 × RB1330.
- HCPINDEX2—The anchor points are R1690:7 and R2530:7, and the parameter is calculated as 0.1 × RB2120 + 0.1 × RB2140 + 0.15 × RB2230 + 0.3 × RB2250 + 0.2 × RB2430 + 0.15 × RB2460
- BD1900R2—A kernel width of 5 is used for anchor bands R1850 and R2060.
- BD2100_2—The kernel width of R2250:3 is used in place of R2250:5.
- MIN2200—The kernel width of R2210:3 is used in place of R2120:3
- BD2230—The kernel width of R2235:3 is used in place of R2230:3
- BD2265—The kernel width of R2340:5 is used in place of R2295:5.
- R3920 is not specifically a summary product but is recommended by Viviano et al. [14] for the IC2 Browse Product and was calculated using a kernel width of 5.
Further details regarding the discrepancies identified in some of these summary products are discussed in the PDS Geosciences Node Community at the following URL: https://geoweb.rsl.wustl.edu/community/index.php?/topic/3761-calculation-of-bd1900_2-in-mtrdr-data/ (accessed on 18 October 2025).
Appendix C. SOM Updates
The decay (), learning rate (), and radius update () functions used in our SOM models are expressed as follows (with T representing the total number of iterations and t a specific iteration):
The initial parameters for a SOM model were chosen to be and . These initial parameters ensure that a strong topological organisation occurs during the early stages of training. As recommended in the literature, an initial radius that is half of the largest dimension of the array is recommended [45]. In this case, a radius of 25 results in half of all neurons in the model being updated for each sequentially trained instance. Furthermore, an initial learning rate of 0.5 causes neuron weights to change rapidly. As the radius and learning rate decrease over time, the long-term impact of large initial values is mitigated; however, starting values that are too small can cause the SOM model to get stuck in a ”metastable” configuration [45] preventing the SOM from learning and achieving meaningful topological organisation, which the model may not recover from in subsequent learning. The value of in Equation (1) was chosen to ensure that approximately 25%—or 250 iterations—of formal training had a radius of less than 1, meaning 75%—or approximately 750 iterations—of the training involved updating the neurons around the BMU.
Appendix D. Data
With reference to the GitHub repository linked in the Data Availability Statement, all results can be reproduced using the original CRISM TER-3 data by following the steps outlined below. Alternatively, both the CRISM TER-3 data and corresponding SOM outputs for the target regions analysed in this study are available for direct download from Zenodo, as also detailed in Section Data Availability Statement. The following steps (Appendix D, Appendix E and Appendix F) outline how to reproduce the results using our Python code and the original CRISM TER-3 data for a selected target region.
To begin, navigate to the MRO_Spectra_Library_Conversion folder and run the CRISM_Replication_Library.py script to process MRO CRISM Spectra Library Data.
Download CRISM TER-3 Image Files for a Target Region
Follow these steps to download the necessary CRISM TER-3 data for a specific region:
- 1.
- Visit https://ode.rsl.wustl.edu/mars/productsearch (accessed on 18 October 2025).
- 2.
- In Step 1, select TER as the product type.
- 3.
- In Step 2, define the target region by entering the appropriate Product ID, such as FRT00003E12* for target region FRT00003E12.
- 4.
- Review the search results.
- 5.
- Download all files containing "if" in the file name, such asfrt0000cbe5_07_if166j_ter3.img.
After downloading the TER-3 data, follow these steps to convert the data into summary products:
- 1.
- Consolidate all downloaded TER-3 files into a single folder. There should be an img (500+ mb), lbl, and hdr file in the same folder.
- 2.
- Open the CRISM_Spectral_Conversion.py script located in theCRISM_Spectral_Conversion folder.
- 3.
- Specify the path to the TER-3 .img file and set the desired output directory in the script.
- 4.
- Execute the script.
Note: This process may take several hours to complete.
Appendix E. SOM
To train the SOM, follow the steps below. Summary product data must first be prepared by either generating it according to the procedure outlined in Appendix D or by using the preprocessed summary product files provided for a target region (see the Section Data Availability Statement).
- 1.
- Open the som_training.py script located in the SOM_Training folder.
- 2.
- Set the path to the .h5 file, which contains preprocessed summary products for the target region.
- 3.
- Configure the SOM training parameters as necessary.
- 4.
- Run the script to train the SOM.
Note: This process may take many hours for a 50 × 50 SOM with 1000 iterations. However, training can be paused and resumed thanks to built-in functionality within the SOM class (as defined in som_class.py). The class accepts the inputs (int_iter and int_weights), which allow training to resume from a specific iteration (int_iter) using the corresponding set of neuron weight vectors (int_weights). During training, the model is automatically saved at every 10th iteration.
Appendix F. Visualisation of Mineral Locations
The following steps outline the process for visualising the spatial distribution of specific minerals on the Martian surface. SOM data must first be prepared either by generating it using the procedure described in Appendix E or by using the provided SOM output files for a target region (see the Section Data Availability Statement).
- 1.
- Open the Layer_Visuals.py script located in the SOM_Analysis folder.
- 2.
- Specify the relevant data sources (including the original TER-3 image file—required for False views), which should be familiar from previous steps.
- 3.
- Under the section titled Identify Spatial Distribution of a Specific Mineral, configure the settings for your analysis.
- 4.
- The output will be a false-colour or custom image displaying the identified spatial distribution of the nominated mineral on the Martian surface and target pixel location if you specify such a target.
Appendix F.1. SOM Hierarchical Analysis
For hierarchical analysis of the SOM model, perform the following steps:
- 1.
- Open the SOM_analysis.py script located in the SOM_Analysis folder.
- 2.
- Specify the relevant data sources from the previous steps (excluding the original TER-3 image file and summary product outputs contained in the h5 file).
- 3.
- Run the script to perform a comprehensive topological analysis of the SOM model.
Appendix F.2. K-Means Comparison
To perform a k-means comparison, follow these steps:
- 1.
- Ensure the SOM model is trained before running this process, as the inputs (reshaped_data_loc and reshaped_locations_loc) are generated during the SOM training.
- 2.
- Open the kmeans.py script located in the KMeans folder.
- 3.
- Specify the relevant data locations and mineral information.
- 4.
- Run the script to perform the K-means comparison.
References
- Murchie, S.; Arvidson, R.; Bedini, P.; Beisser, K.; Bibring, J.P.; Bishop, J.; Boldt, J.; Cavender, P.; Choo, T.; Clancy, R.; et al. Compact reconnaissance imaging spectrometer for Mars (CRISM) on Mars reconnaissance orbiter (MRO). J. Geophys. Res. Planets 2007, 112. [Google Scholar] [CrossRef]
- Li, S.; Song, W.; Fang, L.; Chen, Y.; Ghamisi, P.; Benediktsson, J.A. Deep learning for hyperspectral image classification: An overview. IEEE Trans. Geosci. Remote Sens. 2019, 57, 6690–6709. [Google Scholar]
- Huang, Y.; Zhou, X.; Xi, B.; Li, J.; Kang, J.; Tang, S.; Chen, Z.; Hong, W. Diverse-region hyperspectral image classification via superpixelwise graph convolution technique. Remote Sens. 2022, 14, 2907. [Google Scholar] [CrossRef]
- Chen, Z.; Yang, H.; Liu, Q.; Liu, Y.; Zhu, M.; Liang, X. Deep Learning for Hyperspectral Image Classification: A Critical Evaluation via Mutation Testing. Remote Sens. 2024, 16, 4695. [Google Scholar]
- Ehlmann, B.L.; Edwards, C.S. Mineralogy of the Martian surface. Annu. Rev. Earth Planet. Sci. 2014, 42, 291–315. [Google Scholar] [CrossRef]
- Wu, X.; Mustard, J.; Tarnas, J.; Zhang, X.; Das, E.; Liu, Y. Imaging Mars analog minerals’ reflectance spectra and testing mineral detection algorithms. Icarus 2021, 369, 114644. [Google Scholar] [CrossRef]
- Pelkey, S.; Mustard, J.; Murchie, S.; Clancy, R.; Wolff, M.; Smith, M.; Milliken, R.; Bibring, J.P.; Gendrin, A.; Poulet, F.; et al. CRISM multispectral summary products: Parameterizing mineral diversity on Mars from reflectance. J. Geophys. Res. Planets 2007, 112. [Google Scholar] [CrossRef]
- Pieters, C.M.; Boardman, J.; Buratti, B.; Chatterjee, A.; Clark, R.; Glavich, T.; Green, R.; Head, J., III; Isaacson, P.; Malaret, E.; et al. The Moon mineralogy mapper (M3) on chandrayaan-1. Curr. Sci. 2009, 96, 500–505. [Google Scholar]
- Li, S.; Moriarty, D.P., III; Pieters, C.M.; Klima, R.L.; Dapremont, A.M. A controlled mosaic of moon mineralogy mapper (M3) reflectance data in the lunar polar regions for understanding the mineralogy and water of the Artemis exploration zone. Icarus 2025, 440, 116668. [Google Scholar]
- NASA. Artemis—Humans in Space. 2025. Available online: https://www.nasa.gov/humans-in-space/artemis/ (accessed on 22 July 2025).
- Bedini, E. The use of hyperspectral remote sensing for mineral exploration: A review. J. Hyperspectral Remote Sens. 2017, 7, 189–211. [Google Scholar]
- Thomas, M.; Walter, M.R. Application of Hyperspectral Infrared Analysis of Hydrothermal Alteration on Earth and Mars. Astrobiology 2004, 2. [Google Scholar] [CrossRef]
- Gao, A.F.; Rasmussen, B.; Kulits, P.; Scheller, E.L.; Greenberger, R.; Ehlmann, B.L. Generalized unsupervised clustering of hyperspectral images of geological targets in the near infrared. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 19–25 June 2021; pp. 4294–4303. [Google Scholar]
- Viviano, C.E.; Seelos, F.P.; Murchie, S.L.; Kahn, E.G.; Seelos, K.D.; Taylor, H.W.; Taylor, K.; Ehlmann, B.L.; Wiseman, S.M.; Mustard, J.F.; et al. Revised CRISM spectral parameters and summary products based on the currently detected mineral diversity on Mars. J. Geophys. Res. Planets 2014, 119, 1403–1431. [Google Scholar] [CrossRef]
- Shirmard, H.; Farahbakhsh, E.; Müller, R.D.; Chandra, R. A review of machine learning in processing remote sensing data for mineral exploration. Remote Sens. Environ. 2022, 268, 112750. [Google Scholar] [CrossRef]
- Shirmard, H.; Farahbakhsh, E.; Heidari, E.; Beiranvand Pour, A.; Pradhan, B.; Müller, D.; Chandra, R. A comparative study of convolutional neural networks and conventional machine learning models for lithological mapping using remote sensing data. Remote Sens. 2022, 14, 819. [Google Scholar] [CrossRef]
- Peyghambari, S.; Zhang, Y. Hyperspectral remote sensing in lithological mapping, mineral exploration, and environmental geology: An updated review. J. Appl. Remote Sens. 2021, 15, 031501. [Google Scholar] [CrossRef]
- Ke, T.; Zhong, Y.; Song, M.; Wang, X.; Zhang, L. Mineral detection based on hyperspectral remote sensing imagery on Mars: From detection methods to fine mapping. ISPRS J. Photogramm. Remote Sens. 2024, 218, 761–780. [Google Scholar] [CrossRef]
- Yoldi, Z.; Pommerol, A.; Poch, O.; Thomas, N. Reflectance study of ice and Mars soil simulant associations—I. H2O ice. Icarus 2021, 358, 114169. [Google Scholar] [CrossRef]
- Wellington, D.; Bell, J., III. Patterns of surface albedo changes from Mars reconnaissance orbiter Mars color imager (MARCI) observations. Icarus 2020, 349, 113766. [Google Scholar] [CrossRef]
- Ma, Y.; Chen, S.; Ermon, S.; Lobell, D.B. Transfer learning in environmental remote sensing. Remote Sens. Environ. 2024, 301, 113924. [Google Scholar] [CrossRef]
- Zhang, T.; Gao, P.; Dong, H.; Zhuang, Y.; Wang, G.; Zhang, W.; Chen, H. Consecutive pre-training: A knowledge transfer learning strategy with relevant unlabeled data for remote sensing domain. Remote Sens. 2022, 14, 5675. [Google Scholar] [CrossRef]
- Agarwal, P.; Skupin, A. Self-Organising Maps: Applications in Geographic Information science; John Wiley & Sons: Chichester, West Sussex, UK, 2008. [Google Scholar]
- Vesanto, J.; Alhoniemi, E. Clustering of the self-organizing map. IEEE Trans. Neural Netw. 2000, 11, 586–600. [Google Scholar] [CrossRef]
- Danielsen, A.S.; Johansen, T.A.; Garrett, J.L. Self-organizing maps for clustering hyperspectral images on-board a cubesat. Remote Sens. 2021, 13, 4174. [Google Scholar] [CrossRef]
- Iranzad, A. Hyperspectral Mineral Identification Using SVM and SOM. Master’s Thesis, Brock University, St. Catharines, ON, Canada, 2013. [Google Scholar]
- Raatikainen, M.; Sarala, P.; Ranta, J.P. Self-organizing map modelling and prospectivity mapping of surface geochemical data in Au and multi-metal mineral exploration: Example from northern Finland. Geochem. Explor. Environ. Anal. 2025, 25, geochem2024-055. [Google Scholar] [CrossRef]
- Liu, H.; Wang, Y.; Wen, S.; Zhang, S.; Zhu, K.; Liu, J. Comprehensive Evaluation of the Lunar South Pole Landing Sites Using Self-Organizing Maps for Scientific and Engineering Purposes. Remote Sens. 2025, 17, 1579. [Google Scholar] [CrossRef]
- Viviano-Beck, C.E.; Seelos, F.P.; Murchie, S.L.; Kahn, E.G.; Seelos, K.D.; Taylor, H.W.; Taylor, K.; Ehlmann, B.L.; Wiseman, S.M.; Mustard, J.F.; et al. MRO CRISM Type Spectra Library. NASA Planetary Data System. 2015. Available online: https://crismtypespectra.rsl.wustl.edu (accessed on 15 October 2025).
- Seelos, F.P.; Seelos, K.D.; Murchie, S.L.; Novak, M.A.M.; Hash, C.D.; Morgan, M.F.; Arvidson, R.E.; Aiello, J.; Bibring, J.P.; Bishop, J.L.; et al. The CRISM investigation in Mars orbit: Overview, history, and delivered data products. Icarus 2024, 419, 115612. [Google Scholar] [CrossRef]
- Parente, M.; McGuire, P.C.; Bishop, J.L.; Murchie, S.L. Analysis of the effects of spectral mixture resolution on the detection of phyllosilicates on Mars using Mars Express OMEGA and Mars Reconnaissance Orbiter CRISM data. Planet. Space Sci. 2008, 56, 1009–1020. [Google Scholar]
- Alemanno, G. Spectral Parameters, Solar System Planets. In Encyclopedia of Astrobiology; Springer: Berlin/Heidelberg, Germany, 2023; pp. 2814–2815. [Google Scholar]
- Murchie, S.; Stein, T.; Ward, J. Mars Reconnaissance Orbiter: CRISM Data Product Software Interface Specification, Version 1.3.7.7; Technical report, PDS Geosciences Node; Washington University in St. Louis and Johns Hopkins University Applied Physics Laboratory: Laurel, MD, USA, 2024. [Google Scholar]
- Barve, S.; Webster, J.M.; Chandra, R. Reef-insight: A framework for reef habitat mapping with clustering methods using remote sensing. Information 2023, 14, 373. [Google Scholar] [CrossRef]
- Wan, S.; Wang, Y.P. The comparison of density-based clustering approach among different machine learning models on paddy rice image classification of multispectral and hyperspectral image data. Agriculture 2020, 10, 465. [Google Scholar] [CrossRef]
- Keshava, N.; Mustard, J.F. Spectral unmixing. IEEE Signal Process. Mag. 2002, 19, 44–57. [Google Scholar] [CrossRef]
- Hong, D.; Yokoya, N.; Chanussot, J.; Zhu, X.X. An augmented linear mixing model to address spectral variability for hyperspectral unmixing. IEEE Trans. Image Process. 2018, 28, 1923–1938. [Google Scholar] [CrossRef] [PubMed]
- Haijen, X.; Koirala, B.; Tao, X.; Scheunders, P. A Two-step Linear Mixing Model for Unmixing under Hyperspectral Variability. arXiv 2025, arXiv:2502.17212. [Google Scholar] [CrossRef]
- Zou, S.; Zare, A. Hyperspectral unmixing with endmember variability using partial membership latent dirichlet allocation. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; pp. 6200–6204. [Google Scholar]
- Zhang, W.; Wang, J.; Jin, D.; Oreopoulos, L.; Zhang, Z. In Proceedings of the 2018 IEEEA Deterministic Self-Organizing Map Approach and Its Application on Satellite Data Based Cloud Type Classification. International Conference on Big Data (Big Data), Seattle, WA, USA, 10–13 December 2018; pp. 2027–2034. [Google Scholar]
- Jayashree, S.; Maya, K.V.; Indira, K.; Dinesh, P. Self Organizing Map based Land Cover Clustering for Decision-Level Jaccard Index and Block Activity based Pan-Sharpened Images. J. Indian Soc. Remote Sens. 2025, 53, 549–569. [Google Scholar] [CrossRef]
- Cracknell, M.; Reading, A.M.; De Caritat, P. Multiple influences on regolith characteristics from continental-scale geophysical and mineralogical remote sensing data using Self-Organizing Maps. Remote Sens. Environ. 2015, 165, 86–99. [Google Scholar] [CrossRef]
- Li, H.; Qu, K.; Zhou, J. Reconstructing sound speed profile from remote sensing data: Nonlinear inversion based on self-organizing map. IEEE Access 2021, 9, 109754–109762. [Google Scholar] [CrossRef]
- Shalaginov, A.; Franke, K. A new method for an optimal som size determination in neuro-fuzzy for the digital forensics applications. In Proceedings of the Advances in Computational Intelligence: 13th International Work-Conference on Artificial Neural Networks, IWANN 2015, Palma de Mallorca, Spain, 10–12 June 2015; Proceedings, Part II 13. Springer: Cham, Switzerland, 2015; pp. 549–563. [Google Scholar]
- Kohonen, T. Self-Organizing Maps, 3rd ed.; Springer Series in Information Sciences; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).