To comprehensively evaluate the performance and applicability of the proposed mineral hyperspectral unmixing method, an experimental framework incorporating both simulated datasets and real-world observational datasets was constructed. The simulated datasets provide benchmark scenarios with known endmember signatures and abundance ground truth, enabling systematic assessment of algorithm accuracy and robustness under varying noise levels, mixing mechanisms, and endmember variability conditions. In contrast, real hyperspectral datasets are employed to validate the applicability and stability of the proposed method under realistic spectral characteristics, surface complexity, and sensor noise conditions.
2.1.1. Simulated Dataset
A hyperspectral simulation framework was constructed using mineral spectra from the ENVI 5.6 USGS spectral library, covering the wavelength range of 0.4–2.5 μm, to evaluate the proposed method under controlled conditions. The simulated mineral set includes amphibole, chlorite, epidote, kaolinite, muscovite, serpentine, and zoisite, which together represent a diverse group of common alteration and rock-forming minerals relevant to hydrothermal environments.
To define class-level reference spectra, all spectral samples belonging to each mineral category were first grouped according to their mineral names, and an L2-medoid was selected as the representative spectrum of each class. This procedure ensures that the reference endmembers correspond to physically measured spectra rather than synthetic averages, thereby preserving realistic spectral shapes while providing a stable spectral baseline for subsequent simulation and evaluation. This concept originates from the Partitioning Around Medoids (PAM) clustering framework proposed by Rousseeuw and Kaufman [
14], ensuring internal consistency within each class while providing a stable spectral baseline for subsequent mixing behavior analysis.
To generate abundance maps, a spatially correlated Dirichlet random field, which has been widely adopted as a prior model for abundance vectors in hyperspectral unmixing and Bayesian mixture modeling frameworks [
15,
16], was used so that the abundance vectors naturally satisfy non-negativity and sum-to-one constraints while preserving local spatial continuity. Gamma sampling was first performed on a coarse spatial grid, followed by interpolation and smoothing to introduce spatial correlation and generate abundance fields with realistic regional structures. A top-k sparsification strategy was then introduced to limit the number of active mineral components per pixel, and a proportion of pure pixels was additionally injected to better approximate natural scenes dominated by one or a few primary components. The final abundance vectors were normalized pixel-wise, ensuring their suitability as direct inputs for the subsequent physical spectral mixing models.
To simulate intra-class endmember variability, one spectral instance from each mineral class was randomly sampled at each pixel, yielding pixel-wise endmember perturbations associated with mineralogical variability, compositional differences, grain-size variation, and surface-condition effects. This design enables the simulated dataset to more realistically represent the spectral heterogeneity commonly observed in natural mineral assemblages.
Mixed spectra were then generated using two alternative mixing mechanisms: (1) a linear mixing model in reflectance space and (2) a Hapke-based nonlinear model in the single-scattering albedo domain. In the linear mixing model, the observed reflectance is computed as the abundance-weighted linear combination of endmember reflectance spectra. In contrast, the Hapke-based model first performs linear mixing in the single-scattering albedo domain, followed by forward modeling to recover reflectance, thereby more realistically representing the nonlinear scattering behavior of particulate media. The bidirectional reflectance factor (BRF) is expressed as:
where
r denotes the bidirectional reflectance,
w is the single-scattering albedo,
μ0 and
μ are the cosines of the incidence and emergence angles, respectively,
g is the phase angle,
B(
g) represents the opposition effect function,
p(
g) is the particle phase function, and
H(
μ0) and
H(
μ) are the multiple-scattering functions [
17].
Based on radiative transfer theory, the Hapke model integrates single-scattering albedo, phase functions, multiple scattering approximations, and surface roughness corrections to describe the scattering and reflection behavior of particulate surfaces. Its core assumption is that the radiance received by the sensor arises not only from single scattering but also from multiple inter-particle interactions and shadowing effects, allowing for a more realistic interpretation of the optical properties observed in hyperspectral imagery. Consequently, the Hapke model is widely regarded as one of the representative physical nonlinear mixing models in hyperspectral unmixing studies and is particularly suitable for explaining nonlinear effects induced by multiple scattering [
15,
18].
Following spectral mixing, the framework provides PCA (Principal Component Analysis)-based RGB visualization to facilitate qualitative validation of spatial structures and spectral characteristics.
Figure 1 shows the overall simulated mineral abundance distribution, while
Figure 2 presents the abundance distributions of individual mineral endmembers and their corresponding spectral curves in the simulated data. The final output is stored in .mat format and includes the mixed hyperspectral data cube, abundance maps, endmember instance indices, class-wise endmember sets, central endmember spectra, wavelength information, full width at half maximum (FWHM), and complete generation parameters. This comprehensive data structure enables direct application to tasks such as endmember extraction, spectral unmixing, and nonlinear mixing analysis, providing a controllable and reproducible experimental environment for algorithm evaluation and theoretical investigation.
2.1.2. Real-World Datasets
To evaluate the proposed method under realistic mineral exploration conditions, an airborne hyperspectral dataset acquired by the Shortwave Airborne Spectral Imager (SASI; ITRES, Calgary, AB, Canada) over the Liuyuan region, Gansu Province, China, was used. The SASI sensor covers the shortwave infrared region from approximately 950 to 2450 nm with 144 contiguous bands, which is suitable for detecting diagnostic absorption features of hydroxyl-bearing and carbonate minerals.
The study area is located in the Fangshankou–Liuyuan region of Guazhou County, Jiuquan City, Gansu Province, China (95°10′–95°46′E, 41°08′–41°16′N). The region is characterized by arid climatic conditions, sparse vegetation cover, and extensive rock exposure, all of which are favorable for airborne mineral mapping. Geologically, the area contains abundant metallic and non-metallic mineral occurrences and exhibits strong lithological and structural complexity, making it a representative testbed for mineral hyperspectral unmixing in natural scenes [
19,
20,
21].
Because the study area is characterized by sparse vegetation and extensive rock exposure, the real-data experiment primarily evaluates the proposed method under exposed or weakly vegetated geological surface conditions. The effects of dense green or dry vegetation cover were not explicitly considered in the present analysis.
The airborne SASI imagery used in this study has a spatial resolution of 2.25 m. After removing spectral bands severely affected by water-vapor absorption and low signal quality, 29 effective bands were retained for analysis. These retained bands preserve the principal spectral characteristics of the target minerals while reducing noise contamination.
For real-data validation, muscovite and carbonate were selected as representative target minerals because they are widely distributed in the study area and because corresponding field and laboratory reference information is available. Field and laboratory identifications from samples DWHNG-01, DWHNG-03, DWHNG-08, DWHNG-09, LY-18-1, LY-18-2, and LY-18-3 were used as external evidence to assess the geological plausibility of the estimated endmembers and abundance maps. It should be noted that no quantitative abundance ground truth is available for the real dataset; therefore, the real-data evaluation focuses primarily on spectral consistency and geological plausibility rather than strict abundance-retrieval accuracy.