Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions

Ni, Rui; Meng, Tingyu; Zhao, Fei; Dang, Yanan; Zhang, Wenbin; Lu, Pingping

doi:10.3390/rs18050682

Open AccessArticle

Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions

by

Rui Ni

^1,2

,

Tingyu Meng

^1,*

,

Fei Zhao

^1,2

,

Yanan Dang

¹

,

Wenbin Zhang

^1,2 and

Pingping Lu

^1,2

¹

National Key Laboratory of Microwave Imaging, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China

²

School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2026, 18(5), 682; https://doi.org/10.3390/rs18050682

Submission received: 5 January 2026 / Revised: 10 February 2026 / Accepted: 12 February 2026 / Published: 25 February 2026

(This article belongs to the Special Issue Advances in Scene Understanding with Hyperspectral Remote Sensing: From Data Benchmarks to Applications)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

We propose SSF-3DSC, a lunar-tailored spatial–spectral fusion 3D signal-compensation network integrating a spectral compensation module (SCM), multi-scale spatial attention (MSA), and cascaded 3D residual refinement (C3D-RCM).
On paired low-/high-SNR M3 south polar cubes, SSF-3DSC improves both spatial coherence and spectral fidelity compared with representative terrestrial HSI denoisers and a spectral-only lunar baseline in the lunar south polar region.

What are the implications of the main findings?

Explicit spatial–spectral fusion mitigates speckle-like artifacts of spectral-only compensation and yields more reliable reconstructions in photon-starved polar observations.
The compensated M3 products better support regional-scale polar analysis (e.g., landing-site characterization and mineral abundance mapping) by expanding the usable information in low-signal areas.

Abstract

Hyperspectral images (HSIs) from the lunar polar regions are frequently compromised by low signal-to-noise ratio (SNR) under adverse illumination, limiting their utility for scientific analysis. Existing spectral-only compensation approaches operate without spatial context, leading to speckle-like artifacts that degrade spatial consistency and constrain subsequent applications. To address this limitation, we propose SSF-3DSC, a spatial–spectral fusion 3D signal-compensation framework tailored for lunar HSIs to simultaneously restore spectral fidelity and spatial consistency under extreme low-illumination conditions. To the best of our knowledge, this represents the first deep learning framework specifically engineered for joint spatial–spectral restoration in the photon-starved regime. SSF-3DSC integrates three specialized components: a spectral compensation module (SCM) for restoring spectral fidelity, a multi-scale spatial attention (MSA) module for capturing hierarchical spatial patterns, and a cascaded 3D residual convolutional module (C3D-RCM) for refining spatial–spectral representations. Trained on paired low- and high-SNR Moon Mineralogy Mapper (M3) data cubes from the lunar south polar region, SSF-3DSC employs synergistic spatial–spectral fusion to achieve high-fidelity reconstruction, significantly outperforming a spectral-only lunar baseline (Paired-CycleGAN). Regional-scale experiments demonstrate its ability to recover both spatially coherent geological structures and spectrally reliable mineral abundance maps. By establishing a new benchmark for lunar HSI restoration under low-illumination conditions, this work enhances the scientific utility of low-signal M3 data and enables robust quantitative investigations into the Moon’s challenging polar regions.

Keywords:

hyperspectral imaging; signal compensation; spatial–spectral fusion; Moon Mineralogy Mapper; lunar exploration; lunar south polar region

1. Introduction

The lunar south polar region has attracted intense scientific interest due to its unique geological context and the potential to contain water ice. It lies along the rim of the South Pole–Aitken (SPA) basin, the Moon’s oldest and largest impact structure [1], which is believed to have excavated lower-crustal or upper-mantle materials, providing a unique window into lunar internal composition and evolution [2,3,4]. Furthermore, the Moon’s slight axial tilt (

1 . 54^{°}

) results in permanently shadowed regions (PSRs) within polar craters [5]. These PSRs are devoid of direct sunlight and act as “cold traps” (

T < 110

K), capable of preserving water ice and other volatiles over geologic timescales [6]. This has been proved by observations from neutron absorption to ultraviolet reflectance [7,8,9,10,11,12,13,14].

Therefore, exploration of the south polar region offers both practical resources and fundamental scientific insights, driving numerous recent missions (e.g., NASA Artemis and Chang’e-7) to focus on this target.

Hyperspectral sensors can provide continuous visible to near-infrared (VIS-NIR) spectral measurements to characterize polar volatiles and ancient crustal materials. The Moon Mineralogy Mapper (M3), a hyperspectral imaging spectrometer onboard India’s Chandrayaan-1, acquired VIS-NIR reflectance measurements spanning nearly the entire lunar surface, including polar regions [15]. Its comprehensive spectral coverage enables reliable identification of key lunar minerals, including pyroxene, plagioclase, olivine, and spinel [16,17]. This identification is based on their diagnostic absorption features (Figure 1a). M3 additionally covers critical OH/H₂O absorption bands around 1300, 1500, and

2000 nm

[18,19], which are essential for discriminating hydration signatures from lunar regolith responses (Figure 1b). Collectively, these capabilities establish M3 data as an indispensable dataset for investigating both lunar mineralogy and potential water ice deposits.

However, low solar elevation angles in the polar region generate widespread topographic shadows in M3 imagery, where their spectral signatures are primarily derived from secondary scattered light rather than direct illumination. The low intensity of this secondary illumination results in a significantly degraded signal-to-noise (SNR) from these regions. Consequently, mineralogical mapping derived from M3 data has been confined within 0∼

70^{°}

N/S [21]. For higher latitude regions, conservative approaches have been adopted by either excluding low-SNR areas entirely or applying strict filters to remove degraded spectra. Specifically, the spectral SNR index (SNRI), used to filter out unreliable M3 measurements [22,23], and the integral of the squared second derivative (ISSD), used to identify noise- and shadow-affected pixels, have been proposed.

However, the proportion of low-SNR pixels increases markedly toward high latitudes. Studies show that over one-quarter of regions above

80^{°}

S lack high-quality observations even before excluding high-incidence-angle anomalies [24]. This inherent degradation severely limits the utility of M3 for polar scientific investigations. Therefore, effective methods to recover these low-SNR data are essential for utilizing the full potential of M3 observations.

Research on the enhancement or restoration of lunar HSI data, especially for low-SNR polar data, remains limited. While terrestrial HSI shadow compensation and denoising strategies provide a practical foundation, their application to the lunar environment faces challenges due to fundamentally different illumination physics. Current terrestrial compensation methods generally fall into two categories. Physical model-based approaches [25,26], which rely on explicit reflectance models with prior knowledge of scene geometry and sensor data, are hampered in lunar polar environments since scattered light from crater walls and adjacent topographic highs violates the direct-illumination assumptions of photometric models. Limited contextual information and sensor constraints further hinder the feasibility of direct spectral recovery. Spectral unmixing-based approaches [27,28], reconstructing shadowed spectra by estimating the endmember abundances from adjacent illuminated pixels, fail to account for the nonlinear photon interactions caused by multiple scattering and indirect illumination by linear unmixing models. The relatively homogeneous lunar regolith further diminishes endmember separability. To address these limitations, deep learning has emerged as a promising alternative.

As a data-driven approach, deep learning is especially well-suited to capture the complex spatial–spectral interactions in degraded HSI data [29]. Among them, convolutional neural networks (CNNs) have become the prevailing paradigm for terrestrial hyperspectral restoration, benefiting from their inherent ability to automatically learn hierarchical spatial–spectral features and model complex nonlinear relationships directly from data [30,31]. One of the first CNN architectures for HSI denoising was proposed by Xie et al. [32], followed by an autoencoder-based shadow-compensation approach that learns relighting functions from paired shadowed and illuminated spectra under diverse atmospheric conditions [33]. A residual-learning-based denoising CNN (DnCNN) that reformulates denoising by estimating noise residuals rather than directly reconstructing clean images further established the utility of CNNs in HSI restoration [34]. Subsequent developments have further improved HSI denoising performance. An HSID-CNN [35] learns nonlinear mappings via feature fusion across spectral bands and spatial neighborhoods, while an HSI-SDeCNN [36] exploits 3D spatial–spectral patches from target and neighboring bands. These designs achieve robust noise suppression and restoration beyond traditional band-wise filters or purely 2D spatial denoisers [36]. These methods highlight the importance of joint spectral–spatial processing. Nevertheless, most CNN-based models lack explicit attention mechanisms to adaptively weight spatial–spectral features, limiting their performance in photon-starved imaging scenarios (i.e., a photon-scarce condition where insufficient signal photons are recorded due to extremely low illumination).

Recently, Transformer architectures have been introduced to HSI restoration tasks for their capabilities to capture long-range, global dependencies with their powerful self-attention mechanisms. Liang et al. [37] proposed a spatial–spectral Transformer for HSI denoising, leveraging the self-attention mechanism to capture long-range spatial and spectral correlations. Lai et al. [38] developed a Transformer-based framework to address a spectrum of HSI restoration tasks, including denoising, inpainting, and super-resolution. These approaches highlight the advantages of Transformers in capturing global information. However, the direct application of Transformers to typical terrestrial HSIs introduces substantial computational complexity and parameter overheads [39]. Beyond these, they are prone to over-fitting in data-scarce scenarios [40,41]. This substantial data requirement constitutes a fundamental limitation in lunar HSI compensation, where the availability of paired, high-quality observational data is inherently limited. Moreover, Transformers may struggle to preserve subtle local spectral variations, which are crucial for lunar mineral identification [42,43], potentially compromising the fidelity of restored signals.

Beyond CNNs and Transformers, diffusion models have recently emerged as a distinct paradigm in restoration techniques for hyperspectral and optical remote sensing. Built on denoising diffusion probabilistic models (DDPMs) [44], these approaches employ an iterative denoising–sampling process to reconstruct high-quality images from degraded observations. In hyperspectral restoration, diffusion priors have been actively explored: DDS2M [45] and HIR-Diff [46] introduce self-supervised and unsupervised priors for spatial–spectral modeling, while HSR-Diff [47] leverages conditional diffusion for spectrally aware super-resolution. However, despite their impressive generative capabilities, these approaches incur substantial computational costs and significant inference latency due to their iterative nature [48]. Critically, they are data-hungry and, owing to inherent stochastic sampling, struggle to guarantee the deterministic preservation of subtle spectral signatures and fine-grained spatial–spectral coherence, which are essential for reliable quantitative analysis.

Nevertheless, deep learning methodologies for terrestrial HSI denoising and shadow-compensation cannot be directly transferred to lunar applications due to the unique environment of lunar polar regions. Terrestrial HSI restoration models are primarily designed to handle idealized noise patterns such as Gaussian, impulse, or stripe noise under relatively consistent illumination and diverse surface compositions [49]. In particular, many CNN-based denoising networks implicitly assume additive Gaussian noise model. In contrast, lunar polar HSIs—especially under low illumination—exhibit fundamentally different degradation characteristics. Photon-starvation induces signal-dependent noise that is better described by Poisson statistics than by additive Gaussian perturbations [50]. Moreover, complex topographic shadowing leads to extreme illumination variability, and the relatively homogeneous regolith reduces spectral contrast [51]. Under such weak illumination, the measured HSI signals can approach the sensor noise floor, making restoration an ill-posed inverse problem in which small modeling mismatches may induce disproportionate spectral distortions. This domain gap further constrains the direct transferability of recent transformer- and diffusion-based restorers. Transformer architectures, despite their powerful global modeling capability, often rely on terrestrial datasets for pre-training; consequently, the learned spectral–spatial dependencies may not readily generalize to lunar mineral surfaces with fundamentally different spectral properties. Likewise, diffusion models learn data priors from terrestrial training distributions and can therefore be more prone to producing Earth-like textures or artifacts when applied to lunar polar scenes. Addressing such signal-dependent, spatially varying noise and subtle spectral signatures requires adaptive modules, such as attention mechanisms, to selectively enhance diagnostic information. Therefore, these challenges underscore the need for lunar-specific restoration frameworks and motivate the design of tailored deep learning models.

As the first attempt to address lunar HSI degradation, we previously proposed a CycleGAN-based compensation network tailored to M3 low-SNR spectra [24]. This paired-CycleGAN formulated compensation as a spectral translation between low- and high-SNR domains, utilizing adversarial learning to model their nonlinear relationship. Despite improving the spectral fidelity of low-signal pixels substantially, it operated solely in the spectral domain without exploiting spatial context, which resulted in spatial inconsistency across neighboring pixels and speckle-like artifacts in homogeneous regions. These limitations underscore the methodological boundary of spectral-only methods: their inability to preserve spatial coherence critical for faithful hyperspectral reconstruction. Therefore, an integrated approach that jointly exploits inter-band spectral dependencies and local spatial context is imperative for robust lunar HSI enhancement.

In this study, a novel spatial–spectral fusion 3D signal compensation (SSF-3DSC) network is proposed to overcome the limitations of spectral-only approaches. To the best of our knowledge, this is the first deep residual network specifically designed for lunar hyperspectral data. SSF-3DSC integrates spectral and spatial information by processing the entire 3D HSI cube (

B \times H \times W

), thereby avoiding pixel- or spectrum-wise isolation. Its architecture consists of three components: (1) a spectral compensation module (SCM) that restores individual spectral signatures; (2) a multi-scale spatial attention (MSA) module that emphasizes salient features across scales; (3) a cascaded 3D residual convolutional module (C3D-RCM) that performs final spatial–spectral fusion and reconstruction. By formulating compensation as a 3D cube reconstruction task, SSF-3DSC simultaneously restores spectral fidelity and enforces spatial coherence, thereby mitigating the spatial artifacts inherent to spectral-only methods. Its architecture and staged training strategy are tailored to the key challenges of lunar compensation: limited training data and complex, non-Gaussian degradations induced by photon starvation. Comprehensive experiments demonstrate that this spatial–spectral approach yields coherent and reliable reflectance reconstructions in low-signal polar regions. The main contributions of this work are summarized as follows:

(1): Spatial–spectral Cooperative Restoration Architecture: An end-to-end architecture is developed to simultaneously restore spatial and spectral information in low-SNR lunar regions, thereby addressing the trade-off between spectral fidelity and spatial consistency in low-illumination HSIs over homogeneous, regolith-covered surfaces.
(2): Dual-Branch Noise Compensation: A dual-branch module is proposed to decouple degradation characteristics and separate optimization pathways for spatial and spectral reconstruction, with joint spatial–spectral loss ensuring balanced fidelity across both domains.
(3): Cross-Scale Residual Feature Fusion: A cross-scale residual feature fusion strategy is proposed to enhance multi-scale feature representation by integrating pixel-wise spectral features with spatial context across scales, thereby improving the extraction and integration of spatial–spectral details.
(4): Superior Performance and Extensive Scientific Applicability: This work provides the first validation of deep learning-based methods for restoring spatial–spectral features in low-SNR lunar polar HSIs, demonstrating their feasibility for both quantitative enhancement and regional-scale scientific applications.

The rest of this paper is organized as follows. Section 2 introduces the M3 dataset and preprocessing. Section 3 presents the proposed SSF-3DSC framework. Section 4 reports experimental evaluations, including comparisons, ablation studies, and regional applications. Section 6 concludes the paper and discusses future research directions. An overview of the complete workflow is shown in Figure 2.

2. Data and Preprocessing

2.1. Data Source

Onboard the Chandrayaan-1 spacecraft, JPL’s M3 imaging spectrometer measures reflected radiance from the lunar surface over a wavelength range of

0.46

–

2.98

µm. This instrument features two operational modes with distinct spectral characteristics: the high-resolution “Target Mode” employs

10 nm

sampling across 256 channels, while the “Global Mode” utilizes variable spectral sampling–

20 nm

for shorter wavelengths and

40 nm

for longer wavelengths–across 85 channels [52]. Data acquisition followed a systematic schedule based on optimal illumination periods, designated as optical periods (OPs). The mission divided observations into five distinct OPs (OP1A, OP1B, OP2A, OP2B, and OP2C) to accommodate variations in phase angle, spacecraft altitude, and solar elevation angle [53]. High-resolution imagery (

140 m / pixel

) from OP1B and OP2A predominantly covered the lunar nearside, whereas OP2C provided lower-resolution data (

280 m / pixel

) with nearly complete coverage of the south polar region. Through global mode operations, M3 achieved comprehensive imaging of over 95% of the lunar surface before mission interruption.

M3 Level 2 data products provide pixel-level radiance factor (RADF [54]) observations by converting the at-sensor radiance from Level 1 to standardized “reflectance” values at an incidence angle of

30^{°}

and an emission angle of

0^{°}

. This product delivers pixel-specific, resampled, and photometrically calibrated reflectance data, maintaining consistency with analogous datasets from the SELENE Spectral Profiler [55]. However, although thermal correction algorithms have been developed to address limitations in the M3 Level 2 thermal adjustments [56,57], these corrections primarily work in mid- and low-latitude regions. The impact of thermal distortions remains negligible in PSRs near the lunar poles [10].

For this study, which focuses on low-signal areas within the lunar south polar area, we utilized all available M3 Level 2 data from OP2C. This dataset comprises 99 images providing comprehensive coverage of regions above

70^{°}

S latitude (Figure 3).

2.2. Preprocessing

The raw M3 Level 2 hyperspectral frames were processed using a multi-step preprocessing pipeline to ensure data quality and reliability for subsequent analyses. A major challenge in original M3 image analysis stems from signal variability across spectral bands, which commonly manifests as “stripe noise”. This stripe noise pattern exhibits dense, non-periodic characteristics that exist across both spectral bands and spatial locations. To address this issue, we employed a multi-step preprocessing strategy combining spectral-domain anomaly correction and spatial-domain Fourier filtering [58] to effectively suppress these irregular stripe artifacts (Figure 4). In this pipeline, band-level spectral outliers are first detected via a local outlier factor (LOF) method [59] and corrected using median filtering followed by light 1D Gaussian smoothing, after which band-wise 2D Fourier filtering is applied to remove residual striping. Detailed implementation procedures for this preprocessing pipeline are documented by Ni et al. [24]. Figure 5a demonstrates the effectiveness of this preprocessing pipeline by comparing the reflectance spectrum of a specific point before (blue solid line) and after stripe artifact removal (orange solid line). This approach was systematically executed across the entire M3 hyperspectral dataset, effectively addressing spatial noise components throughout the data cube.

In addition to noise artifacts, the M3 imagery acquired during OP2C also suffers from geometric mis-registration, characterized by positional offsets of up to 5 km. This issue arises primarily due to the malfunction of the spacecraft’s star trackers, substantially affecting the spatial alignment between M3 hyperspectral data and other lunar geospatial datasets. To resolve these discrepancies, geometric corrections were implemented by leveraging location (LOC) and observation (OBS) ancillary data developed by Gaddis et al. [60], which are publicly accessible at https://asc-pds-services.s3.us-west-2.amazonaws.com/mosaic/m3/geometric_restoration/index.html (accessed on 11 February 2026). Utilizing these ancillary data achieves sub-pixel accuracy for M3 images, effectively eliminating multi-kilometer positional errors. Readers seeking a detailed account of the correction methodology and related considerations are referred to Malaret et al. [61] and Ni et al. [24].

2.3. Dataset Construction

After denoising and co-registration, the signal-to-noise ratio index (SNRI) [22] of the south polar M3 data was computed for each pixel to quantify its spectral quality. The SNRI is defined as the mean relative deviation between the observed spectrum and its smoothed counterpart.

S N R I = \frac{\sum_{i = n}^{m} |\frac{r_{i}^{'} - r_{i}}{r_{i}^{'}}|}{m - n},

(1)

where

r_{i}

represents the original reflectance (RADF) at band i, while

r_{i}^{'}

denotes the corresponding smoothed reflectance. n and m represent the first and last bands used for SNRI calculation, and

| \cdot |

indicates the absolute value. To mitigate the impact of spectral anomalies present in the first two bands [62] and the substantial fluctuations in 3 µm water absorption signatures observed in M3 polar region data [63], the SNRI assessment is restricted to wavelengths spanning 0.58–2.5 µm. This spectral constraint necessitates setting the algorithm parameters to

n = 2

and

m = 73

, ensuring optimal signal quality evaluation.

SNRI values were calculated for all OP2C observations spanning the south polar region, enabling a complete characterization of spectral quality variations across the entire polar dataset. We applied a threshold on SNRI to partition the data into high-quality (SNRI < 0.1) and low-quality (0.1 < SNRI < 0.5) regions. In practice, pixels with SNRI in the range 0.1–0.5 are identified as the primary targets for compensation, since these areas experience limited solar irradiance yet retain meaningful signal. The upper bound of 0.5 is adopted as a conservative cutoff supported by our prior extended-SNRI evaluation [24], which shows a rapid deterioration in compensation reliability once SNRI exceeds 0.5, indicating an increasingly noise-dominated and information-insufficient regime. Figure 5b presents the M3 scene M3G20090718T054612 as an exemplary case to show the spatial distribution of data quality evaluated by SNRI, where lower SNRI values correspond to higher signal quality.

In our prior work [24], a pixel-to-pixel CycleGAN was trained to map low-SNR spectra to high-SNR spectra. Through the enhancement of spectral fidelity, the proposed CycleGAN framework operates on individual pixels and thus inevitably introduces spurious speckle and discontinuities in the whole image.

To address the aforementioned limitations, this study proposes a signal compensation framework that leverages 3D image cubes to synergistically integrate spatial and spectral features for preserving spatial continuity while enhancing spectral fidelity. The foundation of this approach is a paired dataset constructed from meticulously co-registered M3 observations. Specifically, all high- and low-quality observations across the south polar region (latitudes > 70°S) were combined into mosaics through averaging after sub-pixel co-registration. This averaging process homogenizes illumination variations and averages out residual noise, yielding spatially aligned high-/low-signal image pairs across the lunar south polar region. From the overlapping regions of these high- and low-SNR mosaics (shown in green in Figure 6a), we acquired the paired observational data used for training and validation. Crucially, since Level 2 products are topographically and photometrically corrected, the observed spectral variations in these paired datasets primarily reflect differences in illumination conditions rather than confounding effects from topographic and viewing geometry. This ensures that the network learns a robust mapping between signal degradation states.

The resulting dataset samples have spatial dimensions of 32 × 32 pixels and spectral dimensions comprising 83 bands, following the discard of the initial two bands that exhibited anomalous reflectance characteristics. The sampling protocol prioritized maximum data collection. Accordingly, a 32 × 32 sampling window was systematically moved across the overlapping regions between high- and low-SNR images, with adjacent samples overlapped by three pixels on both horizontal and vertical axes, yielding to approximately 10% spatial overlap between neighboring sample cubes. Furthermore, a quality criterion was enforced, specifying that the proportion of null pixels within each sample must not exceed 30%. This threshold, determined empirically to balance sample availability and data integrity, minimizes instability in gradient descent optimization [64] and reduces the attenuation of sequential dependencies in spatial data caused by missing values [65]. To ensure a rigorous and scientifically relevant evaluation of our network’s generalization capability, 2 of 13 candidate landing regions for NASA’s Artemis III mission [66]—De Gerlache Rim 1 and Connecting Ridge Extension—were designated as the independent test zones (blue squares in Figure 6b). Training samples were strictly excluded from these entire regions to guarantee an unbiased evaluation of the model’s generalization capability across spatially independent areas. Through the systematic sampling approach described above, 766 rigorously co-registered pairs of low- and high-SNR HSI cubes were extracted from overlapping regions of the M3 dataset. Among these, 689 pairs were allocated for training, while the remaining 77 were reserved for testing to ensure unbiased evaluation. This partitioning maintains a training-to-testing ratio of approximately 9:1. Before inputting the training samples into the model, random rotations (by multiples of 90°) and horizontal/vertical flipping augmentations were applied. While preserving the physical validity of spectral features, these geometric augmentations effectively enhance dataset diversity and mitigate the risk of overfitting by improving the network’s robustness to variations in spatial orientation and perspective.

3. Methodology

Signal compensation for lunar HSI can be conceptualized as a domain-to-domain translation problem. The fundamental challenge is to reconstruct high-fidelity data cubes from observations, degraded by suboptimal illumination, while preserving both spatial–spectral integrity and accuracy. To this end, the compensation process can be modeled as a learned transformation

Φ

that maps a hyperspectral cube l from the low-SNR domain

L

to its high-fidelity counterpart h in the high-SNR domain

H

. This relationship is expressed as follows:

h = Φ (l), l \in L and h \in H

(2)

through the learning of

Φ

, and the network is trained to restore the information content of low-quality input cubes l, producing spatially coherent and spectrally accurate outputs, h.

Building on this formulation, this section presents the proposed SSF-3DSC framework. Section 3.1 outlines the network architecture, Section 3.2 introduces the staged training strategy, Section 3.3 describes the loss functions, Section 3.4 summarizes implementation details, and Section 3.5 details the metrics for compensation evaluation.

3.1. Spatial–Spectral Fusion 3D Signal Compensation (SSF-3DSC) Framework Architecture

A novel deep learning network is proposed to increase the SNR of hyperspectral imagery of the lunar surface under low illumination conditions. The training samples were subsequently fed into the proposed network, whose architecture consists of three main components: a spectral compensation module (SCM), a multi-scale spatial attention module (MSA), and a cascaded 3D residual convolutional module (C3D-RCM). Together, these modules learn a robust compensation mapping that reconstructs high-SNR output. A binary validity mask is applied throughout the framework to exclude null pixels from intermediate operations and loss computations, ensuring that only reliable data contributes to the compensation process. Specifically, the validity mask is constructed from the preprocessed M3 cube by assigning 1 to pixels with finite, physically meaningful reflectance values and 0 to invalid/no-data pixels; it is then applied consistently to gate forward propagation and to exclude invalid pixels from all loss computations. Figure 7 shows the specific structure of the proposed SSF-3DSC framework.

3.1.1. Spectral Compensation Module (SCM)

The spectral compensation module (SCM) focuses on enhancing the spectral features of HSI cubes, which are critical for lunar surface analysis due to their role in characterizing surface composition. The SCM is implemented using the fully connected neural network (FCNN) architecture and serves as the first-stage spectral compensator in our framework. To capture spectral reflectance across all pixels, this module flattens the input cube along its spatial dimensions (h and w), treating each pixel’s spectrum as an independent sample. This results in a set of spectral vectors (batch size × h × w), each with 83 bands, which are processed through an FCNN.

Previous results have shown that the FCNN architecture with a global receptive field is effective in capturing long-range spectral correlations [67]. This FCNN-based design mirrors the generator network proposed by Ni et al. [24], which demonstrated robust performance in extracting and reconstructing spectral features within a CycleGAN framework for low signal spectra compensation in the lunar south polar region. The overall architecture of SCM is displayed in Figure 8, and the details of the spectral feature extraction block are presented in Table 1. Each layer, except the output layer, employs a leaky rectified linear unit (LReLU) activation function to introduce non-linearity [49], while the output layer uses a Sigmoid activation to constrain the compensated spectra within a normalized range. After completing the spectral feature processing through the FCNN, the module reshapes the output back to the original 3D cube (i.e., restoring the flattened spectral vectors back to their original spatial–spectral arrangement). This complete spectral compensation process can be mathematically expressed as follows:

Y_{SCM} = R (σ (F C N N (vec (X_{masked}))), [B, h, w]),

(3)

where

vec (\cdot)

denotes flattening only along the spatial dimensions

[h, w]

while preserving the channel dimension B,

R (\cdot, [\cdot])

denotes reshaping into the specified dimensions,

X_{masked}

represents the masked low-SNR input,

σ

denotes the sigmoid activation function, and

Y_{SCM}

is the spectrally compensated output. By operating purely on spectral signatures, the SCM corrects band-wise intensity deviations and improves spectral fidelity before any spatial processing is applied.

3.1.2. Multi-Scale Spatial Attention Module (MSA)

While the SCM focuses on spectral reconstruction, it may introduce spatial artifacts such as speckle noise. The multi-scale spatial attention module (MSA) is designed to recover and refine spatial features that may be degraded in low-signal HSIs, thereby mitigating artifacts introduced during spectral reconstruction. Inspired by CBAM’s spatial attention [68], the proposed MSA incorporates parallel multi-scale convolutional kernels directly into the spatial-attention generation process. These features are then fused via a

1 \times 1

projection to generate a unified attention map, facilitating scale-aware saliency. This module employs a multi-scale convolutional approach combined with a spatial attention mechanism to extract spatial patterns at varying spatial scales, mitigating speckle noise or spatial inconsistencies.

MSA first performs channel-wise average and max pooling along the spectral dimension of the masked input cube (Figure 9), yielding two complementary single-channel spatial maps: (i) an average-pooled map encodes global illumination/background context; (ii) a max-pooled map highlights salient high-contrast structures and edge features. Each map is then processed through parallel convolutional layers with kernel sizes of

3 \times 3

,

5 \times 5

, and

7 \times 7

to capture multi-scale spatial features. The resulting feature maps–three from average pooling and three from maximum pooling, plus the original pooled maps–are concatenated along the channel dimension, forming an 8-channel feature map.

A

1 \times 1

convolution, followed by batch normalization and a sigmoid activation, transforms this concatenated feature map into a spatial attention map

M_{S}

. Values of

M_{S}

are between 0 and 1, with higher values assigning greater weights to locations prioritized during compensation. The computation is expressed as follows:

\begin{matrix} M_{S} & = σ (B N (C o n v_{1 \times 1} (C o n c a t (P_{multi - avg} (X_{masked}), P_{multi - \max} (X_{masked}))))), \end{matrix}

(4)

where

P_{multi - avg} (\cdot)

and

P_{multi - \max} (\cdot)

denote the multi-scale features from average and maximum pooling, respectively.

C o n c a t

represents channel-wise concatenation,

C o n v_{1 \times 1}

is the

1 \times 1

convolution,

B N

is batch normalization, and

σ

is the sigmoid function.

In essence, the MSA module serves as a spatial regularizer by learning a dynamic attention map. This map acts as a sophisticated weighting mask that selectively enhances salient spatial details (e.g., edges and textures) while suppressing noise and spatial distortions across the image, ensuring these crucial details are preserved and refined in the final output.

3.1.3. Cascaded 3D Residual Convolutional Module (C3D-RCM)

The C3D-RCM integrates the spectral and spatial features extracted by the two above modules, performing comprehensive spatial–spectral feature fusion. Constructed upon a 3D ResNet backbone [69], this module utilizes a cascaded multi-scale convolutional design to effectively aggregate cross-band and cross-pixel dependencies, thereby optimizing the fusion of fine spectral details with broad spatial structures. First, the masked attention map

M_{S}

is applied to the SCM output, via element-wise multiplication:

Y_{fusion} = Y_{SCM} ⊙ M_{S},

(5)

where

Y_{fusion} \in R^{B \times h \times w}

denotes the fused feature map (i.e., the spatial–spectral feature map in Figure 7). This attention mechanism selectively amplifies informative regions while attenuating less relevant or noisy regions, thereby optimizing feature representation for subsequent processing. Subsequently, the C3D-RCM maps the band axis of the fused feature cube to the volumetric depth (i.e., D = B) while initializing the feature-channel dimension to C = 1; then, it employs a two-stage processing pipeline based on 3D residual convolutions to jointly model spatial–spectral correlations. As shown in Figure 10, the feature extraction stage employs a cascade of 3D residual convolutional (3D-RC) blocks that increase the feature-channel depth from 1 to 32, 64, and 128, thereby increasing capacity to aggregate inter-band and inter-pixel dependencies while preserving the

B \times h \times w

resolution. The subsequent feature-refinement stage systematically reduces it to 1, effectively consolidating the learned high-dimensional representations into a single-channel compensated cube while enforcing spatial coherence and spectral fidelity. Each 3D-RC block comprises two

3 \times 3 \times 3

convolutions with ReLU activations and an identity shortcut to stabilize training (Figure 7). These 3D filters enable joint spatial–spectral processing by exploiting correlations across both spatial and spectral dimensions simultaneously [70].

The final output

Y_{out}

is computed by adding the output of our C3D-RCM to the masked low-SNR input, embodying the residual learning paradigm:

Y_{out} = Y_{masked} + f_{C 3 D - RCM} (Y_{fusion}) .

(6)

The residual connections in this module mean that the network actually learns to predict the difference (residual) between the low-SNR input and the desired high-SNR output, rather than predicting the high-SNR image directly. This residual learning strategy is commonly used in image denoising tasks—the model focuses on estimating the noise component, which is then added to (or subtracted from) the input to obtain the clean result [71]. By learning the SNR improvement as a residual mapping, the RCM facilitates faster convergence and avoids over-smoothing, ensuring that the fine details present in the high-SNR target are recovered in the final output.

3.2. Staged Training Strategy

Given the complexity of the multi-module architecture and to mitigate the risk of any single component overpowering others during initial training, a phased training strategy is adopted to incrementally build the network’s capability:

Stage 1—Spectral Pre-training: The SCM is trained alone while parameters in the MSA and C3D-RCM are kept frozen. In this stage, the network learns to perform accurate spectral compensation on low-SNR cubes, essentially refining each pixel’s spectral signature to match high-SNR characteristics using the high-SNR target as a reference. The isolation of the spectral sub-network first ensures that the SCM provides a solid foundation of spectral fidelity before introducing spatial learning.

Stage 2—Spectral–Spatial Joint Training: The MSA is unfrozen and jointly optimized with the already pre-trained SCM, while the C3D-RCM remains frozen. In this stage, the model learns to restore spatial features and suppress artifacts through the MSA on top of the spectral corrections from Phase 1. Jointly optimizing SCM and MSA allows the network to balance spectral reconstruction with spatial denoising, yielding an output both spectrally accurate and spatially clean.

Stage 3—End-to-End Fine-Tuning: The C3D-RCM is unfrozen, and the entire architecture is trained end-to-end. In this final phase, all modules (SCM, MSA, and C3D-RCM) are optimized together so that the 3D residual convolutional layers can adjust the combined spatial–spectral features and refine the residual mapping. This end-to-end fine-tuning allows the full model to converge to an optimal solution that coherently integrates spectral and spatial enhancements.

3.3. Loss Function

The ill-posed, low-signal lunar environment and inherent spatial–spectral coupling, combined with staged modular training, necessitate specialized composite losses. These loss functions were devised for each training phase to optimize different aspects of the training process. The loss functions were strategically chosen to align with the objectives of the SCM, MSA, and C3D-RCM, emphasizing spectral fidelity, spatial coherence, and overall reconstruction quality across the three-stage training process. A masking mechanism is applied to all loss computations to restrict calculations to valid pixels. Below, the following subsections detail the loss functions employed in each phase and their specific contributions to the training objective.

3.3.1. Stage 1: Spectral Feature Extraction

In the first training stage, the model is optimized with loss terms that emphasize spectral accuracy. The loss based on the mean spectral angle mapper (MSAM) is used to preserve spectral shape by measuring the angle between the reconstructed and reference spectral vectors [72]. This loss is defined as the mean of spectral angles (in radians) over all pixels, effectively penalizing spectral distortions irrespective of intensity. Reconstruction error (RE) is added as a pixel-wise fidelity term, typically implemented as a mean squared error between the denoised image and ground truth [73]. Minimizing RE ensures that low absolute differences in reflectance or intensity, improving spectral amplitude fidelity. To encourage smoothness of the compensated spectra and minimize fluctuations, a first-order total variation (TV) [24] loss is included. First-order TV loss penalizes abrupt spikes between adjacent spectral bands, thereby promoting spectral continuity and robustness in the reconstructed spectral profiles. The above three loss functions are expressed as follows:

\begin{matrix} L_{MSAM} (C, h, l) & = E_{h \sim P_{data} (h)} [arccos \frac{\sum_{j = 1}^{B} (h_{j}^{flat} \cdot C (l_{j}^{flat}))}{∥ h_{j}^{flat} ∥_{2} \cdot {∥ C (l_{j}^{flat}) ∥}_{2}}], \end{matrix}

(7)

\begin{matrix} L_{RE} (C, h, l) & = E_{h \sim P_{data} (h)} [\sum_{j = 1}^{B} {∥C (l_{j}^{flat}) - h_{j}^{flat}∥}^{2}], \end{matrix}

(8)

L_{TV} (C, h, l) = E_{l \sim P_{data} (l)} [\frac{1}{B - 1} \sum_{j = 1}^{B - 1} {∥C (l_{j}^{flat}) - C (l_{j + 1}^{flat}) - (h_{j}^{flat} - h_{j + 1}^{flat})∥}_{2}^{2}],

(9)

where

h_{j}^{flat}

and

l_{j}^{flat}

represent the flattened spectra of the image cubes h and l, respectively, where j denotes the spectral band index and B is the total number of spectral bands in the sample cube.

C (\cdot)

denotes the proposed compensation network.

{∥ \cdot ∥}_{2}

denotes the

ℓ_{2}

norm.

E_{h \sim P_{data} (h)}

and

E_{l \sim P_{data} (l)}

indicate expectations over the probability distribution of high- and low-signal cubes computed as masked mini-batch means over valid pixels for all losses, where

P_{data}

represents the underlying data distribution.

Consequently, the combined loss for Stage 1 is formulated as follows:

\begin{matrix} L_{Stage 1} & = ω_{MSAM} \cdot L_{MSAM} + ω_{RE} \cdot L_{RE} + ω_{TV} \cdot L_{TV}, \end{matrix}

(10)

where

ω_{(\cdot)}

are positive scalar weights that control the contribution of their respective loss terms.

3.3.2. Stage 2: Spatial Feature Restoration

The second stage introduces additional loss functions targeting toward preserving spatial structure on top of

L_{Stage 1}

. Multi-scale structural similarity index (MS-SSIM) loss is incorporated to enhance structural similarity across different scales [74]. MS-SSIM is a full-reference image quality metric that evaluates the similarity between two images by considering structural information, luminance, and contrast at multiple resolutions, making it effective for capturing both local and global perceptual quality. Here, 1 − MS-SSIM is used as the loss term, so a higher MS-SSIM index yields a lower loss, encouraging the network to produce outputs with structural characteristics closely matching the ground truth at coarse and fine scales.

L_{MS - SSIM} = 1 - E [{(\prod_{l = 0}^{M - 1} S_{ℓ})}^{\frac{1}{M}}],

(11)

S_{ℓ} = \frac{(2 μ_{h, ℓ} \cdot μ_{C (l), ℓ} + C_{1}) (2 σ_{h, C (l), ℓ} + C_{2})}{(μ_{h, ℓ}^{2} + μ_{C (l), ℓ}^{2} + C_{1}) (σ_{h, ℓ}^{2} + σ_{C (l), ℓ}^{2} + C_{2})},

(12)

where M denotes the number of spatial scales,

S_{ℓ}

represents the SSIM value at the ℓ-th layer, and

μ_{(\cdot), ℓ}

,

σ_{(\cdot), ℓ}^{2}

, and

σ_{h, c (l), ℓ}

are the local mean, variance, and cross-covariance statistics, respectively, calculated between the compensated output

C (l)

and the reference high-signal hyperspectral image cube h. The stability constants

C_{1}

and

C_{2}

, defined as

C_{1} = {(K_{1} L)}^{2}

and

C_{2} = {(K_{2} L)}^{2}

[75], with parameters

K_{1} = 0.01

,

K_{2} = 0.03

, and a dynamic range of

L = 1

, are consistent with the physical properties of reflectance (radiance).

To preserve high-frequency details, a Laplacian pyramid loss is incorporated, which adeptly captures spatial details across multiple scales, thereby enhancing texture and edge preservation at various resolution levels. This loss is computed by decomposing images into a Laplacian pyramid and summing the differences at each level [76], thereby penalizing errors in both low-frequency (blurry) content and high-frequency textures:

L_{Lap} = \sum_{ℓ = 1}^{L} \frac{1}{N_{ℓ}} \sum_{b, h, w} | L_{h, B, H, W}^{(ℓ)} - L_{C (l), B, H, W}^{(ℓ)} |,

(13)

L^{(ℓ)} = I^{(ℓ)} - G (I^{(ℓ)}),

(14)

where

L_{h, B, H, W}

and

L_{C (l), B, H, W}

denote the Laplacian pyramid components for the high-SNR and compensated images;

N_{ℓ}

is the total number of valid pixels in layer l, while L represents the total number of pyramid layers. The input cubes at each layer are denoted by

I^{(ℓ)}

, and

G (\cdot)

represents the Gaussian blur operation [77].

Consequently, the total loss in Stage 2 becomes the following:

L_{Stage 2} = L_{Stage 1} + ω_{MS - SSIM} \cdot L_{MS - SSIM} + ω_{Lap} \cdot L_{Lap},

(15)

with placeholder weights balancing new terms. The MS-SSIM loss

L_{MS - SSIM}

guides the network to restore spatial structures and textures (improving visual similarity), while the Laplacian pyramid loss

L_{Lap}

ensures that both global structure and fine details are recovered at multiple scales.

3.3.3. Stage 3: End-to-End Training

In the final stage, the entire network is fine-tuned with an aggregated loss to balance the spectral and spatial objectives. A Log-Cosh loss term is introduced to stabilize training and attenuate outliers. The Log-Cosh loss is defined as follows:

L_{\log - \cosh} = \frac{1}{B} \sum_{j = 1}^{B} log (cosh (h_{j} - C {(l)}_{j})) .

(16)

The Log-Cosh function behaves similarly to mean squared error (MSE, i.e.,

ℓ_{2}

loss) for small residuals, yielding smooth gradients, while approximating mean absolute error (MAE, i.e.,

ℓ_{1}

loss) for larger residuals, thus providing robustness against outliers [78]. Unlike MAE, which is non-differentiable at zero, and MSE, which is overly sensitive to extreme values, Log-Cosh has twice-differentiability across its domain, ensuring optimization stability. This smooth gradient property facilitates better coordination among multiple loss functions during the end-to-end training phase.

By including Log-Cosh into the overall objective, extreme errors are further suppressed, yielding a more balanced trade-off between spectral fidelity and spatial sharpness. The end-to-end training loss is a weighted sum of all prior loss terms (MSAM, RE, TV, MS-SSIM, and Laplacian) plus the Log-Cosh term:

L_{Stage 3} = L_{Stage 1} + L_{Stage 2} + ω_{\log - \cosh} \cdot L_{\log - \cosh},

(17)

where

ω_{\log - \cosh}

is a positive scalar weight that modulates the contribution of the Log-Cosh term.

3.4. Training Details

The SSF-3DSC framework was implemented using PyTorch (v2.5.1). Training was conducted with a batch size of 32 on an NVIDIA A40 GPU, with each training stage running for 1000 iterations.

The Adam optimizer was employed to minimize the loss function (17), configured with an initial learning rate of

1 \times 10^{- 4}

, weight decay of

1 \times 10^{- 5}

, and momentum parameters of

β_{1} = 0.9

,

β_{2} = 0.999

, and

ε = 10^{- 5}

. To ensure stable convergence, a cosine-annealing learning rate scheduler (CosineAnnealingLR) was utilized, which progressively reduces the learning rate from an initial value of

1 \times 10^{- 4}

to a minimum threshold of

1 \times 10^{- 6}

over 1000 epochs. This smooth decay strategy eliminates the training instabilities typically associated with step-decay methods, ensuring stable convergence while facilitating effective parameter fine-tuning in later training phases.

During the training process, we sequentially unfreezed specific modules (SCM, MSA, and C3D-RCM) to facilitate progressive learning. To address the challenge of multi-objective optimization across different training stages, weighting parameters were re-initialized by a scale-normalization heuristic similar to GradNorm [79]: a short warm-up run was first used to record the mean magnitude of each potential loss term, including those not yet active. After which coefficients

ω (\cdot)

were set so that all weighted losses lay in the same order of magnitude. These statistics were continuously updated during subsequent training, allowing the recorded values to be reused for Stages 2 and 3 without additional warm-up, thereby ensuring that newly introduced objectives (i.e., MS-SSIM, Laplacian, and Log-Cosh) received sufficient gradient signal without overwhelming previously optimized terms. Coefficients for computing running averages of the gradient were all set to 0.5, with specific weighting parameters for each loss component represented as shown in Table 2.

3.5. Evaluation Metrics

To comprehensively assess the spectral compensation performance, we employ a suite of established evaluation metrics commonly used in hyperspectral imaging analysis. The metrics are categorized into spatial-domain and spectral-domain measures to provide both perceptual and quantitative assessments. Reference-based metrics are computed with respect to high-signal observations serving as the radiometric benchmark, derived from overlapping acquisitions under favorable illumination and geometrically co-registered to ensure pixel-level correspondence with the compensated low-signal test data.

3.5.1. Spatial-Domain Metrics

Peak Signal-to-Noise Ratio (PSNR)

PSNR quantifies reconstruction fidelity by measuring distortion suppression relative to maximum signal power. Operating at an image-wide scale, PSNR is expressed in decibels (dB), where higher PSNR values indicate superior reconstruction fidelity. Typically, an improvement of 1–2 dB in PSNR corresponds to visually discernible enhancement [80].

Feature Similarity Index Matrix (FSIM)

FSIM evaluates the preservation of perceptually significant spatial features, such as structural information and edge integrity. Calculated globally across the entire image, FSIM represents a perceptual quality measure that quantifies similarity by integrating phase congruency and gradient magnitude to assess similarities between reference and processed images [81]. FSIM values range from 0 to 1, with higher values indicating superior preservation of salient features and edge structures.

3.5.2. Spectral-Domain Metrics

Mean Relative Absolute Error (MRAE)

This metric quantifies spectral deviation by computing the average relative error between reconstructed and reference spectral values on a pixel-wise basis, normalized by the true signal intensity. This produces a dimensionless indicator of spectral distortion, with lower values signifying higher fidelity and 0 indicating perfect recovery. Particularly valuable in hyperspectral analysis, MRAE exhibits higher sensitivity to errors in low-intensity spectral regions [82], which are critical for lunar polar region exploration.

Erreur Relative Globale Adimensionnelle de Synthèse (ERGAS)

This metric quantifies the overall spectral quality of the reconstructed image with heightened sensitivity to band-to-band spectral distortions. As a global, image-level quality indicator, ERGAS is a dimensionless global error metric which quantifies spectral fidelity by calculating normalized root mean square error (RMSE) across all spectral bands. Lower ERGAS values indicate reduced spectral distortion, with 0 representing perfect reconstruction. ERGAS is widely adopted in HSI applications due to its sensitivity to spectral artifacts and subtle spectral variations that may be overlooked by spatial-domain metrics [83].

4. Experimental Results

To validate the effectiveness of the proposed framework, we conducted comprehensive experiments comprising comparative analysis and real-data evaluations. To the best of our knowledge, this study represents the first study to explicitly integrate spatial–spectral information for M3 HSI signal compensation in low-illumination lunar polar regions. In the absence of dedicated fusion architectures tailored for this task, we adopt several representative terrestrial HSI restoration networks as architectural baselines to investigate how conventional spatial–spectral modeling units perform under lunar polar degradation conditions.

We benchmarked SSF-3DSC against five representative deep-learning baselines. These include three CNN-based restoration backbones (3D-DnCNN [34], HSID-CNN [35], and HSI-SDeCNN [36]), widely used for terrestrial hyperspectral denoising, and a diffusion-based generative model (conditional DDPM, cDDPM [44]), which represents recent probabilistic restoration approaches. We also included paired-CycleGAN [24], the first deep-learning method for spectral-domain compensation in lunar HSIs, to examine the contributory role of spatial information under low-illumination conditions.

To ensure a fair comparison, all baselines were re-implemented and trained from scratch under identical conditions, including the same paired lunar dataset, data splits, preprocessing procedures, and optimization settings (e.g., optimizer configuration and learning-rate schedules). Because a validated degradation model for lunar polar HSIs is not yet available, we refrain from adopting standard terrestrial simulation protocols for either training or performance evaluation. Unlike terrestrial denoising benchmarks that assume idealized noise models under stable illumination, M3 observations in shadowed polar regions are governed by photon-starved, illumination-dependent, and inherently non-linear degradations induced by extreme illumination variability and topographic occlusion. Consequently, conventional synthetic-noise simulations are not physically faithful to this context. Therefore, we benchmarked all methods on a domain-faithful paired dataset derived from real lunar observations, ensuring that both training and assessment reflect the operational constraints of lunar polar exploration. Performance was evaluated using quantitative metrics, visual comparisons, and spectral profile analyses, with detailed results presented in the following subsections.

4.1. Performance Evaluation of Spatial–Spectral Quality Metrics

Performance was evaluated on the test set using four metrics: mean PSNR (M-PSNR), mean FSIM (M-FSIM), MRAE, and ERGAS. Generally, higher values for MPSNR and MFSIM indicate better spatial similarity, whereas lower values for MRAE and ERGAS signify higher spectral fidelity. These baselines were selected to cover representative restoration paradigms, including widely adopted supervised CNN denoisers (3D-DnCNN, HSID-CNN, and HSI-SDeCNN) and a diffusion-based probabilistic model (cDDPM), thereby facilitating a systematic evaluation of how standard terrestrial priors generalize to photon-starved M3 polar observations under paired real-observation supervision. Table 3 lists the compensation results of different networks on the test set, where the best performance for each metric is marked in bold, and the second-best is underlined.

The results in Table 3 demonstrate that our proposed framework significantly outperforms 3D input-based networks [34,35,36], the spectral dimension-exclusive compensation approach [24], and the diffusion-based cDDPM baseline [44] across all evaluation metrics. Specifically, our proposed network achieves the highest MPSNR (27.68 dB) and MFSIM (0.9452), representing improvements of 1.85 dB and 0.0654 over the strongest baseline (3D-DnCNN), respectively. In contrast, HSI-SDeCNN and HSID-CNN obtained relatively low spatial performance, lagging behind with MPSNRs of 24.38 dB and 22.86 dB and MFSIMs of 0.8024 and 0.8038. Paired-CycleGAN, trained exclusively on one-dimensional spectral data, exhibits limited spatial fidelity (MPSNR = 22.9880, MFSIM = 0.7937) due to the absence of spatial information in its input. The notably low spatial quality metrics underscore the critical importance of incorporating spatial contextual features in HSI compensation tasks. The cDDPM performs worst among all methods, with extremely low spatial scores (M-PSNR = 6.54 dB, M-FSIM = 0.659), indicating that the diffusion prior fails to recover meaningful spatial structures under the photon-starved, data-limited conditions of the lunar polar regions.

Concurrently, the proposed framework excels in spectral fidelity restoration. Specifically, SSF-3DSC achieves the lowest (best) ERGAS of 24.42 and MRAE of 17.54, reducing relative spectral distortion by over 15% compared with the 3D-DnCNN (ERGAS = 28.78; MRAE = 21.58) and by more than 40% versus HSID-CNN and HSI-SDeCNN. The diffusion-based cDDPM exhibits the most severe spectral distortion, yielding the highest ERGAS (168.43) and MRAE (148.23) values among all evaluated methods. Notably, paired-CycleGAN—a network optimized exclusively for spectral fidelity—yields the second-best spectral restoration performance (ERGAS = 27.10; MRAE = 20.67). This superior performance of our model over paired-CycleGAN can be attributed to the spectral contextual information from neighboring pixels explicitly incorporated by SSF-3DSC through its spatial–spectral modeling architecture. Compared to approaches that solely utilize spectral dimension information, the integration of mutual spatial constraints from adjacent pixels enables SSF-3DSC to effectively mitigate both global spectral distortion (quantified by ERGAS) and inter-band relative amplitude errors (measured by MRAE).

The synergistic performance achieved across spatial–spectral quality metrics demonstrates that our framework effectively balances spatial consistency with spectral fidelity, comprehensively validating the effectiveness of our proposed network architecture for low-signal compensation in M3 data.

4.2. Performance Evaluation of Visual Quality

To provide an intuitive visualization of the spatial effects achieved by various network architectures for lunar polar signal compensation, we performed spectral integration over the full wavelength range to generate grayscale images (as illustrated in Figure 11). Supplementary Figure S1 provides enlarged views of representative regions to highlight fine-scale detail preservation. Compared to the generation of false-color images using selected bands, grayscale images eliminate the subjective bias associated with arbitrary band selection. A unified contrast stretching operation was applied to both the compensated results and original images to facilitate qualitative assessment of the full-band energy levels of each pixel following compensation. The eight cases shown in Figure 11 cover a broad range of diverse lunar surface conditions—from deep-shadow, low-SNR areas, and illumination boundaries to texture-rich terrains and heterogeneous regions with varying contrast levels—enabling visual assessment under diverse spatial and illumination conditions. Figure 11a shows the low-SNR images, while Figure 11b shows the paired high-signal images. Figure 11c–h displays the resulting images obtained after applying different compensation methods. To facilitate systematic visual assessment, we annotate representative topographic features using colored bounding boxes. White boxes delineate salient geomorphological ROIs in the paired dataset. Colored boxes provide a qualitative guide to restoration quality: red indicates failure (no recognizable features), yellow indicates partial restoration (coarse structures without fine-scale texture), and green indicates successful restoration with preserved fine-grained morphology. It should be noted that this color coding serves solely as a visual guide and does not constitute a quantitative categorization; visual impressions should be interpreted together with the objective metrics in Table 3 to provide a complete evaluation.

Visual inspection reveals that 3D-DnCNN and the proposed method outperform all other evaluated networks in terms of overall compensation quality. In particular, HSI-SDeCNNs reconstruct only conspicuous brightness modulations associated with prominent topographic variations (e.g., in Cases 4 and 5) and fail to reproduce fine-scale topographic details, particularly the subtle terrain undulations in Case 2. In comparison, 3D-DnCNNs demonstrates an improved capability to restore more detailed topographic relief features (particularly in Cases 4, 5, and 6). Nevertheless, they still struggle with the reconstruction of fine textural details and other smaller-scale morphological features (Cases 1 and 3). Paired-CycleGANs, trained exclusively on one-dimensional spectral data without spatial context, produce compensation results exhibiting scattered spatial noise artifacts that obscure the majority of characteristic topographic features. Despite these artifacts, paired-CycleGANs achieve a closer approximation of the overall image brightness (grayscale intensity) to the references than the aforementioned three networks, indicating superior fidelity in restoring the integrated spectral intensity per pixel—consistent with its demonstrated effectiveness in mineral-detection applications [24]. The diffusion-based cDDPM baseline exhibits the poorest performance, with its artifacts exacerbated into pervasive high-frequency granular noise that nearly precludes the reconstruction of all recognizable topographic features. This result suggests that the learned diffusion prior struggles to effectively model the signal-dependent degradation patterns characteristic of photon-starved and data-scarce lunar polar observations. In contrast, the proposed method yields compensation results that exhibit notable consistency in multi-scale topographic features, textural details, and global brightness relative to the reference data. For instance, it successfully reconstructs fine-scale features, including wrinkle ridges in Case 6, the slope in the lower-left portion of Case 1, and detailed topographic variations in Case 4. This comprehensive restoration capability arises from the MSA module’s integration of multi-scale spatial features, coupled with the joint optimization of MS-SSIM and Laplacian-pyramid loss functions. Although the proposed method demonstrates effective signal compensation capabilities, cascaded 3D convolutions and MS-SSIM/Log-Cosh losses introduce a mild spatial smoothing effect. This smoothing tendency, however, is mitigated by the residual formulation and Laplacian-pyramid objective in C3D-RCM, which guide the network to suppress noise rather than blur structural features, thereby preserving meso-scale topographic boundaries (e.g., crater rims, ridges, and slopes) at the M3 OP2C resolution. Compared to other evaluated approaches, it better preserves fine-scale morphological details and achieves superior overall reconstruction quality.

Overall, the superior visual fidelity achieved by SSF-3DSC highlights its potential to improve the interpretability of HSI data acquired under the low-illumination conditions of the lunar poles.

4.3. Performance Evaluation of Spectral Profiles

The fidelity of spectral signatures is paramount for the robust interpretation of HSI, particularly in lunar missions targeting objectives such as quantitative mineralogical mapping, hydroxyl/waterice abundance estimation, and elucidating geological evolution. Accurate spectral compensation is the cornerstone for discriminating the physicochemical properties of diverse lunar surface materials under low-illumination conditions, where attenuated signals compromise the spectral characteristics of shadowed terrain and PSRs near the lunar poles.

To evaluate the spectral restoration capability of various network architectures, a comparative analysis of representative pixel spectral profiles was conducted. Figure 12 presents the band-wise reflectance profiles at the pixel level (29, 9) within the representative test samples for HSI-SDeCNN, HSID-CNN, 3D-DnCNN, paired-CycleGAN, and our proposed SSF-3DSC. The vertical axis (‘Reflectance’) represents reflectance values, while the horizontal axis denotes the band index corresponding to M3 spectral channels. Due to the substantial spectral distortion exhibited by the cDDPM baseline, it is excluded from the spectral profile plots to prevent visual clutter from obscuring the other spectral profiles; its inability to reconstruct reliable spectral signatures has been adequately demonstrated by the quantitative evaluation in Section 4.1. As illustrated, the spectral profile compensated by the proposed network (blue dashed line) demonstrates the highest fidelity to the high-signal reference (red solid line). In contrast, all competing networks introduce varying degrees of spectral distortion, including the over or underestimation of overall reflectance levels, as well as the introduction of artificial fluctuations in spectral profiles. Specifically, HSID-CNN (green dashed line) exhibits pronounced spectral oscillations and spurious absorption bands; HSI-SDeCNN (brown dashed line) and 3D-DnCNN (purple dashed line) exhibit substantial deviations in spectral morphology and absorption band positions compared to the reference spectrum (e.g., Cases 2, 4, and 8), while simultaneously introducing systematic reflectance bias through over or underestimation (e.g., Cases 1, 3, and 4). Although paired-CycleGAN (orange dashed line) retains varying degrees of reflectance bias (Cases 1, 2, and 7), it significantly outperforms the aforementioned networks by delivering more accurate restoration of global spectral morphology and enhanced fidelity in local absorption band center recovery, thus representing the closest competing approach to our proposed SSF-3DSC. Such reflectance bias, however, is an inherent artifact of its single-pixel processing, which precludes the stabilizing influence of spatial regularization. In contrast, SSF-3DSC exploits spatial context from neighboring pixels to impose structural constraints that effectively suppress fluctuations in reflectance levels. The superior spectral fidelity achieved by SSF-3DSC relative to the purely spectral paired-CycleGAN underscores the critical role of spatial context in enabling robust hyperspectral signal reconstruction.

Combined with the spatial assessments in Section 4.2, these results further substantiate our network’s comprehensive signal-compensation capability in the lunar polar setting, which is essential for reliable lunar surface characterization under challenging illumination conditions and in low-signal regions. It is worth noting that the compared baselines are strong and widely adopted restoration backbones, and our results should be interpreted through the lens of domain specificity. The observed performance gap primarily reflects domain-specific degradations: lunar polar observations exhibit distinct, illumination-driven, and signal-dependent effects. By explicitly incorporating spatial–spectral fusion mechanisms tailored to this setting, SSF-3DSC is better suited to lunar signal compensation than general-purpose restorers, resulting in more reliable compensation under lunar polar conditions.

4.4. Ablation Study

To validate the contribution of constituent terms of our proposed architecture to the overall performance for signal compensation, a comprehensive ablation study was conducted to examine the individual and combined effects of these elements. Because the MS-SSIM, Laplacian-pyramid, and Log-Cosh losses are progressively incorporated into Stages 2 and 3 with distinct roles (Section 3.3.2 and Section 3.3.3), our ablation analysis primarily focuses on architectural modules and the staged-training strategy, while the effects of these loss terms are interpreted through their stage-wise objectives: structural similarity, multi-scale detail preservation, and robust end-to-end optimization, respectively. As detailed in Section 3.1, our architecture consists of three components (SCM, MSA, and C3D-RCM), which are designed to capture spectral-dimensional representations, multi-scale spatial representations, and perform spatial–spectral feature extraction and reconstruction of HSI cubes, respectively. Furthermore, end-to-end training was incorporated into the ablation framework to assess the efficacy of our staged training strategy. The effectiveness of both architectural components and training methodologies was comprehensively evaluated using both visual assessment and quantitative performance metrics.

Table 4 summarizes the quantitative performance of ablation experiments across different architectural modules and training strategy configurations, while Figure 13 and Figure 14 illustrate the corresponding spatial–spectral reconstruction effects. The complete architecture incorporating all three modules (SCM, MSA, and C3D-RCM) with staged training achieves optimal performance in both visual quality (Figure 13h) and all evaluation metrics (last column of Table 4). Specifically, this configuration attains the highest PSNR and FSIM, as well as the lowest ERGAS and MRAE, while exhibiting the most similar visual appearance to the ground truth (Figure 13b vs. Figure 13h). The ablation study systematically demonstrates that removing individual modules consistently degrades model performance. In particular, eliminating the SCM results in severe spectral fidelity degradation, with ERGAS deteriorating to 34.07 and MRAE increasing to 25.99, demonstrating the fundamental importance of SCM for spectral compensation and restoration. The spatial domain also exhibits substantial detail loss (Figure 13c). Similarly, though to a lesser extent, excluding the MSA component leads to moderate performance decline in spatial reconstruction (PSNR = 26.63 dB, FSIM = 0.9291), thereby demonstrating the importance of multi-scale attention mechanisms for capturing hierarchical spatial features effectively. Despite achieving the second-best spatial visual performance, Figure 13d exhibits increased blurriness relative to both the proposed method and ground truth, a degradation closely associated with the loss of multi-scale spatial attention features provided by the MSA module. The removal of the C3D-RCM module results in slight performance degradation (PSNR = 27.02 dB, FSIM = 0.9410, ERGAS = 25.98, MRAE = 18.45), as this module is primarily responsible for reconstructing both spectral features and multi-scale spatial features, with the spatial degradation manifesting as increased residual spatial artifacts (noisy pixels, as shown in Figure 13e). Finally, the training strategy itself plays a crucial role. When all three modules are utilized, switching from staged training to an end-to-end training strategy causes dramatic deterioration in all quality metrics and achieves the poorest spatial consistency among all ablation configurations (Figure 13g). This substantial improvement validates the effectiveness of the progressive learning strategy in optimizing the multi-module architecture for addressing complex restoration tasks involving low illumination lunar HSIs.

These results highlight the complementary nature of our proposed modules and training strategy, where their full integration yields superior performance compared to any subset combination. This architectural synergy demonstrates that each module distinctly addresses spectral characteristics, multi-scale spatial details, or feature reconstruction, collectively enabling more accurate and robust signal compensation. It is worth noting that when a specific module was ablated, any loss function terms exclusively designed for or tightly coupled with that module were also removed. The distinct impact of whether these corresponding loss terms are removed alongside the module or not—as a separate consideration from the module’s ablation itself—is beyond the scope of this particular study.

5. Discussion

The preceding sections have rigorously assessed the performance of the proposed network through quantitative metrics, visual inspection, spectral-profile analysis, and ablation studies. This discussion now emphasizes the framework’s practical applicability at regional scales, including the examination of spatial consistency and mineralogical fidelity in previously unsampled test regions, as well as the broader scientific implications, particularly the substantial expansion of usable data coverage in the scientifically critical lunar polar regions.

5.1. Evaluation Based on Spatial Consistency

Recent lunar exploration missions, such as NASA’s Artemis program [84,85,86] and China’s CE-7 mission [87,88,89], have increased scientific interest in potential landing zones within the lunar south polar region. Among the identified scientifically significant areas, De Gerlache Rim 1 and Connecting Ridge Extension were initially identified by NASA in 2022 as 2 of the 13 candidate landing regions for the Artemis III mission [66]. As described in Section 2.3, these regions were designated as independent test zones due to their scientific importance and the availability of both high- and low-signal M3 HSI coverage, providing an optimal testbed for evaluating the spatial consistency of the proposed signal compensation method at regional scales.

To demonstrate the regional-scale effectiveness of our approach, Figure 15 presents a comparative analysis of low-SNR M3 images from challenging low-illumination conditions with our model’s signal-compensated outputs and the corresponding high-SNR reference imagery from the two candidate landing regions. The rectangular frames delineate the boundaries of de Gerlache Rim 1 and the Connecting Ridge Extension, where coverage corresponds to overlapping zones of co-registered low- and high-SNR observations. In contrast, the blue-filled areas mark regions without paired low- and high-SNR coverage. For regional signal compensation, an overlapping patch-based processing strategy was employed to mitigate edge artifacts. The Target regions was partitioned into 32 × 32 pixel patches with a three-pixel overlap, each processed independently through our network. The processed patches were seamlessly reassembled using a gradient-based weighted blending algorithm on the overlapping zones to ensure spatial continuity, implemented as normalized weighted averaging in the overlaps.

Figure 15b presents the regional-scale signal compensation results, revealing substantial improvements in image quality and topographic clarity. The compensated imagery exhibits strong visual correlation with the high-SNR reference data (Figure 15c), with restored topographic features showing remarkable agreement with the underlying LRO LOLA digital terrain model (DTM). This stands in stark contrast to the original low-SNR observations (Figure 15a), where adverse illumination conditions have severely attenuated signal quality, rendering topographic features nearly indiscernible.

Specifically, the rim and interior walls of de Gerlache crater show remarkable correspondence between the compensated results (Figure 15b) and reference imagery (Figure 15c), with the compensated crater rim profile closely matching the underlying DTM topography. Furthermore, our compensation results accurately capture the detailed topographic relief characteristics at the base of the Connecting Ridge Extension. These distinct geological features are almost entirely obscured in the original low-SNR data. Given that spectral integration is performed during the display process with uniform contrast stretching applied across all images, the observed brightness similarity between our compensated results and the ground truth observations further validates the model’s fidelity in accurately restoring the integrated spectral signal intensity.

Nevertheless, despite the generally high compensation quality achieved across this region, a notable exception occurs in the pitted terrain northwest of Spudis crater, located in the upper-right portion of the Connecting Ridge Extension candidate landing site. This region presents uniquely challenging conditions: the compensated imagery displays persistent ambiguity and blurring artifacts that suggest fundamental limitations in signal recovery. This area represents the most poorly illuminated region in the original low-SNR imagery and exhibits significant noise artifacts that persist even in its paired high-SNR counterpart. These observations indicate that, due to its low-lying topography and the inherent challenges of M3’s polar observation conditions, this region failed to yield high-quality observational data throughout the entire OP2C period. Consequently, even the data designated as ’high-SNR’ for this specific locale appear compromised, with the corresponding low-SNR pixels likely approaching the critical lower boundary of the SNRI threshold used for HSI quality evaluation. Therefore, subsequent research should comprehensively examine how SNRI threshold selection influences the reliability and robustness of spectral compensation methodologies.

In summary, the evaluation of spatial consistency validates the efficacy of our proposed architecture for regional-scale lunar HSI compensation. The compensated results exhibit considerable visual correlation with high-SNR reference imagery, effectively restoring obscured geological features in challenging low-illumination conditions. Despite its limitations in instances of extreme signal degradation, the model demonstrates overall robustness in generating spatially coherent and topographically reliable imagery from compromised lunar HSIs.

5.2. Evaluation Based on Mineral Abundance Inversion

To validate the spectral fidelity of our SSF-3DSC framework in preserving mineralogical spectral signatures, we conducted a mineralogical inversion analysis in the Shackleton Crater region. This region presents an ideal testing ground for spectral compensation validation, given its status as a high-priority scientific target with dual significance: PSRs that are candidate sites for water ice deposits [90] and exposures of ancient crustal material. The crater’s complicated topography produces diverse illumination conditions—from direct solar illumination on rims to scattered light in shadowed areas—enabling comprehensive validation of our compensation methodology.

Reflectance anomalies observed on the western wall of Shackleton Crater are typically associated purest anorthosite (PAN)–lunar crustal rock composed of >98% plagioclase [91,92]. Plagioclase is intrinsically very reflective in the visible–NIR (especially around 1050 nm) and exhibits a characteristic Fe²⁺ absorption band near 1250 nm [93]. In practice, these spectral properties mean that regions rich in plagioclase (anorthosite) appear with elevated reflectance at 1050 nm and a stronger absorption at 1249 nm. Hence, the 1050/1249 nm reflectance ratio provides a robust proxy for plagioclase abundance [21]. In other words, higher values of the 1050/1249 nm ratio indicate stronger 1250 nm absorption relative to 1050 nm, a hallmark of plagioclase minerals.

Figure 16 presents a comparative analysis of plagioclase abundance maps: (a) raw low-SNR M3 images (0.1 < SNRI < 0.5), (b) the corresponding SSF-3DSC-compensated results, and (c) a reference mineralogical ratio map based on SELENE/Kaguya multiband imager (MI) observations (adapted from Haruyama et al. [91]). This Kaguya MI ratio map, acquired under direct solar illumination of Shackleton’s upper inner wall, provides a high-fidelity external benchmark for plagioclase distribution under optimal observational conditions. In all maps, a uniform color scale is applied where warm (orange/red) tones mark high 1050/1249 nm ratio values (high plagioclase content), and cool (green/blue) tones mark low ratios (mafic-rich or plagioclase-poor terrain).

The SSF-3DSC compensated ratio map (Figure 16b) effectively reproduces the spatial mineralogical patterns observed in the MI reference (Figure 16c). The highest ratio values are consistently observed on the western inner wall of Shackleton Crater in both datasets, corresponding to known locations of pure anorthosite (PAN) previously identified through LROC NAC observations [91]. This spectral fidelity extends throughout the crater region: elevated plagioclase abundances characterize the northwestern outer rim (displayed as orange-yellow hues), while the outer regions of Shackleton Crater maintain consistently lower values (shown in green). The compensated map preserves the same lateral variations as the MI reference, maintaining relative contrast between adjacent geological units. These results confirm that our compensation approach not only restores the overall signal but also retains the fine-scale spectral relationships essential for mineral mapping.

Conversely, the ratio map derived from low-SNR raw imagery (Figure 16a) is dominated by noise and artifacts, exhibiting anomalous values and substantial spatial irregularities throughout the crater region. Instead of delineating any clear geological units, the map displays a chaotic spatial distribution of random high and low ratio pixels–an expected outcome given the low illumination data quality. This chaotic pattern, coupled with geologically unrealistic abundance estimates (ratios exceeding 1.08 outside the crater rim), renders direct mineralogical interpretation from such signal-degraded data unreliable. This fragmentation is a direct consequence of the poor data quality: without compensation, shadowed spectra produce spurious band depths that overwhelm the true signal.

Despite overall consistency of compensation results, sparse anomalous pixels persist within the delineated region. These outliers, represented by the scattered blue pixels in the central region of Figure 16b, are sparsely scattered in the region beyond the western crater rim, contrasting with the elevated plagioclase signatures characteristic of their geological surroundings. Such localized discrepancies probably arise from the fundamental challenges of processing extremely low-SNR data, where signal attenuation in certain pixels inevitably exceeds the compensation capacity of our current framework architecture. While these sparse anomalies constitute an inherent limitation of the methodology, they remain sufficiently isolated to preserve the integrity of the broader mineralogical interpretation. This finding underscores a critical principle for practical applications: confidence in compensated data should scale with spatial coherence–isolated pixels warrant careful scrutiny, whereas regionally consistent patterns provide more reliable mineralogical insights.

5.3. Usable-Coverage Expansion and Outlook

The analyses presented in previous sections collectively demonstrate that our proposed deep learning framework significantly enhances the restoration quality and spectral fidelity of M3 HSIs acquired under challenging illumination conditions within lunar polar regions. To quantify the practical impact of our approach, we assess the distribution of signal quality across the lunar south polar region by computing the SNRI for each pixel using optimal values from repeated M3 observations.

Table 5 presents the proportional distribution of pixels across different SNRI ranges within two latitudinal zones. The statistics reveal that while high-quality observations (SNRI

< 0.1

) constitute 90% of pixels within 70°S, this proportion drops dramatically to 75% beyond 80°S. More critically, the moderately degraded pixels (

0.1 < SNRI < 0.5

)—previously deemed unusable for quantitative analysis—represent 6.47% and 15.10% of the respective regions. These pixels, concentrated in topographically shadowed areas, contain crucial mineralogical information about the Moon’s polar geology that would otherwise remain inaccessible without spectral compensation.

Building upon our validated compensation capability, we systematically processed all pixels within the 0.1–0.5 SNRI range across the lunar south polar region (70°S). This operational implementation effectively expands the usable M3 coverage from regions with SNRI

< 0.1

to those with SNRI

< 0.5

. This application aims not only to showcase the regional-scale restoration capabilities of our model but also to explore its utility in a real-world scientific context: a preliminary mineral abundance inversion in this newly accessible data regime.

This regional-scale analysis shows that the compensation approach substantially increases usable coverage in high-latitude scenes by incorporating moderate-SNR (SNRI: 0.1–0.5) observations, particularly within 80°S–90°S shadowed terrains, thereby providing a more data-rich basis for subsequent polar studies. By broadening the analyzable pixel set, the expanded dataset may support more reliable, fine-scale mapping in shadowed regions when combined with complementary observations (e.g., higher-resolution imagery or multi-sensor data). While detailed mineralogical interpretation of these newly accessible patches lies outside the present scope, the demonstrated coverage gain provides a practical foundation for future work, potentially facilitating more robust mineralogical mapping and resource assessments in high-latitude terrains.

5.4. Methodological Limitations and Future Directions

Comprehensive evaluation through quantitative metrics, visual assessments, and spectral-profile analyses (Section 4 and Section 5) demonstrates the effectiveness of SSF-3DSC. Despite these improvements, several limitations warrant attention.

First, since our paired-learning approach is trained on data within a fixed SNR range, the network cannot be expected to deliver compensation quality exceeding that of reference observations, constituting an inherent methodological boundary rather than a technical limitation addressable through architectural improvements. Therefore, the method should not be generalized to extremely low-SNR, deep-PSR spectra (typically SNRI > 0.5) and derived application-level inferences (including volatile-related interpretation), where diagnostic absorption signatures may fall below the noise floor and lie outside the learned mapping. Second, isolated anomalous pixels persist sporadically within well-compensated regions, particularly in areas experiencing extreme signal attenuation. Thus, careful contextual inspection is required when performing detailed mineralogical interpretation. Third, the selection of the SNRI threshold, which determines the boundaries for compensation, directly impacts both the coverage and fidelity of recoverable data. Notably, higher SNRI typically corresponds to increasingly noise-dominated spectra, for which restoration reliability may degrade rapidly; therefore, results beyond our validated range (0.1 < SNRI < 0.5) should be interpreted with caution. But, an optimal, universally applicable threshold remains undetermined. Finally, as the model was trained exclusively on M3 OP2C data from the lunar south polar region, its generalization to highly heterogeneous terrains, other polar regions, and alternative hyperspectral sensors remains to be validated through systematic evaluation.

Future work should focus on refine SNRI thresholding for dataset construction and investigate the model’s performance across a continuum of SNR levels to establish clear reliability bounds. Furthermore, extending SSF-3DSC to the lunar north pole and to future hyperspectral instruments targeting shadowed craters will validate the method’s generalizability and advance polar investigations, enabling more reliable water ice detection and resource prospecting. In addition, exploring multi-sensor data fusion (e.g., combining optical HSI with thermal or SAR observations) is expected to further enhance the robustness of compensation in challenging illumination regimes. Methodologically, exploring more advanced attention-guided spatial–spectral fusion backbones may provide additional gains in module synergy and overall performance.

6. Conclusions

This study introduced SSF-3DSC, a deep learning framework for restoring low-illumination M3 HSIs of the lunar south polar region. By jointly exploiting spatial–spectral information through a spectral compensation unit, a multi-scale spatial attention mechanism, and a 3D residual refinement block under a staged training strategy, SSF-3DSC outperforms representative terrestrial hyperspectral denoising models [34,35,36,44] and lunar spectral-only compensation methods [24], achieving higher spatial similarity (MPSNR = 27.68 dB; MFSIM = 0.9452) and superior spectral fidelity (ERGAS = 24.42). Quantitative metrics, visual inspections, spectral-profile analyses, and ablation experiments collectively validate the effectiveness of the architecture and importance of spatial–spectral integration, while regional-scale applications over Artemis III candidate landing regions and Shackleton Crater demonstrate its practical utility for geological interpretation and mineral abundance mapping.

Despite demonstrated improvements, the proposed approach is currently bounded by the quality of reference observations and the specific SNR threshold used for training. Minor artifacts may also persist in areas of extreme signal attenuation. Future work will address these limitations by refining reliability bounds, extending the framework to the lunar north pole and future hyperspectral instruments, and exploring multi-sensor fusion. These efforts aim to advance SSF-3DSC from a novel network into a more robust and generalizable hyperspectral signal compensation framework, facilitating high-fidelity mineralogical and potential water ice detection across expanded and more precisely defined SNR operational ranges.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/rs18050682/s1, Figure S1: Enlarged high-resolution reproduction of Figure 11 (8 representative cases) to facilitate fine-scale visual inspection.

Author Contributions

Conceptualization, R.N. and P.L.; methodology, R.N. and T.M.; software, R.N.; validation, R.N., T.M., and Y.D.; investigation, R.N. and W.Z.; data curation, R.N. and F.Z.; writing—original draft preparation, R.N.; writing—review and editing, R.N., T.M., Y.D., F.Z., and P.L.; supervision, P.L.; funding acquisition, P.L. and T.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the National Natural Science Foundation of China (Grant Nos. 62401549 and 62495035).

Data Availability Statement

The Moon Mineralogy Mapper (M3) Level 2 data products are publicly available via the NASA Planetary Data System (PDS) Imaging Node: https://pds-imaging.jpl.nasa.gov/volumes/m3.html (accessed on 3 January 2026). The M3 geometric restoration ancillary products, including location (LOC) and observation (OBS) files developed by Gaddis et al. [60], are accessible at https://asc-pds-services.s3.us-west-2.amazonaws.com/mosaic/m3/geometric_restoration/index.html (accessed on 3 January 2026). The LRO LOLA digital terrain model (DTM) data can be downloaded from the LOLA PDS Data Node: https://imbrium.mit.edu/ (accessed on 3 January 2026). Information on the candidate landing regions for NASA’s Artemis III mission, including De Gerlache Rim 1 and Connecting Ridge Extension, is available from NASA at https://www.nasa.gov/news-release/nasa-identifies-candidate-regions-for-landing-next-americans-on-moon/ (accessed on 3 January 2026).

Acknowledgments

The authors would like to acknowledge the Moon Mineralogy Mapper (M3) and LRO LOLA science teams for their efforts in developing and maintaining these valuable lunar remote sensing datasets. The authors also thank the NASA Planetary Data System (PDS) for providing open access to the M3 and LOLA data products used in this study. In addition, the authors acknowledge the developers of the M3 geometric restoration ancillary products (LOC/OBS) for enabling accurate geometric correction of the M3 imagery.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhang, P.; Dai, W.; Niu, R.; Zhang, G.; Liu, G.; Liu, X.; Bo, Z.; Wang, Z.; Zheng, H.; Liu, C.; et al. Overview of the Lunar In Situ Resource Utilization Techniques for Future Lunar Missions. Space Sci. Technol. 2023, 3, 0037. [Google Scholar] [CrossRef]
Sinha, M.; Paul, S.; Ghosh, M.; Mohanty, S.N.; Pattanayak, R.M. Automated Lunar Crater Identification with Chandrayaan-2 TMC-2 Images using Deep Convolutional Neural Networks. Sci. Rep. 2024, 14, 8231. [Google Scholar] [CrossRef]
Jolliff, B.L. Science and Exploration of the Moon: Overview. In Oxford Research Encyclopedia of Planetary Science; Oxford University Press: Oxford, UK, 2021. [Google Scholar] [CrossRef]
Guo, H.; Guang, L.; Ding, Y. Moon-based Earth observation: Scientific concept and potential applications. Int. J. Digit. Earth 2018, 11, 546–557. [Google Scholar] [CrossRef]
Paige, D.A.; Siegler, M.A.; Zhang, J.A.; Hayne, P.O.; Foote, E.J.; Bennett, K.A.; Vasavada, A.R.; Greenhagen, B.T.; Schofield, J.T.; McCleese, D.J.; et al. Diviner Lunar Radiometer Observations of Cold Traps in the Moon’s South Polar Region. Science 2010, 330, 479–482. [Google Scholar] [CrossRef]
Mazarico, E.; Neumann, G.A.; Smith, D.E.; Zuber, M.T.; Torrence, M.H. Illumination conditions of the lunar polar regions using LOLA topography. Icarus 2011, 211, 1066–1081. [Google Scholar] [CrossRef]
Gladstone, G.R.; Retherford, K.D.; Egan, A.F.; Kaufmann, D.E.; Miles, P.F.; Parker, J.W.; Horvath, D.; Rojas, P.M.; Versteeg, M.H.; Davis, M.W.; et al. Far-ultraviolet reflectance properties of the Moon’s permanently shadowed regions. J. Geophys. Res. Planets 2012, 117, E00H04. [Google Scholar] [CrossRef]
Hayne, P.O.; Hendrix, A.; Sefton-Nash, E.; Siegler, M.A.; Lucey, P.G.; Retherford, K.D.; Williams, J.P.; Greenhagen, B.T.; Paige, D.A. Evidence for exposed water ice in the Moon’s south polar regions from Lunar Reconnaissance Orbiter ultraviolet albedo and temperature measurements. Icarus 2015, 255, 58–69. [Google Scholar] [CrossRef]
Colaprete, A.; Schultz, P.; Heldmann, J.; Wooden, D.; Shirley, M.; Ennico, K.; Hermalyn, B.; Marshall, W.; Ricco, A.; Elphic, R.C.; et al. Detection of Water in the LCROSS Ejecta Plume. Science 2010, 330, 463–468. [Google Scholar] [CrossRef]
Li, S.; Lucey, P.G.; Milliken, R.E.; Hayne, P.O.; Fisher, E.; Williams, J.P.; Hurley, D.M.; Elphic, R.C. Direct evidence of surface exposed water ice in the lunar polar regions. Proc. Natl. Acad. Sci. USA 2018, 115, 8907–8912. [Google Scholar] [CrossRef]
Qiao, L.; Ling, Z.C.; Head, J.W.; Ivanov, M.A.; Liu, B. Analyses of Lunar Orbiter Laser Altimeter 1,064-nm Albedo in Permanently Shadowed Regions of Polar Crater Flat Floors: Implications for Surface Water Ice Occurrence and Future In Situ Exploration. Earth Space Sci. 2019, 6, 467–488. [Google Scholar] [CrossRef]
Bickel, V.T.; Moseley, B.; Lopez-Francos, I.; Shirley, M. Peering into lunar permanently shadowed regions with deep learning. Nat. Commun. 2021, 12, 5607. [Google Scholar] [CrossRef]
Eke, V.R.; Teodoro, L.F.A.; Elphic, R.C. The spatial distribution of polar hydrogen deposits on the Moon. Icarus 2009, 200, 12–18. [Google Scholar] [CrossRef]
McClanahan, T.P.; Parsons, A.M.; Livengood, T.A.; Su, J.J.; Chin, G.; Hamara, D.; Harshman, K.; Starr, R.D. Evidence for Widespread Hydrogen Sequestration within the Moon’s South Pole Cold Traps. Planet. Sci. J. 2024, 5, 217. [Google Scholar] [CrossRef]
Pieters, C.; Boardman, J.; Buratti, B.; Chatterjee, A.; Clark, R.; Glavich, T.; Green, R.; Head, J.; Isaacson, P.; Malaret, E.; et al. The Moon Mineralogy Mapper (M3) on Chandrayaan-1. Curr. Sci. 2008, 96, 500–505. [Google Scholar]
Adams, J.B. Interpretation of Visible and Near-Infrared Diffuse Reflectance Spectra of Pyroxenes and Other Rock-Forming Minerals. In Infrared and Raman Spectroscopy of Lunar and Terrestrial Minerals; Karr, C., Ed.; Academic Press: New York, NY, USA, 1975; pp. 91–116. [Google Scholar]
Pieters, C.M.; Besse, S.; Boardman, J.; Buratti, B.; Cheek, L.; Clark, R.N.; Combe, J.P.; Dhingra, D.; Goswami, J.N.; Green, R.O.; et al. Mg-spinel lithology: A new rock type on the lunar farside. J. Geophys. Res. Planets 2011, 116, E00G08. [Google Scholar] [CrossRef]
Pommerol, A.; Schmitt, B. Strength of the H₂O near-infrared absorption bands in hydrated minerals: Effects of particle size and correlation with albedo. J. Geophys. Res. Planets 2008, 113, E10009. [Google Scholar] [CrossRef]
Pieters, C.M.; Goswami, J.N.; Clark, R.N.; Annadurai, M.; Boardman, J.; Buratti, B.; Combe, J.P.; Dyar, M.D.; Green, R.; Head, J.W.; et al. Character and Spatial Distribution of OH/H₂O on the Surface of the Moon Seen by M³ on Chandrayaan-1. Science 2009, 326, 568–572. [Google Scholar] [CrossRef]
Hapke, B. Theory of Reflectance and Emittance Spectroscopy, 2nd ed.; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar] [CrossRef]
Lemelin, M.; Lucey, P.G.; Camon, A. Compositional Maps of the Lunar Polar Regions Derived from the Kaguya Spectral Profiler and the Lunar Orbiter Laser Altimeter Data. Planet. Sci. J. 2022, 3, 63. [Google Scholar] [CrossRef]
Li, S.; Milliken, R.E. Water on the surface of the Moon as seen by the Moon Mineralogy Mapper: Distribution, abundance, and origins. Sci. Adv. 2017, 3, e1701471. [Google Scholar] [CrossRef]
Wang, W.; Jin, Q.; Chen, X.; Jiao, H.; Cai, W.; Lu, Y.; Xu, T.; Wu, Y. Character and spatial distribution of mineralogy at the lunar south polar region. Planet. Space Sci. 2024, 240, 105833. [Google Scholar] [CrossRef]
Ni, R.; Zhao, F.; Meng, T.; Du, Y.; Lu, P.; Wang, R. Signal Compensation of Moon Mineralogy Mapper (M3) Under Low Illumination Conditions Using a CycleGAN-Based Network. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 8504–8522. [Google Scholar] [CrossRef]
Adler-Golden, S.M.; Matthew, M.W.; Anderson, G.P.; Felde, G.W.; Gardner, J.A. Algorithm for de-shadowing spectral imagery. In Proceedings of the Imaging Spectrometry VIII, Seattle, WA, USA, 8–10 July 2002; SPIE: Bellingham, WA, USA, 2002; Volume 4816, pp. 203–210. [Google Scholar]
Richter, R.; Müller, A. De-shadowing of satellite/airborne imagery. Int. J. Remote Sens. 2005, 26, 3137–3148. [Google Scholar] [CrossRef]
Liu, Y.; Bioucas-Dias, J.; Li, J.; Plaza, A. Hyperspectral cloud shadow removal based on linear unmixing. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; IEEE: New York, NY, USA, 2017; pp. 1000–1003. [Google Scholar]
Rüfenacht, D.; Fredembach, C.; Süsstrunk, S. Automatic and accurate shadow detection using near-infrared information. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 36, 1672–1678. [Google Scholar] [CrossRef] [PubMed]
Zhang, Z.; Cao, R.; Sheng, H.; Guo, M.; Shao, Z.; Wu, L. Shadow detection and removal for remote sensing images via multi-feature adaptive optimization and geometry-aware illumination compensation. Expert Syst. Appl. 2025, 282, 127769. [Google Scholar] [CrossRef]
Ilesanmi, A.E.; Ilesanmi, T.O. Methods for image denoising using convolutional neural network: A review. Complex Intell. Syst. 2021, 7, 2179–2198. [Google Scholar] [CrossRef]
Signoroni, A.; Savardi, M.; Baronio, A.; Benini, S. Deep Learning Meets Hyperspectral Image Analysis: A Multidisciplinary Review. J. Imaging 2019, 5, 52. [Google Scholar] [CrossRef]
Xie, W.; Li, Y. Hyperspectral Imagery Denoising by Deep Learning With Trainable Nonlinearity Function. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1963–1967. [Google Scholar] [CrossRef]
Windrim, L.; Ramakrishnan, R.; Melkumyan, A.; Murphy, R.J. A Physics-Based Deep Learning Approach to Shadow Invariant Representations of Hyperspectral Images. IEEE Trans. Image Process. 2018, 27, 665–677. [Google Scholar] [CrossRef]
Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef]
Yuan, Q.; Zhang, Q.; Li, J.; Shen, H.; Zhang, L. Hyperspectral Image Denoising Employing a Spatial–Spectral Deep Residual Convolutional Neural Network. IEEE Trans. Geosci. Remote Sens. 2019, 57, 1205–1218. [Google Scholar] [CrossRef]
Maffei, A.; Haut, J.M.; Paoletti, M.E.; Plaza, J.; Bruzzone, L.; Plaza, A. A Single Model CNN for Hyperspectral Image Denoising. IEEE Trans. Geosci. Remote Sens. 2020, 58, 2516–2529. [Google Scholar] [CrossRef]
Liang, H.; Ke, C.; Li, K. Hybrid Spatial-Spectral Neural Network for Hyperspectral Image Denoising. In Proceedings of the European Conference on Computer Vision, Milan, Italy, 29 September–4 October 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 278–294. [Google Scholar]
Lai, Y.Y.; Lin, C.H.; Leng, Z.C. Hyper-Restormer: A General Hyperspectral Image Restoration Transformer for Remote Sensing Imaging. arXiv 2023, arXiv:2312.07016. [Google Scholar] [CrossRef]
Wang, R.; Ma, L.; He, G.; Johnson, B.A.; Yan, Z.; Chang, M.; Liang, Y. Transformers for Remote Sensing: A Systematic Review and Analysis. Sensors 2024, 24, 3495. [Google Scholar] [CrossRef] [PubMed]
Nakamura, R.; Kataoka, H.; Takashima, S.; Noriega, E.J.M.; Yokota, R.; Inoue, N. Pre-training Vision Transformers with Very Limited Synthesized Images. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1–6 October 2023; pp. 20360–20369. [Google Scholar]
Li, Z.; Wallace, E.; Shen, S.; Lin, K.; Keutzer, K.; Klein, D.; Gonzalez, J. Train big, then compress: Rethinking model size for efficient training and inference of transformers. In Proceedings of the International Conference on Machine Learning, Virtual, 13–18 July 2020; PMLR: Norfolk, MA, USA, 2020; pp. 5958–5968. [Google Scholar]
Hong, D.; Han, Z.; Yao, J.; Gao, L.; Zhang, B.; Plaza, A.; Chanussot, J. SpectralFormer: Rethinking Hyperspectral Image Classification with Transformers. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5518615. [Google Scholar] [CrossRef]
Wang, Z.; Gao, F.; Dong, J.; Du, Q. Global and Local Attention-Based Transformer for Hyperspectral Image Change Detection. IEEE Geosci. Remote Sens. Lett. 2025, 22, 5500405. [Google Scholar] [CrossRef]
Ho, J.; Jain, A.; Abbeel, P. Denoising Diffusion Probabilistic Models. arXiv 2020, arXiv:2006.11239. [Google Scholar] [CrossRef]
Miao, Y.; Zhang, L.; Zhang, L.; Tao, D. DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration. arXiv 2023, arXiv:2303.06682. [Google Scholar]
Pang, L.; Rui, X.; Cui, L.; Wang, H.; Meng, D.; Cao, X. HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models. arXiv 2024, arXiv:2402.15865. [Google Scholar] [CrossRef]
Wu, C.; Wang, D.; Mao, H.; Li, Y. HSR-Diff:Hyperspectral Image Super-Resolution via Conditional Diffusion Models. arXiv 2023, arXiv:2306.12085. [Google Scholar]
Hu, X.; Liu, X.; Hong, D.; Duan, Q.; Jiang, L.; Yang, H.; Zhan, D. Recent Advances in Diffusion Models for Hyperspectral Image Processing and Analysis: A Review. arXiv 2025, arXiv:2505.11158. [Google Scholar]
Zhao, M.; Yan, L.; Chen, J. Hyperspectral image shadow compensation via cycle-consistent adversarial networks. Neurocomputing 2021, 450, 61–69. [Google Scholar] [CrossRef]
Moseley, B.; Bickel, V.; López-Francos, I.G.; Rana, L. Extreme Low-Light Environment-Driven Image Denoising over Permanently Shadowed Lunar Regions with a Physical Noise Model. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 6313–6323. [Google Scholar] [CrossRef]
Martinez, A.; Siegler, M.A. A Global Thermal Conductivity Model for Lunar Regolith at Low Temperatures. J. Geophys. Res. Planets 2021, 126, e2021JE006829. [Google Scholar] [CrossRef]
Green, R.O.; Pieters, C.; Mouroulis, P.; Eastwood, M.; Boardman, J.; Glavich, T.; Isaacson, P.; Annadurai, M.; Besse, S.; Barr, D.; et al. The Moon Mineralogy Mapper (M3) imaging spectrometer for lunar science: Instrument description, calibration, on-orbit measurements, science data calibration and on-orbit validation. J. Geophys. Res. Planets 2011, 116, E00G19. [Google Scholar] [CrossRef]
Boardman, J.W.; Pieters, C.M.; Green, R.O.; Lundeen, S.R.; Varanasi, P.; Nettles, J.; Petro, N.; Isaacson, P.; Besse, S.; Taylor, L.A. Measuring moonlight: An overview of the spatial properties, lunar coverage, selenolocation, and related Level 1B products of the Moon Mineralogy Mapper. J. Geophys. Res. Planets 2011, 116, E00G14. [Google Scholar] [CrossRef]
Hapke, B. Bidirectional reflectance spectroscopy: 1. Theory. J. Geophys. Res. Solid Earth 1981, 86, 3039–3054. [Google Scholar] [CrossRef]
Yokota, Y.; Matsunaga, T.; Ohtake, M.; Haruyama, J.; Nakamura, R.; Yamamoto, S.; Ogawa, Y.; Morota, T.; Honda, C.; Saiki, K. Lunar photometric properties at wavelengths 0.5–1.6 µm acquired by SELENE Spectral Profiler and their dependency on local albedo and latitudinal zones. Icarus 2011, 215, 639–660. [Google Scholar] [CrossRef]
Li, S.; Milliken, R.E. An empirical thermal correction model for Moon Mineralogy Mapper data constrained by laboratory spectra and Diviner temperatures. J. Geophys. Res. Planets 2016, 121, 2081–2107. [Google Scholar] [CrossRef]
Bandfield, J.L.; Poston, M.J.; Klima, R.L.; Edwards, C.S. Widespread distribution of OH/H2O on thelunar surface inferred from spectral data. Nat. Geosci. 2018, 11, 173–177. [Google Scholar] [CrossRef]
Shkuratov, Y.; Surkov, Y.; Ivanov, M.; Korokhin, V.; Kaydash, V.; Videen, G.; Pieters, C.; Stankevich, D. Improved Chandrayaan-1 M3 data: A northwest portion of the Aristarchus Plateau and contiguous maria. Icarus 2019, 321, 34–49. [Google Scholar] [CrossRef]
Breunig, M.M.; Kriegel, H.P.; Ng, R.T.; Sander, J. LOF: Identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, TX, USA, 15–18 May 2000; pp. 93–104. [Google Scholar]
Gaddis, L.R.; Boardman, J.; Malaret, E.; Besse, S.; Kirk, R.; Archinal, B.; Edmundson, K.; Sides, S.; Weller, L. Status of Restoring Moon Mineralogy Mapper Data to Full Spatial and Photometric Accuracy. In Proceedings of the 46th Lunar and Planetary Science Conference (LPSC), The Woodlands, TX, USA, 16–20 May 2015; Volume LPI Contribution No. 1832, p. 2033. [Google Scholar]
Malaret, E.; Battisti, A.; Gaddis, L. 2019 Status of Geometric Restoration of Moon Mineralogy Mapper Data. In Proceedings of the 50th Lunar and Planetary Science Conference (LPSC), The Woodlands, TX, USA, 18–22 March 2019; Volume LPI Contribution No. 2132, p. 2816. [Google Scholar]
Suárez-Valencia, J.E.; Rossi, A.P.; Zambon, F.; Carli, C.; Nodjoumi, G. MoonIndex, an Open-Source Tool to Generate Spectral Indexes for the Moon From M3 Data. Earth Space Sci. 2024, 11, e2023EA003464. [Google Scholar] [CrossRef]
Li, S.; Lucey, P.G.; Fraeman, A.A.; Poppe, A.R.; Sun, V.Z.; Hurley, D.M.; Schultz, P.H. Widespread hematite at high latitudes of the Moon. Sci. Adv. 2020, 6, eaba1940. [Google Scholar] [CrossRef] [PubMed]
Tseng, P.; Yun, S. A coordinate gradient descent method for nonsmooth separable minimization. Math. Program. 2009, 117, 387–423. [Google Scholar] [CrossRef]
Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.S.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geosci. Remote Sens. Mag. 2017, 5, 8–36. [Google Scholar] [CrossRef]
NASA. NASA Identifies Candidate Regions for Landing Next Americans on Moon; NASA: Washington, DC, USA, 2022. [Google Scholar]
Meng, Z.; Zhao, F.; Liang, M. SS-MLP: A Novel Spectral-Spatial MLP Architecture for Hyperspectral Image Classification. Remote Sens. 2021, 13, 4060. [Google Scholar] [CrossRef]
Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
Hara, K.; Kataoka, H.; Satoh, Y. Learning spatio-temporal features with 3D residual networks for action recognition. In Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops, Venice, Italy, 22–29 October 2017; pp. 3154–3160. [Google Scholar]
Liu, Z.; Wang, W.; Ma, Q.; Liu, X.; Jiang, J. Rethinking 3D-CNN in Hyperspectral Image Super-Resolution. Remote Sens. 2023, 15, 2574. [Google Scholar] [CrossRef]
Khan, A.; Jin, W.; Haider, A.; Rahman, M.; Wang, D. Adversarial gaussian denoiser for multiple-level image denoising. Sensors 2021, 21, 2998. [Google Scholar] [CrossRef]
Lee, C.M.; Cheng, C.H.; Lin, Y.F.; Cheng, Y.C.; Liao, W.T.; Hsu, C.C.; Yang, F.E.; Wang, Y.C.F. Prompthsi: Universal hyperspectral image restoration framework for composite degradation. arXiv 2024, arXiv:2411.15922. [Google Scholar] [CrossRef]
Paura, V. Hyperspectral Unmixing of Agricultural Images taken from UAV Using Adapted U-Net Architecture. arXiv 2024, arXiv:2409.19701. [Google Scholar] [CrossRef]
Shin, C.J.; Lee, T.B.; Heo, Y.S. Dual Image Deblurring Using Deep Image Prior. Electronics 2021, 10, 2045. [Google Scholar] [CrossRef]
Zhou, W.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef]
Lai, W.S.; Huang, J.B.; Ahuja, N.; Yang, M.H. Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 624–632. [Google Scholar]
Gedraite, E.S.; Hadad, M. Investigation on the effect of a Gaussian Blur in image filtering and segmentation. In Proceedings of the ELMAR-2011, Zadar, Croatia, 14–16 September 2011; IEEE: New York, NY, USA, 2011; pp. 393–396. [Google Scholar]
Jadon, S. A survey of loss functions for semantic segmentation. In Proceedings of the 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), Vina del Mar, Chile, 27–29 October 2020; IEEE: New York, NY, USA, 2020; pp. 1–7. [Google Scholar]
Chen, Z.; Badrinarayanan, V.; Lee, C.Y.; Rabinovich, A. GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks. arXiv 2017, arXiv:1711.02257. [Google Scholar]
Farber, N.; Girod, B.; Villasenor, J. Extensions of ITU-T recommendation H.324 for error-resilient video transmission. IEEE Commun. Mag. 1998, 36, 120–128. [Google Scholar] [CrossRef]
Sara, U.; Akter, M.; Uddin, M.S. Image quality assessment through FSIM, SSIM, MSE and PSNR: A comparative study. J. Comput. Commun. 2019, 7, 8–18. [Google Scholar] [CrossRef]
Lin, Y.T.; Finlayson, G.D. On the Optimization of Regression-Based Spectral Reconstruction. Sensors 2021, 21, 5586. [Google Scholar] [CrossRef] [PubMed]
Carvajal, A.; Garcia-Colon, V. High capacity motors on-line diagnosis based on ultra wide band partial discharge detection. In Proceedings of the 4th IEEE International Symposium on Diagnostics for Electric Machines, Power Electronics and Drives, SDEMPED 2003, Madison, WI, USA, 1–4 June 2003; IEEE: New York, NY, USA, 2003; pp. 168–170. [Google Scholar]
Kumari, N.; Bretzfelder, J.M.; Ganesh, I.; Lang, A.; Kring, D.A. Surface Conditions and Resource Accessibility at Potential Artemis Landing Sites 007 and 011. Planet. Sci. J. 2022, 3, 224. [Google Scholar] [CrossRef]
Huang, Q.; Liu, S.; Yang, X.; Tong, X. Artemis III pre-selected landing sites engineering suitability analysis with illumination, communication and slope, based on LOLA terrain. In Proceedings of the International Conference on Remote Sensing, Mapping, and Geographic Systems (RSMG 2023), Kaifeng, China, 7–9 July 2023; SPIE: Bellingham, WA, USA, 2023; Volume 12815. [Google Scholar]
Boazman, S.J.; Shah, J.; Harish; Gawronska, A.J.; Halim, S.H.; Satyakumar, A.V.; Gilmour, C.M.; Bickel, V.T.; Barrett, N.; Kring, D.A. The Distribution and Accessibility of Geologic Targets near the Lunar South Pole and Candidate Artemis Landing Sites. Planet. Sci. J. 2022, 3, 275. [Google Scholar] [CrossRef]
Rao, W.; Fang, Y.; Peng, S.; Zhang, H.; Sheng, L.; Ma, J. Landing Site Selection Method of Lunar South Pole Region. J. Deep Space Explor. 2022, 9, 571–578. [Google Scholar] [CrossRef]
Liu, N.; Jin, Y.Q. A Statistical Rule of the Stokes Parameters of Pol-SAR for Identifying Flat Surface in PSR. IEEE Geosci. Remote Sens. Lett. 2023, 20, 4002705. [Google Scholar] [CrossRef]
Zhao, F.; Lu, P.; Meng, T.; Dang, Y.; Gao, Y.; Xu, Z.; Wang, R.; Wu, Y. Selection of Landing Sites for the Chang’E-7 Mission Using Multi-Source Remote Sensing Data. Remote Sens. 2025, 17, 1121. [Google Scholar] [CrossRef]
Zuber, M.T.; Head, J.W.; Smith, D.E.; Neumann, G.A.; Mazarico, E.; Torrence, M.H.; Aharonson, O.; Tye, A.R.; Fassett, C.I.; Rosenburg, M.A.; et al. Constraints on the volatile distribution within Shackleton crater at the lunar south pole. Nature 2012, 486, 378–381. [Google Scholar] [CrossRef]
Haruyama, J.; Yamamoto, S.; Yokota, Y.; Ohtake, M.; Matsunaga, T. An explanation of bright areas inside Shackleton Crater at the Lunar South Pole other than water-ice deposits. Geophys. Res. Lett. 2013, 40, 3814–3818. [Google Scholar] [CrossRef]
Yamamoto, S.; Nakamura, R.; Matsunaga, T.; Ogawa, Y.; Ishihara, Y.; Morota, T.; Hirata, N.; Ohtake, M.; Hiroi, T.; Yokota, Y. Massive layer of pure anorthosite on the Moon. Geophys. Res. Lett. 2012, 39, L13201. [Google Scholar] [CrossRef]
Ke, X.; Wang, C.; Du, J.; Yuan, Y.; Xu, X.; Feng, Y.; Xie, H.; Liu, S.; Tong, X. Topographic Correction of the SELENE MI Images with the LOLA DEM around Shackleton Crater. Remote Sens. 2022, 14, 4739. [Google Scholar] [CrossRef]

Figure 1. (a) Representative reflectance spectra of mafic minerals. The following samples from the Reflectance Experiment Laboratory (RELAB) database were utilized: olivine (PO-CMP-026), clinopyroxene (LS-CMP-009), orthopyroxene (PP-RGB-080), glass (LS-CMP-035-A), plagioclase (LR-CMP-217), and ilmenite (LR-CMP-218). (b) The bidirectional reflectance of H₂O ice and lunar regolith intimate mixtures with varying abundances, based on Hapke’s radiative transfer model [20]. Vertical dashed lines indicate characteristic absorption centers of the indicated materials in the VIS–NIR range.

Figure 2. Flowchart of the experimental procedure of this study.

Figure 3. Spatial distribution of M3 Level 2 data coverage above

70^{°}

S in the lunar south polar region. Data are accessible via the Planetary Data System (PDS) at https://pds-imaging.jpl.nasa.gov/volumes/m3.html (accessed on 11 February 2026).

Figure 3. Spatial distribution of M3 Level 2 data coverage above

70^{°}

S in the lunar south polar region. Data are accessible via the Planetary Data System (PDS) at https://pds-imaging.jpl.nasa.gov/volumes/m3.html (accessed on 11 February 2026).

Figure 4. Multi-step preprocessing pipeline for stripe noise suppression in M3 hyperspectral data, combining spectral-domain anomaly correction and spatial-domain Fourier filtering.

Figure 5. (a) Reflectance spectra before and after stripe noise removal using the preprocessing methodology. The blue line denotes a representative spectrum from the original M3 Level 2 HSI, the green line shows the spectrum after local outlier factor (LOF)-based outlier detection [59], and the orange line depicts the final stripe-free spectrum obtained through multi-step preprocessing. The points denote the discrete reflectance values at each spectral band. (b) SNR Index image examples computed from spectrally smoothed spectral data, showing the spatial distribution of signal quality. Low-SNR (high-SNRI) pixels concentrate at high latitudes and within craters. The color scheme classifies pixels as follows: blue (SNRI 0–0.025, high quality), gray (SNRI 0.025–0.1, moderate quality), yellow (SNRI 0.1–0.5, indirectly illuminated areas regions–target region of this study), and red/black (SNRI > 0.5, assumed to lack sufficient spectral information).

Figure 6. (a) High-signal M3 pixels of the lunar south polar region, with green areas indicating overlapping coverage with low-SNR observations. The yellow rectangle marks the subregion enlarged in panel (b). (b) Representative spatial distribution of training sample footprints (red rectangles) within the overlapping regions (green). Blue rectangles indicate the two Artemis III candidate landing sites–De Gerlache Rim 1 and Connecting Ridge Extension–designated as independent test areas.

Figure 7. Flowchart of the proposed SSF-3DSC method for signal compensation in HSI data.

Figure 8. Structure of the spectral compensation module (SCM). The detailed architecture of the spectral feature extraction component is provided in Table 1.

Figure 9. Architecture of the multi-scale spatial attention (MSA) module. Spatial feature weighting is achieved by a dual-pathway (average and maximum channel pooling) combined with multi-scale convolutions.

Figure 10. Structure of a 3D residual convolutional (3D-RC) block. The module consists of two

3 \times 3 \times 3

Conv3D layers with ReLU activations, together with a shortcut path for residual learning. The first convolution updates the channel dimension from

C_{i n}

to

C_{o u t}

, while the second maintains

C_{o u t}

to

C_{o u t}

. The residual addition preserves the spatial–spectral resolution (

B \times h \times w

), enabling volumetric feature learning across neighboring bands and pixels.

Figure 10. Structure of a 3D residual convolutional (3D-RC) block. The module consists of two

3 \times 3 \times 3

Conv3D layers with ReLU activations, together with a shortcut path for residual learning. The first convolution updates the channel dimension from

C_{i n}

to

C_{o u t}

, while the second maintains

C_{o u t}

to

C_{o u t}

. The residual addition preserves the spatial–spectral resolution (

B \times h \times w

), enabling volumetric feature learning across neighboring bands and pixels.

Figure 11. Compensation results for representative HSI cubes from the test set. Each panel is rendered as a grayscale image obtained by integrating reflectance over the full spectral range. All panels share a global linear contrast stretch computed from the collective intensity range of the dataset, ensuring direct brightness comparability. (a) Low-SNR input; (b) high-SNR reference; (c) conditional DDPM; (d) HSI-SDeCNN; (e) HSID-CNN; (f) 3D-DnCNN; (g) paired-CycleGAN; (h) proposed SSF-3DSC. Colored boxes denote the restoration quality: white (reference ROIs), red (failure), yellow (coarse restoration without fine details), and green (successful restoration with textural fidelity).

Figure 12. Comparative spectral restoration results at the pixel level (29, 9) showing the original low-signal spectrum (black), reference high-signal spectrum (red), and outputs from HSI-SDeCNN (brown), HSID-CNN (green), 3D-DnCNN (purple), paired-CycleGAN (orange), and the proposed SSF-3DSC method (blue).

Figure 13. Ablation study results demonstrating spatial restoration quality across different architectural configurations. (a) Original low-signal image. (b) Reference high-signal image. (c) w/o SCM. (d) w/o MSA. (e) w/o C3D-RCM. (f) w/o SCM+MSA. (g) End-to-end training. (h) Proposed method. Colored boxes indicate restoration quality (red: failure; yellow: coarse; green: faithful), consistent with the color scheme in Figure 11.

Figure 14. Ablation study results: spectral profiles at pixel (29, 9) across different architectural configurations: reference high-signal spectrum (red), w/o SCM (blue), w/o MSA (green), w/o C3D-RCM (purple), w/o SCM+MSA (brown), end-to-end training (cyan), and proposed method (orange).

Figure 15. Regional-scale signal compensation over the Artemis III candidate landing regions: de Gerlache Rim 1 and Connecting Ridge Extension (outlined by rectangles). Blue-filled regions indicate zones lacking paired low- and high-SNR coverage. All images are overlaid on LRO LOLA 100-m DTM. (a) Low-SNR M3 observations from poor illumination conditions; (b) compensated imagery produced by our method; (c) high-SNR reference observations from optimal illumination conditions.

Figure 16. Plagioclase abundance distribution maps derived from 1050/1249 nm band ratio analysis in the Shackleton Crater region: (a) unprocessed low-SNR M3 observations; (b) SSF-3DSC compensated results; (c) reference mineralogical map from SELENE/Kaguya MI (adapted from Haruyama et al. [91]). Higher ratio values correspond to intensified 1250 nm Fe²⁺ absorption features that are diagnostic of plagioclase mineralogy. In the inset, the white cross indicates the lunar south pole, and the white stars denote the upper and lower reference portions on the Shackleton crater wall defined in Haruyama et al. [91].

Table 1. The detailed structure of the spectral feature extraction block in SCM.

	Layer	Activation Function	Units
Spectral Feature Extraction Block	Input layer	-	B
	Hidden layer	LReLU	128
	Hidden layer	LReLU	256
	Hidden layer	LReLU	512
	Hidden layer	LReLU	256
	Hidden layer	LReLU	128
	Output layer	Sigmoid	B

Table 2. Stage-wise loss weights used in SSF-3DSC training.

Training Stage	Loss Weights
Training Stage	$ω_{MSAM}$	$ω_{RE}$	$ω_{TV}$	$ω_{SSIM}$	$ω_{Lap}$	$ω_{Log - Cosh}$
1	10	0.25	1	-	-	-
2	10	0.05	1	1	4	-
3	10	0.20	1	5	15	10

Table 3. Quantitative evaluation of the compensation results. The proposed method is shaded in gray, with the best performance highlighted in bold and the second-best underlined. All methods were retrained on the same paired dataset with identical splits, preprocessing, and optimization protocols.

Network	Spatial Quality Metrics		Spectral Quality Metrics
Network	MPSNR	MFSIM	ERGAS	MRAE
cDDPM	6.5351	0.6586	168.4341	148.2302
HSI-SDeCNN	24.3837	0.8024	34.6368	27.5304
HSID-CNN	22.8557	0.8038	41.7882	33.0030
3D-DnCNN	25.8243	0.8798	28.7848	21.5760
Paired-CycleGAN	22.9880	0.7937	27.1028	20.6651
Proposed	27.6820	0.9452	24.4203	17.5404

Table 4. Ablation study: reconstruction accuracy under different module configurations and training strategies. Checkmarks (✔) denote the inclusion of architectural modules (SCM, MSA, and C3D-RCM) and training strategies (end-to-end vs. staged). Gray shading indicates the proposed method. The best performance is highlighted in bold, and the second-best is underlined.

Modules	SCM		✔	✔		✔	✔
	MSA	✔		✔		✔	✔
	C3D-RCM	✔	✔		✔	✔	✔
Training Strategy	End-to-End				✔	✔
Training Strategy	Staged	✔	✔	✔			✔
Metrics	MPSNR	24.4489	26.6313	27.0231	24.9584	19.3300	27.6820
	MFSIM	0.8644	0.9291	0.9410	0.8727	0.8452	0.9452
	ERGAS	34.0659	26.3980	25.9780	32.5890	38.5904	24.4203
	MRAE	25.9856	18.1807	18.4539	24.2632	34.3471	17.5404

Table 5. Proportion of the SNR Index across the lunar south polar region. A higher SNR index indicates lower signal quality for the pixel.

Region	SNR Index
Region	0–0.025	0.025–0.1	0.1–0.5	0.5–1.0	>1.0
Within 70°S	71.75%	18.28%	6.47%	1.42%	2.10%
Within 80°S	43.96%	30.76%	15.10%	4.37%	5.82%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ni, R.; Meng, T.; Zhao, F.; Dang, Y.; Zhang, W.; Lu, P. Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions. Remote Sens. 2026, 18, 682. https://doi.org/10.3390/rs18050682

AMA Style

Ni R, Meng T, Zhao F, Dang Y, Zhang W, Lu P. Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions. Remote Sensing. 2026; 18(5):682. https://doi.org/10.3390/rs18050682

Chicago/Turabian Style

Ni, Rui, Tingyu Meng, Fei Zhao, Yanan Dang, Wenbin Zhang, and Pingping Lu. 2026. "Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions" Remote Sensing 18, no. 5: 682. https://doi.org/10.3390/rs18050682

APA Style

Ni, R., Meng, T., Zhao, F., Dang, Y., Zhang, W., & Lu, P. (2026). Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions. Remote Sensing, 18(5), 682. https://doi.org/10.3390/rs18050682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Spatial–Spectral Fusion 3D Signal Compensation for Moon Mineralogy Mapper (M3) Hyperspectral Images in Low-Signal Lunar Polar Regions

Highlights

Abstract

1. Introduction

2. Data and Preprocessing

2.1. Data Source

2.2. Preprocessing

2.3. Dataset Construction

3. Methodology

3.1. Spatial–Spectral Fusion 3D Signal Compensation (SSF-3DSC) Framework Architecture

3.1.1. Spectral Compensation Module (SCM)

3.1.2. Multi-Scale Spatial Attention Module (MSA)

3.1.3. Cascaded 3D Residual Convolutional Module (C3D-RCM)

3.2. Staged Training Strategy

3.3. Loss Function

3.3.1. Stage 1: Spectral Feature Extraction

3.3.2. Stage 2: Spatial Feature Restoration

3.3.3. Stage 3: End-to-End Training

3.4. Training Details

3.5. Evaluation Metrics

3.5.1. Spatial-Domain Metrics

Peak Signal-to-Noise Ratio (PSNR)

Feature Similarity Index Matrix (FSIM)

3.5.2. Spectral-Domain Metrics

Mean Relative Absolute Error (MRAE)

Erreur Relative Globale Adimensionnelle de Synthèse (ERGAS)

4. Experimental Results

4.1. Performance Evaluation of Spatial–Spectral Quality Metrics

4.2. Performance Evaluation of Visual Quality

4.3. Performance Evaluation of Spectral Profiles

4.4. Ablation Study

5. Discussion

5.1. Evaluation Based on Spatial Consistency

5.2. Evaluation Based on Mineral Abundance Inversion

5.3. Usable-Coverage Expansion and Outlook

5.4. Methodological Limitations and Future Directions

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI