Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation

He, Yibo; Seng, Kah Phooi; Ang, Li Minn; Peng, Bei; Zhao, Xingyu

doi:10.3390/app15084188

Open AccessArticle

Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation

by

Yibo He

¹

,

Kah Phooi Seng

²,

Li Minn Ang

^2,*,

Bei Peng

³ and

Xingyu Zhao

⁴

¹

School of Advanced Technology, Xi’an Jiaotong Liverpool University, Suzhou 215123, China

²

School of Science, Technology and Engineering, University of Sunshine Coast, Petrie, QLD 4502, Australia

³

Department of Computer Science, University of Liverpool, Liverpool L69 3DR, UK

⁴

Warwick Manufacturing Group (WMG), University of Warwick, Coventry CV4 7AL, UK

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(8), 4188; https://doi.org/10.3390/app15084188

Submission received: 31 December 2024 / Revised: 3 April 2025 / Accepted: 5 April 2025 / Published: 10 April 2025

Download

Browse Figures

Versions Notes

Abstract

The scarcity of labeled training samples poses a significant challenge in hyperspectral image classification. Cross-scene classification has been shown to be an effective approach to tackle the problem of limited sample learning. This paper investigates the usage of generative adversarial networks (GANs) to enable collaborative artificial intelligence learning on hyperspectral datasets. We propose and design a specialized architecture, termed Hyper-CycleGAN, for heterogeneous transfer learning across source and target scenes. This architecture enables the establishment of bidirectional mappings through efficient adversarial training and merges both source-to-target and target-to-source generators. The proposed Hyper-CycleGAN architecture harnesses the strengths of GANs, along with custom modifications like the integration of multi-scale attention mechanisms to enhance feature learning capabilities specifically tailored for hyperspectral data. To address training instability, the Wasserstein generative adversarial network with gradient penalty (WGAN-GP) loss discriminator is utilized. Additionally, a label smoothing technique is introduced to enhance the generalization capability of the generator, particularly in handling unlabeled samples, thus improving model robustness. Experimental results are performed to validate and confirm the effectiveness of the cross-domain Hyper-CycleGAN approach by demonstrating its applicability to two real-world cross-scene hyperspectral image datasets. Addressing the challenge of limited labeled samples in hyperspectral image classification, this research makes significant contributions and gives valuable insights for remote sensing, environmental monitoring, and medical imaging applications.

Keywords:

hyperspectral data; data analytics; adversarial neural network; generative adversarial network

1. Introduction

Hyperspectral images (HSI) consist of continuous, narrow spectral bands, reflecting not only the spatial information but also the spectral information of remotely sensed objects. With high-resolution spectral information and large-scale spatial information, hyperspectral images have demonstrated outstanding capabilities in many fields, such as geological exploration and medicine [1]. Hyperspectral image classification is an important application. Many machine learning algorithms have been designed and applied to HSI classification. However, the small sample problem still makes HSI classification very challenging.

In the past few years, many deep-learning-based HSI classification methods have achieved impressive results. The authors of [2] proposed a spectral attention network that purposefully suppresses less useful spectral bands and enhances valuable spectral bands to solve the HSI classification problem. In the work of Shen et al. [3], a deep fully convolutional network (FCN) was utilized to take the entire data cube of an HSI as an input, which can then effectively integrate long-range contextual information. The authors of [4] designed a convolutional neural network (CNN)-based network that can merge features at multiple scales. However, CNNs can only handle relationships in spatial neighborhoods. The authors of [5] proposed a deep feature aggregation framework driven by a graph convolutional network (DFAGCN) model. The model first extracts features using CNN and then introduces a GCN-based model [6] and, finally, fuses the features using a weighted concatenation method. The authors of [7] proposed small-batch GCN (miniGCN), which trains the network in small batches, extracts different features using CNN and miniGCN, and fuses the features using three fusion strategies for HSI classification. However, most methods based on deep learning theory require a sufficient number of labeled samples for training, otherwise it is difficult to achieve the desired classification results. Unfortunately, off-the-shelf labeled hyperspectral images are rare, and labeling new images can be both time-consuming and expensive [8].

A promising alternative direction lies in cross-domain generation of hyperspectral data. To improve the classification performance of a dataset with a limited number of labeled samples (called the target scene), we can use information from another similar dataset with sufficient labeled samples (called the source scene) [9]. However, due to different imaging sensors, the source and target scenes may contain different feature distributions, which leads to a non-negligible classification problem. There is also a large amount of distortion and noise on the generated hyperspectral data.

In recent years, models based on generative adversarial networks (GANs) [10] have been used for migration learning in remote sensing. GANs can generate fake samples that are indistinguishable from the real ones. Typically, a GAN-based model consists of a generator and a discriminator. It is relatively feasible to transfer images from one scene to another by adversarial learning [11]. The auxiliary classifier GAN (AC-GAN) [12] introduces labeling information into the training process of GAN. Another approach, CycleGAN (the cycle-consistent generative adversarial network), was proposed by Zhu et al. [13] and is a modified model of GAN that creates a bidirectional mapping between two different scenes. Many GAN-based methods achieve alignment of cross-scene feature distributions by minimizing domain differences [14].

In this paper, we propose and design a novel adversarial neural network architecture for cross-domain data generation, termed Hyper-CycleGAN, which is targeted toward hyperspectral data. Specifically, the Hyper-CycleGAN architecture makes the following contributions:

The architecture initiates with a bidirectional mapping network to ensure consistency in data distribution between source and target scenes. These generators are shared within the system to guarantee cyclic consistency and enable efficient reconstruction of samples mapped across domains. Despite improved model performance, it is observed that the data generated by the original CycleGAN generator exhibit noise and poor quality, with a lack of detailed features.
Consequently, modifications are made to the original internal module of the generator by incorporating a multi-scale attention mechanism. This adjustment allows for an enhanced capture of subtle relationships between spectral bands, prioritization of salient spectral features, and consideration of the spatial distribution of spectral information.
The discriminator employs the PatchGAN [15] architecture for local feature analysis, enhancing detail and structure recognition in hyperspectral images. Meanwhile, the Wasserstein GAN with gradient penalty (WGAN-GP) [16] technique is utilized to stabilize the training process and improve both fidelity and quality of the synthetic hyperspectral data.
A hyperspectral data evaluation tool based on the ResNet architecture is subsequently created. This tool employs a complex structure consisting of ResBlocks [17] and DenseBlocks [18] to efficiently capture and interpret the rich spectral information in each hyperspectral data sample, ensuring its performance in processing hyperspectral data analysis tasks and its robustness in processing these tasks.

In summary, our proposed Hyper-CycleGAN architecture harnesses the strengths of CycleGAN, along with custom modifications like the integration of multi-scale attention mechanisms [19], to enhance feature learning capabilities specifically tailored for hyperspectral data. This results in a more robust and efficient image transformation model capable of generating high-quality, realistic hyperspectral data while improving details and reducing noise. Experimental results validate the superiority of our Hyper-CycleGAN architecture over traditional approaches, demonstrating enhanced recognition performance across diverse scenarios from the perspective of hyperspectral data analysis.

2. Background

In this section, we provide the related topics in brief, including deep learning for hyperspectral data and adversarial neural networks for hyperspectral data, to give readers the background for better understanding.

2.1. Deep Learning for Hyperspectral Data

Hyperspectral data, as noted by Mehta et al. [20], differs significantly from traditional RGB imaging by capturing detailed information across numerous narrow spectral bands rather than just three broad ones. This enables precise differentiation of subtle material variations but also introduces challenges due to its high dimensionality and complexity, necessitating specialized mathematical and computational approaches for effective analysis. Researchers have applied numerous machine learning techniques to extract valuable information from these complex datasets [21]. Traditional methods have shown considerable effectiveness in hyperspectral data analysis across various applications. Techniques such as spectral indices and band ratios offer computationally efficient solutions for identifying specific materials [22]. Dimensionality reduction approaches, including principal component analysis (PCA) and independent component analysis (ICA), effectively preserve essential information while reducing data complexity [23]. Furthermore, conventional classifiers, like support vector machines (SVMs) [24] and random forests, demonstrate strong performance in hyperspectral classification, particularly when training samples are limited [25]. These traditional methods typically require fewer computational resources and perform exceptionally well when analyzing materials with well-defined spectral signatures.

A significant challenge in hyperspectral imaging applications stems from the combination of limited training samples and extensive feature sets, making reliable statistical parameter estimation difficult. While traditional methods perform admirably in scenarios with clear spectral separation, they often struggle with complex scenes containing mixed pixels and subtle spectral variations [26]. Deep learning approaches offer complementary capabilities that address these limitations, particularly for applications requiring automated feature extraction from complex datasets. Hyperspectral image classification represents a domain where deep learning has made substantial advances. Unlike conventional methods that rely on manually engineered features and may struggle to capture intricate patterns in hyperspectral data [27], deep learning models automatically learn hierarchical representations from raw data. Convolutional neural networks (CNNs) [28] and recurrent neural networks (RNNs) [29] have demonstrated superior classification accuracy for complex scenes through this automated feature learning capability.

The optimal choice between traditional and deep learning approaches depends on specific application requirements, training data availability, and computational constraints. For example, both traditional mathematical models [30] and deep learning architectures [7] have successfully addressed spectral unmixing tasks. Similarly, anomaly detection benefits from both statistical approaches [31] and deep autoencoder networks [32]. This paper presents a deep-learning-based adversarial neural network architecture for hyperspectral image classification, while acknowledging that traditional methods maintain significant relevance in specific application contexts.

2.2. Adversarial Neural Networks for Hyperspectral Data

In the context of deep learning for hyperspectral data processing, acquiring a substantial volume of labeled data to train supervised learning models presents a significant challenge due to the complexity and diversity of hyperspectral data, which escalates the cost of labeled data acquisition. As Wang et al. [33] pointed out in their research, this challenge has prompted researchers to explore unsupervised learning methods, particularly heterogeneous domain adaptation techniques for hyperspectral image classification. Similarly, Reddy et al. [34] emphasized the difficulty of acquiring labeled data in hyperspectral image processing. To address these challenges, adversarial neural networks have emerged as a viable solution. These networks can be trained without labeled data, learning the distribution and features of data in an unsupervised or semi-supervised manner. Primitive GANs operate adversarially, where the generator network produces realistic hyperspectral data samples, while the discriminator network distinguishes between generated and real data, encouraging the generator to produce increasingly realistic samples. The cycle-consistent generative adversarial network (CycleGAN) [13], a specialized architecture within adversarial neural networks, facilitates domain transformation without requiring paired training data. Recent research has demonstrated CycleGAN’s significant potential in hyperspectral data processing. Chen et al. [35] proposed a CycleGAN-based hyperspectral image super-resolution reconstruction method, while Zhao et al. [36] explored using CycleGAN for hyperspectral image denoising and feature enhancement, providing new insights for unsupervised hyperspectral data processing. For data conversion, let s ∈ R^W×H×B and t ∈ R^W×H×B be features belonging to the source and target scenes, respectively. Here, W × H is the spatial dimension, and B is the spectral dimension. Then, the aim of CycleGAN is to learn the mapping from G_S→T and G_T→S. Overall, CycleGAN consists of two generators,

G_{S \to T}

and

G_{T \to S}

, and two discriminators,

D_{S}

and

D_{T}

. CycleGAN uses three different loss functions to perform the transformation task, and they are as follows:

Adversarial loss: it aims to transform samples in the source domain into samples in the target domain and will learn the mapping S→T, i.e.,

G_{S \to T} (s)

:

\begin{matrix} L_{a d v} (G_{S \to T}, D_{T}) = E_{t \sim T (t)} [l o g D_{T} (t)] \\ + E_{S \sim P_{S} (s)} [\log (1 - D_{T} (G_{S \to T} (s)))] \end{matrix}

(1)

Cycle-consistency loss: it aims to retain its original feature vectors using periodic losses to ensure that the network is able to learn both the forward and inverse mappings:

\begin{matrix} L_{c y c} (G_{S \to T}, G_{T \to S}) \\ = E_{s \sim P_{S} (s)} [{∥G_{T \to S} (G_{S \to T} (s)) - s∥}_{1}] ∣ \\ + E_{t \sim P_{T} (t)} [{∥G_{S \to T} (G_{T \to S} (t)) - t∥}_{1}] \end{matrix},

(2)

where ||.||₁ is the L₁-norm and

E

[.] is the expectation operator.

Identity-mapping loss: It is used to preserve the identity of the target class in the network. It is generally employed to train the network for initial learning, i.e.,

\begin{matrix} L_{i d} (G_{S \to T}, G_{T \to S}) = E_{s \sim P_{S} (s)} [{∥G_{T \to S} (s) - s∥}_{1}] \\ + E_{t \sim T (t)} [{∥G_{S \to T} (t) - t∥}_{1}] \end{matrix}

(3)

Full objective: the overall objective of the CycleGAN network is given as:

\begin{matrix} L_{full} = λ_{adv} L_{adv} (G_{S \to T}, D_{T}) + λ_{adv} L_{a d v} (G_{T \to S}, D_{S}) \\ + λ_{cyc} L_{cyc} (G_{S \to T}, G_{T \to S}) \\ + λ_{i d} L_{i d} (G_{S \to T}, G_{T \to S}), \end{matrix}

(4)

where

λ_{adv}

,

λ_{cyc}

, and

λ_{i d}

are the weights associated with adversarial, cycle-consistency, and identity-mapping loss, respectively. These values are used as hyperparameters in the network during training.

3. The Proposed Hyper-CycleGAN

The proposed Hyper-CycleGAN architecture was designed to classify spectral information. In this architecture, the CycleGAN cycle network was adapted, with specific adjustments, as illustrated in Figure 1. This architecture comprises two streams: the target stream (indicated by orange lines) traverses from the target scene through the source scene and back, whereas the source stream (indicated by blue lines) follows a reverse path. Transforming original images into those of other scenes enables the generation of new hyperspectral data. However, it was observed that images generated by the original CycleGAN generator were marred by noise and poor quality, with a lack of detail. Consequently, the internal modules of the generator were modified, notably through the incorporation of a multi-scale attention mechanism. This mechanism enables the network to concentrate on various parts of the image at differing scales, thus more effectively capturing the details and features characteristic of hyperspectral images.

Generator: Figure 2 depicts the proposed architecture, highlighting the internal configuration of the specially designed Hyper-CycleGAN generator module. Internally, the Hyper-CycleGAN generator’s architecture employs an encoder–decoder framework consisting of three primary components: an encoder, a multi-scale attention module, and a decoder. The encoder functions by transforming the input image into an intermediary representation or feature vector, typically utilizing multiple convolutional layers to extract features at various levels within the image. Designed to capture inter-band correlations, the multi-scale attention module amplifies focus on specific spectral attributes and enhances understanding of spectral spatial information, thus improving the quality and precision of generated hyperspectral data. Conversely, the decoder component reverses the encoding process, translating the feature vector back into an image within the target domain through multiple inverse convolutional layers tasked with rendering the feature vector into the final image within said domain.

Algorithm 1 delineates the pseudocode framework for the integration of a multi-scale attention module within the bespoke architecture of the Hyper-CycleGAN module, specifically devised for the synthesis of hyperspectral data. Our methodology is predicated on addressing the unique challenges intrinsic to hyperspectral data, by maintaining the original generator’s capacity to discern and assimilate complex mapping relationships among hyperspectral images. This foundational principle is critical for ensuring the generator’s ongoing effectiveness in enabling seamless transformations across diverse spectral domains.

To elevate the quality and complexity of the synthesized hyperspectral data, we have introduced multi-scale attention modules. These modules are meticulously designed to enhance the generator’s perceptual sharpness and contextual comprehension within the hyperspectral domain. Their design aims to identify correlations between varying wavelengths, focus attention on specific spectral features, and deepen the understanding of spectral spatial information. The intricate interplay between spectral bands is revealed through subtle variations and correlations across different wavelengths. Such interplay might include dependencies between adjacent spectral bands, illustrating how alterations in one band could affect the attributes of neighboring bands. Furthermore, spectral bands may engage in complex interactions, such as absorption and emission patterns, contributing to the scene’s overall spectral signature. The emphasis on prominent spectral features within the multi-scale attention modules is evidenced by the selective amplification of specific spectral components identified as vital for the synthesis process. This entails focusing on distinctive spectral peaks or valleys that correlate with specific materials or substances in the scene. By prioritizing these essential spectral features, the generator can accurately capture the fundamental characteristics of the hyperspectral data, leading to more precise synthesis outcomes.

Algorithm 1: Hyper-CycleGAN generator with multi-scale attention pseudocode

  1: inputs (), down_stack (), up_stack (), last ()
  2: outputs ()
  3: x ← inputs
  4: skips ← []
  5: for down in down_stack do
  6:   x ← down(x)
  7:   skips.append(x)
  8: end for
  9: skips ← reversed(skips[: −1])
10: for up, skip in zip(up_stack, skips) do
11:   x ← self_attention(x) {}
12:   attention_outputs ← []
13:   for i do
14: scaled_input ← Conv1D(x, filters = x.shape[−1], kernel_size = i + 1, padding = ‘same’) {}
15: attention_output ← MultiHeadAttention(scaled_input, scaled_input) {}
16: attention_outputs.append(attention_output)
17:   end for
18:   merged_attention ← Concatenate(attention_outputs) {}
19:   x ← Concatenate([x, merged_attention]) {}
20:   x ← up(x)
21:   x ← Concatenate()([x, skip])
22: end for
23: outputs ← last(x)
24: return outputs

Moreover, integrating spatial contextual cues into the framework fosters a deeper comprehension of the spatial distribution of spectral information. This requires examining how spectral characteristics vary across different spatial locations within a scene. For instance, the spatial arrangement of vegetation, water bodies, or built structures may reveal unique spectral signatures that contribute to the scene’s overall spectral composition. Incorporating spatial contextual information allows the generator to better situate spectral features within their spatial contexts, resulting in more realistic and contextually relevant synthesis outcomes.

In summary, incorporating multi-scale attention modules significantly enhances the Hyper-CycleGAN framework’s capability to synthesize high-quality hyperspectral data. This enhancement facilitates capturing the subtle relationships between spectral bands, prioritizing salient spectral features, and considering the spatial distribution of spectral information. Consequently, this advancement propels the state-of-the-art in hyperspectral data synthesis forward.

Discriminator: The discriminator’s role in hyperspectral data analysis is pivotal for distinguishing between real and synthetically generated hyperspectral data. It scrutinizes the data’s features, identifying genuine hyperspectral characteristics in the actual data while pinpointing discrepancies in the generated data. This discernment capability renders the discriminator an integral component in the training regimen of the generator. In our research, we selected PatchGAN as the foundational architecture for the discriminator module, explicitly optimized for hyperspectral data examination. PatchGAN employs convolutional neural networks (CNNs) to impose specific penalties on N*N feature blocks. Within this architectural framework, CNNs serve as the terminal layers, methodically segmenting the input hyperspectral image into discrete blocks or patches. This segmentation facilitates the independent assessment of each patch, enabling a localized scrutiny of spectral features. By concentrating on smaller segments of the hyperspectral image, PatchGAN not only simplifies the model’s complexity but also enables a more detailed analysis of localized image attributes. Such localized scrutiny significantly augments the framework’s proficiency in capturing the subtle details and structural intricacies inherent to hyperspectral data. Consequently, integrating PatchGAN into our methodology substantially enhanced our ability to discern complex spectral patterns and spatial interrelations within hyperspectral images, thereby improving hyperspectral data analysis and interpretation.

Furthermore, to address potential training challenges and augment overall performance in hyperspectral data analysis, we incorporated Wasserstein GAN with gradient penalty (WGAN-GP) into our discriminator architecture. WGAN-GP is instrumental in refining hyperspectral data synthesis by tackling specific challenges inherent to the generation process. The essence of WGAN-GP lies in the introduction of a gradient penalty term, which applies constraints to the gradient architecture of the generated hyperspectral samples. This constraint effectively moderates the gradient, thus stabilizing the training process and facilitating the generation of more realistic hyperspectral data. WGAN-GP is crucial for overcoming issues related to gradient instability, such as gradient explosion or vanishing gradients, which pose significant challenges in hyperspectral data synthesis. By managing the gradient, WGAN-GP ensures more consistent training dynamics, culminating in enhanced efficiency in data synthesis and superior capabilities in data reconstruction and denoising. Mathematically, the loss function of WGAN-GP can be expressed as:

\begin{matrix} L_{W G A N - G P} (D_{S}) = E_{\hat{s} \sim P_{\tilde{s}}} [D (\hat{s})] - E_{s \sim P_{s}} [D (s)] \\ + λ E_{\tilde{s} \sim P_{\tilde{s}}} [({∥\nabla_{\tilde{s}} D (\tilde{s})∥}_{2} - 1)^{2}] \end{matrix}

(5)

Here, D_S still denotes the discriminator,

P_{\tilde{s}}

is the distribution formed by a mixture of real and generated data, λ is the weight of the gradient penalty term, and

\hat{s}

is a sample sampled from

P_{\tilde{s}}

, a distribution formed by linear interpolation between real and generated data. The incorporation of a gradient penalty term in the Wasserstein generative adversarial network with gradient penalty (WGAN-GP) incentivizes the discriminator to acquire smoother and more stable decision boundaries throughout the training process. This augmentation leads to a substantial enhancement in the training efficiency of the model and elevates the quality of the generated samples.

Hyper-CycleGAN classifier: Upon the generation of new hyperspectral data by the generator, our subsequent endeavor involved the development of a ResNet classifier to rigorously evaluate the quality and fidelity of the generated data. Figure 3 delineates the detailed architecture of our meticulously designed classifier, which was specifically crafted to examine the spectral signatures embedded within the synthesized hyperspectral dataset.

At the heart of our hyperspectral classifier model resides a custom amalgamation of residual blocks (ResBlocks) and dense blocks (DenseBlocks), each meticulously engineered to navigate the intricacies and complexities inherent in hyperspectral data analysis. These architectural elements formed the foundation of our classifier, empowering it to adeptly capture and interpret the rich spectral information contained within each hyperspectral data sample. Within the ResBlocks, our classifier utilized the strength of residual connections to ensure unimpeded information flow and gradient propagation, thereby facilitating effective feature extraction and representation learning. The employment of these residual connections allowed our classifier to effectively counteract the vanishing gradient issue, a common challenge in deep neural networks, thus enhancing its capacity to detect subtle spectral variations and intricate patterns inherent in hyperspectral data. Complementing the ResBlocks, the DenseBlocks are distinguished by their dense connectivity patterns, which enhance feature reuse and enable the extraction of highly discriminative spectral features. Through this dense connectivity, our classifier was poised to harness the extensive spectral information available across different wavelengths, thereby enabling precise discrimination of subtle spectral nuances and accurate classification of hyperspectral data samples.

Moreover, the architecture of our hyperspectral classifier was thoughtfully designed to include spectral-specific layers and operations, ensuring its suitability for tasks associated with hyperspectral data analysis. These spectral-specific adaptations permitted our classifier to effectively capture the unique spectral characteristics intrinsic to hyperspectral imagery, thereby augmenting its overall efficacy and robustness in hyperspectral data classification tasks.

Hyper-CycleGAN training: Algorithm 2 shows the Hyper-CycleGAN training pseudocode. Our training was divided into two parts: generator–discriminator training and classifier training. In the generator–discriminator training part, two generators and two discriminators were trained simultaneously in an adversarial manner [37]. First, for

L_{a d v}

(Equation (1)), we replaced the negative log-likelihood objective with a least squares loss [38]. This loss was more stable during training and produced higher-quality results. In particular, for the GAN loss

L_{a d v} (G_{S \to T}, D_{T})

, we trained G to minimize

E_{t \sim T (t)} [l o g D_{T} (t)]

and trained D to minimize

E_{S \sim P_{S} (s)} [\log (1 - D_{T} (G_{S \to T} (s)))]

.

Secondly, to mitigate model oscillations and optimize training for hyperspectral data synthesis, we incorporated a strategy proposed in [39]. This strategy involved updating the discriminator using a history of previously generated hyperspectral images rather than the most recent ones. An image buffer was maintained to store the 60,000 previously generated hyperspectral samples. Moreover, we introduced label smoothing during the training process to enhance model robustness and generalization capability, particularly in the context of hyperspectral data. Label smoothing involves adjusting the target labels for real and fake samples to encourage more calibrated and confident predictions from the discriminator. This regularization technique prevents the discriminator from becoming overly confident in its predictions and fosters the generation of diverse and realistic hyperspectral outputs, essential for accurate and reliable hyperspectral data synthesis.

During the classifier training phase, our focus shifted to harnessing the unique spectral characteristics inherent in hyperspectral data for effective classification tasks. Leveraging ResNet, specifically tailored for hyperspectral data analysis, we embarked on a journey to decode the complex spectral signatures embedded within our synthesized dataset. This dataset, meticulously crafted by the generator, served as the cornerstone of our classifier’s training regimen. In the initial stages, as we laid the foundation for our ResNet model, we carefully initialized the weight parameters using the common random initialization method. This ensured that our classifier possessed the versatility to capture the rich and intricate spectral features encapsulated within hyperspectral data cubes. By imbuing our model with this innate adaptability, we equipped it with the capability to navigate the vast expanse of spectral space with precision and finesse. Optimization became paramount as we delved deeper into the training process. Here, we made a deliberate choice in favor of the Adam optimizer, renowned for its efficacy in handling the inherent complexities of high-dimensional data, such as hyperspectral imagery. With an initial learning rate meticulously set to 1 × 10⁻⁴ × 4, our model embarked on a journey of exploration, gradually unraveling the subtle nuances embedded within the spectral domain. Throughout the arduous training journey, we adopted the batch training approach, allowing us to efficiently process large volumes of hyperspectral data while maintaining computational efficiency. This iterative process enabled our model to iteratively refine its understanding of the spectral landscape, iteratively honing its classification prowess with each passing epoch. Moreover, to safeguard against the looming specter of overfitting and ensure the generalization of our classifier, we implemented an early stopping strategy. With a patience value meticulously set to 15 epochs, our model navigated the delicate balance between convergence and generalization, ensuring optimal performance while mitigating the risk of premature convergence to suboptimal solutions.

Algorithm 2: Hyper-CycleGAN training loop pseudocode

1. Input:
- Source domain hyperspectral data S and target domain hyperspectral data T
- Models:

G_{S \to T}

,

G_{T \to S}

,

D_{S}

,

D_{T}

, hyperspectral classifier f
- Hyperparameters: λgp, λcycle, training iterations N, classifier training iterations M
2. Output:
- Trained models:

G_{S \to T}

,

G_{T \to S}

,

D_{S}

,

D_{T}

, f
3. procedure TRAIN HYPER-CYCLEGAN
4. for n = 1 to N do // Main training loop
5: Train the discriminators and generators:
6. Draw a minibatch of samples {s^(i), …, s^(m)} from domain S
7. Draw a minibatch of samples {t^(i), …, t^(n)} from domain T
8. Compute the discriminator loss on real images:

J_{r e a l}^{(D)} = \frac{1}{m} \sum_{i = 1}^{m} {(D_{S} (s^{(i)}) - 1)}^{2} + \frac{1}{n} \sum_{j = 1}^{n} {(D_{T} (t^{(j)}) - 1)}^{2}

9. Compute the discriminator loss on fake images:

J_{f a k e}^{(D)} = \frac{1}{m} \sum_{i = 1}^{m} (D_{T} {(G_{S \to T} (s^{(i)}))}^{2} + \frac{1}{n} \sum_{j = 1}^{n} (D_{S} {(G_{T \to S} (t^{(j)}))}^{2})

10. Compute the WGAN-GP loss term:

J_{g p}^{(D)} = λ_{g p} \sum_{k = 1}^{K} {(∥ \nabla_{D} s^{(k)} ∥_{2} - 1)}^{2} + λ_{g p} \sum_{k = 1}^{K} {(∥ \nabla_{D} t^{(k)} ∥_{2} - 1)}^{2}

11. Compute the total discriminator loss with WGAN-GP:

J_{t o t a l}^{(D)} = J_{r e a l}^{(D)} + J_{f a k e}^{(D)} + J_{g p}^{(D)}

12. Update the discriminators
13. Compute the T→S generator loss:

J (G_{T \to S}) = \frac{1}{n} \sum_{j = 1}^{n} {(D_{S} (G_{T \to S} (t^{(j)})) - 1)}^{2} + λ_{c y c l e} J_{c y c l e} (T \to S)

14. Compute the S→T generator loss:

J (G_{S \to T}) = \frac{1}{m} \sum_{i = 1}^{m} {(D_{T} (G_{S \to T} (s^{(i)})) - 1)}^{2} + λ_{c y c l e} J_{c y c l e} (S \to T)

15.   Update the generators
16.   Train the classifier:
17.   for m = 1 to M do // Classifier training sub-loop
18. Draw a minibatch of samples {x^{(i)}, …, x^{(k)}} from domain S to T
19. Compute the classifier loss:

L_{c l s} = \frac{1}{k} \sum_{i = 1}^{k} C r o s s E n t r o p y (f (x^{(i)}), y^{(i)})

20. Update the classifier parameters to minimize

L_{c l s}

21. end for
22. end procedure

4. Experiments

To demonstrate the effectiveness of the proposed method, experiments were conducted on two real-world HSI datasets. This section provides the details of the experiments.

4.1. Datasets

The ROSIS Pavia University (RPaviaU) scene was captured by the ROSIS HSI sensor over the University of Pavia, Italy. The ROSIS HSI (Reflective Optics System Imaging Spectrometer) sensor was manufactured by the German Aerospace Center (DLR), located in Cologne, Germany. The data cube size of the RPaviaU scene was 610 × 340 × 103, where first two dimensions represent the spatial size, while the last dimension is the number of bands. The DAIS Pavia Center (DPaviaC) scene was captured by the DAIS sensor over the center of Pavia City, Italy. The DAIS (Digital Airborne Imaging Spectrometer) sensor was manufactured by the German Aerospace Center (DLR), located in Cologne, Germany. The data cube size of DPaviaC was 400 × 400 × 72. There were seven land cover classes shared between RPaviaU and DPaviaC scenes, and the detailed land cover classes as well as the number of labeled samples are listed in Table 1. The data cubes and ground truth maps are illustrated in Figure 4. The white areas in the figure indicate pixels that were not involved in the classification.

The EO-1 Hangzhou (EHangzhou) scene was captured by the EO-1 Hyperion HSI sensor over Hangzhou city, Zhejiang, China. The EO-1 Hyperion HSI sensor was developed by NASA (National Aeronautics and Space Administration), headquartered in Washington, DC, USA. The data cube size of the EHangzhou scene was 590 × 230 × 198, where the first two dimensions represent the spatial size, while the last dimension is the number of bands. The ROSIS Pavia HR (RPaviaHR) scene was captured by the ROSIS HSI sensor over Pavia city, Italy. The data cube size of the RPaviaHR scene was 1400 × 512 × 102. There were three common classes between the EHangzhou and RPaviaHR scenes, which were water, ground/building, and vegetation, as shown in Table 2. The data cubes and ground truth maps are illustrated in Figure 5.

Data preprocessing: In the experiments involving the proposed Hyper-CycleGAN, the dataset was systematically partitioned into training and testing subsets to ensure a rigorous evaluation. Specifically, 150 labeled instances per class were randomly sampled from the source domain for training. Similarly, five labeled instances per class were randomly selected from the target domain for training. This random selection process was conducted using stratified sampling to preserve the class distribution. All remaining labeled instances in the target domain, which were not selected for training, were assigned to the test set and reserved solely for performance evaluation. To clarify, “labeled source samples” refer to pixels with ground truth labels from the source hyperspectral image, while “labeled target samples” refer to pixels with ground truth labels from the target hyperspectral image. The random selection was performed independently for each class to ensure balanced representation across all categories.

Z-score normalization was employed as a preprocessing step to standardize the spectral data across different wavelengths, ensuring that each spectral band contributed equally to the overall analysis. Additionally, random data augmentation techniques, such as rotation, flipping, and scaling, were applied to expand the training dataset and enhance the model’s robustness and generalization ability. These strategies were implemented to mitigate overfitting and improve the model’s performance on unseen data. It was ensured that the final classification map (Figure 6) was generated exclusively using the test set data. The model was trained solely on the selected training samples (150 per class from the source domain and 5 per class from the target domain) and subsequently applied to the remaining target samples, which were entirely withheld during training. This strict separation between training and testing data ensured that the classification results accurately represented the model’s generalization ability rather than its memorization of training data. Furthermore, complete isolation of the test set was maintained throughout the evaluation process to prevent information leakage and avoid artificially inflated performance metrics.

4.2. Experimental Settings and Training

In our experiment, we employed the Tensorflow library for implementation, harnessing the computational capabilities of an NVIDIA RTX4090 GPU equipped with 24 GB of memory. The NVIDA RTX4090 is a public version of the card manufactured by NVIDIA, and the device is sourced from the Santa Clara, CA, USA.

Throughout our experiments, we set λ = 100, which acted as the weight adjustment coefficient in Equation (3), governing the balance between adversarial loss and cycle-consistency loss, specifically tailored for hyperspectral data. For optimization, we utilized the Adam optimizer [40] with a batch size of 70. The selection of the Adam optimizer was motivated by its effectiveness in handling high-dimensional and nonlinear optimization tasks common in hyperspectral data analysis. The choice of batch size may be influenced by factors such as computational resources and the complexity of the hyperspectral dataset. All networks were trained from scratch with a learning rate of 0.0002. The learning rate remained constant for the initial 100 epochs and was linearly decayed to zero over the subsequent 100 epochs. This learning rate decay strategy was chosen to facilitate smooth convergence while preventing overshooting and instability in the training process, crucial for hyperspectral data synthesis tasks.

5. Results and Discussion

This section presents the results and discussion of the experiments described in the previous section. The performance of our approach and compared techniques was evaluated in terms of the classification rate, stability, qualitative measurements of discriminative capabilities, and feature mapping.

5.1. Classification Performance

In this study, we evaluated the classification performance of five different models on a hyperspectral dataset, including the traditional CycleGAN, CycleGAN combined with WGAN-GP, our proposed model, Hyper-CycleGAN, and the additional methods, 3-D-SRNet [41] and Cycle-AC-GAN [42]. The 3-D-SRNet method is a cross-scene hyperspectral image (HSI) classification algorithm based on pretraining and fine-tuning. We used overall accuracy (OA) and average accuracy (AA) as metrics. Table 3 presents the experimental results on the DPaviaC dataset, while Figure 6 shows the results averaged over 10 independent runs on the DPaviaC dataset. Table 4 presents the experimental results on the RPaviaHR dataset. Figure 7 shows the final classification map in the RPaviaU and DPaviaC datasets. Figure 8 shows the final classification map in the EHangzhou and RPaviaHR datasets.

The superiority of adversarial neural networks in handling hyperspectral data is clear, with even the basic CycleGAN outperforming 3-D-SRNet by more than 10% in OA. This highlights the inherent capability of adversarial networks to effectively extract and leverage intricate spectral features for improved classification accuracy. Adversarial networks excel at learning complex representations directly from raw data, a critical advantage in hyperspectral data analysis. Their adaptability and feature extraction prowess make them ideally suited for tasks where traditional methods may struggle to capture subtle spectral patterns. Data analysis showed that Hyper-CycleGAN demonstrated significant improvements in OA and AA compared to the traditional CycleGAN model, with increases of approximately 6.27% and 8.53%, respectively. Compared to the CycleGAN + WGAN-GP model, Hyper-CycleGAN achieved improvements of approximately 5.50% and 7.60% in OA and AA, respectively. These results indicate a substantial performance enhancement of Hyper-CycleGAN in hyperspectral image classification tasks, particularly in terms of average accuracy, where its performance outperformed the other two models. While CycleGAN, as a traditional image transformation model, achieved certain classification performance on hyperspectral datasets, its accuracy was relatively low, primarily due to the model’s failure to effectively capture key features in the spectral data. Although CycleGAN + WGAN-GP slightly improved performance by incorporating Wasserstein GAN and gradient penalty, the improvement was limited. In contrast, Hyper-CycleGAN, by integrating multi-scale attention mechanisms and WGAN-GP loss, significantly enhanced classification performance, representing a breakthrough in hyperspectral image classification tasks. Compared to Cycle-AC-GAN, which consists of a binary domain classifier and an auxiliary land cover classifier, our proposed Hyper-CycleGAN architecture also achieved significant improvements. The Hyper-CycleGAN achieved significant improvements amounting to about 6.27% in OA and 8.53% in AA. This indicates that the proposed method achieved a substantial performance improvement when addressing hyperspectral image classification tasks. It further demonstrates the effectiveness of the method.

In addition, Hyper-CycleGAN exhibited superior performance compared to the other two models for this dataset, suggesting that the model architecture effectively utilized the information extracted from hyperspectral images, demonstrating enhanced generalization capabilities. Figure 7 and Figure 8 show the final classification maps of the two datasets, which visually illustrate the classification results produced by the models. Each pixel in the classification map represents a specific land cover class or category predicted by Hyper-CycleGAN based on the spectral features detected in the hyperspectral image. By comparing this classification map with real labels, the consistency and accuracy of the model’s classifications can be observed. This map demonstrates a high degree of consistency with the ground truth labels, especially with respect to the spatial distribution of the vegetation class and ground/building class in Figure 8. The boundaries between the different ground/building classes were coherent and tightly aligned between the two representations, highlighting Hyper-CycleGAN’s ability to accurately delineate ground features. Upon closer inspection, individual pixels precisely corresponded to the ground/building class predicted by Hyper-CycleGAN. The detected spectral features were very similar to those depicted in the ground truth, underscoring the model’s proficiency in leveraging spectral information for accurate classification. In addition, these maps successfully captured subtle variations in ground features, such as vegetation density and urban morphology. These nuances were closely related to the corresponding features present in the real annotations, demonstrating the model’s capacity to capture the intricate details inherent in hyperspectral imagery. Although Hyper-CycleGAN outperformed existing methods in classification on multiple datasets, certain limitations in the classification results remained. To further analyze these limitations, we present in Figure 7c several regions where classification errors occurred, followed by an in-depth exploration of the underlying causes. As can be seen from Figure 7, on the RPaviaU and DPaviaC datasets, the classification failures were primarily concentrated in regions with fuzzy boundaries, small areas, and noise interference. For example, in the boundary areas of different feature classes (e.g., meadows and bare soil), the model was prone to confusion due to the transitional changes in spectral features. In addition, some small areas (e.g., shadows and self-blocking bricks) exhibited deviations in classification results from the real labels due to their small size and complex spectral features. Meanwhile, some areas may be disturbed by noise (e.g., changes in lighting conditions or sensor errors), which hinders the model’s ability to accurately extract features.

5.2. Qualitative Performance

Our Hyper-CycleGAN implementation distinguishes itself from the original CycleGAN by incorporating a generator network structure with a multi-scale attention mechanism, thereby enabling the synthesis of detailed features in hyperspectral data. This specialized module captured both local and global features, thereby enhancing the reconstruction of structural texture details and improving the fidelity of synthesized hyperspectral data. By selectively attending to relevant spectral bands and spatial regions, the attention mechanism facilitated improved differentiation among various classes or categories present in the hyperspectral images. The adaptive nature of this mechanism ensured accurate capture of both fine-grained details and broader contextual information, yielding superior feature representations. Figure 9 illustrates the enhanced discriminative power achieved by our method. The features of a patch in the target scene were more specific after using the multi-scale attention layer compared to before using it, which contained more details, demonstrating the effectiveness of the multi-scale attention layer in capturing subtle variations in hyperspectral data. Additionally, the spatial distribution of spectral information was faithfully replicated, reflecting accurate mapping of spectral features across the hyperspectral domain.

In addition to capturing more detailed features, the quality of the mappings generated by our Hyper-CycleGAN exhibited outstanding performance. Figure 10 illustrates the spectral maps for our dataset, comprising multiple categories, each characterized by distinct spectral features, including absorption peaks, valleys, and intensity distributions. These spectral maps vividly demonstrate variations in spectral features across different categories, highlighting the importance of these differences in material or substance identification and analysis. Figure 11 displays the results of the hyperspectral spectrograms generated by our Hyper-CycleGAN for the fifth category, representing self-blocking brick spectrograms. Figure 12 displays the results of the hyperspectral spectrograms generated by our Hyper-CycleGAN for the third category, representing vegetation spectrograms. These spectrograms exhibit a remarkable level of fidelity and coherence compared with the original dataset. Through the integration of a multi-scale attention mechanism, our Hyper-CycleGAN demonstrated outstanding performance in capturing intricate spectral features, including subtle variations in reflectance and absorption patterns specific to self-blocking bricks. The synthesized spectrograms maintained a high level of clarity and realism, showcasing our model’s effectiveness in preserving the unique spectral signatures of different materials. Additionally, the spatial distribution of spectral information was faithfully replicated, reflecting accurate mapping of spectral features across the hyperspectral domain.

To thoroughly evaluate the performance of our method beyond classification accuracy, we utilized the mean structural similarity index measure (MSSIM) and mean peak signal-to-noise ratio (MPSNR) to measure the perceptual quality of the generated hyperspectral images. As shown in Table 5, Hyper-CycleGAN demonstrated superior performance in hyperspectral image generation, significantly surpassing the traditional CycleGAN. On the DPaviaC dataset, Hyper-CycleGAN achieved MPSNR and MSSIM values of 29.23 and 0.787, respectively, which are notably higher than the corresponding values of 28.317 and 0.735 achieved by CycleGAN. Similarly, on the RPaviaHR dataset, Hyper-CycleGAN achieved MPSNR and MSSIM values of 29.461 and 0.802, respectively, substantially outperforming CycleGAN’s values of 28.342 and 0.729. The improved performance of Hyper-CycleGAN is attributed to its enhanced ability to model spectral and spatial features. Specifically, the higher MSSIM values indicate that Hyper-CycleGAN excelled at maintaining structural consistency in the images, including accurately reproducing brightness, contrast, and local texture details. Furthermore, the higher MPSNR values highlight its effectiveness in reducing noise interference and preserving pixel-level image quality. Beyond individual metrics, Hyper-CycleGAN demonstrated comprehensive advantages in reproducing spectral-spatial distributions with high accuracy. This capability ensured that the generated hyperspectral images closely aligned with real data in terms of visual quality and structural fidelity, thereby setting a new benchmark in the field of hyperspectral image synthesis.

5.3. Stability

The integration of the Wasserstein generative adversarial network with gradient penalty (WGAN-GP) loss into the Hyper-CycleGAN architecture resulted in fundamental improvements, significantly enhancing both the stability of training and the quality of generated hyperspectral data. Unlike the original CycleGAN, which relies solely on adversarial loss for training, incorporating WGAN-GP loss presented several pivotal advantages.

Primarily, including a gradient penalty term within WGAN-GP loss effectively mitigated mode collapse, a prevalent issue in traditional generative adversarial network (GAN) training. This phenomenon occurs when the generator fails to capture the full diversity of the target distribution, leading to limited and repetitive sample generation. Our training experiments, as illustrated in Figure 13, revealed instances of mode collapse where models struggled to produce diverse and realistic hyperspectral data samples. However, with WGAN-GP loss integration, we successfully mitigated mode collapse and achieved more stable training dynamics. The gradient penalty term regularized the discriminator by imposing constraints on gradients of generated samples, thus preventing excessive domination by the discriminator during training. Consequently, this enabled generators to produce a broader range of hyperspectral data samples, leading to improved quality and fidelity. Furthermore, employing the Wasserstein distance metric in WGAN-GP loss offered a more meaningful measure of discrepancy between real and generated data distributions compared to traditional GANs that rely on binary classification. This provided smoother and more informative gradient signals during training, facilitating stable convergence and better quality synthesis

In summary, incorporating WGAN-GP loss into the Hyper-CycleGAN framework represents a significant advancement over the original CycleGAN architecture by effectively addressing issues like mode collapse and providing a more meaningful loss metric, thus enhancing the stability, diversity, and quality of generated hyperspectral data and improving the overall model performance in hyperspectral data synthesis tasks.

6. Conclusions and Future Work

This paper proposed Hyper-CycleGAN, which is an innovative adversarial neural network architecture designed specifically for hyperspectral data transformation. Utilizing the bidirectional mapping capabilities of CycleGAN, Hyper-CycleGAN ensured a consistent data distribution between the source and target hyperspectral data, facilitating efficient domain-wide reconstruction. The integration of multi-scale attention mechanisms significantly enhanced the generator’s ability to discern subtle spectral relationships, prioritize important features, and account for spatial distributions, leading to improved quality and fidelity of the hyperspectral data. Moreover, we developed a tool for evaluating hyperspectral data based on the ResNet architecture, enabling precise classification and robust performance in hyperspectral data analysis tasks. Overall, our proposed Hyper-CycleGAN architecture outperformed traditional methods, as demonstrated by its superior classification capabilities in various hyperspectral data analysis scenarios.

For future work, we aim to explore further enhancements to increase the efficiency and effectiveness of hyperspectral data transformation through advanced techniques from other fields, particularly reinforcement learning and evolutionary algorithms. Reinforcement learning approaches could enhance our framework through mechanisms like deep Q-networks that dynamically adjust attention weights based on generated data quality, enabling the model to learn optimal attention patterns across spectral bands. Policy gradient methods could optimize the generator’s transformation strategy in real time, helping the model adapt to variations in input data characteristics, such as seasonal changes in remote sensing applications without requiring retraining. Complementarily, evolutionary algorithms could be implemented to evolve optimal network architectures for the generator and discriminator components. A coevolutionary approach could simultaneously optimize both architectures, leading to more robust adversarial training. This would involve fitness functions incorporating both spectral fidelity measures and classification accuracy, ensuring that evolved architectures maintain high-quality transformations. Future research will also focus on hybrid approaches combining reinforcement learning for parameter optimization with evolutionary algorithms for architecture search, addressing challenges related to scalability and computational efficiency, specifically tailored to hyperspectral data. This strategic approach aims to advance the field and foster applications in real-world scenarios, such as precision agriculture, environmental monitoring, and mineral exploration.

Author Contributions

Conceptualization, Y.H., K.P.S. and L.M.A.; methodology Y.H. and K.P.S.; software, Y.H.; investigation, Y.H. and K.P.S.; writing—original draft, Y.H., K.P.S. and L.M.A.; writing—review and editing, K.P.S., L.M.A., X.Z. and B.P.; supervision, K.P.S., X.Z. and B.P.; project administration, K.P.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://www.ehu.eus/ccwintco/index.php?title=Hyperspectral_Remote_Sensing_Scenes (accessed on 2 April 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhong, C.; Zhang, J.; Wu, S.; Zhang, Y. Cross-scene deep transfer learning with spectral feature adaptation for hyperspectral image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 2861–2873. [Google Scholar] [CrossRef]
Mou, L.; Zhu, X.X. Learning to pay attention on spectral domain: A spectral attention module-based convolutional network for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2019, 58, 110–122. [Google Scholar] [CrossRef]
Shen, Y.; Zhu, S.; Chen, C.; Du, Q.; Xiao, L.; Chen, J.; Pan, D. Efficient deep learning of nonlocal features for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2020, 59, 6029–6043. [Google Scholar] [CrossRef]
Zhang, H.; Yu, H.; Xu, Z.; Zheng, K.; Gao, L. A novel classification framework for hyperspectral image classification based on multi-scale dense network. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, IEEE, Brussels, Belgium, 11–16 July 2021; pp. 2238–2241. [Google Scholar]
Xu, K.; Huang, H.; Deng, P.; Li, Y. Deep feature aggregation framework driven by graph convolutional network for scene classification in remote sensing. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 5751–5765. [Google Scholar] [CrossRef] [PubMed]
Kipf, T.N.; Welling, M. Semi-supervised classification with graph convolutional networks. arXiv 2016, arXiv:1609.02907. [Google Scholar]
Hong, D.; Gao, L.; Yao, J.; Zhang, B.; Plaza, A.; Chanussot, J. Graph convolutional networks for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2020, 59, 5966–5978. [Google Scholar] [CrossRef]
Paoletti, M.E.; Haut, J.M.; Plaza, J.; Plaza, A. Deep learning classifiers for hyperspectral imaging: A review. ISPRS J. Photogramm. Remote Sens. 2019, 158, 279–317. [Google Scholar] [CrossRef]
Liu, T.; Zhang, X.; Gu, Y. Unsupervised cross-temporal classification of hyperspectral images with multiple geodesic flow kernel learning. IEEE Trans. Geosci. Remote Sens. 2019, 57, 9688–9701. [Google Scholar] [CrossRef]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
Lata, K.; Dave, M.; Nishanth, K.N. Image-to-image translation using generative adversarial network. In Proceedings of the 2019 3rd International Conference on Electronics, Communication and Aerospace Technology (ICECA), IEEE, Coimbatore, India, 12–14 June 2019; pp. 186–189. [Google Scholar]
Odena, A.; Olah, C.; Shlens, J. Conditional image synthesis with auxiliary classifier gans. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 2642–2651. [Google Scholar]
Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2223–2232. [Google Scholar]
Ma, X.; Mou, X.; Wang, J.; Liu, X.; Geng, J.; Wang, H. Cross-dataset hyperspectral image classification based on adversarial domain adaptation. IEEE Trans. Geosci. Remote Sens. 2020, 59, 4179–4190. [Google Scholar] [CrossRef]
Kim, T.; Cha, M.; Kim, H.; Lee, J.K.; Kim, J. Learning to discover cross-domain relations with generative adversarial networks. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 1857–1865. [Google Scholar]
Gulrajani, I.; Ahmed, F.; Arjovsky, M.; Dumoulin, V.; Courville, A.C. Improved training of wasserstein gans. Adv. Neural Inf. Process. Syst. 2017, 30, 5767–5777. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 4700–4708. [Google Scholar]
Guo, Q.; Qiu, X.; Liu, P.; Xue, X.; Zhang, Z. Multi-scale self-attention for text classification. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 7847–7854. [Google Scholar]
Mehta, N.; Shaik, S.; Devireddy, R.; Gartia, M.R. Single-cell analysis using hyperspectral imaging modalities. J. Biomech. Eng. 2018, 140, 020802. [Google Scholar] [CrossRef] [PubMed]
Varshney, P.K.; Arora, M.K. Advanced Image Processing Techniques for Remotely Sensed Hyperspectral Data; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
Zhang, B.; Wu, D.; Zhang, L.; Jiao, Q.; Li, Q. Application of hyperspectral remote sensing for environment monitoring in mining areas. Environ. Earth Sci. 2012, 65, 649–658. [Google Scholar] [CrossRef]
Hong, D.; Yokoya, N.; Chanussot, J.; Zhu, X.X. An augmented linear mixing model to address spectral variability for hyperspectral unmixing. IEEE Trans. Image Process. 2018, 28, 1923–1938. [Google Scholar] [CrossRef]
Liang, H.; Bao, W.; Shen, X. Adaptive weighting feature fusion approach based on generative adversarial network for hyperspectral image classification. Remote Sens. 2021, 13, 198. [Google Scholar] [CrossRef]
Tong, F.; Zhang, Y. Exploiting spectral–spatial information using deep random forest for hyperspectral imagery classification. IEEE Geosci. Remote Sens. Lett. 2021, 19, 1–5. [Google Scholar] [CrossRef]
Ahmad, M.; Shabbir, S.; Roy, S.K.; Hong, D.; Wu, X.; Yao, J.; Khan, A.M.; Mazzara, M.; Distefano, S.; Chanussot, J. Hyperspectral image classification—Traditional to deep models: A survey for future prospects. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 15, 968–999. [Google Scholar] [CrossRef]
Serpico, S.B.; Moser, G. Extraction of spectral channels from hyperspectral images for classification purposes. IEEE Trans. Geosci. Remote Sens. 2007, 45, 484–495. [Google Scholar] [CrossRef]
Zhang, M.; Li, W.; Du, Q.; Gao, L.; Zhang, B. Feature extraction for classification of hyperspectral and LiDAR data using patch-to-patch CNN. IEEE Trans. Cybern. 2018, 50, 100–111. [Google Scholar] [CrossRef]
Wu, H.; Prasad, S. Convolutional recurrent neural networks for hyperspectral data classification. Remote Sens. 2017, 9, 298. [Google Scholar] [CrossRef]
Uezato, T.; Hong, D.; Yokoya, N.; He, W. Guided deep decoder: Unsupervised image pair fusion. In Proceedings of the European Conference on Computer Vision, Glasgow, UK, 23–28 August 2020; Springer International Publishing: Cham, Switzerland, 2020; pp. 87–102. [Google Scholar]
Li, S.; Wang, W.; Qi, H.; Ayhan, B.; Kwan, C.; Vance, S. Low-rank tensor decomposition based anomaly detection for hyperspectral imagery. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015; pp. 4525–4529. [Google Scholar]
Ahmad, M.; Khan, A.M.; Hussain, R.; Protasov, S.; Chow, F.; Khattak, A.M. Unsupervised geometrical feature learning from hyperspectral data. In Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence (SSCI), Athens, Greece, 6–9 December 2016; pp. 1–6. [Google Scholar]
Wang, X.; Li, Y.; Cheng, Y. Hyperspectral image classification based on unsupervised heterogeneous domain adaptation CycleGan. Chin. J. Electron. 2020, 29, 608–614. [Google Scholar] [CrossRef]
Reddy, T.S.; Anusha, S.; Reddy, E.V.V.; Shivamani, V.V.; Mrudula, V. Hyperspectral Image Classification based on Cycle GAN and EfficientNet. In Proceedings of the 2024 Fourth International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT), Bhilai, India, 11–12 January 2024; pp. 1–6. [Google Scholar]
Chen, C.; Wang, Y.; Zhang, N.; Zhang, Y.; Zhao, Z. A review of hyperspectral image super-resolution based on deep learning. Remote Sens. 2023, 15, 2853. [Google Scholar] [CrossRef]
Zhao, M.; Yan, L.; Chen, J. Hyperspectral image shadow compensation via cycle-consistent adversarial networks. Neurocomputing 2021, 450, 61–69. [Google Scholar] [CrossRef]
Kaneko, T.; Kameoka, H.; Tanaka, K.; Hojo, N. Cyclegan-vc3: Examining and improving cyclegan-vcs for mel-spectrogram conversion. arXiv 2020, arXiv:2010.11672. [Google Scholar]
Mao, X.; Li, Q.; Xie, H.; Lau, R.Y.; Wang, Z.; Paul Smolley, S. Least squares generative adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2794–2802. [Google Scholar]
Shrivastava, A.; Pfister, T.; Tuzel, O.; Susskind, J.; Wang, W.; Webb, R. Learning from simulated and unsupervised images through adversarial training. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2107–2116. [Google Scholar]
Adam, K.D.B.J. A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Jiang, Y.; Li, Y.; Zhang, H. Hyperspectral image classification based on 3-D separable ResNet and transfer learning. IEEE Geosci. Remote Sens. Lett. 2019, 16, 1949–1953. [Google Scholar] [CrossRef]
Meng, Z.; Ye, M.; Yao, F.; Xiong, F.; Qian, Y. Cross-scene hyperspectral image classification based on cycle-consistent adversarial networks. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; pp. 1912–1915. [Google Scholar]

Figure 1. The framework of our proposed Hyper-CycleGAN.

Figure 2. Detailed structure of the generator (G) and the discriminator (D) in the model.

Figure 3. Hyper-CycleGAN classifier architecture.

Figure 4. Datasets along with their ground truths. (a) DPaviaU dataset and (b) DPaviaC dataset.

Figure 5. Datasets along with their ground truths. (a) EHangzhou dataset and (b) RPaviaHR dataset.

Figure 6. The results averaged over 10 independent runs.

Figure 7. The color classification map of target pixels in RPaviaU and DPaviaC: (a) target database ground truth, (b) the classification map obtained by the proposed method, and (c) regions of classification failure.

Figure 8. The color classification map of target pixels in EHangzhou and RPaviaHR: (a) target database ground truth and (b) the classification map obtained by the proposed method.

Figure 9. Feature space mapping of the target scene.

Figure 10. The averaged spectral features in datasets. (a) RPaviaU dataset spectrum, (b) RPaviaC dataset spectrum, (c) EHangzhou dataset spectrum, and (d) RPaviaHR dataset spectrum.

Figure 11. The averaged spectral features of the “self-blocking bricks” class in the RPaviaU and DPaviaC datasets.

Figure 12. The averaged spectral features of the “vegetation” class in the EHangzhou and RPaviaHR datasets.

Figure 13. (a) Original discriminator loss training plot (pattern collapse in red box). (b) Discriminator training plot using WGAN-GP loss.

Table 1. Number of labeled samples in each land cover class within the RPaviaU and DPaviaC datasets.

Class Name		Labeled Samples
#	Name	RPaviaU	DPaviaC
1	Trees	3064	2424
2	Asphalt	6631	1704
3	Bitumen	1330	685
4	Shadows	947	241
5	Self-Blocking Bricks	3682	2237
6	Meadows	18,649	1251
7	Bare Soil	5029	1475

Table 2. Number of labeled samples in each land cover class within the EHangzhou and RPaviaHR datasets.

Class Name		Labeled Samples
#	Name	EHangzhou	RPaviaHR
1	Water	18,403	22,525
2	Ground/Building	77,450	145,416
3	Vegetation	40,207	22,961

Table 3. OA and AA of each method on the DPaviaC dataset.

Method	OA (%)	AA (%)
3-D-SRNet	73.67	76.48
CycleGAN	83.75	84.34
CycleGAN + WGAN-GP	84.52	85.27
Cycle-AC-GAN [42]	88.14	88.72
Hyper-CycleGAN	90.02	92.87

Table 4. OA and AA of each method on the RPaviaHR dataset.

Method	OA (%)	AA (%)
3-D-SRNet	83.54	89.97
CycleGAN	86.73	87.62
CycleGAN + WGAN-GP	87.74	88.72
Cycle-AC-GAN [42]	92.36	96.47
Hyper-CycleGAN	94.67	97.87

Table 5. MPSNR and MSSIM on the DPaviaC and RPaviaHR datasets. “↑” indicates that higher values correspond to better performance.

	DPaviaC		RPaviaHR
Methods	MPSNR (↑)	MSSIM (↑)	MPSNR (↑)	MSSIM (↑)
Ground Truth	N/A	1	N/A	1
CycleGAN	28.317	0.735	28.342	0.729
Hyper-CycleGAN	29.23	0.787	29.461	0.802

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

He, Y.; Seng, K.P.; Ang, L.M.; Peng, B.; Zhao, X. Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation. Appl. Sci. 2025, 15, 4188. https://doi.org/10.3390/app15084188

AMA Style

He Y, Seng KP, Ang LM, Peng B, Zhao X. Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation. Applied Sciences. 2025; 15(8):4188. https://doi.org/10.3390/app15084188

Chicago/Turabian Style

He, Yibo, Kah Phooi Seng, Li Minn Ang, Bei Peng, and Xingyu Zhao. 2025. "Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation" Applied Sciences 15, no. 8: 4188. https://doi.org/10.3390/app15084188

APA Style

He, Y., Seng, K. P., Ang, L. M., Peng, B., & Zhao, X. (2025). Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation. Applied Sciences, 15(8), 4188. https://doi.org/10.3390/app15084188

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hyper-CycleGAN: A New Adversarial Neural Network Architecture for Cross-Domain Hyperspectral Data Generation

Abstract

1. Introduction

2. Background

2.1. Deep Learning for Hyperspectral Data

2.2. Adversarial Neural Networks for Hyperspectral Data

3. The Proposed Hyper-CycleGAN

4. Experiments

4.1. Datasets

4.2. Experimental Settings and Training

5. Results and Discussion

5.1. Classification Performance

5.2. Qualitative Performance

5.3. Stability

6. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI