Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (465)

Search Parameters:
Keywords = GAN for feature-enhanced

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 566 KB  
Article
AE-CTGAN: Autoencoder–Conditional Tabular GAN for Multi-Omics Imbalanced Class Handling and Cancer Outcome Prediction
by Ibrahim Al-Hurani, Sara H. ElFar, Abedalrhman Alkhateeb and Salama Ikki
Algorithms 2026, 19(2), 95; https://doi.org/10.3390/a19020095 (registering DOI) - 25 Jan 2026
Abstract
The rapid advancement of sequencing technologies has led to the generation of complex multi-omics data, which are often high-dimensional, noisy, and imbalanced, posing significant challenges for traditional machine learning methods. The novelty of this work resides in the architecture-level integration of autoencoders with [...] Read more.
The rapid advancement of sequencing technologies has led to the generation of complex multi-omics data, which are often high-dimensional, noisy, and imbalanced, posing significant challenges for traditional machine learning methods. The novelty of this work resides in the architecture-level integration of autoencoders with Generative Adversarial Network (GAN) and Conditional Tabular Generative Adversarial Network (CTGAN) models, where the autoencoder is employed for latent feature extraction and noise reduction, while GAN-based models are used for realistic sample generation and class imbalance mitigation in multi-omics cancer datasets. This study proposes a novel framework that combines an autoencoder for dimensionality reduction and a CTGAN for generating synthetic samples to balance underrepresented classes. The process starts with selecting the most discriminative features, then extracting latent representations for each omic type, merging them, and generating new minority samples. Finally, all samples are used to train a neural network to predict specific cancer outcomes, defined here as clinically relevant biomarkers or patient characteristics. In this work, the considered outcome in the bladder cancer is Tumor Mutational Burden (TMB), while the breast cancer outcome is menopausal status, a key factor in treatment planning. Experimental results show that the proposed model achieves high precision, with an average precision of 0.9929 for TMB prediction in bladder cancer and 0.9748 for menopausal status in breast cancer, and reaches perfect precision (1.000) for the positive class in both cases. In addition, the proposed AE–CTGAN framework consistently outperformed an autoencoder combined with a standard GAN across all evaluation metrics, achieving average accuracies of 0.9929 and 0.9748, recall values of 0.9846 and 0.9777, and F1-scores of 0.9922 for bladder and breast cancer datasets, respectively. A comparative fidelity analysis in the latent space further demonstrated the superiority of CTGAN, reducing the average Euclidean distance between real and synthetic samples by approximately 72% for bladder cancer and by up to 84% for breast cancer compared to a standard GAN. These findings confirm that CTGAN generates high-fidelity synthetic samples that preserve the structural characteristics of real multi-omics data, leading to more reliable class balancing and improved predictive performance. Overall, the proposed framework provides an effective and robust solution for handling class imbalance in multi-omics cancer data and enhances the accuracy of clinically relevant outcome prediction. Full article
Show Figures

Figure 1

36 pages, 1564 KB  
Article
Transformer-Based Multi-Source Transfer Learning for Intrusion Detection Models with Privacy and Efficiency Balance
by Baoqiu Yang, Guoyin Zhang and Kunpeng Wang
Entropy 2026, 28(2), 136; https://doi.org/10.3390/e28020136 (registering DOI) - 24 Jan 2026
Abstract
The current intrusion detection methods suffer from deficiencies in terms of cross-domain adaptability, privacy preservation, and limited effectiveness in detecting minority-class attacks. To address these issues, a novel intrusion detection model framework, TrMulS, is proposed that integrates federated learning, generative adversarial networks with [...] Read more.
The current intrusion detection methods suffer from deficiencies in terms of cross-domain adaptability, privacy preservation, and limited effectiveness in detecting minority-class attacks. To address these issues, a novel intrusion detection model framework, TrMulS, is proposed that integrates federated learning, generative adversarial networks with multispace feature enhancement ability, and transformers with multi-source transfer ability. First, at each institution (source domain), local spatial features are extracted through a CNN, multiple subsets are constructed (to solve the feature singularity problem), and the multihead self-attention mechanism of the transformer is utilized to capture the correlation of features. Second, the synthetic samples of the target domain are generated on the basis of the improved Exchange-GAN, and the cross-domain transfer module is designed by combining the Maximum Mean Discrepancy (MMD) to minimize the feature distribution difference between the source domain and the target domain. Finally, the federated transfer learning strategy is adopted. The model parameters of each local institution are encrypted and uploaded to the target server and then aggregated to generate the global model. These steps iterate until convergence, yielding the globally optimal model. Experiments on the ISCX2012, KDD99 and NSL-KDD intrusion detection standard datasets show that the detection accuracy of this method is significantly improved in cross-domain scenarios. This paper presents a novel paradigm for cross-domain security intelligence analysis that considers efficiency, privacy and balance. Full article
Show Figures

Figure 1

57 pages, 4130 KB  
Review
Critical Review of Recent Advances in AI-Enhanced SEM and EDS Techniques for Metallic Microstructure Characterization
by Gasser Abdelal, Chi-Wai Chan and Sean McLoone
Appl. Sci. 2026, 16(2), 975; https://doi.org/10.3390/app16020975 - 18 Jan 2026
Viewed by 168
Abstract
This critical review explores the transformative impact of artificial intelligence (AI), particularly machine learning (ML) and computer vision (CV), on scanning electron microscopy (SEM) and energy dispersive spectroscopy (EDS) for metallic microstructure analysis, spanning research from 2010 to 2025. It critically evaluates how [...] Read more.
This critical review explores the transformative impact of artificial intelligence (AI), particularly machine learning (ML) and computer vision (CV), on scanning electron microscopy (SEM) and energy dispersive spectroscopy (EDS) for metallic microstructure analysis, spanning research from 2010 to 2025. It critically evaluates how AI techniques balance automation, accuracy, and scalability, analysing why certain methods (e.g., Vision Transformers for complex microstructures) excel in specific contexts and how trade-offs in data availability, computational resources, and interpretability shape their adoption. The review examines AI-driven techniques, including semantic segmentation, object detection, and instance segmentation, which automate the identification and characterisation of microstructural features, defects, and inclusions, achieving enhanced accuracy, efficiency, and reproducibility compared to traditional manual methods. It introduces the Microstructure Analysis Spectrum, a novel framework categorising techniques by task complexity and scalability, providing a new lens to understand AI’s role in materials science. The paper also evaluates AI’s role in chemical composition analysis and predictive modelling, facilitating rapid forecasts of mechanical properties such as hardness and fracture strain. Practical applications in steelmaking (e.g., automated inclusion characterisation) and case studies on high-entropy alloys and additively manufactured metals underscore AI’s benefits, including reduced analysis time and improved quality control. Extending prior reviews, this work incorporates recent advancements like Vision Transformers, 3D Convolutional Neural Networks (CNNs), and Generative Adversarial Networks (GANs). Key challenges—data scarcity, model interpretability, and computational demands—are critically analysed, with representative trade-offs from the literature highlighted (e.g., GANs can substantially augment effective dataset size through synthetic data generation, typically at the cost of significantly increased training time). Full article
(This article belongs to the Special Issue Advances in AI and Multiphysics Modelling)
Show Figures

Figure 1

14 pages, 2317 KB  
Article
Shrimp-Derived Chitosan for the Formulation of Active Films with Mexican Propolis: Physicochemical and Functional Evaluation of the Biomaterial
by Alejandra Delgado-Lozano, Pedro Alberto Ledesma-Prado, César Leyva-Porras, Lydia Paulina Loya-Hernández, César Iván Romo-Sáenz, Carlos Arzate-Quintana, Manuel Román-Aguirre, María Alejandra Favila-Pérez, Alva Rocío Castillo-González and Celia María Quiñonez-Flores
Coatings 2026, 16(1), 124; https://doi.org/10.3390/coatings16010124 - 17 Jan 2026
Viewed by 159
Abstract
The development of functional biomaterials based on natural polymers has gained increasing relevance due to the growing demand for sustainable and bioactive alternatives for biomedical and technological applications. In this study, chitosan was obtained from shrimp exoskeletons and used to formulate active films [...] Read more.
The development of functional biomaterials based on natural polymers has gained increasing relevance due to the growing demand for sustainable and bioactive alternatives for biomedical and technological applications. In this study, chitosan was obtained from shrimp exoskeletons and used to formulate active films enriched with Mexican propolis, aiming to evaluate the influence of the extract on the physicochemical and functional properties of the resulting biomaterial. Propolis was incorporated into the chitosan film-forming solution at a final concentration of 1.0% (v/v). The propolis employed met the requirements of the Mexican Official Standard NOM-003-SAG/GAN-2017 regarding flavonoid content, total phenolic compounds, and antimicrobial activity; additionally, it was evaluated through antioxidant activity, hemolysis, and acute toxicity (LD50) assays to provide a broader biological and safety assessment. The extracted chitosan exhibited a degree of deacetylation of 74% and characteristic FTIR spectral features comparable to those of commercial chitosan, confirming the quality of the obtained polymer. Chitosan–propolis films exhibited antimicrobial activity against Staphylococcus aureus, Escherichia coli, and Candida albicans, whereas pure chitosan films showed no inhibitory effect. Thermal analyses (TGA/DSC) revealed a slight reduction in thermal stability due to the incorporation of thermolabile polyphenolic compounds, along with increased thermal complexity of the system. SEM observations demonstrated reduced microbial adhesion and marked morphological damage in microorganisms exposed to the functionalized films. Overall, the incorporation of Mexican propolis enabled the development of a hybrid biomaterial with enhanced antimicrobial performance and potential application in wound dressings and bioactive coatings. Full article
(This article belongs to the Special Issue Coatings with Natural Products)
Show Figures

Graphical abstract

25 pages, 44747 KB  
Article
Small-Sample Thermal Fault Diagnosis Using Knowledge Graph and Generative Adversarial Networks
by Ke Chen, Gang Xu, Yunjie Zhang and Yi Wang
Electronics 2026, 15(2), 355; https://doi.org/10.3390/electronics15020355 - 13 Jan 2026
Viewed by 112
Abstract
The scarcity of fault samples significantly impedes the generalization of data-driven diagnosis models for local thermal imbalances in integrated energy systems. To overcome this limitation, this paper proposes a novel knowledge graph-guided conditional generative adversarial network (KG-GAN) framework. The approach begins by constructing [...] Read more.
The scarcity of fault samples significantly impedes the generalization of data-driven diagnosis models for local thermal imbalances in integrated energy systems. To overcome this limitation, this paper proposes a novel knowledge graph-guided conditional generative adversarial network (KG-GAN) framework. The approach begins by constructing a dynamically updatable fault knowledge graph for district heating systems, which explicitly encapsulates pipeline topology, thermodynamic principles, and fault propagation mechanisms. The derived knowledge embeddings are then fused with physics-based constraints into the adversarial learning process, effectively alleviating the issue of physically implausible sample generation that plagues conventional data-centric models. Experimental validation on a district heating platform, involving four common fault types, demonstrates the superiority of our method. With only 100 samples per fault category, a diagnostic model trained on KG-GAN-generated data achieves a classification accuracy of 91.7%, outperforming a GAN-based baseline by 8.3%. Furthermore, t-Distributed Stochastic Neighbor Embedding (t-SNE) visualization reveals a 92.3% feature distribution consistency between generated and real samples, confirming the method’s capability to enhance diagnostic robustness and physical interpretability under small-sample conditions. Full article
Show Figures

Figure 1

23 pages, 6250 KB  
Article
Refining Open-Source Asset Management Tools: AI-Driven Innovations for Enhanced Reliability and Resilience of Power Systems
by Gopal Lal Rajora, Miguel A. Sanz-Bobi, Lina Bertling Tjernberg and Pablo Calvo-Bascones
Technologies 2026, 14(1), 57; https://doi.org/10.3390/technologies14010057 - 11 Jan 2026
Viewed by 185
Abstract
Traditional methods of asset management in electric power systems rely upon fixed schedules and reactive measurements, leading to challenges in the transparent prioritization of maintenance under evolving operating conditions and incomplete data. In this paper, we introduce a new, fully integrated artificial intelligence [...] Read more.
Traditional methods of asset management in electric power systems rely upon fixed schedules and reactive measurements, leading to challenges in the transparent prioritization of maintenance under evolving operating conditions and incomplete data. In this paper, we introduce a new, fully integrated artificial intelligence (AI)-driven approach for enhancing the resilience and reliability of open-source asset management tools to support improved performance and decisions in electric power system operations. This methodology addresses and overcomes several significant challenges, including data heterogeneity, algorithmic limitations, and inflexible decision-making, through a three-module workflow. The data fidelity module provides a domain-aware pipeline for identifying structural (missing) values from explicit missingness using sophisticated imputation methods, including Multiple Imputation Chain Equations (MICE) and Generative Adversarial Network (GAN)-based hybrids. The characterization module employs seven complementary weighting strategies, including PCA, Autoencoder, GA-based optimization, SHAP, Decision-Tree Importance, and Entropy Weighting, to achieve objective feature weight assignment, thereby eliminating the need for subjective manual rules. The optimization module enhanced the action space through multi-objective optimization, balancing reliability maximization and cost minimization. A synthetic dataset of 100 power transformers was used to validate that the MICE achieved better imputation than other methods. The optimized weighting framework successfully categorizes Health Index values into five condition levels, while the multi-objective maintenance policy optimization generates decisions that align with real-world asset management practices. The proposed framework provides the Transmission and Distribution System Operators (TSOs/DSOs) with an adaptable, industry-oriented decision-support workflow system for enhancing reliability, optimizing maintenance expenses, and improving asset management policies for critical power infrastructure. Full article
(This article belongs to the Special Issue AI for Smart Engineering Systems)
Show Figures

Figure 1

27 pages, 3924 KB  
Article
Research and Optimization of Soil Major Nutrient Prediction Models Based on Electronic Nose and Improved Extreme Learning Machine
by He Liu, Yuhang Cao, Haoyu Zhao, Jiamu Wang, Changlin Li and Dongyan Huang
Agriculture 2026, 16(2), 174; https://doi.org/10.3390/agriculture16020174 - 9 Jan 2026
Viewed by 196
Abstract
Keeping the levels of soil major nutrients (total nitrogen, TN; available phosphorous, AP; and available potassium, AK) in optimum condition is important to achieve the goals of precision agriculture systems. To address the issues of slow speed and low accuracy in soil nutrient [...] Read more.
Keeping the levels of soil major nutrients (total nitrogen, TN; available phosphorous, AP; and available potassium, AK) in optimum condition is important to achieve the goals of precision agriculture systems. To address the issues of slow speed and low accuracy in soil nutrient detection, this study developed a prediction model for soil major nutrients content based on an improved Extreme Learning Machine (ELM) algorithm. This model utilizes a soil major nutrients detection system integrating pyrolysis and artificial olfaction. First, the Bootstrap Aggregating (Bagging) ensemble strategy was introduced during the model integration phase to effectively reduce prediction variance through multi-submodel fusion. Second, Generative Adversarial Networks (GAN) were employed for sample augmentation, enhancing the diversity and representativeness of the dataset. Subsequently, a multi-scale convolutional and Efficient Lightweight Attention Network (ELA-Net) was embedded in the feature mapping layer to strengthen the representation capability of soil gas features. Finally, adaptive hyperparameter tuning was achieved using the Adaptive Chaotic Bald Eagle Optimization Algorithm (ACBOA) to enhance the model’s generalization capability. Results demonstrate that this model achieves varying degrees of performance improvement in predicting total nitrogen (R2 = 0.894), available phosphorus (R2 = 0.728), and available potassium (R2 = 0.706). Overall prediction accuracy surpasses traditional models by 8–12%, with significant reductions in both RMSE and MAE. These results demonstrate that the method can rapidly, accurately, and non-destructively estimate key soil nutrients, providing theoretical guidance and practical support for field fertilization, soil fertility assessment, and on-site decision-making in precision agriculture. Full article
(This article belongs to the Section Agricultural Soils)
Show Figures

Figure 1

25 pages, 14310 KB  
Article
Mouse Data Protection in Image-Based User Authentication Using Two-Dimensional Generative Adversarial Networks: Based on a WM_INPUT Message Approach
by Jinwook Kim and Kyungroul Lee
Electronics 2026, 15(2), 292; https://doi.org/10.3390/electronics15020292 - 9 Jan 2026
Viewed by 166
Abstract
With the rapid evolution of computing technologies and the increased proliferation of online services, secure remote user authentication methods have become essential. Among these methods, password-based authentication remains dominant due to its straightforward implementation and ease of use. Nevertheless, password-based systems are particularly [...] Read more.
With the rapid evolution of computing technologies and the increased proliferation of online services, secure remote user authentication methods have become essential. Among these methods, password-based authentication remains dominant due to its straightforward implementation and ease of use. Nevertheless, password-based systems are particularly prone to credential theft from keylogging attacks, making user passwords easily compromised. To address these risks, image-based authentication methods were developed, allowing users to enter passwords through mouse clicks rather than keyboard input, thereby reducing vulnerabilities associated with conventional password entry. However, subsequent studies have shown that mouse movement and click information can still be obtained using APIs such as the GetCursorPos() function or WM_INPUT message, thus undermining the intended security benefits of image-based authentication. In response, various defense strategies have sought to inject artificial or random mouse data through functions such as SetCursorPos() or by utilizing the WM_INPUT message, in an effort to disguise authentic user input. Despite these defenses, recent machine learning-based attacks have demonstrated that such naïve bogus input can be distinguished from legitimate mouse data with up to 99% classification accuracy, resulting in substantial exposure of actual user actions. To address this, a technique leveraging Generative Adversarial Networks (GAN) was introduced to produce artificial mouse data closely mimicking genuine user input, which has been shown to reduce the attack success rate by roughly 37%, offering enhanced protection for mouse-driven authentication systems. This article seeks to advance GAN-based mouse data protection by integrating multiple adversarial generative models and conducting a comprehensive evaluation of their effectiveness with respect to data processing techniques, feature selection, generation intervals, and model-specific performance differences. Our experimental findings reveal that the enhanced approach reduces attack success rates by up to 48%, marking an 11% performance gain over previous mouse data protection approaches, and providing stronger empirical support that our method offers superior protection for user authentication data compared to prior techniques. Full article
Show Figures

Figure 1

23 pages, 3855 KB  
Article
Visual-to-Tactile Cross-Modal Generation Using a Class-Conditional GAN with Multi-Scale Discriminator and Hybrid Loss
by Nikolay Neshov, Krasimir Tonchev, Agata Manolova, Radostina Petkova and Ivaylo Bozhilov
Sensors 2026, 26(2), 426; https://doi.org/10.3390/s26020426 - 9 Jan 2026
Viewed by 224
Abstract
Understanding surface textures through visual cues is crucial for applications in haptic rendering and virtual reality. However, accurately translating visual information into tactile feedback remains a challenging problem. To address this challenge, this paper presents a class-conditional Generative Adversarial Network (cGAN) for cross-modal [...] Read more.
Understanding surface textures through visual cues is crucial for applications in haptic rendering and virtual reality. However, accurately translating visual information into tactile feedback remains a challenging problem. To address this challenge, this paper presents a class-conditional Generative Adversarial Network (cGAN) for cross-modal translation from texture images to vibrotactile spectrograms, using samples from the LMT-108 dataset. The generator is adapted from pix2pix and enhanced with Conditional Batch Normalization (CBN) at the bottleneck to incorporate texture class semantics. A dedicated label predictor, based on a DenseNet-201 and trained separately prior to cGAN training, provides the conditioning label. The discriminator is derived from pix2pixHD and uses a multi-scale architecture with three discriminators, each comprising three downsampling layers. A grid search over multi-scale discriminator configurations shows that this setup yields optimal perceptual similarity measured by Learned Perceptual Image Patch Similarity (LPIPS). The generator is trained using a hybrid loss that combines adversarial, L1, and feature matching losses derived from intermediate discriminator features, while the discriminators are trained using standard adversarial loss. Quantitative evaluation with LPIPS and Fréchet Inception Distance (FID) confirms superior similarity to real spectrograms. GradCAM visualizations highlight the benefit of class conditioning. The proposed model outperforms pix2pix, pix2pixHD, Residue-Fusion GAN, and several ablated versions. The generated spectrograms can be converted into vibrotactile signals using the Griffin–Lim algorithm, enabling applications in haptic feedback and virtual material simulation. Full article
(This article belongs to the Special Issue Intelligent Sensing and Artificial Intelligence for Image Processing)
Show Figures

Figure 1

20 pages, 2021 KB  
Article
Noise-Conditioned Denoising Autoencoder with Temporal Attention for Bearing RUL Prediction
by Zhongtian Jin, Chong Chen, Aris Syntetos and Ying Liu
Machines 2026, 14(1), 75; https://doi.org/10.3390/machines14010075 - 8 Jan 2026
Viewed by 219
Abstract
Bearings are important elements of mechanical systems and the correct forecasting of their remaining useful life (RUL) is key to successful predictive maintenance. Nevertheless, noise interference during different operating conditions is also a significant problem in predicting their RUL. Existing denoising-based RUL prediction [...] Read more.
Bearings are important elements of mechanical systems and the correct forecasting of their remaining useful life (RUL) is key to successful predictive maintenance. Nevertheless, noise interference during different operating conditions is also a significant problem in predicting their RUL. Existing denoising-based RUL prediction models often show degraded performance when exposed to heterogeneous and non-stationary noise, resulting in unstable feature extraction and reduced generalisation. To address the challenge of heterogeneous and non-stationary noise in bearing RUL prediction, this study proposes a hybrid framework that combines a noise-conditioned convolutional denoising autoencoder (NC-CDAE) and a temporal attention transformer (TAT). The NC-CDAE adaptively suppresses diverse noise types through conditional modulation, while the TAT captures long-term temporal dependencies to enhance degradation trend learning. This synergistic design improves both the noise robustness and temporal modelling capability of the system. To further validate the model under varying conditions, synthetic datasets with different noise intensities were generated using a conditional generative adversarial network (cGAN). Comprehensive experiments show that the proposed NC-CDAE + TAT framework achieves lower and more stable errors than state-of-the-art methods, reducing RMSE by up to 23.6% and MAE by 18.2% on average and maintaining consistent performance (an RMSE between 0.155 and 0.194) across diverse conditions. Full article
Show Figures

Figure 1

15 pages, 2072 KB  
Article
A Ceramic Rare Defect Amplification Method Based on TC-CycleGAN
by Zhiqiang Zeng, Changying Dang, Zebing Ma, Jiansu Li and Zhonghua Li
Sensors 2026, 26(2), 395; https://doi.org/10.3390/s26020395 - 7 Jan 2026
Viewed by 252
Abstract
The ceramic defect detection technology based on deep learning suffers from the problems of scarce rare defect samples and class imbalance. However, the current deep generative image augmentation techniques are limited when applied to the task of augmenting rare ceramic defects due to [...] Read more.
The ceramic defect detection technology based on deep learning suffers from the problems of scarce rare defect samples and class imbalance. However, the current deep generative image augmentation techniques are limited when applied to the task of augmenting rare ceramic defects due to issues such as uneven image brightness and insufficient features of small-sized defects, resulting in poor image quality and limited improvement in detection results. This paper proposes a ceramic rare defect image augmentation method based on TC-CycleGAN. TC-CycleGAN is based on the CycleGAN framework and optimizes the generator and discriminator structures to make them more suitable for ceramic defect features, thereby improving the quality of generated images. The generator is TC-UNet, which introduces the scSE and DehazeFormer modules on the basis of UNet, effectively enhancing the model’s ability to learn the subtle defect features on the ceramic surface; the discriminator is the TC-PatchGAN architecture, which replaces the original BatchNorm module with the ContraNorm module, effectively increasing the discriminator’s sensitivity to the representation of tiny ceramic defect features and enhancing the diversity of generated images. The image quality assessment experiments show that the method proposed in this paper significantly improves the quality of generated defective images. For the concave type images, the FID and KID values have decreased by 49% and 73%, respectively, while for the smoke stains type images, the FID and KID values have decreased by 57% and 63% respectively. The further defect detection experiments results show that when using the data set expanded by the method in this paper for training, the recognition accuracy of the detection model for rare defects has significantly improved. The detection accuracy of the concave and smoke stains types of defects has increased by 1.2% and 3.9% respectively. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

41 pages, 25791 KB  
Article
TGDHTL: Hyperspectral Image Classification via Transformer–Graph Convolutional Network–Diffusion with Hybrid Domain Adaptation
by Zarrin Mahdavipour, Nashwan Alromema, Abdolraheem Khader, Ghulam Farooque, Ali Ahmed and Mohamed A. Damos
Remote Sens. 2026, 18(2), 189; https://doi.org/10.3390/rs18020189 - 6 Jan 2026
Viewed by 393
Abstract
Hyperspectral image (HSI) classification is pivotal for remote sensing applications, including environmental monitoring, precision agriculture, and urban land-use analysis. However, its accuracy is often limited by scarce labeled data, class imbalance, and domain discrepancies between standard RGB and HSI imagery. Although recent deep [...] Read more.
Hyperspectral image (HSI) classification is pivotal for remote sensing applications, including environmental monitoring, precision agriculture, and urban land-use analysis. However, its accuracy is often limited by scarce labeled data, class imbalance, and domain discrepancies between standard RGB and HSI imagery. Although recent deep learning approaches, such as 3D convolutional neural networks (3D-CNNs), transformers, and generative adversarial networks (GANs), show promise, they struggle with spectral fidelity, computational efficiency, and cross-domain adaptation in label-scarce scenarios. To address these challenges, we propose the Transformer–Graph Convolutional Network–Diffusion with Hybrid Domain Adaptation (TGDHTL) framework. This framework integrates domain-adaptive alignment of RGB and HSI data, efficient synthetic data generation, and multi-scale spectral–spatial modeling. Specifically, a lightweight transformer, guided by Maximum Mean Discrepancy (MMD) loss, aligns feature distributions across domains. A class-conditional diffusion model generates high-quality samples for underrepresented classes in only 15 inference steps, reducing labeled data needs by approximately 25% and computational costs by up to 80% compared to traditional 1000-step diffusion models. Additionally, a Multi-Scale Stripe Attention (MSSA) mechanism, combined with a Graph Convolutional Network (GCN), enhances pixel-level spatial coherence. Evaluated on six benchmark datasets including HJ-1A and WHU-OHS, TGDHTL consistently achieves high overall accuracy (e.g., 97.89% on University of Pavia) with just 11.9 GFLOPs, surpassing state-of-the-art methods. This framework provides a scalable, data-efficient solution for HSI classification under domain shifts and resource constraints. Full article
(This article belongs to the Section Remote Sensing Image Processing)
Show Figures

Graphical abstract

14 pages, 1392 KB  
Article
AirSpeech: Lightweight Speech Synthesis Framework for Home Intelligent Space Service Robots
by Xiugong Qin, Fenghu Pan, Jing Gao, Shilong Huang, Yichen Sun and Xiao Zhong
Electronics 2026, 15(1), 239; https://doi.org/10.3390/electronics15010239 - 5 Jan 2026
Viewed by 267
Abstract
Text-to-Speech (TTS) methods typically employ a sequential approach with an Acoustic Model (AM) and a vocoder, using a Mel spectrogram as an intermediate representation. However, in home environments, TTS systems often struggle with issues such as inadequate robustness against environmental noise and limited [...] Read more.
Text-to-Speech (TTS) methods typically employ a sequential approach with an Acoustic Model (AM) and a vocoder, using a Mel spectrogram as an intermediate representation. However, in home environments, TTS systems often struggle with issues such as inadequate robustness against environmental noise and limited adaptability to diverse speaker characteristics. The quality of the Mel spectrogram directly affects the performance of TTS systems, yet existing methods overlook the potential of enhancing Mel spectrogram quality through more comprehensive speech features. To address the complex acoustic characteristics of home environments, this paper introduces AirSpeech, a post-processing model for Mel-spectrogram synthesis. We adopt a Generative Adversarial Network (GAN) to improve the accuracy of Mel spectrogram prediction and enhance the expressiveness of synthesized speech. By incorporating additional conditioning extracted from synthesized audio using specified speech feature parameters, our method significantly enhances the expressiveness and emotional adaptability of synthesized speech in home environments. Furthermore, we propose a global normalization strategy to stabilize the GAN training process. Through extensive evaluations, we demonstrate that the proposed method significantly improves the signal quality and naturalness of synthesized speech, providing a more user-friendly speech interaction solution for smart home applications. Full article
Show Figures

Figure 1

13 pages, 2618 KB  
Article
Multi-Domain Perception Transformer for Generalized Forgery Image Detection
by Qiaoyue Man, Seok-Jeong Gee and Young-Im Cho
Appl. Sci. 2026, 16(1), 533; https://doi.org/10.3390/app16010533 - 5 Jan 2026
Viewed by 182
Abstract
With the rapid advancement of generative AI (AIGC) technology, synthetic images are increasingly approaching real pictures in terms of resolution and semantic consistency. Traditional detection methods face numerous challenges, such as insufficient cross-modal generalization capabilities and difficulty in identifying hidden generative traces. Existing [...] Read more.
With the rapid advancement of generative AI (AIGC) technology, synthetic images are increasingly approaching real pictures in terms of resolution and semantic consistency. Traditional detection methods face numerous challenges, such as insufficient cross-modal generalization capabilities and difficulty in identifying hidden generative traces. Existing solutions primarily design feature extractors for single generative models, struggling to address the complexity of multimodal forgeries. Therefore, we propose a multi-domain feature fusion Transformer network that integrates spatial, frequency, and wavelet transform features and introduce a cross-domain feature fusion module (CDAF) to detect subtle forgery traces in deepfake images. This model demonstrates superior detection performance on current forged images generated by generative adversarial networks (GANs) and diffusion models while exhibiting enhanced robustness. Full article
Show Figures

Figure 1

21 pages, 2696 KB  
Article
Self-Supervised Contrastive Learning and GAN-Based Denoising for High-Fidelity HumanNeRF Images
by Qian Xu, Wenxuan Xu, Meng Huang, Weiqing Yan and Yang Guo
Sensors 2026, 26(1), 249; https://doi.org/10.3390/s26010249 - 31 Dec 2025
Viewed by 353
Abstract
To address the prevalent noise issue in images generated by HumanNeRF, this paper proposes an image denoising method that combines self-supervised contrastive learning and Generative Adversarial Networks (GANs). While HumanNeRF excels in realistic 3D human reconstruction tasks, its generated images often suffer from [...] Read more.
To address the prevalent noise issue in images generated by HumanNeRF, this paper proposes an image denoising method that combines self-supervised contrastive learning and Generative Adversarial Networks (GANs). While HumanNeRF excels in realistic 3D human reconstruction tasks, its generated images often suffer from noise and detail loss due to incomplete training data and sampling noise during the rendering process. To solve this problem, our method first utilizes a self-supervised contrastive learning strategy to construct positive and negative sample pairs, enabling the network to effectively distinguish between noise and human detail features without external labels. Secondly, it introduces a Generative Adversarial Network, where the adversarial training between the generator and discriminator further enhances the detail representation and overall realism of the images. Experimental results demonstrate that the proposed method can effectively remove noise from HumanNeRF images while significantly improving detail fidelity, ultimately yielding higher-quality human images and providing crucial support for subsequent 3D human reconstruction and realistic rendering. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

Back to TopTop