MDPI - Publisher of Open Access Journals

25 pages, 3342 KB

Open AccessArticle

A Novel Spectrum Recognition Model of Spatial Electromagnetic Anomalies Based on VAE-GANGP

by Bin Liu, Jiansheng Bai and Qiongyi Li

Electronics 2026, 15(5), 1062; https://doi.org/10.3390/electronics15051062 - 3 Mar 2026

Viewed by 553

To address the issues of sample imbalance, unstable generation quality, and insufficient feature extraction in spectrum anomaly signal detection under complex electromagnetic environments, this paper proposes a VAE-GANGP identification model that integrates a Variational Autoencoder (VAE) with a Gradient Penalty-based Generative Adversarial Network [...] Read more.

To address the issues of sample imbalance, unstable generation quality, and insufficient feature extraction in spectrum anomaly signal detection under complex electromagnetic environments, this paper proposes a VAE-GANGP identification model that integrates a Variational Autoencoder (VAE) with a Gradient Penalty-based Generative Adversarial Network (GAN-GP). First, the VAE is employed to encode the original spectrum, generating structured latent features that follow a standard normal distribution. This replaces the random noise input in traditional GANs, significantly enhancing the semantic consistency of generated samples and training stability. Second, an adversarial training mechanism based on Wasserstein distance with gradient penalty (WGAN-GP) is introduced, effectively mitigating mode collapse and gradient vanishing, thereby improving the model’s capability to fit complex signal distributions. Furthermore, a multi-objective optimization function combining reconstruction error and adversarial loss is constructed, establishing an end-to-end integrated framework for feature learning, signal reconstruction, and anomaly discrimination. Experiments are conducted using a synthetic dataset comprising various modulation types and simulated environments with different signal-to-noise ratios for systematic validation. The results demonstrate that the spectrum data generated by VAE-GANGP closely matches the distribution of real signals. Under AWGN-dominated synthetic test conditions, the model achieves an anomaly detection accuracy of 98.1%. When evaluated under more realistic channel impairments (phase noise, multipath, impulsive interference), the model maintains competitive performance, outperforming existing methods and demonstrating promising potential for practical electromagnetic spectrum monitoring. Its performance significantly surpasses traditional detection methods and single deep learning models, providing a highly reliable and adaptive solution for spatial electromagnetic spectrum anomaly detection. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Figure 1

21 pages, 1440 KB

Open AccessArticle

VAE-GAN-Guided Cross-Class Generation: A Class Imbalance Data Augmentation Method for Network Intrusion Detection

by Fuyuan Kang, Tao Feng and Jiaqi Lin

Electronics 2025, 14(11), 2103; https://doi.org/10.3390/electronics14112103 - 22 May 2025

Cited by 6 | Viewed by 3597

Abstract

Network intrusion datasets often face class imbalance issues in intrusion detection tasks, where the number of majority class samples is much higher than minority class samples. Current solutions face notable limitations: traditional normalization weakens the multimodal distribution of continuous features, while mainstream generative [...] Read more.

Network intrusion datasets often face class imbalance issues in intrusion detection tasks, where the number of majority class samples is much higher than minority class samples. Current solutions face notable limitations: traditional normalization weakens the multimodal distribution of continuous features, while mainstream generative models focus excessively on minority class mining while neglecting majority class information. To address these issues, we propose M2M-VAEGAN, which innovatively incorporates a Variational Gaussian Mixture (VGM) model to preserve multimodal characteristics of continuous features. We design a transfer learning framework, pre-training on majority classes to capture general attack patterns, followed by fine-tuning with balanced batches of majority and minority samples to prevent catastrophic forgetting. Additionally, we enhance the VAEGAN architecture with an auxiliary classifier to strengthen conditional information learning. On the NSL-KDD and CIC-IDS2017 datasets, M2M-VAEGAN outperforms methods such as SMOTE, CTGAN, and CTABGAN, achieving a 1.25% to 6.42% improvement in minority class recall. These results demonstrate the effectiveness of the proposed approach. Full article

(This article belongs to the Special Issue Recognition of Patterns and Trends in Multimedia Datasets)

► Show Figures

Figure 1

19 pages, 34572 KB

Open AccessArticle

Suppression of Multiple Reflection Interference Signals in GPR Images Caused by Rebar Using VAE-GAN

by Chuan Li, Qibing Ma, Yawei Wang, Xi Yang, Hao Liu and Lulu Wang

Appl. Sci. 2025, 15(7), 3728; https://doi.org/10.3390/app15073728 - 28 Mar 2025

Cited by 7 | Viewed by 1867

Abstract

Due to the rebars layer’s shielding effect on Ground Penetrating Radar (GPR) waves, the hyperbolic clutter generated by the rebars interferes with the echoes from void beneath them. The overlapping waveforms of both signals result in attenuation and distortion of the void signals, [...] Read more.

Due to the rebars layer’s shielding effect on Ground Penetrating Radar (GPR) waves, the hyperbolic clutter generated by the rebars interferes with the echoes from void beneath them. The overlapping waveforms of both signals result in attenuation and distortion of the void signals, making it difficult to identify void defects under the rebar. This study proposes an unsupervised generative network model based on Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs). Through a shared latent space, mapping is achieved between two image domains, effectively eliminating the multiple reflection interference signals caused by the rebar while accurately reconstructing the void defects, generating GPR B-Scan images without rebar clutter. Additionally, the channel and spatial attention module (CSA) is implemented into the model to help the network to better focus on the essential information in GPR images. The proposed model was validated through ablation and comparative experiments using synthetic data. Finally, real GPR data from the Husa Tunnel were used to verify the model’s effectiveness in practical engineering applications. The results showed that this model is highly effective; it improves the visibility of void defects signals, thereby enhancing the interpretability of GPR data for tunnel lining inspections. Full article

► Show Figures

Figure 1

14 pages, 743 KB

Open AccessArticle

AD-VAE: Adversarial Disentangling Variational Autoencoder

by Adson Silva and Ricardo Farias

Sensors 2025, 25(5), 1574; https://doi.org/10.3390/s25051574 - 4 Mar 2025

Cited by 8 | Viewed by 2568

Abstract

Face recognition (FR) is a less intrusive biometrics technology with various applications, such as security, surveillance, and access control systems. FR remains challenging, especially when there is only a single image per person as a gallery dataset and when dealing with variations like [...] Read more.

Face recognition (FR) is a less intrusive biometrics technology with various applications, such as security, surveillance, and access control systems. FR remains challenging, especially when there is only a single image per person as a gallery dataset and when dealing with variations like pose, illumination, and occlusion. Deep learning techniques have shown promising results in recent years using VAE and GAN, with approaches such as patch-VAE, VAE-GAN for 3D Indoor Scene Synthesis, and hybrid VAE-GAN models. However, in Single Sample Per Person Face Recognition (SSPP FR), the challenge of learning robust and discriminative features that preserve the subject’s identity persists. To address these issues, we propose a novel framework called AD-VAE, specifically for SSPP FR, using a combination of variational autoencoder (VAE) and Generative Adversarial Network (GAN) techniques. The proposed AD-VAE framework is designed to learn how to build representative identity-preserving prototypes from both controlled and wild datasets, effectively handling variations like pose, illumination, and occlusion. The method uses four networks: an encoder and decoder similar to VAE, a generator that receives the encoder output plus noise to generate an identity-preserving prototype, and a discriminator that operates as a multi-task network. AD-VAE outperforms all tested state-of-the-art face recognition techniques, demonstrating its robustness. The proposed framework achieves superior results on four controlled benchmark datasets—AR, E-YaleB, CAS-PEAL, and FERET—with recognition rates of 84.9%, 94.6%, 94.5%, and 96.0%, respectively, and achieves remarkable performance on the uncontrolled LFW dataset, with a recognition rate of 99.6%. The AD-VAE framework shows promising potential for future research and real-world applications. Full article

(This article belongs to the Topic Applications in Image Analysis and Pattern Recognition)

► Show Figures

Figure 1

23 pages, 6852 KB

Open AccessArticle

Reconstruction of Random Structures Based on Generative Adversarial Networks: Statistical Variability of Mechanical and Morphological Properties

by Mikhail Tashkinov, Yulia Pirogova, Evgeniy Kononov, Aleksandr Shalimov and Vadim V. Silberschmidt

Mathematics 2025, 13(1), 7; https://doi.org/10.3390/math13010007 - 24 Dec 2024

Cited by 5 | Viewed by 1605

Abstract

Generative adversarial neural networks with a variational autoencoder (VAE-GANs) are actively used in the field of materials design. The synthesis of random structures with nonrepeated geometry and predetermined mechanical properties is important for solving various practical problems. Geometric parameters of such artificially generated [...] Read more.

Generative adversarial neural networks with a variational autoencoder (VAE-GANs) are actively used in the field of materials design. The synthesis of random structures with nonrepeated geometry and predetermined mechanical properties is important for solving various practical problems. Geometric parameters of such artificially generated random structures can vary within certain limits compared to the training dataset, causing unpredicted fluctuations in their resulting mechanical response. This study investigates the statistical variability of mechanical and morphological characteristics of random 3D models reconstructed from 2D images using a VAE-GAN neural network. A combined multitool method employing different mathematical and statistical instruments for comparison of the reconstructed models with their corresponding originals is proposed. It includes the analysis of statistical distributions of elastic properties, morphometric parameters, and stress values. The neural network was trained on two datasets, containing models created based on Gaussian random fields. Statistical fluctuations of the mechanical and morphological parameters of the reconstructed models are analyzed. The deviation of the effective elastic modulus of the reconstructed models from that of the original ones was less than 5.7% on average. The difference between the median values of ligament thickness and distance between ligaments ranged from 3.6 to 6.5% and 2.6 to 5.2%, respectively. The median value of the surface area of the reconstructed geometries was 4.6–8.1% higher compared to the original models. It is thus shown that mechanical properties of the NN-generated structures retain the statistical variability of the corresponding originals, while the variability of the morphology is highly affected by the training set and does not depend on the configuration of the input 2D image. Full article

(This article belongs to the Special Issue Recent Advances in Machine Learning Methods for Mechanical Engineering)

► Show Figures

Figure 1

12 pages, 1581 KB

Open AccessArticle

Airfoil Shape Generation and Feature Extraction Using the Conditional VAE-WGAN-gp

by Kazuo Yonekura, Yuki Tomori and Katsuyuki Suzuki

AI 2024, 5(4), 2092-2103; https://doi.org/10.3390/ai5040102 - 28 Oct 2024

Cited by 14 | Viewed by 3523

Abstract

A machine learning method was applied to solve an inverse airfoil design problem. A conditional VAE-WGAN-gp model, which couples the conditional variational autoencoder (VAE) and Wasserstein generative adversarial network with gradient penalty (WGAN-gp), is proposed for an airfoil generation method, and then, it [...] Read more.

A machine learning method was applied to solve an inverse airfoil design problem. A conditional VAE-WGAN-gp model, which couples the conditional variational autoencoder (VAE) and Wasserstein generative adversarial network with gradient penalty (WGAN-gp), is proposed for an airfoil generation method, and then, it is compared with the WGAN-gp and VAE models. The VAEGAN model couples the VAE and GAN models, which enables feature extraction in the GAN models. In airfoil generation tasks, to generate airfoil shapes that satisfy lift coefficient requirements, it is known that VAE outperforms WGAN-gp with respect to the accuracy of the reproduction of the lift coefficient, whereas GAN outperforms VAE with respect to the smoothness and variations of generated shapes. In this study, VAE-WGAN-gp demonstrated a good performance in all three aspects. Latent distribution was also studied to compare the feature extraction ability of the proposed method. Full article

► Show Figures

Figure 1

29 pages, 6007 KB

Open AccessArticle

VAE-WACGAN: An Improved Data Augmentation Method Based on VAEGAN for Intrusion Detection

by Wuxin Tian, Yanping Shen, Na Guo, Jing Yuan and Yanqing Yang

Sensors 2024, 24(18), 6035; https://doi.org/10.3390/s24186035 - 18 Sep 2024

Cited by 25 | Viewed by 4621

Abstract

To address the class imbalance issue in network intrusion detection, which degrades performance of intrusion detection models, this paper proposes a novel generative model called VAE-WACGAN to generate minority class samples and balance the dataset. This model extends the Variational Autoencoder Generative Adversarial [...] Read more.

To address the class imbalance issue in network intrusion detection, which degrades performance of intrusion detection models, this paper proposes a novel generative model called VAE-WACGAN to generate minority class samples and balance the dataset. This model extends the Variational Autoencoder Generative Adversarial Network (VAEGAN) by integrating key features from the Auxiliary Classifier Generative Adversarial Network (ACGAN) and the Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP). These enhancements significantly improve both the quality of generated samples and the stability of the training process. By utilizing the VAE-WACGAN model to oversample anomalous data, more realistic synthetic anomalies that closely mirror the actual network traffic distribution can be generated. This approach effectively balances the network traffic dataset and enhances the overall performance of the intrusion detection model. Experimental validation was conducted using two widely utilized intrusion detection datasets, UNSW-NB15 and CIC-IDS2017. The results demonstrate that the VAE-WACGAN method effectively enhances the performance metrics of the intrusion detection model. Furthermore, the VAE-WACGAN-based intrusion detection approach surpasses several other advanced methods, underscoring its effectiveness in tackling network security challenges. Full article

(This article belongs to the Section Sensor Networks)

► Show Figures

Figure 1

25 pages, 18421 KB

Open AccessArticle

Prediction of Operational Noise Uncertainty in Automotive Micro-Motors Based on Multi-Branch Channel–Spatial Adaptive Weighting Strategy

by Hao Hu, Shiqi Deng, Wang Yan, Yanyong He and Yudong Wu

Electronics 2024, 13(13), 2553; https://doi.org/10.3390/electronics13132553 - 28 Jun 2024

Cited by 3 | Viewed by 1731

Abstract

The acoustic performance of automotive micro-motors directly impacts the comfort and driving experience of both drivers and passengers. However, various motor production and testing uncertainties can lead to noise fluctuations during operation. Thus, predicting the operational noise range of motors on the production [...] Read more.

The acoustic performance of automotive micro-motors directly impacts the comfort and driving experience of both drivers and passengers. However, various motor production and testing uncertainties can lead to noise fluctuations during operation. Thus, predicting the operational noise range of motors on the production line in advance becomes crucial for timely adjustments to production parameters and process optimization. This paper introduces a prediction model based on a Multi-Branch Channel–Spatial Adaptive Weighting Strategy (MCSAWS). The model includes a multi-branch feature extraction (MFE) network and a channel–spatial attention module (CSAM). It uses the vibration and noise data from micro-motors’ idle operations on the production line as input to efficiently predict the operational noise uncertainty interval of automotive micro-motors. The model employs the VAE-GAN approach for data augmentation (DA) and uses Gammatone filters to emphasize the noise at the commutation frequency of the motor. The model was compared with Convolutional Neural Networks (CNNs) and Multilayer Perceptrons (MLPs). Experimental results demonstrate that the MCSAWS method is superior to conventional methods in prediction accuracy and reliability, confirming the feasibility of the proposed approach. This research can help control noise uncertainty in micro-motors’ production and manufacturing processes in advance. Full article

(This article belongs to the Special Issue Applications of Artificial Intelligence in Mechanical Engineering)

► Show Figures

Figure 1

22 pages, 10012 KB

Open AccessArticle

Remaining Useful Life Prediction Method Enhanced by Data Augmentation and Similarity Fusion

by Huaqing Wang, Ye Li, Ye Jin, Shengkai Zhao, Changkun Han and Liuyang Song

Vibration 2024, 7(2), 560-581; https://doi.org/10.3390/vibration7020029 - 5 Jun 2024

Cited by 13 | Viewed by 3694

Abstract

Precise prediction of the remaining useful life (RUL) of rolling bearings is crucial for ensuring the smooth functioning of machinery and minimizing maintenance costs. The time-domain features can reflect the degenerative state of the bearings and reduce the impact of random noise present [...] Read more.

Precise prediction of the remaining useful life (RUL) of rolling bearings is crucial for ensuring the smooth functioning of machinery and minimizing maintenance costs. The time-domain features can reflect the degenerative state of the bearings and reduce the impact of random noise present in the original signal, which is often used for life prediction. However, obtaining ideal training data for RUL prediction is challenging. Thus, this paper presents a bearing RUL prediction method based on unsupervised learning sample augmentation, establishes a VAE-GAN model, and expands the time-domain features that are calculated based on the original vibration signals. By combining the advantages of VAE and GAN in data generation, the generated data can better represent the degradation state of the bearings. The original data and generated data are mixed to realize data augmentation. At the same time, the dynamic time warping (DTW) algorithm is introduced to measure the similarity of the dataset, establishing the mapping relationship between the training set and target sequence, thereby enhancing the prediction accuracy of supervised learning. Experiments employing the XJTU-SY rolling element bearing accelerated life test dataset, IMS dataset, and pantograph data indicate that the proposed method yields high accuracy in bearing RUL prediction. Full article

► Show Figures

Figure 1

23 pages, 3371 KB

Open AccessArticle

Zero-Shot SAR Target Recognition Based on a Conditional Generative Network with Category Features from Simulated Images

by Guo Chen, Siqian Zhang, Qishan He, Zhongzhen Sun, Xianghui Zhang and Lingjun Zhao

Remote Sens. 2024, 16(11), 1930; https://doi.org/10.3390/rs16111930 - 27 May 2024

Cited by 6 | Viewed by 2409

Abstract

SAR image target recognition relies heavily on a large number of annotated samples, making it difficult to classify the unseen class targets. Due to the lack of effective category auxiliary information, the current zero-shot target recognition methods for SAR images are limited to [...] Read more.

SAR image target recognition relies heavily on a large number of annotated samples, making it difficult to classify the unseen class targets. Due to the lack of effective category auxiliary information, the current zero-shot target recognition methods for SAR images are limited to inferring only one unseen class rather than classifying multiple unseen classes. To address this issue, a conditional generative network with the category features from the simulated images for zero-shot SAR target recognition is proposed in this paper. Firstly, the deep features are extracted from the simulated images and fused into the category features that characterize the entire class. Then, a conditional VAE-GAN network is constructed to generate the feature instances of the unseen classes. The high-level semantic information shared in the category features aids in generalizing the mapping learned from the seen classes to the unseen classes. Finally, the generated features of the unseen classes are used to train a classifier that can classify the real unseen images. The classification accuracies for the targets of the three unseen classes based on the proposed method can reach 99.80 ± 1.22% and 71.57 ± 2.28% with the SAMPLE dataset and the MSTAR dataset, respectively. The advantage and validity of the proposed architecture are indicated with a small number of the seen classes and a small amount of the training data. Furthermore, the proposed method can be extended to generalized zero-shot recognition. Full article

► Show Figures

Figure 1

15 pages, 1748 KB

Open AccessArticle

Leveraging Dual Variational Autoencoders and Generative Adversarial Networks for Enhanced Multimodal Interaction in Zero-Shot Learning

by Ning Li, Jie Chen, Nanxin Fu, Wenzhuo Xiao, Tianrun Ye, Chunming Gao and Ping Zhang

Electronics 2024, 13(3), 539; https://doi.org/10.3390/electronics13030539 - 29 Jan 2024

Cited by 6 | Viewed by 3363

Abstract

In the evolving field of taxonomic classification, and especially in Zero-shot Learning (ZSL), the challenge of accurately classifying entities unseen in training datasets remains a significant hurdle. Although the existing literature is rich in developments, it often falls short in two critical areas: [...] Read more.

In the evolving field of taxonomic classification, and especially in Zero-shot Learning (ZSL), the challenge of accurately classifying entities unseen in training datasets remains a significant hurdle. Although the existing literature is rich in developments, it often falls short in two critical areas: semantic consistency (ensuring classifications align with true meanings) and the effective handling of dataset diversity biases. These gaps have created a need for a more robust approach that can navigate both with greater efficacy. This paper introduces an innovative integration of transformer models with ariational autoencoders (VAEs) and generative adversarial networks (GANs), with the aim of addressing them within the ZSL framework. The choice of VAE-GAN is driven by their complementary strengths: VAEs are proficient in providing a richer representation of data patterns, and GANs are able to generate data that is diverse yet representative, thus mitigating biases from dataset diversity. Transformers are employed to further enhance semantic consistency, which is key because many existing models underperform. Through experiments have been conducted on benchmark ZSL datasets such as CUB, SUN, and Animals with Attributes 2 (AWA2), our approach is novel because it demonstrates significant improvements, not only in enhancing semantic and structural coherence, but also in effectively addressing dataset biases. This leads to a notable enhancement of the model’s ability to generalize visual categorization tasks beyond the training data, thus filling a critical gap in the current ZSL research landscape. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Figure 1

21 pages, 4061 KB

Open AccessArticle

Improved Prediction of Aquatic Beetle Diversity in a Stagnant Pool by a One-Dimensional Convolutional Neural Network Using Variational Autoencoder Generative Adversarial Network-Generated Data

by Miao Hu, Shujiao Jiang, Fenglong Jia, Xiaomei Yang and Zhiqiang Li

Appl. Sci. 2023, 13(15), 8841; https://doi.org/10.3390/app13158841 - 31 Jul 2023

Cited by 1 | Viewed by 2105

Abstract

Building a reasonable model for predicting biodiversity using limited data is challenging. Expanding limited experimental data using a variational autoencoder generative adversarial network (VAEGAN) to improve biodiversity predictions for a region is a new strategy. Aquatic beetle diversity in a large >30-year-old artificial [...] Read more.

Building a reasonable model for predicting biodiversity using limited data is challenging. Expanding limited experimental data using a variational autoencoder generative adversarial network (VAEGAN) to improve biodiversity predictions for a region is a new strategy. Aquatic beetle diversity in a large >30-year-old artificial pool that had not had human interference in Nanshe Village (Dapeng Peninsula, Shenzhen City, Guangdong Province, China) was investigated. Eight ecological factors were considered. These were water temperature, salinity, pH, water depth, proportional area of aquatic plants, proportional area of submerged plants, water area, and water level. Field sampling was performed for 1 or 2 days in the middle or late part of each month for a year. A type D net was swept 10 times in the same direction in each ~1 m × ~1 m sample square, generating 132 datasets (experimental data). In total, 39 aquatic beetle species were collected, 19 of which were assigned to Hydrophilidae, 16 to Dytiscidae, 3 to Noteridae, and 1 to Gyrinidae. A one-dimensional convolutional neural network (1-D CNN) was used to assess and predict the grade of the number of individuals and the number of aquatic beetle species. The Bayesian-optimized 1-D CNN established using 112 experimental datasets as the training set and the other 20 datasets as validation and testing sets gave a 74.0% prediction accuracy for the grade of the number of individuals and a 70.0% prediction accuracy for the number of species. The impact of insufficient sample data on the model was assessed using a VAEGAN to expand the training set from 112 to 512 samples, and then the Bayesian-optimized 1-D CNN-based VAEGAN prediction model was re-established. This improved prediction accuracy for the grade of the number of individuals to 86.0% and for the number of species to 85.0%. The grade of the number of individuals’ prediction accuracy was 88.0% and the number of species’ prediction accuracy was 85.0% when the random effects of only obtaining a single individual of a species were excluded. The results indicated that the accuracy of the 1-D CNN in predicting the aquatic beetle species number and abundance from relevant environmental factors can be improved using a VAEGAN to expand the experimental data. Full article

► Show Figures

Figure 1

20 pages, 1187 KB

Open AccessArticle

3D Shape Generation via Variational Autoencoder with Signed Distance Function Relativistic Average Generative Adversarial Network

by Ebenezer Akinyemi Ajayi, Kian Ming Lim, Siew-Chin Chong and Chin Poo Lee

Appl. Sci. 2023, 13(10), 5925; https://doi.org/10.3390/app13105925 - 11 May 2023

Cited by 6 | Viewed by 7610

Abstract

3D shape generation is widely applied in various industries to create, visualize, and analyse complex data, designs, and simulations. Typically, 3D shape generation uses a large dataset of 3D shapes as the input. This paper proposes a variational autoencoder with a signed distance [...] Read more.

3D shape generation is widely applied in various industries to create, visualize, and analyse complex data, designs, and simulations. Typically, 3D shape generation uses a large dataset of 3D shapes as the input. This paper proposes a variational autoencoder with a signed distance function relativistic average generative adversarial network, referred to as 3D-VAE-SDFRaGAN, for 3D shape generation from 2D input images. Both the generative adversarial network (GAN) and variational autoencoder (VAE) algorithms are typical algorithms used to generate realistic 3D shapes. However, it is very challenging to train a stable 3D shape generation model using VAE-GAN. This paper proposes an efficient approach to stabilize the training process of VAE-GAN to generate high-quality 3D shapes. A 3D mesh-based shape is first generated using a 3D signed distance function representation by feeding a single 2D image into a 3D-VAE-SDFRaGAN network. The signed distance function is used to maintain inside–outside information in the implicit surface representation. In addition, a relativistic average discriminator loss function is employed as the training loss function. The polygon mesh surfaces are then produced via the marching cubes algorithm. The proposed 3D-VAE-SDFRaGAN is evaluated with the ShapeNet dataset. The experimental results indicate a notable enhancement in the qualitative performance, as evidenced by the visual comparison of the generated samples, as well as the quantitative performance evaluation using the chamfer distance metric. The proposed approach achieves an average chamfer distance score of 0.578, demonstrating superior performance compared to existing state-of-the-art models. Full article

► Show Figures

Figure 1

26 pages, 16666 KB

Open AccessArticle

Transfer Learning with Attributes for Improving the Landslide Spatial Prediction Performance in Sample-Scarce Area Based on Variational Autoencoder Generative Adversarial Network

by Mansheng Lin, Shuai Teng, Gongfa Chen and David Bassir

Land 2023, 12(3), 525; https://doi.org/10.3390/land12030525 - 21 Feb 2023

Cited by 12 | Viewed by 3928

Abstract

Owing to the complexity of obtaining the landslide inventory data, it is a challenge to establish a landslide spatial prediction model with limited labeled samples. This paper proposed a novel strategy, namely transfer learning with attributes (TLAs), to make good use of existing [...] Read more.

Owing to the complexity of obtaining the landslide inventory data, it is a challenge to establish a landslide spatial prediction model with limited labeled samples. This paper proposed a novel strategy, namely transfer learning with attributes (TLAs), to make good use of existing landslide inventory data, a strategy that is based on a variational autoencoder of a generative adversarial network (VAEGAN) for improving the landslide spatial prediction performance in sample-scarce areas. Different from transfer learning (TL), TLAs are pretraining the model with the data reconstructed by VAEGAN, so that the models learn in advance the landslide attributes of sample-scarce areas. Accordingly, a database containing a total of 986 landslides in three study areas with 14 landslide-influencing factors was established, and each of the three models, i.e., convolutional neural networks (CNNs), bidirectional long short-term memory (BiLSTM) and gated recurrent units (GRUs), was respectively selected as the feature extractor of the VAEGAN to reconstruct the data with attributes and the prediction model to generate the landslide susceptibility maps to investigate and validate the proposed TLA strategy. The experimental results showed that the TLA strategy increased the mean value of evaluators, such as the area under the receiver-operating characteristic (AUROC), F1-score, precision, recall and accuracy by about 2–7% compared with TL, results that indicated that the generated data have the attribute of specific study areas and the effectiveness of TLA strategy in sample-scare areas. Full article

(This article belongs to the Special Issue Application of Artificial Intelligence and Modelling Tools in Landscape Archaeology and Geo-Design)

► Show Figures

Figure 1

21 pages, 2349 KB

Open AccessReview

Improvement of Generative Adversarial Network and Its Application in Bearing Fault Diagnosis: A Review

by Diwang Ruan, Xuran Chen, Clemens Gühmann and Jianping Yan

Lubricants 2023, 11(2), 74; https://doi.org/10.3390/lubricants11020074 - 10 Feb 2023

Cited by 62 | Viewed by 7138

Abstract

A small sample size and unbalanced sample distribution are two main problems when data-driven methods are applied for fault diagnosis in practical engineering. Technically, sample generation and data augmentation have proven to be effective methods to solve this problem. The generative adversarial network [...] Read more.

A small sample size and unbalanced sample distribution are two main problems when data-driven methods are applied for fault diagnosis in practical engineering. Technically, sample generation and data augmentation have proven to be effective methods to solve this problem. The generative adversarial network (GAN) has been widely used in recent years as a representative generative model. Besides the general GAN, many variants have recently been reported to address its inherent problems such as mode collapse and slow convergence. In addition, many new techniques are being proposed to increase the sample generation quality. Therefore, a systematic review of GAN, especially its application in fault diagnosis, is necessary. In this paper, the theory and structure of GAN and variants such as ACGAN, VAEGAN, DCGAN, WGAN, et al. are presented first. Then, the literature on GANs is mainly categorized and analyzed from two aspects: improvements in GAN’s structure and loss function. Specifically, the improvements in the structure are classified into three types: information-based, input-based, and layer-based. Regarding the modification of the loss function, it is sorted into two aspects: metric-based and regularization-based. Afterwards, the evaluation metrics of the generated samples are summarized and compared. Finally, the typical applications of GAN in the bearing fault diagnosis field are listed, and the challenges for further research are also discussed. Full article

(This article belongs to the Special Issue Recent Advances in Machine Learning in Tribology)

► Show Figures

Figure 1

Search Results (19)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (19)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI