Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (19)

Search Parameters:
Keywords = VAEGAN

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 3342 KB  
Article
A Novel Spectrum Recognition Model of Spatial Electromagnetic Anomalies Based on VAE-GANGP
by Bin Liu, Jiansheng Bai and Qiongyi Li
Electronics 2026, 15(5), 1062; https://doi.org/10.3390/electronics15051062 - 3 Mar 2026
Viewed by 553
Abstract
To address the issues of sample imbalance, unstable generation quality, and insufficient feature extraction in spectrum anomaly signal detection under complex electromagnetic environments, this paper proposes a VAE-GANGP identification model that integrates a Variational Autoencoder (VAE) with a Gradient Penalty-based Generative Adversarial Network [...] Read more.
To address the issues of sample imbalance, unstable generation quality, and insufficient feature extraction in spectrum anomaly signal detection under complex electromagnetic environments, this paper proposes a VAE-GANGP identification model that integrates a Variational Autoencoder (VAE) with a Gradient Penalty-based Generative Adversarial Network (GAN-GP). First, the VAE is employed to encode the original spectrum, generating structured latent features that follow a standard normal distribution. This replaces the random noise input in traditional GANs, significantly enhancing the semantic consistency of generated samples and training stability. Second, an adversarial training mechanism based on Wasserstein distance with gradient penalty (WGAN-GP) is introduced, effectively mitigating mode collapse and gradient vanishing, thereby improving the model’s capability to fit complex signal distributions. Furthermore, a multi-objective optimization function combining reconstruction error and adversarial loss is constructed, establishing an end-to-end integrated framework for feature learning, signal reconstruction, and anomaly discrimination. Experiments are conducted using a synthetic dataset comprising various modulation types and simulated environments with different signal-to-noise ratios for systematic validation. The results demonstrate that the spectrum data generated by VAE-GANGP closely matches the distribution of real signals. Under AWGN-dominated synthetic test conditions, the model achieves an anomaly detection accuracy of 98.1%. When evaluated under more realistic channel impairments (phase noise, multipath, impulsive interference), the model maintains competitive performance, outperforming existing methods and demonstrating promising potential for practical electromagnetic spectrum monitoring. Its performance significantly surpasses traditional detection methods and single deep learning models, providing a highly reliable and adaptive solution for spatial electromagnetic spectrum anomaly detection. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

21 pages, 1440 KB  
Article
VAE-GAN-Guided Cross-Class Generation: A Class Imbalance Data Augmentation Method for Network Intrusion Detection
by Fuyuan Kang, Tao Feng and Jiaqi Lin
Electronics 2025, 14(11), 2103; https://doi.org/10.3390/electronics14112103 - 22 May 2025
Cited by 6 | Viewed by 3597
Abstract
Network intrusion datasets often face class imbalance issues in intrusion detection tasks, where the number of majority class samples is much higher than minority class samples. Current solutions face notable limitations: traditional normalization weakens the multimodal distribution of continuous features, while mainstream generative [...] Read more.
Network intrusion datasets often face class imbalance issues in intrusion detection tasks, where the number of majority class samples is much higher than minority class samples. Current solutions face notable limitations: traditional normalization weakens the multimodal distribution of continuous features, while mainstream generative models focus excessively on minority class mining while neglecting majority class information. To address these issues, we propose M2M-VAEGAN, which innovatively incorporates a Variational Gaussian Mixture (VGM) model to preserve multimodal characteristics of continuous features. We design a transfer learning framework, pre-training on majority classes to capture general attack patterns, followed by fine-tuning with balanced batches of majority and minority samples to prevent catastrophic forgetting. Additionally, we enhance the VAEGAN architecture with an auxiliary classifier to strengthen conditional information learning. On the NSL-KDD and CIC-IDS2017 datasets, M2M-VAEGAN outperforms methods such as SMOTE, CTGAN, and CTABGAN, achieving a 1.25% to 6.42% improvement in minority class recall. These results demonstrate the effectiveness of the proposed approach. Full article
(This article belongs to the Special Issue Recognition of Patterns and Trends in Multimedia Datasets)
Show Figures

Figure 1

19 pages, 34572 KB  
Article
Suppression of Multiple Reflection Interference Signals in GPR Images Caused by Rebar Using VAE-GAN
by Chuan Li, Qibing Ma, Yawei Wang, Xi Yang, Hao Liu and Lulu Wang
Appl. Sci. 2025, 15(7), 3728; https://doi.org/10.3390/app15073728 - 28 Mar 2025
Cited by 7 | Viewed by 1867
Abstract
Due to the rebars layer’s shielding effect on Ground Penetrating Radar (GPR) waves, the hyperbolic clutter generated by the rebars interferes with the echoes from void beneath them. The overlapping waveforms of both signals result in attenuation and distortion of the void signals, [...] Read more.
Due to the rebars layer’s shielding effect on Ground Penetrating Radar (GPR) waves, the hyperbolic clutter generated by the rebars interferes with the echoes from void beneath them. The overlapping waveforms of both signals result in attenuation and distortion of the void signals, making it difficult to identify void defects under the rebar. This study proposes an unsupervised generative network model based on Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs). Through a shared latent space, mapping is achieved between two image domains, effectively eliminating the multiple reflection interference signals caused by the rebar while accurately reconstructing the void defects, generating GPR B-Scan images without rebar clutter. Additionally, the channel and spatial attention module (CSA) is implemented into the model to help the network to better focus on the essential information in GPR images. The proposed model was validated through ablation and comparative experiments using synthetic data. Finally, real GPR data from the Husa Tunnel were used to verify the model’s effectiveness in practical engineering applications. The results showed that this model is highly effective; it improves the visibility of void defects signals, thereby enhancing the interpretability of GPR data for tunnel lining inspections. Full article
Show Figures

Figure 1

14 pages, 743 KB  
Article
AD-VAE: Adversarial Disentangling Variational Autoencoder
by Adson Silva and Ricardo Farias
Sensors 2025, 25(5), 1574; https://doi.org/10.3390/s25051574 - 4 Mar 2025
Cited by 8 | Viewed by 2568
Abstract
Face recognition (FR) is a less intrusive biometrics technology with various applications, such as security, surveillance, and access control systems. FR remains challenging, especially when there is only a single image per person as a gallery dataset and when dealing with variations like [...] Read more.
Face recognition (FR) is a less intrusive biometrics technology with various applications, such as security, surveillance, and access control systems. FR remains challenging, especially when there is only a single image per person as a gallery dataset and when dealing with variations like pose, illumination, and occlusion. Deep learning techniques have shown promising results in recent years using VAE and GAN, with approaches such as patch-VAE, VAE-GAN for 3D Indoor Scene Synthesis, and hybrid VAE-GAN models. However, in Single Sample Per Person Face Recognition (SSPP FR), the challenge of learning robust and discriminative features that preserve the subject’s identity persists. To address these issues, we propose a novel framework called AD-VAE, specifically for SSPP FR, using a combination of variational autoencoder (VAE) and Generative Adversarial Network (GAN) techniques. The proposed AD-VAE framework is designed to learn how to build representative identity-preserving prototypes from both controlled and wild datasets, effectively handling variations like pose, illumination, and occlusion. The method uses four networks: an encoder and decoder similar to VAE, a generator that receives the encoder output plus noise to generate an identity-preserving prototype, and a discriminator that operates as a multi-task network. AD-VAE outperforms all tested state-of-the-art face recognition techniques, demonstrating its robustness. The proposed framework achieves superior results on four controlled benchmark datasets—AR, E-YaleB, CAS-PEAL, and FERET—with recognition rates of 84.9%, 94.6%, 94.5%, and 96.0%, respectively, and achieves remarkable performance on the uncontrolled LFW dataset, with a recognition rate of 99.6%. The AD-VAE framework shows promising potential for future research and real-world applications. Full article
(This article belongs to the Topic Applications in Image Analysis and Pattern Recognition)
Show Figures

Figure 1

23 pages, 6852 KB  
Article
Reconstruction of Random Structures Based on Generative Adversarial Networks: Statistical Variability of Mechanical and Morphological Properties
by Mikhail Tashkinov, Yulia Pirogova, Evgeniy Kononov, Aleksandr Shalimov and Vadim V. Silberschmidt
Mathematics 2025, 13(1), 7; https://doi.org/10.3390/math13010007 - 24 Dec 2024
Cited by 5 | Viewed by 1605
Abstract
Generative adversarial neural networks with a variational autoencoder (VAE-GANs) are actively used in the field of materials design. The synthesis of random structures with nonrepeated geometry and predetermined mechanical properties is important for solving various practical problems. Geometric parameters of such artificially generated [...] Read more.
Generative adversarial neural networks with a variational autoencoder (VAE-GANs) are actively used in the field of materials design. The synthesis of random structures with nonrepeated geometry and predetermined mechanical properties is important for solving various practical problems. Geometric parameters of such artificially generated random structures can vary within certain limits compared to the training dataset, causing unpredicted fluctuations in their resulting mechanical response. This study investigates the statistical variability of mechanical and morphological characteristics of random 3D models reconstructed from 2D images using a VAE-GAN neural network. A combined multitool method employing different mathematical and statistical instruments for comparison of the reconstructed models with their corresponding originals is proposed. It includes the analysis of statistical distributions of elastic properties, morphometric parameters, and stress values. The neural network was trained on two datasets, containing models created based on Gaussian random fields. Statistical fluctuations of the mechanical and morphological parameters of the reconstructed models are analyzed. The deviation of the effective elastic modulus of the reconstructed models from that of the original ones was less than 5.7% on average. The difference between the median values of ligament thickness and distance between ligaments ranged from 3.6 to 6.5% and 2.6 to 5.2%, respectively. The median value of the surface area of the reconstructed geometries was 4.6–8.1% higher compared to the original models. It is thus shown that mechanical properties of the NN-generated structures retain the statistical variability of the corresponding originals, while the variability of the morphology is highly affected by the training set and does not depend on the configuration of the input 2D image. Full article
Show Figures

Figure 1

12 pages, 1581 KB  
Article
Airfoil Shape Generation and Feature Extraction Using the Conditional VAE-WGAN-gp
by Kazuo Yonekura, Yuki Tomori and Katsuyuki Suzuki
AI 2024, 5(4), 2092-2103; https://doi.org/10.3390/ai5040102 - 28 Oct 2024
Cited by 14 | Viewed by 3523
Abstract
A machine learning method was applied to solve an inverse airfoil design problem. A conditional VAE-WGAN-gp model, which couples the conditional variational autoencoder (VAE) and Wasserstein generative adversarial network with gradient penalty (WGAN-gp), is proposed for an airfoil generation method, and then, it [...] Read more.
A machine learning method was applied to solve an inverse airfoil design problem. A conditional VAE-WGAN-gp model, which couples the conditional variational autoencoder (VAE) and Wasserstein generative adversarial network with gradient penalty (WGAN-gp), is proposed for an airfoil generation method, and then, it is compared with the WGAN-gp and VAE models. The VAEGAN model couples the VAE and GAN models, which enables feature extraction in the GAN models. In airfoil generation tasks, to generate airfoil shapes that satisfy lift coefficient requirements, it is known that VAE outperforms WGAN-gp with respect to the accuracy of the reproduction of the lift coefficient, whereas GAN outperforms VAE with respect to the smoothness and variations of generated shapes. In this study, VAE-WGAN-gp demonstrated a good performance in all three aspects. Latent distribution was also studied to compare the feature extraction ability of the proposed method. Full article
Show Figures

Figure 1

29 pages, 6007 KB  
Article
VAE-WACGAN: An Improved Data Augmentation Method Based on VAEGAN for Intrusion Detection
by Wuxin Tian, Yanping Shen, Na Guo, Jing Yuan and Yanqing Yang
Sensors 2024, 24(18), 6035; https://doi.org/10.3390/s24186035 - 18 Sep 2024
Cited by 25 | Viewed by 4621
Abstract
To address the class imbalance issue in network intrusion detection, which degrades performance of intrusion detection models, this paper proposes a novel generative model called VAE-WACGAN to generate minority class samples and balance the dataset. This model extends the Variational Autoencoder Generative Adversarial [...] Read more.
To address the class imbalance issue in network intrusion detection, which degrades performance of intrusion detection models, this paper proposes a novel generative model called VAE-WACGAN to generate minority class samples and balance the dataset. This model extends the Variational Autoencoder Generative Adversarial Network (VAEGAN) by integrating key features from the Auxiliary Classifier Generative Adversarial Network (ACGAN) and the Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP). These enhancements significantly improve both the quality of generated samples and the stability of the training process. By utilizing the VAE-WACGAN model to oversample anomalous data, more realistic synthetic anomalies that closely mirror the actual network traffic distribution can be generated. This approach effectively balances the network traffic dataset and enhances the overall performance of the intrusion detection model. Experimental validation was conducted using two widely utilized intrusion detection datasets, UNSW-NB15 and CIC-IDS2017. The results demonstrate that the VAE-WACGAN method effectively enhances the performance metrics of the intrusion detection model. Furthermore, the VAE-WACGAN-based intrusion detection approach surpasses several other advanced methods, underscoring its effectiveness in tackling network security challenges. Full article
(This article belongs to the Section Sensor Networks)
Show Figures

Figure 1

25 pages, 18421 KB  
Article
Prediction of Operational Noise Uncertainty in Automotive Micro-Motors Based on Multi-Branch Channel–Spatial Adaptive Weighting Strategy
by Hao Hu, Shiqi Deng, Wang Yan, Yanyong He and Yudong Wu
Electronics 2024, 13(13), 2553; https://doi.org/10.3390/electronics13132553 - 28 Jun 2024
Cited by 3 | Viewed by 1731
Abstract
The acoustic performance of automotive micro-motors directly impacts the comfort and driving experience of both drivers and passengers. However, various motor production and testing uncertainties can lead to noise fluctuations during operation. Thus, predicting the operational noise range of motors on the production [...] Read more.
The acoustic performance of automotive micro-motors directly impacts the comfort and driving experience of both drivers and passengers. However, various motor production and testing uncertainties can lead to noise fluctuations during operation. Thus, predicting the operational noise range of motors on the production line in advance becomes crucial for timely adjustments to production parameters and process optimization. This paper introduces a prediction model based on a Multi-Branch Channel–Spatial Adaptive Weighting Strategy (MCSAWS). The model includes a multi-branch feature extraction (MFE) network and a channel–spatial attention module (CSAM). It uses the vibration and noise data from micro-motors’ idle operations on the production line as input to efficiently predict the operational noise uncertainty interval of automotive micro-motors. The model employs the VAE-GAN approach for data augmentation (DA) and uses Gammatone filters to emphasize the noise at the commutation frequency of the motor. The model was compared with Convolutional Neural Networks (CNNs) and Multilayer Perceptrons (MLPs). Experimental results demonstrate that the MCSAWS method is superior to conventional methods in prediction accuracy and reliability, confirming the feasibility of the proposed approach. This research can help control noise uncertainty in micro-motors’ production and manufacturing processes in advance. Full article
(This article belongs to the Special Issue Applications of Artificial Intelligence in Mechanical Engineering)
Show Figures

Figure 1

22 pages, 10012 KB  
Article
Remaining Useful Life Prediction Method Enhanced by Data Augmentation and Similarity Fusion
by Huaqing Wang, Ye Li, Ye Jin, Shengkai Zhao, Changkun Han and Liuyang Song
Vibration 2024, 7(2), 560-581; https://doi.org/10.3390/vibration7020029 - 5 Jun 2024
Cited by 13 | Viewed by 3694
Abstract
Precise prediction of the remaining useful life (RUL) of rolling bearings is crucial for ensuring the smooth functioning of machinery and minimizing maintenance costs. The time-domain features can reflect the degenerative state of the bearings and reduce the impact of random noise present [...] Read more.
Precise prediction of the remaining useful life (RUL) of rolling bearings is crucial for ensuring the smooth functioning of machinery and minimizing maintenance costs. The time-domain features can reflect the degenerative state of the bearings and reduce the impact of random noise present in the original signal, which is often used for life prediction. However, obtaining ideal training data for RUL prediction is challenging. Thus, this paper presents a bearing RUL prediction method based on unsupervised learning sample augmentation, establishes a VAE-GAN model, and expands the time-domain features that are calculated based on the original vibration signals. By combining the advantages of VAE and GAN in data generation, the generated data can better represent the degradation state of the bearings. The original data and generated data are mixed to realize data augmentation. At the same time, the dynamic time warping (DTW) algorithm is introduced to measure the similarity of the dataset, establishing the mapping relationship between the training set and target sequence, thereby enhancing the prediction accuracy of supervised learning. Experiments employing the XJTU-SY rolling element bearing accelerated life test dataset, IMS dataset, and pantograph data indicate that the proposed method yields high accuracy in bearing RUL prediction. Full article
Show Figures

Figure 1

23 pages, 3371 KB  
Article
Zero-Shot SAR Target Recognition Based on a Conditional Generative Network with Category Features from Simulated Images
by Guo Chen, Siqian Zhang, Qishan He, Zhongzhen Sun, Xianghui Zhang and Lingjun Zhao
Remote Sens. 2024, 16(11), 1930; https://doi.org/10.3390/rs16111930 - 27 May 2024
Cited by 6 | Viewed by 2409
Abstract
SAR image target recognition relies heavily on a large number of annotated samples, making it difficult to classify the unseen class targets. Due to the lack of effective category auxiliary information, the current zero-shot target recognition methods for SAR images are limited to [...] Read more.
SAR image target recognition relies heavily on a large number of annotated samples, making it difficult to classify the unseen class targets. Due to the lack of effective category auxiliary information, the current zero-shot target recognition methods for SAR images are limited to inferring only one unseen class rather than classifying multiple unseen classes. To address this issue, a conditional generative network with the category features from the simulated images for zero-shot SAR target recognition is proposed in this paper. Firstly, the deep features are extracted from the simulated images and fused into the category features that characterize the entire class. Then, a conditional VAE-GAN network is constructed to generate the feature instances of the unseen classes. The high-level semantic information shared in the category features aids in generalizing the mapping learned from the seen classes to the unseen classes. Finally, the generated features of the unseen classes are used to train a classifier that can classify the real unseen images. The classification accuracies for the targets of the three unseen classes based on the proposed method can reach 99.80 ± 1.22% and 71.57 ± 2.28% with the SAMPLE dataset and the MSTAR dataset, respectively. The advantage and validity of the proposed architecture are indicated with a small number of the seen classes and a small amount of the training data. Furthermore, the proposed method can be extended to generalized zero-shot recognition. Full article
Show Figures

Figure 1

15 pages, 1748 KB  
Article
Leveraging Dual Variational Autoencoders and Generative Adversarial Networks for Enhanced Multimodal Interaction in Zero-Shot Learning
by Ning Li, Jie Chen, Nanxin Fu, Wenzhuo Xiao, Tianrun Ye, Chunming Gao and Ping Zhang
Electronics 2024, 13(3), 539; https://doi.org/10.3390/electronics13030539 - 29 Jan 2024
Cited by 6 | Viewed by 3363
Abstract
In the evolving field of taxonomic classification, and especially in Zero-shot Learning (ZSL), the challenge of accurately classifying entities unseen in training datasets remains a significant hurdle. Although the existing literature is rich in developments, it often falls short in two critical areas: [...] Read more.
In the evolving field of taxonomic classification, and especially in Zero-shot Learning (ZSL), the challenge of accurately classifying entities unseen in training datasets remains a significant hurdle. Although the existing literature is rich in developments, it often falls short in two critical areas: semantic consistency (ensuring classifications align with true meanings) and the effective handling of dataset diversity biases. These gaps have created a need for a more robust approach that can navigate both with greater efficacy. This paper introduces an innovative integration of transformer models with ariational autoencoders (VAEs) and generative adversarial networks (GANs), with the aim of addressing them within the ZSL framework. The choice of VAE-GAN is driven by their complementary strengths: VAEs are proficient in providing a richer representation of data patterns, and GANs are able to generate data that is diverse yet representative, thus mitigating biases from dataset diversity. Transformers are employed to further enhance semantic consistency, which is key because many existing models underperform. Through experiments have been conducted on benchmark ZSL datasets such as CUB, SUN, and Animals with Attributes 2 (AWA2), our approach is novel because it demonstrates significant improvements, not only in enhancing semantic and structural coherence, but also in effectively addressing dataset biases. This leads to a notable enhancement of the model’s ability to generalize visual categorization tasks beyond the training data, thus filling a critical gap in the current ZSL research landscape. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

21 pages, 4061 KB  
Article
Improved Prediction of Aquatic Beetle Diversity in a Stagnant Pool by a One-Dimensional Convolutional Neural Network Using Variational Autoencoder Generative Adversarial Network-Generated Data
by Miao Hu, Shujiao Jiang, Fenglong Jia, Xiaomei Yang and Zhiqiang Li
Appl. Sci. 2023, 13(15), 8841; https://doi.org/10.3390/app13158841 - 31 Jul 2023
Cited by 1 | Viewed by 2105
Abstract
Building a reasonable model for predicting biodiversity using limited data is challenging. Expanding limited experimental data using a variational autoencoder generative adversarial network (VAEGAN) to improve biodiversity predictions for a region is a new strategy. Aquatic beetle diversity in a large >30-year-old artificial [...] Read more.
Building a reasonable model for predicting biodiversity using limited data is challenging. Expanding limited experimental data using a variational autoencoder generative adversarial network (VAEGAN) to improve biodiversity predictions for a region is a new strategy. Aquatic beetle diversity in a large >30-year-old artificial pool that had not had human interference in Nanshe Village (Dapeng Peninsula, Shenzhen City, Guangdong Province, China) was investigated. Eight ecological factors were considered. These were water temperature, salinity, pH, water depth, proportional area of aquatic plants, proportional area of submerged plants, water area, and water level. Field sampling was performed for 1 or 2 days in the middle or late part of each month for a year. A type D net was swept 10 times in the same direction in each ~1 m × ~1 m sample square, generating 132 datasets (experimental data). In total, 39 aquatic beetle species were collected, 19 of which were assigned to Hydrophilidae, 16 to Dytiscidae, 3 to Noteridae, and 1 to Gyrinidae. A one-dimensional convolutional neural network (1-D CNN) was used to assess and predict the grade of the number of individuals and the number of aquatic beetle species. The Bayesian-optimized 1-D CNN established using 112 experimental datasets as the training set and the other 20 datasets as validation and testing sets gave a 74.0% prediction accuracy for the grade of the number of individuals and a 70.0% prediction accuracy for the number of species. The impact of insufficient sample data on the model was assessed using a VAEGAN to expand the training set from 112 to 512 samples, and then the Bayesian-optimized 1-D CNN-based VAEGAN prediction model was re-established. This improved prediction accuracy for the grade of the number of individuals to 86.0% and for the number of species to 85.0%. The grade of the number of individuals’ prediction accuracy was 88.0% and the number of species’ prediction accuracy was 85.0% when the random effects of only obtaining a single individual of a species were excluded. The results indicated that the accuracy of the 1-D CNN in predicting the aquatic beetle species number and abundance from relevant environmental factors can be improved using a VAEGAN to expand the experimental data. Full article
Show Figures

Figure 1

20 pages, 1187 KB  
Article
3D Shape Generation via Variational Autoencoder with Signed Distance Function Relativistic Average Generative Adversarial Network
by Ebenezer Akinyemi Ajayi, Kian Ming Lim, Siew-Chin Chong and Chin Poo Lee
Appl. Sci. 2023, 13(10), 5925; https://doi.org/10.3390/app13105925 - 11 May 2023
Cited by 6 | Viewed by 7610
Abstract
3D shape generation is widely applied in various industries to create, visualize, and analyse complex data, designs, and simulations. Typically, 3D shape generation uses a large dataset of 3D shapes as the input. This paper proposes a variational autoencoder with a signed distance [...] Read more.
3D shape generation is widely applied in various industries to create, visualize, and analyse complex data, designs, and simulations. Typically, 3D shape generation uses a large dataset of 3D shapes as the input. This paper proposes a variational autoencoder with a signed distance function relativistic average generative adversarial network, referred to as 3D-VAE-SDFRaGAN, for 3D shape generation from 2D input images. Both the generative adversarial network (GAN) and variational autoencoder (VAE) algorithms are typical algorithms used to generate realistic 3D shapes. However, it is very challenging to train a stable 3D shape generation model using VAE-GAN. This paper proposes an efficient approach to stabilize the training process of VAE-GAN to generate high-quality 3D shapes. A 3D mesh-based shape is first generated using a 3D signed distance function representation by feeding a single 2D image into a 3D-VAE-SDFRaGAN network. The signed distance function is used to maintain inside–outside information in the implicit surface representation. In addition, a relativistic average discriminator loss function is employed as the training loss function. The polygon mesh surfaces are then produced via the marching cubes algorithm. The proposed 3D-VAE-SDFRaGAN is evaluated with the ShapeNet dataset. The experimental results indicate a notable enhancement in the qualitative performance, as evidenced by the visual comparison of the generated samples, as well as the quantitative performance evaluation using the chamfer distance metric. The proposed approach achieves an average chamfer distance score of 0.578, demonstrating superior performance compared to existing state-of-the-art models. Full article
Show Figures

Figure 1

26 pages, 16666 KB  
Article
Transfer Learning with Attributes for Improving the Landslide Spatial Prediction Performance in Sample-Scarce Area Based on Variational Autoencoder Generative Adversarial Network
by Mansheng Lin, Shuai Teng, Gongfa Chen and David Bassir
Land 2023, 12(3), 525; https://doi.org/10.3390/land12030525 - 21 Feb 2023
Cited by 12 | Viewed by 3928
Abstract
Owing to the complexity of obtaining the landslide inventory data, it is a challenge to establish a landslide spatial prediction model with limited labeled samples. This paper proposed a novel strategy, namely transfer learning with attributes (TLAs), to make good use of existing [...] Read more.
Owing to the complexity of obtaining the landslide inventory data, it is a challenge to establish a landslide spatial prediction model with limited labeled samples. This paper proposed a novel strategy, namely transfer learning with attributes (TLAs), to make good use of existing landslide inventory data, a strategy that is based on a variational autoencoder of a generative adversarial network (VAEGAN) for improving the landslide spatial prediction performance in sample-scarce areas. Different from transfer learning (TL), TLAs are pretraining the model with the data reconstructed by VAEGAN, so that the models learn in advance the landslide attributes of sample-scarce areas. Accordingly, a database containing a total of 986 landslides in three study areas with 14 landslide-influencing factors was established, and each of the three models, i.e., convolutional neural networks (CNNs), bidirectional long short-term memory (BiLSTM) and gated recurrent units (GRUs), was respectively selected as the feature extractor of the VAEGAN to reconstruct the data with attributes and the prediction model to generate the landslide susceptibility maps to investigate and validate the proposed TLA strategy. The experimental results showed that the TLA strategy increased the mean value of evaluators, such as the area under the receiver-operating characteristic (AUROC), F1-score, precision, recall and accuracy by about 2–7% compared with TL, results that indicated that the generated data have the attribute of specific study areas and the effectiveness of TLA strategy in sample-scare areas. Full article
Show Figures

Figure 1

21 pages, 2349 KB  
Review
Improvement of Generative Adversarial Network and Its Application in Bearing Fault Diagnosis: A Review
by Diwang Ruan, Xuran Chen, Clemens Gühmann and Jianping Yan
Lubricants 2023, 11(2), 74; https://doi.org/10.3390/lubricants11020074 - 10 Feb 2023
Cited by 62 | Viewed by 7138
Abstract
A small sample size and unbalanced sample distribution are two main problems when data-driven methods are applied for fault diagnosis in practical engineering. Technically, sample generation and data augmentation have proven to be effective methods to solve this problem. The generative adversarial network [...] Read more.
A small sample size and unbalanced sample distribution are two main problems when data-driven methods are applied for fault diagnosis in practical engineering. Technically, sample generation and data augmentation have proven to be effective methods to solve this problem. The generative adversarial network (GAN) has been widely used in recent years as a representative generative model. Besides the general GAN, many variants have recently been reported to address its inherent problems such as mode collapse and slow convergence. In addition, many new techniques are being proposed to increase the sample generation quality. Therefore, a systematic review of GAN, especially its application in fault diagnosis, is necessary. In this paper, the theory and structure of GAN and variants such as ACGAN, VAEGAN, DCGAN, WGAN, et al. are presented first. Then, the literature on GANs is mainly categorized and analyzed from two aspects: improvements in GAN’s structure and loss function. Specifically, the improvements in the structure are classified into three types: information-based, input-based, and layer-based. Regarding the modification of the loss function, it is sorted into two aspects: metric-based and regularization-based. Afterwards, the evaluation metrics of the generated samples are summarized and compared. Finally, the typical applications of GAN in the bearing fault diagnosis field are listed, and the challenges for further research are also discussed. Full article
(This article belongs to the Special Issue Recent Advances in Machine Learning in Tribology)
Show Figures

Figure 1

Back to TopTop