Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (143)

Search Parameters:
Keywords = graph autoencoders

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
26 pages, 7102 KB  
Article
Sustainable Agile Identification and Adaptive Risk Control of Major Disaster Online Rumors Based on LLMs and EKGs
by Xin Chen
Sustainability 2025, 17(19), 8920; https://doi.org/10.3390/su17198920 - 8 Oct 2025
Viewed by 282
Abstract
Amid the increasing frequency and severity of major disasters, the rapid spread of online misinformation poses substantial risks to public safety, effective crisis management, and long-term societal sustainability. Current methods for managing disaster-related rumors rely on static, rule-based approaches that lack scalability, fail [...] Read more.
Amid the increasing frequency and severity of major disasters, the rapid spread of online misinformation poses substantial risks to public safety, effective crisis management, and long-term societal sustainability. Current methods for managing disaster-related rumors rely on static, rule-based approaches that lack scalability, fail to capture nuanced misinformation, and are limited to reactive responses, hindering effective disaster management. To address this gap, this study proposes a novel framework that leverages large language models (LLMs) and event knowledge graphs (EKGs) to facilitate the sustainable agile identification and adaptive control of disaster-related online rumors. The framework follows a multi-stage process, which includes the collection and preprocessing of disaster-related online data, the application of Gaussian Mixture Wasserstein Autoencoders (GMWAEs) for sentiment and rumor analysis, and the development of EKGs to enrich the understanding and reasoning of disaster events. Additionally, an enhanced model for rumor identification and risk control is introduced, utilizing Graph Attention Networks (GATs) to extract node features for accurate rumor detection and prediction of rumor propagation paths. Extensive experimental validation confirms the efficacy of the proposed methodology in improving disaster response. This study contributes novel theoretical insights and presents practical, scalable solutions for rumor control and risk management during crises. Full article
(This article belongs to the Section Hazards and Sustainability)
Show Figures

Figure 1

15 pages, 405 KB  
Article
Detecting Imbalanced Credit Card Fraud via Hybrid Graph Attention and Variational Autoencoder Ensembles
by Ibomoiye Domor Mienye, Ebenezer Esenogho and Cameron Modisane
AppliedMath 2025, 5(4), 131; https://doi.org/10.3390/appliedmath5040131 - 2 Oct 2025
Viewed by 537
Abstract
Credit card fraud detection remains a major challenge due to severe class imbalance and the constantly evolving nature of fraudulent behaviors. To address these challenges, this paper proposes a hybrid framework that integrates a Variational Autoencoder (VAE) for probabilistic anomaly detection, a Graph [...] Read more.
Credit card fraud detection remains a major challenge due to severe class imbalance and the constantly evolving nature of fraudulent behaviors. To address these challenges, this paper proposes a hybrid framework that integrates a Variational Autoencoder (VAE) for probabilistic anomaly detection, a Graph Attention Network (GAT) for capturing inter-transaction relationships, and a stacking ensemble with XGBoost for robust prediction. The joint use of VAE anomaly scores and GAT-derived node embeddings enables the model to capture both feature-level irregularities and relational fraud patterns. Experiments on the European Credit Card and IEEE-CIS Fraud Detection datasets show that the proposed approach outperforms baseline models by up to 15% in F1-score, achieving values above 0.980 with AUCs reaching 0.995. These results demonstrate the effectiveness of combining unsupervised anomaly detection with graph-based learning within an ensemble framework for highly imbalanced fraud detection problems. Full article
Show Figures

Figure 1

24 pages, 1807 KB  
Article
Defense Strategy Against False Data Injection Attacks on Cyber–Physical System for Vehicle–Grid Based on KNN-GAE
by Qiuyan Li, Dawei Song, Yuanyuan Wang, Di Wang, Weijian Tao and Qian Ai
Energies 2025, 18(19), 5215; https://doi.org/10.3390/en18195215 - 30 Sep 2025
Viewed by 376
Abstract
With the in-depth integration of electric vehicles (EVs) and smart grids, the Cyber–Physical System for Vehicle–Grid (CPSVG) has become a crucial component of power systems. However, its inherent characteristic of deep cyber–physical coupling also renders it vulnerable to cyberattacks, particularly False Data Injection [...] Read more.
With the in-depth integration of electric vehicles (EVs) and smart grids, the Cyber–Physical System for Vehicle–Grid (CPSVG) has become a crucial component of power systems. However, its inherent characteristic of deep cyber–physical coupling also renders it vulnerable to cyberattacks, particularly False Data Injection Attacks (FDIAs), which pose a severe threat to the safe and stable operation of the system. To address this challenge, this paper proposes an FDIA defense method based on K-Nearest Neighbor (KNN) and Graph Autoencoder (GAE). The method first employs the KNN algorithm to locate abnormal data in the system and identify the attacked nodes. Subsequently, Graph Autoencoder is utilized to reconstruct the tampered and contaminated data with high fidelity, restoring the accuracy and integrity of the data. Simulation verification was conducted in a typical vehicle–grid interaction system scenario. The results demonstrate that, compared with various scenarios such as no defense, traditional detection mechanisms, and only location-based data elimination, the proposed KNN-GAE method can more accurately identify and repair all attacked data. It provides reliable data input that is closest to the true values for subsequent state estimation, thereby significantly enhancing the system’s state awareness capability and operational stability after an attack. This study offers new insights and effective technical means for ensuring the security defense of the Vehicle–Grid Interaction Cyber–Physical System. Full article
(This article belongs to the Section E: Electric Vehicles)
Show Figures

Figure 1

22 pages, 5899 KB  
Article
Research on Power Flow Prediction Based on Physics-Informed Graph Attention Network
by Qiyue Huang, Yapeng Wang, Xu Yang, Sio-Kei Im and Jianxiu Cai
Appl. Sci. 2025, 15(19), 10555; https://doi.org/10.3390/app151910555 - 29 Sep 2025
Viewed by 239
Abstract
As an emerging distributed energy system, microgrid power flow prediction plays a crucial role in optimizing energy dispatch and power grid operation. Traditional methods of power flow prediction mainly rely on statistics and time series models, neglecting the spatial relationships among different nodes [...] Read more.
As an emerging distributed energy system, microgrid power flow prediction plays a crucial role in optimizing energy dispatch and power grid operation. Traditional methods of power flow prediction mainly rely on statistics and time series models, neglecting the spatial relationships among different nodes within the microgrid. To overcome this limitation, a Physical-Informed Graph Attention Network (PI-GAT) is proposed to capture the spatial structure of the microgrid, while an attention mechanism is introduced to measure the importance weights between nodes. In this study, we constructed a representative 14-node microgrid power flow dataset. After collecting the data, we preprocessed and transformed it into a suitable format for graph neural networks. Next, an autoencoder was employed for pre-training, enabling unsupervised learning-based dimensionality reduction to enhance the expressive power of the data. Subsequently, the extended data is fed into a graph convolution module with attention mechanism, allowing adaptive weight learning and capturing relationships between nodes. And integrate the physical state equation into the loss function to achieve high-precision power flow prediction. Finally, simulation verification was conducted, comparing the PI-GAT method with traditional approaches. The results indicate that the proposed model outperforms the other latest model across various evaluation indicators. Specifically, it has 46.9% improvement in MSE and 14.08% improvement in MAE. Full article
Show Figures

Figure 1

36 pages, 35564 KB  
Article
Enhancing Soundscape Characterization and Pattern Analysis Using Low-Dimensional Deep Embeddings on a Large-Scale Dataset
by Daniel Alexis Nieto Mora, Leonardo Duque-Muñoz and Juan David Martínez Vargas
Mach. Learn. Knowl. Extr. 2025, 7(4), 109; https://doi.org/10.3390/make7040109 - 24 Sep 2025
Viewed by 433
Abstract
Soundscape monitoring has become an increasingly important tool for studying ecological processes and supporting habitat conservation. While many recent advances focus on identifying species through supervised learning, there is growing interest in understanding the soundscape as a whole while considering patterns that extend [...] Read more.
Soundscape monitoring has become an increasingly important tool for studying ecological processes and supporting habitat conservation. While many recent advances focus on identifying species through supervised learning, there is growing interest in understanding the soundscape as a whole while considering patterns that extend beyond individual vocalizations. This broader view requires unsupervised approaches capable of capturing meaningful structures related to temporal dynamics, frequency content, spatial distribution, and ecological variability. In this study, we present a fully unsupervised framework for analyzing large-scale soundscape data using deep learning. We applied a convolutional autoencoder (Soundscape-Net) to extract acoustic representations from over 60,000 recordings collected across a grid-based sampling design in the Rey Zamuro Reserve in Colombia. These features were initially compared with other audio characterization methods, showing superior performance in multiclass classification, with accuracies of 0.85 for habitat cover identification and 0.89 for time-of-day classification across 13 days. For the unsupervised study, optimized dimensionality reduction methods (Uniform Manifold Approximation and Projection and Pairwise Controlled Manifold Approximation and Projection) were applied to project the learned features, achieving trustworthiness scores above 0.96. Subsequently, clustering was performed using KMeans and Density-Based Spatial Clustering of Applications with Noise (DBSCAN), with evaluations based on metrics such as the silhouette, where scores above 0.45 were obtained, thus supporting the robustness of the discovered latent acoustic structures. To interpret and validate the resulting clusters, we combined multiple strategies: spatial mapping through interpolation, analysis of acoustic index variance to understand the cluster structure, and graph-based connectivity analysis to identify ecological relationships between the recording sites. Our results demonstrate that this approach can uncover both local and broad-scale patterns in the soundscape, providing a flexible and interpretable pathway for unsupervised ecological monitoring. Full article
Show Figures

Figure 1

25 pages, 1804 KB  
Article
Adversarial Reconstruction with Spectral-Augmented and Graph Joint Embedding for Network Anomaly Detection
by Liwei Yu, Jing Wu, Qimei Chen and Guiao Yang
Electronics 2025, 14(18), 3730; https://doi.org/10.3390/electronics14183730 - 21 Sep 2025
Viewed by 399
Abstract
Network anomaly detection is widely used in network analysis and security prevention, in which reconstruction-based approaches have achieved remarkable results. However, attributed networks exhibit highly nonlinear relationships and time dependence over time, which make the anomalies more complex and ambiguous, resulting in anomaly [...] Read more.
Network anomaly detection is widely used in network analysis and security prevention, in which reconstruction-based approaches have achieved remarkable results. However, attributed networks exhibit highly nonlinear relationships and time dependence over time, which make the anomalies more complex and ambiguous, resulting in anomaly detection still facing challenges. To this end, this study proposes an adversarial reconstruction framework with spectral-augmented and graph joint embedding for anomaly detection (GAN-SAGE), which integrates an autoencoder (AE) based on the frequency feature enhanced graph transformer (GT) into the generator for generating adversarial networks (GAN), improving network representation through adversarial training. The first stage of the encoding process captures the frequency domain information of the input timing data through spectral-augmented, and the second stage enhances the modeling capability of spatial structure and graph interaction dependency through multi-attribute coupling and GTs. We conducted extensive experiments on AIOps, SWaT and WADI datasets, demonstrating the effectiveness of GAN-SAGE compared to the state-of-the-art method. The detection performance of GAN-SAGE, respectively, improved by an average of 9.64%, 18.73% and 19.79% in terms of F1-score across the three datasets. Full article
Show Figures

Figure 1

30 pages, 696 KB  
Article
SPADR: A Context-Aware Pipeline for Privacy Risk Detection in Text Data
by Sultan Asiri, Randa Alshehri, Fatima Kamran, Hend Laznam, Yang Xiao and Saleh Alzahrani
Electronics 2025, 14(18), 3725; https://doi.org/10.3390/electronics14183725 - 19 Sep 2025
Viewed by 715
Abstract
Large language models (LLMs) are powerful, but they can unintentionally memorize and leak sensitive information found in their training or input data. To address this issue, we propose SPADR, a semantic privacy anomaly detection and remediation pipeline designed to detect and remove privacy [...] Read more.
Large language models (LLMs) are powerful, but they can unintentionally memorize and leak sensitive information found in their training or input data. To address this issue, we propose SPADR, a semantic privacy anomaly detection and remediation pipeline designed to detect and remove privacy risks from text. SPADR addresses limitations in existing redaction methods by identifying deeper forms of sensitive content, including implied relationships, contextual clues, and non-standard identifiers that traditional NER systems often overlook. SPADR combines semantic anomaly scoring using a denoising autoencoder with named entity recognition and graph-based analysis to detect both direct and hidden privacy risks. It is flexible enough to work on both training data (to prevent memorization) and user input (to prevent leakage at inference time). We evaluate SPADR on the Enron Email Dataset, where it significantly reduces document-level privacy leakage while maintaining strong semantic utility. The enhanced version, SPADR (S2), reduces the PII leak rate from 100% to 16.06% and achieves a BERTScore F1 of 88.03%. Compared to standard NER-based redaction systems, SPADR offers more accurate and context-aware privacy protection. This work highlights the importance of semantic and structural understanding in building safer, privacy-respecting AI systems. Full article
Show Figures

Figure 1

25 pages, 2728 KB  
Article
QAMT: An LLM-Based Framework for Quality-Assured Medical Time-Series Data Generation
by Yi Luo, Yong Zhang, Chunxiao Xing, Peng Ren and Xinhao Liu
Sensors 2025, 25(17), 5482; https://doi.org/10.3390/s25175482 - 3 Sep 2025
Viewed by 858
Abstract
The extensive deployment of diverse sensors in hospitals has resulted in the collection of various medical time-series data. However, these real-world medical time-series data suffer from limited volume, poor data quality, and privacy concerns, resulting in performance degradation in downstream tasks, such as [...] Read more.
The extensive deployment of diverse sensors in hospitals has resulted in the collection of various medical time-series data. However, these real-world medical time-series data suffer from limited volume, poor data quality, and privacy concerns, resulting in performance degradation in downstream tasks, such as medical research and clinical decision-making. Existing studies provide generated medical data as a supplement or alternative to real-world data. However, medical time-series data are inherently complex, including temporal data such as laboratory measurements and static event data such as demographics and clinical outcomes, with each patient’s temporal data being influenced by their static event data. This intrinsic complexity makes the generation of high-quality medical time-series data particularly challenging. Traditional methods typically employ Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs), but these methods struggle to generate high-quality static event data of medical time-series data and often lack interpretability. Currently, large language models (LLMs) introduce new opportunities for medical data generation, but they face difficulties in generating temporal data and have challenges in specific domain generation tasks. In this study, we are the first to propose an LLM-based framework for modularly generating medical time-series data, QAMT, which generates quality-assured data and ensures the interpretability of the generation process. QAMT constructs a reliable health knowledge graph to provide medical expertise to the LLMs and designs dual modules to simultaneously generate static event data and temporal data, constituting high-quality medical time-series data. Moreover, QAMT introduces a quality assurance module to evaluate the generated data. Unlike existing methods, QAMT preserves the interpretability of the data generation process. Experimental results show that QAMT can generate higher-quality time-series medical data compared with existing methods. Full article
(This article belongs to the Special Issue Sensors Fusion in Digital Healthcare Applications)
Show Figures

Figure 1

19 pages, 17084 KB  
Article
SPADE: Superpixel Adjacency Driven Embedding for Three-Class Melanoma Segmentation
by Pablo Ordóñez, Ying Xie, Xinyue Zhang, Chloe Yixin Xie, Santiago Acosta and Issac Guitierrez
Algorithms 2025, 18(9), 551; https://doi.org/10.3390/a18090551 - 2 Sep 2025
Viewed by 570
Abstract
The accurate segmentation of pigmented skin lesions is a critical prerequisite for reliable melanoma detection, yet approximately 30% of lesions exhibit fuzzy or poorly defined borders. This ambiguity makes the definition of a single contour unreliable and limits the effectiveness of computer-assisted diagnosis [...] Read more.
The accurate segmentation of pigmented skin lesions is a critical prerequisite for reliable melanoma detection, yet approximately 30% of lesions exhibit fuzzy or poorly defined borders. This ambiguity makes the definition of a single contour unreliable and limits the effectiveness of computer-assisted diagnosis (CAD) systems. While clinical assessment based on the ABCDE criteria (asymmetry, border, color, diameter, and evolution), dermoscopic imaging, and scoring systems remains the standard, these methods are inherently subjective and vary with clinician experience. We address this challenge by reframing segmentation into three distinct regions: background, border, and lesion core. These regions are delineated using superpixels generated via the Simple Linear Iterative Clustering (SLIC) algorithm, which provides meaningful structural units for analysis. Our contributions are fourfold: (1) redefining lesion borders as regions, rather than sharp lines; (2) generating superpixel-level embeddings with a transformer-based autoencoder; (3) incorporating these embeddings as features for superpixel classification; and (4) integrating neighborhood information to construct enhanced feature vectors. Unlike pixel-level algorithms that often overlook boundary context, our pipeline fuses global class information with local spatial relationships, significantly improving precision and recall in challenging border regions. An evaluation on the HAM10000 melanoma dataset demonstrates that our superpixel–RAG–transformer (region adjacency graph) pipeline achieves exceptional performance (100% F1 score, accuracy, and precision) in classifying background, border, and lesion core superpixels. By transforming raw dermoscopic images into region-based structured representations, the proposed method generates more informative inputs for downstream deep learning models. This strategy not only advances melanoma analysis but also provides a generalizable framework for other medical image segmentation and classification tasks. Full article
Show Figures

Figure 1

18 pages, 3961 KB  
Article
Multi-Task Graph Attention Net for Electricity Consumption Prediction and Anomaly Detection
by Na Bai, Jian Zhang and Zhaoli Wu
Computers 2025, 14(9), 350; https://doi.org/10.3390/computers14090350 - 26 Aug 2025
Viewed by 554
Abstract
Precise electricity consumption forecasting and anomaly detection constitute fundamental requirements for maintaining grid reliability in smart power systems. While consumption patterns demonstrate quasi-periodic behavior with region-specific fluctuations influenced by environmental factors, existing approaches may fail to systematically model these dynamic variations or quantify [...] Read more.
Precise electricity consumption forecasting and anomaly detection constitute fundamental requirements for maintaining grid reliability in smart power systems. While consumption patterns demonstrate quasi-periodic behavior with region-specific fluctuations influenced by environmental factors, existing approaches may fail to systematically model these dynamic variations or quantify environmental impacts. This limitation results in a compromised prediction accuracy and ambiguous anomaly identification. To overcome these challenges, we propose a novel Multi-Task Graph Attention Network (MGAT) framework leveraging an adaptive entropy analysis. Our methodology comprises four key innovations: (1) the temporal decomposition of consumption data with entropy-based adaptive clustering into predictable low-entropy components (processed via multi-scale attention networks) and volatile high-entropy components; (2) the graph-based representation of high-entropy fluctuations through numerical correlation encoding, complemented by temporal environmental graphs quantifying external influences; (3) the hierarchical fusion of environmental and fluctuation graphs via a specialized Graph Attention Autoencoder that jointly models dynamic patterns and environmental dependencies; (4) the integrated synthesis of all components for simultaneous consumption prediction and anomaly detection. Experiments verify the MGAT’s performance in both forecasting precision and anomaly identification compared to conventional methods. Full article
(This article belongs to the Special Issue Machine Learning Applications in Pattern Recognition)
Show Figures

Figure 1

18 pages, 1061 KB  
Article
Using Causality-Driven Graph Representation Learning for APT Attacks Path Identification
by Xiang Cheng, Miaomiao Kuang and Hongyu Yang
Symmetry 2025, 17(9), 1373; https://doi.org/10.3390/sym17091373 - 22 Aug 2025
Viewed by 763
Abstract
In the cybersecurity attack and defense space, the “attacker” and the “defender” form a dynamic and symmetrical adversarial pair. Their strategy iterations and capability evolutions have long been in a symmetrical game of mutual restraint. We will introduce modern Intrusion Detection Systems (IDSs) [...] Read more.
In the cybersecurity attack and defense space, the “attacker” and the “defender” form a dynamic and symmetrical adversarial pair. Their strategy iterations and capability evolutions have long been in a symmetrical game of mutual restraint. We will introduce modern Intrusion Detection Systems (IDSs) from the defender’s side to counter the techniques designed by the attacker (APT attack). One major challenge faced by IDS is to identify complex attack paths from a vast provenance graph. By constructing an attack behavior tracking graph, the interactions between system entities can be recorded, but the malicious activities of attackers are often hidden among a large number of normal system operations. Although traditional methods can identify attack behaviors, they only focus on the surface association relationships between entities and ignore the deep causal relationships, which limits the accuracy and interpretability of detection. Existing graph anomaly detection methods usually assign the same weight to all interactions, while we propose a Causal Autoencoder for Graph Explanation (CAGE) based on reinforcement learning. This method extracts feature representations from the traceability graph through a graph attention network(GAT), uses Q-learning to dynamically evaluate the causal importance of edges, and highlights key causal paths through a weight layering strategy. In the DARPA TC project, the experimental results conducted on the selected three datasets indicate that the precision of this method in the anomaly detection task remains above 97% on average, demonstrating excellent accuracy. Moreover, the recall values all exceed 99.5%, which fully proves its extremely low rate of missed detections. Full article
(This article belongs to the Special Issue Advanced Studies of Symmetry/Asymmetry in Cybersecurity)
Show Figures

Figure 1

27 pages, 7664 KB  
Article
Autoencoder-like Sparse Non-Negative Matrix Factorization with Structure Relationship Preservation
by Ling Zhong and Haiyan Gao
Entropy 2025, 27(8), 875; https://doi.org/10.3390/e27080875 - 19 Aug 2025
Viewed by 658
Abstract
Clustering algorithms based on non-negative matrix factorization (NMF) have garnered significant attention in data mining due to their strong interpretability and computational simplicity. However, traditional NMF often struggles to effectively capture and preserve topological structure information between data during low-dimensional representation. Therefore, this [...] Read more.
Clustering algorithms based on non-negative matrix factorization (NMF) have garnered significant attention in data mining due to their strong interpretability and computational simplicity. However, traditional NMF often struggles to effectively capture and preserve topological structure information between data during low-dimensional representation. Therefore, this paper proposes an autoencoder-like sparse non-negative matrix factorization with structure relationship preservation (ASNMF-SRP). Firstly, drawing on the principle of autoencoders, a “decoder-encoder” co-optimization matrix factorization framework is constructed to enhance the factorization stability and representation capability of the coefficient matrix. Then, a preference-adjusted random walk strategy is introduced to capture higher-order neighborhood relationships between samples, encoding multi-order topological structure information of the data through an optimal graph regularization term. Simultaneously, to mitigate the impact of noise and outliers, the l2,1-norm is used to constrain the feature correlation between low-dimensional representations and the original data, preserving feature relationships between data, and a sparse constraint is imposed on the coefficient matrix via the inner product. Finally, clustering experiments conducted on 8 public datasets demonstrate that ASNMF-SRP consistently exhibits favorable clustering performance. Full article
(This article belongs to the Section Information Theory, Probability and Statistics)
Show Figures

Figure 1

14 pages, 2118 KB  
Article
Joint Spectral–Spatial Representation Learning for Unsupervised Hyperspectral Image Clustering
by Xuanhao Liu, Taimao Wang and Xiaofeng Wang
Appl. Sci. 2025, 15(16), 8935; https://doi.org/10.3390/app15168935 - 13 Aug 2025
Viewed by 567
Abstract
Hyperspectral image (HSI) clustering has attracted significant attention due to its broad applications in agricultural monitoring, environmental protection, and other fields. However, the integration of high-dimensional spectral and spatial information remains a major challenge, often resulting in unstable clustering and poor generalization under [...] Read more.
Hyperspectral image (HSI) clustering has attracted significant attention due to its broad applications in agricultural monitoring, environmental protection, and other fields. However, the integration of high-dimensional spectral and spatial information remains a major challenge, often resulting in unstable clustering and poor generalization under noisy or redundant conditions. To address these challenges, we propose a Joint Spectral–Spatial Representation Learning (JSRL) framework for robust hyperspectral image clustering. We first perform spectral clustering to generate pseudo-labels and guide a residual Graph Attention Network (GAT) that jointly refines pixel-level spectral and spatial features. We then aggregate pixels into superpixels and employ a Variational Graph Autoencoder (VGAE) to learn structure-aware representations, further optimized via a quantum-behaved particle swarm optimization (QPSO) strategy. This hierarchical architecture not only mitigates spectral redundancy and reinforces spatial coherence, but also enables more robust and generalizable clustering across diverse HSI scenarios. Extensive experiments on multiple benchmark HSI datasets demonstrate that JSRL consistently achieves state-of-the-art performance, highlighting its robustness and generalization capability across diverse clustering scenarios. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

31 pages, 8113 KB  
Article
An Autoencoder-like Non-Negative Matrix Factorization with Structure Regularization Algorithm for Clustering
by Haiyan Gao and Ling Zhong
Symmetry 2025, 17(8), 1283; https://doi.org/10.3390/sym17081283 - 10 Aug 2025
Viewed by 718
Abstract
Clustering plays a crucial role in data mining and knowledge discovery, where non-negative matrix factorization (NMF) has attracted widespread attention due to its effective data representation and dimensionality reduction capabilities. However, standard NMF has inherent limitations when processing sampled data embedded in low-dimensional [...] Read more.
Clustering plays a crucial role in data mining and knowledge discovery, where non-negative matrix factorization (NMF) has attracted widespread attention due to its effective data representation and dimensionality reduction capabilities. However, standard NMF has inherent limitations when processing sampled data embedded in low-dimensional manifold structures within high-dimensional ambient spaces, failing to effectively capture the complex structural information hidden in feature manifolds and sampling manifolds, and neglecting the learning of global structures. To address these issues, a novel structure regularization autoencoder-like non-negative matrix factorization for clustering (SRANMF) is proposed. Firstly, based on the non-negative symmetric encoder-decoder framework, we construct an autoencoder-like NMF model to enhance the characterization ability of latent information in data. Then, by fully considering high-order neighborhood relationships in the data, an optimal graph regularization strategy is introduced to preserve multi-order topological information structures. Additionally, principal component analysis (PCA) is employed to measure global data structures by maximizing the variance of projected data. Comparative experiments on 11 benchmark datasets demonstrate that SRANMF exhibits excellent clustering performance. Specifically, on the large-scale complex datasets MNIST and COIL100, the clustering evaluation metrics improved by an average of 35.31% and 46.17% (ACC) and 47.12% and 18.10% (NMI), respectively. Full article
(This article belongs to the Section Computer)
Show Figures

Figure 1

17 pages, 1455 KB  
Article
Enhanced Graph Autoencoder for Graph Anomaly Detection Using Subgraph Information
by Chi Zhang and Jin-Woo Jung
Appl. Sci. 2025, 15(15), 8691; https://doi.org/10.3390/app15158691 - 6 Aug 2025
Viewed by 1111
Abstract
Graph anomaly detection aims at identifying rare, unusual entities in attributed networks with respect to their patterns or structures that deviate significantly from the majority within a graph. Over the years, extensive efforts in this field have been dedicated to the powerful capability [...] Read more.
Graph anomaly detection aims at identifying rare, unusual entities in attributed networks with respect to their patterns or structures that deviate significantly from the majority within a graph. Over the years, extensive efforts in this field have been dedicated to the powerful capability of attributed networks to model real-world systems. Given the scarcity of labeled anomalies, current research primarily emphasizes model design via unsupervised learning. Graph autoencoders have been widely utilized for such purposes, leveraging the outstanding capabilities of Graph Neural Networks to model graph structured data. However, most existing graph autoencoder-based anomaly detectors do not exploit the nodes’ local subgraph information, limiting their ability to comprehensively understand the network for better representation learning. Moreover, these methods place greater emphasis on the attribute reconstruction process while neglecting the structure reconstruction aspect. This paper proposes an enhanced graph autoencoder framework for graph anomaly detection tasks that incorporates a subgraph extraction and aggregation preprocessing stage to utilize the nodes’ local topological information for enhanced embedding generation and to induce an additional node–subgraph view through model learning. A graph structure learning-based decoder is introduced as the structure decoder for better relationship learning. Finally, during the anomaly scoring stage, a node neighborhood selection technique is applied to enhance the detection performance. The effectiveness of the proposed framework is demonstrated through comprehensive experiments conducted on six commonly used real-world datasets. Full article
(This article belongs to the Special Issue Intelligent Computing for Sustainable Smart Cities)
Show Figures

Figure 1

Back to TopTop