MDPI - Publisher of Open Access Journals

15 pages, 2694 KB

Open AccessArticle

Seismic Facies Recognition Based on Multimodal Network with Knowledge Graph

by Binpeng Yan, Mutian Li, Rui Pan and Jiaqi Zhao

Appl. Sci. 2025, 15(20), 11087; https://doi.org/10.3390/app152011087 - 16 Oct 2025

Seismic facies recognition constitutes a fundamental task in seismic data interpretation, playing an essential role in characterizing subsurface geological structures, sedimentary environments, and hydrocarbon reservoir distributions. Conventional approaches primarily depend on expert interpretation, which often introduces substantial subjectivity and operational inefficiency. Although deep learning-based methods have been introduced, most rely solely on unimodal data—namely, seismic images—and encounter challenges such as limited annotated samples and inadequate generalization capability. To overcome these limitations, this study proposes a multimodal seismic facies recognition framework named GAT-UKAN, which integrates a U-shaped Kolmogorov–Arnold Network (U-KAN) with a Graph Attention Network (GAT). This model is designed to accept dual-modality inputs. By fusing visual features with knowledge embeddings at intermediate network layers, the model achieves knowledge-guided feature refinement. This approach effectively mitigates issues related to limited samples and poor generalization inherent in single-modality frameworks. Experiments were conducted on the F3 block dataset from the North Sea. A knowledge graph comprising 47 entities and 12 relation types was constructed to incorporate expert knowledge. The results indicate that GAT-UKAN achieved a Pixel Accuracy of 89.7% and a Mean Intersection over Union of 70.6%, surpassing the performance of both U-Net and U-KAN. Furthermore, the model was transferred to the Parihaka field in New Zealand via transfer learning. After fine-tuning, the predictions exhibited strong alignment with seismic profiles, demonstrating the model’s robustness under complex geological conditions. Although the proposed model demonstrates excellent performance in accuracy and robustness, it has so far been validated only on 2D seismic profiles. Its capability to characterize continuous 3D geological features therefore remains limited. Full article

► Show Figures

Figure 1

16 pages, 1464 KB

Open AccessArticle

Transient Stability Assessment of Power Systems Built upon a Deep Spatio-Temporal Feature Extraction Network

by Yu Nan, Meng Tong, Zhenzhen Kong, Huichao Zhao and Yadong Zhao

Energies 2025, 18(17), 4547; https://doi.org/10.3390/en18174547 - 27 Aug 2025

Viewed by 609

Abstract

The rapid and accurate identification of power system transient stability status is a fundamental prerequisite for ensuring the secure and reliable operation of large-scale power grids. With the increasing complexity and heterogeneity of modern power system components, system nonlinearity has grown significantly, rendering traditional time-domain simulation and direct methods unable to meet accuracy and efficiency requirements simultaneously. To further improve the prediction accuracy of power system transient stability and provide more refined assessment results, this paper integrates deep learning with power system transient stability and proposes a transient stability assessment of power systems built upon a deep spatio-temporal feature extraction network method. First, a spatio-temporal feature extraction module is constructed by combining an improved graph attention network with a residual bidirectional temporal convolutional network, aiming to capture the spatial and bidirectional temporal characteristics of transient stability data. Second, a classification module is developed using the Kolmogorov–Arnold network to establish the mapping relationship between spatio-temporal features and transient stability states. This enables the accurate determination of the system’s transient stability status within a short time after fault occurrence. Finally, a weighted cross-entropy loss function is employed to address the issue of low prediction accuracy caused by the imbalanced sample distribution in the evaluation model. The feasibility, effectiveness, and superiority of the proposed method are validated through tests on the New England 10-machine 39-bus system and the NPCC 48-machine 140-bus system. Full article

► Show Figures

Figure 1

21 pages, 14585 KB

Open AccessArticle

Unsupervised Contrastive Graph Kolmogorov–Arnold Networks Enhanced Cross-Modal Retrieval Hashing

by Hongyu Lin, Shaofeng Shen, Yuchen Zhang and Renwei Xia

Mathematics 2025, 13(11), 1880; https://doi.org/10.3390/math13111880 - 4 Jun 2025

Cited by 1 | Viewed by 976

Abstract

To address modality heterogeneity and accelerate large-scale retrieval, cross-modal hashing strategies generate compact binary codes that enhance computational efficiency. Existing approaches often struggle with suboptimal feature learning due to fixed activation functions and limited cross-modal interaction. We propose Unsupervised Contrastive Graph Kolmogorov–Arnold Networks (GraphKAN) Enhanced Cross-modal Retrieval Hashing (UCGKANH), integrating GraphKAN with contrastive learning and hypergraph-based enhancement. GraphKAN enables more flexible cross-modal representation through enhanced nonlinear expression of features. We introduce contrastive learning that captures modality-invariant structures through sample pairs. To preserve high-order semantic relations, we construct a hypergraph-based information propagation mechanism, refining hash codes by enforcing global consistency. The efficacy of our UCGKANH approach is validated by thorough tests on the MIR-FLICKR, NUS-WIDE, and MS COCO datasets, which show significant gains in retrieval accuracy coupled with strong computational efficiency. Full article

► Show Figures

Figure 1

20 pages, 10077 KB

Open AccessArticle

A GraphKAN-Based Intelligent Fault Diagnosis Method of Rolling Bearing Under Variable Working Conditions

by Ye Liu, Yanhe Xu, Jie Liu, Hui Qin and Xinqiang Niu

Symmetry 2025, 17(2), 241; https://doi.org/10.3390/sym17020241 - 6 Feb 2025

Cited by 1 | Viewed by 1206

Abstract

Unsupervised domain adaptation (UDA) can effectively address the two main drawbacks of transfer learning: the requirement of a large number of samples collected from different working conditions, and the inherent defects of convolutional neural networks (CNNs). In the realm of UDA, it is essential to leverage three types of information: class labels, domain specifications, and data organization. These components play a vital role in linking the source domain with the target domain. A technique aimed at identifying issues in rolling bearings is presented, employing an integration of CNN-KAN and GraphKAN structures to support the UDA methodology. A cohesive deep learning architecture is employed to represent the three types of information involved in UDA. The initial two types of information are represented through the roles of classifier and domain discriminator. To begin with, an architecture leveraging CNN-KAN is employed to extract features from the incoming signals. Following this, the features obtained from the CNN-KAN architecture are input into a specially developed graph creation layer that constructs instance graphs by analyzing the relationships among the structural characteristics found within the samples. In the following step, an innovative GraphKAN model is applied to illustrate the instance graphs, concurrently employing CORrelation ALignment (CORAL) loss to assess the structural discrepancies among instance graphs from different domains. Results from experiments conducted on two separate datasets demonstrate that the proposed framework surpasses alternative approaches and successfully recognizes transferable characteristics that are advantageous for domain adaptation. Full article

(This article belongs to the Special Issue Symmetry in Bearing Modeling and Intelligent Fault Diagnosis)

► Show Figures

Figure 1

23 pages, 8209 KB

Open AccessArticle

Spatio-Temporal Transformer with Kolmogorov–Arnold Network for Skeleton-Based Hand Gesture Recognition

by Pengcheng Han, Xin He, Takafumi Matsumaru and Vibekananda Dutta

Sensors 2025, 25(3), 702; https://doi.org/10.3390/s25030702 - 24 Jan 2025

Viewed by 2464

Abstract

Manually crafted features often suffer from being subjective, having an inadequate accuracy, or lacking in robustness in recognition. Meanwhile, existing deep learning methods often overlook the structural and dynamic characteristics of the human hand, failing to fully explore the contextual information of joints in both the spatial and temporal domains. To effectively capture dependencies between the hand joints that are not adjacent but may have potential connections, it is essential to learn long-term relationships. This study proposes a skeleton-based hand gesture recognition framework, the ST-KT, a spatio-temporal graph convolution network, and a transformer with the Kolmogorov–Arnold Network (KAN) model. It incorporates spatio-temporal graph convolution network (ST-GCN) modules and a spatio-temporal transformer module with KAN (KAN–Transformer). ST-GCN modules, which include a spatial graph convolution network (SGCN) and a temporal convolution network (TCN), extract primary features from skeleton sequences by leveraging the strength of graph convolutional networks in the spatio-temporal domain. A spatio-temporal position embedding method integrates node features, enriching representations by including node identities and temporal information. The transformer layer includes a spatial KAN–Transformer (S-KT) and a temporal KAN–Transformer (T-KT), which further extract joint features by learning edge weights and node embeddings, providing richer feature representations and the capability for nonlinear modeling. We evaluated the performance of our method on two challenging skeleton-based dynamic gesture datasets: our method achieved an accuracy of 97.5% on the SHREC’17 track dataset and 94.3% on the DHG-14/28 dataset. These results demonstrate that our proposed method, ST-KT, effectively captures dynamic skeleton changes and complex joint relationships. Full article

(This article belongs to the Topic Applied Computer Vision and Pattern Recognition: 2nd Edition)

► Show Figures

Figure 1

17 pages, 3457 KB

Open AccessArticle

Multimodal Information Fusion and Data Generation for Evaluation of Second Language Emotional Expression

by Jun Yang, Liyan Wang, Yong Qi, Haifeng Chen and Jian Li

Appl. Sci. 2024, 14(19), 9121; https://doi.org/10.3390/app14199121 - 9 Oct 2024

Viewed by 1883

Abstract

This study aims to develop an emotion evaluation method for second language learners, utilizing multimodal information to comprehensively evaluate students’ emotional expressions. Addressing the limitations of existing emotion evaluation methods, which primarily focus on the acoustic features of speech (e.g., pronunciation, frequency, and rhythm) and often neglect the emotional expressions conveyed through voice and facial videos, this paper proposes an emotion evaluation method based on multimodal information. The method includes the following three main parts: (1) generating virtual data using a Large Language Model (LLM) and audio-driven facial video synthesis, as well as integrating the IEMOCAP dataset with self-recorded student videos and audios containing teacher ratings to construct a multimodal emotion evaluation dataset; (2) a graph convolution-based emotion feature encoding network to extract emotion features from multimodal information; and (3) an emotion evaluation network based on Kolmogorov–Arnold Networks (KAN) to compare students’ emotion features with standard synthetic data for precise evaluation. The emotion recognition method achieves an unweighted accuracy (UA) of 68.02% and an F1 score of 67.11% in experiments with the IEMOCAP dataset and TTS data. The emotion evaluation model, using the KAN network, outperforms the MLP network, with a mean squared error (MSE) of 0.811 compared to 0.943, providing a reliable tool for evaluating language learners’ emotional expressions. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

► Show Figures

Figure 1

Search Results (6)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (6)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI