Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (6)

Search Parameters:
Keywords = BiGRU-extended transformer

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 700 KB  
Article
BiGRMT: Bidirectional GRU–Recurrent Memory Transformer for Efficient Long-Sequence Anomaly Detection in High-Concurrency Microservices
by Ruicheng Zhang, Renzun Zhang, Shuyuan Wang, Kun Yang, Miao Xu, Dongwei Qiao and Xuanzheng Hu
Electronics 2025, 14(23), 4754; https://doi.org/10.3390/electronics14234754 - 3 Dec 2025
Viewed by 218
Abstract
In high-concurrency distributed systems, log data often exhibits sequence uncertainty and redundancy, which pose significant challenges to the accuracy and efficiency of anomaly detection. To address these issues, we propose BiGRMT, a hybrid architecture that integrates Bidirectional Gated Recurrent Unit (Bi-GRU) with a [...] Read more.
In high-concurrency distributed systems, log data often exhibits sequence uncertainty and redundancy, which pose significant challenges to the accuracy and efficiency of anomaly detection. To address these issues, we propose BiGRMT, a hybrid architecture that integrates Bidirectional Gated Recurrent Unit (Bi-GRU) with a Recurrent Memory Transformer (RMT). BiGRMT enhances local temporal feature extraction through bidirectional modeling and adaptive noise filtering using Bi-GRU, while a RMT component is incorporated to significantly extend the model’s capacity for long-sequence modeling via segment-level memory. The Transformer’s multi-head attention mechanism continues to capture global time dependencies but now with improved efficiency due to the RMT’s memory-sharing design. Extensive experiments on three benchmark datasets from LogHub (Spark, BGL(Blue Gene/L), and HDFS (Hadoop distributed file system)) demonstrate that BiGRMT achieves strong results in terms of precision, recall, and F1-score. It attains a precision of 0.913, outperforming LogGPT (0.487) and slightly exceeding Temporal logical attention network (TLAN) (0.912). Compared to LogPal, which prioritizes detection accuracy, BiGRMT strikes a better balance by significantly reducing computational overhead while maintaining high detection performance. Even under challenging conditions such as a 50% increase in log generation rate or 20% injected noise, BiGRMT maintains F1-scores of 87.4% and 83.6%, respectively, showcasing excellent robustness. These findings confirm that BiGRMT is a scalable and practical solution for automated fault detection and intelligent maintenance in complex distributed software systems. Full article
Show Figures

Figure 1

27 pages, 7274 KB  
Article
Intelligent Identification of Internal Leakage of Spring Full-Lift Safety Valve Based on Improved Convolutional Neural Network
by Shuxun Li, Kang Yuan, Jianjun Hou and Xiaoqi Meng
Sensors 2025, 25(17), 5451; https://doi.org/10.3390/s25175451 - 3 Sep 2025
Cited by 1 | Viewed by 898
Abstract
In modern industry, the spring full-lift safety valve is a key device for safe pressure relief of pressure-bearing systems. Its valve seat sealing surface is easily damaged after long-term use, causing internal leakage, resulting in safety hazards and economic losses. Therefore, it is [...] Read more.
In modern industry, the spring full-lift safety valve is a key device for safe pressure relief of pressure-bearing systems. Its valve seat sealing surface is easily damaged after long-term use, causing internal leakage, resulting in safety hazards and economic losses. Therefore, it is of great significance to quickly and accurately diagnose its internal leakage state. Among the current methods for identifying fluid machinery faults, model-based methods have difficulties in parameter determination. Although the data-driven convolutional neural network (CNN) has great potential in the field of fault diagnosis, it has problems such as hyperparameter selection relying on experience, insufficient capture of time series and multi-scale features, and lack of research on valve internal leakage type identification. To this end, this study proposes a safety valve internal leakage identification method based on high-frequency FPGA data acquisition and improved CNN. The acoustic emission signals of different internal leakage states are obtained through the high-frequency FPGA acquisition system, and the two-dimensional time–frequency diagram is obtained by short-time Fourier transform and input into the improved model. The model uses the leaky rectified linear unit (LReLU) activation function to enhance nonlinear expression, introduces random pooling to prevent overfitting, optimizes hyperparameters with the help of horned lizard optimization algorithm (HLOA), and integrates the bidirectional gated recurrent unit (BiGRU) and selective kernel attention module (SKAM) to enhance temporal feature extraction and multi-scale feature capture. Experiments show that the average recognition accuracy of the model for the internal leakage state of the safety valve is 99.7%, which is better than the comparison model such as ResNet-18. This method provides an effective solution for the diagnosis of internal leakage of safety valves, and the signal conversion method can be extended to the fault diagnosis of other mechanical equipment. In the future, we will explore the fusion of lightweight networks and multi-source data to improve real-time and robustness. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

21 pages, 4725 KB  
Article
A Novel Open Circuit Fault Diagnosis for a Modular Multilevel Converter with Modal Time-Frequency Diagram and FFT-CNN-BIGRU Attention
by Ziyuan Zhai, Ning Wang, Siran Lu, Bo Zhou and Lei Guo
Machines 2025, 13(6), 533; https://doi.org/10.3390/machines13060533 - 19 Jun 2025
Viewed by 695
Abstract
Fault diagnosis is one of the most important issues for a modular multilevel converter (MMC). However, conventional solutions are deficient in two aspects. Firstly, they lack the necessary feature information. Secondly, they are incapable of performing open-circuit fault diagnosis of the modular multilevel [...] Read more.
Fault diagnosis is one of the most important issues for a modular multilevel converter (MMC). However, conventional solutions are deficient in two aspects. Firstly, they lack the necessary feature information. Secondly, they are incapable of performing open-circuit fault diagnosis of the modular multilevel converter with the requisite degree of accuracy. To solve this problem, an intelligent diagnosis method is proposed to integrate the modal time–frequency diagram and FFT-CNN-BiGRU-Attention. By selecting the phase current and bridge arm voltage as the core fault parameters, the particle swarm algorithm is used to optimize the Variational Modal Decomposition parameters, and the fault signal is decomposed and reconstructed into sensitive feature components. The reconstructed signals are further transformed into modal time–frequency diagrams via continuous wavelet transform to fully retain the time–frequency domain features. In the model construction stage, the frequency–domain features are first extracted using the fast Fourier transform (FFT), and the local patterns are captured through a combination with a convolutional neural network; subsequently, the timing correlations are analyzed using bidirectional gated loop cells, and the Attention Mechanism is introduced to strengthen the key features. Simulations show that the proposed method achieves 98.63% accuracy in locating faulty insulated gate bipolar transistors (IGBTs) in the sub-module, with second-level real-time response capability. Compared with the recently published scheme, it maintains stable performance under complex working conditions such as noise interference and data imbalances, showing stronger robustness and practical value. This study provides a new idea for the intelligent operation and maintenance of power electronic devices, which can be extended to the fault diagnosis of other power equipment in the future. Full article
(This article belongs to the Section Electromechanical Energy Conversion Systems)
Show Figures

Figure 1

17 pages, 2333 KB  
Article
Multi-Modal Emotion Recognition Based on Wavelet Transform and BERT-RoBERTa: An Innovative Approach Combining Enhanced BiLSTM and Focus Loss Function
by Shaohua Zhang, Yan Feng, Yihao Ren, Zefei Guo, Renjie Yu, Ruobing Li and Peiran Xing
Electronics 2024, 13(16), 3262; https://doi.org/10.3390/electronics13163262 - 16 Aug 2024
Cited by 6 | Viewed by 3038
Abstract
Emotion recognition plays an increasingly important role in today’s society and has a high social value. However, current emotion recognition technology faces the problems of insufficient feature extraction and imbalanced samples when processing speech and text information, which limits the performance of existing [...] Read more.
Emotion recognition plays an increasingly important role in today’s society and has a high social value. However, current emotion recognition technology faces the problems of insufficient feature extraction and imbalanced samples when processing speech and text information, which limits the performance of existing models. To overcome these challenges, this paper proposes a multi-modal emotion recognition method based on speech and text. The model is divided into two channels. In the first channel, the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) feature set is extracted from OpenSmile, and the original eGeMAPS feature set is merged with the wavelet transformed eGeMAPS feature set. Then, speech features are extracted through a sparse autoencoder. The second channel extracts text features through the BERT-RoBERTa model. Then, deeper text features are extracted through a gated recurrent unit (GRU), and the deeper text features are fused with the text features. Emotions are identified by the attention layer, the dual-layer Bidirectional Long Short-Term Memory (BiLSTM) model, and the loss function, combined with cross-entropy loss and focus loss. Experiments show that, compared with the existing model, the WA and UA of this model are 73.95% and 74.27%, respectively, on the imbalanced IEMOCAP dataset, which is superior to other models. This research result effectively solves the problem of feature insufficiency and sample imbalance in traditional sentiment recognition methods, and provides a new way of thinking for sentiment analysis application. Full article
(This article belongs to the Section Circuit and Signal Processing)
Show Figures

Figure 1

27 pages, 8479 KB  
Article
Developing an Artificial Intelligence-Based Method for Predicting the Trajectory of Surface Drifting Buoys Using a Hybrid Multi-Layer Neural Network Model
by Miaomiao Song, Wei Hu, Shixuan Liu, Shizhe Chen, Xiao Fu, Jiming Zhang, Wenqing Li and Yuzhe Xu
J. Mar. Sci. Eng. 2024, 12(6), 958; https://doi.org/10.3390/jmse12060958 - 7 Jun 2024
Cited by 10 | Viewed by 2680
Abstract
Accurately predicting the long-term trajectory of a surface drifting buoy (SDB) is challenging. This paper proposes a promising solution to the SDB trajectory prediction based on artificial intelligence (AI) technologies. Initially, a scalable mathematical model for trajectory prediction is developed, transforming the challenge [...] Read more.
Accurately predicting the long-term trajectory of a surface drifting buoy (SDB) is challenging. This paper proposes a promising solution to the SDB trajectory prediction based on artificial intelligence (AI) technologies. Initially, a scalable mathematical model for trajectory prediction is developed, transforming the challenge of predicting trajectory points into predicting velocities in eastward and northward directions. Subsequently, a four-layer trajectory prediction calculation framework (FLTPCF) is established, outlining a complete workflow for the real-time online training of marine environment data and SDBs’ trajectory prediction. Thirdly, for facilitating accurate long-term trajectory prediction, a hybrid artificial neural network trajectory prediction model, named CNN–BiGRU–Attention, integrates a Convolutional Neural Network (CNN), Bidirectional Gated Recurrent Unit (BiGRU), and Attention mechanism (AM), tuned for spatiotemporal feature extraction and extended time-series reasoning. Extensive experiments, including ablation studies, comparative analyses with state-of-the-art models like BiLSTM and Transformer, evaluations against numerical methods, and adaptability tests, were conducted for justifying the CNN–BiGRU–Attention model. The results highlight the CNN–BiGRU–Attention model’s excellent convergence, accuracy, and generalization capabilities in predicting 24, 48, and 72 h trajectories for SDBs with varying drogue statuses and under different sea conditions. This work has great potential to promote the intelligent degree of marine environmental monitoring. Full article
(This article belongs to the Special Issue Applications of Artificial Intelligence in Marine Machinery)
Show Figures

Figure 1

16 pages, 15717 KB  
Article
Signal Reconstruction of Arbitrarily Lack of Frequency Bands from Seismic Wavefields Based on Deep Learning
by Xin Li, Fengjiao Zhang and Liguo Han
Appl. Sci. 2024, 14(11), 4922; https://doi.org/10.3390/app14114922 - 6 Jun 2024
Cited by 1 | Viewed by 1810
Abstract
Due to the limitations of seismic exploration instruments and the impact of the high frequencies absorption by the earth layers during subsurface propagation of seismic waves, recorded seismic data usually lack high and low frequency information that is needed to accurately image geological [...] Read more.
Due to the limitations of seismic exploration instruments and the impact of the high frequencies absorption by the earth layers during subsurface propagation of seismic waves, recorded seismic data usually lack high and low frequency information that is needed to accurately image geological structures. Traditional methods face challenges such as limitations of model assumptions and poor adaptability to complex geological conditions. Therefore, this paper proposes a deep learning method that introduces the attention mechanism and Bi-directional gated recurrent unit (BiGRU) into the Transformer neural network. This approach can simultaneously capture both global and local characteristics of time series data, establish mappings between different frequency bands, and achieve information compensation and frequency extension. The results show that the BiGRU-Extended Transformer network is capable of compensating and extending the synthetic seismic data sets with the limited frequency band. It has certain generalization capabilities and stability and can effectively handle various problems in the data reconstruction process, which is better than traditional methods. Full article
(This article belongs to the Special Issue Seismic Data Processing and Imaging)
Show Figures

Figure 1

Back to TopTop