Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (433)

Search Parameters:
Keywords = Bi-directional gated recurrent units

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
36 pages, 1954 KB  
Article
VeMisNet: Enhanced Feature Engineering for Deep Learning-Based Misbehavior Detection in Vehicular Ad Hoc Networks
by Nayera Youness, Ahmad Mostafa, Mohamed A. Sobh, Ayman M. Bahaa and Khaled Nagaty
J. Sens. Actuator Netw. 2025, 14(5), 100; https://doi.org/10.3390/jsan14050100 - 9 Oct 2025
Viewed by 129
Abstract
Ensuring secure and reliable communication in Vehicular Ad hoc Networks (VANETs) is critical for safe transportation systems. This paper presents Vehicular Misbehavior Network (VeMisNet), a deep learning framework for detecting misbehaving vehicles, with primary contributions in systematic feature engineering and scalability analysis. VeMisNet [...] Read more.
Ensuring secure and reliable communication in Vehicular Ad hoc Networks (VANETs) is critical for safe transportation systems. This paper presents Vehicular Misbehavior Network (VeMisNet), a deep learning framework for detecting misbehaving vehicles, with primary contributions in systematic feature engineering and scalability analysis. VeMisNet introduces domain-informed spatiotemporal features—including DSRC neighborhood density, inter-message timing patterns, and communication frequency analysis—derived from the publicly available VeReMi Extension Dataset. The framework evaluates Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Bidirectional LSTM architectures across dataset scales from 100 K to 2 M samples, encompassing all 20 attack categories. To address severe class imbalance (59.6% legitimate vehicles), VeMisNet applies SMOTE post train–test split, preventing data leakage while enabling balanced evaluation. Bidirectional LSTM with engineered features achieves 99.81% accuracy and F1-score on 500 K samples, with remarkable scalability maintaining >99.5% accuracy at 2 M samples. Critical metrics include 0.19% missed attack rates, under 0.05% false alarms, and 41.76 ms inference latency. The study acknowledges important limitations, including reliance on simulated data, single-split evaluation, and potential adversarial vulnerability. Domain-informed feature engineering provides 27.5% relative improvement over dimensionality reduction and 22-fold better scalability than basic features. These results establish new VANET misbehavior detection benchmarks while providing honest assessment of deployment readiness and research constraints. Full article
Show Figures

Figure 1

31 pages, 7912 KB  
Article
A FIG-IWOA-BiGRU Model for Bus Passenger Flow Fluctuation Trend and Spatial Prediction
by Jie Zhang, Qingling He, Xiaojuan Lu, Shungen Xiao and Ning Wang
Mathematics 2025, 13(19), 3204; https://doi.org/10.3390/math13193204 - 6 Oct 2025
Viewed by 142
Abstract
To capture bus passenger flow fluctuations and address the problems of slow convergence and high error in machine learning parameter optimization, this paper develops an improved Whale Optimization Algorithm (IWOA) integrated with a Bidirectional Gated Recurrent Unit (BiGRU). First, a Logistic–Tent chaotic mapping [...] Read more.
To capture bus passenger flow fluctuations and address the problems of slow convergence and high error in machine learning parameter optimization, this paper develops an improved Whale Optimization Algorithm (IWOA) integrated with a Bidirectional Gated Recurrent Unit (BiGRU). First, a Logistic–Tent chaotic mapping is introduced to generate a diverse and high-quality initial population. Second, a hybrid mechanism combining elite opposition-based learning and Cauchy mutation enhances population diversity and reduces premature convergence. Third, a cosine-based adaptive convergence factor and inertia weight strategy improve the balance between global exploration and local exploitation. Based on the correlation analysis between bus passenger flow and weather condition data in Harbin, and combined with the fluctuation characteristics of bus passenger flow, the data were divided into windows with a 7-day weekly cycle and processed by fuzzy information granulation to obtain three groups of fuzzy granulated window data, namely LOW, R, and UP, representing the fluctuation trend and spatial characteristics of bus passenger flow. The IWOA was employed to optimize and solve parameters such as the hidden layer weights and bias vectors of the BiGRU, thereby constructing a bus passenger flow fluctuation trend and spatial prediction model based on FIG-IWOA-BiGRU. Simulation experiments with 21 benchmark functions and real bus data verified its effectiveness. Results show that IWOA significantly improves optimization accuracy and convergence speed. For bus passenger flow forecasting, the average MAE, RMSE, and MAPE of LOW, R, and UP data are 2915, 3075, and 8.1%, representing improvements over existing classical models. The findings provide reliable decision support for bus scheduling and passenger travel planning. Full article
Show Figures

Figure 1

10 pages, 1464 KB  
Communication
A Signal Detection Method Based on BiGRU for FSO Communications with Atmospheric Turbulence
by Zhenning Yi, Zhiyong Xu, Jianhua Li, Jingyuan Wang, Jiyong Zhao, Yang Su and Yimin Wang
Photonics 2025, 12(10), 980; https://doi.org/10.3390/photonics12100980 - 2 Oct 2025
Viewed by 166
Abstract
In free space optical (FSO) communications, signals are affected by turbulence when transmitted through the atmosphere. Fluctuations in intensity caused by atmospheric turbulence lead to an increase in the bit error rate of FSO systems. Deep learning (DL), as a current research hotspot, [...] Read more.
In free space optical (FSO) communications, signals are affected by turbulence when transmitted through the atmosphere. Fluctuations in intensity caused by atmospheric turbulence lead to an increase in the bit error rate of FSO systems. Deep learning (DL), as a current research hotspot, offers a promising approach to improve the accuracy of signal detection. In this paper, we propose a signal detection method based on a bidirectional gated recurrent unit (BiGRU) neural network for FSO communications. The proposed detection method considers the temporal correlation of received signals due to the properties of the BiGRU neural network, which is not available in existing detection methods based on DL. In addition, the proposed detection method does not require channel state information (CSI) for channel estimation, unlike maximum likelihood (ML) detection technology with perfect CSI. Numerical results demonstrate that the proposed BiGRU-based detector achieves significant improvements in bit error rate (BER) performance compared with a multilayer perceptron (MLP)-based detector. Specifically, under weak turbulence conditions, the BiGRU-based detector achieves an approximate 2 dB signal-to-noise ratio (SNR) gain at a target BER of 106 compared to the MLP-based detector. Under moderate turbulence conditions, it achieves an approximate 6 dB SNR gain at the same target BER of 106. Under strong turbulence conditions, the proposed detector obtains a 6 dB SNR gain at a target BER of 104. Additionally, it outperforms conventional methods by more than one order of magnitude in BER under the same turbulence and SNR conditions. Full article
(This article belongs to the Section Optical Communication and Network)
Show Figures

Figure 1

23 pages, 5971 KB  
Article
Improved MNet-Atten Electric Vehicle Charging Load Forecasting Based on Composite Decomposition and Evolutionary Predator–Prey and Strategy
by Xiaobin Wei, Qi Jiang, Huaitang Xia and Xianbo Kong
World Electr. Veh. J. 2025, 16(10), 564; https://doi.org/10.3390/wevj16100564 - 2 Oct 2025
Viewed by 293
Abstract
In the context of low carbon, achieving accurate forecasting of electrical energy is critical for power management with the continuous development of power systems. For the sake of improving the performance of load forecasting, an improved MNet-Atten electric vehicle charging load forecasting based [...] Read more.
In the context of low carbon, achieving accurate forecasting of electrical energy is critical for power management with the continuous development of power systems. For the sake of improving the performance of load forecasting, an improved MNet-Atten electric vehicle charging load forecasting based on composite decomposition and the evolutionary predator–prey and strategy model is proposed. In this light, through the data decomposition theory, each subsequence is processed using complementary ensemble empirical mode decomposition and filters out high-frequency white noise by using singular value decomposition based on matrix operation, which improves the anti-interference ability and computational efficiency of the model. In the model construction stage, the MNet-Atten prediction model is developed and constructed. The convolution module is used to mine the local dependencies of the sequences, and the long term and short-term features of the data are extracted through the loop and loop skip modules to improve the predictability of the data itself. Furthermore, the evolutionary predator and prey strategy is used to iteratively optimize the learning rate of the MNet-Atten for improving the forecasting performance and convergence speed of the model. The autoregressive module is used to enhance the ability of the neural network to identify linear features and improve the prediction performance of the model. Increasing temporal attention to give more weight to important features for global and local linkage capture. Additionally, the electric vehicle charging load data in a certain region, as an example, is verified, and the average value of 30 running times of the combined model proposed is 117.3231 s, and the correlation coefficient PCC of the CEEMD-SVD-EPPS-MNet-Atten model is closer to 1. Furthermore, the CEEMD-SVD-EPPS-MNet-Atten model has the lowest MAPE, RMSE, and PCC. The results show that the model in this paper can better extract the characteristics of the data, improve the modeling efficiency, and have a high data prediction accuracy. Full article
(This article belongs to the Section Charging Infrastructure and Grid Integration)
Show Figures

Graphical abstract

27 pages, 10646 KB  
Article
Deep Learning-Based Hybrid Model with Multi-Head Attention for Multi-Horizon Stock Price Prediction
by Rajesh Kumar Ghosh, Bhupendra Kumar Gupta, Ajit Kumar Nayak and Samit Kumar Ghosh
J. Risk Financial Manag. 2025, 18(10), 551; https://doi.org/10.3390/jrfm18100551 - 1 Oct 2025
Viewed by 430
Abstract
The prediction of stock prices is challenging due to their volatility, irregular patterns, and complex time-series structure. Reliably forecasting stock market data plays a crucial role in minimizing financial risk and optimizing investment strategies. However, traditional models often struggle to capture temporal dependencies [...] Read more.
The prediction of stock prices is challenging due to their volatility, irregular patterns, and complex time-series structure. Reliably forecasting stock market data plays a crucial role in minimizing financial risk and optimizing investment strategies. However, traditional models often struggle to capture temporal dependencies and extract relevant features from noisy inputs, which limits their predictive performance. To improve this, we developed an enhanced recursive feature elimination (RFE) method that blends the importance of impurity-based features from random forest and gradient boosting models with Kendall tau correlation analysis, and we applied SHapley Additive exPlanations (SHAP) analysis to externally validate the reliability of the selected features. This approach leads to more consistent and reliable feature selection for short-term stock prediction over 1-, 3-, and 7-day intervals. The proposed deep learning (DL) architecture integrates a temporal convolutional network (TCN) for long-term pattern recognition, a gated recurrent unit (GRU) for sequence capture, and multi-head attention (MHA) for focusing on critical information, thereby achieving superior predictive performance. We evaluate the proposed approach using daily stock price data from three leading companies—HDFC Bank, Tata Consultancy Services (TCS), and Tesla—and two major stock indices: Nifty 50 and S&P 500. The performance of our model is compared against five benchmark models: temporal convolutional network (TCN), long short-term memory (LSTM), GRU, Bidirectional GRU, and a hybrid TCN–GRU model. Our method consistently shows lower error rates and higher predictive accuracy across all datasets, as measured by four commonly used performance metrics. Full article
(This article belongs to the Section Financial Markets)
Show Figures

Figure 1

27 pages, 2645 KB  
Article
Short-Text Sentiment Classification Model Based on BERT and Dual-Stream Transformer Gated Attention Mechanism
by Song Yang, Jiayao Xing, Zhaoxia Liu and Yunhao Sun
Electronics 2025, 14(19), 3904; https://doi.org/10.3390/electronics14193904 - 30 Sep 2025
Viewed by 303
Abstract
With the rapid development of social media, short-text data have become increasingly important in fields such as public opinion monitoring, user feedback analysis, and intelligent recommendation systems. However, existing short-text sentiment analysis models often suffer from limited cross-domain adaptability and poor generalization performance. [...] Read more.
With the rapid development of social media, short-text data have become increasingly important in fields such as public opinion monitoring, user feedback analysis, and intelligent recommendation systems. However, existing short-text sentiment analysis models often suffer from limited cross-domain adaptability and poor generalization performance. To address these challenges, this study proposes a novel short-text sentiment classification model based on the Bidirectional Encoder Representations from Transformers (BERTs) and a dual-stream Transformer gated attention mechanism. This model first employs Bidirectional Encoder Representations from Transformers (BERTs) and the Chinese Robustly Optimized BERT Pretraining Approach (Chinese-RoBERTa) to achieve data augmentation and multilevel semantic mining, thereby expanding the training corpus and enhancing minority class coverage. Second, a dual-stream Transformer gated attention mechanism was developed to dynamically adjust feature fusion weights, enhancing adaptability to heterogeneous texts. Finally, the model integrates a Bidirectional Gated Recurrent Unit (BiGRU) with Multi-Head Self-Attention (MHSA) to strengthen sequence information modeling and global context capture, enabling the precise identification of key sentiment dependencies. The model’s superior performance in handling data imbalance and complex textual sentiment logic scenarios is demonstrated by the experimental results, achieving significant improvements in accuracy and F1 score. The F1 score reached 92.4%, representing an average increase of 8.7% over the baseline models. This provides an effective solution for enhancing the performance and expanding the application scenarios of short-text sentiment analysis models. Full article
(This article belongs to the Special Issue Deep Generative Models and Recommender Systems)
Show Figures

Figure 1

29 pages, 2068 KB  
Article
Voice-Based Early Diagnosis of Parkinson’s Disease Using Spectrogram Features and AI Models
by Danish Quamar, V. D. Ambeth Kumar, Muhammad Rizwan, Ovidiu Bagdasar and Manuella Kadar
Bioengineering 2025, 12(10), 1052; https://doi.org/10.3390/bioengineering12101052 - 29 Sep 2025
Viewed by 421
Abstract
Parkinson’s disease (PD) is a progressive neurodegenerative disorder that significantly affects motor functions, including speech production. Voice analysis offers a less invasive, faster and more cost-effective approach for diagnosing and monitoring PD over time. This research introduces an automated system to distinguish between [...] Read more.
Parkinson’s disease (PD) is a progressive neurodegenerative disorder that significantly affects motor functions, including speech production. Voice analysis offers a less invasive, faster and more cost-effective approach for diagnosing and monitoring PD over time. This research introduces an automated system to distinguish between PD and non-PD individuals based on speech signals using state-of-the-art signal processing and machine learning (ML) methods. A publicly available voice dataset (Dataset 1, 81 samples) containing speech recordings from PD patients and non-PD individuals was used for model training and evaluation. Additionally, a small supplementary dataset (Dataset 2, 15 samples) was created although excluded from experiment, to illustrate potential future extensions of this work. Features such as Mel-frequency cepstral coefficients (MFCCs), spectrograms, Mel spectrograms and waveform representations were extracted to capture key vocal impairments related to PD, including diminished vocal range, weak harmonics, elevated spectral entropy and impaired formant structures. These extracted features were used to train and evaluate several ML models, including support vector machine (SVM), XGBoost and logistic regression, as well as deep learning (DL)architectures such as deep neural networks (DNN), convolutional neural networks (CNN) combined with long short-term memory (LSTM), CNN + gated recurrent unit (GRU) and bidirectional LSTM (BiLSTM). Experimental results show that DL models, particularly BiLSTM, outperform traditional ML models, achieving 97% accuracy and an AUC of 0.95. The comprehensive feature extraction from both datasets enabled robust classification of PD and non-PD speech signals. These findings highlight the potential of integrating acoustic features with DL methods for early diagnosis and monitoring of Parkinson’s Disease. Full article
Show Figures

Figure 1

15 pages, 1708 KB  
Article
Fatigue Detection from 3D Motion Capture Data Using a Bidirectional GRU with Attention
by Ziyang Wang, Xueyi Liu and Yikang Wang
Appl. Sci. 2025, 15(19), 10492; https://doi.org/10.3390/app151910492 - 28 Sep 2025
Viewed by 195
Abstract
Exercise-induced fatigue can degrade athletic performance and increase injury risk, yet traditional fatigue assessments often rely on subjective measures. This study proposes an objective fatigue recognition approach using high-fidelity motion capture data and deep learning. This study induced both cognitive and physical fatigue [...] Read more.
Exercise-induced fatigue can degrade athletic performance and increase injury risk, yet traditional fatigue assessments often rely on subjective measures. This study proposes an objective fatigue recognition approach using high-fidelity motion capture data and deep learning. This study induced both cognitive and physical fatigue in 50 male participants through a dual task (mental challenge followed by intense exercise) and collected three-dimensional lower-limb joint kinematics and kinetics during vertical jumps. A bidirectional Gate Recurrent Unit (GRU) with an attention mechanism (BiGRU + Attention) was trained to classify pre- vs. post-fatigue states. Five-fold cross-validation was employed for within-sample evaluation, and attention weight analysis provided insight into key fatigue-related movement phases. The BiGRU + Attention model achieved superior performance with 92% classification accuracy and an Area Under Curve (AUC) of 96%, significantly outperforming the single-layer GRU baseline (85% accuracy, AUC 92%). It also exhibited higher recall and fewer missed detections of fatigue. The attention mechanism highlighted critical moments (end of countermovement and landing) associated with fatigue-induced biomechanical changes, enhancing model interpretability. This study collects spatial data and biomechanical data during movement, and uses a bidirectional Gate Recurrent Unit (GRU) model with an attention mechanism to distinguish between non-fatigue states and fatigue states involving both physical and psychological aspects, which holds certain pioneering significance in the field of fatigue state identification. This study lays the foundation for real-time fatigue monitoring systems in sports and rehabilitation, enabling timely interventions to prevent performance decline and injury. Full article
Show Figures

Figure 1

24 pages, 24348 KB  
Article
State of Health for Lithium-Ion Batteries Based on Explainable Feature Fragments via Graph Attention Network and Bi-Directional Gated Recurrent Unit
by Wenpeng Luan, Hanju Cai, Xiaohui Wang and Bochao Zhao
Sensors 2025, 25(19), 5953; https://doi.org/10.3390/s25195953 - 24 Sep 2025
Viewed by 501
Abstract
Accurate lithium-ion battery state of health estimation is critical for safety and range anxiety mitigation. Existing methods often lack interpretability in the extraction of feature fragments and fail to model spatial correlations between features. To address these gaps, this paper introduces a novel [...] Read more.
Accurate lithium-ion battery state of health estimation is critical for safety and range anxiety mitigation. Existing methods often lack interpretability in the extraction of feature fragments and fail to model spatial correlations between features. To address these gaps, this paper introduces a novel framework centered on interpretable feature engineering and synergistic spatial–temporal learning. The core novelty lies in using incremental capacity (IC) analysis on charging data, captured by onboard sensors, to dynamically select a 0.1 V voltage window based on IC peaks, ensuring the extracted voltage and capacity fragments are physically meaningful. These fragments are then transformed into graph-structured data, enabling a graph attention network and a bi-directional gated recurrent unit to effectively capture both spatial dependencies and temporal degradation trends, with a residual connection optimizing the network. Validation on two public benchmark datasets demonstrates the model’s superiority, achieving an average mean absolute error of 0.561% and a root mean square error of 0.783%. Furthermore, the model exhibits a low computational footprint, requiring only 1.68 MFLOPs per inference, and its fast inference time of 17.55 ms on an embedded platform confirms its feasibility for practical deployment. Full article
Show Figures

Figure 1

19 pages, 4231 KB  
Article
Deep Feature Decoupling Network for Ball Mill Load Signals
by Xiaoyan Luo, Wei Huang, Saisai He, Wencong Xiao and Zhihong Jiang
Machines 2025, 13(10), 881; https://doi.org/10.3390/machines13100881 - 24 Sep 2025
Viewed by 303
Abstract
Accurately identifying the load status of a ball mill is critical for optimizing grinding efficiency and ensuring operational stability. However, the one-dimensional vibration signals collected from ball mills exhibit strong non-stationarity and a high degree of entanglement between multi-scale local transient features and [...] Read more.
Accurately identifying the load status of a ball mill is critical for optimizing grinding efficiency and ensuring operational stability. However, the one-dimensional vibration signals collected from ball mills exhibit strong non-stationarity and a high degree of entanglement between multi-scale local transient features and long-range temporal evolution patterns. To address this, rather than relying on a purely black-box approach, this paper introduces a novel Deep Multi-scale Spatial–Temporal Feature Decoupling Network (DMSTFD-Net) guided by a clear feature decoupling philosophy to enhance model interpretability. The core of DMSTFD-Net lies in its hierarchical collaborative feature refinement mechanism. It first utilizes a one-dimensional residual network (ResNet) to adaptively capture and preliminarily decouple multi-scale spatial characteristics from the raw signal. Subsequently, the extracted high-level feature sequences are fed into a bidirectional gated recurrent unit (Bi-GRU) to decouple high-order temporal dynamic patterns. Experiments on a multi-condition dataset demonstrate that the proposed network achieves a state-of-the-art accuracy of 97.65%. Furthermore, dedicated cross-condition experiments and t-SNE visualizations validate the framework’s effectiveness. The results confirm that DMSTFD-Net provides a powerful, robust, and more interpretable solution for ball mill load identification. Full article
(This article belongs to the Section Advanced Manufacturing)
Show Figures

Figure 1

32 pages, 1238 KB  
Article
GRU-BERT for NILM: A Hybrid Deep Learning Architecture for Load Disaggregation
by Annysha Huzzat, Ahmed S. Khwaja, Ali A. Alnoman, Bhagawat Adhikari, Alagan Anpalagan and Isaac Woungang
AI 2025, 6(9), 238; https://doi.org/10.3390/ai6090238 - 22 Sep 2025
Viewed by 625
Abstract
Non-Intrusive Load Monitoring (NILM) aims to disaggregate a household’s total aggregated power consumption into appliance-level usage, enabling intelligent energy management without the need for intrusive metering. While deep learning has improved NILM significantly, existing NILM models struggle to capture load patterns across both [...] Read more.
Non-Intrusive Load Monitoring (NILM) aims to disaggregate a household’s total aggregated power consumption into appliance-level usage, enabling intelligent energy management without the need for intrusive metering. While deep learning has improved NILM significantly, existing NILM models struggle to capture load patterns across both longer time intervals and subtle timings for appliances involving brief or overlapping usage patterns. In this paper, we propose a novel GRU+BERT hybrid architecture, exploring both unidirectional (GRU+BERT) and bidirectional (Bi-GRU+BERT) variants. Our model combines Gated Recurrent Units (GRUs) to capture sequential temporal dependencies with Bidirectional Encoder Representations from Transformers (BERT), which is a transformer-based model that captures rich contextual information across the sequence. The bidirectional variant (Bi-GRU+BERT) processes input sequences in both forward (past to future) and backward (future to past) directions, enabling the model to learn relationships between power consumption values at different time steps more effectively. The unidirectional variant (GRU+BERT) provides an alternative suited for appliances with structured, sequential multi-phase usage patterns, such as dishwashers. By placing the Bi-GRU or GRU layer before BERT, our models first capture local time-based load patterns and then use BERT’s self-attention to understand the broader contextual relationships. This design addresses key limitations of both standalone recurrent and transformer-based models, offering improved performance on transient and irregular appliance loads. Evaluated on the UK-DALE and REDD datasets, the proposed Bi-GRU+BERT and GRU+BERT models show competitive performance compared to several state-of-the-art NILM models while maintaining a comparable model size and training time, demonstrating their practical applicability for real-time energy disaggregation, including potential edge and cloud deployment scenarios. Full article
(This article belongs to the Section AI Systems: Theory and Applications)
Show Figures

Figure 1

21 pages, 4834 KB  
Article
A Displacement Monitoring Model for High-Arch Dams Based on SHAP-Driven Ensemble Learning Optimized by the Gray Wolf Algorithm
by Shasha Li, Kai Jiang, Shunqun Yang, Zuxiu Lan, Yining Qi and Huaizhi Su
Water 2025, 17(18), 2766; https://doi.org/10.3390/w17182766 - 18 Sep 2025
Viewed by 389
Abstract
Displacement monitoring data is essential for assessing the structural safety of high-arch dams. Existing models, predominantly based on single-model architectures, often lack the ability to effectively integrate multiple algorithms, leading to limited predictive performance and poor interpretability. This study proposes an ensemble learning [...] Read more.
Displacement monitoring data is essential for assessing the structural safety of high-arch dams. Existing models, predominantly based on single-model architectures, often lack the ability to effectively integrate multiple algorithms, leading to limited predictive performance and poor interpretability. This study proposes an ensemble learning framework for dam displacement prediction, combining Hydraulic–Seasonal–Temporal model (HST), Random Forest (RF), and Bidirectional Gated Recurrent Unit (BiGRU) models as base learners. A stacking strategy is employed to enhance predictive accuracy, and the Grey Wolf Optimizer (GWO) is used for hyperparameter optimization. To improve model transparency, the Shapley Additive Explanations (SHAP) algorithm is applied for interpretability analysis. Extensive experiments demonstrate that the proposed ensemble model outperforms individual models, achieving a Root Mean Squared Error (RMSE) of 0.2241 and a Coefficient of Determination (R2) of 0.9993 on the test set. The SHAP analysis further elucidates the contribution of key variables, providing valuable insights into the displacement prediction process and offering a robust technical foundation for arch dam safety monitoring and early risk warning. Full article
Show Figures

Figure 1

20 pages, 2930 KB  
Article
Pain Level Classification from Speech Using GRU-Mixer Architecture with Log-Mel Spectrogram Features
by Adi Alhudhaif
Diagnostics 2025, 15(18), 2362; https://doi.org/10.3390/diagnostics15182362 - 17 Sep 2025
Viewed by 374
Abstract
Background/Objectives: Automatic pain detection from speech signals holds strong promise for non-invasive and real-time assessment in clinical and caregiving settings, particularly for populations with limited capacity for self-report. Methods: In this study, we introduce a lightweight recurrent deep learning approach, namely the [...] Read more.
Background/Objectives: Automatic pain detection from speech signals holds strong promise for non-invasive and real-time assessment in clinical and caregiving settings, particularly for populations with limited capacity for self-report. Methods: In this study, we introduce a lightweight recurrent deep learning approach, namely the Gated Recurrent Unit (GRU)-Mixer model for pain level classification based on speech signals. The proposed model maps raw audio inputs into Log-Mel spectrogram features, which are passed through a stacked bidirectional GRU for modeling the spectral and temporal dynamics of vocal expressions. To extract compact utterance-level embeddings, an adaptive average pooling-based temporal mixing mechanism is applied over the GRU outputs, followed by a fully connected classification head alongside dropout regularization. This architecture is used for several supervised classification tasks, including binary (pain/non-pain), graded intensity (mild, moderate, severe), and thermal-state (cold/warm) classification. End-to-end training is done using speaker-independent splits and class-balanced loss to promote generalization and discourage bias. The provided audio inputs are normalized to a consistent 3-s window and resampled at 8 kHz for consistency and computational efficiency. Results: Experiments on the TAME Pain dataset showcase strong classification performance, achieving 83.86% accuracy for binary pain detection and as high as 75.36% for multiclass pain intensity classification. Conclusions: As the first deep learning based classification work on the TAME Pain dataset, this work introduces the GRU-Mixer as an effective benchmark architecture for future studies on speech-based pain recognition and affective computing. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

35 pages, 6406 KB  
Article
Comparative Study of RNN-Based Deep Learning Models for Practical 6-DOF Ship Motion Prediction
by HaEun Lee and Yangjun Ahn
J. Mar. Sci. Eng. 2025, 13(9), 1792; https://doi.org/10.3390/jmse13091792 - 17 Sep 2025
Viewed by 458
Abstract
Accurate prediction of ship motion is essential for ensuring the safety and efficiency of maritime operations. However, the ship dynamics’ nonlinear, non-stationary, and environment-dependent nature presents significant challenges for reliable short-term forecasting. This study uses a simulated dataset designed to reflect realistic maritime [...] Read more.
Accurate prediction of ship motion is essential for ensuring the safety and efficiency of maritime operations. However, the ship dynamics’ nonlinear, non-stationary, and environment-dependent nature presents significant challenges for reliable short-term forecasting. This study uses a simulated dataset designed to reflect realistic maritime variability to evaluate the performance of recurrent neural network (RNN)-based models—including RNN, LSTM, GRU, and Bi-LSTM—under both single and multi-environment conditions. The analysis examines the effects of input sequence length, downsampling intervals, model complexity, and input dimensionality. Results show that Bi-LSTM consistently outperforms unidirectional architectures, particularly in complex multi-environment scenarios. In single-environment settings, the prediction horizon exceeded 40 s, while it decreased to around 20 s under more variable conditions, reflecting generalization challenges. Multi-degree-of-freedom (DOF) inputs enhanced performance by capturing the coupled nature of ship dynamics, whereas incorporating wave height data yielded inconsistent results. A sequence length of 200 timesteps and a downsampling interval of 5 effectively balanced motion feature preservation with high-frequency noise reduction. Increasing model size improved accuracy up to 256 hidden units and 10 layers, beyond which performance gains diminished. Additionally, Peak Matching was introduced as a complementary metric to MSE, emphasizing the importance of accurately predicting motion extrema for practical maritime applications. Full article
(This article belongs to the Special Issue Machine Learning for Prediction of Ship Motion)
Show Figures

Figure 1

36 pages, 8122 KB  
Article
Human Activity Recognition via Attention-Augmented TCN-BiGRU Fusion
by Ji-Long He, Jian-Hong Wang, Chih-Min Lo and Zhaodi Jiang
Sensors 2025, 25(18), 5765; https://doi.org/10.3390/s25185765 - 16 Sep 2025
Viewed by 745
Abstract
With the widespread application of wearable sensors in health monitoring and human–computer interaction, deep learning-based human activity recognition (HAR) research faces challenges such as the effective extraction of multi-scale temporal features and the enhancement of robustness against noise in multi-source data. This study [...] Read more.
With the widespread application of wearable sensors in health monitoring and human–computer interaction, deep learning-based human activity recognition (HAR) research faces challenges such as the effective extraction of multi-scale temporal features and the enhancement of robustness against noise in multi-source data. This study proposes the TGA-HAR (TCN-GRU-Attention-HAR) model. The TGA-HAR model integrates Temporal Convolutional Neural Networks and Recurrent Neural Networks by constructing a hierarchical feature abstraction architecture through cascading Temporal Convolutional Network (TCN) and Bidirectional Gated Recurrent Unit (BiGRU) layers for complex activity recognition. This study utilizes TCN layers with dilated convolution kernels to extract multi-order temporal features. This study utilizes BiGRU layers to capture bidirectional temporal contextual correlation information. To further optimize feature representation, the TGA-HAR model introduces residual connections to enhance the stability of gradient propagation and employs an adaptive weighted attention mechanism to strengthen feature representation. The experimental results of this study demonstrate that the model achieved test accuracies of 99.37% on the WISDM dataset, 95.36% on the USC-HAD dataset, and 96.96% on the PAMAP2 dataset. Furthermore, we conducted tests on datasets collected in real-world scenarios. This method provides a highly robust solution for complex human activity recognition tasks. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

Back to TopTop