MDPI - Publisher of Open Access Journals

17 pages, 4574 KB

Open AccessArticle

Fault Diagnosis Method for Rotating Machinery Based on Threshold-Free Recurrence Distance Visualization Convolutional Neural Network

by Chao Song, Fuzhou Feng, Feng Liu, Ziyu Liu and Hao Hu

Sensors 2026, 26(12), 3815; https://doi.org/10.3390/s26123815 - 16 Jun 2026

Viewed by 286

Abstract

Recursive Plots (RPs) can fully utilize the information of signals on a time scale, but their application involves the issue of manual threshold selection, and different thresholds have a significant impact on the analysis results of recursive plots, which in turn affects the [...] Read more.

Recursive Plots (RPs) can fully utilize the information of signals on a time scale, but their application involves the issue of manual threshold selection, and different thresholds have a significant impact on the analysis results of recursive plots, which in turn affects the accuracy of subsequent fault diagnosis models. Some scholars have proposed the no-threshold recursive plot method to address the above issues, but this method is not comprehensive enough and has limitations. On the basis of RPs, this article proposes a Threshold-Free Recurrence Distance (TFRD), which is combined with a Convolutional Neural Network (CNN) to form a TFRD-CNN rotating machinery fault diagnosis model. The accuracy of the method is tested using bearing vibration data from Western Reserve University, and the effectiveness of the model is verified using a planetary gearbox gear fault dataset. At the same time, the TFRD-CNN method is compared with a Markov Transition Field (MTF), Gramian Angular Fields (GAF), and RP and URP combined with CNN methods. The results show that the TFRD-CNN method has significant advantages. Full article

(This article belongs to the Section Fault Diagnosis & Sensors)

► Show Figures

Figure 1

28 pages, 6635 KB

Open AccessArticle

Advanced Fault Detection of Permanent Magnet Faults in Offshore Wind Turbine Generators Using Finite Element Analysis and Deep Transfer Learning

by Hüseyin Tayyer Canseven, Mustafa Ercire, Merve Cömert, Abdurrahman Ünsal and Nur Sarma

Machines 2026, 14(6), 665; https://doi.org/10.3390/machines14060665 - 8 Jun 2026

Viewed by 228

Abstract

As the offshore wind industry scales toward 15 MW capacity, the reliability of Direct-Drive Permanent Magnet Synchronous Generators (DD-PMSGs) becomes critical. However, real-world run-to-failure data for these massive, multi-pole machines is virtually non-existent, creating a barrier for developing effective data-driven diagnostic systems. This [...] Read more.

As the offshore wind industry scales toward 15 MW capacity, the reliability of Direct-Drive Permanent Magnet Synchronous Generators (DD-PMSGs) becomes critical. However, real-world run-to-failure data for these massive, multi-pole machines is virtually non-existent, creating a barrier for developing effective data-driven diagnostic systems. This study proposes a high-fidelity framework for detecting permanent magnet faults in the International Energy Agency (IEA) 15 MW Reference Wind Turbine. Using Finite Element Analysis (FEA), a dataset (magnetic flux and back electromotive-force (EMF)) capturing the electromagnetic signatures of healthy and faulty states of a PMSG under varying severities is generated. To improve the power of computer vision, 1D time-series signals were transformed into 2D images. Specifically, Gramian Angular Fields (GAFs) and Recurrence Plots (RPs) were applied to magnetic flux density signals, while Markov Transition Fields (MTFs) were applied to back-EMF signals. These representations were then fused into multi-channel Red-Green-Blue (RGB) images and processed via a ResNet-18 Deep Transfer Learning model using a strictly non-overlapping, leakage-free dataset partitioning strategy. The proposed framework achieved a classification accuracy of 99.45% on noise-free data. Furthermore, robustness testing under varying levels of Additive White Gaussian Noise (AWGN) (30 dB, 40 dB, and 50 dB Signal-to-Noise Ratio (SNR)) demonstrated sustained high performance, maintaining over 90% accuracy even under severe 30 dB noise conditions. Comparative analysis proved that this multi-channel fusion significantly outperforms single-channel encoding methods, which collapse under heavy noise, validating the scalability of the framework and applicability for next-generation condition monitoring in harsh offshore environments. Full article

(This article belongs to the Special Issue Digital Twins and Advanced Fault Modeling in the Condition Monitoring of Electric Machines)

► Show Figures

Figure 1

19 pages, 1657 KB

Open AccessArticle

End-to-End Learnable Recurrence Plot for Sleep Stage Classification Using Non-Contact Ballistocardiography

by Jiseong Jeong and Sunyong Yoo

Electronics 2026, 15(9), 1798; https://doi.org/10.3390/electronics15091798 - 23 Apr 2026

Viewed by 411

Abstract

Accurate sleep stage classification is essential for evaluating sleep quality, yet clinical polysomnography is impractical for continuous home-based monitoring. Ballistocardiography (BCG) enables unobtrusive sleep monitoring through sensors embedded in sleep furniture; however, existing BCG-based approaches either rely on complex physiological feature extraction or [...] Read more.

Accurate sleep stage classification is essential for evaluating sleep quality, yet clinical polysomnography is impractical for continuous home-based monitoring. Ballistocardiography (BCG) enables unobtrusive sleep monitoring through sensors embedded in sleep furniture; however, existing BCG-based approaches either rely on complex physiological feature extraction or employ fixed-parameter signal-to-image transformations that cannot adapt to inter-subject variability. This study proposes a learnable recurrence plot (RP) framework for three-stage sleep classification (Wake, NREM, REM) from single-channel BCG signals. The Learnable RP introduces three innovations: multi-scale phase-space reconstruction at physiologically motivated time delays (τ = 5, 10, 20), differentiable per-scale thresholds optimized end-to-end, and attention-based spatial fusion of multi-scale recurrence maps. The framework was evaluated through 10-fold stratified cross-validation across six backbone architectures using 50 overnight recordings. The Learnable RP consistently outperformed four baseline transformation methods (GAF, MTF, Classical RP, Modified RP), achieving an aggregate mean accuracy of 73.60%, with EfficientNet-B5 reaching 78.91%. and 78.91%. Statistical validation across all 24 pairwise comparisons (4 baselines × 6 backbones) confirmed consistent superiority (all p < 0.001). The proposed framework achieves competitive performance without explicit physiological feature engineering, offering a viable path toward end-to-end unobtrusive sleep monitoring. Full article

(This article belongs to the Section Bioelectronics)

► Show Figures

Figure 1

14 pages, 2371 KB

Open AccessArticle

Multimodal Phase-Space Dynamics Fusion for Robust Ischemia Screening: An Edge-AI Paradigm with SERF Magnetocardiography

by Keyi Li, Xiangyang Zhou, Yifan Jia, Ruizhe Wang, Yidi Cao, Jiaojiao Pang, Rui Shang, Yadan Zhang, Yangyang Cui, Dong Xu and Min Xiang

Biosensors 2026, 16(4), 228; https://doi.org/10.3390/bios16040228 - 20 Apr 2026

Viewed by 855

Abstract

Background: Myocardial ischemia (MI) is a major cause of morbidity and mortality worldwide and requires timely and reliable detection. Although Spin-Exchange Relaxation-Free (SERF) magnetocardiography (MCG) provides femtotesla-level sensitivity for identifying non-linear cardiac repolarization anomalies, its clinical deployment is currently impeded by the computational [...] Read more.

Background: Myocardial ischemia (MI) is a major cause of morbidity and mortality worldwide and requires timely and reliable detection. Although Spin-Exchange Relaxation-Free (SERF) magnetocardiography (MCG) provides femtotesla-level sensitivity for identifying non-linear cardiac repolarization anomalies, its clinical deployment is currently impeded by the computational bottlenecks inherent to portable edge platforms. Methods: We propose a “Sensor-to-Image” Edge-AI framework that links quantum sensing with computer vision. Single-channel SERF-MCG signals from a large cohort of 2118 subjects (1135 Healthy, 983 Ischemia) were transformed into phase-space images using three distinct encoding modalities: Recurrence Plots (RP), Gramian Angular Summation Fields (GASF), and Markov Transition Fields (MTF). These visual representations were subsequently analyzed by a streamlined MobileNetV3-Small architecture, optimized for low-latency inference. To maximize diagnostic precision, an adaptive weighted fusion mechanism was engineered to combine the chaotic specificity captured by RP with the morphological sensitivity of GASF through a validation-optimized fixed global weighting strategy. Results: In our experiments, the fusion model achieved an Area Under the Curve (AUC) of 0.865, which was higher than the 1D-CNN baseline (AUC 0.857) and the single-modality models. Notably, the fusion strategy significantly elevated sensitivity to 88.3% while maintaining a specificity of 66.5%. Although specificity is moderate, this trade-off prioritizes high sensitivity to minimize false negatives in pre-hospital screening scenarios. The average inference time was 4.7 ms per sample on a standard CPU, suggesting suitability for real-time Point-of-Care (PoC) scenarios under further on-device validation. Conclusions: The results suggest that multi-view phase-space fusion can capture subtle spatio-temporal changes associated with ischemia. The proposed lightweight framework may support the development of portable SERF-MCG systems with embedded AI screening. Full article

(This article belongs to the Section Biosensor and Bioelectronic Devices)

► Show Figures

Figure 1

32 pages, 5547 KB

Open AccessArticle

GMRVGG: A Bearing Fault Diagnosis Method Based on Tri-Modal Image Feature Fusion

by Ao Li, Yuantao Li, Xiaoli Wang and Jiancheng Yin

Sensors 2026, 26(8), 2426; https://doi.org/10.3390/s26082426 - 15 Apr 2026

Viewed by 354

Abstract

Bearings serve as vital components in rotating machinery. Fault diagnosis of bearings constitutes an essential area within mechanical health monitoring. However, most existing methods rely solely on single-modal data or employ a single signal-to-image conversion technique, leading to insufficient information dimensionality and inadequate [...] Read more.

Bearings serve as vital components in rotating machinery. Fault diagnosis of bearings constitutes an essential area within mechanical health monitoring. However, most existing methods rely solely on single-modal data or employ a single signal-to-image conversion technique, leading to insufficient information dimensionality and inadequate feature representation, which ultimately limits diagnostic accuracy. To address these challenges, this paper proposes a bearing fault diagnosis method (GADF-MTF-RP-VGG16, GMRVGG) based on tri-modal image feature fusion. Specifically, three image conversion techniques—Gramian Angular Difference Field (GADF), Markov Transition Field (MTF), and Recurrence Plot (RP)—are utilized to first convert 1D vibration signals into 2D images. Subsequently, shallow to deep features are extracted and fused through the VGG16 backbone network. Finally, fault diagnosis is achieved by integrating a fully connected classifier layer. The proposed methodology was comprehensively validated on both the Case Western Reserve University (CWRU) and the University of Ottawa datasets, which were augmented with severe 6 dB Gaussian white noise and 6 dB pink noise to simulate complex industrial environments. Under these harsh conditions, the proposed method achieved superior overall accuracies (up to 96.9% on the CWRU dataset and consistently 95.8% on the Ottawa dataset), significantly surpassing conventional single-modal approaches. This effectively addresses the limitations of insufficient feature dimensionality and inadequate representation, establishing a highly reliable and robust solution for intelligent bearing fault diagnosis. Full article

(This article belongs to the Special Issue Advances in Bearing Fault Diagnosis Using Single Sensor Techniques and Sensor Fusion Approaches)

► Show Figures

Figure 1

29 pages, 7368 KB

Open AccessArticle

Method for Emotion Recognition of EEG Signals Based on Recursive Graph and Spatiotemporal Attention Mechanism

by Dong Huang, Lin Xu and Yuwen Li

Brain Sci. 2026, 16(4), 377; https://doi.org/10.3390/brainsci16040377 - 30 Mar 2026

Viewed by 712

Abstract

Emotion recognition plays a crucial role in human–computer interaction and mental health applications. Traditional Electroencephalogram (EEG)-based emotion recognition methods are limited in classification accuracy due to their neglect of the spatiotemporal characteristics of the signals and individual differences. This study proposes a novel [...] Read more.

Emotion recognition plays a crucial role in human–computer interaction and mental health applications. Traditional Electroencephalogram (EEG)-based emotion recognition methods are limited in classification accuracy due to their neglect of the spatiotemporal characteristics of the signals and individual differences. This study proposes a novel EEG emotion recognition framework that integrates spatiotemporal features to enhance performance through the following innovations: (1) the use of a Recurrence Plot (RP) to transform one-dimensional EEG signals into two-dimensional images, enhancing the representation of nonlinear dynamic features; (2) the design of a Spatiotemporal Channel Attention Module (TCSA), which combines temporal convolution, channel, and spatial attention mechanisms to optimize the capture of complex patterns; and (3) the integration of the lightweight and efficient network Efficientnet to construct the TCSA-Efficientnet classification model. On the Database for Emotion Analysis using Physiological Signals (DEAP) dataset, the proposed method achieves accuracy rates of 99.11% and 99.33% for valence and arousal classification tasks, respectively. On the Database for Emotion Recognition Using EEG and Physiological Signals (DREAMER) dataset, the method achieves accuracy rates of 98.08% and 97.49%, outperforming other EEG-based emotion classification models on both datasets. This demonstrates its advantages in accuracy, robustness, and generalization. Full article

(This article belongs to the Section Computational Neuroscience, Neuroinformatics, and Neurocomputing)

► Show Figures

Figure 1

15 pages, 2424 KB

Open AccessArticle

Harnessing Symmetry in Recurrence Plots: A Multi-Scale Detail Boosting Approach for Time Series Similarity Measurement

by Jiancheng Yin, Xuye Zhuang, Wentao Sui and Yunlong Sheng

Symmetry 2026, 18(2), 290; https://doi.org/10.3390/sym18020290 - 4 Feb 2026

Viewed by 389

Abstract

Time series similarity measurement is a fundamental task underpinning clustering, classification, and anomaly detection. Traditional approaches predominantly rely on one-dimensional data representations, which often fail to capture complex structural dependencies. To address this limitation, this paper proposes a novel similarity measurement framework based [...] Read more.

Time series similarity measurement is a fundamental task underpinning clustering, classification, and anomaly detection. Traditional approaches predominantly rely on one-dimensional data representations, which often fail to capture complex structural dependencies. To address this limitation, this paper proposes a novel similarity measurement framework based on two-dimensional image enhancement. The method initially transforms one-dimensional time series into recurrence plots (RPs), converting temporal dynamics into visually symmetric textures, enhancing the temporal information of the one-dimensional time series. To overcome the potential blurring of fine-grained information during transformation, multi-scale detail boosting (MSDB) is introduced to amplify the high-frequency components and textural details of the RP images. Subsequently, a pre-trained ResNet-18 network is utilized to extract deep visual features from the enhanced images, and the similarity is quantified using the Euclidean distance of these feature vectors. Extensive experiments on the UCR Time Series Classification Archive demonstrate that the proposed method effectively leverages image enhancement to reveal latent temporal patterns. This approach leverages the inherent symmetry properties embedded in recurrence plots. By enhancing the texture of these symmetrical structures, the proposed method provides a more robust and informative basis for similarity assessment. Full article

(This article belongs to the Section Mathematics)

► Show Figures

Figure 1

23 pages, 7288 KB

Open AccessArticle

ECA-RepNet: A Lightweight Coal–Rock Recognition Network Using Recurrence Plot Transformation

by Jianping Zhou, Zhixin Jin, Hongwei Wang, Wenyan Cao, Xipeng Gu, Qingyu Kong, Jianzhong Li and Zeping Liu

Information 2026, 17(2), 140; https://doi.org/10.3390/info17020140 - 1 Feb 2026

Viewed by 505

Abstract

Coal and rock recognition is one of the key technologies in mining production, but traditional methods have limitations such as single-feature representation dimension, insufficient robustness, and unbalanced performance in lightweight design under noise interference and complex feature conditions. To address these issues, an [...] Read more.

Coal and rock recognition is one of the key technologies in mining production, but traditional methods have limitations such as single-feature representation dimension, insufficient robustness, and unbalanced performance in lightweight design under noise interference and complex feature conditions. To address these issues, an Efficient Channel Attention Reparameterized Network (ECA-RepNet) based on recurrence plot and Efficient Channel Attention mechanism is proposed. The one-dimensional vibration signal is mapped to the two-dimensional image space through a recurrence plot (RP), which retains the dynamic characteristics of the time series while capturing the complex patterns in the signal. Multi-scale feature extraction and lightweight design are achieved through the reparameterized large kernel block (RepLK Block) and the depthwise separable convolution (DSConv) module. The ECA module is introduced to embed multiple convolutional layers. Through global average pooling, one-dimensional convolution, and dynamic weight allocation, the modeling ability of inter-channel dependencies is enhanced, the model robustness is improved, and the computational overhead is reduced. Experimental results demonstrate that the ECA-RepNet model achieves 97.33% accuracy, outperforming classic models including ResNet, CNN, and MobileNet in parameter efficiency, training time, and inference speed. Full article

► Show Figures

Graphical abstract

18 pages, 1839 KB

Open AccessArticle

Classification of Heart Sound Recordings (PCG) via Recurrence Plot-Derived Features and Machine Learning Techniques

by Abdulmajeed M. Almosained, Turky N. Alotaiby, Rawad A. Alqahtani and Hanan S. Murayshid

Electronics 2026, 15(3), 601; https://doi.org/10.3390/electronics15030601 - 29 Jan 2026

Cited by 2 | Viewed by 1122

Abstract

Early and reliable detection of cardiac disease is crucial for preventing complications and enhancing patient outcomes. Phonocardiogram (PCG) signals, which encode rich information about cardiac function, offer a non-invasive and cost-effective way to identify abnormalities such as valvular disorders, arrhythmias, and other heart [...] Read more.

Early and reliable detection of cardiac disease is crucial for preventing complications and enhancing patient outcomes. Phonocardiogram (PCG) signals, which encode rich information about cardiac function, offer a non-invasive and cost-effective way to identify abnormalities such as valvular disorders, arrhythmias, and other heart pathologies. This study investigates advanced diagnostic methods for heart sound analysis to improve the detection and classification of cardiac abnormalities. In the proposed framework, recurrence plots (RPs) are used for feature extraction, while machine learning algorithms are applied for classification, creating a diagnostic model that can recognize cardiac conditions from composite acoustic signals. This method serves as an efficient alternative to more computationally intensive deep learning methods and other high-dimensional ML-based solutions. Experimental results demonstrate that the multiclass classification task achieves up to 98.4% accuracy, and the binary classification reaches 99.5% accuracy using 2 s signal segments. The techniques assessed in this research demonstrate the potential of automated heart sound analysis as a screening tool in both clinical and remote healthcare settings. Overall, the findings highlight the significance of machine learning in heart sound classification and its potential to facilitate timely, accessible, and cost-effective cardiovascular care. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Figure 1

22 pages, 5092 KB

Open AccessArticle

Fault Diagnosis Method for Excitation Dry-Type Transformer Based on Multi-Channel Vibration Signal and Visual Feature Fusion

by Yang Liu, Mingtao Yu, Jingang Wang, Peng Bao, Weiguo Zu, Yinglong Deng, Shiyi Chen, Lijiang Ma, Pengcheng Zhao and Jinyao Dou

Sensors 2025, 25(24), 7460; https://doi.org/10.3390/s25247460 - 8 Dec 2025

Cited by 2 | Viewed by 1013

Abstract

To address the limitations of existing fault diagnosis methods for excitation dry-type transformers, such as inadequate utilization of multi-axis vibration data, low recognition accuracy under complex operational conditions, and limited computational efficiency, this paper presents a lightweight fault diagnosis approach based on the [...] Read more.

To address the limitations of existing fault diagnosis methods for excitation dry-type transformers, such as inadequate utilization of multi-axis vibration data, low recognition accuracy under complex operational conditions, and limited computational efficiency, this paper presents a lightweight fault diagnosis approach based on the fusion of multi-channel vibration signals and visual features. Initially, a multi-physics field coupling simulation model of the excitation dry-type transformer is developed. Vibration data collected from field-installed three-axis sensors are combined to generate typical fault samples, including normal operation, winding looseness, core looseness, and winding eccentricity. Due to the high dimensionality of vibration signals, the Symmetrized Dot Pattern (ISDP) method is extended to aggregate and map time- and frequency-domain information from the x-, y-, and z-axes into a two-dimensional feature map. To optimize the inter-class separability and intra-class consistency of the map, Particle Swarm Optimization (PSO) is employed to adaptively adjust the angle gain factor (η) and time delay coefficient (t). Keypoint descriptors are then extracted from the map using the Oriented FAST and Rotated BRIEF (ORB) feature extraction operator, which improves computational efficiency while maintaining sensitivity to local details. Finally, an efficient fault classification model is constructed using an Adaptive Boosting Support Vector Machine (Adaboost-SVM) to achieve robust fault mode recognition across multiple operating conditions. Experimental results demonstrate that the proposed method achieves a fault diagnosis accuracy of 94.00%, outperforming signal-to-image techniques such as Gramian Angular Field (GAF), Recurrence Plot (RP), and Markov Transition Field (MTF), as well as deep learning models based on Convolutional Neural Networks (CNN) in both training and testing time. Additionally, the method exhibits superior stability and robustness in repeated trials. This approach is well-suited for online monitoring and rapid diagnosis in resource-constrained environments, offering significant engineering value in enhancing the operational safety and reliability of excitation dry-type transformers. Full article

(This article belongs to the Collection Sensors and Sensing Technology for Industry 4.0)

► Show Figures

Figure 1

23 pages, 24357 KB

Open AccessArticle

Time Series-to-Image Encoding for Classification Using Convolutional Neural Networks: A Novel and Robust Approach

by Hammoud Al Joumaa, Loui Al-Shrouf and Mohieddine Jelali

Mach. Learn. Knowl. Extr. 2025, 7(4), 155; https://doi.org/10.3390/make7040155 - 28 Nov 2025

Cited by 2 | Viewed by 2867

Abstract

In recent decades, data collection technologies have evolved to facilitate the monitoring and improvement of numerous activities and processes in everyday human life. Their evolution is propelled by the advancement of artificial intelligence (AI), which aims to emulate human intelligence in the execution [...] Read more.

In recent decades, data collection technologies have evolved to facilitate the monitoring and improvement of numerous activities and processes in everyday human life. Their evolution is propelled by the advancement of artificial intelligence (AI), which aims to emulate human intelligence in the execution of related tasks. The remarkable success of deep learning (DL) and computer vision (CV) on image data prompted researchers to consider its application to time series and multivariate data. In this context, time series imaging has been identified as the research field for the transformation of time series data (a one-dimensional data format) into images (a two-dimensional data format). These data can be the variables or features of a system or phenomenon under consideration. State-of-the-art techniques for time series imaging include recurrence plot (RP), Gramian angular field (GAF), and Markov transition field (MTF). This paper proposes a novel, robust, and simple technique of time series imaging using Grayscale Fingerprint Features Field Imaging (G3FI). This novel technique is distinguished by the low resolution of the resulting image and the simplicity of the transformation procedure. The efficacy of the novel and state-of-the-art techniques for enhancing the performance of CNN-based classification models on time series datasets is thoroughly examined and compared. Full article

► Show Figures

Figure 1

29 pages, 7050 KB

Open AccessArticle

Mechanical Fault Diagnosis Method of Disconnector Based on Parallel Dual-Channel Model of Feature Fusion

by Chi Zhang, Hongzhong Ma and Tianyu Hu

Sensors 2025, 25(22), 6933; https://doi.org/10.3390/s25226933 - 13 Nov 2025

Cited by 1 | Viewed by 877

Abstract

Mechanical fault samples of disconnectors are scarce, the fault types vary, and the self-evidence is weak, which leads to a lack of perfect fault diagnosis methods, and hidden defects cannot be found in time. To solve this problem, a mechanical fault diagnosis method [...] Read more.

Mechanical fault samples of disconnectors are scarce, the fault types vary, and the self-evidence is weak, which leads to a lack of perfect fault diagnosis methods, and hidden defects cannot be found in time. To solve this problem, a mechanical fault diagnosis method for disconnectors based on a parallel dual-channel feature fusion model is proposed. Firstly, the optimal parameters for variational mode decomposition (VMD) are obtained using the black-winged kite algorithm (BKA). After the signal decomposition, the kurtosis values of each intrinsic mode function (IMF) are calculated, screened, and reconstructed. The reconstructed signal is input into the gated recurrent unit (GRU) to capture its time-series characteristics. Then, the vibration signal is generated by the recurrence plot (RP) to generate the atlas set and input into the vision Transformer (ViT) to extract its spatial characteristics. Finally, the time-series and spatial characteristics are fused, the multi-head self-attention mechanism is used for training, and softmax is used for fault classification. The measured data results show that the diagnostic accuracy of the model for mechanical fault types reaches 97.9%, which is 3.2%, 4.3%, 1.0%, 2.4%, 2.9%, 1.8%, 2.1%, 9%, and 7.5% higher than the other nine models numbered #2–#10, respectively, verifying its effectiveness and adaptability. Full article

(This article belongs to the Section Fault Diagnosis & Sensors)

► Show Figures

Figure 1

20 pages, 918 KB

Open AccessArticle

MVIB-Lip: Multi-View Information Bottleneck for Visual Speech Recognition via Time Series Modeling

by Yuzhe Li, Haocheng Sun, Jiayi Cai and Jin Wu

Entropy 2025, 27(11), 1121; https://doi.org/10.3390/e27111121 - 31 Oct 2025

Viewed by 1432

Abstract

Lipreading, or visual speech recognition, is the task of interpreting utterances solely from visual cues of lip movements. While early approaches relied on Hidden Markov Models (HMMs) and handcrafted spatiotemporal descriptors, recent advances in deep learning have enabled end-to-end recognition using large-scale datasets. [...] Read more.

Lipreading, or visual speech recognition, is the task of interpreting utterances solely from visual cues of lip movements. While early approaches relied on Hidden Markov Models (HMMs) and handcrafted spatiotemporal descriptors, recent advances in deep learning have enabled end-to-end recognition using large-scale datasets. However, such methods often require millions of labeled or pretraining samples and struggle to generalize under low-resource or speaker-independent conditions. In this work, we revisit lipreading from a multi-view learning perspective. We introduce MVIB-Lip, a framework that integrates two complementary representations of lip movements: (i) raw landmark trajectories modeled as multivariate time series, and (ii) recurrence plot (RP) images that encode structural dynamics in a texture form. A Transformer encoder processes the temporal sequences, while a ResNet-18 extracts features from RPs; the two views are fused via a product-of-experts posterior regularized by the multi-view information bottleneck. Experiments on the OuluVS and a self-collected dataset demonstrate that MVIB-Lip consistently outperforms handcrafted baselines and improves generalization to speaker-independent recognition. Our results suggest that recurrence plots, when coupled with deep multi-view learning, offer a principled and data-efficient path forward for robust visual speech recognition. Full article

(This article belongs to the Special Issue The Information Bottleneck Method: Theory and Applications)

► Show Figures

Figure 1

19 pages, 5198 KB

Open AccessArticle

Research on a Fault Diagnosis Method for Rolling Bearings Based on the Fusion of PSR-CRP and DenseNet

by Beining Cui, Zhaobin Tan, Yuhang Gao, Xinyu Wang and Lv Xiao

Processes 2025, 13(8), 2372; https://doi.org/10.3390/pr13082372 - 25 Jul 2025

Cited by 2 | Viewed by 1208

Abstract

To address the challenges of unstable vibration signals, indistinct fault features, and difficulties in feature extraction during rolling bearing operation, this paper presents a novel fault diagnosis method based on the fusion of PSR-CRP and DenseNet. The Phase Space Reconstruction (PSR) method transforms [...] Read more.

To address the challenges of unstable vibration signals, indistinct fault features, and difficulties in feature extraction during rolling bearing operation, this paper presents a novel fault diagnosis method based on the fusion of PSR-CRP and DenseNet. The Phase Space Reconstruction (PSR) method transforms one-dimensional bearing vibration data into a three-dimensional space. Euclidean distances between phase points are calculated and mapped into a Color Recurrence Plot (CRP) to represent the bearings’ operational state. This approach effectively reduces feature extraction ambiguity compared to RP, GAF, and MTF methods. Fault features are extracted and classified using DenseNet’s densely connected topology. Compared with CNN and ViT models, DenseNet improves diagnostic accuracy by reusing limited features across multiple dimensions. The training set accuracy was 99.82% and 99.90%, while the test set accuracy is 97.03% and 95.08% for the CWRU and JNU datasets under five-fold cross-validation; F1 scores were 0.9739 and 0.9537, respectively. This method achieves highly accurate diagnosis under conditions of non-smooth signals and inconspicuous fault characteristics and is applicable to fault diagnosis scenarios for precision components in aerospace, military systems, robotics, and related fields. Full article

(This article belongs to the Section Process Control, Modeling and Optimization)

► Show Figures

Figure 1

21 pages, 1871 KB

Open AccessArticle

Fusion of Recurrence Plots and Gramian Angular Fields with Bayesian Optimization for Enhanced Time-Series Classification

by Maria Mariani, Prince Appiah and Osei Tweneboah

Axioms 2025, 14(7), 528; https://doi.org/10.3390/axioms14070528 - 10 Jul 2025

Cited by 6 | Viewed by 4825

Abstract

Time-series classification remains a critical task across various domains, demanding models that effectively capture both local recurrence structures and global temporal dependencies. We introduce a novel framework that transforms time series into image representations by fusing recurrence plots (RPs) with both Gramian Angular [...] Read more.

Time-series classification remains a critical task across various domains, demanding models that effectively capture both local recurrence structures and global temporal dependencies. We introduce a novel framework that transforms time series into image representations by fusing recurrence plots (RPs) with both Gramian Angular Summation Fields (GASFs) and Gramian Angular Difference Fields (GADFs). This fusion enriches the structural encoding of temporal dynamics. To ensure optimal performance, Bayesian Optimization is employed to automatically select the ideal image resolution, eliminating the need for manual tuning. Unlike prior methods that rely on individual transformations, our approach concatenates RP, GASF, and GADF into a unified representation and generalizes to multivariate data by stacking transformation channels across sensor dimensions. Experiments on seven univariate datasets show that our method significantly outperforms traditional classifiers such as one-nearest neighbor with Dynamic Time Warping, Shapelet Transform, and RP-based convolutional neural networks. For multivariate tasks, the proposed fusion model achieves macro F1 scores of 91.55% on the UCI Human Activity Recognition dataset and 98.95% on the UCI Room Occupancy Estimation dataset, outperforming standard deep learning baselines. These results demonstrate the robustness and generalizability of our framework, establishing a new benchmark for image-based time-series classification through principled fusion and adaptive optimization. Full article

► Show Figures

Figure 1

Search Results (74)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (74)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI