Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (75)

Search Parameters:
Keywords = Markov transition fields

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
33 pages, 2048 KiB  
Article
Multimodal Hidden Markov Models for Real-Time Human Proficiency Assessment in Industry 5.0: Integrating Physiological, Behavioral, and Subjective Metrics
by Mowffq M. Alsanousi and Vittaldas V. Prabhu
Appl. Sci. 2025, 15(14), 7739; https://doi.org/10.3390/app15147739 - 10 Jul 2025
Viewed by 133
Abstract
This paper presents a Multimodal Hidden Markov Model (MHMM) framework specifically designed for real-time human proficiency assessment, integrating physiological (Heart Rate Variability (HRV)), behavioral (Task Completion Time (TCT)), and subjective (NASA Task Load Index (NASA-TLX)) data streams to infer latent human proficiency states [...] Read more.
This paper presents a Multimodal Hidden Markov Model (MHMM) framework specifically designed for real-time human proficiency assessment, integrating physiological (Heart Rate Variability (HRV)), behavioral (Task Completion Time (TCT)), and subjective (NASA Task Load Index (NASA-TLX)) data streams to infer latent human proficiency states in industrial settings. Using published empirical data from the surgical training literature, a comprehensive simulation study was conducted, with the MHMM (Trained) achieving 92.5% classification accuracy, significantly outperforming unimodal Hidden Markov Model (HMM) variants 61–63.9% and demonstrating competitive performance with advanced models such as Long Short-Term Memory (LSTM) networks 90%, and Conditional Random Field (CRF) 88.5%. The framework exhibited robustness across stress-test scenarios, including sensor noise, missing data, and imbalanced class distributions. A key advantage of the MHMM over black-box approaches is its interpretability by providing quantifiable transition probabilities that reveal learning rates, forgetting patterns, and contextual influences on proficiency dynamics. The model successfully captures context-dependent effects, including task complexity and cumulative fatigue, through dynamic transition matrices. When demonstrated through simulation, this framework establishes a foundation for developing adaptive operator-AI collaboration systems in Industry 5.0 environments. The MHMM’s combination of high accuracy, robustness, and interpretability makes it a promising candidate for future empirical validation in real-world industrial, healthcare, and training applications in which it is critical to understand and support human proficiency development. Full article
(This article belongs to the Special Issue Applications of Artificial Intelligence in Industrial Engineering)
Show Figures

Figure 1

21 pages, 5444 KiB  
Article
Diagnosis of Schizophrenia Using Feature Extraction from EEG Signals Based on Markov Transition Fields and Deep Learning
by Alka Jalan, Deepti Mishra, Marisha and Manjari Gupta
Biomimetics 2025, 10(7), 449; https://doi.org/10.3390/biomimetics10070449 - 7 Jul 2025
Viewed by 437
Abstract
Diagnosing schizophrenia using Electroencephalograph (EEG) signals is a challenging task due to the subtle and overlapping differences between patients and healthy individuals. To overcome this difficulty, deep learning has shown strong potential, especially given its success in image recognition tasks. In many studies, [...] Read more.
Diagnosing schizophrenia using Electroencephalograph (EEG) signals is a challenging task due to the subtle and overlapping differences between patients and healthy individuals. To overcome this difficulty, deep learning has shown strong potential, especially given its success in image recognition tasks. In many studies, one-dimensional EEG signals are transformed into two-dimensional representations to allow for image-based analysis. In this work, we have used the Markov Transition Field for converting EEG signals into two-dimensional images, capturing both the temporal patterns and statistical dynamics of the data. EEG signals are continuous time-series recordings from the brain, where the current state is often influenced by the immediately preceding state. This characteristic makes MTF particularly suitable for representing such data. After the transformation, a pre-trained VGG-16 model is employed to extract meaningful features from the images. The extracted features are then passed through two separate classification pipelines. The first uses a traditional machine learning model, Support Vector Machine, while the second follows a deep learning approach involving an autoencoder for feature selection and a neural network for final classification. The experiments were conducted using EEG data from the open-access Schizophrenia EEG database provided by MV Lomonosov Moscow State University. The proposed method achieved a highest classification accuracy of 98.51 percent and a recall of 100 percent across all folds using the deep learning pipeline. The Support Vector Machine pipeline also showed strong performance with a best accuracy of 96.28 percent and a recall of 97.89 percent. The proposed deep learning model represents a biomimetic approach to pattern recognition and decision-making. Full article
Show Figures

Figure 1

18 pages, 4366 KiB  
Article
sEMG-Based Gesture Recognition Using Sigimg-GADF-MTF and Multi-Stream Convolutional Neural Network
by Ming Zhang, Leyi Qu, Weibiao Wu, Gujing Han and Wenqiang Zhu
Sensors 2025, 25(11), 3506; https://doi.org/10.3390/s25113506 - 2 Jun 2025
Viewed by 511
Abstract
To comprehensively leverage the temporal, static, and dynamic information features of multi-channel surface electromyography (sEMG) signals for gesture recognition, considering the sensitive temporal characteristics of sEMG signals to action amplitude and muscle recruitment patterns, an sEMG-based gesture recognition algorithm is innovatively proposed using [...] Read more.
To comprehensively leverage the temporal, static, and dynamic information features of multi-channel surface electromyography (sEMG) signals for gesture recognition, considering the sensitive temporal characteristics of sEMG signals to action amplitude and muscle recruitment patterns, an sEMG-based gesture recognition algorithm is innovatively proposed using Sigimg-GADF-MTF and multi-stream convolutional neural network (MSCNN) by introducing the Sigimg, GADF, and MTF data processing methods and combining them with a multi-stream fusion strategy. Firstly, a sliding window is used to rearrange the multi-channel original sEMG signals through channels to generate a two-dimensional image (named Sigimg method). Meanwhile, each channel signal is respectively transformed into two-dimensional subimages using Gram angular difference field (GADF) and Markov transition field (MTF) methods. Then, the GADF and MTF images are obtained using a horizontal stitching method to splice these subimages, respectively. The Sigimg, GADF, and MTF images are used to construct a training and testing dataset, which is then imported into the constructed MSCNN model for experimental testing. The fully connected layer fusion method is utilized for multi-stream feature fusion, and the gesture recognition results are output. Through comparative experiments, an average accuracy of 88.4% is achieved using the Sigimg-GADF-MTF-MSCNN algorithm on the Ninapro DBl dataset, higher than most mainstream models. At the same time, the effectiveness of the proposed algorithm is fully verified through generalization testing of data obtained from the self-developed sEMG signal acquisition platform with an average accuracy of 82.4%. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

26 pages, 20655 KiB  
Article
CEEMDAN-MRAL Transformer Vibration Signal Fault Diagnosis Method Based on FBG
by Hong Jiang, Zhichao Wang, Lina Cui and Yihan Zhao
Photonics 2025, 12(5), 468; https://doi.org/10.3390/photonics12050468 - 10 May 2025
Viewed by 369
Abstract
In order to solve the problem that the vibration signal of transformer is affected by noise and electromagnetic interference, resulting in low accuracy of fault diagnosis mode recognition, a CEEMDAN-MRAL fault diagnosis method based on Fiber Bragg Grating (FBG) was proposed to quickly [...] Read more.
In order to solve the problem that the vibration signal of transformer is affected by noise and electromagnetic interference, resulting in low accuracy of fault diagnosis mode recognition, a CEEMDAN-MRAL fault diagnosis method based on Fiber Bragg Grating (FBG) was proposed to quickly and accurately evaluate the vibration fault state of transformer.The FBG sends the wavelength change in the optical signal center caused by the vibration of the transformer to the demodulation system, which obtains the vibration signal and effectively avoids the noise influence caused by strong electromagnetic interference inside the transformer. The vibration signal is decomposed into several intrinsic mode functions (IMFs) by complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), and the wavelet threshold denoising algorithm improves the signal-to-noise ratio (SNR) to 1.6 times. The Markov transition field (MTF) is used to construct a training and test set. The unique MRAL-Net is proposed to extract the spatial features of the signal and analyze the time series dependence of the features to improve the richness of the signal feature scale. This proposed method effectively removes the noise interference. The average accuracy of fault diagnosis of the transformer winding core reaches 97.9375%, and the time taken on the large-scale complex training set is only 1705 s, which has higher diagnostic accuracy and shorter training time than other models. Full article
Show Figures

Figure 1

20 pages, 10432 KiB  
Article
Apple Watercore Grade Classification Method Based on ConvNeXt and Visible/Near-Infrared Spectroscopy
by Chunlin Zhao, Zhipeng Yin, Yushuo Tan, Wenbin Zhang, Panpan Guo, Yaxing Ma, Haijian Wu, Ding Hu and Quan Lu
Agriculture 2025, 15(7), 756; https://doi.org/10.3390/agriculture15070756 - 31 Mar 2025
Viewed by 393
Abstract
To address the issues of insufficient rigor in existing methods for quantifying apple watercore severity and the complexity and low accuracy of traditional classification models, this study proposes a method for watercore quantification and a classification model based on a deep convolutional neural [...] Read more.
To address the issues of insufficient rigor in existing methods for quantifying apple watercore severity and the complexity and low accuracy of traditional classification models, this study proposes a method for watercore quantification and a classification model based on a deep convolutional neural network. Initially, visible/near-infrared transmission spectral data of apple samples were collected. The apples were then sliced into 4.5 mm thick sections using a specialized tool, and image data of each slice were captured. Using BiSeNet and RIFE algorithms, a three-dimensional model of the watercore regions was constructed from the apple slices to calculate the watercore severity, which was subsequently categorized into five distinct levels. Next, methods such as the Gramian Angular Summation Field (GASF), Gram Angular Difference Field (GADF), and Markov Transition Field (MTF) were applied to transform the one-dimensional spectral data into two-dimensional images. These images served as input for training and prediction using the ConvNeXt deep convolutional neural network. The results indicated that the GADF method yielded the best performance, achieving a test set accuracy of 98.73%. Furthermore, the study contrasted the classification and prediction of watercore apples using traditional methods with the existing quantification approaches for watercore levels. The comparative results demonstrated that the proposed GADF-ConvNeXt model is more straightforward and efficient, achieving superior performance in classifying watercore grades. Furthermore, the newly proposed quantification method for watercore levels proved to be more effective. Full article
(This article belongs to the Section Digital Agriculture)
Show Figures

Figure 1

18 pages, 5668 KiB  
Article
Low-Voltage Series Arc Fault Detection Based on Multi-Feature Fusion and Improved Residual Network
by Haitao Wang, Juyuan Kang and Yigang Lin
Electronics 2025, 14(7), 1325; https://doi.org/10.3390/electronics14071325 - 27 Mar 2025
Viewed by 427
Abstract
Deep learning-based image classification techniques have been widely utilized in low-voltage AC series-type fault arc detection. However, the transformation of signals into images frequently leads to significant loss of current signal characteristics, thereby compromising arc recognition accuracy. Additionally, uncharacterized signal content may be [...] Read more.
Deep learning-based image classification techniques have been widely utilized in low-voltage AC series-type fault arc detection. However, the transformation of signals into images frequently leads to significant loss of current signal characteristics, thereby compromising arc recognition accuracy. Additionally, uncharacterized signal content may be lost due to multiple factors, including sensor bandwidth limitations, sensor-event distance, and the topological configuration of the circuit where the fault originated. To address this challenge, a novel framework for identifying series-type low-voltage AC fault arcs is presented, which integrates the Markov transfer field (MTF) with multi-feature fusion and an improved residual neural network (ResNet18). This approach employs fast Fourier transform (FFT) to compute magnitude and phase data and then converts the original current signals, magnitude spectrograms, and phase spectrograms into MTF images. An adaptive weighted averaging strategy is subsequently applied to fuse these MTF images, generating composite discriminative features that preserve both amplitude and phase information from the original signals. The proposed system incorporates a convolutional block-based attention mechanism (CBAM) into the ResNet18 architecture to enhance feature representation while reducing training complexity. Extensive experimental evaluations on a diverse dataset demonstrate that the developed method achieves an impressive recognition accuracy of 99.88% for series fault arcs. This result validates the effectiveness of the proposed framework in maintaining critical signal characteristics and improving detection precision compared to existing approaches. Full article
Show Figures

Figure 1

19 pages, 3428 KiB  
Article
Driver Identification System Based on a Machine Learning Operations Platform Using Controller Area Network Data
by Hyunseo Shin, Wangyu Park, Suhong Kim, Juhum Kweon and Changjoo Moon
Electronics 2025, 14(6), 1138; https://doi.org/10.3390/electronics14061138 - 14 Mar 2025
Cited by 2 | Viewed by 708
Abstract
Ensuring vehicle security and preventing unauthorized driving are critical in modern transportation. Traditional driver identification methods, such as biometric authentication, require additional hardware and may not adapt well to changing driving behaviors. This study proposes a real-time driver identification system leveraging a Machine [...] Read more.
Ensuring vehicle security and preventing unauthorized driving are critical in modern transportation. Traditional driver identification methods, such as biometric authentication, require additional hardware and may not adapt well to changing driving behaviors. This study proposes a real-time driver identification system leveraging a Machine Learning Operations (MLOps)-based platform that continuously re-trains a deep learning model using vehicle Controller Area Network (CAN) data. The system collects CAN data, converts them into Markov Transition Field (MTF) images, and classifies drivers using a ResNet-18 model deployed on the Google Cloud Platform (GCP). An automated pipeline utilizing Pub/Sub, GCP Composer, and Vertex AI ensures continuous model updates based on newly uploaded driving data. Our experimental results demonstrate that models trained only on recent data significantly outperform those incorporating historical data, highlighting the necessity of frequent retraining. The intruder detection system effectively identifies unregistered drivers, further enhancing vehicle security. By automating model retraining and deployment, this system provides an adaptive solution that accommodates evolving driving behaviors, reducing reliance on static models. These findings emphasize the importance of real-time data adaptation in driver authentication systems, contributing to enhanced vehicle security and safety. Full article
Show Figures

Figure 1

21 pages, 11183 KiB  
Article
A Three-Channel Feature Fusion Approach Using Symmetric ResNet-BiLSTM Model for Bearing Fault Diagnosis
by Yingyong Zou, Tao Liu and Xingkui Zhang
Symmetry 2025, 17(3), 427; https://doi.org/10.3390/sym17030427 - 12 Mar 2025
Cited by 1 | Viewed by 771
Abstract
For mechanical equipment to operate normally, rolling bearings—which are crucial parts of rotating machinery—need to have their faults diagnosed. This work introduces a bearing defect diagnosis technique that incorporates three-channel feature fusion and is based on enhanced Residual Networks and Bidirectional long- and [...] Read more.
For mechanical equipment to operate normally, rolling bearings—which are crucial parts of rotating machinery—need to have their faults diagnosed. This work introduces a bearing defect diagnosis technique that incorporates three-channel feature fusion and is based on enhanced Residual Networks and Bidirectional long- and short-term memory networks (ResNet-BiLSTM) model. The technique can effectively establish spatial-temporal relationships and better capture complex features in data by combining the powerful spatial feature extraction capability of ResNet and the bidirectional temporal modeling capability of BiLSTM. Specifically, the one-dimensional vibration signals are first transformed into two-dimensional images using the Continuous Wavelet Transform (CWT) and Markov Transition Field (MTF). The upgraded ResNet-BiLSTM network is then used to extract and combine the original one-dimensional vibration signal along with features from the two types of two-dimensional images. Finally, experimental validation is performed on two bearing datasets. The results show that compared with other state-of-the-art models, the computing cost is greatly reduced, with params and flops at 15.4 MB and 715.24 MB, respectively, and the running time of a single batch becomes 5.19 s. The fault diagnosis accuracy reaches 99.53% and 99.28% for the two datasets, respectively, successfully realizing the classification task. Full article
(This article belongs to the Section Engineering and Materials)
Show Figures

Figure 1

14 pages, 5701 KiB  
Article
Finite Element Modeling-Assisted Deep Subdomain Adaptation Method for Tool Condition Monitoring
by Cong Jing, Xin He, Guichang Xu, Luyang Li and Yunfeng Yao
Processes 2025, 13(2), 545; https://doi.org/10.3390/pr13020545 - 15 Feb 2025
Viewed by 525
Abstract
To reduce the experimental costs associated with tool condition monitoring (TCM) under new cutting conditions, a finite element modeling (FEM)-assisted deep subdomain adaptive network (DSAN) approach is proposed. Initially, an FEM technique is employed to construct a cutting tool model for the new [...] Read more.
To reduce the experimental costs associated with tool condition monitoring (TCM) under new cutting conditions, a finite element modeling (FEM)-assisted deep subdomain adaptive network (DSAN) approach is proposed. Initially, an FEM technique is employed to construct a cutting tool model for the new cutting condition (target domain), and the similarity between simulated and experimental data is assessed to obtain valid simulated samples for the target domain. Subsequently, the time–frequency Markov representation method is utilized to extract imaging features from the simulated samples, which serve as input features for the monitoring model. Then, a DSAN model is established to facilitate the transfer from simulation to reality, with the source domain comprising a simulated sample set under new cutting conditions that includes various types of tool conditions obtained through FEM, and the target domain containing only a limited number of normal tool condition samples under new cutting conditions. The application analysis has demonstrated the effectiveness of the proposed method, achieving a classification accuracy of 99%. The proposed approach can significantly reduce experimental costs and obtain high-precision diagnostics of tool conditions with a small sample size. Full article
(This article belongs to the Special Issue Transfer Learning Methods in Equipment Reliability Management)
Show Figures

Figure 1

23 pages, 29165 KiB  
Article
Parallax-Tolerant Weakly-Supervised Pixel-Wise Deep Color Correction for Image Stitching of Pinhole Camera Arrays
by Yanzheng Zhang, Kun Gao, Zhijia Yang, Chenrui Li, Mingfeng Cai, Yuexin Tian, Haobo Cheng and Zhenyu Zhu
Sensors 2025, 25(3), 732; https://doi.org/10.3390/s25030732 - 25 Jan 2025
Viewed by 695
Abstract
Camera arrays typically use image-stitching algorithms to generate wide field-of-view panoramas, but parallax and color differences caused by varying viewing angles often result in noticeable artifacts in the stitching result. However, existing solutions can only address specific color difference issues and are ineffective [...] Read more.
Camera arrays typically use image-stitching algorithms to generate wide field-of-view panoramas, but parallax and color differences caused by varying viewing angles often result in noticeable artifacts in the stitching result. However, existing solutions can only address specific color difference issues and are ineffective for pinhole images with parallax. To overcome these limitations, we propose a parallax-tolerant weakly supervised pixel-wise deep color correction framework for the image stitching of pinhole camera arrays. The total framework consists of two stages. In the first stage, based on the differences between high-dimensional feature vectors extracted by a convolutional module, a parallax-tolerant color correction network with dynamic loss weights is utilized to adaptively compensate for color differences in overlapping regions. In the second stage, we introduce a gradient-based Markov Random Field inference strategy for correction coefficients of non-overlapping regions to harmonize non-overlapping regions with overlapping regions. Additionally, we innovatively propose an evaluation metric called Color Differences Across the Seam to quantitatively measure the naturalness of transitions across the composition seam. Comparative experiments conducted on popular datasets and authentic images demonstrate that our approach outperforms existing solutions in both qualitative and quantitative evaluations, effectively eliminating visible artifacts and producing natural-looking composite images. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

21 pages, 8608 KiB  
Article
Analysis of Cardiac Arrhythmias Based on ResNet-ICBAM-2DCNN Dual-Channel Feature Fusion
by Chuanjiang Wang, Junhao Ma, Guohui Wei and Xiujuan Sun
Sensors 2025, 25(3), 661; https://doi.org/10.3390/s25030661 - 23 Jan 2025
Cited by 1 | Viewed by 1116
Abstract
Cardiovascular disease (CVD) poses a significant challenge to global health, with cardiac arrhythmia representing one of its most prevalent manifestations. The timely and precise classification of arrhythmias is critical for the effective management of CVD. This study introduces an innovative approach to enhancing [...] Read more.
Cardiovascular disease (CVD) poses a significant challenge to global health, with cardiac arrhythmia representing one of its most prevalent manifestations. The timely and precise classification of arrhythmias is critical for the effective management of CVD. This study introduces an innovative approach to enhancing arrhythmia classification accuracy through advanced Electrocardiogram (ECG) signal processing. We propose a dual-channel feature fusion strategy designed to enhance the precision and objectivity of ECG analysis. Initially, we apply an Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) and enhanced wavelet thresholding for robust noise reduction. Subsequently, in the primary channel, region of interest features are emphasized using a ResNet-ICBAM network model for feature extraction. In parallel, the secondary channel transforms 1D ECG signals into Gram angular difference field (GADF), Markov transition field (MTF), and recurrence plot (RP) representations, which are then subjected to two-dimensional convolutional neural network (2D-CNN) feature extraction. Post-extraction, the features from both channels are fused and classified. When evaluated on the MIT-BIH database, our method achieves a classification accuracy of 97.80%. Compared to other methods, our approach of two-channel feature fusion has a significant improvement in overall performance by adding a 2D convolutional network. This methodology represents a substantial advancement in ECG signal processing, offering significant potential for clinical applications and improving patient care efficiency and accuracy. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

19 pages, 5357 KiB  
Article
Planetary Gearboxes Fault Diagnosis Based on Markov Transition Fields and SE-ResNet
by Yanyan Liu, Tongxin Gao, Wenxu Wu and Yongquan Sun
Sensors 2024, 24(23), 7540; https://doi.org/10.3390/s24237540 - 26 Nov 2024
Cited by 4 | Viewed by 909
Abstract
The working conditions of planetary gearboxes are complex, and their structural couplings are strong, leading to low reliability. Traditional deep neural networks often struggle with feature learning in noisy environments, and their reliance on one-dimensional signals as input fails to capture the interrelationships [...] Read more.
The working conditions of planetary gearboxes are complex, and their structural couplings are strong, leading to low reliability. Traditional deep neural networks often struggle with feature learning in noisy environments, and their reliance on one-dimensional signals as input fails to capture the interrelationships between data points. To address these challenges, we proposed a fault diagnosis method for planetary gearboxes that integrates Markov transition fields (MTFs) and a residual attention mechanism. The MTF was employed to encode one-dimensional signals into feature maps, which were then fed into a residual networks (ResNet) architecture. To enhance the network’s ability to focus on important features, we embedded the squeeze-and-excitation (SE) channel attention mechanism into the ResNet34 network, creating a SE-ResNet model. This model was trained to effectively extract and classify features. The developed method was validated using a specific dataset and achieved an accuracy of about 98.1%. The results demonstrate the effectiveness and reliability of the developed method in diagnosing faults in planetary gearboxes under strong noise conditions. Full article
Show Figures

Figure 1

20 pages, 17284 KiB  
Article
Fault-Line Selection Method in Active Distribution Networks Based on Improved Multivariate Variational Mode Decomposition and Lightweight YOLOv10 Network
by Sizu Hou and Wenyao Wang
Energies 2024, 17(19), 4958; https://doi.org/10.3390/en17194958 - 3 Oct 2024
Cited by 3 | Viewed by 1519
Abstract
In active distribution networks (ADNs), the extensive deployment of distributed generations (DGs) heightens system nonlinearity and non-stationarity, which can weaken fault characteristics and reduce fault detection accuracy. To improve fault detection accuracy in distribution networks, a method combining improved multivariate variational mode decomposition [...] Read more.
In active distribution networks (ADNs), the extensive deployment of distributed generations (DGs) heightens system nonlinearity and non-stationarity, which can weaken fault characteristics and reduce fault detection accuracy. To improve fault detection accuracy in distribution networks, a method combining improved multivariate variational mode decomposition (IMVMD) and YOLOv10 network for active distribution network fault detection is proposed. Firstly, an MVMD method optimized by the northern goshawk optimization (NGO) algorithm named IMVMD is introduced to adaptively decompose zero-sequence currents at both ends of line sources and loads into intrinsic mode functions (IMFs). Secondly, considering the spatio-temporal correlation between line sources and loads, a dynamic time warping (DTW) algorithm is utilized to determine the optimal alignment path time series for corresponding IMFs at both ends. Then, the Markov transition field (MTF) transforms the 1D time series into 2D spatio-temporal images, and the MTF images of all lines are concatenated to obtain a comprehensive spatio-temporal feature map of the distribution network. Finally, using the spatio-temporal feature map as input, the lightweight YOLOv10 network autonomously extracts fault features to achieve precise fault-line selection. Experimental results demonstrate the robustness of the proposed method, achieving a fault detection accuracy of 99.88%, which can ensure accurate fault-line selection under complex scenarios involving simultaneous phase-to-ground faults at two points. Full article
Show Figures

Figure 1

16 pages, 6475 KiB  
Article
Exploring Inner Speech Recognition via Cross-Perception Approach in EEG and fMRI
by Jiahao Qin, Lu Zong and Feng Liu
Appl. Sci. 2024, 14(17), 7720; https://doi.org/10.3390/app14177720 - 1 Sep 2024
Cited by 3 | Viewed by 3570
Abstract
Multimodal brain signal analysis has shown great potential in decoding complex cognitive processes, particularly in the challenging task of inner speech recognition. This paper introduces an innovative I nner Speech Recognition via Cross-Perception (ISRCP) approach that significantly enhances accuracy by fusing electroencephalography (EEG) [...] Read more.
Multimodal brain signal analysis has shown great potential in decoding complex cognitive processes, particularly in the challenging task of inner speech recognition. This paper introduces an innovative I nner Speech Recognition via Cross-Perception (ISRCP) approach that significantly enhances accuracy by fusing electroencephalography (EEG) and functional magnetic resonance imaging (fMRI) data. Our approach comprises three core components: (1) multigranularity encoders that separately process EEG time series, EEG Markov Transition Fields, and fMRI spatial data; (2) a cross-perception expert structure that learns both modality-specific and shared representations; and (3) an attention-based adaptive fusion strategy that dynamically adjusts the contributions of different modalities based on task relevance. Extensive experiments on the Bimodal Dataset on Inner Speech demonstrate that our model outperforms existing methods across accuracy and F1 score. Full article
(This article belongs to the Section Electrical, Electronics and Communications Engineering)
Show Figures

Figure 1

24 pages, 2042 KiB  
Article
A Cross-Working Condition-Bearing Diagnosis Method Based on Image Fusion and a Residual Network Incorporating the Kolmogorov–Arnold Representation Theorem
by Ziyi Tang, Xinhao Hou, Xin Wang and Jifeng Zou
Appl. Sci. 2024, 14(16), 7254; https://doi.org/10.3390/app14167254 - 17 Aug 2024
Cited by 4 | Viewed by 1773
Abstract
With the optimization and advancement of industrial production and manufacturing, the application scenarios of bearings have become increasingly diverse and highly coupled. This complexity poses significant challenges for the extraction of bearing fault features, consequently affecting the accuracy of cross-condition fault diagnosis methods. [...] Read more.
With the optimization and advancement of industrial production and manufacturing, the application scenarios of bearings have become increasingly diverse and highly coupled. This complexity poses significant challenges for the extraction of bearing fault features, consequently affecting the accuracy of cross-condition fault diagnosis methods. To improve the extraction and recognition of fault features and enhance the diagnostic accuracy of models across different conditions, this paper proposes a cross-condition bearing diagnosis method. This method, named MCR-KAResNet-TLDAF, is based on image fusion and a residual network that incorporates the Kolmogorov–Arnold representation theorem. Firstly, the one-dimensional vibration signals of the bearing are processed using Markov transition field (MTF), continuous wavelet transform (CWT), and recurrence plot (RP) methods, converting the resulting images to grayscale. These grayscale images are then multiplied by corresponding coefficients and fed into the R, G, and B channels for image fusion. Subsequently, fault features are extracted using a residual network enhanced by the Kolmogorov–Arnold representation theorem. Additionally, a domain adaptation algorithm combining multiple kernel maximum mean discrepancy (MK-MMD) and conditional domain adversarial network with entropy conditioning (CDAN+E) is employed to align the source and target domains, thereby enhancing the model’s cross-condition diagnostic accuracy. The proposed method was experimentally validated on the Case Western Reserve University (CWRU) dataset and the Jiangnan University (JUN) dataset, which include the 6205-2RS JEM SKF, N205, and NU205 bearing models. The method achieved accuracy rates of 99.36% and 99.889% on the two datasets, respectively. Comparative experiments from various perspectives further confirm the superiority and effectiveness of the proposed model. Full article
Show Figures

Figure 1

Back to TopTop