Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (2,424)

Search Parameters:
Keywords = Attention Network Test

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 8745 KB  
Article
Automated Prostate Cancer Detection on T2-Weighted MRI Using a Dual-Stream Attention Network: A Study on Private Saudi Clinical Data and Public Benchmark Datasets
by Saeed Alqahtani, M. A. Jowhari, Yahya.Q. Sabi and Hussein Alshaari
J. Clin. Med. 2026, 15(9), 3327; https://doi.org/10.3390/jcm15093327 (registering DOI) - 27 Apr 2026
Abstract
Background: The steady rise of prostate cancer in Saudi Arabia signals a critical public health shift that requires immediate investment in early detection and prevention to mitigate a future clinical crisis. Accurate diagnosis using multiparametric MRI and PI-RADS scoring remains challenging, as interpretations [...] Read more.
Background: The steady rise of prostate cancer in Saudi Arabia signals a critical public health shift that requires immediate investment in early detection and prevention to mitigate a future clinical crisis. Accurate diagnosis using multiparametric MRI and PI-RADS scoring remains challenging, as interpretations are highly experience-dependent and subspecialized radiologists are limited. Methods: To address this gap, this study introduces a novel Dual-Stream Attention Network designed to automate the classification of low-risk (PIRADS 2-3) versus high-risk (PIRADS 4-5) lesions from T2-weighted MRI. Leveraging a ResNet50 backbone, the architecture employs parallel streams for Local and Global Feature Processing, each enhanced by a Channel-Spatial Attention module to highlight diagnostically relevant regions. These features are integrated through a Cross-Stream Fusion mechanism and a gate-controlled Adaptive Feature Fusion module to optimize multi-scale information. The model was developed and validated on a regional dataset of 3850 images from Jazan Specialist Hospital and Prince Mohammed bin Naser Hospital. This research provides a standardized, high-precision diagnostic path tailored to the Saudi Arabian population, conducted under institutional review board approval (No. 25138). Results: The proposed dual-stream attention network achieved an accuracy of 97.8% on the validation set and 96.4% on the test set, demonstrating high performance and generalization capabilities in classifying prostate lesions from Saudi patient populations. Conclusions: The proposed dual-stream architecture with novel attention and fusion mechanisms demonstrates high effectiveness for prostate cancer classification from T2-weighted MRI in Saudi clinical settings. This represents the first deep learning model specifically trained and validated on Saudi Arabian prostate MRI data, with the potential to address the shortage of specialized expertise and improve diagnostic efficiency in the Kingdom. Full article
(This article belongs to the Special Issue Prostate Cancer: Diagnosis, Clinical Management and Prognosis)
Show Figures

Figure 1

15 pages, 1007 KB  
Article
Fault Location Method for Distribution Networks Based on SimAM-GraphSAGE-GAT
by Wei Bao, Lei Wang, Wei Liu, Qilong Chen, Yanan Yang, Bingxuan Li, Kang Sun and Ming Yang
Energies 2026, 19(9), 2093; https://doi.org/10.3390/en19092093 (registering DOI) - 27 Apr 2026
Abstract
In distribution networks, traditional fault location methods have insufficient anti-interference capability and low accuracy in locating high-resistance grounding faults. To address these issues, a distribution network fault location method on the basis of SimAM-GraphSAGE-GAT is proposed. Firstly, the distribution network topology structure is [...] Read more.
In distribution networks, traditional fault location methods have insufficient anti-interference capability and low accuracy in locating high-resistance grounding faults. To address these issues, a distribution network fault location method on the basis of SimAM-GraphSAGE-GAT is proposed. Firstly, the distribution network topology structure is converted into an adjacency matrix, and the electrical parameters of the faulty line are incorporated as node features into the graph structure of the network. Subsequently, the sampling and aggregation mechanism of GraphSAGE is used for learning node representation. Features are refined using SimAM. As a parameter-free attention mechanism, SimAM improves the ability of the model to capture important fault information. Then, the multi-head attention mechanism of GAT is introduced to enhance the representation of neighborhood relationships. Finally, GraphSAGE is utilized once again for deep aggregation, with a view to localizing faults by node classification. An IEEE 33-node distribution network model is adopted to verify the effectiveness of the algorithm in the experiment. The results show that this method can maintain high positioning accuracy even under the tested conditions, such as high-resistance grounding, noise interference, and data loss. Full article
(This article belongs to the Section F1: Electrical Power System)
Show Figures

Figure 1

19 pages, 8343 KB  
Article
TAHRNet: An Improved HRNet-Based Semantic Segmentation Model for Mangrove Remote Sensing Imagery
by Haonan Lin, Dongyang Fu, Chuhong Wang, Jinjun Huang, Hanrui Wu, Yu Huang and Litian Xiong
Forests 2026, 17(5), 525; https://doi.org/10.3390/f17050525 (registering DOI) - 25 Apr 2026
Abstract
Mangrove represent vital coastal ecosystems that contribute to shoreline stabilization, ecological balance, and environmental management. Nevertheless, the precise delineation of mangrove regions using remote sensing data is often impeded by spectral similarities with intertidal mudflats and aquatic features, alongside the irregular spatial patterns [...] Read more.
Mangrove represent vital coastal ecosystems that contribute to shoreline stabilization, ecological balance, and environmental management. Nevertheless, the precise delineation of mangrove regions using remote sensing data is often impeded by spectral similarities with intertidal mudflats and aquatic features, alongside the irregular spatial patterns and intricate margins of mangrove stands. This research utilizes high-resolution Gaofen-6 (GF-6) satellite observations as the foundational data to develop Triplet Axial High-Resolution Network (TAHRNet), a semantic segmentation architecture derived from the High-Resolution Network with Object-Contextual Representations (HRNet-OCR) framework for mangrove identification. The model integrates a Triplet Attention module to facilitate cross-dimensional feature dependencies and an improved Multi-Head Sequential Axial Attention mechanism to capture long-range spatial context while maintaining structural consistency. Based on evaluations using the test dataset, TAHRNet yielded a Mean Intersection over Union (MIoU) of 92.01% and a Overall Accuracy of 96.38%. Relative to U-Net and SegFormer, the proposed approach showed MIoU improvements of 5.25% and 1.88%, with corresponding Accuracy gains of 2.68% and 0.94%. Further application to coastal mapping in Zhanjiang produced results that align with manual visual interpretation. These findings suggest that TAHRNet is a viable tool for mangrove extraction and can provide technical support for coastal monitoring and ecological analysis. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
25 pages, 4226 KB  
Article
From Design to Acceptance: A Full-Scale Analysis of Prestressed Concrete Railway Sleepers According to EN 13230
by Łukasz Chudyba, Wit Derkowski, Tomasz Lisowicz, Łukasz Ślaga and Piotr Piech
Materials 2026, 19(9), 1753; https://doi.org/10.3390/ma19091753 (registering DOI) - 24 Apr 2026
Viewed by 83
Abstract
Prestressed concrete railway sleepers are key structural components that determine the safety, durability, and serviceability of modern railway infrastructure. This study presents a comprehensive investigation of the design, testing, and acceptance of prestressed concrete sleepers in accordance with EN 13230, with particular reference [...] Read more.
Prestressed concrete railway sleepers are key structural components that determine the safety, durability, and serviceability of modern railway infrastructure. This study presents a comprehensive investigation of the design, testing, and acceptance of prestressed concrete sleepers in accordance with EN 13230, with particular reference to the requirements applied on the Polish railway network. The analysis integrates normative provisions, analytical calculations, finite element modeling, and experimental verification, including static, dynamic, and fatigue load tests. Special attention is given to the kt coefficient, which accounts for prestress losses, fatigue degradation, and the development of concrete strength throughout the service life. This coefficient plays a critical role in the acceptance criteria for sleepers during mandatory product testing. The influence of concrete age on the variability of kt is examined, showing that the highest variability occurs within the first 180 days of curing. Full-scale laboratory tests performed on PS-94 sleepers confirm compliance with standard requirements regarding cracking loads, crack width limits, and ultimate load capacity under both exceptional and fatigue loading conditions. Numerical simulations provide additional insight into stress and displacement distributions in critical cross-sections, supporting the experimental findings. The results indicate that most of prestressing force losses occur during the early service period. This observation supports the application of age-dependent acceptance criteria, which may improve conformity assessment procedures for prestressed concrete railway sleepers in contemporary railway engineering practice. Full article
(This article belongs to the Section Construction and Building Materials)
16 pages, 4919 KB  
Article
EA-UNET: An Enhanced and Efficient Model for Left-Turn Lane
by Haowei Wang, Haixin Liu, Fei Wang, Xingbin Chen, Baogang Li and Jiang Liu
Sensors 2026, 26(9), 2642; https://doi.org/10.3390/s26092642 - 24 Apr 2026
Viewed by 114
Abstract
Left-turn lanes are critical elements of urban intersections. Accurate and efficient lane detection is essential for the safe navigation of autonomous vehicles. To address the limitations of existing semantic segmentation algorithms—specifically, inadequate detection accuracy, high computational cost, and vulnerability to environmental disturbances—we propose [...] Read more.
Left-turn lanes are critical elements of urban intersections. Accurate and efficient lane detection is essential for the safe navigation of autonomous vehicles. To address the limitations of existing semantic segmentation algorithms—specifically, inadequate detection accuracy, high computational cost, and vulnerability to environmental disturbances—we propose a lightweight deep convolutional neural network named EA-UNet. First, we replace the standard U-Net encoder with EfficientNet-B0 to enhance feature extraction efficiency. Second, we introduce a novel contextual coordination module, termed MP-ASPP, which integrates a Convolutional Block Attention Module (CBAM) to further refine attention mechanisms. Finally, a comprehensive real-world dataset was constructed by collecting videos and images of left-turn waiting areas during real-vehicle testing. Experimental results demonstrate that EA-UNet significantly outperforms the baseline U-Net and other state-of-the-art models, achieving accurate and efficient segmentation of left-turn lanes even in complex scenes. Full article
(This article belongs to the Section Vehicular Sensing)
21 pages, 2063 KB  
Article
LGA-Net: A Local–Global Aggregation Network for Point Cloud Segmentation of Sheep in Smart Livestock Farming
by Zhou Zhang, Wei Zhao, Jing Jin, Fuzhong Li and Svitlana Pavlova
Agriculture 2026, 16(9), 933; https://doi.org/10.3390/agriculture16090933 - 23 Apr 2026
Viewed by 370
Abstract
Point cloud semantic segmentation is a pivotal technology for realizing non-contact body measurement and refined management of livestock. However, processing sheep point clouds in smart livestock scenarios presents specific challenges, primarily due to non-rigid posture deformations and severe background interference. To address these [...] Read more.
Point cloud semantic segmentation is a pivotal technology for realizing non-contact body measurement and refined management of livestock. However, processing sheep point clouds in smart livestock scenarios presents specific challenges, primarily due to non-rigid posture deformations and severe background interference. To address these issues, this paper proposes a novel symmetric encoder–decoder architecture named Local–Global Aggregation Network (LGA-Net), which achieves high-precision parsing of sheep point clouds by constructing a dual-scale feature aggregation mechanism. First, a Dual Attention Aggregation (DAA) module is designed to jointly encode geometric and color features, significantly enhancing the network’s ability to capture fine-grained local boundaries, such as sheep ears and hooves. Second, a Global Semantic Relation (GSR) module is introduced, utilizing spatial occupancy ratios to establish long-range dependencies, thereby effectively resolving semantic ambiguity caused by posture variations. Furthermore, a plug-and-play Dual-domain Feature Enhancement (DFE) module is proposed. By fusing bilinear interactions between explicit 3D space and implicit feature space, the DFE module constructs a high-pass filtering mechanism to suppress low-frequency background noise. Extensive experiments on a self-constructed point cloud dataset involving two semantic classes (Sheep and Fence) demonstrate that LGA-Net achieves a mIoU of 97.3%, an OA of 99.0%, and a mAcc of 97.8%. These results indicate that the proposed method outperforms existing mainstream algorithms in both segmentation accuracy and robustness. This study not only proposes a feasible solution for precise sheep extraction under the tested experimental conditions, but also provides solid technical support for subsequent automated body measurement and behavior analysis. Full article
(This article belongs to the Section Farm Animal Production)
Show Figures

Figure 1

23 pages, 2091 KB  
Article
A Photovoltaic Power Prediction Method Based on Wavelet Convolutional Neural Networks and Improved Transformer
by Yibo Zhou, Zihang Liu, Zhen Cheng, Hanglin Mi, Zhaoyang Qin and Kangyangyong Cao
Energies 2026, 19(9), 2040; https://doi.org/10.3390/en19092040 - 23 Apr 2026
Viewed by 148
Abstract
The output power of photovoltaic (PV) systems is influenced by various environmental factors, exhibiting strong nonlinearity and non-stationarity, which poses significant challenges for accurate forecasting. To address these issues, this paper proposes a short-term PV power forecasting method based on wavelet convolutional neural [...] Read more.
The output power of photovoltaic (PV) systems is influenced by various environmental factors, exhibiting strong nonlinearity and non-stationarity, which poses significant challenges for accurate forecasting. To address these issues, this paper proposes a short-term PV power forecasting method based on wavelet convolutional neural networks and an improved Transformer. First, the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) is employed to decompose the original PV power sequence into several intrinsic mode functions (IMFs). Fuzzy entropy is then utilized to evaluate the complexity of each component, and subsequences with similar entropy values are reconstructed to reduce the non-stationarity of the original series. Subsequently, Pearson correlation coefficients and the maximal information coefficient (MIC) are applied to capture both linear and nonlinear relationships between each reconstructed component and meteorological features, enabling the selection of strongly correlated variables. On this basis, a wavelet convolutional network (WTConv) is introduced to perform multi-scale decomposition and frequency-band feature extraction on the reconstructed components by integrating wavelet transform with convolution operations, effectively expanding the receptive field and extracting deep-seated features of the sequences. Finally, an improved iTransformer model is adopted for time-series modeling, leveraging its inverted encoding structure and self-attention mechanism to fully capture long-term dependencies among multivariate variables. The proposed model is validated using actual power data from a PV plant in Ningxia, China, across four seasons. Comprehensive experiments, including ablation studies, comparative analyses, loss function convergence evaluation, and Diebold–Mariano significance tests, are conducted to thoroughly assess the model’s effectiveness and superiority. Experimental results demonstrate that the proposed model achieves excellent prediction accuracy and stability in spring, summer, autumn, and winter, showing strong potential for engineering applications. Full article
(This article belongs to the Section A2: Solar Energy and Photovoltaic Systems)
Show Figures

Figure 1

18 pages, 39608 KB  
Article
Denoising Domain Adversarial Network Based on Attention Mechanism for Motor Fault Diagnosis in Real Industrial Environment
by Linjie Jin, Zhengqing Liu, Dawei Gu, Baisong Pan, Qiucheng Wang and Mohammad Fard
Machines 2026, 14(5), 462; https://doi.org/10.3390/machines14050462 - 22 Apr 2026
Viewed by 204
Abstract
Acoustic signal-based fault diagnosis offers a promising non-contact approach for rotating machinery. However, its practical application is usually affected by environmental noise. This paper presented a Denoising Attention Domain Adversarial Network (DDAN) for the robust fault diagnosis of wheel hub motors under severe [...] Read more.
Acoustic signal-based fault diagnosis offers a promising non-contact approach for rotating machinery. However, its practical application is usually affected by environmental noise. This paper presented a Denoising Attention Domain Adversarial Network (DDAN) for the robust fault diagnosis of wheel hub motors under severe noise interference. The proposed framework consists of the following two core modules: a DenseNet-based denoising module that adaptively suppresses background noise while retaining critical fault features, and a Stacked Autoencoder Domain Adversarial Network (SADAN) that integrates channel attention, spatial attention, and multi-head self-attention (MHSA) for refined feature extraction and classification. Such a hierarchical attention mechanism facilitates effective local noise suppression and global dependency capture. Validation on a hub motor fault dataset and publicly available online dataset demonstrates that compared to existing methods, DDAN achieves superior diagnostic accuracy across various noise levels and signal-to-noise ratios, improving SNR from -15.97 dB to 1.24 dB, achieving 82.71% accuracy under low SNR condition, and reaching 84.93% and 83.75% accuracy in cross-domain generalization tests. Furthermore, the comparison of the diagnostic accuracy of audio signals from different acoustic acquisition devices further verifies the practicality and potential of the system in low-cost industrial deployment. Full article
(This article belongs to the Section Electrical Machines and Drives)
Show Figures

Figure 1

15 pages, 916 KB  
Article
Object Re-Identification Method for Air-to-Ground Targets Based on Neighborhood Feature Centralization Attention
by Tian Yao, Yong Xu, Yue Ma, Hongtao Yan, Haihang Xu and An Wang
Computation 2026, 14(5), 96; https://doi.org/10.3390/computation14050096 - 22 Apr 2026
Viewed by 149
Abstract
To address the core challenges in air-to-ground target re-identification (ReID), including network focus on invalid background information, poor adaptability to nonlinear feature distribution, and insufficient cross-domain generalization, this paper proposes a novel air-to-ground ReID framework based on Neighborhood Feature Centralization Attention (NFCA). On [...] Read more.
To address the core challenges in air-to-ground target re-identification (ReID), including network focus on invalid background information, poor adaptability to nonlinear feature distribution, and insufficient cross-domain generalization, this paper proposes a novel air-to-ground ReID framework based on Neighborhood Feature Centralization Attention (NFCA). On the basis of Coordinate Attention, the framework introduces a parameter-free Neighborhood Feature Centralization mechanism to build a lightweight attention module, which enhances cross-feature semantic interaction and suppresses background noise while retaining precise position encoding. It achieves end-to-end direct optimization of sample pair similarity through binary cross-entropy loss, eliminating the proxy task bias of traditional classification loss and adapting to the nonlinear structure of feature space. A multi-source data-driven training strategy is constructed by fusing ReID datasets and general classification datasets, which expands the coverage of feature space and narrows the distribution gap between training data and real air-to-ground scenarios without additional manual annotation. Experiments show that the proposed method achieves leading mAP values on the self-developed UAV air-to-ground dataset JC-1, the public person ReID dataset Market-1501, and the public vehicle ReID dataset VehicleID. Sufficient statistical validation, ablation experiments and cross-domain tests verify the advancement, reliability and generalization of the proposed method in complex air-to-ground scenarios. Full article
Show Figures

Figure 1

32 pages, 3915 KB  
Article
Market-Aware and Topology-Embedded Safe Reinforcement Learning for Virtual Power Plant Dispatch
by Yueping Xiang, Luoyi Li, Yanqiu Hou, Xiaoyu Dai, Wenfeng Peng, Zhuoyang Liu, Ziming Liu, Zicong Chen, Xingyu Hu and Lv He
World Electr. Veh. J. 2026, 17(4), 222; https://doi.org/10.3390/wevj17040222 - 21 Apr 2026
Viewed by 148
Abstract
To address the challenges faced by virtual power plants (VPPs) in uncertain market environments and complex distribution networks, including strong market coupling, difficulty in multi-resource coordination, and strict safety constraints, this paper proposes a Hierarchical Hybrid Intelligent Framework (H2IF). The proposed framework integrates [...] Read more.
To address the challenges faced by virtual power plants (VPPs) in uncertain market environments and complex distribution networks, including strong market coupling, difficulty in multi-resource coordination, and strict safety constraints, this paper proposes a Hierarchical Hybrid Intelligent Framework (H2IF). The proposed framework integrates a market-aware meta-game mechanism, a topology-embedded graph attention coordination method, and a risk-aware soft/hard constraint safety mechanism to achieve economically optimal dispatch of VPPs in complex dynamic scenarios. By explicitly modeling competitive market interactions, the proposed method enhances strategy robustness; by exploiting grid topology priors, it improves multi-agent coordination capability; and by combining differentiable projection with risk-constrained optimization, it jointly ensures operational safety and revenue stability. Simulation results on a modified IEEE 33-bus system demonstrate that H2IF outperforms mainstream deep reinforcement learning methods and rule-based dispatch strategies in overall performance. In the 24 × 300-step testing scenario, H2IF achieves an average single-episode operating cost of 38.23 k$, which is 28.9%, 40.4%, and 26.5% lower than those of MADDPG, SAC, and the rule-based method, respectively, while also yielding the lowest constraint violation level. Ablation studies further verify the effectiveness of each key module in improving profit, reducing operating costs, enhancing tracking performance, and strengthening safety. The results indicate that the proposed method enables coordinated optimization of economy, safety, and robustness for VPP dispatch under uncertain market and operating conditions. Full article
(This article belongs to the Section Marketing, Promotion and Socio Economics)
Show Figures

Figure 1

24 pages, 34048 KB  
Article
Unsupervised Hyperspectral Unmixing Based on Multi-Faceted Graph Representation and Curriculum Learning
by Ran Liu, Junfeng Pu, Yanru Chen, Yanling Miao, Dawei Liu and Qi Wang
Remote Sens. 2026, 18(8), 1250; https://doi.org/10.3390/rs18081250 - 21 Apr 2026
Viewed by 214
Abstract
Hyperspectral unmixing aims to estimate endmember spectra and their corresponding abundance fractions at the subpixel scale, which is a critical preprocessing step for quantitative analysis of hyperspectral remote sensing imagery. While deep learning-based methods have achieved remarkable progress, three fundamental challenges remain: (i) [...] Read more.
Hyperspectral unmixing aims to estimate endmember spectra and their corresponding abundance fractions at the subpixel scale, which is a critical preprocessing step for quantitative analysis of hyperspectral remote sensing imagery. While deep learning-based methods have achieved remarkable progress, three fundamental challenges remain: (i) reliance on a single shared spatial prior that cannot decouple the heterogeneous spatial patterns of different land covers; (ii) the lack of synergy in jointly optimizing endmember extraction and abundance estimation; (iii) the poor robustness of unsupervised training to complex mixtures, noise, and class imbalance. To address these issues, we propose a novel unsupervised unmixing framework that integrates adaptive orthogonal multi-faceted graph representation with curriculum learning. Specifically, we design an Adaptive Orthogonal Multi-Faceted Graph Generator (AOMFG) to learn a set of independent orthogonal graph structures, achieving spatially informed decoupling of land cover patterns. Then, a dual-branch collaborative optimization network is constructed: a Graph Convolutional Network (GCN) branch that incorporates the learned spatial topological priors for abundance estimation, and a 1D Convolutional Neural Network (1DCNN) branch that employs a query-attention mechanism to adaptively aggregate pure spectral features for endmember extraction. Finally, we introduce a three-stage curriculum learning strategy that progressively fine-tunes the model, which significantly enhances its performance. Extensive experiments on three widely used real-world benchmark datasets demonstrate that our proposed framework consistently outperforms state-of-the-art methods in both endmember extraction and abundance estimation accuracy. Comprehensive ablation studies, parameter sensitivity analysis, and noise robustness tests further validate the effectiveness of each core component. Full article
(This article belongs to the Section Remote Sensing Image Processing)
Show Figures

Figure 1

22 pages, 4789 KB  
Article
DTF-STCANet: A Dual Time–Frequency Swin Transformer and ConvNeXt Attention Network for Heart Sound Classification
by Mehmet Nail Bilen, Fatih Mehmet Çelik, Mehmet Ali Kobat and Fatih Demir
Diagnostics 2026, 16(8), 1234; https://doi.org/10.3390/diagnostics16081234 - 21 Apr 2026
Viewed by 211
Abstract
Background/Objectives: Cardiovascular diseases are the leading cause of death worldwide. Therefore, early diagnosis and treatment of these diseases are of critical importance. Stethoscopes are the easiest and fastest medical devices for the initial diagnosis of cardiovascular diseases. However, interpreting heart sounds requires [...] Read more.
Background/Objectives: Cardiovascular diseases are the leading cause of death worldwide. Therefore, early diagnosis and treatment of these diseases are of critical importance. Stethoscopes are the easiest and fastest medical devices for the initial diagnosis of cardiovascular diseases. However, interpreting heart sounds requires considerable expertise. The use of artificial intelligence in healthcare for decision support has increased and become popular recently. Methods: The popular 2016 PhysioNet/CinC Challenge dataset, consisting of phonocardiogram (PCG) signals, was used to implement the proposed approach. Spectrogram and continuous wavelet transform (CWT) images of the PCG signals were first generated. This increased the distinguishability of the data in terms of both time and frequency components. These two-input images were tested on the developed Dual Time–Frequency Swin Transformer–ConvNeXt Attention Network (DTF-STCANet) model. To further improve classification accuracy, the Weighted KNN algorithm was preferred during the classification phase. Results: With the proposed approach, a 99.29% classification accuracy was achieved. Performance was compared with other state-of-the-art models. Conclusions: The proposed approach, through the integration of PCG signals with artificial intelligence, further strengthens the concept of early diagnosis of heart disease. Full article
(This article belongs to the Special Issue Artificial Intelligence in Biomedical Diagnostics and Analysis 2025)
Show Figures

Figure 1

22 pages, 21234 KB  
Article
MFAFENet: A Multi-Sensor Collaborative and Multi-Scale Feature Information Adaptive Fusion Network for Spindle Rotational Error Classification in CNC Machine Tools
by Fei Wang, Lin Song, Pengfei Wang, Ping Deng and Tianwei Lan
Entropy 2026, 28(4), 475; https://doi.org/10.3390/e28040475 - 20 Apr 2026
Viewed by 142
Abstract
Accurate classification of spindle rotational errors is critical for ensuring machining precision and operational reliability of CNC machine tools. However, existing methods face challenges in extracting discriminative feature information from vibration signals due to small inter-class differences and complex electromechanical interference. This paper [...] Read more.
Accurate classification of spindle rotational errors is critical for ensuring machining precision and operational reliability of CNC machine tools. However, existing methods face challenges in extracting discriminative feature information from vibration signals due to small inter-class differences and complex electromechanical interference. This paper proposes a novel deep learning model, MFAFENet, based on multi-sensor collaboration and multi-scale feature information adaptive fusion. Vibration signals from three mounting positions are transformed into time-frequency information representations via Short-time Fourier Transform. The proposed network adaptively fuses multi-scale feature information from parallel branches with different kernel sizes through a branch attention mechanism. An efficient channel attention module is then incorporated to recalibrate channel-wise feature responses. The cross-entropy loss function is employed to optimize the network parameters during training. Experiments on a spindle reliability test bench demonstrate that MFAFENet achieves 93.37% average test accuracy, outperforming other comparative methods. Ablation and comparative studies confirm the effectiveness of each module and the clear advantage of adaptive fusion over fixed-weight multi-scale methods. Multi-sensor fusion further improves accuracy by 7.23% over the best single-sensor setup. The proposed method establishes an effective end-to-end mapping between vibration signals and rotational errors, providing a promising solution for high-precision spindle condition monitoring. Full article
(This article belongs to the Section Multidisciplinary Applications)
Show Figures

Figure 1

16 pages, 8390 KB  
Article
An Adaptive Deep Learning Framework for Multi-Label Chest X-Ray Diagnosis Using a Hybrid CNN–Transformer Architecture and Class-Wise Ensemble Fusion
by Chi-Feng Hsieh, Hsu-Hsia Peng, Yu-Hsiang Tsai, Chia-Ching Chang, Cheng-Hsuan Juan, Hsian-He Hsu and Chun-Jung Juan
Diagnostics 2026, 16(8), 1227; https://doi.org/10.3390/diagnostics16081227 - 20 Apr 2026
Viewed by 239
Abstract
Background/Objectives: To develop and externally evaluate a deep learning framework for multi-label thoracic disease classification on chest radiographs using hybrid convolutional neural network (CNN)–transformer architectures, hierarchical scalar-weighted fusion, and ensemble strategies. Methods: This retrospective, multi-center study utilized publicly available datasets: NIH [...] Read more.
Background/Objectives: To develop and externally evaluate a deep learning framework for multi-label thoracic disease classification on chest radiographs using hybrid convolutional neural network (CNN)–transformer architectures, hierarchical scalar-weighted fusion, and ensemble strategies. Methods: This retrospective, multi-center study utilized publicly available datasets: NIH ChestX-ray14 (112,120 images; 30,805 patients) for model development and internal testing, and CheXpert (223,415 images) plus ChestX-Det10 (3578 images) for external validation. Nine CNN–transformer hybrids were systematically benchmarked, and the proposed model incorporated multi-scale DenseNet121 features, scalar-weighted fusion, positional encodings, and cross-attention. Four post hoc ensemble methods were explored, including a class-wise Top-3 Grid Search. Performance was evaluated using AUROC as the primary metric, along with precision, recall, F1-score, accuracy, specificity, positive predictive value, and negative predictive value. Statistical comparisons were performed using bootstrapped resampling and appropriate parametric or non-parametic tests. Results: On the NIH internal test set, the proposed hybrid model achieved a mean AUROC of 0.8495, which was significantly higher than that of the DenseNet121 baseline (0.8441, p = 0.032). The Top-3 Grid Search ensemble further improved internal performance, achieving a mean AUROC of 0.8577 (p < 0.00001). On external validation, the ensemble consistently outperformed DenseNet121, achieving mean AUROCs of 0.6500 on CheXpert (p < 0.001) and 0.6592 on ChestX-Det10 (p < 0.001). Per-class analysis revealed significant improvements for clinically important conditions such as cardiomegaly, mass, and pneumothorax. Grad-CAM visualizations demonstrated the strong alignment of predicted abnormalities with radiologically relevant regions. Conclusions: This CNN–transformer framework, particularly when combined with class-wise ensemble strategies, provided modest but statistically significant improvements in multi-label chest X-ray classification. External validation suggested partial generalizability across datasets, although performance remained moderate under domain shift. Full article
(This article belongs to the Special Issue Artificial Intelligence in Diagnostic Imaging)
Show Figures

Figure 1

13 pages, 2116 KB  
Article
Rapid Estimation for the Maximum Remaining Capacity of Retired Lithium-Ion Batteries Based on CNN-CBAM-LSTM
by Aqing Li, Penghao Cui, Yifei Cao, Peng Zhou, Lei Yang, Guochen Bian and Zhendong Shao
Batteries 2026, 12(4), 145; https://doi.org/10.3390/batteries12040145 - 20 Apr 2026
Viewed by 219
Abstract
With the continuous increase in the number of Retired Lithium-Ion Batteries (RLBs), accurately estimating their Maximum Remaining Capacity (MRC) has become a key challenge for rapid sorting and secondary utilization. Conventional detection methods often suffer from low efficiency and limited scalability for large-scale [...] Read more.
With the continuous increase in the number of Retired Lithium-Ion Batteries (RLBs), accurately estimating their Maximum Remaining Capacity (MRC) has become a key challenge for rapid sorting and secondary utilization. Conventional detection methods often suffer from low efficiency and limited scalability for large-scale applications. To address these issues, this paper presents a rapid MRC estimation method using a hybrid Convolutional Neural Network (CNN), Conv Block Attention Module (CBAM), and Long Short-Term Memory (LSTM) architecture. The proposed approach extracts key voltage and capacity features from only the initial 30 min charging phase, integrating both factory and laboratory data. Specifically, the CNN captures local temporal patterns, the LSTM models long-term dependencies, and the CBAM adaptively emphasizes critical characteristics. Experimental results demonstrate that the proposed method significantly outperforms traditional approaches, achieving a testing R2 of 98.05% and a Mean Absolute Percentage Error (MAPE) of 1.60%. These results highlight the superior performance of the proposed framework, exhibiting strong potential for high-throughput battery sorting and large-scale health monitoring systems. Full article
Show Figures

Figure 1

Back to TopTop