Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (830)

Search Parameters:
Keywords = SE attention

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 4258 KiB  
Article
A Few-Shot SE-Relation Net-Based Electronic Nose for Discriminating COPD
by Zhuoheng Xie, Yao Tian and Pengfei Jia
Sensors 2025, 25(15), 4780; https://doi.org/10.3390/s25154780 - 3 Aug 2025
Viewed by 62
Abstract
We propose an advanced electronic nose based on SE-RelationNet for COPD diagnosis with limited breath samples. The model integrates residual blocks, BiGRU layers, and squeeze–excitation attention mechanisms to enhance feature-extraction efficiency. Experimental results demonstrate exceptional performance with minimal samples: in 4-way 1-shot tasks, [...] Read more.
We propose an advanced electronic nose based on SE-RelationNet for COPD diagnosis with limited breath samples. The model integrates residual blocks, BiGRU layers, and squeeze–excitation attention mechanisms to enhance feature-extraction efficiency. Experimental results demonstrate exceptional performance with minimal samples: in 4-way 1-shot tasks, the model achieves 85.8% mean accuracy (F1-score = 0.852), scaling to 93.3% accuracy (F1-score = 0.931) with four samples per class. Ablation studies confirm that the 5-layer residual structure and single-hidden-layer BiGRU optimize stability (h_F1-score ≤ 0.011). Compared to SiameseNet and ProtoNet, SE-RelationNet shows superior accuracy (>15% improvement in 1-shot tasks). This technology enables COPD detection with as few as one breath sample, facilitating early intervention to mitigate lung cancer risks in COPD patients. Full article
(This article belongs to the Special Issue Nature Inspired Engineering: Biomimetic Sensors (2nd Edition))
Show Figures

Figure 1

26 pages, 1790 KiB  
Article
A Hybrid Deep Learning Model for Aromatic and Medicinal Plant Species Classification Using a Curated Leaf Image Dataset
by Shareena E. M., D. Abraham Chandy, Shemi P. M. and Alwin Poulose
AgriEngineering 2025, 7(8), 243; https://doi.org/10.3390/agriengineering7080243 - 1 Aug 2025
Viewed by 176
Abstract
In the era of smart agriculture, accurate identification of plant species is critical for effective crop management, biodiversity monitoring, and the sustainable use of medicinal resources. However, existing deep learning approaches often underperform when applied to fine-grained plant classification tasks due to the [...] Read more.
In the era of smart agriculture, accurate identification of plant species is critical for effective crop management, biodiversity monitoring, and the sustainable use of medicinal resources. However, existing deep learning approaches often underperform when applied to fine-grained plant classification tasks due to the lack of domain-specific, high-quality datasets and the limited representational capacity of traditional architectures. This study addresses these challenges by introducing a novel, well-curated leaf image dataset consisting of 39 classes of medicinal and aromatic plants collected from the Aromatic and Medicinal Plant Research Station in Odakkali, Kerala, India. To overcome performance bottlenecks observed with a baseline Convolutional Neural Network (CNN) that achieved only 44.94% accuracy, we progressively enhanced model performance through a series of architectural innovations. These included the use of a pre-trained VGG16 network, data augmentation techniques, and fine-tuning of deeper convolutional layers, followed by the integration of Squeeze-and-Excitation (SE) attention blocks. Ultimately, we propose a hybrid deep learning architecture that combines VGG16 with Batch Normalization, Gated Recurrent Units (GRUs), Transformer modules, and Dilated Convolutions. This final model achieved a peak validation accuracy of 95.24%, significantly outperforming several baseline models, such as custom CNN (44.94%), VGG-19 (59.49%), VGG-16 before augmentation (71.52%), Xception (85.44%), Inception v3 (87.97%), VGG-16 after data augumentation (89.24%), VGG-16 after fine-tuning (90.51%), MobileNetV2 (93.67), and VGG16 with SE block (94.94%). These results demonstrate superior capability in capturing both local textures and global morphological features. The proposed solution not only advances the state of the art in plant classification but also contributes a valuable dataset to the research community. Its real-world applicability spans field-based plant identification, biodiversity conservation, and precision agriculture, offering a scalable tool for automated plant recognition in complex ecological and agricultural environments. Full article
(This article belongs to the Special Issue Implementation of Artificial Intelligence in Agriculture)
Show Figures

Figure 1

21 pages, 4147 KiB  
Article
OLTEM: Lumped Thermal and Deep Neural Model for PMSM Temperature
by Yuzhong Sheng, Xin Liu, Qi Chen, Zhenghao Zhu, Chuangxin Huang and Qiuliang Wang
AI 2025, 6(8), 173; https://doi.org/10.3390/ai6080173 - 31 Jul 2025
Viewed by 235
Abstract
Background and Objective: Temperature management is key for reliable operation of permanent magnet synchronous motors (PMSMs). The lumped-parameter thermal network (LPTN) is fast and interpretable but struggles with nonlinear behavior under high power density. We propose OLTEM, a physics-informed deep model that combines [...] Read more.
Background and Objective: Temperature management is key for reliable operation of permanent magnet synchronous motors (PMSMs). The lumped-parameter thermal network (LPTN) is fast and interpretable but struggles with nonlinear behavior under high power density. We propose OLTEM, a physics-informed deep model that combines LPTN with a thermal neural network (TNN) to improve prediction accuracy while keeping physical meaning. Methods: OLTEM embeds LPTN into a recurrent state-space formulation and learns three parameter sets: thermal conductance, inverse thermal capacitance, and power loss. Two additions are introduced: (i) a state-conditioned squeeze-and-excitation (SC-SE) attention that adapts feature weights using the current temperature state, and (ii) an enhanced power-loss sub-network that uses a deep MLP with SC-SE and non-negativity constraints. The model is trained and evaluated on the public Electric Motor Temperature dataset (Paderborn University/Kaggle). Performance is measured by mean squared error (MSE) and maximum absolute error across permanent-magnet, stator-yoke, stator-tooth, and stator-winding temperatures. Results: OLTEM tracks fast thermal transients and yields lower MSE than both the baseline TNN and a CNN–RNN model for all four components. On a held-out generalization set, MSE remains below 4.0 °C2 and the maximum absolute error is about 4.3–8.2 °C. Ablation shows that removing either SC-SE or the enhanced power-loss module degrades accuracy, confirming their complementary roles. Conclusions: By combining physics with learned attention and loss modeling, OLTEM improves PMSM temperature prediction while preserving interpretability. This approach can support motor thermal design and control; future work will study transfer to other machines and further reduce short-term errors during abrupt operating changes. Full article
Show Figures

Figure 1

26 pages, 3684 KiB  
Article
Creation of Zinc (II)-Complexed Green Tea and Its Effects on Gut Microbiota by Daily Green Tea Consumption
by Tsukasa Orita, Daichi Ijiri, De-Xing Hou and Kozue Sakao
Molecules 2025, 30(15), 3191; https://doi.org/10.3390/molecules30153191 - 30 Jul 2025
Viewed by 333
Abstract
Although Zn (II)-(−)-Epigallocatechin gallate (EGCg) complex (Zn-EGCg) is known for its promising bioactivities, little attention has been paid to its incorporation into daily green tea consumption. In this study, we aimed to incorporate Zn (II) into green tea extract to promote the formation [...] Read more.
Although Zn (II)-(−)-Epigallocatechin gallate (EGCg) complex (Zn-EGCg) is known for its promising bioactivities, little attention has been paid to its incorporation into daily green tea consumption. In this study, we aimed to incorporate Zn (II) into green tea extract to promote the formation of Zn-EGCg complex within the tea matrix. We then investigated how the formation of Zn-complexed green tea extract (Zn-GTE) influences the gut microbiota in a Western diet (WD)-fed mouse model. Structural analyses using ultraviolet–visible spectroscopy (UV–Vis), Fourier-transform infrared spectroscopy (FT-IR), proton nuclear magnetic resonance (1H NMR), and powder X-ray diffraction (PXRD) suggested that Zn (II) interacted with hydroxyl groups of polyphenols within the extract, consistent with Zn-EGCg formation, although the complex could not be unequivocally identified. Under intake levels equivalent to daily consumption, Zn-GTE administration restored WD-induced reductions in alpha-diversity and resulted in a distinct microbial composition compared to treatment with green tea extract (GTE) or Zn alone, as shown by beta-diversity analysis. Linear discriminant analysis Effect Size (LEfSe) analysis revealed increased abundances of bacterial taxa belonging to o_Clostridiales, o_Bacteroidales, and f_Rikenellaceae, and decreased abundances of g_Akkermansia in the Zn-GTE group compared to the GTE group. These findings highlight that Zn-GTE, prepared via Zn (II) supplementation to green tea, may exert distinct microbiota-modulating effects compared to its individual components. This study provides new insights into the role of dietary metal–polyphenol complexes, offering a food-based platform for studying metal–polyphenol interactions under physiologically relevant conditions. Full article
(This article belongs to the Special Issue Health Benefits and Applications of Bioactive Phenolic Compounds)
Show Figures

Graphical abstract

16 pages, 3203 KiB  
Article
Green Synthesised Carbon Nanodots Using the Maillard Reaction for the Rapid Detection of Elemental Selenium in Water and Carbonated Beverages
by Arjun Muthu, Duyen H. H. Nguyen, Aya Ferroudj, József Prokisch, Hassan El-Ramady, Chaima Neji and Áron Béni
Nanomaterials 2025, 15(15), 1161; https://doi.org/10.3390/nano15151161 - 28 Jul 2025
Viewed by 185
Abstract
Selenium (Se) is an essential trace element involved in antioxidant redox regulation, thyroid hormone metabolism, and cancer prevention. Among its different forms, elemental selenium (Se0), particularly at the nanoscale, has gained growing attention in food, feed, and biomedical applications due to [...] Read more.
Selenium (Se) is an essential trace element involved in antioxidant redox regulation, thyroid hormone metabolism, and cancer prevention. Among its different forms, elemental selenium (Se0), particularly at the nanoscale, has gained growing attention in food, feed, and biomedical applications due to its lower toxicity and higher bioavailability compared to inorganic selenium species. However, the detection of Se0 in real samples remains challenging as current analytical methods are time-consuming, labour-intensive, and often unsuitable for rapid analysis. In this study, we developed a method for rapidly measuring Se0 using carbon nanodots (CNDs) produced from the Maillard reaction between glucose and glycine. The fabricated CNDs were water-dispersible and strongly fluorescent, with an average particle size of 3.90 ± 1.36 nm. Comprehensive characterisation by transmission electron microscopy (TEM), Fourier-transform infrared spectroscopy (FTIR), fluorescence spectroscopy, and Raman spectroscopy confirmed their structural and optical properties. The CNDs were employed as fluorescent probes for the selective detection of Se0. The sensor showed a wide linear detection range (0–12.665 mmol L−1), with a low detection limit (LOD) of 0.381 mmol L−1 and a quantification limit (LOQ) of 0.465 mmol L−1. Validation with spiked real samples—including ultra-pure water, tap water, and soft drinks—yielded high recoveries (98.6–108.1%) and low relative standard deviations (<3.4%). These results highlight the potential of CNDs as a simple, reliable, and environmentally friendly sensing platform for trace-level Se0 detection in complex food and beverage matrices. Full article
Show Figures

Graphical abstract

34 pages, 2268 KiB  
Review
Recent Progress in Selenium Remediation from Aqueous Systems: State-of-the-Art Technologies, Challenges, and Prospects
by Muhammad Ali Inam, Muhammad Usman, Rashid Iftikhar, Svetlozar Velizarov and Mathias Ernst
Water 2025, 17(15), 2241; https://doi.org/10.3390/w17152241 - 28 Jul 2025
Viewed by 462
Abstract
The contamination of drinking water sources with selenium (Se) oxyanions, including selenite (Se(IV)) and selenate (Se(VI)), contains serious health hazards with an oral intake exceeding 400 µg/day and therefore requires urgent attention. Various natural and anthropogenic sources are responsible for high Se concentrations [...] Read more.
The contamination of drinking water sources with selenium (Se) oxyanions, including selenite (Se(IV)) and selenate (Se(VI)), contains serious health hazards with an oral intake exceeding 400 µg/day and therefore requires urgent attention. Various natural and anthropogenic sources are responsible for high Se concentrations in aquatic environments. In addition, the chemical behavior and speciation of selenium can vary noticeably depending on the origin of the source water. The Se(VI) oxyanion is more soluble and therefore more abundant in surface water. Se levels in contaminated waters often exceed 50 µg/L and may reach several hundred µg/L, well above drinking water limits set by the World Health Organization (40 µg/L) and Germany (10 µg/L), as well as typical industrial discharge limits (5–10 µg/L). Overall, Se is difficult to remove using conventionally available physical, chemical, and biological treatment technologies. The recent literature has therefore highlighted promising advancements in Se removal using emerging technologies. These include advanced physical separation methods such as membrane-based treatment systems and engineered nanomaterials for selective Se decontamination. Additionally, other integrated approaches incorporating photocatalysis coupled adsorption processes, and bio-electrochemical systems have also demonstrated high efficiency in redox transformation and capturing of Se from contaminated water bodies. These innovative strategies may offer enhanced selectivity, removal, and recovery potential for Se-containing species. Here, a current review outlines the sources, distribution, and chemical behavior of Se in natural waters, along with its toxicity and associated health risks. It also provides a broad and multi-perspective assessment of conventional as well as emerging physical, chemical, and biological approaches for Se removal and/or recovery with further prospects for integrated and sustainable strategies. Full article
(This article belongs to the Section Water Quality and Contamination)
Show Figures

Graphical abstract

19 pages, 5417 KiB  
Article
SE-TFF: Adaptive Tourism-Flow Forecasting Under Sparse and Heterogeneous Data via Multi-Scale SE-Net
by Jinyuan Zhang, Tao Cui and Peng He
Appl. Sci. 2025, 15(15), 8189; https://doi.org/10.3390/app15158189 - 23 Jul 2025
Viewed by 206
Abstract
Accurate and timely forecasting of cross-regional tourist flows is essential for sustainable destination management, yet existing models struggle with sparse data, complex spatiotemporal interactions, and limited interpretability. This paper presents SE-TFF, a multi-scale tourism-flow forecasting framework that couples a Squeeze-and-Excitation (SE) network with [...] Read more.
Accurate and timely forecasting of cross-regional tourist flows is essential for sustainable destination management, yet existing models struggle with sparse data, complex spatiotemporal interactions, and limited interpretability. This paper presents SE-TFF, a multi-scale tourism-flow forecasting framework that couples a Squeeze-and-Excitation (SE) network with reinforcement-driven optimization to adaptively re-weight environmental, economic, and social features. A benchmark dataset of 17.8 million records from 64 countries and 743 cities (2016–2024) is compiled from the Open Travel Data repository in github (OPTD) for training and validation. SE-TFF introduces (i) a multi-channel SE module for fine-grained feature selection under heterogeneous conditions, (ii) a Top-K attention filter to preserve salient context in highly sparse matrices, and (iii) a Double-DQN layer that dynamically balances prediction objectives. Experimental results show SE-TFF attains 56.5% MAE and 65.6% RMSE reductions over the best baseline (ARIMAX) at 20% sparsity, with 0.92 × 103 average MAE across multi-task outputs. SHAP analysis ranks climate anomalies, tourism revenue, and employment as dominant predictors. These gains demonstrate SE-TFF’s ability to deliver real-time, interpretable forecasts for data-limited destinations. Future work will incorporate real-time social media signals and larger multimodal datasets to enhance generalizability. Full article
Show Figures

Figure 1

15 pages, 4874 KiB  
Article
A Novel 3D Convolutional Neural Network-Based Deep Learning Model for Spatiotemporal Feature Mapping for Video Analysis: Feasibility Study for Gastrointestinal Endoscopic Video Classification
by Mrinal Kanti Dhar, Mou Deb, Poonguzhali Elangovan, Keerthy Gopalakrishnan, Divyanshi Sood, Avneet Kaur, Charmy Parikh, Swetha Rapolu, Gianeshwaree Alias Rachna Panjwani, Rabiah Aslam Ansari, Naghmeh Asadimanesh, Shiva Sankari Karuppiah, Scott A. Helgeson, Venkata S. Akshintala and Shivaram P. Arunachalam
J. Imaging 2025, 11(7), 243; https://doi.org/10.3390/jimaging11070243 - 18 Jul 2025
Viewed by 455
Abstract
Accurate analysis of medical videos remains a major challenge in deep learning (DL) due to the need for effective spatiotemporal feature mapping that captures both spatial detail and temporal dynamics. Despite advances in DL, most existing models in medical AI focus on static [...] Read more.
Accurate analysis of medical videos remains a major challenge in deep learning (DL) due to the need for effective spatiotemporal feature mapping that captures both spatial detail and temporal dynamics. Despite advances in DL, most existing models in medical AI focus on static images, overlooking critical temporal cues present in video data. To bridge this gap, a novel DL-based framework is proposed for spatiotemporal feature extraction from medical video sequences. As a feasibility use case, this study focuses on gastrointestinal (GI) endoscopic video classification. A 3D convolutional neural network (CNN) is developed to classify upper and lower GI endoscopic videos using the hyperKvasir dataset, which contains 314 lower and 60 upper GI videos. To address data imbalance, 60 matched pairs of videos are randomly selected across 20 experimental runs. Videos are resized to 224 × 224, and the 3D CNN captures spatiotemporal information. A 3D version of the parallel spatial and channel squeeze-and-excitation (P-scSE) is implemented, and a new block called the residual with parallel attention (RPA) block is proposed by combining P-scSE3D with a residual block. To reduce computational complexity, a (2 + 1)D convolution is used in place of full 3D convolution. The model achieves an average accuracy of 0.933, precision of 0.932, recall of 0.944, F1-score of 0.935, and AUC of 0.933. It is also observed that the integration of P-scSE3D increased the F1-score by 7%. This preliminary work opens avenues for exploring various GI endoscopic video-based prospective studies. Full article
Show Figures

Figure 1

20 pages, 5486 KiB  
Article
SE-TransUNet-Based Semantic Segmentation for Water Leakage Detection in Tunnel Secondary Linings Amid Complex Visual Backgrounds
by Renjie Song, Yimin Wu, Li Wan, Shuai Shao and Haiping Wu
Appl. Sci. 2025, 15(14), 7872; https://doi.org/10.3390/app15147872 - 14 Jul 2025
Viewed by 260
Abstract
Traditional manual inspection methods for tunnel lining leakage are subjective and inefficient, while existing models lack sufficient recognition accuracy in complex scenarios. An intelligent leakage identification model adaptable to complex backgrounds is therefore needed. To address these issues, a Vision Transformer (ViT) was [...] Read more.
Traditional manual inspection methods for tunnel lining leakage are subjective and inefficient, while existing models lack sufficient recognition accuracy in complex scenarios. An intelligent leakage identification model adaptable to complex backgrounds is therefore needed. To address these issues, a Vision Transformer (ViT) was integrated into the UNet architecture, forming an SE-TransUNet model by incorporating SE-Block modules at skip connections between the encoder-decoder and the ViT output. Using a hybrid leakage dataset partitioned by k-fold cross-validation, the roles of SE-Block and ViT modules were examined through ablation experiments, and the model’s attention mechanism for leakage features was analyzed via Score-CAM heatmaps. Results indicate: (1) SE-TransUNet achieved mean values of 0.8318 (IoU), 0.8304 (Dice), 0.9394 (Recall), 0.8480 (Precision), 0.9733 (AUC), 0.8562 (MCC), 0.9218 (F1-score), and 6.53 (FPS) on the hybrid dataset, demonstrating robust generalization in scenarios with dent shadows, stain interference, and faint leakage traces. (2) Ablation experiments confirmed both modules’ necessity: The baseline model’s IoU exceeded the variant without the SE module by 4.50% and the variant without both the SE and ViT modules by 7.04%. (3) Score-CAM heatmaps showed the SE module broadened the model’s attention coverage of leakage areas, enhanced feature continuity, and improved anti-interference capability in complex environments. This research may provide a reference for related fields. Full article
Show Figures

Figure 1

21 pages, 1099 KiB  
Review
The Roles of E3 Ubiquitin Ligases in Cerebral Ischemia–Reperfusion Injury
by Man Li, Xiaoxiao Yu, Qiang Liu, Zhi Fang and Haijun Wang
Int. J. Mol. Sci. 2025, 26(14), 6723; https://doi.org/10.3390/ijms26146723 - 13 Jul 2025
Viewed by 331
Abstract
The temporary or permanent occlusion of cerebral blood vessels results in ischemic stroke (IS). Ischemia per se causes focal neuronal damage, and the subsequent ischemia–reperfusion injury that occurs after blood flow restoration further compromises brain tissue and cells in the neurovascular unit, significantly [...] Read more.
The temporary or permanent occlusion of cerebral blood vessels results in ischemic stroke (IS). Ischemia per se causes focal neuronal damage, and the subsequent ischemia–reperfusion injury that occurs after blood flow restoration further compromises brain tissue and cells in the neurovascular unit, significantly contributing to poor patient outcomes and functional impairments. Current research indicates that the ubiquitin–proteasome system (UPS) plays a crucial role in the pathological processes associated with cerebral ischemia–reperfusion injury (CIRI). Notably, E3 ubiquitin (Ub) ligases, which are essential in the UPS, have garnered increasing attention as potential novel therapeutic targets for treating ischemia–reperfusion damage in the brain. This review focuses primarily on the background of E3 Ub ligases and explores their intricate relationships with the pathological processes of CIRI. Full article
(This article belongs to the Special Issue Latest Advances in Oxidative Stress and Brain Injury)
Show Figures

Figure 1

29 pages, 16473 KiB  
Article
Demographic Change and Commons Governance: Examining the Impacts of Rural Out-Migration on Public Open Spaces in China Through a Social–Ecological Systems Framework
by Xuerui Shi, Gabriel Hoh Teck Ling and Pau Chung Leng
Land 2025, 14(7), 1444; https://doi.org/10.3390/land14071444 - 10 Jul 2025
Viewed by 455
Abstract
Rapid urbanization in China has driven substantial rural population out-migration, raising concerns about its implications for the governance of land commons in villages. While existing studies have acknowledged the effects of migration on rural resource management, little attention has been paid to its [...] Read more.
Rapid urbanization in China has driven substantial rural population out-migration, raising concerns about its implications for the governance of land commons in villages. While existing studies have acknowledged the effects of migration on rural resource management, little attention has been paid to its influence on the self-governance of rural public open spaces (POSs). This study adopts the social–ecological systems (SES) framework to examine how rural out-migration shapes POS self-governance mechanisms. Based on survey data from 594 villagers across 198 villages in Taigu District, partial least squares structural equation modeling (PLS-SEM) and a mediation model grounded in the SES framework were employed for analysis. The results indicate that rural out-migration does not exert a direct impact on POS self-governance. Instead, it negatively influences governance outcomes through full mediation by villager organizations, the left-behind population, collective investment in POSs, and self-organizing activities. Notably, the mediating roles of the left-behind population and self-organizing activities account for 67.38% of the total effect, underscoring their critical importance. Drawing on these insights, the study proposes four policy recommendations to strengthen rural POS self-governance under conditions of demographic transition. This research contributes to the literature by being the first to incorporate an external social factor—rural out-migration—within the SES framework in the context of POS governance, thereby advancing both theoretical and practical understandings of rural commons management. Full article
Show Figures

Figure 1

18 pages, 70320 KiB  
Article
RIS-UNet: A Multi-Level Hierarchical Framework for Liver Tumor Segmentation in CT Images
by Yuchai Wan, Lili Zhang and Murong Wang
Entropy 2025, 27(7), 735; https://doi.org/10.3390/e27070735 - 9 Jul 2025
Viewed by 429
Abstract
The deep learning-based analysis of liver CT images is expected to provide assistance for clinicians in the diagnostic decision-making process. However, the accuracy of existing methods still falls short of clinical requirements and needs to be further improved. Therefore, in this work, we [...] Read more.
The deep learning-based analysis of liver CT images is expected to provide assistance for clinicians in the diagnostic decision-making process. However, the accuracy of existing methods still falls short of clinical requirements and needs to be further improved. Therefore, in this work, we propose a novel multi-level hierarchical framework for liver tumor segmentation. In the first level, we integrate inter-slice spatial information by a 2.5D network to resolve the accuracy–efficiency trade-off inherent in conventional 2D/3D segmentation strategies for liver tumor segmentation. Then, the second level extracts the inner-slice global and local features for enhancing feature representation. We propose the Res-Inception-SE Block, which combines residual connections, multi-scale Inception modules, and squeeze-excitation attention to capture comprehensive global and local features. Furthermore, we design a hybrid loss function combining Binary Cross Entropy (BCE) and Dice loss to solve the category imbalance problem and accelerate convergence. Extensive experiments on the LiTS17 dataset demonstrate the effectiveness of our method on accuracy, efficiency, and visual results for liver tumor segmentation. Full article
(This article belongs to the Special Issue Cutting-Edge AI in Computational Bioinformatics)
Show Figures

Figure 1

21 pages, 5895 KiB  
Article
Improved YOLO-Based Pulmonary Nodule Detection with Spatial-SE Attention and an Aspect Ratio Penalty
by Xinhang Song, Haoran Xie, Tianding Gao, Nuo Cheng and Jianping Gou
Sensors 2025, 25(14), 4245; https://doi.org/10.3390/s25144245 - 8 Jul 2025
Viewed by 420
Abstract
The accurate identification of pulmonary nodules is critical for the early diagnosis of lung diseases; however, this task remains challenging due to inadequate feature representation and limited localization sensitivity. Current methodologies often utilize channel attention mechanisms and intersection over union (IoU)-based loss functions. [...] Read more.
The accurate identification of pulmonary nodules is critical for the early diagnosis of lung diseases; however, this task remains challenging due to inadequate feature representation and limited localization sensitivity. Current methodologies often utilize channel attention mechanisms and intersection over union (IoU)-based loss functions. Yet, they frequently overlook spatial context and struggle to capture subtle variations in aspect ratios, which hinders their ability to detect small objects. In this study, we introduce an improved YOLOV11 framework that addresses these limitations through two primary components: a spatial squeeze-and-excitation (SSE) module that concurrently models channel-wise and spatial attention to enhance the discriminative features pertinent to nodules and explicit aspect ratio penalty IoU (EAPIoU) loss that imposes a direct penalty on the squared differences in aspect ratios to refine the bounding box regression process. Comprehensive experiments conducted on the LUNA16, LungCT, and Node21 datasets reveal that our approach achieves superior precision, recall, and mean average precision (mAP) across various IoU thresholds, surpassing previous state-of-the-art methods while maintaining computational efficiency. Specifically, the proposed SSE module achieves a precision of 0.781 on LUNA16, while the EAPIoU loss boosts mAP@50 to 92.4% on LungCT, outperforming mainstream attention mechanisms and IoU-based loss functions. These findings underscore the effectiveness of integrating spatially aware attention mechanisms with aspect ratio-sensitive loss functions for robust nodule detection. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

21 pages, 3406 KiB  
Article
ResNet-SE-CBAM Siamese Networks for Few-Shot and Imbalanced PCB Defect Classification
by Chao-Hsiang Hsiao, Huan-Che Su, Yin-Tien Wang, Min-Jie Hsu and Chen-Chien Hsu
Sensors 2025, 25(13), 4233; https://doi.org/10.3390/s25134233 - 7 Jul 2025
Viewed by 576
Abstract
Defect detection in mass production lines often involves small and imbalanced datasets, necessitating the use of few-shot learning methods. Traditional deep learning-based approaches typically rely on large datasets, limiting their applicability in real-world scenarios. This study explores few-shot learning models for detecting product [...] Read more.
Defect detection in mass production lines often involves small and imbalanced datasets, necessitating the use of few-shot learning methods. Traditional deep learning-based approaches typically rely on large datasets, limiting their applicability in real-world scenarios. This study explores few-shot learning models for detecting product defects using limited data, enhancing model generalization and stability. Unlike previous deep learning models that require extensive datasets, our approach effectively performs defect detection with minimal data. We propose a Siamese network that integrates Residual blocks, Squeeze and Excitation blocks, and Convolution Block Attention Modules (ResNet-SE-CBAM Siamese network) for feature extraction, optimized through triplet loss for embedding learning. The ResNet-SE-CBAM Siamese network incorporates two primary features: attention mechanisms and metric learning. The recently developed attention mechanisms enhance the convolutional neural network operations and significantly improve feature extraction performance. Meanwhile, metric learning allows for the addition or removal of feature classes without the need to retrain the model, improving its applicability in industrial production lines with limited defect samples. To further improve training efficiency with imbalanced datasets, we introduce a sample selection method based on the Structural Similarity Index Measure (SSIM). Additionally, a high defect rate training strategy is utilized to reduce the False Negative Rate (FNR) and ensure no missed defect detections. At the classification stage, a K-Nearest Neighbor (KNN) classifier is employed to mitigate overfitting risks and enhance stability in few-shot conditions. The experimental results demonstrate that with a good-to-defect ratio of 20:40, the proposed system achieves a classification accuracy of 94% and an FNR of 2%. Furthermore, when the number of defective samples increases to 80, the system achieves zero false negatives (FNR = 0%). The proposed metric learning approach outperforms traditional deep learning models, such as parametric-based YOLO series models in defect detection, achieving higher accuracy and lower miss rates, highlighting its potential for high-reliability industrial deployment. Full article
Show Figures

Figure 1

33 pages, 3352 KiB  
Article
Optimization Strategy for Underwater Target Recognition Based on Multi-Domain Feature Fusion and Deep Learning
by Yanyang Lu, Lichao Ding, Ming Chen, Danping Shi, Guohao Xie, Yuxin Zhang, Hongyan Jiang and Zhe Chen
J. Mar. Sci. Eng. 2025, 13(7), 1311; https://doi.org/10.3390/jmse13071311 - 7 Jul 2025
Viewed by 398
Abstract
Underwater sonar target recognition is crucial in fields such as national defense, navigation, and environmental monitoring. However, it faces issues such as the complex characteristics of ship-radiated noise, imbalanced data distribution, non-stationarity, and bottlenecks of existing technologies. This paper proposes the MultiFuseNet-AID network, [...] Read more.
Underwater sonar target recognition is crucial in fields such as national defense, navigation, and environmental monitoring. However, it faces issues such as the complex characteristics of ship-radiated noise, imbalanced data distribution, non-stationarity, and bottlenecks of existing technologies. This paper proposes the MultiFuseNet-AID network, aiming to address these challenges. The network includes the TriFusion block module, the novel lightweight attention residual network (NLARN), the long- and short-term attention (LSTA) module, and the Mamba module. Through the TriFusion block module, the original, differential, and cumulative signals are processed in parallel, and features such as MFCC, CQT, and Fbank are fused to achieve deep multi-domain feature fusion, thereby enhancing the signal representation ability. The NLARN was optimized based on the ResNet architecture, with the SE attention mechanism embedded. Combined with the long- and short-term attention (LSTA) and the Mamba module, it could capture long-sequence dependencies with an O(N) complexity, completing the optimization of lightweight long sequence modeling. At the same time, with the help of feature fusion, and layer normalization and residual connections of the Mamba module, the adaptability of the model in complex scenarios with imbalanced data and strong noise was enhanced. On the DeepShip and ShipsEar datasets, the recognition rates of this model reached 98.39% and 99.77%, respectively. The number of parameters and the number of floating point operations were significantly lower than those of classical models, and it showed good stability and generalization ability under different sample label ratios. The research shows that the MultiFuseNet-AID network effectively broke through the bottlenecks of existing technologies. However, there is still room for improvement in terms of adaptability to extreme underwater environments, training efficiency, and adaptability to ultra-small devices. It provides a new direction for the development of underwater sonar target recognition technology. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

Back to TopTop