Saved Queries

During UAV aerial photography tasks, influenced by flight altitude and imaging mechanisms, the target in images often exhibits characteristics such as small size, complex backgrounds, and small inter-class differences. Under single optical modality, the weak and less discriminative feature representation of targets in drone-captured images makes them easily overwhelmed by complex background noise, leading to low detection accuracy, high missed-detection and false-detection rates in current object detection networks. Moreover, such methods struggle to meet all-weather and all-scenario application requirements. To address these issues, this paper proposes DVIF-Net, a visible-infrared fusion network for small-target detection in UAV aerial images, which leverages the complementary characteristics of visible and infrared images to enhance detection capability in complex environments. Firstly, a dual-branch feature extraction structure is designed based on YOLO architecture to separately extract features from visible and infrared images. Secondly, a P4-level cross-modal fusion strategy is proposed to effectively integrate features from both modalities while reducing computational complexity. Meanwhile, we design a novel dual context-guided fusion module to capture complementary features through channel attention of visible and infrared images during fusion and enhance interaction between modalities via element-wise multiplication. Finally, an edge information enhancement module based on cross stage partial structure is developed to improve sensitivity to small-target edges. Experimental results on two cross-modal datasets, DroneVehicle and VEDAI, demonstrate that DVIF-Net achieves detection accuracies of 85.8% and 62%, respectively. Compared with YOLOv10n, it has improved by 21.7% and 10.5% in visible modality, and by 7.4% and 30.5% in infrared modality, while maintaining a model parameter count of only 2.49 M. Furthermore, compared with 15 other algorithms, the proposed DVIF-Net attains SOTA performance. These results indicate that the method significantly enhances the detection capability for small targets in UAV aerial images, offering a high-precision and lightweight solution for real-time applications in complex aerial scenarios. Full article

14 pages, 2107 KB

Open AccessArticle

Agricultural Knowledge-Enhanced Deep Learning for Joint Intent Detection and Slot Filling

by Mingtang Liu, Shanshan Wu, Wenlong Tian, Shuo Lei and Jiahao Miao

Appl. Sci. 2025, 15(20), 10932; https://doi.org/10.3390/app152010932 (registering DOI) - 11 Oct 2025

Abstract

Intent detection and slot filling are fundamental components for constructing intelligent question-answering systems in agricultural domains. Existing approaches show notable limitations in semantic feature extraction and achieve relatively low accuracy when processing domain-specific agricultural queries with complex terminology and contextual dependencies. To address these challenges, this paper proposes an agricultural knowledge-enhanced deep learning approach that integrates agricultural domain knowledge and terminology with advanced neural architectures. The method integrates HanLP-based agricultural terminology processing with BERT contextual encoding, TextCNN feature extraction, and attention-based fusion. Experimental validation on a curated domain-specific agricultural dataset of 8041 melon cultivation queries demonstrates that the proposed model achieves an accuracy of 79.6%, recall of 80.1%, and F1-score of 79.8%, demonstrating significant improvements (7–22% performance gains) over baseline methods including TextRNN, TextRCNN, TextCNN, and BERT-TextCNN models. The results demonstrate significant potential for advancing intelligent agricultural advisory systems and domain-specific natural language understanding applications, particularly for precision agriculture applications. Full article

(This article belongs to the Section Agricultural Science and Technology)

►▼ Show Figures

Figure 1

21 pages, 4636 KB

Open AccessArticle

Explainable Few-Shot Anomaly Detection for Real-Time Automotive Quality Control

by Safeh Clinton Mawah, Dagmawit Tadesse Aga, Shahrokh Hatefi, Farouk Smith and Yimesker Yihun

Processes 2025, 13(10), 3238; https://doi.org/10.3390/pr13103238 (registering DOI) - 11 Oct 2025

Abstract

Automotive manufacturing quality control faces persistent challenges such as limited defect samples, cross-domain variability, and the demand for interpretable decision-making. This work presents an explainable few-shot anomaly detection framework that integrates EfficientNet-based feature extraction, adaptive prototype learning, and component-specific attention mechanisms to address these requirements. The system is designed for rapid adaptation to novel defect types while maintaining interpretability through a multi-modal explainable AI module that combines visual, quantitative, and textual outputs. Evaluation on automotive datasets demonstrates promising performance on evaluated automotive components, achieving 99.4% accuracy for engine wiring inspection and 98.8% for gear inspection, with improvements of 5.2–7.6% over state-of-the-art baselines, including traditional unsupervised methods (PaDiM, PatchCore), advanced approaches (FastFlow, CFA, DRAEM), and few-shot supervised methods (ProtoNet, MatchingNet, RelationNet, FEAT), and with only 0.63% cross-domain degradation between wiring and gear inspection tasks. The architecture operates under real-time industrial constraints, with an average inference time of 18.2 ms, throughput of 60 components per minute, and memory usage below 2 GB on RTX 3080 hardware. Ablation studies confirm the importance of prototype learning (−4.52%), component analyzers (−2.79%), and attention mechanisms (−2.21%), with K = 5 few-shot configuration providing the best trade-off between accuracy and adaptability. Beyond performance, the framework produces interpretable defect localization, root-cause analysis, and severity-based recommendations designed for manufacturing integration with execution systems via standardized industrial protocols. These results demonstrate a practical and scalable approach for intelligent quality control, enabling robust, interpretable, and adaptive inspection within the evaluated automotive components. Full article

(This article belongs to the Special Issue Recent Trends in Advanced Manufacturing Technologies for Materials Processing and Production)

22 pages, 520 KB

Open AccessReview

Prevalence of Human and Animal African Trypanosomiasis in Nigeria: A Scoping Review

by Chinwe Chukwudi, Elizabeth Odebunmi and Chukwuemeka Ibeachu

Parasitologia 2025, 5(4), 53; https://doi.org/10.3390/parasitologia5040053 (registering DOI) - 11 Oct 2025

Abstract

African trypanosomiasis is a protozoan disease that affects both humans and animals. Human African Trypanosomiasis (HAT) is a Neglected Tropical Disease targeted for elimination in 2030. Although WHO has not reported HAT from Nigeria in the last decade, there are published studies reporting seroprevalence, parasite detection/isolation, and animal reservoirs potentially involved in HAT transmission in Nigeria. Interestingly, the burden of Animal African Trypanosomiasis (AAT) continues to increase. In this study, we synthesized published reports on the prevalence of HAT and AAT in Nigeria from 1993–2021, the trypanosome species involved, the spread of animal reservoirs, and the variability in diagnostic methodologies employed. A scoping review was performed following the methodological framework outlined in PRISMA-ScR checklist. Sixteen eligible studies published between 1993 and 2021 were reviewed: 13 for AAT and 3 for HAT. Varying prevalence rates were recorded depending on the diagnostic methods employed. The average prevalence reported from these studies was 3.3% (HAT), and 27.3% (AAT). Diagnostic methods employed include microscopy, PCR and Card Agglutination Test for Trypanosomiasis (CATT). Cattle, pigs, and dogs were identified as carriers of human-infective trypanosomes. This study highlights the scarcity of HAT epidemiological studies/data from Nigeria, the high prevalence, complex epidemiology, limited attention and surveillance of African Trypanosomiasis in Nigeria. Remarkably, WHO records do not reflect the published data showing evidence of HAT prevalence/cases in Nigeria. Unfortunately, diagnostics challenges and unrealistic disease reporting protocols seem to limit HAT reporting from Nigeria. Therefore, adequately coordinated epidemiological surveys and targeted intervention policies are imperative to ascertain the true epidemiological status of HAT in Nigeria and prevent disease re-emergence towards achieving WHO’s elimination targets. The presence of animal carriers of human-infective trypanosomes underscores the importance of a one-health approach to combat African trypanosomiasis effectively. Full article

(This article belongs to the Special Issue Epidemiology, Diagnosis and Clinical Management of Human Parasitic Infections—2nd Edition)

►▼ Show Figures

Figure 1

25 pages, 4958 KB

Open AccessArticle

YOLO-DPDG: A Dual-Pooling Dynamic Grouping Network for Small and Long-Distance Traffic Sign Detection

by Ruishi Liang, Minjie Jiang and Shuaibing Li

Appl. Sci. 2025, 15(20), 10921; https://doi.org/10.3390/app152010921 (registering DOI) - 11 Oct 2025

Abstract

Traffic sign detection is a crucial task for autonomous driving perception systems, as it directly impacts vehicle path planning and safety decisions. Existing algorithms face challenges such as feature information attenuation and model lightweighting requirements in the detection of small traffic signs at long distances. To address these issues, this paper proposes a dual-pooling dynamic grouping (DPDG) module. This module dynamically adjusts the number of groups to adapt to different input features, combines global average pooling and max pooling to enhance channel attention representation, and uses a lightweight 3 × 3 convolution-based spatial branch to generate spatial weights. Based on a hierarchical optimization strategy, the DPDG module is integrated into the YOLOv10n network. Experimental results on the traffic sign dataset demonstrate a significant improvement in the performance of the YOLO-DPDG network: Compared to the baseline YOLOv10n model, mAP@0.5 and mAP@0.5:0.95 improved by 8.77% and 10.56%, respectively, while precision and recall were enhanced by 6.16% and 6.62%, respectively. Additionally, inference speed (FPS) increased by 11.1%, with only a 4.89% increase in model parameters. Compared to the YOLOv10-Small model, this method achieves a similar detection accuracy while reducing the number of model parameters by 64.83%. This study provides a more efficient and lightweight solution for edge-based traffic sign detection. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

►▼ Show Figures

Figure 1

17 pages, 4020 KB

Open AccessArticle

Lightweight Detection Method of Wheelset Tread Defects Based on Improved YOLOv7

by Peng Yang, Fan Gao, Xinwen Yang, Caidong Wang, Hongjun Yang and Zhifeng Zhang

Appl. Sci. 2025, 15(20), 10903; https://doi.org/10.3390/app152010903 - 10 Oct 2025

Abstract

Accurate online inspection of train wheelset tread defects is challenging owing to the variety and position uncertainty of defects. This study develops an improved YOLOv7 model capable of inspecting various wheelset tread defects with high accuracy and low computation complexity. This model comprises GSConv, a small target enhancement (STE) module, and StyleGAN3. GSConv significantly reduces the model volume while maintaining the feature expression ability, achieving a lightweight structure. The STE module enhances the fusion of shallow features and distribution of attention weights, significantly improving the sensitivity to small-sized defects and positioning robustness. StyleGAN3 enhances small samples by addressing inhomogeneity, thereby generating high-quality defect samples; it overcomes the limitations of traditional amplification methods regarding texture authenticity and morphological diversity, systematically improving the model’s generalization ability under sample scarcity conditions. The model achieves 1.6%, 10.7%, 48.63% and 37.97% higher mean average precision values than YOLOv7, YOLOv5, SSD, and Faster R-CNN, respectively, and the model parameter size is reduced by 73.91, 94.69, 122.11, and 154.91 MB, respectively. Hence, the proposed YOLOv7-STE model outperforms traditional models. Moreover, it demonstrates satisfactory performance in detecting small target defects in different samples, highlighting its potential applicability in online wheel tread defect inspection. Full article

►▼ Show Figures

Figure 1

18 pages, 2920 KB

Open AccessArticle

Frequency Domain Reflectometry for Power Cable Defect Localization: A Comparative Study of FFT and IFFT Methods

by Wenbo Zhu, Baojun Hui, Jianda Li, Tao Han, Linjie Zhao and Shuai Hou

Energies 2025, 18(20), 5346; https://doi.org/10.3390/en18205346 - 10 Oct 2025

Abstract

At present, the development of power cables shows three notable trends: higher voltage, longer distance and more complex environments. Against this backdrop, the limitations of traditional detection techniques in locating local defects have become increasingly apparent. Frequency Domain Reflectometry (FDR) has garnered sustained research attention both domestically and internationally due to its high sensitivity and accuracy in detecting localized defects. This paper aims to compare the defect localization effectiveness of the Fast Fourier Transform (FFT) method and the Inverse Fast Fourier Transform (IFFT) method within FDR. First, the differences between the two methods are analyzed from a theoretical perspective. Then, field tests are conducted on cables of varying voltage levels and lengths, with comparisons made using parameters such as full width at half maximum (FWHM) and signal-to-noise ratio (SNR). The results indicate that the FFT method is more suitable for low-interference or short cables, while the IFFT method is more suitable for high-noise, high-resolution, or long cables. Full article

(This article belongs to the Special Issue Advanced Techniques for Power Transmission, Distribution and Transformation Equipment)

►▼ Show Figures

Figure 1

18 pages, 9861 KB

Open AccessArticle

EH-YOLO: Dimensional Transformation and Hierarchical Feature Fusion-Based PCB Surface Defect Detection

by Chengzhi Deng, You Zhang, Zhaoming Wu, Yingbo Wu, Xiaowei Sun and Shengqian Wang

Appl. Sci. 2025, 15(20), 10895; https://doi.org/10.3390/app152010895 - 10 Oct 2025

Abstract

Small surface defects in printed circuit boards (PCBs) severely affect the reliability of electronic devices, making PCB surface defect detection crucial for ensuring the quality of electronic products. However, the existing detection methods often struggle with insufficient accuracy and the inherent trade-off between detection precision and inference speed. To address these problems, we propose a novel ESDM-HNN-YOLO (EH-YOLO) network based on the improved YOLOv10 for efficient detection of small PCB defects. Firstly, an enhanced spatial-depth module (ESDM) is designed, which transforms spatial-dimensional features into depth-dimensional representations while integrating spatial attention module (SAM) and channel attention module (CAM) to highlight critical features. This dual mechanism not only effectively suppresses feature loss in micro-defects but also significantly enhances detection accuracy. Secondly, a hybrid neck network (HNN) is designed, which optimizes the speed–accuracy balance through hierarchical architecture. The hierarchical structure uses a computationally efficient weighted bidirectional feature pyramid network (BiFPN) to enhance multi-scale feature fusion of small objects in the shallow layer and uses a path aggregation network (PAN) to prevent feature loss in the deeper layer. Comprehensive evaluations on benchmark datasets (PCB_DATASET and DeepPCB) demonstrate the superior performance of EH-YOLO, achieving mAP@50-95 scores of 45.3% and 78.8% with inference speeds of 166.67 FPS and 158.73 FPS, respectively. These results significantly outperform existing approaches in both accuracy and processing efficiency. Full article

►▼ Show Figures

Figure 1

20 pages, 8850 KB

Open AccessArticle

Intelligent Defect Recognition of Glazed Components in Ancient Buildings Based on Binocular Vision

by Youshan Zhao, Xiaolan Zhang, Ming Guo, Haoyu Han, Jiayi Wang, Yaofeng Wang, Xiaoxu Li and Ming Huang

Buildings 2025, 15(20), 3641; https://doi.org/10.3390/buildings15203641 (registering DOI) - 10 Oct 2025

Abstract

Glazed components in ancient Chinese architecture hold profound historical and cultural value. However, over time, environmental erosion, physical impacts, and human disturbances gradually lead to various forms of damage, severely impacting the durability and stability of the buildings. Therefore, preventive protection of glazed components is crucial. The key to preventive protection lies in the early detection and repair of damage, thereby extending the component’s service life and preventing significant structural damage. To address this challenge, this study proposes a Restoration-Scale Identification (RSI) method that integrates depth information. By combining RGB-D images acquired from a depth camera with intrinsic camera parameters, and embedding a Convolutional Block Attention Module (CBAM) into the backbone network, the method dynamically enhances critical feature regions. It then employs a scale restoration strategy to accurately identify damage areas and recover the physical dimensions of glazed components from a global perspective. In addition, we constructed a dedicated semantic segmentation dataset for glazed tile damage, focusing on cracks and spalling. Both qualitative and quantitative evaluation results demonstrate that, compared with various high-performance semantic segmentation methods, our approach significantly improves the accuracy and robustness of damage detection in glazed components. The achieved accuracy deviates by only ±10 mm from high-precision laser scanning, a level of precision that is essential for reliably identifying and assessing subtle damages in complex glazed architectural elements. By integrating depth information, real scale information can be effectively obtained during the intelligent recognition process, thereby efficiently and accurately identifying the type of damage and size information of glazed components, and realizing the conversion from two-dimensional (2D) pixel coordinates to local three-dimensional (3D) coordinates, providing a scientific basis for the protection and restoration of ancient buildings, and ensuring the long-term stability of cultural heritage and the inheritance of historical value. Full article

(This article belongs to the Section Building Materials, and Repair & Renovation)

►▼ Show Figures

Figure 1

14 pages, 6532 KB

Open AccessArticle

The Evaluation of Skin Infiltration in Mycosis Fungoides/Sézary Syndrome Using the High-Frequency Ultrasonography

by Hanna Cisoń, Alina Jankowska-Konsur and Rafał Białynicki-Birula

J. Clin. Med. 2025, 14(20), 7143; https://doi.org/10.3390/jcm14207143 - 10 Oct 2025

Abstract

Background/Objectives: High-frequency ultrasonography (HFUS) has gained increasing attention in dermatology as a non-invasive imaging technique capable of visualizing cutaneous structures with high resolution. In cutaneous T-cell lymphomas (CTCL), including mycosis fungoides (MF)/Sézary syndrome (SS), HFUS may provide an objective method for assessing disease activity and monitoring treatment response. This study aimed to evaluate the clinical utility of HFUS in detecting therapy-induced changes in subepidermal low-echogenic band (SLEB) thickness. Methods: We conducted a prospective, single-center study between May 2021 and May 2025. Thirty-three patients with histologically confirmed MF (n = 31) or SS (n = 2) underwent HFUS at baseline and after 4–8 weeks of treatment. SLEB thickness was measured before (E1) and after early treatment (E2). Patients received systemic agents, phototherapy, or topical regimens. Statistical analysis included mixed-model ANOVA with repeated measures to assess SLEB changes, and post hoc tests were applied to explore the influence of therapy type, age, and gender. Results: Among 31 evaluable patients with MF, HFUS revealed a significant reduction in SLEB thickness after treatment (0.90 ± 1.10 mm vs. 0.69 ± 0.89 mm; F(1,29) = 8.88, p = 0.006, η² = 0.23). The type of early therapy (systemic vs. topical) did not significantly affect outcomes (p = 0.452). Age emerged as a relevant factor: patients ≥ 66 years exhibited higher baseline SLEB values and a significant reduction post-treatment (p < 0.001), whereas no comparable effect was observed in younger patients. Gender did not significantly influence SLEB changes. Conclusions: HFUS is a sensitive and clinically applicable imaging tool for monitoring treatment response in MF/SS. Reductions in SLEB thickness were observed across therapeutic modalities and aligned with early clinical improvement. HFUS may serve as a valuable adjunct to standard clinical and histopathological evaluation in the routine management of MF/SS. Full article

(This article belongs to the Section Dermatology)

►▼ Show Figures

Figure 1

24 pages, 76400 KB

Open AccessArticle

MBD-YOLO: An Improved Lightweight Multi-Scale Small-Object Detection Model for UAVs Based on YOLOv8

by Bo Xu, Di Cai, Kelin Sui, Zheng Wang, Chuangchuang Liu and Xiaolong Pei

Appl. Sci. 2025, 15(20), 10877; https://doi.org/10.3390/app152010877 - 10 Oct 2025

Abstract

To address the challenges of low detection accuracy and weak generalization in UAV aerial imagery caused by complex ground environments, significant scale variations among targets, dense small objects, and background interference, this paper proposes an improved lightweight multi-scale small-object detection model, MBD-YOLO (MBFF module, BiMS-FPN, and Dual-Stream Head). Specifically, to enhance multi-scale feature extraction capabilities, we introduce the Multi-Branch Feature Fusion (MBFF) module, which dynamically adjusts receptive fields through parallel branches and adaptive depthwise convolutions, expanding the receptive field while preserving detail perception. We further design a lightweight Bidirectional Multi-Scale Feature Aggregation Pyramid Network (BiMS-FPN), integrating bidirectional propagation paths and a Multi-Scale Feature Aggregation (MSFA) module to mitigate feature spatial misalignment and improve small-target detection. Additionally, the Dual-Stream Head with NMS-free architecture leverages a task-aligned architecture and dynamic matching strategies to boost inference speed without compromising accuracy. Experiments on the VisDrone2019 dataset demonstrate that MBD-YOLO-n surpasses YOLOv8n by 6.3% in mAP50 and 8.2% in mAP50–95, with accuracy gains of 17.96–55.56% for several small-target categories, while increasing parameters by merely 3.1%. Moreover, MBD-YOLO-s achieves superior detection accuracy, efficiency, and generalization with only 12.1 million parameters, outperforming state-of-the-art models and proving suitable for resource-constrained embedded deployment scenarios. The superior performance of MBD-YOLO, which harmonizes high precision with low computational demand, fulfills the critical requirements for real-time deployment on resource-limited UAVs, showing great promise for applications in traffic monitoring, urban security, and agricultural surveying. Full article

(This article belongs to the Special Issue Application, Optimization and Architecture of Deep Learning Neural Network)

►▼ Show Figures

Figure 1

38 pages, 13748 KB

Open AccessArticle

MH-WMG: A Multi-Head Wavelet-Based MobileNet with Gated Linear Attention for Power Grid Fault Diagnosis

by Yousef Alkhanafseh, Tahir Cetin Akinci, Alfredo A. Martinez-Morales, Serhat Seker and Sami Ekici

Appl. Sci. 2025, 15(20), 10878; https://doi.org/10.3390/app152010878 - 10 Oct 2025

Abstract

Artificial intelligence is increasingly embedded in power systems to boost efficiency, reliability, and automation. This study introduces an end-to-end, AI-driven fault-diagnosis pipeline built around a Multi-Head Wavelet-based MobileNet with Gated Linear Attention (MH-WMG). The network takes time-series signals converted into images as input and branches into three heads that, respectively, localize the fault area, classify the fault type, and predict the distance bin for all short-circuit faults. Evaluation employs the canonical Kundur two-area four-machine system, partitioned into six regions, twelve fault scenarios (including normal operation), and twelve predefined distance bins. MH-WMG achieves high performance: perfect accuracy, precision, recall, and F1 (1.00) for fault-area detection; strong fault-type identification (accuracy = 0.9604, precision = 0.9625, recall = 0.9604, and F1 = 0.9601); and robust distance-bin prediction (accuracy = 0.8679, precision = 0.8725, recall = 0.8679, and F1 = 0.8690). The model is compact and fast (2.33 M parameters, 44.14 ms latency, 22.66 images/s) and outperforms baselines in both accuracy and efficiency. The pipeline decisively outperforms conventional time-series methods. By rapidly pinpointing and classifying faults with high fidelity, it enhances grid resilience, reduces operational risk, and enables more stable, intelligent operation, demonstrating the value of AI-driven fault detection for future power-system reliability. Full article

►▼ Show Figures

Figure 1

24 pages, 1545 KB

Open AccessArticle

Curvature-Aware Point-Pair Signatures for Robust Unbalanced Point Cloud Registration

by Xinhang Hu, Zhao Zeng, Jiwei Deng, Guangshuai Wang, Jiaqi Yang and Siwen Quan

Sensors 2025, 25(20), 6267; https://doi.org/10.3390/s25206267 - 10 Oct 2025

Abstract

Existing point cloud registration methods can effectively handle large-scale and partially overlapping point cloud pairs. However, registering unbalanced point cloud pairs with significant disparities in spatial extent and point density remains a challenging problem that has received limited research attention. This challenge primarily arises from the difficulty in achieving accurate local registration when the point clouds exhibit substantial scale variations and uneven density distributions. This paper presents a novel registration method for unbalanced point cloud pairs that utilizes the local point cluster structure feature for effective outlier rejection. The fundamental principle underlying our method is that the internal structure of a local cluster comprising a point and its K-nearest neighbors maintains rigidity-preserved invariance across different point clouds. The proposed pipeline operates through four sequential stages. First, keypoints are detected in both the source and target point clouds. Second, local feature descriptors are employed to establish initial one-to-many correspondences, which is a strategy that increases correspondences redundancy to enhance the pool of potential inliers. Third, the proposed Local Point Cluster Structure Feature is applied to filter outliers from the initial correspondences. Finally, the transformation hypothesis is generated and evaluated through the RANSAC method. To validate the efficacy of the proposed method, we construct a carefully designed benchmark named KITTI-UPP (KITTI-Unbalanced Point cloud Pairs) based on the KITTI odometry dataset. We further evaluate our method on the real-world TIESY Dataset which is a LiDAR-scanned dataset collected by the Third Railway Survey and Design Institute Group Co. Extensive experiments demonstrate that our method significantly outperforms the state-of-the-art methods in terms of both registration success rate and computational efficiency on the KITTI-UPP benchmark. Moreover, it achieves competitive results on the real-world TIESY dataset, confirming its applicability and generalizability across diverse real-world scenarios. Full article

(This article belongs to the Section Sensing and Imaging)

►▼ Show Figures

Figure 1

17 pages, 7150 KB

Open AccessArticle

DeepFishNET+: A Dual-Stream Deep Learning Framework for Robust Underwater Fish Detection and Classification

by Mahdi Hamzaoui, Mokhtar Rejili, Mohamed Ould-Elhassen Aoueileyine and Ridha Bouallegue

Appl. Sci. 2025, 15(20), 10870; https://doi.org/10.3390/app152010870 - 10 Oct 2025

Abstract

The conservation and protection of fish species are crucial tasks for aquaculture and marine biology. Recognizing fish in underwater environments is highly challenging due to poor lighting and the visual similarity between fish and the background. Conventional recognition methods are extremely time-consuming and often yield unsatisfactory accuracy. This paper proposes a new method called DeepFishNET+. First, an Underwater Image Enhancement module was implemented for image correction. Second, Global CNN Stream (RestNet50) and a Local Transformer Stream were implemented to generate the Feature Map and Feature Vector. Next, a feature fusion operation was performed in the Cross-Attention Feature Fusion module. Finally, Yolov8 was used for fish detection and localization. Softmax was applied for species recognition. This new approach achieved a classification precision of 98.28% and a detection precision of 92.74%. Full article

(This article belongs to the Special Issue Advances in Aquatic Animal Nutrition and Aquaculture)

►▼ Show Figures

Figure 1

23 pages, 2122 KB

Open AccessArticle

PSD-YOLO: An Enhanced Real-Time Framework for Robust Worker Detection in Complex Offshore Oil Platform Environments

by Yikun Qin, Jiawen Dong, Wei Li, Linxin Zhang, Ke Feng and Zijia Wang

Sensors 2025, 25(20), 6264; https://doi.org/10.3390/s25206264 - 10 Oct 2025

Abstract

To address the safety challenges for personnel in the complex and hazardous environments of offshore drilling platforms, this paper introduces the Platform Safety Detection YOLO (PSD-YOLO), an enhanced, real-time object detection framework based on YOLOv10s. The framework integrates several key innovations to improve detection robustness: first, the Channel Attention-Aware (CAA) mechanism is incorporated into the backbone network to effectively suppress complex background noise interference; second, a novel C2fCIB_Conv2Former module is designed in the neck to strengthen multi-scale feature fusion for small and occluded targets; finally, the Soft-NMS algorithm is employed in place of traditional NMS to significantly reduce missed detections in dense scenes. Experimental results on a custom offshore platform personnel dataset show that PSD-YOLO achieves a mean Average Precision (mAP@0.5) of 82.5% at an inference speed of 232.56 FPS. The efficient and accurate detection framework proposed in this study provides reliable technical support for automated safety monitoring systems, holds significant practical implications for reducing accident rates and safeguarding personnel by enabling real-time warnings of hazardous situations, fills a critical gap in intelligent sensor monitoring for offshore platforms and makes a significant contribution to advancing their safety monitoring systems. Full article

(This article belongs to the Section Intelligent Sensors)

►▼ Show Figures

Figure 1

Show export options Show export options

Select all

Export citation of selected articles as:

Error

Oops... you haven't selected anything for export.

Displaying article 1-50 on page 1 of 228.

Go to page 1 2 3 4 5

Search Results (11,379)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI