MDPI - Publisher of Open Access Journals

24 pages, 4735 KB

Open AccessArticle

An Improved YOLO11n-Based Algorithm for Road Sign Detection

by Haifeng Fu, Xinlei Xiao, Yonghua Han, Le Dai, Lan Yao and Lu Xu

Sensors 2026, 26(8), 2543; https://doi.org/10.3390/s26082543 - 20 Apr 2026

Viewed by 264

For vehicle driving scenarios in complex backgrounds, road sign detection faces challenges such as multi-scale targets, long-distances, and low-resolution. To address these challenges, a detection method based on an improved YOLO11n network is proposed. Firstly, to accommodate the multi-scale characteristics of the targets [...] Read more.

For vehicle driving scenarios in complex backgrounds, road sign detection faces challenges such as multi-scale targets, long-distances, and low-resolution. To address these challenges, a detection method based on an improved YOLO11n network is proposed. Firstly, to accommodate the multi-scale characteristics of the targets and improve the network’s ability to detect low-resolution objects and details, a Multi-path Gated Aggregation (MGA) Module is proposed, achieving these objectives via multi-dimensional feature extraction. Secondly, the Neck is improved by designing a network structure that incorporates high-resolution information from the Backbone, thereby enhancing the detection capabilities for small and blurry targets. Finally, an enhanced Spatial Pyramid Pooling—Fast (SPPF) module is proposed, wherein a Group Convolution-Layer Normalization-SiLU structure is integrated across various stages of information passing. By fusing adjacent channel information, it effectively suppresses complex background noise across multiple scales and amplifies road marking features, which consequently boosts the model’s discriminability for distant and obscured targets. Experimental results on a multi-type road sign dataset show that the improved model achieves an mAP@0.5 of 96.96%, which is 1.42% higher than the original model. The mAP@0.5–0.95 and Recall rates are 83.94% and 92.94%, respectively, while the inference speed remains at 134 FPS. Research demonstrates that via targeted modular designs, the proposed approach strikes a superior balance between detection accuracy and real-time efficiency. Consequently, it provides robust technical support for the reliable operation of intelligent vehicle perception systems under complex conditions. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

20 pages, 4366 KB

Open AccessArticle

Intelligent Detection of Asphalt Pavement Cracks Based on Improved YOLOv8s

by Jinfei Su, Jicong Xu, Chuqiao Shi, Yuhan Wang, Shihao Dong and Xue Zhang

Coatings 2026, 16(3), 359; https://doi.org/10.3390/coatings16030359 - 12 Mar 2026

Viewed by 439

Abstract

The intelligent detection of asphalt pavement cracks has become increasingly important for ensuring service performance of road infrastructure. Traditional manual detection has significant safety hazards and insufficient accuracy. Furthermore, existing deep learning models still face challenges, including missed detection, false alarms, and poor [...] Read more.

The intelligent detection of asphalt pavement cracks has become increasingly important for ensuring service performance of road infrastructure. Traditional manual detection has significant safety hazards and insufficient accuracy. Furthermore, existing deep learning models still face challenges, including missed detection, false alarms, and poor performance in small target detection under complex conditions. This investigation adopts unmanned aerial vehicles (UAVs) to acquire pavement distress information and develops an intelligent detection approach for asphalt pavement crack based on improved YOLOv8s. First, the Spatial Pyramid Pooling Fast (SPPF) module is replaced with the Spatial Pyramid Pooling Fast with Cross Stage Partial Connections (SPPFCSPC) module in the backbone network to enhance the multi-scale feature fusion capability. Secondly, the Convolutional Block Attention Module (CBAM) module is introduced to the neck network to optimize the feature weights in both channel and spatial attention. Meanwhile, the Efficient Intersection over Union (EIoU) loss is adopted to improve accuracy. Finally, the Crack_Dataset is established, and the ablation experiments are conducted to verify the reliability of the detection model. The research indicates that the improved model achieves Precision, Recall, and mAP@0.5 of 83.9%, 79.6%, and 83.9%, respectively, representing increases of 1.5%, 1.3%, and 1.4%, compared with the baseline model. In comparison with mainstream object detection algorithms such as YOLOv5s and YOLOv8s, the proposed method attains an F1-score, mAP@0.5, and mAP@[0.5–0.95] of 0.82, 83.9%, and 46.6%, respectively, demonstrating a performance improvement. Based on the improved detection model, a pavement crack detection system was designed and implemented using PyQt5. This system supports image, video, and real-time camera input and detection. Full article

(This article belongs to the Special Issue Pavement Surface Status Evaluation and Smart Perception)

► Show Figures

Figure 1

25 pages, 7167 KB

Open AccessArticle

Edge-Enhanced YOLOV8 for Spacecraft Instance Segmentation in Cloud-Edge IoT Environments

by Ming Chen, Wenjie Chen, Yanfei Niu, Ping Qi and Fucheng Wang

Future Internet 2026, 18(1), 59; https://doi.org/10.3390/fi18010059 - 20 Jan 2026

Viewed by 470

Abstract

The proliferation of smart devices and the Internet of Things (IoT) has led to massive data generation, particularly in complex domains such as aerospace. Cloud computing provides essential scalability and advanced analytics for processing these vast datasets. However, relying solely on the cloud [...] Read more.

The proliferation of smart devices and the Internet of Things (IoT) has led to massive data generation, particularly in complex domains such as aerospace. Cloud computing provides essential scalability and advanced analytics for processing these vast datasets. However, relying solely on the cloud introduces significant challenges, including high latency, network congestion, and substantial bandwidth costs, which are critical for real-time on-orbit spacecraft services. Cloud-edge Internet of Things (cloud-edge IoT) computing emerges as a promising architecture to mitigate these issues by pushing computation closer to the data source. This paper proposes an improved YOLOV8-based model specifically designed for edge computing scenarios within a cloud-edge IoT framework. By integrating the Cross Stage Partial Spatial Pyramid Pooling Fast (CSPPF) module and the WDIOU loss function, the model achieves enhanced feature extraction and localization accuracy without significantly increasing computational cost, making it suitable for deployment on resource-constrained edge devices. Meanwhile, by processing image data locally at the edge and transmitting only the compact segmentation results to the cloud, the system effectively reduces bandwidth usage and supports efficient cloud-edge collaboration in IoT-based spacecraft monitoring systems. Experimental results show that, compared to the original YOLOV8 and other mainstream models, the proposed model demonstrates superior accuracy and instance segmentation performance at the edge, validating its practicality in cloud-edge IoT environments. Full article

(This article belongs to the Special Issue Convergence of IoT, Edge and Cloud Systems)

► Show Figures

Figure 1

18 pages, 4205 KB

Open AccessArticle

Research on Field Weed Target Detection Algorithm Based on Deep Learning

by Ziyang Chen, Le Wu, Zhenhong Jia, Jiajia Wang, Gang Zhou and Zhensen Zhang

Sensors 2026, 26(2), 677; https://doi.org/10.3390/s26020677 - 20 Jan 2026

Viewed by 453

Abstract

Weed detection algorithms based on deep learning are considered crucial for smart agriculture, with the YOLO series algorithms being widely adopted due to their efficiency. However, existing YOLO algorithms struggle to maintain high accuracy, while low parameter requirements and computational efficiency are achieved [...] Read more.

Weed detection algorithms based on deep learning are considered crucial for smart agriculture, with the YOLO series algorithms being widely adopted due to their efficiency. However, existing YOLO algorithms struggle to maintain high accuracy, while low parameter requirements and computational efficiency are achieved when weeds with occlusion or overlap are detected. To address this challenge, a target detection algorithm called SSS-YOLO based on YOLOv9t is proposed in this paper. First, the SCB (Spatial Channel Conv Block) module is introduced, in which large kernel convolution is employed to capture long-range dependencies, occluded weed regions are bypassed by being associated with unobstructed areas, and features of unobstructed regions are enhanced through inter-channel relationships. Second, the SPPF EGAS (Spatial Pyramid Pooling Fast Edge Gaussian Aggregation Super) module is proposed, where multi-scale max pooling is utilized to extract hierarchical contextual features, large receptive fields are leveraged to acquire background information around occluded objects, and features of weed regions obscured by crops are inferred. Finally, the EMSN (Efficient Multi-Scale Spatial-Feedforward Network) module is developed, through which semantic information of occluded regions is reconstructed by contextual reasoning and background vegetation interference is effectively suppressed while visible regional details are preserved. To validate the performance of this method, experiments are conducted on both our self-built dataset and the publicly available Cotton WeedDet12 dataset. The results demonstrate that compared to existing algorithms, significant performance improvements are achieved by the proposed method. Full article

(This article belongs to the Section Smart Agriculture)

► Show Figures

Figure 1

28 pages, 3390 KB

Open AccessArticle

SDC-YOLOv8: An Improved Algorithm for Road Defect Detection Through Attention-Enhanced Feature Learning and Adaptive Feature Reconstruction

by Hao Yang, Yulong Song, Yue Liang, Enhao Tang and Danyang Cao

Sensors 2026, 26(2), 609; https://doi.org/10.3390/s26020609 - 16 Jan 2026

Viewed by 731

Abstract

Road defect detection is essential for timely road damage repair and traffic safety assurance. However, existing object detection algorithms suffer from insufficient accuracy in detecting small road surface defects and are prone to missed detections and false alarms under complex lighting and background [...] Read more.

Road defect detection is essential for timely road damage repair and traffic safety assurance. However, existing object detection algorithms suffer from insufficient accuracy in detecting small road surface defects and are prone to missed detections and false alarms under complex lighting and background conditions. To address these challenges, this study proposes SDC-YOLOv8, an improved YOLOv8-based algorithm for road defect detection that employs attention-enhanced feature learning and adaptive feature reconstruction. The model incorporates three key innovations: (1) an SPPF-LSKA module that integrates Fast Spatial Pyramid Pooling with Large Separable Kernel Attention to enhance multi-scale feature representation and irregular defect modeling capabilities; (2) DySample dynamic upsampling that replaces conventional interpolation methods for adaptive feature reconstruction with reduced computational cost; and (3) a Coordinate Attention module strategically inserted to improve spatial localization accuracy under complex conditions. Comprehensive experiments on a public pothole dataset demonstrate that SDC-YOLOv8 achieves 78.0% mAP@0.5, 81.0% Precision, and 70.7% Recall while maintaining real-time performance at 85 FPS. Compared to the baseline YOLOv8n model, the proposed method improves mAP@0.5 by 2.0 percentage points, Precision by 3.3 percentage points, and Recall by 1.8 percentage points, yielding an F1 score of 75.5%. These results demonstrate that SDC-YOLOv8 effectively enhances small-target detection accuracy while preserving real-time processing capability, offering a practical and efficient solution for intelligent road defect detection applications. Full article

(This article belongs to the Section Fault Diagnosis & Sensors)

► Show Figures

Figure 1

23 pages, 91075 KB

Open AccessArticle

Improved Lightweight Marine Oil Spill Detection Using the YOLOv8 Algorithm

by Jianting Shi, Tianyu Jiao, Daniel P. Ames, Yinan Chen and Zhonghua Xie

Appl. Sci. 2026, 16(2), 780; https://doi.org/10.3390/app16020780 - 12 Jan 2026

Viewed by 597

Abstract

Marine oil spill detection using Synthetic Aperture Radar (SAR) is crucial but challenged by dynamic marine conditions, diverse spill scales, and limitations in existing algorithms regarding model size and real-time performance. To address these challenges, we propose LSFE-YOLO, a YOLOv8s-optimized (You Only Look [...] Read more.

Marine oil spill detection using Synthetic Aperture Radar (SAR) is crucial but challenged by dynamic marine conditions, diverse spill scales, and limitations in existing algorithms regarding model size and real-time performance. To address these challenges, we propose LSFE-YOLO, a YOLOv8s-optimized (You Only Look Once version 8) lightweight model with an original, domain-tailored synergistic integration of FasterNet, GN-LSC Head (GroupNorm Lightweight Shared Convolution Head), and C2f_MBE (C2f Mobile Bottleneck Enhanced). FasterNet serves as the backbone (25% neck width reduction), leveraging partial convolution (PConv) to minimize memory access and redundant computations—overcoming traditional lightweight backbones’ high memory overhead—laying the foundation for real-time deployment while preserving feature extraction. The proposed GN-LSC Head replaces YOLOv8’s decoupled head: its shared convolutions reduce parameter redundancy by approximately 40%, and GroupNorm (Group Normalization) ensures stable accuracy under edge computing’s small-batch constraints, outperforming BatchNorm (Batch Normalization) in resource-limited scenarios. The C2f_MBE module integrates EffectiveSE (Effective Squeeze and Excitation)-optimized MBConv (Mobile Inverted Bottleneck Convolution) into C2f: MBConv’s inverted-residual design enhances multi-scale feature capture, while lightweight EffectiveSE strengthens discriminative oil spill features without extra computation, addressing the original C2f’s scale variability insufficiency. Additionally, an SE (Squeeze and Excitation) attention mechanism embedded upstream of SPPF (Spatial Pyramid Pooling Fast) suppresses background interference (e.g., waves, biological oil films), synergizing with FasterNet and C2f_MBE to form a cascaded feature optimization pipeline that refines representations throughout the model. Experimental results show that LSFE-YOLO improves mAP (mean Average Precision) by 1.3% and F1 score by 1.7% over YOLOv8s, while achieving substantial reductions in model size (81.9%), parameter count (82.9%), and computational cost (84.2%), alongside a 20 FPS (Frames Per Second) increase in detection speed. LSFE-YOLO offers an efficient and effective solution for real-time marine oil spill detection. Full article

(This article belongs to the Special Issue Remote Sensing Applications in Agricultural, Earth and Environmental Science, 2nd Edition)

► Show Figures

Figure 1

21 pages, 6330 KB

Open AccessArticle

Improved YOLOv9 with Dual Convolution and LSKA Attention for Robust Small Defect Detection in Textiles

by Chang Xuan, Weimin Shi, Lei Sun, Ji Wu, Yongchao Zhang and Jiajia Tu

Processes 2026, 14(1), 149; https://doi.org/10.3390/pr14010149 - 1 Jan 2026

Viewed by 570

Abstract

To mitigate the challenges of false positives and undetected small-scale defects in fabric inspection, this study proposes an advanced fabric defect detection system that leverages an optimized YOLOv9 algorithm. First, redundant computations are reduced by introducing DualConv to replace standard convolution. Second, the [...] Read more.

To mitigate the challenges of false positives and undetected small-scale defects in fabric inspection, this study proposes an advanced fabric defect detection system that leverages an optimized YOLOv9 algorithm. First, redundant computations are reduced by introducing DualConv to replace standard convolution. Second, the LSKA attention mechanism is incorporated to increase the weight of important features, thereby enhancing the accuracy of small target detection and improving the generalization ability. Additionally, the focal modulation network is employed to replace the fast spatial pyramid module, mitigating the loss of detailed information caused by the feature pooling operation. Furthermore, the conventional feature pyramid network is replaced with bidirectional feature pyramid network, which is utilized for efficient feature fusion, thereby enhancing multiscale feature representation and improving detection accuracy. Finally, the bounding box loss function is optimized by introducing the shape-IoU loss function, which facilitates more rapid model convergence and significantly improves detection accuracy. Experiments conducted on a fabric defect dataset demonstrate that the proposed algorithm yields a 6.7% increase in mAP@0.5 and a 14.7% improvement in mAP@0.5–0.95, while simultaneously reducing the model’s total parameters by 17.8% and computational FLOPs by 14.4%, compared with those of the original algorithm. The improved YOLOv9 model significantly enhances the precision and accuracy of defect detection while maintaining inference speed (55.8 FPS) that meets industrial requirements. Full article

(This article belongs to the Section Process Control and Monitoring)

► Show Figures

Figure 1

22 pages, 3870 KB

Open AccessArticle

Accurate Pose Detection Method for Rail Fastener Clips Based on Improved YOLOv8-Pose

by Defang Lv, Jianjun Meng, Zhenhan Ren, Liqing Yao and Gengqi Liu

Appl. Sci. 2026, 16(1), 276; https://doi.org/10.3390/app16010276 - 26 Dec 2025

Viewed by 390

Abstract

Minor displacements and deflections of rail fastener clips pose a critical risk to railway safety, which are difficult to quantify accurately using traditional object detection methods. This paper proposes an improved YOLOv8-pose-based method, You Only Look Once version 8-pose with GAM, SPPF-Attention, and [...] Read more.

Minor displacements and deflections of rail fastener clips pose a critical risk to railway safety, which are difficult to quantify accurately using traditional object detection methods. This paper proposes an improved YOLOv8-pose-based method, You Only Look Once version 8-pose with GAM, SPPF-Attention, and Wise-IoU (YOLOv8-pose-GSW) for automated and quantitative pose detection of fastener clips. Firstly, a high-precision keypoint detection network is constructed by integrating a Global Attention Mechanism (GAM) into the neck, enhancing the Spatial Pyramid Pooling Fast (SPPF) module to Spatial Pyramid Pooling Fast with Attention (SPPF-Attention) in the backbone, and adopting the Wise Intersection over Union (Wise-IoU) loss function. Subsequently, a posterior verification mechanism based on spatial constraint error is designed to eliminate unreliable detections by leveraging the inherent geometric priors of fasteners. Finally, the deflection angle, longitudinal displacement, and lateral displacement of the clip are calculated from the verified keypoints. Experimental results demonstrate that the proposed method achieves an Average Precision at IoU threshold from 0.5 to 0.95 (AP@0.5:0.95) of 77.5%, representing a 3.6% improvement over the baseline YOLOv8s-pose model, effectively balancing detection accuracy and computational efficiency. This work provides a reliable technical solution for the refined maintenance of rail fasteners. Full article

► Show Figures

Figure 1

23 pages, 5151 KB

Open AccessArticle

Small-Target Detection Algorithm Based on Improved YOLOv11n

by Ke Zeng, Wangsheng Yu, Xianxiang Qin and Siyu Long

Sensors 2026, 26(1), 71; https://doi.org/10.3390/s26010071 - 22 Dec 2025

Cited by 2 | Viewed by 1123

Abstract

Target detection in UAV aerial photography scenarios faces challenges of small targets and complex backgrounds. Thus, we proposed an improved YOLOv11n small-target detection algorithm. First, a detection head is added to the 160 × 160 resolution feature layer, and non-adjacent layer feature is [...] Read more.

Target detection in UAV aerial photography scenarios faces challenges of small targets and complex backgrounds. Thus, we proposed an improved YOLOv11n small-target detection algorithm. First, a detection head is added to the 160 × 160 resolution feature layer, and non-adjacent layer feature is fused via Asymptotic Feature Pyramid Network (AFPN) to alleviate feature loss caused by downsampling and reduce cross-level feature conflicts. Second, the Spatial Channel Attention SPPF (SCASPPF) module replaces the original Spatial Pyramid Pooling-Fast (SPPF) module to highlight key features and suppress irrelevant ones. Moreover, the loss function is enhanced by fusing MPDIoU and InnerIoU to boost detection accuracy. Finally, Inception Deep Convolution (IDC) is adopted to improve the C3k2 module, expanding the model’s receptive field and enhancing small-target detection performance. Experiments on the Visdrone2019 dataset show that the algorithm achieves 39.256% mAP@0.5, 6.689% higher than 32.567% mAP@0.5 of the benchmark model (YOLOv11n). Full article

(This article belongs to the Section Sensor Networks)

► Show Figures

Figure 1

22 pages, 5334 KB

Open AccessArticle

Two-Stage Multi-Label Detection Method for Railway Fasteners Based on Type-Guided Expert Model

by Defang Lv, Jianjun Meng, Gaoyang Meng, Yanni Shen, Liqing Yao and Gengqi Liu

Appl. Sci. 2025, 15(24), 13093; https://doi.org/10.3390/app152413093 - 12 Dec 2025

Cited by 1 | Viewed by 468

Abstract

Railway track fasteners, serving as critical connecting components, have a reliability that directly impacts railway operational safety. To address the performance bottlenecks of existing detection methods in handling complex scenarios with diverse fastener types and co-occurring multiple defects, this paper proposes a Type-Guided [...] Read more.

Railway track fasteners, serving as critical connecting components, have a reliability that directly impacts railway operational safety. To address the performance bottlenecks of existing detection methods in handling complex scenarios with diverse fastener types and co-occurring multiple defects, this paper proposes a Type-Guided Expert Model-based Fastener Detection and Diagnosis framework (TGEM-FDD) based on You Only Look Once (YOLO) v8. This framework follows a “type-identification-first, defect-diagnosis-second” paradigm, decoupling the complex task: the first stage employs an enhanced YOLOv8s with Deepstar, SPPF-attention, and DySample (YOLOv8s-DSD) detector integrating Deepstar Block, Spatial Pyramid Pooling Fast with Attention (SPPF-Attention), and Dynamic Sample (DySample) modules for precise fastener localization and type identification; the second stage dynamically invokes a specialized multi-label classification “expert model” based on the identified type to achieve accurate diagnosis of multiple defects. This study constructs a multi-label fastener image dataset containing 4800 samples to support model training and validation. Experimental results demonstrate that the proposed YOLOv8s-DSD model achieves a remarkable 98.5% mean average precision at an Intersection over Union threshold of 0.5 (mAP@0.5) in the first-stage task, outperforming the original YOLOv8s baseline and several mainstream detection models. In end-to-end system performance evaluation, the TGEM-FDD framework attains a comprehensive Task mean average precision (Task mAP) of 88.1% and a macro-average F1 score for defect diagnosis of 86.5%, significantly surpassing unified single-model detection and multi-task separate-head methods. This effectively validates the superiority of the proposed approach in tackling fastener type diversity and defect multi-label complexity, offering a viable solution for fine-grained component management in complex industrial scenarios. Full article

► Show Figures

Figure 1

14 pages, 2582 KB

Open AccessArticle

Seafood Object Detection Method Based on Improved YOLOv5s

by Nan Zhu, Zhaohua Liu, Zhongxun Wang and Zheng Xie

Sensors 2025, 25(24), 7546; https://doi.org/10.3390/s25247546 - 12 Dec 2025

Viewed by 588

Abstract

To address the issues of false positives and missed detections commonly observed in traditional underwater seafood object detection algorithms, this paper proposes an improved detection method based on YOLOv5s. Specifically, we introduce a Spatial–Channel Synergistic Attention (SCSA) module after the Fast Spatial Pyramid [...] Read more.

To address the issues of false positives and missed detections commonly observed in traditional underwater seafood object detection algorithms, this paper proposes an improved detection method based on YOLOv5s. Specifically, we introduce a Spatial–Channel Synergistic Attention (SCSA) module after the Fast Spatial Pyramid Pooling layer in the backbone network. This module adopts a synergistic mechanism where the channel attention guides spatial localization, and the spatial attention feeds back to optimize channel weights, dynamically enhancing the unique features of aquatic targets (such as sea cucumber folds) while suppressing seawater background interference. In addition, we replace some C3 modules in YOLOv5s with our designed three-scale convolution dual-path variable-kernel module based on Pinwheel-shaped Convolution (C3k2-PSConv). This module strengthens the model’s ability to capture multi-dimensional features of aquatic targets, especially in the feature extraction of small-sized and occluded targets, reducing the false detection rate while ensuring the model’s lightweight property. The enhanced model is evaluated on the URPC dataset, which contains real-world underwater imagery of echinus, starfish, holothurian, and scallop. The experimental results show that compared with the baseline model YOLOv5s, while maintaining real-time inference speed, the proposed method in this paper increases the mean average precision (mAP) by 2.3% and reduces the number of parameters by approximately 2.4%, significantly improving the model’s operational efficiency. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

31 pages, 4757 KB

Open AccessArticle

MFEF-YOLO: A Multi-Scale Feature Extraction and Fusion Network for Small Object Detection in Aerial Imagery over Open Water

by Qi Liu, Haiyang Yu, Ping Zhang, Tingting Geng, Xinru Yuan, Bingqian Ji, Shengmin Zhu and Ruopu Ma

Remote Sens. 2025, 17(24), 3996; https://doi.org/10.3390/rs17243996 - 11 Dec 2025

Cited by 3 | Viewed by 1362

Abstract

Current object detection using UAV platforms in open water faces challenges such as low detection accuracy, limited storage, and constrained computational capabilities. To address these issues, we propose MFEF-YOLO, a small object detection network based on multi-scale feature extraction and fusion. First, we [...] Read more.

Current object detection using UAV platforms in open water faces challenges such as low detection accuracy, limited storage, and constrained computational capabilities. To address these issues, we propose MFEF-YOLO, a small object detection network based on multi-scale feature extraction and fusion. First, we introduce a Dual-Branch Spatial Pyramid Pooling Fast (DBSPPF) module in the backbone network to replace the original SPPF module, while integrating ODConv and C3k2 modules to collectively enhance feature extraction capabilities. Second, we improve small object detection by adding a P2 detection head and reduce model parameters by removing the P5 detection head. Finally, we design an Island-based Multi-scale Feature Fusion Network (IMFFNet) and employ a Coordinate-guided Multi-scale Feature Fusion Module (CMFFM) to strengthen contextual information and boost detection accuracy. We validate the effectiveness of MFEF-YOLO using the public dataset SeaDronesSee and our custom dataset TPDNV. Experimental results show that compared to the baseline model, mAP₅₀ improves by 0.11 and 0.03 using the two datasets, respectively, while model parameters are reduced by 11.54%. Furthermore, DBSPPF and IMFFNet demonstrate superior performance in comparative studies with other methods, confirming their effectiveness. These improvements and outstanding performance make MFEF-YOLO particularly suitable for UAV-based object detection in open waters. Full article

(This article belongs to the Special Issue Deep Learning-Based Small-Target Detection in Remote Sensing)

► Show Figures

Figure 1

19 pages, 10997 KB

Open AccessArticle

YOLO-AEB: PCB Surface Defect Detection Based on Adaptive Multi-Branch Attention and Efficient Atrous Spatial Pyramid Pooling

by Chengzhi Deng, Yingbo Wu, Zhaoming Wu, Weiwei Zhou, You Zhang, Xiaowei Sun and Shengqian Wang

Computers 2025, 14(12), 543; https://doi.org/10.3390/computers14120543 - 10 Dec 2025

Cited by 1 | Viewed by 642

Abstract

The surface defect detection of printed circuit boards (PCBs) plays a crucial role in the field of industrial manufacturing. However, the existing PCB defect detection methods have great challenges in detecting the accuracy of tiny defects under the complex background due to its [...] Read more.

The surface defect detection of printed circuit boards (PCBs) plays a crucial role in the field of industrial manufacturing. However, the existing PCB defect detection methods have great challenges in detecting the accuracy of tiny defects under the complex background due to its compact layout. To address this problem, we propose a novel YOLO-AMBA-EASPP-BiFPN (YOLO-AEB) network based on the YOLOv10 framework that achieves high precision and real-time detection of tiny defects through multi-level architecture optimization. In the backbone network, an adaptive multi-branch attention mechanism (AMBA) is first proposed, which employs an adaptive reweighting algorithm (ARA) to dynamically optimize fusion weights within the multi-branch attention mechanism (MBA), thereby optimizing the ability to represent tiny defects under complex background noise. Then, an efficient atrous spatial pyramid pooling (EASPP) is constructed, which fuses AMBA and atrous spatial pyramid pooling-fast (ASPF). This integration effectively mitigates feature degradation while preserving expansive receptive fields, and the extraction of defect detail features is strengthened. In the neck network, the bidirectional feature pyramid network (BiFPN) is used to replace the conventional path aggregation network (PAN), and the bidirectional cross-scale feature fusion mechanism is used to improve the transfer ability of shallow detail features to deep networks. Comprehensive experimental evaluations demonstrate that our proposed network achieves state-of-the-art performance, whose F₁ score can reach 95.7% and mean average precision (mAP) can reach 97%, representing respective improvements of 7.1% and 5.8% over the baseline YOLOv10 model. Feature visualization analysis further verifies the effectiveness and feasibility of YOLO-AEB. Full article

► Show Figures

Figure 1

18 pages, 2645 KB

Open AccessArticle

Advancing YOLOv8-Based Wafer Notch-Angle Detection Using Oriented Bounding Boxes, Hyperparameter Tuning, Architecture Refinement, and Transfer Learning

by Eun Seok Jun, Hyo Jun Sim and Seung Jae Moon

Appl. Sci. 2025, 15(21), 11507; https://doi.org/10.3390/app152111507 - 28 Oct 2025

Viewed by 1554

Abstract

Accurate angular alignment of wafers is essential in ion implantation to prevent channeling effects that degrade device performance. This study proposes a real-time notch-angle-detection system based on you only look once version 8 with oriented bounding boxes (YOLOv8-OBB). The proposed method compares YOLOv8 [...] Read more.

Accurate angular alignment of wafers is essential in ion implantation to prevent channeling effects that degrade device performance. This study proposes a real-time notch-angle-detection system based on you only look once version 8 with oriented bounding boxes (YOLOv8-OBB). The proposed method compares YOLOv8 and YOLOv8-OBB, demonstrating the superiority of the latter in accurately capturing rotational features. To enhance detection performance, hyperparameters—including initial learning rate (Lr0), weight decay, and optimizer—are optimized using an one factor at a time (OFAT) approach followed by grid search. Architectural improvements, including spatial pyramid pooling fast with large selective kernel attention (SPPF_LSKA), a bidirectional feature pyramid network (BiFPN), and a high-resolution detection head (P2 head), are incorporated to improve small-object detection. Furthermore, a gradual unfreezing strategy is employed to support more effective and stable transfer learning. The final system is evaluated over 100 training epochs and tracked up to 5000 epochs to verify long-term stability. Compared to baseline models, it achieves higher accuracy and robustness in angle-sensitive scenarios, offering a reliable and scalable solution for high-precision wafer-notch detection in semiconductor manufacturing. Full article

(This article belongs to the Special Issue Advances in Machine Learning and Data Mining: Emerging Trends and Applications)

► Show Figures

Figure 1

22 pages, 48696 KB

Open AccessArticle

Liquid Reservoir Weld Defect Detection Based on Improved YOLOv8s

by Zonghang Li, Tao Song, Bin Zhou, Yupei Zhang, Shifan Yu, Songxiao Cao, Zhipeng Xu and Qing Jiang

Sensors 2025, 25(21), 6521; https://doi.org/10.3390/s25216521 - 23 Oct 2025

Viewed by 918

Abstract

The liquid reservoir is a critical component of the automotive air conditioning system, while weld seams on its surface may exhibit different types of defects with various shapes and scales, meaning traditional detection methods struggle to detect them effectively. In this article, we [...] Read more.

The liquid reservoir is a critical component of the automotive air conditioning system, while weld seams on its surface may exhibit different types of defects with various shapes and scales, meaning traditional detection methods struggle to detect them effectively. In this article, we propose a YOLOv8s-based algorithm to detect liquid reservoir weld defects. In order to improve feature fusion within the neck and enhance the model’s capacity to detect defects showing substantial size variations, the neck is optimized through the integration of the improved Reparameterized Generalized Feature Pyramid Network (RepGFPN) and the addition of a small-object detection head. To further improve the capacity of identifying complex defects, the Spatial Pyramid Pooling Fast (SPPF) module in YOLOv8s is substituted with Focal Modulation Networks (FocalNets). Additionally, the Cascaded Group Attention (CGA) mechanism is incorporated into the improved neck to minimize the propagation of redundant feature information. Experimental results indicate that the improved YOLOv8s achieves a 6.3% improvement in mAP@0.5 and a 4.3% improvement in mAP@0.5:0.95 compared to the original model. The AP value for detecting craters, porosity, undercuts, and lack of fusion defects improves by 3.9%, 13.5%, 5.0%, and 2.5%, respectively. We conducted comparative experiments against other state-of-the-art models on the liquid reservoir weld dataset and the steel pipe weld defect dataset, and the results show that our model has outstanding detection performance. Full article

(This article belongs to the Section Fault Diagnosis & Sensors)

► Show Figures

Figure 1

Search Results (102)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (102)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI