Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (102)

Search Parameters:
Keywords = spatial pyramid pooling fast

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
24 pages, 4735 KB  
Article
An Improved YOLO11n-Based Algorithm for Road Sign Detection
by Haifeng Fu, Xinlei Xiao, Yonghua Han, Le Dai, Lan Yao and Lu Xu
Sensors 2026, 26(8), 2543; https://doi.org/10.3390/s26082543 - 20 Apr 2026
Viewed by 264
Abstract
For vehicle driving scenarios in complex backgrounds, road sign detection faces challenges such as multi-scale targets, long-distances, and low-resolution. To address these challenges, a detection method based on an improved YOLO11n network is proposed. Firstly, to accommodate the multi-scale characteristics of the targets [...] Read more.
For vehicle driving scenarios in complex backgrounds, road sign detection faces challenges such as multi-scale targets, long-distances, and low-resolution. To address these challenges, a detection method based on an improved YOLO11n network is proposed. Firstly, to accommodate the multi-scale characteristics of the targets and improve the network’s ability to detect low-resolution objects and details, a Multi-path Gated Aggregation (MGA) Module is proposed, achieving these objectives via multi-dimensional feature extraction. Secondly, the Neck is improved by designing a network structure that incorporates high-resolution information from the Backbone, thereby enhancing the detection capabilities for small and blurry targets. Finally, an enhanced Spatial Pyramid Pooling—Fast (SPPF) module is proposed, wherein a Group Convolution-Layer Normalization-SiLU structure is integrated across various stages of information passing. By fusing adjacent channel information, it effectively suppresses complex background noise across multiple scales and amplifies road marking features, which consequently boosts the model’s discriminability for distant and obscured targets. Experimental results on a multi-type road sign dataset show that the improved model achieves an mAP@0.5 of 96.96%, which is 1.42% higher than the original model. The mAP@0.5–0.95 and Recall rates are 83.94% and 92.94%, respectively, while the inference speed remains at 134 FPS. Research demonstrates that via targeted modular designs, the proposed approach strikes a superior balance between detection accuracy and real-time efficiency. Consequently, it provides robust technical support for the reliable operation of intelligent vehicle perception systems under complex conditions. Full article
(This article belongs to the Section Vehicular Sensing)
Show Figures

Figure 1

20 pages, 4366 KB  
Article
Intelligent Detection of Asphalt Pavement Cracks Based on Improved YOLOv8s
by Jinfei Su, Jicong Xu, Chuqiao Shi, Yuhan Wang, Shihao Dong and Xue Zhang
Coatings 2026, 16(3), 359; https://doi.org/10.3390/coatings16030359 - 12 Mar 2026
Viewed by 439
Abstract
The intelligent detection of asphalt pavement cracks has become increasingly important for ensuring service performance of road infrastructure. Traditional manual detection has significant safety hazards and insufficient accuracy. Furthermore, existing deep learning models still face challenges, including missed detection, false alarms, and poor [...] Read more.
The intelligent detection of asphalt pavement cracks has become increasingly important for ensuring service performance of road infrastructure. Traditional manual detection has significant safety hazards and insufficient accuracy. Furthermore, existing deep learning models still face challenges, including missed detection, false alarms, and poor performance in small target detection under complex conditions. This investigation adopts unmanned aerial vehicles (UAVs) to acquire pavement distress information and develops an intelligent detection approach for asphalt pavement crack based on improved YOLOv8s. First, the Spatial Pyramid Pooling Fast (SPPF) module is replaced with the Spatial Pyramid Pooling Fast with Cross Stage Partial Connections (SPPFCSPC) module in the backbone network to enhance the multi-scale feature fusion capability. Secondly, the Convolutional Block Attention Module (CBAM) module is introduced to the neck network to optimize the feature weights in both channel and spatial attention. Meanwhile, the Efficient Intersection over Union (EIoU) loss is adopted to improve accuracy. Finally, the Crack_Dataset is established, and the ablation experiments are conducted to verify the reliability of the detection model. The research indicates that the improved model achieves Precision, Recall, and mAP@0.5 of 83.9%, 79.6%, and 83.9%, respectively, representing increases of 1.5%, 1.3%, and 1.4%, compared with the baseline model. In comparison with mainstream object detection algorithms such as YOLOv5s and YOLOv8s, the proposed method attains an F1-score, mAP@0.5, and mAP@[0.5–0.95] of 0.82, 83.9%, and 46.6%, respectively, demonstrating a performance improvement. Based on the improved detection model, a pavement crack detection system was designed and implemented using PyQt5. This system supports image, video, and real-time camera input and detection. Full article
(This article belongs to the Special Issue Pavement Surface Status Evaluation and Smart Perception)
Show Figures

Figure 1

25 pages, 7167 KB  
Article
Edge-Enhanced YOLOV8 for Spacecraft Instance Segmentation in Cloud-Edge IoT Environments
by Ming Chen, Wenjie Chen, Yanfei Niu, Ping Qi and Fucheng Wang
Future Internet 2026, 18(1), 59; https://doi.org/10.3390/fi18010059 - 20 Jan 2026
Viewed by 470
Abstract
The proliferation of smart devices and the Internet of Things (IoT) has led to massive data generation, particularly in complex domains such as aerospace. Cloud computing provides essential scalability and advanced analytics for processing these vast datasets. However, relying solely on the cloud [...] Read more.
The proliferation of smart devices and the Internet of Things (IoT) has led to massive data generation, particularly in complex domains such as aerospace. Cloud computing provides essential scalability and advanced analytics for processing these vast datasets. However, relying solely on the cloud introduces significant challenges, including high latency, network congestion, and substantial bandwidth costs, which are critical for real-time on-orbit spacecraft services. Cloud-edge Internet of Things (cloud-edge IoT) computing emerges as a promising architecture to mitigate these issues by pushing computation closer to the data source. This paper proposes an improved YOLOV8-based model specifically designed for edge computing scenarios within a cloud-edge IoT framework. By integrating the Cross Stage Partial Spatial Pyramid Pooling Fast (CSPPF) module and the WDIOU loss function, the model achieves enhanced feature extraction and localization accuracy without significantly increasing computational cost, making it suitable for deployment on resource-constrained edge devices. Meanwhile, by processing image data locally at the edge and transmitting only the compact segmentation results to the cloud, the system effectively reduces bandwidth usage and supports efficient cloud-edge collaboration in IoT-based spacecraft monitoring systems. Experimental results show that, compared to the original YOLOV8 and other mainstream models, the proposed model demonstrates superior accuracy and instance segmentation performance at the edge, validating its practicality in cloud-edge IoT environments. Full article
(This article belongs to the Special Issue Convergence of IoT, Edge and Cloud Systems)
Show Figures

Figure 1

18 pages, 4205 KB  
Article
Research on Field Weed Target Detection Algorithm Based on Deep Learning
by Ziyang Chen, Le Wu, Zhenhong Jia, Jiajia Wang, Gang Zhou and Zhensen Zhang
Sensors 2026, 26(2), 677; https://doi.org/10.3390/s26020677 - 20 Jan 2026
Viewed by 453
Abstract
Weed detection algorithms based on deep learning are considered crucial for smart agriculture, with the YOLO series algorithms being widely adopted due to their efficiency. However, existing YOLO algorithms struggle to maintain high accuracy, while low parameter requirements and computational efficiency are achieved [...] Read more.
Weed detection algorithms based on deep learning are considered crucial for smart agriculture, with the YOLO series algorithms being widely adopted due to their efficiency. However, existing YOLO algorithms struggle to maintain high accuracy, while low parameter requirements and computational efficiency are achieved when weeds with occlusion or overlap are detected. To address this challenge, a target detection algorithm called SSS-YOLO based on YOLOv9t is proposed in this paper. First, the SCB (Spatial Channel Conv Block) module is introduced, in which large kernel convolution is employed to capture long-range dependencies, occluded weed regions are bypassed by being associated with unobstructed areas, and features of unobstructed regions are enhanced through inter-channel relationships. Second, the SPPF EGAS (Spatial Pyramid Pooling Fast Edge Gaussian Aggregation Super) module is proposed, where multi-scale max pooling is utilized to extract hierarchical contextual features, large receptive fields are leveraged to acquire background information around occluded objects, and features of weed regions obscured by crops are inferred. Finally, the EMSN (Efficient Multi-Scale Spatial-Feedforward Network) module is developed, through which semantic information of occluded regions is reconstructed by contextual reasoning and background vegetation interference is effectively suppressed while visible regional details are preserved. To validate the performance of this method, experiments are conducted on both our self-built dataset and the publicly available Cotton WeedDet12 dataset. The results demonstrate that compared to existing algorithms, significant performance improvements are achieved by the proposed method. Full article
(This article belongs to the Section Smart Agriculture)
Show Figures

Figure 1

28 pages, 3390 KB  
Article
SDC-YOLOv8: An Improved Algorithm for Road Defect Detection Through Attention-Enhanced Feature Learning and Adaptive Feature Reconstruction
by Hao Yang, Yulong Song, Yue Liang, Enhao Tang and Danyang Cao
Sensors 2026, 26(2), 609; https://doi.org/10.3390/s26020609 - 16 Jan 2026
Viewed by 731
Abstract
Road defect detection is essential for timely road damage repair and traffic safety assurance. However, existing object detection algorithms suffer from insufficient accuracy in detecting small road surface defects and are prone to missed detections and false alarms under complex lighting and background [...] Read more.
Road defect detection is essential for timely road damage repair and traffic safety assurance. However, existing object detection algorithms suffer from insufficient accuracy in detecting small road surface defects and are prone to missed detections and false alarms under complex lighting and background conditions. To address these challenges, this study proposes SDC-YOLOv8, an improved YOLOv8-based algorithm for road defect detection that employs attention-enhanced feature learning and adaptive feature reconstruction. The model incorporates three key innovations: (1) an SPPF-LSKA module that integrates Fast Spatial Pyramid Pooling with Large Separable Kernel Attention to enhance multi-scale feature representation and irregular defect modeling capabilities; (2) DySample dynamic upsampling that replaces conventional interpolation methods for adaptive feature reconstruction with reduced computational cost; and (3) a Coordinate Attention module strategically inserted to improve spatial localization accuracy under complex conditions. Comprehensive experiments on a public pothole dataset demonstrate that SDC-YOLOv8 achieves 78.0% mAP@0.5, 81.0% Precision, and 70.7% Recall while maintaining real-time performance at 85 FPS. Compared to the baseline YOLOv8n model, the proposed method improves mAP@0.5 by 2.0 percentage points, Precision by 3.3 percentage points, and Recall by 1.8 percentage points, yielding an F1 score of 75.5%. These results demonstrate that SDC-YOLOv8 effectively enhances small-target detection accuracy while preserving real-time processing capability, offering a practical and efficient solution for intelligent road defect detection applications. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

23 pages, 91075 KB  
Article
Improved Lightweight Marine Oil Spill Detection Using the YOLOv8 Algorithm
by Jianting Shi, Tianyu Jiao, Daniel P. Ames, Yinan Chen and Zhonghua Xie
Appl. Sci. 2026, 16(2), 780; https://doi.org/10.3390/app16020780 - 12 Jan 2026
Viewed by 597
Abstract
Marine oil spill detection using Synthetic Aperture Radar (SAR) is crucial but challenged by dynamic marine conditions, diverse spill scales, and limitations in existing algorithms regarding model size and real-time performance. To address these challenges, we propose LSFE-YOLO, a YOLOv8s-optimized (You Only Look [...] Read more.
Marine oil spill detection using Synthetic Aperture Radar (SAR) is crucial but challenged by dynamic marine conditions, diverse spill scales, and limitations in existing algorithms regarding model size and real-time performance. To address these challenges, we propose LSFE-YOLO, a YOLOv8s-optimized (You Only Look Once version 8) lightweight model with an original, domain-tailored synergistic integration of FasterNet, GN-LSC Head (GroupNorm Lightweight Shared Convolution Head), and C2f_MBE (C2f Mobile Bottleneck Enhanced). FasterNet serves as the backbone (25% neck width reduction), leveraging partial convolution (PConv) to minimize memory access and redundant computations—overcoming traditional lightweight backbones’ high memory overhead—laying the foundation for real-time deployment while preserving feature extraction. The proposed GN-LSC Head replaces YOLOv8’s decoupled head: its shared convolutions reduce parameter redundancy by approximately 40%, and GroupNorm (Group Normalization) ensures stable accuracy under edge computing’s small-batch constraints, outperforming BatchNorm (Batch Normalization) in resource-limited scenarios. The C2f_MBE module integrates EffectiveSE (Effective Squeeze and Excitation)-optimized MBConv (Mobile Inverted Bottleneck Convolution) into C2f: MBConv’s inverted-residual design enhances multi-scale feature capture, while lightweight EffectiveSE strengthens discriminative oil spill features without extra computation, addressing the original C2f’s scale variability insufficiency. Additionally, an SE (Squeeze and Excitation) attention mechanism embedded upstream of SPPF (Spatial Pyramid Pooling Fast) suppresses background interference (e.g., waves, biological oil films), synergizing with FasterNet and C2f_MBE to form a cascaded feature optimization pipeline that refines representations throughout the model. Experimental results show that LSFE-YOLO improves mAP (mean Average Precision) by 1.3% and F1 score by 1.7% over YOLOv8s, while achieving substantial reductions in model size (81.9%), parameter count (82.9%), and computational cost (84.2%), alongside a 20 FPS (Frames Per Second) increase in detection speed. LSFE-YOLO offers an efficient and effective solution for real-time marine oil spill detection. Full article
Show Figures

Figure 1

21 pages, 6330 KB  
Article
Improved YOLOv9 with Dual Convolution and LSKA Attention for Robust Small Defect Detection in Textiles
by Chang Xuan, Weimin Shi, Lei Sun, Ji Wu, Yongchao Zhang and Jiajia Tu
Processes 2026, 14(1), 149; https://doi.org/10.3390/pr14010149 - 1 Jan 2026
Viewed by 570
Abstract
To mitigate the challenges of false positives and undetected small-scale defects in fabric inspection, this study proposes an advanced fabric defect detection system that leverages an optimized YOLOv9 algorithm. First, redundant computations are reduced by introducing DualConv to replace standard convolution. Second, the [...] Read more.
To mitigate the challenges of false positives and undetected small-scale defects in fabric inspection, this study proposes an advanced fabric defect detection system that leverages an optimized YOLOv9 algorithm. First, redundant computations are reduced by introducing DualConv to replace standard convolution. Second, the LSKA attention mechanism is incorporated to increase the weight of important features, thereby enhancing the accuracy of small target detection and improving the generalization ability. Additionally, the focal modulation network is employed to replace the fast spatial pyramid module, mitigating the loss of detailed information caused by the feature pooling operation. Furthermore, the conventional feature pyramid network is replaced with bidirectional feature pyramid network, which is utilized for efficient feature fusion, thereby enhancing multiscale feature representation and improving detection accuracy. Finally, the bounding box loss function is optimized by introducing the shape-IoU loss function, which facilitates more rapid model convergence and significantly improves detection accuracy. Experiments conducted on a fabric defect dataset demonstrate that the proposed algorithm yields a 6.7% increase in mAP@0.5 and a 14.7% improvement in mAP@0.5–0.95, while simultaneously reducing the model’s total parameters by 17.8% and computational FLOPs by 14.4%, compared with those of the original algorithm. The improved YOLOv9 model significantly enhances the precision and accuracy of defect detection while maintaining inference speed (55.8 FPS) that meets industrial requirements. Full article
(This article belongs to the Section Process Control and Monitoring)
Show Figures

Figure 1

22 pages, 3870 KB  
Article
Accurate Pose Detection Method for Rail Fastener Clips Based on Improved YOLOv8-Pose
by Defang Lv, Jianjun Meng, Zhenhan Ren, Liqing Yao and Gengqi Liu
Appl. Sci. 2026, 16(1), 276; https://doi.org/10.3390/app16010276 - 26 Dec 2025
Viewed by 390
Abstract
Minor displacements and deflections of rail fastener clips pose a critical risk to railway safety, which are difficult to quantify accurately using traditional object detection methods. This paper proposes an improved YOLOv8-pose-based method, You Only Look Once version 8-pose with GAM, SPPF-Attention, and [...] Read more.
Minor displacements and deflections of rail fastener clips pose a critical risk to railway safety, which are difficult to quantify accurately using traditional object detection methods. This paper proposes an improved YOLOv8-pose-based method, You Only Look Once version 8-pose with GAM, SPPF-Attention, and Wise-IoU (YOLOv8-pose-GSW) for automated and quantitative pose detection of fastener clips. Firstly, a high-precision keypoint detection network is constructed by integrating a Global Attention Mechanism (GAM) into the neck, enhancing the Spatial Pyramid Pooling Fast (SPPF) module to Spatial Pyramid Pooling Fast with Attention (SPPF-Attention) in the backbone, and adopting the Wise Intersection over Union (Wise-IoU) loss function. Subsequently, a posterior verification mechanism based on spatial constraint error is designed to eliminate unreliable detections by leveraging the inherent geometric priors of fasteners. Finally, the deflection angle, longitudinal displacement, and lateral displacement of the clip are calculated from the verified keypoints. Experimental results demonstrate that the proposed method achieves an Average Precision at IoU threshold from 0.5 to 0.95 (AP@0.5:0.95) of 77.5%, representing a 3.6% improvement over the baseline YOLOv8s-pose model, effectively balancing detection accuracy and computational efficiency. This work provides a reliable technical solution for the refined maintenance of rail fasteners. Full article
Show Figures

Figure 1

23 pages, 5151 KB  
Article
Small-Target Detection Algorithm Based on Improved YOLOv11n
by Ke Zeng, Wangsheng Yu, Xianxiang Qin and Siyu Long
Sensors 2026, 26(1), 71; https://doi.org/10.3390/s26010071 - 22 Dec 2025
Cited by 2 | Viewed by 1123
Abstract
Target detection in UAV aerial photography scenarios faces challenges of small targets and complex backgrounds. Thus, we proposed an improved YOLOv11n small-target detection algorithm. First, a detection head is added to the 160 × 160 resolution feature layer, and non-adjacent layer feature is [...] Read more.
Target detection in UAV aerial photography scenarios faces challenges of small targets and complex backgrounds. Thus, we proposed an improved YOLOv11n small-target detection algorithm. First, a detection head is added to the 160 × 160 resolution feature layer, and non-adjacent layer feature is fused via Asymptotic Feature Pyramid Network (AFPN) to alleviate feature loss caused by downsampling and reduce cross-level feature conflicts. Second, the Spatial Channel Attention SPPF (SCASPPF) module replaces the original Spatial Pyramid Pooling-Fast (SPPF) module to highlight key features and suppress irrelevant ones. Moreover, the loss function is enhanced by fusing MPDIoU and InnerIoU to boost detection accuracy. Finally, Inception Deep Convolution (IDC) is adopted to improve the C3k2 module, expanding the model’s receptive field and enhancing small-target detection performance. Experiments on the Visdrone2019 dataset show that the algorithm achieves 39.256% mAP@0.5, 6.689% higher than 32.567% mAP@0.5 of the benchmark model (YOLOv11n). Full article
(This article belongs to the Section Sensor Networks)
Show Figures

Figure 1

22 pages, 5334 KB  
Article
Two-Stage Multi-Label Detection Method for Railway Fasteners Based on Type-Guided Expert Model
by Defang Lv, Jianjun Meng, Gaoyang Meng, Yanni Shen, Liqing Yao and Gengqi Liu
Appl. Sci. 2025, 15(24), 13093; https://doi.org/10.3390/app152413093 - 12 Dec 2025
Cited by 1 | Viewed by 468
Abstract
Railway track fasteners, serving as critical connecting components, have a reliability that directly impacts railway operational safety. To address the performance bottlenecks of existing detection methods in handling complex scenarios with diverse fastener types and co-occurring multiple defects, this paper proposes a Type-Guided [...] Read more.
Railway track fasteners, serving as critical connecting components, have a reliability that directly impacts railway operational safety. To address the performance bottlenecks of existing detection methods in handling complex scenarios with diverse fastener types and co-occurring multiple defects, this paper proposes a Type-Guided Expert Model-based Fastener Detection and Diagnosis framework (TGEM-FDD) based on You Only Look Once (YOLO) v8. This framework follows a “type-identification-first, defect-diagnosis-second” paradigm, decoupling the complex task: the first stage employs an enhanced YOLOv8s with Deepstar, SPPF-attention, and DySample (YOLOv8s-DSD) detector integrating Deepstar Block, Spatial Pyramid Pooling Fast with Attention (SPPF-Attention), and Dynamic Sample (DySample) modules for precise fastener localization and type identification; the second stage dynamically invokes a specialized multi-label classification “expert model” based on the identified type to achieve accurate diagnosis of multiple defects. This study constructs a multi-label fastener image dataset containing 4800 samples to support model training and validation. Experimental results demonstrate that the proposed YOLOv8s-DSD model achieves a remarkable 98.5% mean average precision at an Intersection over Union threshold of 0.5 (mAP@0.5) in the first-stage task, outperforming the original YOLOv8s baseline and several mainstream detection models. In end-to-end system performance evaluation, the TGEM-FDD framework attains a comprehensive Task mean average precision (Task mAP) of 88.1% and a macro-average F1 score for defect diagnosis of 86.5%, significantly surpassing unified single-model detection and multi-task separate-head methods. This effectively validates the superiority of the proposed approach in tackling fastener type diversity and defect multi-label complexity, offering a viable solution for fine-grained component management in complex industrial scenarios. Full article
Show Figures

Figure 1

14 pages, 2582 KB  
Article
Seafood Object Detection Method Based on Improved YOLOv5s
by Nan Zhu, Zhaohua Liu, Zhongxun Wang and Zheng Xie
Sensors 2025, 25(24), 7546; https://doi.org/10.3390/s25247546 - 12 Dec 2025
Viewed by 588
Abstract
To address the issues of false positives and missed detections commonly observed in traditional underwater seafood object detection algorithms, this paper proposes an improved detection method based on YOLOv5s. Specifically, we introduce a Spatial–Channel Synergistic Attention (SCSA) module after the Fast Spatial Pyramid [...] Read more.
To address the issues of false positives and missed detections commonly observed in traditional underwater seafood object detection algorithms, this paper proposes an improved detection method based on YOLOv5s. Specifically, we introduce a Spatial–Channel Synergistic Attention (SCSA) module after the Fast Spatial Pyramid Pooling layer in the backbone network. This module adopts a synergistic mechanism where the channel attention guides spatial localization, and the spatial attention feeds back to optimize channel weights, dynamically enhancing the unique features of aquatic targets (such as sea cucumber folds) while suppressing seawater background interference. In addition, we replace some C3 modules in YOLOv5s with our designed three-scale convolution dual-path variable-kernel module based on Pinwheel-shaped Convolution (C3k2-PSConv). This module strengthens the model’s ability to capture multi-dimensional features of aquatic targets, especially in the feature extraction of small-sized and occluded targets, reducing the false detection rate while ensuring the model’s lightweight property. The enhanced model is evaluated on the URPC dataset, which contains real-world underwater imagery of echinus, starfish, holothurian, and scallop. The experimental results show that compared with the baseline model YOLOv5s, while maintaining real-time inference speed, the proposed method in this paper increases the mean average precision (mAP) by 2.3% and reduces the number of parameters by approximately 2.4%, significantly improving the model’s operational efficiency. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

31 pages, 4757 KB  
Article
MFEF-YOLO: A Multi-Scale Feature Extraction and Fusion Network for Small Object Detection in Aerial Imagery over Open Water
by Qi Liu, Haiyang Yu, Ping Zhang, Tingting Geng, Xinru Yuan, Bingqian Ji, Shengmin Zhu and Ruopu Ma
Remote Sens. 2025, 17(24), 3996; https://doi.org/10.3390/rs17243996 - 11 Dec 2025
Cited by 3 | Viewed by 1362
Abstract
Current object detection using UAV platforms in open water faces challenges such as low detection accuracy, limited storage, and constrained computational capabilities. To address these issues, we propose MFEF-YOLO, a small object detection network based on multi-scale feature extraction and fusion. First, we [...] Read more.
Current object detection using UAV platforms in open water faces challenges such as low detection accuracy, limited storage, and constrained computational capabilities. To address these issues, we propose MFEF-YOLO, a small object detection network based on multi-scale feature extraction and fusion. First, we introduce a Dual-Branch Spatial Pyramid Pooling Fast (DBSPPF) module in the backbone network to replace the original SPPF module, while integrating ODConv and C3k2 modules to collectively enhance feature extraction capabilities. Second, we improve small object detection by adding a P2 detection head and reduce model parameters by removing the P5 detection head. Finally, we design an Island-based Multi-scale Feature Fusion Network (IMFFNet) and employ a Coordinate-guided Multi-scale Feature Fusion Module (CMFFM) to strengthen contextual information and boost detection accuracy. We validate the effectiveness of MFEF-YOLO using the public dataset SeaDronesSee and our custom dataset TPDNV. Experimental results show that compared to the baseline model, mAP50 improves by 0.11 and 0.03 using the two datasets, respectively, while model parameters are reduced by 11.54%. Furthermore, DBSPPF and IMFFNet demonstrate superior performance in comparative studies with other methods, confirming their effectiveness. These improvements and outstanding performance make MFEF-YOLO particularly suitable for UAV-based object detection in open waters. Full article
(This article belongs to the Special Issue Deep Learning-Based Small-Target Detection in Remote Sensing)
Show Figures

Figure 1

19 pages, 10997 KB  
Article
YOLO-AEB: PCB Surface Defect Detection Based on Adaptive Multi-Branch Attention and Efficient Atrous Spatial Pyramid Pooling
by Chengzhi Deng, Yingbo Wu, Zhaoming Wu, Weiwei Zhou, You Zhang, Xiaowei Sun and Shengqian Wang
Computers 2025, 14(12), 543; https://doi.org/10.3390/computers14120543 - 10 Dec 2025
Cited by 1 | Viewed by 642
Abstract
The surface defect detection of printed circuit boards (PCBs) plays a crucial role in the field of industrial manufacturing. However, the existing PCB defect detection methods have great challenges in detecting the accuracy of tiny defects under the complex background due to its [...] Read more.
The surface defect detection of printed circuit boards (PCBs) plays a crucial role in the field of industrial manufacturing. However, the existing PCB defect detection methods have great challenges in detecting the accuracy of tiny defects under the complex background due to its compact layout. To address this problem, we propose a novel YOLO-AMBA-EASPP-BiFPN (YOLO-AEB) network based on the YOLOv10 framework that achieves high precision and real-time detection of tiny defects through multi-level architecture optimization. In the backbone network, an adaptive multi-branch attention mechanism (AMBA) is first proposed, which employs an adaptive reweighting algorithm (ARA) to dynamically optimize fusion weights within the multi-branch attention mechanism (MBA), thereby optimizing the ability to represent tiny defects under complex background noise. Then, an efficient atrous spatial pyramid pooling (EASPP) is constructed, which fuses AMBA and atrous spatial pyramid pooling-fast (ASPF). This integration effectively mitigates feature degradation while preserving expansive receptive fields, and the extraction of defect detail features is strengthened. In the neck network, the bidirectional feature pyramid network (BiFPN) is used to replace the conventional path aggregation network (PAN), and the bidirectional cross-scale feature fusion mechanism is used to improve the transfer ability of shallow detail features to deep networks. Comprehensive experimental evaluations demonstrate that our proposed network achieves state-of-the-art performance, whose F1 score can reach 95.7% and mean average precision (mAP) can reach 97%, representing respective improvements of 7.1% and 5.8% over the baseline YOLOv10 model. Feature visualization analysis further verifies the effectiveness and feasibility of YOLO-AEB. Full article
Show Figures

Figure 1

18 pages, 2645 KB  
Article
Advancing YOLOv8-Based Wafer Notch-Angle Detection Using Oriented Bounding Boxes, Hyperparameter Tuning, Architecture Refinement, and Transfer Learning
by Eun Seok Jun, Hyo Jun Sim and Seung Jae Moon
Appl. Sci. 2025, 15(21), 11507; https://doi.org/10.3390/app152111507 - 28 Oct 2025
Viewed by 1554
Abstract
Accurate angular alignment of wafers is essential in ion implantation to prevent channeling effects that degrade device performance. This study proposes a real-time notch-angle-detection system based on you only look once version 8 with oriented bounding boxes (YOLOv8-OBB). The proposed method compares YOLOv8 [...] Read more.
Accurate angular alignment of wafers is essential in ion implantation to prevent channeling effects that degrade device performance. This study proposes a real-time notch-angle-detection system based on you only look once version 8 with oriented bounding boxes (YOLOv8-OBB). The proposed method compares YOLOv8 and YOLOv8-OBB, demonstrating the superiority of the latter in accurately capturing rotational features. To enhance detection performance, hyperparameters—including initial learning rate (Lr0), weight decay, and optimizer—are optimized using an one factor at a time (OFAT) approach followed by grid search. Architectural improvements, including spatial pyramid pooling fast with large selective kernel attention (SPPF_LSKA), a bidirectional feature pyramid network (BiFPN), and a high-resolution detection head (P2 head), are incorporated to improve small-object detection. Furthermore, a gradual unfreezing strategy is employed to support more effective and stable transfer learning. The final system is evaluated over 100 training epochs and tracked up to 5000 epochs to verify long-term stability. Compared to baseline models, it achieves higher accuracy and robustness in angle-sensitive scenarios, offering a reliable and scalable solution for high-precision wafer-notch detection in semiconductor manufacturing. Full article
Show Figures

Figure 1

22 pages, 48696 KB  
Article
Liquid Reservoir Weld Defect Detection Based on Improved YOLOv8s
by Zonghang Li, Tao Song, Bin Zhou, Yupei Zhang, Shifan Yu, Songxiao Cao, Zhipeng Xu and Qing Jiang
Sensors 2025, 25(21), 6521; https://doi.org/10.3390/s25216521 - 23 Oct 2025
Viewed by 918
Abstract
The liquid reservoir is a critical component of the automotive air conditioning system, while weld seams on its surface may exhibit different types of defects with various shapes and scales, meaning traditional detection methods struggle to detect them effectively. In this article, we [...] Read more.
The liquid reservoir is a critical component of the automotive air conditioning system, while weld seams on its surface may exhibit different types of defects with various shapes and scales, meaning traditional detection methods struggle to detect them effectively. In this article, we propose a YOLOv8s-based algorithm to detect liquid reservoir weld defects. In order to improve feature fusion within the neck and enhance the model’s capacity to detect defects showing substantial size variations, the neck is optimized through the integration of the improved Reparameterized Generalized Feature Pyramid Network (RepGFPN) and the addition of a small-object detection head. To further improve the capacity of identifying complex defects, the Spatial Pyramid Pooling Fast (SPPF) module in YOLOv8s is substituted with Focal Modulation Networks (FocalNets). Additionally, the Cascaded Group Attention (CGA) mechanism is incorporated into the improved neck to minimize the propagation of redundant feature information. Experimental results indicate that the improved YOLOv8s achieves a 6.3% improvement in mAP@0.5 and a 4.3% improvement in mAP@0.5:0.95 compared to the original model. The AP value for detecting craters, porosity, undercuts, and lack of fusion defects improves by 3.9%, 13.5%, 5.0%, and 2.5%, respectively. We conducted comparative experiments against other state-of-the-art models on the liquid reservoir weld dataset and the steel pipe weld defect dataset, and the results show that our model has outstanding detection performance. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

Back to TopTop