Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

Search Results (968)

Search Parameters:
Keywords = lightweight backbone network

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 2747 KB  
Article
Identification of the Picking Stage for Volvariella Volvacea Fruiting Bodies Using an Improved YOLO11n Model
by Haitao Yin, Jinpeng Wang, Bin Zhou, Yongqi Chao and Hongping Zhou
Agriculture 2026, 16(13), 1371; https://doi.org/10.3390/agriculture16131371 (registering DOI) - 23 Jun 2026
Abstract
Accurate and rapid detection of Volvariella volvacea (straw mushroom) fruiting bodies at harvestable maturity is a critical prerequisite for automated industrial cultivation. However, existing detection methods often yield high false-negative and false-positive rates when processing a small-scale, densely distributed, and heavily occluded targets [...] Read more.
Accurate and rapid detection of Volvariella volvacea (straw mushroom) fruiting bodies at harvestable maturity is a critical prerequisite for automated industrial cultivation. However, existing detection methods often yield high false-negative and false-positive rates when processing a small-scale, densely distributed, and heavily occluded targets against complex straw substrate backgrounds. Furthermore, these methods frequently struggle to balance the competing requirements of architectural efficiency (such as parameter volume and computational complexity) and real-time performance for edge computing. To address these challenges, this study proposes a YOLO11n-CPDM, a lightweight detection model based on an improved YOLO11n architecture. The model incorporates synergistic optimizations across feature extraction, fusion, and reconstruction. First, a Dual Coordinate Attention Feature Extraction mechanism is integrated into the C3k2 bottleneck blocks of the backbone network. This enhances target perception in complex, occluded environments by concurrently modeling global context and local salient features. Second, within the neck network, the standard attention module is replaced with the PnPNystraAttention module, coupled with the DySample dynamic upsampling operator. This modification strengthens contextual relationships among multi-scale features and improves spatial consistency during reconstruction while preserving linear computational complexity. Finally, the detection head is optimized using MBConv blocks based on an inverted residual structure to minimize parameter volume. Experimental results on a custom V. volvacea dataset demonstrate that the proposed YOLO11n-CPDM model achieves significant performance gains, with Precision (P), Recall (R), and Mean Average Precision (mAP50) reaching 86.8%, 87.5%, and 88.4%, respectively. These figures represent improvements of 2.7, 3.0, and 3.2 percentage points over the baseline YOLO11n model. Additionally, the model size is reduced to 4.8 MB (a 12.7% decrease), while achieving inference speeds of 42.7 FPS on Jetson AGX Orin and 21.2 FPS on Jetson Nano, outperforming the baseline model on both embedded platforms. Consequently, the proposed model effectively enhances detection performance in complex environments while maintaining excellent lightweight characteristics and deployment flexibility, providing a solid technical foundation for intelligent perception and automated harvesting of V. volvacea. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
27 pages, 7020 KB  
Article
MSA-YOLO: An Optimized UAV Object Detection Algorithm for Low-Visibility Maritime
by Longcheng Huang, Mengguang Liao, Shaoning Li, Chuanguang Zhu and Sichun Long
Remote Sens. 2026, 18(13), 2065; https://doi.org/10.3390/rs18132065 (registering DOI) - 23 Jun 2026
Abstract
Maritime search and rescue is an important component of emergency response frameworks and primarily relies on Unmanned Aerial Vehicles (UAVs) for maritime object detection. However, maritime accidents frequently occur in low-visibility environments, such as foggy or low-light conditions, which lead to low contrast, [...] Read more.
Maritime search and rescue is an important component of emergency response frameworks and primarily relies on Unmanned Aerial Vehicles (UAVs) for maritime object detection. However, maritime accidents frequently occur in low-visibility environments, such as foggy or low-light conditions, which lead to low contrast, blurred object boundaries, and degraded texture representations. Most existing maritime object detection algorithms are developed for natural light scenes, and their performance deteriorates markedly when deployed directly in low-visibility environments, primarily due to reduced image quality that hinders feature extraction and semantic information aggregation. Although several studies incorporate image enhancement techniques prior to detection to improve image quality, these approaches often introduce significant additional computational overhead, limiting their practical deployment on UAV platforms. To tackle these challenges, this paper proposes a lightweight model built upon a recent YOLO framework, termed Multi-Scale Adaptive YOLO (MSA-YOLO), for maritime detection using UAVs in low-visibility environments. The proposed model systematically optimizes the backbone, neck, and detection head networks. Specifically, an improved StarNet backbone is designed by integrating Efficient Channel Attention (ECA) mechanisms and multi-scale convolutional kernels, which strengthen feature extraction capability while maintaining low computational overhead. In the neck network, a high-frequency enhanced residual block branch is inserted into the C3k2 module to capture richer detailed information, while depthwise separable convolution is utilized to further reduce computational cost. Moreover, a non-parametric attention module is incorporated into the detection head to adaptively optimize features in the classification and regression branches. Finally, a joint loss function that combines bounding box regression, classification, and distribution focal losses is utilized to improve detection accuracy and training stability. Experimental results on the constructed AFO, Zhoushan Island, and Shandong Province datasets demonstrate that, relative to YOLOv11-s, MSA-YOLO reduces model parameters and FLOPs by 52.07% and 41.36%, respectively, while achieving improvements of 1.11% and 1.33% in mAP@0.5:0.95 and mAP@0.5. These results indicate that the proposed method effectively balances computational efficiency and detection accuracy, rendering it suitable for practical maritime search and rescue applications in low-visibility environments. Full article
Show Figures

Figure 1

22 pages, 5229 KB  
Article
Extracting Alpine Shrub Using Improved Lightweight DeepLabV3+ Network
by Wangping Li, Xingling Cao, Zhaoye Zhou, Longlong Shi, Xiaodong Wu, Wenbo Wei, Yanjun Bian, Xiuxia Zhang, Niu Wang and Cong Wang
Remote Sens. 2026, 18(12), 2055; https://doi.org/10.3390/rs18122055 (registering DOI) - 22 Jun 2026
Viewed by 149
Abstract
In recent years, shrubland is an important land cover type in alpine regions, while accurate segmentation of shrubs using remote sensing data remain challenging. To address these issues, this study proposes an alpine shrub segmentation method based on an improved lightweight DeepLabV3+ network, [...] Read more.
In recent years, shrubland is an important land cover type in alpine regions, while accurate segmentation of shrubs using remote sensing data remain challenging. To address these issues, this study proposes an alpine shrub segmentation method based on an improved lightweight DeepLabV3+ network, in which MobileNetV2 is used to replace the original backbone to reduce model complexity while maintaining feature representation capability, a channel squeeze-and-excitation (cSE) attention module is introduced to enhance the response to key shrub features and boundary details, and Ghost convolution is incorporated to reduce computational redundancy while preserving segmentation accuracy. Experimental results from both ablation and comparative studies demonstrate that the proposed model achieves a mean intersection over union (MIoU) of 88.47%, mean pixel accuracy (mPA) of 92.93%, F1-score of 91.80%, and overall accuracy of 94.52%, representing improvements of 3.53%, 2.64%, 2.96%, and 1.69%, respectively, over the original DeepLabV3+ model, while also significantly reducing the number of parameters and model size. In addition, independent cross-year validation using unmanned aerial vehicle (UAV) imagery acquired in 2025 suggests that the proposed model has good applicability under similar UAV sensor and acquisition conditions. Overall, this study provides an effective lightweight semantic segmentation approach for alpine shrub segmentation from high-resolution UAV imagery and offers useful technical support for vegetation monitoring in alpine regions such as the Qinghai–Tibet Plateau. Full article
(This article belongs to the Special Issue Advances in Deep Learning Approaches: UAV Data Analysis)
Show Figures

Figure 1

29 pages, 12456 KB  
Article
A Lightweight Drainage Pipe Defect Detection Method Based on an Improved YOLO11 Network
by Rui Xue, Hongtao Fu, Hui Zhao and Chongquan Wang
Information 2026, 17(6), 613; https://doi.org/10.3390/info17060613 (registering DOI) - 21 Jun 2026
Viewed by 85
Abstract
Drainage pipe defect detection is essential for maintaining the normal operation of urban infrastructure. In recent years, deep learning-based object detection methods have provided an effective technical solution for drainage pipe defect recognition. Among them, YOLO-series models have demonstrated strong potential in visual [...] Read more.
Drainage pipe defect detection is essential for maintaining the normal operation of urban infrastructure. In recent years, deep learning-based object detection methods have provided an effective technical solution for drainage pipe defect recognition. Among them, YOLO-series models have demonstrated strong potential in visual detection tasks due to their end-to-end architecture and high inference efficiency. However, directly applying baseline YOLO models may still face challenges such as limited detection accuracy, relatively high model complexity, and insufficient adaptability for lightweight deployment scenarios. To address these issues, this paper proposes a lightweight drainage pipe defect detection method based on an improved YOLO11 network. Rather than treating detection enhancement and model compression as two separate procedures, the proposed method integrates feature enhancement, adaptive pruning, and distillation-based recovery into a unified lightweight detection framework. Specifically, an improved SimAM attention mechanism is introduced into the backbone and integrated with the C3k2 module to construct the C3K2_SWS module, aiming to enhance the representation capability of critical defect features. In the neck network, a focused diffusion pyramid network with a dimension-aware selective fusion structure, termed FDPN-DASI, is designed to strengthen multi-scale feature interactions. In addition, an adaptive-threshold focal loss (ATFL) is introduced to improve the learning capability for hard samples. For efficient deployment, the LAMP pruning algorithm is further improved, and an entropy-guided entropy-adaptive magnitude-based pruning method (EA-LAMP) is proposed to enable adaptive allocation of pruning ratios across different network layers. Moreover, BCKD knowledge distillation is applied after pruning to mitigate the accuracy degradation caused by model compression. Experimental results indicate that the proposed lightweight YOLO11-SFA+EA+BCKD framework achieves a precision of 92.4%, a recall of 88.5%, and an mAP50 of 93.3%, while maintaining a compact model size of 1.6 M parameters and 4.5 G FLOPs. Compared with the baseline model, the proposed method improves precision, recall, and mAP50 by 5.9%, 5.0%, and 4.7%, respectively, while reducing the number of parameters, FLOPs, and model size by 1.0 M, 1.8 G, and 2.1 M, respectively. These results suggest that the proposed framework can improve detection performance while reducing model complexity under the current experimental setting, indicating its potential for lightweight drainage pipe defect detection tasks. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

20 pages, 1947 KB  
Article
Dynamic Distillation-Aided Federated Learning for Intrusion Detection in Heterogeneous Edge Networks
by Fan Wang and Weimin Chen
Electronics 2026, 15(12), 2728; https://doi.org/10.3390/electronics15122728 (registering DOI) - 21 Jun 2026
Viewed by 98
Abstract
Intrusion detection serves as a core technology for securing heterogeneous edge networks, including IoT, industrial edges, and 5G networks. However, existing federated learning-based intrusion detection systems suffer from environmental heterogeneity, limited sample availability, and severe class imbalance—issues that result in inefficient resource allocation [...] Read more.
Intrusion detection serves as a core technology for securing heterogeneous edge networks, including IoT, industrial edges, and 5G networks. However, existing federated learning-based intrusion detection systems suffer from environmental heterogeneity, limited sample availability, and severe class imbalance—issues that result in inefficient resource allocation and compromised detection performance against rare attacks. In this paper, we propose a novel lightweight intrusion detection model for heterogeneous edge networks, named FedNIDS-CNN, which is based on dynamic distillation-aided federated learning with a CNN backbone. In the data preprocessing phase, a two-level class balancing strategy integrating nearest-neighbor interpolation augmentation and adaptive synthetic sampling is employed to ensure distortion-free sample synthesis. For feature and model optimization, principal component analysis (PCA) is used to reduce the dimensionality of traffic features, while a lightweight 1D-CNN is adopted as the base model to alleviate computational overhead on edge devices. During federated training and knowledge aggregation, a dynamic weight distillation loss mechanism is designed to enhance the model’s ability to recognize minority-class attacks. Meanwhile, the federated framework supports client-side local training and server-side weighted soft-label aggregation, enabling effective knowledge fusion across heterogeneous models. Experimental results on the CICIDS2017 dataset demonstrate that the proposed method achieves an accuracy of 98.55% and an F1-score of 98.40%. Benefiting from the soft-label transmission and parameter-free aggregation design, the framework gets rid of the constraint of homogeneous model architecture and natively supports heterogeneous network models and edge devices with different computing capabilities. It also significantly reduces communication traffic and per-round training latency, confirming its excellent real-time performance and applicability in resource-constrained edge environments. Full article
(This article belongs to the Special Issue IoT Security in the Age of AI: Innovative Approaches and Technologies)
Show Figures

Figure 1

26 pages, 35295 KB  
Article
A Lightweight Framework for Tea Shoot Detection and Plucking Point Localization Enabled by Modified YOLOv11s-Seg Model
by Yongmao Huang, Yuankai Luo, Yuanxi Mu and Haiyan Jin
Agriculture 2026, 16(12), 1357; https://doi.org/10.3390/agriculture16121357 (registering DOI) - 20 Jun 2026
Viewed by 217
Abstract
In this work, a lightweight framework enabled by the modified YOLOv11s-seg model for tea shoot detection and plucking point localization is proposed. Detecting tea shoots and localizing plucking points with higher accuracy generally require larger model size and more model parameters, making it [...] Read more.
In this work, a lightweight framework enabled by the modified YOLOv11s-seg model for tea shoot detection and plucking point localization is proposed. Detecting tea shoots and localizing plucking points with higher accuracy generally require larger model size and more model parameters, making it difficult to balance accuracy and lightweighting. To overcome this limitation, a modified lightweight YOLOv11s-seg model is developed. First, the multi-scale edge information enhancement is introduced into the conventional YOLOv11s-seg to extract edge feature better and improve the detection accuracy of tea shoots. Meanwhile, context anchor attention is utilized to modify the cross stage partial spatial attention module in a backbone network to improve the detection capability for small objects. Moreover, the detail calibration reconstruction feature pyramid network is proposed. It utilizes spatial and contextual semantic information to reconstruct and calibrate features in key regions, enhancing the capability for object fusion and recognition at various scales. Furthermore, with the modified model performing instance segmentation to acquire the contour of each tea shoot, the coordinates of the three lowest pixel points in the contour are captured to localize the plucking point based on the average coordinates. In addition, the layer-adaptive magnitude-based pruning (LAMP) method is used to lighten the model. The experimental results show that the LAMP-pruned modified YOLOv11s-seg model with a speedup ratio of 1.5 achieves a mAP@0.5 of 86.5% for tea shoot detection, exhibiting a 4.7 percentage point improvement over the conventional YOLOv11s-seg model. Moreover, it exhibits an accuracy of 81.9% for plucking point localization on the validation and test subsets with 232 images in total, and its number of parameters, model size and floating point operations (FLOPs) separately achieve reductions of 67.3%, 66.2%, and 24.9% over the conventional model as well. Therefore, the proposed LAMP-pruned modified model shows good balance between lightweighting and detection accuracy. Finally, the modified LAMP-pruned YOLOv11s-seg model is deployed on a Jetson Orin NX edge module and measured in a tea plantation, with the measured results exhibiting a detection speed of 34.1 FPS and verifying its availability in practical applications. Full article
(This article belongs to the Special Issue Advances in Precision Agriculture in Orchard)
Show Figures

Figure 1

17 pages, 15918 KB  
Article
ADA-YOLO: An Adaptive Dynamic Aggregation Network for Small Object Detection in UAV Imagery
by Jiajun Chen, Shaochen Jiang, Yongming Li, Sulaiman Tuersunayi and Yong Liu
Sensors 2026, 26(12), 3908; https://doi.org/10.3390/s26123908 (registering DOI) - 19 Jun 2026
Viewed by 270
Abstract
Unmanned Aerial Vehicle (UAV) image object detection holds significant application value in the low-altitude economy, traffic monitoring, intelligent agriculture, and disaster rescue. However, due to the top-down perspective, UAV images typically suffer from challenges such as small target scales, dense object distribution, severe [...] Read more.
Unmanned Aerial Vehicle (UAV) image object detection holds significant application value in the low-altitude economy, traffic monitoring, intelligent agriculture, and disaster rescue. However, due to the top-down perspective, UAV images typically suffer from challenges such as small target scales, dense object distribution, severe occlusions, and complex backgrounds. These issues often limit the recall and localization accuracy of general-purpose detectors when they are directly applied to UAV small-object detection scenarios. To address these aforementioned challenges, this paper proposes an Adaptive Dynamic Aggregation YOLO network, termed ADA-YOLO. The novelty of ADA-YOLO lies in its highly efficient combinatorial design specifically tailored for UAV small object detection, while retaining the efficient backbone of YOLOv8, we systematically reconstruct the neck and detection head to improve accuracy. Specifically, a high-resolution P2 detection branch is incorporated to construct a P2–P5 multi-scale prediction structure. Furthermore, the lightweight DySample dynamic upsampling module is adopted to replace traditional upsampling methods, and an Adaptive Spatial Feature Fusion (ASFF) mechanism is introduced to alleviate semantic conflicts and noise interference during multi-scale feature fusion. This synergistic combination explicitly addresses multi-scale representation challenges and enhances small-object detection performance in complex scenes. Comparative experiments with the baseline YOLOv8n on the VisDrone2019 dataset demonstrate that ADA-YOLO achieves an improvement of 11.3% in mAP@0.5 and 8.2% in mAP@0.5:0.95. The improved model achieves these performance gains with a modest parameter increase and acceptable computational complexity. Finally, ablation experiments further validate the effectiveness of each individual module and their synergistic gains. Full article
(This article belongs to the Section Remote Sensors)
Show Figures

Figure 1

20 pages, 4527 KB  
Article
A Re-Parameterized Lightweight Residual Attention Framework for Resource-Constrained Edge Computing
by Yuze Gao, Jiamin Zhu, Xiaoxiao Liu and Wei Wu
Computers 2026, 15(6), 395; https://doi.org/10.3390/computers15060395 (registering DOI) - 19 Jun 2026
Viewed by 175
Abstract
Edge vision systems require convolutional neural networks (CNNs) that preserve recognition accuracy under strict storage, computation, and latency constraints. Although ResNet18 is a compact residual backbone, direct deployment on resource-constrained devices remains costly, whereas simple channel reduction weakens representation capacity. This study aims [...] Read more.
Edge vision systems require convolutional neural networks (CNNs) that preserve recognition accuracy under strict storage, computation, and latency constraints. Although ResNet18 is a compact residual backbone, direct deployment on resource-constrained devices remains costly, whereas simple channel reduction weakens representation capacity. This study aims to build a deployable ResNet18-based classifier that reduces model complexity while recovering the accuracy lost during compression. We propose a lightweight framework that combines global channel scaling, a re-parameterized attention residual block, and teacher–student knowledge distillation. The proposed block uses multi-branch convolution and squeeze-and-excitation attention during training, then folds the linear branches into a single 3-by-3 convolution for inference. Experiments on CIFAR-100 show that the final model reduces parameters from 11.220 M to 2.841 M, retains comparable Top-1 accuracy (0.7579 vs. 0.7606), improves Top-5 accuracy (0.9340 vs. 0.9253), and reduces graphics processing unit (GPU) batch inference latency from 3.279 ms to 2.161 ms. Deployment on PYNQ-Z2 verifies the complete camera-based CPU-side inference workflow, with an average end-to-end latency of 421.467 ms/frame. The results indicate that residual topology preservation, re-parameterized feature enhancement, and distillation form a practical route for edge-oriented lightweight CNN deployment. Full article
(This article belongs to the Topic Smart Edge Devices: Design and Applications)
Show Figures

Figure 1

29 pages, 7489 KB  
Article
CGMSN: CFAR-Guided Mode-Selective Network for SAR Target Detection
by Lingjuan Yu, Xinya Xiong, Xiaochun Xie, Miaomiao Liang, Xiangchun Yu, Xuan Jiao and Wen Hong
Remote Sens. 2026, 18(12), 2040; https://doi.org/10.3390/rs18122040 - 18 Jun 2026
Viewed by 156
Abstract
Improving detection performance across diverse synthetic aperture radar (SAR) scenes remains challenging because different datasets exhibit different levels of target–background separability. To address this issue, we propose a constant false alarm rate (CFAR)-guided mode-selective network (CGMSN), which selects an appropriate feature-fusion mode according [...] Read more.
Improving detection performance across diverse synthetic aperture radar (SAR) scenes remains challenging because different datasets exhibit different levels of target–background separability. To address this issue, we propose a constant false alarm rate (CFAR)-guided mode-selective network (CGMSN), which selects an appropriate feature-fusion mode according to the CFAR target–background separation margin. Specifically, CFAR is used as an interpretable statistical tool to construct an anomaly response map. The separation margin is then calculated by comparing the average CFAR anomaly responses of annotated target regions and their surrounding contextual backgrounds. Based on this indicator, a You Only Look Once version 8 (YOLOv8)-based mode-selective detector is constructed with three key components. First, a lightweight representation-enhanced backbone that integrates ResNet18 and a dilated convolutional spatial pyramid (DCSP) module is adopted to improve contextual representation while maintaining moderate model complexity. Second, a mode-selective neck (MSN) is designed with three predefined fusion modes, where the appropriate fusion depth is selected according to the CFAR-guided target–background separation margin of each dataset. Third, a complete intersection over the union modulated head (CMH) is developed to enhance classification-regression alignment and suppress clutter-induced responses. Experiments on SAR-Aircraft-1.0, High-Resolution SAR Images Dataset (HRSID), and SAR Ship Detection Dataset (SSDD) indicate that datasets with smaller CFAR target–background separation margins benefit from deeper fusion, while datasets with larger separation margins can adopt shallower fusion. Moreover, the proposed CGMSN achieves superior performance over representative detectors, demonstrating its effectiveness on the evaluated SAR datasets with diverse scene characteristics. Full article
Show Figures

Figure 1

17 pages, 4000 KB  
Article
A Lightweight and High-Precision PCB Surface Defect Detection Method Based on YOLOv8
by Zhenling Wang, Ya Gao, Ying Xiao and Qiurui He
J. Imaging 2026, 12(6), 266; https://doi.org/10.3390/jimaging12060266 - 18 Jun 2026
Viewed by 156
Abstract
In response to the diverse types and large number of PCB surface defects, our paper proposes an improved YOLOv8-based method for PCB surface defect detection. First, a lightweight modification is performed by introducing RepGhostBottleNeck as the lightweight backbone network, which reduces the number [...] Read more.
In response to the diverse types and large number of PCB surface defects, our paper proposes an improved YOLOv8-based method for PCB surface defect detection. First, a lightweight modification is performed by introducing RepGhostBottleNeck as the lightweight backbone network, which reduces the number of parameters in the training model. It should be noted that the term “lightweight” in this paper is relative to the original YOLOv8L baseline. Compared with extremely lightweight detectors, the model in this paper places greater emphasis on the balance between accuracy and efficiency. Additionally, an attention mechanism module and a small object detection head module are added to the backbone network. Furthermore, the loss function of the network is improved. Experimental results show that the improved model achieves an average mAP@0.5 of 0.976, demonstrating high-precision detection on the constructed dataset. Full article
(This article belongs to the Special Issue AI-Driven Image and Video Understanding)
Show Figures

Figure 1

26 pages, 6707 KB  
Article
BDRNet: Background-Aware Dynamic-Scale Routing Network for UAV Remote Sensing Object Detection
by Xuelong Zheng, Faming Shao, Qing Liu, Juying Dai, Yiming Yue, Tao Zhang and Caian Chen
Remote Sens. 2026, 18(12), 1987; https://doi.org/10.3390/rs18121987 - 15 Jun 2026
Viewed by 242
Abstract
Object detection in UAV remote sensing imagery remains challenging due to severe scale variation, dense object distributions, complex background clutter, and localization ambiguity caused by extremely small objects. To address these issues, this paper proposes BDRNet, a lightweight background-aware dynamic-scale routing network for [...] Read more.
Object detection in UAV remote sensing imagery remains challenging due to severe scale variation, dense object distributions, complex background clutter, and localization ambiguity caused by extremely small objects. To address these issues, this paper proposes BDRNet, a lightweight background-aware dynamic-scale routing network for UAV remote sensing object detection. First, a background-aware feature enhancement (BAFE) module is introduced into the backbone to enhance feature representation through horizontal and vertical contextual modeling, improving target-related responses in complex aerial scenes. Second, a dynamic-scale routing pyramid (DSRP) is designed to retain the high-resolution P2 branch and adaptively integrate multi-scale features through spatially dynamic routing, alleviating the loss of fine-grained information and improving the representation of small and scale-varied objects. Third, a scale- and geometry-aware normalized Wasserstein distance (SGNW) loss is proposed by modeling bounding boxes as two-dimensional Gaussian distributions. By incorporating aspect-ratio-guided geometric weighting and scale-aware dynamic fusion, SGNW improves regression stability for small objects while preserving geometric constraints for medium and large targets. Extensive experiments on the VisDrone2019 and UAVDT datasets demonstrate that BDRNet consistently improves detection accuracy over the YOLOv10s detector while maintaining a comparable model size and computational cost. Compared with several mainstream lightweight detectors, BDRNet achieves a favorable accuracy–efficiency trade-off, demonstrating its effectiveness for UAV remote sensing object detection in complex aerial scenarios. Full article
Show Figures

Figure 1

33 pages, 22512 KB  
Article
A Simulation-Based Hybrid Quantum-Classical Channel Attention Network for Reliable Aircraft Skin Defect Recognition
by Shiqi Jiang, Hai Peng, Dingqi Zhang and Yupei Zhu
Technologies 2026, 14(6), 361; https://doi.org/10.3390/technologies14060361 - 13 Jun 2026
Viewed by 211
Abstract
Aircraft skin defect recognition is a safety-critical visual inspection task in which lightweight models must maintain high diagnostic accuracy while suppressing false alarms caused by complex surface textures, illumination variations, and weak defect patterns. This study proposes HQCA-Net, a simulation-based hybrid quantum-classical channel [...] Read more.
Aircraft skin defect recognition is a safety-critical visual inspection task in which lightweight models must maintain high diagnostic accuracy while suppressing false alarms caused by complex surface textures, illumination variations, and weak defect patterns. This study proposes HQCA-Net, a simulation-based hybrid quantum-classical channel attention network for reliable aircraft skin defect recognition. The core component, termed Residual Quantum Channel Attention (RQCA), embeds a 10-qubit variational quantum circuit into a classical ResNet-18 backbone to perform compact and structured nonlinear feature recalibration, introducing only 30 trainable quantum-gate parameters. The quantum circuit is evaluated using state-vector simulation, and this study focuses on model-level feature recalibration, reliability, and robustness within the evaluated dataset rather than implementation on physical quantum hardware. Experiments on a six-class aircraft skin defect dataset show that HQCA-Net achieves 97.93% classification accuracy and a global false positive rate of 0.49%, outperforming ResNet-18 and classical lightweight attention mechanisms including SE, ECA, and SimAM. Additional analyses using confidence calibration, Grad-CAM visualization, Gaussian noise perturbation, few-shot training, and circuit-depth ablation further indicate that the proposed RQCA module improves feature discrimination and false-alarm suppression under compact parameter constraints. These results suggest that the hybrid quantum-classical attention module can serve as a parameter-efficient nonlinear feature recalibration strategy for reliable visual defect inspection under the tested experimental conditions. Full article
(This article belongs to the Section Quantum Technologies)
Show Figures

Figure 1

29 pages, 28758 KB  
Article
Spatio-Temporal Feature Enhancement for Recognizing Strongly Correlated Sequential Actions in Aircraft Assembly
by Jiaming Shi, Xiang Huang, Guoyi Hou, Chengda Guo, Qingxue Wang and Yumin Chen
Sensors 2026, 26(12), 3781; https://doi.org/10.3390/s26123781 - 13 Jun 2026
Viewed by 392
Abstract
The positioning and clamping process in aircraft assembly exhibits pronounced long-term temporal correlations and intense human–machine interactions. Consequently, assembly quality depends heavily on operator compliance and consistency. Capturing long-term, strongly correlated features in complex industrial environments remains a significant challenge. To overcome this, [...] Read more.
The positioning and clamping process in aircraft assembly exhibits pronounced long-term temporal correlations and intense human–machine interactions. Consequently, assembly quality depends heavily on operator compliance and consistency. Capturing long-term, strongly correlated features in complex industrial environments remains a significant challenge. To overcome this, this study proposes a Long-Term Strongly Associated Action Recognition Network (LTSA-Net) tailored for aircraft assembly positioning and clamping tasks. Based on the C3D backbone, the model first incorporates the SimAM attention mechanism and BN modules to significantly enhance focus on critical spatiotemporal features. To address the challenge of capturing long-term temporal dependencies, LTSFEM is designed to extract global temporal information accurately. Furthermore, to balance structural lightweight design with real-time inference requirements, the CWSTB module is integrated to achieve substantial parameter compression. In addition, a dedicated aircraft assembly positioning and clamping dataset was constructed, and a robust training framework was established using the AdamW optimizer and Mixup data augmentation. Experimental results demonstrate that LTSA-Net achieves a recognition accuracy of 98.82% on the LTSA-Dataset, with a per-frame inference time of 42 ms, successfully meeting the dual requirements of high precision and real-time performance in industrial scenarios, and providing a practical technical solution for intelligent monitoring of aircraft assembly processes. Full article
(This article belongs to the Section Industrial Sensors)
Show Figures

Figure 1

21 pages, 4517 KB  
Article
Research on an Online Detection Method of Seed Filling Performance for a Pneumatic Suction Seed Metering Device Based on YOLOv8-MA
by Yuankun Zheng, Yulong Ding, Jizhong Wang, Hanlu Jiang, Weipeng Zhang, Hongze Guo, Shenghe Bai, Liming Zhou, Kang Niu and Lijing Liu
AgriEngineering 2026, 8(6), 240; https://doi.org/10.3390/agriengineering8060240 - 12 Jun 2026
Viewed by 207
Abstract
To address the difficulty of real-time detection of seed-filling performance in pneumatic suction seed metering devices under high-speed operation—where seed targets are tiny, prone to adhesion, and affected by motion blur—this paper proposes a lightweight online detection algorithm, YOLOv8n-MA. First, according to the [...] Read more.
To address the difficulty of real-time detection of seed-filling performance in pneumatic suction seed metering devices under high-speed operation—where seed targets are tiny, prone to adhesion, and affected by motion blur—this paper proposes a lightweight online detection algorithm, YOLOv8n-MA. First, according to the seed adsorption characteristics of the suction holes, the detection targets are divided into three categories: none, one, and two. Second, based on YOLOv8n, the backbone network is replaced with MobileNetV1 to reduce computational cost, and an ACmix attention module is integrated into the Neck to enhance feature representation for the three suction-hole states. Finally, to meet the demand for low-latency inference on resource-constrained devices, the model is deployed on an edge computing controller to achieve real-time detection. Experimental results show that, compared with the original YOLOv8n, the parameters and FLOPs of YOLOv8n-MA are reduced by 34.4% and 59.8%, respectively, while the mean average precision (mAP) is improved by 2.0% to 96.8%, achieving a superior trade-off between accuracy and efficiency over other detection models of the same category, such as YOLOv5n, YOLOv9n, and YOLOv10n. In field tests, the detection accuracy reaches 95.02% at 12 km/h and 92.65% at 15 km/h. The proposed method provides effective technical support for the intelligent monitoring and control of precision seeding under high-speed operation. Full article
Show Figures

Figure 1

30 pages, 68434 KB  
Article
A Lightweight and High-Precision Citrus Detection Model for Unstructured Orchard Environments
by Junjie Yang, Haorong Wu, Dong Lv, Wei Ma, Hao Teng and Dehua Chen
Horticulturae 2026, 12(6), 718; https://doi.org/10.3390/horticulturae12060718 - 11 Jun 2026
Cited by 1 | Viewed by 397
Abstract
This study was conducted to address the challenges of detecting citrus fruits in complex orchard environments characterized by overlap, occlusion, and variable lighting conditions. To tackle these issues, an improved detection model named YOLO-MGP was developed based on the YOLOv8n architecture. Four key [...] Read more.
This study was conducted to address the challenges of detecting citrus fruits in complex orchard environments characterized by overlap, occlusion, and variable lighting conditions. To tackle these issues, an improved detection model named YOLO-MGP was developed based on the YOLOv8n architecture. Four key enhancements were introduced to the core components of the detection framework. First, the primary backbone network was replaced with MobileNetV3, which substantially reduced computational requirements while preserving the capability for multi-scale feature extraction. Second, a C2f-GLU module was incorporated into the neck network. By leveraging Gated Linear Units, this module strengthens the feature selection and fusion processes. Third, an additional P2 detection layer was added to improve the detection of small targets. This modification was complemented by the integration of a Coordinate Attention mechanism, which refines the distribution of feature weights across spatial and channel dimensions. Finally, the CIoU loss was replaced by PIoU to enhance the accuracy of bounding box regression, particularly for occluded and overlapping targets. Experimental results demonstrate that the YOLO-MGP model achieved a precision of 94.2%, a recall of 89.7%, and a mAP50 of 95.7% on our custom citrus dataset. By substantially reducing the number of parameters while maintaining competitive detection performance, the proposed method offers a practical and lightweight solution for fruit detection in automated harvesting systems. Full article
(This article belongs to the Special Issue Emerging Technologies in Smart Agriculture)
Show Figures

Figure 1

Back to TopTop