Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (2,628)

Search Parameters:
Keywords = YOLOv7 network model

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
16 pages, 3207 KB  
Article
Apple Leaf Disease Detection Based on Improved YOLOv11 with DSSA Mechanism
by Yuanyuan Zhang, Jiya Tian and Duanyang Zhang
Plants 2026, 15(12), 1928; https://doi.org/10.3390/plants15121928 (registering DOI) - 22 Jun 2026
Abstract
Visual inspection of apple leaf diseases is inefficient and subjective, limiting large-scale orchard applications. To realize rapid and accurate disease identification, this paper proposes an improved YOLOv11 model integrated with a Dual Sparse Selection Attention (DSSA) module. By embedding the DSSA module into [...] Read more.
Visual inspection of apple leaf diseases is inefficient and subjective, limiting large-scale orchard applications. To realize rapid and accurate disease identification, this paper proposes an improved YOLOv11 model integrated with a Dual Sparse Selection Attention (DSSA) module. By embedding the DSSA module into the key layers of the YOLOv11 backbone network, the model enhances fine-grained feature extraction for small and complex lesions while suppressing background interference. A tailored training strategy with an optimized learning rate and optimizer is designed to ensure stable convergence. Experiments are conducted on a dataset consisting of 7594 images covering four categories: black rot, rust, scab, and healthy leaves. The proposed model achieves precision of 0.973, recall of 0.978, mAP50 of 0.991, and 0.949 mAP50–95, outperforming YOLOv8, YOLOv9, YOLOv10, and the vanilla YOLOv11. Furthermore, a Qt-based visualization system is developed for practical orchard deployment. This method provides a reliable solution for intelligent apple leaf disease detection and smart orchard management. Full article
Show Figures

Figure 1

31 pages, 5802 KB  
Article
Automated Aqueductal CSF Flow Analysis in Spontaneous Intracranial Hypotension: Hemodynamic Quantification and Exploratory Waveform Morphology Assessment Using Cine PC-MRI
by Yi-Jhe Huang, Wen-Hsien Chen, Hung-Chieh Chen and Da-Chuan Cheng
Diagnostics 2026, 16(12), 1939; https://doi.org/10.3390/diagnostics16121939 (registering DOI) - 22 Jun 2026
Abstract
Background/Objectives: Spontaneous intracranial hypotension (SIH) is caused by spinal cerebrospinal fluid (CSF) leakage and is typically diagnosed by clinical presentation and characteristic MRI signs; however, objective tools for monitoring physiological changes and treatment response remain limited. Cine phase-contrast MRI (PC-MRI) enables noninvasive quantification [...] Read more.
Background/Objectives: Spontaneous intracranial hypotension (SIH) is caused by spinal cerebrospinal fluid (CSF) leakage and is typically diagnosed by clinical presentation and characteristic MRI signs; however, objective tools for monitoring physiological changes and treatment response remain limited. Cine phase-contrast MRI (PC-MRI) enables noninvasive quantification of aqueductal CSF dynamics, yet reliable analysis is challenging since the cerebral aqueduct is extremely small and susceptible to low contrast, partial volume effects, and ROI-dependent measurement variability—particularly in SIH where CSF pulsatility is often reduced. Methods: We propose an end-to-end automated framework that integrates (1) a cascade localization–segmentation strategy, consisting of Tiny YOLOv4 detection followed by MultiResUNet segmentation on a YOLOv4-derived cropped ROI; (2) physiology-informed pulsatility-based segmentation (PUBS) to refine anatomical masks into functional flow ROIs; and (3) one-dimensional convolutional neural networks (1D-CNNs) to extract exploratory waveform morphology features from 32-phase cardiac-cycle velocity waveforms. The study includes 39 participants, yielding 59 cine PC-MRI examinations: 11 controls, 28 Pre-treatment SIH scans and 20 Post-treatment Recovery scans. Results: The cascade model significantly improves segmentation robustness compared with a full-image baseline, achieving higher Dice scores and markedly lower boundary errors across cohorts (e.g., Pre-treatment SIH HD95: 1.66 ± 0.74 px vs. 15.37 ± 44.98 px). PUBS refinement reduces quantification deviation from expert manual references in SIH (mean relative error: 7.4% to 5.6%) and improves diagnostic performance for multiple hemodynamic parameters (e.g., downward mean flow AUC: 0.747 to 0.792). For waveform morphology analysis, the end-to-end 1D-CNN classifier was evaluated using repeated-seed participant-level grouped LOOCV. The repeated-seed ensemble prediction showed modest out-of-sample discrimination between Normal controls and Pre-treatment SIH scans, with an AUC of 0.646, a bootstrap 95% confidence interval of 0.455–0.826, and a permutation-test p-value of 0.072. Separately, exploratory analysis of the final baseline-trained 1D-CNN latent space showed marked, apparent Normal-versus-SIH separability and an intermediate recovery distribution in PCA space, suggesting that aqueductal waveform morphology may encode SIH-related physiological information. Conclusions: These findings suggest that SIH-related information may be reflected not only in flow magnitude but also in aqueductal CSF waveform morphology. However, the modest and statistically non-significant out-of-sample performance of the end-to-end 1D-CNN classifier indicates that morphology-based AI features should currently be regarded as exploratory biomarker candidates rather than validated stand-alone diagnostic tools. Larger independent cohorts are required to confirm their reproducibility, physiological meaning, and clinical utility. Full article
Show Figures

Figure 1

22 pages, 6722 KB  
Article
MoLi-Net: A Lightweight Brightness-Aware Model for Chinese Herbal Materials Recognition with an Auxiliary Module for Impurity Detection
by Zilong Xu, Changcheng Jiang, Jianhui Ding, Weiyang Ding and Zhenping Wan
Electronics 2026, 15(12), 2731; https://doi.org/10.3390/electronics15122731 (registering DOI) - 21 Jun 2026
Abstract
Object detection in complex industrial environments is prone to being affected by insufficient dynamic weighting of local and global features, as well as illumination variations and impurities. Moreover, existing models suffer from excessive model complexity, which directly impairs computational efficiency. To more accurately [...] Read more.
Object detection in complex industrial environments is prone to being affected by insufficient dynamic weighting of local and global features, as well as illumination variations and impurities. Moreover, existing models suffer from excessive model complexity, which directly impairs computational efficiency. To more accurately distinguish Chinese herbal materials with diverse morphologies, this paper proposes the MobileAttn module. Drawing on the idea of token representation in the Transformer architecture, this module extracts contextual information through global feature compression, fuses it with tokens to generate a spatial attention map, and realizes dynamic recalibration of convolutional features. This process enhances the feature weights of key semantic regions, suppresses redundant background information, and improves feature discriminability. To address illumination interference, brightness-aware weights are combined with dual-path (channel and spatial) attention for global control, dynamically reducing the impact of illumination; this component is named LightAttn. When Chinese herbal materials contain common industrial unknown impurities (e.g., small stones and weeds), an impurity detection auxiliary module, a post-processing step independent of the main detection network, is proposed. This module refines Non-Maximum Suppression (NMS) logic to distinguish target Chinese herbal materials from interfering impurities. Subsequently, it accurately locates and marks impurities on the conveyor belt, thereby achieving effective unknown impurity detection. Experimental results demonstrate that, compared with the original YOLOv11 on the Chinese herbal materials detection task, the optimized model achieves a 1.7% improvement in the overall mean Average Precision (mAP@0.5:0.95). On a per-class basis, gains are particularly pronounced for certain challenging high-aspect-ratio Chinese herbal materials. Prunella vulgaris and orange peel achieve respective AP improvements of 5.8% and 4.1%. Meanwhile, the model parameter count is reduced by 23.1% and the computational complexity by 20.3%. The F1-Score of the impurity detection results is 86.38%, verifying the effectiveness of the impurity detection auxiliary module. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

26 pages, 5547 KB  
Article
A Lightweight Framework for Tea Shoot Detection and Plucking Point Localization Enabled by Modified YOLOv11s-Seg Model
by Yongmao Huang, Yuankai Luo, Yuanxi Mu and Haiyan Jin
Agriculture 2026, 16(12), 1357; https://doi.org/10.3390/agriculture16121357 (registering DOI) - 20 Jun 2026
Viewed by 86
Abstract
In this work, a lightweight framework enabled by the modified YOLOv11s-seg model for tea shoot detection and plucking point localization is proposed. Detecting tea shoots and localizing plucking points with higher accuracy generally require larger model size and more model parameters, making it [...] Read more.
In this work, a lightweight framework enabled by the modified YOLOv11s-seg model for tea shoot detection and plucking point localization is proposed. Detecting tea shoots and localizing plucking points with higher accuracy generally require larger model size and more model parameters, making it difficult to balance accuracy and lightweighting. To overcome this limitation, a modified lightweight YOLOv11s-seg model is developed. First, the multi-scale edge information enhancement is introduced into the conventional YOLOv11s-seg to extract edge feature better and improve the detection accuracy of tea shoots. Meanwhile, context anchor attention is utilized to modify the cross stage partial spatial attention module in a backbone network to improve the detection capability for small objects. Moreover, the detail calibration reconstruction feature pyramid network is proposed. It utilizes spatial and contextual semantic information to reconstruct and calibrate features in key regions, enhancing the capability for object fusion and recognition at various scales. Furthermore, with the modified model performing instance segmentation to acquire the contour of each tea shoot, the coordinates of the three lowest pixel points in the contour are captured to localize the plucking point based on the average coordinates. In addition, the layer-adaptive magnitude-based pruning (LAMP) method is used to lighten the model. The experimental results show that the LAMP-pruned modified YOLOv11s-seg model with a speedup ratio of 1.5 achieves a mAP@0.5 of 86.5% for tea shoot detection, exhibiting a 4.7 percentage point improvement over the conventional YOLOv11s-seg model. Moreover, it exhibits an accuracy of 81.9% for plucking point localization on the validation and test subsets with 232 images in total, and its number of parameters, model size and floating point operations (FLOPs) separately achieve reductions of 67.3%, 66.2%, and 24.9% over the conventional model as well. Therefore, the proposed LAMP-pruned modified model shows good balance between lightweighting and detection accuracy. Finally, the modified LAMP-pruned YOLOv11s-seg model is deployed on a Jetson Orin NX edge module and measured in a tea plantation, with the measured results exhibiting a detection speed of 34.1 FPS and verifying its availability in practical applications. Full article
(This article belongs to the Special Issue Advances in Precision Agriculture in Orchard)
18 pages, 6162 KB  
Article
YOLO-UTD: A Domain-Specific Detection Framework for Small Objects in UAV Traffic Surveillance
by Hailang Huang, Meng Li, Jiebao Zhang and Yitong Li
Sensors 2026, 26(12), 3931; https://doi.org/10.3390/s26123931 (registering DOI) - 20 Jun 2026
Viewed by 187
Abstract
Detecting objects in drone-captured aerial imagery is particularly formidable due to challenges such as the prevalence of numerous small targets and their dense spatial distribution. To bridge this gap, this paper introduces YOLO-UTD (YOLO-UAV Traffic Detection), a dedicated small object detector tailored for [...] Read more.
Detecting objects in drone-captured aerial imagery is particularly formidable due to challenges such as the prevalence of numerous small targets and their dense spatial distribution. To bridge this gap, this paper introduces YOLO-UTD (YOLO-UAV Traffic Detection), a dedicated small object detector tailored for drone traffic surveillance. Built upon the YOLOv8 framework, the proposed model incorporates three principal enhancements. First, a specialized small-object detection head replaces the original large-object head to increase the sensitivity to fine-grained features. Second, we introduce a shallow-augmented feature pyramid network (SFPN) into the neck module. The SFPN enriches the semantic content of high-resolution shallow features via dense multiscale interactions and CARAFE upsampling, boosting performance on small targets. Finally, a C2fA layer is integrated into the deep backbone stages to adaptively fuse spatial details and semantic context through a dual-path architecture and a cross-attention mechanism, thereby dynamically refining features critical for small objects. Extensive experiments on the VisDrone2019 dataset validate that YOLO-UTD achieves a 3.6% higher mean average precision (mAP) than YOLOv8 while preserving a low parameter footprint, with a particularly significant gain of 5.3% in vehicle detection accuracy. These findings confirm the model’s efficacy and strong potential for application in smart city drone surveillance. Full article
Show Figures

Figure 1

17 pages, 15918 KB  
Article
ADA-YOLO: An Adaptive Dynamic Aggregation Network for Small Object Detection in UAV Imagery
by Jiajun Chen, Shaochen Jiang, Yongming Li, Sulaiman Tuersunayi and Yong Liu
Sensors 2026, 26(12), 3908; https://doi.org/10.3390/s26123908 (registering DOI) - 19 Jun 2026
Viewed by 238
Abstract
Unmanned Aerial Vehicle (UAV) image object detection holds significant application value in the low-altitude economy, traffic monitoring, intelligent agriculture, and disaster rescue. However, due to the top-down perspective, UAV images typically suffer from challenges such as small target scales, dense object distribution, severe [...] Read more.
Unmanned Aerial Vehicle (UAV) image object detection holds significant application value in the low-altitude economy, traffic monitoring, intelligent agriculture, and disaster rescue. However, due to the top-down perspective, UAV images typically suffer from challenges such as small target scales, dense object distribution, severe occlusions, and complex backgrounds. These issues often limit the recall and localization accuracy of general-purpose detectors when they are directly applied to UAV small-object detection scenarios. To address these aforementioned challenges, this paper proposes an Adaptive Dynamic Aggregation YOLO network, termed ADA-YOLO. The novelty of ADA-YOLO lies in its highly efficient combinatorial design specifically tailored for UAV small object detection, while retaining the efficient backbone of YOLOv8, we systematically reconstruct the neck and detection head to improve accuracy. Specifically, a high-resolution P2 detection branch is incorporated to construct a P2–P5 multi-scale prediction structure. Furthermore, the lightweight DySample dynamic upsampling module is adopted to replace traditional upsampling methods, and an Adaptive Spatial Feature Fusion (ASFF) mechanism is introduced to alleviate semantic conflicts and noise interference during multi-scale feature fusion. This synergistic combination explicitly addresses multi-scale representation challenges and enhances small-object detection performance in complex scenes. Comparative experiments with the baseline YOLOv8n on the VisDrone2019 dataset demonstrate that ADA-YOLO achieves an improvement of 11.3% in mAP@0.5 and 8.2% in mAP@0.5:0.95. The improved model achieves these performance gains with a modest parameter increase and acceptable computational complexity. Finally, ablation experiments further validate the effectiveness of each individual module and their synergistic gains. Full article
(This article belongs to the Section Remote Sensors)
Show Figures

Figure 1

29 pages, 6688 KB  
Article
CGMSN: CFAR-Guided Mode-Selective Network for SAR Target Detection
by Lingjuan Yu, Xinya Xiong, Xiaochun Xie, Miaomiao Liang, Xiangchun Yu, Xuan Jiao and Wen Hong
Remote Sens. 2026, 18(12), 2040; https://doi.org/10.3390/rs18122040 - 18 Jun 2026
Viewed by 111
Abstract
Improving detection performance across diverse synthetic aperture radar (SAR) scenes remains challenging because different datasets exhibit different levels of target–background separability. To address this issue, we propose a constant false alarm rate (CFAR)-guided mode-selective network (CGMSN), which selects an appropriate feature-fusion mode according [...] Read more.
Improving detection performance across diverse synthetic aperture radar (SAR) scenes remains challenging because different datasets exhibit different levels of target–background separability. To address this issue, we propose a constant false alarm rate (CFAR)-guided mode-selective network (CGMSN), which selects an appropriate feature-fusion mode according to the CFAR target–background separation margin. Specifically, CFAR is used as an interpretable statistical tool to construct an anomaly response map. The separation margin is then calculated by comparing the average CFAR anomaly responses of annotated target regions and their surrounding contextual backgrounds. Based on this indicator, a You Only Look Once version 8 (YOLOv8)-based mode-selective detector is constructed with three key components. First, a lightweight representation-enhanced backbone that integrates ResNet18 and a dilated convolutional spatial pyramid (DCSP) module is adopted to improve contextual representation while maintaining moderate model complexity. Second, a mode-selective neck (MSN) is designed with three predefined fusion modes, where the appropriate fusion depth is selected according to the CFAR-guided target–background separation margin of each dataset. Third, a complete intersection over the union modulated head (CMH) is developed to enhance classification-regression alignment and suppress clutter-induced responses. Experiments on SAR-Aircraft-1.0, High-Resolution SAR Images Dataset (HRSID), and SAR Ship Detection Dataset (SSDD) indicate that datasets with smaller CFAR target–background separation margins benefit from deeper fusion, while datasets with larger separation margins can adopt shallower fusion. Moreover, the proposed CGMSN achieves superior performance over representative detectors, demonstrating its effectiveness on the evaluated SAR datasets with diverse scene characteristics. Full article
26 pages, 3882 KB  
Article
Remote Sensing Small Object Detection Network Based on Wavelet-Convolution and Fine-Grained Preservation
by Hangyu Li and Tiecheng Song
Information 2026, 17(6), 609; https://doi.org/10.3390/info17060609 (registering DOI) - 18 Jun 2026
Viewed by 155
Abstract
Small object detection in remote sensing imagery is a fundamental task for visual information extraction, yet it remains challenging due to extremely small target scales, complex backgrounds, and the loss of discriminative feature information caused by repeated downsampling. To address these issues, this [...] Read more.
Small object detection in remote sensing imagery is a fundamental task for visual information extraction, yet it remains challenging due to extremely small target scales, complex backgrounds, and the loss of discriminative feature information caused by repeated downsampling. To address these issues, this paper proposes a Wavelet-Convolution and Fine-Grained Preservation Network (WCFPNet) based on YOLOv8n. Specifically, a Wavelet-Convolution Module (WCM) is introduced into the backbone to decompose feature maps into low- and high-frequency sub-bands, thereby enhancing structural feature modeling and preserving subtle target details. To compensate for the weakened fine-grained information after repeated downsampling, an Enhanced Spatial Pyramid Pooling-Fast (ESPPF) module is embedded at the end of the backbone to strengthen multi-scale contextual aggregation. In addition, an Enhanced Feature Pyramid Network (EFPN) is designed in the neck to facilitate the propagation of shallow and intermediate fine-grained features to high-level semantic features through cross-level fusion and the Convolutional Block Attention Module (CBAM). Experiments on the NWPU VHR-10 dataset show that WCFPNet achieves 0.879 mAP@0.5 and 0.515 mAP@0.5:0.95, outperforming YOLOv8n by 1.7 and 2.5 percentage points, respectively. Moreover, the proposed WCFPNet achieves a competitive performance compared with several representative detectors while maintaining moderate model complexity. These results demonstrate the effectiveness of WCFPNet in challenging remote sensing scenes characterized by complex backgrounds, dense object distributions, and weak textures. Full article
Show Figures

Figure 1

20 pages, 6258 KB  
Article
A Lightweight Tea Bud Detector via Cascaded Gated Modulation and Multi-Scale Feature Enhancement
by Zewei Mi and Minming Gu
AI 2026, 7(6), 227; https://doi.org/10.3390/ai7060227 - 18 Jun 2026
Viewed by 167
Abstract
Accurate detection of tea buds is a key technology for enabling automated tea harvesting. However, in natural environments, tea buds present challenges such as scale variation, dense distribution, and high similarity to the background, making it difficult for traditional methods to balance accuracy [...] Read more.
Accurate detection of tea buds is a key technology for enabling automated tea harvesting. However, in natural environments, tea buds present challenges such as scale variation, dense distribution, and high similarity to the background, making it difficult for traditional methods to balance accuracy and efficiency. To address these issues, this paper proposes a lightweight detection framework, PCM-YOLO. The model introduces a cascaded gated feature modulation network into the YOLOv11 architecture, combining feedforward structures and gating mechanisms to selectively emphasize informative features, thereby improving tea bud detection performance. In addition, a feature-enhanced downsampling module is proposed, which employs a stepwise pooling-based feature enhancement mechanism to progressively expand the receptive field while preserving feature resolution, effectively incorporating multi-scale contextual information. Finally, a multi-scale feature enhancement module is designed to reduce the computational complexity of the model while maintaining detection performance as much as possible. Experimental results on public datasets demonstrate notable performance improvements over YOLOv11-N: Precision increases from 86.7% to 90.6% (an absolute increase of 3.9 percentage points), mAP50-95 increases by 1.6%, and the number of parameters is reduced by 20.6%. These results indicate that PCM-YOLO achieves a substantial reduction in model complexity while effectively improving detection accuracy, providing a feasible technical solution for deploying high-precision, real-time tea bud detection systems at the edge in tea plantation environments. Full article
(This article belongs to the Section AI Systems: Theory and Applications)
Show Figures

Figure 1

17 pages, 4000 KB  
Article
A Lightweight and High-Precision PCB Surface Defect Detection Method Based on YOLOv8
by Zhenling Wang, Ya Gao, Ying Xiao and Qiurui He
J. Imaging 2026, 12(6), 266; https://doi.org/10.3390/jimaging12060266 - 18 Jun 2026
Viewed by 139
Abstract
In response to the diverse types and large number of PCB surface defects, our paper proposes an improved YOLOv8-based method for PCB surface defect detection. First, a lightweight modification is performed by introducing RepGhostBottleNeck as the lightweight backbone network, which reduces the number [...] Read more.
In response to the diverse types and large number of PCB surface defects, our paper proposes an improved YOLOv8-based method for PCB surface defect detection. First, a lightweight modification is performed by introducing RepGhostBottleNeck as the lightweight backbone network, which reduces the number of parameters in the training model. It should be noted that the term “lightweight” in this paper is relative to the original YOLOv8L baseline. Compared with extremely lightweight detectors, the model in this paper places greater emphasis on the balance between accuracy and efficiency. Additionally, an attention mechanism module and a small object detection head module are added to the backbone network. Furthermore, the loss function of the network is improved. Experimental results show that the improved model achieves an average mAP@0.5 of 0.976, demonstrating high-precision detection on the constructed dataset. Full article
(This article belongs to the Special Issue AI-Driven Image and Video Understanding)
Show Figures

Figure 1

25 pages, 19355 KB  
Article
REB-Tea: An Intelligent Detection Model for Tea Buds with Clarity and Multi-Scale Feature Enhancement
by Zhuoxun Wu, Jun Lyu, Jingfan Pan, Junyi Luo and Lin Wang
Agriculture 2026, 16(12), 1340; https://doi.org/10.3390/agriculture16121340 - 17 Jun 2026
Viewed by 321
Abstract
Tea bud detection is a fundamental prerequisite for accurate tea yield estimation and intelligent mechanical harvesting. However, existing detection methods face several critical challenges, including ineffective extraction of multi-scale features, weak feature saliency for small tea bud targets, and the prevalent imaging issue [...] Read more.
Tea bud detection is a fundamental prerequisite for accurate tea yield estimation and intelligent mechanical harvesting. However, existing detection methods face several critical challenges, including ineffective extraction of multi-scale features, weak feature saliency for small tea bud targets, and the prevalent imaging issue in which the central regions of tea images are in focus while peripheral areas suffer from defocus blur. These factors collectively result in a high rate of missed detections, severely limiting detection accuracy and subsequent application performance. To overcome these technical bottlenecks, this paper proposes a novel tea bud detection framework, termed REB-Tea, which integrates image clarity optimization with multi-scale feature enhancement. First, the Restormer image restoration network is employed to improve overall image clarity and enhance the discriminative representation of tea bud features. Subsequently, a bidirectional feature pyramid network (BiFPN) structure and an efficient multi-scale attention (EMA) mechanism are incorporated into the neck of the YOLOv5 model to strengthen multi-scale feature fusion and guide the network to focus on fine-grained tea bud features across different scales, thereby improving detection performance for small and densely distributed targets. Experimental results based on 10-fold cross-validation demonstrate that the proposed REB-Tea model achieves an average mAP50 of 95.5% on the Longjing 43 tea test set, representing a 9.9 percentage point improvement over the baseline YOLOv5 model, and Welch’s independent two-sample t-test verifies that this accuracy increment is highly statistically significant. Moreover, the model exhibits reliable detection performance across different tea varieties, including Cuifeng and Fuding White Tea. Specifically, the mAP50 reaches 88.3% on Cuifeng, which shares similar appearance characteristics with Longjing, and 78.1% on Fuding White Tea, which has noticeably different appearance characteristics from Longjing. These results confirm the effectiveness of the REB-Tea framework in addressing challenges such as out-of-focus blurring, weak feature saliency, and multi-scale feature extraction. Overall, the proposed approach significantly enhances tea bud detection accuracy in natural environments and provides robust technical support for intelligent tea harvesting applications. Full article
Show Figures

Figure 1

15 pages, 32174 KB  
Article
YOLO-FSEP: An Improved YOLOv8n Algorithm for Sugar Orange Detection in Orchards
by Tianfa Deng, Jinchao Sun, Qingjuan Zhao and Faguo Huang
Sensors 2026, 26(12), 3848; https://doi.org/10.3390/s26123848 - 17 Jun 2026
Viewed by 101
Abstract
To address the challenges of detecting sugar orange fruits in complex natural orchard environments—where fruits are frequently occluded by leaves and branches and may be mutually occluded due to dense growth, leading to missed detections, false positives, and low detection confidence—we propose an [...] Read more.
To address the challenges of detecting sugar orange fruits in complex natural orchard environments—where fruits are frequently occluded by leaves and branches and may be mutually occluded due to dense growth, leading to missed detections, false positives, and low detection confidence—we propose an improved algorithm based on YOLOv8n, named YOLO-FSEP. A Spatial-Channel Synergistic Attention (SCSA) module is introduced into the main network to enhance feature extraction capabilities; the IoU loss function is replaced with Focal_SIOU to improve the detection accuracy for difficult samples; and an SE attention mechanism is embedded in the detection head, with the addition of a P6 high-resolution detection layer to optimize multi-scale object performance. Experimental results on a self-built sugar orange dataset show that, compared to the baseline YOLOv8n, the improved model achieves a 0.9% increase in accuracy, a 1.3% increase in recall, and a 3.2% increase in mAP50-95, while maintaining an inference speed of 62.6 FPS. To evaluate the model under dynamic conditions, we performed a 200-frame continuous test of the 3D localization pipeline on a laptop with a RealSense D435i camera. The average YOLO inference time was 49.90 ms, post-processing (depth extraction and 3D coordinate conversion) took 0.24 ms, and the total processing time was 50.15 ms. Given that the typical response time for a robotic arm’s single positioning operation is 100–200 ms, this real-time performance meets the dynamic localization requirements of sugar orange harvesting. Full article
(This article belongs to the Special Issue Smart Sensors in Precision Agriculture)
Show Figures

Figure 1

21 pages, 695 KB  
Review
A Structured Review of Deep Learning Approaches and Image-Preprocessing Techniques for Automated Contact Allergy Patch Test Interpretation
by Dominyka Stragyte, Gvidas Mikalauskas, Katrina Gaidulevic, Renata Paukstaitiene, Kestutis Stasaitis, Vidas Raudonis and Skaidra Valiukeviciene
Med. Sci. 2026, 14(2), 322; https://doi.org/10.3390/medsci14020322 - 15 Jun 2026
Viewed by 94
Abstract
Background: Allergic contact dermatitis (ACD) is a common inflammatory skin disease and patch testing (PT) remains the gold standard for its diagnosis; however, PT interpretation is time-consuming and prone to inter-observer variability. Growing advances in digital imaging and artificial intelligence (AI) have [...] Read more.
Background: Allergic contact dermatitis (ACD) is a common inflammatory skin disease and patch testing (PT) remains the gold standard for its diagnosis; however, PT interpretation is time-consuming and prone to inter-observer variability. Growing advances in digital imaging and artificial intelligence (AI) have encouraged the development of automated PT evaluation systems. This review aimed to summarize the use of deep learning networks (DNNs) and image-preprocessing techniques for PT classification. Methods: A literature review was conducted to identify original research published between 2020 and 2025 that applied deep learning algorithms to PT image analysis. Included studies were assessed with respect to model architecture, dataset characteristics, preprocessing strategies, and diagnostic performance. Results: Six original studies employing deep learning for PT image classification met the inclusion criteria. They employed a range of architectures, including YOLOv5x, EfficientNetB0, Xception, and custom CNN models. Reported diagnostic performance varied, with accuracy values ranging from 90% to 99.5%, F1-scores from 0.37 to 0.98, and AUROC values up to 0.94. Despite promising results, models remain unreliable for ICDRG grading, especially for severe reactions, and methodological variability in dataset composition, imaging conditions, preprocessing pipelines, and classification tasks limits comparability across studies. Conclusions: Deep learning shows promise for automated PT interpretation, but further standardized and multicenter studies with detailed preprocessing protocols and comprehensive ICDRG grading are required for clinical implementation. Full article
Show Figures

Figure 1

26 pages, 6707 KB  
Article
BDRNet: Background-Aware Dynamic-Scale Routing Network for UAV Remote Sensing Object Detection
by Xuelong Zheng, Faming Shao, Qing Liu, Juying Dai, Yiming Yue, Tao Zhang and Caian Chen
Remote Sens. 2026, 18(12), 1987; https://doi.org/10.3390/rs18121987 - 15 Jun 2026
Viewed by 228
Abstract
Object detection in UAV remote sensing imagery remains challenging due to severe scale variation, dense object distributions, complex background clutter, and localization ambiguity caused by extremely small objects. To address these issues, this paper proposes BDRNet, a lightweight background-aware dynamic-scale routing network for [...] Read more.
Object detection in UAV remote sensing imagery remains challenging due to severe scale variation, dense object distributions, complex background clutter, and localization ambiguity caused by extremely small objects. To address these issues, this paper proposes BDRNet, a lightweight background-aware dynamic-scale routing network for UAV remote sensing object detection. First, a background-aware feature enhancement (BAFE) module is introduced into the backbone to enhance feature representation through horizontal and vertical contextual modeling, improving target-related responses in complex aerial scenes. Second, a dynamic-scale routing pyramid (DSRP) is designed to retain the high-resolution P2 branch and adaptively integrate multi-scale features through spatially dynamic routing, alleviating the loss of fine-grained information and improving the representation of small and scale-varied objects. Third, a scale- and geometry-aware normalized Wasserstein distance (SGNW) loss is proposed by modeling bounding boxes as two-dimensional Gaussian distributions. By incorporating aspect-ratio-guided geometric weighting and scale-aware dynamic fusion, SGNW improves regression stability for small objects while preserving geometric constraints for medium and large targets. Extensive experiments on the VisDrone2019 and UAVDT datasets demonstrate that BDRNet consistently improves detection accuracy over the YOLOv10s detector while maintaining a comparable model size and computational cost. Compared with several mainstream lightweight detectors, BDRNet achieves a favorable accuracy–efficiency trade-off, demonstrating its effectiveness for UAV remote sensing object detection in complex aerial scenarios. Full article
Show Figures

Figure 1

14 pages, 3727 KB  
Article
Research on Aircraft Fire Detection Method Based on IATF-YOLO
by Wei Zhang, Kai Wang and Xiaosong Song
Fire 2026, 9(6), 255; https://doi.org/10.3390/fire9060255 - 15 Jun 2026
Viewed by 312
Abstract
Aircraft cargo compartment fires constitute a significant type of aviation fire, posing a grave threat to aviation safety. To guard against and respond to such fires, existing aircraft cargo compartments are equipped with smoke detection fire detectors, which rely on perceiving changes in [...] Read more.
Aircraft cargo compartment fires constitute a significant type of aviation fire, posing a grave threat to aviation safety. To guard against and respond to such fires, existing aircraft cargo compartments are equipped with smoke detection fire detectors, which rely on perceiving changes in smoke transmittance to determine the onset of a fire. However, these detectors offer relatively low recognition accuracy and cannot provide a direct visual representation of the fire. In this work, we introduce a fire recognition method built on image sensors and a deep learning model. In light of the irregular shapes of flames and smoke, an improved interactive triplet attention mechanism (ITAM) is integrated into the You Only Look Once version 5 (YOLOv5) model, enhancing the model’s recognition accuracy. Furthermore, the original Neck structure is replaced with an Asymptotic Feature Pyramid Network (AFPN), improving the model’s ability to recognize small targets, which is particularly useful for detecting flames and smoke early in a fire. This paper further improves the model’s recognition accuracy by introducing the Focaler-IoU loss function, which balances the feature learning of hard and easy samples. Therefore, the network model in this paper is named IATF-YOLO. Ablation experiments demonstrate that our algorithm improves accuracy by 2%, while comparative experiments with several mainstream baseline models show that our algorithm achieves a 0.7% accuracy improvement, with a final peak accuracy of 93.6%. Full article
(This article belongs to the Special Issue Relevance and Applicability of AI for Fire Engineering)
Show Figures

Figure 1

Back to TopTop