Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (166)

Search Parameters:
Keywords = computational ghost imaging

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 6871 KB  
Article
GSC-YOLO: A Pedestrian Detection Method for Low-Light Security Surveillance Scenarios
by Wei Qing, Fan Li, Shuang Li and Pengfei Yin
Sensors 2026, 26(10), 2987; https://doi.org/10.3390/s26102987 - 9 May 2026
Viewed by 564
Abstract
Pedestrian detection in nighttime security surveillance and other low-light visual sensing tasks is an important foundation for intelligent perception in complex environments. Under low-light conditions, visible-light images often suffer from missing texture details, intensified noise, and reduced contrast, which can easily lead to [...] Read more.
Pedestrian detection in nighttime security surveillance and other low-light visual sensing tasks is an important foundation for intelligent perception in complex environments. Under low-light conditions, visible-light images often suffer from missing texture details, intensified noise, and reduced contrast, which can easily lead to insufficient target representation, unstable cross-scale feature fusion, and an increased risk of missed detections. Although multimodal schemes, such as RGB–infrared approaches, can improve detection performance by exploiting modal complementarity, they involve relatively high hardware costs, cross-modal calibration complexity, and system integration overhead, which impose deployment limitations in lightweight or cost-sensitive scenarios. Therefore, developing an efficient pedestrian detection method for low-light monocular RGB scenarios is of clear practical value. This study focuses on low-light monocular RGB pedestrian detection and proposes an application-oriented structurally optimized model, termed GSC-YOLO, built upon YOLOv13. First, GhostNetV3 is introduced as the backbone to enhance multi-scale feature representation under weak-texture conditions. Second, a Semantic–Spatial Alignment (SSA) module is designed to improve information compensation and suppress noise during the feature fusion stage. Finally, C2f_Faster is incorporated into the high-level semantic branch to optimize information flow and reduce redundant computation. On the RGB subsets of the two public datasets, LLVIP and KAIST, GSC-YOLO achieves mAP@0.5:0.95 values of 57.70% and 66.61%, respectively, and Recall values of 89.93% and 90.49%, respectively, consistently outperforming the YOLOv13 baseline. The results demonstrate that, under the experimental settings adopted in this study, the proposed method effectively improves pedestrian perception performance in low-light RGB scenes while maintaining favorable real-time inference capability, and may provide a useful reference for front-end vision sensing research in low-altitude intelligent networks. Full article
Show Figures

Figure 1

27 pages, 196460 KB  
Article
LargeStitch: Efficient Seamless Stitching of Large-Size Aerial Images via Deep Matching and Seam-Band Fusion
by Jianglei Zhou, Zhaoyu Wei, Yisen Zhong and Xianqiang He
Remote Sens. 2026, 18(10), 1481; https://doi.org/10.3390/rs18101481 - 9 May 2026
Viewed by 316
Abstract
High-resolution panoramas generated by UAV image stitching are indispensable image resources for remote sensing applications. However, most existing stitching methods are designed for small-size images, making it difficult to process large-size images efficiently, leading to problems such as image feature misalignment and low [...] Read more.
High-resolution panoramas generated by UAV image stitching are indispensable image resources for remote sensing applications. However, most existing stitching methods are designed for small-size images, making it difficult to process large-size images efficiently, leading to problems such as image feature misalignment and low generation efficiency. This paper presents LargeStitch, a novel batch stitching method for large-size UAV images. The method introduces advanced image matching and alignment strategies through deep learning techniques to achieve efficient extraction and accurate alignment of dense features. To further optimize the stitching effect, this paper also proposes a seamless fusion method based on Seam-band, which effectively solves the problem of ghosting and misalignment in the overlapping region of large-size images. In addition, we designed a mask-based pre-stitching image filtering strategy, which optimizes the selection of candidate images to reduce content redundancy, thereby effectively avoiding unnecessary computational overhead and time consumption. The experimental results show that LargeStitch is not only capable of realizing fast stitching of high-precision and large-size aerial images but also significantly outperforms existing methods in terms of stitching quality and processing efficiency, making it a practical solution for realizing high-efficiency and seamless aerial image stitching. Full article
Show Figures

Figure 1

36 pages, 9864 KB  
Article
Orchard-YOLO: A Robust Deep Learning Framework for Fruit Detection Complex Optical and Environmental Degradation
by Yichen Wang, Hongjun Tian, Yuhan Zhou, Yang Xiong, Yichen Li, Manlin Wang, Yijie Yin, Xiaoyin Guo, Jiani Wu, Jiesen Zhang, Ying Tang and Shuai Huang
Photonics 2026, 13(5), 429; https://doi.org/10.3390/photonics13050429 - 27 Apr 2026
Viewed by 585
Abstract
Accurate target perception in unstructured outdoor environments remains a fundamental challenge in computational imaging and machine vision, primarily due to severe optical degradation caused by variable illumination, specular highlights, and dense foliage occlusion. Existing optical sensing systems often struggle to maintain robustness under [...] Read more.
Accurate target perception in unstructured outdoor environments remains a fundamental challenge in computational imaging and machine vision, primarily due to severe optical degradation caused by variable illumination, specular highlights, and dense foliage occlusion. Existing optical sensing systems often struggle to maintain robustness under these physical constraints, especially when deployed on edge devices with strict computational limits. To address these challenges, this paper proposes Orchard-YOLO, a lightweight, computationally efficient object detection network designed to maintain robustness against environmental and optical noise in complex orchard environments. Unlike generic architectures, Orchard-YOLO introduces three architectural enhancements for robust detection: (1) a High-Resolution P2 Detection Head to preserve high-frequency optical details and fine-grained texture cues often lost during digital downsampling; (2) Coordinate Attention (CA) mechanisms integrated into the feature fusion pathway to filter out background optical interference and enhance spatial discrimination for heavily occluded targets; and (3) a Ghost-convolution-based backbone to optimize the inference pipeline for real-time edge processing. Evaluated on a comprehensive multi-fruit dataset under simulated optical stress (including ±50% illumination variation and up to 70% occlusion), Orchard-YOLO achieves 94.8% mAP@0.5. It shows improved robustness under illumination variation and occlusion compared to baseline models, while achieving up to 25 FPS on an NVIDIA Jetson Nano edge device. These results suggest that Orchard-YOLO offers a detection framework suitable for resource-constrained orchard perception. Full article
(This article belongs to the Special Issue Computational Imaging: Photonics and Optical Applications)
Show Figures

Figure 1

21 pages, 28338 KB  
Article
An Enhanced YOLOv8n-Based Approach for Pig Behavior Recognition
by Jianjun Guo, Yudian Xu, Lijun Lin, Beibei Zhang, Piao Zhou, Shangwen Luo, Yuhan Zhuo, Jingyu Ji, Zhijie Luo and Guangming Cheng
Computers 2026, 15(4), 230; https://doi.org/10.3390/computers15040230 - 8 Apr 2026
Viewed by 495
Abstract
Pig behavior statistics can reflect their health status. Conventional approaches depend on manual observation to derive behavioral information from video recordings, a process that demands substantial time and human effort. To overcome these limitations in indoor intensive farming environments, this study introduces an [...] Read more.
Pig behavior statistics can reflect their health status. Conventional approaches depend on manual observation to derive behavioral information from video recordings, a process that demands substantial time and human effort. To overcome these limitations in indoor intensive farming environments, this study introduces an effective approach for recognizing pig behaviors, employing an enhanced YOLOv8n architecture. The approach utilizes advanced object detection algorithms to automatically identify pig behaviors, including stand, lie, eat, fight, and tail-bite, from overhead video footage of the enclosure. First, images of daily pig behaviors are collected using cameras to build a pig behavior dataset. To boost detection accuracy, the SE attention mechanism is embedded within the feature extraction backbone of the YOLOv8n network to enhance its representational capacity, strengthening the model’s capacity to grasp overarching contextual information and improve the expressiveness of extracted features. The GIoU loss function is employed during training to reduce computational cost and accelerate model convergence. Moreover, integrating Ghost convolution into the backbone significantly reduces both computational complexity and the total number of parameters. The experimental findings reveal that the optimized YOLOv8n model contains just 1.71 million parameters, marking a 42.93% reduction relative to the baseline model. Its floating-point operations total 5.0 billion, indicating a 38.27% decrease, while the mean average precision (mAP@50) reaches 96.8%, surpassing the original by 2.6 percentage points. Compared with other widely used YOLO-based object detection frameworks, the proposed approach achieves notably higher accuracy while requiring significantly lower computational resources and model complexity. Full article
(This article belongs to the Section AI-Driven Innovations)
Show Figures

Graphical abstract

36 pages, 15892 KB  
Article
UAV Real-Time Image Recognition Using Lightweight YOLOv11
by Xin-Yu Zhang and Jih-Gau Juang
Appl. Sci. 2026, 16(7), 3468; https://doi.org/10.3390/app16073468 - 2 Apr 2026
Viewed by 499
Abstract
Unmanned aerial vehicles (UAVs) for environmental monitoring typically rely on embedded platforms with limited computational capacity, which constrains the deployment of highly accurate yet computationally demanding object-detection models. To address this challenge and enable real-time image recognition under resource limitations, this study develops [...] Read more.
Unmanned aerial vehicles (UAVs) for environmental monitoring typically rely on embedded platforms with limited computational capacity, which constrains the deployment of highly accurate yet computationally demanding object-detection models. To address this challenge and enable real-time image recognition under resource limitations, this study develops three lightweight neural network architectures based on the YOLOv11 framework. The proposed designs aim to significantly reduce computational complexity and parameter count while maintaining stable and reliable detection performance, thereby improving inference efficiency and deployment flexibility on UAV platforms. YOLOv11-M is selected as the baseline model due to its favorable trade-off between detection accuracy and inference speed. Three lightweight strategies are then proposed and evaluated. First, a Ghost Convolution approach replaces portions of standard convolution with low-cost linear operations, effectively reducing both parameter size and computational overhead during feature extraction. Second, MobileNetV4 is employed as the backbone network; its optimized bottleneck structures and attention mechanisms enable substantial model compression without compromising recognition performance. Third, a MobileOne architecture with reparameterization is introduced, in which multi-branch structures enhance feature learning during training and are subsequently merged into a single-path network for inference, thereby significantly reducing computational cost and improving practical deployability. Full article
(This article belongs to the Section Electrical, Electronics and Communications Engineering)
Show Figures

Figure 1

27 pages, 6255 KB  
Article
Lightweight Safety Helmet Wearing Detection Algorithm Based on GSA-YOLO
by Haodong Wang, Qiang Zhou, Zhiyuan Hao, Wentao Xiao and Luqing Yan
Sensors 2026, 26(7), 2110; https://doi.org/10.3390/s26072110 - 28 Mar 2026
Viewed by 662
Abstract
Electric power station confined spaces are high-risk and complex environments characterized by significant illumination variations. Whether safety helmets are properly worn directly affects the operational safety of workers in confined spaces. However, helmet detection in such environments faces several challenges, including drastic lighting [...] Read more.
Electric power station confined spaces are high-risk and complex environments characterized by significant illumination variations. Whether safety helmets are properly worn directly affects the operational safety of workers in confined spaces. However, helmet detection in such environments faces several challenges, including drastic lighting changes and difficulties in small-object detection. Moreover, existing object detection models typically contain a large number of parameters, making real-time helmet detection difficult to deploy on field devices with limited computational resources. To address these issues, this paper proposes a lightweight safety helmet wearing detection algorithm named GSA-YOLO. To mitigate the effects of severe illumination variation and detail loss in confined spaces, a GCA-C2f module integrating GhostConv and the CBAM attention mechanism is embedded into the backbone network. This design reduces the number of parameters and computational cost while enhancing the model’s feature extraction capability under challenging lighting conditions. To improve detection performance for occluded targets, an improved efficient channel attention (I-ECA) mechanism is introduced into the neck structure, which suppresses irrelevant channel features and enhances occluded object detection accuracy. Furthermore, to alleviate missed detections of small objects and inaccurate localization under low-light conditions, a P2 detection branch is added to the head, and the WIoU loss function is adopted to dynamically adjust the weights of hard and easy samples, thereby improving small-object detection accuracy and localization robustness. A confined space helmet detection dataset containing 5000 images was constructed through on-site data collection for model training and validation. Experimental results demonstrate that the proposed GSA-YOLO achieves an mAP@0.5 of 91.2% on the self-built dataset with only 2.3 M parameters, outperforming the baseline model by 2.9% while reducing the parameter count by 23.6%. The experimental results verify that the proposed algorithm is suitable for environments with significant illumination variation and small-object detection challenges. It provides a lightweight and efficient solution for on-site helmet detection in confined space scenarios, thereby contributing to the reduction in industrial safety accidents. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

25 pages, 233246 KB  
Article
Seamlessly Natural: Image Stitching with Natural Appearance Preservation
by Gaetane Lorna N. Tchana, Damaris Belle M. Fotso, Antonio Hendricks and Christophe Bobda
Technologies 2026, 14(3), 186; https://doi.org/10.3390/technologies14030186 - 19 Mar 2026
Viewed by 473
Abstract
Conventional image stitching pipelines predominantly rely on homographic alignment, whose planar assumption often breaks down in dual-camera configurations capturing non-planar scenes, producing geometric warping, bulging, and structural distortion. To address these limitations, this paper presents SENA (Seamlessly Natural), a geometry-driven image stitching approach [...] Read more.
Conventional image stitching pipelines predominantly rely on homographic alignment, whose planar assumption often breaks down in dual-camera configurations capturing non-planar scenes, producing geometric warping, bulging, and structural distortion. To address these limitations, this paper presents SENA (Seamlessly Natural), a geometry-driven image stitching approach with three complementary contributions. First, we propose a hierarchical affine-based warping strategy that combines global affine initialization, local affine refinement, and a smooth free-form deformation field regulated by seamguard adaptive smoothing. This multi-scale design preserves local shape, parallelism, and aspect ratios, thereby reducing the hallucinated distortions commonly associated with homography-based models. Second, SENA incorporates a geometry-driven adequate zone detection mechanism that identifies regions with reduced parallax directly from the disparity consistency of correspondences filtered by RANSAC, without relying on semantic segmentation or depth estimation. Third, within this zone, anchor-based seamline cutting and segmentation enforce one-to-one geometric correspondence between image pairs, reducing ghosting and smearing artifacts. Extensive experiments demonstrate that SENA achieves 26.2 dB PSNR and 0.84 SSIM, obtains the lowest BRISQUE score (33.4) among compared methods, and reduces runtime by 79% on average across resolutions. These results confirm improved structural fidelity and computational efficiency while maintaining competitive alignment accuracy. Full article
(This article belongs to the Special Issue Image Analysis and Processing)
Show Figures

Figure 1

15 pages, 3994 KB  
Article
Parameter-Reduced YOLOv8n with GhostConv and C3Ghost for Automated Blood Cell Detection
by Jing Yang, Bo Yang, Zhenqing Li, Yoshinori Yamaguchi and Wen Xiao
Bioengineering 2026, 13(3), 321; https://doi.org/10.3390/bioengineering13030321 - 11 Mar 2026
Viewed by 721
Abstract
Accurate detection of blood cells in microscopic images plays a crucial role in automated hematological analysis and clinical diagnosis. Herein, we proposed an improved YOLOv8n-based model for efficient and precise detection of red blood cells (RBCs), white blood cells (WBCs), and platelets in [...] Read more.
Accurate detection of blood cells in microscopic images plays a crucial role in automated hematological analysis and clinical diagnosis. Herein, we proposed an improved YOLOv8n-based model for efficient and precise detection of red blood cells (RBCs), white blood cells (WBCs), and platelets in the BCCD dataset. The baseline YOLOv8n framework was enhanced by integrating GhostConv and C3Ghost modules to reduce model complexity while maintaining high detection performance. A series of ablation experiments were conducted to evaluate the individual and combined effects of these modules on model accuracy and computational efficiency. Experimental results demonstrated that the baseline model achieved an mAP@0.5 of 0.9043 with 3.01 M parameters. After incorporating GhostConv, the model maintained comparable accuracy (mAP@0.5 = 0.9040) with a reduction in parameters to 2.73 M. The C3Ghost integration further decreased parameters to 1.99 M with an mAP@0.5 of 0.8973. The combined model achieved an optimal balance between accuracy (mAP@0.5 = 0.9001) and compactness (1.71 M parameters). Results indicate that the improved YOLOv8n can effectively enhance detection efficiency without sacrificing precision. The proposed lightweight detection framework provides a promising solution for real-time blood cell analysis. Its high accuracy, reduced computational load, and strong generalization ability make it suitable for integration into automated laboratory systems, facilitating rapid and intelligent medical diagnostics in hematology and related biomedical applications. Full article
Show Figures

Figure 1

19 pages, 35815 KB  
Article
YOLOv10-TWD: An Improved YOLOv10n for Terracotta Warrior Recognition
by Yalin Li, Liang Wang, Xinyuan Zhang, Sijie Dong and Xinjuan Zhu
Appl. Sci. 2026, 16(5), 2616; https://doi.org/10.3390/app16052616 - 9 Mar 2026
Viewed by 374
Abstract
To address challenges such as complex backgrounds, partial occlusion, and high similarity of details in Terracotta Warrior image recognition, this paper proposes a lightweight detection method, YOLOv10-TWD, based on an improved YOLOv10n. Specifically, a lightweight Convolution-Attention Fusion Module (CAFMAttention) and a dual-branch feature [...] Read more.
To address challenges such as complex backgrounds, partial occlusion, and high similarity of details in Terracotta Warrior image recognition, this paper proposes a lightweight detection method, YOLOv10-TWD, based on an improved YOLOv10n. Specifically, a lightweight Convolution-Attention Fusion Module (CAFMAttention) and a dual-branch feature extraction structure (DualConv) are integrated into the detection head to enhance the model’s focus on fine-grained features and its discriminative robustness under partial damage conditions. In the Neck network, Ghost-Shuffle Convolution (GSConv) is introduced to compress the computational cost of multi-scale feature fusion while strengthening context-aware capabilities. Experimental results on a self-built Terracotta Warrior dataset demonstrate that the proposed method achieves a 7.63% improvement in mAP@0.5 compared to the baseline YOLOv10n, while simultaneously achieving a 6.66% increase in inference speed. The model achieves high precision alongside significant optimization in inference efficiency, making it well-suited for rapid recognition tasks in cultural heritage and museum scenarios. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

18 pages, 5195 KB  
Article
Computational Ghost Imaging Encryption for Multiple Images Based on Compressed Sensing and Block Scrambling
by Zhipeng Wang, Jiahuan Yang, Ruizhi Ge, Yingying Zhang and Yi Qin
Information 2026, 17(3), 239; https://doi.org/10.3390/info17030239 - 1 Mar 2026
Viewed by 467
Abstract
To achieve high capacity, high speed, and secure image transmission, we propose a multi-image computational ghost imaging (CGI)-based encryption scheme that integrates compressed sensing (CS), block scrambling, and dynamic-salt-driven bidirectional XOR diffusion. First, multiple images are partitioned into 8 × 8 pixel blocks, [...] Read more.
To achieve high capacity, high speed, and secure image transmission, we propose a multi-image computational ghost imaging (CGI)-based encryption scheme that integrates compressed sensing (CS), block scrambling, and dynamic-salt-driven bidirectional XOR diffusion. First, multiple images are partitioned into 8 × 8 pixel blocks, and their spatial structure is disrupted through random scrambling. The scrambled composite image then undergoes pixel-level encryption via two-round bidirectional XOR diffusion, using session-unique keys derived from SHA-256-based dynamic salt, eliminating the statistical characteristics of the original images. Subsequently, each pixel block is subjected to both Gaussian CS and Hadamard-based CGI measurements in parallel, achieving dual-mode compressive encryption and enhancing robustness through measurement redundancy. Finally, only the scrambling key, the XOR-diffusion key, and the compressed measurements are stored; the original image information is thus transformed into unrecognizable measurement data. During the decryption process, the Fast Iterative Shrinkage-Thresholding Algorithm (FISTA) with a Discrete Cosine Transform (DCT) sparse basis is employed for dual-sparse reconstruction from the compressed measurements, recovering the encrypted composite image. An inverse XOR operation is then applied to remove the pixel-level diffusion, followed by block reordering using the scrambling key to restore the original images. Experimental results demonstrate that the proposed scheme enables efficient and secure multi-image transmission while maintaining high decrypted image quality. Security analysis indicates that the scheme possesses high key sensitivity, effectively resisting chosen-plaintext attacks. Histogram uniformity analysis and cropping attack resistance experiments further confirm its excellent statistical security and robustness. Full article
(This article belongs to the Section Information Processes)
Show Figures

Figure 1

30 pages, 18122 KB  
Article
Fine-Grained Age-Class Identification of Moso Bamboo Using an Improved Lightweight YOLO11 Model
by Yingbin Zhang, Xinhuang Zhang, Zhichao Cai, Xi He, Shuwei Chen, Zhengxuan Lai, Kunyong Yu and Riwen Lai
J. Imaging 2026, 12(3), 102; https://doi.org/10.3390/jimaging12030102 - 27 Feb 2026
Viewed by 567
Abstract
Accurate identification of moso bamboo (Phyllostachys edulis) age classes is essential for effective forestry resource management, yet existing methods often struggle to achieve a satisfactory balance between accuracy and computational efficiency under complex field conditions. To address this challenge, this study [...] Read more.
Accurate identification of moso bamboo (Phyllostachys edulis) age classes is essential for effective forestry resource management, yet existing methods often struggle to achieve a satisfactory balance between accuracy and computational efficiency under complex field conditions. To address this challenge, this study proposes a lightweight object detection model, termed YOLO11-GCR, for fine-grained moso bamboo age-class classification based on close-range imagery. The proposed approach builds upon the YOLO11 framework and incorporates Ghost convolution, the Convolutional Block Attention Module (CBAM), and a Receptive Field Block (RFB) to reduce model complexity, enhance discriminative feature representation, and improve sensitivity to subtle texture variations among age classes. A dataset consisting of 9538 annotated bamboo culm images covering four age classes (I-du to IV-du) was constructed and divided into training, validation, and independent test sets with strict spatiotemporal separation. Experimental results indicate that YOLO11-GCR achieves robust detection performance with a lightweight architecture of 2.62 × 106 parameters and 6.2 GFLOPs, yielding an mAP@0.5 of 0.913 and an mAP@0.5–0.95 of 0.895 on the independent test set. Notably, the model demonstrates improved classification stability for visually similar age classes, such as II-du and III-du. Overall, this study presents an efficient and practical imaging-based solution for automated moso bamboo age-class recognition in complex natural environments. Full article
(This article belongs to the Section Computer Vision and Pattern Recognition)
Show Figures

Figure 1

13 pages, 3050 KB  
Article
Research and Application of Coal Gangue Detection Method Based on Improved YOLOv7-Tiny
by Shenglei Hao, Jian Ma, Zhenyang Zhang, Yong Liu, Dongxu Wu, Lehua Zhao, Peng Zhang, Kun Zhang and Mingchao Du
Processes 2026, 14(3), 488; https://doi.org/10.3390/pr14030488 - 30 Jan 2026
Cited by 1 | Viewed by 408
Abstract
Coal gangue sorting is crucial for improving coal quality and reducing environmental pollution; however, traditional methods suffer from resource wastage, high cost, and intensive labor demands. To address these challenges, this paper investigates an image recognition-based coal gangue sorting technique and proposes an [...] Read more.
Coal gangue sorting is crucial for improving coal quality and reducing environmental pollution; however, traditional methods suffer from resource wastage, high cost, and intensive labor demands. To address these challenges, this paper investigates an image recognition-based coal gangue sorting technique and proposes an improved YOLOv7-tiny detection model tailored for edge GPU devices with limited computational power and memory. YOLOv7-tiny is selected as the baseline due to its balanced performance in detection accuracy, architectural maturity, and deployment stability on edge GPUs. Compared to newer lightweight detectors such as YOLOv8-N and YOLOv6-N, YOLOv7-tiny adopts an ELAN-based modular design, which facilitates structural optimization without relying on anchor-free reconstruction or complex post-training strategies, making it particularly suitable for engineering enhancements in real-time industrial sorting under resource constraints. To tackle the limitations in computing and storage, we first introduce an ELAN-PC feature extraction module based on partial convolution and ELAN. Secondly, a GhostCSP module is proposed by integrating cross-stage aggregation and Ghost bottleneck concepts. These modules replace the original ELAN structures in the backbone and neck networks, significantly reducing floating-point operations (FLOPs) and the number of parameters. Furthermore, the SIoU loss function is adopted to replace the original bounding box loss, enhancing detection accuracy. Experimental results demonstrate that compared with the baseline YOLOv7-tiny, the improved model increases mAP0.5 from 86.9% to 88.7% (a gain of 1.8%), reduces FLOPs from 13.2 G to 9.2 G (a decrease of 30%), and cuts parameters from 6.0 M to 4.3 M (a reduction of 28%). In dynamic sorting tests, the model achieves a coal gangue sorting rate of 82.2% with a misclassification rate of 8.1%, indicating promising practical applicability. Full article
(This article belongs to the Section Energy Systems)
Show Figures

Figure 1

22 pages, 25915 KB  
Article
YOLO-Shrimp: A Lightweight Detection Model for Shrimp Feed Residues Fusing Multi-Attention Features
by Tianwen Hou, Xinying Miao, Zhenghan Wang, Yi Zhang, Zhipeng He, Yifei Sun, Wei Wang and Ping Ren
Sensors 2026, 26(3), 791; https://doi.org/10.3390/s26030791 - 24 Jan 2026
Viewed by 683
Abstract
Precise control of feeding rates is critically important in intensive shrimp farming for cost reduction, optimization of farming strategies, and protection of the aquatic environment. However, current assessment of residual feed in feeding trays relies predominantly on manual visual inspection, which is inefficient, [...] Read more.
Precise control of feeding rates is critically important in intensive shrimp farming for cost reduction, optimization of farming strategies, and protection of the aquatic environment. However, current assessment of residual feed in feeding trays relies predominantly on manual visual inspection, which is inefficient, highly subjective, and difficult to standardize. The residual feed particles typically exhibit characteristics such as small size, high density, irregular shapes, and mutual occlusion, posing significant challenges for automated visual detection. To address these issues, this study proposes a lightweight detection model named YOLO-Shrimp. To enhance the network’s capability in extracting features from small and dense targets, a novel attention mechanism termed EnSimAM is designed. Building upon the SimAM structure, EnSimAM incorporates local variance and edge response to achieve multi-scale feature perception. Furthermore, to improve localization accuracy for small objects, an enhanced weighted intersection over union loss function, EnWIoU, is introduced. Additionally, the lightweight RepGhost module is adopted as the backbone of the model, significantly reducing both the number of parameters and computational complexity while maintaining detection accuracy. Evaluated on a real-world aquaculture dataset containing 3461 images, YOLO-Shrimp achieves mAP@0.5 and mAP@0.5:0.95 scores of 70.01% and 28.01%, respectively, while reducing the parameter count by 19.7% and GFLOPs by 14.6% compared to the baseline model. Full article
(This article belongs to the Section Smart Agriculture)
Show Figures

Figure 1

27 pages, 12605 KB  
Article
YOLOv11n-CGSD: Lightweight Detection of Dairy Cow Body Temperature from Infrared Thermography Images in Complex Barn Environments
by Zhongwei Kang, Hang Song, Hang Xue, Miao Wu, Derui Bao, Chuang Yan, Hang Shi, Jun Hu and Tomas Norton
Agriculture 2026, 16(2), 229; https://doi.org/10.3390/agriculture16020229 - 15 Jan 2026
Viewed by 781
Abstract
Dairy cow body temperature is a key physiological indicator that reflects metabolic level, immune status, and environmental stress responses, and it has been widely used for early disease recognition. Infrared thermography (IRT), as a non-contact imaging technique capable of remotely acquiring the surface [...] Read more.
Dairy cow body temperature is a key physiological indicator that reflects metabolic level, immune status, and environmental stress responses, and it has been widely used for early disease recognition. Infrared thermography (IRT), as a non-contact imaging technique capable of remotely acquiring the surface radiation temperature distribution of animals, is regarded as a powerful alternative to traditional temperature measurement methods. Under practical cowshed conditions, IRT images of dairy cows are easily affected by complex background interference and generally suffer from low resolution, poor contrast, indistinct boundaries, weak structural perception, and insufficient texture information, which lead to significant degradation in target detection and temperature extraction performance. To address these issues, a lightweight detection model named YOLOv11n-CGSD is proposed for dairy cow IRT images, aiming to improve the accuracy and robustness of region of interest (ROI) detection and body temperature extraction under complex background conditions. At the architectural level, a C3Ghost lightweight module based on the Ghost concept is first constructed to reduce redundant feature extraction while lowering computational cost and enhancing the network capability for preserving fine-grained features during feature propagation. Subsequently, a space-to-depth convolution module is introduced to perform spatial rearrangement of feature maps and achieve channel compression via non-strided convolution, thereby improving the sensitivity of the model to local temperature variations and structural details. Finally, a dynamic sampling mechanism is embedded in the neck of the network, where the upsampling and scale alignment processes are adaptively driven by feature content, enhancing the model response to boundary temperature changes and weak-texture regions. Experimental results indicate that the YOLOv11n-CGSD model can effectively shift attention from irrelevant background regions to ROI contour boundaries and increase attention coverage within the ROI. Under complex IRT conditions, the model achieves P, R, and mAP50 values of 89.11%, 86.80%, and 91.94%, which represent improvements of 3.11%, 5.14%, and 4.08%, respectively, compared with the baseline model. Using Tmax as the temperature extraction parameter, the maximum error (Max. Error) and mean error (MAE. Error) in the lower udder region are reduced by 33.3% and 25.7%, respectively, while in the around the anus region, the Max. Error and MAE. Error are reduced by 87.5% and 95.0%, respectively. These findings demonstrate that, under complex backgrounds and low-quality IRT imaging conditions, the proposed model achieves lightweight and high-performance detection for both lower udder (LU) and around the anus (AA) regions and provides a methodological reference and technical support for non-contact body temperature measurement of dairy cows in practical cowshed production environments. Full article
(This article belongs to the Section Farm Animal Production)
Show Figures

Figure 1

17 pages, 11104 KB  
Article
Lightweight Improvements to the Pomelo Image Segmentation Method for Yolov8n-seg
by Zhen Li, Baiwei Cao, Zhengwei Yu, Qingting Jin, Shilei Lyu, Xiaoyi Chen and Danting Mao
Agriculture 2026, 16(2), 186; https://doi.org/10.3390/agriculture16020186 - 12 Jan 2026
Viewed by 796
Abstract
Instance segmentation in agricultural robotics requires a balance between real-time performance and accuracy. This study proposes a lightweight pomelo image segmentation method based on the YOLOv8n-seg model integrated with the RepGhost module. A pomelo dataset consisting of 5076 samples was constructed through systematic [...] Read more.
Instance segmentation in agricultural robotics requires a balance between real-time performance and accuracy. This study proposes a lightweight pomelo image segmentation method based on the YOLOv8n-seg model integrated with the RepGhost module. A pomelo dataset consisting of 5076 samples was constructed through systematic image acquisition, annotation, and data augmentation. The RepGhost architecture was incorporated into the C2f module of the YOLOv8-seg backbone network to enhance feature reuse capabilities while reducing computational complexity. Experimental results demonstrate that the YOLOv8-seg-RepGhost model enhances efficiency without compromising accuracy: parameter count is reduced by 16.5% (from 3.41 M to 2.84 M), computational load decreases by 14.8% (from 12.8 GFLOPs to 10.9 GFLOPs), and inference time is shortened by 6.3% (to 15 ms). The model maintains excellent detection performance with bounding box mAP50 at 97.75% and mask mAP50 at 97.51%. The research achieves both high segmentation efficiency and detection accuracy, offering core support for developing visual systems in harvesting robots and providing an effective solution for deep learning-based fruit target recognition and automated harvesting applications. Full article
(This article belongs to the Special Issue Advances in Precision Agriculture in Orchard)
Show Figures

Figure 1

Back to TopTop