- Article
Benchmarking YOLO Models for Crop Growth and Weed Detection in Cotton Fields
- Hassan Raza,
- Muhammad Abu Bakr and
- Sultan Daud Khan
- + 3 authors
Reliable differentiation of crops and weeds is essential for precision agriculture, where real-time detection can minimize chemical inputs and support site-specific interventions. This study presents the large-scale and systematic benchmark of 19 YOLO-family variants, spanning YOLOv3 through YOLOv11, for cotton–weed detection using the Cotton–8 dataset. The dataset comprises 4440 annotated field images with five categories: broadleaf weeds, grass weeds, and three growth stages of cotton. All models were trained under a standardized protocol with COCO-pretrained weights, fixed seeds, and Ultralytics implementations to ensure reproducibility and fairness. Inference was conducted with a confidence threshold of 0.25 and a non-maximum suppression (NMS) IoU threshold of 0.45, with test-time augmentation (TTA) disabled. Evaluation employed precision, recall, mAP@0.5, and mAP@0.5:0.95, along with inference latency and parameter counts to capture accuracy–efficiency trade-offs. Results show that larger models, such as YOLO11x, achieved the best detection accuracy (mAP@0.5 = 81.5%), whereas lightweight models like YOLOv8n and YOLOv9t offered the fastest inference ( 27 msper image) but with reduced accuracy. Across classes, cotton growth stages were detected reliably, but broadleaf and grass weeds remained challenging, especially under stricter localization thresholds. These findings highlight that the key bottleneck lies in small-object detection and precise localization rather than architectural design. By providing the first direct comparison across successive YOLO generations for weed detection in cotton, this work offers a practical reference for researchers and practitioners selecting models for real-world, resource-constrained cotton–weed management.
5 November 2025





