Systematic Evaluation of YOLOv8 Variants for UAV-Based Object Detection
Abstract
1. Introduction
- Comprehensive Model Capacity Analysis: A systematic evaluation of all YOLOv8 variants (11 M to 68 M parameters) is conducted specifically for drone imagery. The results reveal that the largest model (YOLOv8x) dramatically underperforms due to training instability under limited data conditions, while YOLOv8l achieves the best overall performance, reaching a 15.9% mAP50.
- Input Resolution versus Architectural Modifications: The study quantitatively demonstrates that input resolution scaling (640 → 1280) yields a 25% improvement in detection performance, significantly outperforming architectural enhancements such as P2 detection layer addition (+6%).
- Diminishing Returns and Failure Modes: The performance trajectory across model scales is analyzed, revealing clear diminishing returns in model scaling (s → m: +59%; m → l: +31%; l → x: −54%). A detailed analysis of the failure mechanisms shows why oversized models fail in data-constrained scenarios, which is a critical yet underreported phenomenon.
- Practical Deployment Guidelines: Based on the experimental findings, concrete recommendations are derived for various application scenarios. The optimal configuration achieves a 488% improvement over the YOLOv5 baseline and establishes strong and reproducible baselines for future research on the VisDrone benchmark.
2. Related Work
2.1. YOLO Series and Model Selection
2.2. Prior Work on Object Detection
2.2.1. General Object Detection
2.2.2. Small-Object Detection
- Limited Pixel Information: Small objects contain minimal visual information, and their critical features are often lost during convolutional downsampling. For example, standard CNN architectures with five pooling layers reduce a 640 × 640 input image to a 20 × 20 feature representation. At this resolution, a typical 16-pixel object occupies less than one feature map cell, making reliable detection extremely difficult [6].
- Scale Imbalance: Small objects constitute a minority of instances in most common datasets (e.g., 41% of COCO objects are small), leading to a training imbalance in which detectors become biased toward medium- and large-sized objects [18].
- Context Ambiguity: As small objects occupy only a few pixels, they often lack sufficient context for disambiguation. For instance, in urban images, a 10 × 10-pixel blob can represent a pedestrian, a bicycle, or even background noise.
2.2.3. Drone-Based Object Detection
Challenges in Aerial Imagery
Aerial Detection Benchmarks
Existing Methods for Aerial Detection
3. Experimental Results and Analysis
3.1. Experimental Setup
3.2. Model Capacity Effect
Comparison Between YOLOv8 s/m/l Models
3.3. Input Resolution Effect
Comparison Between 640 and 1280 Input Resolutions
3.4. Failure Mechanism of Oversized Models
3.5. Architectural Modification Effect
Addition of P2 Detection Layer
3.6. Comprehensive Comparison
3.6.1. Comparison of Model Variants with YOLOv5 Baseline
3.6.2. Comparison of Various Models with YOLO-HV
- Backbone Architecture: YOLO-HV employs the NextViT backbone, a hybrid CNN-Transformer architecture. Transformers are particularly advantageous for small-object detection owing to their improved ability to model long-range dependencies and integrate the global scene context. Empirical evidence shows that replacing the standard convolutional backbone with NextViT yields a solid 3.1% mAP improvement over its baseline [8]. In contrast, YOLOv8l relies solely on a purely convolutional CSP-based backbone, which may lack such global receptive capabilities.
- Advanced Upsampling: YOLO-HV employs the DyHead module, which performs scale-, spatial-, and task-aware attention to dynamically adjust the detection strategy based on object characteristics. Ablation studies indicate that this dynamic mechanism is the most critical contributor to YOLO-HV’s success, providing an additional 4.1% mAP boost [8]. YOLOv8l, while utilizing a decoupled head, lacks these sophisticated attention mechanisms.
- Advanced Upsampling and Multi-Scale Fusion: YOLO-HV utilizes Content-Aware ReAssembly of Features (CARAFE) for upsampling, alongside specialized multi-scale convolutions (e.g., MSDConv). These modules adaptively reassemble local features and preserve fine-grained details critical for small-object detection, collectively contributing further performance gains [8]. Conversely, YOLOv8l relies on standard bilinear upsampling, which does not employ content adaptivity and therefore retains less spatial detail.
3.7. Speed-Accuracy Trade-Off Analysis
4. Discussion
4.1. Performance Difference Between YOLOv8l and YOLO-HV
4.1.1. Architectural Factors
4.1.2. Methodological Factors
4.1.3. Validation of Gap Decomposition
4.2. Per-Class Performance Analysis
4.2.1. Large Objects (Cars, Buses, and Trucks)
4.2.2. Medium Objects (Vans, Motors)
4.2.3. Small Objects (Pedestrians, Bicycles)
4.2.4. Gap to State-of-the-Art
4.2.5. Category-Specific Insights
4.3. Detailed Analysis of Contributions of Present Work
4.3.1. Performance Effects of Model Capacity
4.3.2. Performance Effects of Resolution vs. Architecture
4.3.3. Identification and Analysis of Diminishing Returns and Failure Modes
4.3.4. Evidence-Based Practical Deployment Guidelines
- Real-time applications (>20 FPS): YOLOv8s @ 640 (6.12% mAP50, 80 FPS) for latency-critical navigation.
- Balanced systems (10–20 FPS): YOLOv8m @ 640 (9.71% mAP50, 50 FPS) for live-traffic monitoring.
- High-accuracy offline analysis (<10 FPS): YOLOv8l @ 1280 (15.9% mAP50, 8 FPS) for forensic investigations.
- Avoid: YOLOv8x @ 1280 unless the dataset contains more than 50 K images, or training instability is explicitly mitigated via gradient accumulation or alternative normalization (e.g., GroupNorm).
4.4. Reframing Absolute Performance
4.5. Limitations and Future Directions
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Du, D.; Zhu, P.; Wen, L.; Bian, X.; Lin, H.; Hu, Q.; Peng, T.; Zheng, J.; Wang, X.; Zhang, Y.; et al. VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops; IEEE: New York, NY, USA, 2019; pp. 213–226. Available online: https://openaccess.thecvf.com/content_ICCVW_2019/html/VISDrone/Du_VisDrone-DET2019_The_Vision_Meets_Drone_Object_Detection_in_Image_Challenge_ICCVW_2019_paper.html (accessed on 14 December 2025).
- Lin, T.-Y.; Maire, M.; Belongie, S.J.; Bourdev, L.D.; Girshick, R.B.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft COCO: Common Objects in Context. In Proceedings of the European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2014; Available online: https://link.springer.com/chapter/10.1007/978-3-319-10602-1_48 (accessed on 14 December 2025).
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2016; pp. 779–788. Available online: https://ieeexplore.ieee.org/document/7780460 (accessed on 14 December 2025).
- Jocher, G. YOLOv5. 2020. Available online: https://github.com/ultralytics/yolov5 (accessed on 14 December 2025).
- Jocher, G.; Chaurasia, A.; Qiu, J. YOLOv8. 2023. Available online: https://github.com/ultralytics/ultralytics (accessed on 14 December 2025).
- Lin, T.-Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature Pyramid Networks for Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2017; pp. 2117–2125. Available online: https://ieeexplore.ieee.org/document/8099589 (accessed on 14 December 2025).
- Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. Cbam: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV); Springer: Berlin/Heidelberg, Germany, 2018; pp. 3–19. Available online: https://openaccess.thecvf.com/content_ECCV_2018/html/Sanghyun_Woo_Convolutional_Block_Attention_ECCV_2018_paper.html (accessed on 14 December 2025).
- Xu, S.; Zhang, M.; Chen, J.; Zhong, Y. YOLO-HyperVision: A Vision Transformer Backbone-Based Enhancement of YOLOv5 for Detection of Dynamic Traffic Information. Egypt. Inform. J. 2024, 27, 100523. [Google Scholar] [CrossRef]
- Li, J.; Xia, X.; Li, W.; Li, H.; Wang, X.; Xiao, X.; Wang, R.; Zheng, M.; Pan, X. Next-Vit: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios. arXiv 2022, arXiv:220705501. [Google Scholar] [CrossRef]
- Wang, J.; Chen, K.; Xu, R.; Liu, Z.; Loy, C.C.; Lin, D. Carafe: Content-Aware Reassembly of Features. In Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE: New York, NY, USA, 2019; pp. 3007–3016. Available online: https://ieeexplore.ieee.org/document/9010830 (accessed on 14 December 2025).
- Lin, T.-Y.; Goyal, P.; Girshick, R.B.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2017; Available online: https://ieeexplore.ieee.org/document/8417976 (accessed on 14 December 2025).
- Girshick, R.B.; Donahue, J.; Darrell, T.; Malik, J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2013; Available online: https://ieeexplore.ieee.org/document/6909475 (accessed on 14 December 2025).
- Girshick, R. Fast R-Cnn. In Proceedings of the IEEE International Conference on Computer Vision; IEEE: New York, NY, USA, 2015; pp. 1440–1448. Available online: https://ieeexplore.ieee.org/document/7410526 (accessed on 14 December 2025).
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.E.; Fu, C.-Y.; Berg, A.C. SSD: Single Shot MultiBox Detector. In Proceedings of the European Conference on Computer Vision; IEEE: New York, NY, USA, 2015; Available online: https://link.springer.com/chapter/10.1007/978-3-319-46448-0_2 (accessed on 14 December 2025).
- Tan, M.; Pang, R.; Le, Q.V. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2019; Available online: https://ieeexplore.ieee.org/document/9156454 (accessed on 14 December 2025).
- Zhang, S.; Chi, C.; Yao, Y.; Lei, Z.; Li, S.Z. Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2019; Available online: https://ieeexplore.ieee.org/document/9156746 (accessed on 14 December 2025).
- Singh, B.; Davis, L.S. An Analysis of Scale Invariance in Object Detection Snip. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2018; pp. 3578–3587. Available online: https://ieeexplore.ieee.org/document/8578475 (accessed on 21 March 2026).
- Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path Aggregation Network for Instance Segmentation. In Proceedings of the 2018, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2018; Available online: https://ieeexplore.ieee.org/document/8579011 (accessed on 21 March 2026).
- Ghiasi, G.; Lin, T.-Y.; Pang, R.; Le, Q.V. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2019; Available online: https://ieeexplore.ieee.org/document/8954436 (accessed on 21 March 2026).
- Wang, Q.; Wu, B.; Zhu, P.; Li, P.; Zuo, W.; Hu, Q. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2019; Available online: https://ieeexplore.ieee.org/document/9156697 (accessed on 14 December 2025).
- Wang, X.; Girshick, R.B.; Gupta, A.; He, K. Non-Local Neural Networks. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2017; Available online: https://ieeexplore.ieee.org/document/8578911 (accessed on 14 December 2025).
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; Available online: https://ieeexplore.ieee.org/document/9710580 (accessed on 14 December 2025).
- Dai, J.; Qi, H.; Xiong, Y.; Li, Y.; Zhang, G.; Hu, H.; Wei, Y. Deformable Convolutional Networks. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2017; Available online: https://ieeexplore.ieee.org/document/8237351 (accessed on 14 December 2025).
- Zhu, X.; Su, W.; Lu, L.; Li, B.; Wang, X.; Dai, J. Deformable DETR: Deformable Transformers for End-to-End Object Detection. arXiv 2020, arXiv:2010.04159. [Google Scholar] [CrossRef]
- Liu, W.; Lu, H.; Fu, H.; Cao, Z. Learning to Upsample by Learning to Sample. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2023; Available online: https://ieeexplore.ieee.org/document/10377871 (accessed on 14 December 2025).
- Ghiasi, G.; Cui, Y.; Srinivas, A.; Qian, R.; Lin, T.-Y.; Cubuk, E.D.; Le, Q.V.; Zoph, B. Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2020; Available online: https://ieeexplore.ieee.org/document/9578639 (accessed on 14 December 2025).
- Bochkovskiy, A.; Wang, C.-Y.; Liao, H.-Y.M. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv 2020, arXiv:2004.10934. [Google Scholar] [CrossRef]
- Du, D.; Qi, Y.; Yu, H.; Yang, Y.; Duan, K.; Li, G.; Zhang, W.; Huang, Q.; Tian, Q. The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking. In Proceedings of the European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2018; Available online: https://openaccess.thecvf.com/content_ECCV_2018/html/Dawei_Du_The_Unmanned_Aerial_ECCV_2018_paper.html (accessed on 14 December 2025).
- Xia, G.-S.; Bai, X.; Ding, J.; Zhu, Z.; Belongie, S.J.; Luo, J.; Datcu, M.; Pelillo, M.; Zhang, L. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2017; Available online: https://ieeexplore.ieee.org/document/8578516 (accessed on 14 December 2025).
- Cai, Z.; Vasconcelos, N. Cascade R-CNN: Delving into High Quality Object Detection. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2017; Available online: https://ieeexplore.ieee.org/document/8578742 (accessed on 14 December 2025).
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops; IEEE: New York, NY, USA, 2021; Available online: https://ieeexplore.ieee.org/document/9607487 (accessed on 14 December 2025).
- Singh, B.; Najibi, M.; Davis, L.S. Sniper: Efficient Multi-Scale Training. Adv. Neural Inf. Process. Syst. 2018, 31, 9333–9343. Available online: https://proceedings.neurips.cc/paper/2018/hash/166cee72e93a992007a89b39eb29628b-Abstract.html (accessed on 14 December 2025).
- Liao, J.; Tian, H. Cluster-NMS: Improving Crowded Object Detection through Clustering Pattern. Signal Image Video Process. 2025, 19, 758. [Google Scholar] [CrossRef]
- Lin, Y.; Lin, Y.; Wu, H.; Wu, M. The Enhance-Fuse-Align Principle: A New Architectural Blueprint for Robust Object Detection, with Application to X-Ray Security. Sensors 2025, 25, 6603. [Google Scholar] [CrossRef] [PubMed]


| Model | |||
|---|---|---|---|
| YOLOv8s | YOLOv8m | YOLOv8l | |
| Input | 640 | 640 | 640 |
| mAP50 | 6.12% | 9.71% | 12.70% |
| mAP50-95 | 3.01% | 5.14% | 6.89% |
| P | 16.20% | 21.10% | 25.30% |
| R | 8.50% | 13.00% | 16.50% |
| Training Time | 1.6 h | 2.6 h | 4.2 h |
| Car | 23.60% | 33.50% | 41.40% |
| Bus | 4.15% | 6.83% | 9.11% |
| Van | 8.93% | 16.10% | 19.80% |
| Truck | 8.38% | 12.10% | 18.20% |
| Motor | 4.11% | 8.72% | 12.80% |
| People | 3.14% | 5.04% | 6.45% |
| Tricycle | 2.52% | 5.44% | 6.53% |
| Pedestrian | 2.61% | 4.93% | 7.13% |
| Bicycle | 1.37% | 1.88% | 2.17% |
| Awning-Tricycle | 2.46% | 2.59% | 3.65% |
| Model | ||
|---|---|---|
| YOLOv8l | YOLOv8l | |
| Input | 640 | 1280 |
| mAP50 | 12.70% | 15.90% |
| mAP50-95 | 6.89% | 9.13% |
| P | 25.30% | 30.20% |
| R | 16.50% | 19.90% |
| Training Time | 4.2 h | 14.6 h |
| Car | 41.40% | 39.80% |
| Bus | 9.11% | 13.70% |
| Van | 19.80% | 25.90% |
| Truck | 18.20% | 22.50% |
| Motor | 12.80% | 16.40% |
| People | 6.45% | 10.30% |
| Tricycle | 6.53% | 11.00% |
| Pedestrian | 7.13% | 9.26% |
| Bicycle | 2.17% | 3.93% |
| Awning-Tricycle | 3.65% | 6.57% |
| Model | ||
|---|---|---|
| YOLOv8l | YOLOv8x | |
| Input | 1280 | 1280 |
| mAP50 | 15.90% | 7.32% |
| mAP50-95 | 9.13% | 4.02% |
| P | 30.20% | 13.40% |
| R | 19.90% | 11.60% |
| Training Time | 14.6 h | |
| Car | 39.80% | 16.90% |
| Bus | 13.70% | 6.42% |
| Van | 25.90% | 9.68% |
| Truck | 22.50% | 8.47% |
| Motor | 16.40% | 9.92% |
| People | 10.30% | 6.73% |
| Tricycle | 11.00% | 3.46% |
| Pedestrian | 9.26% | 4.70% |
| Bicycle | 3.93% | 2.26% |
| Awning-Tricycle | 6.57% | 4.59% |
| Model | ||
|---|---|---|
| YOLOv8l | YOLOv8l + P2 | |
| Input | 640 | 640 |
| mAP50 | 12.70% | 13.50% |
| mAP50-95 | 6.89% | 7.54% |
| P | 25.30% | 26.60% |
| R | 16.50% | 16.80% |
| Training Time | 4.2 h | 5.1 h |
| Car | 41.40% | 40.20% |
| Bus | 9.11% | 8.61% |
| Van | 19.80% | 21.30% |
| Truck | 18.20% | 17.50% |
| Motor | 12.80% | 14.80% |
| People | 6.45% | 10.20% |
| Tricycle | 6.53% | 7.22% |
| Pedestrian | 7.13% | 7.69% |
| Bicycle | 2.17% | 2.17% |
| Awning-Tricycle | 3.65% | 5.22% |
| Model | ||||
|---|---|---|---|---|
| YOLOv5s | YOLOv8s | YOLOv8l | YOLOv8l | |
| Input | 640 | 640 | 640 | 1280 |
| mAP50 | 3.76% | 6.12% | 12.70% | 15.90% |
| mAP50-95 | 1.47% | 3.01% | 6.89% | 9.13% |
| P | 11.20% | 16.20% | 25.30% | 30.20% |
| R | 5.50% | 8.50% | 16.50% | 19.90% |
| Training Time | 1.32 h | 1.6 h | 4.2 h | 14.6 h |
| Car | 15.10% | 23.60% | 41.40% | 39.80% |
| Bus | 2.13% | 4.15% | 9.11% | 13.70% |
| Van | 4.87% | 8.93% | 19.80% | 25.90% |
| Truck | 5.00% | 8.38% | 18.20% | 22.50% |
| Motor | 2.03% | 4.11% | 12.80% | 16.40% |
| People | 2.34% | 3.14% | 6.45% | 10.30% |
| Tricycle | 2.06% | 2.52% | 6.53% | 11.00% |
| Pedestrian | 1.02% | 2.61% | 7.13% | 9.26% |
| Bicycle | 0.98% | 1.37% | 2.17% | 3.93% |
| Awning-Tricycle | 1.88% | 2.46% | 3.65% | 6.57% |
| Model | Backbone | Input | Custom Modules | mAP50 | Params | GFLOPs |
|---|---|---|---|---|---|---|
| YOLOv5x | CSPDarknet | 640 | None | 5.1% | 86.7 M | 205.7 |
| Faster R-CNN | ResNet-101 | 1024 | None | 8.3% | 60.1 M | 370.4 |
| RetinaNet | ResNet-101 | 800 | FPN | 9,6% | 56.8 M | 315.2 |
| YOLO-HV | NextViT | 640 | CARAFE, DyHead | 38.1% | - | - |
| YOLOv8l | CSPDarknet | 1280 | None | 15.9% | 43.6 M | 164.9 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, C.-M.; Juang, J.-C. Systematic Evaluation of YOLOv8 Variants for UAV-Based Object Detection. Appl. Sci. 2026, 16, 3559. https://doi.org/10.3390/app16073559
Liu C-M, Juang J-C. Systematic Evaluation of YOLOv8 Variants for UAV-Based Object Detection. Applied Sciences. 2026; 16(7):3559. https://doi.org/10.3390/app16073559
Chicago/Turabian StyleLiu, Chieh-Min, and Jyh-Ching Juang. 2026. "Systematic Evaluation of YOLOv8 Variants for UAV-Based Object Detection" Applied Sciences 16, no. 7: 3559. https://doi.org/10.3390/app16073559
APA StyleLiu, C.-M., & Juang, J.-C. (2026). Systematic Evaluation of YOLOv8 Variants for UAV-Based Object Detection. Applied Sciences, 16(7), 3559. https://doi.org/10.3390/app16073559

