eCBAM and saSIoU Co-Optimized YOLOv11 for Riverine Floating Garbage Classification Under Complex Aquatic Scenarios
Abstract
1. Introduction
1.1. Research Background and Significance
1.2. Related Work
1.2.1. Object Detection in Aquatic Environmental Monitoring
1.2.2. Attention Mechanism for Target Detection
1.2.3. Loss Function Optimization for Bounding Box Regression
1.3. Research Content and Innovations
2. Related Theoretical Foundations
2.1. YOLOv11 Model Architecture
2.2. CBAM Attention Mechanism
2.3. SIoU Loss Function
3. Design of Improved YOLOv11 Model
3.1. Enhanced CBAM (eCBAM) Attention Mechanism
3.1.1. Floating Object Channel Attention (Multi-Scale Channel Attention)
3.1.2. Floating Object Spatial Attention (Boundary-Enhanced Spatial Attention)
3.1.3. Shape-Adaptive Weight Fusion
3.2. Scenario-Adapted saSIoU Loss Function
3.2.1. Angle Sensitivity Enhancement for Large Targets
3.2.2. Shape-Adaptive Mechanism for Irregular Objects
3.2.3. Dynamic Boundary Blur Tolerance
3.3. Overall Architecture of the Improved YOLOv11 Model
4. Experiments and Results Analysis
4.1. Experimental Dataset and Environment
4.1.1. Dataset Construction and Augmentation
4.1.2. Experimental Environment and Parameter Settings
4.2. Evaluation Metrics
4.3. Ablation Experiments
4.3.1. Effect of eCBAM
4.3.2. Effect of saSIoU
4.3.3. Synergistic Effect of eCBAM and saSIoU
4.4. Comparative Experiments
4.4.1. Accuracy Comparison
4.4.2. Precision and Recall Comparison
4.4.3. Efficiency Comparison
4.5. Robustness Experiments
4.6. Visualization Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Yin, H.; Islam, M.S.; Ju, M. Urban river pollution in the densely populated city of Dhaka, Bangladesh: Big picture and rehabilitation experience from other developing countries. J. Clean. Prod. 2025, 321, 129040. [Google Scholar] [CrossRef]
- UNESCO, UN-Water, World Water Assessment Programme. The United Nations World Water Development Report 2024: Water for Prosperity and Peace. 2024. United Nations Digital Library System. ISBN 9789231006579. Available online: https://digitallibrary.un.org/record/4042870 (accessed on 8 August 2025).
- Hangzhou Lvzhongyou Intelligent Technology Co., Ltd. Governance of Floating Garbage in Urban Rivers: Dilemmas of Traditional Cleaning Methods and the Breakthrough Path of Waterborne Trash Bins. 2025. Available online: http://m.toutiao.com/group/7542790129312727562/?upstream_biz=doubao (accessed on 15 September 2025).
- Du, X.; Xie, Z. Water Floating Garbage Detection Algorithm Based on Improved YOLOv7-Tiny. 2025. Available online: https://www.researchgate.net/publication/374791946_Water_Floating_Garbage_Detection_Algorithm_Based_on_Improved_YOLOv7-Tiny/fulltext/652fd4685d51a8012b52c1fb/Water-Floating-Garbage-Detection-Algorithm-Based-on-Improved-YOLOv7-Tiny.pdf (accessed on 20 September 2025).
- Zhu, J.; Hu, T.; Zheng, L.; Zhou, N.; Ge, H.; Hong, Z. YOLOv8-C2f-Faster-EMA: An improved underwater trash detection model based on YOLOv8. Sensors 2024, 24, 2483. [Google Scholar] [CrossRef] [PubMed]
- Lu, X.T.; Yang, T.; Jin, W.; Liu, J.; Wen, R. Methods for water fluctuation and underwater turbulence degraded imaging. J. Appl. Opt. 2017, 38, 42–55. [Google Scholar] [CrossRef]
- Xian, R.; Tang, L.; Liu, S. Development of a lightweight floating object detection algorithm. Water 2024, 16, 1633. [Google Scholar] [CrossRef]
- Kim, Y.; Kim, S.; Jeon, M. NBBOX: Noisy Bounding Box Improves Remote Sensing Object Detection. arXiv 2025, arXiv:2409.09424v3. [Google Scholar] [CrossRef]
- Li, M.; Wang, Q.; Zhang, W. Research on water surface garbage detection method based on machine vision. Environ. Sci. Technol. 2020, 43, 189–195. [Google Scholar]
- Zhao, C.Z. Research and Implementation of Water Surface Floating Object Detection Method Based on Deep Learning. Master’s Thesis, Hebei University of Engineering, Handan, China, 2025. [Google Scholar] [CrossRef]
- Yi, Z.R.; Yao, D.Y.; Li, G.J.; Ai, J.Y.; Xie, W. Detection and localization for lake floating objects based on CA-faster R-CNN. Multimed. Tools Appl. 2022, 81, 17263–17281. [Google Scholar] [CrossRef]
- Li, H.; Yang, S.P.; Liu, J.J.; Yang, Y.; Kadoch, M.; Liu, T.Y. A Framework and Method for Surface Floating Object Detection Based on 6G Networks. Electronics 2022, 11, 2939. [Google Scholar] [CrossRef]
- Zou, Y.; Wang, X.; Wang, L. A high-quality instance-segmentation network for floating-algae detection using RGB images. Remote Sens. 2022, 14, 6247. [Google Scholar] [CrossRef]
- Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Yang, Z.; Yin, Y.; Jing, Q.; Shao, Z. A High-Precision Detection Model of Small Objects in Maritime UAV Perspective Based on Improved YOLOv5. J. Mar. Sci. Eng. 2023, 11, 1680. [Google Scholar] [CrossRef]
- Wang, G.; Chen, Y.; Zhang, J.; Li, H.; Liu, S. Multi-scale CBAM for small object detection. Pattern Recognit. Lett. 2021, 146, 10–16. [Google Scholar]
- Liu, Y.; Li, X.; Wang, Z.; Zhang, H.; Chen, L. Edge-enhanced CBAM for medical image segmentation. Comput. Biol. Chem. 2022, 100, 107568. [Google Scholar]
- Zhao, J.; Li, H.; Zhang, L.; Wang, Y.; Chen, S. Reflection-aware attention module for water surface object detection. IEEE Sens. J. 2023, 23, 17452–17461. [Google Scholar]
- Yu, J.H.; Jiang, Y.N.; Wang, Z.Y.; Cao, Z.M.; Huang, T.S. UnitBox: An Advanced Object Detection Network. In Proceedings of the 24th ACM International Conference on Multimedia; ACM: New York, NY, USA, 2016; pp. 516–520. [Google Scholar] [CrossRef]
- Rezatofighi, H.; Tsoi, N.; Gwak, J.Y.; Sadeghian, A.; Reid, I.; Savarese, S. Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2019, Long Beach, CA, USA, 16–20 June 2019; pp. 658–666. [Google Scholar]
- Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI Conference on Artificial Intelligence 2020, New York, NY, USA, 9–11 February 2020; pp. 12993–13000. [Google Scholar]
- Zheng, Z.; Wang, P.; Liu, W.; Ren, D.W.; Ye, R.G.; Hu, Q.H.; Zuo, W.M. Complete IoU loss: Improving object detection bounding box regression. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 4897–4912. [Google Scholar]
- Gevorgyan, Z. SIoU loss: More powerful learning for bounding box regression. arXiv 2022, arXiv:2205.12740. [Google Scholar] [CrossRef]
- Li, H.; Kong, M.; Shi, Y. Tea bud detection model in a real picking environment based on an improved YOLOv5. Biomimetics 2024, 9, 692. [Google Scholar] [CrossRef]
- Zhang, Y.; Li, J.; Wang, H.; Zhang, X.; Zhou, Y.; Zhang, J. Adaptive SIoU loss for marine debris detection. IEEE J. Ocean. Eng. 2023, 48, 890–901. [Google Scholar]
- Ultralytics. YOLOv11 Official Documentation. 2024. Available online: https://docs.ultralytics.com/models/yolov11/ (accessed on 21 July 2025).
- Redmon, J.; Farhadi, A. YOLOv3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar] [CrossRef]
- Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; et al. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976. [Google Scholar] [CrossRef]
- Ultralytics. YOLOv8 Official Documentation. 2023. Available online: https://docs.ultralytics.com/models/yolov8/ (accessed on 20 June 2025).
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv9: Learning what you want to learn using programmable gradient information. arXiv 2024, arXiv:2402.13616. [Google Scholar] [CrossRef]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. YOLOv10: Real-Time End-to-End Object Detection. arXiv 2024, arXiv:2405.14458. [Google Scholar]








| Target Size | Definition (Pixel Area) | Count | Proportion (%) |
|---|---|---|---|
| Large | >10,000 | 6691 | 72.93 |
| Medium | 1000–10,000 | 1717 | 18.62 |
| Small | <1000 | 778 | 8.45 |
| Parameter | Enviroment Configuration |
|---|---|
| GPU | NVIDIA RTX 4090 |
| CPU | Intel(R) Xeon(R) Gold 5418Y (10 cores) |
| VRAM | 24 G |
| RAM | 120 GB |
| PyTorch | 2.2.2 |
| Python | 3.10 |
| Model | mAP@0.5 (%) | mAP@0.95 (%) | Precision (%) | Recall (%) | FPS | Parameters | GFLOPs |
|---|---|---|---|---|---|---|---|
| YOLOv11 (Baseline) | 83.83 | 52.17 | 79.62 | 81.94 | 112 | 2,590,425 | 6.44 |
| YOLOv11 + eCBAM | 85.52 | 54.69 | 77.39 | 84.90 | 108 | 2,598,864 | 6.44 |
| YOLOv11 + saSIoU | 85.20 | 54.83 | 83.91 | 81.70 | 111 | 2,590,425 | 6.44 |
| Proposed Model | 86.48 | 56.44 | 80.43 | 84.36 | 107 | 2,598,864 | 6.44 |
| Model | mAP@0.5 (%) | mAP@0.95 (%) | Precision (%) | Recall (%) | FPS | Parameters | GFLOPs |
|---|---|---|---|---|---|---|---|
| YOLOv5 | 74.96 | 46.25 | 74.26 | 71.00 | 98 | 2,188,409 | 5.93 |
| YOLOv6 | 73.88 | 46.61 | 73.18 | 69.24 | 85 | 4,160,041 | 11.57 |
| YOLOv9t | 77.67 | 47.51 | 71.10 | 76.47 | 105 | 1,765,513 | 6.70 |
| YOLOv10n | 78.61 | 50.70 | 73.05 | 74.69 | 102 | 2,708,210 | 8.40 |
| YOLOv8n | 82.94 | 52.62 | 79.67 | 80.25 | 109 | 2,690,793 | 6.94 |
| YOLOv11 (Baseline) | 83.83 | 52.17 | 79.62 | 81.94 | 112 | 2,590,425 | 6.44 |
| Proposed Model | 86.48 | 56.44 | 80.43 | 84.36 | 107 | 2,598,864 | 6.44 |
| Model | High-Reflection mAP@0.5 (%) | Turbulent Flow mAP@0.5 (%) | Dense Objects mAP@0.5 (%) | Average mAP@0.5 (%) |
|---|---|---|---|---|
| YOLOv11 | 78.32 | 79.56 | 81.24 | 79.71 |
| YOLOv8n | 77.89 | 78.91 | 80.55 | 79.12 |
| YOLOv10n | 76.45 | 77.82 | 79.33 | 77.87 |
| Proposed Model | 83.67 | 84.21 | 85.19 | 84.36 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhao, Z.; Huang, W.; Li, T.; Zhu, J. eCBAM and saSIoU Co-Optimized YOLOv11 for Riverine Floating Garbage Classification Under Complex Aquatic Scenarios. Appl. Sci. 2026, 16, 651. https://doi.org/10.3390/app16020651
Zhao Z, Huang W, Li T, Zhu J. eCBAM and saSIoU Co-Optimized YOLOv11 for Riverine Floating Garbage Classification Under Complex Aquatic Scenarios. Applied Sciences. 2026; 16(2):651. https://doi.org/10.3390/app16020651
Chicago/Turabian StyleZhao, Ziyu, Wenquan Huang, Teng Li, and Jing Zhu. 2026. "eCBAM and saSIoU Co-Optimized YOLOv11 for Riverine Floating Garbage Classification Under Complex Aquatic Scenarios" Applied Sciences 16, no. 2: 651. https://doi.org/10.3390/app16020651
APA StyleZhao, Z., Huang, W., Li, T., & Zhu, J. (2026). eCBAM and saSIoU Co-Optimized YOLOv11 for Riverine Floating Garbage Classification Under Complex Aquatic Scenarios. Applied Sciences, 16(2), 651. https://doi.org/10.3390/app16020651

