ATDIOU: Arctangent Differential Loss Function for Bounding Box Regression
Abstract
1. Introduction
- We introduce ATDIoU, an arctangent-differential loss for BBR that augments IoU with corner-wise distance supervision. Unlike overlap-only or center-distance objectives, ATDIoU explicitly constrains the box geometry by penalizing the offsets of the corresponding top-left and bottom-right corners through a bounded arctangent-differential mapping, improving localization stability.
- We provide empirical analyses of gradient behavior and optimization dynamics, showing that the arctangent-differential mapping yields smoother, non-saturating gradients under localization errors, which helps reduce box drift during training.
- Extensive experiments verify that integrating ATDIoU into a modern detector, such as YOLOv6, YOLOV10, achieves consistent mAP gains on both the PASCAL VOC and the VisDrone2019 datasets.
2. Related Work
2.1. IoU-Based BBR Losses
2.2. Object Detection
2.3. Vertex-Aware Localization
3. Methods
3.1. Simulation Experiment
3.2. Arctangent Differential Function
3.3. ATDIoU

| Algorithm 1 ATDIoU as BBR loss |
|
4. Experiments
4.1. Datasets
4.2. Experimental Setup
4.3. Metrics
4.4. Experimental Analysis
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Han, B.; Tang, Q.; Song, J.; Wang, Z.; Yang, Y. GAOC: A Gaussian Adaptive Ochiai Loss for Bounding Box Regression. Sensors 2026, 26, 368. [Google Scholar] [CrossRef] [PubMed]
- Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; et al. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976. [Google Scholar] [CrossRef]
- Tang, Q.; Su, C.; Tian, Y.; Zhao, S.; Yang, K.; Hao, W.; Feng, X.; Xie, M. YOLO-SS: Optimizing YOLO for enhanced small object detection in remote sensing imagery. J. Supercomput. 2025, 81, 303. [Google Scholar] [CrossRef]
- Xie, M.; Tang, Q.; Tian, Y.; Feng, X.; Shi, H.; Hao, W. DCN-YOLO: A Small-Object Detection Paradigm for Remote Sensing Imagery Leveraging Dilated Convolutional Networks. Sensors 2025, 25, 2241. [Google Scholar] [CrossRef] [PubMed]
- Wang, S. Effectiveness of traditional augmentation methods for rebar counting using UAV imagery with Faster R-CNN and YOLOv10-based transformer architectures. Sci. Rep. 2025, 15, 33702. [Google Scholar] [CrossRef] [PubMed]
- Önal, O.; Dandıl, E. Unsafe-Net: YOLO v4 and ConvLSTM based computer vision system for real-time detection of unsafe behaviours in workplace. Multimed. Tools Appl. 2025, 84, 34967–34993. [Google Scholar] [CrossRef]
- He, Q.; Xu, A.; Ye, Z.; Zhou, W.; Cai, T. Object detection based on lightweight YOLOX for autonomous driving. Sensors 2023, 23, 7596. [Google Scholar] [CrossRef] [PubMed]
- Shi, Z.; Hu, J.; Ren, J.; Ye, H.; Yuan, X.; Ouyang, Y.; He, J.; Ji, B.; Guo, J. HS-FPN: High frequency and spatial perception FPN for tiny object detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 25 February–4 March 2025; Volume 39, pp. 6896–6904. [Google Scholar]
- Wu, D.; Shang, M.; Luo, X.; Wang, Z. An L 1-and-L 2-norm-oriented latent factor model for recommender systems. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 5775–5788. [Google Scholar] [CrossRef] [PubMed]
- Jun, J.H.; Lee, J.S. Effective mitigation of impulsive vibration in satellite electronic units using acoustic black hole structure. Appl. Acoust. 2024, 225, 110165. [Google Scholar] [CrossRef]
- Rezatofighi, H.; Tsoi, N.; Gwak, J.; Sadeghian, A.; Reid, I.; Savarese, S. Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 16–17 June 2019; pp. 658–666. [Google Scholar]
- Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 12993–13000. [Google Scholar]
- Zhang, Y.F.; Ren, W.; Zhang, Z.; Jia, Z.; Wang, L.; Tan, T. Focal and efficient IOU loss for accurate bounding box regression. arXiv 2021, arXiv:2101.08158. [Google Scholar] [CrossRef]
- Gevorgyan, Z. SIoU loss: More powerful learning for bounding box regression. arXiv 2022, arXiv:2205.12740. [Google Scholar] [CrossRef]
- Ma, S.; Xu, Y. Mpdiou: A loss for efficient and accurate bounding box regression. arXiv 2023, arXiv:2307.07662. [Google Scholar] [CrossRef]
- Tong, Z.; Chen, Y.; Xu, Z.; Yu, R. Wise-IoU: Bounding box regression loss with dynamic focusing mechanism. arXiv 2023, arXiv:2301.10051. [Google Scholar]
- Wang, J.; Xu, C.; Yang, W.; Yu, L. A normalized Gaussian Wasserstein distance for tiny object detection. arXiv 2021, arXiv:2110.13389. [Google Scholar]
- Yang, X.; Yan, J.; Ming, Q.; Wang, W.; Zhang, X.; Tian, Q. Rethinking rotated object detection with gaussian wasserstein distance loss. In International Conference on Machine Learning; PMLR: Cambridge, MA, USA, 2021; pp. 11830–11841. [Google Scholar]
- Du, D.; Zhu, P.; Wen, L.; Bian, X.; Lin, H.; Hu, Q.; Peng, T.; Zheng, J.; Wang, X.; Zhang, Y.; et al. VisDrone-DET2019: The vision meets drone object detection in image challenge results. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, Seoul, Republic of Korea, 27–28 October 2019. [Google Scholar]
- Everingham, M.; Eslami, S.M.A.; Gool, L.V.; Williams, C.K.I.; Winn, J.; Zisserman, A. The pascal visual object classes challenge: A retrospective. Int. J. Comput. Vis. 2015, 111, 98–136. [Google Scholar] [CrossRef]
- Everingham, M.; Gool, L.V.; Williams, C.K.I.; Winn, J.; Zisserman, A. The pascal visual object classes (voc) challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef]
- Xu, J.; Ren, H.; Cai, S.; Zhang, X. An improved faster R-CNN algorithm for assisted detection of lung nodules. Comput. Biol. Med. 2023, 153, 106470. [Google Scholar] [CrossRef] [PubMed]
- Qiao, S.; Chen, L.C.; Yuille, A. Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 10213–10224. [Google Scholar]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Proceedings, Part I 14; Springer International Publishing: Berlin/Heidelberg, Germany, 2016; pp. 21–37. [Google Scholar]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollar, P. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2980–2988. [Google Scholar]
- Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar] [CrossRef]
- Wang, K.; Liew, J.H.; Zou, Y.; Zhou, D.; Feng, J. Panet: Few-shot image semantic segmentation with prototype alignment. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 9197–9206. [Google Scholar]
- Chen, J.; Mai, H.; Luo, L.; Chen, X.; Wu, K. Effective feature fusion network in BIFPN for small object detection. In Proceedings of the 2021 IEEE International Conference on Image Processing (ICIP); IEEE: Piscataway, NJ, USA, 2021; pp. 699–703. [Google Scholar]
- Law, H.; Deng, J. Cornernet: Detecting objects as paired keypoints. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 734–750. [Google Scholar]
- Tian, Z.; Shen, C.; Chen, H.; He, T. FCOS: Fully convolutional one-stage object detection. In Proceedings of the IEEE International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 9627–9636. [Google Scholar]
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. Detrs beat yolos on real-time object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 16–22 June 2024; pp. 16965–16974. [Google Scholar]
- Zhu, X.; Su, W.; Lu, L.; Li, B.; Wang, X.; Dai, J. Deformable detr: Deformable transformers for end-to-end object detection. arXiv 2020, arXiv:2010.04159. [Google Scholar]
- Zhang, D.; Li, F.; Liu, S.; Zhang, L.; Su, H.; Zhu, J.; Ni, H.Y.-S.L.M. Hino: Det with improved denoising anchor boxes for end-to-end object detection. In Proceedings of the International Conference on Learning Representations, Virtual Event, 25–29 April 2023. [Google Scholar]
- Kong, Y.; Tian, D.; Xiu, J.; Che, X. Nonlinear tracking differentiator based on feedforward and arctangent function. In Proceedings of the 2020 39th Chinese Control Conference (CCC); IEEE: Piscataway, NJ, USA, 2020; pp. 2968–2973. [Google Scholar]







| EIoU | ||||||
| GIoU | ||||||
| SIoU | ||||||
| MPDIoU | ||||||
| DIoU | ||||||
| WIoU | ||||||
| ATDIoU | ||||||
| EIoU | 0.130 ± 0.0009 | 0.374 ± 0.0041 | 0.614 ± 0.0030 | 0.238 ± 0.0011 | 0.396 ± 0.0031 | 0.238 ± 0.0028 |
| GIoU | 0.126 ± 0.0035 | 0.373 ± 0.0051 | 0.631 ± 0.0009 | 0.239 ± 0.0024 | 0.396 ± 0.0035 | 0.246 ± 0.0037 |
| SIoU | 0.134 ± 0.0031 | 0.364 ± 0.0033 | 0.621 ± 0.0035 | 0.236 ± 0.0034 | 0.397 ± 0.0019 | 0.240 ± 0.0009 |
| MPDIoU | 0.129 ± 0.0017 | 0.367 ± 0.0029 | 0.621 ± 0.0010 | 0.234 ± 0.0031 | 0.395 ± 0.0020 | 0.239 ± 0.0012 |
| DIoU | 0.130 ± 0.0028 | 0.372 ± 0.0029 | 0.611 ± 0.0020 | 0.236 ± 0.0012 | 0.397 ± 0.0011 | 0.236 ± 0.0036 |
| WIoU | 0.131 ± 0.0023 | 0.373 ± 0.0030 | 0.621 ± 0.0026 | 0.244 ± 0.0035 | 0.403 ± 0.0040 | 0.245 ± 0.0019 |
| ATDIoU | 0.135 ± 0.0018 | 0.377 ± 0.0006 | 0.626 ± 0.0027 | 0.242 ± 0.0005 | 0.404 ± 0.0026 | 0.247 ± 0.0002 |
| 0.135 ± 0.0045 | 0.377 ± 0.0015 | 0.626 ± 0.0068 | 0.242 ± 0.0012 | 0.404 ± 0.0065 | 0.247 ± 0.0005 |
| NWD | 0.425 | 0.341 | 0.335 | 0.175 |
| GWD | 0.431 | 0.346 | 0.337 | 0.179 |
| EIoU | 0.430 | 0.348 | 0.339 | 0.182 |
| GIoU | 0.429 | 0.349 | 0.338 | 0.180 |
| SIoU | 0.426 | 0.343 | 0.333 | 0.178 |
| DIoU | 0.433 | 0.348 | 0.339 | 0.181 |
| ATDIoU | 0.439 | 0.353 | 0.343 | 0.185 |
| Mean ↑ | Std ↓ | CV ↓ | Score ↑ | |
|---|---|---|---|---|
| MPDIOU | 1.7476 | 0.1890 | 0.1081 | 1.5771 |
| GIoU | 1.7016 | 0.1952 | 0.1147 | 1.5265 |
| WIoU | 1.6862 | 0.2172 | 0.1288 | 1.4938 |
| DIoU | 1.6757 | 0.2282 | 0.1362 | 1.4748 |
| EIoU | 1.6757 | 0.2282 | 0.1362 | 1.4748 |
| SIoU | 1.6678 | 0.2386 | 0.1431 | 1.4590 |
| ATDIoU | 1.7746 | 0.1853 | 0.1044 | 1.6069 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tang, Q.; Qiang, H.; Tian, Y.; Feng, X.; Hao, W.; Xie, M. ATDIOU: Arctangent Differential Loss Function for Bounding Box Regression. Sensors 2026, 26, 1545. https://doi.org/10.3390/s26051545
Tang Q, Qiang H, Tian Y, Feng X, Hao W, Xie M. ATDIOU: Arctangent Differential Loss Function for Bounding Box Regression. Sensors. 2026; 26(5):1545. https://doi.org/10.3390/s26051545
Chicago/Turabian StyleTang, Qiang, Hao Qiang, Yuan Tian, Xubin Feng, Wei Hao, and Meilin Xie. 2026. "ATDIOU: Arctangent Differential Loss Function for Bounding Box Regression" Sensors 26, no. 5: 1545. https://doi.org/10.3390/s26051545
APA StyleTang, Q., Qiang, H., Tian, Y., Feng, X., Hao, W., & Xie, M. (2026). ATDIOU: Arctangent Differential Loss Function for Bounding Box Regression. Sensors, 26(5), 1545. https://doi.org/10.3390/s26051545

