Anti-UAV Target Tracking with Motion Association Integration
Abstract
1. Introduction
- An integrating motion association target detection and tracking collaboration for anti-UAV tasks is proposed.
- A motion association module is designed, which dynamically assesses target presence through a multi-level confidence evaluation mechanism and rapidly responds to target disappearance.
- To address target perception under occlusion, a grayscale-based verification matching mechanism is introduced, enabling stable tracking even in occluded conditions.
2. Materials and Methods
2.1. Object Detection
2.2. Visual Tracking
2.3. Anti-UAV Detection and Tracking Techniques
3. Methodology
3.1. Overall Framework
3.2. Detection Branch
3.2.1. Contextual Anchor Attention
3.2.2. Space-to-Depth
3.2.3. Small Object Detection Head
3.3. Tracking Branch
3.4. Motion Association Module
3.4.1. Multi-Level Evaluation Mechanism
3.4.2. Verification Matching Mechanism
| Algorithm 1: Motion Association Algorithm Based on Multi-level Evaluation. |
| Input: ) Output: Predicted bounding box in the t-th frame image ; ; then ; end then ; ; else Output (0, 0, 0, 0); end else (0, 0, 0, 0) end |
4. Experiments
4.1. Dataset and Metrics
4.1.1. Dataset
4.1.2. Metrics
4.2. Implementation Details
4.3. Comparison with the State-of-the-Art Methods
4.4. Qualitative Comparisons
4.5. Quantitative Comparisons
4.5.1. Component Effectiveness Analysis
4.5.2. Hyper-Parameters Analysis
4.5.3. Performance Comparison of Different Detection Algorithms
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zhang, C.; Gao, Q.; Shi, R.; Yue, M. LDHD-Net: A Lightweight Network with Double Branch Head for Feature Enhancement of UAV Targets in Complex Scenes. Int. J. Intell. Syst. 2024, 2024, 7259029. [Google Scholar] [CrossRef]
- Huang, B.; Li, J.; Chen, J.; Wang, G.; Zhao, J.; Xu, T. Anti-UAV410: A Thermal Infrared Benchmark and Customized Scheme for Tracking Drones in the Wild. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 2852–2865. [Google Scholar] [CrossRef]
- Danelljan, M.; Bhat, G.; Khan, F.S.; Felsberg, M. ATOM: Accurate Tracking by Overlap Maximization. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 16–20 June 2019; pp. 4655–4664. [Google Scholar]
- Huang, L.; Zhao, X.; Huang, K. GlobalTrack: A Simple and Strong Baseline for Long-Term Tracking. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 11037–11044. [Google Scholar] [CrossRef]
- Zhu, X.-F.; Xu, T.; Zhao, J.; Liu, J.-W.; Wang, K.; Wang, G.; Li, J.; Wang, Q.; Jin, L.; Zhu, Z.; et al. Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System. arXiv 2023, arXiv:2306.15767. [Google Scholar] [CrossRef]
- Liu, S.; Xu, T.; Zhu, X.-F.; Wu, X.-J.; Kittler, J. Learning Adaptive Detection and Tracking Collaborations with Augmented UAV Synthesis for Accurate Anti-UAV System. Expert Syst. Appl. 2025, 282, 127679. [Google Scholar] [CrossRef]
- Tong, X.; Sun, X.; Zuo, Z.; Su, S.; Wu, P.; Wei, J.; Guo, R. GSFNet: Gyro-Aided Spatial-Frequency Network for Motion Deblurring of UAV Infrared Images. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5003718. [Google Scholar] [CrossRef]
- Jiang, N.; Wang, K.; Peng, X.; Yu, X.; Wang, Q.; Xing, J.; Li, G.; Guo, G.; Ye, Q.; Jiao, J.; et al. Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking. IEEE Trans. Multimed. 2023, 25, 486–500. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- Chen, Y.; Zhang, G.; Ma, Y.; Kang, J.U.; Kwan, C. Small Infrared Target Detection Based on Fast Adaptive Masking and Scaling with Iterative Segmentation. IEEE Geosci. Remote Sens. Lett. 2022, 19, 1–5. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 26 June–1 July 2016; pp. 779–788. [Google Scholar]
- Wang, Y.; Song, J.; Wang, Y.; Wang, R.; Chen, H. Object Detection Method of Inland Vessel Based on Improved YOLO. J. Mar. Sci. Eng. 2025, 13, 697. [Google Scholar] [CrossRef]
- Qin, Z.; Chen, D.; Wang, H. MCA-YOLOv7: An Improved UAV Target Detection Algorithm Based on YOLOv7. IEEE Access 2024, 12, 42642–42650. [Google Scholar] [CrossRef]
- Ye, T.; Qin, W.; Li, Y.; Wang, S.; Zhang, J.; Zhao, Z. Dense and Small Object Detection in UAV-Vision Based on a Global-Local Feature Enhanced Network. IEEE Trans. Instrum. Meas. 2022, 71, 2515513. [Google Scholar] [CrossRef]
- Wang, M.; Sheng, D.; Yuan, P.; Jin, W.; Li, L. Infrared Imaging Detection for Hazardous Gas Leakage Using Background Information and Improved YOLO Networks. Remote Sens. 2025, 17, 1030. [Google Scholar] [CrossRef]
- Wu, W.; Zhang, J.; Zhou, G.; Zhang, Y.; Wang, J.; Hu, L. ESG-YOLO: A Method for Detecting Male Tassels and Assessing Density of Maize in the Field. Agronomy 2024, 14, 241. [Google Scholar] [CrossRef]
- Sunkara, R.; Luo, T. No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects. In Machine Learning and Knowledge Discovery in Databases; Amini, M.-R., Canu, S., Fischer, A., Guns, T., Kralj Novak, P., Tsoumakas, G., Eds.; Lecture Notes in Computer Science; Springer Nature: Cham, Switzerland, 2023; Volume 13715, pp. 443–459. ISBN 978-3-031-26408-5. [Google Scholar]
- Bhat, G.; Danelljan, M.; Van Gool, L.; Timofte, R. Learning Discriminative Model Prediction for Tracking. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 6181–6190. [Google Scholar]
- Danelljan, M.; Van Gool, L.; Timofte, R. Probabilistic Regression for Visual Tracking. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online, 14–19 June 2020; pp. 7181–7190. [Google Scholar]
- Bhat, G.; Danelljan, M.; Van Gool, L.; Timofte, R. Know Your Surroundings: Exploiting Scene Information for Object Tracking. In Computer Vision–ECCV 2020, Proceedings of the 6th European Conference, Glasgow, UK, 23–28 August 2020; Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M., Eds.; Springer International Publishing: Cham, Switzerland, 2020; Volume 12368, pp. 205–221. [Google Scholar]
- Dong, X.; Shen, J. Triplet Loss in Siamese Network for Object Tracking. In Computer Vision—ECCV 2018; Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2018; Volume 11217, pp. 472–488. ISBN 978-3-030-01260-1. [Google Scholar]
- Li, B.; Yan, J.; Wu, W.; Zhu, Z.; Hu, X. High Performance Visual Tracking with Siamese Region Proposal Network. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 8971–8980. [Google Scholar]
- Li, B.; Wu, W.; Wang, Q.; Zhang, F.; Xing, J.; Yan, J. SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 16–20 June 2019; pp. 4277–4286. [Google Scholar]
- Guo, D.; Wang, J.; Cui, Y.; Wang, Z.; Chen, S. SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online, 14–19 June 2020; pp. 6268–6276. [Google Scholar]
- Fang, H.; Wu, C.; Wang, X.; Zhou, F.; Chang, Y.; Yan, L. Online Infrared UAV Target Tracking with Enhanced Context-Awareness and Pixel-Wise Attention Modulation. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5005417. [Google Scholar] [CrossRef]
- Tao, J.; Chan, S.; Shi, Z.; Bai, C.; Chen, S. FocTrack: Focus Attention for Visual Tracking. Pattern Recognit. 2025, 160, 111128. [Google Scholar] [CrossRef]
- Zhao, J.; Zhang, J.; Li, D.; Wang, D. Vision-Based Anti-UAV Detection and Tracking. IEEE Trans. Intell. Transp. Syst. 2022, 23, 25323–25334. [Google Scholar] [CrossRef]
- Wojke, N.; Bewley, A.; Paulus, D. Simple Online and Realtime Tracking with a Deep Association Metric. In Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 17–20 September 2017; pp. 3645–3649. [Google Scholar]
- Sun, L.; Zhang, J.; Yang, Z.; Fan, B. A Motion-Aware Siamese Framework for Unmanned Aerial Vehicle Tracking. Drones 2023, 7, 153. [Google Scholar] [CrossRef]
- Xu, Y.; Fu, Y. Adapting to Length Shift: FlexiLength Network for Trajectory Prediction. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16 June 2024; pp. 15226–15237. [Google Scholar]
- Fang, H.; Wang, X.; Liao, Z.; Chang, Y.; Yan, L. A Real-Time Anti-Distractor Infrared UAV Tracker with Channel Feature Refinement Module. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Online, 11–17 October 2021; pp. 1240–1248. [Google Scholar]
- Golding, B.; Pouchelon, G.; Bellone, C.; Murthy, S.; Di Nardo, A.A.; Govindan, S.; Ogawa, M.; Shimogori, T.; Lüscher, C.; Dayer, A.; et al. Retinal Input Directs the Recruitment of Inhibitory Interneurons into Thalamic Visual Circuits. Neuron 2014, 81, 1057–1069. [Google Scholar] [CrossRef] [PubMed]
- Cai, X.; Lai, Q.; Wang, Y.; Wang, W.; Sun, Z.; Yao, Y. Poly Kernel Inception Network for Remote Sensing Detection. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16 June 2024; pp. 27706–27716. [Google Scholar]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-Excitation Networks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 7132–7141. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Computer Vision—ECCV 2018; Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2018; Volume 11211, pp. 3–19. ISBN 978-3-030-01233-5. [Google Scholar]
- Khodarahmi, M.; Maihami, V. A Review on Kalman Filter Models. Arch. Computat. Methods Eng. 2023, 30, 727–747. [Google Scholar] [CrossRef]
- Yoo, J.-C.; Han, T.H. Fast Normalized Cross-Correlation. Circuits Syst. Signal Process. 2009, 28, 819–843. [Google Scholar] [CrossRef]
- Rezatofighi, H.; Tsoi, N.; Gwak, J.; Sadeghian, A.; Reid, I.; Savarese, S. Generalized Intersection over Union: A Metric and a Loss for Bounding Box Regression. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 16–20 June 2019; pp. 658–666. [Google Scholar]
- Zhao, J.; Li, J.; Jin, L.; Chu, J.; Zhang, Z.; Wang, J.; Xia, J.; Wang, K.; Liu, Y.; Gulshad, S.; et al. The 3rd Anti-UAV Workshop & Challenge: Methods and Results. arXiv 2023, arXiv:2305.07290. [Google Scholar] [CrossRef]
- Yaseen, M. What Is YOLOv8: An in-Depth Exploration of the Internal Features of the next-Generation Object Detector. arXiv 2024, arXiv:2409.07813. [Google Scholar]
- Mayer, C.; Danelljan, M.; Bhat, G.; Paul, M.; Paudel, D.P.; Yu, F.; Van Gool, L. Transforming Model Prediction for Tracking. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 19–24 June 2022; pp. 8721–8730. [Google Scholar]
- Cui, Y.; Jiang, C.; Wang, L.; Wu, G. MixFormer: End-to-End Tracking with Iterative Mixed Attention. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, New Orleans, LA, USA, 19–24 June 2022; pp. 13608–13618. [Google Scholar]










| Environment | Configuration | Specifications |
|---|---|---|
| Software | Programming Environment | Python 3.8.20 |
| Deep Learning Framework | PyTorch 2.4.1 + Torchvision 0.19.1 | |
| Operating System | Ubuntu 20.04 LTS | |
| Hardware | CPU | Intel® Xeon® Platinum 8352V CPU @ 2.10 GHz |
| GPU | NVIDIA GeForce RTX 4090 | |
| GPU Memory | 24,564 MiB |
| Dataset | Methods | IoU | ACC | SR |
|---|---|---|---|---|
| Anti-UAV | DiMP | 0.521 | 0.433 | 0.662 |
| Super-DiMP | 0.603 | 0.536 | 0.791 | |
| SiamCAR | 0.466 | 0.372 | 0.598 | |
| ToMP | 0.427 | 0.355 | 0.563 | |
| MixFormer | 0.572 | 0.497 | 0.737 | |
| GlobalTrack | 0.624 | 0.551 | 0.797 | |
| EDTC | 0.676 | 0.617 | 0.866 | |
| MDTC | 0.672 | 0.626 | 0.875 |
| Methods | Anti-UAV410 | AntiUAV600 | FPS | ||||
|---|---|---|---|---|---|---|---|
| IoU | ACC | SR | IoU | ACC | SR | ||
| ATOM | 0.482 | 0.388 | 0.624 | 0.359 | 0.236 | 0.459 | 110 |
| DiMP | 0.537 | 0.453 | 0.691 | 0.411 | 0.296 | 0.511 | 100 |
| Super-DiMP | 0.573 | 0.497 | 0.743 | 0.428 | 0.320 | 0.536 | 87 |
| KYS | 0.422 | 0.315 | 0.536 | 0.340 | 0.214 | 0.434 | 65 |
| SiamCAR | 0.439 | 0.342 | 0.567 | 0.299 | 0.162 | 0.381 | 125 |
| ToMP | 0.520 | 0.432 | 0.679 | 0.441 | 0.332 | 0.560 | 97 |
| GlobalTrack | 0.620 | 0.554 | 0.808 | 0.463 | 0.363 | 0.582 | 30 |
| MDTC | 0.618 | 0.544 | 0.802 | 0.525 | 0.427 | 0.641 | 94 |
| Methods | Anti-UAV410 | AntiUAV600 | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| ATOM | 0.634 | 0.638 | 0.636 | 0.488 | 0.503 | 0.494 |
| DiMP | 0.700 | 0.706 | 0.703 | 0.543 | 0.560 | 0.550 |
| Super-DiMP | 0.752 | 0.759 | 0.755 | 0.571 | 0.588 | 0.578 |
| KYS | 0.544 | 0.548 | 0.545 | 0.460 | 0.474 | 0.466 |
| SiamCAR | 0.575 | 0.579 | 0.577 | 0.404 | 0.415 | 0.409 |
| ToMP | 0.688 | 0.693 | 0.690 | 0.595 | 0.612 | 0.602 |
| GlobalTrack | 0.814 | 0.825 | 0.819 | 0.615 | 0.635 | 0.623 |
| MDTC | 0.892 | 0.802 | 0.835 | 0.711 | 0.647 | 0.670 |
| Dataset | Baseline | IDB | MAM | VMS | IoU | ACC | SR |
|---|---|---|---|---|---|---|---|
| Anti-UAV410 | ✓ | 0.6057 | 0.5341 | 0.7886 | |||
| ✓ | ✓ | 0.6079 | 0.5403 | 0.7954 | |||
| ✓ | ✓ | ✓ | 0.6162 | 0.5412 | 0.8018 | ||
| ✓ | ✓ | ✓ | ✓ | 0.6177 | 0.5432 | 0.8018 | |
| Anti-UAV | ✓ | ✓ | 0.6646 | 0.6197 | 0.8678 | ||
| ✓ | ✓ | ✓ | 0.6718 | 0.6263 | 0.8751 | ||
| ✓ | ✓ | ✓ | ✓ | 0.6719 | 0.6264 | 0.8749 |
| 0.2 | 0.25 | 0.3 | 0.35 | |
| IoU | 0.6696 | 0.6719 | 0.6666 | 0.6703 |
| ACC | 0.6208 | 0.6264 | 0.6164 | 0.6226 |
| SR | 0.8683 | 0.8749 | 0.8655 | 0.8724 |
| 0.005 | 0.01 | 0.015 | 0.02 | 0.025 | |
| IoU | 0.5243 | 0.5253 | 0.5243 | 0.5243 | 0.5244 |
| ACC | 0.4265 | 0.4273 | 0.4265 | 0.4265 | 0.4266 |
| SR | 0.6402 | 0.6411 | 0.6402 | 0.6402 | 0.6402 |
| 0.8 | 0.7 | 0.6 | 0.5 | 0.3 | |
| IoU | 0.6696 | 0.6719 | 0.6696 | 0.6696 | 0.6695 |
| ACC | 0.6209 | 0.6264 | 0.6208 | 0.6208 | 0.6209 |
| SR | 0.8684 | 0.8749 | 0.8683 | 0.8683 | 0.8683 |
| Model | AP50 (%) | AP50–95 (%) |
|---|---|---|
| YOLOv8s | 83.3 | 49.4 |
| YOLOv10s | 85.2 | 51.8 |
| YOLOv12s | 85.7 | 52.1 |
| Faster R-CNN | 79.3 | 48.2 |
| FoveaBOX | 76.0 | 42.6 |
| Deformable DETR | 86.5 | 47.4 |
| Our | 86.8 | 53.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cao, Y.; Sun, X.; Guo, R.; Dang, Z.; Su, S.; Bu, D. Anti-UAV Target Tracking with Motion Association Integration. Electronics 2026, 15, 839. https://doi.org/10.3390/electronics15040839
Cao Y, Sun X, Guo R, Dang Z, Su S, Bu D. Anti-UAV Target Tracking with Motion Association Integration. Electronics. 2026; 15(4):839. https://doi.org/10.3390/electronics15040839
Chicago/Turabian StyleCao, Yaofu, Xiaoyong Sun, Runze Guo, Zhaoyang Dang, Shaojing Su, and Desen Bu. 2026. "Anti-UAV Target Tracking with Motion Association Integration" Electronics 15, no. 4: 839. https://doi.org/10.3390/electronics15040839
APA StyleCao, Y., Sun, X., Guo, R., Dang, Z., Su, S., & Bu, D. (2026). Anti-UAV Target Tracking with Motion Association Integration. Electronics, 15(4), 839. https://doi.org/10.3390/electronics15040839

