Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System

Shan, Bo; Zhao, Donghui; Zhao, Ruijin; Hiroshi, Yokoi

doi:10.3390/s25237130

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System

by

Bo Shan

¹,

Donghui Zhao

^1,2,*

,

Ruijin Zhao

¹ and

Yokoi Hiroshi

³

¹

School of Electrical Engineering, Shenyang University of Technology, Shenyang 110178, China

²

Jianghuai Advanced Technology Center, Hefei 230000, China

³

Department of Mechanical Engineering and Intelligent Systems, University of Electro-Communications, Tokyo 182-8585, Japan

^*

Author to whom correspondence should be addressed.

Sensors 2025, 25(23), 7130; https://doi.org/10.3390/s25237130

Submission received: 28 September 2025 / Revised: 8 November 2025 / Accepted: 17 November 2025 / Published: 21 November 2025

(This article belongs to the Special Issue Advanced Sensors for Intelligent Robotic Systems: Vision, Touch, and Dexterous Manipulation)

Download Versions Notes

Abstract

In the research field of multi-robot cooperation, reliable and low-cost motion capture is crucial for system development and validation. To address the high costs of traditional motion capture systems, this study proposes a real-time 6D pose estimation and tracking method for multi-robot systems based on YolPnP-FT. Using only an Intel RealSense D435i depth camera, the system achieves simultaneous robot classification, 6D pose estimation, and multi-target tracking in real-world environments. The YolPnP-FT pipeline introduces a keypoint confidence filtering strategy (PnP-FT) at the output of the YOLOv8 detection head and employs Gaussian-penalized Soft-NMS to enhance robustness under partial occlusion. Based on these detection results, a linearly weighted combination of Mahalanobis distance and cosine distance enables stable ID assignment in visually similar multi-robot scenarios. Experimental results show that, at a camera height below 2.5 m, the system achieves an average position error of less than 0.009 m and an average angular error of less than 4.2°, with a stable tracking frame rate of 19.8 FPS at 1920 × 1080 resolution. Furthermore, the perception outputs are validated in a CoppeliaSim-based simulation environment, confirming their utility for downstream coordination tasks. These results demonstrate that the proposed method provides a low-cost, real-time, and deployable perception solution for multi-robot systems.

Keywords: multi-robot system; RGB sensing; 6D pose estimation; multi-target tracking; real-time perception; low-cost perception

Share and Cite

MDPI and ACS Style

Shan, B.; Zhao, D.; Zhao, R.; Hiroshi, Y. Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System. Sensors 2025, 25, 7130. https://doi.org/10.3390/s25237130

AMA Style

Shan B, Zhao D, Zhao R, Hiroshi Y. Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System. Sensors. 2025; 25(23):7130. https://doi.org/10.3390/s25237130

Chicago/Turabian Style

Shan, Bo, Donghui Zhao, Ruijin Zhao, and Yokoi Hiroshi. 2025. "Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System" Sensors 25, no. 23: 7130. https://doi.org/10.3390/s25237130

APA Style

Shan, B., Zhao, D., Zhao, R., & Hiroshi, Y. (2025). Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System. Sensors, 25(23), 7130. https://doi.org/10.3390/s25237130

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Real-Time 6D Pose Estimation and Multi-Target Tracking for Low-Cost Multi-Robot System

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI