Research on Six-Degree-of-Freedom Refueling Robotic Arm Positioning and Docking Based on RGB-D Visual Guidance
Abstract
:Featured Application
Abstract
1. Introduction
2. Target Recognition and Pose Estimation of the Fuel Tank Inlet
2.1. Identifying the Target
2.2. Recognition Method and Results Based on RGB-D Camera
3. Experimental Design
3.1. Experimental Platform Setup
3.2. Experimental Procedure Design
3.3. Experimental Procedure
- a1: Target detection is performed using the OpenCV (version 4.5.2.54) method. The success of the target recognition is judged. If successful, proceed to a2; if the target is not recognized, the experiment is considered a failure.
- a2: If a1 successfully recognizes the target, pose estimation is performed on the target’s center point. The robotic arm moves the end effector to the target position for docking. The experiment is considered successful if docking is completed smoothly; otherwise, it is a failure.
- a3: The position and posture of the simulated fuel tank are changed. Under the condition of successful a1, experiment a2 is conducted until docking fails, determining the successful docking range.
- a4: The position and posture of the simulated fuel tank are changed, and experiment a1 is repeated until failure, establishing the applicable range for target recognition.
- a5: The above experiments based on the OpenCV method are replaced with the YOLOv8-based method, and experiments a1 to a4 are conducted accordingly.
3.4. Experimental Results and Discussion
4. Conclusions and Future Prospects
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
No. | Docking Interface Pose (x,y,z,rx,ry,rz) | Recognition Based on OpenCV (x,y,z,rx,ry,rz) | Success Docking | Recognition Based on OpenCV YOLOv8 (x,y,z,rx,ry,rz) | Success Docking |
---|---|---|---|---|---|
1 | (−0.998406,−0.411761,0.129670) (151.82193,36.066860,−44.923798) | (−0.998553,−0.411444,0.129181) (151.82192,36.066859,−44.923798) | Yes | (−0.998373,−0.411554,0.129454) (151.82192,36.066859,−44.923798) | Yes |
2 | (−0.948406,−0.411761,0.129670) (151.82193,36.066860,−44.923798) | (−0.948693,−0.411482,0.129118) (151.82192,36.066859,−44.923798) | Yes | (−0.948590,−0.411581,0.129318) (151.82192,36.066859,−44. 923798) | Yes |
3 | (−0.998406,−0.461761,0.129670) (151.82193,36.066860,−44.923798) | (−0.997705,−0.460317,0.129021) (151.82192,36.066859,−44.923798) | Yes | (−0.998281,−0.460671,0.129232) (151.82192,36.066859,−44.923798) | Yes |
4 | (−0.998406,−0.411761,0.134670) (151.82193,36.066860,−44.923798) | (−0.998493,−0.411566,0.134391) (151.82192,36.066859,−44.923798) | Yes | (−0.998451,−0.411463,0.134775) (151.82192,36.066859,−44.923798) | Yes |
5 | (−0.948406,−0.461761,0.129670) (151.82193,36.066860,−44.923798) | (−0.948016,−0.460288,0.128731) (151.82192,36.066859,−44.923798) | Yes | (−0.948287,−0.462751,0.129043) (151.82192,36.066859,−44.923798) | Yes |
6 | (−0.948406,−0.411761,0.134670) (151.82193,36.066860,−44.923798) | (−0.948021,−0.411056,0.133613) (151.82193,36.066860,−44.923798) | Yes | (−0.948512,−0.411664,0.134539) (151.82193,36.066860,−44.923798) | Yes |
7 | (−0.998406,−0.461761,0.134670) (151.82193,36.066860,−44.923798) | (−0.997156,−0.460437,0.133226) (151.82193,36.066860,−44.923798) | Yes | (−0.998032,−0.461582,0.133492) (151.82193,36.066860,−44.923798) | Yes |
8 | (−1.048406,−0.366761,0.124670) (151.82193,36.066860,−44.923798) | (−1.048839,−0.364972,0.124701) (151.82193,36.066860,−44.923798) | Yes | (−1.047969,−0.366241,0.124848) (151.82193,36.066860,−44.923798) | Yes |
9 | (−1.048406,−0.366761,0.134670) (151.82193,36.066860,−44.923798) | (−1.048952,−0.365831,0.133089) (151.82193,36.066860,−44.923798) | Yes | (−1.048359,−0.366527,0.133977) (151.82193,36.066860,−44.923798) | Yes |
10 | (−0.948406,−0.461761,0.134670) (151.82193,36.066860,−44.923798) | (−0.948199,−0.458838,0.133817) (151.82193,36.066860,−44.923798) | Yes | (−0.948177,−0.463963,0.134895) (151.82193,36.066860,−44.923798) | Yes |
No. | Docking Interface Pose (x,y,z,rx,ry,rz) | Recognition Based on OpenCV (x,y,z,rx,ry,rz) | Successful Docking | Recognition Based on OpenCV YOLOv8 (x,y,z,rx,ry,rz) | Successful Docking |
---|---|---|---|---|---|
1 | (−0.998406,−0.411761,0.129670) (153.82193,36.066860,−44.923798) | (−0.996278,−0.410937,0.127972) (153.34790,36.163685,−44.73578) | Yes | (−0.996978,−0.412937,0.128079) (153.34790,36.163685,−44.73578) | Yes |
2 | (−0.998406,−0.411761,0.129670) (151.82193,34.066860,−44.923798) | (−0.996889,−0.409235,0.127667) (152.03432,34.321348,−45.178932) | Yes | (−0.997782,−0.410883,0.128631) (152.03432,34.321348,−45.178932) | Yes |
3 | (−0.998406,−0.411761,0.129670) (151.82193,36.066860,−42.923798) | (−0.995703,−0.408701,0.126892) (152.27036,36.410983,−43.376418) | Yes | (−0.999132,−0.416915,0.127105) (152.27036,36.410983,−43.376418) | Yes |
4 | (−0.998406,−0.411761,0.129670) (153.82193,34.066860,−44.923798) | (−0.996872,−0.413578,0.127962) (154.08652,34.903678,−44.555324) | Yes | (−0.997451,−0.411463,0.128024) (154.08652,34.903678,−44.555324) | Yes |
5 | (−0.998406,−0.411761,0.129670) (153.82193,36.066860,−42.923798) | (−0.995091,−0.414992,0.128013) (153.25681,36.693251,−42.379042) | Yes | (−0.997891,−0.413732,0.1266031) (153.25681,36.693251,−42.379042) | Yes |
6 | (−0.998406,−0.411761,0.129670) (151.82193,38.066860,−42.923798) | (−0.997351,−0.413742,0.128531) (152.72146,37.73592,−42.823921) | Yes | (−0.997235,−0.411321,0.127745) (152.72146,37.73592,−42.823921) | Yes |
7 | (−0.998406,−0.411761,0.129670) (149.82193,34.066860,−46.923798) | (−0.999492,−0.408388,0.127431) (148.89427,34.672967,−47.678159) | Yes | (−0.998164,−0.415803,0.130572) (148.89427,34.672967,−47.678159) | Yes |
8 | (−0.948406,−0.361761,0.134670) (154.32193,38.066860,−42.923798) | (−0.942015,−0.352419,0.131144) (153.69912,38.472419,−42.603981) | No | (−0.946068,−0.359634,0.133223) (153.69912,38.472419,−42.603981) | Yes |
9 | (−1.048406,−0.461761,0.124670) (149.32193,34.066860,−46.923798) | (−1.0315847,−0.467249,0.128247) (148.97931,33.938642,−47.244901) | No | (−1.049984,−0.463752,0.124639) (148.97931,33.938642,−47.244901) | Yes |
10 | (−0.948406,−0.361761,0.134670) (149.32193,36.066860,−46.923798) | (−0.950131,−0.376181,0.137169) (149.86974,35.971524,−46.760962) | No | (−0.947794,−0.363371,0.134149) (149.86974,35.071524,−46.760962) | Yes |
No. | Docking Interface Pose (x,y,z,rx,ry,rz) | Successful Docking | Time Taken for Recognition Method Based on OpenCV (s) | Successful Docking | Time Taken for Recognition Method Based on YOLOv8 (s) |
---|---|---|---|---|---|
1 | (−0.998406,−0.411761,0.129670) (151.82193,36.066860,−44.923798) | Yes | 10.3 | Yes | 9.6 |
2 | (−0.978406,−0.431761,0.131670) (152.82193,35.066860,−43.923798) | Yes | 10.5 | Yes | 9.9 |
3 | (−0.999406,−0.391761,0.127670) (151.32193,35.566860,−45.923798) | Yes | 10.7 | Yes | 9.8 |
4 | (−1.048406,−0.431761,0.128670) (152.82193,35.066860,−44.423798) | Yes | 10.6 | Yes | 9.9 |
5 | (−1.048406,−0.381761,0.130670) (149.82193,35.566860,−43.423798) | Yes | 10.7 | Yes | 9.8 |
6 | (−0.948406,−0.381761,0.128670) (150.82193,36.566860,−45.423798) | Yes | 10.4 | Yes | 9.7 |
7 | (−1.018406,−0.431761,0.127670) (151.32193,36.566860,−45.023798) | Yes | 10.6 | Yes | 9.9 |
8 | (−0.948406,−0.361761,0.134670) (152.82193,37.066860,−43.923798) | Yes | 10.8 | Yes | 10.0 |
9 | (−0.988406,−0.451761,0.127670) (150.82193,34.066860,−45.923798) | No | — | Yes | 10.2 |
10 | (−0.948406,−0.381761,0.133670) (153.82193,37.066860,−43.923798) | No | — | Yes | 10.1 |
References
- Harris, C.; Stephens, M. A combined corner and edge detector. In Proceedings of the Alvey Vision Conference, Manchester, UK, 31 August–2 September 1988; pp. 147–151. [Google Scholar]
- Lowe, D.G. Object recognition from local scale-invariant features. In Proceedings of the Seventh IEEE International Conference on Computer Vision (ICCV), Kerkyra, Greece, 20–27 September 1999; IEEE: Piscataway, NJ, USA, 1999; Volume 2, pp. 1150–1157. [Google Scholar]
- Bay, H.; Ess, A.; Tuytelaars, T.; Gool, L.V. Speeded-up robust features (SURF). Comput. Vis. Image Underst. 2008, 110, 346–359. [Google Scholar] [CrossRef]
- Dalal, N.; Triggs, B. Histograms of Oriented Gradients for Human Detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition, San Diego, CA, USA, 20–25 June 2005; IEEE: Piscataway, NJ, USA, 2005. [Google Scholar]
- Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Girshick, R. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017. [Google Scholar]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-End Object Detection with Transformers. arXiv 2020, arXiv:2005.12872. [Google Scholar]
- Liu, Z.; Hu, H.; Lin, Y.; Yao, Z.; Xie, Z.; Wei, Y.; Ning, J.; Cao, Y.; Zhang, Z.; Dong, L.; et al. Swin transformer v2: Scaling up capacity and resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; pp. 12009–12019. [Google Scholar]
- Bochkovskiy, A.; Wang, C.-Y.; Liao, H.-Y.M. YOLOv4: Optimal Speed and Accuracy of Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Seattle, WA, USA, 13–19 June 2020. [Google Scholar]
- Haralick, R.M.; Lee, C.-N.; Ottenberg, K.; Noelle, M. Analysis and solution of the three point perspective pose estimation problem. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Maui, HI, USA, 3–6 June 1991; pp. 592–598. [Google Scholar]
- Hartley, R.; Zisserman, A. Multiple View Geometry in Computer Vision; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
- Jie, S. A Brief History and Overview of Bundle Adjustment. J. Wuhan Univ. Inf. Sci. Ed. 2018, 43, 1797–1810. [Google Scholar]
- Moreno-Noguer, F.; Lepetit, V.; Fua, P. EPnP: An Accurate O(n) Solution to the PnP Problem. Int. J. Comput. Vis. 2009, 81, 155–166. [Google Scholar]
- Zheng, Y.; Kuang, Y.; Sugimoto, S.; Astrom, K.; Okutomi, M. Revisiting the PnP Problem: A Fast, General and Optimal Solution. In Proceedings of the 2013 IEEE International Conference on Computer Vision, Sydney, Australia, 1–8 December 2013; IEEE: Piscataway, NJ, USA, 2014. [Google Scholar]
- Kneip, L.; Li, H.; Seo, Y. UPnP: An Optimal O(n) Solution to the Absolute Pose Problem with Universal Applicability. In Proceedings of the European Conference on Computer Vision, Zurich, Switzerland, 6–12 September 2014; Springer: Cham, Switzerland, 2014. [Google Scholar]
- Li, S.; Xu, C.; Xie, M. A robust O(n)solution to theperspective-n-point problem. IEEE Trans. Onpattern Anal. Mach. Intell. 2012, 34, 1444–1450. [Google Scholar] [CrossRef] [PubMed]
- Kendall, A.; Grimes, M.; Cipolla, R. Posenet: A convolutional network for real-time 6-dof camera relocalization. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 2938–2946. [Google Scholar]
- Kehl, W.; Manhardt, F.; Tombari, F.; Ilic, S.; Navab, N. SSD-6D: Making RGB-based 3D detection and 6d pose estimation greatagain. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; IEEE: New York, NY, USA, 2017. [Google Scholar]
- Do, T.T.; Cai, M.; Pham, T.; Reid, I. Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image. arXiv 2018, arXiv:1802.10367. [Google Scholar]
- Xiang, Y.; Schmidt, T.; Narayanan, V.; Fox, D. PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes. In Proceedings of the Robotics: Science and Systems 2018, Pittsburgh, PA, USA, 26–30 June 2018. [Google Scholar]
- Rad, M.; Lepetit, V. BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3848–3856. [Google Scholar]
- Tekin, B.; Sinha, S.N.; Fua, P. Real-time seamless single shot 6d object pose prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 292–301. [Google Scholar]
- Peng, S.; Liu, Y.; Huang, Q.; Zhou, X.; Bao, H. Pvnet: Pixel-wise voting network for 6dof pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 4561–4570. [Google Scholar]
- Liang, M. Study of a Binocular Vision System for Automotive Fueling Robots. Master’s Thesis, South China University of Technology, Guangzhou, China, 2021. [Google Scholar]
- Ma, Z. Research on Fuel Tank Cap Recognition and Detection Technology Based on Binocular Vision. Master’s Thesis, Jilin University, Changchun, China, 2022. [Google Scholar]
- Wang, X.; Dong, X.; Kong, X.; Zhi, J.; Wang, L. MS-KF Fusion Algorithm for Cone Sleeve Tracking. Appl. Opt. 2013, 34, 951–956. [Google Scholar]
- Scott, G.P.; Henshaw, C.G.; Walker, I.D.; Willimon, B. Autonomous robotic refueling of an unmanned surface vehicle in varying sea states. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany, 28 September–2 October 2015; pp. 1664–1671. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, M.; Liu, J. Research on Six-Degree-of-Freedom Refueling Robotic Arm Positioning and Docking Based on RGB-D Visual Guidance. Appl. Sci. 2024, 14, 4904. https://doi.org/10.3390/app14114904
Yang M, Liu J. Research on Six-Degree-of-Freedom Refueling Robotic Arm Positioning and Docking Based on RGB-D Visual Guidance. Applied Sciences. 2024; 14(11):4904. https://doi.org/10.3390/app14114904
Chicago/Turabian StyleYang, Mingbo, and Jiapeng Liu. 2024. "Research on Six-Degree-of-Freedom Refueling Robotic Arm Positioning and Docking Based on RGB-D Visual Guidance" Applied Sciences 14, no. 11: 4904. https://doi.org/10.3390/app14114904
APA StyleYang, M., & Liu, J. (2024). Research on Six-Degree-of-Freedom Refueling Robotic Arm Positioning and Docking Based on RGB-D Visual Guidance. Applied Sciences, 14(11), 4904. https://doi.org/10.3390/app14114904