Triplet-Fusion Self-Attention-Enhanced Pyramidal Convolutional Neural Network for Surgical Robot Kinematic Solution
Abstract
1. Introduction
- A compact, flexible multi-DoF single-port instrument arm and its joint–workspace motion mapping for precise control.
- A TFSAM module that augments self-attention with triplet attention, enabling explicit modeling of global cross-dimensional dependencies and long-range interactions.
- A DPCNN with TFSAM and residual connections that captures multi-scale nonlinearities and enhances robustness to modeling uncertainty.
- Extensive comparisons and simulations demonstrating state-of-the-art accuracy and robustness for the targeted surgical IK task.
2. Related Work
2.1. Analytical and Numerical Kinematics
2.2. Neural Network-Based Kinematics
2.3. Attention Mechanisms in Robotics
3. System Description
3.1. Surgical Instrument Arm Model
3.2. Forward Kinematics Analysis
4. Methodology
4.1. Data Sampling Module
4.2. Hybrid Model Architecture Design
4.2.1. DPCNN
4.2.2. Triplet-Fusion Self-Attention Mechanism (TFSAM) Module
4.2.3. DPCNN–TFSAM Collaborative Framework
4.3. Model Evaluation Indexes
4.4. Framework of the Proposed Model
5. Result and Discussion
5.1. Experimental Setup
5.2. Impact of Optimizers on Model Performance
5.3. Comparison of Different Methods
5.4. Computational Efficiency Analysis
5.5. Ablation Experiment
- (1)
- TAM-DPCNN (with the self-attention module removed),
- (2)
- SAM-DPCNN (with the triplet attention module removed),
- (3)
- DPCNN (with all attention mechanisms removed).
5.6. Mechanical Simulation Verification
6. Conclusions and Future Works
- (1)
- An effective kinematic hybrid estimation model is provided. The proposed TFSAM-DPCNN model can effectively fit and predict the end position and positional state of the surgical robot, and the proposed model performs optimally compared with other state-of-the-art models.
- (2)
- The proposed TFSAM attention mechanism effectively extracts features between variables and enhances the model’s adaptability, demonstrating strong performance in predicting multiple joint variables, with R2 values exceeding 0.99.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Liang, Y.; Du, Z.; Wang, W.; Yan, Z.; Sun, L. An improved scheme for eliminating the coupled motion of surgical instruments used in laparoscopic surgical robots. Robot. Auton. Syst. 2019, 112, 49–59. [Google Scholar] [CrossRef]
- Wu, Z.; Zhu, C.; Ding, Y.; Wang, Y.; Xu, B.; Xu, K. A robotic surgical tool with continuum wrist, kinematically optimized curved stem, and collision avoidance kinematics for single port procedure. Mech. Mach. Theory 2022, 173, 104863. [Google Scholar] [CrossRef]
- Vitiello, V.; Lee, S.L.; Cundy, T.P.; Yang, G.Z. Emerging robotic platforms for minimally invasive surgery. IEEE Rev. Biomed. Eng. 2012, 6, 111–126. [Google Scholar] [CrossRef]
- Wu, D.; Zhang, R.; Pore, A.; Alba, D.D.; Ha, X.T.; Li, Z.; Zhang, Y.; Herrera, F.; Ourak, M.; Kowalczyk, W. A review on machine learning in flexible surgical and interventional robots: Where we are and where we are going. Biomed. Signal Process. 2024, 93, 106179. [Google Scholar] [CrossRef]
- Wang, Z.; Gao, Q.; Zhao, H. CPG-inspired locomotion control for a snake robot basing on nonlinear oscillators. J. Intell. Robot. Syst. 2017, 85, 209–227. [Google Scholar] [CrossRef]
- Iliukhin, V.N.; Mitkovskii, K.B.; Bizyanova, D.A.; Akopyan, A.A. The modeling of inverse kinematics for 5 DOF manipulator. Procedia Eng. 2017, 176, 498–505. [Google Scholar] [CrossRef]
- Tang, L.; Zhu, L.; Zhu, X.; Gu, G. Confined spaces path following for cable-driven snake robots with prediction lookup and interpolation algorithms. Sci. China Technol. Sci. 2020, 63, 255–264. [Google Scholar] [CrossRef]
- Cagigas-Muñiz, D. Artificial Neural Networks for inverse kinematics problem in articulated robots. Eng. Appl. Artif. Intell. 2023, 126, 107175. [Google Scholar] [CrossRef]
- Hrdina, J.; Návrat, A.; Vašík, P. Control of 3-link robotic snake based on conformal geometric algebra. Adv. Appl. Clifford Algebras 2016, 26, 1069–1080. [Google Scholar] [CrossRef]
- Tong, Y.; Liu, J.; Liu, Y.; Yuan, Y. Analytical inverse kinematic computation for 7-DOF redundant sliding manipulators. Mech. Mach. Theory 2021, 155, 104006. [Google Scholar] [CrossRef]
- Jain, T.; Jain, J.K.; Roy, D. Joint space redundancy resolution of serial link manipulator: An inverse kinematics and continuum structure numerical approach. Mater. Today Proc. 2021, 38, 423–431. [Google Scholar] [CrossRef]
- Sardana, L.; Sutar, M.K.; Pathak, P.M. A geometric approach for inverse kinematics of a 4-link redundant In-Vivo robot for biopsy. Robot. Auton. Syst. 2013, 61, 1306–1313. [Google Scholar] [CrossRef]
- Lin, P.F.; Huang, M.B.; Huang, H.P. Analytical solution for inverse kinematics using dual quaternions. IEEE Access 2019, 7, 166190–166202. [Google Scholar] [CrossRef]
- Kucuk, S.; Bingul, Z. Inverse kinematics solutions for industrial robot manipulators with offset wrists. Appl. Math. Model. 2014, 38, 1983–1999. [Google Scholar] [CrossRef]
- Park, S.O.; Lee, M.C.; Kim, J. Trajectory planning with collision avoidance for redundant robots using jacobian and artificial potential field-based real-time inverse kinematics. Int. J. Control Autom. 2020, 18, 2095–2107. [Google Scholar] [CrossRef]
- Chen, Y.; Luo, X.; Han, B.; Jia, Y.; Liang, G.; Wang, X. A general approach based on Newton’s method and cyclic coordinate descent method for solving the inverse kinematics. Appl. Sci. 2019, 9, 5461. [Google Scholar] [CrossRef]
- Dereli, S.; Köker, R.I. A meta-heuristic proposal for inverse kinematics solution of 7-DOF serial robotic manipulator: Quantum behaved particle swarm algorithm. Artif. Intell. Rev. 2020, 53, 949–964. [Google Scholar] [CrossRef]
- Huang, H.C.; Chen, C.P.; Wang, P.R. Particle swarm optimization for solving the inverse kinematics of 7-DOF robotic manipulators. In Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Seoul, Republic of Korea, 14–17 October 2012; IEEE: New York, NY, USA, 2012; pp. 3105–3110. [Google Scholar]
- Aydogmus, O.; Boztas, G. Implementation of singularity-free inverse kinematics for humanoid robotic arm using Bayesian optimized deep neural network. Measurement 2024, 229, 114471. [Google Scholar] [CrossRef]
- Yang, Y.; Li, W.; Song, B.; Zou, Y.; Pan, Y. Enhanced fault tolerant kinematic control of redundant robots with linear-variational-inequality based zeroing neural network. Eng. Appl. Artif. Intell. 2024, 133, 108068. [Google Scholar] [CrossRef]
- Sharkawy, A.N.; Khairullah, S.S. Forward and Inverse Kinematics Solution of A 3-DOF Articulated Robotic Manipulator Using Artificial Neural Network. Int. J. Robot. Control Syst. 2023, 3, 330–353. [Google Scholar] [CrossRef]
- Aysal, F.E.; Çelik, İ.; Cengiz, E.; Oğuz, Y.K. A comparison of multi-layer perceptron and inverse kinematic for RRR robotic arm. Politek. Derg. 2023, 27, 121–131. [Google Scholar] [CrossRef]
- Srisuk, P.; Sento, A.; Kitjaidure, Y. Inverse kinematics solution using neural networks from forward kinematics equations. In 2017 9th International Conference on Knowledge and Smart Technology (KST); IEEE: New York, NY, USA, 2017; pp. 61–65. [Google Scholar]
- Thomas, M.J.; Sanjeev, M.M.; Sudheer, A.P.; ML, J. Comparative study of various machine learning algorithms and Denavit–Hartenberg approach for the inverse kinematic solutions in a 3-PP SS parallel manipulator. Ind. Robot. 2020, 47, 683–695. [Google Scholar] [CrossRef]
- Chawla, I.; Pathak, P.M.; Notash, L.; Samantaray, A.K.; Li, Q.; Sharma, U.K. Inverse and forward kineto-static solution of a large-scale cable-driven parallel robot using neural networks. Mech. Mach. Theory 2023, 179, 105107. [Google Scholar] [CrossRef]
- Wagaa, N.; Kallel, H.; Mellouli, N.D. Analytical and deep learning approaches for solving the inverse kinematic problem of a high degrees of freedom robotic arm. Eng. Appl. Artif. Intell. 2023, 123, 106301. [Google Scholar] [CrossRef]
- Wu, Z.; Li, Z.; Zhu, D.; Liao, Q.; Yao, W. A Multi-Robot Task Allocation Method Based on Graph Attention Network and Unsupervised Learning. In Proceedings of the 2024 IEEE International Conference on Unmanned Systems (ICUS), Nanjing, China, 18–20 October 2024; IEEE: New York, NY, USA, 2024; pp. 1222–1227. [Google Scholar]
- Shao, X.; Xu, L.; Sun, G.; Yao, W.; Wu, L.; Della Santina, C. Self-attention enhanced dynamics learning and adaptive fractional-order control for continuum soft robots with system uncertainties. IEEE Trans. Autom. Sci. Eng. 2025, 22, 18694–18708. [Google Scholar] [CrossRef]
- Zhang, X.; Zheng, L.; Fan, W.; Mao, L.; Li, Z.; Cao, Y. Multi-domain data-driven chatter detection in robotic milling under varied robot poses based on directional attention mechanism. Mech. Syst. Signal Process. 2025, 227, 112406. [Google Scholar] [CrossRef]
- He, H.; Fu, H.; Wang, Q.; Zhou, S.; Liu, W.; Chen, Y. Spatio-temporal transformer-based reinforcement learning for robot crowd navigation. In Proceedings of the 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO), Koh Samui, Thailand, 4–9 December 2023; IEEE: New York, NY, USA, 2023; pp. 1–7. [Google Scholar]
- Wang, W.; Wang, R.; Mao, L.; Min, B.C. Navistar: Socially aware robot navigation with hybrid spatio-temporal graph transformer and preference learning. In Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 1–5 October 2023; IEEE: New York, NY, USA, 2023; pp. 11348–11355. [Google Scholar]
- Žlajpah, L.; Petrič, T. Kinematic calibration for collaborative robots on a mobile platform using motion capture system. Robot. Comput.-Integr. Manuf. 2023, 79, 102446. [Google Scholar] [CrossRef]
- Lu, L.; Li, G.; Xing, P.; Gao, H.; Song, Y.; Zhang, H. Numerical calculation and experimental investigation of the dynamic alignment of ship propulsion shafting based on Latin hypercube stochastic finite element. Ocean Eng. 2024, 296, 116935. [Google Scholar] [CrossRef]
- Xia, W.; Zhang, R.; Zhang, X.; Usman, M. A novel method for diagnosing Alzheimer’s disease using deep pyramid CNN based on EEG signals. Heliyon 2023, 9, e14858. [Google Scholar] [CrossRef]
- Ruder, S. An overview of gradient descent optimization algorithms. arXiv 2016, arXiv:1609.04747. [Google Scholar]
- Johnson, R.; Zhang, T. Deep pyramid convolutional neural networks for text categorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers); Association for Computational Linguistics: Stroudsburg, PA, USA, 2017; Volume 1, pp. 562–570. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 6000–6010. [Google Scholar]
- Pan, M.; Su, T.; Liang, K.; Liang, L.; Yang, Q. Sensorless force estimation of teleoperation system based on multilayer depth Extreme Learning Machine. Appl. Soft Comput. 2024, 157, 111494. [Google Scholar] [CrossRef]
- Shan, G.; Guoyin, Z.; Chengwei, J.; Yanxia, W. SGDAT: An optimization method for binary neural networks. Neurocomputing 2023, 555, 126431. [Google Scholar] [CrossRef]
- Kong, Y.; Yang, L.; Chen, C.; Zhu, X.; Li, D.; Guan, Q.; Du, G. Online kinematic calibration of robot manipulator based on neural network. Measurement 2024, 238, 115281. [Google Scholar] [CrossRef]
- Elkholy, H.A.; Shahin, A.S.; Shaarawy, A.W.; Marzouk, H.; Elsamanty, M. Solving inverse kinematics of a 7-DOF manipulator using convolutional neural network. In Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV2020), Cairo, Egypt, 8–10 April 2020; Springer: Cham, Switzerland, 2020; pp. 343–352. [Google Scholar]
- Toquica, J.S.; Oliveira, P.C.A.S.; Souza, W.S.; Motta, J.M.C.O.S.; Borges, D.B.O.L. An analytical and a Deep Learning model for solving the inverse kinematic problem of an industrial parallel robot. Comput. Ind. Eng. 2021, 151, 106682. [Google Scholar] [CrossRef]













| (rad) | (mm) | (mm) | (rad) | |
|---|---|---|---|---|
| 1 | 0 | 0 | ||
| 2 | 0 | 0 | ||
| 3 | 0 | 0 | ||
| 4 | 0 | |||
| 5 | 0 | |||
| 6 | 0 | |||
| 7 | 0 | |||
| 8 | 0 | |||
| 9 | 0 | |||
| 10 | 0 | 0 | 0 |
| Joint Variable/Unit | Value |
|---|---|
| d/mm | 0~300 |
| /rad | /2 |
| /rad | /12 |
| /rad | /2 |
| /rad | /12 |
| /rad | /12 |
| Joint Variable | Evaluation Indicators | ELM | LSTM-EKF | MLP-ANNs | CNN | GRU | TFSAM-DPCNN |
|---|---|---|---|---|---|---|---|
| d | RMSE | 4.2381 | 2.5400 | 4.5329 | 10.8827 | 3.0990 | 2.3766 |
| R2 | 0.9976 | 0.9991 | 0.9972 | 0.9842 | 0.9987 | 0.9993 | |
| MAE | 3.0755 | 1.8864 | 3.1430 | 9.0097 | 2.5429 | 1.7315 | |
| MAPE | 0.2689 | 0.1869 | 0.2825 | 0.5944 | 0.2225 | 0.1125 | |
| SD | 4.2571 | 2.8692 | 4.1833 | 10.8500 | 3.1208 | 2.4078 | |
| RMSE | 0.3332 | 0.1539 | 0.1431 | 0.2442 | 0.1026 | 0.08285 | |
| R2 | 0.8664 | 0.9715 | 0.9749 | 0.9282 | 0.9873 | 0.9917 | |
| MAE | 0.2384 | 0.0774 | 0.0841 | 0.1760 | 0.0413 | 0.03720 | |
| MAPE | 0.8497 | 0.3738 | 0.3898 | 0.6990 | 0.2396 | 0.2216 | |
| SD | 0.3210 | 0.1479 | 0.1394 | 0.2439 | 0.0933 | 0.0838 | |
| RMSE | 0.031 | 0.0058 | 0.0060 | 0.01754 | 0.0029 | 0.0027 | |
| R2 | 0.8322 | 0.9941 | 0.9937 | 0.9461 | 0.9985 | 0.9987 | |
| MAE | 0.0233 | 0.0046 | 0.0046 | 0.0140 | 0.0022 | 0.0020 | |
| MAPE | 0.8544 | 0.3444 | 0.3235 | 0.8760 | 0.0767 | 0.0880 | |
| SD | 0.0260 | 0.0055 | 0.0060 | 0.0166 | 0.0032 | 0.0029 | |
| RMSE | 0.3514 | 0.1589 | 0.1468 | 0.2898 | 0.1022 | 0.08513 | |
| R2 | 0.8496 | 0.9692 | 0.9737 | 0.8976 | 0.9873 | 0.9912 | |
| MAE | 0.2556 | 0.0845 | 0.0841 | 0.216 | 0.0428 | 0.0404 | |
| MAPE | 1.4864 | 0.5336 | 0.731 | 1.2089 | 0.2909 | 0.2529 | |
| SD | 0.3391 | 0.1512 | 0.1466 | 0.2869 | 0.0939 | 0.0869 | |
| RMSE | 0.0165 | 0.0116 | 0.0118 | 0.0482 | 0.0058 | 0.0055 | |
| R2 | 0.988 | 0.9940 | 0.9938 | 0.8974 | 0.9985 | 0.9986 | |
| MAE | 0.0122 | 0.0084 | 0.0089 | 0.0397 | 0.0044 | 0.0042 | |
| MAPE | 0.465 | 0.2473 | 0.2866 | 0.9023 | 0.1124 | 0.1921 | |
| SD | 0.0205 | 0.0111 | 0.0117 | 0.0481 | 0.0057 | 0.0057 | |
| RMSE | 0.0281 | 0.0107 | 0.0121 | 0.0477 | 0.00544 | 0.00556 | |
| R2 | 0.9657 | 0.9951 | 0.9936 | 0.8595 | 0.99874 | 0.99870 | |
| MAE | 0.0202 | 0.0075 | 0.0092 | 0.039 | 0.00416 | 0.00422 | |
| MAPE | 0.6374 | 0.2864 | 0.3378 | 0.9714 | 0.11014 | 0.1780 | |
| SD | 0.0257 | 0.0110 | 0.012 | 0.0474 | 0.00574 | 0.00567 |
| Model | Training Time (Minutes) | Parameters |
|---|---|---|
| ELM | 2.2 | 1900 |
| LSTM-EKF | 134.02 | 804,358 |
| MLP-ANNs | 14.22 | 398,527 |
| CNN | 26.59 | 22,118 |
| GRU | 87.88 | 254,214 |
| TFSAM-DPCNN | 197.63 | 142,660 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Su, T.; Liang, L.; Pan, M.; Fu, C.; Huang, H.; Li, J.; Liang, K. Triplet-Fusion Self-Attention-Enhanced Pyramidal Convolutional Neural Network for Surgical Robot Kinematic Solution. Actuators 2026, 15, 104. https://doi.org/10.3390/act15020104
Su T, Liang L, Pan M, Fu C, Huang H, Li J, Liang K. Triplet-Fusion Self-Attention-Enhanced Pyramidal Convolutional Neural Network for Surgical Robot Kinematic Solution. Actuators. 2026; 15(2):104. https://doi.org/10.3390/act15020104
Chicago/Turabian StyleSu, Tiecheng, Lu Liang, Mingzhang Pan, Changcheng Fu, Hengqiu Huang, Jing’ao Li, and Ke Liang. 2026. "Triplet-Fusion Self-Attention-Enhanced Pyramidal Convolutional Neural Network for Surgical Robot Kinematic Solution" Actuators 15, no. 2: 104. https://doi.org/10.3390/act15020104
APA StyleSu, T., Liang, L., Pan, M., Fu, C., Huang, H., Li, J., & Liang, K. (2026). Triplet-Fusion Self-Attention-Enhanced Pyramidal Convolutional Neural Network for Surgical Robot Kinematic Solution. Actuators, 15(2), 104. https://doi.org/10.3390/act15020104

