Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer

You, Huaqiang; Liu, Yanjun; Shi, Zhenjie; Wang, Zekai; Wang, Lin; Xue, Gang

doi:10.3390/s26010297

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer

by

Huaqiang You

¹

,

Yanjun Liu

^1,2,

Zhenjie Shi

¹

,

Zekai Wang

²,

Lin Wang

² and

Gang Xue

^1,2,*

¹

School of Mechanical Engineering, State Key Laboratory of Advanced Equipment and Technology for Metal Forming, Key Laboratory of High-Efficiency and Clean Mechanical Manufacture of Ministry of Education, National Demonstration Center for Experimental Mechanical Engineering Education, Shandong University, Jinan 250061, China

²

Institute of Marine Science and Technology, Shandong Key Laboratory of Intelligent Marine Engineering Geology, Environment and Equipment, Shandong University, Qingdao 266237, China

^*

Author to whom correspondence should be addressed.

Sensors 2026, 26(1), 297; https://doi.org/10.3390/s26010297

Submission received: 3 November 2025 / Revised: 27 November 2025 / Accepted: 30 December 2025 / Published: 2 January 2026

(This article belongs to the Section Sensors and Robotics)

Download Versions Notes

Abstract

This work proposes a non-singular fast terminal sliding mode control (NFTSMC) strategy based on the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm and a nonlinear disturbance observer (NDO) to address the issues of modeling errors, motion disturbances, and transmission friction in robotic manipulators. Firstly, a novel modular serial 5-DOF robotic manipulator configuration is designed, and its kinematic and dynamic models are established. Secondly, a nonlinear disturbance observer is employed to estimate the total disturbance of the system and apply feedforward compensation. Based on boundary layer technology, an improved NFTSMC method is proposed to accelerate the convergence of tracking errors, reduce chattering, and avoid singularity issues inherent in traditional terminal sliding mode control. The stability of the designed control system is proved using Lyapunov stability theory. Subsequently, a deep reinforcement learning (DRL) agent based on the TD3 algorithm is trained to adaptively adjust the control gains of the non-singular fast terminal sliding mode controller. The dynamic information of the robotic manipulator is used as the input to the TD3 agent, which searches for optimal controller parameters within a continuous action space. A composite reward function is designed to ensure the stable and efficient learning of the TD3 agent. Finally, the motion characteristics of three joints for the designed 5-DOF robotic manipulator are analyzed. The results show that compared to the non-singular fast terminal sliding mode control algorithm based on a nonlinear disturbance observer (NDONFT), the non-singular fast terminal sliding mode control algorithm integrating a nonlinear disturbance observer and the Twin Delayed Deep Deterministic Policy Gradient algorithm (TD3NDONFT) reduces the mean absolute error of position tracking for the three joints by 7.14%, 19.94%, and 6.14%, respectively, and reduces the mean absolute error of velocity tracking by 1.78%, 9.10%, and 2.11%, respectively. These results verify the effectiveness of the proposed algorithm in enhancing the trajectory tracking accuracy of the robotic manipulator under unknown time-varying disturbances and demonstrate its strong robustness against sudden disturbances.

Keywords: robotic manipulator; Twin Delayed Deep Deterministic Policy Gradient algorithm; non-singular fast terminal sliding mode control; trajectory tracking control; nonlinear disturbance observer

Share and Cite

MDPI and ACS Style

You, H.; Liu, Y.; Shi, Z.; Wang, Z.; Wang, L.; Xue, G. Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer. Sensors 2026, 26, 297. https://doi.org/10.3390/s26010297

AMA Style

You H, Liu Y, Shi Z, Wang Z, Wang L, Xue G. Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer. Sensors. 2026; 26(1):297. https://doi.org/10.3390/s26010297

Chicago/Turabian Style

You, Huaqiang, Yanjun Liu, Zhenjie Shi, Zekai Wang, Lin Wang, and Gang Xue. 2026. "Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer" Sensors 26, no. 1: 297. https://doi.org/10.3390/s26010297

APA Style

You, H., Liu, Y., Shi, Z., Wang, Z., Wang, L., & Xue, G. (2026). Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer. Sensors, 26(1), 297. https://doi.org/10.3390/s26010297

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI