1. Introduction
Teleoperation enables a human operator to control a remote slave manipulator via a local master device, allowing the robotic system to perform tasks in hazardous or inaccessible environments. The core of such systems lies in establishing a high-fidelity bidirectional channel: the master transmits motion commands (e.g., position/velocity), while the slave returns interaction forces or tactile signals, thereby seamlessly integrating human cognition with robotic execution. This technology has been widely applied in nuclear maintenance, space and deep-sea exploration, and telesurgery. From an engineering perspective, the mechanical design of teleoperation systems varies substantially across applications. In on-orbit servicing, redundant manipulators with high degrees of freedom are typically used for satellite capture and maintenance [
1]. In deep-sea exploration, underwater vehicle–manipulator systems (UVMSs) must be pressure-resistant and watertight, while also compensating for hydrodynamic disturbances [
2]. In high-risk settings such as nuclear facilities, heterogeneous master–slave configurations are common: the slave prioritizes payload capacity and durability, whereas the master emphasizes ergonomics and realistic force feedback. In advanced manufacturing and medical rehabilitation [
3], extremely high tracking accuracy and force transparency are required to support precision tasks such as micro-assembly, polishing, and tele-rehabilitation training. Regarding the evolution of bilateral teleoperation control structures and performance metrics, Hokayem and Spong provided a relatively comprehensive historical review, offering a reference for subsequent method comparisons and problem definition [
4]. For haptic interaction devices, Xiong et al. proposed an object-grasping strategy based on an improved stiffness-display device, enhancing the operator’s perception of contact states and improving operational stability [
5].
Realizing these diverse applications hinges on effective bilateral control. A primary objective of high-performance bilateral teleoperation is to ensure both stability and transparency [
6,
7]. While maintaining closed-loop stability, the system must accurately synchronize the slave position with the master motion and faithfully transmit the interaction forces between the slave and the environment to the operator in real time [
8,
9]. Achieving these dual objectives in practice is challenged by several key factors. Time-varying delays inherent in long-distance communication can break passivity, causing phase lag and energy accumulation that may severely compromise stability [
10]. Furthermore, robotic systems exhibit intrinsic nonlinear dynamics, including friction, gravity, and Coriolis/centrifugal force coupling. Payload variations and uncertainties in dynamic parameters can significantly degrade tracking accuracy and controller robustness [
11,
12,
13]. In unstructured, dynamic environments, contact force signals are often contaminated with high-frequency noise and transients. Moreover, environmental dynamics (e.g., stiffness and damping) are typically unknown and time-varying, which persistently degrades force-feedback fidelity and overall transparency.
Figure 1 illustrates a typical master–slave teleoperation architecture and highlights the fundamental interactions among the master, the slave, the communication channel, and the environment.
Significant research efforts have been devoted to addressing the aforementioned challenges. Early studies primarily examined how communication delays affect system stability. Based on passivity theory, Reference [
14] systematically established stability conditions for teleoperation systems with time delays, providing a theoretical foundation for subsequent controller design. Building upon this work, Anderson and Spong [
15] proposed the scattering transformation to preserve passivity and stability under delayed communication. Subsequently, Niemeyer and Slotine [
16] further developed the wave-variable control technique, which maintains passivity in the presence of communication delays by transmitting wave variables rather than direct power signals. Reference [
17] investigated teleoperation systems with heterogeneous master–slave configurations and proposed a coordinated impedance-matching strategy that improves performance under asymmetric structures. Reference [
18] introduced a state-prediction mechanism for synchronization control of multi-degree-of-freedom manipulators under time-varying delays, thereby reducing tracking lag. However, passivity-based and wave-variable approaches often sacrifice transparency in practice, potentially causing position drift, force distortion, and wave reflection [
19,
20]. In this context, Reference [
21] investigated robust control under the combined effects of time-varying delays and model uncertainties. By constructing an adaptive observer to jointly estimate system states and disturbances, it improved stability maintenance under parameter perturbations. To improve tracking performance while ensuring stability, proportional–derivative (PD)-type controllers have been widely adopted in teleoperation systems. Nuño et al. [
22,
23,
24] showed that simple structures such as P + d or PD + d can ensure stability under certain conditions, offering a practical baseline for engineering implementation. To cope with parametric uncertainties in manipulator dynamics, adaptive control schemes have also been developed. For instance, Chen et al. [
25] proposed an adaptive controller integrated with environment force estimation, while Huang et al. [
26] employed a radial basis function neural network (RBFNN) to approximate unknown dynamics. A common limitation of these methods is that they typically guarantee only asymptotic stability, which may lead to insufficient convergence speed in teleoperation tasks requiring rapid response and high dynamic performance. Consequently, finite-time control has attracted attention for its potential to deliver faster convergence and higher precision. Representative works include the adaptive fuzzy finite-time controller by Yang et al. [
27] and the use of nonsingular terminal sliding-mode control by Li et al. [
28] and Zhang et al. [
29] in teleoperation systems. Although these methods provide theoretically improved convergence, many involve complex structures and require tuning multiple parameters, which complicates practical implementation. Hashemzadeh et al. [
30] proposed a nonlinear tracking controller that estimates uncertainties online to enhance robustness against time-varying delays. Nevertheless, like other PD-based and asymptotically convergent designs, this approach may still exhibit limited transient response and disturbance rejection when facing abrupt force changes or significant parameter variations, and it may fail to drive tracking errors into a desired bound within a finite time.
To address these issues, this paper proposes a high-performance control framework for position synchronization and force feedback in complex master–slave teleoperation systems. The main contributions are summarized as follows:
- (a)
An integrated dynamic estimation and communication strategy is proposed. A radial basis function neural network (RBFNN) is employed to estimate online the unknown nonlinear dynamics associated with the manipulators, the operator, and the environment [
25,
31]. Specifically, environment interaction parameters are identified online by a slave-side RBFNN. Instead of transmitting raw force/torque signals (power variables) through the delayed channel, only the learned network weights are communicated to the master, where the environment torque is reconstructed locally using the current master states [
25,
26]. By avoiding direct transmission of power signals over the time-delay channel, the proposed strategy mitigates passivity degradation caused by delayed force feedback and provides a delay-independent basis for stability analysis.
- (b)
A comprehensive super-twisting sliding-mode control (STSMC) scheme is developed. By incorporating STSMC theory [
32,
33], the proposed controller achieves fast convergence, high tracking accuracy, and strong robustness against uncertainties and disturbances in the position-tracking loop, thereby overcoming the asymptotic convergence and slow transient response of conventional proportional–derivative (PD)-based and adaptive methods.
- (c)
A practical force-signal conditioning method is implemented. A first-order low-pass filter is introduced to denoise and parameterize the measured environment interaction force signals [
26]. This preprocessing step suppresses the adverse effects of high-frequency measurement noise and force transients on feedback quality and closed-loop stability.
The remainder of this paper is organized as follows.
Section 2 presents the dynamic modeling of the master–slave teleoperation system and outlines the required theoretical preliminaries.
Section 3 details the controller design and provides a rigorous Lyapunov-based stability analysis.
Section 4 validates the proposed approach through numerical simulations of a two-link manipulator, physics-based simulations of a seven-degree-of-freedom manipulator, and real-world experiments on a seven-degree-of-freedom robotic platform. Finally,
Section 5 concludes the paper and discusses potential directions for future research.
3. Adaptive Super-Twisting Controller Design
3.1. Slave-Side Environmental Torque Model
In practical teleoperation systems, the interaction between the slave manipulator and the environment is subject to significant uncertainties and measurement noise. In particular, the raw force measurements used to estimate the environmental interaction torque are often contaminated by sensor noise and external disturbances. Directly incorporating such signals into the control loop may degrade system robustness and stability.
To mitigate the influence of measurement noise, a first-order discrete-time low-pass filter is introduced to preprocess the measured environmental force signal
. Let
denote the discrete sampling instant. The filtered force signal
is generated according to:
where
is the filter coefficient determining the trade-off between noise attenuation and signal responsiveness.
The filtered signal
is then incorporated into the environmental impedance model to obtain a smoothed estimate of the environmental interaction torque:
where
and
denote the environment stiffness and damping matrices, respectively.
To further improve the estimation accuracy of the environmental torque under modeling uncertainties, a Radial Basis Function Neural Network (RBFNN) is employed to approximate the nonlinear environmental dynamics online. The environmental torque can therefore be represented as follows:
where
denotes the desired weight coefficient matrix of the neural network.
denotes the basis function vector. To minimize the estimation error between the network output and the true environmental torque
, the weight matrix
is adjusted online according to a designed adaptive update law. The adaptive update law is formulated as follows:
where
denotes a bounded, positive-definite adaptive gain matrix and
is a positive constant.
Using the current weight estimate
, the slave-side environmental torque is reconstructed in the form:
To fundamentally mitigate the instability risks associated with time-varying communication delays, a strategy based on Model-Mediated Teleoperation (MMT) is adopted. Instead of transmitting the raw environmental torque , which inherently carries power conjugate variables and closes a delayed energetic loop, we transmit the estimated environmental parameters .
The transmission of the weight matrix represents a flow of information rather than energy. In traditional bilateral control (e.g., position-force architecture), the delayed force signal acts as an active energy source, potentially violating passivity. In contrast, by reconstructing the torque locally at the master side using local states and delayed parameters , the communication channel is effectively decoupled from the power exchange ports because only model parameters (RBF weights) are transmitted. Consequently, the communication delay does not appear as a delayed power-conjugate feedback path; instead, it manifests as a parametric lag in the received weight vector, which produces a reconstruction mismatch in the rendered environmental torque. Under bounded time-varying delays and bounded parameter update rates, this mismatch is bounded and can be lumped into the matched perturbation term in the sliding dynamics. Therefore, the delay primarily influences rendering fidelity (transparency) while the closed-loop interaction stability is ensured by the subsequent robust super-twisting design.
To mitigate the instability introduced by communication delays, the environmental torque is not transmitted directly back to the master. Instead, the learned environmental characteristic parameters—namely, the weight matrix are communicated through the channel. This strategy offers two key advantages. First, since environmental characteristics generally evolve at a lower frequency than the force signal itself, transmitting the weight matrix places a lower demand on communication bandwidth. Second, the delayed channel no longer carries the raw force/torque signal (a power variable); hence, the classical passivity degradation caused by delayed force feedback is mitigated. The residual mismatch introduced by using delayed parameters in the reconstruction is treated as a bounded modeling error and will be accounted for in the subsequent stability analysis.
3.2. Slave-Side Controller Design
To ensure position synchronization under uncertain communication delays, a trajectory planner is deployed on the slave side. This module takes the delayed master joint position
as its input, where
denotes the time-varying forward delay in the communication channel. The planner then generates a smooth reference trajectory, comprising the desired position
and corresponding first and second derivatives
and
, for the slave manipulator to track. A stable second-order low-pass filter is adopted to transform the intermittent master-position commands into a continuously differentiable slave trajectory. The transfer function of the filter is selected as follows:
where
denotes the positive time constant of the filter. Taking the master position
as the input to the filter, define the delayed master joint position as follows:
The output can be expressed as follows:
The corresponding expression in the time domain is given by the following differential equation:
Substitute the delayed definition
:
The generated signals
,
, and
are used as the reference trajectory for the slave manipulator to track. Accordingly, the position tracking error
, and its first derivative with respect to the time
are defined as follows:
Then, the slave-side sliding surface
is designed as follows:
where
is a positive-definite diagonal matrix determining the slope of the sliding surface and
is a user-designed diagonal (or symmetric) positive definite matrix, i.e.,
with
, which shapes the convergence rate of the tracking-error dynamics on the sliding manifold. A larger
accelerates error convergence but may increase control effort and sensitivity to measurement noise.
Differentiating (17) yields the expression for
:
where the model uncertainty term, denoted as
, is defined as follows:
To compensate for model uncertainties, a Radial basis function neural network (RBFNN) is utilized for online approximation, and we have:
where
denotes the desired approximation parameter matrix of the neural network.
denotes the neural network input vector, which is defined as
.
denotes the neural network approximation error vector.
denotes the Gaussian basis function vector, which is defined as
, and
is defined as follows:
where
denote the center vector of the
th hidden-layer neuron, and
is the Gaussian basis function width. Note that the model uncertainty term
encompasses unmodeled dynamics and parametric perturbations in the dynamic model of the slave manipulator.
Based on the above design, the slave controller is constructed as follows:
where
denotes the linear gain matrix,
denotes the gain matrix of the super-twisting algorithm, and
denotes the integral gain matrix. To ensure closed-loop stability and convergence of the RBF neural network weight estimates, the adaptive update law for the slave system is formulated, and we obtain:
where
is a positive-definite diagonal gain matrix.
3.3. Master-Side Controller Design
In a teleoperation system, the master-side controller must achieve two primary objectives: (a) to provide the operator with high-fidelity, transparent force feedback, and (b) to ensure precise tracking of the desired motion trajectory. To this end, this paper develops an impedance-model-based controller that simultaneously renders realistic environmental interaction forces and guarantees stable human–robot interaction.
To cope with the time-varying communication delay, the master receives the environment model parameters (i.e., the neural-network weights) estimated at the slave side with a backward delay . It is emphasized that the physical interaction takes place at the slave–environment interface; therefore, the master does not directly measure the true environmental torque. Instead, the received parameters are used to render an equivalent environment-emulated torque locally for haptic display, which mitigates the phase lag typically introduced by directly transmitting delayed force/torque signals.
Specifically, the slave approximates the interaction torque via a multi-output RBF network, and only the corresponding weight matrix is transmitted to the master. Let
denote the Gaussian basis-function vector evaluated at the current master state, and let
be the estimated weight matrix. Using the delayed weights
, the master computes the rendered torque candidate as follows:
For the special case
, (24) reduces to the conventional scalar dot-product form. For general
-DOF manipulators, the matrix form in (24) is required to ensure dimensional consistency of the reconstructed joint-torque vector. This reconstruction utilizes the current master motion state to generate
, while the learned interaction characteristics are carried by the transmitted weights; consequently, the feedback loop avoids directly introducing delayed force signals and thereby alleviates phase lag in the rendered haptic feedback. Based on the desired impedance model of the master manipulator and using
as the input, the desired trajectory planner for the master is derived as follows:
where
,
, and
denote the inertia, damping, and stiffness matrices, respectively. These matrices are adjustable and can be tuned according to the operator’s preferences, perceptual requirements, and the actual dynamics of the master device to optimize haptic rendering performance. Consequently, the design of the master controller is essentially reduced to ensuring that the actual master trajectory
accurately tracks the desired trajectory
generated by the impedance model. For the master manipulator, the tracking error
and its first-order derivative
can be obtained as follows:
Similarly,
is a positive-definite diagonal sliding-surface gain matrix for the master side. Then, the master-side sliding surface
is designed as follows:
Differentiating (23) with respect to the time yields:
Consistent with the derivation of the slave-side controller, the uncertainties, frictional effects, and unmodeled dynamics present in the master-side dynamic model are aggregated into a lumped model uncertainty term
, which is expressed as follows:
To compensate for model uncertainties, a Radial basis function neural network (RBFNN) is also employed for online approximation and compensation, and we have:
where
denotes the desired weight matrix, and l corresponds to the number of neurons in the hidden layer.
denotes the approximation error vector.
denotes the Gaussian basis function vector. The center vector of the ith hidden layer node is expressed as
, and the width parameter is denoted as
. Then,
can be determined by:
Since the model uncertainty term incorporates both the dynamic uncertainties of the master manipulator and those from the environmental interaction. Accordingly, the input vector for the neural network is chosen as follows: .
Following the design framework established above, the master manipulator controller is formulated as follows:
where
denotes the linear gain matrix,
denotes the gain matrix of the super-twisting algorithm, and
denotes the integral gain matrix.
To ensure the stability of the closed-loop system and the convergence of the neural network weights, the corresponding adaptive update law for the master side is designed as follows:
where
is a positive-definite diagonal gain matrix.
Integrating the master and slave controller designs presented in
Section 3.1 and
Section 3.3, the complete architecture of the bilateral teleoperation control system is illustrated in
Figure 2. The left and right blocks depict the master-side and slave-side controllers, respectively.
3.4. Stability Analysis
This subsection establishes the stability properties of the slave-side and master-side controllers developed in
Section 3.2 and
Section 3.3. It should be emphasized that, due to the presence of the super-twisting integral action, the RBFNN-based adaptive compensation, and the delay-induced reconstruction mismatch on the master side, the resulting closed-loop system is not directly identical to the simplest scalar canonical super-twisting form. Therefore, the analysis proceeds by first rewriting the closed-loop sliding dynamics into an augmented form consisting of a mechanical-energy subsystem and an internal super-twisting state and then invoking the strict Lyapunov framework of [
32,
33] for the corresponding augmented super-twisting dynamics.
Before presenting the main results, the following assumptions are introduced.
Assumption 1
([
34,
35])
. For each , the inertia matrix is symmetric, positive definite, and uniformly bounded. That is, there exist positive constants and such that:Moreover, the matrix is skew-symmetric.
Assumption 2
([
28])
. The RBFNN approximation residual is bounded on the compact operating set, namely: where are unknown but finite constants. Assumption 3
([
36,
37])
. The lumped matched disturbance is bounded, i.e.,where are unknown but finite constants. Assumption 4.
The forward and backward communication delays are bounded: In addition, the transmitted environmental weight estimate has a bounded update rate:and the Gaussian basis vector used in environmental torque reconstruction is bounded: Hence, the delay-induced reconstruction mismatch:satisfies: Therefore, can be absorbed into the matched perturbation term of the master-side sliding dynamics.
Assumption 5
([
32,
33])
. The lumped matched uncertainty entering the sliding dynamics is locally absolutely continuous and has a bounded derivative. Specifically, define:and assume that:where are unknown but finite constants. Assumption 5 is the regularity condition required for invoking the strict Lyapunov theory of the super-twisting algorithm in [
32,
33]. It does not, by itself, imply boundedness of the internal integral state; rather, after the closed-loop sliding dynamics are rewritten in the corresponding augmented form, it allows one to rigorously establish boundedness of the internal super-twisting state and the resulting finite-time or ultimate boundedness properties of the sliding variables.
For compactness, define the element-wise nonlinear vector:
Next, introduce the internal integral states associated with the super-twisting terms:
whose dynamics satisfy:
By substituting the slave-side control law and adaptive law into the slave-side sliding dynamics, and similarly substituting the master-side control law and adaptive law into the master-side sliding dynamics, the two closed-loop subsystems can be written in the unified form:
where:
denotes the neural-network weight estimation error.
To explicitly handle the integral component, define the augmented variable
Equations (50) and (51) show that both the slave-side and master-side closed-loop sliding dynamics admit the same augmented super-twisting structure. Therefore, a unified proof can be carried out for .
Theorem 1.
Under Assumptions 1–5, consider the proposed bilateral teleoperation controller. Suppose that, for each , the matrices , , , , and are positive definite, and the gains and satisfy the standard gain conditions of the super-twisting algorithm in [32,33]. Then the following statements hold simultaneously for both the slave side and the master side: all closed-loop signals are bounded; the sliding variables are uniformly ultimately bounded; in the ideal case , the sliding variables converge to zero in finite time; once the sliding motion is established, the tracking errors converge to zero exponentially. Proof. Consider the composite Lyapunov candidate:
where:
with:
We first compute the derivative of
. Differentiating
yields:
By Assumption 1,
is skew-symmetric, and therefore:
Substituting the unified closed-loop sliding dynamics (50) into (57), one obtains:
Differentiating
with respect to the time yields:
By the trace identity:
It follows that:
Combining (58) and (61), the neural-network parameter estimation term is exactly canceled, and thus:
Summing over
, we obtain:
At this stage, the term cannot be discarded directly. Unlike informal arguments in which the integral term is simply viewed as an additional dissipative component, the present proof explicitly incorporates the integral action into the augmented variable . The boundedness of must therefore be established rigorously.
From (51), for each
:
Since Assumption 5 ensures that
is bounded, and since the gains
and
satisfy the standard gain conditions of the super-twisting algorithm, the strict Lyapunov theory in [
32,
33] guarantees that, for each subsystem
, there exists a positive definite function:
and class-
functions
,
, and
such that:
and
Define the total augmented Lyapunov function:
Therefore, under general bounded perturbations, each augmented pair
is uniformly ultimately bounded; in the ideal case
, each pair
converges to the origin in finite time. In particular, for each
, there exists a finite constant
such that:
Using (70) in (62), and noting that
and
, there exist constants
and
such that:
and
Whenever
, one has:
which implies:
Hence, each
is nonincreasing outside the compact set:
and, therefore, each sliding variable
is uniformly ultimately bounded.
Since
contains quadratic terms in both
and
, the boundedness of
implies boundedness of
and
. Moreover, since
is bounded by (70),
is bounded by Assumptions 2–5, and:
It follows that the internal integral state
is also bounded. Consequently, all closed-loop signals on both the slave side and the master side are bounded.
Next, by the definition of the sliding surface:
The tracking error dynamics satisfy:
Since is positive definite and is uniformly ultimately bounded, (81) represents an exponentially stable linear system driven by a bounded input. Therefore, is uniformly ultimately bounded for both and .
Finally, consider the ideal case
. Then, by the strict Lyapunov theory of the super-twisting algorithm in [
32,
33], each augmented pair
converges to the origin in finite time. Hence, for each
, there exists a finite time
such that:
Accordingly, the tracking error dynamics reduce to:
from which the exponential convergence of both
and
to zero follows immediately.
This completes the proof.
3.5. Controller Parameter Selection
The proposed adaptive super-twisting sliding-mode control scheme contains several design parameters that affect the convergence speed, robustness, and steady-state performance of the teleoperation system. In practice, these parameters can be selected according to the following guidelines.
First, the sliding-surface gain matrices and determine the convergence rate of the tracking error dynamics on the sliding manifold. Larger gains accelerate error convergence but may increase control effort and sensitivity to measurement noise.
Second, the super-twisting gains , , and mainly influence the reaching phase of the sliding motion. Increasing these gains enhances disturbance rejection and speeds up convergence to the sliding surface. However, excessively large gains may introduce control chattering or actuator saturation.
Furthermore, the adaptive gains in the neural-network weight update laws regulate the learning speed of the RBFNN approximation. Larger adaptive gains enable faster compensation of model uncertainties but may amplify oscillations caused by measurement noise.
Finally, the filter coefficient in the environmental force preprocessing module determines the trade-off between noise attenuation and signal responsiveness. A smaller provides stronger noise suppression but introduces additional lag, whereas a larger improves responsiveness at the expense of noise sensitivity.
In this study, the controller parameters are selected empirically to balance tracking performance, robustness, and implementation feasibility. The specific parameter values used in simulations and experiments are provided in
Section 4.
5. Conclusions
Bilateral teleoperation systems are primarily challenged by model uncertainties, environmental disturbances, and transparency degradation. To address these issues, this paper develops an adaptive super-twisting sliding-mode control framework to achieve high-performance position synchronization and force feedback. The proposed scheme incorporates a first-order low-pass filter to preprocess environmental force signals and utilizes an RBF neural network for online torque estimation, effectively mitigating noise and improving estimation fidelity. Furthermore, an adaptive compensation mechanism is integrated into the sliding-mode controller to handle system uncertainties in real time. The closed-loop stability is established via Lyapunov stability theory. Both simulation and experimental results demonstrate that the proposed method maintains high-precision position tracking and ensures accurate environmental force reflection, even under dynamic uncertainties and external disturbances. Compared with conventional PD-based control strategies, the new framework exhibits superior convergence rates, higher steady-state accuracy, and enhanced robustness.
Future research will focus on extending this framework to multi-manipulator cooperative teleoperation and coordinated control under task-level constraints. Additionally, we aim to develop advanced modeling techniques for unstructured environments—accounting for nonlinearities such as friction and backlash—and integrate data-driven learning methods for intent prediction and autonomous parameter tuning.