1. Introduction
Underwater manipulators serve as essential platforms for deep-sea resource exploration and operations, playing an irreplaceable role in underwater infrastructure maintenance, emergency salvage, and scientific sampling tasks [
1,
2,
3]. However, due to their strongly coupled multi-degree-of-freedom nonlinear dynamics and the influence of complex unstructured environments such as ocean currents and contact collisions, accurate modeling and highly robust control of underwater manipulators remain challenging and have become long-standing research topics in the control community [
4]. In particular, underwater operation introduces additional hydrodynamic effects, including added mass, nonlinear fluid damping/drag, and strong coupling forces induced by the surrounding fluid. These effects are commonly described based on marine craft hydrodynamic modeling principles [
5], and drag forces are often represented using Morison-type formulations [
6]. The presence of such uncertainties significantly increases model complexity and degrades tracking accuracy. Compared with conventional control methods such as proportional–integral–derivative (PID) control [
7] and model predictive control (MPC) [
8], which are often limited by model accuracy or computational complexity, sliding mode control (SMC) [
9] has been widely adopted for underwater manipulator trajectory tracking owing to its inherent robustness and insensitivity to parametric uncertainties and external disturbances.
To further enhance the overall performance of sliding mode control in underwater environments, extensive research efforts have been devoted to its improvement. From the perspective of algorithmic optimization, Zhong et al. [
10] and Liu et al. [
11] mitigated chattering and enhanced disturbance rejection by introducing adaptive fuzzy gain regulation and improved reaching laws, respectively. Meanwhile, Zaare et al. [
12] developed a predefined-time nonsingular terminal sliding mode controller, achieving global convergence within a prescribed time. In terms of environmental adaptability, Mao et al. [
13] and Ge et al. [
14] validated the effectiveness of sliding mode strategies under complex hydrodynamic conditions by incorporating disturbance observers and optimized reaching laws to address high-speed currents and wave-induced disturbances.
Despite the effectiveness of these improved approaches, the inherent chattering phenomenon of conventional sliding mode control remains a critical factor limiting its achievable precision, which has motivated the development of terminal sliding mode control [
15,
16] and high-order sliding mode control [
17]. As a representative high-order sliding mode technique, super-twisting sliding mode control (STSMC) employs an integral action to generate continuous control signals, thereby substantially mitigating chattering while preserving robustness. The theoretical foundations and finite-time convergence properties of the super-twisting algorithm for the sliding variable have been systematically established within the framework of higher-order sliding modes [
18]. In recent years, STSMC has been widely applied in underwater robotic systems. Studies by Liao et al. [
19] and Borlaug et al. [
20] have demonstrated that incorporating adaptive mechanisms into STSMC can significantly reduce high-frequency chattering and improve position tracking accuracy.
In addition, active disturbance compensation constitutes another effective pathway for improving control performance. By employing disturbance observers (DOBs) [
21] or sliding mode observers to estimate model uncertainties and external disturbances and provide feedforward compensation, the burden on the feedback controller can be significantly alleviated [
22]. Among various observer designs, the extended state observer (ESO) exhibits notable advantages due to its ability to treat internal uncertainties and external disturbances as a lumped disturbance and estimate them in a unified manner, while relying minimally on an accurate system model. Consequently, ESO has shown great potential in manipulator control applications. For instance, Wang et al. [
23,
24] combined a finite-time ESO with terminal sliding mode control to achieve fast and stable control performance under complex hydrodynamic conditions.
Although these studies have improved trajectory tracking performance to some extent, several practical issues remain insufficiently addressed. First, most STSMC–ESO-based approaches focus primarily on chattering mitigation and disturbance rejection, while actuator torque saturation—an inevitable constraint in practical underwater manipulators—is rarely incorporated into the control design and stability analysis. Actuator saturation may lead to performance degradation, loss of sliding motion, and delayed recovery after saturation release. From a general control perspective, this problem is closely related to the classical anti-windup framework. Second, the interaction between disturbance estimation, sliding dynamics, and saturation recovery is often treated separately rather than within a unified stability-oriented framework. Therefore, achieving coordinated disturbance rejection, chattering mitigation, and saturation-aware robustness remains a critical challenge. To address these issues, this paper proposes a saturation-aware trajectory tracking control scheme that integrates super-twisting sliding mode control (STSMC) with an extended state observer (ESO) and an auxiliary anti-saturation dynamic system. The main contributions of this work are summarized as follows:
An STSMC–ESO composite framework is developed for underwater manipulators, where the ESO estimates lumped disturbances in real time to provide feedforward compensation, thereby reducing reliance on high feedback gains.
An auxiliary anti-saturation dynamic system is introduced to explicitly address actuator torque constraints, improving tracking robustness and accelerating recovery after saturation release.
A unified Lyapunov-based stability analysis of the overall closed-loop system is provided, demonstrating global exponential asymptotic stability under saturation constraints.
The remainder of this paper is organized as follows.
Section 2 presents the dynamic model of the underwater manipulator incorporating hydrodynamic effects and lumped uncertainties.
Section 3 describes the design of the anti-saturation auxiliary system, the ESO, and the super-twisting sliding mode controller, followed by the stability analysis.
Section 4 provides comparative simulation results and discussion. Finally,
Section 5 concludes the paper and outlines future research directions.
4. Simulation Comparisons and Results Analysis
To validate the effectiveness of the proposed super-twisting sliding mode control (STSMC) strategy with input saturation constraints combined with an extended state observer (ESO), a six-degree-of-freedom underwater manipulator system, Oberon7, is selected as the research object. Numerical simulations are conducted in Gazebo 11.10.2 (Open Source Robotics Foundation) under Ubuntu 20.04, as illustrated in
Figure 2.
The model parameters of Oberon7 include link masses, center-of-mass positions, and inertia tensors, which are listed in
Table 1. The motion ranges and maximum angular velocity limits of each joint are given in
Table 2. To ensure that the generated reference trajectories satisfy the physical constraints of the actuators, the amplitude and frequency parameters of the sinusoidal trajectories are individually designed for each joint, such that the resulting trajectories do not exceed the prescribed maximum velocity limits throughout the entire motion.
Table 1 presents the detailed model parameters of the Oberon7 manipulator, including the mass, center-of-mass position, and inertia tensor of each link.
Table 2 summarizes the allowable joint motion ranges and maximum angular velocities.
For performance comparison, three control schemes are considered:
Conventional sliding mode control (SMC);
Super-twisting sliding mode control (STSMC);
STSMC combined with an extended state observer (STSMC + ESO).
The corresponding control laws are given as follows:
To limit the length of this section, only the tracking results of the first three key joints are presented, since the remaining joints exhibit similar response characteristics. The controller parameters for these three joints are listed in
Table 3.
The controller parameters in
Table 3 are determined following a structured bandwidth-based and time-scale separation design procedure.
First, the sliding surface gain λ is selected to ensure that the closed-loop error dynamics are faster than the dominant reference frequency. Accordingly, progressively larger values (4, 6, and 10) are adopted for Joint1–Joint3, providing faster convergence for joints with stronger coupling and higher disturbance sensitivity.
Second, the SMC switching gain k is chosen slightly above the estimated upper bound of the equivalent disturbance magnitude. The larger value for Joint2 (28) reflects its higher disturbance level, while moderate values (18 and 20) are sufficient for the remaining joints.
Third, the STSMC gains and are determined according to the estimated upper bound of the disturbance to ensure rapid convergence of the sliding variable. The gains increase from Joint1 (2, 1) to Joint3 (12, 8), consistent with the increasing disturbance variation rate and coupling intensity.
Fourth, the ESO bandwidth is selected to be on the same order as, or moderately higher than, the closed-loop bandwidth implied by the sliding surface gain (), while avoiding excessive noise amplification. Thus, larger bandwidths (10, 10) are assigned to Joint2 and Joint3, whereas a moderate value (5) is used for Joint1.
Finally, the anti-saturation auxiliary gains and are chosen according to time-scale separation principles such that the saturation recovery dynamics are faster than the nominal closed-loop dynamics. For the joint with higher torque demand (Joint3), larger gains (15, 15) are adopted to accelerate saturation recovery, while moderate gains (1, 1) are sufficient for Joint1 and Joint2.
To ensure that the reference trajectories satisfy amplitude and velocity constraints while maintaining sufficient smoothness, sinusoidal functions are adopted to generate the desired joint trajectories:
where
,
denote the desired joint position, velocity, and acceleration, respectively. The midpoint of the joint motion range is defined as
and the trajectory amplitude is
where
is a scaling factor, chosen as
= 0.8 in this study. The trajectory frequency is determined by
to ensure that the desired velocity does not exceed the specified maximum joint velocity. The phase angle is given by
which guarantees that the trajectory satisfies the initial condition
.
This trajectory design not only satisfies physical constraints but also provides sufficient smoothness and trackability, making it well suited for sliding mode control performance evaluation.
To evaluate the robustness of the proposed controller against modeling inaccuracies and unmodeled dynamics, a sensitivity analysis is conducted. Specifically, key hydrodynamic parameters (including added mass and drag coefficients) are perturbed by ±30% to emulate modeling errors and environmental variability. In addition, external time-varying disturbance terms are injected into the system to simulate non-stationary environmental effects. It should be emphasized that the simulated UVMS operates in a Gazebo underwater environment where hydrodynamic effects and ambient current disturbances are already included through the underwater simulation plugins (e.g., buoyancy, added mass, hydrodynamic damping, and current-induced loads). Therefore, the closed-loop system is continuously subject to environmental disturbances during the entire simulation. In addition to these environment-induced disturbances, we superimpose an explicit sinusoidal disturbance term to provide a repeatable and controlled benchmark for fair comparison among different control strategies. External disturbances are introduced as follows:
To evaluate tracking accuracy, disturbance rejection, and control smoothness, a set of standard time-domain indices is reported for each joint and each controller. We compute the root-mean-square error (RMSE) and ITAE
. To quantify control effort, we report the RMSE of torque. Moreover, to objectively characterize chattering/high-frequency oscillations, we introduce two additional indicators: (i)
computed from the numerical derivative of
, and (ii) the high-frequency energy ratio
, defined as the ratio of spectral energy of
above a cutoff frequency
to the total spectral energy. In this study,
is used. All indices are computed from the logged trajectories using a unified evaluation script to ensure a fair comparison.
Table 4,
Table 5 and
Table 6 report the above metrics for the azimuth, shoulder, and elbow joints, respectively.
Figure 3a,b present the position responses and tracking errors of the elbow joint under the three control schemes. All controllers are capable of tracking the reference trajectory; however, notable performance differences can be observed. During periods of strong disturbances (e.g., t = 15–20 s), conventional SMC exhibits pronounced error fluctuations and inferior steady-state accuracy, with a maximum tracking error of 0.19848 rad and an RMS error of 0.06638 rad.
In contrast, STSMC significantly improves tracking accuracy, reducing the RMS error to 0.03399 rad. When ESO is further incorporated, the STSMC + ESO scheme achieves the best performance, with the RMS error further reduced to 0.0202 rad and the maximum error constrained within 0.21921 rad. Moreover, the error curve remains smooth throughout the entire process, clearly demonstrating the effectiveness of disturbance observation in enhancing tracking accuracy and robustness.
Figure 4 compares the control torque outputs. Conventional SMC exhibits abrupt torque variations around t = 3 s, with a peak value of 193.05 N·m, which may induce actuator shocks. In contrast, STSMC and STSMC + ESO generate significantly smoother control inputs, with peak torques limited to 221.79 N·m and 257.04 N·m, respectively, both remaining within saturation limits. Although STSMC + ESO produces a slightly higher RMS torque (139.23 N·m) and increased control energy consumption (∫|τ|dt = 3895.40 N·m·s), it responds more rapidly to disturbances and exhibits better matching between control effort and disturbance compensation, indicating stronger adaptive and dynamic compensation capabilities.
Figure 5a illustrates the ESO estimation performance, where
denote the estimated position and velocity, and
represents the estimated disturbance. The results show that the ESO achieves rapid convergence of state estimates, and the disturbance estimate closely follows the trend of the external disturbance
after approximately t ≈ 3 s, confirming the observer’s real-time performance and estimation accuracy. It is important to clarify that the lumped disturbance term includes not only the external disturbance input, but also unmodeled dynamics, parametric uncertainties, hydrodynamic coupling effects, friction, and modeling errors. Therefore, a perfect match between the injected disturbance and the ESO estimation is not generally expected.
Figure 5b shows the dynamic evolution of the ESO bandwidth and observer gains, where
reach their nominal operating values at approximately t ≈ 4 s. The adaptive bandwidth design effectively mitigates the initial differentiation peak problem commonly encountered in linear ESO structures.
To clarify the practical advantage of ESO, an ablation study is conducted on the elbow joint by comparing: (i) the proposed STSMC + ESO with moderate feedback gains, and (ii) a high-gain STSMC without ESO, whose gains are increased to compensate for disturbances under the same torque saturation limit
. The results are summarized in
Figure 6 and
Table 7.
As shown in
Table 7, STSMC + ESO achieves smaller RMS/peak tracking errors than the high-gain STSMC baseline. More importantly, the high-gain STSMC (without ESO) exhibits a substantially larger torque variation index (RMS
) and a higher saturation time ratio, indicating more severe chattering and more frequent saturation exposure.
These results demonstrate that the ESO provides active lumped-disturbance compensation in a feedforward manner, which reduces the required feedback gain margin for achieving similar tracking performance. Consequently, the proposed STSMC + ESO yields smoother control effort, mitigates saturation, and improves recovery behavior under disturbance transients.
To evaluate the influence of the ESO bandwidth parameter, a sensitivity study is conducted by varying
among 5, 10, and 15 while keeping other parameters unchanged. As shown in
Table 8, increasing the bandwidth from 5 to 10 significantly improves tracking performance, reducing RMSE by approximately 13.5% and ITAE by 24.1%. However, further increasing the bandwidth to 15 yields only a marginal improvement in tracking accuracy while increasing the high-frequency energy ratio
, indicating enhanced noise amplification. Therefore,
is selected as a compromise between estimation speed and robustness.
The comparative results of the three control schemes for the Shoulder joint are shown in
Figure 7 and
Figure 8. As depicted in
Figure 7a, all approaches are capable of tracking the desired trajectory with generally satisfactory performance, achieving high-precision trajectory regulation overall. The position tracking errors are compared in
Figure 7b. It can be observed that the conventional SMC exhibits a slight divergence in the mid-to-late stage, with a maximum error of 0.24544 rad and an RMS error of 0.04426 rad, indicating relatively large overall deviations. After introducing STSMC, the error is significantly reduced, with the RMS error decreasing to 0.03250 rad. With further incorporation of the ESO compensation mechanism, the error convergence is enhanced, and the RMS error is further reduced to 0.02956 rad, accompanied by smaller error fluctuations and noticeably improved tracking stability.
Figure 8a illustrates the variation in the control input torques. Due to the heavy load and strong coupling of the Shoulder joint with other joints, all three controllers generate control signals with relatively large magnitudes. The maximum output torque of SMC reaches 241.62 N·m, while STSMC and STSMC + ESO reach 293.63 N·m and 343.58 N·m, respectively. Although the latter exhibits higher control energy consumption (the integral control effort is 2640.72 N·m·s), it achieves superior tracking accuracy and faster recovery capability, indicating a favorable trade-off between control cost and performance. Compared with SMC (0.0056), STSMC reduces
to 0.0021 (62% reduction), and STSMC + ESO further reduces it to 0.0014 (75% reduction), indicating a significant attenuation of high-frequency oscillations.
Figure 8b presents the ESO estimation capability for the system states and the lumped disturbance. The estimated position and velocity closely match the true values, demonstrating the effectiveness of the ESO in state observation. Regarding disturbance estimation, due to complex nonlinear coupling and parametric uncertainties affecting the Shoulder joint, the ESO estimate exhibits a certain deviation from the reference disturbance.
It is worth noting that the disturbance estimation error of the Shoulder joint is noticeably larger than that of the other joints. This phenomenon can be mainly attributed to two factors. First, as a proximal joint with higher equivalent inertia and stronger nonlinear coupling, the Shoulder joint is subject to more complex configuration-dependent dynamics, which makes the lumped disturbance inherently more difficult to approximate through a single matched disturbance channel. Second, the Shoulder joint operates under higher torque demand and may approach saturation limits more frequently, while sampling and noise constraints limit the achievable observer bandwidth. These factors introduce additional discrepancy between the nominal observer model and the actual plant dynamics, resulting in a noticeable estimation bias.
It should be emphasized that the ESO estimation does not aim to reproduce the injected disturbance waveform exactly, but to provide a real-time compensation signal for the lumped disturbance. Therefore, the effectiveness of the ESO is evaluated primarily based on closed-loop tracking performance and robustness rather than pointwise estimation accuracy.
The comparative results for the Azimuth joint are shown in
Figure 9 and
Figure 10.
Figure 9a presents the position tracking curves under the three controllers, where all schemes achieve satisfactory tracking performance. Nevertheless, noticeable differences arise in tracking accuracy during disturbance intervals. The corresponding tracking errors are compared in
Figure 9b. The STSMC scheme exhibits faster error convergence, with the maximum error decreasing from 0.03871 rad to 0.03658 rad, and the RMS error decreasing from 0.01586 rad to 0.01054 rad. By further introducing the ESO, the system’s robustness against disturbances is strengthened via state observation and compensation, leading to an RMS error of 0.00915 rad and a reduced maximum error of 0.03026 rad, achieving the best tracking performance among the three methods.
Figure 10a compares the control input responses. Within
,
reaches 120.984 N·m/s and the high-frequency energy ratio
increases to 0.04299, which are respectively 9.02× and 49.40× larger than those in the preceding interval. This confirms the localized chattering behavior of SMC. In contrast, STSMC and STSMC + ESO significantly reduce high-frequency torque activity, resulting in smoother control inputs. Although its RMS torque is slightly higher (9.53 N·m) and the accumulated control energy is 241.36 N·m·s (slightly higher than SMC at 200.25 N·m·s), it provides a clear cost–performance advantage in terms of improved tracking accuracy.
Figure 10b further demonstrates the ESO’s estimation capability for the Azimuth joint. The estimates of position, velocity, and the lumped disturbance are highly consistent with the true values, indicating that the ESO effectively captures unmodeled dynamics and external disturbances, thereby providing reliable support for feedforward compensation.
To verify the effectiveness of the input saturation compensation mechanism under actuator-limited conditions, the elbow joint is selected as a representative case. The maximum torque limit is set to 200 N·m, and the performance of the STSMC + ESO controller without and with saturation compensation is compared. The simulation results are shown in
Figure 10 and
Figure 11.
Figure 11a,b depict the position tracking results and the corresponding tracking errors, respectively. Throughout the overall tracking process, both schemes can largely follow the desired trajectory; however, a clear divergence occurs around t ≈ 24 s, which corresponds to the phase when the control torque transitions from saturation back to the normal operating range. Without compensation, the controller fails to promptly re-establish effective control after the torque recovers, leading to a pronounced short-term trajectory deviation, with a maximum negative error reaching −0.08 rad. In contrast, with the saturation compensation mechanism, the RMS tracking error is reduced from 0.03726 rad to 0.03242 rad, and the controller can rapidly re-enter an effective control region after saturation is released, thereby suppressing the error quickly and restoring the system state.
Figure 12a compares the control torques. Under external disturbances, both schemes generate torque outputs approaching saturation. In the uncompensated case, once the torque reaches the upper bound (200 N·m), the control input cannot be adjusted in time. The recovery time
is defined as the time required for the tracking error to re-enter and remain within a prescribed band (
for 0.5 s) after saturation is released. The anti-saturation scheme reduces the time spent in full saturation from 9.58 s to 7.35 s and shortens the post-saturation recovery time from 2.754 s to 0.274 s, corresponding to a 90.05% reduction in
.
Figure 12b shows the dynamic evolution of the sliding variable s(t). In the uncompensated case, an evident “sliding-mode escape” occurs near 24 s, i.e., the system state temporarily departs from the sliding surface, which manifests as degradation or loss of sliding-mode effectiveness. In contrast, with saturation compensation, s(t) remains bounded with stable oscillations, and the control loop remains effective, demonstrating that the compensation mechanism not only improves error convergence but also enhances sliding-surface maintenance.
In summary, the proposed saturation compensation mechanism can prevent loss of control when the control output approaches the actuator limit. More importantly, it enables rapid restoration of effective control after saturation is released, shortens the transient error recovery process, and prevents error accumulation and trajectory drift caused by hysteresis effects. This property is particularly important for complex underwater operations involving frequent disturbances and large-amplitude maneuvers, highlighting the practical adaptability and safety advantages of the proposed control strategy.
In principle, the proposed anti-saturation mechanism is joint-independent and can be directly applied to all joints. In this work, the elbow joint is selected as a representative case for clarity and brevity.
In this study, the proposed controller is compared with conventional SMC and STSMC, since these methods share the same sliding-mode framework and provide a fair baseline for evaluating the contributions of the ESO compensation and the anti-saturation auxiliary dynamics.
It is worth noting that other advanced approaches have also been reported for underwater manipulator control, such as adaptive fuzzy sliding mode control [
10], Gaussian-process-assisted MPC [
8], and prescribed performance control [
15]. These methods can achieve high tracking accuracy, but typically require either online optimization (MPC), complex rule-based design and tuning (fuzzy-based control), or careful selection of performance functions and parameters (prescribed performance control). Therefore, implementing and tuning these heterogeneous controllers in a fully fair manner is nontrivial and is beyond the scope of this work.
From the perspective of real-time computation, the proposed method remains computationally lightweight. The ESO consists of linear state updates with a time-varying bandwidth, the super-twisting law involves only algebraic operations and integration, and the anti-saturation auxiliary system is a low-order linear dynamic system. Therefore, the proposed controller does not introduce an optimization loop and is suitable for real-time implementation on embedded robotic platforms.