1. Introduction
In recent years, the motion control of unmanned marine vehicles (UMVs) has garnered increasing attention owing to their wide-ranging applications in scientific exploration, marine resource surveying, and defense operations [
1,
2]. To satisfy the demanding requirements of maritime missions for fast, precise, and robust performance, various advanced control strategies have been developed, including dynamic positioning (DP) control [
3], tracking control [
4,
5], and path-following control [
6]. Due to the highly nonlinear and strongly coupled characteristics of ships, DP control has consistently remained a challenging problem.
The finite-time control approach has recently gained popularity due to its high precision and rapid convergence. For UMVs, a resilient finite-time control methodology was proposed in [
7]. Additionally, a nonsingular terminal sliding-mode (TSM) controller augmented with an extended state observer was developed to achieve finite-time convergence [
8]. Furthermore, an adaptive parameter adjustment scheme leveraging both system tracking discrepancies and adaptive estimation errors was introduced, leading to the design of an adaptive finite-time controller based on second-order terminal sliding-mode control (TSMC) [
9,
10]. However, a significant limitation of finite-time control methods is their inherent dependence on the initial state, resulting in settling times that vary with different initial conditions. To address these limitations, Polyakov first proposed the fixed-time stability theory in [
11]. Unlike finite-time stability, fixed-time stability guarantees that the upper bound of convergence time is independent of initial conditions, relying solely on design parameters. In [
12], a novel fixed-time sliding-mode control strategy incorporating a time-varying threshold and additional parameters was proposed to address the trajectory tracking problem for autonomous surface vehicles. Further, an adaptive practical fixed-time sliding-mode controller was developed in [
13] to handle model uncertainties and external disturbances affecting autonomous surface vehicles. In [
14], a fast fixed-time sliding-mode trajectory tracking control strategy is proposed for autonomous surface vehicles, and faster convergence is achieved through a dual-phase state acceleration mechanism with a gain scheduling function. In [
15], a global fixed-time stability result was derived by combining the power and hyperbolic cosine functions, and a fixed-time sliding-mode controller was developed accordingly. Moreover, when sensor measurements and control commands of UMVs are transmitted over bandwidth-limited network channels, signal quantization inevitably introduces quantization errors, degrading the control accuracy and robustness [
16,
17]. However, most existing fixed-time control methods based on terminal sliding mode under quantization conditions can only confine system states within a bounded region, failing to achieve exact convergence to the origin. Therefore, how to design a fixed-time control scheme with a dynamic parameter adjustment strategy is the first motivation of this study.
On the other hand, UMVs are routinely exposed to environmental disturbances, uncertainties in hydrodynamic coefficients, modeling errors, and parameter variations, further complicating accurate dynamic modeling [
18,
19]. To address this problem, recurrent neural networks have emerged as an effective solution due to their capability for approximating unknown parameters [
20,
21]. However, traditional recurrent neural networks structures suffer from short-term memory limitations and are prone to losing critical historical information, thus restricting their effectiveness in capturing highly fluctuating temporal features [
22]. In estimating system uncertainties, long short-term memory networks have shown considerable advantages owing to their superior learning capabilities. In [
23], a long short-term memory estimator integrating a selective update strategy was proposed to capture rapidly fluctuating temporal information for accurate sideslip angle estimation, based on which a learning line-of-sight guidance law for path following was developed. In [
24], a long short-term memory model incorporating an enhanced forget-gate mechanism was developed, demonstrating superior performance in wind power forecasting tasks. Moreover, a read-first long short-term memory architecture introduced in [
25] employs augmented read gates to enhance interactions among control gates, improve feature extraction during the encoding phase, and strengthen long-term memory capabilities. However, traditional long short-term memory networks exhibit certain limitations in capturing temporal dependencies within the gradient flow, which motivates the current study.
With the growing demand for high performance, safety, and reliability in UMV control systems, it is essential to develop effective fault-tolerant control (FTC) strategies, as actuator failures may lead to severe performance degradation or even mission failure. The authors of [
26] propose an FTC strategy that employs an adaptive mechanism to estimate the upper bounds of time-varying stuck faults and external disturbances, without presuming prior knowledge of the lower bounds of thruster faults. The authors of [
27] develop an integral sliding-mode, output-feedback FTC scheme for UMVs subject to unknown dynamics, thruster faults, and external disturbances; the dynamic-positioning objective is ensured via a high-gain compensator using the available outputs. To our knowledge, designing an effective fixed-time FTC scheme for UMVs under quantization errors and ocean disturbances remains a formidable challenge.
Based on the above analysis, this paper proposes a fixed-time FTC strategy utilizing an improved long short-term memory network to address the DP problem of UMVs under signal quantization, disturbances, and input saturation constraints. The main contributions are as follows:
(1) Unlike the long short-term memory networks in [
23,
24], this paper proposes a long short-term memory model integrated with adaptive mixed gradient (AMG) optimization algorithm, which enables more efficient capture of temporal features in gradient dynamics and substantially enhances the network’s capacity for modeling marine disturbances.
(2) Compared with [
16,
17], an improved dynamic quantization adjustment strategy is proposed by presetting the initial quantization threshold, which expands the upper bound of the quantization parameter range and enables more effective compensation for quantization errors.
(3) This paper proposes a fixed-time fault-tolerant controller that ensures the DP error converges to a small neighborhood of the equilibrium point within a finite time, independent of the initial state.
The rest of this article is organized as follows. The problem description and the quantizer model are provided in
Section 2. A long short-term memory network structure combined with a long short-term memory optimization algorithm is developed in
Section 3. The controller design and stability analysis are presented in
Section 4. Simulation studies are demonstrated in
Section 5. Finally,
Section 6 concludes this article.
Notation: We write for the set of real numbers, for the space of n-dimensional real column vectors and for the space of real matrices. For a matrix A, denotes the induced Euclidean norm, and denotes the transpose of A. Let diag be the diagonal matrix with diagonal entries . For , define , where is the sign function. denotes the rounding (integer-valued) function, denotes the minimum of its arguments.
3. Improved LSTM Network
The long short-term memory network, a variant of recurrent neural networks, effectively addresses the challenges of capturing long-term dependencies inherent in sequential data. Central to long short-term memory is its unique gating mechanism, comprising three primary gates: the input gate, forget gate, and output gate. These gates collaboratively regulate the flow and retention of information within the network, dynamically updating the internal cell states. Through this sophisticated gating architecture, long short-term memory significantly mitigates common issues in traditional recurrent neural networks, such as gradient vanishing and exploding, thereby achieving enhanced performance in processing and predicting temporal sequences compared to conventional models. Therefore, to effectively predict the lumped disturbances in system (
15), a long short-term memory network is employed in this paper. The network structure is shown in
Figure 3 and can be expressed as follows.
where
, and
represent bias vectors,
,
and
denote weight matrices, and
is an intermediate variable. The central innovation of long short-term memory lies in its memory cell
, which selectively accumulates and updates state information using three gating mechanisms: the input gate
, forget gate
, and output gate
.
To precisely capture temporal dependencies within the gradient flow, this paper introduces an AMG optimization algorithm, whose detailed implementation is presented in Algorithm 1.
| Algorithm 1 AMG Optimization Algorithm with Temporal Dependencies |
Input: Learning rate ; decay rates ; memory factor ; stochastic objective function is defined to describe the performance index of the system under stochastic disturbances; initial values , , Output: Optimized parameters
- 1:
Initialize , , - 2:
while not converged do - 3:
- 4:
Compute gradient: - 5:
Update learning rate: - 6:
Update first moment: - 7:
Temporal average: - 8:
Update second moment: - 9:
Temporal average: - 10:
Update parameters: - 11:
end while
|
Subsequently, a fixed-time extended state observer is constructed to estimate the thruster faults
, whose dynamics are expressed as follows:
where
and
,
,
, which satisfies
, and
.
denotes the estimate of the thruster fault
. Subsequently, the error dynamics will be derived as follows:
where
,
; according to the Theorem 2 in [
1], the errors
and
will converge to zero in fixed-time
. The convergence time
is
where
,
,
,
,
and
are positive constants.
Remark 3. The exhibits continuous differentiability over time, satisfying the condition , where denotes the upper bound of .
5. Simulation Results
In this section, simulations are conducted on a floating production vessel to demonstrate the effectiveness of the proposed fixed-time FTC strategy. Following [
28], the model matrices are adopted. The specific numerical values of
M,
D,
Q,
B, and the disturbance
used in system (1) are summarized in
Table 1.
The model uncertainties account for
and
is considered as
;
;
. The data of the UMV were collected offline using an ocean disturbance dataset, which was employed to train the long short-term memory neural network in an offline manner. The long short-term memory model consists of five layers, with a seven-dimensional input comprising six input features and one target output. The objective function used for training is the minimization of the mean squared error of the prediction. To update the weights, the adaptive mixed gradient optimization algorithm optimizer is utilized, where the initial learning rate is 0.01 and the momentum factor is 0.7. The total uncertainty is estimated using the proposed long short-term memory network integrated with the AMG optimization algorithm.
Figure 4 illustrates the predicted curve of the total uncertainty obtained by the long short-term memory. The minimal error between the predicted and actual values effectively demonstrates the accuracy of the proposed estimation approach. The following fault scenario is considered: the port main thruster experiences a 10% reduction in effectiveness, and the starboard main propeller is stuck at
, while the remaining thrusters operate normally. The corresponding design parameters are specified as follows:
,
,
,
,
,
,
. All numerical simulations in this paper were carried out in the MATLAB R2024a environment. The gain adjustment follows a performance-driven procedure grounded in Lyapunov analysis. Starting from the prescribed performance pair
, the performance indices are fixed. We then derive a feasible region for the controller gains from the Lyapunov inequality/theorem and compute (or update) the gains by solving the associated LMI with the constraint
. Finally, a scaling refinement is applied under actuator-saturation and anti-chattering requirements to ensure both a certified fixed-time convergence upper bound and practical implementability. To visualize the DP performance, the
Figure 5 shows the UMV trajectory on the 3–D
plane. The initial position is approximately
. Under the proposed controller, the vessel undergoes a brief adjustment and rotation, rapidly converges toward the origin, and performs small refinements in its vicinity. The terminal positioning error is about
. These results indicate that the proposed method achieves smooth, stable dynamic positioning with high-precision convergence in the plane.
As reported in
Table 2, the AMG algorithm delivers the best accuracy: relative to Adaptive Moment (Adam), root-mean-square error (RMSE) and mean absolute error (MAE) drop by 4.5% and 17%, and symmetric mean absolute percentage error (SMAPE) reaches a minimum of 3.46%.
5.1. Hardware-in-the-Loop Validation of the Proposed Control Scheme
To further validate the feasibility of the proposed control method, a hardware-in-the-loop (HiL) simulation platform is established for testing. As shown in
Figure 6, the HiL platform is composed of a Raspberry Pi 4B controller that runs Ubuntu 20.04 and ROS Noetic, and a model computer that runs Windows 11 and MATLAB 2024a. Due to the hardware limitations of the Raspberry Pi and cross-system data transfer requirements, it is necessary to increase the solution step size and modify multiple design parameters. The other settings are the same as described above.
The pose data of the UMV are recorded via data acquisition modules in Simulink and exported to MATLAB for plotting, the specific results are shown in
Figure 7. Although the results are inferior to software simulation, the control objectives are achieved, which demonstrates the feasibility for future engineering applications.
Remark 6. The proposed control method is implemented on a Raspberry Pi 4B development board for hardware-in-the-loop simulation testing. The results demonstrate that the method exhibits a certain degree of real-time capability. Future work will explore model compression techniques (e.g., quantization, pruning) and efficient architectures to enable real-time operation under hardware constraints.
Remark 7. The optimizer’s hyperparameters—learning rate (), momentum (), and decay factor ()—were selected via grid search and validated across simulation scenarios. Performance remained stable within moderate ranges (e.g., ), whereas extreme values led to divergence or oscillation.
5.2. Comparative Analysis of Fixed-Time TSMC and Finite-Time TSMC
The simulation results in
Figure 8 show that the proposed approach achieves noticeably faster convergence of both positions and velocities than the finite-time TSMC and fixed-time TSMC baselines. In particular, relative to the finite-time controller in [
7], the settling time is substantially shortened. This acceleration translates into enhanced closed-loop stability and a quicker transient response properties that are essential for real-time applications demanding high precision and rapid regulation. Furthermore, the corresponding control efforts in
Figure 9 and
Figure 10 remain uniformly bounded, with smaller peaks and quicker decay than the finite-time TSMC baseline.
5.3. Comparative Analysis of Dynamic Adjustment of Quantization Parameters
To assess the proposed scheme for dynamic adjustment of quantization parameters in Algorithm 2, we compare it against the uniform static-quantization controller reported in [
17]. As shown in
Figure 11, the dynamic-tuning strategy drives the state to equilibrium noticeably faster. Continuous adjustment of the quantization parameters suppresses the error throughout the transient, whereas the static design attains stability more slowly and with larger overshoot. These results confirm the superiority of the dynamic adjustment mechanism.
| Algorithm 2 Adjustment Strategy of Quantization Parameter |
- 1:
Input: Select , the parameter is chosen as , , where is a positive constant and is defined in Theorem 1. - 2:
Output: - 3:
; - 4:
; - 5:
while do - 6:
- 7:
if , then - 8:
; ; - 9:
else - 10:
; - 11:
; - 12:
end if - 13:
end while
|
5.4. Comparative Analysis of the Impact of Input Saturation
To further evaluate the performance of the proposed anti-saturation compensation controller, we simulated the controller with and without input-saturation compensation. As depicted in
Figure 12, the saturation-aware scheme drives the position, yaw, and velocity states to the origin more rapidly and with reduced overshoot. After a short transient, all states remain bounded and converge to zero, whereas the saturation-neglecting controller develops increasing oscillations and ultimately destabilizes the system. Hence, explicitly incorporating thruster limits markedly improves closed-loop robustness.
5.5. Comparative Analysis of the Impact of Thruster Faults
In this subsection, we compare a baseline without fault accommodation to the proposed FTC. We consider the following fault scenario: for
, all thrusters operate normally; for
, the port main thruster and aft tunnel thruster I are stuck at
and
, respectively; the starboard main thruster experiences a
loss of effectiveness; the remaining thrusters operate nominally. As shown in
Figure 13 and
Figure 14, without fault handling the surge/sway/yaw velocities and the position/attitude states exhibit pronounced oscillations and noticeable steady-state offsets. In contrast, under the proposed FTC all states and control inputs remain bounded and converge rapidly to the desired equilibrium despite thruster faults, demonstrating closed-loop stability and effective fault mitigation.
5.6. Comparative Analysis of the Impact of Long Short-Term Memory Network and Adaptive Law
To provide a more comprehensive evaluation of the proposed controller, we compared its performance with that of the controller based on the adaptive law presented in [
7,
9]. The simulation results are shown in
Figure 15. As shown in
Figure 15, due to the superior capability of neural network in approximating nonlinear functions, the state curve demonstrates enhanced stability and robustness when subjected to time-varying disturbances. However, the adaptive parameters vary with these time-varying disturbances, which can lead to degraded control performance and cause the system state to diverge.
Remark 8. In the comparative simulations, the same disturbances, input constraints, initial conditions, simulation time, and solution step size are set, although different control methods are used for the comparison, which may have resulted in different design parameters for the controllers. However, all parameters are adjusted as much as possible according to the same standards (such as control accuracy, control input size, and response speed). Therefore, a fair comparison may be guaranteed.