1. Introduction
Marine resource exploitation and intelligent shipping are advancing rapidly, making the intelligence and automation of vessel motion control systems increasingly imperative. Serving as a vital waterborne transportation platform, vessels during navigation are greatly influenced by uncertain model parameters, strong dynamic nonlinearities, and complex, time-varying marine environments (for instance, random disturbances caused by wind, waves, and currents) [
1]. An additional critical challenge arises because many vessels depend exclusively on propellers and rudders, which yields fewer independent control inputs relative to the system’s degrees of freedom—a characteristic underactuated configuration [
2,
3]. This characteristic greatly increases the difficulty of vessel motion control. In practical navigation operations, vessels are often required to strictly follow predefined geometric paths (such as shipping lanes, patrol boundaries, or survey routes). Among various navigation tasks, path following control has attracted substantial attention because it meets the operational demands for flexibility and economy. It now stands as a central research topic in the control of underactuated Unmanned Surface Vehicles (USVs) [
4,
5,
6]. Therefore, under novel architectures with coexistence of internal and external composite uncertainties, designing a path following control scheme capable of estimating and compensating for unknown time-varying sideslip angles, dynamic uncertainties, and environmental disturbances constitutes a critical technical challenge for ensuring the safe and reliable operation of intelligent vessels, possessing significant theoretical value and urgent practical implications. In recent years, numerous domestic and international scholars have proposed diversified strategies for the tracking control of underactuated USVs from various theoretical perspectives. Among these, path following refers to USVs traveling along a predetermined, time-independent spatial path at a specified speed [
7], representing one of the fundamental tasks in underactuated USV maneuvering control [
8]. To achieve this objective, Line-of-Sight (LOS) guidance-based methods have become the mainstream choice. [
9,
10,
11,
12]. LOS guidance fundamentally transforms path following into a heading angle tracking task: by directing the vessel toward a virtual point situated a lookahead distance along the reference path, the cross-track error is driven to converge [
13]. However, underactuated USVs lack independent transverse thrusters; lateral motion is generated through the coupling between longitudinal thrust and rudder forces. This particularity renders many advanced strategies difficult to apply directly. Furthermore, conventional LOS guidance suffers from the major drawback of ignoring sideslip angles. As a result, steady-state cross-track errors persist under environmental disturbances, and sideslip effects inevitably arise during path tracking [
14,
15], which further leads to inevitable sideslip phenomena when vessels track desired paths in complex navigation environments [
16]. Therefore, accurate estimation and effective compensation of sideslip angles constitute the key to improving USV path following performance and enhancing tracking accuracy.
To address this issue, Borhaug E et al. [
12] originally proposed Integral LOS guidance (ILOS), which introduces an integral action to cancel steady-state errors from constant unknown sideslip angles. This approach was later extended to curved paths. However, its effectiveness drops sharply when sideslip angles change rapidly with sea conditions. The Adaptive LOS (ALOS) method was subsequently introduced by Fossen [
10,
17] to estimate the sideslip angle online via adaptive laws and to deliver feedforward compensation. Despite being computationally more efficient than ILOS [
13,
18], the ALOS approach still suffers from limited estimation accuracy. Building upon this foundation, Sun et al. [
19] proposed a time-varying sideslip angle compensation method based on the ALOS that estimates sideslip angles in real time to enhance adaptability to time-varying disturbances. Nevertheless, their guidance law is tailored to straight-line paths and cannot be directly applied to curved following tasks. In another development, Cheng [
20] proposed a hybrid LOS guidance law that combines ILOS with hybrid LOS through a fuzzy system for small and large sideslip angle scenarios, respectively. However, the design of fuzzy switching rules relies on empirical parameter tuning, and frequent threshold crossings of sideslip angles can induce chattering issues. Furthermore, Dong et al. [
21] conducted a systematic review and improvement of LOS guidance law design for sideslip angle compensation, proposing a unified guidance framework applicable to various sea states. Xu et al. [
22] simultaneously considered sideslip angle reduction and input saturation in path following control, achieving active suppression of sideslip angles through nonlinear control methods.
The aforementioned methods essentially belong to indirect compensation. Researchers have further introduced observers for direct reconstruction of sideslip angles. An extended state observer based LOS (ESO-LOS) scheme was introduced by Yu et al. [
23], achieving effective compensation for time-varying sideslip angles. In the work of Wu et al. [
24], a reduced-order extended state observer is employed to perform real-time estimation of time-varying sideslip angles arising from environmental disturbances (e.g., wind, waves, and currents); consequently, an improved ESO-LOS is proposed that enables rapid and precise sideslip angle compensation, allowing underactuated vessels to follow desired paths with high accuracy even under challenging sea states. Building upon this, Martinović [
25] developed an Adaptive State Observer (ASO) combined with LOS guidance (ASO-LOS) for online sideslip angle compensation, validating its effectiveness on underactuated USVs. Žečević et al. [
26] proposed an ASO-based precise sideslip angle compensation method ensuring asymptotic convergence of sideslip angle compensation errors to zero. However, ASO-LOS and similar observer-based estimation methods are essentially passive and reactive, relying on feedback correction of real-time measurements, exhibiting certain lag in response to rapid dynamic changes of sideslip angles. Meanwhile, observer gains require careful tuning for specific operating conditions, and high-gain designs tend to introduce measurement noise sensitivity issues. To address these drawbacks, prediction techniques were introduced by Liu et al. [
27], who put forward a Predicted LOS method which utilizes a predictor to perform online proactive estimation of unknown time-varying sideslip angles, thereby mitigating convergence oscillations that arise from large adaptive gains and producing more precise guidance control laws. Furthermore, to enhance convergence performance, the finite-time theory has been incorporated into LOS guidance [
28]. A Finite-Time ESO-LOS guidance was developed by Yu et al. [
29], demonstrating significant advantages in convergence speed, transient performance, and the precision of sideslip angle compensation. Ma, Y. et al. [
30] proposed a finite-time sideslip observer-based model-free fast path-following control strategy for autonomous surface vehicles subject to large time-varying sideslip. Polyakov, A. and Fridman, L. [
31] studied stability notions and Lyapunov functions for sliding mode control systems. Wang et al. [
32] introduced fixed-time theory into the PLOS framework, designing a Predicted Fixed-time LOS guidance that achieves fixed-time convergence characteristics. Wang et al. [
33,
34] integrated finite-time observers with fixed-time convergence theory, which relaxes constraints on drift angles, allowing for time-varying drift with substantially larger magnitudes, and consequently enhances both the disturbance rejection capability and the accuracy of the path following system.
Motivated by the above discussion, this paper introduces adaptive mechanisms, virtual parameter learning, and prediction techniques, combined with lifelong learning mechanisms, to propose a Predicted Adaptive LOS (PALOS)-based path following control strategy. Through this strategy, online estimation and dynamic feedforward compensation are achieved for time-varying sideslip angles and environmental disturbances such as wind, waves, and currents. Only position information is required for compensation, without the need for direct velocity measurements or knowledge of environmental parameters. As a result, sensor configuration requirements are significantly lowered, while the robustness and reliability of ship motion control systems operating in complex environments are effectively enhanced. The primary contributions of this work are outlined as follows:
A novel PALOS guidance strategy is developed, breaking through the restrictive small sideslip angle constraint. Compared with the ILOS in [
12], ALOS in [
10], and ESO-LOS in [
23], this paper introduces prediction techniques into the LOS guidance framework. Unlike the passive reactive estimation based on extended state observers in [
21] or the direct sideslip angle suppression method in [
22], the proposed predictor enables rapid and precise compensation for large time-varying sideslip angles without requiring high-gain observer tuning. Thereby, steady-state cross-track errors are eliminated, and robustness to complex sea states is enhanced.
An adaptive neural network control scheme with lifelong learning mechanisms is proposed. Adaptive neural networks and virtual parameter learning techniques are employed to approximate online the lumped uncertainties comprising model parameter uncertainties, external disturbances, and input saturation, requiring adaptive update of only two parameters without persistent excitation conditions or complex fuzzy rule design. Unlike the fuzzy-based hybrid LOS method in [
20], which requires empirical rule tuning, the proposed approach substantially lowers computation. Moreover, distinct from the lifelong learning frameworks in [
35,
36], our scheme constructs and retains historical knowledge via feedback terms, enabling knowledge reuse and alleviating catastrophic forgetting.
A self-triggered knowledge retention protocol is developed to schedule the intermittent injection of historical knowledge. Unlike the distributed event-triggered mechanisms requiring continuous monitoring in [
17,
26], the proposed self-triggered scheduling function achieves adaptive scheduling of triggering instants through preset dynamic conditions, enabling on-demand reuse of historical virtual-parameter estimates, significantly reducing communication transmission load, while effectively precluding Zeno behavior.
3. PALOS Guidance for 3-DOF USVs
In this section, the design and analysis of PALOS guidance for 3-DOF USVs are presented. Prior to the guidance law design, the main variables involved in the PALOS-based path following are introduced, and the corresponding schematic diagram based on the conventional LOS is illustrated in
Figure 1. Note that
is the reference path with
being the path variable. In the control design, the path-tangential angle
can be obtained as
with
and
, where
and its first derivative
are both known and bounded. The actual position of USVs is denoted by
. To facilitate the subsequent control scheme design, the position errors, including the longitudinal error and lateral error, are defined as follows:
The sideslip angle , as a bounded unknown time-varying variable, has its estimate maintaining an equivalently bounded property. Let , , and .
By taking the derivative of
and
with respect to time, and incorporating the USV kinematic relations from (
1) yields:
In these equations,
represents the resultant velocity of the USV, which is positive and bounded in practice;
is the path parameter update speed.
In order to mitigate the influence of unknown time-varying sideslip angles on guidance performance, the predictor is formulated as follows:
where
and
denote the prediction errors;
and
are positive constants to be designed;
and
represent the estimates of
and
, respectively. This predictor achieves online estimation of the USV’s lateral error by introducing a position prediction error feedback mechanism, providing compensation basis for the subsequent guidance law design.
The update laws based on prediction errors are designed as:
where
and
are positive design constants.
From the above, the prediction error dynamics can be obtained as:
The above error dynamics equations reflect the coupling relationship between prediction errors and estimation deviations, providing a mathematical foundation for stability analysis.
To address the path following problem of USVs in the presence of unknown time-varying sideslip angles, this subsection devises a predicted compensation mechanism for error dynamics. By introducing auxiliary variables, the predictor dynamic equations shown in (
11)–(
13) are constructed, aiming to achieve online estimation of prediction errors caused by time-varying sideslip angles, model uncertainties, and external environmental disturbances, thereby providing more precise compensation basis for the subsequent controller design. The key advantage of the predictor lies in its capability to indirectly estimate the sideslip angle using only position information without direct measurement, significantly reducing sensor configuration requirements. Proceeding from this foundation, this section proceeds to analyze the convergence performance of the predictor, and the theorem below is stated.
Theorem 1. Consider the predictor system described by (
11).
With appropriately selected design parameters , , and , the prediction errors converge exponentially to a neighborhood of the origin, ensuring that all signals in the system are uniformly ultimately bounded. Proof. Select the Lyapunov candidate function as
. Differentiating
with respect to time and substituting (
13) yields:
After rearrangement, one obtains:
where
is defined as:
From the definition of , when the design parameters satisfy , , and , it follows that . Therefore, is negative definite. Invoking Lyapunov stability arguments, the prediction error dynamics are established to be uniformly ultimately bounded, featuring exponential convergence of all error trajectories to any arbitrarily small neighborhood of the equilibrium. This brings the proof of Theorem 1 to an end. □
Furthermore, the predicted adaptive LOS guidance law can be constructed as follows:
where
is a parameter to be designed which denotes the lookahead distance, and its upper and lower bounds
and
are positive constants to be designed.
Furthermore, to achieve online updating of the path parameter, the update speed
can be designed such that the USV accurately tracks the desired path:
where
being a positive constant to be selected.
Accordingly, the update law for the path parameter
is given as follows:
Based on the completed predictor design and convergence analysis, this subsection further investigates the stability of the guidance subsystem, and presents the following theorem regarding the convergence of the guidance subsystem.
Theorem 2. Consider the guidance subsystem described by (
17).
With appropriately selected design parameters , and , based on the boundedness property of , the system converges exponentially to a neighborhood of the origin. Proof. Construct the Lyapunov candidate function
. Taking the time derivative of
and substituting the guidance error dynamic equations yields:
Combining with Lemma 1, one obtains:
Substituting (
20) yields:
From Lemma 2, it follows that:
Furthermore, combining with Lemma 1, the above inequality can be written as:
Equation (
24) can be rewritten as:
By selecting appropriate design parameters such that
and
, then
where
,
, and
.
Because Theorem 1 guarantees global uniform asymptotic stability for the prediction errors
and
, once the predictor reaches its steady state and the equalities
and
are assumed, the preceding inequality reduces to a simpler one:
Under the above conditions, the designed tuning parameters can ensure that holds, and therefore is negative definite. In line with Lyapunov stability theory, the error signals of the PALOS guidance subsystem are uniformly ultimately bounded; moreover, they converge exponentially to a small region around the origin. This behavior consequently guarantees the stability of the overall system. The above line of reasoning thus completes the proof of Theorem 2. □
4. Control Design and Stability Analysis
Once the feasible desired heading angle has been generated by applying the PALOS guidance law from the previous section, the current section proceeds to incorporate adaptive neural networks, virtual parameter learning, and lifelong learning mechanisms. By utilizing a self-triggered mechanism that drives the transmission of historical information, the lumped uncertainties (i.e., model uncertainties together with external disturbances) are approximated online and compensated in a dynamic manner.
In practical engineering, all actuators inevitably face physical limitations, meaning that the forces and torques delivered by the ship’s propulsion system cannot exceed certain upper bounds. Furthermore, taking Assumption 1 and the vessel’s maneuvering behavior into account, the ship velocity v remains bounded, i.e., , with being a constant.
4.1. Surge Control
To achieve accurate and effective following of desired path, the desired surge velocity generated by surge velocity guidance as
where
and
are positive constants to be designed, which physically represent the upper and lower limits of the surge velocity of the USV, respectively. Meanwhile,
denotes another positive constant to be selected.
Subsequently, the error variable for the surge velocity is defined as follows:
By differentiating (
30) with respect to time and inserting the dynamic model (
2), we arrive at:
Combining with Assumption 2, we let be an unknown function, where the neural network input vector is described by .
According to Lemma 3, introducing the neural network approximation mechanism yields:
where
denotes the output from the hidden layer associated with the radial basis functions of the neural network. The quantity
m indicates the total number of nodes in the network, while
represents its approximation error.
On this basis, we let
. Substituting (
32) into it, and according to Assumption 1, introducing the virtual parameter online learning mechanism, one obtains:
where
is an unknown virtual parameter greater than zero, and
is a known scalar function. Meanwhile,
is the unknown upper limit of the longitudinal external time-varying environmental disturbance, from which it can be inferred that
must be a positive unknown constant.
Therefore, to solve the path following control problem of underactuated USVs under the external time-varying disturbances and model dynamic uncertainties, the surge velocity control law
is designed as follows:
where
denotes the estimate of the adaptive virtual parameter
, and its adaptive law can be constructed as:
We introduce the estimation error equation of the adaptive virtual parameter
, which can be written as
, with
, and
,
,
and
are positive gain coefficients to be designed.
denotes the historical information obtained from the previous task. To circumvent the restrictive assumption of explicit task demarcation, an adaptive triggering schedule is synthesized that autonomously determines when to inject historical virtual-parameter trajectories into the current update law, enabling cross-scenario knowledge accumulation for USV operations without predefined phase boundaries [
38]. The specific expression of the self-triggered mechanism is given subsequently.
4.2. Steering Control
After completing the surge velocity control design, this paper further designs the heading angle tracking control law to realize the turning motion of the USV. One may write the heading angle tracking error as follows:
By differentiating the above expression with respect to time, we obtain:
The virtual yaw velocity is designed as:
where
is a positive design parameter.
Considering that the desired heading angular velocity
contains the virtual control law
to be designed, direct differentiation would inevitably lead to a sharp increase in control law complexity. To address this, dynamic surface control (DSC) technique is introduced, and a first-order filter is designed to filter
:
where
is a time constant;
denotes the first-order filter.
We define the error variable
as the filtering error. Taking the time derivative of the first-order filter
, one obtains
, from which it can be further deduced that:
Here,
constitutes a continuous function with a bounded range [
1] and its supremum is denoted by
. Therefore, the heading angular velocity error variable
can be rewritten as follows:
By differentiating
with respect to time and inserting (
2), the following expression is derived:
Similarly, we define the unknown function as
, where the neural network input vector is described by
. According to Lemma 3, we employ a neural network to perform the approximation. This allows us to deduce:
where
is the hidden layer output vector corresponding to the radial basis neural network functions;
n and
denote the number of nodes and the approximation error of the neural network, respectively.
On this basis, we let
. Substituting (
43) into it, and according to Assumption 1, introducing the single-parameter online learning mechanism, one obtains:
where
is a virtual parameter greater than zero, with
being the unknown upper bound of the yaw external time-varying environmental disturbance, from which it can be inferred that
must be a positive unknown constant. Therefore, the heading angular velocity control law
is designed as follows:
where
denotes the adaptive estimate of the virtual parameter
, and its adaptive law is designed as:
Further, we define its estimation error as
, where
, and
,
,
and
are positive gain coefficients to be designed;
denotes the historical information obtained from the previous task [
38].
To modulate the present adaptation law using accumulated parameter estimates from prior tasks
, the following triggering protocol is designed:
where
. And the term
denotes the update rule for the critic network taken from the earlier learning stage, and
ℓ takes the values of
u or
r. The interval scheduling
is then formulated as follows:
where
,
, and
are parameters to be designed;
is the first-order derivative of the tracking error
at
.
Remark 2. In the designed adaptive law, the self-triggered mechanism is introduced as an information update and transmission scheduling means, avoiding the continuous real-time monitoring of triggering conditions required in event-triggered mechanisms, thereby reducing the computational burden. Meanwhile, through effective retention and transfer of historical information, it helps alleviate the catastrophic forgetting problem [39]. Remark 3. According to (47), the time-scheduling function functions as the inter-trigger interval, quantifying the temporal displacement between the historical knowledge epoch and the present instant. is a function of and , through which the next migration length can be calculated based on , , and the parameters to be designed. According to (48), when the desired path changes relatively slowly and the external disturbance is small, appropriately increasing the triggering threshold parameters , and decreasing can effectively reduce the triggering frequency. Conversely, should the operational environment undergo perturbation or exogenous disturbances escalate, reducing , and increasing can enhance the response capability of the triggering mechanism to error variations. Remark 4. The triggering instants satisfy , and when , is the initial triggering instant. According to the self-triggered protocol (47), the initial instant cannot be calculated from the scheduling function; therefore, it must be predefined. The system then performs the first triggering operation upon reaching this known instant. 4.3. Stability Analysis
This subsection, within the design framework of the vector backstepping method, evaluates the stability of the entire guidance-control closed-loop system under the designed PALOS guidance scheme. In addition, it presents relevant proofs regarding the negative definiteness of the designed Lyapunov function.
Theorem 3. Consider Equations (
1)
and (
2)
and further assume that all previously stated assumptions hold. If the predictor Formula (
11),
the PALOS guidance law (
17), (
29)–(
38),
the designed control laws (
34), (
45),
the adaptive laws (
35), (
46),
and the first-order filter (
39)
are adopted simultaneously, then by properly selecting the design parameters, the resulting closed-loop path following guidance-control system exhibits stable and convergent characteristics. Proof. To investigate the stability of the closed-loop ship path-following control system, this section employs the Lyapunov direct method to construct a composite Lyapunov candidate function containing velocity errors, adaptive parameter errors, and dynamic surface errors as follows:
To facilitate subsequent derivation, we let , .
Differentiating (
49) with respect to time gives:
Substituting (
33), (
44) and (
40) into (
50), the preceding equation can be further expressed as:
Substituting the control laws (
34) and (
45), and the adaptive laws (
35) and (
46) sequentially into (
51), the preceding relation can be reformulated as:
where the specific expressions for
and
are as follows:
If appropriate design parameters are selected such that
,
,
,
,
, and
, it can be ensured that
. Equation (
49) can be further written as:
where
is the initial value of
.
From (
55), one may conclude that the proposed adaptive neural network path following control scheme can ensure the uniform ultimate boundedness stability of all signals in the closed-loop system under the combined effects of external time-varying disturbances and model dynamic uncertainties. Theorem 3 is thus proved. □
From (
48), the minimum triggering interval is:
Equation (
47) implies the existence of a positive constant
satisfying the condition as:
. Plugging this inequality into (
56) then gives:
Therefore, there exists a lower bound for the interval between two consecutive triggers, which can avoid Zeno behavior. Furthermore, the differential term in the scheduling function exists and is bounded.