Direct X-Rudder Path-Following Control for Underactuated AUVs via TIB-CSAC
Abstract
1. Introduction
- (1)
- A direct X-rudder control framework is proposed. In this framework, the X-rudder control policy is learned directly from two-segment path-following information, avoiding the complex hierarchical design of guidance laws and attitude controllers as well as the associated coupled parameter tuning.
- (2)
- A task-informed inductive-bias state encoder is developed to construct structured and task-conditioned representations of two-segment path-following states for direct policy learning. This design reduces the representation burden caused by direct concatenation of heterogeneous state variables and improves sample efficiency.
- (3)
- A multi-head conservative value-evaluation mechanism is introduced to improve value assessment for high-uncertainty state–action pairs, thereby reducing return drawdowns induced by challenging path-following tasks and improving tail-stage convergence stability.
- (4)
- The proposed method is validated in three representative scenarios with distinct path geometries and desired surge velocities. The results show improvements across multiple error metrics, including the maximum absolute error, mean absolute error, tail error, and threshold-exceedance proportion, demonstrating enhanced generalization capability and path-following performance under diverse operating conditions.
2. Problem Statement
2.1. Underactuated AUV Model
2.2. Path Following Objective
3. X-Rudder Path Following Control Based on TIB-CSAC
3.1. Markov Decision Process Design
3.1.1. States
3.1.2. Actions
3.1.3. Reward
3.1.4. Termination
3.2. Task-Informed Inductive-Bias Conservative SAC Algorithm
3.2.1. Task-Informed Inductive-Bias Encoder
3.2.2. Conservative Soft Actor–Critic Algorithm
| Algorithm 1 Training procedure |
| Input: Critic size nQ; time interval Δt; soft-update rate τ; discount factor γ; batch size nB; termination flag dt; Initialize: actor ; critics ; temperature α; target critics ; Replay buffer Ɗ; 1: // Interaction 2: for each environment step do 3: Get current observation st; 4: Sample action ; 5: Execute at, obtain , st+1, dt; 6: If dt = true, reset the environment; 7: Store into Ɗ; 8: // Parameter updates 9: if |Ɗ| > nbat then 10: Sample independent mini-batch ; 11: Sample next action for ( in ); 12: Compute TD target using Equation (43); 13: Update critic using Equations (44) and (45); 14: Update actor using Equation (41); 15: Update α using Equation (42); 16: Soft update target critics using Equations (46) and (47); 17: end if 18: end for |
4. Experiments
4.1. Basic Settings
4.2. Training Results
4.3. Path Following Results
4.3.1. Case 1 Boustrophedon Path
4.3.2. Case 2 Trapezoidal Path
4.3.3. Case 3 Random Path
4.3.4. Metric Analyses
4.4. Ablation Study
4.4.1. Quantitative Analysis of the TIB Encoder
4.4.2. Quantitative Analysis of the Conservative Value Evaluation Mechanism
4.4.3. Overall Ablation Discussion
4.5. Sensitivity Study
4.5.1. Sensitivity Analysis on Critic Size
4.5.2. Sensitivity Analysis on Reward Weights
4.6. Stochastic Disturbance
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
References
- Shen, Z.; Wang, Q.; Dong, S.; Yu, H. Prescribed performance dynamic surface control for trajectory-tracking of unmanned surface vessel with input saturation. Appl. Ocean Res. 2021, 113, 102736. [Google Scholar] [CrossRef]
- He, L.; Zhang, Y.; Li, S.; Li, B.; Yuan, Z. Three-Dimensional Path Following Control for Underactuated AUV Based on Ocean Current Observer. Drones 2024, 8, 672. [Google Scholar] [CrossRef]
- Zheng, J.; Song, L.; Liu, L.; Yu, W.; Wang, Y.; Chen, C. Fixed-time sliding mode tracking control for autonomous underwater vehicles. Appl. Ocean Res. 2021, 117, 102928. [Google Scholar] [CrossRef]
- Rout, R.; Subudhi, B. Design of Line-of-Sight Guidance Law and a Constrained Optimal Controller for an Autonomous Underwater Vehicle. IEEE Trans. Circuits Syst. II Express Briefs 2021, 68, 416–420. [Google Scholar] [CrossRef]
- Wang, L.; Li, S.; Liu, J.; Wu, Q. Data-driven path-following control of underactuated ships based on antenna mutation beetle swarm predictive reinforcement learning. Appl. Ocean Res. 2022, 124, 103207. [Google Scholar] [CrossRef]
- S., P.; Rajendran, S. A unified seakeeping and manoeuvring model with a PID controller for path following of a KVLCC2 tanker in regular waves. Appl. Ocean Res. 2021, 116, 102860. [Google Scholar] [CrossRef]
- He, Y.; Xie, Y.; Pan, G.; Cao, Y.; Huang, Q.; Ma, S.; Zhang, D.; Cao, Y. Depth and Heading Control of a Manta Robot Based on S-Plane Control. J. Mar. Sci. Eng. 2022, 10, 1698. [Google Scholar] [CrossRef]
- Xu, F.; Zhang, L.; Zhong, J. Three-Dimensional Path Tracking of Over-Actuated AUVs Based on MPC and Variable Universe S-Plane Algorithms. J. Mar. Sci. Eng. 2024, 12, 418. [Google Scholar] [CrossRef]
- Jiang, C.; Lv, J.; Wan, L.; Wang, J.; He, B.; Wu, G. An Improved S-Plane Controller for High-Speed Multi-Purpose AUVs with Situational Static Loads. J. Mar. Sci. Eng. 2023, 11, 646. [Google Scholar] [CrossRef]
- He, L.; Xie, M.; Zhang, Y. A Review of Path Following, Trajectory Tracking, and Formation Control for Autonomous Underwater Vehicles. Drones 2025, 9, 286. [Google Scholar] [CrossRef]
- Wang, D.; Shen, Y.; Wan, J.; Sha, Q.; Li, G.; Chen, G.; He, B. Sliding mode heading control for AUV based on continuous hybrid model-free and model-based reinforcement learning. Appl. Ocean Res. 2022, 118, 102960. [Google Scholar] [CrossRef]
- Sun, Y.; Ran, X.; Zhang, G.; Wang, X.; Xu, H. AUV path following controlled by modified Deep Deterministic Policy Gradient. Ocean Eng. 2020, 210, 107360. [Google Scholar] [CrossRef]
- Fang, Y.; Huang, Z.; Pu, J.; Zhang, J. AUV position tracking and trajectory control based on fast-deployed deep reinforcement learning method. Ocean Eng. 2022, 245, 110452. [Google Scholar] [CrossRef]
- Wang, Z.; Li, Y.; Ma, C.; Yan, X.; Jiang, D. Path-following optimal control of autonomous underwater vehicle based on deep reinforcement learning. Ocean Eng. 2023, 268, 113407. [Google Scholar] [CrossRef]
- Wang, Y.; Hou, Y.; Lai, Z.; Cao, L.; Hong, W.; Wu, D. An adaptive PID controller for path following of autonomous underwater vehicle based on Soft Actor–Critic. Ocean Eng. 2024, 307, 118171. [Google Scholar] [CrossRef]
- Zhang, C.; Cheng, P.; Du, B.; Dong, B.; Zhang, W. AUV path tracking with real-time obstacle avoidance via reinforcement learning under adaptive constraints. Ocean Eng. 2022, 256, 111453. [Google Scholar] [CrossRef]
- Zhang, Q.; Lin, J.; Sha, Q.; He, B.; Li, G. Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle. IEEE Access 2020, 8, 24258–24268. [Google Scholar] [CrossRef]
- Dong, N.; Liu, S.; Ip, A.W.H.; Yung, K.L.; Gao, Z.; Juan, R.; Wang, Y. End-to-end autonomous underwater vehicle path following control method based on improved soft actor–critic for deep space exploration. J. Ind. Inf. Integr. 2025, 45, 100792. [Google Scholar] [CrossRef]
- Gu, N.; Wang, D.; Peng, Z.; Wang, J.; Han, Q.-L. Advances in line-of-sight guidance for path following of autonomous marine vehicles: An overview. IEEE Trans. Syst. Man. Cybern. Syst. 2022, 53, 12–28. [Google Scholar] [CrossRef]
- Fan, J.; Li, W. DRIBO: Robust Deep Reinforcement Learning via Multi-View Information Bottleneck. In Proceedings of the 39th International Conference on Machine Learning, Proceedings of Machine Learning Research, Baltimore, MD, USA, 17–23 July 2022; pp. 6074–6102. [Google Scholar]
- Schaul, T.; Horgan, D.; Gregor, K.; Silver, D. Universal Value Function Approximators. In Proceedings of the 32nd International Conference on Machine Learning, Proceedings of Machine Learning Research, Lille, France, 6–11 July 2015; pp. 1312–1320. [Google Scholar]
- Fossen, T.I. Handbook of Marine Craft Hydrodynamics and Motion Control; Wiley: Hoboken, NJ, USA, 2011. [Google Scholar]
- Chai, P.; Sun, Y.; Wu, W.; Zhou, H.; Gao, F. AUV preset performance approximate dynamic programming path following control based on fixed time guidance law. Ocean Eng. 2025, 321, 120425. [Google Scholar] [CrossRef]
- Zhang, L.; Sun, Y.; Chai, P.; Tan, J.; Zheng, H. Prescribed-performance time-delay compensation control for UUV trajectory tracking in main-branch water conveyance tunnel transitions under unknown input delays. Ocean Eng. 2025, 342, 122941. [Google Scholar] [CrossRef]
- Pettersen, K.Y.; Egeland, O. Time-varying exponential stabilization of the position and attitude of an underactuated autonomous underwater vehicle. IEEE Trans. Autom. Control. 1999, 44, 112–115. [Google Scholar] [CrossRef]
- Haarnoja, T.; Zhou, A.; Hartikainen, K.; Tucker, G.; Ha, S.; Tan, J.; Kumar, V.; Zhu, H.; Gupta, A.; Abbeel, P. Soft actor-critic algorithms and applications. arXiv 2018, arXiv:1812.05905. [Google Scholar] [CrossRef]
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous control with deep reinforcement learning. arXiv 2015, arXiv:1509.02971. [Google Scholar] [CrossRef]
- An, X.; Xing, H.; Li, H.; Shi, H.; Gu, Y. Construction and verification of the autonomous underwater helicopter’s digital twin system for missions simulation. Ocean Eng. 2025, 341, 122641. [Google Scholar] [CrossRef]
- Ju, H.; Juan, R.; Gomez, R.; Nakamura, K.; Li, G. Transferring policy of deep reinforcement learning from simulation to reality for robotics. Nat. Mach. Intell. 2022, 4, 1077–1087. [Google Scholar] [CrossRef]
- Zhu, W.; Guo, X.; Owaki, D.; Kutsuzawa, K.; Hayashibe, M. A Survey of Sim-to-Real Transfer Techniques Applied to Reinforcement Learning for Bioinspired Robots. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 3444–3459. [Google Scholar] [CrossRef] [PubMed]
- Wu, J.; Zhou, Y.; Yang, H.; Huang, Z.; Lv, C. Human-Guided Reinforcement Learning With Sim-to-Real Transfer for Autonomous Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 14745–14759. [Google Scholar] [CrossRef]
- Hsu, K.-C.; Ren, A.Z.; Nguyen, D.P.; Majumdar, A.; Fisac, J.F. Sim-to-Lab-to-Real: Safe reinforcement learning with shielding and generalization guarantees. Artif. Intell. 2023, 314, 103811. [Google Scholar] [CrossRef]






















| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Inertia terms | Rudder rate | Δδmax = 0.3491 rad/s | |
| Damping coefficient | Rudder limit | δmax = 0.4189 rad | |
| Damping coefficient | AUV weight | W = 1813 N | |
| Rudder coefficients | Vertical offset |
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Initial AUV states | η0 = 0, ν0 = 0 | Path segments | np = 2 |
| Maximum adjacent heading change | Time step | Δt = 0.2 s | |
| Maximum absolute path pitch angle | Time threshold | ||
| Minimum segment length | Lmin = 100 m | Error threshold | |
| Normalization factors | Reward weights | ||
| Maximum task time limit | Disturbance phase shifts | ||
| Disturbance frequencies | Disturbance phase shifts | ||
| Disturbance frequencies | Disturbance phase shifts |
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Learning rate | lr = 2 × 10−4 | Dynamic encoder | MLP(2 × 32), ReLU |
| Discount factor | γ = 0.99 | Segment branches | MLP(2 × 64), ReLU |
| Critic number | nQ = 6 | Fusion layer | MLP(1 × 64), ReLU |
| Soft update coefficient | τ = 1 × 10−3 | FiLM modulation | g:MLP(1 × 64), ReLU |
| Training episodes | neps = 1200 | Policy network | MLP(2 × 128), ReLU |
| Replay Buffer size | nbuf = 1 × 106 | Critic head network | MLP(2 × 128), ReLU |
| Batch size | nbat = 256 | Target entropy |
| Method | ||||
|---|---|---|---|---|
| TIB-CSAC | −1.7244 × 102 | 1.2368 × 102 | 2.9560 × 102 | −2.6398 × 106 |
| SAC | −2.7927 × 102 | 5.1210 × 102 | 7.3320 × 102 | −6.5231 × 106 |
| DDPG | −5.5372 × 102 | 4.6465 × 102 | 9.3920 × 102 | −9.2165 × 106 |
| Index | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| (m) | 111.79 | 238.38 | 434.02 | 546.22 | 486.21 | 562.53 | 674.69 | 596.73 | 441.31 | 433.10 |
| (m) | 92.60 | 158.75 | 179.64 | 115.61 | −47.99 | −174.84 | −288.56 | −370.22 | −325.50 | −185.18 |
| (m) | 2.56 | 14.46 | 14.68 | 15.56 | 3.68 | 11.19 | 15.01 | 20.85 | 11.28 | 18.72 |
| Index | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
| (m) | 464.11 | 375.32 | 450.57 | 628.69 | 794.98 | 883.33 | 1001.32 | 1110.58 | 1264.07 | 1387.24 |
| (m) | −74.54 | 99.49 | 201.61 | 131.19 | 68.23 | −97.93 | −52.65 | 54.26 | 135.41 | 54.28 |
| (m) | 20.23 | 24.20 | 22.87 | 32.63 | 31.45 | 43.51 | 32.77 | 35.95 | 51.05 | 51.89 |
| Case | Metric | TIB-CSAC | LOS+SP | SAC | DDPG |
|---|---|---|---|---|---|
| C1 | MXAE (m) | 1.7299 × 10−1 | 2.6719 × 10−1 | 2.6679 × 10−1 | 4.2489 × 10−1 |
| MAE (m) | 4.1841 × 10−3 | 1.0881 × 10−2 | 1.3561 × 10−2 | 4.7823 × 10−2 | |
| CVaR95 (m) | 4.7757 × 10−2 | 8.6580 × 10−2 | 6.3900 × 10−2 | 2.4309 × 10−1 | |
| Vε | 7.4956 × 10−3 | 1.4532 × 10−2 | 3.0235 × 10−3 | 1.4487 × 10−1 | |
| C2 | MXAE (m) | 3.6035 × 10−1 | 4.8607 × 10−1 | 5.8540 × 10−1 | 5.5639 × 10−1 |
| MAE (m) | 1.0904 × 10−2 | 3.8505 × 10−2 | 2.1844 × 10−2 | 8.1225 × 10−2 | |
| CVaR95 (m) | 1.0198 × 10−1 | 2.5271 × 10−1 | 1.5847 × 10−1 | 4.0743 × 10−1 | |
| Vε | 1.7398 × 10−2 | 1.1855 × 10−1 | 3.2938 × 10−2 | 2.5016 × 10−1 | |
| C3 | MXAE (m) | 3.4549 × 10−1 | 3.7410 × 10−1 | 4.6598 × 10−1 | 6.7275 × 10−1 |
| MAE (m) | 1.2928 × 10−2 | 3.3489 × 10−2 | 3.1507 × 10−2 | 9.6232 × 10−2 | |
| CVaR95 (m) | 1.2817 × 10−1 | 2.1243 × 10−1 | 1.6925 × 10−1 | 4.4115 × 10−1 | |
| Vε | 2.5180 × 10−2 | 8.2075 × 10−2 | 6.1138 × 10−2 | 3.3564 × 10−1 |
| Case | Metric | TIB-CSAC | LOS+SP | SAC | DDPG |
|---|---|---|---|---|---|
| C1 | MXAE (m) | 2.3813 × 100 | 1.0942 × 101 | 1.0892 × 101 | 1.0348 × 101 |
| MAE (m) | 6.1981 × 10−2 | 5.9013 × 10−1 | 4.7040 × 10−1 | 4.8171 × 10−1 | |
| CVaR95 (m) | 8.4313 × 10−1 | 7.1942 × 100 | 5.3551 × 100 | 5.1160 × 100 | |
| Vε | 6.8683 × 10−2 | 3.7126 × 10−1 | 3.8413 × 10−1 | 7.3974 × 10−1 | |
| C2 | MXAE (m) | 1.0324 × 100 | 1.0575 × 101 | 8.3592 × 100 | 1.4654 × 101 |
| MAE (m) | 3.6557 × 10−2 | 7.0307 × 10−1 | 4.6165 × 10−1 | 7.6099 × 10−1 | |
| CVaR95 (m) | 3.8266 × 10−1 | 6.1709 × 100 | 4.2428 × 100 | 7.2433 × 100 | |
| Vε | 6.6016 × 10−2 | 5.7557 × 10−1 | 4.9173 × 10−1 | 7.9612 × 10−1 | |
| C3 | MXAE (m) | 2.9708 × 100 | 1.0196 × 101 | 6.5206 × 100 | 1.0790 × 101 |
| MAE (m) | 8.7358 × 10−2 | 5.9247 × 10−1 | 5.1517 × 10−1 | 7.0218 × 10−1 | |
| CVaR95 (m) | 1.0650 × 100 | 4.9855 × 100 | 4.1604 × 100 | 5.4241 × 100 | |
| Vε | 1.4977 × 10−1 | 5.8885 × 10−1 | 4.6221 × 10−1 | 8.4074 × 10−1 |
| Case | Metric | TIB-CSAC | LOS+SP | SAC | DDPG |
|---|---|---|---|---|---|
| C1 | 3.6415 × 10−2 | 3.5432 × 10−2 | 7.1948 × 10−2 | 1.9965 × 10−2 | |
| 5.9789 × 10−2 | 7.9926 × 10−1 | 2.0554 × 10−2 | 6.3354 × 100 | ||
| 7.5725 × 100 | 1.0895 × 101 | 7.7465 × 100 | 4.2052 × 101 | ||
| 2.7156 × 101 | 1.8364 × 101 | 7.7713 × 101 | 1.4899 × 101 | ||
| 5.5553 × 103 | 5.7613 × 103 | 5.9402 × 103 | 5.6670 × 103 | ||
| 3.7622 × 103 | 3.8948 × 103 | 3.9028 × 103 | 3.8116 × 103 | ||
| C2 | 6.0163 × 10−3 | 1.0941 × 10−2 | 5.5885 × 10−2 | 2.7831 × 10−3 | |
| 3.1153 × 10−1 | 2.3985 × 100 | 1.0248 × 10−1 | 6.8936 × 100 | ||
| 1.8446 × 101 | 3.1841 × 101 | 2.1538 × 101 | 4.7438 × 101 | ||
| 3.2003 × 101 | 2.6966 × 101 | 1.7262 × 102 | 1.9909 × 101 | ||
| 2.2942 × 103 | 2.4531 × 103 | 2.5507 × 103 | 2.4031 × 103 | ||
| 1.2300 × 103 | 1.2704 × 103 | 1.2812 × 103 | 1.2576 × 103 | ||
| C3 | 4.1298 × 10−2 | 2.0751 × 10−2 | 9.3597 × 10−2 | 2.6845 × 10−2 | |
| 6.1293 × 10−1 | 1.5771 × 100 | 1.1750 × 10−1 | 1.3719 × 101 | ||
| 1.6996 × 101 | 1.9188 × 101 | 2.1262 × 101 | 8.9930 × 101 | ||
| 3.4639 × 101 | 2.3822 × 101 | 2.0899 × 102 | 2.6849 × 101 | ||
| 7.5449 × 103 | 7.5713 × 103 | 7.7279 × 103 | 7.5638 × 103 | ||
| 2.4384 × 103 | 2.5830 × 103 | 2.6988 × 103 | 2.5480 × 103 |
| Method | ||||
|---|---|---|---|---|
| SAC | 7.3320 × 102 | −6.5231 × 106 | 5.1210 × 102 | −2.7927 × 102 |
| CSAC | 6.4180 × 102 | −5.4341 × 106 | 1.3339 × 102 | −2.0564 × 102 |
| TIB-SAC | 4.3060 × 102 | −3.9544 × 106 | 3.5529 × 102 | −2.3032 × 102 |
| TIB-CSAC | 2.9560 × 102 | −2.6398 × 106 | 1.2368 × 102 | −1.7244 × 102 |
| nQ | ||||
|---|---|---|---|---|
| 2 | 2.4269 × 103 | 1.2000 × 103 | −7.9829 × 106 | −3.7323 × 103 |
| 4 | 5.3876 × 102 | 3.8780 × 102 | −3.2756 × 106 | −2.7346 × 102 |
| 6 | 1.2368 × 102 | 2.9560 × 102 | −2.6398 × 106 | −1.7244 × 102 |
| 8 | 1.0508 × 102 | 3.3480 × 102 | −2.6063 × 106 | −1.6741 × 102 |
| Case | [kr,1, kr,2, kr,3] | [7.50, 3.75, 3.75] | [3.75, 7.50, 3.75] | [3.75, 3.75, 7.50] | [5.0, 5.0, 5.0] |
|---|---|---|---|---|---|
| C1 | MXAE (m) | 2.8895 × 100 | 3.2273 × 100 | 2.7073 × 100 | 2.3813 × 100 |
| MAE (m) | 9.3567 × 10−2 | 1.2752 × 10−1 | 8.1307 × 10−2 | 6.1981 × 10−2 | |
| CVaR95 (m) | 1.2507 × 100 | 1.1357 × 100 | 1.0947 × 100 | 8.4313 × 10−1 | |
| Vε | 8.7584 × 10−2 | 3.7600 × 10−1 | 7.3903 × 10−2 | 6.8683 × 10−2 | |
| C2 | MXAE (m) | 1.1418 × 100 | 1.6357 × 100 | 1.1187 × 100 | 1.0324 × 100 |
| MAE (m) | 4.0079 × 10−2 | 8.7472 × 10−2 | 5.9850 × 10−2 | 3.6557 × 10−2 | |
| CVaR95 (m) | 4.0264 × 10−1 | 7.0743 × 10−1 | 5.4121 × 10−1 | 3.8266 × 10−1 | |
| Vε | 6.0212 × 10−2 | 2.0276 × 10−1 | 1.1075 × 10−1 | 6.6016 × 10−2 | |
| C3 | MXAE (m) | 2.6098 × 100 | 2.5967 × 100 | 2.6334 × 100 | 2.9708 × 100 |
| MAE (m) | 5.2106 × 10−2 | 6.1310 × 10−2 | 6.4482 × 10−2 | 8.7358 × 10−2 | |
| CVaR95 (m) | 5.2805 × 10−1 | 5.6101 × 10−1 | 6.0352 × 10−1 | 1.0650 × 100 | |
| Vε | 7.0330 × 10−2 | 9.8560 × 10−2 | 9.6979 × 10−2 | 1.4977 × 10−1 |
| Case | [kr,1, kr,2, kr,3] | [7.50, 3.75, 3.75] | [3.75, 7.50, 3.75] | [3.75, 3.75, 7.50] | [5.0, 5.0, 5.0] |
|---|---|---|---|---|---|
| C1 | MXAE (m) | 2.5174 × 10−1 | 1.5607 × 10−1 | 1.8283 × 10−1 | 1.7299 × 10−1 |
| MAE (m) | 1.0644 × 10−2 | 1.0499 × 10−2 | 6.4236 × 10−3 | 4.1841 × 10−3 | |
| CVaR95 (m) | 1.1995 × 10−1 | 5.1650 × 10−2 | 7.6575 × 10−2 | 4.7757 × 10−2 | |
| Vε | 2.4152 × 10−2 | 4.2438 × 10−4 | 1.6045 × 10−2 | 7.4956 × 10−3 | |
| C2 | MXAE (m) | 3.8565 × 10−1 | 3.0402 × 10−1 | 3.7452 × 10−1 | 3.6035 × 10−1 |
| MAE (m) | 1.0350 × 10−2 | 1.6619 × 10−2 | 1.0264 × 10−2 | 1.0904 × 10−2 | |
| CVaR95 (m) | 1.1253 × 10−1 | 9.9581 × 10−2 | 9.9590 × 10−2 | 1.0198 × 10−1 | |
| Vε | 2.0667 × 10−2 | 1.5981 × 10−2 | 1.5450 × 10−2 | 1.7398 × 10−2 | |
| C3 | MXAE (m) | 3.2619 × 10−1 | 2.6435 × 10−1 | 3.2154 × 10−1 | 3.4549 × 10−1 |
| MAE (m) | 1.0312 × 10−2 | 1.9053 × 10−2 | 1.1628 × 10−2 | 1.2928 × 10−2 | |
| CVaR95 (m) | 9.7986 × 10−2 | 8.2500 × 10−2 | 9.0378 × 10−2 | 1.2817 × 10−1 | |
| Vε | 1.6776 × 10−2 | 1.1930 × 10−2 | 1.5267 × 10−2 | 2.5180 × 10−2 |
| Index | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| (m) | 112.13 | 307.52 | 497.67 | 657.61 | 813.02 | 835.02 | 731.12 | 619.40 | 621.43 | 525.10 |
| (m) | 0.00 | −32.65 | −33.77 | 52.53 | 155.41 | 311.17 | 365.13 | 209.29 | 96.76 | 30.63 |
| (m) | 4.72 | 9.98 | 3.04 | 8.81 | 13.64 | 23.01 | 20.88 | 24.59 | 22.71 | 19.30 |
| Index | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
| (m) | 408.72 | 315.79 | 229.22 | 138.34 | 161.95 | 218.32 | 283.15 | 423.69 | 509.14 | 595.53 |
| (m) | −33.07 | −152.37 | −264.84 | −196.47 | −52.90 | 98.86 | 197.75 | 133.86 | 224.75 | 297.61 |
| (m) | 16.14 | 6.18 | 4.09 | 4.13 | 12.66 | 12.21 | 10.74 | 22.41 | 14.44 | 11.01 |
| Metric | TIB-CSAC | LOS+SP | SAC | DDPG | |
|---|---|---|---|---|---|
| Vertical | MXAE (m) | 3.0580 × 10−1 | 3.1465 × 10−1 | 3.9217 × 10−1 | 5.5085 × 10−1 |
| MAE (m) | 8.9867 × 10−3 | 2.3031 × 10−2 | 2.5653 × 10−2 | 4.6731 × 10−2 | |
| CVaR95 (m) | 6.6192 × 10−2 | 1.3515 × 10−1 | 1.3048 × 10−1 | 2.6511 × 10−1 | |
| Vε | 7.9594 × 10−3 | 4.0323 × 10−2 | 3.1242 × 10−2 | 1.2729 × 10−1 | |
| Horizontal | MXAE (m) | 3.0449 × 100 | 1.0180 × 101 | 6.7056 × 100 | 1.5886 × 101 |
| MAE (m) | 6.2952 × 10−2 | 5.0292 × 10−1 | 3.0592 × 10−1 | 7.7647 × 10−1 | |
| CVaR95 (m) | 4.6887 × 10−1 | 4.9808 × 100 | 2.6753 × 100 | 6.6688 × 100 | |
| Vε | 1.2621 × 10−1 | 4.2244 × 10−1 | 5.8750 × 10−1 | 9.3249 × 10−1 |
| Metric | TIB-CSAC | LOS+SP | SAC | DDPG |
|---|---|---|---|---|
| 7.3909 × 10−3 | 4.1181 × 10−3 | 5.3944 × 10−2 | 4.9386 × 10−3 | |
| 5.0755 × 10−1 | 2.2047 × 100 | 2.1324 × 10−1 | 1.8090 × 101 | |
| 1.3140 × 101 | 2.5033 × 101 | 3.1093 × 101 | 1.1789 × 102 | |
| 2.5234 × 101 | 1.4796 × 101 | 1.7047 × 102 | 1.6914 × 101 | |
| 7.2190 × 103 | 7.4023 × 103 | 7.4040 × 103 | 7.3943 × 103 | |
| 2.2866 × 103 | 2.3312 × 103 | 2.3302 × 103 | 2.3286 × 103 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tan, J.; Sun, Y.; Zhang, L.; Chai, P.; Liu, Z. Direct X-Rudder Path-Following Control for Underactuated AUVs via TIB-CSAC. J. Mar. Sci. Eng. 2026, 14, 1100. https://doi.org/10.3390/jmse14121100
Tan J, Sun Y, Zhang L, Chai P, Liu Z. Direct X-Rudder Path-Following Control for Underactuated AUVs via TIB-CSAC. Journal of Marine Science and Engineering. 2026; 14(12):1100. https://doi.org/10.3390/jmse14121100
Chicago/Turabian StyleTan, Jiehui, Yushan Sun, Liwen Zhang, Puxin Chai, and Zhan Liu. 2026. "Direct X-Rudder Path-Following Control for Underactuated AUVs via TIB-CSAC" Journal of Marine Science and Engineering 14, no. 12: 1100. https://doi.org/10.3390/jmse14121100
APA StyleTan, J., Sun, Y., Zhang, L., Chai, P., & Liu, Z. (2026). Direct X-Rudder Path-Following Control for Underactuated AUVs via TIB-CSAC. Journal of Marine Science and Engineering, 14(12), 1100. https://doi.org/10.3390/jmse14121100

