Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques

Li, Shuyue; López-Benítez, Miguel; Lim, Eng Gee; Ma, Fei; Cao, Mengze; Yu, Limin; Qin, Xiaohui

doi:10.3390/drones9110752

Open AccessReview

Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques

by

Shuyue Li

^1,2,3

,

Miguel López-Benítez

^4,5

,

Eng Gee Lim

¹

,

Fei Ma

⁶

,

Mengze Cao

^3,7,

Limin Yu

^1,*

and

Xiaohui Qin

^3,*

¹

Department of Communications and Networking, School of Advanced Technology, Xi’an Jiaotong-Liverpool University (XJTLU), Suzhou 215123, China

²

Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3DR, UK

³

Jiangsu Tsingunited Intelligent Control Technology Co., Ltd., Wuxi 214131, China

⁴

School of Computer Science and Informatics, University of Liverpool, Liverpool L69 3DR, UK

⁵

ARIES Research Centre, Universidad Antonio de Nebrija, 28040 Madrid, Spain

⁶

Department of Applied Mathematics, School of Science, Xi’an Jiaotong-Liverpool University (XJTLU), Suzhou 215123, China

⁷

College of Mechanical and Vehicle Engineering, Hunan University, Changsha 410082, China

^*

Authors to whom correspondence should be addressed.

Drones 2025, 9(11), 752; https://doi.org/10.3390/drones9110752

Submission received: 9 September 2025 / Revised: 20 October 2025 / Accepted: 27 October 2025 / Published: 30 October 2025

(This article belongs to the Section Unmanned Surface and Underwater Drones)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

Our quantitative analysis reveals a fundamental trade-off in cooperative navigation: batch optimization methods like FGO provide the highest accuracy at a significant computational cost, while robust filters like MCC-KF offer resilience to non-Gaussian noise with much greater efficiency but are prone to long-term drift.
A critical review of intelligent strategies (e.g., DRL, Semantic Communication) and advanced navigation techniques shows that while conceptually powerful, their practical deployment is currently hindered by specific, unresolved challenges such as the sim-to-real gap, hyperparameter instability, and the lack of standardized underwater datasets.

What is the implication of the main finding?

The key implication is that the optimal algorithm choice is a mission-specific engineering decision, not a one-size-fits-all solution; this paper provides the first data-driven framework (via our quantitative comparison) to guide researchers and engineers in making this critical trade-off between accuracy and resource constraints.
This work implies that future research should prioritize the specific, practical bottlenecks identified in this review, and our data-driven roadmap provides concrete, actionable research questions to accelerate the transition of these advanced technologies from theory to robust, real-world application in autonomous underwater systems.

Abstract

Cooperative navigation is a fundamental enabling technology for unlocking the full potential of Unmanned Underwater Vehicle (UUV) clusters in GNSS-denied environments. However, the severe constraints of the underwater acoustic channel, such as high latency, low bandwidth, and non-Gaussian noise, pose significant challenges to designing robust and efficient state estimation and information fusion algorithms. While numerous surveys have cataloged the available techniques, they have remained largely descriptive, lacking a rigorous, quantitative comparison of their performance trade-offs under realistic conditions. This paper provides a comprehensive and critical review that moves beyond qualitative descriptions to establish a novel quantitative comparison framework. Through a standardized benchmark scenario, we provide the first data-driven, comparative analysis of key frontier algorithms—from recursive filters like the Maximum Correntropy Kalman Filter (MCC-KF) to batch optimization methods like Factor Graph Optimization (FGO)—evaluating them across critical metrics including accuracy, computational complexity, communication load, and robustness. Our results empirically reveal the fundamental performance gaps and trade-offs, offering actionable insights for system design. Furthermore, this paper provides in-depth technical analyses of advanced topics, including distributed fusion architectures, intelligent strategies like Deep Reinforcement Learning (DRL), and the unique challenges of navigating in extreme environments such as the polar regions. Finally, leveraging the insights derived from our quantitative analysis, we propose a structured, data-driven research roadmap to systematically guide future investigations in this critical domain.

Keywords:

sensor fusion; state estimation; cooperative control; cooperative navigation; UUV cluster; multi-agent systems; underwater acoustic communication; multi-sensor systems; robust filtering; graph optimization

1. Introduction

1.1. Critical Role of Sensor Fusion in Cooperative Navigation

Achieving robust autonomy and enabling complex cooperative behaviors in modern unmanned systems, from self-driving cars to UAV clusters, are primary research goals. In the most challenging operational domains—the underwater environment—these goals are pushed to their absolute limits by the underlying problem of robust state estimation for cooperative Unmanned Underwater Vehicle (UUV) clusters. While a single UUV is significantly limited in its operational efficiency, robustness, and data resolution for wide-area missions, a cooperative cluster of UUVs can overcome these inherent limitations [1]. However, the very foundation of this cooperation—the ability for vehicles to share sensor data and fuse it into a coherent navigational picture—is severely undermined by the underwater environment. As noted by Saeed et al. [2], the acoustic communication channels that UUVs rely on are not only prone to security threats like jamming and spoofing but are also fundamentally constrained by high latency, low bandwidth, and other non-ideal characteristics. Therefore, developing robust information fusion and state estimation algorithms that can function effectively despite these extreme sensor and communication challenges is not just a strategic need for advancing marine technology but a fundamental problem in the field of sensor fusion itself.

1.2. Motivation and Challenges

The primary motivation for this field stems from a paradigm shift from mono to cluster systems. To break through the bottleneck of monolithic systems, the research has shifted to clusters or swarms consisting of multiple UUVs [3], which brings revolutionary advantages, including mission sharing, enhanced distributed sensing, and improved navigation accuracy through mutual observation [4,5]. However, this cooperation is entirely dependent on overcoming the ubiquitous constraints of underwater acoustic channels, which is the core challenge of the entire field. The Underwater Acoustic Communication (UAC) channel suffers from severe physical limitations [6], including extreme propagation delays (acoustic waves propagate at about 1500 m/s), minimal bandwidth (typically a few kbps), and high sensitivity to multipath propagation and environmental noise.

Together, these characteristics lead to significant delays, packet loss, and frequent link interruptions [7], which constitute fundamental bottlenecks for cluster collaboration. As emphasized by the Office of Naval Research (ONR) [8], these constraints make traditional centralized fusion architectures impractical, creating an urgent need for robust, scalable, and delay-tolerant distributed methods [9]. Furthermore, the complex underwater environment results in sensor measurements with non-Gaussian, heavy-tailed noise, challenging traditional filtering algorithms based on the Gaussian assumption, highlighting that the core of the solution lies in distributed Cooperative Navigation (CN).

Figure 1 visually illustrates the system-level challenges faced by UUV clusters in collaborative navigation. As the only feasible means of underwater long-range communication, acoustic communication inherently suffers from high latency, low bandwidth, and high packet loss rates, which constitute fundamental bottlenecks for cluster collaboration. These unstable communication links not only cause delays and incomplete transmission of information between nodes but also render traditional centralized fusion architectures impractical. Additionally, the complex multipath effects and time-varying noise in the underwater environment directly result in sensor measurement data exhibiting non-Gaussian characteristics, further challenging traditional filtering algorithms based on the Gaussian assumption.

1.3. Overview of Algorithmic Approaches

The research community has dedicated considerable effort to overcoming these challenges, resulting in a rich and diverse landscape of state estimation and information fusion algorithms. These techniques span a wide spectrum, from adaptations of classical Bayesian filtering methods (e.g., Extended Kalman Filter, Unscented Kalman Filter) [10], to more advanced approaches based on robust statistics (e.g., M-estimation, MCC) [11], information theory, and graph-based optimization (e.g., Factor Graph Optimization) [12]. These methods are often integrated within various network architectures, ranging from centralized to fully decentralized schemes [13,14], each presenting a different set of trade-offs.

1.4. Research Gap

While numerous surveys and reviews have been published to catalog this growing body of work, a significant gap remains in the existing literature. Most reviews focus on providing a taxonomy and a high-level, qualitative description of how different algorithms function. However, for a system designer or a mission planner, a critical question is often left unanswered: given the harsh constraints of the UAC channel, how do these algorithms perform relative to one another in terms of the fundamental trade-off between navigation accuracy and system costs (e.g., computational complexity, communication overhead)? There is a distinct lack of a systematic, quantitative comparison framework that can guide the selection of an appropriate algorithm for a specific mission scenario.

1.5. Contribution

To address the aforementioned gap, this paper moves beyond qualitative description to provide a comprehensive, quantitatively grounded reference framework for the design and evaluation of CN systems for UUV clusters. Prior reviews [15,16,17,18] have adeptly cataloged and explained the operational principles of key algorithms, establishing a crucial foundation for the field. This paper aims to build upon and complement these foundational works by introducing a framework focused on a different, yet equally critical, dimension: the direct, quantitative comparison of performance trade-offs. The core contributions of this review are three-fold:

A Novel Quantitative Comparison Framework: We introduce a standardized benchmark scenario and a multi-dimensional assessment methodology to systematically evaluate key algorithms. This framework is built upon concrete performance metrics—Root Mean Square Error (RMSE), CPU usage, communication load, and latency—that directly address the critical trade-offs in UUV system design. The results, encapsulated in our comprehensive comparison and intuitive radar chart visualization (Section 7.3), provide a clear, data-driven foundation for algorithm selection.
A Structured Algorithmic Taxonomy and In-Depth Analysis: We present a detailed taxonomy of state-of-the-art techniques, complemented by an in-depth analysis of pivotal methods. This includes a detailed mathematical derivation of the Maximum Correntropy Criterion (MCC) Kalman Filter (KF) (Section 4.3) and a thorough examination of Robust Filtering Techniques, encompassing adaptive, consensus, and information-theoretic approaches (Section 4.4). This level of technical detail provides the necessary foundation to fully contextualize our quantitative comparisons.
A Data-Informed Future Research Roadmap: Based on the insights gleaned from our quantitative analysis, we identify and discuss key unresolved challenges and promising future research directions. Unlike speculative roadmaps, our recommendations are directly tied to the performance gaps and trade-offs revealed by our comparative framework, highlighting specific areas—such as the development of scalable algorithms that balance accuracy and computational cost—that require focused attention from the research community.

1.6. Paper Structure

The structure of this review is designed to guide the reader logically from the fundamental challenges to the state-of-the-art solutions and future frontiers.

Section 2 begins by detailing the root cause of these challenges: the physical limitations of the underwater acoustic channel.
With the problem defined, Section 3 analyzes the foundational state estimation paradigms.
The core of the review then delves into specific fusion algorithms in Section 4, critically focusing on robust methods.
Building on this foundation, Section 5 investigates how a robust fusion engine enables intelligent behaviors through Deep Reinforcement Learning (DRL) and Task-Oriented Communication (TOC).
Subsequently, Section 6 extends the discussion to complex multi-sensor platforms and extreme environments.
Section 7 first assesses critical system-level issues such as security, and then culminates in our core contribution: a new subsection providing a comprehensive quantitative comparative analysis that synthesizes the preceding discussions into a data-driven evaluation.
Section 8 proposes a forward-looking research roadmap based on the insights from our analysis.
Finally, Section 9 summarizes the main conclusions of this work.

2. Hydroacoustic Communication Channel Modeling and Its Impact on Navigation

2.1. Physical Basis of Hydroacoustic Propagation

Underwater sound propagation is profoundly affected by environmental parameters, with temperature (T), salinity (S), and depth (D, i.e., pressure) being the key determinants of the Sound Speed Profile (SSP). Empirical models such as the Chen–Millero model and the Mackenzie equation [19] are widely used by academics and the engineering community to accurately estimate the speed of sound in specific marine environments. In addition to changes in sound velocity, underwater sound propagation is accompanied by other complex phenomena, including frequency-dependent absorption, geometric diffusion, multipath effects caused by reflection and refraction from the seabed and sea surface, and environmental noise from multiple sources [20]. These complex physical processes, especially the multipath effect, are the leading cause of non-Gaussian, heavy-tailed noise distributions in acoustic measurements, posing a fundamental challenge to traditional filtering algorithms based on Gaussian assumptions.

2.2. Classification of Delay and Packet Loss Models

To support the design and validation of robust algorithms, channel impairments must be mathematically modeled accurately.

2.2.1. Delay Modeling

Communication delay (

τ

) is usually decomposed into deterministic and stochastic parts. The deterministic delay is mainly determined by the propagation distance (d) and the average speed of sound (c), while the stochastic perturbation arises from multipath effects and environmental fluctuations [21]. The delay model has evolved from the simple linear equation

τ = \frac{d}{c}

to a more accurate integral form that accounts for the bending effect of the sound line in a non-uniform sound velocity profile:

τ = \int_{0}^{d} \frac{1}{c (z)} d z

(1)

where

c (z)

is a function of sound speed that varies with depth z. The stochastic term,

δ τ (t)

, is typically described using models such as Markov chains or Gaussian processes to capture its dynamic properties [21]. To estimate the delay more accurately, some methods use Kernel Density Estimation (KDE) [22] to derive the probability distribution of the delay from experimental data without the need for an exact a priori model.

2.2.2. Packet Loss Modeling

In addition to delays, packet loss and intermittent connections are also important characteristics of UAC channels, caused by factors such as deep fading and temporary signal blockage by obstacles (e.g., underwater mountains). In simulations and analyses, communication interruptions are typically modeled using a Bernoulli process (assuming that the packet loss probability is independent for each transmission) or more complex models such as the Gilbert-Elliott model, which uses a two-state Markov chain to describe the channel’s “good” and “bad” states [22]. Accurate packet loss modeling is crucial for designing algorithms that can operate reliably in real marine environments.

2.3. Data-Driven Channel Prediction Methods

Traditional purely physical channel models [23] are challenging in capturing the complex, time-varying, and geographically specific nature of the UAC channel in real time. Building accurate real-time physical models is not only computationally intensive, but also requires a large amount of environmental data (e.g., continuous T-S-D profiles) that are usually difficult to access. As a result, the research field is undergoing a shift from physical modeling to data-driven predictive modeling. Machine learning provides a powerful path to this end by learning predictive models of channels directly from observational data, thereby bypassing complex physical simulations [24]. This allows systems to obtain more adaptive and accurate real-time channel state information, which can then be used to optimize communication protocols or navigation strategies.

Recent research has demonstrated the great potential of machine learning in UAC modeling. For example, researchers have utilized Convolutional Neural Networks (CNNs) to predict acoustic Transmission Loss (TL) [25], enabling effective prediction of the far-field acoustic field given a seafloor topographic profile. Another notable work is applying a Multi-Layer Perceptron-Random Forest (MLP-RF) hybrid model to predict the location and width of acoustic convergence zones. In a complex ocean front environment, the model achieves 82.43% accuracy in predicting the distance of the convergence zone (with an error of less than 1 km) [26], which fully demonstrates the superiority of the data-driven approach in predicting key acoustic features.

2.4. High-Fidelity Simulation: Tools, Techniques, and Datasets

High-fidelity simulation is integral to the Verification and Validation (V&V) cycle of UUV navigation algorithms. Industry-standard acoustic modeling tools, such as the BELLHOP model based on line tracking [27] and the KRAKEN model based on Simple Positive Wave Theory (SPWT), make it possible to accurately compute acoustic propagation paths, crossing times, and multipath effects. To ensure the realism of the simulations, these tools are often combined with real-world oceanographic datasets (e.g., the World Ocean Atlas, WOA) [28] to generate close-to-reality, geographically specific sound velocity profiles. Furthermore, as learning-based multi-intelligent body systems become increasingly complex, high-fidelity hardware-in-the-loop system simulation platforms, such as the Gazebo-based LRAUV Sim [15], become critical. These platforms not only provide realistic physics engines and visualizations, but also support the development and training of complex collaborative behaviors, such as DRL strategies [15], which greatly accelerates the process from algorithms to real-world deployment.

In order to provide a concise and comprehensive reference for researchers, Table 1 summarizes the key impairments of the UAC channel, its direct impact on CN, and commonly used modeling strategies.

2.5. Summary

This section has systematically discussed the physical characteristics of UAC channels and their fundamental constraints on CN. It has first analyzed the key environmental factors that determine the velocity profile of the sound (temperature, salinity, depth) and complex physical phenomena such as multipath effects, propagation delays, bandwidth limitations, and environmental noise [32]. A core conclusion is that complex multipath propagation [32] is the primary cause of non-Gaussian, heavy-tailed noise in acoustic measurements, posing a fundamental challenge to traditional filtering algorithms based on Gaussian assumptions. To support robust algorithm design, the paper reviews mathematical modeling methods for channel damage, where delay modeling ranges from simple deterministic models to integral models considering sound ray bending [21,22], while packet loss modeling introduces techniques such as the Bernoulli process and the Gilbert-Elliott model [22]. Additionally, in response to the limitations of physical models, this section has explored data-driven channel prediction methods, such as using machine learning to predict propagation losses [25,26], and emphasized the critical role of high-fidelity simulation tools (e.g., BELLHOP [27]) and platforms (e.g., LRAUV Sim [15]) in algorithm validation. These physical limitations are the primary source of the non-ideal characteristics, such as non-Gaussian noise and outliers, that challenge conventional sensor fusion algorithms and ultimately hinder the achievement of robust swarm autonomy.

3. State Estimation and Information Reconstruction Under Uncertainty

Reliable state estimation is the bedrock upon which all higher-level autonomous behaviors are built. This section focuses on the core challenge of maintaining an accurate state estimate in the face of the information gaps and asynchrony created by the underwater communication channel.

3.1. Asynchrony and the Challenge of the Information Divide

The inherent shortcomings of the UAC channel directly contribute to a set of core problems in UUV cluster co-navigation, for which information reconstruction techniques were created to address. These challenges mainly include: missing or incomplete state information (e.g., position, velocity) of neighboring nodes due to communication interruptions or packet loss; outdated sensor measurements due to propagation delays; and asynchronous updating of communication topology due to node movement or link quality changes [33]. These information gaps seriously affect the performance and stability of CN systems, making effective estimation and compensation of missing information a critical task.

3.2. Model-Based Prediction and Compensation Techniques

Classical reconstruction methods rely on the UUV’s own motion model to predict and fill the information gaps.

State prediction: when neighbor information is temporarily unavailable, each UUV can use its dynamics or kinematics model for state forward propagation. This includes state prediction using kinetic model-based methods such as Extended Kalman Filter (EKF) or Unscented Kalman Filter (UKF) [34], as well as simpler kinematic methods such as polynomial (e.g., Lagrangian) interpolation based on historical trajectory points, or curve fitting, for short-term position prediction when the kinetic model is unknown or too complex [22].
Delay compensation: In order to deal with received outdated information, researchers have proposed various compensation strategies. One common approach is to consider the communication delay as a measurement bias in the observation model and treat it in the filter update step [35]. Another more refined approach is to use state history augmentation or interpolation to “backtrack” an outdated time-stamped observation to its corresponding historical moment for state updating and covariance correction [32], thus effectively compensating for the latency.

3.3. Paradigm Battle: Recursive Filtering vs. Batch Optimization

In the field of CN, state estimation methods are divided into two main paradigms: Recursive Bayesian Filtering and Batch Optimization. Understanding the fundamental trade-offs between these two paradigms is crucial for designing and selecting appropriate navigation algorithms.

Figure 2 highlights the fundamental differences between Recursive Filtering (RF) and batch optimization (two core state estimation paradigms) in their handling of temporal information. RF, based on the Markov assumption, efficiently performs online state updates but its “forgetful” nature prevents it from correcting early errors retrospectively. In contrast, batch optimization breaks this assumption by jointly modeling all historical data as a large-scale optimization problem, enabling global consistency and effectively correcting errors. These two paradigms involve an inherent trade-off between computational efficiency, memory requirements, and global consistency. The choice of method depends on specific task requirements and available computational resources.

RF methods, such as the classical KF and its variants (EKF, UKF), follow the Markov assumption, i.e., [36], the current state contains sufficient statistics for all historical information. In this framework, the state estimation is performed recursively: the filter makes predictions based on the state of the previous moment and the current control inputs, and then corrects them using the sensor observations of the current moment. Observations are discarded as soon as they are processed, and the system retains only the estimate of the current state (mean and covariance). The strengths of this approach are its computational efficiency and constant memory footprint, making it ideal for real-time online operation. However, its central weakness stems from the fact that the Markov assumption makes it “forgetful”. When dealing with nonlinear systems, the linearization error introduced by methods such as EKF accumulates, and once it occurs, it cannot be corrected by subsequent information, which may lead to overconfident (too small covariance), inconsistent, and ultimately dispersive estimates of the filter.

In contrast, batch optimization methods, especially those based on graph optimization, adopt a completely different philosophy. Instead of following Markov’s assumptions, it constructs the state estimation problem as a large-scale nonlinear least squares problem and jointly optimizes over all states and observations within a time window (or over the entire history) [37]. This approach has become mainstream in the robotics community, especially in the field of Simultaneous Localization and Mapping (SLAM).

Collaborative navigation is essentially a large-scale state estimation problem involving the states of multiple intelligences over a long period of time, as well as massive amounts of measurement information between them. This complex network of relationships can be naturally abstracted as a Factor Graph (FG) [38]. In the FG, the UUVs’ positions at different moments are represented as variable nodes, while their own motion models (e.g., inertial derivation) and relative observations from other UUVs (e.g., acoustic ranging) are represented as factor nodes connecting the variable nodes [39], and each factor node represents a probabilistic constraint between the variables. In this way, the CN problem is elegantly transformed into a large-scale, sparse nonlinear least squares optimization problem [40]. Solving this problem is equivalent to finding the Maximum A Posteriori (MAP) probability estimate for all UUV trajectories.

The fundamental advantage of this approach is its global (or batch) consistency. By considering all constraints simultaneously, the optimization process can “backtrack” and correct earlier state estimates based on subsequent observations, effectively smoothing the entire trajectory and eliminating error accumulation [41]. This allows graph optimization methods to achieve more accurate and robust results than recursive filters when dealing with asynchronous, heterogeneous, and often unreliable data streams.

However, this ability to optimize globally comes at a higher computational cost. As the optimization window (i.e., the length of the history trajectory) increases, the dimensionality of the problem and the computational effort grow rapidly [41], which poses a challenge for real-time applications. To address this issue, the research community has developed techniques such as Sliding-Window Optimization (SWO) and Incremental Smoothing And Mapping (iSAM). SWO limits computational costs by optimizing only a fixed-size window of recent states [42], making it a core component of many modern visual SLAM systems. Meanwhile, iSAM and its successor, iSAM2, avoid performing full batch optimization each time by efficiently updating the numerical solution of the FG [43], significantly reducing computational burden while maintaining high accuracy, making them highly suitable for real-time scenarios.

The development of open-source solver libraries (e.g., GTSAM [39]) has made solving these large-scale FG problems possible in practice. Advanced applications based on graph optimization are constantly emerging. For example, as discussed in the review of cooperative formation control [6], the use of data fusion to manage dynamically changing swarm topologies (such as autonomous dispersion and reorganization of nodes) is a key technology. Another innovative work comes from Wang et al. [44] who integrate the adaptive sliding observation window and MCC into the FG framework to robustly deal with measurement wildcards caused by, e.g., underwater multipath, and heading and velocity errors (YEaVE) due to low-accuracy sensors, and significantly improve the positioning accuracy in harsh environments [44]. In addition, to optimize the path planning of large-scale multi-intelligent body systems, a guidance graph [45] can be introduced to automatically generate guidance, thus improving the throughput without sacrificing the solution quality.

3.4. Learning-Based Information Reconstruction

With the development of machine learning techniques, data-driven information reconstruction methods also show great potential. Recurrent Neural Networks (RNNs), in particular Long Short-Term Memory Networks (LSTMs) and Gated Recurrent Units (GRUs) [46], are capable of learning temporal dynamics patterns of UUV motions in order to predict their trajectories during communication outages. A more cutting-edge approach is the application of Graph Neural Networks (GNNs). GNNs are inherently suited for processing data with graph structure and are capable of explicitly modeling spatial and temporal dependencies between nodes within UUV clusters. By end-to-end training on historical observations, GNNs can learn a powerful state estimation model that outperforms traditional filtering and interpolation methods in terms of trajectory estimation accuracy and generalization in dynamic swarming scenarios with high rates of missing information and frequent changes in network topology [47]. Some studies have also explored self-supervised learning methods [48] to denoise Inertial Measurement Unit (IMU) data in the absence of ground truth, further enhancing the robustness of navigation.

3.5. Summary

This section has focused on addressing the information gap and asynchronous challenges caused by defects in underwater acoustic channels [33]. This section has first reviewed classic model-based information reconstruction techniques, including dynamic or kinematic models used for state prediction during communication interruptions, and delay compensation strategies used to handle outdated information [32]. Subsequently, this section delved into the two major paradigms in the field of state estimation: RF and batch optimization [37]. RF (such as the EKF) is computationally efficient and suitable for online operation, but its Markov assumption makes it prone to divergence due to the accumulation of linearization errors [36]. In contrast, batch optimization methods, represented by FGs, achieve global consistency by jointly optimizing all historical states and observations within a time window, thereby yielding more accurate and robust estimation results, particularly suited for handling asynchronous, unreliable data streams in CN [38]. Finally, this section has introduced cutting-edge learning-based information reconstruction methods, particularly RNNs that can learn UUV motion temporal patterns, and more powerful GNNs [47] that can explicitly model cluster spatio-temporal dependencies. The latter demonstrates the potential to outperform traditional methods in dynamic scenarios with high information loss rates.

4. Fusion Algorithms and Architectures

Having established the foundational state estimation paradigms, this section delves into the specific algorithms and architectural choices that are critical for achieving robust CN in the harsh underwater environment. We will explore fundamental architectural frameworks, review standard Bayesian filtering methods, and then conduct a deep dive into several state-of-the-art robust and information-theoretic techniques. For each core method, we will not only describe its operational principles but also critically analyze its practical applicability and inherent challenges within the constrained UUV domain.

4.1. Fusion Architecture Framework

The choice of fusion architecture is a foundational design decision that dictates how information flows and is processed within a UUV cluster. This choice involves trade-offs between processing paradigms (recursive vs. batch), data integration strategies (loosely vs. tightly coupled), and the challenges of distributed implementation.

Loosely Coupled (LC) vs. Tightly Coupled (TC) Architectures: In a loosely coupled system, each sensor’s data are processed independently to produce a state estimate, and these individual estimates are then fused. This approach is modular and simple but can suffer from information loss [49,50]. In contrast, a tightly coupled architecture fuses raw sensor data directly within a single estimation filter, which is more complex but can achieve higher accuracy by fully exploiting the correlations between different sensor measurements [51,52].
Recursive Filtering vs. Batch Optimization: Recursive filters process measurements sequentially, updating the state estimate at each time step and then discarding the measurement. This is computationally efficient [11,53]. Batch optimization, exemplified by Factor Graph Optimization (FGO), retains all measurements over a time window and solves for the most likely sequence of states that best explains all observations simultaneously [54,55]. FGO models the problem as a bipartite graph of variable nodes (states) and factor nodes (probabilistic constraints from measurements) [55]. While its ability to correct long-term drift is a crucial advantage for extended UUV missions, its batch nature demands higher memory, computational resources, and significant communication overhead to share factor information, making it challenging for resource-constrained UUVs [56,57].
Advanced Topics in Distributed Fusion Frameworks: The practical implementation of these architectures in a distributed UUV network faces further challenges that require advanced solutions. The first is the problem of data incest or “double counting,” which is addressed by algorithms like Covariance Intersection (CI) that conservatively fuse information without requiring knowledge of cross-correlations [58,59]. Second is ensuring the convergence of distributed filters under asynchronous links and frequent packet loss, a key research area for UAC networks [60,61]. Finally, the Information Filter (IF) offers a practical method for communication compression, as its additive nature in the information space simplifies distributed fusion to a simple summation of information vectors from each node [62].

4.2. Bayesian Filtering: The EKF, UKF, and Particle Filters

Bayesian filters are the most widely used state estimation algorithms in the field.

EKF/UKF: EKF and UKF are fundamental tools for dealing with nonlinear systems [63]. In collaborative navigation, they are used to recursively fuse predictions from the UUV’s own motion model with relative measurements from neighboring nodes (e.g., acoustic ranging, azimuthal angle, etc.). The EKF is linearized by a first-order Taylor expansion of a nonlinear function, and is suitable for weakly nonlinear systems, whereas the UKF approximates the state distribution by an Unscented Transform. UKF approximates the state distribution by Unscented Transform, which has higher accuracy for strongly nonlinear systems, but the computational complexity increases accordingly.
Particle Filters (PF): PFs provide a powerful nonparametric solution for complex problems with strongly nonlinear dynamics or non-Gaussian noise distributions [64]. It represents the posterior probability density by a set of random samples (particles) with weights, and can theoretically approximate distributions of arbitrary form. However, its main drawbacks are its huge computational effort and its susceptibility to particle degeneration and sample impoverishment problems in high-dimensional state spaces, requiring a large number of particles to ensure estimation accuracy [65]. This limits its wide application to resource-constrained UUVs.

4.3. Robust Filtering Techniques: Adaptive, Consensus, and Information-Theoretic Approaches

While standard Bayesian filters are optimal under Gaussian assumptions, their performance degrades significantly in the presence of non-Gaussian, heavy-tailed noise. This section focuses on robust techniques designed to overcome this limitation.

Adaptive and consensus-based filters offer partial solutions by either tuning filter parameters in real time [66] or by averaging information across the network [67]. However, a more powerful approach is to use information-theoretic metrics that are inherently robust to outliers. The MCC is a prime example [11]. The MCC-KF reformulates the standard KF to maximize correntropy—a robust similarity measure—instead of minimizing the mean square error [68]. This is achieved by introducing an adaptive weighting mechanism where the Kalman gain is calculated using an effective measurement noise covariance that is inversely weighted by a Gaussian kernel of the measurement residual. When a residual is large (indicating an outlier), its weight approaches zero, effectively causing the filter to “distrust” and ignore that measurement, thus achieving robustness [69].

Figure 3 clearly contrasts the different behaviors of the two optimization objectives, Minimum Mean Square Error (MMSE) and MCC, when dealing with non-Gaussian noise, especially outliers. The quadratic cost function of MMSE makes it highly sensitive to large errors, causing outliers to be overly penalized, which leads to a sharp decline or even divergence in the performance of traditional filters based on it (such as KFs). In contrast, the MCC leverages the properties of the Gaussian kernel function to moderately penalize small errors and rapidly saturate penalties for large errors, thereby fundamentally suppressing the impact of outliers. This principle enables MCC-based filters to exhibit superior robustness in non-ideal channel environments such as underwater multipath propagation.

A crucial practical challenge for MCC-based filters is the selection and stability of the kernel bandwidth

σ

, a key hyperparameter that governs the filter’s robustness. While often set empirically, recent research has focused on adaptive kernel bandwidth methods [70,71] to ensure stability and optimality in unknown, time-varying environments. While MCC offers robustness, its nonlinear weighting complicates the state-vector augmentation methods typically used for handling delayed or out-of-sequence updates [72]. Furthermore, when considering its use as a robust factor in graph optimization, it is less common than standard M-estimators like Huber or Cauchy due to its non-convex nature, which can complicate optimization [73,74,75,76]. In contrast to the Minimum Error Entropy (MEE) criterion, MCC is often computationally simpler, making it more attractive for real-time UUV applications. The primary advantage of the MCC-KF is its inherent resilience to the sudden, large-magnitude errors typical of acoustic multipath effects [77], though this comes at the cost of hyperparameter tuning and potential increases in computational load.

4.4. Information-Theoretic Fusion and Fault Detection: Bhattacharyya Distance

The Bhattacharyya distance (BD) is another information-theoretic metric that quantifies the “distance” or “divergence” between two probability distributions [78,79]. In a UUV network, it can measure the consistency between a vehicle’s own predicted state distribution and the information received from a neighbor. For two Gaussian distributions

p_{1} \sim N (μ_{1}, Σ_{1})

and

p_{2} \sim N (μ_{2}, Σ_{2})

, the BD has a convenient closed-form solution:

D_{B} (p_{1}, p_{2}) = \frac{1}{8} {(μ_{1} - μ_{2})}^{T} Σ^{- 1} (μ_{1} - μ_{2}) + \frac{1}{2} ln (\frac{det (Σ)}{\sqrt{det (Σ_{1}) det (Σ_{2})}})

(2)

where

Σ = \frac{Σ_{1} + Σ_{2}}{2}

. If the calculated BD exceeds a predefined threshold, the neighbor’s data are deemed inconsistent and can be rejected.

The principled selection of this threshold is non-trivial. While it can be derived from statistical distributions (e.g., Chi-squared) under ideal assumptions, adaptive methods like CUSUM or GLR tests [80,81] are often required in practice. Because the BD considers the full distributional information (both mean and covariance), it is inherently more robust to non-Gaussian noise and model mismatch than purely innovation-based consistency checks like the Normalized Innovation Squared (NIS) test. Compared to related metrics like Kullback–Leibler (KL) or Jensen-Shannon (JS) divergence, the BD offers the practical advantages of being a true symmetric metric and having a closed-form solution for Gaussian distributions [82,83], making it more computationally tractable for onboard implementation. For UUV applications, its key appeal lies in providing a principled measure for fault detection, with the main challenge being its higher computational cost compared to a standard NIS check [84].

4.5. Multiresolution Fusion: Wavelet Analysis and Multirate Architectures

The sensor suite of a UUV is inherently multirate and multimodal, and typically consists of high-frequency, high-noise inertial sensors and low-frequency, delayed but drift-free acoustic localization measurements, for which sampling rates are typically 1 Hz or less [85]. Fusing these asynchronous, heterogeneous data streams directly in the time domain is a major engineering challenge.

To address this challenge, the industry has developed Multirate Filtering Architectures (MRFAs). The classic implementation is that the prediction step of the filter operates at a high frequency of the IMU, and utilizes the IMU measurements to perform high-frequency navigation-level extrapolation, thus providing a smooth and continuous state estimation. The filter’s update step, on the other hand, is triggered asynchronously at a lower frequency and is executed only when low-frequency measurements are received from the acoustic positioning system or Global Positioning System (GPS) (for surface-assisted nodes) [86]. This architecture effectively utilizes the strengths of the different sensors: the IMU provides short-term relative motion information, while the acoustic/GPS measurements are used to periodically correct the Inertial Navigation System (INS) for drift errors accumulated over time. However, a key limitation of this architecture is its sensitivity to measurement latency [87]. Significant and time-varying latencies, especially those inherent in UAC channels, can make low-frequency measurements “out of date” by the time they arrive, which, if not handled correctly, can severely affect the filtering accuracy or even lead to divergence.

As a powerful mathematical tool, wavelet analysis provides a new perspective for more refined fusion within a multirate architecture. It can decompose signals into different frequencies, enabling more optimized fusion strategies. Specifically, the application of wavelet analysis in UUV navigation focuses on two aspects: first, as a signal preprocessing tool for noise reduction; second, as a core framework for multi-scale fusion.

IMU signal denoising: The most direct application of the wavelet transform is as an advanced signal preprocessing technique [88]. The output signals of IMUs, especially low-cost Micro-Electromechanical System (MEMS) sensors, contain a large amount of random noise. By utilizing the multiresolution property of the wavelet transform, the signal can be decomposed into different frequency bands. Since the energy of the useful signal is mainly concentrated in the low-frequency part, while the noise is widely distributed in the high-frequency part, the noise can be effectively filtered out by applying soft/hard thresholding methods to the high-frequency wavelet coefficients while maximizing the retention of the useful features of the original signal.
Wavelet-assisted fusion: The wavelet transform provides a unified framework for multi-scale information fusion [16]. In this “wavelet-assisted fusion” strategy, signals from different sensors are first decomposed into the wavelet domain. The fusion process can be carried out at the wavelet coefficient level, e.g., by utilizing high-frequency IMU coefficients for state propagation between high acoustic update rates, and low-frequency acoustic measurements to correct for the low-frequency drift component in the state estimation [26]. This approach not only elegantly handles physical processes on different time scales (e.g., fast maneuvers vs. slow drifts) but also aligns asynchronous data streams through wavelet timing reconstruction. Recent studies have even combined wavelet transforms with neural networks to propose novel network architectures such as WINNet, which takes wavelet-transformed time-frequency features as network inputs along with the original time-domain data [64], providing a richer representation of information for learning-based localization systems. In addition, Wavelet Neural Networks (WNN) are also used to construct motion models of AUVs [17] to cope with system uncertainties.

4.6. Summary

This section has provided a deep dive into the fusion algorithms and architectural frameworks essential for robust CN. We began by outlining the fundamental architectural trade-offs, including loosely vs. tightly coupled integration, recursive filtering vs. batch optimization (identifying FGO as a powerful but resource-intensive batch method), and key challenges in distributed frameworks like data incest and convergence. After reviewing standard Bayesian filters, we focused on advanced robust techniques. We provided a detailed mathematical and critical analysis of information-theoretic methods like the MCC filter and the BD, highlighting their strengths in handling non-Gaussian noise and detecting faults, while also analyzing their practical challenges related to hyperparameter tuning and computational cost. The overarching theme is that there is no one-size-fits-all solution; the choice of algorithm and architecture requires a careful consideration of the trade-offs between accuracy, robustness, and resource constraints inherent to the specific UUV mission.

5. Intelligent Strategies and Adaptive Collaboration

With a robust sensor fusion engine in place to provide reliable state awareness, the UUV swarm can move beyond simple estimation towards the execution of intelligent and adaptive cooperative control strategies.

5.1. DRL for Collaborative Control and Decision-Making

DRL represents a paradigm shift from traditional model-based control to model-free, data-driven learning [89]. It enables UUVs to learn complex adaptive behaviors in dynamic and uncertain environments without the need for precise mathematical models of the system or environment [90]. For UUV clusters, deriving closed-form optimal control policies is a highly dimensional and difficult problem to resolve, given their complex nonlinear fluid dynamics and unknown environmental perturbations [91]. DRL bypasses this challenge by allowing intelligences to learn optimal control policies directly through trial-and-error interactions in (often simulated) environments [92]. This learning process enables the discovery of emergent synergistic behaviors such as formation control, target tracking, and obstacle avoidance that are inherently robust to environmental changes and unmodeled dynamics.

DRL has been remarkably successful in single Autonomous Underwater Vehicle (AUV) missions, such as in autonomous docking missions, where algorithms like Twin Delayed Deep Deterministic Policy Gradient (TD3) have been shown to provide stable, precise control [29]. More importantly, Multi-Agent Reinforcement Learning (Multi-Agent DRL, MADRL) opens up new avenues for solving collaborative tasks. For example, the Multi-Agent DDPG (MADDPG) algorithm has been used to learn cooperative target encirclement strategies [92], while Double Deep Q-Networks (DDQN) have been used to optimize network topology and localization strategies to reduce energy consumption and localization errors [93]. These advances highlight the key synergies between DRL and high-fidelity simulation environments such as LRAUV Sim, which provide the data and environment needed to perform large-scale training of DRL algorithms [15]. A particularly promising research direction is to combine DRL with classical methods, e.g., using the Fisher Information Matrix (FIM) to optimize the trajectory of an Unmanned Surface Vehicle (USV) in order to provide the most informative measurements for a team of AUVs controlled by the RL [29], thus enabling synergistic optimization of control and positioning.

While DRL offers a powerful, model-free approach to optimizing UUV swarm behavior (e.g., path planning, formation control) in complex, dynamic environments, its practical application faces significant hurdles specific to the underwater domain. The high cost and risk of real-world data collection underwater make training prohibitive, often forcing reliance on simulators [94]. However, these simulators may fail to capture the full fidelity of complex acoustic channel physics and unpredictable ocean currents, leading to a significant “sim-to-real” gap [95]. Furthermore, deploying complex neural network policies on embedded UUV hardware presents a major challenge in balancing real-time decision-making performance with constrained energy consumption [96].

5.2. TOC: Bandwidth Optimization Based on the Information Bottleneck Principle

TOC fundamentally reshapes the framework of the communication problem [97]. While the goal of traditional communication is to achieve bit-level accurate data reconstruction, the goal of TOC is to optimize the transmitted signal to maximize the performance of specific downstream tasks (e.g., localization, classification) [29]. This philosophy fits perfectly with the needs of the severely bandwidth-constrained UAC channel, a fundamental bottleneck that cannot be overcome by improving modem technology alone. Transmitting raw or conventionally compressed sensor data (e.g., sonar scan images) is rendered infeasible by its enormous data volume. More importantly, traditional compression methods (e.g., JPEG) are task-independent and aim to maintain human perception quality, which may not be consistent with the information required for navigation tasks.

TOC is theoretically based on the Information Bottleneck (IB) principle and aims to learn a communication strategy that compresses the source data by retaining only the most relevant information for the collaborative task [98]. This allows for extremely high compression rates with minimal degradation in task performance, representing a revolutionary improvement in UUV cluster communication efficiency. A concrete implementation is the Orthogonally constrained Variational Information Bottleneck (O-VIB) framework. The framework utilizes a VIB encoder with Automatic Relevance Determination (ARD) and orthogonality constraints to intelligently compress multiview visual features for Unmanned Aerial Vehicle (UAV) localization, achieving highly accurate localization with a very low bandwidth budget [99]. Although this technique is currently mainly applied to UAV visual navigation, its core idea can be fully generalized to the UUV field [98]. It is foreseeable that similar frameworks will emerge in the future for transmitting highly compressed, mission-relevant features from sonar, IMUs, or acoustic sensors over the UAC channel, thus making the leap from transmitting raw data to transmitting “semantics”.

This approach is particularly promising for UUV clusters as it directly tackles the extreme bandwidth limitations of UAC channels. By transmitting only the information essential for the task (e.g., a target’s semantic label and location, not the raw sonar image), it can achieve orders-of-magnitude compression [100,101]. The critical challenge, however, lies in establishing a shared “semantic dictionary” or context model among UUVs [102] and ensuring the robustness of the semantic extraction and reconstruction process against the channel’s high bit error rates [103]. A misinterpreted semantic message could be more catastrophic for mission success than a traditionally corrupted but partially recoverable message [104].

5.3. Active and Bionic Sensing for Enhanced Situational Awareness

In addition to passively receiving and fusing information, UUV clusters can actively participate in sensing processes to enhance their situational awareness of the environment. In this field, two main research approaches stand out: one is to actively improve perception quality by optimizing the geometric configuration of the swarm, and the other is to draw inspiration from biological swarms in nature to design efficient cooperative behaviors.

Active Cooperative Sensing: In this strategy, UUVs in a cluster are no longer isolated sensing units, but intelligently coordinate each other’s positions and sensing behaviors to achieve globally optimal sensing results [47]. For example, a cluster of UUVs can dynamically adjust its formation to obtain the minimum Dilution of Precision (DOP) for a specific target to maximize the localization accuracy [28]. This approach tightly couples the navigation and sensing tasks so that the cluster “senses” as a whole.
Bio-inspired Cooperative Control: Clusters of organisms such as schools of fish and flocks of birds in nature exhibit efficient and robust cooperative behavior [105]. Inspired by this, researchers have developed bionic control strategies. For example, a novel control scheme utilizes sonar to construct a dynamic interaction topology that enables UUVs to achieve robust formation control and obstacle avoidance by relying only on relative position information (without exchanging velocity information) [31]. This approach greatly reduces the requirement for communication bandwidth and data synchronization and is well suited for underwater environments. Research programs such as the U.S. ONR Collaborative Autonomous Swarming Technology (CAST) are also actively advancing the field [21], exploring new technologies for distributed control, swarm behavior, and CN.

5.4. Summary

This section has explored intelligent strategies that transcend traditional filtering and control frameworks, aiming to enhance the autonomy, robustness, and collaborative efficiency of UUV clusters in dynamic and uncertain environments [89]. First, it has introduced the application of DRL, which represents a paradigm shift from model-based to model-free learning. This enables UUV clusters to autonomously learn complex control strategies, such as cooperative formation and target tracking, through trial-and-error interactions with the environment [90], without the need to establish precise system dynamics models, while also highlighting the significant practical hurdles such as the sim-to-real gap [106,107] and onboard computational constraints [108,109] that currently limit its real-world deployment. Second, this section delved into the revolutionary communication concept of TOC [97]. Unlike traditional communication, which aims for bit-level precise reconstruction, TOC is based on the IB principle, aiming to optimize signal transmission to maximize the performance of downstream tasks (such as navigation) by transmitting only the most task-relevant information [98], significantly improving communication efficiency in extreme bandwidth-constrained environments like acoustic channels, while acknowledging the critical challenges of establishing a shared semantic context [104] and ensuring robustness against high bit error rates [54]. Finally, this section discussed active and biomimetic perception strategies for enhancing situational awareness. This includes “active collaborative perception,” [47], where cluster members intelligently coordinate their positions and sensing behaviors to achieve globally optimal perception effects, and “biomimetic collaborative control,” [31], which mimics the efficient collaborative patterns of natural biological clusters (such as fish schools). The latter can achieve robust formation and obstacle avoidance at extremely low communication costs. Collectively, the discussion underscores that while these intelligent strategies hold immense promise, their successful transition from theory to practice is contingent upon overcoming significant challenges related to data acquisition, simulation fidelity, computational resources, and the need for robust semantic alignment.

6. Heterogeneous Systems and Extreme Environments

Having established a toolbox of robust sensor fusion algorithms, this section extends the discussion to more complex and challenging scenarios. We explore how these fusion principles are applied in heterogeneous multi-sensor platforms (such as UUV-USV teams) and push them to their limits in extreme operational environments like the polar regions and the deep sea, which serve as ultimate testbeds for algorithmic robustness.

6.1. Collaborative Navigation of Heterogeneous Fleets (UUV-USV-AUV)

Currently, the field of underwater autonomous systems is rapidly evolving from homogeneous UUV swarms to more powerful and flexible heterogeneous teams [5]. In these teams, platforms with different capabilities (e.g., surface vs. submerged, sensing vs. actuating platforms) work together to achieve common goals [110]. One of the most prominent and impactful examples is the team of UUVs and USVs.

This combination creates a powerful symbiotic system. UUVs have the key advantages of underwater detection and stealth, but their fundamental limitation is their inability to directly access GPS signals and engage in high-bandwidth communications [111]. In contrast, USVs, while limited to the surface, have continuous access to GPS and utilize high-bandwidth satellite or radio communications [110]. Combined, the USV can act as a mobile “mother ship” or navigation and communications gateway. It provides a GPS-calibrated, accurate mobile acoustic baseline for submerged UUVs, enabling them to be positioned with high accuracy. At the same time, the USV relays data and commands between the UUV cluster and remote human operators [112]. This layered architecture elegantly circumvents many of the fundamental limitations of a purely inter-UUV cooperative system [113], enabling complex missions with long endurance and wide range.

This new architecture also gives rise to new optimization problems. For example, how to plan the trajectory of a USV to minimize the probability of an accidental collision between it and the AUV while maximizing the data transfer rate between the two has become an active research area [112]. In addition, using the UAV’s “eagle eye” view, combined with fuzzy sliding mode control, visual path replanning of USV formations can be realized, which further enhances the cooperative capability of heterogeneous systems [114]. In addition to UUV-USV combinations, other forms of heterogeneous teams also show their value in specific applications [115], such as UUV-UAV-UGV combinations, which can be used for special tasks such as underground or mine mapping.

Figure 4 illustrates the hierarchical operational mode of the heterogeneous UUV-USV collaborative system, a powerful strategy that overcomes the inherent communication and positioning limitations of pure UUV clusters. The USV, positioned on the water surface, continuously receives high-precision GPS signals and high-bandwidth radio communication capabilities, thereby serving as the system’s “navigation anchor” and “communication gateway.” It transmits regular GPS correction information to the underwater UUV cluster via an acoustic link while relaying data and commands. This hierarchical architecture not only addresses the issue of long-term drift in UUVs but also significantly enhances the operational range, robustness, and real-time interaction capabilities of the entire cluster with the external environment.

6.2. Navigation Frontiers I: Polar Region Operations

Operating under the polar ice cap presents a unique and formidable set of challenges that demand specialized navigation strategies [116]. The lack of GNSS is absolute, and the overhead ice sheet creates a severe acoustic multipath environment [117]. Furthermore, navigation at high latitudes introduces fundamental geodetic and attitude reference nuances [118].

Standard navigation frameworks encounter geometric problems due to the convergence of meridians near the poles, making the traditional latitude-longitude coordinate system ill-conditioned [119]. To overcome this, navigation is typically performed on a polar stereographic grid frame, where a 2D Cartesian grid is projected onto the Earth’s surface [49]. Consequently, all heading references must be adapted from True North to Grid North to maintain consistency within the grid projection. This transformation is non-trivial and must be accounted for in all motion models and sensor fusion algorithms [50].

Attitude determination is also severely hampered. The weak horizontal component and rapid fluctuations of the Earth’s magnetic field near the geomagnetic poles render magnetic compasses unreliable for heading estimation [120]. This necessitates a greater reliance on high-grade inertial sensors (e.g., fiber-optic or ring-laser gyroscopes) [121] and periodic heading updates from acoustic systems (e.g., using a long-baseline array deployed from an ice camp) or by surfacing in ice-free zones to obtain a GNSS fix [116].

6.3. Navigation Frontiers II: Deep Sea and Complex Terrain Localization

Deep-sea exploration (>1000 m) and operations in complex terrains like underwater canyons present their own distinct set of challenges, primarily related to acoustic propagation [122] and feature-based navigation [123,124].

In the deep ocean, the SSP is highly stratified and dynamic, with significant variations caused by thermoclines, haloclines, and internal waves [125]. These variations can drastically bend acoustic rays, leading to large, unpredictable positioning errors [126]. For long-endurance missions, relying on a pre-mission SSP is insufficient. This motivates the use of online SSP estimation techniques, where the AUV uses onboard sensors (CTD—Conductivity, Temperature, Depth) to continuously measure and update the acoustic model in real time [127]. Such continuous updates allow for adaptive compensation of ray-bending effects in navigation algorithms [128].

Furthermore, both deep-sea plains and complex canyons pose significant problems for acoustic SLAM. The deep-sea floor is often feature-sparse (“abyssal desert”), making it difficult to find and re-observe unique acoustic landmarks [129]. Conversely, canyons create extreme acoustic multipath and reverberation [129]. In both cases, robust Data Association (DA) is paramount. Misassociating an acoustic return can corrupt the entire map and trajectory estimate. To address this, advanced DA techniques are required. While methods like Joint Probabilistic Data Association (JPDA) can handle multiple potential associations, state-of-the-art approaches based on Random Finite Sets (RFS), such as the Poisson Multi-Bernoulli Mixture (PMBM) filter [130], provide a more rigorous Bayesian framework for joint tracking and mapping in cluttered and uncertain environments [131]. These methods are computationally intensive but offer a promising path toward robust, long-term acoustic SLAM in the most challenging terrains.

6.4. Summary

This section has explored the application of CN principles in advanced and challenging contexts. We first examined the synergistic advantages of heterogeneous fleets, where GNSS-enabled surface assets anchor the underwater network [132]. We then delved into the unique and severe challenges of operating in two of Earth’s most extreme environments: polar regions and the deep sea [133]. To provide a concise overview of the key challenges and enabling technologies discussed, Table 2 provides a comprehensive comparison of the unique challenges and corresponding solution strategies in several advanced operational scenarios discussed in this section. These advanced applications underscore that a truly robust CN system must be adaptable not only to communication failures but also to the fundamental physical and geometric properties of its operational environment [134].

7. System-Level Challenges: Safety, Assessment, and Deployment

Beyond algorithmic design, transitioning a sensor fusion system from theory to real-world deployment requires addressing critical system-level challenges. This section examines these challenges, focusing on how to secure the fusion engine against malicious sensor data, and then establishes a rigorous framework for the quantitative performance evaluation of different fusion strategies.

7.1. Securing CN: Resisting Spoofing and Jamming Attacks

As UUV clusters grow in importance and autonomy in strategic applications, they will inevitably become targets of adversarial attacks [140]. As a result, security can no longer be considered an after-the-fact add-on, but must be a core design consideration for any operationalized co-navigation system.

The entire premise of co-navigation rests on the trusted exchange of information between intelligences. Attackers can destroy this trust in two main ways [141]: first, through jamming attacks, which inject a large amount of noise into the channel, causing Denial-of-Service (DOS) so that nodes cannot communicate effectively with each other; and second, through spoofing attacks, which inject fake and false information into the network to manipulate the cluster’s behavior, inducing it to deviate from its intended task or to collide.

GPS jamming and spoofing are well-known threats to heterogeneous systems containing surface assets such as USVs [142]. Once an attacker succeeds in spoofing a USV that serves as an “anchor point”, the positioning reference of the entire underwater cluster may be shifted, which can lead to the entire cluster being lost. Research has shown that machine learning-based classifiers such as Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM) [143] can effectively detect different types of GPS spoofing attacks (e.g., simple, relay, and complex), providing a layer of protection for surface assets.

Extending these concepts to the underwater domain, similar acoustic threats are faced. Acoustic jamming refers to an attacker using an acoustic source to broadcast strong noise into the channel, drowning out useful communication signals [144]. Acoustic spoofing is even more insidious, as an attacker can replay or fake acoustic ranging signals (ping) or Ultra-Short Baseline (USBL) answer signals [145], thus providing false relative position information to the target UUV. A truly robust system must have the ability to validate information from peers. This goes beyond simple filtering and requires the introduction of new layers of security. As discussed in Section 4.4, fault-detection mechanisms based on information-theoretic metrics, such as barometric distance [146], can serve as an effective defense against spoofing attacks. By quantifying the “informational surprises” associated with each new observation, the system can identify and reject maliciously injected data that is highly inconsistent with the current state beliefs, thus enhancing security at the data level.

Future research directions should include the development of consistency-based anomaly rejection algorithms (e.g., a UUV can cross-check a neighbor’s position derived from its own position with the position broadcast by that neighbor), and the design of trust-based information fusion mechanisms, i.e., [147], to automatically reduce the weight of information from nodes judged to be unreliable.

While standard cryptographic methods can secure data payloads against eavesdropping and modification, they do little to prevent physical-layer attacks like acoustic jamming or signal spoofing, which are particularly effective in the open underwater medium [148,149]. Moreover, traditional security protocols often introduce significant communication overhead and latency, directly conflicting with the primary goal of efficient CN under bandwidth constraints [150,151]. This highlights a fundamental trade-off. Therefore, future research must focus on the co-design of security and navigation algorithms. For instance, leveraging the statistical fault-detection methods discussed in Section 4.4 (like BD) not only enhances robustness against sensor noise but can also serve as a lightweight, intrinsic defense against malicious data injection attacks [152,153], offering a more resource-efficient security paradigm for UUVs.

7.2. Multi-Dimensional Performance Evaluation Framework

In order to systematically evaluate and compare different collaborative navigation systems, a comprehensive performance evaluation framework is needed. This review proposes the following five core evaluation dimensions based on existing research [1]:

Fusion Accuracy: Usually measured by the RMSE between the estimated state (position, velocity, attitude) and the ground truth. This is the most direct metric for evaluating the performance of the algorithm.
Computational Complexity: Measures the consumption of computational resources by the algorithm, which is usually expressed in terms of CPU/GPU processing time per fusion cycle and the memory usage of the algorithm. This is directly related to whether the algorithm can run in real time on resource-constrained UUVs.
Communication Load: A measure of the algorithm’s need for communication bandwidth, usually expressed in terms of the data transfer rate (bit/second) required by each node and the frequency and size of packet exchanges. This is a critical constraint in UAC channels where bandwidth is extremely valuable.
Robustness: The ability of a system to maintain its performance stability in the face of non-ideal conditions such as high communication delays, packet loss, node failure, non-Gaussian sensor noise, and external environmental disturbances.
Scalability: Refers to how key performance metrics (e.g., accuracy, computational complexity, and convergence time) change when the number of nodes in the cluster increases [3]. A scalable algorithm should be able to increase the number of nodes significantly without a sharp drop in performance.

7.3. Comprehensive Quantitative Comparative Analysis of Frontier Methods

To systematically evaluate the performance trade-offs of these algorithms, we propose a comprehensive assessment framework based on a standardized benchmark. This benchmark is designed to replicate the harsh operational environment of UUV CN. Our analysis provides quantitative insights into the critical balance between fusion accuracy, computational cost, communication load, and robustness. The benchmark parameters are precisely calibrated against the UAC channel characteristics, as detailed in Section 2, thereby rigorously incorporating pivotal challenges such as:

High packet loss rates (15–25%) [22]
Severely constrained bandwidth (a few kbps) [1]
Significant propagation delays (∼0.67 ms/m) [154]
Non-Gaussian, heavy-tailed measurement noise from multipath effects [155]

Table 3 presents a multi-dimensional quantitative performance comparison of the core algorithms discussed in this review, clearly revealing the intrinsic trade-offs of different technical routes. The following analysis provides an in-depth interpretation of these trade-offs.

Baseline and Distributed Foundations: EKF and IF
The standard EKF serves as the performance baseline due to its implementation simplicity and high computational efficiency [156]. However, its foundational weakness lies in its strict reliance on Gaussian noise assumptions and linearized models. In the UAC channel’s typical non-Gaussian noise environment, its performance degrades significantly [23]. More critically, it is highly vulnerable to sensor data interruptions; in simulated DVL partial failure scenarios, its RMSE can exhibit catastrophic growth (error increases of over 300%) [154]. The IF, being mathematically dual to the EKF, shows similar performance in centralized scenarios. Its core advantage, however, emerges in distributed systems where the fusion step is remarkably simple—information from different nodes can be directly summed. This property makes it naturally suited for asynchronous and sparse communication environments, thus offering low communication load and strong scalability [157].
Enhancements within the Recursive Framework: AKF and Consensus Filters
The AKF and Consensus Filters represent two distinct approaches to improving robustness within the recursive filtering paradigm. By estimating and compensating for time-varying noise statistics online, the AKF can enhance both accuracy and robustness, with studies showing accuracy improvements of 30–40% in certain navigation scenarios [31,158]. A 2022 study in IEEE Transactions on Vehicular Technology further demonstrates a novel adaptive filter that shows strong performance under non-Gaussian noise [159]. The Consensus Filter, in contrast, reformulates the distributed estimation problem as an average consensus problem on a network graph [157]. Its standout advantage is its high robustness to dynamic network topologies and link failures, a topic explored in recent 2025 publications in IEEE Transactions on Automatic Control concerning completely distributed state estimation [160]. However, this comes at the cost of higher communication overhead, as multiple negotiation iterations are often required to reach consensus.
Specialized Solutions for Harsh Environments: MCC-KF and Wavelet-aided Fusion
To specifically address the hostile underwater environment, robust filters based on more advanced theories have been developed. The Maximum Correntropy Criterion Kalman Filter (MCC-KF) demonstrates unique value by fundamentally employing a cost function that is insensitive to large errors. This provides exceptional robustness against the heavy-tailed, impulsive noise commonly caused by acoustic multipath effects [68,155,161]. This is corroborated by new 2025 research in the IEEE Signal Processing Letters, which proposes a dual-robust-kernel Kalman filter specifically for outlier-robust underwater navigation [162]. Recent studies indicate that in complex non-Gaussian noise environments, MCC-KF-based algorithms can improve navigation accuracy by over 50% compared to traditional filters [163]. Wavelet-aided fusion approaches the problem from a signal processing perspective. By decomposing and reconstructing sensor signals across different frequency scales, it can effectively filter noise and align asynchronous data streams, thereby significantly improving fusion accuracy. Experiments have shown that wavelet denoising, even as a preprocessing step, can reduce the root mean square of position errors by approximately 14% [85].
The High-Accuracy Paradigm: Factor Graph Optimization
FGO represents the current state-of-the-art in terms of achievable accuracy. By abandoning the Markov assumption of recursive filters and jointly optimizing all historical states and measurements within a time window, it achieves global consistency and can effectively correct earlier errors [41,42,43]. In comparable navigation problems, its RMSE can be significantly lower than that of an EKF, with forthcoming work in the 2025 IEEE/ION Position, Location and Navigation Symposium (PLANS) continuing to validate its superior performance in robust navigation frameworks [154]. Critically, its batch-processing nature provides remarkable resilience to sensor failures, with error growth being minimal in DVL failure scenarios [154]. However, this high precision is achieved at the cost of high computational complexity (often >3 times that of an EKF) and latency [156]. Furthermore, its computational complexity scales quadratically with the number of nodes $O (N^{2})$ , limiting its real-time application in very large-scale clusters [41].

To visually represent these critical performance trade-offs, Figure 5 presents the data from Table 3 in the form of a radar chart. The shape of each polygon clearly delineates the performance “fingerprint” of the corresponding algorithm, allowing readers to quickly grasp the strengths and weaknesses of each approach. For instance, the polygon for FG extends far along the “Accuracy” axis but is severely retracted on the “Computational Efficiency” and “Scalability” axes, whereas the MCC-KF exhibits a balanced profile with a notable protrusion on the “Robustness” axis.

7.4. Summary

This section has explored the key challenges faced when converting CN algorithms into practical systems, primarily covering security, performance evaluation frameworks, and comprehensive comparisons of technical approaches. First, it has emphasized the core importance of ensuring the security of CN [140]. Since CN relies on trusted information exchange, this section analyzes the main threats faced by the system—jamming and spoofing attacks [141]. It has been noted that, in addition to traditional defensive measures, the fault-detection mechanisms based on information-theoretic metrics (such as the BD [146]) discussed in Section 4 can also serve as an effective data-layer defense against spoofing attacks by quantifying “information anomalies” to identify and remove malicious data injections.

Second, to systematically evaluate and compare different algorithms, this section has proposed a multi-dimensional performance evaluation framework [1]. This framework includes five core evaluation dimensions: fusion accuracy, computational complexity, communication load, robustness, and scalability [3]. Crucially, to move beyond qualitative descriptions, we introduced a standardized benchmark scenario to ground the comparison in objective, data-driven evidence. This culminated in a new, rigorous quantitative analysis presented in Table 3 and visually synthesized in the radar chart of Figure 5. This new analysis provides a much clearer insight into the critical trade-offs, for instance, revealing that while FG methods offer superior accuracy, the MCC-based filters provide unparalleled robustness against the non-Gaussian noise typical in real-world deployments—a distinction not apparent in purely qualitative assessments.

8. Synthesis, Key Challenges, and Future Roadmap

This paper has systematically reviewed the landscape of robust state estimation and information fusion techniques for cooperative UUV clusters. This final section synthesizes the key findings, particularly those from our quantitative analysis, to identify the most pressing challenges facing the field and proposes a structured, data-driven roadmap for future research.

8.1. Synthesis of Key Findings and Technology Evolution

The comprehensive analysis throughout this paper, culminating in the quantitative comparison in Section 7.3, allows us to synthesize the current state-of-the-art not merely as an evolution of techniques, but as a landscape defined by fundamental trade-offs. Our quantitative results (Table 3 and Figure 5) empirically confirm the critical trade-off between accuracy and resource consumption. Batch optimization methods like FGO or Bundle Adjustment (BA) achieve superior accuracy and global consistency through joint optimization of poses and landmarks [54,164], but this advantage comes at the cost of high computational and communication overhead, making them challenging for real-time deployment on resource-constrained UUV platforms. Conversely, advanced recursive filters such as the Maximum Correntropy Kalman Filter (MCC-KF) demonstrate strong robustness to non-Gaussian noise with significantly lower complexity [53,165,166], yet they remain susceptible to long-term drift without global constraints. This synthesis reveals that technological evolution has not produced a single “best” algorithm, but rather a spectrum of specialized tools, each exhibiting a distinct balance between accuracy, robustness, and resource efficiency [167].

8.2. Challenges and Unsolved Problems

Based on the synthesis above, we can now articulate the key challenges and unsolved problems that are hindering the widespread deployment of large-scale autonomous UUV clusters. These challenges are no longer abstract concepts but are directly informed by the performance gaps identified in our analysis.

The Accuracy-vs-Complexity Gap: This is the most significant challenge. As demonstrated in our quantitative analysis, there is a clear performance gap between theoretically optimal but resource-intensive batch methods and efficient but less accurate filtering techniques [168]. The unsolved problem is how to design algorithms that are both robust for long-duration missions and practical for real-time execution on embedded UUV hardware. Recent approaches attempt to bridge this using techniques like federated fixed-lag smoothing to balance accuracy and computational load [169]
Hyperscale Scalability: Most existing algorithms are validated in simulations with only a few UUVs. The communication and computational complexity associated with scaling to clusters of tens or even hundreds of vehicles (hyperscale) remains a largely unsolved problem [170]. The challenge lies in designing decentralized algorithms whose complexity does not grow exponentially with the number of agents and whose performance does not degrade catastrophically with increased communication latency and packet loss, a key focus of modern cooperative localization research [171].
Energy-Aware Design: The energy budget is arguably the most critical constraint for UUVs. However, the majority of current research focuses on optimizing for navigation accuracy, with energy efficiency often being an afterthought [172]. A key unsolved problem is the lack of a unified framework for the co-design of energy-aware and navigation-aware algorithms that can make intelligent trade-offs between performance and mission endurance, for example, by developing mission planners that explicitly model the vehicle’s energy consumption [173].
Lack of Standardized Benchmarking: As highlighted by our effort to create a comparison framework in Section 7, the field lacks standardized, publicly available underwater datasets and simulation environments. This makes it exceedingly difficult to fairly compare new algorithms, hindering reproducible research and slowing the pace of innovation. While specific efforts have been made to create and characterize datasets [174], the community still needs more comprehensive and widely adopted benchmarks to rigorously validate emerging navigation techniques [175].

8.3. Future Research Trajectories: A Roadmap for the Next Decade

Building upon the specific challenges identified above, we propose a structured research roadmap. To make it concrete and actionable, and to provide a clear, structured presentation, each direction is supported by examples from recent literature and organized into a timeline.

8.3.1. Short-Term Goals (1–3 Years)

Standardized Benchmarking and Datasets: A primary obstacle to progress is the lack of standardized datasets. The community urgently needs to develop benchmark suites—analogous to the KITTI dataset in autonomous driving—that include raw sensor data (IMU, DVL, sonar, acoustic modem logs) from diverse, real-world underwater environments [175,176]. This would directly address the need for better evaluation controls and accelerate innovation.
Energy-Aware Navigation and Communication: Future research should focus on algorithms that explicitly co-optimize for navigation accuracy and energy consumption. This includes developing adaptive sensing strategies where a UUV can power down non-essential sensors, and creating energy-aware communication protocols that schedule transmissions to minimize the power-intensive “wake-up” cycles of acoustic modems [177].

8.3.2. Medium-Term Goals (3–7 Years)

Bridging the Accuracy-Complexity Gap with Hybrid AI: A key research question is: How can we design algorithms that achieve FGO-like accuracy with filter-like efficiency? A promising avenue is the development of Hybrid AI models that merge the theoretical rigor of traditional filters with the powerful nonlinear fitting capabilities of neural networks. For example, recent studies have successfully used LSTMs or Transformers to learn and predict the complex, non-Gaussian noise characteristics of sensors, which are then fed into an adaptive Kalman filter to improve its accuracy [178,179].
Hyperscale Scalability via Decentralized Optimization: To overcome the challenges of large-scale swarms, research should move beyond simple consensus and explore more advanced decentralized optimization techniques. This includes developing communication-efficient distributed graph optimization algorithms and exploring event-triggered communication schemes where UUVs only share information when their uncertainty exceeds a certain threshold, thus drastically reducing network load [180].

8.3.3. Long-Term Goals (7+ Years)

Robust-by-Design Security: As UUV clusters become more autonomous, security becomes paramount. Future work should focus on “robust-by-design” security, moving beyond cryptographic overhead. This involves leveraging the intrinsic robustness of certain estimation algorithms as a defense mechanism. For instance, the statistical fault-detection methods discussed in Section 4.4 can be extended to explicitly detect and reject malicious data injection attacks from compromised nodes, creating a more resource-efficient security paradigm [181].
Quantum Sensing for GNSS-Free Navigation: A transformative long-term goal is the development of navigation systems that are entirely independent of external signals. Quantum sensing offers a revolutionary path forward. Future research should explore the integration of ultra-precise quantum inertial measurement units (e.g., quantum gyroscopes) and gravity gradiometers, which have the theoretical potential to enable long-duration, drift-free dead reckoning, fundamentally changing the paradigm of underwater navigation [182,183].
Ethical Frameworks for Autonomous Swarms: As UUV swarms transition from research to real-world deployment in both civilian and military contexts, it is imperative to develop robust ethical and legal frameworks. This includes research into verifiable decision-making, accountability in multi-agent systems, and establishing rules of engagement for autonomous maritime operations to ensure their safe and responsible use [184].

9. Conclusions

This paper has provided a comprehensive and critical review of the state-of-the-art in robust state estimation and information fusion for cooperative UUV clusters. Moving beyond a purely descriptive survey, the central contribution of this work is the introduction of a novel quantitative comparison framework. By establishing a standardized benchmark scenario and evaluating key algorithms across concrete performance metrics—accuracy, computational complexity, communication overhead, and robustness—this paper provides the first data-driven guide to navigating the fundamental trade-offs inherent in UUV cooperative navigation system design.

Our analysis synthesized several key findings. We provided a deep dive into the architectural and algorithmic trade-offs, from the global consistency of batch optimization methods like FGO to the computational efficiency of robust recursive filters like the MCC-KF. We critically evaluated not only their operational principles but also their practical limitations and hyperparameter sensitivities within the context of the extreme underwater environment. Furthermore, we extended the discussion to advanced applications, including the challenges of operating in heterogeneous fleets and extreme environments such as the polar regions and the deep sea.

By synthesizing the insights from our quantitative analysis, we identified key unsolved problems and proposed a structured, data-driven future research roadmap, highlighting specific, actionable research questions for the next decade. The overarching conclusion of this review is that there is no “one-size-fits-all” solution in this domain. The optimal choice of an algorithm is a mission-specific decision that requires a careful balance of competing performance criteria. It is our hope that the analytical framework and quantitative insights provided in this paper will serve as a valuable resource for researchers and engineers, accelerating the development of the next generation of truly autonomous and robust cooperative underwater systems.

Author Contributions

Conceptualization, S.L., M.L.-B., E.G.L., F.M., L.Y. and X.Q.; methodology, S.L.; software, S.L.; validation, S.L. and L.Y.; formal analysis, S.L.; investigation, S.L.; resources, L.Y. and X.Q.; data curation, S.L. and M.C.; writing—original draft preparation, S.L.; writing—review and editing, S.L., M.L.-B., E.G.L., F.M., M.C., L.Y. and X.Q.; visualization, S.L.; supervision, M.L.-B., E.G.L., F.M., L.Y. and X.Q.; project administration, L.Y. and X.Q.; funding acquisition, L.Y. and X.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by AI University Research Center (AI-URC) and XJTLU Laboratory for Intelligent Computation and Financial Technology through XJTLU Key Program Special Fund (KSF-P-02), Suzhou Municipal Key Laboratory Broadband Wireless Access Technology (BWAT), Jiangsu Data Science and Cognitive Computational Engineering Research Centre, and JITRI Supervision Support Fund (JSF10120220008) of XJTLU-JITRI Academy.

Data Availability Statement

This article is a review based on previously published studies. All data and findings discussed within the text are derived from the cited publications. The figures presented are original illustrations created by the authors to visually summarize and explain key concepts, architectures, and performance trade-offs from the reviewed literature. For further information, please contact the corresponding authors.

Acknowledgments

During the preparation of this manuscript, the authors used Gemini 2.5 Pro for language editing and text structuring. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

Author Xiaohui Qin was employed by Jiangsu TsingUnited Intelligent Control Technology Co., Ltd., Wuxi, Jiangsu 214131, China. Author Mengze Cao was employed by Hunan University Wuxi Intelligent Control Research Institute (WICRI), Wuxi, Jiangsu 214131, China. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AKF	Adaptive Kalman Filter
ARD	Automatic Relevance Determination
AUV	Autonomous Underwater Vehicle
BD	Bhattacharyya Distance
CAST	Collaborative Autonomous Swarming Technology
CN	Cooperative Navigation
CNN	Convolutional Neural Network
DDQN	Double Deep Q-Network
DOP	Dilution of Precision
DOS	Denial-of-Service
DRL	Deep Reinforcement Learning
DVL	Doppler Velocity Log
EKF	Extended Kalman Filter
FG	Factor Graph
FGO	Factor Graph Optimization
FIM	Fisher Information Matrix
GNSS	Global Navigation Satellite System
GPS	Global Positioning System
IB	Information Bottleneck
IF	Information Filter
IMU	Inertial Measurement Unit
INS	Inertial Navigation System
iSAM	Incremental Smoothing And Mapping
KDE	Kernel Density Estimation
KF	Kalman Filter
LC	Loosely Coupled
LSTM	Long Short-Term Memory Network
MAP	Maximum A Posteriori
MCC	Maximum Correntropy Criterion
MEE	Minimum Error Entropy
MEMS	Micro-Electromechanical System
MLP-RF	Multi-Layer Perceptron-Random Forest
MMSE	Minimum Mean Square Error
MSE	Mean Square Error
ONR	Office of Naval Research
O-VIB	Orthogonally constrained Variational Information Bottleneck
PF	Particle Filter
RF	Recursive Filtering
RMSE	Root Mean Square Error
SLAM	Simultaneous Localization and Mapping
SPWT	Simple Positive Wave Theory
SSP	Sound Speed Profile
SWO	Sliding-Window Optimization
TC	Tightly Coupled
TL	Transmission Loss
TOC	Task-Oriented Communication
UAC	Underwater Acoustic Communication
UAV	Unmanned Aerial Vehicle
UKF	Unscented Kalman Filter
USBL	Ultra-Short Baseline
USV	Unmanned Surface Vehicle
UUV	Unmanned Underwater Vehicle
WNN	Wavelet Neural Network
WOA	World Ocean Atlas

References

Zhang, L.; Wu, S.; Tang, C.; Lin, H. UUV Cluster Distributed Navigation Fusion Positioning Method with Information Geometry. J. Mar. Sci. Eng. 2025, 13, 696. [Google Scholar] [CrossRef]
Saeed, K.; Khalil, W.; Al-Shamayleh, A.S.; Ahmed, S.; Akhunzada, A.; Alharthi, S.Z.; Gani, A. A Comprehensive Analysis of Security-Based Schemes in Underwater Wireless Sensor Networks. Sustainability 2023, 15, 7198. [Google Scholar] [CrossRef]
Lin, Y.H.; Chuang, P.C.; Huang, J.Y.T. Simultaneous Depth and Heading Control for Autonomous Underwater Vehicle Docking Maneuvers Using Deep Reinforcement Learning within a Digital Twin System. Comput. Mater. Contin. 2025, 84, 4907–4948. [Google Scholar] [CrossRef]
Heshmat, M.; Saad Saoud, L.; Abujabal, M.; Sultan, A.; Elmezain, M.; Seneviratne, L.; Hussain, I. Underwater SLAM Meets Deep Learning: Challenges, Multi-Sensor Integration, and Future Directions. Sensors 2025, 25, 3258. [Google Scholar] [CrossRef]
Gao, W.; Yang, J.; Liu, J.; Xu, B.; Shi, H. Cooperative location of multiple UUVs based on hydro-acoustic communication delay. Syst. Eng. Electron. 2014, 36, 539–544. [Google Scholar]
Das, B.; Subudhi, B.; Pati, B.B. Cooperative Formation Control of Autonomous Underwater Vehicles: An Overview. Int. J. Autom. Comput. 2016, 13, 199–225. [Google Scholar] [CrossRef]
Goel, S. A Distributed Cooperative UAV Swarm Localization System: Development and Analysis. In Proceedings of the 30th International Technical Meeting of the Satellite Division of The Institute of Navigation (ION GNSS+ 2017), Portland, Oregon, 25–29 September 2017; pp. 2501–2518. [Google Scholar] [CrossRef]
Office of Naval Research. Cooperative Autonomous Swarm Technology (CAST). 2022. Available online: https://www.onr.navy.mil/organization/departments/code-33/division-333/cooperative-autonomous-swarm-technology (accessed on 15 July 2025).
Wang, Z.; Wang, H.; Yuan, J.; Yu, D.; Zhang, K.; Ren, J. Bio-Inspired Cooperative Control Scheme of Obstacle Avoidance for UUV Swarm. J. Mar. Sci. Eng. 2024, 12, 489. [Google Scholar] [CrossRef]
Li, N.; Martínez, J.F.; Meneses Chaus, J.M.; Eckert, M. A Survey on Underwater Acoustic Sensor Network Routing Protocols. Sensors 2016, 16, 414. [Google Scholar] [CrossRef] [PubMed]
Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum correntropy Kalman filter. Automatica 2017, 76, 70–77. [Google Scholar] [CrossRef]
Ning, B.; Zhao, F.; Luo, H.; Luo, D.; Shao, W. Robust GNSS/INS Tightly Coupled Positioning Using Factor Graph Optimization with P-Spline and Dynamic Prediction. Remote Sens. 2025, 17, 1792. [Google Scholar] [CrossRef]
Cheng, Z.; Chen, G.; Li, X.M.; Ren, H. Consensus-Based Power System State Estimation Algorithm Under Collaborative Attack. Sensors 2024, 24, 6886. [Google Scholar] [CrossRef]
Chen, Y.; Gao, Y.; Gan, K.; Li, M.; Wei, C.; Guo, X.; Zhao, R.; Lu, J.; Che, L. State Estimation for Active Distribution Networks Considering Bad Data in Measurements and Topology Parameters. Energies 2025, 18, 2222. [Google Scholar] [CrossRef]
González-García, J.; Gómez-Espinosa, A.; Cuan-Urquizo, E.; García-Valdovinos, L.G.; Salgado-Jiménez, T.; Cabello, J.A.E. Autonomous Underwater Vehicles: Localization, Navigation, and Communication for Collaborative Missions. Appl. Sci. 2020, 10, 1256. [Google Scholar] [CrossRef]
Theocharidis, T.; Kavallieratou, E. Underwater communication technologies: A review. Telecommun. Syst. 2025, 88, 54. [Google Scholar] [CrossRef]
Zhou, Q.; Ye, Q.; Lai, C.; Kou, G. Cryptography-Based Secure Underwater Acoustic Communication for UUVs: A Review. Electronics 2025, 14, 2415. [Google Scholar] [CrossRef]
Bae, I.; Hong, J. Survey on the Developments of Unmanned Marine Vehicles: Intelligence and Cooperation. Sensors 2023, 23, 4643. [Google Scholar] [CrossRef]
Merveille, F.F.R.; Jia, B.; Xu, Z. Advancements in Underwater Navigation: Integrating Deep Learning and Sensor Technologies for Unmanned Underwater Vehicles. Preprints 2024. [Google Scholar] [CrossRef]
Yu, R.; Liu, Y.; Meng, Y.; Guo, Y.; Xiong, Z.; Jiang, P. Optimal Configuration of Heterogeneous Swarm for Cooperative Detection with Minimum DOP Based on Nested Cones. Drones 2024, 8, 11. [Google Scholar] [CrossRef]
Guo, Q.; Yan, X.; Luo, Q.; Lin, J. Cooperative localization algorithm for multiple AUVs under communication Delay. In Proceedings of the 2025 International Wireless Communications and Mobile Computing (IWCMC), Abu Dhabi, United Arab Emirates, 12–16 May 2025; pp. 31–36. [Google Scholar] [CrossRef]
Li, L.; Li, Y.; Zhang, Y.; Xu, G.; Zeng, J.; Feng, X. Formation control of multiple autonomous underwater vehicles under communication delay, packet discreteness and dropout. J. Mar. Sci. Eng. 2022, 10, 920. [Google Scholar] [CrossRef]
Yan, Z.P.; Liu, Y.B.; Zhou, J.J.; Zhang, W.; Wang, L. Consensus of multiple autonomous underwater vehicles with double independent Markovian switching topologies and time-varying delays. Chin. Phys. B 2017, 26, 040203. [Google Scholar] [CrossRef]
Deo, I.K.; Venkateshwaran, A.; Jaiman, R.K. Predicting transmission loss in underwater acoustics using continual learning with range-dependent conditional convolutional neural networks. J. Acoust. Soc. Am. 2025, 157, 3930–3945. [Google Scholar] [CrossRef] [PubMed]
Xu, W.; Zhang, L.; Wang, H. Machine learning–based feature prediction of convergence zones in ocean front environments. Front. Mar. Sci. 2024, 11, 1337234. [Google Scholar] [CrossRef]
Gallici, M.; Masmitja, I.; Martín, M. Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles. arXiv 2025. [Google Scholar] [CrossRef]
Staff Writer. Uncrewed Marine Vehicles—USV, ASV, UUV, AUV & ROV. Unmanned Syst. Technol. 2025. Available online: https://www.unmannedsystemstechnology.com/expo/uncrewed-marine-vehicles/ (accessed on 28 October 2025).
Mostaani, A.; Vu, T.X.; Sharma, S.K.; Nguyen, V.D.; Liao, Q.; Chatzinotas, S. Task-Oriented Communication Design in Cyber-Physical Systems: A Survey on Theory and Applications. IEEE Access 2022, 10, 133842–133868. [Google Scholar] [CrossRef]
Wang, Q.; Fan, S.; Zhang, Y.; Gao, W.; Wei, J.; Wang, Y. A Novel Adaptive Sliding Observation-Based Cooperative Positioning Algorithm Under Factor Graph Framework for Multiple UUVs. IEEE Trans. Ind. Inform. 2023, 19, 8743–8753. [Google Scholar] [CrossRef]
Ouyang, X.; Zeng, F.; Lv, D.; Dong, T.; Wang, H. Cooperative Navigation of UAVs in GNSS-Denied Area With Colored RSSI Measurements. IEEE Sens. J. 2021, 21, 2194–2210. [Google Scholar] [CrossRef]
Sheng, G.; Liu, X.; Sheng, Y.; Cheng, X.; Luo, H. Cooperative Navigation Algorithm of Extended Kalman Filter Based on Combined Observation for AUVs. Remote Sens. 2023, 15, 533. [Google Scholar] [CrossRef]
Edu, I.R.; Adochiei, F.C.; Obreja, R.; Rotaru, C.; Grigorie, T.L. Inertial Sensor Signals Denoising with Wavelet Transform. INCAS Bull. 2015, 7, 57–64. [Google Scholar] [CrossRef]
Xu, B.; Wang, X.; Guo, Y.; Zhang, J.; Razzaqi, A.A. A Novel Adaptive Filter for Cooperative Localization Under Time-Varying Delay and Non-Gaussian Noise. IEEE Trans. Instrum. Meas. 2021, 70, 1–15. [Google Scholar] [CrossRef]
Zhang, H.; Ji, D.S.; Xie, S.R.; Wang, W.H. A Review of Path Planning for Autonomous Underwater Vehicles. Sensors 2022, 22, 5016. [Google Scholar] [CrossRef]
Yan, T.; Xu, Z.; Yang, S.X. Consensus Formation Tracking for Multiple AUV Systems Using Distributed Bioinspired Sliding Mode Control. IEEE Trans. Intell. Veh. 2023, 8, 1081–1092. [Google Scholar] [CrossRef]
Fox, V.; Hightower, J.; Liao, L.; Schulz, D.; Borriello, G. Bayesian filtering for location estimation. IEEE Pervasive Comput. 2003, 2, 24–33. [Google Scholar] [CrossRef]
Das, A.; Elfring, J.; Dubbelman, G. Real-Time Vehicle Positioning and Mapping Using Graph Optimization. Sensors 2021, 21, 2815. [Google Scholar] [CrossRef]
Li, T.; Zhou, T. Multi-scale fusion framework via retinex and transmittance optimization for underwater image enhancement. PLoS ONE 2022, 17, e0275107. [Google Scholar] [CrossRef]
Hamza, A.B.; He, Y.; Krim, H.; Willsky, A.S. A multiscale approach to pixel-level image fusion. Integr. Comput.-Aided Eng. 2005, 12, 135–146. [Google Scholar] [CrossRef]
Tang, Y.; Gong, J.; Li, Y.; Zhang, G.; Yang, B.; Yang, Z. Wavelet Transform-Based Inertial Neural Network for Spatial Positioning Using Inertial Measurement Units. Remote Sens. 2024, 16, 2326. [Google Scholar] [CrossRef]
Dellaert, F.; Kaess, M. Factor Graphs for Robot Perception. Found. Trends® Robot. 2017, 6, 1–139. [Google Scholar] [CrossRef]
Cadena, C.; Carlone, L.; Carrillo, H.; Latif, Y.; Scaramuzza, D.; Neira, J.; Reid, I.; Leonard, J.J. Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age. IEEE Trans. Robot. 2016, 32, 1309–1332. [Google Scholar] [CrossRef]
Kaess, M.; Johannsson, H.; Roberts, R.; Ila, V.; Leonard, J.J.; Dellaert, F. iSAM2: Incremental smoothing and mapping using the Bayes tree. Int. J. Robot. Res. 2012, 31, 217–235. [Google Scholar] [CrossRef]
Wang, Z.; Wen, Z.; Xia, Q.; Cai, W. Deep Reinforcement Learning Based Multi-UUV Cooperative Control for Target Capturing. In Proceedings of the 2022 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), Falerna, Italy, 12–15 September 2022; pp. 1–6. [Google Scholar] [CrossRef]
Xu, J.; Xie, G.; Tang, J.; Ding, Y.; Liu, W.; Zhang, S.; Li, Y. Never too Cocky to Cooperate: An FIM and RL-based USV-AUV Collaborative System for Underwater Tasks in Extreme Sea Conditions. arXiv 2025, arXiv:2504.14894. [Google Scholar]
Diao, Y.; Zhang, Y.; Martini, D.D.; Zhao, P.G.; Li, E.L. Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems. arXiv 2025, arXiv:2503.08661. [Google Scholar] [CrossRef]
He, Z.; Wang, B.; Chen, Y.; Li, H.; Xie, Z.; Tan, K.; Ye, J.; Zhu, Y.; Chen, G. Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude Economy. arXiv 2025, arXiv:2507.03159. [Google Scholar]
Savkin, A.V.; Verma, S.C.; Anstee, S. Optimal Navigation of an Unmanned Surface Vehicle and an Autonomous Underwater Vehicle Collaborating for Reliable Acoustic Communication with Collision Avoidance. Drones 2022, 6, 27. [Google Scholar] [CrossRef]
Kang, Y.; Zhao, L.; Cheng, J.; Wu, M.; Fan, X. A Novel Grid SINS/DVL Integrated Navigation Algorithm for Marine Application. Sensors 2018, 18, 364. [Google Scholar] [CrossRef]
Zhao, L.; Kang, Y.; Cheng, J.; Wu, M. A Fault-Tolerant Polar Grid SINS/DVL/USBL Integrated Navigation Algorithm Based on the Centralized Filter and Relative Position Measurement. Sensors 2019, 19, 3899. [Google Scholar] [CrossRef] [PubMed]
Yang, H.; Gao, X.; Huang, H.; Li, B.; Jiang, J. A Tightly Integrated Navigation Method of SINS, DVL, and PS Based on RIMM in the Complex Underwater Environment. Sensors 2022, 22, 9479. [Google Scholar] [CrossRef] [PubMed]
Ding, S.; Zhang, T.; Li, Y.; Xu, S.; Lei, M. Underwater multi-sensor fusion localization with visual-inertial-depth using hybrid residuals and efficient loop closing. Measurement 2024, 238, 115245. [Google Scholar] [CrossRef]
Shaukat, N.; Moinuddin, M.; Otero, P. Underwater Vehicle Positioning by Correntropy-Based Fuzzy Multi-Sensor Fusion. Sensors 2021, 21, 6165. [Google Scholar] [CrossRef] [PubMed]
Peng, X.; Qin, Z.; Tao, X.; Lu, J.; Hanzo, L. A Robust Semantic Text Communication System. IEEE Trans. Wirel. Commun. 2024, 23, 11372–11385. [Google Scholar] [CrossRef]
Grisetti, G.; Kümmerle, R.; Stachniss, C.; Burgard, W. A Tutorial on Graph-Based SLAM. IEEE Intell. Transp. Syst. Mag. 2010, 2, 31–43. [Google Scholar] [CrossRef]
Zhuang, L.; Chen, X.; Lu, W.; Yan, Y. Graph Matching for Underwater Simultaneous Localization and Mapping Using Multibeam Sonar Imaging. J. Mar. Sci. Eng. 2024, 12, 1859. [Google Scholar] [CrossRef]
Zhang, L.; Gao, Y.; Guan, L. Optimizing AUV Navigation Using Factor Graphs with Side-Scan Sonar Integration. J. Mar. Sci. Eng. 2024, 12, 313. [Google Scholar] [CrossRef]
Julier, S.; Uhlmann, J. A non-divergent estimation algorithm in the presence of unknown correlations. In Proceedings of the 1997 American Control Conference (Cat. No.97CH36041), Albuquerque, NM, USA, 4–6 June 1997; Volume 4, pp. 2369–2373. [Google Scholar] [CrossRef]
Sun, S.; Lin, H.; Ma, J.; Li, X. Multi-sensor distributed fusion estimation with applications in networked systems: A review paper. Inf. Fusion 2017, 38, 122–134. [Google Scholar] [CrossRef]
Huang, S.; Yamamoto, K. Innovation Sharing Distributed Kalman Filter with Packet Loss. J. Robot. Mechatron. 2024, 36, 680–688. [Google Scholar] [CrossRef]
Pfaff, F.; Noack, B.; Hanebeck, U.D.; Govaers, F.; Koch, W. Information form distributed Kalman filtering (IDKF) with explicit inputs. In Proceedings of the 2017 20th International Conference on Information Fusion (Fusion), Xi’an, China, 10–13 July 2017; pp. 1–8. [Google Scholar] [CrossRef]
Mahmoud, M.S. Distributed estimation based on information-based covariance intersection algorithms. Int. J. Adapt. Control Signal Process. 2016, 30, 750–778. [Google Scholar] [CrossRef]
Akhihiero, D.; Olawoye, U.; Das, S.; Gross, J. Cooperative Localization for GNSS-Denied Subterranean Navigation: A UAV–UGV Team Approach. Navig. J. Inst. Navig. 2024, 71, 139–160. [Google Scholar] [CrossRef]
Xue, K.; Wu, T. Distributed Consensus of USVs under Heterogeneous UAV-USV Multi-Agent Systems Cooperative Control Scheme. J. Mar. Sci. Eng. 2021, 9, 1314. [Google Scholar] [CrossRef]
Yin, J.; Wu, M.; Wang, J.; Li, Y. Polar Grid Navigation Algorithm for Unmanned Underwater Vehicles. Sensors 2017, 17, 1599. [Google Scholar] [CrossRef]
Särkkä, S. Recursive Bayesian Inference on Stochastic Differential Equations. Ph.D. Thesis, Helsinki University of Technology, Espoo, Finland, 2006. [Google Scholar]
Olfati-Saber, R. Distributed Kalman filtering for sensor networks. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007; pp. 5492–5498. [Google Scholar] [CrossRef]
Hou, B.; He, Z.; Zhou, X.; Zhou, H.; Li, D.; Wang, J. Maximum Correntropy Criterion Kalman Filter for α-Jerk Tracking Model with Non-Gaussian Noise. Entropy 2017, 19, 648. [Google Scholar] [CrossRef]
Jwo, D.J.; Chen, Y.L.; Cho, T.S.; Biswal, A. A Robust GPS Navigation Filter Based on Maximum Correntropy Criterion with Adaptive Kernel Bandwidth. Sensors 2023, 23, 9386. [Google Scholar] [CrossRef]
Fakoorian, S.; Izanloo, R.; Shamshirgaran, A.; Simon, D. Maximum Correntropy Criterion Kalman Filter with Adaptive Kernel Size. In Proceedings of the 2019 IEEE National Aerospace and Electronics Conference (NAECON), Dayton, OH, USA, 15–19 July 2019; pp. 581–584. [Google Scholar] [CrossRef]
Wang, G.; Gao, Z.; Zhang, Y.; Ma, B. Adaptive Maximum Correntropy Gaussian Filter Based on Variational Bayes. Sensors 2018, 18, 1960. [Google Scholar] [CrossRef]
Mu, R.; Chu, Y.; Zhang, H.; Liang, H. A multiple-step, randomly delayed, robust Cubature Kalman filter for spacecraft-relative navigation. Aerospace 2023, 10, 289. [Google Scholar] [CrossRef]
Agarwal, P.; Tipaldi, G.D.; Spinello, L.; Stachniss, C.; Burgard, W. Robust map optimization using dynamic covariance scaling. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, German, 6–10 May 2013; pp. 62–69. [Google Scholar] [CrossRef]
Liu, X.; Qu, H.; Zhao, J.; Yue, P.; Wang, M. Maximum correntropy unscented Kalman filter for spacecraft relative state estimation. Sensors 2016, 16, 1530. [Google Scholar] [CrossRef]
Wang, B.; Hu, T. Online gradient descent for kernel-based maximum correntropy criterion. Entropy 2019, 21, 644. [Google Scholar] [CrossRef] [PubMed]
Xiong, W.; Schindelhauer, C.; So, H.C.; Wang, Z. Maximum correntropy criterion for robust TOA-based localization in NLOS environments. Circuits Syst. Signal Process. 2021, 40, 6325–6339. [Google Scholar] [CrossRef]
Huang, S.; Yang, T.; Huang, C.F. Multipath correlations in underwater acoustic communication channels. J. Acoust. Soc. Am. 2013, 133, 2180–2190. [Google Scholar] [CrossRef]
Bhattacharyya, A. On a measure of divergence between two statistical populations defined by their probability distribution. Bull. Calcutta Math. Soc. 1943, 35, 99–110. [Google Scholar]
Kailath, T. The divergence and Bhattacharyya distance measures in signal selection. IEEE Trans. Commun. Technol. 2003, 15, 52–60. [Google Scholar] [CrossRef]
Gustafsson, F.; Gustafsson, F. Adaptive Filtering and Change Detection; Wiley: New York, NY, USA, 2000; Volume 1. [Google Scholar]
Zhang, Z.; Nie, Y.; Yin, L. A Robust Fault Detection Filter for Linear Time-Varying System with Non-Gaussian Noise. arXiv 2025, arXiv:2504.17648. [Google Scholar]
Feller, W. An Introduction to Probability Theory and Its Applications; John Wiley & Sons: Hoboken, NJ, USA, 1991; Volume 2. [Google Scholar]
Hussein, I.I.; Roscoe, C.; Wilkins, M.P.; Schumacher, P.W., Jr. Track-to-track association using Bhattacharyya divergence. In Proceedings of the Advanced Maui Optical and Space Surveillance Technologies Conference (AMOS 2015), Wailea, HI, USA, 15–18 September 2015. [Google Scholar]
Bi, S.; Beer, M.; Zhang, J.; Yang, L.; He, K. Optimization or Bayesian strategy? Performance of the Bhattacharyya distance in different algorithms of stochastic model updating. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part B Mech. Eng. 2021, 7, 020903. [Google Scholar] [CrossRef]
Davari, N.; Gholami, A.; Shabani, M. Multirate Adaptive Kalman Filter for Marine Integrated Navigation System. J. Navig. 2017, 70, 628–647. [Google Scholar] [CrossRef]
The MathWorks, Inc. IMU and GPS Fusion for Inertial Navigation. MATLAB & Simulink Documentation. 2025. Available online: https://www.mathworks.com/help/nav/ug/imu-and-gps-fusion-for-inertial-navigation.html (accessed on 8 September 2025).
Julier, S.; Uhlmann, J. Fusion of time delayed measurements with uncertain time delays. In Proceedings of the American Control Conference, Portland, Oregon, 8–10 June 2005; Volume 6, pp. 4028–4033. [Google Scholar] [CrossRef]
Abro, G.E.M.; Abdallah, A.M.; Zahid, F.; Ahmed, S. A Comprehensive Review of Next-Gen UAV Swarm Robotics: Optimisation Techniques and Control Strategies for Dynamic Environments. Intell. Autom. Soft Comput. 2025, 40, 99–123. [Google Scholar] [CrossRef]
Sagar, M.M.; Konara, M.; Picard, N.; Park, K. State-of-the-Art Navigation Systems and Sensors for Unmanned Underwater Vehicles (UUVs). Appl. Mech. 2025, 6, 10. [Google Scholar] [CrossRef]
Zhang, Y.; Jiang, H.; Bhatt, V.; Nikolaidis, S.; Li, J. Guidance Graph Optimization for Lifelong Multi-Agent Path Finding. arXiv 2024, arXiv:2402.01446. [Google Scholar] [PubMed]
Holmberg, T.E.; Ioup, E.; Abdelguerfi, M. Knowledge Graph-Based Multi-Agent Path Planning in Dynamic Environments using WAITR. arXiv 2024, arXiv:2412.1946. [Google Scholar]
Torrence, C.; Compo, G.P. A Practical Guide to Wavelet Analysis. Bull. Am. Meteorol. Soc. 1998, 79, 61–78. [Google Scholar] [CrossRef]
Wang, J.; Xu, T.; Wang, Z. Adaptive Robust Unscented Kalman Filter for AUV Acoustic Navigation. Sensors 2020, 20, 60. [Google Scholar] [CrossRef]
Wei, L.; Sun, M.; Peng, Z.; Guo, J.; Cui, J.; Qin, B.; Cui, J.H. A Channel-Aware AUV-Aided Data Collection Scheme Based on Deep Reinforcement Learning. J. Mar. Sci. Eng. 2025, 13, 1460. [Google Scholar] [CrossRef]
Liu, J.; Xu, Y.; Song, S.; Jiang, L. Reducing AUV Energy Consumption Through Dynamic Sensor Directions Switching via Deep Reinforcement Learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 25 February–4 March 2025; Volume 39, pp. 18843–18851. [Google Scholar]
Yu, T.; Zhang, Q.; Liu, T. Reinforcement learning approaches in the motion systems of autonomous underwater vehicles. Appl. Ocean Res. 2025, 161, 104682. [Google Scholar] [CrossRef]
Dermin, X.; Lei, G. Wavelet transform and its application to autonomous underwater vehicle control system fault detection. In Proceedings of the 2000 International Symposium on Underwater Technology (Cat. No.00EX418), Online, 26–26 May 2000; pp. 99–104. [Google Scholar] [CrossRef]
Fernandes, M.; Sahoo, S.R.; Kothari, M. Cooperative Localization for Autonomous Underwater Vehicles—A comprehensive review. arXiv 2023, arXiv:2307.06189. [Google Scholar]
Yuan, K.; Wang, Z.J. A Simple Self-Supervised IMU Denoising Method for Inertial Aided Navigation. IEEE Robot. Autom. Lett. 2023, 8, 944–950. [Google Scholar] [CrossRef]
Xie, H.; Qin, Z.; Tao, X.; Letaief, K.B. Task-Oriented Multi-User Semantic Communications. IEEE J. Sel. Areas Commun. 2022, 40, 2584–2597. [Google Scholar] [CrossRef]
Gündüz, D.; Qin, Z.; Aguerri, I.E.; Dhillon, H.S.; Yang, Z.; Yener, A.; Wong, K.K.; Chae, C.B. Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications. IEEE J. Sel. Areas Commun. 2023, 41, 5–41. [Google Scholar] [CrossRef]
Hu, Q.; Zhang, G.; Qin, Z.; Cai, Y.; Yu, G.; Li, G.Y. Robust Semantic Communications With Masked VQ-VAE Enabled Codebook. IEEE Trans. Wirel. Commun. 2023, 22, 8707–8722. [Google Scholar] [CrossRef]
Getu, T.M.; Saad, W.; Kaddoum, G.; Bennis, M. Performance Limits of a Deep Learning-Enabled Text Semantic Communication Under Interference. IEEE Trans. Wirel. Commun. 2024, 23, 10213–10228. [Google Scholar] [CrossRef]
Xin, G.; Fan, P.; Letaief, K.B. Semantic Communication: A Survey of Its Theoretical Development. Entropy 2024, 26, 102. [Google Scholar] [CrossRef]
Shao, J.; Mao, Y.; Zhang, J. Task-Oriented Communication for Multidevice Cooperative Edge Inference. IEEE Trans. Wirel. Commun. 2023, 22, 73–87. [Google Scholar] [CrossRef]
Salvato, E.; Fenu, G.; Medvet, E.; Pellegrino, F.A. Crossing the Reality Gap: A Survey on Sim-to-Real Transferability of Robot Controllers in Reinforcement Learning. IEEE Access 2021, 9, 153171–153187. [Google Scholar] [CrossRef]
Palomeras, N.; Ridao, P. Autonomous Underwater Vehicle Docking Under Realistic Assumptions Using Deep Reinforcement Learning. Drones 2024, 8, 673. [Google Scholar] [CrossRef]
Alajlan, N.N.; Ibrahim, D.M. TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications. Micromachines 2022, 13, 851. [Google Scholar] [CrossRef]
Heydari, S.; Mahmoud, Q.H. Tiny Machine Learning and On-Device Inference: A Survey of Applications, Challenges, and Future Directions. Sensors 2025, 25, 3191. [Google Scholar] [CrossRef] [PubMed]
Jamming and Spoofing: Two Threats for Your UAS GNC System. 2024. Available online: https://www.uavnavigation.com/company/blog/jamming-and-spoofing-two-threats-your-uas-gnc-system (accessed on 14 July 2025).
Hickling, T.; Hogan, M.; Tammam, A.; Aouf, N. Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application. arXiv 2025, arXiv:2502.20326. [Google Scholar] [CrossRef]
Polar Challenge: Towards Long-Term Under-Ice Observations in the Polar Regions. 2016. Available online: https://www.unesco.org/en/articles/polar-challenge-towards-long-term-under-ice-observations-polar-regions (accessed on 14 July 2025).
Alrefaei, F.; Alzahrani, A.; Song, H.; Alrefaei, S. A Survey on the Jamming and Spoofing attacks on the Unmanned Aerial Vehicle Networks. In Proceedings of the 2022 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), Toronto, ON, Canada, 1–4 June 2022; pp. 1–7. [Google Scholar] [CrossRef]
Meheretu, S.E.; Nigussie, E.; Hailesilassie, S.Y.; Gebremeskel, G.B. A Critical Analysis of Spoofing and Jamming Approaches in UAV Navigation. Research Square; PREPRINT (Version 1). 2024. Available online: https://www.researchsquare.com/article/rs-5370811/v1 (accessed on 8 September 2025).
Friess, C.; Niculescu, V.; Polonelli, T.; Magno, M.; Benini, L. Fully Onboard SLAM for Distributed Mapping With a Swarm of Nano-Drones. IEEE Internet Things J. 2024, 11, 32363–32380. [Google Scholar] [CrossRef]
Barker, L.D.L.; Jakuba, M.V.; Bowen, A.D.; German, C.R.; Maksym, T.; Mayer, L.; Boetius, A.; Dutrieux, P.; Whitcomb, L.L. Scientific Challenges and Present Capabilities in Underwater Robotic Vehicle Design and Navigation for Oceanographic Exploration Under-Ice. Remote Sens. 2020, 12, 2588. [Google Scholar] [CrossRef]
Freitag, L.; Koski, P.; Morozov, A.; Singh, S.; Partan, J. Acoustic communications and navigation under Arctic ice. In Proceedings of the 2012 Oceans, Yeosu, Republic of Korea, 21–24 May 2012; pp. 1–8. [Google Scholar] [CrossRef]
Paturel, Y.; Lacambre, J.B.; Patin, F.; Moynagh, C. Inertial navigation at high latitude: Trials and test results. In Proceedings of the OCEANS 2015-MTS/IEEE Washington, Washington, DC, USA, 19–22 October 2015; pp. 1–5. [Google Scholar] [CrossRef]
Yan, Z.; Wang, L.; Wang, T.; Yang, Z.; Chen, T.; Xu, J. Polar Cooperative Navigation Algorithm for Multi-Unmanned Underwater Vehicles Considering Communication Delays. Sensors 2018, 18, 1044. [Google Scholar] [CrossRef] [PubMed]
Fan, S.; Bose, N.; Liang, Z. Polar AUV Challenges and Applications: A Review. Drones 2024, 8, 413. [Google Scholar] [CrossRef]
Miranda, M.; Takei, N.; Miyazawa, Y.; Kozuma, M. Multi-Harmonic Modulation in a Fiber-Optic Gyroscope. Sensors 2023, 23, 4442. [Google Scholar] [CrossRef]
Lunkov, A.; Sidorov, D.; Petnikov, V. Horizontal Refraction of Acoustic Waves in Shallow-Water Waveguides Due to an Inhomogeneous Bottom Structure. J. Mar. Sci. Eng. 2021, 9, 1269. [Google Scholar] [CrossRef]
Zhang, Q.; Kim, J. Feature-Based Global Localization for Underwater Terrain Aided Navigation Using Bag of Words. In Proceedings of the OCEANS 2024, Singapore, 14–18 April 2024; pp. 1–5. [Google Scholar] [CrossRef]
Wang, R.; Wang, J.; Li, Y.; Ma, T.; Zhang, X. Research Advances and Prospects of Underwater Terrain-Aided Navigation. Remote Sens. 2024, 16, 2560. [Google Scholar] [CrossRef]
Wu, S.; Li, Z.; Qin, J.; Wang, M.; Li, W. The Effects of Sound Speed Profile to the Convergence Zone in Deep Water. J. Mar. Sci. Eng. 2022, 10, 424. [Google Scholar] [CrossRef]
Huang, W.; Zhou, J.; Gao, F.; Wang, J.; Xu, T. Experimental Results of Underwater Sound Speed Profile Inversion by Few-Shot Multi-Task Learning. Remote Sens. 2024, 16, 167. [Google Scholar] [CrossRef]
Zhao, S.; Liu, H.; Xue, S.; Wang, Z.; Xiao, Z. Two-Step Correction Based on In-Situ Sound Speed Measurements for USBL Precise Real-Time Positioning. Remote Sens. 2023, 15, 5046. [Google Scholar] [CrossRef]
Bhatt, E.C.; Viquez, O.; Schmidt, H. Under-ice acoustic navigation using real-time model-aided range estimationa). J. Acoust. Soc. Am. 2022, 151, 2656–2671. [Google Scholar] [CrossRef]
Evers, C.; Naylor, P.A. Acoustic SLAM. IEEE/ACM Trans. Audio Speech Lang. Process. 2018, 26, 1484–1498. [Google Scholar] [CrossRef]
Granström, K.; Fatemi, M.; Svensson, L. Poisson Multi-Bernoulli Mixture Conjugate Prior for Multiple Extended Target Filtering. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 208–225. [Google Scholar] [CrossRef]
Mullane, J.; Vo, B.N.; Adams, M.D.; Vo, B.T. A Random-Finite-Set Approach to Bayesian SLAM. IEEE Trans. Robot. 2011, 27, 268–282. [Google Scholar] [CrossRef]
Zhu, Z.; Zhang, L.; Liu, L.; Wu, D.; Bai, S.; Ren, R.; Geng, W. An Efficient Multi-AUV Cooperative Navigation Method Based on Hierarchical Reinforcement Learning. J. Mar. Sci. Eng. 2023, 11, 1863. [Google Scholar] [CrossRef]
Yang, L.; Zhao, S.; Wang, X.; Shen, P.; Zhang, T. Deep-Sea Underwater Cooperative Operation of Manned/Unmanned Submersible and Surface Vehicles for Different Application Scenarios. J. Mar. Sci. Eng. 2022, 10, 909. [Google Scholar] [CrossRef]
Ferri, G.; Paoletti, A.; Orlando, G. Cooperative robotic networks for underwater surveillance: An information-theoretic perspective. IET Radar, Sonar Navig. 2017, 11, 755–766. [Google Scholar] [CrossRef]
Wang, X.; Fan, X.; Shi, P.; Ni, J.; Zhou, Z. An Overview of Key SLAM Technologies for Underwater Scenes. Remote Sens. 2023, 15, 2496. [Google Scholar] [CrossRef]
Curado Teixeira, F.; Quintas, J.; Pascoal, A. AUV terrain-aided navigation using a Doppler velocity logger. Annu. Rev. Control 2016, 42, 166–176. [Google Scholar] [CrossRef]
Anonsen, K.; Hallingstad, O. Terrain Aided Underwater Navigation Using Point Mass and Particle Filters. In Proceedings of the 2006 IEEE/ION Position, Location, And Navigation Symposium, Coronado, CA, USA, 24–27 April 2006; pp. 1027–1035. [Google Scholar] [CrossRef]
Eustice, R.M.; Pizarro, O.; Singh, H. Visually Augmented Navigation for Autonomous Underwater Vehicles. IEEE J. Ocean. Eng. 2008, 33, 103–122. [Google Scholar] [CrossRef]
Ma, T.; Ding, S.; Li, Y.; Fan, J. A review of terrain aided navigation for underwater vehicles. Ocean Eng. 2023, 281, 114779. [Google Scholar] [CrossRef]
Yang, J.; Chen, Y.; Du, S.; Chen, B.; Principe, J.C. IA-LSTM: Interaction-Aware LSTM for Pedestrian Trajectory Prediction. IEEE Trans. Cybern. 2024, 54, 3904–3917. [Google Scholar] [CrossRef] [PubMed]
Mao, G.; Fidan, B. (Eds.) Localization Algorithms and Strategies for Wireless Sensor Networks; IGI Global: Hershey, PA, USA, 2009. [Google Scholar]
Julier, S.J.; Uhlmann, J.K. A New Extension of the Kalman Filter to Nonlinear Systems. In Proceedings of the SPIE-The International Society for Optical Engineering, Orlando, FL, USA, 20–25 April 1997; Volume 3068, pp. 182–193. [Google Scholar] [CrossRef]
Pasek, P.; Kaniewski, P. A review of consensus algorithms used in Distributed State Estimation for UAV swarms. In Proceedings of the 2022 IEEE 16th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET), Lviv-Slavske, Ukraine, 22–26 February 2022; pp. 472–477. [Google Scholar] [CrossRef]
Chu, Z.; Wang, F.; Lei, T.; Luo, C. Path Planning Based on Deep Reinforcement Learning for Autonomous Underwater Vehicles Under Ocean Current Disturbance. IEEE Trans. Intell. Veh. 2023, 8, 108–120. [Google Scholar] [CrossRef]
Fang, Z.; Wang, J.; Ma, Y.; Tao, Y.; Deng, Y.; Chen, X.; Fang, Y. R-ACP: Real-Time Adaptive Collaborative Perception Leveraging Robust Task-Oriented Communications. arXiv 2024, arXiv:2410.04168. [Google Scholar] [CrossRef]
Abci, B.; Nader, J.; El Badaoui El Najjar, M.; Cocquempot, V. Fault-Tolerant Multi-sensor Fusion and Thresholding Based on the Bhattacharyya Distance with Application to a Multi-robot System. In Proceedings of the 15th European Workshop on Advanced Control and Diagnosis, Bologna, Italy, 21–22 November 2018; Springer: Berlin/Heidelberg, Germany, 2018; pp. 347–364. [Google Scholar]
Venanzi, M.; Rogers, A.; Jennings, N.R. Trust-based fusion of untrustworthy information in crowdsourcing applications. In Proceedings of the 2013 International Conference on Autonomous Agents and Multi-Agent Systems, Richland, SC, USA, 6–10 May 2013; AAMAS ’13; pp. 829–836. [Google Scholar]
Xiong, M.; Zhuo, J.; Dong, Y.; Jing, X. A Layout Strategy for Distributed Barrage Jamming against Underwater Acoustic Sensor Networks. J. Mar. Sci. Eng. 2020, 8, 252. [Google Scholar] [CrossRef]
Diamant, R.; Casari, P.; Tomasin, S. Cooperative Authentication in Underwater Acoustic Sensor Networks. IEEE Trans. Wirel. Commun. 2019, 18, 954–968. [Google Scholar] [CrossRef]
Yang, G.; Dai, L.; Wei, Z. Challenges, Threats, Security Issues and New Trends of Underwater Wireless Sensor Networks. Sensors 2018, 18, 3907. [Google Scholar] [CrossRef]
Osanaiye, O.; Alfa, A.S.; Hancke, G.P. A Statistical Approach to Detect Jamming Attacks in Wireless Sensor Networks. Sensors 2018, 18, 1691. [Google Scholar] [CrossRef]
Wang, Q.; Peng, B.; Xie, P.; Cheng, C. A Novel Data-Driven Fault Detection Method Based on Stable Kernel Representation for Dynamic Systems. Sensors 2023, 23, 5891. [Google Scholar] [CrossRef]
Cao, L.; Sheng, W.; Zhang, F.; Du, K.; Fu, C.; Song, P. Face Manipulation Detection Based on Supervised Multi-Feature Fusion Attention Network. Sensors 2021, 21, 8181. [Google Scholar] [CrossRef] [PubMed]
Leland, K.; Taylor, C.; Graas, F.v. Factor-Graph Optimization for Robust Navigation via High-Precision GNSS/IMU Corroboration. In Proceedings of the 2025 IEEE/ION Position, Location and Navigation Symposium (PLANS), Salt Lake City, UT, USA, 28 April–1 May 2025; pp. 97–104. [Google Scholar] [CrossRef]
Fakoorian, S.; Mohammadi, A.; Azimi, V.; Simon, D. Robust Kalman-Type Filter for Non-Gaussian Noise: Performance Analysis With Unknown Noise Covariances. J. Dyn. Syst. Meas. Control 2019, 141, 091011. [Google Scholar] [CrossRef]
Wen, W.; Kan, Y.C.; Hsu, L.T. Performance Comparison of GNSS/INS Integrations Based on EKF and Factor Graph Optimization. In Proceedings of the 32nd International Technical Meeting of the Satellite Division of The Institute of Navigation (ION GNSS+ 2019), Miami, FL, USA, 16–20 September 2019; pp. 3019–3032. [Google Scholar]
Olfati-Saber, R.; Fax, J.A.; Murray, R.M. Consensus and Cooperation in Networked Multi-Agent Systems. Proc. IEEE 2007, 95, 215–233. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Xu, B.; Wu, Z.; Chambers, J.A. A New Adaptive Extended Kalman Filter for Cooperative Localization. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 353–368. [Google Scholar] [CrossRef]
Xu, B.; Wang, X.; Zhang, J.; Guo, Y.; Razzaqi, A.A. A Novel Adaptive Filtering for Cooperative Localization Under Compass Failure and Non-Gaussian Noise. IEEE Trans. Veh. Technol. 2022, 71, 3737–3749. [Google Scholar] [CrossRef]
Zhang, L.; Guay, M.; Wang, S.; Lu, M. Completely Distributed State Estimation for Jointly Observable Uncertain Linear Systems. IEEE Trans. Autom. Control 2025, 70, 7063–7070. [Google Scholar] [CrossRef]
Singh, R.K.; Saha, J.; Bhaumik, S. Maximum correntropy polynomial chaos Kalman filter for underwater navigation. Digit. Signal Process. 2024, 155, 104774. [Google Scholar] [CrossRef]
Bao, J.; Mu, X.; Yu, X.; Zhu, Z.; Qin, H. Outlier-Robust Underwater Navigation Using a Dual-Robust-Kernel Kalman Filter. IEEE Signal Process. Lett. 2025, 32, 1485–1489. [Google Scholar] [CrossRef]
Li, P.; Sun, X.; Chen, Z.; Zhang, X.; Yan, T.; He, B. A Robust and Adaptive AUV Integrated Navigation Algorithm Based on a Maximum Correntropy Criterion. Electronics 2024, 13, 2426. [Google Scholar] [CrossRef]
Shi, B.; Lin, W.; Ouyang, W.; Shen, C.; Sun, S.; Sun, Y.; Sun, L. BA-CLM: A Globally Consistent 3D LiDAR Mapping Based on Bundle Adjustment Cost Factors. Sensors 2024, 24, 5554. [Google Scholar] [CrossRef]
Xu, B.; Wang, X.; Zhang, J.; Razzaqi, A.A. Maximum correntropy delay Kalman filter for SINS/USBL integrated navigation. ISA Trans. 2021, 117, 274–287. [Google Scholar] [CrossRef]
Fan, Y.; Zhang, Y.; Wang, G.; Wang, X.; Li, N. Maximum Correntropy Based Unscented Particle Filter for Cooperative Navigation with Heavy-Tailed Measurement Noises. Sensors 2018, 18, 3183. [Google Scholar] [CrossRef]
Chen, W.; Chi, W.; Ji, S.; Ye, H.; Liu, J.; Jia, Y.; Yu, J.; Cheng, J. A survey of autonomous robots and multi-robot navigation: Perception, planning and collaboration. Biomim. Intell. Robot. 2025, 5, 100203. [Google Scholar] [CrossRef]
Paull, L.; Saeedi, S.; Seto, M.; Li, H. AUV Navigation and Localization: A Review. IEEE J. Ocean. Eng. 2014, 39, 131–149. [Google Scholar] [CrossRef]
Chen, S.; Wang, N.; Chen, T.; Yang, Y.; Tian, J. Confidence check-adaptive federated Kalman filter and its application in underwater vehicle integrated navigation. Chin. J. Ship Res. 2022, 17, 203–211. [Google Scholar] [CrossRef]
Paull, L.; Seto, M.; Leonard, J.J. Decentralized cooperative trajectory estimation for autonomous underwater vehicles. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, Chicago, IL, USA, 14–18 September 2014; pp. 184–191. [Google Scholar] [CrossRef]
Zhu, J.; Kia, S.S. Cooperative Localization Under Limited Connectivity. IEEE Trans. Robot. 2019, 35, 1523–1530. [Google Scholar] [CrossRef]
Acarer, T. Energy-Aware Path Planning by Autonomous Underwater Vehicle in Underwater Wireless Sensor Networks for Safer Maritime Transportation. Int. J. Interact. Multimed. Artif. Intell. 2024, 8, 15–27. [Google Scholar] [CrossRef]
Thompson, F.; Galeazzi, R.; Guihen, D. Field trials of an energy-aware mission planner implemented on an autonomous surface vehicle. J. Field Robot. 2020, 37, 1040–1062. [Google Scholar] [CrossRef]
Wang, C.; Cheng, C.; Yang, D.; Pan, G.; Zhang, F. Underwater AUV Navigation Dataset in Natural Scenarios. Electronics 2023, 12, 3788. [Google Scholar] [CrossRef]
Radulov, N.; Zhang, Y.; Bujanca, M.; Ye, R.; Luján, M. A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems. In Proceedings of the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Abu Dhabi, United Arab Emirates, 14–18 October 2024; pp. 14225–14232. [Google Scholar] [CrossRef]
Álvarez Tuñón, O.; Marnet, L.R.; Aubard, M.; Antal, L.; Costa, M.; Brodskiy, Y. SubPipe: A Submarine Pipeline Inspection Dataset for Segmentation and Visual-inertial Localization. In Proceedings of the OCEANS 2024, Singapore, 14–18 April 2024; pp. 1–7. [Google Scholar] [CrossRef]
Cao, W.; Chen, K.; Cheng, E. Joint optimization of AoI and energy for AUV-assisted data collection in underwater acoustic sensor networks. Front. Mar. Sci. 2025, 12, 1580751. [Google Scholar] [CrossRef]
Dai, Q.; Wan, R.; Han, S.Y.; Xiao, G.R. A novel adaptive Gaussian sum cubature Kalman filter with time-varying non-Gaussian noise for GNSS/SINS tightly coupled integrated navigation system. Front. Astron. Space Sci. 2025, 12, 1436270. [Google Scholar] [CrossRef]
Chen, K.; Zhang, P.; You, L.; Sun, J. Research on Kalman Filter Fusion Navigation Algorithm Assisted by CNN-LSTM Neural Network. Appl. Sci. 2024, 14, 5493. [Google Scholar] [CrossRef]
Zheng, S.; Liu, S.; Wang, L. Event-triggered distributed optimization for model-free multi-agent systems. Front. Inf. Technol. Electron. Eng. 2024, 25, 214–224. [Google Scholar] [CrossRef]
Tabatabaei, H.; Gallo, A.J.; Al-Dabbagh, A.W. Secure state and output estimation for accommodation of false data injection attacks in large-scale systems. Automatica 2025, 180, 112460. [Google Scholar] [CrossRef]
Bidel, Y.; Zahzam, N.; Bresson, A.; Blanchard, C.; Cadoret, M.; Olesen, A.V.; Forsberg, R. Absolute airborne gravimetry with a cold atom sensor. J. Geod. 2020, 94, 20. [Google Scholar] [CrossRef]
Canuel, B.; Leduc, F.; Holleville, D.; Gauguet, A.; Fils, J.; Virdis, A.; Clairon, A.; Dimarcq, N.; Bordé, C.J.; Landragin, A.; et al. Six-Axis Inertial Sensor Using Cold-Atom Interferometry. Phys. Rev. Lett. 2006, 97, 010402. [Google Scholar] [CrossRef] [PubMed]
Santoni de Sio, F.; van den Hoven, J. Meaningful Human Control over Autonomous Systems: A Philosophical Account. Front. Robot. AI 2018, 5, 15. [Google Scholar] [CrossRef]

Figure 1. Core constraints and challenges of UUV CN.

Figure 2. Flowchart comparing Recursive Filtering and batch optimization paradigms.

Figure 3. Comparison of cost functions for MSE and MCC−based filters. (a) Mean Square Error (MSE) Cost Function; (b) Maximum Correntropy Criterion (MCC) Cost Function. Unlike the quadratic MSE cost, which is highly sensitive to outliers, the MCC cost function saturates for large errors [68], demonstrating its inherent robustness.

Figure 4. Hierarchical Architecture Diagram of Heterogeneous UUV-USV CN System.

Figure 5. Multi-Dimensional Performance Radar Chart of Key CN Algorithms. (Note: For visualization, some metrics are normalized or inverted so a LARGER area corresponds to BETTER overall performance).

Table 1. UAC channel characteristics and their modeling parameters.

Characterization	Impact on CN	Typical Values/Range	Common Modeling Techniques and Key References
Propagation delay	State information is asynchronous, and observations are out of date	Approx. 0.67 ms/m	Deterministic models ( $τ = \frac{d}{c}$ ), integral models, stochastic processes (Gaussian processes, Markov chains) [21], KDE [22]
Bandwidth constraints	Limited amount of communication data, low frequency of information updates	Data rate: typically, a few kbps; Update frequency: often <1 Hz	Shannon capacity model [29], Task-Oriented Communication (TOC) [29]
Multipath fading	Interference, dramatic fluctuations in signal strength, and generation of non-Gaussian heavy-tailed models	Delay extension up to 100–500 ms in deep water or complex terrain	Ray tracing (Bellhop [27]), Rayleigh/Rice fading models [30]
Doppler shift	Carrier frequency shift, synchronization difficulties	1 ms relative velocity at 1 kHz center frequency results in ∼1 Hz frequency shift	Linear frequency shift model, Doppler effect model [8]
Time-varying	Rapidly changing channel characteristics and high adaptive algorithm requirements	Second-to-minute variations, strong correlation with waves, tides, etc.	Time-varying impulse response model, autocorrelation function [31]
Packet loss	Loss of cooperative information	5–30% (depending on environment)	Bernoulli processes, Gilbert-Elliott model [22]

Table 2. Challenges and Strategies for Heterogeneous Systems and Extreme Environment Operations.

Operational Scenarios	Key Navigation and Communication Challenges	Cutting-Edge Strategies/Architectures	Key References
Heterogeneous Fleets (UUV-USV-AUV)	Data fusion between different sensor modalities; Maintaining formation with diverse vehicle dynamics; Communication management.	Tightly coupled fusion architectures; Use of USV as a “mothership” for GNSS reference and communication relay; Adaptive control algorithms.	[50,65,119,124,134]
Polar Region Operations	Meridian convergence rendering lat/lon unusable; Unreliable magnetic compass for heading; Severe acoustic multipath from ice canopy.	Polar stereographic grid frames and Grid North for geodetic reference; High-grade inertial sensors (FOG/RLG); Advanced multipath compensation techniques.	[65,119,124,135,136]
Deep Sea/Complex Terrain	Highly variable and uncertain SSP; Feature-sparse environments for SLAM; Extreme multipath in canyons.	Online SSP estimation for adaptive acoustic modeling; Advanced Data Association (DA) methods like JPDA and PMBM filters for robust acoustic SLAM.	[124,136,137,138,139]

Table 3. Quantitative Performance Comparison of State Estimation Algorithms under a Standardized UUV Navigation Benchmark.

Algorithm Category	Accuracy (Position RMSE)	Relative Comp. Complexity	Comm. Load	Robustness (vs. Non-Gaussian Noise)	Scalability (Complexity Growth)
Algorithm Category	Nominal (m)	DVL ¹ Partial Failure (m)	Comm. Load	Robustness (vs. Non-Gaussian Noise)	Scalability (Complexity Growth)
EKF (Baseline)	2.5	>25.0	1.0×	Medium	Low
AKF	1.8	>20.0	∼1.5×	Medium	Medium
Information Filter (IF)	2.5	>25.0	1.0×	Low	High
Consensus Filter	2.0	∼18.0	∼1.8×	High	High
MCC-KF	1.5	∼10.0	∼2.0×	High	Very High
Wavelet-aided Fusion	1.2	∼8.0	∼2.5×	Medium	High
Factor Graph (Sliding Window)	0.8	1.0	>3.0×	High	High

¹ DVL: Doppler Velocity Log.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, S.; López-Benítez, M.; Lim, E.G.; Ma, F.; Cao, M.; Yu, L.; Qin, X. Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques. Drones 2025, 9, 752. https://doi.org/10.3390/drones9110752

AMA Style

Li S, López-Benítez M, Lim EG, Ma F, Cao M, Yu L, Qin X. Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques. Drones. 2025; 9(11):752. https://doi.org/10.3390/drones9110752

Chicago/Turabian Style

Li, Shuyue, Miguel López-Benítez, Eng Gee Lim, Fei Ma, Mengze Cao, Limin Yu, and Xiaohui Qin. 2025. "Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques" Drones 9, no. 11: 752. https://doi.org/10.3390/drones9110752

APA Style

Li, S., López-Benítez, M., Lim, E. G., Ma, F., Cao, M., Yu, L., & Qin, X. (2025). Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques. Drones, 9(11), 752. https://doi.org/10.3390/drones9110752

Article Menu

Enabling Cooperative Autonomy in UUV Clusters: A Survey of Robust State Estimation and Information Fusion Techniques

Highlights

Abstract

1. Introduction

1.1. Critical Role of Sensor Fusion in Cooperative Navigation

1.2. Motivation and Challenges

1.3. Overview of Algorithmic Approaches

1.4. Research Gap

1.5. Contribution

1.6. Paper Structure

2. Hydroacoustic Communication Channel Modeling and Its Impact on Navigation

2.1. Physical Basis of Hydroacoustic Propagation

2.2. Classification of Delay and Packet Loss Models

2.2.1. Delay Modeling

2.2.2. Packet Loss Modeling

2.3. Data-Driven Channel Prediction Methods

2.4. High-Fidelity Simulation: Tools, Techniques, and Datasets

2.5. Summary

3. State Estimation and Information Reconstruction Under Uncertainty

3.1. Asynchrony and the Challenge of the Information Divide

3.2. Model-Based Prediction and Compensation Techniques

3.3. Paradigm Battle: Recursive Filtering vs. Batch Optimization

3.4. Learning-Based Information Reconstruction

3.5. Summary

4. Fusion Algorithms and Architectures

4.1. Fusion Architecture Framework

4.2. Bayesian Filtering: The EKF, UKF, and Particle Filters

4.3. Robust Filtering Techniques: Adaptive, Consensus, and Information-Theoretic Approaches

4.4. Information-Theoretic Fusion and Fault Detection: Bhattacharyya Distance

4.5. Multiresolution Fusion: Wavelet Analysis and Multirate Architectures

4.6. Summary

5. Intelligent Strategies and Adaptive Collaboration

5.1. DRL for Collaborative Control and Decision-Making

5.2. TOC: Bandwidth Optimization Based on the Information Bottleneck Principle

5.3. Active and Bionic Sensing for Enhanced Situational Awareness

5.4. Summary

6. Heterogeneous Systems and Extreme Environments

6.1. Collaborative Navigation of Heterogeneous Fleets (UUV-USV-AUV)

6.2. Navigation Frontiers I: Polar Region Operations

6.3. Navigation Frontiers II: Deep Sea and Complex Terrain Localization

6.4. Summary

7. System-Level Challenges: Safety, Assessment, and Deployment

7.1. Securing CN: Resisting Spoofing and Jamming Attacks

7.2. Multi-Dimensional Performance Evaluation Framework

7.3. Comprehensive Quantitative Comparative Analysis of Frontier Methods

7.4. Summary

8. Synthesis, Key Challenges, and Future Roadmap

8.1. Synthesis of Key Findings and Technology Evolution

8.2. Challenges and Unsolved Problems

8.3. Future Research Trajectories: A Roadmap for the Next Decade

8.3.1. Short-Term Goals (1–3 Years)

8.3.2. Medium-Term Goals (3–7 Years)

8.3.3. Long-Term Goals (7+ Years)

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI