1. Introduction
With the growing demand for unmanned underwater operations, autonomous underwater vehicles (AUVs) have been widely utilized in fields such as underwater exploration, environmental monitoring, energy extraction, and marine research [
1,
2,
3]. However, the limitations of traditional single-body AUVs have become increasingly evident in deep-sea or complex environments, highlighting the urgent need for multi-body collaborative systems with greater payload capacity and higher adaptability. As an innovative platform, the Multiple AUVs in Tandem Connection (MATC) system utilizes a modular design and distributed task allocation, enabling the system to flexibly adjust the number of units and their functional distribution based on mission requirements [
4]. This approach offers greater flexibility and scalability compared to traditional AUVs, making it well-suited for challenging underwater operations.
The MATC system comprises multiple interconnected AUV units, each independently capable of carrying sensors, payloads, or other equipment required for specific tasks, thereby enhancing flexibility and payload capacity. Additionally, tandem underwater vehicles offer significant advantages in terms of system fault tolerance [
5], maneuverability [
6,
7], as well as navigation and positioning [
8]. Compared to traditional single-body AUVs, the MATC system demonstrates superior task adaptability, allowing for expansion and adjustments to suit various application scenarios. For example, in underwater exploration or environmental monitoring missions, the arrangement and operational modes of the AUVs can be customized to meet specific mission requirements and environmental conditions.
Due to the interconnected nature of multiple AUV units, the MATC system exhibits the characteristics of a distributed thrust system in its design [
9]. The propellers of each AUV unit, typically located on both sides, work together to propel the entire system forward. Furthermore, the MATC system often incorporates flexible components, such as extendable springs, to minimize rigid collisions between units. Accurately controlling the relative displacement between units and ensuring stable straight-line motion of the entire system are critical for the successful execution of MATC system missions.
However, due to the distributed nature of the system’s propulsion, the relative motion and coupled dynamic effects between units make both overall speed control and inter-unit distance management particularly complex. The relative displacement between units not only impacts the system’s stability but also increases the risk of collisions and deviations from the planned trajectory, especially in high-speed or complex underwater environments. Ensuring steady straight-line motion thus becomes a significant challenge. Moreover, as the number of units’ increases, conventional thrust allocation methods lack the real-time efficiency needed for large-scale systems, highlighting the necessity of improving the efficiency of online thrust allocation methods.
While advancements in thrust allocation methods for distributed propulsion platforms, such as land trains, have been well-documented, to the best of our knowledge, no existing literature directly addresses the specific challenges of thrust allocation in multi-AUV systems with varying unit numbers and dynamic constraints. Land trains provide a representative example of distributed propulsion platforms. With advancements in control theory, thrust allocation methods for land trains have evolved over time. These methods primarily include optimal control [
10], fuzzy control [
11], adaptive control [
12], and data-driven control [
13]. For instance, Liu et al. developed a cooperative model predictive control (CMPC) strategy incorporating a multi-objective rolling optimization scheme [
14]. Key optimization objectives included maximizing overall line capacity, minimizing energy consumption, and enhancing ride comfort for train platoons operating within the railway section. To address the multi-mode operating characteristics of trains, such as acceleration, deceleration, and constant speed, Dong et al. successfully improved train operational efficiency and safety through fuzzy control [
15]. To overcome challenges related to high-precision speed tracking under uncertainties in train parameters and external resistance, Song et al. proposed a robust adaptive control algorithm [
16]. Additionally, Cheng et al. developed a data-driven thrust allocation method that simultaneously ensures the tracking safety of multiple trains and the operational efficiency of individual trains [
17]. However, most existing thrust allocation methods for land trains do not consider thrust constraints, making them unsuitable for direct application to the steady control of MATC systems. Furthermore, current models of tandem underwater vehicles are relatively simple, with fewer platform thrust units, and existing research has primarily focused on attitude control rather than thrust allocation for steady sailing.
AUV units in MATC systems face significant challenges due to thrust input saturation and uncertain ocean current disturbances [
18], which complicate effective thrust allocation. The key innovation of this study lies in its hybrid approach to thrust allocation in MATC systems. For smaller systems with thrust constraints, an improved Genetic Algorithm (GA) is introduced, specifically designed to account for constraint conditions, ensuring precise and feasible solutions [
19,
20]. For larger systems, the Harris Hawks Optimization (HHO)+BP algorithm is employed, enabling efficient offline thrust allocation and significantly reducing computational overhead for real-time implementation. The advancements of this study lie in the development of the hybrid approach that effectively addresses both discrete and continuous optimization problems in thrust allocation for MATC systems. This method demonstrates remarkable scalability, seamlessly adapting to systems with varying numbers of units, from small configurations to large-scale setups.
In certain practical application scenarios, such as fixed-altitude seabed exploration, the motion of the MATC system is primarily confined to the horizontal plane, where the effects of vertical motion are minimal. In addition, by simplifying the model to concentrate on horizontal dynamics, we can more effectively analyze how thrust allocation influences system speed and inter-unit distance. Therefore, this study focuses on controlling straight-line motion within a two-dimensional horizontal plane.
This study not only provides a novel solution to the steady straight-line sailing challenge in MATC systems but also establishes a theoretical foundation for multi-body coordination control in complex underwater environments. The successful application of this method paves the way for the broader deployment of multi-body vehicles in future deep-sea exploration, environmental monitoring, and other underwater tasks, offering substantial academic and practical significance.
2. System Modeling of MATC
To achieve steady straight-line sailing in an MATC system, the first step is to establish a dynamic model of the system. Force analysis, as the foundation of system modeling, plays a critical role in understanding and designing the system. By conducting a thorough force analysis, the interactions and effects of various forces acting on each AUV unit during its movement in water can be identified, providing a theoretical basis for subsequent dynamic modeling and control strategy development. Steady straight-line sailing refers to the entire MATC system moving in a straight line at a constant speed while maintaining stable relative distances between the AUV units. Achieving this requires applying appropriate thrust to each AUV unit, ensuring the system moves uniformly and that the inter-unit distances remain within the desired range. This balance is essential for the stability and efficiency of the MATC system during its operations.
For the purposes of analysis, the following assumptions are made. The shape, center of mass, center of buoyancy, and thrust position of each AUV unit in the MATC system are identical. The initial state assumes that the gravitational force is equal to the buoyant force. The vertical motion is neglected, and the MATC system maintains horizontal straight-line motion during operation. The gravitational and buoyant forces of the inter-unit connection structure are negligible. The relationship between spring deformation and the force between units is accurately described. The thrust applied to each side of the units is synchronized and equal in magnitude. The distances between units are equal when the springs are undeformed. The thrust direction of each unit is forward, and all units share identical performance characteristics. These assumptions simplify the analysis while providing a foundation for the dynamic modeling and control strategy development.
The MATC system is subject to several interconnected forces during straight-line sailing. These forces include thrust, drag, and reactive forces from adjacent units. The thrust is generated by the propellers on both sides of each AUV unit. The drag is caused by fluid resistance. To simplify the coupling effects between units, it is assumed that the reaction forces are caused by spring deformation.
Figure 1 shows an MATC system composed of three AUV units.
Figure 2 shows a schematic of the forces acting on the MATC system. In this figure:
represents the number of AUV units in the MATC system;
represents the length of each AUV unit;
,
, and
represent the total thrust, the left thrust, and the right thrust of the
-th unit, respectively;
represents the fluid drag acting on the
-th unit;
represents the reactive force that unit
exerts on unit
, and
represents the reactive force that unit
exerts on unit
. During straight-line sailing, the following relations are satisfied:
;
.
Based on Newton’s second law, the dynamic equations for each AUV unit are as follows:
where
,
, and
represent the mass, velocity, and acceleration of the
-th unit, respectively.
If the spring coefficient between units is
, the above equation can be rewritten as:
where
and
represent the current distance and the target distance between the
-th unit and its adjacent unit, respectively.
and
are correspond to the compression and elongation states of the spring, respectively.
3. Materials
3.1. Operations of the GA
The GA is an optimization technique inspired by the principles of natural selection and genetics. It seeks approximate or optimal solutions by simulating evolutionary processes such as selection, crossover, and mutation. GAs offer several key advantages. First, they have strong global optimization capabilities, allowing them to avoid local optima and explore a broader solution space. Second, GAs are highly adaptable, capable of solving complex, nonlinear, and multi-modal optimization problems without the need for extensive prior knowledge. They are also well-suited for parallel processing, enabling the simultaneous exploration of multiple solutions. Additionally, GAs are flexible and can be combined with other optimization techniques to address more complex challenges. These attributes make GAs highly effective for a wide range of optimization tasks.
The operations of the GA primarily include the following steps [
21]. (1) Initialization: A set of possible solutions is randomly generated as the initial population. Each individual (solution) is typically represented by a code, such as a binary string. (2) Selection: Individuals are selected based on their fitness. The goal of selection is to retain the best genes and improve the search efficiency of the algorithm. (3) Crossover: One or more offspring are generated from two parent individuals, typically by exchanging parts of their genes. This operation simulates biological reproduction and helps explore the solution space. (4) Mutation: A small, random change is made to an individual’s genes, such as flipping certain bits in a binary encoding. Mutation simulates genetic mutations in nature and helps explore new areas of the solution space. (5) Fitness Evaluation: Each individual’s fitness value is calculated based on the objective function. The fitness value reflects how well the individual solves the problem, often corresponding to the objective function value in optimization problems. (6) Replacement: The newly generated offspring are replaced by individuals in the current population. Replacement strategies affect the population’s diversity and the algorithm’s convergence speed. (7) Termination Condition: The algorithm terminates when certain conditions are met, such as reaching the maximum number of iterations, finding a solution that satisfies the desired accuracy, or when the population’s fitness changes very little.
GA uses selection, crossover, and mutation to filter individuals, retaining those with high fitness values to form new generations. The new generation inherits information from the previous generation but is improved. As the process continues, the fitness of individuals in the population increases until a stopping condition is satisfied. The fitness function serves as the basis for selection in GA, and its formulation directly influences the convergence speed and the quality of the solution.
The selection, crossover, and mutation processes for the MATC system thrust allocation using the GA are as follows:
(1) Selection: A random sampling selection operator is used to select individuals with higher fitness values from a randomly generated thrust allocation population, forming a new thrust allocation population with a higher probability.
(2) Crossover: A single-point crossover operator is used. A crossover point is randomly selected from the binary-encoded thrust allocation population obtained through the selection process, and the binary encodings are exchanged to form the next generation of thrust allocation individuals.
(3) Mutation: A bit-flip mutation operator is used. Random mutation positions are selected from the binary-encoded thrust allocation population obtained after crossover, and the selected bit is flipped to generate the next generation of thrust allocation individuals.
3.2. Objective Function Formulation
In the MATC system, stable straight-line sailing is essential for the successful completion of the mission. To ensure stable operation in complex underwater environments, precise control over the motion of each AUV unit is necessary. During the control process, the formulation of the objective function plays a crucial role. The objective function not only quantifies the system’s performance but also provides a basis for solving optimization problems. In the formulation of the objective function, quadratic objective functions are widely used for such problems due to their excellent mathematical properties, simplicity in solving, and extensive practical applications [
22,
23].
The core requirement for stable straight-line sailing is that the MATC system must maintain a constant speed while the AUV units uphold the ideal distance between them. To achieve this, it is necessary to allocate the thrust of each AUV appropriately and optimize the system to ensure that the overall speed and inter-unit distances meet the preset requirements. The design of the objective function should be closely aligned with these two key factors.
An important goal of the MATC system is to ensure that the distance between each unit remains within the ideal range at all times. This not only ensures the stability of the multi-body AUV system during sailing but also prevents collisions or inefficiencies caused by the units being too close or too far apart. To achieve this goal, the objective function must include a control term for the distance between units. Specifically, the objective function should have a distance deviation term, which represents the difference between the actual distance and the ideal distance. To prevent excessive distance deviations, the objective function applies a weighted penalty to the distance difference using a weighting factor. The magnitude of the weighting factor can be adjusted according to the system’s tolerance, ensuring that the distance control precision meets the desired requirements during the optimization process.
Another goal of the MATC system is to achieve uniform motion of the entire system, avoiding fluctuations in speed during operation. To accomplish this, the objective function must include a term for the overall speed deviation, which measures the difference between the actual speed and the target speed during each optimization cycle. Specifically, if there is a large deviation between the actual overall speed and the target speed within a given cycle, it indicates that the thrust allocation for that cycle is not suitable and needs to be adjusted by the optimization algorithm to achieve the target speed. Therefore, the objective function should include an overall speed deviation term. To prevent excessive fluctuations in speed control, the objective function applies a penalty to the overall speed deviation, encouraging the optimization process to minimize this difference.
Hence, the objective function is defined as follows:
where
and
represent the actual inter-unit distance and the ideal inter-unit distance after one cycle of thrust allocation, respectively;
and
represent the actual overall speed and the target overall speed after one cycle of thrust allocation, respectively;
and
are the weighting factors for the distance deviation and the overall speed deviation, respectively.
By carefully designing the weighting factors, the priorities of these two key factors can be balanced. In practical applications, the selection of the inter-unit distance deviation weighting factor and the overall speed deviation weighting factor will directly affect the optimization results. If the system has a higher precision requirement for speed, the weighting factor for overall speed deviation can be increased, making the optimization process focus more on controlling the overall speed. Conversely, if the system is more sensitive to the distance between AUV units, the weighting factor for distance deviation can be increased, prompting the optimization algorithm to prioritize maintaining the inter-unit distances. Therefore, the reasonable design and adjustment of the weighting factors are crucial to the optimization outcome. In practice, these factors typically need to be fine-tuned through simulations and experiments to ensure that the objective function accurately reflects the system’s real-world requirements and achieves optimal optimization results.
The goal of the optimization process is to minimize the value of the objective function , meaning that all deviation terms in the objective function should be as close to zero as possible. By minimizing the objective function, the system can ensure that the thrust allocation is optimal for each cycle, satisfying both the overall speed requirement and maintaining the ideal inter-unit distances between the AUV units. Therefore, the core task of the optimization is to appropriately allocate thrust such that, after each execution cycle, the system achieves the following objectives: minimizing the deviation between the actual overall speed and the target overall speed, and minimizing the deviation between the actual AUV inter-unit distances and the ideal distances.
Given the MATC system thrust allocation characteristics, initial state, and target state, the optimization process based on the above objective function can determine the appropriate thrust values for each AUV unit for the next execution cycle.
4. Methods
The number of units in MATC systems is uncertain and depends on mission requirements. When the number of units is small and thrust constraints are present, an improved GA that accounts for constraint conditions can be employed. For larger systems (e.g., 30 units), the HHO+BP algorithm is utilized, enabling efficient offline thrust allocation and significantly reducing computational overhead for real-time implementation. This hybrid approach leverages the strengths of each technique: the GA ensures precision for smaller systems, the HHO handles complexity for larger systems, and the BP neural network reduces computational overhead for real-time implementation. This hybrid approach addresses the limitations of existing methodologies by providing a scalable, computationally efficient, and adaptive solution for both small and large MATC systems.
4.1. Improved GA Considering Thrust Saturation Constraints
In the field of robotics control, thrust saturation constraints refer to the situation where the output of a robot’s thrust system (such as motors or servos) is limited by physical or electrical factors. These constraints are typically expressed as the thrust system’s output being unable to exceed a maximum or minimum value. This is due to physical and electrical limitations in the actual thrusts, such as power, current, and speed, preventing infinite increases in output.
The impacts of thrust saturation constraints include the following aspects: (1) When the thrust reaches saturation, the control system may no longer effectively adjust the robot’s motion, leading to increased control errors, slower system response, or instability; (2) Thrust saturation can lead to system instability, particularly in high-dynamic response applications; (3) When the thrust operates in saturation, its efficiency is typically lower, which can result in energy waste and a decrease in the overall system’s efficiency; (4) To address saturation constraints, the design and tuning of the control system become more complex.
Thrust saturation constraints are a significant issue in robot control, with a notable impact on the performance and stability of the control system. Effectively handling these constraints requires considering the relevant factors during the control system design and employing appropriate control strategies. By optimizing the control method and introducing anti-saturation mechanisms, the negative effects of saturation on system performance can be mitigated. In the MATC system, thrust saturation constraints refer to the maximum thrust each AUV unit can provide.
If the thrust saturation limit for each AUV unit is denoted as , the thrust assigned to each AUV unit should satisfy the condition: . Since the GA is difficult to deal with the constraints in optimization, it is necessary to convert the constraint into an unconstrained expression. This can be achieved by creating a penalty function that penalizes the thrust allocation results exceeding the saturation limits (), and adding it to the objective function of the GA for thrust optimization. This allows the determination of the actual thrust each AUV unit will use in the next execution cycle.
The penalty function is also formulated in a quadratic form. Considering the minimization optimization approach of the objective function, the specific term can be expressed as follows:
where
is the penalty factor.
For this penalty function: when , its value is 0, resulting in no penalty and a smaller objective function value; when , the value becomes , introducing a penalty and leading to a larger objective function value.
The objective function with the penalty term is expressed as follows:
By introducing the penalty function, the MATC system can effectively address the thrust allocation problem under thrust constraints and uncertain ocean current disturbances, significantly enhancing the reliability of stable straight-line sailing.
4.2. Offline Model Training to Reduce Online Processing Time
When the number of units in the MATC system is larger, the transportation efficiency and load-carrying capacity will be significantly improved. At this point, reducing the system’s online computational load and enabling the method to converge quickly are key to achieving stable straight-line sailing. Offline model training is an effective method to alleviate online computational demands. BP neural networks offer advantages such as strong generalization, high fault tolerance, and simplified models [
24,
25]. By utilizing the BP neural network model trained offline, the online processing time for stable straight-line sailing can be significantly reduced, thereby greatly lowering the computational complexity of rolling optimization.
The structure of a BP neural network mainly consists of three components: the input layer, hidden layer, and output layer. By collecting a large dataset of thrust allocation, a neural network model can be trained to predict the thrust distribution results. Additionally, the HHO algorithm can be used to optimize the weights and biases of the BP neural network [
26,
27]. By performing a global search on the weights of the BP neural network, HHO helps avoid the local optima issue often encountered by traditional gradient descent methods, leading to a more optimal network model. HHO not only improves the accuracy of network training but also accelerates the convergence rate. Let
denote the thrust that the
-th AUV unit is expected to adopt. The thrust allocation result for the MATC system can then be represented as
, where this allocation scheme will be treated as a candidate solution (i.e., the position of the Harris Hawk) in the HHO algorithm. During the iterative process, HHO will apply various position update strategies in an attempt to converge toward the thrust allocation that minimizes the objective function (
).
The offline model training process is as follows.
Initialize the population of Harris Hawks, where each Hawk represents a candidate thrust allocation solution:
where
is the population size.
Update the positions of the Hawks (i.e., thrust allocation values) at each iteration:
where
is the adjustment term, calculated based on random jumps or spiral trajectories.
For each iteration, verify whether the updated solution satisfies the constraints (e.g.,
). Record the optimal thrust allocation result as:
The system state vector is defined as:
Combine the system states
with the corresponding optimal thrust allocation
to form the training dataset:
where
is the total number of samples.
Use a BP neural network to establish the nonlinear mapping between system states and thrust allocation:
where
is the weight matrix,
is the bias vector,
is the neural network’s mapping function.
The loss function for training is defined as the mean squared error (MSE):
The weights and biases are updated using gradient descent:
where
is the learning rate.
To avoid local optima in gradient descent, the HHO algorithm is used to optimize the initial weights and biases of the neural network. Define the Hawk’s position as the combination of weights and biases:
Use the neural network loss function
as the objective function for HHO and perform global optimization:
Through the offline training process, obtain the final neural network model
, which is used for real-time thrust allocation prediction in online applications:
where
and
are the optimized weights and biases.
Figure 3 shows the flowchart of the offline model training process.
5. Results and Discussion
To verify the effectiveness of the proposed thrust allocation method, comprehensive simulation studies were conducted in the MATLAB (R2021a) environment, focusing on two configurations: a 5-unit and a 30-unit MATC system. The parameter values used in the simulations are as follows:
(1) The initial states: ; ; ; .
(2) The target states: ; .
(3) Parameters of the MATC system: or ; ; ; , , , ; ; .
(4) Parameters of the proposed method: ; Rolling optimization iterations ; When using the GA, the sample size , the number of replacements , the crossover probability , the mutation probability , the distance deviation weighting factor , the overall speed deviation weighting factor , and the penalty factor ; When using the HHO algorithm, the population size , the distance deviation weighting factor , the overall speed deviation weighting factor , and the penalty factor ; When using the neural network algorithm, the number of training iterations , and the learning rate .
(5) Mathematical description of current disturbances: The disturbance force (
) varies with time and can be expressed as:
5.1. Verification of the Improved GA’s Ability to Adjust Inter-Unit Distances and Overall Speed
The improved GA significantly enhances the dynamic optimization of thrust allocation in the MATC system. This ensures stable straight-line sailing by adjusting key parameters such as inter-unit distances and overall speed. This section aims to validate the algorithm’s effectiveness in achieving these objectives, highlighting its ability to manage the non-linear dynamics and interactions between the AUV units within the system.
Figure 4 illustrates the changes in thrust allocation, inter-unit distance, and overall state of a 5-unit MATC system over time using the improved GA, demonstrating its progression toward stable straight-line sailing. From the
Figure 4 results, it can be seen that when using improved GA for thrust allocation, the thrust of each unit stabilizes, the inter-unit distance decreases from 0.13 m to the ideal value of 0.12 m, and the overall speed increases from 1 knot to the target value of 1.5 knots.
The specific data are presented in
Table 1. As shown, the thrust allocation for each unit varies. At 15 s, the thrust values for units 1, 2, 3, 4, and 5 are 2.314 N, 2.498 N, 2.512 N, 2.401 N, and 2.593 N, respectively. These differences are primarily determined by the initial state of the MATC system.
5.2. Verification of the Improved GA’s Ability to Adjust Inter-Unit Distances Under Ocean Current Disturbances
This section examines the effectiveness of the improved GA in dynamically adjusting thrust allocation to maintain the desired inter-unit distances despite these external disturbances. By analyzing the system’s response, we can evaluate the robustness and adaptability of the algorithm in ensuring stable straight-line sailing under challenging environmental conditions.
Figure 5 shows the variation in thrust allocation and inter-unit distances for the 5-unit MATC system under ocean current disturbances as it approaches stable straight-line sailing. Under the improved GA-driven thrust optimization, its thrust continuously adjusts over time to counteract the effects of the disturbance. The inter-unit distances of all units gradually converge from 0.13 m to the ideal 0.12 m.
This result demonstrates that the proposed thrust allocation method effectively mitigates the uncertainty caused by ocean current disturbances, ensuring that the units maintain distances close to the ideal value and preventing excessive compression or stretching between the units. Specific data are shown in
Table 2.
The results from the simulation studies clearly demonstrate that the proposed thrust allocation method can effectively handle complex ocean current disturbances. By dynamically adjusting the thrust values to compensate for the varying forces acting on the system, the method ensures the system remains stable and can adapt to real-time changes in the environment.
5.3. Verification of the Inter-Unit Distance Adjustment Ability of the Improved GA Considering Thrust Constraints
Thrust constraints are an essential consideration in real-world applications of the MATC system, as they ensure the propulsion system operates within safe and efficient limits. This section evaluates the improved GA’s ability to maintain optimal inter-unit distances while respecting these constraints.
Figure 6 shows the thrust allocation and inter-unit distance variations for a 5-unit MATC system, comparing scenarios with and without the inclusion of the penalty function.
Figure 6a shows the thrust allocation results with the penalty function applied, demonstrating that all thrust values remain within 3.5 N. In contrast,
Figure 6b presents the thrust allocation results without the penalty function, where thrust values exceed 3.5 N (e.g., u5).
Figure 6c displays the inter-unit distance variations with the penalty function, revealing significant oscillations due to the constrained thrust values, which limit rapid adjustments. Conversely,
Figure 6d illustrates the inter-unit distance variations without the penalty function, showing smaller oscillations as the thrust values are unconstrained, enabling faster and more flexible adjustments. Despite this, the improved GA offers a significant improvement in managing thrust constraints, which is critical for the stability and efficiency of MATC systems.
Except for Unit 5, which is affected by ocean current disturbances, the thrusts of the other units gradually stabilize. The introduction of the penalty function effectively constrains the optimization process, ensuring that each unit’s thrust stays within the predefined range, thereby maintaining the system’s stability and controllability.
Additionally, these oscillations are primarily caused by the periodic nature of ocean current disturbances acting on Unit 5. In the early stages of system operation, the MATC system rapidly adjusts the thrust allocation, allowing the system to quickly return to the ideal state. However, because of the continuous fluctuations in ocean current, slight oscillations occur before the units achieve the desired inter-unit distance. Over time, these oscillations reduce as the system stabilizes, and the distances between the units converge to the ideal 0.12 m.
Table 3 presents the variation in thrust allocation and inter-unit distance over time with the penalty function applied. As shown, after the thrust allocation begins, the maximum thrust for all units, except for Unit 5, is 3.325 N, and the minimum thrust is 1.884 N, both of which satisfy the thrust saturation constraints. Furthermore, at 15 s, the inter-unit distances are all within the range of 0.122 m to 0.125 m, with a maximum deviation of 0.005 m, indicating an ideal convergence of the distances.
5.4. Verification of the Thrust Allocation Ability of the HHO+BP When the Number of AUV Units in MATC System Is Large
If the offline training process is excluded and the thrust optimization is directly performed using the improved GA, the inter-unit distance variation for the 30-unit MATC system is shown in
Figure 7a. The inter-unit distance variation for the thrust distribution after offline training (HHO+BP) is presented in
Figure 7b.
As shown in
Figure 7a, due to the long computation time for thrust allocation, the MATC system is unable to execute the optimal thrust allocation in time, leading to noticeable oscillations in the allocated thrust and preventing the inter-unit distance from converging to the ideal value. At 10.5 s, the average inter-unit distance is 0.1217 m, the maximum is 0.1653 m (with a relative deviation of 37.75%), and the minimum is 0.1121 m (with a relative deviation of 6.58%). Therefore, directly using online optimization methods for thrust allocation in cases with a large number of AUV units shows significant drawbacks.
In contrast, as shown in
Figure 7b, the inclusion of the offline training process significantly reduces the online optimization time, enabling the system to quickly execute the thrust allocation results. Consequently, the thrust allocation stabilizes rapidly, and the inter-unit distance converges to the ideal value. At 10.5 s, the average inter-unit distance is 0.1200 m, the maximum is 0.1201 m (with a relative deviation of 0.08%), and the minimum is 0.1199 m (with a relative deviation of 0.08%).
Figure 8 shows a comparison of the overall speed variations when directly using the improved GA and when using the offline training method. When the improved GA is applied directly, the straight-line speed fails to converge to the target speed. At 10.5 s, the overall speed is 0.66 knots (with a relative deviation of 56.00%). In contrast, when the offline training method is used, the overall speed quickly converges to the target speed of 1.50 knots.
Both the improved GA and the offline-optimized model (HHO+BP) can complete result computation and output within a single sampling period, demonstrating their real-time capability. The results clearly demonstrate that the improved GA alone struggles to manage the complexity of large-scale systems. In contrast, our hybrid approach (HHO+BP) achieves significantly better performance, with more stable and controlled inter-unit distance variations. Additionally, the overall speed variations are smoother and more consistent in the HHO+BP method compared to the improved GA. This comparison underscores the superiority of our proposed method in handling large-scale thrust allocation problems. While the improved GA is effective for smaller systems, its limitations become apparent in larger configurations, necessitating the integration of HHO+BP to ensure scalability, stability, and computational efficiency. By combining these techniques, our method provides a robust solution for large-scale MATC systems, addressing the challenges that existing approaches fail to resolve.