1. Introduction
Approximately 71% of the Earth’s surface is comprised of water [
1], yet it is estimated that as much as 95% of this vast resource remains unexplored to a similar resolution that is available for the surface of the moon [
2]. Although essential to all known life, the salt water medium which makes up the Earth’s oceans, and affects global climate and weather patterns also provides a hostile environment for exploration [
3]. In particular, the oceans present a hazardous environment for the safe and successful operation of a group of underwater vehicles known as Autonomous Underwater Vehicles (AUVs). Unlike other autonomous platforms (e.g., Unmanned Aerial Vehicles (UAVs) and Unmanned Surface Vehicles (USVs)), AUVs are operated in an environment where access to accurate positional data and a reliable communication medium is severely limited. Nonetheless, AUVs have evolved from research platforms used exclusively by academic institutions in the 1950s through to the present day where these highly autonomous systems are used within the oil and gas industry for pipeline inspection [
4] as well as being utilized for search and recovery missions [
5], maritime surveillance [
6], and sampling previously unexplored regions of the Earth such as under the Arctic ice pack [
7].
However, as a result of AUVs being untethered, the power available to these vehicles is finite and as such, their range and endurance is limited [
8]. While it is now common for AUV mission endurance to be measured in days, the operation of a single vehicle for large spatiotemporal data collection operations, such as oil plume tracking [
9], is inadequate. Therefore, the only way to satisfy this requirement is to operate a groups of AUVs within a multi-vehicle deployment which has been proven to improve the efficiency of current missions as described in [
10,
11].
Despite the improved efficiencies offered by multi-vehicle coordination, the difficulties associated with the underwater environment present a number of substantial technological barriers which must be overcome before fully autonomous multi-vehicle coordination is realized. At present, the greatest of these challenges is establishing a suitable communication strategy among the vehicles which is tolerant to the limitations of the underwater acoustic communication channel [
3,
12]. Nevertheless, a number of sea-trials have been completed since the start of the 21st century that have demonstrated not only the feasibility of operating multiple AUVs within a group structure but also the advantages of doing so [
10,
11].
While the trials discussed above are essential to the realization of a self-coordinating group of AUVs, the costs associated with completing such trials do not present an economically viable method to develop, test and analyse the various potential coordination algorithms associated with the deployment of a group of AUVs. Instead, extensive testing and optimization of the algorithms can be completed in a more time efficient manner using mathematical modelling and simulation techniques [
13].
Mathematical modelling involves representing the dynamics and kinematics of a vehicle through a serious of differential equations that when integrated, simulate the motion of that particular vehicle. When used correctly, mathematical modelling provides a powerful tool through all phases of the design process. Therefore, this paper uses mathematical modelling techniques to accurately represent the dynamics of a particular type of AUV known as RoboSalmon, which has been designed, built and tested at the University of Glasgow [
14].
RoboSalmon is classed as a Biomimetic Autonomous Underwater Vehicle (BAUV). These vehicles adopt the same propulsion and steering mechanisms as real fish [
14,
15,
16,
17,
18] and in doing so, their propulsive mechanism has been found to be more efficient at low speeds when compared with traditional propeller based AUVs [
19] but also poses far greater manoeuvrability characteristics [
20]. As a result, BAUVs are more suited to operating in confined environments and therefore make an ideal candidate vehicle for assessing the feasibility of coordinating multiple vehicles in close proximity to one another.
However, in order for a group of vehicles to complete the large scale spatiotemporal data collection missions discussed above, it is apparent that the vehicles must be able to coordinate themselves to ensure the given survey area is mapped efficiently. To achieve this, the coordination algorithms must ensure that all vehicles within the group are spaced appropriately, move with a common heading angle while avoiding collisions between neighbouring vehicles.
One particular coordination strategy that satisfies the above criteria is related to the behavioural mechanism utilized by fish within large school structures. This mechanism enforces a number of behavioural zones around each fish, which, depending on the distance to its nearest neighbours, results in the fish manoeuvring in an
attractive, orientating or
repulsive manner [
21,
22,
23].
Therefore, it is the aim of this paper to demonstrate that coordination algorithms based on the behavioural mechanisms of fish provides a suitable and easily adaptive method to allow a group BAUVs to be considered self-coordinating. Furthermore, the work presented in this paper shows the benefits of employing reduced fidelity versions of the current validated mathematical model of RoboSalmon for multi-vehicle coordination studies [
20].
This study is presented in this paper in the following manner.
Section 2 briefly describes the RoboSalmon vehicle and compares it with the other types of Unmanned Underwater Vehicles. It then discusses in detail the current mathematical model of RoboSalmon, its run time performance and methodologies implemented to improve its run time. Results from the validation process are presented in order to ensure that the reduced fidelity models still accurately represent the dynamics of the RoboSalmon vehicle.
Section 3 will describe the implementation of the coordination algorithms discussed above with the inclusion of realistic communication constraints.
Section 4 presents the results from the simulations involving the coordination algorithms and finally,
Section 5 presents the conclusions of the work completed in this paper.
3. Implementation of Algorithms for Multi-Vehicle Coordination
As the results in the previous section demonstrated, mathematical models implementing system identification techniques are able to complete simulations in a fraction of the time required by their corresponding high fidelity, white-box models. As a result, these models are particularly suited to completing investigations into multi-vehicle scenarios. Consequently, the following section will describe the implementation of coordination algorithms based on the behavioural mechanisms of fish to allow a group of BAUVs to be considered self-organizing. The algorithms will be implemented within the LUT and ANN models described in the previous section and a comparison between the results obtained from these models and the original high fidelity model will be made in terms of accuracy and simulation execution time.
In order to successfully operate a group of vehicles within a fully autonomous multi-vehicle scenario a number of different features of the overall system design have to be taken into consideration. These features can be broadly categorized into the following groups: Coordination Methodologies, Communication, Vehicle Control and Dynamics.
The interdependencies and relationships which exist among these features is known as the system architecture and is presented diagrammatically in
Figure 12.
Figure 12.
System architecture for coordinating multiple BAUV’s autonomously. Design features within the green box relate to design considerations for each of the individual vehicles within the group. Design features within the red box relate to the features required to allow the vehicles within the group to communication and coordinate with one another.
Figure 12.
System architecture for coordinating multiple BAUV’s autonomously. Design features within the green box relate to design considerations for each of the individual vehicles within the group. Design features within the red box relate to the features required to allow the vehicles within the group to communication and coordinate with one another.
Within
Figure 12, the Vehicle Dynamics and Control section represents the dynamics and kinematics of the RoboSalmon vehicle described in
Section 2.2. While for the navigation section, it is assumed that each vehicle’s position is known and represented by a two-dimensional Cartesian coordinate system.
In addition, apparent from
Figure 12 is that in order to produce autonomous multi-vehicle coordination, a communication strategy has to be established to allow the individual vehicles within the group to share their relevant positional data. Furthermore, coordination algorithms have to be implemented that are able to process and utilize this information to manoeuvre the vehicles to move in the correct direction.
However, as discussed previously, the underwater milieu presents a number of unique characteristics which makes the implementation of these two design features particularly challenging. Therefore, it is the purpose of the following sections to present the communication protocols and coordination algorithms implemented that take into account these challenges and allow the coordination of a school of BAUVs.
The implementation of a communication strategy within any group scenario is essential to the group’s ability to successfully complete the specified task. However, due to the aforementioned physical limitations of the underwater communication channel, the implementation of a successful communication strategy for multi-vehicle cooperation is particularly challenging. Nonetheless, presently there are three strategies utilized within the underwater acoustic channel: Frequency Division Multiple Access (FDMA), Code Division Multiple Access (CDMA) and Time Division Multiple Access (TDMA) [
36]. Evidently, in order to implement these strategies, each of the vehicles would have to be fitted with an acoustic modem. While there are acoustic modems on the market which could fit into the current version of RoboSalmon [
37], it is expected that the size of the vehicle would have to be slightly increased to allow further sensing equipment payload to be incorporated within the vehicle.
While each of the above possess their own operational benefits, this paper involves the implementation of a TDMA protocol for facilitating communication among the different vehicles. The TDMA protocol operates by assigning each vehicle within a group a unique timeslot within which it can transmit its data to the other vehicles. Once each vehicle has transmitted its data once, the process is repeated and the cycle continues until the end of the mission [
37]. The length of the timeslot, t is dependent on the transmission time of the data and the propagation delay due to the speed of sound in water. The timeslot can be calculated using the following equation:
The Transmission Time in the above equation is a constant value based on the size of the data being transmitted (256 bits) and the transmission rate of the acoustic modem (31.2 kbits/s). The second term is known as the propagation delay and determines the maximum time interval required for a packet of data to be transferred between the two vehicles furthest away from one another. As it is not possible to know the exact value of this parameter, a conservative estimation is made based on one of the parameters associated with the coordination algorithms and is discussed below in
Section 4.
3.1. Coordination Algorithms
As discussed earlier, one of the main motivations for operating AUVs within multi-vehicle deployments is to allow the collection of data over large spatiotemporal domains. Consequently, the coordination of the vehicles within the group is of critical importance to ensure not only their safe operation but also to ensure that the data collected is done so in an efficient manner. To achieve this, the algorithms implemented will have to ensure that neighbouring vehicles do not collide with one another but also ensure the formation of a group structure moving with a common directionality.
While several coordination methodologies exist that, if implemented, would satisfy the above criteria, the work presented in this paper will once again take inspiration from nature and implement a coordination strategy based on the behavioural mechanisms known to exist within school structures in nature.
These naturally occurring formations have been known to range in size from small groups containing as little as two individuals to immense structures containing millions of fish often moving with remarkable synchronicity [
38]. While, initially, debate surrounded the exact mechanism used to explain this phenomenon, Aoki’s work published in 1981 presented a set of behavioural mechanisms that when implemented within a simulation model, successfully imitate the behaviours exhibited by fish within school structures in nature [
39].
In his paper, Aoki established that fish must poses three behavioural tendencies in order to produce the schools structures displayed in nature
i.e.,
repulsion,
orientation and
attraction. The specific mechanism presented in Aoki’s paper to trigger the above behaviours is to assign three concentric circles of variable size to each individual and depending on which of these zones is occupied by its nearest neighbours, the individual in the centre will manoeuvre in either a repulsive, orientating or attractive manner as shown in
Figure 13.
Figure 13.
Graphical representation of behavioural mechanisms of fish within school structures. The diagram demonstrates how Fish (2), (3) and (4) would manoeuvre with reference to Fish (1). Fish (2) occupies the Repulsion Zone so it would look to move away from Fish (1). Fish (3) occupies the Attraction Zone and would therefore look to move towards Fish (1). Fish (4) occupies the Orientation Zone and would therefore align its heading angle with that of Fish (1).
Figure 13.
Graphical representation of behavioural mechanisms of fish within school structures. The diagram demonstrates how Fish (2), (3) and (4) would manoeuvre with reference to Fish (1). Fish (2) occupies the Repulsion Zone so it would look to move away from Fish (1). Fish (3) occupies the Attraction Zone and would therefore look to move towards Fish (1). Fish (4) occupies the Orientation Zone and would therefore align its heading angle with that of Fish (1).
Therefore, it is believed that this mechanism can be adopted and implemented as a coordination strategy for the coordinated movement of multiple vehicles. However, it is apparent from
Figure 13 that in order to implement the above mechanism, the individual vehicles within the group have to communicate their positional and heading data to the other vehicles within the group. As presented in [
20], the RoboSalmon vehicle is equipped with an Inertial Measurement Unit (IMU) containing a combination of accelerometers and gyroscopes. Consequently, dead reckoning techniques could be implemented to integrate the signals from these sensors to obtain each vehicle’s positional and orientation data. While dead reckoning techniques produce positional error growth [
40], for the purposes of the investigations completed in this paper, the influence of these errors on the coordination algorithms have not been taken into consideration. As discussed, this occurs through the transmission of data using acoustic communication methods and adopting a TDMA protocol.
3.2. Implementation of Coordination Algorithms
The controller implemented within the Guidance System (
Figure 2) to alter the heading angle of each vehicle requires a desired heading angle as an input. Consequently, the equations implemented to produce either a repulsive, orientating or attractive manoeuvre from each vehicle are required to produce an angular value based on the positional and heading data of its nearest neighbours as well as the behavioural zone which the neighbouring vehicles are occupying.
3.2.1. Repulsive Behaviour
The repulsive behaviour of each vehicle within the group has to ensure that the vehicle will manoeuvre in such a way as to avoid colliding with its nearest neighbour. The equation implemented to achieve this behaviour is shown below in Equation (9).
where
represents the desired heading angle (radians) which is input to the controller within the guidance system and
is the present heading angle of the vehicle. The decision of whether to add or subtract the 45° depends on the relative position between the two vehicles in question.
3.2.2. Orientating Behaviour
On the other hand, the orientating behaviour has to result in each vehicle aligning its heading angle with that of its nearest neighbours and the equation implemented within the algorithm to achieve this is shown below:
3.2.3. Attractive Behaviour
Finally, the attractive behaviour of each vehicle has to result in a manoeuvre that results in each vehicle reducing its distance to its neighbouring vehicles. This is achieved by creating a point in space which is equal to the average x and y positions of each vehicles nearest neighbours. The equations implemented to complete this task is shown below:
where
and
represent the x and y positions of nearest neighbours, respectively. These positions are then used to evaluate the required heading angle,
of the vehicle to manoeuvre in the direction of its nearest neighbour using the equation shown below.
3.2.4. Structure of Coordination Algorithm
The decision process to determine which of the above equations is used and its relationship with the communication protocol described in
Section 3.1 is known as the algorithms structure and is presented below in
Figure 14.
Figure 14.
Algorithm structure.
Figure 14.
Algorithm structure.
Equations (9)–(12) and the structure presented above represent the coordination algorithm implemented based on the behavioural mechanisms of fish within school structures. The following section presents results demonstrating the effectiveness of the above strategy to coordinate multiple BAUVs form a school structure.
4. Results
In the work, presented in this paper, a deployment of 12 vehicles is simulated for two different behavioural zones sizes. The maximum number of nearest neighbours that each vehicle could take into consideration is set to six based on the results from the work completed in [
41,
42]. Furthermore, as demonstrated by Equation (8), the time slot assigned to each vehicle to communicate its relevant data to the remaining vehicles is predetermined and is based on the transmission time of the data and the propagation delay. While the transmission time is constant at 0.008 s, the propagation delay is dependent on the maximum distance between two vehicles within the group. However, since this value cannot be determined and in actual fact varies throughout the simulation, a conservative value for the maximum distance between two vehicles has been selected to be double the size of the attraction zone utilised within the simulation. Finally, the size of the behavioural zones have been selected to ensure that the minimum inter-vehicle distances across the deployment are such that hydrodynamic interactions are negligible based on the results of [
43]. The above parameters are summarized below in
Table 6.
Table 6.
Simulation parameters.
Table 6.
Simulation parameters.
Parameter | Value |
---|
Deployment Size | 12 Vehicles |
Number of Nearest Neighbours | 6 |
Small Behavioural Zone Sizes | 2, 8, 50 m |
Large Behavioural Zone Sizes | 10, 20, 60 m |
Time Slot Size—Small Behavioural Zone | 0.07 s |
Time Slot Size—Large Behavioural Zone | 0.08 s |
Simulations utilizing the above parameters where completed for the three models discussed in this paper: the original high fidelity model, the LUT and ANN models. The following section will present the results from these simulations in terms of the ability of the algorithms to form a group structure but also to analyse the performance of the reduced models compared to the high fidelity model.
In order to determine whether or not the coordination algorithms have been successful, the variation in the average heading angle of the group was calculated using the standard deviation metric. Consequently, if the standard deviation calculated converges to a relatively small value, then it can be concluded with confidence that all the vehicles within the group are moving with a common directionality. However, to definitively demonstrate that the algorithms have been successful, the average distance to each vehicles nearest neighbours should also converge to be within the boundaries of the orientation zone. Therefore, the above metrics where calculated for the results produced from each of the three models discussed in this work and the results are presented below in
Figure 15 and
Figure 16.
Figure 15 and
Figure 16 clearly demonstrate that for each of the three models, the standard deviation and average nearest neighbour metrics behave as expected and converge to a steady state value. This suggests that the coordination algorithms have been successful in promoting the formation of a self-organizing group. Inspecting
Figure 15 and
Figure 16 more closely, it also becomes apparent that for the simulations involving the smaller behavioural zones, the time to convergence is greater when compared to the simulations involving the larger behavioural zones. This behaviour is yet again expected as the initial starting positions are the same for both scenarios and therefore, the time taken for individuals to start occupying one another’s orientations zones will be less when the behavioural zone sizes are larger.
Figure 15.
Comparison between results obtained for validated model, LUT model and ANN model for the standard deviation of heading angle of the group. The figure also presents the results for two different behavioural zone sizes. (
a) Small Behavioural Zone Sizes (
Table 6); (
b) Large Behavioural Zone Sizes (
Table 6).
Figure 15.
Comparison between results obtained for validated model, LUT model and ANN model for the standard deviation of heading angle of the group. The figure also presents the results for two different behavioural zone sizes. (
a) Small Behavioural Zone Sizes (
Table 6); (
b) Large Behavioural Zone Sizes (
Table 6).
Figure 16.
Comparison between results obtained for original model, LUT model and ANN model for the standard deviation of heading angle of the group. The figure also presents the results for two different behavioural zone sizes. (
a) Small Behavioural Zone Sizes (
Table 6); (
b) Large Behavioural Zone Sizes (
Table 6).
Figure 16.
Comparison between results obtained for original model, LUT model and ANN model for the standard deviation of heading angle of the group. The figure also presents the results for two different behavioural zone sizes. (
a) Small Behavioural Zone Sizes (
Table 6); (
b) Large Behavioural Zone Sizes (
Table 6).
In relation to the accuracy of the reduced fidelity models, the above metrics cannot be used to compare the different models. The reason that this is the case is that, although the standard deviation of the heading angle converges to zero for each model, the results from the reduced fidelity models might have an entirely different average heading angle to that of the validated model and therefore, the group may be converge to a different heading angle. Nevertheless, the above figures would suggest that the results produced from the ANN model are slightly less accurate than that of the results produced from the LUT model. This becomes apparent by inspecting
Figure 15 where the evolution of the standard deviation metric is noticeable different for the ANN model when compared with the validated model and LUTs results.
However, in order to accurately quantify the discrepancy between the different models, the
Theil’s Inequality Coefficient (TIC) was calculated again for each of the six states presented in
Table 2 for the results produced from the multi-vehicle simulations. The results are presented below in
Table 7.
Table 7.
Average TIC values obtained from comparison of simulations implementing the large and small behavioural zones.
Table 7.
Average TIC values obtained from comparison of simulations implementing the large and small behavioural zones.
Body and Earth Fixed Variables | Small Behavioural Zones | Large Behavioural Zones |
---|
Artificial Neural Network | Look Up Table | Artificial Neural Network | Look Up Table |
---|
Surge | 0.1023 | 0.0515 | 0.0799 | 0.0399 |
Sway | 0.2475 | 0.1616 | 0.2168 | 0.1375 |
Roll | 0.4604 | 0.3123 | 0.4447 | 0.3185 |
X-Pos | 0.0403 | 0.0117 | 0.0290 | 0.0061 |
Y-Pos | 0.0771 | 0.0342 | 0.0932 | 0.0447 |
Heading Angle | 0.2147 | 0.120 | 0.1978 | 0.1317 |
The results presented in
Table 7 clearly demonstrate that, regardless of the behavioural zone sizes implemented, the TIC values are greater for the comparison between the validated model and ANN model than they are for the comparison between the LUT model and the validated model. This would support the findings discussed above from the visual inspection of
Figure 15.
The reason that the ANN model is not as accurate as the LUT is due to the way in which the neural network was trained. The target data supplied to the ANN for training purposes was the evolution caudal fins position throughout one second corresponding to a specific tail centre deflection angle. In order to ensure the evolution of the caudal fin’s position throughout the vehicle’s entire operational range was supplied to the ANN, only caudal fin positions relating to tail centre deflection angles in five degree intervals where supplied to the network. As a result, the ANN produced a mapping which would be accurate for tail centre deflection angles starting at −90° and ending at 90° in five degree intervals. This was disguised in the results for the open loop manoeuvres where the tail centre deflections angles utilised (
Table 3) coincided with the same angles for which training data were supplied to the ANN.
However, for the simulations involving multi-vehicle scenarios, the commanded tail centre deflection angles could be any value between −90 and 90. Therefore, the ANN was producing values for the caudal fin’s position which it had not been specifically trained to produce and was instead, relying on the mapping obtained during the training phase to be sufficiently accurate.
Furthermore, while the ANN was only supplied training data at five degree intervals, the LUTs constructed contained data for the caudal fins position at one degree intervals. Consequently, the look table model only had to interpolate between one degree intervals in tail centre deflection angle whereas, the ANN was essentially interpolating over five degree intervals.
By comparing the results from
Table 4 and
Table 7, it is also apparent that the TIC values has increased for both the LUT model and ANN model. The reason for this can be explained by the nature of the open loop manoeuvres, whereby the tail centreline is deflected to a particular value where it remains for a considerable period of time. Whereas, within the multi-vehicle scenario, the tail centreline will be transitioning between various angles at a frequency of 1 Hz. Therefore, the fact that more transient behaviour exists within the multi-vehicle simulations would suggest that the system identification techniques employed to represent the transient behaviour are less accurate than those employed to evaluate the steady-state behaviour.
Table 7 also demonstrates that the state parameter which has the greatest discrepancy is the roll rate, with a maximum value of 0.4604. However, closer analyses of the difference between the results produced from the models for this parameter demonstrates that the TIC of 0.4604 value is equivalent to a maximum discrepancy of 6.33°/s.
However, as well as analysing the accuracy of the models for the multi-vehicle simulations, the run time performance of the models has been evaluated. The multi-vehicle simulations evaluated 400 s worth of data and the original model required approximately 40 min to complete, the LUT model required approximately 3.8 min while the ANN model required approximately 4.2 min.
Finally, the overall difference in the results produced from the three plots can be visually analyzed by inspecting the trajectory plots obtained from the results as shown below in
Figure 17.
Figure 17.
Comparison of trajectory plots obtained from the three models. Black trajectory represents validated model, blue line represents the ANN model and the red line represents the LUT Model. (
a) Small Behavioural Zone Sizes (
Table 6); (
b) Large Behavioural Zone Sizes (
Table 6).
Figure 17.
Comparison of trajectory plots obtained from the three models. Black trajectory represents validated model, blue line represents the ANN model and the red line represents the LUT Model. (
a) Small Behavioural Zone Sizes (
Table 6); (
b) Large Behavioural Zone Sizes (
Table 6).
Although
Figure 17 demonstrates that there are differences between the reduced fidelity models and the validated model, the magnitude of the discrepancy is small in comparison to the improvement in the simulation execution time achieved.
5. Conclusions
The work presented in this paper has defined the operational benefits of being able to deploy a self-coordinating group of AUVs for oceanic monitoring purposes, the challenges associated with doing so as well as the current state of the art in the deployment of groups of AUVs.
The paper has also demonstrated the benefits associated with implementing system identification techniques to replace complex functionality within high fidelity mathematical models to allow a drastic reduction in the simulation execution time while still producing results that are representative of the dynamics of the RoboSalmon vehicle. However, the results also demonstrated that of the two system identification techniques implemented, the model implementing LUTs was slightly more accurate when compared to the results obtained from the ANN model.
Furthermore, this paper has also presented the feasibility of implementing a decentralized coordination algorithm based on the behavioural mechanisms of fish to allow a group of BAUVs to be considered self-organizing. The results also demonstrated the ability of the algorithms to coordinate the vehicles to manoeuvre with specific inter-vehicle distances and therefore the algorithms could be implemented to allow a group of BAUVs to completed large-scale spatiotemporal data collection.
Therefore, in summarizing, the results produced in this paper have demonstrated the ability to implement system identification techniques to produce low fidelity mathematical models that dramatically reduce simulation execution time while maintaining an accurate representation of RoboSalmon’s dynamics. As a result, these models are able to produce more efficient simulations when investigating multi-vehicle scenarios. However, the results also demonstrated that of the two system identification techniques implemented, the LUT model was not only more accurate but was also capable of completing simulations quicker than the ANN model.