An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters

Song, Baojian; Zhang, Jiahao; Han, Xinjie; Fan, Yunsheng; Sun, Zhe; Wang, Yingjie

doi:10.3390/jmse12091622

Open AccessArticle

An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters

by

Baojian Song

^1,2

,

Jiahao Zhang

^1,2,

Xinjie Han

^1,2,

Yunsheng Fan

^1,2,*

,

Zhe Sun

^1,2

and

Yingjie Wang

^1,2

¹

School of Marine Electrical Engineering, Dalian Maritime University, Dalian 116026, China

²

Key Laboratory of Technology and System for Intelligent Ships of Liaoning Province, Dalian 116026, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2024, 12(9), 1622; https://doi.org/10.3390/jmse12091622

Submission received: 26 May 2024 / Revised: 21 August 2024 / Accepted: 7 September 2024 / Published: 11 September 2024

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Review Reports Versions Notes

Abstract

The detection of unknown waters has been studied and applied in various fields, such as national defense, military operations, engineering surveying and mapping, and scene reconstruction. To improve exploration efficiency in unknown waters, this paper proposes a framework for autonomous exploration using unmanned surface vehicles (USVs). This framework, comprising a multi-stage exploration strategy and a hierarchical navigation strategy, is designed to mitigate the inherent restrictions between the exploration target point and exploration direction in USV operations. These two strategies are optimized for the exploration target point and feasible navigation route to address the problem of the USV’s limited mobility during exploration. Rapidly exploring random tree (RRT) and boundary detection methods are used in the local layer to find the boundary in front of and behind the USV, and the gain of the target point is optimized. The hierarchical navigation method is implemented in the global layer to plan appropriate navigation paths. The proposed method is tested in simulations in several virtual environments and contrasted with the conventional methods currently in use. The findings indicate that our strategy covers more ground more effectively than other methods (our method achieved an exploration efficiency ranging from 4.9 to 5.3

m^{2}

/s, whereas traditional methods ranged from 2.3 to 3.9

m^{2}

/s, which demonstrates that our approach can improve exploration efficiency by up to 200% compared to traditional methods), spending less time exploring while significantly reducing collision probability.

Keywords:

unmanned surface vehicle; multi-phase exploration; hierarchical navigation

1. Introduction

Autonomous exploration usually uses sensor data acquired from the environment with prior knowledge to independently perform path planning and environmental exploration in an unknown environment with minimal path cost and working time. It is crucial for accomplishing target search and rescue activities and is widely utilized in environmental investigation, scene reconstruction, and other domains [1,2]. The complex geometry and structure of the environment can still make it difficult.

Given the extensive attention in diverse disciplines, numerous techniques have been developed to address the problem of autonomous exploration, including boundary-based and random sampling-based methods. The frontier-based exploration method determines that the next target of the agent must be the boundary point between the explored and unexplored areas. Based on randomly sampled observation points, the method does not take the boundary as the target point of exploration, nor does it traverse all nodes in order. Instead, it employs a random sampling strategy to identify the subsequent location in the environment where it can gather the most data. Frontier-based approaches typically employ image processing [3] to identify boundary data [4,5] and carry out boundary detection on the entire global map following each map update. In the early phase of exploration, boundary extraction is relatively fast. However, the computational resources used for boundary extraction will grow geometrically with this incremental expansion and will spend a lot of time waiting for data processing, leading to frequent delays in work. Moreover, the access sequence of boundary points [6,7,8,9] will also greatly affect the efficiency of environmental exploration. Using an observation point method that relies on random sampling is called greedy. This method involves exploratory actions that maximize the return of an unknown area and obtain the next or several immediate target points. As a result, the path created is frequently inefficient, and the need frequently travels back and forth, continually visiting the same area. Moreover, for agents with non-differential speed and insufficient steering ability, a large number of invalid paths will be generated, which greatly reduces exploration efficiency.

In recent years, scholars have combined these two exploration methods to obtain the next best target point by evaluating the unknown gain contained in the boundary target point. Rapidly expanding random tree (RRT) is the most common method [10] for extracting boundary target points. This technique uses the random properties of RRT growth, randomly scattering points in the known map region, causing the random tree to grow along the direction of the random points, eventually covering the exploration area and extending to the boundary of the environment. Recording the location of the treetop is used as the boundary target point. This method does not need to construct the environment map accurately and has good results in two-dimensional and three-dimensional environments, so it has been widely used [11]. RRT, however, is also a fundamentally greedy strategy. It does not consider the entirety of the set continuity and only evaluates all boundary nodes individually. If the exploration process only chooses the boundary point with the greatest unknown gain as the target point, it might turn to other regions with greater benefits for exploration when a particular region is not fully explored, leading to an invalid backtracking phenomenon and reducing exploration efficiency. Additionally, the random growth of trees often results in the frequent omission of the corners of the map, which are likely to contain the greatest orthogonality in any factorial representation of the environment.

In this paper, we propose a more efficient exploration and navigation planning framework. The two exploration methods have limitations for the sake of the characteristics that USV cannot rotate when moving, and it is not easy to adjust the heading angle repeatedly in the course of navigation. The frame is shown in Figure 1. The exploration algorithm takes the incremental map produced by simultaneous localization and mapping (SLAM) and extracts the target points, sending both the global and local target points to the path-planning module. The path-planning module plans an optimal path covering both local and global target points and incrementally expands the map through sensor data during navigation to the target point. This framework focuses on maintaining the local map rather than global maps. In order to improve the timeliness of detection, the USV acquires surrounding boundary information with minimal computational cost and plans a reasonable navigation path through hierarchical navigation. Compared with existing research, the main contributions of this paper are summarized as follows:

(1): Our exploration framework adopts a multi-stage exploration strategy in comparison with the current exploration methods, and it introduces the concepts of the global layer and local layer during the exploration process. The multi-phase exploration strategy is implemented in the local layer. The exploration strategy based on RRT and the frontier detection method based on the image are used to obtain the local boundary observation points. In the global layer, the global optimal observation point is dynamically selected as the end point of global backtracking, and repeated access to multiple regions is constrained by the gain function to minimize invalid backtracking behavior.
(2): We introduce a hierarchical navigation strategy compared to traditional navigation. In the backtracking phase, we allocate local and global boundary observation points at the same time, using the global boundary observation points as the endpoints for the current navigation plan. This can reduce the repeated forward and backward movement of the USV when adjusting its course during exploration.

The rest of this paper is organized as follows. Section 2 provides the related work on autonomous exploration and how our approach can further contribute, and Section 3 defines the problems considered. The solution is described in detail in Section 4 and evaluated in Section 5 with simulations. Finally, Section 6 summarizes and concludes the work of this paper.

2. Problem Formulation

Autonomous exploration methods have significantly evolved in recent years and may be broadly separated into two categories: conventional methods and methods based on machine learning. Among the former methods, the frontier-based method [12,13,14] and the method based on randomly sampled observation points [15,16,17,18,19,20,21] are the mainstream approaches applicable to most scenarios. These select the next target point using acquired map information. The fundamental idea behind machine learning-based methods [22,23,24,25,26] is to govern decision-making through autonomous learning during the exploration phase. However, these methods face significant challenges in practical deployment. One major issue is the difficulty in transferring models trained in controlled environments to more complex, real-world settings. As a result, most experiments with machine learning-based methods are still confined to straightforward indoor environments.

The frontier-based exploration method is one of the earliest developed methods for autonomous exploration. It drives the agent to explore by using boundary points, which contain information about unknown regions, as the target. The method described in [12] involves moving towards boundary points to maximize the collection of environmental information. This method does not consider obstacle information and selects the boundary point with the shortest Euclidean distance as the target point. While this approach cannot ensure the safety of boundary points and often results in repeated visits, it has provided foundational insights for subsequent research. Later improvements introduced various methods to enhance the gain function and path cost [5,27,28,29] to obtain the optimal boundary points. However, this method is prone to local optimization issues and can lead to inefficient backtracking when encountering local minima. Consequently, the exploration efficiency in complex environments remains limited. In another approach, the concept of a traveling salesperson (TSP) is presented [6,7,8]. By sorting all the boundaries reasonably, an optimal exploration path that covers the map to a greater extent and consumes less time can be obtained. However, with the increase in the exploration area, the calculation amount will expand rapidly.

In contrast to the boundary approach, the method based on random sampling of observation points employs random sampling points as direct target sites in free space and does not need to analyze and process all boundaries in the map. After the rapidly exploring random graph (RRG) was proposed [30], RRG has been widely used because it does not require geometric division of the search area, has a high coverage rate and a wide search space range, and is biased toward unexplored areas. In reference [17], RRG is utilized for underground exploration, and RRG is used to effectively identify and optimize exploration paths in local narrow spaces to eliminate mobility limitations caused by multibranched and tunnel-like networks. With reference to [16,17], a random graph [18] is employed as a method of random sampling. The map is continuously built using the ray-tracing method, and an interruption mechanism is introduced to decouple the node gain and draw the agent to the unexplored area to complete the exploration. RRT [31] can also achieve randomness. The receding horizon “next–best–view” planner (RH-NBVP) [15] use of RRT in three-dimensional space expansion generates RRT to find the unknown space that can be passed through. Through the gain function, the most plentiful branch, which has the richest unknown information, is then chosen to explore the unknown space. This method can effectively explore a single area. However, this method is unsuitable for exploring larger environments due to its inherent tendency to overlook areas that have not been thoroughly explored. Reference [32] uses RRT to guide the robot to detect dangerous gas leakage and draw the gas concentration range map. Reference [10] introduces the exploration framework of hierarchical RRT, generates random trees in global and local space, respectively, obtains boundary information through the growth of random trees, and selects the best target points according to boundary revenue. Compared to the boundary detection method based on image processing, this method is more flexible. However, after reaching each observation point, the local spatial RRT tree is reset, leading to the deletion and regeneration of many influential nodes, resulting in computational waste. Moreover, the global tree grows simultaneously with the local tree, which may result in invalid backtracking of the agent, which abandons the local node and goes to the global node with a relatively high gain. Due to the limits of both methods, reference [33] combines frontier-based and random sampling-based exploration methods, using the improved RRT as the path planning algorithm for the UAV. However, this approach can cause discontinuities in the exploration path and reduce exploration efficiency. In the dual-stage viewpoint planner (DSVP) [34], the dynamic pruning random tree strategy is introduced during the local exploration stage to prune nodes that are outside of the local plan, allowing the tree to grow better along the frontier direction. A relocation strategy is proposed in the global exploration phase, which uses the global map to quickly locate the global observation points and keep the nodes generated by RRT in the exploration process to achieve the goal of fast backtracking. In this method, local exploration is the main task at present. When the local exploration is complete, it will be guided to a global unknown area for further exploration, which greatly improves the timeliness of exploration. But inevitably, in the relocation phase, there will be a pause caused by calculating the optimal global target point. To navigate to the selected global boundary point, the non-optimal viable path created by RRT is used, resulting in an invalid path cost and requiring a lot of time.

The contribution of this paper lies in proposing an efficient exploration framework that eliminates ineffective backtracking through a multistage exploration strategy. Additionally, a hierarchical navigation strategy is employed to minimize unnecessary forward and backward adjustments of the USV, achieving the maximum movement expectation in the lowest amount of exploration time. We tested the proposed exploration framework under the Robot Operating System (ROS) and compared it with Image-Based, RRT [10], and their combination method DSVP [34]. The results show that, for low-speed navigation, our results are better than the exploration efficiency of other methods in terms of completing the coverage of the whole environment and can be adapted to relatively high-speed navigation after subsequent improvement.

3. Problem Description

The problem considered within this work consists of exploring a bounded space

V \subset R^{3}

. There are three kinds of regions on the bounded space V known as the free region

V_{f r e e}

, the unknown region

V_{u n k n o w n}

, and the occupied obstacle region

V_{o c c u}

. When working, the USV relies on sensor information and

V_{k n o w n}

to realize its positioning and explore and map

V_{u n k n o w n}

.

Assumption 1.

USVs will be explored in a two-dimensional environment. Therefore, it is better to transform the above area into a two-dimensional grid map for exploration.

The problem can be defined as follows:

Question 1: Given the unknown environment, how can the proposed multiphase exploration strategy be used to acquire observation points and complete the overall environmental exploration?

Question 2: Given the USV’s motion characteristics, how can we use the proposed hierarchical navigation strategy to plan efficient navigation routes?

4. Methodology

Our exploration framework takes inspiration from DSVP and divides the strategy into three phases. In the exploration stage, we use dynamically expanded RRT to create a random tree and use Image-Based boundary detection methods to jointly obtain boundary observation points in the local layer for clustering to integrate adjacent and invalid observation points. We also obtain the best local observation points from the gain function. When the USV moves to the assigned local observation point in the pruning phase, the random tree will be pruned instead of reset. In the effective backtracking phase, when there is no unknown region in the local layer, the USV will backtrack to the best global boundary observation point and continue to complete the exploration. In this phase, the USV will implement the hierarchical navigation strategy. The hierarchical navigation strategy involves planning a dynamic and feasible path covering the current local layer and global layer observation points given the premise of considering the kinematic model of the USV. During the work, the USV has converted among three phases to explore the environment.

4.1. Multiphase Exploration Strategy

4.1.1. Exploration Phase

The exploration phase is used to extend

V_{k n o w n}

. We use dynamic extended RRT and a frontier detection method based on image segmentation to generate observation points around the USV. Figure 2a shows the process of the exploration phase, with the USV as the geometric center and the blue dashed region

L \subset R^{3}

defined as the local layer. In the process of exploration, RRT and frontier detection methods based on image segmentation are used for partition detection.

Define the upper half of the local layer

L_{T} \subset L

and create a dynamic random tree with the current position of the USV

P_{B}

as the root. The nodes generated by each iteration of the random tree after meeting the boundary are the local observation points

X_{L}^{R}

in the upper half of the region. RRT is selected for high search space coverage and, in concert with the pruning stage, the growing parts can be retained each time, reducing the calculation cost. Moreover, due to the motion characteristics of the USV, it is not easy to reach the target point behind itself in the local layer. Therefore, RRT is only used to detect the upper half of the region, and priority is given to assigning observation points to the USV first.

Since the growth range of RRT is specified, the growing branches can better go along the direction of the USV, which is conducive to the acquisition of boundary points. Also, because the limited growth range is specified, the possibility of dropping corners is significantly reduced. In the lower part of the local layer

L_{B} \subset L

, the Canny edge detection method is used, which is consistent with the traditional boundary detection method. Given the grid map and limited exploration area, in this case, it does not take up more memory and has more efficient boundary extraction, and the center of the extracted boundary is selected as the local observation point

X_{L}^{F}

in the lower half of the region.

In the exploration stage, the relationship of each parameter is described as follows:

\{\begin{matrix} X_{L} = \sum_{i = 1}^{h} {X_{L}^{R}}_{i} + \sum_{j = 1}^{k} {X_{L}^{F}}_{j} \\ s . t . {X_{L}^{R}}_{i}, {X_{L}^{F}}_{j} \in {L_{T}, L_{B}, F O V (S_{l}), L}, \end{matrix}

(1)

\{\begin{matrix} R (X_{L}^{i}) = λ T (X_{L}^{i}, P_{B}) I (X_{L}^{i}) - N (X_{L}^{i}, P_{B}) \\ s . t . i \in {R, F}, \end{matrix}

(2)

T (X_{L}^{i}, P_{B}) = \{\begin{matrix} 1 + T_{g a i n}, i f ψ \leq \frac{π}{6} \\ 1 - T_{g a i n}, i f \frac{π}{6} < ψ \leq \frac{π}{2} \\ 0, i f ψ < \frac{π}{2}, \end{matrix}

(3)

T_{g a i n} = \{\begin{matrix} λ_{1} sin ψ, i f ψ \leq \frac{π}{36} \\ cos ψ, i f \frac{π}{36} < ψ \leq \frac{π}{12} \\ λ_{2} sin ψ, i f \frac{π}{12} < ψ \leq \frac{π}{6} . \end{matrix}

(4)

In Equation (1),

X_{L}

is defined as the set of local observation points obtained in the exploration phase. The subscripts of

X_{L}^{R}

and

X_{L}^{F}

, which are obtained by RRT and the edge detection method, represent the order of generating corresponding observation points, and the local observation points are restricted to the local layer. Further,

F O V (S_{l})

is the effective field of view of the sensor, and its coverage is smaller than the planning area of the local layer

L

, while the acquisition process of observation points will be effectively carried out within the scope of

F O V (S_{l}) \cup L

. As a result, the unknown boundary area will be closer to the visual constraint range of

L

, and the unknown area in

L

will be acquired more purposefully. Considering the motion characteristics of the USV with limited turning, in the exploration phase, directions are preferentially allocated to the observation points

X_{L}^{R}

in the current exploration, while

X_{L}^{F}

is distributed behind the USV, so priority is not given. All observation points that have not been visited are imported into the global target point set

X_{G}

with delay and lag. Equations (2)–(4) show the gain function used to calculate each local observation point, which is similar to the method used in [10]. On this basis, the steering gain is added, which is more suitable for the exploration characteristics of the USV.

In Equation (2),

X_{L}^{i} = X_{L}^{R} \cup X_{L}^{F}

is used to denote all observation points in the local range, and the weight

λ

is used to make the gain

T \cdot I

have a similar order of magnitude as the navigation cost N. Additionally,

I (X_{L}^{i})

represents the information gain at the observation point, whose size is related to the selection of the sensor, and it is expressed in the form of the unknown area covered. As shown in Figure 3, the red circular area is defined as the effective field of view of the sensor, and the sum of the yellow and pink areas is defined as the unknown area, which is different from the traditional use of only the yellow area and abandons the small occupied area of the grid. By increasing the information gain, the influence of navigation costs can be reduced, and the USV can go to the local observation point farther away. The information gained from the observation point is expressed as the sum of the yellow and pink areas. Further,

N (X_{L}^{i}, P_{B})

represents the navigation cost of the USV from the current position

P_{B}

to the observation point

X_{L}^{i}

. Here, the feasible path generated by obstacle occlusion is not considered but is only represented by Euclidean distance. Also,

T (X_{L}^{i}, P_{B})

represents the steering gain brought by the observation point

X_{L}^{i}

, whose specific magnitude is given by Equations (3) and (4). In Equation (3),

ψ

represents the radian between the target point and the direction of the USV. Further,

T_{g a i n}

is the steering gain coefficient, and its magnitude is related to

ψ

. In Equation (4),

ψ_{1}

and

ψ_{2}

represent the steering weight coefficient, which is the order of magnitude used for balance calculation. Here, considering that a large steering angle may have a certain impact on navigation, the selection should meet

ψ_{1} > 10 ψ_{2}

to ensure the priority of small angle selection. It can be seen from the selection of

T (X_{L}^{i}, P_{B})

and

T_{g a i n}

that, by the size of

ψ

, the steering gain and the steering weight coefficient are calculated, and the USV is encouraged to go to the observation point with a small angle in the current direction; that is, there is a small amplitude of steering during the voyage, which can cover the map more effectively. After obtaining the best observation point, the USV enters the pruning phase when it moves to point B in Figure 2b after one iteration.

4.1.2. Pruning Phase

The main purpose of pruning is to remove useless RRT nodes from the local layer. The so-called useless nodes are the nodes that are occluded and outside the upper half of the current local layer. As shown in Figure 2b, firstly, the physical structure of the random tree is updated, and the root is converted from A to B. Secondly, all nodes and branches that are blocked by obstacles and outside the upper part of the current local layer are deleted, such as the light blue nodes and branches in the figure. Finally, the empty node without branches is connected to the current root, B. During the pruning process, new local observation points, namely the orange nodes in the graph, are randomly sampled and added to the local observation point set. Meanwhile, local observation points

X_{L}^{G}

that are not in the current local layer are transformed into global observation points and added to the global observation point set

X_{G}

. Note that

X_{L}^{G}

does not consist only of the

X_{L}^{F}

legacy, but also of

X_{L}^{R}

, which is not selected and whose surroundings are not detected during the exploration phase. In the pruning phase, because part of each growth iteration is retained, it will use a smaller amount of computation than the method of regenerating a new local random tree. In addition, the boundary detection algorithm based on image segmentation is collocated. In the process of exploration, the previous phase of

L_{T}

will take part in the composition of the current moment

L_{B}

. This will largely avoid missing unexplored corners due to the random nature of tree growth.

Algorithm 1 defines the detailed process of the pruning phase. The local layer is updated at a constant frequency. At each update, all

X_{L}

values are checked, and the local observation points

X_{L}^{G}

, which were dropped into the global layer during the exploration phase, are imported into

X_{G}

. This procedure corresponds to lines 3 through 8 in Algorithm 1. Then,

P_{B}

is transformed into the root node

V_{i n i t}

, and all RRT nodes in

L

are checked. All nodes V and branches E that are not in

L_{T}

are pruned, and the nodes that are not pruned are directly connected to the root node. In terms of code implementation, we designed a

C o n

function to connect nodes.

Algorithm 1: Pruning Phase

4.1.3. Effective Backtracking Phase

Backtracking is inevitable during exploration, but we can avoid invalid backtracking. So-called invalid backtracking refers to the act of repeatedly moving back and forth between a few regional explorations, which greatly reduces exploration efficiency. The backtracking that occurs when the current subarea is fully explored before entering another subarea can be an effective backtrace.

In the exploration process, when there is no unknown boundary information in

L_{T}

, that is,

X_{L}^{R} = ⌀

, it will change from the local exploration phase to the effective backtracking phase of the global layer. In this phase, the local image observation point

X_{L}^{F}

, the global layer information

G \subset R^{3}

, and the global observation point set information

X_{G}

are used, and the hierarchical navigation strategy is combined to complete the effective backtracking. Here,

X_{L}^{F}

, which is not converted to

X_{L}^{G}

, consists of

X_{L} \in L_{B}

, left over from the exploration phase. Further,

G

is an incremental map obtained from the beginning of exploration until now, which is mainly used to plan the navigation path of the USV in the backtracking phase. Also,

X_{G}

is composed of

X_{L}^{G}

left over from each iteration. In the iteration process, the global observation point with the nearest distance and the least backtracking cost is selected from

X_{G}

as the navigation target point, and a reasonable path is planned through the hierarchical navigation strategy to complete effective backtracking. Here, the selection of the best global observation point follows the generated time order. The time order is determined by marking the time angle index for the global target point

X_{G}^{i} \subset X_{G}

. Each backtracking will search from the observation point generated the previous time in

X_{G}

. In the backtracking phase, the USV sailed in the area of the known, but it may be gathering information to extend other boundary conditions. The planning of the course may use

F O V (S_{l})

to cover the unknown boundary, so it needs to update the list of the global observation point

X_{G}

. If

X_{G} = ⌀

appears, the exploration work has been completed and the exploration will end.

4.2. Hierarchical Navigation Strategy

Under the premise of considering the kinematics model of the USV, the hierarchical navigation strategy plans the navigation path of the USV at the local layer and the global layer, respectively, by assigning local observation points and global observation points simultaneously. The main purpose of this method is to plan a reasonable turning route in the local layer so that the USV can change course efficiently and plan sparse paths in the global layer for the USV to follow and complete effective backtracking, thereby complete environmental exploration faster. The triggering of the hierarchical navigation strategy should satisfy:

\begin{matrix} \{\begin{matrix} B e s t P o i n t X_{L}^{i} = {X_{L}^{R}}_{i} \\ s . t . \sum_{\binom{i = 1}{i \neq j}}^{h} {X_{L}^{R}}_{i} = ⌀ \end{matrix} . \end{matrix}

(5)

In the equation,

B e s t P o i n t X_{L}^{i}

is the local RRT node with the maximum gain R in the

L_{T}

layer in the exploration phase and is the target point of the next voyage of the USV in the exploration process. The second formula indicates that there are no other unknown boundary points

{X_{L}^{R}}_{i} = ⌀

except

B e s t P o i n t X_{L}^{b}

in the current

L_{T}

layer. This makes the USV sail to

B e s t P o i n t X_{L}^{i}

; the local exploration phase is completed and transformed into the backtracking phase. In the traditional backtracking phase, the USV usually needs to repeatedly advance and retreat to adjust its course to the next observation point, which greatly reduces exploration efficiency and can even cause dangerous collisions. This situation can be avoided by using this hierarchical navigation strategy under the premise of considering the kinematic model of the USV.

4.2.1. Local Navigation Planning

In the local layer planning, the planned path

T_{L}

should have two constraints.

Firstly, it meets the requirement that the USV can adjust its course by planning a path with a reasonable turning radius to complete the subsequent planning after sailing to

X_{L}^{b}

, completing the local exploration stage, and realizing dynamic obstacle avoidance at this stage.

Second, the

X_{L}^{F}

information needs to be dynamically detected during local layer planning, and the subsequent decision of the USV will depend on

X_{L}^{F}

. When

X_{L}^{F} \neq ⌀

in the local layer, the

F O V (S_{l})

of the USV in

T_{L}

should cover the remaining

X_{L}^{F}

as much as possible, which is consistent with the representation in lines 9 to 11 of Algorithm 2. Algorithm 3, referenced in Algorithm 2, will be detailed later in the text. And when the

X_{L}^{b}

exploration is completed, local planning will be carried out to go to

X_{L}^{F}

instead of going to the global best observation point.

Algorithm 2: Exploration Phase

As shown in Figure 4, the cyan rectangle represents the coverage area of the local layer. In

L

, the position of the USV at the current moment is

P_{B}

, and

X_{L}^{R}

is the only observation point in the upper half of the local region obtained and clustered by RRT. The planned local path between

P_{B}

and

X_{L}^{R}

refers to the Dubins navigation planning. Also,

P_{B}

and

X_{L}^{R}

are taken as the initial and final points of navigation planning, and the course of points

X_{L}^{R}

is specified as the maximum sideslip angle point. The selection of the

P_{B}

point turning here is related to the global optimal observation point

X_{G}^{b}

. Generally, in order to optimize the

T_{L}

, the same side of

X_{G}^{b}

is used as the turning direction by default, and the set radius, as the radius of the light pink circle shown in Figure 4, is used as the minimum turning radius to solve the Dubins navigation planning problem. After arriving at point

X_{L}^{R}

, because the local target point set is updated and detected at this stage, the USV will make different decisions based on whether

X_{L}^{F}

exists.

When

X_{L}^{F} \neq ⌀

, the local path planning should make the USV at view point

V_{p}

in

T_{L}

satisfy the conditions as follows:

\begin{matrix} V_{p} \in T_{L}, \end{matrix}

(6)

\begin{matrix} \forall X_{L}^{F} s . t . \exists V_{p} X_{L}^{F} i s c o v e r e d b y F O V (S_{l}) a t V_{p} . \end{matrix}

(7)

Equations (6) and (7) require

T_{L}

to pass through the sampled viewpoint

V_{p}

; that is, the randomly generated

V_{p}

is a point on the feasible path, and for any

X_{L}^{F}

, there exists a

V_{p}

so that the USV can observe at least one

X_{L}^{F}

when passing by this point. It should be noted that the selection of Equation (7) may lead to the possibility that multiple viewpoints observe the same

X_{L}^{F}

at the same time, which may cause the planning

T_{L}

to be non-optimal. But in general, the complexity of the water environment is not comparable to the complexity of the indoor environment, so there are no multiple

X_{L}^{F}

values. Therefore, although

T_{L}

is not optimal, it can achieve the desired effect. After sailing to the

V_{p}

, the local boundary detection will continue in the exploration phase, which is consistent with the representation in lines 4 to 9 of Algorithm 3. The track of the resulting

T_{L}

as a whole is shown by the red line in Figure 4.

When

X_{L}^{F} = ⌀

, the planned path should meet the requirements of global backtracking. At this time,

T_{L}

is related to

X_{G}^{b}

and

V_{b}

. In Figure 4, the light green dot

X_{G}^{b}

is defined as the global best observation point, and the green dot

V_{b}

is defined as the boundary point at the intersection of the global path planning and

L

. At this stage, the USV will not perform local boundary detection but only use

V_{b}

as the final target point to perform local stage backtracking, and its curve is shown by the dotted red line.

Algorithm 3: The Strategy of Hierarchical Navigation

4.2.2. Global Navigation Planning

In the global layer planning process, we assume that the motion model of the USV is an integrity constraint, and the USV is abstracted into a point without considering its orientation. Since the navigation is in a known environment, only the optimal track

T_{G}

from the current position

P_{B}

to

X_{G}^{b}

is planned, as shown in the dark green line in Figure 4. This is similar to the usual global planning algorithm. Therefore, here we only use A-Star, a depth-first global path planning algorithm, to generate rough planning for the global layer, while the specific navigation planning has been solved in the planning stage of the local layer. The overall navigation process has been expressed in detail in Algorithm 3.

The workflow of the entire framework is shown in Figure 5. The USV collects environmental information through LiDAR to update the point cloud data. The map module obtains the attitude data of the motion sensor, updates the local and global maps, and extracts and selects the exploration target points. Through the algorithm decision, the selected optimal target point is sent to the path planning module for navigation planning and, in this process, new sensor data are received and new maps are expanded. Then, it starts a new process loop.

5. Simulation

In this section, we conduct environmental exploration simulations in the virtual world of the Gazebo 9 simulation engine [35]. For comparing exploration algorithms, we used an ideal water surface environment without physical plug-ins such as wind, waves, and currents, which will be the focus of subsequent research. The physical characteristics and sensor configurations of the simulated USV [36] are consistent with those in the real world. Unlike ground vehicles that use wheeled odometers to directly obtain mileage information, our USV model employs an extended Kalman filter (EKF) to fuse IMU and GNSS data. This fusion generates odometer (Odom) messages that contain the estimated position and attitude information of the USV in space, which is used for localization through the robot localization package that incorporated EKF and GNSS information. The USV model is shown in Figure 6. The algorithm runs on the Ubuntu 18.04 operating system based on the ROS Melodic distribution, using a laptop computer with an Intel Core i7-4720HQ CPU (2.6 GHz) and 12 GB RAM to compare the performance of each exploration method.

5.1. Simulation Environment

Three water surface environments with different layouts and sizes were created for simulation, and the performance of the proposed exploration framework was verified by each of the three environments. Table 1 lists the characteristics of the simulation environment, and the specific map is shown in Figure 7.

Simple Enclosed Environment (SEE) (35 m × 40 m): Contains simple layouts and relatively narrow enclosures designed to examine the effectiveness of a hierarchical navigation strategy, as shown in Figure 7a.
Large Maze Environment (LME) (120 m × 60 m): Includes long narrow corridors and complex spatial layers. The performance of the exploration strategy is tested by checking whether invalid backtracking is generated, as shown in Figure 7b.
Harbor Basin Environment (HBE) (105 m × 85 m): The harbor basin of the Lingshui Campus of Dalian Maritime University is selected as the simulation environment and drawn at 1:2, aiming at the performance of the complete detection and exploration framework, as shown in Figure 7c.

5.2. Simulation Test Evaluation

Our simulation stipulates that the USV sails at a speed of 2 knots (≈1.03 m/s); a circular area with a radius of 20 m represents the effective field of vision of the sensor

F O V (S_{l})

; the local layer

L

is set as a circular area with a radius of 25 m; and the resolution of the map is set as 0.2 m. In the simulation, we compare two typical exploration methods and their fusion versions to our method (i.e., four methods). The two typical methods use the open source code of the applicable operating system version, and the fusion version has its unique 3D framework, which we wrote according to our understanding of relevant articles.

Image-Based: The Canny edge detection algorithm is used to extract the boundary observation points and search for the boundary points with the highest gain.
RRT: The randomness of global and local RRT trees is used to search for the best boundary observation point across space.
DSVP: The combination of the above two methods. This method adds an Image-Based edge detection algorithm to the global map to reduce backtracking generation and improve exploration efficiency.

Each method was run five times at three random locations for a total of 15 times. If the USV almost stopped moving or arrived at a time limit, the exploration was defined as complete, and the operation was finished. Here we define the time limit as a period after stopping movement (120 s for SEE and 300 s for LME and HBE). The selected results are the three best trajectories of each random point, as shown in Figure 8 (from left to right are: Ours, DSVP, Image-Based, and RRT, respectively), and the evaluation indexes (length of navigation path and exploration area) are the average values of these three times. Figure 8a shows the best trajectory of SEE, where the triangle marks the initial position of exploration. In this simulation, the other three methods are explored with the original navigation algorithm instead of using the hierarchical navigation strategy. The path width at the top left of the map is slightly larger than the turning radius of the USV, and the wall at the top and left of the map forms an impassable narrow path. As can be seen from Figure 8a, except for our method, other methods change course by retreating or even hitting obstacles. Moreover, RRT has obvious ineffective backtracking because it has limitations in acquiring target points, especially for USVs with limited movement.

Figure 9a,b compare the average travel distance and the average detection area of all the methods. Our method completes the environmental exploration after a voyage of 105.9 m in 267.7 s. It can be seen that Image-Based and RRT almost cannot work in the later stage of exploration and cannot cover the whole space every time. This is because the planned path is constrained by the environment, resulting in a smaller average detection area than our method. Because the DSVP is limited by navigation, its exploration effect is also somewhat lacking compared with ours. In addition, the three optimal exploration times and the average exploration time of the above methods are shown in Table 2.

Figure 8b,c show the optimal trajectories of LME and HBE. To reduce the risk of collision with obstacles in the simulation, Dubins local route planning is added to all the planning aspects of the exploration tests. From the best trajectories of DSVP, Image-Based, and RRT in the figure, it can be seen that these methods will produce a certain degree of invalid backtracking, and most of them will produce a large degree of turning in the navigation trajectory. Our method completed the exploration in LME and HBE with 332.45 m in 1182.7 s and 428.9 m in 1508.0 s, respectively. Figure 10 and Figure 11 compare the evaluation indexes of all methods. According to the results, the average sailing speed is not different from other methods, but the exploration can be completed in the shortest time. When the environment is large, our exploration strategy is better than other methods, and the average running time is the smallest. Corresponding Table 3 and Table 4 record the three best exploration times and the average exploration time.

In our comparison, we use two primary metrics to evaluate exploration efficiency:

δ

(

m^{2}

/s) and

r_{δ}

. The metric

δ

(

m^{2}

/s) represents the average exploration efficiency of an algorithm, defined as the average area explored per second during the exploration process. The metric

r_{δ}

indicates the relative efficiency compared to our proposed strategy. Table 5 compares the exploration efficiency of four methods using these metrics. It can be seen that, in the same virtual environment, our method significantly improves exploration efficiency and reduces exploration time compared to the other methods.

6. Conclusions

Aiming at the problem of water surface environment exploration, we propose an exploration framework that includes a multistage exploration strategy and hierarchical navigation strategy. The transition among the exploration, pruning, and practical backtracking phases thoroughly explores the entire environment by introducing the concepts of global and local layers. In the local layer, RRT and boundary detection algorithms detect the boundary before and after the USV’s voyage. The gain of the target point is optimized to reduce the inherent constraint that the target point is different from the exploration direction. In the global layer, the hierarchical navigation strategy is introduced to plan reasonable navigation routes to improve the timeliness of exploration and guide the environment to continue to expand. Finally, we compare the proposed method with three other methods in three large simulation environments (including SEE, LME, and HBE) to verify the effectiveness and robustness of the proposed method. The results show that our method takes less time and covers the area faster than other methods. However, the low-cost 2D LiDAR used in the simulation itself cannot provide the details required for accurate obstacle detection in the real surface environment, and the simulated maritime scene does not include environmental factors such as waves and currents. In the future, we will use these problems as our research direction and, on this basis, consider the impact of near-shallow water shorelines on USV operations as well as the impact of dynamic obstacles on environmental detection, and we will conduct test experiments in real scenes.

Author Contributions

Conceptualization, B.S. and J.Z.; methodology, B.S.; software, J.Z.; validation, B.S., Y.F. and J.Z.; formal analysis, B.S.; investigation, B.S.; resources, Y.F. and X.H.; data curation, B.S. and Y.W.; writing—original draft preparation, J.Z.; writing—review and editing, B.S. and Z.S.; visualization, J.Z.; supervision, Y.F.; project administration, X.H.; funding acquisition, Y.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Key Research and Development Program of China (Grant number 2022YFB4301401), the Key Program for Basic Research of China (Grant number JCKY2023206B026), National Natural Science Foundation of China (Grant number 61976033), Pilot Base Construction and Pilot Verification Plan Program of Liaoning Province of China (Grant number 2022JH24/10200029), Fundamental Research Funds for the Central Universities (Grant number 3132023512), China Postdoctoral Science Foundation(Grant number 2022M710569), Liaoning Province Doctor Startup Fund (Grant number 2022-BS-094).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DSVP	Dual-Stage Viewpoint Planner
HBE	Harbor Basin Environment
LME	Large Maze Environment
RRT	Rapidly Exploring Random Tree
RRT	Rapidly Exploring Random Tree
USV	Unmanned Surface Vehicle

References

Xie, Y.; Liang, X.; Lou, L.; Guo, X. Self-organization method of usv swarm target strike task based on ant colony algorithm. In Proceedings of the 2019 3rd International Symposium on Autonomous Systems (ISAS), Shanghai, China, 29–31 May 2019; pp. 388–393. [Google Scholar]
Chen, X.; Dou, S.; Song, T.; Wu, H.; Sun, Y.; Xian, J. Spatial-Temporal Ship Pollution Distribution Exploitation and Harbor Environmental Impact Analysis via Large-Scale AIS Data. J. Mar. Sci. Eng. 2024, 12, 960. [Google Scholar] [CrossRef]
Tabb, M.; Ahuja, N. Multiscale image segmentation by integrated edge and region detection. IEEE Trans. Image Process. 1997, 6, 642–655. [Google Scholar] [CrossRef] [PubMed]
Keidar, M.; Kaminka, G.A. Robot exploration with fast frontier detection: Theory and experiments. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems, Valencia, Spain, 4–8 June 2012. [Google Scholar]
Zhang, S.; Zhang, X.; Li, T.; Yuan, J.; Fang, Y. Fast Active Aerial Exploration for Traversable Path Finding of Ground Robots in Unknown Environments. IEEE Trans. Instrum. Meas. 2022, 71, 7502213. [Google Scholar] [CrossRef]
Kulich, M.; Faigl, J.; Preucil, L. On distance utility in the exploration task. In Proceedings of the IEEE International Conference on Robotics & Automation, Shanghai, China, 9–13 May 2011. [Google Scholar]
Kulich, M.; Kubalík, J.; Peuil, L. An Integrated Approach to Goal Selection in Mobile Robot Exploration. Sensors 2019, 19, 1400. [Google Scholar] [CrossRef] [PubMed]
Gao, W.; Booker, M.; Adiwahono, A.; Yuan, M.; Wang, J.; Yun, Y.W. An improved Frontier-Based Approach for Autonomous Exploration. In Proceedings of the International Conference on Control, Automation, Robotics & Vision, Singapore, 18–21 November 2018. [Google Scholar]
Meng, Z.; Qin, H.; Chen, Z.; Chen, X.; Sun, H.; Lin, F.; Ang, M.H. A Two-Stage Optimized Next-View Planning Framework for 3-D Unknown Environment Exploration, and Structural Reconstruction. IEEE Robot. Autom. Lett. 2017, 2, 1680–1687. [Google Scholar] [CrossRef]
Umari, H.; Mukhopadhyay, S. Autonomous robotic exploration based on multiple rapidly-exploring randomized trees. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots & Systems, Vancouver, BC, Canada, 24–28 September 2017; pp. 1396–1402. [Google Scholar]
Qiao, W.; Fang, Z.; Si, B. A sampling-based multi-tree fusion algorithm for frontier detection. Int. J. Adv. Robot. Syst. 2019, 16, 172988141986542. [Google Scholar] [CrossRef]
Yamauchi, B. A frontier-based approach for autonomous exploration. In Proceedings of the IEEE International Symposium on Computational Intelligence in Robotics & Automation, Monterey, CA, USA, 10–11 July 2002. [Google Scholar]
Zhong, P.; Chen, B.; Lu, S.; Meng, X.; Liang, Y. Information-Driven Fast Marching Autonomous Exploration With Aerial Robots. IEEE Robot. Autom. Lett. 2022, 7, 810–817. [Google Scholar] [CrossRef]
Kim, H.; Kim, H.; Lee, S.; Lee, H. Autonomous Exploration in a Cluttered Environment for a Mobile Robot With 2D-Map Segmentation and Object Detection. IEEE Robot. Autom. Lett. 2022, 7, 6343–6350. [Google Scholar] [CrossRef]
Bircher, A.; Kamel, M.; Alexis, K.; Oleynikova, H.; Siegwart, R. Receding horizon next-best-view planner for 3D exploration. In Proceedings of the IEEE International Conference on Robotics & Automation, Stockholm, Sweden, 16–21 May 2016. [Google Scholar]
Dang, T.; Mascarich, F.; Khattak, S.; Papachristos, C.; Alexis, K. Graph-based Path Planning for Autonomous Robotic Exploration in Subterranean Environments. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 3–8 November 2020. [Google Scholar]
Dang, T.; Tranzatto, M.; Khattak, S.; Mascarich, F.; Alexis, K.; Hutter, M. Graph-based subterranean exploration path planning using aerial and legged robots. J. Field Robot. 2020, 37, 1363–1388. [Google Scholar] [CrossRef]
Steinbrink, M.; Koch, P.; Jung, B.; May, S. Rapidly-Exploring Random Graph Next-Best View Exploration for Ground Vehicles. In Proceedings of the 2021 European Conference on Mobile Robots (ECMR), Bonn, Germany, 31 August–3 September 2021. [Google Scholar]
Gao, H.; Zhang, X.; Wen, J.; Yuan, J.; Fang, Y. Autonomous Indoor Exploration Via Polygon Map Construction and Graph-Based SLAM Using Directional Endpoint Features. IEEE Trans. Autom. Sci. Eng. 2019, 16, 1531–1542. [Google Scholar] [CrossRef]
Schmid, L.; Pantic, M.; Khanna, R.; Ott, L.; Siegwart, R.; Nieto, J. An Efficient Sampling-Based Method for Online Informative Path Planning in Unknown Environments. IEEE Robot. Autom. Lett. 2020, 5, 1500–1507. [Google Scholar] [CrossRef]
Bhattacharjee, P.; Burger, M.; Börner, A.; Morgenshtern, V.I. Region-of-Interest Prioritised Sampling for Constrained Autonomous Exploration Systems. IEEE Trans. Comput. Imaging 2022, 8, 302–316. [Google Scholar] [CrossRef]
Hu, J.; Niu, H.; Carrasco, J.; Lennox, B.; Arvin, F. Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning. IEEE Trans. Veh. Technol. 2020, 69, 14413–14423. [Google Scholar] [CrossRef]
Li, H.; Zhang, Q.; Zhao, D. Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 2064–2076. [Google Scholar] [CrossRef]
Doroodgar, B.; Liu, Y.; Nejat, G. A Learning-Based Semi-Autonomous Controller for Robotic Exploration of Unknown Disaster Scenes While Searching for Victims. IEEE Trans. Cybern. 2014, 44, 2719–2732. [Google Scholar] [CrossRef]
Niroui, F.; Zhang, K.; Kashino, Z.; Nejat, G. Deep Reinforcement Learning Robot for Search and Rescue Applications: Exploration in Unknown Cluttered Environments. IEEE Robot. Autom. Lett. 2019, 4, 610–617. [Google Scholar] [CrossRef]
Tai, L.; Liu, M. Mobile robots exploration through cnn-based reinforcement learning. Robot. Biomim. 2016, 3, 24. [Google Scholar] [CrossRef]
Cieslewski, T.; Kaufmann, E.; Scaramuzza, D. Rapid exploration with multi-rotors: A frontier selection method for high speed flight. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017. [Google Scholar]
González-Baños, H.H.; Latombe, J.-C. Navigation Strategies for Exploring Indoor Environments. Int. J. Robot. Res. 2002, 21, 829–848. [Google Scholar] [CrossRef]
Zhang, T.; Yu, J.; Li, J.; Wei, J. Upgraded trajectory planning method deployed in autonomous exploration for unmanned aerial vehicle. Int. J. Adv. Robot. Syst. 2022, 19, 172988062211096. [Google Scholar] [CrossRef]
Karaman, S.; Frazzoli, E. Sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
Lavalle, S.M. Rapidly-exploring random trees: A new tool for path planning. Comput. Ence Dept. Oct 1998, 98, 1–4. [Google Scholar]
Selin, M.; Tiger, M.; Duberg, D.; Heintz, F.; Jensfelt, P. Efficient Autonomous Exploration Planning of Large-Scale 3-D Environments. In Proceedings of the International Conference on Robotics and Automation, Montreal, QC, Canada, 20–24 May 2019. [Google Scholar]
Zhao, L.; Yan, L.; Hu, X.; Yuan, J.; Liu, Z. Efficient and High Path Quality Autonomous Exploration and Trajectory Planning of UAV in an Unknown Environment. ISPRS Int. J. Geo-Inf. 2021, 10, 631. [Google Scholar] [CrossRef]
Zhu, H.; Cao, C.; Xia, Y.; Scherer, S.; Zhang, J.; Wang, W. DSVP: Dual-Stage Viewpoint Planner for Rapid Exploration by Dynamic Expansion. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; pp. 7623–7630. [Google Scholar] [CrossRef]
Open Robotics. Gazebo Simulator. 2005. Available online: http://gazebosim.org/ (accessed on 25 April 2023).
Lenes, J.H. Autonomous Online Path Planning and Path-Following Control for Complete Coverage Maneuvering of a USV. 2019. Available online: http://hdl.handle.net/11250/2622919 (accessed on 25 April 2023).

Figure 1. Diagram of the navigation framework.

Figure 2. The first two phases of the exploration strategy. (a) In the exploration phase, the coverage of the local layer

L

at the current moment and the observation point

X_{L}

obtained by the fusion exploration method are displayed. (b) In the pruning phase, the random tree and local observation point

X_{L}

are updated in one iteration. The green dots represent the observation points that changed from

X_{L}

to

X_{G}

after one iteration, which are explored in the effective backtracking phase. Subfigures (c–f) represent the realization process of the exploration and pruning phases.

Figure 2. The first two phases of the exploration strategy. (a) In the exploration phase, the coverage of the local layer

L

at the current moment and the observation point

X_{L}

obtained by the fusion exploration method are displayed. (b) In the pruning phase, the random tree and local observation point

X_{L}

are updated in one iteration. The green dots represent the observation points that changed from

X_{L}

to

X_{G}

after one iteration, which are explored in the effective backtracking phase. Subfigures (c–f) represent the realization process of the exploration and pruning phases.

Figure 3. Information gain of boundary observation points.

Figure 4. Hierarchical navigation planning in the backtracking phase.

Figure 5. Schematic diagram of the exploration process.

Figure 6. The simulated USV model.

Figure 7. Simulation environments: (a) SEE, (b) LME, and (c) HBE.

Figure 8. The exploration trajectory of the USV in three typical scenarios. (a–c) depict the optimal trajectories for SEE, LME, and HBE, respectively. The red, green, and black lines represent the exploration trajectories of the USV from different initial positions, with the small triangles marking the starting points of each trajectory. In each row, from left to right, the trajectories are shown for our method, DSVP, Image-Based, and RRT, respectively.

Figure 9. SEE simulation results: (a) shows the relationship between the average sailing distance and time of our method, and (b) is the relationship between mean detection area and time.

Figure 10. LME simulation results. The diagram is similar to the layout in Figure 9.

Figure 11. HBE simulation results. The diagram is similar to the layout in Figure 9.

Table 1. Simulation environment characteristics.

	Large-Scale	Convoluted	Narrow Corridors	Cluttered Obstacles
SEE		✓	✓	✓
LME	✓	✓	✓	✓
HBE	✓

Table 2. Exploration time of four methods in the SEE scene.

Method	Exploration Time			Average Time
Method	Yellow	Green	Black	Average Time
Ours	207.4 s	267.7 s	235.6s	236.9 s
DSVP	395.4 s	274.2 s	390.0 s	353.2 s
Image-Based	375.3 s	382.2 s	439.5 s	399.0 s
RRT	384.8 s	420.3 s	395.2 s	400.1 s

Table 3. Exploration time of four methods in the LME scene.

Method	Exploration Time			Average Time
Method	Yellow	Green	Black	Average Time
Ours	1104.4 s	1182.7 s	812.8 s	1033.3 s
DSVP	1436.4 s	1307.8 s	1083.9 s	1276.0 s
Image-Based	1469.5 s	1356.6 s	1102.8 s	1309.6 s
RRT	2198.2 s	1519.0 s	1610.1 s	1775.7 s

Table 4. Exploration time of four methods in the HBE scene.

Method	Exploration Time			Average Time
Method	Yellow	Green	Black	Average Time
Ours	1508.0 s	1419.7 s	1379.6 s	1435.8 s
DSVP	2117.3 s	1859.5 s	1649.3 s	1875.4 s
Image-Based	2091.7 s	1756.5 s	2665.4 s	2171.2 s
RRT	2463.7 s	3104.3 s	2129.2 s	2565.7 s

Table 5. Comparison of exploration efficiency.

	Ours		DSVP		Image-Based		RRT
	$δ$	$r_{δ}$	$δ$	$r_{δ}$	$δ$	$r_{δ}$	$δ$	$r_{δ}$
SEE	5.0	1	3.3	0.66	2.9	0.58	2.8	0.56
LME	4.9	1	3.9	0.80	3.8	0.78	2.6	0.53
HBE	5.3	1	3.5	0.66	2.6	0.49	2.3	0.43

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Song, B.; Zhang, J.; Han, X.; Fan, Y.; Sun, Z.; Wang, Y. An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters. J. Mar. Sci. Eng. 2024, 12, 1622. https://doi.org/10.3390/jmse12091622

AMA Style

Song B, Zhang J, Han X, Fan Y, Sun Z, Wang Y. An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters. Journal of Marine Science and Engineering. 2024; 12(9):1622. https://doi.org/10.3390/jmse12091622

Chicago/Turabian Style

Song, Baojian, Jiahao Zhang, Xinjie Han, Yunsheng Fan, Zhe Sun, and Yingjie Wang. 2024. "An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters" Journal of Marine Science and Engineering 12, no. 9: 1622. https://doi.org/10.3390/jmse12091622

APA Style

Song, B., Zhang, J., Han, X., Fan, Y., Sun, Z., & Wang, Y. (2024). An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters. Journal of Marine Science and Engineering, 12(9), 1622. https://doi.org/10.3390/jmse12091622

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Efficient Autonomous Exploration Framework for Unmanned Surface Vehicles in Unknown Waters

Abstract

1. Introduction

2. Problem Formulation

3. Problem Description

4. Methodology

4.1. Multiphase Exploration Strategy

4.1.1. Exploration Phase

4.1.2. Pruning Phase

4.1.3. Effective Backtracking Phase

4.2. Hierarchical Navigation Strategy

4.2.1. Local Navigation Planning

4.2.2. Global Navigation Planning

5. Simulation

5.1. Simulation Environment

5.2. Simulation Test Evaluation

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI