3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm

Pan, Feng; Cui, Peng; Cui, Bo; Yan, Weisheng; Zhang, Shouxu

doi:10.3390/jmse13071354

Open AccessArticle

3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm

by

Feng Pan

^1,*,

Peng Cui

²,

Bo Cui

¹,

Weisheng Yan

¹ and

Shouxu Zhang

¹

School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China

²

SAIC Motor R&D Innovation Headquarters, Shanghai 201804, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(7), 1354; https://doi.org/10.3390/jmse13071354

Submission received: 22 May 2025 / Revised: 13 July 2025 / Accepted: 13 July 2025 / Published: 16 July 2025

(This article belongs to the Special Issue Intelligent Measurement and Control System of Marine Robots)

Download

Browse Figures

Versions Notes

Abstract

The autonomous navigation of an Unmanned Underwater Vehicle (UUV) in unknown 3D underwater environments remains a challenging task due to the presence of complex terrain, uncertain obstacles, and strict kinematic constraints. This paper proposes a novel smooth path planning framework that integrates improved Rapidly-exploring Random Tree* (RRT*) with 3D Dubins curves to efficiently generate feasible and collision-free trajectories for nonholonomic UUVs. A fast curve-length estimation approach based on a backpropagation neural network is introduced to reduce computational burden during path evaluation. Furthermore, the improved RRT* algorithm incorporates pseudorandom sampling, terminal node backtracking, and goal-biased exploration strategies to enhance convergence and path quality. Extensive simulation results in unknown underwater scenarios with static and moving obstacles demonstrate that the proposed method significantly outperforms state-of-the-art planning algorithms in terms of smoothness, path length, and computational efficiency.

Keywords:

path planning; obstacle avoidance; RRT*; 3D Dubins curves

1. Introduction

The maritime environment presents significant challenges for autonomous systems, including uncertain weather, limited visibility, and high operational risk [1,2]. In recent years, Unmanned Underwater Vehicle (UUV) technology has seen rapid development in recent years, offering promising capabilities in marine exploration, environmental monitoring, infrastructure inspection, and defense applications [3,4,5,6]. Real-time path planning for an UUV is challenging due to the complexity of unknown underwater environments and the vehicle’s nonholonomic constraints. To enable fully autonomous operations in such conditions, a robust path planning system must generate feasible and smooth trajectories while ensuring collision avoidance in dynamic and cluttered surroundings.

Persistent efforts have been made to address the prescribed disadvantages [7,8,9]. In [8], a collision-free navigation method for dynamic environments was proposed. This method constructs a probabilistic graph model of the workspace and combines the A* search algorithm with the potential field method to generate feasible maneuvering paths. In [9], a two-stage path planning approach was introduced to guide a robot operating in a partially known environment. The approach first builds a visibility graph of the static environment and uses the Dijkstra algorithm to determine a global path. During the path-following phase, a dual-strategy fuzzy controller is employed to continuously generate local collision-free maneuvering paths. In [10], an efficient convex decomposition method was proposed, which divides the free space into convex polyhedral regions and utilizes the Jump Point Search algorithm to plan safe paths. However, these methods rely on search algorithms to determine resulting paths in structured and static environments, making them inefficient when applied to high-dimensional settings.

In recent years, sampling-based path planning methods have gained increasing attention, particularly incremental sampling techniques such as the Rapidly-exploring Random Tree (RRT) and its improved variant, RRT* [11]. These algorithms incrementally construct collision-free trajectories by sampling the configuration space rather than searching over a grid or structured map, allowing them to perform well in unstructured and high-dimensional environments. Their generated paths typically converge probabilistically to an optimal solution. In [12], an asymptotically optimal motion planning method was proposed for robots with linear dynamics, using the RRT* algorithm for global planning and generating sample trajectories between state pairs via an optimal controller. In [13], the RRT* algorithm was enhanced by narrowing the sampled state set, improving performance in finding near-optimal solutions. These approaches primarily focus on static environments. To address dynamic obstacles, Ref. [14] proposed a hybrid method combining an improved Dynamic Window Approach (DWA) with RRT* to plan both global paths and local trajectories for UUVs in dynamic settings. In [15], RRT* was integrated with Bézier curves to achieve path planning in environments with unknown obstacles.

Despite these advances, several challenges remain in the application of incremental sampling-based path planners. First, the generated maneuvers are often vulnerable to unexpected environmental changes, which can lead to wasted computational effort and increased planning cost during replanning. Second, generating a sufficient number of reachable sample nodes is critical for adequately exploring the environment and producing collision-free maneuvers. However, this process can be computationally inefficient and time-consuming due to kinematic constraints and the presence of environmental obstacles [16,17].

To address the aforementioned limitations, this paper proposes a smooth path planning method for UUVs in unknown 3D environments, which integrates an improved RRT* algorithm with 3D Dubins curves. The proposed method operates under a reactive planning and replanning framework to accommodate environmental uncertainty and the absence of prior information. During replanning, collision-free segments of previously generated paths are reused as structural foundations, while smooth maneuver trajectories are constructed after a coarse path is generated using the enhanced RRT*. To improve planning efficiency, the enhanced RRT* algorithm incorporates pseudorandom sampling, terminal node backtracking-based extension, and goal-biased exploration as key procedures, while 3D Dubins curves are employed to guide the path generation process. Finally, a fast and efficient path reshaping process ensures the resulting trajectory is both smooth and collision-free. The main contributions of this paper are summarized as follows:

The geometric characteristics of 3D Dubins curves are analyzed, and a Backpropagation Neural Network (BPNN) is designed to enable fast length estimation of these curves.
A pseudorandom sampling strategy is developed to enhance obstacle avoidance performance during node generation in cluttered underwater environments.
A novel terminal node backtracking mechanism is introduced to maintain the spatio-temporal feasibility of the tree expansion process in dynamic scenarios.

The remainder of this paper is organized as follows: Section 2 presents the problem formulation and mathematical preliminaries. Section 3 describes the proposed path planning method in detail. Simulation results and comparative analysis are shown in Section 4. Finally, Section 5 concludes the paper.

2. Mathematical Formulation

2.1. Problem Statement

The operational environment is denoted by

X

, which consists of the obstacle region

X_{o}

and the collision-free region

X_{f}

. It should be pointed out that there may be both static obstacles and moving obstacles in

X_{o}

. The obstacle region

X_{o}

may include both static and dynamic obstacles. To facilitate computational tractability in path planning, all obstacles are modeled as spheres

X_{o}

, following the approach in [18,19]. The i-th spherical obstacle is denoted by

Ω_{i}

, with its occupied region at time t defined as

Ω_{i} (t) = \{p (t) | ∥ p (t) - o_{i} (t) ∥ \leq r_{i} + δ\} .

(1)

where

o_{i}

denotes the center of the obstacle,

r_{i}

is its radius, and

δ

represents a safety margin reserved for path planning. The complete obstacle region

X_{o}

can be defined as

X_{o} = ⋃_{i = 1}^{N_{o b s}} Ω_{i}

(2)

where

N_{o b s}

denotes the total number of the spherical obstacles. If obstacle i is a moving obstacle,

v_{{o b s}_{i}}

represents its velocity.

To describe the motion of the UUV, let

p

denote its current position in three-dimensional space. The velocity and configuration associated with position

p

are represented by

V (p)

and

C (p)

, respectively, where

V (p)

returns the velocity vector, and

C (p)

describes the UUV’s orientation and heading at that position. In this work, the UUV is modeled as a three-dimensional nonholonomic system subject to constant forward speed and no lateral slip, an assumption consistent with Dubins motion primitives in underwater environments. Accordingly, the heading direction of the UUV is determined by its velocity vector

V (p)

. The kinematic constraint of the UUV is characterized by a minimum turning radius, denoted as

r_{min}

, which defines the tightest possible curvature of its trajectory while maintaining feasible underwater motion dynamics.

To formulate the smooth path planning problem in an unknown underwater environment, it is assumed that the UUV can only perceive obstacle motion information within a limited sensing radius, denoted by

r_{scan}

. Under this constraint, the objective of path planning is to compute a sufficiently smooth and collision-free path

L

that leads the UUV from its current configuration to the target configuration

C (p_{goal})

. This problem can be mathematically expressed as the minimization of the path length subject to obstacle avoidance constraints

\underset{L \cap X_{o}}{arg min} ∥ L ∥ .

(3)

2.2. 3D Dubins Curve Determination

Smooth trajectory generation is critical for UUV path planning, especially under nonholonomic constraints. Commonly used smooth curves include Bézier and B-spline curves, which offer high-order continuity and visual smoothness. However, these curves do not inherently satisfy curvature or heading constraints, and enforcing such constraints often requires additional optimization or post-processing.

In contrast, Dubins curves are specifically constructed to model motion with constant forward velocity and a minimum turning radius, making them a natural fit for nonholonomic vehicles such as UUVs. Therefore, this work adopts 3D Dubins curves as the motion primitive to ensure dynamically feasible and computationally efficient trajectory planning.

Dubins curves are composed of a combination of circular (C) and straight-line (S) segments. In two-dimensional space, the Dubins curve has been proven to provide the shortest feasible path between two specified configurations under a minimum turning radius constraint, making it the optimal solution for smooth path planning in planar environments [20,21,22,23]. However, in three-dimensional or higher-dimensional spaces, the construction of Dubins curves becomes more complex due to the increased number of feasible path configurations and geometric constraints. To address these challenges, this study introduces an efficient numerical technique for constructing 3D Dubins curves by leveraging their geometric properties.

For notational convenience, two auxiliary functions are defined. The normalization function

n (v)

returns the unit vector in the direction of vector

v

, and the sign function

S g n (x)

returns the sign of scalar x. Given the practical significance of 3D Dubins curves in underwater path planning, the geometrical analysis in this paper focuses on the CSC (curve–straight–curve) configuration, which is commonly encountered in optimal Dubins paths [24]. Other configuration types can be derived using a similar approach and are not elaborated here for brevity.

As illustrated in Figure 1, a CSC Dubins path connects two spatial configurations,

C (p_{1})

and

C (p_{2})

, with its straight segment intersecting the initial and terminal circular arcs at points

p_{C_{1}}

and

p_{C_{2}}

, respectively. Two auxiliary lines,

L_{1}

and

L_{2}

, are constructed to pass through

p_{1}

and

p_{2}

, respectively, and they are aligned with the initial and terminal velocity directions

C_{1}

and

C_{2}

. The direction vector of the straight segment (S) is then given by

v_{s} = n (p_{C_{2}} - p_{C_{1}})

. Let L denote the line extending the straight segment. This line intersects with

L_{1}

and

L_{2}

at points

p_{L_{1}}

and

p_{L_{2}}

, respectively, where

p_{L_{1}} \neq p_{L_{2}}

. The analytical expressions for these intersection points can be derived as

\{\begin{matrix} p_{L_{1}} = p_{1} + k_{1} V (p_{1}) \\ p_{L_{2}} = p_{2} + k_{2} V (p_{2}) \end{matrix}

(4)

where

k_{1}

and

k_{2}

are two nonzero constants.

Let the centers of the initial and terminal circular segments

C_{1}

and

C_{2}

be denoted by

o_{1}

and

o_{2}

, respectively. Due to the geometric property of Dubins curves, there are always

o_{1} p_{1} ⊥ p_{L_{1}} p_{C_{1}}

and

o_{2} p_{2} ⊥ p_{L_{2}} p_{C_{2}}

. It derives that

p_{C_{1}} = p_{L_{1}} + k_{1} v_{s}

and

p_{C_{2}} = p_{L_{2}} + k_{2} v_{s}

. Thus, the direction vector

v_{s}

of the straight segment can be written as

v_{s} = n (δ p + k_{2} (V (p_{2}) + v_{s}) - k_{1} (V (p_{1}) + v_{s}))

(5)

where

δ p = p_{2} - p_{1}

denotes the displacement vector between the initial and terminal positions. Equation (5) can be further reformulated as

v_{s} = \pm (δ p + k_{2} V (p_{2}) - k_{1} V (p_{1})) / m (k_{1}, k_{2})

(6)

where

m (k_{1}, k_{2})

represents the normalization factor, defined as

m (k_{1}, k_{2}) = \sqrt{A + k_{1}^{2} + k_{2}^{2} + 2 k_{2} B_{2} - 2 k_{1} B_{1} - 2 k_{1} k_{2} B_{12}}

(7)

with the scalar quantities given by

A = {| δ p |}^{2}

,

B_{2} = V (p_{2}) \cdot δ p

,

B_{1} = V (p_{1}) \cdot δ p

and

B_{12} = V (p_{1}) \cdot V (p_{2})

. Furthermore, Equation (6) implies that the inequality

v_{s} = s (k_{2} - k_{1} \pm m (k_{1}, k_{2})) v_{s}

(8)

Obviously, Equation (8) demands that

k_{2} - k_{1} \pm m (k_{1}, k_{2}) > 0

holds for CSC Dubins curves. Hence, if

q (k_{1}, k_{2})

denotes

A + 2 k_{2} B_{2} - 2 k_{1} B_{1} + 2 k_{1} k_{2} (1 - B_{12})

, the establishment conditions of CSC curves are as presented in Table 1.

Two auxiliary unit vectors,

n_{1}

and

n_{2}

, are introduced to represent the normal directions from the initial and terminal configurations to their respective circular arc centers. These vectors are defined as

\{\begin{matrix} n_{1} = s (k_{1}) n (V (p_{1}) \times v_{s} \times V (p_{1})), \\ n_{2} = s (k_{2}) n (V (p_{2}) \times v_{s} \times V (p_{2})) . \end{matrix}

(9)

where

s (\cdot)

denotes the sign function that ensures correct orientation, and

n (\cdot)

denotes the normalization operator.

Accordingly, the circular centers of

C_{1}

and

C_{2}

can be expressed as

o_{1} = p_{1} + r_{1} n_{1}

and

o_{2} = p_{2} + r_{2} n_{2}

. Considering the spatial relations of

o_{1}

and

o_{2}

with

p_{C_{1}}

and

p_{C_{2}}

, there are

| k_{1} (V (p_{1}) + v_{s}) - r_{1} n_{1} | = r_{1}

and

| k_{2} (V (p_{2}) + v_{s}) - r_{2} n_{2} | = r_{2}

.

Furthermore, since the CSC curve is smooth at

p_{C_{1}}

and

p_{C_{2}}

, it derives that

v_{s} = s (k_{1}) n (V (p_{1}) \times v_{s} \times o_{1} p_{C_{1}})

and

v_{s} = s (k_{2}) n (V (p_{2}) \times v_{s} \times o_{2} p_{C_{2}})

. Given that

v_{s}

is orthogonal to

\vec{o_{1} p_{C_{1}}}

and

\vec{o_{2} p_{C_{2}}}

, the following conditions must be satisfied:

k_{1} v_{s} \cdot V (p_{1}) + k_{1} - r_{1} v_{s} \cdot n_{1} = 0

and

k_{2} v_{s} \cdot V (p_{2}) + k_{2} - r_{2} V_{s} \cdot n_{2} = 0

.

If

v_{s} \neq - V (p_{1})

and

v_{s} \neq - V (p_{2})

, that is to say

k_{1} \neq + \infty

and

k_{2} \neq - \infty

, there are

\{\begin{matrix} (r_{1}^{2} - k_{1}^{2}) / (k_{1}^{2} + r_{1}^{2}) = \pm (B_{1} + k_{2} B_{12} - k_{1}) / m (k_{1}, k_{2}) \\ (r_{2}^{2} - k_{2}^{2}) / (k_{2}^{2} + r_{2}^{2}) = \pm (B_{2} + k_{2} - k_{1} B_{12}) / m (k_{1}, k_{2}) \end{matrix}

(10)

where

m (k_{1}, k_{2})

is defined in the previous subsection, and

B_{1}

,

B_{2}

, and

B 12

are the projection coefficients introduced earlier. By solving the nonlinear system in (10) under the existence conditions summarized in Table 1, feasible pairs of

(k_{1}, k_{2})

can be determined. Each solution corresponds to a candidate CSC-type 3D Dubins curve. The total length of each candidate curve is obtained by summing the lengths of its component segments. The shortest among them defines the optimal Dubins path. For clarity in notation, we denote the 3D Dubins curve connecting

C (p_{i})

and

C (p_{j})

as

T_{i j}

, and its total length as

T_{i j}

.

Remark 1.

The 3D Dubins curve between

C_{1}

and

C_{2}

with given

r_{1}

and

r_{2}

is decided by

k_{1}

and

k_{2}

, specifically, by A,

B_{2}

,

B_{1}

, and

B_{12}

.

To concisely characterize the geometry of a 3D Dubins curve, we define its geometric descriptor vector as

Λ_{T} = [A, B_{1}, B_{2}, B_{12}, r_{1}, r_{2}]

.

2.3. Fast 3D Dubins Curve Length Estimation

To reduce the computational overhead of sample-based path planning, this paper proposes a fast 3D Dubins curve length estimation method based on a Backpropagation Neural Network (BPNN). As stated in Remark 1, a 3D Dubins curve is uniquely determined by its geometric character vector

Λ_{T}

. Leveraging this property, the BPNN takes

Λ_{T}

as its input and outputs an estimated curve length, enabling efficient evaluation without solving the full geometric model. The structure of the proposed BPNN is illustrated in Figure 2. It consists of 1 input layer with 6 elements, 2 hidden layers with 30 and 22 tansig neurons, and 1 output layer with a purelin neuron.

To train the proposed BPNN, a dataset comprising 10,000 3D Dubins curves was generated. Each curve connects two randomly sampled configurations, with the Euclidean distance between positions satisfying

| δ p | \in [2 (r_{1} + r_{2}), 4 (r_{1} + r_{2})]

and turning radii

r_{i} \in {1, 2, 3}

. The network was trained using

90 %

of the dataset, while the remaining

10 %

was used for testing.

To quantitatively evaluate the generalization capability of the trained model, we generated four independent test groups, each consisting of 1000 new 3D Dubins curves. The BPNN-estimated lengths were compared with the numerically computed ground truth values. As summarized in Table 2, the proposed method achieves an average relative error under

5 %

in all test groups, and it demonstrates a computational speedup of nearly

200 \times

compared to traditional numerical methods.

To further illustrate estimation accuracy, Figure 3 shows a detailed curve-wise comparison of computed and estimated lengths for Group 2. The estimated curve follows the ground truth closely across the sampled experiments, confirming the method’s reliability.

Additionally, Figure 4 presents full comparison plots across all four test groups. Each subplot displays the estimated and computed Dubins lengths over 1000 samples. The BPNN consistently captures the global distribution and local variation of Dubins curve lengths, validating its generalization ability.

Across these four experiments, the average relative errors were 4.77%, 4.79%, 4.47%, and 4.02%, respectively. Moreover, the maximum absolute errors observed were 13.15%, 11.79%, 13.11%, and 6.61%—all within acceptable bounds. These results demonstrate that the proposed method offers an excellent trade-off between estimation speed and accuracy. The estimated length of the Dubins path between

C (p_{i})

and

C (p_{j})

is denoted as

{\hat{T}}_{i, j}

throughout this paper, and it is used in the subsequent planning procedures.

3. Efficient Smooth Path Planning

3.1. Overview

The RRT* algorithm is an incremental, sampling-based path planning method, as presented in Algorithm 1, and it guarantees asymptotic global optimality. Inspired by RRT*-based algorithms, this work addresses the problem of smooth path planning, particularly in unknown environments. To ensure the path planning and replanning performance in unknown environments, it takes three aspects of improvements as follows.

From the perspective of implementation, the proposed smooth path planning approach is designed to be responsive to changes in the feasibility of the current path, as illustrated in Figure 5. To improve the efficiency of both path planning and replanning, a hybrid mechanism combining path reuse, rough path planning, and path reshaping is employed. During the path reuse phase, the feasible segment of the previously planned path is retained and utilized to accelerate the replanning process. Subsequently, the precise computation of Dubins curves is deferred, and an enhanced RRT* algorithm is adopted to rapidly generate a rough path, thereby significantly reducing computational overhead. Finally, smooth and kinematically feasible maneuvers are produced by reshaping the rough trajectory based on the motion constraints of the UUV.

Algorithm 1: RRT* Algorithm

1:: $V \leftarrow p (0)$ , $\{E\} \leftarrow \emptyset$
2:: for $i = 1, 2, . . ., n$ do
3:: randomly sample in $X_{f}$ and obtain $p_{r a n d}$
4:: $p_{n s t} \leftarrow \underset{p \in G}{arg min} ∥ p - p_{r a n d} ∥$
5:: steer from $p_{n s t}$ towards $p_{r a n d}$ and obtain $p_{n e w}$
6:: if $E (p_{n s t}, p_{n e w})$ is collision-free then
7:: $\{p_{n e a r}\} \leftarrow p \in G$ and $∥ p - p_{n e w} ∥$
8:: $G \leftarrow p_{n e w}$ and $\{E\} \leftarrow E (p_{n s t}, p_{n e w})$
9:: for $p_{n e a r} \in \{p_{n e a r}\}$ and $E (p_{n e a r}, p_{n e w})$ is collision-free do
10:: if $t (p_{n e a r}) + ∥ E (p_{n e a r}, p_{n e w}) ∥ < t (p_{n e w})$ then
11:: set $p_{n e a r}$ as the parent node of $p_{n e w}$
12:: delete $E (p_{n s t}, p_{n e w})$ and $\{E\} \leftarrow E (p_{n e a r}, p_{n e w})$
13:: end if
14:: end for
15:: for $p_{n e a r} \in \{p_{n e a r}\}$ and $E (p_{n e w}, p_{n e a r})$ is collision-free do
16:: if $t (p_{n e w}) + ∥ E (p_{n e w}, p_{n e a r}) ∥ < t (p_{n e a r})$ then
17:: $G \leftarrow p_{n e w}$ and $\{E\} \leftarrow E (p_{n s t}, p_{n e w})$
18:: end if
19:: end for
20:: end if
21:: end for

The improved RRT* algorithm still utilizes a random tree structure as the foundation for path generation. However, it further incorporates the concept outlined in Remark 2.

Remark 2.

Each terminal node maintains a unique path from the root node.

Motivated by this property, terminal nodes are used to represent the growth state of the tree in the improved RRT* algorithm. These nodes are classified based on their roles in the planning process and stored in the following three sets:

V_{n e w}

,

V_{o p e n}

, and

V_{c l o s e}

. Specifically,

V_{n e w}

contains nodes whose expansion is impeded by obstacles,

V_{o p e n}

includes nodes that are eligible for further extension, and

V_{c l o s e}

holds nodes that have feasible paths to the goal configuration. Accordingly, the improved RRT* algorithm guides the sampling, expansion, and exploration processes using

V_{o p e n}

and

V_{n e w}

, while maintaining feasible path candidates in

V_{c l o s e}

. From an algorithmic perspective, the processes of sampling, expansion, and exploration are also enhanced. To improve the efficiency of generating feasible trajectories in cluttered or partially known underwater environments, collision-checking information is embedded into the pseudorandom sampling strategy. Additionally, a terminal-node backtracking mechanism is introduced to preserve tree connectivity in the presence of moving or newly discovered obstacles during expansion. The exploration process is further accelerated through a goal-biased strategy, activated under specific maturity conditions.

For clarity and ease of reference, the fundamental variables and their corresponding definitions are summarized in Table 3.

3.2. Collision Detection

Collision detection is a fundamental procedure in both path planning and replanning. It verifies the feasibility of candidate maneuvers and determines whether path re-computation is necessary. To enhance the efficiency of the planning process, this paper introduces the concept of maximal collision, which provides a quantitative assessment of collision detection results. For a spherical obstacle

Ω_{i}

, a collision with the UUV occurs if the following condition is satisfied

\underset{p (t) \in L}{arg min} ∥ p (t) - o_{i} (t) ∥ < r_{i}

(11)

Similarly, in an environment containing multiple obstacles, the maneuver

L

is considered infeasible if condition (11) is satisfied for any detected obstacle. This leads to the introduction of the maximal collision concept, which identifies the obstacle that first causes a collision as the maximal collision obstacle, denoted by

i_{m a x}

. Furthermore, the maximal collision time is defined as the moment when the distance between the UUV and the center of the maximal collision obstacle reaches its minimum. Let

t_{m a x}

denote the maximal collision time. Then,

t_{m a x} = \underset{i \leq N_{o b s}}{arg min} ∥ p (t) - o_{i_{m a x}} (t) ∥

(12)

3.3. Path Reuse

Once present path is evaluated to be infeasible, its feasible part can be obtained and denoted by

L_{f}

, which starts from the present configuration of the UUV and ends at the last reachable configuration before the maximal collision occurs.

L_{f}

is reused as the initial part of the random tree and the other part of

L

is discarded.

To reuse

L_{f}

in the improved RRT* algorithm, the nodes of

L_{f}

are recorded in

V_{n e w}

and the collision detection result of

L_{f}

are assigned to these nodes as well. Thus, new branches will grow from multiple nodes to avoid the obstructions. The smooth connections between the reachable nodes are saved as well, and the present configuration of the UUV is set as the root node of

G

which can be written as

C (p_{0})

.

3.4. Improved RRT* Algorithm

3.4.1. Sampling in Obstructive Environment

Sample nodes are generated corresponding to the nodes in

V_{n e w}

, as illustrated in Figure 6.

For each node

C (p_{j}) \in V_{n e w}

, a new sample node is generated along a direction

M (p_{j})

, which is pseudorandomly determined based on two unit vectors

H (p_{j})

and

V (p_{j})

. The vector

H (p_{j})

indicates the forward direction towards the destination and is defined as

H (p_{j}) = n (o_{i_{m a x}} (t_{m a x c}) - p_{j})

.

Meanwhile,

V (p_{j})

represents the direction away from the original path

L

to help avoid collisions, and it is calculated as

V (p_{j}) = n (H (p_{j}) \times V_{rand})

, where

V_{rand} \in R^{3}

is a randomly generated unit vector.

Then, the sampling direction

M (p_{j})

is determined by a pseudorandom linear combination of

H (p_{j})

and

V (p_{j})

as

M (p_{j}) = n (sin θ H (p_{j}) + cos θ V (p_{j})),

(13)

where

θ

denotes the clockwise angle from

V (p_{j})

to

M (p_{j})

and is generated by

θ = 2 π (w^{α} - 0.5)

with

w \in [0, 1]

being a uniform random variable and

α \in [0, 1]

a shaping parameter. A random steering distance

Δ_{new}

is then sampled from the interval

[ϵ, β ϵ]

, where

ϵ = 4 r_{\min}

and

β \geq 1

. Accordingly, the new sample node

C (p_{new})

is generated at the position

p_{j} + Δ_{new} \cdot M (p_{j})

.

To ensure both the validity and efficiency of the sampling process, the spatio-temporal feasibility of the connection between the newly generated node

C (p new)

and its nearest existing node

C (p nst) \in G

is evaluated using a linear uniform motion model to approximate the UUV trajectory.

The candidate path from

C (p nst)

to

C (p new)

is denoted as

D nst, new

, and its traversal cost is estimated by

\hat{T} (p nst, p new)

. The configuration vector

V (p new)

is then updated according to the following rule

V (p_{n e w}) = n (n (p_{n e w} - p_{n s t}) + n (p_{g o a l} - p_{n e w}))

(14)

If the path

D_{nst, new}

is collision-free, the parent node of

C (p_{new})

is set to

C (p_{nst})

, i.e.,

F (p_{new}) = C (p_{nst})

. Otherwise, the node

C (p_{new})

is discarded, and the sampling process is repeated.

In addition, the inefficiency of sampling in highly obstructed environments is also addressed. This issue arises when numerous sampling attempts fail to produce any feasible node, for instance, when a node

C (p_{j})

is located too close to surrounding obstacles. To mitigate this problem, a maximum sampling attempt threshold

η

is introduced. If more than

η

consecutive invalid sample nodes are generated based on a specific node in

V_{new}

, that node will be replaced by its parent node within

V_{new}

to guide further sampling in a more promising direction.

The detailed procedure of the sampling strategy is summarized in Algorithm 2.

Algorithm 2: Sampling procedure

1:: $i t e r \leftarrow 0$
2:: while $i t e r \leq η$ do
3:: $i t e r \leftarrow i t e r + 1$
4:: calculate $M (p_{j})$ and $Δ_{n e w}$ according to $C (p_{j})$
5:: $p_{n e w} \leftarrow p_{j} + Δ_{n e w} M (p_{j})$
6:: $p_{n s t} \leftarrow \underset{p \in G}{arg min} ∥ p - p_{n e w} ∥$
7:: if $D_{n s t, n e w}$ is collision-free then
8:: remove $C (p_{j})$ from $V_{o p e n}$
9:: break
10:: end if
11:: end while
12:: if $C (p_{n e w})$ is valid then
13:: remove $C (p_{j})$ from $V_{o p e n}$ and $G$
14:: add $F (p_{j})$ to $V_{o p e n}$
15:: end if

3.4.2. Expanding via Updating and Optimizing

The expanding procedure consists of two key components as follows: the updating subprocedure and the optimizing subprocedure, as illustrated in Figure 6b and Figure 6c, respectively.

In the updating subprocedure, the newly generated node

C (p_{new})

is connected to a node in

G

that minimizes the total cost to reach it. A set of nearby, reachable nodes in

G

is identified and recorded as

{C_{near}}

, where the Euclidean distance satisfies

∥ D_{near, new} ∥ \in [ϵ, γ ϵ]

. The parameter

γ

is defined as follows:

γ = max [1 + {(ζ (log (N_{G})) / N_{G})}^{1 / ε}, β],

(15)

where

N_{G}

denotes the number of nodes in

G

,

N_{obs}

is the number of detected obstacles, and

ζ = ∥ D_{new, goal} ∥^{3} / \sum_{i = 1}^{N_{obs}} r_{i}^{3}

. Among the nearby nodes,

C (p_{k}) \in {C_{near}}

is selected such that the total estimated cost

t (p_{new})

is minimized

t (p_{n e w}) = t (p_{k}) + {\hat{T}}_{k, n e w}

(16)

where

{\hat{T}}_{k, new}

is the estimated transfer cost from

C (p_{k})

to

C (p_{new})

, and

V (p_{new})

is configured based on Equation (14).

In conventional RRT*, the cost between two nodes is typically evaluated using Euclidean distance. However, such a metric fails to capture the actual traversal cost for nonholonomic vehicles like UUVs. To address this, the proposed method integrates 3D Dubins curve geometry into the tree-building process by replacing the Euclidean cost with an estimated Dubins curve length, denoted as

{\hat{T}}_{k, new}

.

Specifically, during the updating subprocedure, the parent node

C (p_{k})

of the newly generated node

C (p_{new})

is selected by minimizing the accumulated Dubins-based cost as follows:

C (p_{k}) = arg min_{p_{i} \in \{C_{near}\}} t (p_{i}) + {\hat{T}}_{i, new},

(17)

where

t (p_{i})

denotes the cumulative cost from the root to node

p_{i}

, and

{\hat{T}}_{i, new}

is the length of the Dubins curve estimated by the trained BPNN.

This formulation explicitly binds the Dubins curve geometry to the RRT* optimization process. The estimation

{\hat{T}}_{i, new}

captures both positional and directional constraints of the UUV, ensuring that path feasibility is evaluated under realistic kinematic assumptions. As a result, the tree growth is not only collision-aware but also curvature-constrained, leading to smoother and dynamically feasible trajectories.

Finally, the node

C (p_{new})

is added to

V_{new}

, and the node

C (p_{k})

is removed from

V_{open}

or

V_{new}

if it exists in either set.

In the optimizing subprocedure, the feasibility of the graph

G

is maintained by re-evaluating the connectivity of nearby nodes, as illustrated in Figure 6c. For each node

C (p_{l}) \in {C_{near}} ∖ F (p_{new})

, if the rough path

R_{new, l}

is both collision-free and results in a lower traversal cost compared to

t (p_{l})

, then the parent of

C (p_{l})

is updated to

C (p_{new})

. Consequently, the cost is updated to

t (p_{l}) = t (p_{new}) + {\hat{T}}_{new, l}

, the node

C (p_{l})

is added to the re-evaluation set

V_{re}

, and its new parent

F (p_{l})

is stored in the reference set

V_{ref}

.

Changing the parent of a node may influence the spatio-temporal feasibility of its child nodes and may result in the derivation of new rough paths. For a rough path

R_{r, e}

comprising sequential nodes with updated parent relations, from

C_{l_{1}}

to

C_{l_{n}}

, new rough paths may emerge in three distinct types, as illustrated in Figure 7.

The newly formed rough paths after parent update can be categorized into three types, illustrated in Figure 7, as follows:

Type I (blue solid lines): This path segment starts from the root node

C (p_{0})

and ends at a parent-updated node

C_{l_{1}} \in V_{ref}

. It remains unchanged from the original path before optimization. The separation of this segment is caused by a change in the parent of

C_{l_{1}}

, resulting in a disconnection between the former and latter parts of the path. Since it is a retained portion of a previously feasible path, its validity is preserved without requiring re-evaluation.

Type II (gray + green dashed lines): This path starts from the root node

C (p_{0})

, passes through a newly added node

C (p_{j})

and a parent-updated node

C_{l_{1}}

, and ends at the former parent of another updated node

C_{l_{2}}

. The first segment (to

C (p_{j})

) is shown in gray, and the latter segment (original path portion) in green. This path arises when two or more nodes along the original path undergo parent updates, and a new path is formed by combining the prefix path to

C_{l_{1}}

with the suffix path from the parent of

C_{l_{2}}

. Since the node costs are modified during this process, rechecking the feasibility of this path is necessary.

Type III (gray + orange dashed lines): This path starts at

C (p_{0})

, passes through a newly inserted node and a parent-updated node

C_{l_{n}}

, and finally reaches the original goal

C (p_{goal})

. The first segment (up to

C_{l_{n}}

) is gray, and the last part (to the goal) is shown in orange. This type arises when the suffix of a path containing a parent-updated node is combined with the original goal-reaching segment. As the updated node costs differ from the original, feasibility verification is also required.

As highlighted in Remark 2, each new rough path has a one-to-one correspondence with an updated node in

V_{ref}

. The validity of these new rough paths is verified in the subsequent step.

Candidate new paths are obtained by backtracking from nodes in the union set

V_{new} \cup V_{open} \cup V_{close} \cup V_{ref}

toward the initial node

C (p_{0})

. To facilitate this process, three empty sets—

V_{new}^{*}

,

V_{open}^{*}

, and

V_{close}^{*}

—are initialized. These sets serve as the updated counterparts of

V_{new}

,

V_{open}

, and

V_{close}

, respectively, and they are used to store the new terminal nodes of

G

after the expansion procedure.

Paths that do not contain any nodes in

V_{re}

(i.e., paths unaffected by parent updates) are skipped, and their terminal nodes are directly transferred to the corresponding new sets. Only the paths that include parent-updated nodes in

V_{re}

are subject to further evaluation. Among these, collision-free paths are retained, and their terminal nodes are stored in the appropriate new sets accordingly. For infeasible paths, the unreachable segments are discarded, and the last valid node along each path is marked as the new terminal node and saved in

V_{new}^{*}

.

Upon completion of this backtracking process, the spatio-temporal feasibility of

G

is guaranteed, and the updated terminal nodes are consistently tracked and maintained for subsequent planning stages. A pseudo code of the expanding procedure is presented in the Algorithm 3.

The BPNN model introduced in Section 2 is trained to efficiently estimate the Dubins curve length

{\hat{T}}_{i, j}

between two 3D configurations. During RRT* tree expansion, these estimations are used as substitutes for analytically computed Dubins lengths, which are computationally expensive to evaluate at every iteration.

Algorithm 3: Expanding procedure

1:: determine $\{C_{n e a r}\}$
2:: $C (p_{k}) \leftarrow \underset{p_{i} \in \{C_{n e a r}\}}{arg min} t (p) + {\hat{T}}_{i, n e w}$
3:: $F (p_{n e w}) \leftarrow C (p_{k})$ , remove $C (p_{k})$ from $V_{o p e n} ⋃ V_{n e w}$
4:: for $C (p_{l}) \in \{C_{n e a r}\} / C (p_{k})$ do
5:: if $D_{n e w, l}$ is collision-free and $R_{n e w, l}$ is shorter then
6:: store $F (p_{l})$ in $V_{r e f}$ , $F (p_{l}) \leftarrow C (p_{n e w})$
7:: $t (p_{l}) \leftarrow t (p_{n e w}) + \hat{T} (n e w, l)$
8:: end if
9:: end for
10:: for $C (p_{l_{m}}) \in V_{o p e n} \cup V_{n e w} \cup V_{c l o s e} \cup V_{r e f}$ do
11:: backtracking from $C (p_{l_{m}})$ towards $C (p_{0})$
12:: if $R_{p, l_{m}}$ has no node in $V_{r e}$ then
13:: if $R_{p, l_{m}}$ is collision-free then
14:: move $C (p_{l_{m}})$ to $V_{o p e n}^{*}$
15:: else
16:: move the last reachable node to $V_{n e w}^{*}$
17:: end if
18:: else
19:: if $C (p_{l_{m}}) \in V_{o p e n} \cup V_{n e w} \cup V_{c l o s e}$ then
20:: store $C (p_{l_{m}})$ correspondingly to the new set
21:: else
22:: store $C (p_{l_{m}})$ to $V_{o p e n}^{*}$
23:: end if
24:: end if
25:: end for

By integrating the BPNN estimator into the cost function (Equation (17)), the parent selection process becomes both curvature-aware and computationally efficient. This significantly reduces the time complexity of each expansion step while maintaining path smoothness and feasibility. Nodes that would otherwise be chosen based on Euclidean distance are now selected based on a learned approximation of the true traversal cost under kinematic constraints.

Furthermore, the BPNN-estimated lengths guide the rewiring process and cost updates throughout the graph. As the BPNN captures geometric characteristics such as initial/final direction and curvature limits, it ensures that suboptimal parents (with low Euclidean cost but high curvature) are avoided. This results in smoother paths with fewer unnecessary turns or abrupt heading changes.

3.4.3. Exploring Towards Destination

The exploring procedure is designed to be goal-biased, and it operates based on the set

V_{open}^{*}

to derive a sufficient number of candidate rough paths, as illustrated in Figure 6d. For each node

C (p_{l}) \in V {open}^{*}

, the feasibility of the path

R_{l, goal}

is evaluated.

If

R_{l, goal}

is collision-free, it is considered a potentially feasible maneuver, and the corresponding node

C (p_{l})

is moved to

V_{close}^{*}

. Otherwise, if the path is found to be infeasible, further path planning for collision avoidance is required, and the node is instead moved to

V_{new}^{*}

. To ensure a balance between path diversity (sufficiency) and computational tractability (efficiency), the exploring procedure incorporates two mature criteria to control the growth of the graph

G

. Let

N_{V_{close}}

denote the number of nodes in

V_{close}^{*}

. If

N_{V_{c l o s e}}

denotes the number of the nodes in

V_{c l o s e}^{*}

, the mature criteria are expressed as

(1): $N_{V_{close}}$ exceeds a predefined positive threshold $κ$ ;
(2): Both $V_{open}^{*}$ and $V_{new}^{*}$ are empty.

When either criterion is satisfied,

G

is considered to be sufficiently expanded, and the reshaping procedure is subsequently initiated. The pseudocode for the exploring procedure is presented in Algorithm 4.

Algorithm 4: Exploring procedure

1:: for $C (p_{l}) \in V_{o p e n}^{*}$ do
2:: if $N_{V_{c l o s e}} \leq κ$ or $V_{o p e n}^{*} \cup V_{n e w}^{*} \neq \emptyset$ then
3:: if $R_{l, g o a l}$ is collision-free then
4:: remove $C (p_{l})$ from $V_{o p e n}^{*}$ to $V_{c l o s e}^{*}$
5:: else
6:: remove $C (p_{l})$ from $V_{o p e n}^{*}$ to $V_{n e w}^{*}$
7:: end if
8:: end if
9:: end for

3.5. Path Reshape

Rather than reshaping each candidate rough path individually, the proposed method begins by comparing their costs with the current shortest smoothed path. Let

t_{min}

denote the cost of the current shortest smooth path. For the shortest candidate rough path

R_{m, goal}

, if its cost is less than

t_{min}

, it is considered a promising candidate for further optimization.

The component nodes of

R_{m, goal}

, which are generated by the improved RRT* algorithm, are reconfigured and evaluated sequentially. For each node

C (p_{i}) \in R_{m, goal}

, the configuration

V (p_{i})

is redefined according to Equation (14) by substituting

p_{goal}

with the position of its child node in

R_{m, goal}

.

Subsequently, a 3D Dubins curve is determined from the parent node

F (p_{i})

to the current node

C (p_{i})

, and its feasibility is assessed. Let

C (p_{j}) = F (p_{i})

. If the Dubins curve

T_{j, i}

is collision-free, the cost is updated as

t (p_{i}) = t (p_{j}) + T_{j, i},

(18)

and the curve is stored accordingly.

If a collision is detected, the reshaping process for

R_{m, goal}

is terminated, and the infeasible portion of the path—from

C (p_{i})

to the terminal node

C (p_{f})

—is discarded. The collision detection result is also stored in correspondence with

F (p_{i})

for further handling.

Finally, if the reshaped path

L_{m, goal}

is entirely collision-free and its cost is less than

t_{min}

, then

L_{m, goal}

is saved as the new optimal solution, and

t_{min}

is updated accordingly. In addition, after processing all the candidate rough paths, the shortest smooth result, if it exists, is accepted as

L

. If no feasible smooth path is determined, the improved RRT* algorithm will be repeated.

4. Simulation and Discussion

To prove the validity and efficiency of the smooth path planning method, comparative simulation results with state-of-the-art algorithms—3D RRT*, Informed RRT* [13], Stable Sparse RRT* (SST*) [25], Dynamic RRT* [26], and the asymptotically optimal single-query sampling-based replanning algorithm RRTX [27]—in unknown scenarios with static and moving obstacles are provided in this section.

For simplicity of expression, the proposed smooth path planning method is referred to as DRRT*. Its parameters are configured as presented in Table 4.

To ensure fairness, all algorithms employ the same collision detection function and share some common algorithmic parameters. In the comparative studies, the kinematic and dynamic constraints of the UUV are ignored for the 3D RRT*, Informed RRT*, and Dynamic RRT* algorithms. For these methods, 100 additional sample nodes are appended after each planning failure to ensure that a complete resultant path can be generated. The RRTX algorithm also ignores the prescribed constraints. However, its number of sample nodes is fixed at 800 to increase the probability of finding a goal-reachable path. In contrast, the SST* algorithm shares the same kinematic model as the proposed DRRT* method. Its initial number of sample nodes is set to 100. To complete the path planning task, Dubins curves are used to connect the random tree to the terminal configuration. Each algorithm is executed 50 times on a laptop equipped with an Intel i7-9750H CPU, 16GB RAM, and running MATLAB 2019b.

The parameters of the UUV are set as

r_{m i n} = 1 m

and

r_{s c a n} = 7 m

, and the prescribed configurations are

\{\begin{matrix} p = {[0 m, 0 m, 0 m]}^{T}, \\ v = {[1 m / s, 0 m / s, 0 m / s]}^{T}, \end{matrix}

(19)

and

\{\begin{matrix} p_{g o a l} = {[20 m, 0 m, 0 m]}^{T}, \\ V (p_{g o a l}) = {[1 m / s, 0 m / s, 0 m / s]}^{T} . \end{matrix}

(20)

4.1. Scenario 1: Unknown Environment with Static Obstacles

In this scenario, 22 unknown obstacles form a concave obstructive area. The simulation results of the compared algorithms are shown in Figure 8. Figure 8f illustrates the path planning result of the proposed DRRT*, demonstrating its capability to effectively avoid the concave obstruction and generate a feasible, smooth maneuver to reach the goal configuration.

Compared to the other algorithms shown in Figure 8, the path generated by DRRT* exhibits improved smoothness. The average curvature computed over 10 randomized trials is

0.35

,

0.23

,

0.22

,

0.18

, and

0.20

m^{- 1}

for 3D RRT*, informed RRT*, Dynamic RRT*, SST*, and RRTX, respectively, while DRRT* yields the lowest average curvature of

0.16

m^{- 1}

.

The statistical performance of DRRT* and the comparative methods in this scenario is further summarized in Figure 9. Figure 9a presents the path length statistics, where the average lengths of the resultant paths generated by 3D RRT*, informed RRT*, Dynamic RRT*, SST*, and RRTX are

43.60 m

,

57.88 m

,

49.68 m

,

46.48 m

, and

40.54 m

, respectively. In contrast, the average path length obtained by DRRT* is only

26.71 m

. Meanwhile, the computational time consumption of these algorithms is shown in Figure 9b. The average time taken by 3D RRT*, informed RRT*, Dynamic RRT*, SST*, and RRTX to generate collision-free paths is

5.54 s

,

2.15 s

,

3.83 s

,

145.03 s

, and

4.21 s

, respectively. In comparison, DRRT* achieves superior efficiency, requiring only

1.54 s

on average.

4.2. Scenario 2: An Unknown Environment with Both Static and Moving Obstacles

Scenario 2 represents an unknown environment populated with 16 static obstacles and 4 moving obstacles. These obstacles are randomly distributed, and the moving obstacles travel at a constant speed of

1 m / s

.

The simulation results of the path planning algorithms are illustrated in Figure 10. Specifically, Figure 10f shows the path generated by the proposed DRRT* algorithm. Compared to the results of the other algorithms shown in Figure 10, DRRT* produces a smooth and collision-free trajectory that successfully reaches the goal configuration.

A statistical comparison of the algorithms’ performance in this scenario is provided in Figure 11. Figure 11a presents the lengths of the resulting paths. The average path lengths for the 3D RRT*, informed RRT*, Dynamic RRT*, SST*, and RRTX algorithms are

42.2141 m

,

54.6791 m

,

40.2476 m

,

47.5439 m

, and

37.0687 m

, respectively. In contrast, DRRT* achieves a significantly shorter average path length of

23.2824 m

.

The computational cost of each algorithm is compared in Figure 11b. On average, the 3D RRT*, informed RRT*, Dynamic RRT*, SST*, and RRTX algorithms require

3.7574 s

,

2.0170 s

,

3.1541 s

,

81.6918 s

, and

4.0694 s

, respectively, to generate a complete collision-free path. In comparison, DRRT* demonstrates superior efficiency, completing the planning task in an average of only

1.3029 s

.

4.3. Scenario 3: Evaluating Robustness in a High-Density Obstacle Environment

To rigorously evaluate the effectiveness and robustness of the proposed DRRT* algorithm under more demanding environmental conditions, Scenario 3 introduces a significantly more complex test case. This scenario features an expanded workspace and a substantially increased density of obstacles compared to Scenarios 1 and 2, specifically designed to assess the algorithm’s performance in settings representative of challenging real-world navigation tasks.

Within this environment, the spatial dimensions are increased, and both static and moving obstacles are densely distributed to create a highly constrained planning problem with active dynamic hazards. The obstacle configuration builds upon the principles established in Scenario 1, but incorporates additional dynamic obstacles exhibiting complex converging and diverging movement patterns to further test the algorithm’s ability to handle intricate interactions. We also include traditional methods such as APF and modified RRT in this scenario to provide a broader baseline reference, especially under highly constrained and dynamic conditions.

The simulation results for all tested algorithms are presented in Figure 12. As Figure 12h illustrates, the DRRT* algorithm successfully computes a smooth and collision-free trajectory, effectively navigating the dense static obstacles and dynamically avoiding the moving hazards.

Statistical performance metrics for this scenario are summarized in Figure 13 and Table 5. Figure 13a shows that DRRT* achieves the shortest average path length of

26.94 m

, while also demonstrating the lowest average computation time of

1.42 s

(Figure 13b). Table 5 further confirms DRRT’s superior success rate and overall efficiency when compared to traditional APF and modified RRT-based planners.

These results demonstrate that DRRT* maintains its significant performance advantages even in moderately scaled environments characterized by high obstacle density and complex dynamic interactions, highlighting its enhanced robustness. However, the current framework’s performance boundaries are noted: scenarios involving extremely large−scale underwater domains (e.g.,

> 1000 m

range) or exceptionally high numbers of obstacles (e.g.,

> 200

) may challenge its efficiency due to increased memory requirements and sampling complexity, potentially impacting runtime and success rates. Exploring extensions to handle such extreme scales will be the focus of future work.

4.4. Discussion on Scalability and Limitations

Although the proposed DRRT* algorithm demonstrates significant performance improvements in the tested scenarios, it is important to acknowledge the current limitations regarding environmental complexity. The simulation environments primarily involve moderate obstacle densities and confined spatial regions. As obstacle density increases or the environment size expands, several challenges may arise, including increased sampling requirements, higher memory consumption, and longer convergence times.

However, due to the modular structure of the proposed method—including fast Dubins estimation, goal−biased exploration, and partial path reuse—it is inherently scalable. In larger−scale environments, planning performance can be maintained by increasing the sampling budget or employing adaptive pruning strategies. These extensions, along with quantitative evaluations in dense and cluttered environments (e.g.,

200 +

obstacles over

1000 m

scale), are left as future work.

5. Conclusions

In this paper, we addressed the path planning problem for a nonholonomic UUV operating in unknown 3D environments with obstacles. An efficient and smooth path planning approach was proposed by integrating 3D Dubins curves with an enhanced version of the RRT* algorithm. We analyzed the geometric characteristics of 3D Dubins curves and introduced a rapid curve length estimation method based on a backpropagation (BP) neural network. To improve the efficiency of the planning algorithm, several modifications were made to the original RRT*, including pseudorandom sampling, terminal−node backtracking−based extension, and goal−biased exploration. These enhancements significantly reduce computational overhead and improve the algorithm’s collision avoidance capability. Comprehensive simulation experiments validated the effectiveness and feasibility of the proposed method in unknown environments. Future work will focus on integrating this smooth path planning framework with advanced perception systems and robust controllers to further enhance the mobility and adaptability of the UUVs in complex, unstructured outdoor scenarios.

Author Contributions

Methodology, F.P. and P.C.; Software, B.C.; Validation, B.C.; Data curation, P.C., W.Y., and S.Z.; Writing—original draft, F.P.; Writing—review and editing, B.C., W.Y., and S.Z.; Supervision, W.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported, in part, by the Key Research and Development Program of Shaanxi under Grant 2022ZDLGY03-05, and the Shaanxi Innovative Talent Pandeng Program for Young Science and Technology Rising Star Project 2017KJXX-38.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to data management requirements of the funded project.

Conflicts of Interest

Author Peng Cui was employed by the company SAIC Motor R&D Innovation Headquarters. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interes.

References

Wu, B.; Yan, X.; Wang, Y.; Guedes Soares, C. An evidential reasoning-based CREAM to human reliability analysis in maritime accident process. Risk Anal. 2017, 37, 1936–1957. [Google Scholar] [CrossRef]
Chen, X.; Wei, C.; Xin, Z.; Zhao, J.; Xian, J. Ship detection under low-visibility weather interference via an ensemble generative adversarial network. J. Mar. Sci. Eng. 2023, 11, 2065. [Google Scholar] [CrossRef]
Wang, H.; Gao, W.; Wang, Z.; Zhang, K.; Ren, J.; Deng, L.; He, S. Research on Obstacle Avoidance Planning for UUV Based on A3C Algorithm. J. Mar. Sci. Eng. 2024, 12, 142–149. [Google Scholar] [CrossRef]
Zhang, T.; Li, Q.; Zhang, C.; Liang, H.; Li, P.; Wang, T.; Li, S.; Zhu, Y.; Wu, C. Current trends in the development of intelligent unmanned autonomous systems. Front. Inf. Technol. Electron. Eng. 2017, 18, 68–85. [Google Scholar] [CrossRef]
Cui, R.; Yang, C.; Li, Y.; Sharma, S. Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning. IEEE Trans. Syst. Man Cybern. Syst. 2017, 47, 1019–1029. [Google Scholar] [CrossRef]
Zhang, X.; Wang, Z.; Chen, H.; Ding, H. A Trajectory Tracking and Local Path Planning Control Strategy for Unmanned Underwater Vehicles. J. Mar. Sci. Eng. 2023, 11, 2230. [Google Scholar] [CrossRef]
Mac, T.T.; Copot, C.; Tran, D.T.; De Keyser, R. Heuristic approaches in robot path planning: A survey. Robot. Auton. Syst. 2016, 86, 13–28. [Google Scholar] [CrossRef]
Luis Sanchez-Lopez, J.; Wang, M.; Olivares-Mendez, M.A.; Molina, M.; Voos, H. A Real-Time 3D Path Planning Solution for Collision-Free Navigation of Multirotor Aerial Robots in Dynamic Environments. J. Intell. Robot. Syst. 2019, 93, 33–53. [Google Scholar] [CrossRef]
Mobadersany, P.; Khanmohammadi, S.; Ghaemi, S. A fuzzy multi-stage path-planning method for a robot in a dynamic environment with unknown moving obstacles. Robotica 2015, 33, 1869–1885. [Google Scholar] [CrossRef]
Liu, S.; Watterson, M.; Mohta, K.; Sun, K.; Bhattacharya, S.; Taylor, C.J.; Kumar, V. Planning Dynamically Feasible Trajectories for Quadrotors Using Safe Flight Corridors in 3-D Complex Environments. IEEE Robot. Autom. Lett. 2017, 2, 1688–1695. [Google Scholar] [CrossRef]
Karaman, S.; Frazzoli, E. Sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
Webb, D.J.; van den Berg, J. Kinodynamic RRT*: Asymptotically optimal motion planning for robots with linear dynamics. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation (ICRA), Karlsruhe, Germany, 6–10 May 2013; pp. 5054–5061. [Google Scholar]
Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Informed RRT*: Optimal sampling-based path planning focused via direct sampling of an admissible ellipsoidal heuristic. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Chicago, IL, USA, 14–18 September 2014; pp. 2997–3004. [Google Scholar]
Jian, X.; Zou, T.; Vardy, A.; Bose, N. A Hybrid Path Planning Strategy of Autonomous Underwater Vehicles. In Proceedings of the 2020 IEEE/OES Autonomous Underwater Vehicles Symposium (AUV), St. Johns, NL, Canada, 30 September 2020–2 October 2020; pp. 1–6. [Google Scholar]
Lee, H.; Kim, H.; Kim, H.J. Planning and Control for Collision-Free Cooperative Aerial Transportation. IEEE Trans. Autom. Sci. Eng. 2018, 15, 189–201. [Google Scholar] [CrossRef]
Qiao, W.; Fang, Z.; Si, B. Sample-based Frontier Detection for Autonomous Robot Exploration. In Proceedings of the 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO), Kuala Lumpur, Malaysia, 12–15 December 2018; pp. 1165–1170. [Google Scholar]
Zhang, Z.; Wu, D.; Gu, J.; Li, F. A Path-Planning Strategy for Unmanned Surface Vehicles Based on an Adaptive Hybrid Dynamic Stepsize and Target Attractive Force-RRT Algorithm. J. Mar. Sci. Eng. 2019, 7, 132. [Google Scholar] [CrossRef]
Behandish, M.; Ilieş, H.T. Analytic methods for geometric modeling via spherical decomposition. Comput. Aided Des. 2016, 70, 100–115. [Google Scholar] [CrossRef]
Setty, S.; Mudenagudi, U. Example-based 3D inpainting of point clouds using metric tensor and Christoffel symbols. Mach. Vis. Appl. 2018, 29, 329–343. [Google Scholar] [CrossRef]
Kaya, C.Y. Markov-Dubins path via optimal control theory. Comput. Optim. Appl. 2017, 68, 719–747. [Google Scholar] [CrossRef]
Furtuna, A.A.; Balkcom, D.J. Generalizing Dubins Curves: Minimum-time Sequences of Body-fixed Rotations and Translations in the Plane. Int. J. Robot. Res. 2010, 29, 703–726. [Google Scholar] [CrossRef]
Tokekar, P.; Karnad, N.; Isler, V. Energy-optimal trajectory planning for car-like robots. Auton. Robot. 2014, 37, 279–300. [Google Scholar] [CrossRef]
Wehbe, B.; Bazzi, S.; Shammas, E. Novel three-dimensional optimal path planning method for vehicles with constrained pitch and yaw. Robotica 2017, 35, 2157–2176. [Google Scholar] [CrossRef]
Shkel, A.M.; Lumelsky, V. Classification of the Dubins set. Robot. Auton. Syst. 2001, 34, 179–202. [Google Scholar] [CrossRef]
Li, Y.; Littlefield, Z.; Bekris, K.E. Asymptotically optimal sampling-based kinodynamic planning. Int. J. Robot. Res. 2016, 35, 528–564. [Google Scholar] [CrossRef]
Connell, D.; La, H.M. Extended rapidly exploring random tree-based dynamic path planning and replanning for mobile robots. Int. J. Adv. Robot. Syst. 2018, 15, 1729881418773874. [Google Scholar] [CrossRef]
Otte, M.; Frazzoli, E. RRTX: Asymptotically optimal single-query sampling-based motion planning with quick replanning. Int. J. Robot. Res. 2016, 35, 797–822. [Google Scholar] [CrossRef]

Figure 1. Illustration of the 3D CSC Dubins curve. This curve is composed of two circular segments (

C_{1}

and

C_{2}

) and one straight line segment (S). The initial and terminal configurations of this curve are illustrated by green vector and red vector separately.

Figure 1. Illustration of the 3D CSC Dubins curve. This curve is composed of two circular segments (

C_{1}

and

C_{2}

) and one straight line segment (S). The initial and terminal configurations of this curve are illustrated by green vector and red vector separately.

Figure 2. Structure of the designed BPNN for length estimation.

Figure 3. Error graph of the test result of Group 2. The blue bars represent the absolute estimation error of Dubins path lengths across 1000 samples. Red dots indicate outliers with the largest absolute errors. The inset zooms into the first 100 data points for local detail.

Figure 4. Estimated and computed Dubins path lengths over 1000 random samples per group.

Figure 5. Implementation of the proposed planning method triggered by changes in the feasibility of the current trajectory followed by the UUV.

Figure 6. Illustration of the sampling, updating, optimizing, and exploring procedures: (a) Sampling direction generation and new node creation; (b) connection to the parent node with minimum cost; (c) cost optimization by reconnecting neighbors; (d) goal-biased exploration and rough path generation.

Figure 7. Illustration of the three types of newly generated rough paths caused by parent updates during graph optimization. Type I (blue solid line): unchanged segment from the root node to a parent-updated node. Type II (gray + green dashed line): path connecting two updated nodes, combining a new segment with a reused part of the original path. Type III (gray + orange dashed line): updated segment from the root to an intermediate node, followed by the original goal-reaching portion.

Figure 8. The typical simulation results of the comparative algorithms in Scenario 1. (a) Resultant path of 3D RRT*. (b) Resultant path of informed RRT*. (c) Resultant path of dynamic RRT*. (d) Resultant path of SST*. (e) Resultant path of RRTX. (f) Resultant path of DRRT*. In each subfigure, the blue line denotes the final path generated by the corresponding algorithm. The red short line marks the starting position, and the green sphere indicates the goal location. Colored segments such as yellow, red, or green represent intermediate path segments or exploration trajectories during planning. Obstacles are visualized as translucent blue and green spheres.

Figure 9. Statistic results of path planning in Scenario 1. (a) The statistics of resultant path lengths in Scenario 1. (b) The statistics of computation consumptions in Scenario 1.

Figure 10. The typical simulation results of the comparative algorithms in Scenario 2. (a) Resultant path of the 3D RRT*. (b) Resultant path of the informed RRT*. (c) Resultant path of the dynamic RRT*. (d) Resultant path of the SST*. (e) Resultant path of the RRTX. (f) Resultant path of the DRRT*. In each subfigure, the blue line denotes the final path generated by the corresponding algorithm. The red short line marks the starting position, and the green sphere indicates the goal location. Colored segments such as yellow, red, or green represent intermediate path segments or exploration trajectories during planning. Obstacles are visualized as translucent blue and green spheres.

Figure 11. Statistic results of path planning in Scenario 2. (a) Statistics of resultant path lengths in Scenario 2. (b) Statistics of computation consumption in Scenario 2.

Figure 12. Path planning results of the APF, Modified RRT*, and DRRT* algorithms in Scenario 3. (a) Local minimum problem in the APF algorithm. (b) Resultant path of the APF algorithm. (c) Initial path planning result of modified RRT*. (d) Path replanning result of modified RRT*. (e) Resultant path of modified RRT*. (f) Initial path planning result of DRRT*. (g) Path replanning result of DRRT*.

Figure 13. Statistical comparison of the 3D RRT* and DRRT* algorithms in Scenario 3. (a) Path planning time consumption of the prescribed algorithms. (b) Resultant path length of the prescribed algorithms.

Table 1. Establishment conditions of CSC curves.

Case	$v_{s}$	$k_{1} & k_{2}$	Condition
Case 1	$\frac{δ p + k_{2} V (p_{2}) - k_{1} V (p_{1})}{m (k_{1}, k_{2})}$	$k_{2} \geq k_{1}$	none
Case 2		$k_{2} < k_{1}$	$q (k_{1}, k_{2}) > 0$
Case 3	$- \frac{δ p + k_{2} {\vec{V}}_{2} - k_{1} {\vec{V}}_{1}}{m (k_{1}, k_{2})}$	$k_{2} > k_{1}$	$q (k_{1}, k_{2}) < 0$

Table 2. Computation time and error comparison.

Group	Numerical Method	BPNN Method	Avg. Rel. Error
1	39.6973 s	0.2033 s	4.77%
2	38.6567 s	0.1941 s	4.79%
3	38.0622 s	0.1948 s	4.47%
4	39.3130 s	0.1969 s	4.02%

Table 3. Nomenclature of variables in the improved RRT* algorithm.

Variable	Definition
$t (p)$	Return the path length from root to $C (p)$
$D_{i, j}$	Straight line segment that connects $p_{i}$ and $p_{j}$
$R_{i, . . . j, k}$	Unsmoothed path from root to $p_{k}$ via $p_{i}, . . ., p_{j}$
$Q (p)$	Return the collision detection result stored at $C (p)$
$F (p)$	Return the parent node of $C (p)$
$L_{i, . . ., j, k}$	Smooth path from root to $C (p_{k})$ via $C (p_{i}), . . ., C (p_{j})$
$G$	Random tree for path planning

Table 4. Parameter selection.

Parameter	$α$	$ϵ$	$β$	$ε$	$η$	$κ$
Value	$1 / 6$	$4 r_{m i n}$	$1.5$	3	100	1

Table 5. Simulation results in Scenario 3.

Algorithm	Item	Length	Nodes Num	Time Consumption
APF	Max	$32.50$	−	$0.13$ s
	Aver	$31.14$	−	$0.10$ s
	Min	$30.60$	−	$0.09$ s
Modified RRT*	Max	$42.99$	569	$221.31$ s
	Aver	$31.97$	$235.45$	$59.31$ s
	Min	$23.52$	82	$10.97$ s
DRRT*	Max	$39.10$	86	$6.93$ s
	Aver	$26.94$	$18.83$	$1.42$ s
	Min	$22.40$	4	$0.16$ s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pan, F.; Cui, P.; Cui, B.; Yan, W.; Zhang, S. 3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm. J. Mar. Sci. Eng. 2025, 13, 1354. https://doi.org/10.3390/jmse13071354

AMA Style

Pan F, Cui P, Cui B, Yan W, Zhang S. 3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm. Journal of Marine Science and Engineering. 2025; 13(7):1354. https://doi.org/10.3390/jmse13071354

Chicago/Turabian Style

Pan, Feng, Peng Cui, Bo Cui, Weisheng Yan, and Shouxu Zhang. 2025. "3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm" Journal of Marine Science and Engineering 13, no. 7: 1354. https://doi.org/10.3390/jmse13071354

APA Style

Pan, F., Cui, P., Cui, B., Yan, W., & Zhang, S. (2025). 3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm. Journal of Marine Science and Engineering, 13(7), 1354. https://doi.org/10.3390/jmse13071354

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

3D Dubins Curve-Based Path Planning for UUV in Unknown Environments Using an Improved RRT* Algorithm

Abstract

1. Introduction

2. Mathematical Formulation

2.1. Problem Statement

2.2. 3D Dubins Curve Determination

2.3. Fast 3D Dubins Curve Length Estimation

3. Efficient Smooth Path Planning

3.1. Overview

3.2. Collision Detection

3.3. Path Reuse

3.4. Improved RRT* Algorithm

3.4.1. Sampling in Obstructive Environment

3.4.2. Expanding via Updating and Optimizing

3.4.3. Exploring Towards Destination

3.5. Path Reshape

4. Simulation and Discussion

4.1. Scenario 1: Unknown Environment with Static Obstacles

4.2. Scenario 2: An Unknown Environment with Both Static and Moving Obstacles

4.3. Scenario 3: Evaluating Robustness in a High-Density Obstacle Environment

4.4. Discussion on Scalability and Limitations

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI