A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion

Zhao, Yaoyu; Huang, Wencong; Chang, Yufang; Qin, Ziyu

doi:10.3390/app16105065

Open AccessArticle

A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion

School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan 430068, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(10), 5065; https://doi.org/10.3390/app16105065

Submission received: 23 April 2026 / Revised: 15 May 2026 / Accepted: 16 May 2026 / Published: 19 May 2026

(This article belongs to the Section Robotics and Automation)

Download

Browse Figures

Versions Notes

Abstract

UAV path planning in complex three-dimensional obstacle environments requires a balance between search efficiency and flight feasibility. However, existing RRT*-based methods often fail to satisfy this requirement, as their random sampling lacks directional guidance and makes limited use of environmental information. To this end, this paper proposes an environment-aware cooperative bidirectional RRT* algorithm (EAC-Bi-RRT*). In the sampling stage, the sampling probability of each direction is adaptively adjusted according to the obstacle distribution across 26 directional sectors and the relative goal orientation, so that the search receives stronger directional guidance. During bidirectional expansion, the two trees are assigned leader and follower roles according to the local expandability on the start and goal sides, and their cooperative search is combined with an environment-adaptive step size and a climbing-angle constraint to balance search efficiency and flight reachability. When an expanding node approaches an obstacle, a repulsive-only local directional correction suppresses oscillation, and the initial path is then smoothed by a curvature-constrained B-spline to form a continuous flight trajectory. Across all test scenarios, EAC-Bi-RRT* achieves a 100% planning success rate. Compared with the baseline algorithms, it reduces planning time by approximately 54–90% and path length by approximately 5–18% while maintaining low average turning angles, which demonstrates competitive overall performance.

Keywords:

unmanned aerial vehicle (UAV); 3D path planning; bidirectional RRT*; adaptive directional sampling; cooperative dual-tree expansion

1. Introduction

Unmanned Aerial Vehicles (UAVs) are widely used in urban logistics [1,2], disaster search and rescue [3,4], infrastructure inspection [5,6], and confined-space operations [7,8]. Unlike ground robots that mainly operate in a 2D plane, UAVs performing these tasks must plan paths in three-dimensional space [9]. The planner needs to consider collision risks from different directions, vertical occlusion caused by obstacle heights, and the kinematic constraints imposed by climbing capability [10,11]. When the search space extends from two dimensions to three, the geometric and topological relationships among obstacles become more complex, and feasible trajectory generation is subject to stricter constraints [12].

In complex 3D environments, sampling-based methods remain widely used for path planning because they adapt well to high-dimensional spaces [13]. RRT [14] and RRT* [15] are two of the most commonly used algorithms in this category. RRT is simple to implement and does not require environment preprocessing, but its random expansion process provides little control over path quality and often produces redundant paths [16,17]. RRT* improves path quality through parent selection and rewiring, yet in cluttered 3D environments its uniform random sampling and fixed-step expansion still limit search efficiency [18]. The piecewise linear paths generated in this way also make it difficult to satisfy kinematic constraints such as the climbing-angle requirement of UAVs. Although many improved variants enhance RRT* from different aspects, they still do not jointly balance search efficiency, path quality, and environmental adaptability in complex 3D workspaces.

In the sampling stage, Gammell et al. [19] proposed Informed-RRT*, which restricts sampling to the heuristic region defined by the current best solution and reduces invalid samples. HBAI-RRT* [20] and G-RRT* [21] further compress the sampling domain by incorporating segmented informed regions or heuristic cost information, thereby improving sampling efficiency. These methods mainly improve the sampling range, but their direct guidance on tree expansion direction remains limited. Jiang et al. [22] proposed FS-RRT, which constrains subsequent sampling angles according to collision feedback generated during expansion. The directional deflection sampling strategy [23] further adjusts the sampling deflection angle according to obstacle distribution along the goal direction to improve the success rate of expansion. Overall, existing methods improve sampling guidance from different perspectives, but how sampling resources should be organized and allocated across different directions in 3D space is still not sufficiently discussed.

In bidirectional search, B-RRT* [24] preserves the asymptotic optimality of RRT* within a dual-tree expansion framework while reducing computational cost through heuristic strategies. IB-RRT* [25] mainly improves the connection process between the two trees, using an intelligent sample insertion mechanism to speed up their connection. These methods shorten the time required to obtain an initial feasible path, but the connection process is still sensitive to randomness and may introduce additional computational overhead. CBQ-RRT* [26] further improves path smoothness by introducing kinematic constraints into bidirectional expansion and optimizing connection quality during tree merging. Cao et al. [27] combined an artificial potential field with a variable-step expansion strategy and adaptively adjusted the expansion direction and step size according to feedback on the distance between nodes and obstacles, thereby improving search efficiency and obstacle avoidance capability in cluttered environments. These methods improve the performance of bidirectional search, but most of them still apply similar sampling and expansion rules to both trees, making it difficult to fully exploit the different local obstacle distributions on the start side and the goal side.

For expansion guidance and obstacle avoidance, combining the artificial potential field with RRT* is a common improvement strategy. Qureshi and Ayaz [28] incorporated the artificial potential field into the random sampling process and used the potential-function gradient to steer samples toward the goal, thereby reducing invalid sampling and accelerating convergence. PF-RRT* proposed by Fan et al. [29] considers the combined effect of random points, the goal, and obstacles during new node generation to further improve convergence speed and path quality. HMA-RRT* [30] further integrates dynamic-region sampling, a hierarchical escape mechanism, heading-angle constraints, and adaptive step-size adjustment to improve search efficiency and planning feasibility in complex environments. The artificial potential field itself is still susceptible to local minima and oscillation near the goal region. Kilic et al. [31] introduced tangential forces, inertial heuristic forces, local minimum detection, and dynamic coefficient adjustment to enhance stable obstacle avoidance in complex environments. Even so, most existing APF-RRT* hybrid methods still retain the dominant role of the attractive term in global expansion; when this mechanism is combined with goal-biased guidance, the two can overlap functionally and may further intensify oscillatory behavior near the goal region.

To address the aforementioned issues, this paper proposes an environment-aware cooperative bidirectional RRT* (EAC-Bi-RRT*) algorithm for UAV path planning in complex 3D environments. The main contributions of this work are summarized as follows:

(1): An adaptive directional sampling strategy is proposed. The strategy partitions the 3D sampling space into 26 spherical sectors, assigns differentiated sampling probabilities by integrating obstacle density and goal-direction information, and further introduces a distance-adaptive goal-bias probability, which raises the effective sampling rate while preserving global exploration capability.
(2): A cooperative dual-tree expansion strategy is developed. It incorporates an adaptive variable step-size mechanism and realizes a one-time automatic assignment of the leader and follower roles based on a local expandability assessment at the start and goal states. In addition, the maximum climbing angle of the UAV is embedded into the expansion process as a hard constraint to ensure the kinematic feasibility of each expansion segment.
(3): A repulsive-only local potential field correction mechanism is introduced. In the vicinity of obstacles, the mechanism uses the repulsive term for directional correction and incorporates a bounded goal influence factor so that the repulsive effect gradually weakens as the node approaches the goal. This design mitigates path oscillation near the goal region.

The paper is organized as follows. Section 2 sets up the 3D path planning problem together with the notation used throughout. RRT* and its main extensions are reviewed in Section 3. Section 4 describes the proposed EAC-Bi-RRT* algorithm, focusing on its three core mechanisms. Simulation studies and their interpretation are given in Section 5, and Section 6 closes the paper with conclusions and directions for further work.

2. Problem Definition

Let the 3D bounded workspace be

X \subset R^{3}

, the obstacle region be

X_{o b s} \subset X

, and the free space be

X_{f r e e} = X ∖ X_{o b s}

. Given a start point

x_{s t a r t} \in X_{f r e e}

and a goal point

x_{g o a l} \in X_{f r e e}

, the goal region is defined as

X_{g o a l} = \{x \in X_{f r e e} | ∥ x - x_{g o a l} ∥ \leq r_{g o a l}\},

(1)

where

r_{g o a l} > 0

is the arrival tolerance radius.

A path is defined as an ordered sequence of

K + 1

waypoints

σ = {x_{0}, x_{1}, \dots, x_{K}}

, where

x_{k} = {(x_{k}, y_{k}, z_{k})}^{T}

,

x_{0} = x_{s t a r t}

, and

x_{K} \in X_{g o a l}

. Consecutive waypoints are connected by straight-line segments to form a piecewise-linear path. For any two consecutive waypoints

x_{k}

and

x_{k + 1}

, the climb angle is defined as

γ_{k} = arctan (\frac{z_{k + 1} - z_{k}}{\sqrt{{(x_{k + 1} - x_{k})}^{2} + {(y_{k + 1} - y_{k})}^{2}}}) .

(2)

Definition 1.

A path σ is feasible if every waypoint and connecting segment lies within the free space, i.e.,

x_{k} \in X_{f r e e}

and

\bar{x_{k} x_{k + 1}} \subset X_{f r e e}

for

k = 0, 1, \dots, K - 1

, and the climb angle of every segment satisfies

| γ_{k} | \leq γ_{m a x}

, where

γ_{m a x}

is the maximum allowable climb angle. Let Σ denote the set of all feasible paths.

Definition 2.

The optimal path

σ^{*}

minimizes the cost over all feasible paths:

σ^{*} = arg min_{σ \in Σ} \sum_{k = 0}^{K - 1} ∥ x_{k + 1} - x_{k} ∥ .

(3)

3. Related Works

3.1. RRT

RRT [14] is one of the foundational algorithms in sampling-based path planning and serves as the basis of the proposed method. Given the start configuration

x_{s t a r t}

, the algorithm initializes a search tree T rooted at

x_{s t a r t}

. At each iteration, a sample

x_{r a n d}

is drawn uniformly from the free space, and the node

x_{n e a r e s t}

in T that is closest to

x_{r a n d}

in Euclidean distance is identified. A new node

x_{n e w}

is then generated by extending from

x_{n e a r e s t}

toward

x_{r a n d}

with a fixed step size

η

. The algorithm first checks whether the segment between

x_{n e a r e s t}

and

x_{n e w}

is collision-free. If no collision is detected,

x_{n e w}

is added to T; otherwise, the current expansion is not executed. The algorithm terminates when the newly generated

x_{n e w}

enters the prescribed radius of the goal

x_{g o a l}

and the connecting segment is collision-free. The feasible path is then obtained by tracing the parent pointers from this node back to the root.

3.2. RRT*

RRT* is an asymptotically optimal extension of RRT, and its overall procedure is summarized in Algorithm 1. After generating

x_{n e w}

through the same sampling and expansion procedure as RRT, RRT* no longer connects it directly to

x_{n e a r e s t}

, but instead performs two additional optimization operations within the overall framework, as illustrated in Figure 1. The first is parent selection (ChooseParent, Algorithm 2), which identifies all neighboring nodes within a given radius of

x_{n e w}

, evaluates the cumulative cost from the start node to

x_{n e w}

through each neighbor, and selects the collision-free node with the minimum cost as the parent of

x_{n e w}

. The second is rewiring (Rewire, Algorithm 3), which checks whether routing through

x_{n e w}

yields a lower-cost path to any neighboring node; if so, and if the connection is collision-free, that node is rewired through

x_{n e w}

. By continuously performing parent selection and rewiring, RRT* gradually reduces the cost of the best path found so far and asymptotically converges to the global optimum as the number of samples becomes sufficiently large.

Algorithm 1 RRT*

Input:

x_{s t a r t}

,

x_{g o a l}

,

X_{o b s}

,

η

, N

Output: Path

σ

or ∅

1:

T \leftarrow {V = {x_{s t a r t}}, E = \emptyset}

2: for

i = 1

N do

3:

x_{r a n d} \leftarrow SAMPLEFREE (X_{f r e e})

4:

x_{n e a r e s t} \leftarrow NEAREST (T, x_{r a n d})

5:

x_{n e w} \leftarrow STEER (x_{n e a r e s t}, x_{r a n d}, η)

6: if

COLLISIONFREE (x_{n e a r e s t}, x_{n e w}, X_{o b s})

then

7:

Q_{n e a r} \leftarrow NEAR (T, x_{n e w}, r_{n})

8:

x_{p a r e n t} \leftarrow CHOOSEPARENT (Q_{n e a r}, x_{n e a r e s t}, x_{n e w})

9:

V \leftarrow V \cup {x_{n e w}}

;

E \leftarrow E \cup {(x_{p a r e n t}, x_{n e w})}

10:

REWIRE (T, Q_{n e a r}, x_{n e w})

11: end if

12: end for

13: return

EXTRACTPATH (T)

Algorithm 2 ChooseParent

Input:

Q_{n e a r}

,

x_{n e a r e s t}

,

x_{n e w}

Output:

x_{p a r e n t}

1:

x_{p a r e n t} \leftarrow x_{n e a r e s t}

;

c_{m i n} \leftarrow COST (x_{n e a r e s t}) + ∥ x_{n e a r e s t} - x_{n e w} ∥

2: for each

x_{n e a r} \in Q_{n e a r}

do

3:

c \leftarrow COST (x_{n e a r}) + ∥ x_{n e a r} - x_{n e w} ∥

4: if

c < c_{m i n}

and

COLLISIONFREE (x_{n e a r}, x_{n e w}, X_{o b s})

then

5:

x_{p a r e n t} \leftarrow x_{n e a r}

;

c_{m i n} \leftarrow c

6: end if

7: end for

8: return

x_{p a r e n t}

Algorithm 3 Rewire

Input:

T = (V, E)

,

Q_{n e a r}

,

x_{n e w}

Output: Updated T

1: for each

x_{n e a r} \in Q_{n e a r}

do

2:

c \leftarrow COST (x_{n e w}) + ∥ x_{n e w} - x_{n e a r} ∥

3: if

c < COST (x_{n e a r})

and

COLLISIONFREE (x_{n e w}, x_{n e a r}, X_{o b s})

then

4:

E \leftarrow E ∖ {(PARENT (x_{n e a r}), x_{n e a r})} \cup {(x_{n e w}, x_{n e a r})}

5: end if

6: end for

7: return T

3.3. APF-RRT*

To alleviate the uninformed expansion behavior of RRT*, APF-RRT* incorporates an artificial potential field (APF) into the tree expansion process. The basic idea is derived from a physical field analogy: the goal is modeled as an attractive source that pulls each node toward the target, while obstacles are modeled as repulsive sources that push the node away once it falls within a preset influence radius. The repulsive effect acts only in the vicinity of obstacles and becomes stronger as the node moves closer to an obstacle. The expansion direction is determined by the combined action of the attractive and repulsive fields, allowing the tree to move toward the goal while bypassing regions with dense obstacles. The pseudocode is given in Algorithm 4.

Algorithm 4 APF-RRT*

Input:

x_{s t a r t}

,

x_{g o a l}

,

X_{o b s}

,

η

, N,

k_{a t t}

,

k_{r e p}

,

ρ_{0}

Output: Path

σ

or ∅

1:

T \leftarrow {V = {x_{s t a r t}}, E = \emptyset}

2: for

i = 1

N do

3:

x_{r a n d} \leftarrow SAMPLEFREE (X_{f r e e})

4:

x_{n e a r e s t} \leftarrow NEAREST (T, x_{r a n d})

5: {APF-guided expansion}

6:

F_{a t t} \leftarrow ATTRACTION (x_{n e a r e s t}, x_{g o a l}, k_{a t t})

7:

F_{r e p} \leftarrow REPULSION (x_{n e a r e s t}, X_{o b s}, k_{r e p}, ρ_{0})

8:

F_{t o t a l} \leftarrow F_{a t t} + F_{r e p}

9:

x_{n e w} \leftarrow x_{n e a r e s t} + η \cdot F_{t o t a l} / ∥ F_{t o t a l} ∥

10: if

COLLISIONFREE (x_{n e a r e s t}, x_{n e w}, X_{o b s})

then

11:

Q_{n e a r} \leftarrow NEAR (T, x_{n e w}, r_{n})

12:

x_{p a r e n t} \leftarrow CHOOSEPARENT (Q_{n e a r}, x_{n e a r e s t}, x_{n e w})

13:

V \leftarrow V \cup {x_{n e w}}

;

E \leftarrow E \cup {(x_{p a r e n t}, x_{n e w})}

14:

REWIRE (T, Q_{n e a r}, x_{n e w})

15: end if

16: end for

17: return

EXTRACTPATH (T)

3.4. Bi-RRT*

Bidirectional search is a common way to improve search efficiency. Bi-RRT* builds two search trees,

T_{1}

and

T_{2}

, from the start point

x_{s t a r t}

and the goal point

x_{g o a l}

, respectively, and brings them gradually closer through alternating expansion. Taking

T_{1}

as an example, each iteration consists of sampling, nearest-neighbor search, and node expansion. If the expanded segment passes collision checking, the new node

x_{n e w}

is inserted into tree

T_{1}

. ChooseParent and Rewire are then applied to optimize the local tree structure. The algorithm then searches in

T_{2}

for the node

x_{c l o s e}

that is nearest to

x_{n e w}

. If the distance between them is smaller than a predefined connection threshold and the connecting segment is collision-free, the two partial paths are concatenated to form a complete path from the start to the goal. Otherwise, the roles of

T_{1}

and

T_{2}

are exchanged in the next iteration, and the process continues until the two trees are connected or the iteration budget is exhausted. Algorithm 5 outlines the full procedure of Bi-RRT*. By growing from both ends, bidirectional search approximately halves the distance that a single tree needs to traverse and can significantly accelerate the discovery of an initial feasible path. However, the two trees use identical sampling strategies and step-size parameters, which prevents them from adapting to local environmental differences. In addition, their connection still relies entirely on random sampling, without any active mechanism for tracking the frontier of the other tree, leaving considerable room for improvement in obstacle-rich environments.

Algorithm 5 Bi-RRT*

Input:

x_{s t a r t}

,

x_{g o a l}

,

X_{o b s}

,

η

,

η_{c o n n e c t}

, N

Output: Path

σ

or ∅

1:

T_{1} \leftarrow {V_{1} = {x_{s t a r t}}, E_{1} = \emptyset}

;

T_{2} \leftarrow {V_{2} = {x_{g o a l}}, E_{2} = \emptyset}

2: for

i = 1

N do

3: {Expand $T_{1}$ }

4:

x_{r a n d} \leftarrow SAMPLEFREE (X_{f r e e})

5:

x_{n e a r e s t} \leftarrow NEAREST (T_{1}, x_{r a n d})

6:

x_{n e w} \leftarrow STEER (x_{n e a r e s t}, x_{r a n d}, η)

7: if

COLLISIONFREE (x_{n e a r e s t}, x_{n e w}, X_{o b s})

then

8:

Q_{n e a r} \leftarrow NEAR (T_{1}, x_{n e w}, r_{n})

9:

x_{p a r e n t} \leftarrow CHOOSEPARENT (Q_{n e a r}, x_{n e a r e s t}, x_{n e w})

10:

V_{1} \leftarrow V_{1} \cup {x_{n e w}}

;

E_{1} \leftarrow E_{1} \cup {(x_{p a r e n t}, x_{n e w})}

11:

REWIRE (T_{1}, Q_{n e a r}, x_{n e w})

12: {Attempt connection}

13:

x_{c l o s e} \leftarrow NEAREST (T_{2}, x_{n e w})

14: if

∥ x_{n e w} - x_{c l o s e} ∥ < η_{c o n n e c t}

and

COLLISIONFREE (x_{n e w}, x_{c l o s e}, X_{o b s})

then

15: return

MERGEPATH (T_{1}, T_{2}, x_{n e w}, x_{c l o s e})

16: end if

17: end if

18:

SWAP (T_{1}, T_{2})

{Alternate trees}

19: end for

20: return ∅

4. The Proposed EAC-Bi-RRT* Algorithm

4.1. Overall Algorithmic Framework

This section presents the EAC-Bi-RRT* algorithm. The proposed method adopts bidirectional RRT* as the basic framework and embeds environment-aware mechanisms into key stages including sampling, tree expansion, near-obstacle correction, and path postprocessing, thereby forming an integrated planning pipeline for complex 3D obstacle environments.

During the sampling stage, the algorithm partitions the local 3D directional space around the expansion node into spherical sectors and assigns differentiated sampling probabilities by jointly considering obstacle distribution and goal-direction information. A distance-adaptive goal bias is further introduced to suppress invalid expansions when the goal direction is obstructed. During the dual-tree expansion stage, the expansion step size is adaptively adjusted according to the local environmental complexity. The leader/follower roles are assigned once at initialization based on a local expandability assessment, so as to enhance the coordination between the two trees and promote rapid connection. In near-obstacle regions, a local potential field containing only a repulsive term corrects the expansion direction, while a climb-angle constraint ensures the physical executability of each expansion. After the two trees are successfully connected, the initial polyline path is further processed by curvature-constrained B-spline smoothing with collision-safe fallback, so as to improve path quality and kinematic feasibility. Figure 2 illustrates the overall workflow of EAC-Bi-RRT*.

On this basis, Algorithm 6 presents the main pseudocode of EAC-Bi-RRT* in order to further clarify the calling relationships and execution order of the constituent modules within the overall planning loop. The following subsections describe the design principles and implementation details of each key mechanism.

Algorithm 6 EAC-Bi-RRT*

Input:

x_{s t a r t}

,

x_{g o a l}

,

X_{o b s}

(obstacles),

X

(workspace), N (max iterations)

Output: Optimized path

σ^{*}

1: {Initialization}

2:

T_{1} \leftarrow {V_{1} = {x_{s t a r t}}, E_{1} = \emptyset}

;

T_{2} \leftarrow {V_{2} = {x_{g o a l}}, E_{2} = \emptyset}

3:

d_{i n i t i a l} \leftarrow ∥ x_{s t a r t} - x_{g o a l} ∥

4:

η_{m a x}^{g l o b a l} \leftarrow η_{0} \cdot (1 - R_{v}) / e^{R_{n}}

{Global complexity assessment, Equation (14)}

5: {Environment-aware role assignment}

6:

η_{s} \leftarrow ADAPTIVESTEP (x_{s t a r t}, X_{o b s})

;

η_{g} \leftarrow ADAPTIVESTEP (x_{g o a l}, X_{o b s})

{Equation (18)}

7: if

η_{s} \geq η_{g}

then

8:

T_{L e a d e r} \leftarrow T_{1}

;

T_{F o l l o w e r} \leftarrow T_{2}

9: else

10:

T_{L e a d e r} \leftarrow T_{2}

;

T_{F o l l o w e r} \leftarrow T_{1}

11: end if

12: {Main loop}

13: for

i = 1

N do

14:

(σ, f o u n d) \leftarrow COOPERATIVEEXPAND (T_{L e a d e r}, T_{F o l l o w e r}, X_{o b s})

15: if

f o u n d

then

16:

σ^{*} \leftarrow GREEDYSHORTCUT (σ, X_{o b s})

{Greedy path pruning}

17:

σ^{*} \leftarrow BSLINESMOOTH (σ^{*}, X_{o b s}, κ_{m a x})

{Curvature-constrained smoothing, Equation (28)}

18: return

σ^{*}

19: end if

20: end for

21: return ∅ {Failure}

4.2. Adaptive Directional Sampling Strategy

We define a local spherical coordinate system centered at the current expansion node

x_{n e a r}

. A spatial direction is described by an azimuth angle

θ \in [0, 2 π)

and an elevation angle

ϕ \in [- π / 2, π / 2]

. Here,

θ

measures the horizontal direction counterclockwise from the positive x-axis, and

ϕ

gives the angular deviation from the horizontal plane, with upward taken as positive. This local frame is used only to parameterize the candidate expansion directions, while all node positions and collision-free checks remain in the global Cartesian frame.

This partition follows the classical 26-neighborhood connectivity in discrete 3D space. The spherical space is divided into five elevation layers. The upper, middle, and lower layers each contain eight azimuthal sectors spaced at

45 °

, while the two polar caps are not further subdivided. Table 1 gives the partitioning scheme, and Figure 3 shows the overall structure.

The center direction vector

d_{i}

specifies sector

S_{i}

. The center directions of the top and bottom polar caps are

d_{0} = {[0, 0, 1]}^{T}

and

d_{25} = {[0, 0, - 1]}^{T}

, respectively. For the upper, middle, and lower layers, the sector center direction depends on the layer elevation parameter

ϕ_{l a y e r}

and the azimuth index

j = 0, 1, \dots, 7

:

d_{l a y e r, j} = (\begin{matrix} cos ϕ_{l a y e r} cos (j \cdot 45 ° + 22.5 °) \\ cos ϕ_{l a y e r} sin (j \cdot 45^{\circ} + 22.5 °) \\ sin ϕ_{l a y e r} \end{matrix}),

(4)

where the upper layer takes

ϕ_{u p p e r} = 45^{\circ}

, the middle layer takes

ϕ_{m i d d l e} = 0 °

, and the lower layer takes

ϕ_{l o w e r} = - 45 °

.

The sampling probability assigned to each sector depends on the obstacle distribution within that sector. To this end, obstacles are first counted within the spherical region centered at

x_{n e a r}

with radius

R_{s e n s e}

and denoted as

O_{s e n s e} = \{O_{j} | ∥ c_{j} - x_{n e a r} ∥ \leq R_{s e n s e}\},

(5)

where

c_{j}

is the center coordinate of the j-th obstacle. For each

O_{j} \in O_{s e n s e}

, the vector from the expansion node to the obstacle center is defined as

v_{j} = {(v_{x}, v_{y}, v_{z})}^{T} = c_{j} - x_{n e a r}

. Its azimuth and elevation angles are computed as

\begin{matrix} θ_{j} & = \{\begin{matrix} arctan (\frac{v_{y}}{v_{x}}), & v_{x} > 0, v_{y} \geq 0, \\ arctan (\frac{v_{y}}{v_{x}}) + 2 π, & v_{x} > 0, v_{y} < 0, \\ arctan (\frac{v_{y}}{v_{x}}) + π, & v_{x} < 0, \\ \frac{π}{2}, & v_{x} = 0, v_{y} > 0, \\ \frac{3 π}{2}, & v_{x} = 0, v_{y} < 0, \end{matrix} \\ ϕ_{j} & = arctan (\frac{v_{z}}{\sqrt{v_{x}^{2} + v_{y}^{2}}}) . \end{matrix}

(6)

The angle

ϕ_{j}

determines the elevation layer of obstacle

O_{j}

. Within that layer,

θ_{j}

determines its azimuthal sector. In this way,

O_{j}

is assigned to the corresponding sector

S_{i}

. After all obstacles are assigned to sectors, the obstacle density of each sector can be written as

ρ_{i} = \frac{| O_{i} |}{N_{t o t a l}},

(7)

where

| O_{i} |

is the number of obstacles assigned to sector

S_{i}

, and

N_{t o t a l} = | O_{s e n s e} |

is the total number of obstacles in the sensing region. A larger

ρ_{i}

means that the direction of sector

S_{i}

is more crowded with obstacles.

Obstacle distribution alone is not sufficient to determine the sampling probability, and the goal direction also needs to be taken into account. To measure the alignment between each sector and the goal direction, the cosine between the sector center direction

d_{i}

and the goal direction is computed as

cos Θ_{i} = d_{i} \cdot d_{g o a l},

(8)

where

d_{g o a l} = (x_{g o a l} - x_{n e a r}) / ∥ x_{g o a l} - x_{n e a r} ∥

is the unit vector pointing from the current node to the goal.

Combining the sector obstacle density

ρ_{i}

and the goal-direction cosine

cos Θ_{i}

, the sampling probability of each sector is defined as

P_{s e c t o r} (i) = \frac{e^{- α \cdot ρ_{i}} \cdot (1 + β \cdot cos Θ_{i})}{\sum_{j = 0}^{25} e^{- α \cdot ρ_{j}} \cdot (1 + β \cdot cos Θ_{j})},

(9)

where

α > 0

is the obstacle sensitivity coefficient and

β \in (0, 1)

is the goal orientation coefficient.

The sampling process consists of two stages: sector selection and intra-sector sampling. First, a target sector

S_{k}

is selected according to the probability distribution

P_{s e c t o r}

via roulette wheel selection. Then, the azimuth angle

θ_{s}

and elevation angle

ϕ_{s}

are uniformly sampled within the angular range of

S_{k}

, and the radial distance

r = R_{s a m p l e} \cdot u^{1 / 3}

(

u \sim U (0, 1)

) is computed with a cube-root transformation to ensure volumetric uniformity inside the sphere. The resulting sample point is

x_{s a m p l e} = x_{n e a r} + r \cdot {[cos ϕ_{s} cos θ_{s}, cos ϕ_{s} sin θ_{s}, sin ϕ_{s}]}^{T} .

(10)

On top of the sector-based directional bias sampling, a distance-adaptive goal-bias probability is further introduced. Before each sampling step, the algorithm selects the goal point directly as the sample with probability

P_{g o a l}

and enters the sector-based directional bias sampling procedure with probability

1 - P_{g o a l}

. The value of

P_{g o a l}

is dynamically adjusted according to the distance from the current search tree to the goal:

P_{g o a l} (t) = P_{m i n} + (P_{m a x} - P_{m i n}) \cdot \frac{d_{c u r r e n t}}{d_{i n i t i a l}},

(11)

where

d_{i n i t i a l}

is the initial distance from the start to the goal,

d_{c u r r e n t}

is the distance from the nearest node in the current search tree to the goal, and

P_{m i n}

and

P_{m a x}

are the lower and upper bounds of the goal-bias probability, respectively.

The sector-based directional bias sampling and the distance-adaptive goal-bias strategy together constitute the adaptive directional sampling strategy. The complete procedure is summarized in Algorithm 7.

Algorithm 7 Adaptive directional sampling

Input: T (search tree),

x_{t a r g e t}

(sampling target),

X_{o b s}

,

d_{i n i t i a l}

Output: Sampled point

x_{s a m p l e}

1: {Stage 1: Distance-adaptive goal-bias probability}

2:

d_{c u r r e n t} \leftarrow {min}_{v \in V} ∥ v - x_{t a r g e t} ∥

3:

P_{g o a l} \leftarrow P_{m i n} + (P_{m a x} - P_{m i n}) \cdot d_{c u r r e n t} / d_{i n i t i a l}

{Equation (11)}

4: if

rand () < P_{g o a l}

then

5: return

x_{t a r g e t}

6: end if

7: {Stage 2: 26-sector direction-biased sampling}

8:

x_{n e a r} \leftarrow arg {min}_{v \in V} ∥ v - x_{t a r g e t} ∥

9:

d_{g o a l} \leftarrow (x_{t a r g e t} - x_{n e a r}) / ∥ x_{t a r g e t} - x_{n e a r} ∥

10: for

i = 0

to 25 do

11:

ρ_{i} \leftarrow | O_{i} | / | O_{s e n s e} |

{Obstacle density of sector

S_{i}

}

12:

cos Θ_{i} \leftarrow d_{i} \cdot d_{g o a l}

{Goal alignment,

d_{i}

from Equation (4)}

13:

w_{i} \leftarrow exp (- α \cdot ρ_{i}) \cdot (1 + β \cdot cos Θ_{i})

{Equation (9)}

14: end for

15:

P_{i} \leftarrow w_{i} / \sum_{j = 0}^{25} w_{j}, \forall i \in {0, \dots, 25}

{Normalize}

16: {Stage 3: Sector selection and intra-sector sampling}

17:

S^{*} \leftarrow ROULETTESELECT ({P_{0}, \dots, P_{25}})

18: Sample

θ_{s}

,

ϕ_{s}

uniformly within the angular range of

S^{*}

19:

r \leftarrow R_{s a m p l e} \cdot u^{1 / 3}, u \sim U (0, 1)

{Cube-root for volumetric uniformity}

20:

x_{s a m p l e} \leftarrow x_{n e a r} + r \cdot {[cos ϕ_{s} cos θ_{s}, cos ϕ_{s} sin θ_{s}, sin ϕ_{s}]}^{T}

{Equation (10)}

21: return

x_{s a m p l e}

4.3. Environment-Adaptive Variable-Step-Size Mechanism

A large step size increases the risk of collision when the search tree expands through narrow passages or obstacle-dense regions. A small step size improves local obstacle avoidance, but it also creates many redundant nodes in open areas and reduces overall search efficiency. The expansion step size therefore follows a three-level adjustment scheme and changes with the local environment.

The 3D volumetric occupancy ratio of the workspace is defined at initialization as

R_{v} = \frac{\sum_{i = 1}^{N_{o b s}} V_{o b s, i}}{V_{t o t a l}},

(12)

where

V_{o b s, i}

and

V_{t o t a l}

denote the volume of the i-th obstacle and the total workspace volume, respectively.

An obstacle number density is further introduced as

R_{n} = \frac{N_{o b s} \cdot V_{u n i t}}{V_{t o t a l}} .

(13)

where

N_{o b s}

is the total number of obstacles and

V_{u n i t}

is the unit volume.

This gives the global upper bound of the step size as

η_{m a x}^{g l o b a l} = \frac{η_{0} \cdot (1 - R_{v})}{e^{R_{n}}},

(14)

where

η_{0}

is a nominal step size related to the map scale.

At each expansion step, the 3D distance from the current node

x_{n e a r}

to the nearest obstacle surface is computed as

d_{o b s}^{3 D} (x) = {min}_{i \in O} (∥ x - c_{i} ∥ - r_{i})

. Let this distance be denoted by d. The locally adaptive step size is then written as

η_{b a s e} = \{\begin{matrix} η_{m a x}^{g l o b a l}, & d \geq D_{s a f e}, \\ η_{m i n} + (η_{m a x}^{g l o b a l} - η_{m i n}) {(\frac{d}{D_{s a f e}})}^{κ}, & d < D_{s a f e}, \end{matrix}

(15)

where

D_{s a f e}

sets the safety distance threshold,

κ

controls the curvature of the mapping, and

η_{m i n}

sets the minimum step size.

If multiple obstacles fall within the sensing range, a local density correction further reduces the step size:

η_{a d j u s t e d} = \frac{η_{b a s e}}{1 + β_{l o c a l} \cdot max (0, n_{l o c a l} - 1)},

(16)

where

n_{l o c a l}

counts the obstacles within the safety distance range, and

β_{l o c a l}

controls the local density decay.

When an expansion results in a collision, a bisection search along the expansion direction is performed to find the farthest collision-free point:

x_{f a l l b a c k}^{(k)} = x_{n e a r} + \frac{η_{a d j u s t e d}}{2^{k}} \cdot d, k = 1, 2, \dots, k_{m a x} .

(17)

The algorithm halves the step size progressively and accepts the first expansion point that is collision-free and satisfies

∥ x_{f a l l b a c k}^{(k)} - x_{n e a r} ∥ \geq η_{m i n}

. If no valid point is found after

k > k_{m a x}

iterations, the current expansion attempt is abandoned.

4.4. Cooperative Dual-Tree Expansion Strategy

In conventional bidirectional RRT*, the two search trees adopt symmetric expansion behavior, sharing the same step size, sampling strategy, and expansion rules without any division of labor. To address this limitation, a cooperative dual-tree expansion strategy is developed that enhances the collaboration efficiency of the two trees through a one-time asymmetric role assignment and a dynamic connection threshold.

During algorithm initialization, the adaptive step sizes at the start and goal states are computed as

\{\begin{matrix} η_{s t a r t} & = AdaptiveStep (x_{s t a r t}) \\ η_{g o a l} & = AdaptiveStep (x_{g o a l}) \end{matrix}

(18)

The role assignment is then determined as follows:

(Leader, Follower) = \{\begin{matrix} (T_{1}, T_{2}), & η_{s t a r t} \geq η_{g o a l}, \\ (T_{2}, T_{1}), & η_{s t a r t} < η_{g o a l}, \end{matrix}

(19)

where the tree with the larger step size is designated as the leader and the other as the follower.

During the planning process, the step sizes of both trees are independently computed by the variable step-size mechanism based on the local environment of their respective current nodes:

\{\begin{matrix} η_{L e a d e r} & = AdaptiveStep (x_{n e a r, L e a d e r}) \\ η_{F o l l o w e r} & = AdaptiveStep (x_{n e a r, F o l l o w e r}) \end{matrix}

(20)

The leader tree uses the sampling strategy described in Section 4.2 to sample toward its natural target direction. The follower tree selects the most recently expanded node of the leader tree

x_{L e a d e r, n e w}

as its sampling target with probability

P_{b i a s}

, and performs random sampling with probability

1 - P_{b i a s}

. The connection test between the two trees uses a dynamic threshold that is coupled with the current step sizes:

η_{c o n n e c t} = γ_{c} \cdot min (η_{L e a d e r}, η_{F o l l o w e r}),

(21)

where

γ_{c}

is the connection threshold coefficient.

The role assignment, sampling target selection, and dynamic connection test together constitute the cooperative dual-tree expansion strategy. The complete procedure is summarized in Algorithm 8.

4.5. Climb-Angle Constraint

The vertical maneuverability of a UAV is strictly limited by its thrust and power budget. To ensure the physical executability of the planned path, this paper embeds the maximum climb angle

γ_{m a x}

as a hard constraint into the expansion process. Given the original expansion direction

d_{r a w} = {[d_{x}, d_{y}, d_{z}]}^{T}

, its climb angle is defined as

γ = arctan (\frac{d_{z}}{\sqrt{d_{x}^{2} + d_{y}^{2}}}) .

(22)

The direction is kept unchanged when

| γ | \leq γ_{m a x}

. When

| γ | > γ_{m a x}

, the direction is projected onto the boundary of the feasible cone:

d_{constrained} = \frac{d^{'}}{∥ d^{'} ∥}, d^{'} = [\begin{matrix} d_{x} \\ d_{y} \\ sign (d_{z}) h_{x y} tan γ_{m a x} \end{matrix}],

(23)

where

h_{x y} = \sqrt{d_{x}^{2} + d_{y}^{2}}

is the magnitude of the horizontal component and

γ_{m a x}

is the maximum allowable climb angle of the UAV.

Algorithm 8 Cooperative dual-tree expansion

Input:

T_{L}

(leader tree),

T_{F}

(follower tree),

x_{g o a l}

,

X_{o b s}

Output: Connection flag, path

σ

1: {Phase 1: Leader tree expansion}

2:

x_{s a m p l e} \leftarrow ADAPTIVEDIRECTIONALSAMPLING (T_{L}, x_{g o a l}, X_{o b s})

{Algorithm 7}

3:

x_{n e a r}^{L} \leftarrow Nearest (T_{L}, x_{s a m p l e})

4:

η_{L} \leftarrow ADAPTIVESTEP (x_{n e a r}^{L}, X_{o b s})

{Equations (15) and (16)}

5:

\hat{d} \leftarrow STEER (x_{n e a r}^{L}, x_{s a m p l e})

6:

\hat{d} \leftarrow REPULSIVECORRECTION (x_{n e a r}^{L}, \hat{d}, x_{g o a l}, X_{o b s})

{Equations (24)–(27)}

7:

\hat{d} \leftarrow CLIMBANGLECLAMP (\hat{d}, γ_{m a x})

{Equation (23)}

8:

(x_{n e w}^{L}, o k_{L}) \leftarrow EXTEND (x_{n e a r}^{L}, \hat{d}, η_{L}, X_{o b s})

{with bisection fallback, Equation (17)}

9: if

o k_{L}

then

10:

T_{L} \leftarrow INSERTANDREWIRE (T_{L}, x_{n e w}^{L}, X_{o b s})

11: end if

12: {Phase 2: Follower tree expansion}

13: if

o k_{L}

then

14:

x_{t a r g e t} \leftarrow x_{n e w}^{L}

15: else

16:

x_{t a r g e t} \leftarrow Nearest (T_{L}, T_{F} . latest)

17: end if

18: if

rand () < P_{b i a s}

then

19:

x_{s a m p l e}^{F} \leftarrow x_{t a r g e t}

{Biased toward leader}

20: else

21:

x_{s a m p l e}^{F} \leftarrow RANDOMSAMPLE (X)

22: end if

23:

x_{n e a r}^{F} \leftarrow Nearest (T_{F}, x_{s a m p l e}^{F})

24:

η_{F} \leftarrow ADAPTIVESTEP (x_{n e a r}^{F}, X_{o b s})

25:

(x_{n e w}^{F}, o k_{F}) \leftarrow STEERANDEXTEND (x_{n e a r}^{F}, x_{s a m p l e}^{F}, η_{F}, x_{g o a l}, X_{o b s})

26: if

o k_{F}

then

27:

T_{F} \leftarrow INSERTANDREWIRE (T_{F}, x_{n e w}^{F}, X_{o b s})

28: end if

29: {Phase 3: Dynamic connection detection}

30:

η_{c o n n} \leftarrow γ_{c} \cdot min (η_{L}, η_{F})

{Equation (21)}

31: for each

(o k, x_{n e w}, T_{o w n}, T_{o t h e r}) \in {(o k_{L}, x_{n e w}^{L}, T_{L}, T_{F}), (o k_{F}, x_{n e w}^{F}, T_{F}, T_{L})}

do

32: if

o k

then

33:

x_{c} \leftarrow Nearest (T_{o t h e r}, x_{n e w})

34: if

∥ x_{n e w} - x_{c} ∥ \leq η_{c o n n}

and

COLLISIONFREE (x_{n e w}, x_{c}, X_{o b s})

then

35: return

(true, MERGEPATH (T_{L}, T_{F}, x_{n e w}, x_{c}))

36: end if

37: end if

38: end for

39: return

(false, \emptyset)

4.6. Repulsive-Only Local Potential Field Correction

Since the sampling strategy in Section 4.2 already provides global directional guidance, this paper removes the attractive term from the conventional APF formulation and activates the repulsive correction only when a node enters the influence range of an obstacle (

ρ < ρ_{0}

). This design avoids the functional overlap between the attractive term and the sampling strategy, as well as the path oscillation that arises near the goal region.

The repulsive force is defined as

F_{r e p} (x) = \{\begin{matrix} k_{r e p} {(\frac{1}{ρ} - \frac{1}{ρ_{0}})}^{2} \cdot ψ (d) \cdot n_{o b s}, & ρ < ρ_{0}, \\ 0, & ρ \geq ρ_{0}, \end{matrix}

(24)

where

ρ

denotes the distance from the current node to the nearest obstacle, and

ρ_{0}

denotes the repulsive influence radius.

k_{r e p}

is the repulsive gain coefficient, and

n_{o b s}

is the unit vector pointing from the obstacle toward the current node.

The goal influence factor

ψ (d)

can be written as

ψ (d) = \frac{d^{n}}{1 + d^{n}},

(25)

where

d = ∥ x - x_{g o a l} ∥

is the distance from the current node to the goal and n is the shape parameter.

Within the near-obstacle range, the correction strength follows a distance-decaying weight

w = {(\frac{ρ_{0} - d_{o b s}}{ρ_{0}})}^{2},

(26)

where

d_{o b s}

represents the distance from the current node to the nearest obstacle. The corrected expansion direction is obtained by

d_{o u t} = \frac{d_{s a m p l e} + w \cdot F_{r e p}}{∥ d_{s a m p l e} + w \cdot F_{r e p} ∥},

(27)

where

d_{s a m p l e}

stands for the normalized sampling direction returned by the sampling strategy.

4.7. Path Smoothing and Safety Fallback

The piecewise-linear path generated by RRT* usually contains large turning-angle changes, which can affect the smoothness of trajectory tracking in UAV flight. A cubic B-spline is therefore used to smooth the planned path.

Using the waypoint sequence

{x_{0}, x_{1}, \dots, x_{K}}

of the piecewise-linear path as control points, the cubic B-spline curve is expressed as

C (t) = \sum_{i = 0}^{K} N_{i, 3} (t) x_{i},

(28)

where

N_{i, 3} (t)

is the cubic B-spline basis function, and t serves as the curve parameter.

The smoothed curve must satisfy the maximum curvature constraint

κ (t) \leq κ_{m a x} = 1 / R_{m i n}

. The curve is then discretized at equal arc-length intervals, and the sampled points are checked against the free space. If a smoothed segment intersects an obstacle, that segment is discarded and replaced with the corresponding segment of the original piecewise-linear path so that the final output path remains collision-free.

4.8. Computational Complexity Analysis

The time cost of the proposed EAC-Bi-RRT* algorithm is primarily affected by the nearest-node search, the near-node search, obstacle density estimation, collision detection, and local rewiring. Let T denote the maximum number of iterations, N the number of nodes in the bidirectional trees, M the number of obstacles in the environment, and

S = 26

the fixed sector count of the adaptive directional sampling. In each iteration, the nearest-node search and the near-node search are performed via a linear traversal of the tree nodes, with a per-iteration cost of

O (N)

. The obstacle density estimation, collision detection, and local repulsive potential field correction scale linearly with the obstacle count, yielding a worst-case complexity of

O (M)

. The adaptive directional sampling only computes sampling probabilities over a fixed set of 26 sectors, giving a constant complexity of

O (S) = O (1)

. The cost of local rewiring is determined by the number of near nodes, and is already absorbed into the near-node search and collision detection steps above.

Consequently, the overall time complexity of EAC-Bi-RRT* can be expressed as

O (T \cdot (N + M))

, where T represents the maximum number of iterations. The proposed adaptive directional sampling, variable step-size, climb-angle constraint, and repulsive-only APF correction modules primarily introduce constant or local environment evaluation overhead, and therefore do not change the asymptotic order of complexity relative to standard RRT* and Bi-RRT*. If spatial indexing structures such as k-d trees or R-trees are introduced, the practical computational cost of the nearest-node search and obstacle queries can be further reduced.

5. Results and Discussion

To evaluate the planning performance of EAC-Bi-RRT* in different 3D obstacle scenarios, this section constructs four simulation environments. All experiments are conducted on the Windows 11 platform with an Intel Core i7-14650HX processor and 16 GB of memory, and MATLAB R2025a is used as the simulation software. To thoroughly validate the effectiveness of the proposed method, all simulations are implemented within a unified RRT* code framework, and EAC-Bi-RRT* is compared with five baseline algorithms, namely GB-RRT*, Bi-RRT*, Bi-APF-RRT*, AAE-RRT* [32], and DPF-Bi-RRT* [33].

Figure 4 illustrates the four types of 3D simulation environments used in this section. Environment 1 is a dense spherical obstacle environment, which is mainly used to evaluate the spatial search capability and local obstacle avoidance performance of the algorithm under densely distributed spherical obstacles. Environment 2 is a regular cylindrical array environment, which is mainly used to examine the corridor selection and path optimization capability of the algorithm in structured and constrained spaces. Environment 3 is a multi-layer building environment, which is mainly used to verify the constrained-space traversal capability and complex-space search performance of the algorithm under multi-level enclosed structures and narrow openings. Environment 4 is a large-scale random obstacle environment, which is mainly used to assess the global search capability and stability of the algorithm in large unstructured spaces.

The unified parameter settings used in the experiments are as follows. For all algorithms, the maximum number of iterations is set to 5000, and each algorithm is independently run 100 times in each environment. For GB-RRT*, Bi-RRT*, Bi-APF-RRT*, and DPF-Bi-RRT*, the fixed expansion step sizes in Environments 1–4 are set to 5 m, 10 m, 12 m, and 30 m, respectively. For AAE-RRT*, the basic step sizes are set in the same way, while its adaptive step-size ranges in Environments 1–4 are set to

[1, 10]

m,

[1, 14]

m,

[2, 20]

m, and

[5, 60]

m, respectively. EAC-Bi-RRT* adopts an environment-aware adaptive step-size strategy, in which the expansion step size is automatically adjusted according to the global environmental complexity and the local obstacle density. The evaluation metrics used in the experiments are path length, planning time, success rate, and average turning angle. Specifically, path length measures the total cost of the planned result, and planning time reflects the search efficiency. Success rate evaluates the ability to obtain a feasible path under given conditions, while the average turning angle quantifies the geometric smoothness of the path. The complete parameter settings used in our experiments, including the key parameters of the proposed EAC-Bi-RRT* and those of all baseline algorithms across the four environments, are summarized in Appendix A (Table A1, Table A2 and Table A3).

5.1. Dense Spherical Obstacle Environment

The workspace size in this scenario is

200 \times 200 \times 200

. Approximately 130 spherical obstacles of different scales are placed in the scene to construct a multi-scale and densely distributed 3D obstacle space. The start point is set to

(0, 0, 0)

, and the goal point is set to

(200, 200, 200)

. Statistics and paths are shown in Table 2 and Figure 5, respectively.

In the dense spherical obstacle environment, the obstacle sizes and distributions are highly random, and the random tree is prone to search oscillation and repeated expansion during local obstacle avoidance. Table 2 shows that EAC-Bi-RRT* achieves an average planning time of only 0.039 s. This represents reductions of 87.34%, 90.05%, and 81.86% relative to GB-RRT*, Bi-RRT*, and Bi-APF-RRT*, respectively, and is also significantly lower than those of AAE-RRT* and DPF-Bi-RRT*. In terms of path smoothness, EAC-Bi-RRT*, DPF-Bi-RRT*, and AAE-RRT* all achieve average turning angles below 2°, which are significantly better than those of GB-RRT*, Bi-RRT*, and Bi-APF-RRT*. This improvement comes from two aspects. The cooperative dual-tree expansion reduces ineffective search in densely obstructed regions. Meanwhile, the adaptive step size adjusts the expansion scale according to the local obstacle distribution. Together, these two mechanisms improve planning efficiency while preserving obstacle avoidance accuracy. Although the average path length and average turning angle of EAC-Bi-RRT* are comparable to those of AAE-RRT* and DPF-Bi-RRT*, its planning time is substantially shorter while maintaining comparable path quality and smoothness.

5.2. Regular Cylindrical Array Environment

The workspace size in this scenario is

500 \times 500 \times 200

. A total of 62 regularly arranged vertical column obstacles are placed in the scene, forming a continuous cylindrical array corridor structure. The start point is set to

(0, 0, 0)

, and the goal point is set to

(500, 500, 150)

. The corresponding statistical results are reported in Table 2.

The main challenge in the regular cylindrical array environment is that the algorithm needs not only to find a feasible path, but also to determine an appropriate traversal direction as quickly as possible within continuously constrained corridors. Figure 6 indicates that some baseline algorithms still produce clear backtracking and detours between the columns, while the path generated by EAC-Bi-RRT* stays closer to the main corridor direction. The quantitative results in Table 2 are consistent with this observation. EAC-Bi-RRT* records an average path length of 768.46 m and an average planning time of 0.184 s, both lower than those of the compared algorithms. The path length reductions are 12.97%, 18.04%, 13.47%, 5.57%, and 11.27% relative to GB-RRT*, Bi-RRT*, Bi-APF-RRT*, AAE-RRT*, and DPF-Bi-RRT*, respectively. This result suggests that EAC-Bi-RRT* can advance more efficiently along feasible passages in the regular cylindrical corridor environment while avoiding unnecessary detours between columns. The adaptive directional sampling strategy steers the random tree along corridor-compatible directions and reduces back-and-forth oscillation as well as repeated search between columns.

5.3. Multi-Layer Building Environment

The workspace size in this scenario is

550 \times 550 \times 400

. The scene contains multi-layer floor slabs, through-columns, guard columns, and block obstacles of different heights, which together form a complex 3D enclosed space with narrow openings. The start point is set to

(0, 0, 0)

, and the goal point is set to

(550, 550, 400)

. Table 2 reports the statistics, and Figure 7 shows the planned paths.

The multi-layer building environment involves the coupled constraints of floor-slab blockage, narrow openings, and 3D obstacles. When expansion toward the goal direction is repeatedly blocked, the random tree tends to remain trapped in locally enclosed regions, which leads to planning failure. Table 2 shows that the success rate of GB-RRT* in this environment is only 6%, while AAE-RRT* reaches only 40%, indicating that relying only on goal bias or adaptive step size is still insufficient for stable traversal in multi-layer constrained spaces. In contrast, EAC-Bi-RRT* achieves a success rate of 100%, demonstrating stronger traversal capability and search stability in complex enclosed environments. EAC-Bi-RRT* also attains the best average path length of 981.90 m. This is 17.95%, 14.01%, and 3.33% shorter than those of Bi-RRT*, Bi-APF-RRT*, and DPF-Bi-RRT*, respectively. Its average turning angle is only 5.48°, which is also markedly lower than the 6.50°–22.10° reported for the above algorithms. In terms of time cost, the average planning time of EAC-Bi-RRT* is 0.805 s. Although this value is not the lowest in this environment and is higher than those of Bi-RRT*, Bi-APF-RRT*, and DPF-Bi-RRT*, it remains lower than those of GB-RRT* and AAE-RRT*. This result indicates that the proposed algorithm does not simply pursue local search speed in strongly constrained 3D scenes, but instead trades a small amount of time for more stable search guidance, a higher success rate, and better path quality. The key reason is that the adaptive goal-bias mechanism actively reduces the bias probability based on local obstacle density when the goal direction is persistently blocked. This causes the random tree to fall back to sector-based directional exploration sampling, enabling it to escape locally constrained regions.

5.4. Large-Scale Random Obstacle Environment

The workspace size in this scenario is

2000 \times 2000 \times 400

. Various obstacles, including cuboids and cylinders, are randomly distributed in the scene to construct a large-scale unstructured environment with nonuniform distribution and local clustering characteristics. The start point is set to

(0, 0, 0)

, and the goal point is set to

(2000, 2000, 50)

. The corresponding statistical results are reported in Table 2.

The large-scale random obstacle environment is characterized by a vast search space, uneven obstacle distribution, and pronounced local clustering. As a result, the algorithm needs not only strong global exploration capability, but also the ability to avoid excessive invalid detours in locally cluttered regions. As shown in Figure 8, GB-RRT*, Bi-RRT*, Bi-APF-RRT*, AAE-RRT*, and DPF-Bi-RRT* all exhibit path deviations of varying degrees when traversing locally clustered obstacle regions, whereas the path generated by EAC-Bi-RRT* is more direct overall. Table 2 shows that EAC-Bi-RRT* maintains a success rate of 100% in this environment and achieves the shortest average path length of 2960.57 m and the fastest average planning time of 0.256 s. Compared with GB-RRT*, Bi-RRT*, and Bi-APF-RRT*, its average path length is reduced by 17.34%, 17.98%, and 12.80%, respectively. Compared with AAE-RRT* and DPF-Bi-RRT*, the average planning time is reduced by 84.48% and 54.44%, respectively. The path quality remains comparable, yet the search efficiency is markedly higher. Specifically, the sector-based sampling provides persistent directional guidance that reduces unnecessary detours in the vast search space. Meanwhile, the adaptive step-size mechanism enables faster expansion in open regions and finer obstacle avoidance in locally clustered areas.

5.5. Statistical Reliability Analysis

To further assess the run-to-run reliability of the experimental results, the coefficient of variation (CV) of path length across the 100 independent trials is summarized in Table 3. The CV is defined as the ratio of standard deviation to mean, with lower values indicating higher run-to-run consistency.

As shown in Table 3, EAC-Bi-RRT* maintains a path-length CV below 3% in all four environments while preserving a 100% success rate, indicating high run-to-run consistency across both densely cluttered and large-scale scenarios. It attains the lowest path-length CV in Environments 2 and 4. In contrast, GB-RRT* and Bi-RRT* exhibit path-length CV values exceeding 5% in the complex environments (Environments 3 and 4), reflecting larger run-to-run variability. The medians of the continuous metrics are close to the corresponding means reported in Table 2, suggesting approximately symmetric distributions without significant long-tail outliers.

5.6. Discussion

Figure 9 compares the average planning time, average path length, and average turning angle of all algorithms in the four environments. From the global statistics, the success rates of GB-RRT* in Environments 2–4 are only 73%, 6%, and 84%, respectively, while AAE-RRT* reaches only 40% in Environment 3. In contrast, EAC-Bi-RRT* maintains a success rate of 100% in all four environments. Compared with Bi-RRT* and Bi-APF-RRT*, the average path length of EAC-Bi-RRT* is reduced by 16.23% and 11.55%, respectively, across the four environments. In Environments 1, 2, and 4, the average planning time of EAC-Bi-RRT* is reduced by 80.98% and 57.49% relative to AAE-RRT* and DPF-Bi-RRT*, respectively. This difference reflects a clear efficiency advantage. In Environment 3, due to the special constraints imposed by the multi-layer enclosed structure, the proposed algorithm trades a higher time cost for a complete success rate and the best path quality, which indicates that its search strategy gives priority to reachability and path cost in strongly constrained scenes.

Figure 10 gives the boxplots of the key metrics of EAC-Bi-RRT* across the four environments. In Environment 1, all metrics show relatively compact boxplots. This suggests good repeatability across runs. In Environment 2, both planning time and path length show low median values with limited dispersion, suggesting that the search process remains stable in the regular corridor environment. In Environment 3, the coupled constraints of multi-layer floor slabs and narrow openings increase the uncertainty of the search path and lead to greater variation in planning time. Nevertheless, the distributions of path length and turning angle remain relatively concentrated, which indicates that the algorithm still maintains good robustness in strongly constrained enclosed spaces. In Environment 4, the boxes for path length and planning time remain compact, confirming stable global search performance in the large-scale unstructured environment.

6. Conclusions

UAV path planning in complex 3D obstacle environments requires not only fast generation of collision-free paths, but also satisfactory path smoothness and kinematic feasibility. To meet these requirements, this paper proposes the EAC-Bi-RRT* algorithm. Built on bidirectional RRT*, the proposed method introduces an environment-aware mechanism to regulate the search process. The result is improved adaptability in complex spaces. Experimental validation in four representative 3D obstacle environments shows that EAC-Bi-RRT* achieves strong stability and environmental adaptability in challenging scenes while maintaining both path quality and path smoothness.

Nevertheless, this study still leaves room for further improvement. The proposed method has several limitations that will be addressed in future research:

(1): Embedded onboard deployment and computational profiling: The proposed method targets embedded onboard computing platforms rather than low-level flight-control microcontrollers. The current MATLAB R2025a prototype runs on a desktop CPU, and systematic profiling of onboard CPU, RAM, and real-time scheduling is left for future work.
(2): Perception uncertainty and real-time replanning: The current simulations focus on static 3D environments and do not yet model sensing noise, localization errors, or dynamic obstacles. Future work will investigate uncertainty-aware environment representation and real-time replanning.
(3): Real-world flight validation: All evaluations in this paper are conducted in MATLAB-based simulation, which cannot reproduce the perception, actuation, and disturbance characteristics of physical flight. Future work will integrate the planner with onboard perception, state estimation, and the flight-control loop, advancing from hardware-in-the-loop testing to outdoor physical-flight experiments.

Author Contributions

Conceptualization, Y.Z. and W.H.; methodology, Y.Z. and W.H.; software, Y.Z.; validation, Y.Z., Y.C. and Z.Q.; formal analysis, Y.Z. and Y.C.; investigation, Y.Z. and Z.Q.; resources, W.H.; data curation, Y.Z. and Z.Q.; writing—original draft preparation, Y.Z.; writing—review and editing, W.H., Y.C. and Z.Q.; visualization, Y.Z. and Y.C.; supervision, W.H. and Y.C.; project administration, W.H.; funding acquisition, W.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 62473133, and the Natural Science Foundation of Wuhan, grant number 2025040601020155.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank the School of Electrical and Electronic Engineering, Hubei University of Technology, for its support and resources.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

3D	Three-dimensional
UAV	Unmanned Aerial Vehicle
RRT	Rapidly exploring random tree
RRT*	Rapidly exploring random tree star
APF	Artificial potential field
EAC-Bi-RRT*	Environment-aware cooperative bidirectional RRT*

Appendix A

Table A1. Key parameters used in the proposed EAC-Bi-RRT* algorithm.

Group	Symbol	Meaning	Value	Notes
Adaptive step size	$η_{min}$	Minimum step size	$0.25 η_{max}$	Lower bound
Adaptive step size	$β_{local}$	Local density decay	0.15	Step reduction coefficient
Spherical sector sampling	$P_{min}$	Min goal-bias probability	0.05	Lower bound
Spherical sector sampling	$P_{max}$	Max goal-bias probability	0.40	Upper bound
APF local repulsion	$k_{rep}$	Repulsion coefficient	0.5	Obstacle avoidance gain
Bi-tree cooperation	$γ_{c}$	Connection threshold coeff.	1.5	Dynamic merge criterion
Bi-tree cooperation	$P_{bias}$	Follower bias probability	0.6	Toward leader tree
Path optimization	$κ_{max}$	Maximum curvature	$1 / 80$	Min turning radius = 80 m
Path optimization	$α_{th}$	Sharp turn threshold	$120 °$	Control point insertion
Global constraints	$γ_{max}$	Maximum climb angle	$30 °$	3D flight envelope
Global constraints	$N_{max}$	Maximum iterations	5000	Termination condition

Table A2. Shared parameters for all baseline algorithms across environments.

Parameter	Symbol	Environment 1	Environment 2	Environment 3	Environment 4
Step size	$η$	5	10	12	30
Rewire radius	$γ$	20	25	35	120
Connection tolerance	$d_{conn}$	5	7	10	30
Collision resolution	$Δ_{col}$	1	1	2	4
Maximum iterations	$N_{max}$	5000	5000	5000	5000

Table A3. Algorithm-specific parameters for baseline algorithms.

Method	Parameter	Symbol	Environment 1	Environment 2	Environment 3	Environment 4
GB-RRT*	Goal bias	$P_{g}$	0.20
Bi-RRT*	Swap probability	$P_{s}$	0.50
Bi-APF-RRT*	Goal bias	$P_{g}$	0.10
	Repulsion coefficient	$k_{rep}$	0.30
	Repulsion range	$d_{rep}$	10	14	20	60
AAE-RRT*	Attraction gain	$λ_{a}$	200	1000	1000	1000
	Repulsion gain	$λ_{r}$	100	500	500	500
	Safe distance	$R_{safe}$	5	7	10	30
	Adaptive step range	$[η_{min}, η_{max}]$	[1, 10]	[1, 14]	[2, 20]	[5, 60]
	Collision decay/growth	$λ_{col} / λ_{ncol}$	0.6/2.0
DPF-Bi-RRT*	Attraction coefficient	$k_{att}$	2.0
	Repulsion coefficient	$k_{rep}$	0.8
	Repulsion range	$d_{rep}$	150
	Greedy shortcut window	$n_{g}$	12

References

Boysen, N.; Fedtke, S.; Schwerdfeger, S. Last-mile delivery concepts: A survey from an operational research perspective. OR Spectr. 2021, 43, 1–58. [Google Scholar] [CrossRef]
Mohamed, A.; Mohamed, M. Unmanned Aerial Vehicles in Last-Mile Parcel Delivery: A State-of-the-Art Review. Drones 2025, 9, 413. [Google Scholar] [CrossRef]
Ishiwatari, M. Leveraging Drones for Effective Disaster Management: A Comprehensive Analysis of the 2024 Noto Peninsula Earthquake Case in Japan. Prog. Disaster Sci. 2024, 23, 100348. [Google Scholar] [CrossRef]
Lyu, M.; Zhao, Y.; Huang, C.; Huang, H. Unmanned Aerial Vehicles for Search and Rescue: A Survey. Remote Sens. 2023, 15, 3266. [Google Scholar] [CrossRef]
Luo, Y.; Yu, X.; Yang, D.; Zhou, B. A survey of intelligent transmission line inspection based on unmanned aerial vehicle. Artif. Intell. Rev. 2023, 56, 173–201. [Google Scholar] [CrossRef]
Lyu, C.; Lin, S.; Lynch, A.; Zou, Y.; Liarokapis, M. UAV-based deep learning applications for automated inspection of civil infrastructure. Autom. Constr. 2025, 177, 106285. [Google Scholar] [CrossRef]
Li, J.; Xiong, X.; Yan, Y.; Yang, Y. A Survey of Indoor UAV Obstacle Avoidance Research. IEEE Access 2023, 11, 51861–51889. [Google Scholar] [CrossRef]
Pan, Y.; Hu, K.; Huang, X.; Ying, W.; Xie, X.; Ma, Y.; Zhang, N.; Kang, H. Developing Smart MAVs for Autonomous Inspection in GPS-Denied Constructions. arXiv 2024, arXiv:2408.06030. [Google Scholar] [CrossRef]
Ghambari, S.; Golabi, M.; Jourdan, L.; Lepagnot, J.; Idoumghar, L. UAV Path Planning Techniques: A Survey. RAIRO-Oper. Res. 2024, 58, 2951–2989. [Google Scholar] [CrossRef]
Freitas, E.J.R.; Cohen, M.W.; Neto, A.A.; Guimarães, F.G.; Pimenta, L.C.A. DE3D-NURBS: A Differential Evolution-Based 3D Path-Planner Integrating Kinematic Constraints and Obstacle Avoidance. Knowl.-Based Syst. 2024, 300, 112084. [Google Scholar] [CrossRef]
Liu, Y.; Zhang, H.; Zheng, H.; Li, Q.; Tian, Q. A Spherical Vector-Based Adaptive Evolutionary Particle Swarm Optimization for UAV Path Planning under Threat Conditions. Sci. Rep. 2025, 15, 2116. [Google Scholar] [CrossRef]
Jiang, Y.; Xu, X.-X.; Zheng, M.-Y.; Zhan, Z.-H. Evolutionary Computation for Unmanned Aerial Vehicle Path Planning: A Survey. Artif. Intell. Rev. 2024, 57, 267. [Google Scholar] [CrossRef]
Elbanhawi, M.; Simic, M. Sampling-Based Robot Motion Planning: A Review. IEEE Access 2014, 2, 56–77. [Google Scholar] [CrossRef]
LaValle, S.M. Rapidly-Exploring Random Trees: A New Tool for Path Planning; Technical Report TR 98-11; Computer Science Department, Iowa State University: Ames, IA, USA, 1998; pp. 1–4. [Google Scholar]
Karaman, S.; Frazzoli, E. Sampling-Based Algorithms for Optimal Motion Planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
Xu, T. Recent Advances in Rapidly-Exploring Random Tree: A Review. Heliyon 2024, 10, e32451. [Google Scholar] [CrossRef] [PubMed]
Muhsen, D.K.; Raheem, F.A.; Sadiq, A.T. A Systematic Review of Rapidly Exploring Random Tree RRT Algorithm for Single and Multiple Robots. Cybern. Inf. Technol. 2024, 24, 78–101. [Google Scholar] [CrossRef]
Huang, T.; Fan, K.; Sun, W. Density Gradient-RRT: An Improved Rapidly Exploring Random Tree Algorithm for UAV Path Planning. Expert Syst. Appl. 2024, 252, 124121. [Google Scholar] [CrossRef]
Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Informed RRT*: Optimal Sampling-based Path Planning Focused via Direct Sampling of an Admissible Ellipsoidal Heuristic. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Chicago, IL, USA, 14–18 September 2014; pp. 2997–3004. [Google Scholar] [CrossRef]
Lin, Y.; Zhang, L. An Improved Quick Informed-RRT* Algorithm Based on Hybrid Bidirectional Search and Adaptive Adjustment Strategies. Intell. Serv. Robot. 2024, 17, 847–870. [Google Scholar] [CrossRef]
Kyaw, P.T.; Le, A.V.; Mohan, R.E.; Kelly, J. Greedy Heuristics for Sampling-Based Motion Planning in High-Dimensional State Spaces. arXiv 2025, arXiv:2405.03411v3. [Google Scholar]
Jiang, X.; Wang, Z.; Dong, C. A Path Planning Algorithm Based on Improved RRT Sampling Region. Comput. Mater. Contin. 2024, 80, 4303–4322. [Google Scholar] [CrossRef]
Guo, S.; Gong, J.; Shen, H.; Yuan, L.; Wei, W.; Long, Y. DBVSB-P-RRT*: A Path Planning Algorithm for Mobile Robot with High Environmental Adaptability and Ultra-High Speed Planning. Expert Syst. Appl. 2025, 266, 126123. [Google Scholar] [CrossRef]
Jordan, M.; Perez, A. Optimal Bidirectional Rapidly-Exploring Random Trees; MIT CSAIL Technical Report, MIT-CSAIL-TR-2013-021; MIT: Cambridge, MA, USA, 2013. [Google Scholar]
Qureshi, A.H.; Ayaz, Y. Intelligent Bidirectional Rapidly-Exploring Random Trees for Optimal Motion Planning in Complex Cluttered Environments. Robot. Auton. Syst. 2015, 68, 1–11. [Google Scholar] [CrossRef]
Ye, L.; Li, J.; Li, P. Improving Path Planning for Mobile Robots in Complex Orchard Environments: The Continuous Bidirectional Quick-RRT* Algorithm. Front. Plant Sci. 2024, 15, 1337638. [Google Scholar] [CrossRef]
Cao, M.; Mao, H.; Tang, X.; Sun, Y.; Chen, T. A Novel RRT*-Connect Algorithm for Path Planning on Robotic Arm Collision Avoidance. Sci. Rep. 2025, 15, 2836. [Google Scholar] [CrossRef] [PubMed]
Qureshi, A.H.; Ayaz, Y. Potential Functions Based Sampling Heuristic for Optimal Path Planning. Auton. Robot. 2016, 40, 1079–1093. [Google Scholar] [CrossRef]
Fan, J.; Chen, X.; Wang, Y.; Chen, X. UAV Trajectory Planning in Cluttered Environments Based on PF-RRT* Algorithm with Goal-Biased Strategy. Eng. Appl. Artif. Intell. 2022, 114, 105182. [Google Scholar] [CrossRef]
Jiang, Z.; Liu, Q.; Wang, E.; Wang, Y.; Wang, J. HMA-RRT*: A Hybrid Multi-Strategy Adaptive RRT* Algorithm for USV Path Planning in Complex Maritime Environments. J. King Saud Univ. Comput. Inf. Sci. 2025, 38, 15. [Google Scholar] [CrossRef]
Kilic, K.I.; Desoeuvres, A.; Pedersen, C.B.; Vasegaard, A.E.; Nielsen, P. Adaptive Artificial Potential Field Method for Small Autonomous Vehicles. Robot. Auton. Syst. 2026, 198, 105364. [Google Scholar] [CrossRef]
Huang, H.; Shang, Y.; Liu, X.; Liu, X.; Qi, P. An Improved Bi-RRT*-Based Path Planning Algorithm with Adaptive Search Strategy Assignment Mechanism for Ultra-Low-Altitude Penetration of Fixed-Wing Aircraft. Aerosp. Sci. Technol. 2024, 152, 109363. [Google Scholar] [CrossRef]
Ge, L.; Phang, S.K.; Sariff, N. DPF-Bi-RRT*: An Improved Path Planning Algorithm for Complex 3D Environments with Adaptive Sampling and Dual Potential Field Strategy. IEEE Access 2025, 13, 35958–35972. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of RRT* extension. (a) Among all neighbors of

q_{n e w}

,

q_{m i n}

with the lowest cumulative cost is selected as its parent instead of

q_{n e a r e s t}

. (b) After inserting

q_{n e w}

, the neighbor

q_{j}

is rewired from

q_{o l d}

(red dashed) to

q_{n e w}

(green dashed) because the path through

q_{n e w}

has a lower cost.

Figure 1. Schematic diagram of RRT* extension. (a) Among all neighbors of

q_{n e w}

,

q_{m i n}

with the lowest cumulative cost is selected as its parent instead of

q_{n e a r e s t}

. (b) After inserting

q_{n e w}

, the neighbor

q_{j}

is rewired from

q_{o l d}

(red dashed) to

q_{n e w}

(green dashed) because the path through

q_{n e w}

has a lower cost.

Figure 2. Overall flowchart of the EAC-Bi-RRT* algorithm.

Figure 3. Partitioning of the 3D sampling space into 26 sectors.

Figure 4. Four 3D simulation environments used in this study. (a) Dense spherical obstacle environment. (b) Regular cylindrical array environment. (c) Multi-layer building environment. (d) Large-scale random obstacle environment. The red marker indicates the start position and the green marker indicates the goal position.

Figure 5. Path planning results of six algorithms in Environment 1.

Figure 6. Path planning results of six algorithms in Environment 2.

Figure 7. Path planning results of six algorithms in Environment 3.

Figure 8. Path planning results of six algorithms in Environment 4.

Figure 9. Overall comparison of average planning time, path length, and turning angle of all algorithms across four environments.

Figure 10. Boxplot distributions of key metrics of EAC-Bi-RRT* in the four environments.

Table 1. Partitioning of the 26 spherical sectors.

Layer	Sectors	Elevation Range	Count	Description
Top cap	$S_{0}$	$67.5 ° \leq ϕ \leq 90 °$	1	Steep-climb cone
Upper	$S_{1}$ – $S_{8}$	$22.5 ° \leq ϕ < 67.5 °$	8	Primary climbing directions
Middle	$S_{9}$ – $S_{16}$	$- 22.5 ° < ϕ < 22.5 °$	8	Near-horizontal cruise zone
Lower	$S_{17}$ – $S_{24}$	$- 67.5 < ϕ \leq - 22.5$	8	Descending directions
Bottom cap	$S_{25}$	$- 90 ° \leq ϕ < - 67.5$	1	Steep-descent cone

Table 2. Simulation data of six algorithms in different environments.

Environment	Algorithm	Success Rate	Path Length (m)	Time (s)	Avg. Turning Angle (°)
Environment 1	GB-RRT*	100%	367.20 ± 7.94	0.308 ± 0.136	7.33 ± 1.19
	Bi-RRT*	100%	403.07 ± 25.63	0.392 ± 0.148	12.50 ± 2.00
	Bi-APF-RRT*	100%	381.56 ± 13.74	0.215 ± 0.091	10.44 ± 1.61
	AAE-RRT*	100%	355.95 ± 6.91	0.214 ± 0.084	1.40 ± 1.16
	DPF-Bi-RRT*	100%	357.98 ± 7.74	0.122 ± 0.073	1.39 ± 0.88
	EAC-Bi-RRT*	100%	358.94 ± 7.75	0.039 ± 0.017	1.61 ± 0.79
Environment 2	GB-RRT*	73%	882.96 ± 51.13	0.950 ± 0.600	20.52 ± 1.96
	Bi-RRT*	100%	937.65 ± 64.21	0.431 ± 0.151	22.84 ± 2.28
	Bi-APF-RRT*	100%	888.12 ± 34.46	0.298 ± 0.110	20.39 ± 1.44
	AAE-RRT*	100%	813.78 ± 16.20	0.789 ± 0.121	11.75 ± 1.64
	DPF-Bi-RRT*	100%	866.05 ± 27.21	0.368 ± 0.171	8.96 ± 1.88
	EAC-Bi-RRT*	100%	768.46 ± 11.88	0.184 ± 0.064	8.56 ± 0.82
Environment 3	GB-RRT*	6%	1195.45 ± 114.80	1.681 ± 0.701	18.32 ± 1.91
	Bi-RRT*	100%	1196.76 ± 96.82	0.316 ± 0.222	22.10 ± 2.17
	Bi-APF-RRT*	100%	1141.89 ± 75.74	0.330 ± 0.228	19.46 ± 1.77
	AAE-RRT*	40%	989.03 ± 23.71	0.985 ± 0.400	5.55 ± 1.12
	DPF-Bi-RRT*	99%	1015.73 ± 50.86	0.371 ± 0.212	6.50 ± 1.21
	EAC-Bi-RRT*	100%	981.90 ± 26.21	0.805 ± 0.446	5.48 ± 0.81
Environment 4	GB-RRT*	84%	3581.47 ± 205.56	1.377 ± 0.561	25.81 ± 2.43
	Bi-RRT*	100%	3609.52 ± 245.02	1.158 ± 0.405	25.26 ± 2.67
	Bi-APF-RRT*	100%	3395.17 ± 133.10	1.030 ± 0.369	23.16 ± 2.17
	AAE-RRT*	100%	3076.69 ± 84.06	1.649 ± 0.391	9.46 ± 1.84
	DPF-Bi-RRT*	100%	3061.84 ± 101.89	0.562 ± 0.229	7.26 ± 1.47
	EAC-Bi-RRT*	100%	2960.57 ± 46.97	0.256 ± 0.049	5.98 ± 1.40

Table 3. Coefficient of variation (CV, %) of path length over 100 independent trials in the four environments.

Algorithm	Environment 1	Environment 2	Environment 3	Environment 4
GB-RRT*	2.16	5.79	9.60	5.74
Bi-RRT*	6.36	6.85	8.09	6.79
Bi-APF-RRT*	3.60	3.88	6.63	3.92
AAE-RRT*	1.94	1.99	2.40	2.73
DPF-Bi-RRT*	2.16	3.14	5.01	3.33
EAC-Bi-RRT*	2.16	1.55	2.67	1.59

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, Y.; Huang, W.; Chang, Y.; Qin, Z. A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion. Appl. Sci. 2026, 16, 5065. https://doi.org/10.3390/app16105065

AMA Style

Zhao Y, Huang W, Chang Y, Qin Z. A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion. Applied Sciences. 2026; 16(10):5065. https://doi.org/10.3390/app16105065

Chicago/Turabian Style

Zhao, Yaoyu, Wencong Huang, Yufang Chang, and Ziyu Qin. 2026. "A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion" Applied Sciences 16, no. 10: 5065. https://doi.org/10.3390/app16105065

APA Style

Zhao, Y., Huang, W., Chang, Y., & Qin, Z. (2026). A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion. Applied Sciences, 16(10), 5065. https://doi.org/10.3390/app16105065

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A 3D UAV Path Planning Algorithm Based on Bidirectional RRT* with Adaptive Directional Sampling and Cooperative Dual-Tree Expansion

Abstract

1. Introduction

2. Problem Definition

3. Related Works

3.1. RRT

3.2. RRT*

3.3. APF-RRT*

3.4. Bi-RRT*

4. The Proposed EAC-Bi-RRT* Algorithm

4.1. Overall Algorithmic Framework

4.2. Adaptive Directional Sampling Strategy

4.3. Environment-Adaptive Variable-Step-Size Mechanism

4.4. Cooperative Dual-Tree Expansion Strategy

4.5. Climb-Angle Constraint

4.6. Repulsive-Only Local Potential Field Correction

4.7. Path Smoothing and Safety Fallback

4.8. Computational Complexity Analysis

5. Results and Discussion

5.1. Dense Spherical Obstacle Environment

5.2. Regular Cylindrical Array Environment

5.3. Multi-Layer Building Environment

5.4. Large-Scale Random Obstacle Environment

5.5. Statistical Reliability Analysis

5.6. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI