BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments

Yue, Wenhao; Li, Xiang; Liu, Ziyue; Jiang, Xiaojiang; Pan, Lanlan

doi:10.3390/s26061837

Open AccessArticle

BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments

by

Wenhao Yue

¹

,

Xiang Li

^2,*

,

Ziyue Liu

¹

,

Xiaojiang Jiang

¹

and

Lanlan Pan

¹

College of Mechanical and Power Engineering, Dalian Ocean University, Dalian 116086, China

²

College of Information Engineering, Dalian Ocean University, Dalian 116086, China

^*

Author to whom correspondence should be addressed.

Sensors 2026, 26(6), 1837; https://doi.org/10.3390/s26061837

Submission received: 11 February 2026 / Revised: 13 March 2026 / Accepted: 13 March 2026 / Published: 14 March 2026

(This article belongs to the Section Sensors and Robotics)

Download

Browse Figures

Versions Notes

Abstract

In complex ground environments, conventional RRT* often suffers from low planning efficiency and poor path quality for robot path planning. This paper proposes BGSE-RRT* (Bi-tree Cooperative, Goal-guided, low-discrepancy Sampling, multi-sector Expansion). First, BGSE-RRT* constructs a nonlinear switching probability via bi-tree cooperative adaptive switching, together with KD-Tree nearest-neighbor acceleration and multi-condition triggering, to adaptively balance global exploration and local convergence. Meanwhile, a goal-guided expansion with dynamic target binding and adaptive step size, under a multi-constraint feasibility check, accelerates the convergence of the two trees. When the goal-guided expansion becomes blocked, BGSE-RRT* generates candidate points in local multi-sector regions using a 2D Halton low-discrepancy sequence and selects the best candidate for expansion; if the multi-sector expansion still fails, a sampling-point-guided expansion is activated to continue advancing and search for a feasible path. Second, B-spline smoothing is applied to improve trajectory continuity. Finally, in five simulation environments and ROS/real-robot joint validation, compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* reduces planning time by up to 84.71%, shortens path length by 2.94–6.88%, and improves safety distance by 20.68–48.33%. In ROS/real-robot validation, the trajectory-tracking success rate reaches 100%.

Keywords:

path planning; RRT*; bi-tree adaptive switching; goal-guided; local multi-sector; sampling-point-guided

1. Introduction

In recent years, autonomous driving has developed rapidly worldwide and is widely regarded as a key direction for the future of transportation systems [1]. The realization of autonomous driving relies on the coordinated operation of intelligent vehicles in perception, decision-making, path planning, and control [2]. With high-definition maps available, lane-level navigation can be achieved using high-precision localization. In this process, the objective of path planning is to generate a collision-free, kinematically constrained, continuous, and optimal path from the start to the goal [3].

In path planning, platforms such as steering vehicles [4], marine vessels [5], and robotic manipulators [6] commonly face physical constraints, including steering limits, obstacle avoidance requirements, dynamic stability, and joint limits [7]. Generating feasible paths under these constraints remains a key challenge, and a variety of solutions have been proposed [8]. However, in complex environments with obstacle-dense layouts and heavy occlusion, planners must jointly consider search efficiency, path quality, and trajectory trackability. Existing methods can be broadly categorized into four classes according to modeling and solution paradigms.

(1) Grid-based search methods (e.g., Dijkstra [9] and A* [10]) discretize the space and search over grid cells. Their accuracy depends on grid resolution: finer grids can yield smoother paths but incur higher computational complexity [11]. Some studies perform secondary optimization after search, which may be inconsistent with the original collision-avoidance result. For example, Li et al. [12] use adaptive pure pursuit to generate shorter and smoother paths based on an A* path; however, due to the discrete nature, it is still difficult to guarantee curvature continuity and trajectory trackability in a principled way, and obstacle-dense scenarios often expose a conflict between resolution and real-time performance.

(2) Optimization-based methods (e.g., artificial potential field (APF) [13]) typically rely on accurate potential-field modeling and establish a mapping between the environment model and path generation [14], showing effectiveness in handling environmental constraints such as obstacle avoidance [15]. However, under complex geometric constraints, it is often difficult to directly couple kinematic constraints with the potential-field framework, and the method is prone to local minima, leading to unstable guidance or local stagnation.

(3) Learning-based methods [16] perform planning in a data-driven manner and can be advantageous in unknown environments [17], but they depend on data coverage and prior trajectory data, and it is difficult to obtain training data that are both generalizable and asymptotically optimal. Therefore, they are often combined with conventional methods in practice. For example, Hao et al. [18] initialized a reinforcement-learning Q-table using APF on top of a sampling method; Tan et al. [19] combined a Q-learning Q-table with an A* grid structure to improve performance. Overall, learning-based outputs are often suboptimal and are mainly used to enhance local capabilities of rule-based methods; under complex environments and safety constraints, generalization stability and interpretability must be carefully balanced.

(4) Sampling-based planners (e.g., RRT and PRM) [20] provide probabilistic completeness and good scalability [21] and are widely used for constrained path planning tasks [22]. The resulting path shape is largely determined by the connection/rewiring strategy [23]; thus, many studies improve connection schemes to satisfy specific constraints, such as cubic spline interpolation [24], circular-arc methods [25], quadratic programming [26], Bézier-curve methods [27], and bio-inspired methods [28]. Dubins curves [29] enforce curvature via a minimum turning radius but typically require specified boundary poses and headings. Some studies introduce kinematic constraints during expansion or connection [30], which may reduce the success rate of initial connectivity; moreover, inconsistency between the terminal heading of one segment and the initial heading of the next can lead to poor trajectory trackability, which is insufficiently addressed in part of the literature.

Moreover, the computational efficiency of sampling-based planners in complex environments strongly depends on the sampling distribution and the quality of heuristic guidance. Related work reduces ineffective expansions mainly from search strategies and sampling strategies [31]. For search strategies, bidirectional planning and bi-tree search structures are used to accelerate feasible-solution discovery. For example, FHQ-RRT [32] speeds up high-quality path generation through strategy design and GD-RRT* [33] improves initial-solution quality and optimization efficiency by combining Gaussian sampling and deep policies; however, multi-tree parallel methods may still produce infeasible paths at the merging stage [34]. For sampling strategies, shrinking the sampling space and pruning are common. Informed-RRT* [35] restricts sampling to an ellipsoidal subset, RRT-FN [36] introduces pruning to control tree size, and DBVSB-P-RRT* [37] improves adaptability in complex environments via adaptive biased sampling, variable step size, and pruning. Nevertheless, many of these improvements focus on a single aspect (e.g., sampling space, tree size, or bias form). In obstacle-dense and narrow-passage scenarios, planners may still suffer from failed goal guidance, blocked expansion, and frequent collision rejection, causing fluctuations in convergence efficiency and path quality; meanwhile, curvature continuity and trajectory trackability often require coordinated search mechanisms.

Beyond these mainstream methods, advanced path planning methods developed in recent years for specific tasks and complex physical constraints also merit attention. For example, Bahwini et al. [38] addressed needle insertion path planning under soft-tissue deformation by establishing a temperature-field model based on bioheat transfer, thereby accounting for the effect of soft-tissue deformation on path generation. Zhong et al. [39] proposed a cellular-neural-network-based method for optimal robot path planning, which enabled real-time path generation through neural dynamics without requiring global search. Building on this work, Hills and Zhong [40] further introduced a thermal modeling method that incorporated the heat conduction process into real-time robot path planning, thereby enhancing responsiveness to changes in dynamic environments. In the field of aerial robots, Zhou et al. [41] proposed a crossover-recombination-based globally optimal brain storm optimization algorithm that combined cubic B-splines, constraint handling, and swarm intelligence optimization for multi-constraint UAV path planning. These studies indicate that path planning has expanded from traditional geometric obstacle avoidance to more complex scenarios involving soft-tissue deformation, thermal-field modeling, neural dynamics, and multi-constraint flight optimization. However, such methods generally rely on specific physical models, tasks, or optimization frameworks, and therefore remain clearly distinct from the tree-based incremental sampling planning problem in known static complex ground environments considered in this study.

In summary, improving RRT* in complex environments faces the following key challenges: unstable and non-adaptive goal guidance that induces ineffective expansions and search stagnation; fixed goal bias that easily triggers local oscillation near obstacles; occluded structures that require more directional and uniform candidate generation to reduce repeated trial and error; and polyline connection/rewiring alone that struggles to achieve both optimality and curvature continuity, thereby degrading trajectory trackability.

To address these issues, this paper proposes BGSE-RRT*, a bi-tree cooperative and multi-mechanism integrated path planning algorithm. Our target problem is 2D planar path planning and trackable trajectory generation for ground mobile robots under known-map constraints, where “complex environments” mainly refer to geometric complexity such as obstacle-dense layouts, heavy occlusion, and narrow passages. The main contributions are as follows: (1) a nonlinear switching probability combined with KD-Tree acceleration and multi-condition triggering enables adaptive balancing between global exploration and local convergence, improving expansion efficiency; (2) a goal-guided expansion accelerates bi-tree convergence, and when expansion becomes blocked, 2D Halton low-discrepancy sampling within multi-sector regions generates candidates and selects the best to enhance obstacle avoidance; and (3) if the multi-sector expansion still fails, a sampling-point-guided expansion continues advancing to find a feasible path, and finally B-spline post-processing improves path continuity.

The remainder of this paper is organized as follows: Section 2 introduces BI-RRT*, APF-RRT*, and BI-APF-RRT* baselines and clarifies the comparison scope with sampling-based planners. Section 3 presents BGSE-RRT*. Section 4 describes the experimental environments and evaluates BGSE-RRT* across multiple scenarios, with comparisons to GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*. Section 5 concludes the paper.

2. Related Work

To address the difficulty of jointly achieving path quality, expansion efficiency, and adaptability in complex environments and to ensure representative and reproducible comparisons, this section selects BI-RRT* [30], APF-RRT* [8], and BI-APF-RRT* [24] as baselines. Specifically, BI-RRT* improves feasible-path discovery via bidirectional expansion; APF-RRT* uses potential-field information to provide heuristic guidance for tree expansion; and BI-APF-RRT* further integrates a “bi-tree structure + potential-field guidance,” which is the closest to our approach. However, BI-APF-RRT* may still encounter blocked expansion or local oscillation in complex environments, leading to slower convergence and fluctuating path quality; therefore, using it as a strongly related baseline better validates the improvements in BGSE-RRT* in complex environments.

2.1. BI-RRT*

RRT* incrementally expands a single search tree outward, whereas BI-RRT* constructs two trees from the start and the goal simultaneously. By alternating expansions and repeatedly attempting to connect the two trees, the search progresses them toward each other from both ends and they merge earlier. Benefiting from this bidirectional growth mechanism, BI-RRT* typically achieves higher expansion efficiency than single-tree methods and can obtain a feasible path faster. However, in complex environments, BI-RRT* may still exhibit slow convergence and unstable path quality. The principle of BI-RRT* is illustrated in Figure 1.

2.2. APF-RRT*

APF-RRT* integrates artificial potential field (APF) guidance into the expansion process of RRT*. It applies attraction to the goal

q_{g o a l}

and the random sample

q_{r a n d}

, constructs repulsion from obstacles, and uses the resultant force

F_{t o t a l}

to guide new-node growth so that expansions move toward the goal while avoiding collisions. The procedure is as follows: initialize the tree T at

q_{s t a r t}

; sample

q_{r a n d}

; find the nearest node

q_{n e a r}

; compute

F_{t o t a l}

by combining attraction and repulsion; and generate a new node

q_{n e w}

along this direction with step size s. If the segment is collision-free, add

q_{n e w}

and perform connection and rewiring by selecting the parent with the minimum accumulated cost in the neighborhood; otherwise, discard the node and resample until termination. APF-RRT* preserves the rewiring capability of RRT* while improving the goal guidance and near-obstacle avoidance tendency; however, it is prone to local minima in complex environments, and potential-field evaluation (e.g., near-obstacle distance) introduces extra computational burden, affecting stability. The principle of APF-RRT* is illustrated in Figure 2.

2.3. BI-APF-RRT*

BI-APF-RRT* grows two search trees

T_{s}

and

T_{g}

rooted at

q_{s t a r t}

and

q_{g o a l}

, respectively, and alternates expansion between them. The algorithm samples

q_{r a n d}

, selects the nearest node

q_{n e a r}

in the current tree, and forms the resultant guidance force

F_{t o t a l}

as a weighted sum of the goal attraction term, the sampling-guidance term, and the obstacle repulsion term. Starting from

q_{n e a r}

, it generates a new node

q_{n e w}

along the normalized direction of

F_{t o t a l}

with a fixed step size s. If

q_{n e w}

is collision-free, it is inserted into the current tree and RRT* rewiring is performed to reduce path cost. When the distance between the nearest nodes of the two trees is below a threshold, a collision-free connection is attempted; if successful, the path is backtracked and returned. Built on the asymptotically optimal rewiring mechanism of RRT, BI-APF-RRT* combines bidirectional expansion and potential-field guidance to enhance goal guidance and obstacle avoidance, thereby obtaining feasible paths faster; nevertheless, in complex environments it may still suffer from blocked expansion and local oscillation, resulting in slower convergence and unstable path quality. The principle of BI-APF-RRT* is illustrated in Figure 3.

2.4. Relationship to Representative SOTA Sampling-Based Planners and Comparison Scope

It should be noted that sampling-based motion planning does not have a single “universal SOTA,” since performance strongly depends on task dimensionality, environment structure, collision-checking cost, and whether dynamic obstacles exist. Surveys and benchmark studies often regard BIT*, FMT*, RRT#, and RRTX (for dynamic replanning) as representative planners with clear theoretical guarantees and broad adoption. BIT* [42] uses batch sampling with implicit random geometric graph search, offering advantages in asymptotic optimality and heuristic contraction; FMT* [43] reduces unnecessary collision checks via pre-sampling and lazy connections; RRT# [44] improves interpretability of optimality convergence through vertex-consistency maintenance; and RRTX [45] provides incremental replanning capability for changing environments. In contrast, BGSE-RRT* is a structural enhancement within the incremental tree-search framework of RRT*: it improves initial connectivity efficiency and path quality in complex static environments through bi-tree cooperative adaptive switching, goal-guided expansion, and local multi-sector low-discrepancy sampling with best-candidate expansion and ensures trajectory continuity via B-spline post-processing.

Following the fairness principle of “same paradigm, same constraints, and same collision-checking model,” our experiments primarily select representative RRT*-family variants that directly correspond to our improvement line as baselines in order to isolate and verify gains brought by the proposed structural mechanisms. Meanwhile, we provide a comparative discussion of BIT*/FMT*/RRT#/RRTX in terms of paradigm differences, applicable conditions, and expected advantages/limitations, thereby clarifying the boundary of our contribution and the scope of comparison.

3. Principle of the BGSE-RRT* Algorithm

Although BI-APF-RRT* optimizes the planning time and path quality of RRT* to a certain extent, it still struggles to effectively balance path quality, expansion efficiency and environmental adaptability. To address this issue, the BGSE-RRT* algorithm proposed in this paper realizes adaptive coordination of bi-tree expansion through bi-tree adaptive switching; accelerates the convergence of the two trees via goal-guided expansion and switches to local multi-sector expansion for comprehensive optimal selection and advancement if the expansion is blocked; activates sampling-point-guided expansion to continue advancing and find a feasible path when the multi-sector expansion fails; and finally adopts B-spline to improve trajectory continuity.

3.1. Bi-Tree Cooperative Adaptive Switching Strategy

Aimed at the problem that the traditional BI-RRT* usually adopts fixed goal bias, which is prone to invalid guidance and expansion stagnation in complex environments, this paper takes the nearest connectable distance between the two trees as the feedback quantity and combines the nonlinear switching probability, KD-Tree nearest-neighbor search and multi-condition triggering mechanism to realize the adaptive balance between global exploration and local convergence.

(1) Nonlinear Switching Probability Model

The switching probability is defined as a piecewise function:

Ω_{s} (D) = \{\begin{matrix} Ω_{max} - (Ω_{max} - Ω_{min}) {(\frac{D}{D_{t h}})}^{2}, 0 \leq D \leq D_{t h} \\ Ω_{min}, D > D_{t h} \end{matrix}

(1)

where D is the minimum distance from the current expansion node

q_{c u r r}

to the connectable nodes in the opposite tree,

D_{t h}

is the switching threshold (

D_{t h} > 0

), and

Ω_{max}

and

Ω_{min}

are the upper and lower limits of the switching probability (

0 < Ω_{min} < Ω_{max} < 1

). When

D ≪ D_{t h}

,

Ω_{s} (D)

increases to

Ω_{max}

to enhance the convergence of the two trees; when

D > D_{t h}

,

Ω_{s} (D)

decreases to

Ω_{min}

to suppress excessive goal bias and maintain exploration capability.

(2) KD-Tree Nearest-Neighbor Search Acceleration

To avoid a nearest-neighbor search in each expansion, a KD-Tree index is established for the node coordinates of the opposite tree. When calculating the distance D or performing switching judgment, the current expansion node

q_{c u r r}

is taken as the query point to execute nearest-neighbor search on the KD-Tree of the opposite tree, which reduces the complexity of nearest-neighbor search from linear scanning to

O (log M_{t e r m})

, where

M_{t e r m}

is the number of nodes in the opposite tree.

(3) Distance Pruning and Collision Detection Optimization

The current optimal connectable distance

d_{m i n}

is maintained for the candidate nearest-neighbor set. For the candidate node

q_{i}

, the Euclidean distance

d (q_{c u r r}, q_{i})

is calculated first:

If (

d (q_{c u r r}, q_{i}) \geq d_{m i n}

), the candidate cannot update the current optimal value, and the collision detection is skipped;

If (

d (q_{c u r r}, q_{i}) < d_{m i n}

), the collision detection of the connecting line segment is executed only at this time, and

d_{m i n} = d (q_{c u r r}, q_{i})

is updated if there is no collision.

Finally,

D = d_{m i n}

is taken, and the corresponding node is regarded as the “nearest connectable node”.

(4) Multi-condition Triggering Logic

To avoid premature switching or long-term non-triggering, the following triggering criterion is constructed:

T r = \{\begin{matrix} T r u e, \{\begin{matrix} t_{i t e r} \geq t_{i t e r, m i n} \\ D < D_{t h} \\ ξ < Ω_{s} (D) \end{matrix} \\ F a l s e, o t h e r w i s e \end{matrix}

(2)

where

t_{i t e r}

is the current number of iterations,

t_{i t e r, m i n}

is the minimum effective exploration iteration threshold, and

ξ \sim U (0, 1)

is a uniformly random number. The three conditions are used to suppress invalid switching in the initial stage, limit the action range of switching, and adaptively adjust the strength of goal guidance according to

Ω_{s} (D)

, thereby improving the stability of the switching mechanism.

(5) Rate of Change Analysis

The derivative of Equation (1) in the interval (

0 \leq D \leq D_{t h}

) is obtained as:

\frac{d Ω_{s} (D)}{d D} = - 2 (Ω_{max} - Ω_{min}) \frac{D}{D_{t h}^{2}}, 0 \leq D \leq D_{t h}

(3)

When D decreases,

Ω_{s} (D)

increases to

Ω_{max}

, thus enhancing goal guidance in the short-distance stage and accelerating the state switching when the two trees approach each other. When

D > D_{t h}

,

Ω_{s} (D) = Ω_{min}

and

d Ω_{s} / d D = 0

, and the two trees cover the unknown space in a relatively independent expansion mode; when

D_{t h} / 2 < D \leq D_{t h}

,

Ω_{s} (D)

increases gradually to conduct a tentative approach with a low probability; when

D \leq D_{t h} / 2

,

Ω_{s} (D)

increases and approaches

Ω_{max}

, realizing the rapid convergence of the two trees.

In summary, this strategy enables the strength of goal bias to be adaptively adjusted with the connectable distance between trees and the search stage through nonlinear switching probability, KD-Tree nearest-neighbor acceleration and multi-condition triggering; provides trigger conditions and switching logic for subsequent expansion strategies; and thus reduces invalid expansion. The principle of bi-tree cooperative adaptive switching is shown in Figure 4.

3.2. Goal-Guided Expansion Strategy

Aimed at the problem that traditional goal-biased sampling is prone to generating a large number of invalid expansions in complex scenarios, which further leads to search stagnation, this strategy introduces dynamic target binding, adaptive step size and a multi-constraint feasibility check to improve the probability of effective expansions toward the target, thus accelerating the convergence of the two trees.

(1) Dynamic Target Binding

Let L be the distance between the nearest nodes of the two trees and

L_{t h}

be the coordination threshold. When

L > L_{t h}

(separate exploration), the start tree

T_{i n i t}

takes the goal

q_{g o a l}

as the expansion target and the goal tree

T_{t e r m}

takes the start

q_{i n i t}

as the expansion target to maintain global exploration of the unknown space; when

L \leq L_{t h}

(cooperative convergence), the target of

T_{i n i t}

is switched to the nearest connectable node

q_{n e a r}

in

T_{t e r m}

, and

T_{t e r m}

is bound to the corresponding nearest connectable node in

T_{i n i t}

at the same time, prompting the two trees to converge rapidly along the shortest connectable direction.

(2) Goal-guided Expansion and Adaptive Step Size

Given the current expansion point

q_{c u r r}

and the expansion target

q_{t a r}

, the original direction vector is defined as:

v_{r a w} = q_{t a r} - q_{c u r r}

(4)

and the unit expansion direction is obtained by normalization:

d i r = \frac{v_{r a w}}{∥ v_{r a w} ∥} (∥ v_{r a w} ∥ \neq 0)

(5)

where

∥ v_{r a w} ∥

is the Euclidean norm. To enable the step size to adaptively change with the local obstacle congestion degree, an environmental complexity index

ζ \in [0, 1]

is introduced in this paper to describe the “local congestion level” of the neighborhood of the current expansion point. Specifically, a local evaluation neighborhood

N (q_{c u r r}, R_{c})

is taken with

q_{c u r r}

as the center (

R_{c}

is a fixed evaluation scale with the same dimension as the planning scale), and two types of information including obstacle proportion and near-obstacle safety distance are counted in this neighborhood: let K detection points

x_{k}

be distributed in

N (q_{c u r r}, R_{c})

, then the obstacle proportion is defined as

ρ_{o c c} = (1 / K) \sum I [x_{k} \in O b s]

; at the same time, the normalized term of the nearest obstacle safety distance is calculated as

ρ_{c l r} = 1 - clip (D_{m i n} (q_{c u r r}, O b s) / R_{c}, 0, 1)

. Finally, the environmental complexity index is obtained as:

ζ = clip (w_{o c c} \cdot ρ_{o c c} + w_{c l r} \cdot ρ_{c l r}, 0, 1), w_{o c c} + w_{c l r} = 1

(6)

A larger

ζ

indicates a more crowded local area or a closer distance to obstacles, while a smaller

ζ

indicates a more open local area. Based on

ζ

, the expansion step size is mapped to a continuous function

δ (ζ)

within the interval

[δ_{m i n}, δ_{m a x}]

: a larger step size is adopted in simple environments (

ζ < a

) to accelerate expansion; a smaller step size is adopted in complex environments (

ζ \geq b

) to enhance obstacle avoidance; and linear interpolation is used for compromise processing in the transition region (

a \leq ζ < b

). The candidate new node is:

q_{n e w} = q_{c u r r} + δ (ζ) d i r

(7)

If the current expansion tree is switched to the other tree, the current node of this tree is taken as

q_{c u r r}

, and the recalculation is performed according to Equations (4)–(7).

(3) Expansion Deviation Angle and Basis for Direction Correction

To quantify the consistency between the local expansion direction and the global target direction, the expansion deviation angle is introduced:

Δ ϕ = arccos (\frac{(q_{t a r} - q_{c u r r}) \cdot (q_{g o a l} - q_{c u r r})}{∥ q_{t a r} - q_{c u r r} ∥ ∥ q_{g o a l} - q_{c u r r} ∥})

(8)

where

q_{g o a l}

denotes the global target of the current expansion tree. A smaller

Δ ϕ

indicates that the expansion direction is closer to the global connectable direction and the path redundancy is lower; a larger

Δ ϕ

indicates an obvious deviation, which triggers local multi-sector expansion for direction correction. If the denominator is 0,

Δ ϕ = 0

is set or the calculation is skipped.

(4) Expansion Success Score

An expansion success score based on step size and safety distance is constructed:

P_{s u c c} = α \cdot e^{- β \cdot δ} + γ \frac{D_{o b s} (q_{a d d}, O)}{D_{m a x} (O)}, α \in [0, 1], α + γ = 1

(9)

where

α, β, a n d γ

are adjustment coefficients;

D_{o b s} (q_{a d d}, O)

is the distance from the new node to obstacles; and

D_{m a x} (O)

is the maximum value of this statistic in the scenario. Equation (9) shows that an excessively large step size will reduce the success score, while a larger average distance to obstacles will increase the success score. Therefore, introducing the environmental complexity

ζ

into

δ

can adaptively balance the expansion speed and obstacle avoidance safety.

It should be emphasized that

ζ

is an environmental quantity obtained online by local geometric/collision detection statistics, rather than a manual parameter tuning for a single scenario; it maintains a large step size in open areas to improve exploration efficiency and automatically shortens the step size in crowded/near-obstacle areas to reduce invalid expansions, thus improving the stability and transferability of goal-guided expansion. Moreover,

EvaluateComplexity (q_{c u r r}, O b s)

in Algorithm 1 is used to calculate the environmental complexity index

ζ

of the current node neighborhood;

AdaptiveStep (ζ)

maps

ζ

to the step size

δ (ζ)

, thereby maintaining a large step size in open areas and automatically shortening the step size in crowded/near-obstacle areas to improve the probability of feasible expansion.

In summary, this strategy promotes the cooperative convergence of the two trees through dynamic target binding and reduces invalid expansions by combining adaptive step size and a multi-constraint feasibility check, thus rapidly improving the connection efficiency under the constraint of safety distance; if the expansion fails, it switches to local multi-sector to generate new feasible expansion directions. The flow of goal-guided expansion is shown in Algorithm 1.

Algorithm 1 Goal-guided expansion.

1:: $L \leftarrow DualTree MinDistance (T_{c u r}, T_{o p p})$
2:: if $L > L_{t h}$ then
3:: if Root $(T_{c u r}) = = q_{i n i t} then q_{t a r} \leftarrow q_{g o a l} else q_{t a r} \leftarrow q_{i n i t}$
4:: else
5:: $q_{t a r} \leftarrow KDTreeNearest (T_{o p p}, q_{c u r r})$
6:: $v_{r a w} \leftarrow q_{t a r} - q_{c u r r}$
7:: if $∥ v_{r a w} ∥ = 0$ then return null
8:: $dir \leftarrow v_{r a w} / ∥ v_{r a w} ∥$
9:: $ζ \leftarrow Evaluate Complexity (q_{c u r r}, Obs)$
10:: $δ \leftarrow AdaptiveStep (ζ)$
11:: $q_{n e w} \leftarrow q_{c u r r} + δ \cdot dir$
12:: If $Feasible Segment (q_{c u r r}, q_{n e w}, Obs, {Dist}_{s a f e}, bounds)$ then
13:: $AddNode (T_{c u r}, q_{n e w}, parent = q_{c u r r})$
14:: $KDTree Insert (T_{c u r}, q_{n e w})$
15:: $Update DualTree Distance (T_{c u r}, T_{o p p})$
16:: return $q_{n e w}$
17:: return null

3.3. Local Multi-Sector Refined Expansion

When the goal-guided expansion fails, this strategy takes the direction from the current node

Q_{0} = (X_{0}, Y_{0})

to the target point

Q_{g o a l} = (X_{g}, Y_{g})

as the center, constructs multi-sector partitions in its forward half-plane, generates candidate points combined with a 2D Halton low-discrepancy sequence, and selects the optimal expansion node through multi-objective comprehensive evaluation, thus improving the obstacle avoidance capability.

The azimuth angle is:

φ_{0} = atan2 (Y_{g} - Y_{0}, X_{g} - X_{0})

(10)

In the polar coordinate system of

Q_{0}

, a core sector centered on the target direction is constructed:

Ψ_{c o r e} = [φ_{0} - θ_{0}, φ_{0} + θ_{0}]

(11)

where

θ_{0}

is the half-angle of the core sector. The denser the obstacles are, the smaller the value of

θ_{0}

is, so as to improve the sampling concentration in the target direction and reduce invalid attempts in the blocked direction. M groups of symmetric sectors are constructed on both sides of the core sector, and the angle interval of the m-th group is:

Ψ_{m} = [φ_{0} + θ_{m - 1}, φ_{0} + θ_{m}] \cup [φ_{0} - θ_{m}, φ_{0} - θ_{m - 1}], m = 1, 2, \dots, M

(12)

with:

θ_{m} = θ_{0} + m Δ θ

(13)

where

Δ θ

is the angle increment. In each sector sub-region, a 2D Halton sequence

v_{j} = (v_{j 1}, v_{j 2}) \in (0, 1)

is used to generate candidate points

Q_{j} = (X_{j}, Y_{j})

. Let

u = v_{j 1}

, then the polar radius and polar angle of the candidate points are:

ρ_{j} = \sqrt{ρ_{m i n}^{2} + u (R_{m a x}^{2} - ρ_{m i n}^{2})}

(14)

φ_{j} = φ_{m i n, m} + v_{j 2} (φ_{m a x, m} - φ_{m i n, m})

(15)

Thus:

X_{j} = X_{0} + ρ_{j} cos φ_{j}, Y_{j} = Y_{0} + ρ_{j} sin φ_{j}

(16)

where

R_{m a x}

is the upper limit of the expansion step size,

ρ_{m i n}

is the inner safety radius, and

[φ_{m i n, m}, φ_{m a x, m}]

is the sector angle boundary.

Feasibility constraints are imposed on the candidate points:

Dist (O b s, Q_{j}) \geq {Dist}_{s a f e}, Dist (Q_{g o a l}, Q_{j}) \leq {Dist}_{g o a l, m a x} + β R_{m a x}, β \in (0, 1)

(17)

where

Dist (O b s, Q_{j})

denotes the distance from the candidate point to the nearest obstacle,

{Dist}_{s a f e}

is the safety distance threshold, and

Dist (Q_{g o a l}, Q_{j})

is the distance from the candidate point to the target point. The candidate point can be used as an expansion node only after passing the subsequent line segment collision detection, where

{Dist}_{g o a l, m a x} = Dist (Q_{0}, Q_{g o a l}) + R_{m a x}

(18)

To adaptively adjust the exploration intensity of different sectors, an obstacle density index is introduced:

η_{m} = \frac{1}{A_{m}} \sum_{O b s \in O_{m}} exp (- \frac{Dist (Q_{0}, O b s)}{R_{0}})

(19)

where

A_{m}

is the area of the sector,

O_{m}

is the set of obstacles in the sector, and

R_{0}

is the scale coefficient; a larger

η_{m}

indicates a more crowded area, and thus the sampling allocation and expansion priority are reduced.

The allocated number of sampling points is constructed as:

N_{m} \propto 1 / (1 + η_{m})

(20)

To screen expansion nodes that take into account safety, linearity, goal guidance and smoothness from the candidate points, a multi-objective comprehensive evaluation function is constructed:

F (Q_{j}) = ω_{1} F_{s} (Q_{j}) + ω_{2} F_{l} (Q_{j}) + ω_{3} F_{g} (Q_{j}) + ω_{4} F_{s m} (Q_{j})

(21)

where

ω = {[ω_{1}, ω_{2}, ω_{3}, ω_{4}]}^{T}

is the weight vector, and each sub-item is defined as follows:

(1) Safety Score

F_{s} (Q_{j}) = \frac{Dist (O b s, Q_{j})}{{Dist}_{m a x}}

(22)

where

{Dist}_{m a x} = max_{Q_{j} \in Q_{m}} Dist (O b s, Q_{j})

(23)

if

{Dist}_{m a x} = 0

,

{Dist}_{m a x} \leftarrow {Dist}_{m a x} + ε

is set to avoid division by zero.

{Dist}_{m a x}

takes the maximum value over the entire set of candidate points.

(2) Linearity Score

F_{l} (Q_{j}) = \frac{1}{1 + Dist (Q_{j}, L_{s t d}) / D_{r e f}}

(24)

where

L_{s t d}

is the reference line from

Q_{0}

to

Q_{g o a l}

and

D_{r e f} = R_{m a x}

is the normalized distance.

(3) Goal Guidance Score

F_{g} (Q_{j}) = \frac{1}{1 + Dist (Q_{j}, Q_{g o a l}) / D_{r e f}}

(25)

(4) Smoothness Score

F_{s m} (Q_{j}) = \frac{1}{1 + φ_{r o t} (Q_{j})}

(26)

where the turning angle is given by the included angle among the parent node

Q_{p a r}

, the current node

Q_{0}

and the candidate point

Q_{j}

:

φ_{r o t} (Q_{j}) = arccos (\frac{{\vec{V}}_{1} \cdot {\vec{V}}_{2}}{∥ {\vec{V}}_{1} ∥ ∥ {\vec{V}}_{2} ∥})

(27)

{\vec{V}}_{1} = Q_{0} - Q_{p a r}

is the direction vector from the parent node to the current node,

{\vec{V}}_{2} = Q_{j} - Q_{0}

is the direction vector from the current node to the candidate point, and

∥ \vec{V} ∥

is the Euclidean norm of the vector. If the denominator is 0,

φ_{r o t} = 0

is set, and interval truncation is performed on the cosine term to ensure numerical stability. Finally, the candidate point that maximizes

F (Q_{j})

is selected as the expansion node.

In terms of weight determination and numerical setting, the priority of “safety > goal guidance > (linearity ≈ smoothness)” is mapped to:

p = [p_{s}, p_{l}, p_{g}, p_{s m}] = [4, 1.5, 3, 1.5]

(28)

and the normalized weight is obtained as:

ω_{k} = \frac{p_{k}}{\sum_{i = 1}^{4} p_{i}}, ω_{k} \geq 0, \sum_{k = 1}^{4} ω_{k} = 1

(29)

A unified weight vector is adopted in the experiments of this paper:

ω = {[0.4, 0.15, 0.3, 0.15]}^{T}

(30)

corresponding to safety, linearity, goal guidance and smoothness, respectively. For different scenario preferences, p can be adjusted on the premise of keeping the normalization rule unchanged.

This strategy enhances the effective exploration of the forward space through “multi-sector partitioning –low-discrepancy sampling –multi-objective screening”, and complements the goal-guided expansion, thus suppressing redundancy and alleviating the stagnation of unidirectional expansion. When the local multi-sector expansion still fails, the sampling-point-guided expansion is triggered to provide continuous and feasible detour directions. The principle of local multi-sector refined expansion is shown in Figure 5.

3.4. Sampling-Point-Guided Expansion Strategy

When the local multi-sector expansion still fails, this paper adopts an advancement mode of “random sampling point guidance + direction normalization + fixed step size” and performs a feasibility check under the constraints of collision, safety distance and boundary to continue advancing and to improve the probability of finding a feasible path.

Let the current expansion node be

Q_{0} = (x_{0}, y_{0})

and the random sampling point be

Q_{r} = (x_{r}, y_{r})

, then the original expansion direction vector is:

λ_{raw} = Q_{r} - Q_{0}

(31)

To eliminate the influence of sampling distance on expansion stability, the normalized direction vector is defined as:

λ = \{\begin{matrix} 0, ∥ λ ∥ = 0 \\ \frac{λ}{∥ λ ∥}, else \end{matrix}

(32)

When

∥ λ ∥ = 0

, the current expansion is terminated to avoid generating duplicate nodes; otherwise, a new node is generated with a fixed step size s:

Q_{new} = Q_{0} + λ s

(33)

Let

Q_{new} = (x_{new}, y_{new})

and define the expansion line segment

L = \bar{Q_{0} Q_{new}}

. To ensure feasibility and safety, collision detection is performed on the line segment L, and the minimum safety distance and boundary constraints are judged:

Γ = \{\begin{matrix} T r u e if \{\begin{matrix} F_{free} (L) = T r u e \\ min_{Q \in L} D i s t (O b s, Q) > D i s t_{safe} \\ X_{min} \leq x_{new} \leq X_{max}, Y_{min} \leq y_{new} \leq Y_{max} \end{matrix} \\ F a l s e else \end{matrix}

(34)

where

F_{free} (L)

indicates that the line segment L has no intersection with the obstacle set Obs and

[X_{min}, X_{max}]

,

[Y_{min}, Y_{max}]

are the planning space boundaries. When

Γ = T r u e

,

Q_{new}

is added to the search tree; otherwise, the expansion is discarded.

To facilitate subsequent path quality evaluation and local rewiring, the safety and target connectivity are recorded synchronously when the node is inserted:

T r e \leftarrow T r e \cup \{\begin{matrix} n_{i} & = I d x (Q_{new}) \\ n_{par} & = I d x (Q_{0}) \\ d_{min} & = min_{Q \in L} D_{obs} (Q) \\ d_{goal} & = ∥Q_{new} - Q_{goal}∥ \end{matrix}\}

(35)

where

n_{i}

is the index of the new node and

n_{par}

is the index of the parent node.

In summary, as a fallback strategy when both goal-guided expansion and local multi-sector expansion are blocked, this strategy realizes stable advancement through direction normalization and fixed step size and screens safe nodes by combining collision detection, safety distance and boundary constraints, providing information support for subsequent reconnection and path evaluation; its flow is shown in Algorithm 2.

Algorithm 2 Sampling-point-guided expansion.

1:: $λ_{r a w} \leftarrow Q_{r} - Q_{0}$
2:: if $∥ λ_{r a w} ∥ < ϵ$ then return null
3:: $Q_{n e w} \leftarrow Q_{0} + s \cdot (λ_{r a w} / ∥ λ_{r a w} ∥)$
4:: if $Q_{n e w} . x \notin [X_{m i n}, X_{m a x}]$ or
5:: $Q_{n e w} . y \notin [Y_{m i n}, Y_{m a x}]$ then return null
6:: $L \leftarrow Segment (Q_{0}, Q_{n e w})$
7:: if not $CollisionFree (L)$ then return null
8:: $d_{m i n} \leftarrow MinDistObs (L)$
9:: if $d_{m i n} \leq {Dist}_{s a f e}$ then return null
10:: $d_{g o a l} \leftarrow ∥ Q_{n e w} - Q_{g o a l} ∥$
11:: $AddNode (T, Q_{n e w}, parent = Q_{0},$
12:: $MinDistObs = d_{m i n}, DistGoal = d_{g o a l}$
13:: return $Q_{n e w}$

3.5. B-Spline Optimization and Smoothing Technology

To eliminate sharp inflection points and local discontinuous segments in the initial polyline path, this paper performs interpolation densification on the path points under the premise of satisfying collision-free and safety distance constraints and adopts a B-spline curve for fitting to obtain a continuous, smooth and trackable trajectory.

Given the maximum allowable spacing

L_{int, \max}

between adjacent control points, when the Euclidean distance

Dist (Q_{i}, Q_{i + 1})

between two adjacent points is greater than

L_{int, \max}

, intermediate control points are inserted in the interval

[Q_{i}, Q_{i + 1}]

, and the number of inserted points is:

n_{int} = max (0, ⌈\frac{Dist (Q_{i}, Q_{i + 1})}{L_{int, \max}}⌉ - 1)

(36)

where

⌈ \cdot ⌉

is the ceiling function. The k-th interpolation point is obtained by linear interpolation:

Q_{i, k} = Q_{i} + \frac{k}{n_{int} + 1} (Q_{i + 1} - Q_{i}), k = 1, \dots, n_{int}

(37)

When

n_{int} = 0

, no intermediate points are inserted, and only the endpoints

Q_{i}, Q_{i + 1}

are retained. The set of control points

{Q_{j}}_{j = 1}^{M}

is obtained after interpolation, and an m-order B-spline is used for geometric smoothing of the path. Let the non-decreasing node vector be

T = {t_{j}}_{j = 1}^{M + m}

, and the first-order basis function is:

B_{j, 1} (τ) = \{\begin{matrix} 1, τ \in [t_{j}, t_{j + 1}) \\ 0, else \end{matrix}

(38)

The high-order basis function is derived by the Cox-deBoor recursion:

B_{j, m} (τ) = \frac{τ - t_{j}}{t_{j + m - 1} - t_{j}} B_{j, m - 1} (τ) + \frac{t_{j + m} - τ}{t_{j + m} - t_{j + 1}} B_{j + 1, m - 1} (τ)

(39)

If the denominator is 0, the coefficient of the corresponding term is set to 0. In the parameter interval

τ \in [t_{m}, t_{M + 1}]

, the smooth trajectory

Γ (τ) = (X (τ), Y (τ))

is:

Γ (τ) = \sum_{j = 1}^{M} Q_{j} B_{j, m} (τ)

(40)

where M is the number of control points and

Q_{j}

is the j-th control point. Due to the convex hull property of B-spline, the trajectory point corresponding to any

τ

satisfies:

Γ (τ) \in conv {Q_{1}, Q_{2}, \dots, Q_{M}}

(41)

where

conv (\cdot)

denotes the convex hull of the control point set. To ensure that the smoothed path still satisfies the collision-free and safety distance constraints, collision detection is performed on the smoothed curve; if a collision occurs, iterative processing is conducted by further densifying control points, reducing the order or falling back to the original polyline path until the constraints are satisfied.

Furthermore, to adapt to different path characteristics, the key parameters are set adaptively:

(1) Maximum segment length setting

The maximum segment length is set according to the total path length

S_{total}

, the segment adjustment coefficient

k_{seg}

and the basic segment length upper limit

L_{base}

:

L_{seg, \max} = L_{base} + \frac{S_{total}}{k_{seg}}

(42)

A longer path can appropriately increase the segment length to reduce the number of segments; a larger

k_{seg}

results in a smaller segment length, thus improving the local fitting accuracy and smoothness.

(2) Adaptive selection of B-spline order

The B-spline order is adaptively selected according to the number of control points:

m = \{\begin{matrix} min (m_{1}, M), M < M_{t h}, \\ min (m_{2}, M), M \geq M_{t h}, \\ min (m_{2}, M), s p e c i a l_{p a t h}, \end{matrix}

(43)

where

M_{t h}

is the control point threshold;

m_{1}

and

m_{2}

correspond to short and long paths respectively; and

s p e c i a l_{p a t h}

denotes scenarios such as narrow passages and sharp turns, where the order is reduced to suppress local oscillation.

(3) Adaptive parameter sampling point sequence construction

To improve the discrete resolution in high-curvature regions, an adaptive parameter sampling point sequence is constructed:

N_{ref} = [M (1 + k_{ψ} κ_{max})], τ_{ref} = linspace (t_{m}, t_{M + 1}, N_{ref})

(44)

where

κ_{max}

is the maximum curvature of the path,

k_{ψ}

is the curvature sensitivity coefficient, and

linspace (\cdot)

denotes equidistant sampling in the interval. A larger curvature or a higher

k_{ψ}

results in denser sampling, thus obtaining a trajectory with smoother dynamic changes in the turning region.

In summary, this paper improves the continuity and smoothness of the trajectory through interpolation densification, B-spline fitting and adaptive parameter adjustment, and naturally connects with the path generated by the aforementioned expansion strategies, providing a more kinematically constrained trackable trajectory for mobile robots. The comparison of paths before and after B-spline optimization is shown in Figure 6.

3.6. Basis for Parameter Setting and Sensitivity Discussion

BGSE-RRT* involves parameters such as thresholds, probabilities, step sizes and weights. To avoid environment-specific “manual parameter tuning”, this paper reuses the same set of parameters and constraints for repeated comparisons in six simulation environments and ROS/real-robot joint validation and explains the parameters in groups according to “sensitivity –meaning –influence direction”. Among them,

Ω_{s} (D)

is the bi-tree cooperative switching probability function (adjusting the rhythm of exploration/convergence), and

Ω

is the multi-objective evaluation weight (only affecting the sorting of candidate points without changing the collision/safety feasibility check).

(1) Safety and Feasibility Parameters (High Sensitivity, Hard Constraints)

Including

D i s t_{s a f e}

, boundary constraints and collision criteria: The expansion edge is directly rejected if it collides with obstacles or the safety distance is insufficient. An increase in

D i s t_{s a f e}

improves the safety margin but compresses the feasible space and may increase the planning time; a decrease in

D i s t_{s a f e}

has the opposite effect but reduces the safety redundancy. Such parameters are determined by the robot size, sensing error and safety boundary and can be transferred across scenarios.

(2) Cooperative Switching and Convergence Parameters (Medium Sensitivity, Stage Control)

The switching probability

Ω

is given by the piecewise function

Ω_{s} (D)

driven by distance feedback, which is only determined by a small number of scale parameters including

Ω_{m i n}

,

Ω_{m a x}

and

D_{t h}

:

Ω

tends to

Ω_{m a x}

as D decreases to enhance convergence and takes

Ω_{m i n}

when

D > D_{t h}

to maintain exploration;

t_{i t e r, m i n}

is used to suppress early jitter. The target binding threshold

L_{t h}

, adaptive step size and direction normalization further reduce the scale sensitivity.

(3) Multi-objective Weights and Post-processing (Low Sensitivity, Preference Items; Including Safety Distance Rechecking)

Normalized multi-objective evaluation is adopted for multi-sector expansion, and a unified weight vector

ω = {[0.4, 0.15, 0.3, 0.15]}^{T}

is used in this paper;

ω

is only used to sort the candidate points that pass the collision/safety screening and usually does not change the feasibility. B-spline smoothing is set adaptively, and collision and safety distance rechecking are performed after smoothing. If necessary, densification/reduction in order/fallback are adopted to ensure that the final trajectory is safe and feasible.

In summary, this paper organizes the parameters with a “three-layer desensitization” mechanism: hard constraints ensure feasibility, medium-sensitivity parameters analytically control exploration/convergence, and the remaining parameters are desensitized through normalization/adaptation and safety distance rechecking; combined with the reuse of unified parameters, it can be shown that the performance gain mainly comes from structural and mechanism design rather than manual parameter tuning.

4. Experiments and Results

4.1. Experimental Setup

To evaluate the planning performance of the proposed BGSE-RRT* in complex static planar environments, we designed five simulation environments and one ROS/real-robot joint validation environment and conducted 200 trials under the same parameter set and constraints, comparing BGSE-RRT* with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*.

A 100 m × 100 m two-dimensional simulation platform was implemented in MATLAB R2024a, and additional ROS and real-robot experiments were performed. The evaluation metrics included planning time, path length, safety distance, steering-angle RMS, iteration count, and number of nodes. Environments 1–3 were dense small-obstacle environments; Environments 4–5 were narrow-passage environments formed by large obstacles; and Environment 6 was the ROS and real-robot joint verification environment.

Environments 1–5 are shown in Figure 7. The qualitative comparisons in Environments 1–5 are shown in Figure 8, the weighted fitness curves are shown in Figure 9, and the averaged results over 200 trials are summarized in Figure 10 and Table 1.

4.1.1. Dense Small-Obstacle Environments

Planning time: Compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* reduces planning time by 84.71%, 43.70%, 28.97%, and 22.45%, respectively. This improvement mainly comes from: (i) the bi-tree cooperative adaptive switching, which adjusts the goal bias strength according to the inter-tree distance and the search stage, and (ii) the fact that when goal-guided expansion is blocked, the local multi-sector refined expansion advances via comprehensive candidate evaluation, thereby reducing invalid expansion and strengthening obstacle avoidance.

Path length: Compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* shortens path length by 3.72%, 6.88%, 6.58%, and 2.94%, respectively. With adaptive switching, global exploration is preserved at long inter-tree distances, while at short distances the expansion target is switched to the nearest connectable node in the opposite tree, guiding both trees to grow along a shorter connection direction. Meanwhile, goal-guided expansion accelerates tree convergence, and the multi-sector mechanism suppresses detour redundancy when blocked.

Safety distance: Compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* increases safety distance by 48.33%, 44.81%, 46.11%, and 39.43%, respectively. This gain is attributed to: (i) explicit safety constraints and direction correction in the multi-constraint feasibility check during goal-guided expansion; (ii) the safety term in the multi-sector candidate evaluation; and (iii) collision and safety distance rechecking after B-spline smoothing, which maintains the safety margin.

In addition, BGSE-RRT* also reduces iteration count and number of nodes, resulting in a more compact tree and stronger trackability.

4.1.2. Large-Obstacle Occluded Environments

Planning time: Compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* reduces planning time by 30.36%, 31.53%, 7.35%, and 33.94%, respectively, consistent with the advantages discussed above.

Path length: Compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* reduces path length by 1.52%, 3.85%, 3.54%, and 1.95%, respectively. Dynamic target binding guides the two trees to advance along a shorter corridor connection direction and the selection mechanism stabilizes feasible detours when blocked, yielding paths closer to passable regions.

Safety distance: Compared with GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*, BGSE-RRT* increases safety distance by 20.68%, 15.32%, 9.58%, and 10.42%, respectively. When blocked, the multi-sector mechanism prioritizes safer directions with a feasibility check, and together with multi-constraint collision checks and the B-spline convex hull property, the trajectory shape remains safe.

BGSE-RRT* also reduces iteration count and number of nodes, achieving a better balance between exploration efficiency and path quality in narrow-passage environments.

4.1.3. ROS and Real-Robot Joint Verification

The experiments are conducted in a 20 m × 40 m ROS simulation environment and a 10 m × 20 m real field site. The real-robot platform is a LanderPi Mecanum mobile robot. Based on ROS, chassis control is realized by sending linear velocity v and angular velocity

ω

via topic

/ c m d_{v e l}

. To match the two-dimensional planar kinematic model in this paper, only the

(v, ω)

channels are used with lateral velocity fixed to

v_{y} = 0

, and the bounds

| v | \leq 0.35

m/s and

| ω | \leq 1.80

rad/s are applied for feasibility check and time calibration. Key robot parameters are: wheel diameter 0.060 m, wheelbase 0.150 m, track width 0.160 m, body size 0.212 × 0.174 × 0.441 m, and mass 1.79 kg. Sensors include a 2D LiDAR (MS200, 360°, angular resolution 0.8°, 7–15 Hz, range 0.1–13.4 m), an IMU (QMI8658), and an AB-phase Hall encoder (13 poles, 1:20). Computation and communication use a Raspberry Pi 5 (Raspberry Pi OS, ROS1 Melodic deployed in Docker). The chassis controller is an STM32 F407VET6 communicating with ROS via serial/USB-serial. The ROS simulation uses the same planar model and the same velocity/angular-velocity constraints as the real robot to ensure the consistency and reproducibility of joint verification.

The ROS path comparison shows that BGSE-RRT* (black) achieves the best overall path quality; GB-RRT* (yellow) produces denser corners and higher redundancy, while BGSE-RRT* outperforms BI-RRT* (green), BI-APF-RRT* (red), and BAI-RRT* (blue). Compared with these methods, BGSE-RRT* reduces the average planning time by 30.68%, shortens the average path length by 4.15%, increases the average safety distance by 32.20%, reduces the average steering-angle RMS by 18.64%, reduces the average iteration count by 90.46%, and reduces the average number of nodes by 67.21%. In real-robot navigation, a single run takes about 26 s with a 100% trajectory-tracking success rate, indicating that the proposed method can generate shorter and safer navigation paths. The joint verification results are shown in Figure 11.

5. Conclusions

BGSE-RRT* establishes a bi-tree cooperative adaptive switching mechanism to achieve a dynamic balance between global exploration and local convergence during searching. Under a multi-constraint feasibility check, it adopts goal-guided expansion to accelerate bi-tree convergence and improve connection efficiency. When goal-guided expansion is blocked, BGSE-RRT* switches to local multi-sector refined expansion, generating candidate expansion points and advancing through selection to enhance obstacle avoidance. If the multi-sector mechanism still fails, sampling-point-guided expansion is activated to suppress invalid exploration and maintain forward progress until a feasible path is found. Finally, B-spline smoothing is applied to improve trajectory continuity.

With these complementary mechanisms, BGSE-RRT* achieves up to an 84.71% reduction in planning time in dense small-obstacle environments, with shorter paths and 39–48% increases in safety distance. In narrow-passage environments, planning time is reduced by 7–34% and safety distance is increased by 10–21%. In ROS and real-robot joint verification, the average planning time is reduced by 30.68%, the path is safer and easier to track, and both iteration count and number of nodes are reduced; a real-robot run takes about 26 s with a 100% trajectory-tracking success rate.

This work primarily targets two-dimensional static planar environments, and systematic extensions to dynamic obstacles and multi-robot cooperative planning remain to be developed. Although the algorithm contains multiple threshold and probability parameters, the risk of manual tuning has been mitigated by using a unified parameter set across all environments and by providing a hierarchical sensitivity discussion; nevertheless, stronger adaptivity and broader verification are still needed under more complex tasks and platform constraints.

Future work will introduce motion prediction and online replanning with explicit evaluation of real-time performance and safety. We will also extend the framework to multi-robot cooperative planning by integrating task allocation and collision-avoidance constraints and will further generalize the goal-guided and multi-sector selection mechanisms to higher-dimensional problems by combining them with spatiotemporal trajectory optimization.

Author Contributions

Conceptualization, Z.L.; Methodology, Z.L. and L.P.; Validation, L.P.; Formal analysis, X.L., X.J. and L.P.; Resources, W.Y. and X.J.; Writing—original draft, W.Y.; Writing—review & editing, W.Y., X.L. and L.P.; Visualization, L.P.; Funding acquisition, X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

No applicable.

Informed Consent Statement

No applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Ji, Y.; Ni, L.; Zhao, C.; Lei, C.; Du, Y.; Wang, W. TriPField: A 3D potential field model and its applications to local path planning of autonomous vehicles. IEEE Trans. Intell. Transp. Syst. 2023, 24, 3541–3554. [Google Scholar] [CrossRef]
Paden, B.; Čáp, M.; Yong, S.Z.; Yershov, D.; Frazzoli, E. A survey of motion planning and control techniques for self-driving urban vehicles. IEEE Trans. Intell. Veh. 2016, 1, 33–55. [Google Scholar] [CrossRef]
Ganesan, S.; Ramalingam, B.; Mohan, R.E. A hybrid sampling-based RRT* path planning algorithm for autonomous mobile robot navigation. Expert Syst. Appl. 2024, 258, 125206. [Google Scholar] [CrossRef]
Zhang, H.; Zhang, Y.; Liu, C.; Zhang, Z. Energy efficient path planning for autonomous ground vehicles with Ackermann steering. Robot. Auton. Syst. 2023, 162, 104366. [Google Scholar] [CrossRef]
Zhang, W.; Shan, L.; Chang, L.; Dai, Y. SVF-RRT*: A stream-based VF-RRT* for USVs path planning considering ocean currents. IEEE Robot. Autom. Lett. 2023, 8, 2413–2420. [Google Scholar] [CrossRef]
Shen, H.; Xie, W.-F.; Tang, J.; Zhou, T. Adaptive manipulability-based path planning strategy for industrial robot manipulators. IEEE/ASME Trans. Mechatron. 2023, 28, 1742–1753. [Google Scholar] [CrossRef]
Teng, S.; Hu, X.; Deng, P.; Li, B.; Li, Y.; Ai, Y.; Yang, D.; Li, L.; Xuanyuan, Z.; Zhu, F.; et al. Motion planning for autonomous driving: The state of the art and future perspectives. IEEE Trans. Intell. Veh. 2023, 8, 3692–3711. [Google Scholar] [CrossRef]
Sánchez-Ibáñez, J.R.; Pérez-Del Pulgar, C.J.; García-Cerezo, A. Path planning for autonomous mobile robots: A review. Sensors 2021, 21, 7898. [Google Scholar] [CrossRef]
Dijkstra, E.W. A note on two problems in connexion with graphs. In Edsger Wybe Dijkstra: His Life, Work, and Legacy; Association for Computing Machinery: New York, NY, USA, 2022; pp. 287–290. [Google Scholar] [CrossRef]
Lin, Z.; Wu, K.; Shen, R.; Yu, X.; Huang, S. An efficient and accurate A-star algorithm for autonomous vehicle path planning. IEEE Trans. Veh. Technol. 2024, 73, 9003–9008. [Google Scholar] [CrossRef]
Mac, T.T.; Copot, C.; Tran, D.T.; De Keyser, R. Heuristic approaches in robot path planning: A survey. Robot. Auton. Syst. 2016, 86, 13–28. [Google Scholar] [CrossRef]
Li, B.; Wang, Y.; Ma, S.; Bian, X.; Li, H.; Zhang, T.; Li, X.; Zhang, Y. Adaptive pure pursuit: A real-time path planner using tracking controllers to plan safe and kinematically feasible paths. IEEE Trans. Intell. Veh. 2023, 8, 4155–4168. [Google Scholar] [CrossRef]
Ma, Q.; Li, M.; Huang, G.; Ullah, S. Overtaking path planning for CAV based on improved artificial potential field. IEEE Trans. Veh. Technol. 2024, 73, 1611–1622. [Google Scholar] [CrossRef]
Guo, H.; Hou, X.; Cao, Z.; Zhang, J. GP3: Gaussian process path planning for reliable shortest path in transportation networks. IEEE Trans. Intell. Transp. Syst. 2021, 23, 11575–11590. [Google Scholar] [CrossRef]
Zong, X.; Liu, Y.; Ye, Z.; Xia, X. Multi-strategy ensemble harris hawks optimization for smooth path planning of mobile robots. Int. J. Mod. Phys. C (IJMPC) 2024, 35, 2450083. [Google Scholar] [CrossRef]
Zhang, H.-Y.; Lin, W.-M.; Chen, A.-X. Path planning for the mobile robot: A review. Symmetry 2018, 10, 450. [Google Scholar] [CrossRef]
Waga, A.; Ba-Ichou, A.; Benhlima, S.; Bekri, A.; Abdouni, J. Efficient autonomous navigation for mobile robots using machine learning. IAES Int. J. Artif. Intell. (IJ-AI) 2024, 13, 3061–3071. [Google Scholar] [CrossRef]
Hao, B.; Du, H.; Yan, Z. A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning. Ocean Eng. 2023, 270, 113632. [Google Scholar] [CrossRef]
Tan, C.S.; Mohd-Mokhtar, R.; Arshad, M.R. Expected-mean gamma-incremental reinforcement learning algorithm for robot path planning. Expert Syst. Appl. 2024, 249, 123539. [Google Scholar] [CrossRef]
Huang, J.-K.; Grizzle, J.W. Efficient anytime CLF reactive planning system for a bipedal robot on undulating terrain. IEEE Trans. Robot. 2023, 39, 2093–2110. [Google Scholar] [CrossRef]
Yu, F.; Shang, H.; Zhu, Q.; Zhang, H.; Chen, Y. An efficient RRT-based motion planning algorithm for autonomous underwater vehicles under cylindrical sampling constraints. Auton. Robot. 2023, 47, 281–297. [Google Scholar] [CrossRef]
Yu, J.; Chen, C.; Arab, A.; Yi, J.; Pei, X.; Guo, X. RDT-RRT: Real-time double-tree rapidly-exploring random tree path planning for autonomous vehicles. Expert Syst. Appl. 2024, 240, 122510. [Google Scholar] [CrossRef]
Song, F.; Yang, X.; Xiang, Z. Autonomous berthing of unmanned surface vehicles based on improved Dubins-RRT algorithm and non-singular terminal sliding mode control. IEEE Access 2023, 11, 43159–43168. [Google Scholar] [CrossRef]
Fan, J.; Chen, X.; Liang, X. UAV trajectory planning based on bi-directional APF-RRT* algorithm with goal-biased. Expert Syst. Appl. 2023, 213, 119137. [Google Scholar] [CrossRef]
Wang, B.; Ju, D.; Xu, F.; Xun, G. CAF-RRT*: A 2D path planning algorithm based on circular arc fillet method. IEEE Access 2022, 10, 127168–127181. [Google Scholar] [CrossRef]
Zhang, Y.; Sun, H.; Zhou, J.; Pan, J.; Hu, J.; Miao, J. Optimal vehicle path planning using quadratic optimization for baidu apollo open platform. In Proceedings of the 2020 IEEE Intelligent Vehicles Symposium (IV); IEEE: Piscataway, NJ, USA, 2020; pp. 978–984. [Google Scholar] [CrossRef]
Niu, T.; Wang, L.; Xu, Y.; Wang, J.; Wang, S. Quintic Bézier curve and numerical optimal solution based path planning approach in seismic exploration. Control Eng. Pract. 2024, 145, 105855. [Google Scholar] [CrossRef]
Hu, G.; Du, B.; Wei, G. HG-SMA: Hierarchical guided slime mould algorithm for smooth path planning. Artif. Intell. Rev. 2023, 56, 9267–9327. [Google Scholar] [CrossRef]
Parlangeli, G.; De Palma, D.; Attanasi, R. A novel approach for 3PDP and real-time via point path planning of Dubins’ vehicles in marine applications. Control Eng. Pract. 2024, 144, 105814. [Google Scholar] [CrossRef]
Wang, J.; Li, B.; Meng, M.Q.-H. Kinematic constrained bi-directional RRT with efficient branch pruning for robot path planning. Expert Syst. Appl. 2021, 170, 114541. [Google Scholar] [CrossRef]
Zhao, P.; Chang, Y.; Wu, W.; Luo, H.; Zhou, Z.; Qiao, Y.; Li, Y.; Zhao, C.; Huang, Z.; Liu, B.; et al. Dynamic RRT: Fast feasible path planning in randomly distributed obstacle environments. J. Intell. Robot. Syst. 2023, 107, 48. [Google Scholar] [CrossRef]
Dong, X.; Wang, Y.; Fang, C.; Ran, K.; Liu, G. FHQ-RRT*: An improved path planning algorithm for mobile robots to acquire high-quality paths faster. Sensors 2025, 25, 2189. [Google Scholar] [CrossRef]
Zhao, X.; Wang, K.; Zhang, P.; Kan, J. An improved RRT* path planning algorithm combining Gaussian distributed sampling and depth strategy for robotic arm of fruit-picking robot. Comput. Electron. Agric. 2026, 240, 111244. [Google Scholar] [CrossRef]
Ma, G.; Duan, Y.; Li, M.; Xie, Z.; Zhu, J. A probability smoothing bi-RRT path planning algorithm for indoor robot. Future Gener. Comput. Syst. 2023, 143, 349–360. [Google Scholar] [CrossRef]
Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Informed RRT: Optimal sampling-based path planning focused via direct sampling of an admissible ellipsoidal heuristic. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: Piscataway, NJ, USA, 2014; pp. 2997–3004. [Google Scholar] [CrossRef]
Adiyatov, O.; Varol, H.A. Rapidly-exploring random tree based memory efficient motion planning. In Proceedings of the 2013 IEEE International Conference on Mechatronics and Automation (ICMA); IEEE: Piscataway, NJ, USA, 2013; pp. 354–359. [Google Scholar] [CrossRef]
Guo, S.; Gong, J.; Shen, H.; Yuan, L.; Wei, W.; Long, Y. DBVSB-P-RRT*: A path planning algorithm for mobile robot with high environmental adaptability and ultra-high speed planning. Expert Syst. Appl. 2025, 266, 126123. [Google Scholar] [CrossRef]
Bahwini, T.; Zhong, Y.; Gu, C. Path planning in the presence of soft tissue deformation. Int. J. Interact. Des. Manuf. 2019, 13, 1603–1616. [Google Scholar] [CrossRef]
Zhong, Y.; Shirinzadeh, B.; Yuan, X. Optimal robot path planning with cellular neural network. Int. J. Intell. Mechatron. Robot. 2011, 1, 20–39. [Google Scholar] [CrossRef]
Hills, J.; Zhong, Y. Cellular neural network-based thermal modelling for real-time robotic path planning. Int. J. Agil. Syst. Manag. 2014, 7, 261–281. [Google Scholar] [CrossRef]
Zhou, Q.; Gao, S.; Qu, B.; Gao, X.; Zhong, Y. Crossover recombination-based global-best brain storm optimization algorithm for UAV path planning. Proc. Rom. Acad. Ser. A 2022, 23, 209–218. [Google Scholar]
Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Batch Informed Trees (BIT*). In Proceedings of the 2015 IEEE International Conference on Robotics and Automation (ICRA), Seattle, WA, USA, 26–30 May 2015; pp. 3067–3074. [Google Scholar] [CrossRef]
Janson, L.; Schmerling, E.; Clark, A.; Pavone, M. Fast Marching Tree (FMT*): A Fast Marching Sampling-Based Method for Optimal Motion Planning in Many Dimensions. Int. J. Robot. Res. 2015, 34, 883–921. [Google Scholar] [CrossRef]
Arslan, O.; Tsiotras, P. Use of Relaxation Methods in Sampling-Based Algorithms for Optimal Motion Planning (RRT#). In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013; pp. 2421–2428. [Google Scholar] [CrossRef]
Otte, M.; Frazzoli, E. RRTX: Real-Time Motion Planning/Replanning for Environments with Unpredictable Obstacles. Int. J. Robot. Res. 2016, 35, 797–822. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the BI-RRT* algorithm.

Figure 2. Schematic diagram of the APF-RRT* algorithm.

Figure 3. Schematic diagram of the BI-APF-RRT* algorithm.

Figure 4. Schematic diagram of the bi-tree cooperative adaptive switching strategy.

Figure 5. Schematic diagram of local multi-sector refined expansion.

Figure 6. Comparison diagram of paths before and after B-spline optimization.

Figure 7. Five simulation environments.

Figure 8. Running results of five algorithms.

Figure 9. Weighted fitness curves of five algorithms in Environments 1–5.

Figure 10. Average values of 200 experiments for five algorithms in Environments 1–5.

Figure 11. ROS and real-robot joint verification results.

Table 1. Averages of five algorithm results over 200 experiments in Environments 1–5.

Environment	Algorithm	Path Length /m	Iteration Count/Piece	Number of Nodes /Piece	Safe Distance /m	Planning Time /s	Coefficient of Variation/%
Environment 1	GB-RRT*	154.16	1052.00	653.25	1.20	2.49	31.28
	BI-RRT*	159.57	107.95	125.90	1.31	0.38	22.90
	BI-APF-RRT*	159.58	87.15	113.50	1.12	0.37	18.90
	BAI-RRT*	152.09	96.05	107.85	1.30	0.30	19.24
	BGSE-RRT*	146.13	19.75	46.00	1.85	0.26	4.52
Environment 2	GB-RRT*	163.26	1482.50	773.60	1.19	2.11	26.42
	BI-RRT*	167.82	307.65	220.05	1.17	0.61	40.06
	BI-APF-RRT*	165.67	167.20	139.45	1.18	0.38	24.28
	BAI-RRT*	163.92	227.40	163.15	1.25	0.42	24.74
	BGSE-RRT*	156.80	24.10	49.55	1.69	0.26	18.94
Environment 3	GB-RRT*	169.12	607.55	254.95	0.90	0.37	21.46
	BI-RRT*	175.70	220.25	159.45	0.89	0.36	37.60
	BI-APF-RRT*	176.19	193.85	154.95	1.04	0.32	19.58
	BAI-RRT*	166.62	179.10	136.70	0.95	0.26	16.34
	BGSE-RRT*	165.53	38.9	74.7	1.34	0.24	11.74
Environment 4	GB-RRT*	155.44	289.80	173.70	1.30	0.90	23.16
	BI-RRT*	160.56	88.20	118.80	1.23	0.85	24.20
	BI-APF-RRT*	159.44	100.80	126.25	1.30	0.99	29.16
	BAI-RRT*	155.45	89.65	109.10	1.27	0.87	19.72
	BGSE-RRT*	152.10	22.10	45.85	1.33	1.26	9.96
Environment 5	GB-RRT*	154.03	773.35	128.80	1.07	0.22	38.18
	BI-RRT*	156.41	247.60	121.70	1.25	0.26	31.90
	BI-APF-RRT*	156.49	299.65	144.70	1.31	0.37	55.46
	BAI-RRT*	155.36	187.45	115.25	1.32	0.22	18.80
	BGSE-RRT*	152.66	33.45	67.90	1.53	0.20	12.38

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yue, W.; Li, X.; Liu, Z.; Jiang, X.; Pan, L. BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments. Sensors 2026, 26, 1837. https://doi.org/10.3390/s26061837

AMA Style

Yue W, Li X, Liu Z, Jiang X, Pan L. BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments. Sensors. 2026; 26(6):1837. https://doi.org/10.3390/s26061837

Chicago/Turabian Style

Yue, Wenhao, Xiang Li, Ziyue Liu, Xiaojiang Jiang, and Lanlan Pan. 2026. "BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments" Sensors 26, no. 6: 1837. https://doi.org/10.3390/s26061837

APA Style

Yue, W., Li, X., Liu, Z., Jiang, X., & Pan, L. (2026). BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments. Sensors, 26(6), 1837. https://doi.org/10.3390/s26061837

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

BGSE-RRT*: A Goal-Guided and Multi-Sector Sampling-Expansion Path Planning Algorithm for Complex Environments

Abstract

1. Introduction

2. Related Work

2.1. BI-RRT*

2.2. APF-RRT*

2.3. BI-APF-RRT*

2.4. Relationship to Representative SOTA Sampling-Based Planners and Comparison Scope

3. Principle of the BGSE-RRT* Algorithm

3.1. Bi-Tree Cooperative Adaptive Switching Strategy

3.2. Goal-Guided Expansion Strategy

3.3. Local Multi-Sector Refined Expansion

3.4. Sampling-Point-Guided Expansion Strategy

3.5. B-Spline Optimization and Smoothing Technology

3.6. Basis for Parameter Setting and Sensitivity Discussion

4. Experiments and Results

4.1. Experimental Setup

4.1.1. Dense Small-Obstacle Environments

4.1.2. Large-Obstacle Occluded Environments

4.1.3. ROS and Real-Robot Joint Verification

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI