1. Introduction
In recent years, autonomous driving has developed rapidly worldwide and is widely regarded as a key direction for the future of transportation systems [
1]. The realization of autonomous driving relies on the coordinated operation of intelligent vehicles in perception, decision-making, path planning, and control [
2]. With high-definition maps available, lane-level navigation can be achieved using high-precision localization. In this process, the objective of path planning is to generate a collision-free, kinematically constrained, continuous, and optimal path from the start to the goal [
3].
In path planning, platforms such as steering vehicles [
4], marine vessels [
5], and robotic manipulators [
6] commonly face physical constraints, including steering limits, obstacle avoidance requirements, dynamic stability, and joint limits [
7]. Generating feasible paths under these constraints remains a key challenge, and a variety of solutions have been proposed [
8]. However, in complex environments with obstacle-dense layouts and heavy occlusion, planners must jointly consider search efficiency, path quality, and trajectory trackability. Existing methods can be broadly categorized into four classes according to modeling and solution paradigms.
(1) Grid-based search methods (e.g., Dijkstra [
9] and A* [
10]) discretize the space and search over grid cells. Their accuracy depends on grid resolution: finer grids can yield smoother paths but incur higher computational complexity [
11]. Some studies perform secondary optimization after search, which may be inconsistent with the original collision-avoidance result. For example, Li et al. [
12] use adaptive pure pursuit to generate shorter and smoother paths based on an A* path; however, due to the discrete nature, it is still difficult to guarantee curvature continuity and trajectory trackability in a principled way, and obstacle-dense scenarios often expose a conflict between resolution and real-time performance.
(2) Optimization-based methods (e.g., artificial potential field (APF) [
13]) typically rely on accurate potential-field modeling and establish a mapping between the environment model and path generation [
14], showing effectiveness in handling environmental constraints such as obstacle avoidance [
15]. However, under complex geometric constraints, it is often difficult to directly couple kinematic constraints with the potential-field framework, and the method is prone to local minima, leading to unstable guidance or local stagnation.
(3) Learning-based methods [
16] perform planning in a data-driven manner and can be advantageous in unknown environments [
17], but they depend on data coverage and prior trajectory data, and it is difficult to obtain training data that are both generalizable and asymptotically optimal. Therefore, they are often combined with conventional methods in practice. For example, Hao et al. [
18] initialized a reinforcement-learning Q-table using APF on top of a sampling method; Tan et al. [
19] combined a Q-learning Q-table with an A* grid structure to improve performance. Overall, learning-based outputs are often suboptimal and are mainly used to enhance local capabilities of rule-based methods; under complex environments and safety constraints, generalization stability and interpretability must be carefully balanced.
(4) Sampling-based planners (e.g., RRT and PRM) [
20] provide probabilistic completeness and good scalability [
21] and are widely used for constrained path planning tasks [
22]. The resulting path shape is largely determined by the connection/rewiring strategy [
23]; thus, many studies improve connection schemes to satisfy specific constraints, such as cubic spline interpolation [
24], circular-arc methods [
25], quadratic programming [
26], Bézier-curve methods [
27], and bio-inspired methods [
28]. Dubins curves [
29] enforce curvature via a minimum turning radius but typically require specified boundary poses and headings. Some studies introduce kinematic constraints during expansion or connection [
30], which may reduce the success rate of initial connectivity; moreover, inconsistency between the terminal heading of one segment and the initial heading of the next can lead to poor trajectory trackability, which is insufficiently addressed in part of the literature.
Moreover, the computational efficiency of sampling-based planners in complex environments strongly depends on the sampling distribution and the quality of heuristic guidance. Related work reduces ineffective expansions mainly from search strategies and sampling strategies [
31]. For search strategies, bidirectional planning and bi-tree search structures are used to accelerate feasible-solution discovery. For example, FHQ-RRT [
32] speeds up high-quality path generation through strategy design and GD-RRT* [
33] improves initial-solution quality and optimization efficiency by combining Gaussian sampling and deep policies; however, multi-tree parallel methods may still produce infeasible paths at the merging stage [
34]. For sampling strategies, shrinking the sampling space and pruning are common. Informed-RRT* [
35] restricts sampling to an ellipsoidal subset, RRT-FN [
36] introduces pruning to control tree size, and DBVSB-P-RRT* [
37] improves adaptability in complex environments via adaptive biased sampling, variable step size, and pruning. Nevertheless, many of these improvements focus on a single aspect (e.g., sampling space, tree size, or bias form). In obstacle-dense and narrow-passage scenarios, planners may still suffer from failed goal guidance, blocked expansion, and frequent collision rejection, causing fluctuations in convergence efficiency and path quality; meanwhile, curvature continuity and trajectory trackability often require coordinated search mechanisms.
Beyond these mainstream methods, advanced path planning methods developed in recent years for specific tasks and complex physical constraints also merit attention. For example, Bahwini et al. [
38] addressed needle insertion path planning under soft-tissue deformation by establishing a temperature-field model based on bioheat transfer, thereby accounting for the effect of soft-tissue deformation on path generation. Zhong et al. [
39] proposed a cellular-neural-network-based method for optimal robot path planning, which enabled real-time path generation through neural dynamics without requiring global search. Building on this work, Hills and Zhong [
40] further introduced a thermal modeling method that incorporated the heat conduction process into real-time robot path planning, thereby enhancing responsiveness to changes in dynamic environments. In the field of aerial robots, Zhou et al. [
41] proposed a crossover-recombination-based globally optimal brain storm optimization algorithm that combined cubic B-splines, constraint handling, and swarm intelligence optimization for multi-constraint UAV path planning. These studies indicate that path planning has expanded from traditional geometric obstacle avoidance to more complex scenarios involving soft-tissue deformation, thermal-field modeling, neural dynamics, and multi-constraint flight optimization. However, such methods generally rely on specific physical models, tasks, or optimization frameworks, and therefore remain clearly distinct from the tree-based incremental sampling planning problem in known static complex ground environments considered in this study.
In summary, improving RRT* in complex environments faces the following key challenges: unstable and non-adaptive goal guidance that induces ineffective expansions and search stagnation; fixed goal bias that easily triggers local oscillation near obstacles; occluded structures that require more directional and uniform candidate generation to reduce repeated trial and error; and polyline connection/rewiring alone that struggles to achieve both optimality and curvature continuity, thereby degrading trajectory trackability.
To address these issues, this paper proposes BGSE-RRT*, a bi-tree cooperative and multi-mechanism integrated path planning algorithm. Our target problem is 2D planar path planning and trackable trajectory generation for ground mobile robots under known-map constraints, where “complex environments” mainly refer to geometric complexity such as obstacle-dense layouts, heavy occlusion, and narrow passages. The main contributions are as follows: (1) a nonlinear switching probability combined with KD-Tree acceleration and multi-condition triggering enables adaptive balancing between global exploration and local convergence, improving expansion efficiency; (2) a goal-guided expansion accelerates bi-tree convergence, and when expansion becomes blocked, 2D Halton low-discrepancy sampling within multi-sector regions generates candidates and selects the best to enhance obstacle avoidance; and (3) if the multi-sector expansion still fails, a sampling-point-guided expansion continues advancing to find a feasible path, and finally B-spline post-processing improves path continuity.
The remainder of this paper is organized as follows:
Section 2 introduces BI-RRT*, APF-RRT*, and BI-APF-RRT* baselines and clarifies the comparison scope with sampling-based planners.
Section 3 presents BGSE-RRT*.
Section 4 describes the experimental environments and evaluates BGSE-RRT* across multiple scenarios, with comparisons to GB-RRT*, BI-RRT*, BI-APF-RRT*, and BAI-RRT*.
Section 5 concludes the paper.
3. Principle of the BGSE-RRT* Algorithm
Although BI-APF-RRT* optimizes the planning time and path quality of RRT* to a certain extent, it still struggles to effectively balance path quality, expansion efficiency and environmental adaptability. To address this issue, the BGSE-RRT* algorithm proposed in this paper realizes adaptive coordination of bi-tree expansion through bi-tree adaptive switching; accelerates the convergence of the two trees via goal-guided expansion and switches to local multi-sector expansion for comprehensive optimal selection and advancement if the expansion is blocked; activates sampling-point-guided expansion to continue advancing and find a feasible path when the multi-sector expansion fails; and finally adopts B-spline to improve trajectory continuity.
3.1. Bi-Tree Cooperative Adaptive Switching Strategy
Aimed at the problem that the traditional BI-RRT* usually adopts fixed goal bias, which is prone to invalid guidance and expansion stagnation in complex environments, this paper takes the nearest connectable distance between the two trees as the feedback quantity and combines the nonlinear switching probability, KD-Tree nearest-neighbor search and multi-condition triggering mechanism to realize the adaptive balance between global exploration and local convergence.
(1) Nonlinear Switching Probability Model
The switching probability is defined as a piecewise function:
where
D is the minimum distance from the current expansion node
to the connectable nodes in the opposite tree,
is the switching threshold (
), and
and
are the upper and lower limits of the switching probability (
). When
,
increases to
to enhance the convergence of the two trees; when
,
decreases to
to suppress excessive goal bias and maintain exploration capability.
(2) KD-Tree Nearest-Neighbor Search Acceleration
To avoid a nearest-neighbor search in each expansion, a KD-Tree index is established for the node coordinates of the opposite tree. When calculating the distance D or performing switching judgment, the current expansion node is taken as the query point to execute nearest-neighbor search on the KD-Tree of the opposite tree, which reduces the complexity of nearest-neighbor search from linear scanning to , where is the number of nodes in the opposite tree.
(3) Distance Pruning and Collision Detection Optimization
The current optimal connectable distance is maintained for the candidate nearest-neighbor set. For the candidate node , the Euclidean distance is calculated first:
If (), the candidate cannot update the current optimal value, and the collision detection is skipped;
If (), the collision detection of the connecting line segment is executed only at this time, and is updated if there is no collision.
Finally, is taken, and the corresponding node is regarded as the “nearest connectable node”.
(4) Multi-condition Triggering Logic
To avoid premature switching or long-term non-triggering, the following triggering criterion is constructed:
where
is the current number of iterations,
is the minimum effective exploration iteration threshold, and
is a uniformly random number. The three conditions are used to suppress invalid switching in the initial stage, limit the action range of switching, and adaptively adjust the strength of goal guidance according to
, thereby improving the stability of the switching mechanism.
(5) Rate of Change Analysis
The derivative of Equation (
1) in the interval (
) is obtained as:
When D decreases, increases to , thus enhancing goal guidance in the short-distance stage and accelerating the state switching when the two trees approach each other. When , and , and the two trees cover the unknown space in a relatively independent expansion mode; when , increases gradually to conduct a tentative approach with a low probability; when , increases and approaches , realizing the rapid convergence of the two trees.
In summary, this strategy enables the strength of goal bias to be adaptively adjusted with the connectable distance between trees and the search stage through nonlinear switching probability, KD-Tree nearest-neighbor acceleration and multi-condition triggering; provides trigger conditions and switching logic for subsequent expansion strategies; and thus reduces invalid expansion. The principle of bi-tree cooperative adaptive switching is shown in
Figure 4.
3.2. Goal-Guided Expansion Strategy
Aimed at the problem that traditional goal-biased sampling is prone to generating a large number of invalid expansions in complex scenarios, which further leads to search stagnation, this strategy introduces dynamic target binding, adaptive step size and a multi-constraint feasibility check to improve the probability of effective expansions toward the target, thus accelerating the convergence of the two trees.
(1) Dynamic Target Binding
Let L be the distance between the nearest nodes of the two trees and be the coordination threshold. When (separate exploration), the start tree takes the goal as the expansion target and the goal tree takes the start as the expansion target to maintain global exploration of the unknown space; when (cooperative convergence), the target of is switched to the nearest connectable node in , and is bound to the corresponding nearest connectable node in at the same time, prompting the two trees to converge rapidly along the shortest connectable direction.
(2) Goal-guided Expansion and Adaptive Step Size
Given the current expansion point
and the expansion target
, the original direction vector is defined as:
and the unit expansion direction is obtained by normalization:
where
is the Euclidean norm. To enable the step size to adaptively change with the local obstacle congestion degree, an environmental complexity index
is introduced in this paper to describe the “local congestion level” of the neighborhood of the current expansion point. Specifically, a local evaluation neighborhood
is taken with
as the center (
is a fixed evaluation scale with the same dimension as the planning scale), and two types of information including obstacle proportion and near-obstacle safety distance are counted in this neighborhood: let
K detection points
be distributed in
, then the obstacle proportion is defined as
; at the same time, the normalized term of the nearest obstacle safety distance is calculated as
. Finally, the environmental complexity index is obtained as:
A larger
indicates a more crowded local area or a closer distance to obstacles, while a smaller
indicates a more open local area. Based on
, the expansion step size is mapped to a continuous function
within the interval
: a larger step size is adopted in simple environments (
) to accelerate expansion; a smaller step size is adopted in complex environments (
) to enhance obstacle avoidance; and linear interpolation is used for compromise processing in the transition region (
). The candidate new node is:
If the current expansion tree is switched to the other tree, the current node of this tree is taken as
, and the recalculation is performed according to Equations (
4)–(
7).
(3) Expansion Deviation Angle and Basis for Direction Correction
To quantify the consistency between the local expansion direction and the global target direction, the expansion deviation angle is introduced:
where
denotes the global target of the current expansion tree. A smaller
indicates that the expansion direction is closer to the global connectable direction and the path redundancy is lower; a larger
indicates an obvious deviation, which triggers local multi-sector expansion for direction correction. If the denominator is 0,
is set or the calculation is skipped.
(4) Expansion Success Score
An expansion success score based on step size and safety distance is constructed:
where
are adjustment coefficients;
is the distance from the new node to obstacles; and
is the maximum value of this statistic in the scenario. Equation (
9) shows that an excessively large step size will reduce the success score, while a larger average distance to obstacles will increase the success score. Therefore, introducing the environmental complexity
into
can adaptively balance the expansion speed and obstacle avoidance safety.
It should be emphasized that is an environmental quantity obtained online by local geometric/collision detection statistics, rather than a manual parameter tuning for a single scenario; it maintains a large step size in open areas to improve exploration efficiency and automatically shortens the step size in crowded/near-obstacle areas to reduce invalid expansions, thus improving the stability and transferability of goal-guided expansion. Moreover, in Algorithm 1 is used to calculate the environmental complexity index of the current node neighborhood; maps to the step size , thereby maintaining a large step size in open areas and automatically shortening the step size in crowded/near-obstacle areas to improve the probability of feasible expansion.
In summary, this strategy promotes the cooperative convergence of the two trees through dynamic target binding and reduces invalid expansions by combining adaptive step size and a multi-constraint feasibility check, thus rapidly improving the connection efficiency under the constraint of safety distance; if the expansion fails, it switches to local multi-sector to generate new feasible expansion directions. The flow of goal-guided expansion is shown in Algorithm 1.
| Algorithm 1 Goal-guided expansion. |
- 1:
- 2:
if then - 3:
if Root - 4:
else - 5:
- 6:
- 7:
if then return null - 8:
- 9:
- 10:
- 11:
- 12:
If then - 13:
- 14:
- 15:
- 16:
return - 17:
return null
|
3.3. Local Multi-Sector Refined Expansion
When the goal-guided expansion fails, this strategy takes the direction from the current node to the target point as the center, constructs multi-sector partitions in its forward half-plane, generates candidate points combined with a 2D Halton low-discrepancy sequence, and selects the optimal expansion node through multi-objective comprehensive evaluation, thus improving the obstacle avoidance capability.
In the polar coordinate system of
, a core sector centered on the target direction is constructed:
where
is the half-angle of the core sector. The denser the obstacles are, the smaller the value of
is, so as to improve the sampling concentration in the target direction and reduce invalid attempts in the blocked direction.
M groups of symmetric sectors are constructed on both sides of the core sector, and the angle interval of the
m-th group is:
with:
where
is the angle increment. In each sector sub-region, a 2D Halton sequence
is used to generate candidate points
. Let
, then the polar radius and polar angle of the candidate points are:
Thus:
where
is the upper limit of the expansion step size,
is the inner safety radius, and
is the sector angle boundary.
Feasibility constraints are imposed on the candidate points:
where
denotes the distance from the candidate point to the nearest obstacle,
is the safety distance threshold, and
is the distance from the candidate point to the target point. The candidate point can be used as an expansion node only after passing the subsequent line segment collision detection, where
To adaptively adjust the exploration intensity of different sectors, an obstacle density index is introduced:
where
is the area of the sector,
is the set of obstacles in the sector, and
is the scale coefficient; a larger
indicates a more crowded area, and thus the sampling allocation and expansion priority are reduced.
The allocated number of sampling points is constructed as:
To screen expansion nodes that take into account safety, linearity, goal guidance and smoothness from the candidate points, a multi-objective comprehensive evaluation function is constructed:
where
is the weight vector, and each sub-item is defined as follows:
(1) Safety Score
where
if
,
is set to avoid division by zero.
takes the maximum value over the entire set of candidate points.
(2) Linearity Score
where
is the reference line from
to
and
is the normalized distance.
(4) Smoothness Score
where the turning angle is given by the included angle among the parent node
, the current node
and the candidate point
:
is the direction vector from the parent node to the current node, is the direction vector from the current node to the candidate point, and is the Euclidean norm of the vector. If the denominator is 0, is set, and interval truncation is performed on the cosine term to ensure numerical stability. Finally, the candidate point that maximizes is selected as the expansion node.
In terms of weight determination and numerical setting, the priority of “safety > goal guidance > (linearity ≈ smoothness)” is mapped to:
and the normalized weight is obtained as:
A unified weight vector is adopted in the experiments of this paper:
corresponding to safety, linearity, goal guidance and smoothness, respectively. For different scenario preferences,
p can be adjusted on the premise of keeping the normalization rule unchanged.
This strategy enhances the effective exploration of the forward space through “multi-sector partitioning –low-discrepancy sampling –multi-objective screening”, and complements the goal-guided expansion, thus suppressing redundancy and alleviating the stagnation of unidirectional expansion. When the local multi-sector expansion still fails, the sampling-point-guided expansion is triggered to provide continuous and feasible detour directions. The principle of local multi-sector refined expansion is shown in
Figure 5.
3.4. Sampling-Point-Guided Expansion Strategy
When the local multi-sector expansion still fails, this paper adopts an advancement mode of “random sampling point guidance + direction normalization + fixed step size” and performs a feasibility check under the constraints of collision, safety distance and boundary to continue advancing and to improve the probability of finding a feasible path.
Let the current expansion node be
and the random sampling point be
, then the original expansion direction vector is:
To eliminate the influence of sampling distance on expansion stability, the normalized direction vector is defined as:
When
, the current expansion is terminated to avoid generating duplicate nodes; otherwise, a new node is generated with a fixed step size
s:
Let
and define the expansion line segment
. To ensure feasibility and safety, collision detection is performed on the line segment L, and the minimum safety distance and boundary constraints are judged:
where
indicates that the line segment L has no intersection with the obstacle set Obs and
,
are the planning space boundaries. When
,
is added to the search tree; otherwise, the expansion is discarded.
To facilitate subsequent path quality evaluation and local rewiring, the safety and target connectivity are recorded synchronously when the node is inserted:
where
is the index of the new node and
is the index of the parent node.
In summary, as a fallback strategy when both goal-guided expansion and local multi-sector expansion are blocked, this strategy realizes stable advancement through direction normalization and fixed step size and screens safe nodes by combining collision detection, safety distance and boundary constraints, providing information support for subsequent reconnection and path evaluation; its flow is shown in Algorithm 2.
| Algorithm 2 Sampling-point-guided expansion. |
- 1:
- 2:
if then return null - 3:
- 4:
if or - 5:
then return null - 6:
- 7:
if not then return null - 8:
- 9:
if then return null - 10:
- 11:
- 12:
- 13:
return
|
3.5. B-Spline Optimization and Smoothing Technology
To eliminate sharp inflection points and local discontinuous segments in the initial polyline path, this paper performs interpolation densification on the path points under the premise of satisfying collision-free and safety distance constraints and adopts a B-spline curve for fitting to obtain a continuous, smooth and trackable trajectory.
Given the maximum allowable spacing
between adjacent control points, when the Euclidean distance
between two adjacent points is greater than
, intermediate control points are inserted in the interval
, and the number of inserted points is:
where
is the ceiling function. The
k-th interpolation point is obtained by linear interpolation:
When
, no intermediate points are inserted, and only the endpoints
are retained. The set of control points
is obtained after interpolation, and an
m-order B-spline is used for geometric smoothing of the path. Let the non-decreasing node vector be
, and the first-order basis function is:
The high-order basis function is derived by the Cox-deBoor recursion:
If the denominator is 0, the coefficient of the corresponding term is set to 0. In the parameter interval
, the smooth trajectory
is:
where
M is the number of control points and
is the
j-th control point. Due to the convex hull property of B-spline, the trajectory point corresponding to any
satisfies:
where
denotes the convex hull of the control point set. To ensure that the smoothed path still satisfies the collision-free and safety distance constraints, collision detection is performed on the smoothed curve; if a collision occurs, iterative processing is conducted by further densifying control points, reducing the order or falling back to the original polyline path until the constraints are satisfied.
Furthermore, to adapt to different path characteristics, the key parameters are set adaptively:
(1) Maximum segment length setting
The maximum segment length is set according to the total path length
, the segment adjustment coefficient
and the basic segment length upper limit
:
A longer path can appropriately increase the segment length to reduce the number of segments; a larger results in a smaller segment length, thus improving the local fitting accuracy and smoothness.
(2) Adaptive selection of B-spline order
The B-spline order is adaptively selected according to the number of control points:
where
is the control point threshold;
and
correspond to short and long paths respectively; and
denotes scenarios such as narrow passages and sharp turns, where the order is reduced to suppress local oscillation.
(3) Adaptive parameter sampling point sequence construction
To improve the discrete resolution in high-curvature regions, an adaptive parameter sampling point sequence is constructed:
where
is the maximum curvature of the path,
is the curvature sensitivity coefficient, and
denotes equidistant sampling in the interval. A larger curvature or a higher
results in denser sampling, thus obtaining a trajectory with smoother dynamic changes in the turning region.
In summary, this paper improves the continuity and smoothness of the trajectory through interpolation densification, B-spline fitting and adaptive parameter adjustment, and naturally connects with the path generated by the aforementioned expansion strategies, providing a more kinematically constrained trackable trajectory for mobile robots. The comparison of paths before and after B-spline optimization is shown in
Figure 6.
3.6. Basis for Parameter Setting and Sensitivity Discussion
BGSE-RRT* involves parameters such as thresholds, probabilities, step sizes and weights. To avoid environment-specific “manual parameter tuning”, this paper reuses the same set of parameters and constraints for repeated comparisons in six simulation environments and ROS/real-robot joint validation and explains the parameters in groups according to “sensitivity –meaning –influence direction”. Among them, is the bi-tree cooperative switching probability function (adjusting the rhythm of exploration/convergence), and is the multi-objective evaluation weight (only affecting the sorting of candidate points without changing the collision/safety feasibility check).
(1) Safety and Feasibility Parameters (High Sensitivity, Hard Constraints)
Including , boundary constraints and collision criteria: The expansion edge is directly rejected if it collides with obstacles or the safety distance is insufficient. An increase in improves the safety margin but compresses the feasible space and may increase the planning time; a decrease in has the opposite effect but reduces the safety redundancy. Such parameters are determined by the robot size, sensing error and safety boundary and can be transferred across scenarios.
(2) Cooperative Switching and Convergence Parameters (Medium Sensitivity, Stage Control)
The switching probability is given by the piecewise function driven by distance feedback, which is only determined by a small number of scale parameters including , and : tends to as D decreases to enhance convergence and takes when to maintain exploration; is used to suppress early jitter. The target binding threshold , adaptive step size and direction normalization further reduce the scale sensitivity.
(3) Multi-objective Weights and Post-processing (Low Sensitivity, Preference Items; Including Safety Distance Rechecking)
Normalized multi-objective evaluation is adopted for multi-sector expansion, and a unified weight vector is used in this paper; is only used to sort the candidate points that pass the collision/safety screening and usually does not change the feasibility. B-spline smoothing is set adaptively, and collision and safety distance rechecking are performed after smoothing. If necessary, densification/reduction in order/fallback are adopted to ensure that the final trajectory is safe and feasible.
In summary, this paper organizes the parameters with a “three-layer desensitization” mechanism: hard constraints ensure feasibility, medium-sensitivity parameters analytically control exploration/convergence, and the remaining parameters are desensitized through normalization/adaptation and safety distance rechecking; combined with the reuse of unified parameters, it can be shown that the performance gain mainly comes from structural and mechanism design rather than manual parameter tuning.
5. Conclusions
BGSE-RRT* establishes a bi-tree cooperative adaptive switching mechanism to achieve a dynamic balance between global exploration and local convergence during searching. Under a multi-constraint feasibility check, it adopts goal-guided expansion to accelerate bi-tree convergence and improve connection efficiency. When goal-guided expansion is blocked, BGSE-RRT* switches to local multi-sector refined expansion, generating candidate expansion points and advancing through selection to enhance obstacle avoidance. If the multi-sector mechanism still fails, sampling-point-guided expansion is activated to suppress invalid exploration and maintain forward progress until a feasible path is found. Finally, B-spline smoothing is applied to improve trajectory continuity.
With these complementary mechanisms, BGSE-RRT* achieves up to an 84.71% reduction in planning time in dense small-obstacle environments, with shorter paths and 39–48% increases in safety distance. In narrow-passage environments, planning time is reduced by 7–34% and safety distance is increased by 10–21%. In ROS and real-robot joint verification, the average planning time is reduced by 30.68%, the path is safer and easier to track, and both iteration count and number of nodes are reduced; a real-robot run takes about 26 s with a 100% trajectory-tracking success rate.
This work primarily targets two-dimensional static planar environments, and systematic extensions to dynamic obstacles and multi-robot cooperative planning remain to be developed. Although the algorithm contains multiple threshold and probability parameters, the risk of manual tuning has been mitigated by using a unified parameter set across all environments and by providing a hierarchical sensitivity discussion; nevertheless, stronger adaptivity and broader verification are still needed under more complex tasks and platform constraints.
Future work will introduce motion prediction and online replanning with explicit evaluation of real-time performance and safety. We will also extend the framework to multi-robot cooperative planning by integrating task allocation and collision-avoidance constraints and will further generalize the goal-guided and multi-sector selection mechanisms to higher-dimensional problems by combining them with spatiotemporal trajectory optimization.