An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization

Zhao, Duo; Tang, Qichao; Ma, Lei; Sun, Yongkui; Lei, Jieyu

doi:10.3390/a18100615

Open AccessArticle

An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization

by

Duo Zhao

^1,*,

Qichao Tang

²,

Lei Ma

¹

,

Yongkui Sun

¹ and

Jieyu Lei

¹

School of Electrical Engineering, Southwest Jiaotong University, Chengdu 611756, China

²

School of Electrical Engineering and Information, Southwest Petroleum University, Chengdu 610500, China

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(10), 615; https://doi.org/10.3390/a18100615

Submission received: 13 August 2025 / Revised: 25 September 2025 / Accepted: 26 September 2025 / Published: 29 September 2025

(This article belongs to the Special Issue Swarm Intelligence and Evolutionary Algorithms for Real World Applications (2nd Edition))

Download

Browse Figures

Versions Notes

Abstract

In this paper a novel adaptive rapidly-exploring random trees algorithm based on cross-entropy optimization (CE-RRT) is proposed. We seek to provide a low-cost, fast, and effective solution for path planning of robots in various complex environments. Firstly, an adaptive sampling strategy is introduced to make the search directional. Then, an adaptive step adjustment strategy is proposed to improve the search efficiency of the algorithm. Finally, the cross-entropy algorithm is introduced to optimize redundant nodes in feasible paths and improve path quality. In order to verify the feasibility and effectiveness of the proposed algorithm, it is used to solve path planning problems in two two-dimensional environments and one three-dimensional environment. The RRT and RRT* algorithms are used as benchmarks to measure the effectiveness of the three optimization strategies. The simulation demonstrates that the proposed CE-RRT algorithm can effectively improve search efficiency and path quality. Particularly (path shortened by 26%, 22.70%, and 49.11%), the CE-RRT algorithm exhibits stronger robustness in three-dimensional environments. In addition, the proposed CE-RRT algorithm can be used to plan a reasonable path for the dual robot based on the dual Sawyer simulation platform.

Keywords:

rapidly-exploring random trees; sampling-based algorithms; cross-entropy optimization; optimal path planning

Graphical Abstract

1. Introduction

The motion planning problem is an important research topic in the field of robots, encompassing mobile robots and manipulators [1]. Motion planning for robots refers to planning a collision-free path within their working environment [2]. In most work scenarios involving robots, including but not limited to robot navigation [3], industrial automation [4], and autonomous vehicles [5], the performance of motion planning algorithms is of great significance. Within the robot workflow, it is crucial for them to efficiently and quickly plan a viable path once they have identified the tasks they need to perform. Therefore, the research on the robot motion planning algorithm has been continuously and extensively concerned.

Robot motion planning algorithms are mainly divided into three categories: graph-based search algorithms, sampling-based algorithms, and intelligent biomimetic path planning algorithms. The commonly used graph-based search algorithms include Dijkstra’s algorithm, the A* algorithm [6], and the D* algorithm [7], etc. These methods build maps containing known environment and obstacle information and then use graph theory to find a feasible path from start to goal in discrete space. They can certainly identify the optimal path if it exists. However, their computational efficiency is low when maps are complex, making them unsuitable for path planning in high-dimensional spaces. Intelligent bionic planning algorithms primarily include neural networks [8], genetic algorithms [9], ant colony optimization [10], and so on. These algorithms can enable robots to achieve some autonomy and intelligence in complex, unknown environments. However, they require lengthy search times and are prone to local extremum, with difficulties in parameter tuning.

Sampling-based path planning algorithms mainly include probabilistic roadmap (PRM) [11,12] and rapidly-exploring random trees (RRT) [13] algorithms, and their variables are based on random sampling rather than discretizing the configuration space [14]. These methods construct a set of trajectories by iteratively adding random samples, and the probability of finding a solution is 100% when the number of iterations approaches infinity. They have higher search efficiency and can be used to solve high-dimensional path planning problems. In addition, they can effectively handle path planning problems with non-holonomic constraints. However, the solutions obtained by these algorithms are not optimal and often worse, requiring further optimization. Moreover, PRM is often inefficient when the geometric shape of the obstacle is not a priori known.

Consequently, the simple and efficient characteristics of the RRT algorithm have made it widely popular and attracted the attention of numerous researchers. A series of improvement measures have been proposed; they can be divided into unidirectional and multi-directional based on the expansion direction of the random tree. In terms of one-way random tree expansion, Kalisiak et al. [15] proposed a new variation of the RRT planner, which demonstrates good performance in both loosely constrained and highly constrained environments. Ichnowski et al. [16] presented a parallel RRT (PRRT) for feasible and optimal motion planning designed for modern multicore CPUs. For the shortage of RRT, Karaman et al. [17] proposed the RRT* algorithm with asymptotic optimality and added random geometry and pruning optimization theory based on the RRT node to ensure the nodes can converge to the current optimal value. In terms of multi-directional random tree expansion, Lavalle et al. [18] proposed bi-directional random tree (bi-RRT) for path planning of high-dimensional manipulators. The RRT-connect [19] introduced the greedy expansion idea based on bi-RRT and sets cyclic expansion in the target random tree to speed up the connection speed of dual random tree nodes. The RRT algorithm has undergone long-term development and has achieved many research results. However, there remains a certain gap between the feasible path and the optimal path, and further research is still needed.

The cross-entropy (CE) algorithm is a new heuristic algorithm for studying stochastic optimization problems in recent years [20]. The core idea of the CE algorithm is to find the predicted distribution that is the same or closest to the true distribution through a certain random process [21]. It is particularly suitable for solving optimization problems with continuous variables, and it has lower computational complexity and stronger robustness compared with other intelligent optimization algorithms [22,23]. Therefore, combining the CE algorithm with the RRT algorithm to fully leverage their advantages is a direction worth exploring.

Motivated by the aforementioned discussion, the purpose of this paper is to propose a CE-RRT algorithm by combining the CE algorithm with the RRT algorithm for more efficient and excellent path planning. The main contribution of the work is summarized as follows:

(1): An adaptive RRT algorithm based on cross-entropy optimization (CE-RRT) is proposed by introducing three improved strategies. Firstly, an adaptive sampling strategy is introduced to make the search directional. Secondly, an adaptive step adjustment strategy is proposed to improve the search efficiency of the algorithm; it can adaptively adjust the growth step based on obstacle information. Thirdly, a path smoothing method is introduced to reduce redundant nodes in the path, and then the CE algorithm is used to optimize the smoothed path and improve its quality.
(2): A two-dimensional maze environment, a two-dimensional cluttered environment, and a three-dimensional environment are used to test the CE-RRT algorithm. The RRT algorithm is used as a benchmark to measure the effectiveness of the three proposed optimization strategies. The simulation results show that adaptive sampling and adaptive step adjustment strategies can effectively increase the search speed of the algorithm, and the CE algorithm can also significantly improve path quality.

The article consists of five sections, including the introduction (Section 1). Section 2 defines the motion planning problem and provides a more detailed introduction to the RRT algorithm and CE algorithm. Section 3 proposes the CE-RRT algorithm, and performance tests are shown in Section 4. Section 5 presents a dual robot welding path planning simulation experiment. Section 6 briefly describes the obtained results and future works.

2. Background

This section defines the motion planning problem and provides a more detailed introduction to the RRT algorithm and CE algorithm. These algorithms form the basis of the proposed CE-RRT algorithm.

2.1. Problem Definition

Define

X \subseteq ℝ^{d}

as the configuration space,

X_{o b s}

as the obstacle area, and

X_{f r e e} = X - X_{o b s}

as the closed area between

X

and

X_{o b s}

, where d is the dimension of the configuration space, or the degree of freedom of the robot.

(x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e})

defines a path planning problem, where

x_{i n i t} \subset X_{f r e e}

is the initial state and

X_{g o a l} \subset X_{f r e e}

is the goal region. Once the path point reaches the interior of the goal region

X_{g o a l}

, it can be considered as reaching the goal state. The schematic diagram in two-dimensional configuration space is shown in Figure 1. Based on the above explanation, the definitions of collision-free path, feasible path planning, and optimal path planning are as follows:

Definition 1.

(Collision-free path): Define a bounded continuous changing function

f : [0, 1] \mapsto X

as a path. A path

f (τ)

is collision-free, if

\forall τ \in [0, 1], f (τ) \in X_{f r e e}

.

Definition 2.

(Feasible path planning): For a given path planning problem

(x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e})

, feasible path planning is to find a feasible path

f (τ)

such that

f (τ)

is collision-free,

f (0) = x_{i n i t}

, and

f (1) \in X_{g o a l}

.

Definition 3.

(Optimal path planning): For a given path planning problem

(x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e})

, optimal path planning is to find a feasible path

f^{*}

that minimizes the cost

φ (f^{*})

such that

φ (f^{*}) = \min {φ (f) : f is feasible}

, where

φ (f)

is the cost function of path f.

2.2. RRT Algorithm

The RRT algorithm is a sampling-based path planning algorithm, and its main process is shown in Algorithm 1. The known input information includes environment information

{X_{o b s}, X_{f r e e}}

, initial state

x_{i n i t}

, goal state (region)

X_{g o a l}

, growth step

ε

, and maximum number of iterations

k_{\max}

. The feasible path

f

from

x_{i n i t}

to

X_{g o a l}

is output. Firstly, initialize the random tree

x_{t r e e}

and add the initial node

x_{i n i t}

to

x_{t r e e}

. Then, repeat steps 3~15 until a feasible path is found or the maximum number of iterations

k_{\max}

is reached. In each iteration of the algorithm, a node

x_{r a n d}^{i}

is randomly generated from the collision-free area

X_{f r e e}

(step 4), and then the node

x_{n e a r}^{i}

closest to

x_{r a n d}^{i}

is found on the random tree

x_{t r e e}

(step 5). Next, starting from

x_{n e a r}^{i}

, a new node

x_{n e w}^{i}

is generated along the direction from

x_{n e a r}^{i}

to

x_{r a n d}^{i}

with a step size

ε

(step 6). In step 7, the generated path

x_{n e a r}^{i} \to x_{n e w}^{i}

is performed to collision detection. If no collision occurs, output flag s = 0 and add the new node

x_{n e w}^{i}

to the random tree

x_{t r e e}

. In steps 11~14, when the new node

x_{n e w}^{i}

reaches the goal area

X_{g o a l}

, the path planning ends and outputs the feasible path f found, and the loop ends.

On the basis of the above description, the RRT algorithm has the following possible drawbacks. Firstly, the search direction of the algorithm is determined by a random node

x_{r a n d}^{i}

, it is completely random and lacks a certain direction guidance mechanism, this can easily lead to the waste of computing resources. Secondly, the growth step

ε

is fixed, and a too small

ε

can significantly reduce computational efficiency, while a too large

ε

can easily cause collisions, lacking a mechanism for adaptive adjustment of the step with the environment. In addition, the paths generated by the RRT algorithm often have significant differences from the optimal path, and existing smoothing operations are mostly based on greedy strategies, lacking global optimization capabilities. Therefore, it is necessary to propose a series of improvement strategies to improve these shortcomings.

Algorithm 1: RRT algorithm

Input:

{X_{o b s}, X_{f r e e}} \leftarrow

The environment information,

x_{i n i t} \leftarrow

The initial state of the robot,

X_{g o a l} \leftarrow

The goal region of the robot

k_{\max} \leftarrow

Maximum number of iterations,

ε \leftarrow

growth step

Output: Feasible path f

1

x_{t r e e} \leftarrow Ø

2

x_{t r e e} \leftarrow [x_{t r e e}, x_{i n i t}]

3 for

i \leftarrow 1

to

k_{\max}

do

4

x_{r a n d}^{i} \leftarrow

Randomly generated in

X_{f r e e}

5

x_{n e a r}^{i} \leftarrow GetNearPoint (x_{r a n d}^{i}, x_{t r e e})

6

x_{n e w}^{i} \leftarrow GetNewPoint (ε, x_{r a n d}^{i}, x_{n e a r}^{i})

7

s \leftarrow CollisionDetection (x_{n e a r}^{i}, x_{n e w}^{i}, X_{o b s}, X_{f r e e})

8 if (

s = = 0

, no collision) then

9

x_{t r e e} \leftarrow [x_{t r e e}, x_{n e w}^{i}]

10 end if

11 if (arrived

X_{g o a l}

) then

12

f \leftarrow GetPath (x_{i n i t}, X_{g o a l}, x_{t r e e})

13 break

14 end if

15 end for

16 Return:

f

2.3. CE Algorithm

The core idea of the CE algorithm is to find the predicted distribution that is the same or closest to the true distribution through a certain random process, mainly using a normal distribution. The traditional CE algorithm is listed in Algorithm 2 [24]. Firstly, the appropriate initial

μ_{0}

and

σ^{2}

are selected for the normal distribution based on the range of variable values (steps 1~2). At the beginning, a larger value can be selected for sigma, allowing the normal distribution to have a larger search range. It indicates that the predicted distribution is very close to the true distribution when

σ^{2}

converges to a very small value (steps 4~5). Then the main loop started until

{σ^{2}}_{t} \leq η

. In each iteration, N samples are generated based on normal distribution

x_{t} ~ N (μ_{t - 1}, {σ^{2}}_{t - 1})

(step 6), and their corresponding fitness values are calculated (step 7). After sorting the fitness values (step 8),

N_{e}

samples with lower fitness are selected as elite samples for parameter updating and the next iteration (steps 10~12). When the algorithm reaches the termination criteria, the mean of the approximate optimal solutions

μ_{t}

is output as the optimization result.

Algorithm 2: CE optimization algorithm
Input: Initial mean $μ_{0}$ , variance ${σ^{2}}_{0}$ , sample size N, elite sample rate $ρ$ , smoothing coefficient $α$
Output: Mean $μ_{t}$ of generation t
1	$μ_{0} \leftarrow$ Randomly generate a rational number
2	${σ^{2}}_{0} \leftarrow$ Randomly generate a rational number
3	$N_{e} \leftarrow ⌊ρ \times N⌋$ , ( $⌊\cdot⌋$ ) represents rounding down
4	$t \leftarrow 1$ , $η \leftarrow 1.0 \times 10^{- 10}$
5	while ${σ^{2}}_{t} > η$ do
6	$x_{t} = [x_{1}, x_{2}, \dots, x_{N}] \leftarrow$ Generate N samples based on normal distribution
7	$φ_{t} = [φ_{1}, φ_{2}, \dots, φ_{N}] \leftarrow$ Calculate fitness function values
8	$[φ_{t}, I] \leftarrow s o r t (φ_{t})$
9	$[φ_{t}^{'}, I_{e}] \leftarrow$ Select $N_{e}$ individuals with smaller fitness
10	Parameter updating: $μ_{t} \leftarrow \sum_{i = 1}^{N_{e}} x_{I_{e i}} / N_{e}$ , ${σ^{2}}_{t} \leftarrow \sum_{i = 1}^{N_{e}} {(x_{I_{e i}} - μ_{t})}^{2} / N_{e}$
11	Smoothing operation: $μ_{t} \leftarrow α μ_{t} + (1 - α) μ_{t - 1}$ , ${σ^{2}}_{t} \leftarrow α {σ^{2}}_{t} + (1 - α) {σ^{2}}_{t - 1}$
12	$t \leftarrow t + 1$
13	end while
14	Return: $μ_{t}$

3. The Proposed CE-RRT Algorithm

This section introduces the proposed CE-RRT algorithm, including the main process and algorithm details.

3.1. Main Procedure of the Proposed CE-RRT Algorithm

The main procedure of the proposed algorithm is described in Algorithm 3; this framework is proposed based on the RRT algorithm. Firstly, initialize the random tree

x_{t r e e}

, add the initial node

x_{i n i t}

to

x_{t r e e}

, and initialize the collision flag s, where s = 0 indicates that the newly generated path is collision-free. Then, repeat steps 4~17 until a feasible path is found or the maximum number of iterations

k_{\max}

is reached. In step 18, a path smoothing method is used to process the feasible path f and reduce redundant nodes in f. Then use the CE algorithm to optimize the smoothed path

f_{s m o o t h}

and obtain the optimal feasible path

f_{o p t i m a l}

. Aiming at the shortcomings of the RRT algorithm, three improvement strategies are proposed in CE-RRT. Firstly, an adaptive sampling strategy is proposed; it adds a sampling direction guidance mechanism to solve the problem of low solving efficiency caused by random sampling (step 5). Secondly, an adaptive step size adjustment strategy is introduced to improve the problems of low solving efficiency or easy collision caused by fixed step (step 7). Thirdly, a path smoothing method is used to smooth feasible paths f and reduce redundant nodes (step 18); then, the CE algorithm is used to optimize the smoothed path

f_{s m o o t h}

and obtain a feasible path

f_{o p t i m a l}

with higher quality (step 19). In the next section, the proposed improvement strategies are described in detail.

Algorithm 3: CE-RRT algorithm
Input: ${x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e}}$ , iterations $k_{\max}$ , initial step $ε_{0}$ , sampling probability $p_{a}$ , step increments $△ ε_{1}$ and $△ ε_{2}$
Output: $f_{o p t i m a l} \leftarrow$ The optimal path from $x_{i n i t}$ to $X_{g o a l}$
1	$x_{t r e e} \leftarrow Ø, x_{t r e e} \leftarrow [x_{t r e e}, x_{i n i t}]$
2	$s \leftarrow 0$ , collision sign, no collision
3	for $i \leftarrow 1$ to $k_{\max}$ do
4	$x_{r a n d}^{i} \leftarrow AdaptiveSample (p_{a}, s, X_{g o a l}, X_{f r e e})$
5	$x_{n e a r}^{i} \leftarrow GetNearPoint (x_{r a n d}^{i}, x_{t r e e})$
6	$ε_{i} \leftarrow AdaptiveStep (ε_{i - 1}, △ ε_{1}, △ ε_{2}, s)$
7	$x_{n e w}^{i} \leftarrow GetNewPoint (ε_{i}, x_{r a n d}^{i}, x_{n e a r}^{i})$
8	$s \leftarrow CollisionDetection (x_{n e a r}^{i}, x_{n e w}^{i}, X_{o b s}, X_{f r e e})$
9	if ( $s = = 0$ , no collision) then
10	$x_{t r e e} \leftarrow [x_{t r e e}, x_{n e w}]$
11	end if
12	if (arrived $X_{g o a l}$ ) then
13	$f \leftarrow GetPath (x_{i n i t}, X_{g o a l}, x_{t r e e})$
14	break
15	end if
16	end for
17	$f_{s m o o t h} \leftarrow PathSmooth (f, X_{o b s}, X_{f r e e})$
18	$f_{o p t i m a l} \leftarrow CEOptimize (f_{s m o o t h}, X_{o b s}, X_{f r e e})$
19	Return: $f_{o p t i m a l}$

3.2. Details of Improvement Strategies

This section explains the details of the three proposed improvement strategies.

3.2.1. Adaptive Sampling Strategy

In the RRT algorithm, the generation of random point

x_{r a n d}

is completely random without any specific directional guidance. This will cause many sampling points to be wasted and reduce the search efficiency of the algorithm. Therefore, a sampling strategy with directional guidance is introduced; it can adaptively adjust the sampling direction based on obstacle information

X_{o b s}

. The details of this sampling strategy are listed in Algorithm 4.

Firstly, the sampling flag r is assigned a value in steps 1~10, when r = 1, sampling with a direction-guided mechanism is performed, and when r = 2, random sampling is performed. In steps 3~7, the direction sampling is selected with probability

p_{a}

at the i-th iteration, when the nodes generated by the previous generation do not collide. In steps 8~10, random sampling must be selected when

x_{n e w}^{i - 1}

collides. This demonstrates the adaptability of the proposed sampling strategy (steps 3~10). Then, adaptive sampling is performed in steps 11~15, when r = 1,

x_{r a n d}^{i} \leftarrow X_{g o a l}

, the random tree grows toward the goal area; when r = 2, a new

x_{r a n d}^{i}

is randomly selected from

X_{f r e e}

. It should be noted that if the random tree keeps growing towards the goal area, it is easy to fall into a local extremum when encountering obstacles. The existence of probability

p_{a}

can avoid this problem to a certain extent.

The probability

p_{a}

for directional sampling was determined through systematic experimentation. We tested values across the range [0.1, 0.9] in the 2D and 3D environments. The results show that

p_{a}

= 0.7 consistently provided the best balance between directed exploration (improving search speed) and maintaining sufficient randomness to avoid local minima and ensure robustness in diverse environments.

Algorithm 4: $AdaptiveSample (p_{a}, s, X_{g o a l}, X_{f r e e})$
Input: ${x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e}}$ , collision sign s, adaptive sampling probability $p_{a}$
Output: Random node $x_{r a n d}^{i}$
1	if ( $s = = 0$ , no collision) then
2	$a \leftarrow$ Generate a random number from [0, 1]
3	if $a < p_{a}$ then
4	$r \leftarrow 1$
5	else
6	$r \leftarrow 2$
7	end if
8	else
9	$r \leftarrow 2$
10	end if
11	if $r = = 1$ then
12	$x_{r a n d}^{i} \leftarrow X_{g o a l}$
13	else
14	$x_{r a n d}^{i} \leftarrow$ Randomly generated in $X_{f r e e}$
15	end if
16	Return: $x_{r a n d}^{i}$

3.2.2. Adaptive Step Adjustment Strategy

For a given path planning problem

(x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e})

, the growth step

ε

of the random tree is fixed when using the RRT algorithm for solving. When the size

ε

is small, it is easy to increase the solving time and reduce the solving efficiency; when the size

ε

is large, it is easy to cross smaller obstacles, making it difficult to find a feasible path. Therefore, an adaptive step adjustment strategy is introduced to reach the goal region faster. The strategy increases the step when there are fewer obstacles and decreases the step when there are more obstacles. The specific details are listed in Algorithm 5. The basic idea of this strategy is relatively simple. When the node

x_{n e w}^{i - 1}

of the previous generation is a valid node without collision, the step size

ε_{i}

of the current generation is increased to approach the goal area faster (step 2). When the node

x_{n e w}^{i - 1}

is an invalid node, that is, it collides with an obstacle, the current step

ε_{i}

is reduced to facilitate local search near the obstacle and find a feasible path (step 4). In addition, step variation

Δ ε_{1}

and

Δ ε_{2}

are determined based on specific problems.

The step size adjustment factors Δε₁ (increase factor) and Δε₂ (decrease factor) were determined through systematic experimentation across the tested environments. We evaluated

Δ ε_{1} \in [0.1, 0.5]

and

Δ ε_{2} \in [0.05, 0.15]

on the 2D and 3D environments. The chosen values (

Δ ε_{1}

= 0.2,

Δ ε_{2}

= 0.1) consistently demonstrated optimal performance by balancing rapid expansion toward the goal when obstacles are sparse (

Δ ε_{1}

= 0.2) with fine-grained navigation in cluttered regions to avoid collisions (

Δ ε_{2}

= 0.1). This asymmetry (

Δ ε_{1}

>

Δ ε_{2}

) reflects the empirical observation that accelerating expansion in open spaces yields greater efficiency gains than reducing steps during collisions, while maintaining overall robustness across diverse environments.

Algorithm 5: $AdaptiveStep (ε_{i - 1}, Δ ε_{1}, Δ ε_{2}, s)$
Input: ${x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e}}$ , $ε_{i - 1}$ , step variation $Δ ε_{1}, Δ ε_{2}$ , collision sign s
Output: $ε_{i}$
1	if ( $s = = 0$ , no collision) then
2	$ε_{i} \leftarrow ε_{i - 1} + Δ ε_{1}$
3	else
4	$ε_{i} \leftarrow ε_{i - 1} - Δ ε_{2}$
5	end if
6	Return: $ε_{i}$

3.2.3. Path Optimization Strategy Based on CE Algorithm

The feasible path f obtained based on the RRT algorithm often contains many path points, which is not convenient for direct optimization by using the CE algorithm. Therefore, a path smoothing method is first used to remove redundant path points of path f; then, the CE algorithm is used to optimize the smoothed path

f_{s m o o t h}

and obtain the optimized path

f_{o p t i m a l}

.

(1): Path smoothing method

The specific details of the path smoothing method are shown in Algorithm 6. Firstly, initialize the smooth path

f_{s m o o t h}

. Then, loop iterations are performed in steps 2~23 until the termination condition

l e n g t h (f) = l e n g t h (f_{s m o o t h})

is met. In each iteration, n is the number of nodes on the feasible path f, and steps 4~19 are specific smoothing operations. When there is no collision between two nodes

f (j)

and

f (k)

on path f, the length of the straight path is l₁; in steps 9~11, the length of the path f between nodes

f (j)

and

f (k)

is obtained as l₂. In steps 12~14, when l₁< l₂, it means that the straight path is better; the path points between

f (j)

and

f (k)

can be removed, and these nodes are stored in the set dp. All the path points that may be removed are stored in the set dp after the loop operation of steps 4~16. Subsequently, duplicate elements in the set dp are removed (step 17), and the path nodes contained in the set dp are deleted from the path f. In steps 20~22, the smooth path

f_{s m o o t h}

is updated after conditional judgment, and one iteration ends here. Finally, when there are no nodes that can be deleted on the path f, the algorithm stops iterating and outputs the smoothed path

f_{s m o o t h}

.

Algorithm 6: $PathSmooth (f, X_{o b s}, X_{f r e e})$
Input: $\{x_{i n i t}, X_{g o a l}, X_{o b s}, X_{f r e e}\}$ , feasible path f
Output: The smoothed path $f_{s m o o t h}$
1	$f_{s m o o t h} \leftarrow Ø$
2	while $l e n g t h (f) \neq l e n g t h (f_{s m o o t h})$ do
3	$n \leftarrow l e n g t h (f)$ , $d p \leftarrow Ø$
4	for $j \leftarrow 1$ to $n - 2$ do
5	for $k \leftarrow j + 1$ to $n - 1$ do
6	$s \leftarrow CollisionDetection (f (j), f (k), X_{o b s}, X_{f r e e})$
7	if ( $s = = 0$ , no collision) then
8	$l_{1} \leftarrow D i s (f (j), f (k))$
9	for $m \leftarrow j$ to $k - 1$ do
10	$l_{2} \leftarrow l_{2} + D i s (f (m), f (m + 1))$
11	end for
12	if $l_{1} < l_{2}$ then
13	$d p \leftarrow [d p; f (j + 1 : k - 1)]$
14	end if
15	end if
16	end for
17	$d p \leftarrow$ Remove duplicate elements
18	$f \leftarrow f - d p$
19	end for
20	if $l e n g t h (f) \neq l e n g t h (f_{s m o o t h})$ then
21	$f_{s m o o t h} \leftarrow f$
22	end if
23	end while
24	Return: $f_{s m o o t h}$

(2): Path optimization method based on CE algorithm

The specific process of the path optimization method based on the CE algorithm is shown in Algorithm 7. Taking the two-dimensional path planning problem as an example, the smooth path

f_{s m o o t h}

can be mathematically represented as a matrix of size

n \times 2

, where n is the number of path points. Considering the path length L as the optimization objective, an optimization problem is established:

\{\begin{cases} Find f = [x_{1}, x_{2}, \dots, x_{n}] \\ Minimize L = \sum_{i = 1}^{n - 1} D i s (x_{i}, x_{i + 1}) \end{cases},

(1)

where f is a feasible path and

x_{j} \in f

is a path point on f,

j = 1, 2, \dots, n

, L is the length of path f.

In Algorithm 7, firstly, the smooth path

f_{s m o o t h}

is used as the initial value of the optimization path

f_{o p t i m a l}

, and the variance parameter of the CE algorithm is initialized to a matrix of size

(n - 2) \times 2

. Only the middle (n − 2) nodes are optimized because the starting and ending points of the path are fixed. Then, the algorithm performs

K_{\max}

loop iterations in steps 4~29. In steps 5~7, the x and y coordinates of the other n-2 nodes in the path

f_{o p t i m a l}

are stored in the sets px and py respectively, except for the starting and ending points. In steps 8~13, each element of the sets px and py is taken as the mean, and each element in the matrix

{σ^{2}}_{i}

is taken as the variance, and N samples are generated according to the normal distribution and stored in the matrices point_x and point_y respectively, where point_x and point_y are two-dimensional matrices of size

(n - 2) \times N

. In steps 14~21, N paths are generated based on the elements in the matrices point_x and point_y and stored in the structure path_offspring. It can also be understood that path_offspring is an offspring population containing N individuals. In step 22, in order to retain the excellent characteristics of the population during the evolution process, the best individual

f_{o p t i m a l}

of the previous generation is merged with the offspring population path_offspring. In step 23, collision detection is performed on each individual in the population path_offspring, and valid individuals without collision are stored in a new population path_offspring_new. Then, the path length of each individual in the population path_offspring_new is calculated, and

⌊ρ \times N⌋

individuals with smaller path lengths are selected as elite samples and stored in the archive. The variance parameter is updated according to the elite samples in Archive, and the individual with the smallest path length is selected to update

f_{o p t i m a l}

. In step 28, the smoothing operation of the CE algorithm is performed to reduce the probability that the algorithm falls into a local extremum. One iteration ends here, and the optimized path

f_{o p t i m a l}

is output after

K_{\max}

cycles.

In step 26, the variance parameter

{σ^{2}}_{i}

is updated using elite samples from the Archive. The Archive contains

⌊ρ \times N⌋

elite paths (each of length n − 2 after removing start/end points). To compute

{σ^{2}}_{i}

:

Grouping by position: For each path point position j (j ∈ [1, n − 2]), create a matrix $C_{j}$ of size $⌊ρ \times N⌋ \times 2$ , where each row corresponds to the (x, y) coordinates of the j-th path point from each elite path.
Variance Calculation: The variance matrix obtained ${σ^{2}}_{i}$ is

${σ^{2}}_{i} = [sigma (C_{j, 1}), sigma (C_{j, 2})], j \in [1, n - 2]$

where sigma is a function that calculates the variance of a column vector, the size of the matrix is $(n - 2) \times 2$ .

Algorithm 7: Path optimization method based on CE algorithm
Input: Sample size N, elite sample rate $ρ$ , smoothness coefficient $α$ , $f_{s m o o t h}$ , maximum number of optimizations $K_{\max}$
Output: The optimized path $f_{o p t i m a l}$
1	$f_{o p t i m a l} \leftarrow f_{s m o o t h}$
2	$n \leftarrow l e n g t h (f_{o p t i m a l})$
3	${σ^{2}}_{0} \leftarrow$ Generate a matrix of size $(n - 2) \times 2$ with all elements set to 1
4	for $i \leftarrow 1$ to $K_{\max}$ do
5	for $j \leftarrow 2$ to $n - 1$ do
6	$p x_{j - 1} \leftarrow f_{o p t i m a l} (j, 1)$ , $p y_{j - 1} \leftarrow f_{o p t i m a l} (j, 2)$
7	end for
8	$p o i n t_x \leftarrow Ø$ , $p o i n t_y \leftarrow Ø$
9	for $j \leftarrow 1$ to $n - 2$ do
10	for $k \leftarrow 1$ to $N$ do
11	$p o i n t_x_{j, k} \leftarrow (p x_{j}, {σ^{2}}_{i}^{j, 1})$ , $p o i n t_y_{j, k} \leftarrow (p y_{j}, {σ^{2}}_{i}^{j, 2})$
12	end for
13	end for
14	$p a t h_o f f s p r i n g \leftarrow Ø$
15	for $k \leftarrow 1$ to $N$ do
16	$p a t h_o f f s p r i n g (k) . p a t h (1, :) \leftarrow f_{o p t i m a l} (1, :)$
17	for $j \leftarrow 1$ to $n - 2$ do
18	$p a t h_o f f s p r i n g (k) . p a t h (j + 1, :) \leftarrow [p o i n t_x_{j, k}, p o i n t_y_{j, k}]$
19	end for
20	$p a t h_o f f s p r i n g (k) . p a t h (n, :) \leftarrow f_{o p t i m a l} (n, :)$
21	end for
22	$p a t h_o f f s p r i n g (N + 1) . p a t h \leftarrow f_{o p t i m a l}$
23	$p a t h_o f f s p r i n g_n e w \leftarrow$ Select valid individuals without collision in the population path_offspring
24	$L \leftarrow$ Calculate the path length of each individual in the population path_offspring_new
25	$A r c h i v e \leftarrow$ Select $⌊ρ \times N⌋$ individuals with smaller path lengths as elite samples
26	${σ^{2}}_{i} \leftarrow$ Update variance based on individuals in Archive
27	$f_{o p t i m a l} \leftarrow$ Select the individual with the shortest path length in Archive
28	${σ^{2}}_{i} \leftarrow α {σ^{2}}_{i} + (1 - α) {σ^{2}}_{i - 1}$
29	end for
30	Return: $f_{o p t i m a l}$

4. Simulation Experiments and Analysis

In this section, a series of simulation experiments are performed to evaluate the performance of the proposed CE-RRT algorithm. The RRT algorithm is used as a benchmark to measure the effectiveness of the three proposed optimization strategies.

4.1. Environments and Evaluation Indicators

Three environments are used in the simulations: the 2-dimensional (2D) maze-like environment in Figure 2a (starting point (15, 5), ending point (45, 45)), the 2D cluttered environment in Figure 2b (starting point (3, 3), ending point (47, 47)), and the 3-dimensional (3D) environment in Figure 2c (starting point (15, 10, 25), ending point (15, 46, 25)). Three environments are the same size, all

50 \times 50

. Two indicators are used to compare the performance of algorithms:

L_{f}

is the length of feasible path f, and

T_{f}

is the time to find path f. These algorithms will immediately stop once the feasible solutions are found.

4.2. Comparison Algorithms and Parameter Settings

The CE-RRT is compared with the existing RRT and RRT* algorithms in three environments. In addition, for the three proposed improvement strategies, the improvement algorithms when each strategy exists separately are also added for comparison to verify their effectiveness. ‘A-RRT’, ‘B-RRT’, and ‘C-RRT’ denote RRT algorithms that separately include adaptive sampling strategy, adaptive step size adjustment strategy, and path optimization strategy based on the CE algorithm, respectively.

The detailed parameter settings of all algorithms are listed in Table 1. The parameters of the CE-RRT algorithm are included in A-RRT, B-RRT, and C-RRT. All algorithms and experiments are implemented in the MATLAB 2021b environment, and the hardware environment is an Intel i9@2.3 GHz CPU and 32G memory, which were sourced in Chengdu, China.

4.3. Simulation of Three Environments

4.3.1. 2D Maze Environment

A sparse environment like a maze is one of the most common working environments for autonomous mobile robots, and the RRT algorithm performs well in it. Six algorithms are applied to the 2D maze environment shown in Figure 2a for simulation, and the performance results are shown in Figure 3. In Figure 3, the generated paths of RRT (

L_{f}

= 152.74,

T_{f}

= 29.30), A-RRT (

L_{f}

= 150.46,

T_{f}

= 15.96), B-RRT (

L_{f}

= 153.88,

T_{f}

= 1.08), C-RRT (

L_{f}

= 117.06,

T_{f}

= 30.25), CE-RRT (

L_{f}

= 113.02,

T_{f}

= 13.01), and RRT* (

L_{f}

= 162.73,

T_{f}

= 10.47) are shown.

The result of the RRT algorithm in Figure 3a is used as a benchmark, and the analysis of the results in Figure 3b–f is as follows. In Figure 3b, the efficiency of A-RRT in finding a feasible path is significantly improved, and the

T_{f}

is reduced by 45.53%, from 29.30 s to 15.96 s; this indicates that the adaptive sampling strategy has a good effect. In Figure 3c, the solving efficiency of B-RRT is greatly improved, and the

T_{f}

is reduced by 96.31%, from 29.30 s to 1.08 s; this indicates that the proposed adaptive step adjustment strategy is effective. However, due to the large step

ε

, it is easy to cause the feasible path f to deviate from the optimal solution. In Figure 3d, the path quality has been greatly improved after the optimization of path f by using the CE algorithm, and the length

L_{f}

has been reduced by 23.36%, from 152.74 to 117.06. In Figure 3e, the CE-RRT algorithm reduces path length

L_{f}

by 26% and search time

T_{f}

by 55.60%. In Figure 3f, compared with RRT*, the path length of CE-RRT is shortened by 30.55%, but the time consumed is also increased by 24.26%. This indicates that CE-RRT has obvious advantages in terms of path quality but needs improvement in time efficiency. However, the RRT* algorithm has a 2% probability of planning failure, indicating that it has poor robustness.

The statistical results of 100 simulations are shown in box plots 4 and 5, and detailed data are listed in Table 2. The ‘Fail’ value is the number of failures. Failure refers to the algorithm not being able to find the approximate optimal path within 20,000 iterations. From Figure 4 and Figure 5, it can be seen that the adaptive sampling and adaptive step adjustment strategies effectively reduce the search time of the RRT algorithm, and the quality of the feasible path is significantly improved after optimization by using the CE algorithm. From Figure 6, it is apparent that the CE-RRT algorithm demonstrates superior optimization of the feasible path length

L_{f}

compared to the C-RRT algorithm. This suggests that the addition of the other two strategies contributes positively to reducing

L_{f}

. Furthermore, as shown in Table 2, a smaller standard deviation indicates that the CE-RRT algorithm has strong robustness.

4.3.2. 2D Cluttered Environment

Dense and cluttered environments are also common working environments for mobile robots, and complex environments can better test the effectiveness of path planning algorithms. Five algorithms are applied to the 2D maze environment shown in Figure 2b for simulation, and the performance results are shown in Figure 7. In Figure 7, the generated paths of RRT (

L_{f}

= 82.47,

T_{f}

= 4.15), A-RRT (

L_{f}

= 70.86,

T_{f}

= 1.23), B-RRT (

L_{f}

= 81.05,

T_{f}

= 0.55), C-RRT (

L_{f}

= 68.94,

T_{f}

= 33.17), CE-RRT (

L_{f}

= 63.75,

T_{f}

= 30.03), and RRT* (

L_{f}

= 85.58,

T_{f}

= 6.91) are shown.

The result of the RRT algorithm in Figure 7a is used as a benchmark, and the analysis of the results in Figure 7b–e is as follows. In Figure 7b,c, there is a significant improvement in the efficiency of the algorithm, with the time

T_{f}

reduced by 70.36% and 86.75%, respectively. This indicates that the proposed adaptive sampling and adaptive step adjustment strategies remain effective in the 2D cluttered environment. In Figure 7d,e, the path quality is significantly improved, with path length

L_{f}

shortened by 16.41% and 22.70%, respectively, indicating that the optimization of the CE algorithm is positive. In Figure 7f, compared with the RRT* algorithm, the path length of CE-RRT is shortened by 25.51%, but the time consumed increases significantly. However, the time consumption of C-RRT and CE-RRT has also significantly increased.

The statistical results of 100 simulations are shown in box plots 8 and 9, and detailed data are listed in Table 3. Figure 8 and Figure 9 show that the search time of A-RRT and B-RRT is significantly reduced, but the length of feasible paths obtained by B-RRT slightly increases. This is because larger search steps can easily lead to a decrease in search accuracy. In addition, C-RRT and CE-RRT exhibit significant advantages in path length, but they also consume more time. From Figure 10, the CE-RRT algorithm still has better performance, and it has converged after about 20 iterations. Therefore, the number of iterations of the CE algorithm can be reduced to shorten the running time of the entire algorithm.

4.3.3. 3D Environment

The 3D environment with obstacles is the main working environment for robotic arms. Five algorithms are applied to the 3D environment shown in Figure 2c for simulation, and the performance results are shown in Figure 11. In Figure 11, the generated paths of RRT (

L_{f}

= 77.36,

T_{f}

= 32.90), A-RRT (

L_{f}

= 66.88,

T_{f}

= 0.46), B-RRT (

L_{f}

= 71.96,

T_{f}

= 0.65), C-RRT (

L_{f}

= 45.71,

T_{f}

= 34.94), CE-RRT (

L_{f}

= 39.37,

T_{f}

= 12.97), and RRT* (

L_{f}

= 65.67,

T_{f}

= 45.50) are shown.

The result of the RRT algorithm in Figure 11a is used as a benchmark, and the analysis of the results in Figure 11b–e is as follows. From Figure 11b,c, it can be seen that the search efficiency of A-RRT and B-RRT is greatly improved, with the time

T_{f}

reduced by 98.60% and 98.02%, respectively. This indicates that the proposed adaptive sampling and step adjustment strategies are still very effective in a 3D environment. Similarly, as shown in Figure 11d,e, the quality of feasible paths has also been significantly improved, with lengths shortened by 40.91% and 49.11%, respectively. In Figure 11f, compared with RRT, the path quality obtained by the RRT* algorithm is relatively better; however, due to the addition of the “neighborhood search” and “path reconnection” steps, the time cost has also increased significantly. Moreover, the efficiency of CE-RRT has also been significantly improved.

The statistical results of 100 simulations are shown in Figure 12 and Figure 13, and detailed data are listed in Table 4. Similar to the conclusion in 2D environments, the proposed adaptive sampling and adaptive step size adjustment strategies can still significantly improve the search efficiency of the algorithm in 3D environments. The proposed optimization strategy based on the CE algorithm can greatly reduce path length and improve path quality. As shown in Table 4, the failure rate of the RRT algorithm in a 3D environment is 16%. The failure rates of A-RRT, B-RRT, and CE-RRT are all 0 after introducing the improved strategy, verifying the effectiveness of the improved strategy. This also indicates that the CE-RRT algorithm has stronger robustness. The failure rate of C-RRT is 7%, as the CE algorithm only optimizes feasible paths and does not affect the failure rate of the algorithm. The failure rate of the RRT* algorithm is 17%, and its performance in the 3D environment needs further improvement. It can be seen from Figure 14 that the running time of the entire algorithm can still be shortened by reducing the number of iterations of the CE algorithm.

The testing environments employed in this study (2D maze, 2D cluttered, and 3D static environments) were deliberately selected to isolate and rigorously evaluate the core performance contributions of the three proposed strategies within the CE-RRT algorithm: adaptive sampling, adaptive step adjustment, and CE-based path optimization. These controlled settings provide a clear baseline for quantifying improvements in search efficiency (reduced time

T_{f}

) and path quality (reduced length

L_{f}

) over standard RRT and RRT* algorithms. The significant performance gains demonstrated in Section 4.3.1, Section 4.3.2 and Section 4.3.3 validate the effectiveness of these strategies in static obstacle configurations.

However, we acknowledge that these environments represent a foundational level of complexity compared to many real-world robotic applications. Real-world scenarios often involve more intricate obstacle geometries, narrow passages (bottlenecks), and, critically, dynamic elements such as moving obstacles or changing goals. While the current study successfully establishes CE-RRT’s capabilities in static environments—a necessary prerequisite—we recognize that testing in environments featuring bottlenecks and moving obstacles is crucial for a more comprehensive evaluation of the algorithm’s robustness and adaptability. Addressing these more complex and realistic scenarios, including the development of dynamic extensions to CE-RRT, is identified as a key direction for future work (as further discussed in Section 6). The results presented here provide the essential groundwork for such subsequent investigations.

5. Welding Path Planning Simulation

In this section, a simulation experiment is designed based on the dual Sawyer robot simulation platform to verify the effectiveness of the proposed CE-RRT algorithm for the welding task of some long welds on a large, complex component.

5.1. Experimental Platform

As shown in Figure 15, a dual robot collaborative welding simulation platform is built using two Sawyer robotic arms in the Gazebo environment based on the Robot Operating System (ROS) [25], equipped with a 2-m-high and 5-m-long slide rail. Four typical welding seams are shown in Figure 1, Figure 2, Figure 3 and Figure 4. The hardware environment is an Intel i5-10500@3.1GHz CPU with 8G memory, the operating system is Ubuntu 16.04, and the C++ 11 programming language is used.

5.2. Dual Robot Welding Path Planning Experiment

The proposed CE-RRT algorithm is used to plan a feasible path for the two Sawyer robots to complete the welding task based on the experimental platform of the two Sawyer robots in Figure 15 and the selected typical welding seams. The motion planning results of the dual robot based on the CE-RRT algorithm are shown in Figure 16. Figure 16a shows the path planning of the dual robot from the initial state to the welding preparation state, Figure 16b shows the path planning of the primary welding process of the dual robot (the welding seam 2 is assigned to robot 1, and the welding seam 3 is assigned to robot 2), and Figure 16c shows the path planning of the dual robot from the welding preparation state to the initial state. From the results of Figure 16a,c, it can be seen that the workspaces of the two robots partially overlap, but reasonable path planning makes them not collide; this also verifies the effectiveness of the CE-RRT algorithm.

6. Conclusions

In this paper, a novel adaptive RRT algorithm based on CE optimization (CE-RRT) is proposed by introducing three improved strategies. The purpose is to solve robot path planning problems more quickly and stably. A series of experiments are conducted to evaluate the performance of the CE-RRT algorithm in a 2D maze environment, a 2D cluttered environment, and a 3D environment. The RRT algorithm is used as a benchmark to measure the effectiveness of the three proposed optimization strategies. The results demonstrate that adaptive sampling and adaptive step adjustment strategies can effectively increase the search speed of the algorithm, and the CE algorithm can also significantly improve path quality (the path of CE-RRT is shortened by 26%, 22.70%, and 49.11%). Particularly, the statistical results of 100 simulations indicate that the CE-RRT algorithm exhibits stronger stability in 3D environments. In addition, the CE-RRT algorithm is used to plan a reasonable path for the dual robots to perform tasks based on the dual Sawyer robot simulation platform.

While the CE-RRT algorithm demonstrates significant improvements in search efficiency and path quality within the static environments tested in this study, several promising avenues for future research have been identified:

(1): Extension to Dynamic Environments: The most critical next step is to extend CE-RRT to handle dynamic environments with moving obstacles. This requires developing mechanisms for real-time obstacle detection and prediction and integrating dynamic constraints into the adaptive sampling, step adjustment, and path optimization strategies. The robust performance observed in the static 3D environment (Section 4.3.3) provides a strong foundation for tackling this challenge.
(2): Evaluation in More Complex Static Scenarios: Further validation is needed in environments featuring highly intricate geometries, dense obstacle fields, and narrow bottlenecks beyond those tested here.
(3): Parameter Sensitivity Analysis: A systematic investigation into the sensitivity of CE-RRT’s performance to its key parameters across diverse environments is warranted to provide clearer guidelines for practical deployment.
(4): Real-World Implementation: Transitioning the algorithm from simulation to physical robot platforms (beyond the dual-Sawyer simulation) and addressing real-world uncertainties (sensor noise, actuation errors) is essential for demonstrating its practical utility.

Author Contributions

Conceptualization, D.Z. and Q.T.; methodology, D.Z., Q.T. and L.M.; formal analysis, Q.T.; data curation, J.L.; writing—original draft preparation, Q.T.; writing—review and editing, D.Z.; supervision, L.M.; project administration, Y.S.; funding acquisition, Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the Sichuan Province Science and Technology Support Program (No. 2020ZDZX0015).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors have no relevant financial or non-financial interests to disclose.

References

Choset, H.; Lynch, K.M.; Hutchinson, S.; Kantor, G.A.; Burgard, W. Principles of Robot Motion: Theory, Algorithms, and Implementations; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
LaValle, S.M. Planning Algorithms; Cambridge University Press: Cambridge, UK, 2006; Volume 2, pp. 3671–3678. [Google Scholar]
Mac, T.T.; Copot, C.; Tran, D.T.; De Keyser, R. Heuristic approaches in robot path planning: A survey. Robot. Auton. Syst. 2016, 86, 13–28. [Google Scholar] [CrossRef]
Li, Z.; Li, S.; Luo, X. An overview of calibration technology of industrial robots. IEEE-CAA J. Autom. Sin. 2021, 8, 23–36. [Google Scholar] [CrossRef]
Gonzalez, D.; Perez, J.; Milanes, V.; Nashashibi, F. A review of motion planning techniques for automated vehicles. IEEE Trans. Intell. Transp. Syst. 2015, 17, 1135–1145. [Google Scholar] [CrossRef]
Hart, P.E.; Nilsson, N.J.; Raphael, B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Stentz, A. The focussed d* algorithm for real-time replanning. In IJCAI’95: Proceedings of the 14th International Joint Conference on Artificial Intelligence; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 1995; Volume 2, pp. 1652–1659. [Google Scholar]
Wang, J.; Chi, W.; Li, C.; Wang, C.; Meng, M.Q.-H. Neural RRT*: Learning-based optimal path planning. IEEE Trans. Autom. Sci. Eng. 2020, 17, 1748–1758. [Google Scholar] [CrossRef]
Nazarahari, M.; Khanmirza, E.; Doostie, S. Multi-objective multi-robot path planning in continuous environment using an enhanced genetic algorithm. Expert Syst. Appl. 2019, 115, 106–120. [Google Scholar] [CrossRef]
Miao, C.; Chen, G.; Yan, C.; Wu, Y. Path planning optimization of indoor mobile robot based on adaptive ant colony algorithm. Comput. Ind. Eng. 2021, 156, 107230. [Google Scholar] [CrossRef]
Kavraki, L.; Kolountzakis, M.; Latombe, J.-C. Analysis of probabilistic roadmaps for path planning. IEEE Trans. Robot. 1998, 14, 166–171. [Google Scholar] [CrossRef]
Kavraki, L.; Svestka, P.; Latombe, J.-C.; Overmars, M. Probabilistic roadmaps for path planning in high-dimensional configuration spaces. IEEE Trans. Robot. 1996, 12, 566–580. [Google Scholar] [CrossRef]
LaValle, S. Rapidly-Exploring Random Trees: A New Tool for Path Planning; Research Report 9811. 1998. Available online: https://www.semanticscholar.org/paper/Rapidly-exploring-random-trees-%3A-a-new-tool-for-LaValle/d967d9550f831a8b3f5cb00f8835a4c866da60ad (accessed on 1 September 2025).
Jeong, I.-B.; Lee, S.-J.; Kim, J.-H. Quick-RRT*: Triangular inequality-based implementation of RRT* with improved initial solution and convergence rate. Expert Syst. Appl. 2019, 123, 82–90. [Google Scholar] [CrossRef]
Kalisiak, M.; van de Panne, M. RRT-blossom: RRT with a local flood-fill behavior. In Proceedings of the 2006 IEEE International Conference on Robotics and Automation (ICRA), Orlando, FL, USA, 15–19 May 2006; pp. 1237–1242. [Google Scholar] [CrossRef]
Ichnowski, J.; Alterovitz, R. Scalable multicore motion planning using lock-free concurrency. IEEE Trans. Robot. 2014, 30, 1123–1136. [Google Scholar] [CrossRef] [PubMed]
Karaman, S.; Frazzoli, E. Sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
LaValle, S.M.; Kuffner, J.J., Jr. Randomized kinodynamic planning. Int. J. Robot. Res. 2001, 20, 378–400. [Google Scholar] [CrossRef]
Kuffner, J.; LaValle, S. RRT-connect: An efficient approach to single-query path planning. IEEE Int. Conf. Robot. Auto. 2000, 2, 995–1001. [Google Scholar] [CrossRef]
Rubinstein, R.Y. Optimization of computer simulation models with rare events. Eur. J. Oper. Res. 1997, 99, 89–112. [Google Scholar] [CrossRef]
Tang, Q.; Ma, L.; Zhao, D.; Lei, J.; Wang, Y. A multi-objective cross-entropy optimization algorithm and its application in high-speed train lateral control. Appl. Soft Comput. 2022, 115, 108151. [Google Scholar] [CrossRef]
Kovaleva, M.; Bulger, D.; Zeb, B.A.; Esselle, K.P. Cross-entropy method for electromagnetic optimization with constraints and mixed variables. IEEE Trans. Antennas Propag. 2017, 65, 5532–5540. [Google Scholar] [CrossRef]
Haber, R.E.; del Toro, R.M.; Gajate, A. Optimal fuzzy control system using the cross-entropy method. A case study of a drilling process. Inf. Sci. 2010, 180, 2777–2792. [Google Scholar] [CrossRef]
Kroese, D.P.; Porotsky, S.; Rubinstein, R.Y. The cross-entropy method for continuous multi-extremal optimization. Methodol. Comput. Appl. Probab. 2006, 8, 383–407. [Google Scholar] [CrossRef]
Quigley, M.; Conley, K.; Gerkey, B.P.; Faust, J.; Foote, T.; Leibs, J.; Wheeler, R.; Ng, A.Y. ROS: An open-source robot operating system. In Proceedings of the ICRA Workshop Open Source Software; Kobe, Japan, 12–17 May 2009, Volume 3, p. 5.

Figure 1. Path planning in two-dimensional configuration space.

Figure 2. Three simulation environments: (a) 2D maze environment; (b) 2D cluttered environment; (c) 3D environment.

Figure 3. Simulation results of six algorithms in the 2D maze environment.

Figure 4.

L_{f}

in the 2D maze.

Figure 4.

L_{f}

in the 2D maze.

Figure 5.

T_{f}

in the 2D maze.

Figure 5.

T_{f}

in the 2D maze.

Figure 6. Optimization results in the 2D maze.

Figure 7. Simulation results of six algorithms in the 2D cluttered environment.

Figure 8.

L_{f}

in the 2D cluttered.

Figure 8.

L_{f}

in the 2D cluttered.

Figure 9.

T_{f}

in the 2D cluttered.

Figure 9.

T_{f}

in the 2D cluttered.

Figure 10. Optimization results in the 2D cluttered.

Figure 11. Simulation results of six algorithms in the 3D environment.

Figure 12.

L_{f}

in the 3D environment.

Figure 12.

L_{f}

in the 3D environment.

Figure 13.

T_{f}

in the 3D environment.

Figure 13.

T_{f}

in the 3D environment.

Figure 14. Optimization results in the 3D environment.

Figure 15. Dual robot simulation platform.

Figure 16. Path Planning of dual robot welding based on the CE-RRT algorithm: (a) Reaching the welding preparation state; (b) primary welding process; (c) Return to the initial state.

Table 1. The parameter settings for all algorithms.

Algorithms	RRT	A-RRT	B-RRT	C-RRT
Parameter	1. Step: $ε = 0.5$	1. Step: $ε = 0.5$	1. Initial step: $ε_{0} = 0.5$	1. Step: $ε = 0.5$
settings	2. Random sampling	2. Random sampling	2. Step variation:	2. Random sampling
	probability: $λ = 1.0$	probability: $λ = 0.3$	$△ ε_{1} = 0.2, △ ε_{2} = 0.1$	probability: $λ = 1.0$
	3. Maxi. iterations:	3. Adaptive sampling	3. Range: $ε \in [0.1, 3.5]$	3. Maxi. iterations:
	$k_{\max} =$ 20,000	probability: $p_{a} = 0.7$	4. Random sampling	$k_{\max} =$ 20,000
	4. Independent run times:	4. Maxi. iterations:	Probability: $λ = 1.0$	4. Sample size: $N = 100$
	100	$k_{\max} =$ 20,000	5. Maxi. iterations:	5. Elite sample rate:
		5. Independent run times:	$k_{\max} =$ 20,000	$ρ = 0.3$
		100	6. Independent run times:	6. Smoothing coefficient:
			100	$α = 0.7$
				7. CE iterations: $K = 300$
				8. Independent run times:
				100
	RRT*
	1. Step: $ε = 0.5$	2. Domain radius $g = 5$	3. Maxi. iterations:	4. Independent run times:
			$k_{\max} =$ 20,000	100

The CE-RRT algorithm includes all three improvement strategies. Due to the randomness of sampling-based algorithms, each algorithm is run 100 times.

Table 2. Statistical results of 100 simulations in the 2D maze.

Algorithm		Mean	Std.	Min.	Max.	Fail
RRT	$L_{f}$	155.7	6.62	140.3	172.3	0
RRT	$T_{f}$	21.98	15.52	11.34	88.56	0
A-RRT	$L_{f}$	152	5.26	137.7	167.8	0
A-RRT	$T_{f}$	15.47	3.25	9.37	25.43	0
B-RRT	$L_{f}$	160.2	10.66	133.2	186.6	0
B-RRT	$T_{f}$	1.652	0.9011	0.298	4.610	0
C-RRT	$L_{f}$	118.9	2.9176	111.3	125.2	0
C-RRT	$T_{f}$	28.49	18.74	13.04	89.51	0
CE-RRT	$L_{f}$	116	3.845	110.5	128.3	0
CE-RRT	$T_{f}$	13.92	3.3734	4.67	17.72	0
RRT*	$L_{f}$	169.91	7.00	155.89	187.35	2
RRT*	$T_{f}$	14.74	6.28	6.02	28.00	2

Table 3. Statistical results of 100 simulations in the 2D cluttered.

Algorithm		Mean	Std.	Min.	Max.
RRT	$L_{f}$	83.42	4.002	77.18	92.37
RRT	$T_{f}$	5.04	10.45	1.806	73.19
A-RRT	$L_{f}$	73.73	4.802	68.64	91.32
A-RRT	$T_{f}$	0.77	0.5207	0.336	3.34
B-RRT	$L_{f}$	84.78	6.22	71.49	104.5
B-RRT	$T_{f}$	0.54	0.1283	0.28	0.878
C-RRT	$L_{f}$	67.8	2.85	63.78	75.35
C-RRT	$T_{f}$	35.96	14.13	6.65	107.4
CE-RRT	$L_{f}$	66.81	2.87	63.61	74.41
CE-RRT	$T_{f}$	30.43	7.09	14.58	64.29
RRT*	$L_{f}$	86.03	4.21	79.23	92.76
RRT*	$T_{f}$	6.93	3.04	1.92	13.71

Table 4. Statistical results of 100 simulations in the 3D environment.

Algorithm		Mean	Std.	Min.	Max.	Fail
RRT	$L_{f}$	74.97	13.28	16.50	102.6	16
RRT	$T_{f}$	37.57	55.44	0.424	159.3	16
A-RRT	$L_{f}$	65.48	10.38	44.64	95.77	0
A-RRT	$T_{f}$	0.708	1.451	0.134	6.616	0
B-RRT	$L_{f}$	74.68	14.61	53.37	118.8	0
B-RRT	$T_{f}$	0.510	1.021	0.123	7.54	0
C-RRT	$L_{f}$	44.18	6.23	15.77	58.58	7
C-RRT	$T_{f}$	27.81	44.82	9.66	163.2	7
CE-RRT	$L_{f}$	44.01	1.45	36.96	53.67	0
CE-RRT	$T_{f}$	15.55	4.77	8.50	30.77	0
RRT*	$L_{f}$	76.47	11.70	55.14	100.40	17
RRT*	$T_{f}$	49.88	45.26	4.17	189.92	17

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, D.; Tang, Q.; Ma, L.; Sun, Y.; Lei, J. An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization. Algorithms 2025, 18, 615. https://doi.org/10.3390/a18100615

AMA Style

Zhao D, Tang Q, Ma L, Sun Y, Lei J. An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization. Algorithms. 2025; 18(10):615. https://doi.org/10.3390/a18100615

Chicago/Turabian Style

Zhao, Duo, Qichao Tang, Lei Ma, Yongkui Sun, and Jieyu Lei. 2025. "An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization" Algorithms 18, no. 10: 615. https://doi.org/10.3390/a18100615

APA Style

Zhao, D., Tang, Q., Ma, L., Sun, Y., & Lei, J. (2025). An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization. Algorithms, 18(10), 615. https://doi.org/10.3390/a18100615

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Adaptive Rapidly-Exploring Random Trees Algorithm Based on Cross-Entropy Optimization

Abstract

1. Introduction

2. Background

2.1. Problem Definition

2.2. RRT Algorithm

2.3. CE Algorithm

3. The Proposed CE-RRT Algorithm

3.1. Main Procedure of the Proposed CE-RRT Algorithm

3.2. Details of Improvement Strategies

3.2.1. Adaptive Sampling Strategy

3.2.2. Adaptive Step Adjustment Strategy

3.2.3. Path Optimization Strategy Based on CE Algorithm

4. Simulation Experiments and Analysis

4.1. Environments and Evaluation Indicators

4.2. Comparison Algorithms and Parameter Settings

4.3. Simulation of Three Environments

4.3.1. 2D Maze Environment

4.3.2. 2D Cluttered Environment

4.3.3. 3D Environment

5. Welding Path Planning Simulation

5.1. Experimental Platform

5.2. Dual Robot Welding Path Planning Experiment

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI