1. Introduction
The increasing complexity of intelligent manufacturing systems demands more agile and adaptable robots [
1,
2]. However, the current mainstay, the offline robot demonstration method, poses limitations. Despite its efficacy in demonstrating intricate trajectories, this method requires specialized expertise and struggles with environmental changes [
3]. Any alterations to the working environment or robot necessitate a re-demonstration of the data, hindering the robot’s adaptability. Integrating advanced motion planning algorithms into industrial robots emerges as a critical strategy to address the evolving demands of intelligent manufacturing. Enhancing the robots’ capabilities and flexibility reduces the reliance on specialized expertise and enables seamless adaptation to dynamic workspaces, ultimately facilitating the rapid improvement required in modern manufacturing environments.
Decades of research and development have yielded numerous remarkable contributions to motion planning algorithms. Sampling-based motion planning methods (SBMPMs), such as the rapidly exploring random tree (RRT) [
4] and its variants [
5,
6,
7], have successfully solved high-dimensional motion planning problems by sampling in the configuration space. However, SBMPMs compute and optimize collision-free trajectories from scratch each time, which is time-consuming and leads to uncertainty in the planning results. Industrial robots commonly work on several production lines, but each environment is relatively fixed, with potential temporary obstacles being added [
8]. In such a semi-structured setting, discarding previous planning data and planning from scratch each time is not always a sensible approach [
9].
In effect, extensive data accumulated from previous planning, which contains information about similar situations and their respective near-optimal trajectories, can be translated into previous experience to speed up the computation of near-optimal collision-free trajectories. But this valuable resource is scarcely exploited in SBMPMs. In recent years, the evolution of motion planning and artificial intelligence has crossed, leading to the emergence of learning-based methods [
10,
11]. These approaches leverage neural networks’ powerful nonlinear mapping capabilities to accelerate motion planning by learning from extensive datasets [
12,
13]. Despite this, neural network-based planners face a significant challenge: the network mapping accuracy problem. This issue arises from the network’s inherent limitations in directly mapping high-quality trajectories due to limited precision, causing local collisions and oscillations near the target point, potentially leading to the failure of the entire planning path. Some studies alter the mapping result from a trajectory to a distribution, reducing the requirement for network mapping accuracy [
14,
15]. Nevertheless, these distributions usually still need to be processed by SBMPMs further. Alternatively, making recursive calls to the neural network can enhance planning accuracy, but this approach still cannot prevent all unforeseen circumstances [
16]. Furthermore, the network planner rarely escapes collisions when mapped paths result in collisions, demonstrating a weak ability to independently resolve such issues [
17].
This paper proposes a Dempster–Shafer-based hybrid structure planner using a probabilistic sampling network (PSNet) and cooperation with an enhanced artificial potential field method (PSNet-EAPF) to boost the effectiveness of robot motion planning. In the proposed method, a motion planning module (MPM) consisting of a multilayer regression network and an encoder is constructed to break the centralized planning task into multiple sub-distributions in each step, which obeys the Gaussian distribution. Based on the sub-distribution information obtained, the fusion sampling module (FSM) in PSNet is employed to fuse and resample them, producing a globally connectable path to mitigate the network mapping accuracy problem. Additionally, the variable rotational field enhances the artificial potential field and cooperates with PSNet to repair the local path and boost the ability to escape collisions if worst-case scenarios occur in the global path. The experimental investigation demonstrated that the proposed method’s performance is satisfactory and superior to that of other similar planning methods.
The primary contributions of the novel probabilistic sampling network integrated with the enhanced artificial potential field method are summarized as follows:
Experience-driven probabilistic planning: PSNet integrates prior planning experience into probabilistic sampling, accelerating the computation of feasible trajectories and reducing planning time from minutes to seconds.
Robust fusion and Gaussian resampling: A novel fusion sampling module combined with Gaussian resampling corrects local collision errors caused by network inaccuracies, substantially improving success rates and stability.
Adaptive hybrid structure: By coupling the learning-based global planner with an enhanced artificial potential field (EAPF), the method achieves real-time local replanning, overcoming the local minima and goal non-reachability problems of traditional potential fields.
The remainder of this paper is organized as follows. A thorough literature review of related work is introduced in
Section 2. PSNet and EAPF are described in
Section 3. The experimental results are presented in
Section 4. The discussions based on the experimental results are included in
Section 5. Finally, the conclusions of our work are drawn in
Section 6.
2. Related Work
SBMPMs were initially introduced to address the resolution completeness problem, which necessitates complete obstacle geometry, a requirement often difficult to fulfill in real-world environments [
18,
19,
20]. However, in practice, SBMPMs demonstrated their ability to efficiently solve motion planning problems in dynamic environments, making them a promising approach. These methods rely on random sampling and collision-checking techniques to find valid collision-free configuration spaces. Connectable paths can then be efficiently discovered by searching the sampled configuration space. The most classic SBMPMs are RRT and probabilistic roadmaps (PRM) [
21], which model an exploring tree structure and a roadmap in the collision-free space, respectively. In practice, RRT is preferred over PRM because it does not require the generation of precompiled roadmaps, which can often be difficult to compute quickly in online planning problems. Although RRT performs substantially better than PRM, there is still room for improvement, leading to the development of various RRT variants to enhance their capabilities. RRT-connect, an improved algorithm based on RRT, generates two rapidly exploring random trees from the initial and target configurations simultaneously, making it more efficient than RRT [
22]. Wang et al. further improved RRT-connect’s execution efficiency by adding a new sampling node [
23]. Chen et al. provided a similar sampling strategy and introduced a guidance module that biases RRT-connect towards the target point during expansion [
24]. While RRT can find a path, it does not always find the shortest one. The exploring Random Tree Star (RRT*) algorithm was proposed to overcome this limitation, which is asymptotically optimal [
25]. RRT* finds an initial path using RRT and then continuously optimizes it as more samples are added. However, RRT* becomes computationally inefficient as the dimensionality of the planning problem increases. Even with enhanced methods like RRT*-SMART [
26], PF-RRT* [
27], and PQ-RRT* [
28], these approaches still struggle to efficiently solve high-dimensional manipulation problems due to a lack of prior knowledge of the environment.
Recently, researchers have focused on leveraging previous experiences to accelerate new planning tasks instead of starting from scratch [
15,
29]. This paradigm shift aims to capitalize on the valuable data accumulated from prior planning scenarios and their corresponding near-optimal trajectories, enabling more efficient computation of new collision-free paths. Some of the work has focused on modifying traditional SBMPMs. Fisher R et al. combined Dynamic Reachability Maps (DRM) with RRT-connect. It leverages the topological features of previous paths by using a Reeb graph to capture the persistent characteristics of past solutions, thereby improving efficiency in repetitive tasks [
30]. Ref. [
31] proposed an Experience-Based Bidirectional RRT algorithm. The algorithm significantly improves the efficiency and quality of path planning by learning a Gaussian Mixture Model (GMM) from demonstration paths to create an adaptive sampler and using an Experience Graph to capture exploration information in the configuration space. Improvements based on similar motivations have also appeared in the RPM algorithm [
32,
33,
34]. This work has greatly enhanced the practicality of SBMPMs in semi-structured environments, but it remains significantly limited because changes in high-dimensional obstacles are difficult to describe within the method. Additionally, the inherent nature of random sampling in these methods can lead to jerky or redundant motions.
Another aspect of the research emphasizes the use of artificial intelligence techniques to deal with the challenges of high-dimensional obstacle representation and leverage prior planning experiences. Ichter et al. used RRT to sample paths in learned latent spaces [
35], but this method cannot ensure that path solutions exist in the learned latent space. In [
36], a CNN network learned from images to guide sampling. Ref. [
37] employed 3D point cloud reconstruction from image sets by a network and an enhanced MSRRT* algorithm to efficiently determine paths. Other similar studies [
38,
39] aim to interpret and encode visual information via networks. This method circumvents the manual interpretation of obstacles in high-dimensional joint spaces and offers an essential condition for future motion planning. Jetchev et al. described a method for accelerating planning through experience data, where multiple target trajectories were mapped directly from the trained neural network and selected and optimized by an objective function [
14]. This method achieves the expected acceleration effect, but mapping high-quality trajectories directly from the network remains challenging, thereby reducing the efficiency of the subsequent path optimization process. To address this, ref. [
28] introduced encoders to switch the mapping result from a trajectory to a distribution, reducing the requirement for network output accuracy. The trajectory was then generated by sampling over the distribution using SBMPMs such as FMT∗ [
40] and RRT∗. In addition, reference [
13] introduced a Long Short-Term Memory (LSTM) network as the planning module, which recursively invokes planning based on the start and goal positions as retrieval conditions, thereby significantly improving the overall path quality. Meanwhile, Li et al. [
41] proposed a modular path planning framework inspired by natural behaviors, enabling adaptive switching among the most advantageous algorithms under varying planning scenarios. Qureshi et al. [
17] highlighted the benefits of the recursive call generation trajectory approach and proposed a comprehensive motion planning network model (MPNet). This model consists of environmental information processing and motion planning modules (MPnet-Pnet), incorporating environment information, the robot’s initial and desired goal configurations, and recursively calling motion planning modules to generate connectable paths. If the planned path encounters collisions, the method activates the network’s drop layer (MPNet-Drop) or calls SBMPM to find a new local path. In [
42], constraint manifolds were added to MPNet to expand its functionality further. Ying et al. [
43] applied a similar planning network model to dual-arm assembly robots, achieving the desired results. Li et al. [
44] fused the model-predictive control into MPNet (MPC-MPNet), which introduces two models based on neural generators, discriminators, and parallelizable model predictive controllers to improve MPNet’s planning success rate.
MPNet and its variants have addressed some of the mapping accuracy challenges in network planners, but they have not fully resolved the issue. Moreover, when the mapped target path collides, MPNet struggles to escape the collision region using its built-in re-planning method. The work on MPNet has been a great source of inspiration for us, but our proposed method is distinct from previous approaches as it combines a probabilistic sampling network with a Gaussian sampling module. This module is designed to absorb network errors and enhance the network’s ability to navigate around obstacles. Furthermore, we replace the reliance on SBMPMs for local path generation with a novel artificial potential field enhanced by a variable rotational field.
3. Methods
3.1. Problem and Notations Definition
In this study, a multi-joint robot configuration space is denoted as , where d is the dimension of R. The obstacle space and the obstacle-free space are complementary sets () of configuration space. Let be the ordered configurations list of the robot’s configurations,
where T is the list length, and . The is assumed to be the i th configuration list in one
training set, where , and n is the number of configuration lists. corresponds to the j th robot state (]) in the i th configuration list. Then, a robot
motion planning problem in our study can be denoted as , which can be interpreted as finding a feasible path
solution in connecting the to the by a motion planning method. In addition, building a
collision checker into the algorithm is necessary. Due to the high
dimensionality of the configuration space, which generally depends on the
degree of freedom of the multi-joint robot, it is complicated to describe the
obstacles in the configuration space. As a result, collision checkers are
usually built in a Cartesian space, up to three dimensions, rather than a
configuration space. The Cartesian space around the multi-joint robot, also
called the workspace, is denoted as , where m is a Cartesian space dimension. In the
workspace, the obstacle space is denoted as and the obstacle-free space is denoted as . The collision checker is denoted as , which takes robot configuration information , and workspace information , and estimates if they are valid or not. The planned
trajectory refers to the joint motion trajectory of the robot in configuration
space, where each configuration encodes the complete pose of all robot joints.
The resulting trajectory thus represents the robot’s full-body motion rather
than solely the end-effector’s Cartesian path.
3.2. Probabilistic Sampling Network
The global collision-free path is generated from the start state to the goal state by recursively calling the neural planner PSNet. The PSNet comprises two modules: the MPM and the FSM with Gaussian resampling capability. The MPM takes the environmental information, such as multiple views or point clouds from sensors, the robot’s current configuration, and the goal state to output multimodal distributions of the next configuration. Based on this distribution information, the FSM acts as an information arbiter to produce a globally connectable path. The components of PSNet are shown in
Figure 1.
3.2.1. Motion Planning Module:
The motion planning module is a multilayer regression network with an encoder. The environment information
is encoded into the latent space Z through the
encoder, which can be concisely denoted as:
Subsequently, the latent space Z, the robot’s initial configurations
and the target configurations
are concatenated into a new input list
for the remaining network structure. This structure, akin to the Mixture Density Network (MDN) [
45], consists of an LSTM stacking layer (LSTMS) and three independent fully connected layers (IMDN). The LSTMS captures the probability distribution features between the current and next robot states. The IMDN is employed to map the mean
, standard deviation
and weight coefficient
which are multimodal distribution models
of the next robot configuration, based on the input data.
At this time, the robot motion planning problem
is transformed into the
of
, which is equivalent to
:
where
is the number of multimodal distribution models and
is the learned weight coefficient of each model. The
represents l-th multimodal distribution model of the form:
where
and
both learned from the multilayer regression network like
, denoting mean and standard deviation, respectively. Then, the loss function of multilayer regression network training is defined as the negative log-likelihood:
The above process can be simplified as follows:
3.2.2. Fusion Sampling Module
The classical MDN employs multinomial distribution sampling, a statistical method, to select one distribution and determine the next point. However, this stochastic sampling approach overlooks inherent uncertainty: the selected probability distribution inadequately represents the quality of all sampling points. To address this limitation, our study utilizes Dempster–Shafer (DS) theory [
46], recognized for its effectiveness in handling uncertainty, as the core algorithm in our module to replace multinomial distribution sampling, thereby enhancing the robustness of the sampling method.
Firstly, Bayesian estimation theory (BET) is employed to augment the sample. The multimodal distributions
are converted into the prior distribution parameters
and observation distribution parameters
of the BET using a weighted or averaging technique.
Then, the BET-based distribution model
is further calculated:
where
is from the
, and
is from the
. When
is replaced in Equation (6) in the form of distribution, and the mean
and standard deviation
of Bayesian estimation. BET fuses these two sources, yielding:
This produces a fused Gaussian belief , encapsulating both model confidence and observation uncertainty.
Next, we construct the basic probability assignment (BPA) functions of DS theory. In DS theory, the BPA or evidence quantifies the degree of belief assigned to each subset of the frame of discernment. Here, two types of BPA functions are defined:
MDN-derived evidence (
: The first BPA directly maps the original mixture weights to belief assignments.
where
represents any subset of the frame of discernment
(i.e., any element of the power set
) except the singleton sets
. This means we assign zero mass to all compound hypotheses, focusing belief only on individual distributions.
BET-derived evidence (
: The Bayesian estimation is used to derive a Gaussian belief, which is used to assess the likelihood of each Gaussian component.
This is normalized across all l, yielding a set of Bayesian-informed weights:
Then, the second BPA
is derived from the Bayesian estimation.
Thirdly, the BPA functions composed of the original MDN evidence and the BET-derived evidence are combined using the DS combination rule (Equation (13)) to produce the final fused evidence:
where K is:
The final evidence after fusion is expressed as:
The component
with the highest fused belief
is selected, and its corresponding mean
is used as the next robot configuration point
:
Another parameter is passed into the next process.
The inputs and outputs process in the fusion sampling module can be simplified as follows:
The implementation of our fusion sampling method enhances traditional multinomial sampling by rigorously quantifying uncertainty and integrating multiple sources of evidence. Our fusion sampling module combines prior information and observation data averaged across components, thereby explicitly accounting for sampling uncertainty. The DS theory is used to merge the original MDN distribution with Bayesian estimates, effectively resolving conflicts between different sources favoring distinct Gaussian components. This evidence combination technique redistributes probability mass to minimize the impact of outliers and prediction errors. By independently applying fusion to each configuration dimension, our method accommodates the varying uncertainties across different robot joints. The implementation features vectorized operations for computational efficiency and numerical stability, with normalization ensuring valid probability distributions.
3.2.3. Gaussian Sampling Module
The PSNet, through Gaussian resampling, can correct its collision errors. When collision points occur in the robot’s global configuration path generated by the FSM, these points, along with the subsequent point, are recorded as a collision set
. This set is reconstructed as a multimodal Gaussian model using the provided parameters, and resampling is performed to correct the collision points within the path. The expression of the execution process of GSM can be simplified as:
The schematic diagram is shown in
Figure 2.
3.3. An Enhanced Artificial Potential Field
In most cases, the neural planner PSNet successfully computes a path solution. However, for some hard cases where the PSNet fails to find a path between beacon states, an enhanced artificial potential field (EAPF) planner is activated for replanning. The classical artificial potential field (CAPF) method is renowned for its efficiency and precision in local planning [
47], but two significant drawbacks have limited its broader application [
48,
49]. The local minimum field traps the robot before reaching its global goal point. On the other hand, the repulsive field tends to repel the robot when it reaches a goal point close to an obstacle, which causes the goal to be a non-reachable phenomenon. To overcome these issues, a variable rotational field (VRF) is proposed to enhance the artificial potential field method to repair the local path when a failed case occurs in the global path planned by the PSNet.
The attractive and repulsive fields of CAPF can be computed by Equations (20) and (21), respectively.
where
denotes the attractive field, and
and
are the gain coefficient and order of attractive functions, respectively.
where
denotes the repulsive field, and
and
are the gain coefficient and order of repulsive functions, respectively.
Unlike the CAPF, the EAPF simplifies the repulsion field and introduces an extra variable field called the variable rotational field . The EAPF is derived as follows.
The repulsive field of EAPF is simplified as:
Equation (23) means the repulsive influence of unobstructed areas is eliminated, so the goal non-reachable problem does not exist.
In addition, a new variable rotational field
is defined as Equation (23) for solving the local minimum problem.
where
is the obstacle’s perimeter, and k is the coefficient.
Thus, the total potential field can be denoted as:
When the planned global path leads to a worst-case scenario, where PSNet alone cannot guarantee collision avoidance, as depicted in
Figure 3a, the EAPF planner intervenes. EAPF uses Equation (22) to create a variable rotational field encompassing clockwise and counterclockwise directions (
Figure 3b). The algorithm then evaluates the cost function based on the shortest distance and selects the corresponding rotational field direction. This selected rotational field is added to the total potential field
. A viable path solution is ultimately generated by applying a gradient optimizer within this modified potential field (
Figure 3c).
The expression of the execution process of EAPF can be simplified as follows:
3.4. PSNet- EAPF
The previous section provides a detailed description of each component of the proposed method. Algorithm 1 outlines the complete execution process of the proposed PSNet-EAPF method for robotic motion planning in semi-structured environments. The algorithm takes as input the robot’s initial configuration , goal configuration , and environment information representing obstacles. The algorithm first initializes the path with just the start configuration (line 1). Then it enters a loop that continues to recursively call PSNet until the termination condition is satisfied, which is either finding a valid path to the goal or reaching a maximum iteration threshold (lines 2–23).
Inside the loop, the MPM of PSNet takes the current configuration , goal , and encoded environment information to generate multimodal distribution parameters for the next configuration (lines 4 and 5). If the sampled is collision-free (lines 6 and 7), it is added to the path (line 9). Otherwise, the Gaussian resampling function is invoked to resample the collision set using the provided distribution parameters to try to resolve the collision (lines 12–14).
If PSNet fails to resolve the collision, the EAPF method is used to replan a valid local path segment from
to a new
that avoids obstacles (line 21). The process repeats with the new
until the goal
is reached within the iteration limit, returning the planned collision-free path
(line 23), or the iteration limit is exceeded, returning failure. The PSNet-EAPF enhances the planner’s ability to produce high-quality trajectories by hybrid leveraging both learned experience and classical planning techniques.
Algorithm 1: PSNet-EAPF |
Input: environmental information , initial state goal state |
Output: Collision-free path solution |
1: 2: 3: 4: 5: 6: 7: 8: 9: 10: 11: 12: 13: 14: 15: 16: 17: 18: 19: 20: 21: 22: 23: | Initialize (), while Termination Condition Unsatified do ; ; ; Check Motion: ); If then
; else for to Iterations Number do ; ; ; Check Motion: ); If then ;
break;
If then ; return ; |
4. Experiments
In this section, the effectiveness of the proposed PSNet-EAPF method is evaluated through comparative analysis against two established planners: MPNet, a state-of-the-art network planner that has proven its worth over numerous sampling-based optimization techniques in terms of execution time, and RRT-connect, a classical planner renowned for its efficiency. For the evaluation, two distinct planning challenges have been selected: one involving a 2-D point-mass robot and another encompassing a 6-degree-of-freedom (DOF) universal robot manipulator. These diverse scenarios serve to underscore the versatility and robustness of the PSNet-EAPF approach across a range of robotic platforms and environments. The system used for experiments has an Intel Core i5 processor with 32-GB RAM and a GeForce GTX 2060 GPU. A detailed specification is presented in
Table 1.
Considering that implementation details and engineering optimizations may interfere with the measurement of execution time, this paper uses the iteration count as a stable efficiency metric in the overall evaluation. In the high-dimensional (6-DOF) multi-joint robot experiments, however, since the time consumption of different methods differs by several orders of magnitude, the execution time is additionally reported to directly reflect the performance difference, even though no efficiency optimization has been applied to our code framework.
4.1. 2-D Point-Mass Robot
In planning a 2-D point-mass robot, three workspaces with varying complexity levels (simple, mid-complex, and complex) containing 2280 trajectories were used to train both PSNet-EAPF and MPNet. The trained models were then evaluated on separate test sample sets, each with thirty new and unseen start and goal configurations per scenario. Additionally, evaluation tests were conducted multiple times on the test sample sets for all methods compared in our experiments, and the mean values of their results were reported.
Figure 4 illustrates the paths planned by PSNet-EAPF (red), its competitor MPNet (blue), and RRT-connect (green). It can be observed that PSNet-EAPF planned paths similar to MPNet and outperformed RRT-connect in terms of robustness and shorter path lengths.
Figure 5 shows that the average number and dispersion of iterations in log-scale for PSNet-EAPF and MPNet are significantly better than for RRT-connect across all test environments. While PSNet-EAPF and MPNet exhibit similar performance in the simple environment, MPNet’s performance progressively worsens with increasing environmental complexity. This decline is most pronounced in the complex environment, where MPNet suffers a dramatic loss in both iteration efficiency and stability.
Figure 6 provides a detailed analysis of the percentage of iteration cost associated with ten randomly sampled beacon pairs from thirty new and unseen start and goal configurations for each module of the evaluated methods. Across all tested samples, PSNet and the equivalent MPNet module within MPNet-Oracle both consistently require 20–50 iterations for global path generation, showing no statistically significant difference in base planning capability. If collisions occur in the planned global path, the path repair functions of Gaussian resampling (PSNet-FSM) and the dropout layer (MPNet-Drop), integrated within PSNet and MPNet, respectively, are activated to address these local collision paths. MPNet-Drop has a lower repair success rate than PSNet-FSM and frequently fails in worst-case scenarios. Particularly in complex scenarios, as shown in
Figure 6c, the former has a 30% probability of failure, while the latter has only 10%. In such cases, the MPNet requires invoking an additional Oracle planner (OP), a sampling algorithm that consumes additional computational resources. In contrast, PSNet-FSM can efficiently help PSNet repair these collision paths. Even in extreme cases like certain collision points shown in the P08 group in
Figure 6c, EAPF can quickly find a collision-free path through dozens of iterations, avoiding the need for thousands of iterations required by sampling algorithms.
Furthermore, a new obstacle was temporarily added to each test environment to evaluate the algorithms’ ability to handle unexpected changes, specifically designed to disrupt the planned global paths (
Figure 4d–f). We have still randomly chosen some sets of results from all experimental groups, as shown in
Figure 7. The results show that PSNet can rely on its structure (PSNet-FSM) to avoid almost all temporary obstacle blockages within 50–100 iterations. In contrast, MPNet performs poorly when using its dropout structure to find obstacle-free paths, failing to avoid obstacles in almost all samples even after 100 dropout operations, and instead requiring the invocation of the Op for 800–1800 iterations of computation. Its computational cost in this environment, as tested, is significantly higher than our method, similar to RRT-connect.
4.2. 6-DOF Universal Robot Manipulator
In the high-dimensional motion planning problem for a 6-DOF universal robot manipulator, the planning task is formulated as finding collision-free joint-space trajectories (i.e., sequences of 6-dimensional joint angle configurations) from an initial configuration to a goal configuration in the configuration space (
). The goal configurations are defined such that the end-effector can successfully grasp the red workpiece from the gray storage rack and place it onto the operation platform. The planner generates a trajectory
, where each
represents a 6-DOF joint angle vector. The robot training dataset contains three industrial environments for pick-and-place tasks with two hundred trajectories each. The robot test dataset includes the same two work environments as in training but with thirty new and unseen start and goal configurations per scenario. The evaluation tests were conducted five times on the test sample sets for all methods compared in our experiments, and the mean values of their results were reported. The planning task environment contains a brown ‘L’ shaped workbench with a grey storage rack, a green protector plate, a grey operation platform, and a blue pillar randomly added and placed within the environment. The planner’s job is to find a collision-free path for the robot to pick up a red workpiece from the grey storage rack and place it on the operation platform while avoiding environmental obstacles.
Figure 8 illustrates the two test environment settings. The first test environment (
Figure 8a) presents a moderately cluttered scenario, while the second test environment (
Figure 8b) introduces a higher degree of complexity by incorporating additional obstacles. In addition, the robot manipulator must navigate through a constricted space within the grey storage rack to successfully grasp and retrieve the red workpiece in the second environment.
Figure 9 illustrates the interquartile results of iterations, in log-scale, over the entire 6-DOF robot test dataset. The proposed method maintains lower median levels than other competitors, with values of 2.89 and 3.638, and interquartile ranges (IQR) of approximately 0.054 and 0.264, respectively. MPNet-Oracle exhibits sensitivity to changes in the planning environment. Its median and IQR in environment one are comparable to ours but significantly higher in environment two, increasing from 2.89 (IQR 0.212) to 5.084 (IQR 3.028). In all test environments, the medians and IQR of RRT-connect are higher than those of our method and MPNet-Oracle. These values increase exponentially as the planning environment degrades.
We analyzed the computational overhead associated with each method’s modules for a more detailed evaluation of the proposed PSNet-EAPF algorithm and its competitors, MPNet and RRT-Connect.
Figure 10 illustrates the percentage of iteration cost attributed to each module of the evaluated methods for ten sets of randomly sampled beacon pairs. In environment one, PSNet-EAPF, on average, finds global paths in 20 iterations by invoking only the MF module. MPNet-Oracle also found global paths with similar iterations as PSNet-EAPF, except for sample P06, but its collision avoidance ability was weak. MPNet-Oracle struggled to rely on its drop module to repair local paths and avoid collisions. MPNet-Oracle appears to struggle in slightly complex environments when relying on its drop module to repair local paths and avoid obstacles, as only one out of 10 tests in
Figure 10b successfully used the drop module to resolve collision paths. Even minor ones that brush against the edge of the shelf in
Figure 8b require additional OP for resampling planning, resulting in higher computational overhead and reduced consistency.
In the multi-joint robot experiments, this study compared the iterative test results of various method modules while recording their computation time and corresponding standard deviations (SD). As shown in
Table 2, in the relatively simple obstacle environment (environment one), both PSNet-EAPF and MPNet-Oracle were able to complete path planning mainly through the network planner within approximately 0.3 s (only MPNet-Oracle invoked the Drop module in a few test cases). Although MPNet-Oracle demonstrated comparable performance in simple environments, its time cost and variance increased significantly in complex dynamic scenarios due to its dependence on the computationally expensive OP. RRT-Connect, as a pure sampling-based method, exhibits the highest time consumption, often exceeding 6 s (≈21.6 × faster), and shows the largest standard deviation, indicating unpredictable performance.
Table 3 reports the planning time of each method in environment two. In this more complex setting, the advantage of PSNet-EAPF is even more pronounced, with an average planning time of only 5.354 s and a standard deviation of 1.137 s. By contrast, MPNet-Oracle’s average computation time soars to 228.818 s (≈42.7 × faster), accompanied by a large std 203.714 s (≈55.3 × faster). This dramatic increase arises because the Drop module in MPNet struggles in high-dimensional, complex environments, prompting frequent calls to the computationally expensive Oracle Planner. As a result, the planning process transitions from rapid, network-based inference to slow random sampling and collision detection, escalating the runtime from seconds to minutes. At the same time, RRT-Connect continues to exhibit the highest computational burden among all methods.
To further evaluate each method’s capability to escape collisions, an unknown obstacle was added into the robot manipulator’s path, obstructing the continuous global path, as shown in
Figure 11. This scenario created a serious collision situation, testing the algorithms’ ability to react and replan in unexpected obstacles. The statistical results of this experiment, involving five start and goal pair configurations in both the first (P01–P05) and the more challenging second (P06–P10) test environments, were presented in
Figure 12. While both MPNet-Oracle and PSNet-EAPF face difficulties in handling such critical collision scenarios, the PSNet-EAPF method leverages the EAPF to efficiently identify near-optimal collision-free paths (Only a few dozen iterations are needed.), avoiding the disorderly search behavior exhibited by sampling-based algorithms. In contrast, the MPNet-Oracle and RRT-Connect algorithms exhibit high computational costs and significant performance variability across similar planning problems. (It requires hundreds or even tens of thousands of iterative calculations.) This highlights the effectiveness of EAPF in enhancing collision avoidance and path replanning capabilities, particularly in complex environments with unexpected obstacles.
Table 4 further illustrates the runtime of different methods in an experimental environment with newly added obstacles. The average computation time of PSNet-EAPF increases from 0.304 s and 5.354 s in environments one and two, respectively, to 11.284 s, with a corresponding rise in standard deviation. Nevertheless, compared with MPNet-Oracle and RRT-Connect, whose average runtimes reach approximately 300 s (≈30.6× and ≈26.9× faster) with standard deviations exceeding 200 s, PSNet-EAPF still demonstrates a distinct advantage.
Based on the results of the above experiments, we realized that network accuracy issues became more pronounced in high-dimensional motion planning, potentially causing oscillations in the global path near the target point. These oscillations could lead to planning failures and increase the likelihood of encountering worst-case scenarios. To quantify the robustness of the evaluated methods in addressing this challenge,
Table 5 counted the global planning success rates achieved by each algorithm across all test tasks. PSNet showed a much higher success rate than MPNet in planning collision-free paths to new target points in the original environment. This suggests that PSNet effectively addresses most minor collision issues stemming from network inaccuracy without encountering the worst-case scenarios that can trap MPNet due to its lack of necessary randomness from dropout. However, for severe collisions caused by new unknown obstacles that appear to surpass the level of network inaccuracy-induced collisions, both PSNet and MPNet exhibit unsatisfactory performance, necessitating the intervention of their respective local planners for correction. This discrepancy in performance highlights the effectiveness of PSNet in mitigating the adverse effects of network accuracy errors on global planning, thereby enhancing the overall reliability and robustness of the motion planning system in high-dimensional scenarios.
6. Conclusions
This study presents a pioneering approach that synergistically combines a probabilistic sampling network with an enhanced artificial potential field to efficiently resolve intricate planning challenges within semi-structured environments. By incorporating the innovative IMDN model into the conventional motion planning network, a multitude of Gaussian distributions are generated, effectively mitigating network accuracy errors. Furthermore, we embed a Gaussian resampling function within our PSNet framework to bolster its local collision evasion prowess. Complementing these advancements, an enhanced artificial potential field, addressing local minima and repulsive field limitations, is included as an independent module to handle worst-case scenarios. The experimental results provide quantitative validation of our method’s advantages. PSNet-EAPF achieved a reduction in median iteration count by an order of magnitude, from thousands to tens or low hundreds, in high-complexity environments. This iteration efficiency directly translated into a drastic reduction in wall-clock planning time. Our method consistently generated paths in seconds (e.g., 5.354 s in a complex 6-DOF task), while baseline methods frequently required minute-scale times (e.g., 228.818/295.849 s for MPNet-Oracle) under the same conditions.
Future work will focus on refining the algorithm and extending its applicability to broader robotic platforms. The current design primarily emphasizes functional realization, leaving substantial room for improving computational efficiency. Therefore, improving the overall algorithm performance by optimizing redundant results in modules such as Gaussian sampling and refining the code framework will become the core objective of the next phase. Although the current validation has been conducted on a high-degree-of-freedom manipulator system, the proposed algorithmic framework possesses inherent transferability to other robotic platforms. We plan to explore its application in mobile robot navigation and investigate cooperative planning problems in mobile manipulator systems, particularly addressing the complex challenges arising from coordinating chassis motion with arm configuration.