1. Introduction
Trajectory planning is fundamental to robotic applications, as it ensures the successful execution of assigned tasks and governs the efficiency and accuracy of task completion. The rapid development of robotic arm technology in recent years has led to progressively stricter demands in their operational environments. Consequently, the performance demands on robotic arms have also grown, and the performance requirements for robotic arms vary under different working conditions. Trajectory optimization for robotic arms has always been the goal pursued by researchers.
Trajectory planning investigates the time-dependent relationship of a robot’s pose during motion, aiming to describe the state of joint parameters at specific time points. Currently, there exist numerous trajectory planning methods for robotic arm joint space, which can generally be described through two distinct phases: The first phase involves joint trajectory interpolation, where joint paths are processed through interpolation techniques. Common interpolation methods include polynomial interpolation, B-spline curves, NURBS, and so on. The second phase focuses on trajectory optimization. Following on from the interpolation process, optimization algorithms are employed to refine time nodes, ensuring trajectory smoothness and enhancing the robotic arm’s operational efficiency. Through these two phases of processing, precise and efficient robotic arm trajectory planning can be achieved.
The joint trajectory interpolation of robotic arms involves constructing joint trajectories using different interpolation curves based on specific task requirements and the posing of information at the start and end points. Regarding this, scholars worldwide have conducted extensive discussions. For instance, Wang et al. [
1] established a 3-5-3 polynomial interpolation function as the foundation for trajectory planning and achieved the time-optimal optimization of robotic arm joint trajectories by incorporating an improved cuckoo search algorithm. Dinçer et al. [
2] integrated cubic polynomials with Bézier curves using quadratic Bernstein basis functions to develop a new hybrid polynomial. The experimental results demonstrated that this composite polynomial could achieve smoother motion trajectories while significantly reducing mechanical vibrations. Boryga [
3] proposed a smooth trajectory generation approach based on high-degree polynomial functions. The research formulated acceleration polynomials and determined exact analytical relationships for time and coefficients while satisfying constraints. Liu et al. [
4] employed cubic B-splines for trajectory interpolation and the integrated particle swarm optimization algorithm to optimize motion time. The experimental results demonstrated that this approach provides an optimal trajectory solution for robotic arm joint space applications. Li et al. [
5] proposed a smooth trajectory planning method that decomposes the path into two orthogonal coordinate axes and subsequently employs quintic B-spline curves to generate motion profiles. This approach achieves continuous and stable joint trajectories while significantly reducing end-effector residual vibrations. Lan et al. [
6] employed seventh-degree B-spline curves to generate smooth joint motion profiles, ensuring continuity in collaborative robot joint curves. Fang et al. [
7] proposed an S-curve-based trajectory planning method that constructs joint trajectories using piecewise S-functions. This approach achieves more efficient and computationally simpler trajectory planning compared to conventional polynomial methods while allowing adjustable trade-offs between efficiency and smoothness through limit value modulation. Wang [
8] developed an innovative NURBS-based contour control strategy for robotic systems. This method utilizes contour errors as a key performance indicator and proposes a tangent-approximation contour error estimation technique, significantly enhancing precision in contour tracking applications.
Trajectory optimization can be classified into single-objective and multi-objective methods. The former concentrates on optimizing one particular performance criterion while neglecting other factors, whereas the latter simultaneously handles multiple competing objectives. Multi-objective optimization typically involves obtaining Pareto optimal solutions, from which the most suitable compromise solution is chosen based on practical application requirements and constraints. Han et al. [
9] proposed an enhanced Particle Swarm Optimization (PSO) algorithm that effectively addresses the local optima entrapment problem inherent in conventional approaches. The improved algorithm achieves superior optimization outcomes through better equilibrium between exploration and exploitation capabilities. Du et al. [
10] proposed a novel locally chaotic particle swarm optimization algorithm that effectively resolves the premature convergence issue inherent in conventional algorithms. Zhang et al. [
11] employed hybrid inverse kinematics to compute critical sequence points for joint trajectories, subsequently proposing an adaptive cuckoo search algorithm with enhanced convergence properties. This methodology successfully generated smooth, time-optimal trajectories while satisfying dynamic constraints. Zhang et al. [
12] partitioned the joint space of the robotic arm and subsequently proposed an enhanced sparrow search algorithm. The algorithm enhanced both convergence speed and search capability through the integration of chaotic mapping and adaptive step-size adjustment, ultimately generating smooth operational trajectories. Xu et al. [
13] obtained a trajectory with smooth angular velocity and no acceleration jump by combining the multi-population search strategy of the Harris Hawks algorithm with the quintic polynomial interpolation trajectory. Li et al. [
14] proposed a stability-constrained trajectory planning method that incorporates an enhanced version of Non-Dominated Sorting Genetic Algorithm-II (NSGA-II) to simultaneously optimize both execution time and energy consumption for robotic arm trajectories, thereby improving motion stability. Shi et al. [
15] applied multi-objective optimization to address trajectory planning. They considered both kinematic and dynamic constraints. The researchers developed a normalized weighted objective function. This function helped select optimal solutions from the Pareto front. This method successfully generated high-order continuous optimal trajectories. Yang et al. [
16] integrated multiple improvement strategies into the Whale Optimization Algorithm (WOA). The resulting Multi-Strategy Whale Optimization Algorithm (MSWOA) showed substantially better search performance. This algorithm proves particularly effective for global optimization challenges. Liu et al. [
17] developed an improved multiverse algorithm by enhancing the wormhole probability curve, adaptively adjusting parameters, and incorporating population mutation fusion. The proposed algorithm improves convergence speed and global search capability, thereby enhancing the comprehensive performance of robotic arms. Liu et al. [
18] proposed an improved multi-objective grasshopper optimization algorithm that effectively addresses multi-objective optimization in orthogonal robots, significantly mitigating operational unevenness issues. Xu et al. [
19] developed a multi-objective trajectory planning approach for spraying robots. This method combines the “7-5-7” hybrid interpolation technique with the hybrid multi-objective NSGA-II algorithm. Tests show that optimized paths achieve faster operation, less energy use, and have minimized mechanical impact.
In summary, numerous scholars have conducted in-depth research on interpolation and optimization methods for robotic trajectory planning. These studies mainly use interpolation methods such as polynomials, B-splines, and NURBS to construct joint trajectories and then employ optimization algorithms to perform trade-off optimization among multiple objectives, including time, energy consumption, and jerk. However, due to the inherent randomness of optimization algorithms, they often face issues during optimization, such as an insufficient global search capability leading to local optima and inadequate local convergence. How to better balance the relationship between global search and local convergence to reduce any impact on the optimization results remains a key research direction in the field of robotic trajectory planning.
This research proposes a multi-objective trajectory planning method for robotic arms based on the MOPO algorithm to address three key performance metrics during operation: time, energy, and impact. This method utilizes quintic NURBS to construct joint trajectories for the robotic arm, ensuring the continuity and smoothness of the motion paths. Subsequently, the joint trajectories are optimized using an improved MOPO algorithm, which incorporates enhanced parrot optimization techniques and introduces multi-objective strategies. Finally, experimental results verify the performance advantages of the algorithm and demonstrate the effectiveness of the proposed method for solving multi-objective robotic trajectory planning problems.
The preceding sections are arranged as follows:
Section 2 presents the trajectory planning problem and selects quintic NURBS curves to construct joint trajectories.
Section 3 elaborates on the enhanced parrot optimization algorithm and presents the developed MOPO algorithm through the incorporation of multi-objective strategies.
Section 4 presents the analytical results of algorithmic performance comparisons along with comparative evaluations of the optimization outcomes obtained through multi-objective trajectory planning for the robotic arm. Finally,
Section 5 outlines the main conclusions.
3. Multi-Objective Parrot Optimization Algorithm
3.1. Parrot Optimization Algorithm
PO is an efficient metaheuristic algorithm inspired by four key behaviors observed in parrot populations: foraging, staying, communicating, and stranger avoidance [
20]. These behaviors are mathematically encapsulated through four corresponding computational models in the algorithm:
Unlike traditional metaheuristic algorithms, during each iteration, every individual in the PO population randomly exhibits one of these four behaviors—that is, each individual randomly selects one strategy to update its search process rather than following a fixed update pattern. This approach offers a more biologically faithful representation of behavioral stochasticity observed in parrots while substantially improving population diversity. The randomized architecture of PO distinguishes it from conventional algorithms, effectively mitigating local optima entrapment risks while preserving solution quality.
In the PO algorithm, an initial population of random solutions is first generated. These candidate solutions are then iteratively updated using the current best solution and predefined update rules to search for the global optimum. During optimization, each individual continuously adjusts its position dynamically until specified termination criteria are met.
Figure 1 shows the complete flowchart of the PO algorithm.
3.2. Multi-Objective Parrot Optimization Algorithm
In the PO algorithm, the random selection of an update strategy during each iteration introduces a high degree of randomness. This may lead to insufficient exploration depth or susceptibility to local optima. To address these issues, this research introduces several modifications to the original algorithm.
In the original algorithm, the selection of one of the four strategies during each iteration is determined by a random number, resulting in a high degree of randomness. To address this, this research introduces the parameter , representing the ratio of the current iteration count to the total iteration count. During each iteration, the ratio is compared with a constant, a, to assess the progress of the algorithm and adjust the probabilities of selecting each strategy. Among the four update strategy formulations in the PO algorithm, Equations (14) and (16) correspond to the algorithm’s exploration (breadth search), while Equations (15) and (17) are responsible for exploitation (depth search). Therefore, in the improved algorithm, by judging the iterative process of the algorithm and adjusting the probability of selecting each strategy dynamically, it is ensured that there is a greater probability of selecting the breadth search strategy formula in the early stages of the algorithm, so as to effectively prevent premature convergence to local optima. Conversely, in the later stages, the probability of selecting exploitation strategies is increased to enhance the depth-searching capability, thereby improving the accuracy of convergence towards the optimal solution.
Currently, solving multi-objective optimization problems using similar optimization algorithms generally follows two approaches: One is to deal with multiple objective function problems through weighted summation, gray correlation analysis, the analytic hierarchy process, and other methods, transform multi-objective problems into a single-objective optimization problem, and then apply single-objective optimization algorithm for optimization. The second is to establish mathematical functions that can evaluate and select the most favorable solutions for individuals and the overall population during the iterative process. A multi-objective optimization algorithm is then employed to iteratively generate an optimal Pare-to solution set, from which the most suitable solutions are selected according to task requirements. Multi-objective optimization refers to solving two or more mixed objective functions under certain constraints. In multi-objective trajectory optimization problems, it is often challenging to determine appropriate weights for the various objective functions, and these objectives may sometimes come into conflict with one another. As a result, traditional single-objective optimization methods are often inadequate for solving such problems. This research adopts the second approach. On the basis of improving the PO algorithm, strategies such as dominance relationships, external archive sets, grid partitioning, and roulette wheel selection are introduced to adapt the algorithm for multi-objective optimization. The resulting MOPO algorithm is designed to comprehensively solve problems involving multiple objective functions.
First, the dominance relationship is defined as follows: A solution p is said to dominate solution q if and only if both of the following conditions are satisfied: (1) for all objective functions, p is not worse than q, i.e., , (k = 1, 2, …, r); (2) there exists at least one objective function where p is strictly better than q, i.e., such that . In addition, in single-objective optimization, where only one objective function exists, selecting the global best individual is straightforward by comparing the value of the objective function. However, in multi-objective optimization, where multiple objective functions are typically involved, the direct comparison of individual objective function values is not applicable. In the MOPO proposed in this research, the global best individual is selected as follows: First, an external archive set is defined to store non-dominated individuals discovered during the search process. During each iteration, grids are generated and divided based on all individuals in the external archive. The grid boundaries are determined by the maximum and minimum values. The number of individuals in each grid is then used as a fitness value for roulette wheel selection. The fitness values are inverted, so smaller fitness values have a higher probability of being selected. This approach ensures diversity among the solutions. Once a grid is selected, a random individual within that grid is chosen as the global best individual for updating the algorithm in the current iteration.
The main difference between MOPO and the original single-objective algorithm during iteration lies in the following processes: initializing the state information of n particles in the population and setting the maximum size of the external archive set to N; dividing the entire population into a dominant set W and a non-dominant set Q based on fitness dominance relationships, and placing the particles in Q into the external archive set Ψ; during updates, the dominant set, W, is first updated, calculating the fitness of the new dominant set W, and then processing the dominance relationships with the non-dominant set Q to update W and Q. A new solution can only be incorporated into the external archive set if it is non-dominated with respect to all existing solutions within the archive. Furthermore, any archived solution that becomes dominated by newly introduced solutions during this process will be systematically removed. The population is iteratively updated, and when the termination condition is met, the population optimization is completed, resulting in the Pareto solution set, which represents a set of optimal solutions.
At this point, the MOPO algorithm is obtained. The flowchart of the algorithm is illustrated in
Figure 2.
The pseudocode of the MOPO algorithm (Algorithm 1) is presented below, where Pop represents the population, Obj_fun denotes the function for calculating objective function values, Const_fun indicates the function for evaluating constraint conditions, Dominate_fun represents the function for determining dominance relationships, Grid_fun and Roulette_fun represent the functions used to divide the grid and the function of selecting the global best solution, respectively; finally, St_fun represents the function of selecting the update strategy.
Algorithm 1 MOPO Algorithm |
1: Initialize the MOPO parameters 2: Initialize the Pop’ positions randomly 3: Obj_fun(Pop) and Const_fun(Pop) 4: Initialize individual best solution Pbest 5: Archive ← Dominate_fun(Pop)
21: Update Archive ← Dominate_fun (Archive, Pop) 22: End 23: End |
4. Simulation Verification and Analysis
After completing the derivation of the MOPO algorithm, two experimental subsections are designed for comparative verification to ensure the proposed method can be effectively applied to multi-objective trajectory planning for robotic arms. First, the obtained MOPO algorithm is validated using standard multi-objective test functions, with comparative analysis conducted to examine its effectiveness in solving multi-objective optimization problems and its performance advantages. Subsequently, by integrating interpolation curves, the multi-objective optimization of joint trajectories is performed under robotic kinematic constraints based on time–energy–jerk criteria, verifying the effectiveness of the proposed method for solving robotic multi-objective trajectory planning problems.
4.1. Testing Environment
The simulation was conducted on a system equipped with a 12th Gen Intel® Core™ i7-12700 20-core processor and 16 GB of RAM (Lenovo, Tianjin, China). Computational experiments were conducted on a Windows 11 operating system using MATLAB R2020a as the simulation platform.
4.2. Performance Verification of MOPO Algorithm
In multi-objective optimization problems, evaluation metrics can generally be categorized into convergence metrics and diversity metrics based on the different emphases of an assessment, which measure the convergence and distribution characteristics of the obtained solution sets, respectively. Among them, the Generational Distance (GD) [
21] is one of the most classical convergence metrics, the value of which represents the distance between the non-dominated solution set obtained by the algorithm and the ideal reference Pareto front. A smaller GD value indicates the better convergence of the solution set, meaning that the algorithm exhibits stronger convergence capability, thereby yielding solutions closer to the true Pareto front. This metric reflects the algorithm’s local convergence ability and is defined as follows:
where
n represents the number of non-dominated solutions obtained by the algorithm, and
di denotes the Euclidean distance between the solution
xi and Pareto front.
Among diversity metrics, the Spacing (SP) [
22] metric serves as a representative indicator. Its value is derived from the standard deviation of the minimum distances between each solution and its nearest neighboring solution in the obtained non-dominated set. A smaller SP value indicates the more uniform spatial distribution of solutions within the solution set, demonstrating the algorithm’s stronger exploration capability, yielding a better diversity for non-dominated solutions. This metric quantitatively evaluates the distribution characteristics of the solution set, thereby reflecting the algorithm’s global search performance, formally defined as follows:
where
doi represents the Euclidean distance between solution
xi and its nearest neighboring solution in the obtained non-dominated set, and
d denotes the arithmetic mean of all
doi values.
When evaluating the algorithm’s performance using the aforementioned metrics, the true Pareto front of the optimization problem must be precisely defined. The ZDT test functions serve as classical benchmark problems in multi-objective optimization, featuring well-defined Pareto fronts and distinct optimization characteristics that enable an accurate and objective performance assessment. Among them, the Pareto front of the ZDT1 function is continuous and convex; the Pareto front of the ZDT2 function is continuous and concave; and the Pareto front of the ZDT3 function is discrete and non-continuous. These three functions cover optimization problems with different characteristics, such as convexity, concavity, and discreteness, which can more comprehensively evaluate algorithm performance and ensure the completeness of the algorithm’s evaluation. Therefore, the standard test functions ZDT1, ZDT2, and ZDT3 were selected to validate the effectiveness of the MOPO algorithm in solving multi-objective optimization problems, with the representative multi-objective optimization algorithms NSGA-II and MOPSO being chosen as control groups for comparative performance analysis to verify the performance advantages of the proposed MOPO algorithm.
To ensure the fairness and objectivity of the experiments, uniform parameter configurations were set for all algorithms: a population size of 200, an external archive size of 200, and a maximum iteration count of 200. The optimization results of each algorithm on the standard test functions ZDT1, ZDT2, and ZDT3, represented by the obtained Pareto solution sets, are shown in
Figure 3,
Figure 4, and
Figure 5, respectively.
To ensure the reliability of the experimental results, all three algorithms were independently executed 50 times for each standard test function. The average values of both GD and SP evaluation metrics were then calculated, with the detailed results presented in
Table 1.
A comprehensive analysis of
Figure 3,
Figure 4 and
Figure 5 shows that to solve these three standard test functions, while both NSGA-II and MOPSO algorithms can converge to the true Pareto front, their solution sets exhibit uneven and discontinuous distributions with some missing solutions. This indicates that these algorithms are prone to the problem of falling to a local optimum due to an insufficient global search ability in the optimization process. In contrast, the Pareto-optimal solution set obtained by the MOPO algorithm demonstrates a more accurate and uniform continuous coverage of the true Pareto front of the test functions, indicating a superior balance between global exploration and local exploitation. Further analysis of the data in
Table 1 shows that the MOPO algorithm achieves lower average values for both GD and SP metrics compared to the other two algorithms, indicating better convergence and diversity for the optimal solution set. This more precisely and intuitively reflects its stronger global search capability and local convergence performance, further demonstrating the effectiveness and performance advantages of the proposed MOPO algorithm in solving multi-objective optimization problems.
4.3. Validation of Multi-Objective Trajectory Planning for Robotic Arms
This section adopts the UR5 6-DOF serial robotic arm as a case study. The planned task-space trajectory is transformed into joint-space motion paths through inverse kinematics, yielding the corresponding joint trajectory curves. The characteristic sampling of these joint-space paths generates the discrete path sequence, as presented in
Table 2.
Based on the path sampling point sequence derived from
Table 2, the joint-space waypoints for the robotic arm are determined. Under specified constraints, the MOPO algorithm is applied to conduct the multi-objective optimization of time–energy–jerk for the robotic arm, thereby verifying the effectiveness of the proposed algorithm in solving multi-objective trajectory planning problems in robotics. The algorithmic parameter configuration is detailed in
Table 3.
Among these parameters, N denotes the population size; Maxiter represents the number of iterations; and Ψ indicates the size of the external archive set. Reference to the empirical parameter settings of other multi-objective optimization algorithms in existing studies and preliminary algorithm validation experiments demonstrate that a population size set at around 200 can effectively balance population diversity and computational complexity. The algorithm typically converges to the Pareto front after 100 iterations, and considering optimization efficiency, setting it to 200 iterations is therefore sufficient. The size of the external archive set affects the distribution density of the final Pareto optimal solution set obtained and is generally set to 100 to ensure uniform distribution along the Pareto front. Additionally, dim represents the number of optimization variables, which in this trajectory optimization problem is determined by the number of trajectory sampling points. ub and lb represent the upper and lower bounds of the search space for the optimization variables, respectively, which are appropriately sized for the temporal optimization variables considered in this research. β and γ are parameters in the update formula, maintained consistently with the original algorithm.
The motion constraints for each joint of the UR5 robotic arm are specified in
Table 4, corresponding, respectively, to
,
, and
in constraint conditions in Equation (13).
Using Equation (12) as the objective function, the MOPO algorithm was applied to optimize the robotic arm’s trajectory. Quintic NURBS curves were employed to construct the joint trajectories, with the time intervals
serving as optimization variables. The algorithm identified solutions under the specified objectives, yielding the Pareto solution set distribution shown in
Figure 6.
Subsequently, based on the distribution characteristics of the Pareto solution set, three representative solutions were selected to exemplify optimal solutions at distinct regions of the Pareto front, as illustrated in
Figure 7.
In
Figure 7, the objective function values corresponding to points A, B, and C are used to discuss the impacts of selecting different solutions from the Pareto front on the multiple predefined objectives of the robotic arm. As shown in the figure, the Pareto front can be regarded approximately as a curved line. When the selected solution approaches point A, objective function 1 (time) decreases while objectives 2 (energy) and 3 (impact) increase. This indicates that shorter execution times lead to higher energy consumption and greater mechanical shock in the robotic arm, as higher velocities are required to achieve the reduced duration. Conversely, when the selected solution approaches point C, objective function 1 (time) increases while objectives 2 (energy) and 3 (impact) decrease. This means that relatively longer operation times result in significantly reduced energy consumption and mechanical shock. In practical applications, appropriate solutions should be selected from the Pareto front according to specific operational conditions and different task requirements for the robotic arm.
To verify the effectiveness and optimization performance of the proposed algorithm for solving the multi-objective trajectory planning problem of robotic arms, Solution B is taken as an example to present the optimization results and comparisons. In the pre-optimization phase, uniform time sampling was applied to interpolate between adjacent joint trajectory nodes. Based on the given path nodes and constraints, the robotic arm’s joint motion curves—including position, angular velocity, angular acceleration, and jerk profiles—were generated using quintic NURBS interpolation, as shown in
Figure 8.
As shown in
Figure 8, the joint angle curves in subplot (a) demonstrate excellent fitting accuracy to the original joint trajectory sequence, ensuring precise trajectory planning. Subplots (b) and (c) reveal that all joint velocities and accelerations achieved zero values at both start and end points, guaranteeing smooth motion transitions. Furthermore, the overall trends of velocity, acceleration, and jerk curves exhibit continuous and smooth characteristics without abrupt discontinuities, confirming that the quintic NURBS-based interpolation method effectively maintains trajectory smoothness.
However, the unoptimized time intervals between trajectory sequence points result in significant fluctuations in the motion curves of each joint. These fluctuations may affect the motion stability of the robotic arm. Therefore, further optimization is required to adjust the distribution of path time parameters, thereby reducing trajectory oscillations and improving both the smoothness and execution stability of the trajectory.
For Solution B, the optimal time vector
was applied to interpolate the robotic arm’s joint trajectories using the optimized time node sequence. The resulting performance curves for each joint after optimization are shown in
Figure 9. The values of the three objective functions (time, energy, and impact) before and after optimization are presented in
Table 5.
As can be seen from
Figure 9, the joint angle curves maintain good continuity and smoothness while conforming to the target planned path. Firstly, the velocity, acceleration, and jerk curves of all joints showed no abrupt changes, ensuring smoothness throughout the motion process. Furthermore, both velocity and acceleration were zero at the start and stop moments, satisfying the kinematic constraints of trajectory planning and avoiding sudden impacts or instability factors. A further comparison with the pre-optimized joint curves in
Figure 8 demonstrates that the smoothness of the joint trajectories was significantly improved. The variations in velocity, acceleration, and jerk became more uniform, with substantially reduced fluctuations, resulting in the smoother and more stable motion of the robotic arm.
Further analysis of the data in
Table 5 indicates that taking Solution B as an example, the optimized results show the following improvements compared to the pre-optimized state: the time performance metric was improved by 8.52%, the energy performance metric was improved by 32.04%, and the impact performance metric was improved by 34.86%. While all joint motion curves satisfied the kinematic constraints of the robotic arm, the three predefined objectives were optimized simultaneously, further verifying the effectiveness of the method and providing a novel multi-objective trajectory planning approach for robotic arm applications.