UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies

Liu, Jianping; Han, Xiaoxia; Liu, Fengyi; Wu, Jinde; Zhang, Wenjie

doi:10.3390/app15116064

Open AccessArticle

UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies

by

Jianping Liu

,

Xiaoxia Han

^*,

Fengyi Liu

,

Jinde Wu

and

Wenjie Zhang

School of Electrical and Power Engineering, Taiyuan University of Technology, Taiyuan 030024, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(11), 6064; https://doi.org/10.3390/app15116064

Submission received: 22 March 2025 / Revised: 3 May 2025 / Accepted: 8 May 2025 / Published: 28 May 2025

Download

Browse Figures

Versions Notes

Abstract

:

This study introduces a state transition simulated annealing algorithm that incorporates integrated destruction operators and backward learning strategies (DRSTASA) to address complex challenges in UAV path planning within multidimensional environments. UAV path planning is a critical optimization problem that requires smooth flight paths, obstacle avoidance, moderate angle changes, and minimized flight distance to conserve fuel and reduce travel time. Traditional algorithms often become trapped in local optima, preventing them from finding globally optimal solutions. DRSTASA improves global search capabilities by initializing the population with Latin hypercube sampling, combined with destruction operators and backward learning strategies. Testing on 23 benchmark functions demonstrates that the algorithm outperforms both traditional and advanced metaheuristic algorithms in solving single and multimodal problems. Furthermore, in eight engineering design optimization scenarios, DRSTASA exhibits superior performance compared to the STASA and SNS algorithms, highlighting the significant advantages of this method. DRSTASA is also successfully applied to UAV path planning, identifying optimal paths and proving the practical value of the algorithm.

Keywords:

STASA; destruction operators; backward learning strategies; UAV path planning

1. Introduction

Unmanned aerial vehicles (UAVs) are widely used in various fields such as military operations, agriculture, disaster relief, environmental monitoring, and communications due to their low cost, high maneuverability, and adaptability [1,2,3]. However, path planning in complex and dynamic environments remains a significant challenge, as it must ensure obstacle avoidance, flight safety, and energy efficiency. Finding the optimal path has become a key issue in UAV applications [4].

Traditional path planning methods such as the A* algorithm [5,6], ant colony optimization (ACO) [7], and particle swarm optimization (PSO) [8] have been widely used but often struggle to balance real-time performance, global optimality, and obstacle avoidance. Therefore, researchers have worked to refine or integrate these methods with new approaches to improve their performance.

For instance, Jie Chen, Fang Ye, and Tao Jiang proposed a UAV trajectory planning strategy that leverages the rapid optimization capabilities of ant colony optimization (ACO), though it did not account for complex factors like flight angles [9]. Luji Guo, Chenbo Zhao, and Jiacheng Li optimized the cost function weights of the A* algorithm and introduced virtual target points in the artificial potential field (APF) method, improving both path smoothness and obstacle avoidance [10]. Hongbo Xiang, Xiaobo Liu, and their team combined enhanced particle swarm optimization (EPSO) with genetic algorithms, adaptively tuning the acceleration coefficients of EPSO based on fitness values, which enhanced global search [11]. Dongcheng Li, Wangping Yin, and W. Eric Wong applied Q-learning to dynamically adjust factors such as step size and cost function in the A* algorithm, creating a hybrid method that effectively balances global and local search [12]. Bo Li, Xiaogang Qi, and Baoguo Yu incorporated the Metropolis criterion from simulated annealing into ACO, reducing the likelihood of getting trapped in local optima. They also employed an inscribed circle smoothing method to optimize trajectories, enhancing both feasibility and effectiveness [13].

In recent years, the rapid development of artificial intelligence and intelligent optimization algorithms has led to the increasing application of advanced algorithms and deep learning methods in UAV path planning. Hui Li, Teng Long, and Guangtong Xu proposed a coupling degree-based heuristic priority planning method (CDH-PP), which enhances the efficiency of cooperative path planning for UAV swarms through coupling degree heuristics [14]. Ronglei Xie, Zhijun Meng, and Lifeng Wang addressed path planning challenges in complex dynamic environments using deep learning, introducing the RQ method and an adaptive sampling mechanism to improve obstacle avoidance capabilities [15]. Liguo Tan, Yaohua Zhang, and Jianwen Huo combined the rapidly-exploring random tree (RRT) algorithm with driver visual behavior to propose an RRT path planning method that simulates driver vision. They optimized the path using a greedy algorithm; however, while this method is effective in complex scenarios, it exhibits high computational complexity [16]. Xing Wang, Jeng-Shyang Pan, and Qingyong Yang introduced an improved mayfly algorithm (modMA), which optimizes UAV layout and reduces overall costs through exponentially decreasing inertia weights (EDIW), adaptive Cauchy mutations, and enhanced crossover operators [17]. Jingfan Tian, Yankai Wang, and Dongdong Yuan improved the global optimization capability of the elastic rope algorithm by enhancing the node update mechanism and introducing paths composed of m-dimensional nodes to increase its applicability [18].

Additionally, other studies have proposed innovative strategies for algorithm improvement. Xiaohui Cui, Yu Wang, and Shijie Yang effectively addressed two-point and multi-point path planning problems by employing a chaotic initialization method combined with genetic algorithms (GA) and simulated annealing (SA) [19]. Xiaobing Yu, Chenliang Li, and Jiafang Zhou introduced an adaptive selection mutation-constrained differential evolution algorithm, which shows promising application prospects in disaster scenarios [20]. Xiangyin Zhang, Shuang Xia, and Xiuzhi Li developed a quantum theory-improved firefly algorithm (QFOA), utilizing wave functions to replace random search, thereby enhancing population diversity and outperforming the traditional firefly algorithm (FQA) in terms of search capability, stability, and robustness [21]. The combination of simplified grey wolf optimization (SGWO) and the mutual symbiotic organism search (MSOS) algorithm has also demonstrated potential in UAV path planning [22]. To improve path planning efficiency and ensure the smoothness and safety of UAV operations, Yaqing Chen, Qizhou Yu, and Dan Han proposed a hybrid algorithm that integrates grey wolf optimization (GWO) with the artificial potential field method (APF), significantly enhancing path planning capabilities, particularly in scenarios with short distances and high safety requirements [23]. Despite the widespread application of intelligent algorithms and neural networks in UAV path planning, most algorithms remain limited to two-dimensional path planning, failing to adequately consider factors such as obstacle height and flight slope, while some methods exhibit high computational complexity.

The state transition simulated annealing algorithm (STASA), a hybrid intelligent optimization algorithm proposed by Han et al. [24]. It combines the strengths of the state transition algorithm (STA) [25] and simulated annealing (SA) [26], achieving significant results in various application areas, such as PM2.5 prediction [27] and the optimization of methane production conditions in coal-to-gas processes [28]. Compared to traditional algorithms such as genetic algorithm (GA) [29], particle swarm optimization (PSO) [30], and grey wolf optimizer (GWO) [31], STASA demonstrates greater efficiency and flexibility. STASA offers greater efficiency and flexibility. However, its performance declines when addressing complex, high-dimensional problems.

To address the shortcomings of the state transition simulated annealing algorithm (STASA), this paper introduces the state transition simulated annealing algorithm with destructive perturbation operators and reverse learning strategies (DRSTASA). This algorithm enhances population diversity using Latin hypercube sampling and incorporates destructive perturbation and reverse learning strategies to improve global search, speed up convergence, and avoid local optima. Experimental results show that DRSTASA outperforms other algorithms on 23 benchmark test functions and 8 engineering optimization problems. It has also been successfully applied to three-dimensional UAV path planning, demonstrating strong practical feasibility [32].

The rest of this paper is structured as follows: Section 2 covers the objective function for UAV path planning, Section 3 explains the original STASA algorithm, Section 4 describes the enhancements made by DRSTASA, and Section 5 presents a comparison with other algorithms and experimental results. Finally, a summary is provided.

2. UAV Three-Dimensional Path Planning

With the rapid advancement of unmanned aerial vehicle (UAV) technology, these systems have found widespread applications in military operations, agriculture, logistics, and environmental monitoring. To ensure that UAVs can safely and effectively execute tasks in complex environments, path planning has emerged as a critical issue. The objective of UAV path planning is to identify an optimal route in three-dimensional space that enables the UAV to travel from a starting point to a destination while avoiding obstacles and adhering to safety and energy consumption requirements. The UAV three-dimensional path planning problem aims to find an optimal trajectory that satisfies the following objectives:

The Shortest Path: The total flight distance from the starting point to the endpoint should be minimized.
Obstacle Avoidance: The flight path must navigate around all obstacles to ensure safety.
Flight Altitude: The flight path must comply with the specified altitude restrictions.
Path Smoothness: The trajectory should be as smooth as possible, avoiding sharp turns and steep inclines.

Therefore, this problem primarily considers four key factors. The specific cost function and objective function for UAV path planning are defined in the following subsection.

2.1. Path Optimality

Path planning must achieve optimality under specific criteria to ensure the efficient operation of UAVs. For applications such as aerial photography, surveying, and surface inspection, minimizing the path length is a primary objective.

The cost function for path length

F_{1}

can be expressed as follows:

F (X_{i}) = \sum_{j = 1}^{n - 1} ‖ \vec{P_{l j} P_{l, j + 1}} ‖

(1)

X_{i}

represents the list of n waypoints that the flight path needs to traverse, with the coordinates of each waypoint denoted as

P_{i j} = (\begin{matrix} X_{i j}, Y_{i j}, Z_{i j} \end{matrix})

.

2.2. Safety and Feasibility Constraints

In path planning, ensuring the safe operation of the UAV is crucial. This involves avoiding obstacles in the environment while maintaining flight within a specified altitude range. Furthermore, the flight path should be as smooth as possible to prevent abrupt ascents and descents. Consequently, the remaining cost functions include threat cost, flight altitude cost, and path smoothness cost.

Threat Cost:

F_{2} (X_{i}) = \sum_{j = 1}^{n - 1} \sum_{K = 1}^{K} T_{k} (\vec{P_{ι j} P_{ι, j + 1}})

(2)

where

T_{k} (\vec{P_{i j} P_{i, j + 1}}) \{\begin{array}{l} 0 & i f d_{k} > S + D + R_{k} \\ (S + D + R_{k}) - d_{k} & i f D + R_{k} < d_{k} \leq S + D + R_{k} \\ \infty & i f d_{k} \leq D + R_{k} \end{array}

(3)

In the problem, it is assumed that the set K contains several cylindrical obstacles, each defined by a center coordinate

C_{k}

and a radius

R_{k}

. The diameter of the UAV is represented by D. The vertical distance from adjacent path nodes to the origin is denoted as

d_{k}

, and S represents the danger zone of the obstacles. The experimental parameters were set as k = 6,

R_{k}

= 80,

P_{start}

= [200,100,150], and

P_{end}

= [800;800;150]. The obstacle layout is illustrated in Figure 1.

In practice, there are usually specific altitude requirements for UAVs, which stipulate that the UAV must operate within a defined range between a minimum and maximum height. The cost associated with flight altitude can be calculated using the following formula:

H_{i j} = \{\begin{array}{l} |h_{i j} - \frac{h_{m a x} + h_{m i n}}{2}| & i f h_{m i n} \leq h_{i j} \leq h_{m a x} \\ \infty & o t h e r w i s e \end{array}

(4)

The total altitude cost

F_{3}

can be calculated using the following formula:

F_{3} (X_{i}) = \sum_{j = 1}^{n} H_{i j}

(5)

where

h_{i j}

represents the current altitude of the UAV, and

h_{m a x}

and

h_{m i n}

denote the maximum and minimum allowable flight altitudes, respectively. The experimental parameters were configured as

h_{\max}

= 200;

h_{\min}

= 100.

Smoothness Cost:

The smoothness cost evaluates the turning and ascent angles of the path. The turning angle

ϕ_{i j}

and the ascent angle

ψ_{i j}

are calculated using the following formulas:

Turning Angle:

\begin{array}{l} ϕ_{i j} = \arctan (\frac{‖\vec{P_{l J}^{'} P_{l, j + 1}^{'}} \times \vec{P_{l, j + 1}^{'} P_{l, j + 2}^{'}}‖}{‖\vec{P_{l J}^{'} P_{l, j + 1}^{'}}‖ \cdot ‖\vec{P_{l, j + 1}^{'} P_{l, j + 2}^{'}}‖}) \end{array}

(6)

Ascent Angle:

ψ_{i j} = \arctan (\frac{z_{i, j + 1} - z_{i j}}{‖\vec{P_{l j}^{'} P_{l, j + 1}^{'}}‖})

(7)

The total smoothness cost can be calculated using the following formula:

F_{4} (X_{i}) = a_{1} \sum_{j = 1}^{n - 2} ϕ_{i j} + a_{2} \sum_{j = 1}^{n - 1} | ψ_{i j} - ψ_{i, j - 1} |

(8)

a_{1}

and

a_{2}

are the penalty coefficients for the horizontal turning angle and the vertical pitch angle, respectively. The experimental parameters were set as

a_{1}

= 1;

a_{2}

= 1.

3. STASA

The state transition simulated annealing algorithm (STASA) integrates discrete and continuous state transition operators derived from the state transition algorithm (STA), thereby improving its search capabilities. In both STA and STASA, the solution to an optimization problem is represented as a “state”, and the process of refining the solution is analogous to a “state transition”. This refinement occurs through local and global searches using update operators, followed by the application of a criterion to determine whether the solution has improved.

The general framework for generating candidate solutions using the state transition algorithm is as follows:

\{\begin{array}{l} x_{k + 1} = A_{k} x_{k} + B_{k} u_{k} \\ y_{k + 1} = f (x_{k + 1}) \end{array}

(9)

where

x_{k}, x_{k + 1}

represent the current and the new generation solution, respectively,

u_{k}

is a function that relates the current state to historical states, and

A_{k}

and

B_{k}

are matrices for random transformations.

f (\cdot)

denotes the fitness function for the problem being solved, and

y_{k}

represents the fitness value of

x_{k}

.

3.1. State Transition Operators

In continuous optimization problems, STASA employs four types of state transition operators: rotation, translation, scaling, and axis transformation. Each operator plays a distinct role in improving the search process.

(a): Rotation Operator

x_{k + 1} {= x}_{k} + ε_{α} \frac{1}{n ‖ x_{k} ‖_{2}} R_{r} x_{k}

(10)

where

ε_{α}

is a positive constant known as the rotation factor, and

R_{r} \in R^{n \times n}

is an n-dimensional random matrix with elements uniformly distributed in the range of [−1, 1]. Additionally,

{‖\cdot‖}_{2}

denotes the Euclidean norm (L2 norm). The rotation operator allows the generated candidate solutions to fall within a hypersphere of radius

ε_{α}

.

(b): Translation Operator

x_{k + 1} = x_{k} + ε_{β} R_{t} \frac{x_{k} - x_{k - 1}}{{‖x_{k} - x_{k - 1}‖}_{2}}

(11)

where

ε_{β}

is a positive constant known as the scaling factor, and

R_{t}

is a random variable defined within the range of [0, 1]. The translation operator starts from the initial point

x_{k}

and searches in the direction from point

x_{k - 1}

to point

x_{k}

, with a maximum search length of

ε_{β}

. This operator possesses local search capabilities.

(c): Scaling Operator

x_{k + 1} {= x}_{k} + ε_{γ} R_{e} x_{k}

(12)

where

ε_{γ}

is a positive constant known as the scaling factor, and

R_{e} \in R^{n \times n}

is an n-dimensional random diagonal matrix following a Gaussian distribution with a mean of 0 and a variance of 1. The scaling operator has global search capabilities.

(d): Axis Transformation Operator

x_{k + 1} {= x}_{k} + ε_{δ} R_{α} x_{k}

(13)

where

ε_{δ}

is a positive constant known as the axis transformation factor, and

R_{a} \in R^{n \times n}

is an n-dimensional random diagonal matrix following a Gaussian distribution in the range [0, 1], with only one non-zero element. The axis transformation operator enhances the search capability in a single dimension.

3.2. Update Strategy of the STASA

STASA does not rely on a greedy update criterion. Instead, it uses the Metropolis criterion, similar to the simulated annealing algorithm, to refine solutions. The update process is as follows:

x_{k} = \{\begin{cases} x_{k + 1}, i f f (x_{k + 1}) < f (x_{k}) \\ x_{k + 1}, i f f (x_{k + 1}) \geq f (x_{k}) a n d \\ \exp (- ((f (x_{k + 1}) - f (x_{k})) / T_{k}) \geq η \\ x_{k}, i f f (x_{k + 1}) \geq f (x_{k}) a n d \\ \exp (- ((f (x_{k + 1}) - f (x_{k})) / T_{k}) \leq η \end{cases}

(14)

where η is a random number in the range [0, 1]. When the fitness value of the next-generation solution is less than that of the current solution, the new solution is accepted. When the next-generation solution is not better than the current solution, the simulated annealing temperature is calculated using formula:

\exp (- ((f (x_{K + 1}) - f (x_{k})) / T_{k}))

. If the temperature is greater than η, the new solution is accepted; otherwise, the current solution is retained.

4. DRSTASA

4.1. Population Initialization Based on Latin Hypercube Sampling

In intelligent algorithms, the initialization of the population plays a crucial role in determining the performance and convergence speed of the algorithm. A well-initialized population ensures diversity, which facilitates a more efficient exploration of the search space and accelerates convergence towards the global optimal solution. As a result, this improves the algorithm’s stability and efficiency. Traditional approaches, such as STA and STASA, typically rely on random initialization functions. However, this randomness can hinder the uniform distribution of solutions within the search space, ultimately reducing search efficiency.

To overcome this issue, the Latin hypercube sampling (LHS) method [33] is employed. LHS divides the multidimensional parameter space into non-overlapping intervals, randomly selecting samples from each interval to ensure uniform distribution and better coverage of the parameter space. The key steps of LHS involve determining the number of samples, dividing the parameter range for each dimension into NNN intervals, selecting values from each interval for every dimension, mapping these values to the desired distribution, and shuffling the order to maintain randomness.

Figure 2 illustrates a comparison between the population distributions generated using random initialization and Latin hypercube sampling (LHS) for two- and three-dimensional populations. The comparison demonstrates that unlike random initialization, LHS uniformly partitions each dimension and independently selects sample points from each partition, ensuring an even distribution of points throughout the sample space while maintaining uniqueness across all dimensions. This approach enhances initial population coverage, effectively mitigates the “curse of dimensionality” commonly encountered in high-dimensional spaces with conventional methods, and increases initial population diversity. As a result, it significantly improves the algorithm’s global exploration capability within the solution space.

4.2. Disruption Operator

Inspired by the gravitational destruction phenomenon observed in astrophysics, the interference operator (DO) was first introduced in the gravitational search algorithm (GSA) [34,35]. Its main objective is to improve exploration and exploitation capabilities that will prevent premature convergence and maintain population diversity. Over time, DO has been widely used in heuristic algorithms, such as elite particle swarm optimization algorithms [36].

The DO simulates the disruption of smaller-mass particles as they approach a larger-mass object. Here, the current optimal solution is treated as the large-mass object, and the remaining solutions are considered smaller-mass particles. When specific conditions are met, disruptive perturbations are applied to these solutions. Formula (15) describes the activation of the DO, which occurs when the Euclidean distance between neighboring solutions exceeds a predefined threshold C. The distance between a candidate solution and the current optimal solution is used to determine the degree of disruption.

\frac{R_{i, j}}{R_{i, best}} \leq C

(15)

The algorithm activates the disruption operator to update the current solution only when the condition in Formula (15) is satisfied, where

R_{i, j} = ‖X_{i} - X_{j}‖

represents the Euclidean distance between neighboring candidate solutions i and j, and

R_{i, b e s t} = ‖X_{i} - X_{b e s t}‖

is the Euclidean distance between the particle and the current optimal solution. C is a preset threshold.

To improve the efficacy of the DO, the threshold C is dynamically adjusted throughout the algorithm’s iterations. A larger initial C value facilitates broad exploration early in the process, while a gradually decreasing C encourages convergence during the later stages. Formula (16) details this dynamic threshold adjustment.

C = C_{0} * (1 - i t e r / T)

(16)

C_{0}

is the manually defined initial threshold, typically set between 1 and 3. Iter represents the current iteration count, and T denotes the total number of iterations set for the algorithm. It is important to note that, aside from the current optimal solution, all other solutions will be evaluated according to Formula (16). Only the solutions that meet the criteria of this formula will activate the disruption operator, which is defined as follows:

D = \{\begin{array}{l} r a n d (- \frac{R_{i, j}}{2}, \frac{R_{i, j}}{2}) i f R_{i, b e s t} \geq 1 \\ R_{i, j} + (r a n d (- \frac{R_{i, j}}{2}, \frac{R_{i, j}}{2})) o t h e r w i s e \end{array}

(17)

Here,

r a n d (- \frac{R_{i, j}}{2}, \frac{R_{i, j}}{2})

is a randomly distributed number, with its range defined based on the Euclidean distance between neighboring candidate solutions i and j. When

R_{i, b e s t} \geq 1

, it indicates that the solution is far from the optimal solution, and the disruption operator explores a broader range. In other cases, when the distance is closer, the disruption operator conducts a more localized search around the current solution.

Finally, for solutions that meet the conditions, a disruptive perturbation is applied, and the solution is updated according to the following formula:

x_{i, j}^{t + 1} = \frac{t}{T} x_{i, j}^{t} + (1 - \frac{t}{T}) x_{i, j}^{t} * D

(18)

In Formula (18),

x_{i, j}^{t + 1}

represents the solution after applying the disruptive perturbation. The first part of the formula retains some information from the current solution, while the second part incorporates the influence of the disruption operator. The impact of the disruption operator gradually diminishes as the algorithm progresses, allowing for broader exploration of the search space in the early stages to avoid local optima, and accelerating convergence in the later stages.

In STASA, the disruption operator is executed after the application of the four update operators. This sequence allows the algorithm to conduct extensive exploration during the early stages and prevent falling into local optima in later stages, ultimately increasing the chances of finding the global optimal solution.

4.3. Reverse Learning Strategy

To enhance optimization performance and accelerate convergence, a dynamic reverse learning strategy is introduced. Reverse learning compares the current solution

X = [x_{1}, x_{2} \dots, x_{D}]

with its reverse point

\bar{X} = [\bar{x_{1}}, \bar{x_{2}} \dots \bar{x_{D}}]

, selecting the better option to improve the algorithm’s search capability and exploration range [37]. Here,

x \in [u b, l b]

,

u b

and

l b

represent the upper and lower bounds of the variables, respectively. The formula for calculating the reverse point is as follows:

\bar{x_{i, j}} = l b + u b - x_{i, j}

(19)

Reverse learning expands the search space of the algorithm, improving both convergence accuracy and speed. To enhance flexibility, a dynamic reverse learning strategy is employed, where the upper and lower bounds of the reverse point change with each iteration. The formula for calculating the dynamic reverse point is as follows:

\bar{{x_{ι, J}}^{t}} = r_{5} ({\bar{a}}_{ι} (t) + {\bar{b}}_{ι} (t) - {x_{i, j}}^{t}

(20)

In the equation,

x_{i, j} t

represents the position of the j-th solution in the i-th dimension during the t-th iteration.

{\bar{a}}_{t} (t)

and

{\bar{b}}_{ι} (t)

are the minimum and maximum values of the current population in the i-th dimension, respectively, and

r_{5}

is a random number in the range [0, 1]. The decision to perform reverse learning is made by comparing with the control parameter

p

, and the specific form is as follows:

x_{i, j}^{t + 1} = (1 - r_{6}) x_{i, j} {}^{t}+ r_{6} \bar{{x_{i, j}}^{t}} i f r a n d \geq p

(21)

where t represents the number of iterations,

x_{i, j} t

is the current solution, and

\bar{x_{i, j} t}

is the corresponding reverse solution. Both

r_{6}

and rand are random numbers in the range [0, 1], and

x_{i, j}^{t + 1}

is the population in the t+1th iteration. In the subsequent experiments, the control parameter

p

is set to 0.5. The opposition-based learning strategy activates when meeting predefined stochastic conditions, utilizing opposition solutions to expand search space diversity and balance exploration versus exploitation capabilities. Through fitness comparison between original and opposition solutions, this approach accelerates convergence while enhancing solution quality. Moreover, the opposition mechanism assists in escaping premature stagnation, thereby markedly improving algorithm robustness and optimization efficiency.

4.4. Basic Process of DRSTASA

The overall structure and pseudocode for DRSTASA are presented in Figure 3 and Algorithm 1. The algorithm begins by initializing the population using the Latin hypercube sampling method and evaluating the initial solutions. The best population members and optimal objective values are selected. During each iteration, four transformation operators are applied first to update the population, followed by the Metropolis criterion from simulated annealing to determine whether to replace the current solution. After assessing the distance between current solutions, the disruption operator is applied as needed, and the reverse learning strategy is probabilistically executed to further refine the search process.

Algorithm 1: The Proposed Algorithm DRSTASA

1: Set the initial parameters
2: LHS generate the initial population u, Evaluate the fitness value: f, the current optimal solution: Best
3: while the set stop temperature value is not reached
4: Update the x_k through four transformation operators by Equations (2)–(5) and obtain x_k+1.
5: Calculate the fitness value of the current solution and new solution: f(x_k), f(x_x+1).
6: Update Best and fBest with Metropolis Criteria by Equation (6).
7: Calculate threshold C using Equation (16).
8: Judging by Equation (15), the condition is updated using Equation (17).
9: if rand > p
Use Equation (21) to update.
10: T = α*T, k = k + 1 and return to step 4.

5. Experimental Testing and Analysis

To evaluate the performance of DRSTASA, a total of 23 standard benchmark functions were selected for experimentation. The results were compared with those of nine other recent algorithms, including STA, STASA, AOA, SCA, AHA, FDA, AVOA, GWO, and the improved ASTASA, to analyze the testing performance of each algorithm. The experiments were conducted on a standard PC running Windows 10 64-bit, equipped with a 12th Gen Intel^® Core™ i5-12490F processor operating at 3.00 GHz and 16 GB of RAM. The implementation was based on MATLAB 2020a. For all algorithms, the population size and search intensity were set to 50, with parameter configurations either consistent with the original publications or based on classical settings, as detailed in Table 1.

5.1. Benchmark Test Functions

In this section, 23 fundamental benchmark functions are used to evaluate DRSTASA and eight comparison algorithms. The test functions cover both unimodal and multimodal problems. Specifically,

F_{1} ~ F_{7}

functions are unimodal, and

F_{8} ~ F_{23}

are bimodal. The dimensionality of the functions is variable; for these tests, it is set to 30. Detailed definitions can be found in Table 2 and Table 3.

Each algorithm was independently executed 30 times, during which the optimal values, mean values, and standard deviations (std) were recorded. The optimal values of each algorithm were compared: if an algorithm outperformed the comparison algorithm, it was assigned a score of 1; if the results were equal, it received a score of 0; and if it underperformed, it was assigned a score of −1. In cases where the optimal values were identical, the mean values and standard deviations were sequentially compared. Ultimately, the total scores were tallied for each algorithm across all test functions, clearly demonstrating the performance of DRSTASA in relation to other algorithms.

The results of DRSTASA, compared to nine other algorithms, are presented in Table 4. Apart from its weaker performance on the multimodal functions F21, F22, and F23—where the mean values were suboptimal and stability was lacking—DRSTASA exhibited stable performance on the other test functions, with minimal differences between the minimum and mean values. In functions F6, F12, F13, F16, F21, F22, and F23, DRSTASA did not achieve the optimal value; however, it reached the theoretical optimal values in other test functions, demonstrating adequate solution accuracy.

Overall, DRSTASA shows advantages over other algorithms in most tests, with results surpassing those of recent algorithms. Specifically, when compared to STASA and ASTASA, which scored 13 and 11 points, respectively, it is evident that DRSTASA achieved superior results in the majority of test cases. Although DRSTASA successfully escaped local optima in the multimodal test functions F21, F22, and F23, there remains a significant gap when compared to more robust algorithms such as AHA, FDA, and AVOA. While DRSTASA performed exceptionally well in unimodal tests, it slightly underperformed in multimodal tests relative to these stronger algorithms.

5.1.1. Convergence Analysis

Due to space limitations, Figure 4 illustrates the convergence behavior of various algorithms on selected test functions, highlighting the best results from each algorithm. The blue curve represents the convergence trajectory of the DRSTASA algorithm, with the horizontal axis indicating the number of iterations and the vertical axis representing the optimal objective function value. In the F3 and F4 tests, although DRSTASA achieved the theoretical optimal value, the convergence speed was slower compared to other algorithms, especially in unimodal test functions where the AVOA algorithm demonstrated the fastest convergence. In the F7 and F9 tests, DRSTASA also reached the minimum objective function value; however, significant oscillations were observed during the early iterations. This suggests that the algorithm likely employed techniques such as simulated annealing and perturbation to escape local optima and search for the global optimum. In the F12 and F13 tests, while DRSTASA did not surpass the FDA algorithm in finding the optimal value, it still exhibited a faster convergence speed compared to the other algorithms.

5.1.2. Time Complexity Analysis

Time complexity reflects the efficiency of an algorithm, and as the problem size increases, it directly influences both time and resource costs. The time complexity of the proposed algorithm is primarily determined by the population size, the number of iterations, the dimensionality of the problem, and the time required to update the operators. Let N represent the population size, T denote the number of iterations, and D signify the problem dimensionality. Ignoring the time complexity of the operators, the overall time complexity of the algorithm is as follows:

O (D R S T A S A) = O (N \cdot D \cdot T)

(22)

In a single iteration, let the time complexity of evaluating the objective function for a single solution be

t_{1}

, and the time complexity of updating a solution using the four operators be

t_{2}

. Without incorporating any improvement strategies, the time complexity of the algorithm is as follows:

O (D R S T A S A) = O (N \cdot D \cdot T \cdot (t_{1} + t_{2}))

(23)

When introducing destruction operators and opposition-based learning strategies, let the time complexities of the destruction operator and opposition-based learning be denoted as

t_{3}

and

t_{4}

, respectively. Consequently, the time complexity of the algorithm incorporating these improvements becomes the following:

O (D R S T A S A) = O (N \cdot D \cdot T \cdot (t_{1} + t_{2} + t_{3} + t_{4}))

(24)

From the analysis, it can be concluded that the time complexity of DRSTASA does not increase by an order of magnitude compared to the original STASA. Additionally, the destruction operator and opposition-based learning strategies enhance the efficiency of the algorithm, resulting in faster convergence.

Figure 5 shows the average runtime of nine algorithms across 23 test functions. It is observed that the SCA, AOA, and AHA algorithms have the shortest runtimes, but they exhibit slightly lower accuracy compared to the others. AVOA and GWO fall into the second tier, characterized by higher time costs but with improved accuracy. STA, STASA, and DRSTASA are in the third tier, where DRSTASA has a slightly longer runtime than STASA but achieves significantly higher accuracy, making the additional time acceptable. The FDA requires the longest runtime but offers greater stability, albeit at a considerable time.

To evaluate the effectiveness and stability of DRSTASA, eight engineering design optimization problems were selected for testing. The DRSTASA, STASA, and social network search (SNS) algorithms were employed to solve each problem, with 30 independent runs and 2000 iterations per run. The performance of each algorithm was assessed by comparing the optimal values, mean values, and standard deviations. Table 5 provides details of the eight engineering problems, while Table 6 presents the comprehensive results of each algorithm.

From Table 6, it is clear that the performance of DRSTASA in the string design problem is not as strong as that of the other two algorithms; however, the algorithm excels in the remaining engineering challenges. In the cantilever beam, tubular column, and welded beam design problems, although DRSTASA achieves optimal values, the average performance and standard deviation still show a gap compared to the other algorithms, suggesting that further improvements in stability are needed. The comparison results indicate that DRSTASA demonstrates significant improvements over STASA in various engineering problems and surpasses the SNS algorithm, highlighting the potential of this approach for solving real-world engineering challenges.

5.2. Experimental Results

By considering path optimality, safety, and feasibility constraints, specific weights are assigned to each cost function to determine the overall objective function value for the problem.

F (X_{i}) = \sum_{k = 1}^{4} b_{k} F_{k} (X_{i})

(25)

The variables

b_{1} ~ b_{4}

represent the weight coefficients, while

F_{1} (X_{i})

to

F_{4} (X_{i})

denote the values of the objective function corresponding to the aforementioned constraints. In the experiment, the parameters were configured as

b_{1}

= 5,

b_{2}

= 1,

b_{3}

= 1, and

b_{4}

= 5, while the remaining parameters are given in Section 2.

In this study, the DRSTASA, STASA, SNS, and PSO algorithms were each executed 30 times within the hardware system environment previously described, using a population size of 500 and a maximum of 2000 iterations. Table 7 presents the optimal, worst, and average fitness values obtained from these 30 runs. Figure 6 shows the three-dimensional view, top view, and side view of the optimal path planning results achieved by the algorithms, while Figure 7 illustrates the fitness iteration curves.

From Table 7, it is clear that this algorithm demonstrates relatively high stability across multiple runs, with both the optimal and worst fitness values being lower than those achieved by other algorithms. This suggests a strong capability for escaping local optima. Figure 5 shows that the optimal path generated by the algorithm successfully avoids obstacles, ensuring a smooth transition from the starting point to the destination. Compared to the other three algorithms, it identifies a safer and faster flight route. Figure 6 illustrates the rapid convergence of the algorithm during iterations, allowing it to find the optimal solution efficiently.

6. Summary

This paper presents an enhancement to the STASA algorithm by introducing a state transition simulated annealing algorithm (DRSTASA) that incorporates the use of destruction perturbation operators and opposition-based learning strategies. During the initialization phase, the super-Latin hypercube sampling method is employed to ensure a more uniform distribution of initial solutions. Subsequently, the integration of destruction operators and opposition-based learning strategies helps to accelerate the convergence process and improve the algorithm’s capacity for escaping local optima. The effectiveness of these enhancements is validated through comparisons with eight other algorithms across 23 standard test functions and eight classical engineering design problems.

DRSTASA is further applied to the UAV path planning problem, demonstrating greater stability in the solutions compared to three other algorithms. The paths produced by DRSTASA are more feasible, safer, and more efficient, thus showcasing the practical value of the algorithm. However, the comparative results indicate that there is room for improvement in terms of reducing time costs for complex problems and enhancing stability in multimodal scenarios. Future research will focus on increasing the convergence speed of the algorithm and reducing computational time.

Author Contributions

Conceptualization, J.L. and X.H.; methodology, J.L.; software, J.L.; validation, F.L., J.W. and W.Z.; formal analysis, J.L.; investigation, F.L.; resources, J.W.; data curation, W.Z.; writing—original draft preparation, J.L.; writing—review and editing, X.H.; visualization, F.L.; supervision, J.W.; project administration, W.Z.; funding acquisition, X.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China Funding Project grant number (62176176).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author, Xiaoxia Han, upon reasonable request.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Fang, Z.; Savkin, A.V. Strategies for Optimized UAV Surveillance in Various Tasks and Scenarios: A Review. Drones 2024, 8, 193. [Google Scholar] [CrossRef]
Torrero, L.; Seoli, L.; Molino, A.; Giordan, D.; Manconi, A.; Allasia, P.; Baldo, M. The Use of Micro-UAV to Monitor Active Landslide Scenarios. In Engineering Geology for Society and Territory—Volume 5; Lollino, G., Manconi, A., Guzzetti, F., Culshaw, M., Bobrowsky, P., Luino, F., Eds.; Springer: Cham, Switzerland, 2015. [Google Scholar] [CrossRef]
Yan, C.; Fu, L.; Zhang, J.; Wang, J. A Comprehensive Survey on UAV Communication Channel Modeling. IEEE Access 2019, 7, 107769–107792. [Google Scholar] [CrossRef]
Shakhatreh, H.; Sawalmeh, A.H.; Al-Fuqaha, A.; Dou, Z.; Almaita, E.; Khalil, I.; Othman, N.S.; Khreishah, A.; Guizani, M. Unmanned Aerial Vehicles (UAVs): A Survey on Civil Applications and Key Research Challenges. IEEE Access 2019, 7, 48572–48634. [Google Scholar] [CrossRef]
Hart, P.E.; Nilsson, N.J.; Raphael, B. A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Korf, R.E. Depth-First Iterative-Deepening: An Optimal Admissible Tree Search. Artif. Intell. 1985, 27, 97–109. [Google Scholar] [CrossRef]
Cen, Y.; Song, C.; Xie, N.; Wang, L. Path planning method for mobile robot based on ant colony optimization algorithm. In Proceedings of the 2008 3rd IEEE Conference on Industrial Electronics and Applications, Singapore, 3–5 June 2008; pp. 298–301. [Google Scholar] [CrossRef]
Qin, Y.-Q.; Sun, D.-B.; Li, N.; Cen, Y.-G. Path planning for mobile robot using the particle swarm optimization with mutation operator. In Proceedings of the 2004 International Conference on Machine Learning and Cybernetics (IEEECat. No. 04EX826), Shanghai, China, 26–29 August 2004; Volume 4, pp. 2473–2478. [Google Scholar] [CrossRef]
Chen, J.; Ye, F.; Jiang, T. Path planning under obstacle-avoidance constraints based on ant colony optimization algorithm. In Proceedings of the 2017 IEEE 17th International Conference on Communication Technology (ICCT), Chengdu, China, 27–30 October 2017; pp. 1434–1438. [Google Scholar] [CrossRef]
Guo, L.; Zhao, C.; Li, J.; Yan, Q.; Li, W.; Chen, P. UAV path planning based on improved A * and artificial potential field algorithm. In Proceedings of the 2024 36th Chinese Control and Decision Conference (CCDC), Xi’an, China, 25–27 May 2024; pp. 5471–5476. [Google Scholar] [CrossRef]
Xiang, H.; Liu, X.; Song, X.; Zhou, W. UAV Path Planning Based on Enhanced PSO-GA. In Artificial Intelligence; Fang, L., Pei, J., Zhai, G., Wang, R., Eds.; CICAI 2023; Lecture Notes in Computer Science; Springer: Singapore, 2024; Volume 14474. [Google Scholar] [CrossRef]
Li, D.; Yin, W.; Wong, W.E.; Jian, M.; Chau, M. Quality-Oriented Hybrid Path Planning Based on A* and Q-Learning for Unmanned Aerial Vehicle. IEEE Access 2022, 10, 7664–7674. [Google Scholar] [CrossRef]
Li, B.; Qi, X.; Yu, B.; Liu, L. Trajectory Planning for UAV Based on Improved ACO Algorithm. IEEE Access 2020, 8, 2995–3006. [Google Scholar] [CrossRef]
Li, H.; Long, T.; Xu, G.; Wang, Y. Coupling-Degree-Based Heuristic Prioritized Planning Method for UAV Swarm Path Generation. In Proceedings of the 2019 Chinese Automation Congress (CAC), Hangzhou, China, 22–24 November 2019; pp. 3636–3641. [Google Scholar] [CrossRef]
Xie, R.; Meng, Z.; Wang, L.; Li, H.; Wang, K.; Wu, Z. Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments. IEEE Access 2021, 9, 24884–24900. [Google Scholar] [CrossRef]
Tan, L.; Zhang, Y.; Huo, J.; Song, S. UAV Path Planning Simulating Driver’s Visual Behavior with RRT algorithm. In Proceedings of the 2019 Chinese Automation Congress (CAC), Hangzhou, China, 22–24 November 2019; pp. 219–223. [Google Scholar] [CrossRef]
Wang, X.; Pan, J.-S.; Yang, Q.; Kong, L.; Snášel, V.; Chu, S.-C. Modified Mayfly Algorithm for UAV Path Planning. Drones 2022, 6, 134. [Google Scholar] [CrossRef]
Tian, J.; Wang, Y.; Yuan, D. An Unmanned Aerial Vehicle Path Planning Method Based on the Elastic Rope Algorithm. In Proceedings of the 2019 IEEE 10th International Conference on Mechanical and Aerospace Engineering (ICMAE), Brussels, Belgium, 22–25 July 2019; pp. 137–141. [Google Scholar] [CrossRef]
Cui, X.; Wang, Y.; Yang, S.; Liu, H.; Mou, C. UAV path planning method for data collection of fixed-point equipment in complex forest environment. Front. Neurorobot. 2022, 16, 1105177. [Google Scholar] [CrossRef]
Yu, X.; Li, C.; Zhou, J. A constrained differential evolution algorithm to solve UAV path planning in disaster scenarios. Knowl.-Based Syst. 2020, 204, 106209. [Google Scholar] [CrossRef]
Zhang, X.; Xia, S.; Li, X. Quantum Behavior-Based Enhanced Fruit Fly Optimization Algorithm with Application to UAV Path Planning. Int. J. Comput. Intell. Syst. 2020, 13, 1315–1331. [Google Scholar] [CrossRef]
Qu, C.; Gai, W.; Zhang, J.; Zhong, M. A novel hybrid grey wolf optimizer algorithm for unmanned aerial vehicle (UAV) path planning. Knowl.-Based Syst. 2020, 194, 105530. [Google Scholar] [CrossRef]
Chen, Y.; Yu, Q.; Han, D.; Jiang, H. UAV path planning: Integration of grey wolf algorithm and artificial potential field. Concurr. Comput. Pract. Exp. 2024, 36, e8120. [Google Scholar] [CrossRef]
Han, X.; Dong, Y.; Yue, L.; Xu, Q. State Transition Simulated Annealing Algorithm for Discrete-Continuous Optimization Problems. IEEE Access 2019, 7, 44391–44403. [Google Scholar] [CrossRef]
Zhou, X.; Yang, C.; Gui, W. State Transition Algorithm. J. Ind. Manag. Optim. 2012, 8, 1039–1056. [Google Scholar] [CrossRef]
Aarts, E.H.L. Simulated Annealing: Theory and Applications; Springer Nature: Dordrecht, The Netherlands, 1987. [Google Scholar]
Chu, J.; Dong, Y.; Han, X.; Xie, J.; Xu, X.; Xie, G. Short-term prediction of urban PM2. 5 based on a hybrid modified variational mode decomposition and support vector regression model. Environ. Sci. Pollut. Res. 2021, 28, 56–72. [Google Scholar] [CrossRef]
Shen, Y.; Dong, Y.; Han, X.; Wu, J.; Xue, K.; Jin, M.; Xie, G.; Xu, X. Prediction model for methanation reaction conditions based on a state transition simulated annealing algorithm optimized extreme learning machine. Int. J. Hydrog. Energy 2023, 48, 24560–24573. [Google Scholar] [CrossRef]
Alhijawi, B.; Awajan, A. Genetic algorithms: Theory, genetic operators, solutions, and applications. Evol. Intel. 2024, 17, 1245–1256. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61, ISSN 0965-9978. [Google Scholar] [CrossRef]
Phung, M.D.; Ha, Q.P. Safety-enhanced UAV path planning with spherical vector-based particle swarm optimization. Appl. Soft Comput. 2021, 107, 107376. [Google Scholar] [CrossRef]
Dige, N.; Diwekar, U. Efficient sampling algorithm for large-scale optimization under uncertainty problems. Comput. Chem. Eng. 2018, 115, 431–454. [Google Scholar] [CrossRef]
Rashedi, E.; Nezamabadi-pour, H.; Saryazdi, S. GSA: A Gravitational Search Algorithm. Inf. Sci. 2009, 179, 2232–2248. [Google Scholar] [CrossRef]
Sarafrazi, S.; Nezamabadi-pour, H.; Saryazdi, S. Disruption: A new operator in gravitational search algorithm. Sci. Iran. 2011, 18, 539–548. [Google Scholar] [CrossRef]
Liu, H.; Ding, G.; Wang, B. Bare-bones particle swarm optimization with disruption operator. Appl. Math. Comput. 2014, 238, 106–122. [Google Scholar] [CrossRef]
Shekhawat, S.; Saxena, A. Development and applications of an intelligent crow search algorithm based on opposition based learning. ISA Trans. 2020, 99, 210–230. [Google Scholar] [CrossRef]
Gupta, S.; Deep, K. A hybrid self-adaptive sine cosine algorithm with opposition based learning. Expert Syst. Appl. 2019, 119, 210–230. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of obstacles.

Figure 2. Initialize population comparisons.

Figure 3. DRSTASA algorithm flowchart.

Figure 4. Convergence curves for each algorithm.

Figure 5. Algorithmic average time.

Figure 6. Optimal path planning diagram.

Figure 7. Fitness iteration curve.

Table 1. Compare algorithm parameter settings.

Algorithm	Name and Year of Publication	Parameter Settings
STA	State Transition Algorithm/2012	SE = 50, T = 1010, beta = 1, gamma = 1, delta = 1;
STASA	State Transition Simulated Annealing Algorithm/2019	SE = 50, T = 1010, beta = 1, gamma = 1, delta = 1, a = 0.93;
AOA	Arithmetic Optimization Algorithm/2021	Max_iteration = 2000, PopSize = 50;
SCA	Sine Cosine Algorithm/2016	Pop_size = 50, Max_iter = 2000, a = 2;
AHA	Artificial Hummingbird Algorithm/2021	Max_iteration = 2000, Pop_size = 50;
FDA	Flow Direction Algorithm/2021	alpha = 50, beta = 1, Max_iteration = 2000;
AVOA	African Vultures Optimization Algorithm/2021	Pop_size = 50, Max_iter = 2000, p1 = 0.6, p2 = 0.4, p3 = 0.6, alpha = 0.8, betha = 0.2, gamma = 2.5;
GWO	Grey Wolf Optimizer Algorithm/2014	SearchAgents_no = 50, Max_iteration = 2000; $a_{m a x} = 2 {, a}_{m i n} = 0$ ;
ASTASA	Adaptive State Transition Simulated Annealing Algorithm/2023	SE = 50,Tp = 10, a1 = a2 = 0.5, T = 1010, a = 0.93, Ω = {0, 0.5, 0.1, 0.1, 1 × 10⁻³, 1 × 10⁻⁶, 1 × 10⁻⁹};

Table 2. Unimodal benchmarking function definition.

Function	Value Range	Best
$f_{1} = \sum_{i = 1}^{n} x_{i}^{2}$	[−100, 100]	0
$f_{2} = \sum_{i = 1}^{n} \| x_{i} \| + \prod_{i = 1}^{n} \| x_{i} \|$	[−10, 10]	0
$f_{3} (x) = \sum_{i = 1}^{n} {(\sum_{j = 1}^{i} x_{j})}^{2}$	[−100, 100]	0
$f_{4} (x) = \max_{i} \{\begin{matrix} \|x_{i}\|, 1 ⩽ i ⩽ n \end{matrix}\}$	[−100, 100]	0
$f_{5} (x) = \sum_{i = 1}^{n - 1} [100 {(x_{i + 1} - x_{i}^{2})}^{2} + {(x_{i} - 1)}^{2}]$	[−30, 30]	0
$f_{6} (x) = \sum_{i = 1}^{n} {([x_{i} + 0.5])}^{2}$	[−100, 100]	0
$f_{7} (x) = \sum_{i = 1}^{n} i x_{i}^{4} + rand [0, 1)$	[−1.28, 1.28]	0

Table 3. Multimodal benchmarking function definition.

Function	Dimension	Value Range	Best
$f_{8} (x) = \sum_{i = 1}^{n} - x_{i} \sin (\sqrt{\| x_{i} \|})$	d	[−500, 500]	$- 418.9829 \times d$
$f_{9} (x) = \sum_{i = 1}^{n} [x_{i}^{2} - 10 \cos (2 π x_{i}) + 10]$	d	[−5.12, 5.12]	0
$f_{10} (x) = - 20 \exp (- 0.2 \sqrt{\frac{1}{n} \sum_{i = 1}^{n} x_{i}^{2}}) - \exp (\frac{1}{n} \sum_{i = 1}^{n} \cos (2 π x_{i})) + 20 + e^{\leftarrow}$	d	[−32, 32]	0
$f_{11} (x) = \frac{1}{4000} \sum_{i = 1}^{n} x_{i}^{2} - \prod_{i = 1}^{n} \cos (\frac{x_{i}}{\sqrt{i}}) + 1$	d	[−600, 600]	0
$f_{12} (x) = \frac{π}{n} \{10 \sin (π y_{1}) + \sum_{i = 1}^{n - 1} {(y_{i} - 1)}^{2} [1 + 10 \sin^{2} (π y_{i + 1})] + {(y_{n} - 1)}^{2}\} + \sum_{i = 1}^{n} u (x_{i}, 10, 100, 4) y_{i} = 1 + \frac{x_{i} + 1}{4}$ $u (x_{i}, a, k, m) = \{\begin{array}{l} k {(x_{i} - a)}^{m} & x_{i} > a \\ 0 & - a < x_{i} < a \\ k {(- x_{i} - a)}^{m} & x_{i} < - a \end{array}$	d	[−50, 50]	0
$\begin{array}{l} f_{13} (x) = 0.1 \{\sin^{2} (3 π x_{1}) + \sum_{i = 1}^{n} {(x_{i} - 1)}^{2} [1 + \sin^{2} (3 π x_{i} + 1)] + \\ {(x_{n} - 1)}^{2} [1 + \sin^{2} (2 π x_{n})]\} + \sum_{i = 1}^{n} u (x_{i}, 5, 100, 4) \end{array}$	d	[−50, 50]	0
$f_{14} (x) = {(\frac{1}{500} + \sum_{j = 1}^{25} \frac{1}{j + \sum_{i = 1}^{2} {(x_{i} - a_{i j})}^{6}})}^{- 1}$	2	[−65.53, 65.53]	1
$f_{15} = \sum_{i = 1}^{11} {[a_{i} - \frac{x_{1} (b_{i}^{2} + b_{i} x_{2})}{b_{i}^{2} + b_{i} x_{3} + x_{4}}]}^{2}$	4	[−5, 5]	0.00030
$f_{16} (x) = 4 x_{1}^{2} - 2.1 x_{1}^{4} + x_{1}^{6} / 3 + x_{1} x_{2} - 4 x_{2}^{2} + 4 x_{2}^{4}$	2	[−5, 5]	−1.0316
$f_{17} (x) = {(x_{2} - \frac{5.1}{4 π^{2}} x_{1}^{2} + \frac{5}{π} x_{1} - 6)}^{2} + 10 (1 - \frac{1}{8 π}) \cos x_{1} + 10$	2	[−5, 5]	0.398
$\begin{array}{l} f_{18} (x) = [1 + {(x_{1} + x_{2} + 1)}^{2} (19 - 14 x_{1} + 3 x_{1}^{2} - 14 x_{2} + 6 x_{1} x_{2} + 3 x_{2}^{2})] \\ \times [30 + {(2 x_{1} - 3 x_{2})}^{2} \times (18 - 32 x_{1} + 12 x_{1}^{2} + 48 x_{2} - 36 x_{1} x_{2} + 27 x_{2}^{2})] \end{array}$	2	[−2, 2]	3
$f_{19} (x) = - \sum_{i = 1}^{4} c_{i} \exp (- \sum_{j = 1}^{3} a_{i j} {(x_{j} - p_{i j})}^{2})$	3	[1, 3]	−3.86
$f_{20} (x) = - \sum_{i = 1}^{4} c_{i} \exp (- \sum_{j = 1}^{6} a_{i j} {(x_{j} - p_{i j})}^{2})$	6	[0, 1]	−3.32
$f_{21} = - \sum_{i = 1}^{5} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	4	[0, 10]	−10.1532
$f_{22} = - \sum_{i = 1}^{7} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	4	[0, 10]	−10.4028
$f_{23} = - \sum_{i = 1}^{10} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	4	[0, 10]	−10.5363

Table 4. Test results of nine algorithms on benchmark functions (Among them, the values in bold are the optimal values).

		DSTASA	STASA	SCA	AOA	STA	AHA	FDA	AVOA	GWO	ASTASA
F1	min	0	0	2.23826 × 10⁻⁵	0	0	0	2.49859 × 10⁻²⁷	0	6.545097 × 10⁻¹²⁷	0
	mean	0	0	2.22950 × 10⁻⁵	0	0	0	8.20562 × 10⁻²⁵	0	1.11579 × 10⁻¹²¹	0
	std	0	0	4.41827 × 10⁻⁵	0	0	0	1.69896 × 10⁻²⁴	0	2.81725 × 10⁻¹²¹	0
	R		0	1	0	0	0	1	0	1	0
F2	min	0	0	1.36292 × 10⁻⁸	0	0	0	5.12704 × 10⁻²⁰	0	3.19172 × 10⁻⁷²	2.73 × 10⁻²⁹⁸
	mean	0	0	2.18929 × 10⁻³	0	0	9.75350 × 10⁻³⁰⁵	5.80501 × 10⁻¹⁹	0	9.43916 × 10⁻⁷¹	5.78 × 10⁻²⁸⁵
	std	0	0	5.07143 × 10⁻³	0	0	0	9.87080 × 10⁻¹⁹	0	1.50523 × 10⁻⁷⁰	0
	R		0	1	0	0	1	1	0	1	1
F3	min	0	0	5.34185 × 10⁻²	0	1.67556 × 10⁻¹⁶⁵	0	8.44382 × 10⁻⁵	0	1.66451 × 10⁻⁴⁰	2.45 × 10⁻³⁹
	mean	0	6.39864 × 10⁻¹⁰	1.64976 × 10²	0	2.11171 × 10⁻¹⁰	0	1.19773 × 10⁻³	0	2.20731 × 10⁻³²	8.25 × 10⁻¹³
	std	0	2.32239 × 10⁻⁹	4.05739 × 10²	0	4.67669 × 10⁻¹⁰	0	1.54680 × 10⁻³	0	8.16378 × 10⁻³²	2.68 × 10⁻¹²
	R		1	1	0	1	0	1	0	1	1
F4	min	0	1.40860 × 10⁻³²⁰	0.012679128	0	2.75719 × 10⁻²⁹⁴	2.76011 × 10⁻²⁹¹	5.858143094	0	9.51328 × 10⁻³³	5.47 × 10⁻¹²⁵
	mean	0	2.78906 × 10⁻³⁰⁵	3.541937518	4.11638 × 10⁻³	3.20126 × 10⁻²⁵²	3.20126 × 10⁻²⁵²	11.58330534	0	9.71144 × 10⁻³⁰	9.77 × 10⁻¹⁰
	std	0	0	5.806930164	1.25609 × 10⁻³	0	0	2.472639669	0	2.56898 × 10⁻²⁹	3.66 × 10⁻⁹
	R	-	1	1	1	1	1	1	0	1	1
F5	min	1.68646 × 10⁻⁸	21.08270450	39.98909064	25.80156310	20.72509086	23.25891369	9.56426 × 10⁻⁵	4.08905 × 10⁻⁸	25.19991104	19.22748051
	mean	17.8889	21.66482354	415349.1403	27.18829167	23.30892552	23.76828972	13.80940547	7.48172 × 10⁻⁷	26.33910544	19.60826434
	std	8.1378	0.273833566	7.79634 × 10⁵	0.684053143	9.635097572	0.307795962	16.79242368	8.34461 × 10⁻⁷	0.724028375	0.185495286
	R	-	1	1	1	1	1	1	1	1	1
F6	min	6.39082 × 10⁻¹⁴	6.56884 × 10⁻¹⁴	5.261395082	1.308848294	4.734708 × 10⁻¹⁴	2.29367 × 10⁻¹⁰	4.61674 × 10⁻²⁸	3.31762 × 10⁻¹²	0.248994178	8.02 × 10⁻²²
	mean	1.13264 × 10⁻¹³	1.10163 × 10⁻¹³	149.8606905	1.815537219	9.077552 × 10⁻¹⁴	2.91864 × 10⁻⁸	3.53723 × 10⁻²⁵	2.03156 × 10⁻¹¹	0.707049993	4.52 × 10⁻¹⁹
	std	3.30438 × 10⁻¹⁴	3.42395 × 10⁻¹⁴	595.3156887	0.243828926	3.397547 × 10⁻¹⁴	6.42005 × 10⁻⁸	1.20000 × 10⁻²⁴	1.36005 × 10⁻¹¹	0.352646793	9.58 × 10⁻¹⁹
	R	-	1	1	1	−1	1	−1	1	1	−1
F7	min	9.09943 × 10⁻⁸	5.78305 × 10⁻⁴	0.040713560	3.95305 × 10⁻⁷	4.29126 × 10⁻⁵	2.52943 × 10⁻⁶	9.43934 × 10⁻³	1.50881 × 10⁻⁶	3.73270 × 10⁻⁵	0.000105144
	mean	1.73036 × 10⁻⁶	6.17465 × 10⁻⁴	0.801106982	5.62328 × 10⁻⁶	5.52640 × 10⁻⁴	2.64510 × 10⁻⁵	9.43934 × 10⁻³	3.52845 × 10⁻⁵	3.39785 × 10⁻⁴	0.010289159
	std	1.54743 × 10⁻⁶	6.61753 × 10⁻⁴	1.605338652	4.83192 × 10⁻⁶	4.07464 × 10⁻⁴	2.16516 × 10⁻⁵	1.01640 × 10⁻²	3.79440 × 10⁻⁵	1.85178 × 10⁻⁴	0.008127468
	R	-	1	1	1	1	1	1	1	1	1
F8	min	−12,569.4866	−12,569.4866	−5476.55146	−8318.39435	−12,569.4866	−12,569.4865	−10,316.1050	−12,569.4866	−7925.33057	−12,569.4866
	mean	−12,569.4866	−12,464.6926	−4321.69164	−7716.85829	−12,569.4866	−12,541.8054	−8984.9918	−12,538.5609	−6031.63602	−12,290.9614
	std	3.46119 × 10⁻¹²	197.5076886	297.4265754	315.7564352	6.47089 × 10⁻¹²	105.2953070	677.695949	80.25593103	709.6064865	291.5008075
	R	-	1	1	1	1	1	1	1	1	1
F9	min	0	0	6.728109115	0	0	0	27.85883344	0	0	0
	mean	0	0	82.76259996	0	0	0	51.67162761	0	0.1419434548	0
	std	0	0	51.50064571	0	0	0	14.98471084	0	0.7774563208	0
	R	-	0	1	0	0	0	1	0	1	0
F10	min	8.88178 × 10⁻¹⁶	8.88178 × 10⁻¹⁶	0.215951010	8.88178 × 10⁻¹⁶	4.440892 × 10⁻¹⁵	8.88178 × 10⁻¹⁶	2.120053361	8.88178 ×10⁻¹⁶	7.99360 × 10⁻¹⁵	8.88178 × 10⁻¹⁶
	mean	8.88178 × 10⁻¹⁶	8.88178 × 10⁻¹⁶	17.68353321	8.88178 × 10⁻¹⁶	4.440892 × 10⁻¹⁵	8.88178 × 10⁻¹⁶	3.522161581	8.88178 × 10⁻¹⁶	8.82257 × 10⁻¹⁵	8.88178 × 10⁻¹⁶
	std	0	0	5.866102058	0	0	0	1.081526236	0	2.01908 × 10⁻¹⁵	0
	R	-	0	1	0	1	0	1	0	1	0
F11	min	0	0	0.291675643	0	0	0	0	0	0	0
	mean	0	0	2.078403009	0.031828867	0	0	0.098300745	0	6.82138 × 10⁻⁴	0
	std	0	0	2.803796744	0.031945879	0	0	0.028645710	0	0.002679238	0
	R	-	0	1	1	0	0	1	0	1	0
F12	min	3.21421 × 10⁻¹⁵	1.47667 × 10⁻¹⁵	0.847785404	0.080714235	2.36383 × 10⁻¹⁵	1.88023 × 10⁻¹¹	6.24752 × 10⁻²⁴	2.99681 × 10⁻¹³	0.011988789	7.06 × 10⁻²⁴
	mean	5.81410 × 10⁻¹⁵	5.67626 × 10⁻¹⁵	3.49168 × 10⁷	0.123240492	5.75915 × 10⁻¹⁵	1.25750 × 10⁻⁹	0.411870057	1.38250 × 10⁻¹²	0.030024875	1.07 × 10⁻²⁰
	std	1.77067 × 10⁻¹⁵	8.06953 × 10⁻¹⁵	1.18813 × 10⁸	0.043268406	1.39051 × 10⁻¹⁵	4.33242 × 10⁻⁹	0.545587377	1.04906 × 10⁻¹²	0.011955839	2.70 × 10⁻²⁰
	R	-	-1	1	1	-1	1	-1	1	1	-1
F13	min	3.18192 × 10⁻¹⁴	3.67286 × 10⁻¹⁴	7.123647255	2.408009950	4.22109 × 10⁻¹⁴	3.76686 × 10⁻⁹	3.37417 × 10⁻²⁴	1.61800 × 10⁻¹¹	0.100047883	1.96 × 10⁻²²
	mean	7.94253 × 10⁻¹⁴	7.95419 × 10⁻¹⁴	5.48897 × 10⁷	2.696393591	8.33771 × 10⁻¹⁴	0.305832578	0.013540213	1.51531 × 10⁻¹⁰	0.445347041	0.045889875
	std	2.43030 × 10⁻¹⁴	1.46694 × 10⁻¹³	1.27678 × 10⁸	0.143791836	2.36018 × 10⁻¹⁴	0.235506620	0.028354695	1.14064 × 10⁻¹⁰	0.183978024	0.102558824
	R	-	1	1	1	1	1	−1	1	1	−1
F14	min	0.998003837	2.982105156	0.998003837	0.998003837	0.998003837	0.998003837	0.998003837	0.998003837	0.998003837	0.998003838
	mean	4.499754429	5.438523655	0.998037194	7.770896036	4.779597260	0.998003837	0.998003837	0.998003837	3.678543020	5.692655933
	std	5.440455117	4.165793216	0.000061096	4.433286361	4.522047715	0	0	1.93398 × 10⁻¹⁶	3.713426710	4.557993224
	R	-	1	−1	1	1	−1	−1	−1	−1	1
F15	min	0.000307485	0.000307485	0.000357134	0.000320376	0.000307485	0.000307485	0.0003.07485	0.000307486	0.000307486	0.000307486
	mean	0.000307485	0.000439713	0.000884466	0.005313377	0.000379456	0.000307485	5.51669 × 10⁻⁴	0.000307504	0.007661302	0.000514668
	std	3.62847 × 10⁻¹⁴	0.000315020	0.000308607	0.012872782	0.000237006	1.66935 × 10⁻¹³	4.11854 × 10⁻⁴	5.88603 × 10⁻⁸	0.009830023	0.00039831
	R	-	1	1	1	1	1	1	1	1	1
F16	min	−1.03162845	−1.03162845	−1.03162807	−1.03162845	−1.03162845	−1.03162845	−1.03162845	−1.03162845	−1.03162845	−1.03162845
	mean	−1.03162845	−1.03162845	−1.03160669	−1.03162842	−1.03162845	−1.03162845	−1.03162845	−1.03162845	−1.03162845	−1.03162845
	std	4.38309 × 10⁻¹⁶	6.15734 × 10⁻¹⁶	2.45274 × 10⁻⁵	2.16194 × 10⁻⁸	4.75518 × 10⁻¹⁶	0.397887357	6.77521 × 10⁻¹⁶	5.13342 × 10⁻¹⁶	1.12689 × 10⁻⁹	4.52 × 10⁻¹⁶
	R	-	1	−1	1	1	1	−1	1	1	1
F17	min	0.397887357	0.397887357	0.397895813	0.397887357	0.397887357	0.397887357	0.397887357	0.397887357	0.397887358	0.397887358
	mean	0.397887357	0.397887357	0.398317876	0.397887373	0.397887357	0.397887357	0.397887357	0.397887357	0.397887418	0.397887358
	std	0	0	9.39071 × 10⁻¹³	1.39785 × 10⁻⁸	0	0	0	0	7.21333 × 10⁻⁸	0
	R	-	0	1	1	0	0	0	0	1	1
F18	min	3	3	3	3	3	3	3	3	3	3
	mean	3	3	3	24.22626137	3	3	3	3	8.400002379	3
	std	1.90209 × 10⁻¹⁴	2.00653 × 10⁻¹⁴	1.40481 × 10⁻⁵	29.41664391	2.48436 × 10⁻¹⁴	1.27488× 10⁻¹⁵	2.01660 × 10⁻¹⁵	6.61248 × 10⁻⁸	20.55035868	4.95 × 10⁻¹⁶
	R	-	1	1	1	1	−1	−1	1	1	1
F19	min	−3.86278214	−3.86278214	−3.86151460	−3.86111628	−3.86278214	−3.86278214	−3.86278214	−3.86278214	−3.86278214	−3.86278215
	mean	−3.86278214	−3.86278214	−3.85507954	−3.85687242	−3.86278214	−3.86278214	−3.86278214	−3.86278214	−3.86184716	−3.86278215
	std	1.45290 × 10⁻¹⁴	3.78218 × 10⁻¹⁴	0.001985413	2.89580 × 10⁻³	5.20156 × 10⁻¹⁴	2.71008 × 10⁻¹⁵	2.71008 × 10⁻¹⁵	2.14885 × 10⁻¹⁵	0.002259703	2.31 × 10⁻¹⁵
	R	-	1	1	1	1	1	1	1	1	1
F20	min	−3.32199517	−3.32199517	−3.15647379	−3.25828015	−3.32199517	−3.32199517	−3.32199517	−3.32199517	−3.321994517	−3.32199517
	mean	−3.22688067	−3.22291757	−2.98017193	−3.17153577	−3.26254861	−3.32199517	−3.30614275	−3.28632723	−3.26180981	−3.24669620
	std	0.048370251	0.045066321	0.166303220	0.035538315	0.060462814	1.34243 × 10⁻¹⁵	0.041106809	0.055415085	0.068484869	0.058273385
	R		−1	1	1	1	−1	1	1	1	1
F21	min	−10.1531996	−5.05519772	−5.71194546	−8.30382836	−10.15319967	−10.1531996	−10.1531996	−10.1531996	−10.1531868	−10.1531997
	mean	−5.90486472	−5.05519772	−3.33330583	−4.75221370	−7.032031215	−9.90418836	−9.90418836	−10.1531996	−9.47487572	−8.04555502
	std	1.932392652	6.53716 × 10⁻¹⁵	1.649205881	1.248257971	2.6280332184	1.363891112	1.363891112	4.51078 × 10⁻¹⁵	1.758656268	2.659975707
	R	-	1	1	1	−1	−1	−1	−1	−1	1
F22	min	−10.4029405	−5.08767182	−5.86881454	−9.03871310	−10.40294056	−10.4029405	−10.4029405	−10.4029405	−10.4029243	−10.4029406
	mean	−6.32790119	−5.08767182	−3.47931164	−5.64706181	−8.915907821	−10.4029405	−9.12752888	−10.4029405	−10.4028323	−9.1468558
	std	2.286538610	4.49568 × 10⁻¹⁵	1.754731464	1.594763783	2.5417202149	1.58195 × 10⁻¹⁵	2.622221264	1.27754 × 10⁻¹⁵	7.26464 × 10⁻⁵	2.37780875
	R	-	1	1	1	−1	−1	−1	−1	−1	1
F23	min	−10.5364098	−5.12848078	−5.69960702	−8.60971414	−10.53640981	−10.5364098	−10.5364098	−10.5364098	−10.5364073	−10.5364098
	mean	−6.39033089	−5.30874508	−3.84238656	−5.16251028	−9.815352612	−10.5364098	−9.95408731	−10.5364098	−10.5363015	−9.1018401
	std	2.326399497	0.987348239	1.165562137	1.850734616	1.8697693093	1.89490 × 10⁻¹⁵	1.788022077	3.55271 × 10⁻¹⁵	5.50112 × 10⁻⁵	2.41893659
	R	-	1	1	1	−1	−1	−1	−1	−1	−1
$R_{总}$		-	13	19	18	8	5	4	7	15	11

Table 5. Eight classic engineering design problems.

Engineering Issues	Attribute
1. Compression spring design	3 variables, 4 constraints
2. I-beam design	4 variables, 2 constraints
3. Welded Beam Design	4 variables, 7 constraints
4. Cantilever design issues	5 variables, 1 constraint
5. String design issues	2 variables, 6 constraint
6. Three-bar truss design	2 variables, 3 constraint
7. Reducer design issues	7 variables, 11 constraint
8. Piston rod optimization	4 variables, 4 constraint

Table 6. Comparison of the three algorithms.

		DSTSA	STASA	SNS
CSD	min	0.012687807	0.012665254	0.012667240
	mean	0.012721315	0.012724139	0.012751065
	std	1.425476 × 10⁻⁵	3.686603 × 10⁻⁵	1.333570 × 10⁻⁴
	R	-	1	-1
I-BD	min	0.006625958	0.006625958	0.013074118
	mean	0.006625958	0.006625958	0.013074128
	std	2.148971 × 10⁻¹³	2.272516 × 10⁻¹³	3.415683 × 10⁻⁸
	R	-	0	1
WBD	min	1.556976274	1.557539494	1.724852322
	mean	1.5684358785	1.576662593	1.724884703
	std	0.0191236342	0.026083090	9.51866 × 10⁻⁵
	R	-	1	1
CD	Min	1.339956375	1.339956375	1.339959827
	mean	1.339956494	1.339956472	1.340040862
	std	9.837556 × 10⁻⁸	7.401418 × 10⁻⁸	1.221619 × 10⁻⁴
	R	-	1	1
SD	min	26.48636152	26.48636152	26.48636147
	mean	26.48636170	26.48636180	26.48636147
	std	1.808759 × 10⁻⁷	2.758648 × 10⁻⁷	7.226896 × 10⁻¹⁵
	R	-	1	-1
3-BT	min	263.89584341	263.89584347	263.89584345
	mean	263.89584439	263.89587123	263.89587848
	std	7.925477 × 10⁻⁷	4.292542 × 10⁻⁵	5.3584038 × 10⁻⁵
	R	-	1	-1
RD	min	2994.4245780	2994.4248740	2994.4246658
	mean	2994.4245119	2994.4424464	2994.4404264
	std	8.3096763 × 10⁻⁵	1.4695980 × 10⁻²	1.3590637 × 10⁻²
	R	-	1	-1
PRO	min	8.4126982290	8.4126983354	8.4126983231
	mean	8.4126983958	8.4126984148	56.130713531
	std	4.4586517 × 10⁻⁸	1.0172303 × 10⁻⁷	74.136541109
	R	-	1	1

Table 7. Algorithm running results.

Algorithm	Optimal Fitness	Worst Fitness	Average Fitness
DRSTASA	4673.5174	5195.8919	4737.651
STASA	4793.8431	5485.7753	4902.3325
SNS	4811.4688	5934.1597	5005.4432
PSO	4966.2653	5857.1326	5211.4123

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, J.; Han, X.; Liu, F.; Wu, J.; Zhang, W. UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies. Appl. Sci. 2025, 15, 6064. https://doi.org/10.3390/app15116064

AMA Style

Liu J, Han X, Liu F, Wu J, Zhang W. UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies. Applied Sciences. 2025; 15(11):6064. https://doi.org/10.3390/app15116064

Chicago/Turabian Style

Liu, Jianping, Xiaoxia Han, Fengyi Liu, Jinde Wu, and Wenjie Zhang. 2025. "UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies" Applied Sciences 15, no. 11: 6064. https://doi.org/10.3390/app15116064

APA Style

Liu, J., Han, X., Liu, F., Wu, J., & Zhang, W. (2025). UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies. Applied Sciences, 15(11), 6064. https://doi.org/10.3390/app15116064

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

UAV Path Planning Using a State Transition Simulated Annealing Algorithm Based on Integrated Destruction Operators and Backward Learning Strategies

Abstract

1. Introduction

2. UAV Three-Dimensional Path Planning

2.1. Path Optimality

2.2. Safety and Feasibility Constraints

3. STASA

3.1. State Transition Operators

3.2. Update Strategy of the STASA

4. DRSTASA

4.1. Population Initialization Based on Latin Hypercube Sampling

4.2. Disruption Operator

4.3. Reverse Learning Strategy

4.4. Basic Process of DRSTASA

5. Experimental Testing and Analysis

5.1. Benchmark Test Functions

5.1.1. Convergence Analysis

5.1.2. Time Complexity Analysis

5.2. Experimental Results

6. Summary

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI