Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm

Cheng, Zhekun; Yang, Jueying; Sun, Jinfeng; Zhao, Liangyu

doi:10.3390/drones9070468

Open AccessReview

Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm

¹

School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China

²

School of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore

³

School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China

⁴

Zhengzhou Research Institute, Beijing Institute of Technology, Zhengzhou 450000, China

⁵

National Key Laboratory of Land and Air Based Information Perception and Control, Beijing Institute of Technology, Beijing 100081, China

^*

Author to whom correspondence should be addressed.

Drones 2025, 9(7), 468; https://doi.org/10.3390/drones9070468

Submission received: 11 May 2025 / Revised: 16 June 2025 / Accepted: 27 June 2025 / Published: 1 July 2025

Download

Browse Figures

Versions Notes

Abstract

In recent years, effective trajectory planning has been developed to promote the extensive application of unmanned aerial vehicles (UAVs) in various domains. However, the actual operation of UAVs in complex environments presents significant challenges to their trajectory planning, particularly in maintaining task reliability and ensuring safety. To overcome these challenges, this review presents a comprehensive summary of various trajectory planning techniques currently applied to UAVs based on the emergence of intelligent algorithms, which enhance the adaptability and learning ability of UAVs and offer innovative solutions for their application in complex environments. Firstly, the characteristics of different UAV types, including fixed-wing, multi-rotor UAV, single-rotor UAV, and tilt-rotor UAV, are introduced. Secondly, the key constraints of trajectory planning in complex environments are summarized. Thirdly, the research trend from 2010 to 2024, together with the implementation, advantages, and existing problems of machine learning, evolutionary algorithms, and swarm intelligence, are compared. Based on these algorithms, the related applications of UAVs in complex environments, including transportation, inspection, and other tasks, are summarized. Ultimately, this review provides practical guidance for developing intelligent trajectory planning methods for UAVs to achieve the minimal amount of time spent on computation, efficient dynamic collision avoidance, and superior task completion ability.

Keywords:

unmanned aerial vehicle; trajectory planning; intelligent algorithm; complex environments

1. Introduction

As an aircraft that carries a certain load and performs specific tasks autonomously, an unmanned aerial vehicle (UAV) is remotely controlled by ground operators or even requires no human supervision [1,2,3]. Compared with manned aircraft, UAVs demonstrate higher operational flexibility, smaller size, longer endurance, and better invisibility. As the implementation of UAVs becomes more intelligent, controllable, and widespread, the importance of trajectory planning of UAVs grows in relevance [4,5].

Trajectory planning intends to obtain one or more feasible trajectories of a single or multiple UAV from the origin to the destination with specific tasks or necessary waypoints [6,7]. These planned trajectories are designed to satisfy corresponding constraints, including UAV dynamics [8], obstacle avoidance [9], distance between swarm members [10], and desired formations [11]. These constraints are usually associated with flight safety, trajectory feasibility, and task requirements. The objectives in trajectory planning can be single or multiple. For example, determining the shortest trajectory between two fixed points is an instance of single-goal trajectory planning. In contrast, multiple-goal trajectory planning involves managing several objectives simultaneously, such as minimizing both flight time and control input. The trajectory planning issues for a single UAV regarding the optimal trajectory [12], specific area search [13], and threat avoidance [6] have been extensively studied. Nowadays, due to limitations including battery life, load capacity, and failure rate, the UAV swarm emerges as crucial to expanding the capabilities of a single UAV through cooperation and completing more difficult tasks, which has become the research emphasis all over the world [14].

In the past several decades, researchers have proposed numerous algorithms for trajectory planning issues of single or multiple UAVs. Classical algorithms include artificial potential field (APF) [15], algorithms based on graph search [16], algorithms based on sampling [3], and algorithms based on mathematical models [17]. The graph search-based trajectory planning algorithm often has better completeness in trajectory planning in low-dimensional spaces [18]. Still, it requires complete modeling of the environment, which often leads to dimensionality disasters in high-dimensional spaces. The sampling-based trajectory planning algorithm replaces completeness with probability completeness, thereby improving search efficiency. However, the trajectory often needs to be post-processed utilizing curve fitting and smoothing, which increases the computational cost [19]. Algorithms based on mathematical models [20] incorporate relevant factors into the trajectory planning problem as inequality or equality constraints to obtain the optimal trajectory. However, these algorithms demand extensive computation to solve constrained multi-objective optimization problems [21]. Consequently, it is difficult to apply the mathematical model-based algorithms in complex environments [22]. It is relatively difficult for these classic algorithms to simultaneously meet the requirements of practicality, safety, and complex environmental adaptability in UAV trajectory planning.

With the continuous evolution of artificial intelligence, intelligent algorithms, including machine learning [23], evolutionary algorithms [24], and swarm intelligence [25], have been integrated into the trajectory planning of UAVs. These methods simplify the environment and UAV modeling process and use powerful search methods instead to converge on the target stably, so these algorithms have great advantages over classical algorithms in unknown environments and dynamic obstacles [26,27]. The fault tolerance and learning abilities of machine learning are beneficial for trajectory planning methods to handle dynamic environments [28]. The evolutionary algorithm-based method has a strong global search capability. The characteristics of swarm intelligence [29,30] make it not require central control, giving trajectory planning methods decentralized computational capability. Attributable to the benefits of intelligent algorithm-based trajectory planning, UAVs are widely used in many domains, including transportation [31,32], inspection [33,34], emergency rescue [35], and others.

Currently, there are several review articles on the intelligent algorithms for UAV trajectory planning. For example, Puente-Castro et al. [36] reviewed articles on UAV trajectory planning based on intelligent algorithms. Yahia [37] and Hooshyar [38] systematically introduced UAV trajectory planning using meta-heuristic algorithms, respectively. Poudel et al. [39] summarized bio-inspired algorithms for trajectory planning, highlighting key characteristics, working principles, advantages, and weaknesses. For multiple UAVs, Sharma et al. [40] summarized the UAV swarm trajectory planning techniques for intercepting multiple targets. Tang et al. [41] reviewed the merits and demerits of various swarm intelligence algorithms. However, the existing reviews mentioned above tend to classify and discuss trajectory planning methods from the perspective of intelligent algorithm types, while the constraints of trajectory planning in complex environments are not considered; thus, specific applications based on these intelligent algorithms are not well summarized when analyzing the development of trajectory planning.

To have a more systematic understanding of the characteristics of various trajectory planning methods in complex environments, it is essential to summarize the current progress. This review analyzes trajectory planning characteristics of several typical UAVs and further discusses the challenges of UAV trajectory planning in complex environments. Based on the excellent performance as well as existing problems of intelligent algorithms in these complex conditions, the implementation of UAV trajectory planning through intelligent algorithms is introduced, as shown in Figure 1. Compared with these existing review articles, the main contributions of this work are as follows:

(1): The trajectory planning features and constraints of typical types of UAV in complex environments are classified and introduced.
(2): The growth trends, development processes, and specific applications of different intelligent algorithms in complex environments are reviewed and compared.
(3): The limitations of current UAV trajectory planning algorithms in complex environments and potential future research directions are proposed and discussed.

This review aims to summarize the UAV trajectory planning methods in complex environments within cutting-edge research and provide practical suggestions for the development of intelligent algorithm applications in these scenarios. Differences between this review and existing reviews are shown in Table 1.

2. Trajectory Planning Features

2.1. Trajectory Planning Features Across Various UAV Types

According to different types of wing and rotor configurations, UAVs currently in widespread use can be broadly classified into fixed-wing UAVs, multi-rotor UAVs, single-rotor UAVs, and tilt-rotor UAVs. Each type has inherent differences in its fuselage structures and aerodynamic features, resulting in different requirements for trajectory planning methods, application areas, and technical challenges. Therefore, it is crucial to develop trajectory planning strategies that are customized to the specific features of each UAVs platform. Representative illustrations of UAVs with different wing configurations are shown in Figure 2 [42,43,44,45].

In terms of flight capabilities, multi-rotor UAVs, single-rotor UAVs, and tilt-rotor UAVs all can hover, and although fixed-wing UAVs cannot hover in place, they can cruise at high speed. In terms of structure, fixed-wing UAVs and multi-rotor UAVs have relatively simple structures, while single-rotor UAVs and tilt-rotor UAVs involve complex pitch-changing mechanisms and state switching issues. From the perspective of dynamic characteristics, multi-rotor UAVs have fast response characteristics, while fixed-wing UAVs have larger minimum turning radius restrictions. The comparison of characteristics between various types of UAVs is shown in Table 2. These differences in fuselage structure and flight characteristics also bring different requirements for trajectory planning.

Fixed-wing UAVs represent a class of unmanned vehicles that generate lift via fixed wings and typically use engines or electric motors for propulsion. They offer superior endurance and are particularly suitable for long-duration, heavy-load, or high-altitude tasks. Due to their reliance on continuous forward motion for lift generation, fixed-wing UAVs cannot hover and must sustain a minimum velocity to remain airborne. Therefore, fixed-wing UAVs face notable challenges in trajectory planning, particularly related to stringent dynamic constraints and high requirements for trajectory smoothness, which need to account for factors such as minimum turning radius, pitch rate limit, and climb/descent angle constraints, while avoiding abrupt directional changes [42]. Accordingly, trajectory planning algorithms such as Dubins curves and B-spline trajectories are commonly employed. Recent advancements have also introduced intelligent algorithm-assisted trajectory planning techniques, which enhance flight safety by enabling real-time path risk assessment and trajectory evaluation. For example, Ma et al. [46] developed a strategy that integrated Dubins curve-based trajectory planning for single-UAV coverage tasks and a parallel-interval formation approach for coordinated coverage using fixed-wing UAV swarms to achieve rapid coverage search in complex boundary sea areas. Freitas et al. [47] developed a differential evolution-based algorithm for 3D trajectory planning, using a kth-order non-uniform rational B-spline curve to generate smooth and differentiable paths under climb/dive angle and curvature constraints.

Multi-rotor UAVs rely on multiple propellers to provide lift and attitude control. The most common types include quadrotors and hexacopters. Characterized by their symmetrical structure, high maneuverability, and excellent hovering capabilities, these UAVs offer convenient vertical take-off and landing (VTOL), making them a focal point in the development of advanced trajectory planning methods [48]. Currently, almost all of the most widely used methods, such as sampling, optimization algorithms, intelligent algorithms, and other trajectory planning methods, can be well applied to multi-rotor UAVs because of their high degree of freedom and strong environmental adaptability [49]. When they are applied in complex environments such as urban areas and forested regions, they need to perform rapid dynamic obstacle avoidance and maintain real-time responsiveness. These operational demands pose significant challenges in localization and mapping, especially under the constraints of high-frequency control. Additionally, their relatively small size makes them susceptible to external disturbances such as wind or interference from nearby UAVs, particularly in swarm operations, further complicating trajectory planning and system stability.

As another type of VTOL aircraft, single-rotor UAVs generate lift through a single large main rotor and typically employ a tail rotor or alternative anti-torque mechanisms to maintain directional stability [44]. The most prevalent type of single-rotor UAV is the helicopter, which, with its unique hovering ability and flexible maneuverability, is suitable for tasks that require vertical take-off and landing, low-speed precision operations, or complex terrain access. Due to its main rotor being involved in achieving attitude and lift control, the propulsion is highly coupled with the fuselage attitude, and the trajectory planning of single-rotor UAV requires consideration of the nonlinear and strongly coupled dynamic characteristics [50]. Moreover, their dynamic behavior is highly sensitive to wind disturbances and sudden environmental changes, which complicates planning a stable trajectory. The risks brought by their unique flight characteristics should also be considered. The vertical speed of the planned trajectory needs to be limited to avoid the aircraft entering a vortex ring state, and the landing trajectory should also be smooth to avoid ground resonance. At present, trajectory planning methods for single-rotor UAVs primarily include sampling-based algorithms that leverage large-scale environmental maps for navigation and obstacle avoidance [51], as well as intelligent algorithm-based strategies designed to enhance robustness in disturbed operational conditions [52].

Research on the trajectory planning of tilt-rotor UAVs remains relatively limited. As a novel class of UAV, it combines the structure of fixed-wing UAVs and rotary-wing UAVs. The rotors can be erected to generate vertical lift during take-off and landing and tilted horizontally to provide forward thrust during cruising flight, thereby enabling VTOL capabilities alongside high-speed cruising performance [45]. However, trajectory planning for such UAVs presents significant challenges due to their highly nonlinear dynamics and the strong coupling problem of mode-switching processes. The trajectory needs to consider the smooth transition between the vertical mode and the horizontal mode, and it is difficult to coordinate the trajectory continuity and controllability. In addressing these challenges, trajectory planning methods for tilt-rotor UAVs often employ hybrid algorithms tailored to manage mode-switching dynamics [53].

2.2. Trajectory Planning Features in Complex Environments

In practical application scenarios of complex environments, UAV trajectory planning is influenced by a variety of factors, including obstacles, terrain features, weather conditions, and air defense systems [54]. In this case, the design of trajectory planning methods for UAVs needs to ensure flight stability, maintain task efficiency, and avoid obstacles while conforming to aerodynamic and motion constraints. In multi-UAV systems, additional requirements such as inter-UAV collision avoidance, coordinated communication, and formation control arise, further increasing the dimensionality and complexity of the trajectory planning process [55]. These include higher-dimensional state space and action space, additional constraints, and multi-objective optimization conflicts. The presence of complex and dynamic obstacle environments significantly exacerbates the challenges associated with efficient and reliable trajectory generation.

Dense obstacles in complex environments significantly increase planning difficulty. Typical high-risk scenarios include mountainous terrain, densely built urban areas, and narrow passages in buildings. Dense obstacles will strictly limit the feasible solution space of the UAV and reduce the trajectory tolerance space, requiring the planning method to find a safe trajectory in a limited space, and a large number of obstacle avoidance constraints will greatly increase the computational complexity of trajectory planning methods. Additionally, the narrow environment will amplify the UAV’s dynamic constraints, such as physical limitations, such as maximum acceleration and minimum turning radius, making it more difficult for the trajectory to meet both feasibility and safety requirements. To meet these challenges, current solutions mostly use global coarse planning combined with local optimization methods, constraint optimization methods based on safe flight corridors, and adaptive strategies that integrate machine learning, etc. For complex mountainous environments, Zhang et al. [56] considered terrain and threat areas in the UAV objective function of trajectory planning, as shown in Figure 3a. For complex urban city environments, Lin et al. [57] proposed a stage-by-stage UAV trajectory planning method to plan a collision-free corridor in the offline stage and plan the minimum time trajectory within the corridor in the online stage, enabling rapid generation of feasible UAV trajectories in complex urban landscapes, as illustrated in Figure 3b. For complex, narrow, and constrained environments, Freitas et al. [47] considered the climb/dive angle limitations and optimal turning radius for fixed-wing UAVs and proposed a planning method based on differential evolution to effectively generate viable trajectories through tight spaces, such as tunnels, as displayed in Figure 3c.

In addition to the detected and known obstacles, the presence of unknown and dynamic obstacles introduces additional challenges to UAV trajectory planning in complex environments. First, unknown and dynamic obstacles render offline trajectory planning methods ineffective, while online planning methods face a conflict between real-time requirements and computational complexity in complex environments. Second, it is more difficult to balance safety and trajectory quality, as emergency obstacle avoidance may lead to severe trajectory oscillations or local optima entrapment. The current mainstream solutions include reducing the amount of calculation to increase the planning frequency, enhancing global search capabilities and optimization strategies, or introducing learning algorithms to optimize the prediction accuracy of dynamic obstacles. For instance, Li et al. [58] constructed an online trajectory planning method for UAVs on the basis of the improved deep deterministic policy gradient algorithm for uncertain environments, which introduced a mixed noise consisting of Gaussian noise and Ornstein-Uhlenbeck noise for escaping from local optimality. When the external environment changes, this method does not require remodeling the environment and planning new effective trajectories, saving computing resources. Another method is to use a density-based spatial clustering of applications with noise to extract the position and velocity of dynamic obstacles and generate the collision-free velocity of the UAV via the gradient velocity obstacle. This method retained the original feasible set while ensuring computational efficiency and showed excellent fault tolerance to environmental perception noise, generating collision-free trajectories in real time when facing unknown obstacles [59].

In complex obstacle environments, UAVs are often required to frequently perform maneuvers such as acceleration and deceleration, turning, and attitude adjustment to avoid densely distributed obstacles. Compared with smooth, straight-line flight, such high-frequency attitude changes and detours make the flight trajectory longer and lead to increased energy consumption of the power system, thereby shortening the UAV’s endurance. In addition, frequent maneuvers make energy consumption more uncertain, and the energy budget and duration of the flight task are difficult to accurately predict. To address this limitation, a common approach is to use trajectory smoothing methods to reduce the energy loss caused by drastic velocity changes and attitude changes. Another approach is to incorporate energy consumption into the trajectory optimization objective function and design energy-optimal trajectories. In addition, various auxiliary measures have also been adopted to improve the endurance of UAVs, thereby extending their application scope. For example, Shao et al. [12] developed a support system comprising battery swap and service stations. To optimize the UAVs’ operation, they proposed a trajectory planning method based on an improved Ant Colony Optimization (ACO) algorithm, aiming to optimize the total flight distance and frequency of landings. Additionally, with the development of solar cell technology, solar-powered UAVs have become an effective solution to improve the endurance and reliability of the electric UAVs for extended tasks [60]. Fuel-powered UAVs can be refueled in the air to replenish energy. Chen et al. [61] established a collaborative trajectory planning method for UAV swarms and tankers and realized the refueling of UAVs by tankers in complex environments, as shown in Figure 4.

In general, dense obstacles in complex environments significantly compress the feasible solution space, increase the computational complexity, and amplify the impact of dynamic constraints on trajectory feasibility. In the face of unknown or dynamic obstacles, traditional offline planning methods are difficult to apply, and online planning methods must balance real-time and safety within a limited time. At the same time, frequent maneuvers significantly increase energy consumption and reduce endurance, making energy management more difficult. The above analysis demonstrates that trajectory planning in complex environments must achieve a multi-objective trade-off between computational efficiency, safety, dynamic feasibility, and energy consumption.

3. Intelligent Algorithms for UAV Trajectory Planning

Considering the difficulties faced by UAV trajectory planning in complex environments, traditional trajectory planning methods of UAVs have shown obvious limitations. In recent decades, intelligent algorithms have shown unique advantages in solving high-dimensional nonlinear programming problems due to their good global search capabilities and adaptability and have represented a significant research direction in UAV trajectory planning. Therefore, trajectory planning methods based on intelligent algorithms are summarized and analyzed.

To analyze the development status, the Web of Science search engine was used to search research articles on intelligent algorithm-based UAV trajectory planning from 2010 to 2024, and they were classified according to the types of intelligent algorithms. The methods are divided into three algorithms: machine learning (ML), evolutionary algorithms (EA), and also swarm intelligence (SI) algorithms. The specific classification is shown in Figure 5, where the dotted arrow indicates that deep reinforcement learning is achieved by combining deep learning and reinforcement learning.

In addition, the number of retrieved research articles on intelligent algorithm-based UAV trajectory planning was counted to analyze the development trend. As shown in Figure 6, the number of related studies has gradually increased in the past 15 years. Especially after 2018, the growth trend has accelerated significantly, indicating that UAV trajectory planning methods based on intelligent algorithms have received widespread attention. In this section, ML, EA, and SI and their applications in UAV trajectory planning are introduced.

3.1. Machine Learning

ML involves algorithms to analyze data, learn from patterns, and enable machines to automatically make decisions and predictions about events. As computer technology continues to develop, deep learning (DL), reinforcement learning (RL), and deep reinforcement learning (DRL) have emerged as significant research directions in ML, which are widely used in trajectory planning of UAVs.

3.1.1. Deep Learning

DL originates from the study of neural networks (NN) and primarily employs deep neural networks (DNN) for learning and prediction. As an imitation of the nervous system of organisms, NN is a large-scale structure formed by layers of connected nodes [36]. DNN can be broadly understood as an NN architecture that incorporates multiple hidden layers [62], which extracts features from data through multi-layer nonlinear transformations, suitable for massive data sets and complex tasks [63]. Subsequently, DNN has evolved into many different network topologies, including convolutional neural network (CNN) [64] and recurrent neural network (RNN).

Following an introduction of an NN model with nonlinear analog neurons for trajectory planning by Glasius in 1995 [23], the development of NN in UAV trajectory planning has been revealed. To improve the feasibility of trajectory planning, Liu et al. [65] considered the tracking characteristics of UAVs and introduced the trajectory planning method based on CNN and model predictive control. Due to the limitation of traditional CNNs in using historical data, this method developed a time-series CNN to adapt to time-series prediction problems by adding time-series properties to it. In contrast, the design of RNNs made them naturally capable of processing and memorizing long-term sequence data. Shaffer et al. [66] explored a method for training RNNs to solve multi-agent trajectory planning problems. In addition, this method included a decentralized version, which enhanced the scalability of the swarm. With the development of NN, numerous derived forms have emerged in recent years [67,68]. Li et al. [69] used a graph neural network (GNN) integrated with a mechanism that allows message-dependent attention and a message-aware graph attention network in UAV trajectory planning for multiple agents, which supported the trajectory planning method to maintain high effectiveness in scalable and unseen environments. Another form of NN in UAV is applied in vision-based navigation, especially obstacle avoidance [70,71,72]. Although NN-based trajectory planning methods typically exhibit strong fault tolerance and learning abilities, the opacity of trained neural networks often restricts their practical applications.

In recent years, end-to-end learning methods have been extensively explored for navigation [73,74,75]. Unlike conventional modular frameworks that process perception, trajectory planning, and control as separate tasks, end-to-end methods aim to directly map raw sensor inputs to control outputs, thereby simplifying the overall system architecture and minimizing the accumulation of intermediate errors [76]. The Transformer architecture has emerged as a powerful tool for achieving end-to-end learning due to its strong representation capabilities and parallel computing characteristics, which have two main components: the encoder and the decoder, as shown in Figure 7 [77]. The encoder is composed of several identical layers stacked together; each layer includes a multi-head self-attention layer (MHSA), a feed-forward network, layer normalization, and a residual connection. The basic structure of the decoder is similar to that of the encoder. Still, MHSA is replaced by a multi-head partial self-attention layer (MHPSA), and each layer has an additional masked multi-head attention layer (Mask-MHA) than the encoder. The encoder extracts the characteristics of the input, and the decoder gradually generates the output based on the encoding result and through Mask-MHA. Based on the self-attention mechanism, Transformer has demonstrated exceptional performance in handling sequential data and capturing long-range dependencies and has proven suitable for various sequence modeling tasks in dynamic environments [78]. Wang et al. [79] demonstrated that the proposed decentralized deep reinforcement learning framework for the UAV swarm trajectory planning, integrating a Transformer architecture with an adaptive reward mechanism to enable spatial cooperation and exploration, outperformed all baselines. Similarly, an innovative end-to-end Transformer-based architecture was designed for sequential data processing, enabling mobile robots to autonomously navigate in various unknown environments and surpassing the performance of existing methods [75].

3.1.2. Reinforcement Learning

RL involves algorithms that enable a UAV to develop its optimal strategy to solve problems in interacting with the environment (Figure 8) [80]. Unlike DL, RL operates without the need for pre-collected data. It focuses on decision-making and action selection in the environment for trajectory planning in unknown environments.

Reinforcement learning methods include basic elements such as an agent that executes decision-making, an environment with which the agent interacts, state representations that encapsulate environmental information, actions that the agent can perform, rewards that quantify immediate performance, policy functions that map states to actions, and value functions that estimate long-term return expectations. The policy defines the mapping relationship from state to action, which is the core of reinforcement learning. According to different learning methods, reinforcement learning methods can be divided into three categories: value-based methods indirectly obtain policy by learning state-action value functions, which are suitable for discrete action spaces; policy-based methods directly optimize policy functions, which can handle continuous action spaces, but there are problems such as large variance and slow convergence; and actor-critic methods combine the advantages of both, using a critic to evaluate value functions and guide an actor to optimize policy, which achieves more stable and efficient learning.

Owing to the strength in learning optimal strategy, RL-based UAV trajectory planning methods have been developed. Q-learning is a basic RL algorithm that learns which action to choose in each state to bring the maximum cumulative reward by iteratively updating the Q value (state-action value function). Zhao et al. [81] combined asynchronous methods with the tabular Q-learning algorithm and proposed an asynchronous RL architecture for trajectory planning in discrete space. This method alleviated the issue where classical RL relies on continuous updates from a single agent to learn policies and easily leads to slow convergence. However, this method is specifically designed for the fully known environment. Q-learning cannot use geometric distance information when only part of the information about the environment is available. Therefore, Zhang et al. [82] proposed a trajectory planning method according to geometric reinforcement learning (GRL), in which the reward matrix is adaptively updated by the geometric distance and risk information from detection sensors and other UAVs. Moreover, by adjusting the parameters in GRL, a balance between safety and cost-effectiveness can be achieved. Cui et al. [83] proposed a trajectory planning algorithm to plan a collision-free trajectory based on multi-layer RL, which included a lower-layer RL that led the UAV to the terminal location and a higher-layer RL that avoided dynamic obstacles. Compared with classical Q-learning, this method collects both global and local information to significantly enhance performance. However, for continuous states and large state spaces, traditional RL algorithms may encounter the curse of dimensionality.

3.1.3. Deep Reinforcement Learning

Traditional RL is generally applicable only when the set of states and actions is a finite discrete basis and the number of states and actions is small. The cases in practical applications may be very complex, and it is challenging to define discrete states. To deal with high-dimensional input data, NN is used in RL to fit the policy function and value function [84]. This method is called deep reinforcement learning (DRL), which is a cutting-edge research direction in ML. DRL combines the perception capabilities of DL with the decision-making capabilities of RL. As an environment-independent algorithm, DRL demonstrates greater effectiveness in managing high-dimensional UAV trajectory planning in complex, unknown, uncertain, and dynamic environments compared to earlier conventional algorithms that rely on prior knowledge.

Differing from conventional methods where neural networks are commonly opaque, He et al. [85] introduced a twin delayed deep deterministic policy gradient (TD3) reinforcement learning method for UAV trajectory planning and a model explanation method based on feature attribution to achieve model explainability, which generated some easy-to-interpret text and visual explanations. Yan et al. [86] developed a DRL method using an improved dueling double deep Q-networks (D3QN) algorithm for Q-value prediction and an action selection policy for UAVs in dynamic environments with potential enemy threats. For multi-UAV cooperative trajectory planning in obstacle and narrow environments, Xing et al. [11] introduced a UAV swarm adaptive cooperative trajectory planning method according to improved DRL by innovatively introducing long short-term memory (LSTM) RNN to the multi-agent twin delayed deep deterministic policy gradient (MATD3) network, which improved the optimality of trajectory planning and adaptability to narrow passages. Wang et al. [87] developed a trajectory planning method according to D3QN that operates without the need for any prior knowledge of the environment and other UAVs. However, for DRL-based methods, designing an appropriate reward function that effectively guides the agent toward the desired behavior remains a non-trivial task. If the reward function is not carefully designed, the final result may not align with the original objectives.

3.1.4. Other Machine Learning Methods

DL, RL, and DRL have been extensively applied in UAV trajectory planning. In addition, other types of ML methods have also been tried in this field, including newly proposed methods and traditional machine learning methods. Transfer learning (TL) is a method that transfers the common features or model parameters learned in a source task to a target task to improve the performance of the target task. Bo et al. [88] established the pretrained model, utilizing the TL method to incorporate it into formal training, thereby integrating prior knowledge to enhance training performance and endow UAV with the capability to autonomously avoid obstacles and plan trajectories in unknown environments. For the UAV trajectory planning problem with connectivity outage constraint, Fontanesi et al. [89] proposed a TL method that leverages a teacher policy trained in a source domain to accelerate the learning in a target domain. Moreover, Pérez-Cutiño et al. [90] proposed an algorithm that recursively generates trajectories based on the output of neural networks and random forests (RF) to calculate energy-efficient trajectories for UAVs. Support vector machine (SVM), also a traditional machine learning method, is often used to establish safe flight corridors for UAV in trajectory planning [91,92].

Table 3 provides a summary of the development of various MLs in UAV trajectory planning. Among them, DL can be trained on data, making the UAV trajectory planning method highly adaptable in complex environments. In contrast, insufficient or inappropriate training data may lead to unsatisfactory planned trajectories. RL does not require the collection of training datasets in advance and is better suited for trajectory planning in unknown or dynamic environments. ML-based UAV trajectory planning methods’ fault tolerance and learning abilities demonstrate considerable advantages for dealing with complex and dynamic environments. Additionally, certain challenges persist, including the unpredictability of ML algorithms and the potential for erroneous decisions that may result in safety and reliability issues.

3.2. Evolutionary Algorithms

The core idea of EA is to imitate the evolutionary process of natural populations and continuously optimize the quality of solutions. They are a probabilistic optimization algorithm based on the model of natural evolution [93]. EAs do not require gradient information and therefore do not require the continuity and differentiability of functions, making them suitable for addressing problems that traditional optimization algorithms find difficult to solve in UAV trajectory planning.

A statistical analysis was conducted to quantify the number of research studies utilizing each type of EA for UAV trajectory planning from 2010 to 2024, with the results visualized in Figure 9. The statistical results indicate that the number of research studies based on genetic algorithms (GA) and differential evolution (DE) is significantly higher than other types of EAs; therefore, these two methods are the main discussion content of this section.

3.2.1. Genetic Algorithm

GA involves a random global search algorithm that simulates natural biological evolution and was first proposed by Holland in 1975 [94]. This heuristic algorithm is based on genetics and Darwinian evolution and imposes no limitations on derivation and function continuity.

In UAV trajectory planning, GA provides advantages including adaptability, global optimality, and implicit parallelism [95]. Cao et al. [96] transformed the minimum stay time into the shortest trajectory combinatorial optimization in a complex battlefield environment with multiple bases and used GA to generate reconnaissance trajectories. Ergezer and Leblebicioglu [97] developed new mutation operators tailored to task requirements and constraints to plan a better trajectory for maximizing information collection. For multi-UAV, Sahingoz [98] developed a parallel GA to calculate a feasible trajectory for each UAV in a known environment. Shorakaei et al. [99] implemented trajectory planning for multi-UAV based on probability graphs and parallel GA to optimize the trajectory of each of the UAVs and avoid UAV collisions. However, in complex search spaces, GA also faces challenges of slower convergence speed, longer calculation time, and the risk of prematurely converging to a local optimum. Various improvements have been introduced to enable GA to escape the local optimum, one of which is the vibration genetic algorithm (VGA). VGA is implemented by periodically using vibrational mutation operators on all genes in the entire population to disperse individuals [100]. Pelivanoglu [101] then extended VGA to the multi-frequency vibration genetic algorithm (mVGA), incorporating the fuzzy c-means clustering method and Voronoi diagram to improve the initial population. This method effectively reduced the calculation time in addressing UAV trajectory planning problems. However, the vibration mechanism introduces additional parameters requiring adjustment, and the performance of the VGA may be susceptible to these parameter settings.

3.2.2. Differential Evolution

DE involves a stochastic global search algorithm that was proposed by Storn and Price [102]. The order and method of selection, crossover, and mutation operations in DE differ from those in GA. Compared with traditional GA, DE has lower computational complexity, efficient memory utilization, and faster convergence [103].

In the application process, UAV trajectory planning is usually modeled as an optimization problem and solved using DE. Brintaki and Nikolos [104] designed an offline trajectory planner based on DE for known static environments, demonstrating that DE can effectively find feasible trajectories for UAV under constraints within an acceptable period. However, traditional strategy configuration and parameter tuning methods are cumbersome, so researchers have introduced adaptive mechanisms to improve performance. Adhikari et al. [105] proposed a fuzzy adaptive DE (FA-DE) for the 3D UAV trajectory planning problem, which automatically adjusts parameters through a fuzzy logic controller. This method can better explore and exploit the search space and reduce the possibility of premature convergence. Yu et al. [106] designed an adaptive selection mutation operator for constraint differential evolution (CDE), which determines the probability of an individual entering the mutation phase according to the individual’s fitness value and constraint violation, thereby improving the exploitation and maintaining the exploration. It is worth noting that when UAVs operate in a complex environment, the number of constraints increases, increasing the requirements for trajectory planning methods. In recent years, the ensemble strategy method has been used to design high-efficiency DE algorithms [107,108,109]. Chai et al. [110] developed the Multi-Strategy Fusion Differential Evolution (MSFDE) algorithm, which integrates the multi-population strategy, teaching-learning-based optimization (TLBO)-based adaptive strategy, and interactive mutation strategy to improve the performance of UAV trajectory planning with multi-objective constraints in complex environments.

3.2.3. Other Evolutionary Algorithms

In addition to GA and DE, other types of EAs have been tried to plan UAV trajectories. One type is the genetic programming (GP) proposed by Koza [111], in which the potential solutions are coded in the form of trees instead of the linear chromosomes widely used in GA. Yang et al. [112] used GP for UAV trajectory planning and proposed several specially designed functions and a redesigned decoding system. They pointed out that GP showed better modeling and optimizing ability in trajectory planning. Another method is the evolution strategy (ES), which has a different selection operation compared to GA. Hohmann et al. [113] proposed the hybrid evolutionary strategy (HES) that combines ES with the exact Dijkstra algorithm for multi-objective trajectory planning of UAV. In addition, for the multi-objective trajectory planning problem for UAVs in 3D large-scale urban environments, the same research team used traditional ES and multi-objective covariance matrix adaptation evolution strategy (MO-CMA-ES) as optimizers to solve the established optimization problem, thereby planning trajectories for UAVs [114].

Table 4 provides an overview of the development of various EAs in UAV trajectory planning. The implicit parallelism of EA makes it more efficient when dealing with large-scale problems. Meanwhile, operations such as crossover and mutation can explore multiple possible solutions in the solution space, endowing EAs with strong global search capabilities. The problem of classical EAs prematurely converging to local optima in some cases has also been alleviated by introducing certain mechanisms. However, it is worth noting that the performance of EAs is sensitive to parameters including population and crossover and mutation probabilities. Improper parameter selection may adversely affect the effectiveness of trajectory planning.

3.3. Swarm Intelligence

Swarm intelligence (SI) originated in nature as a bionic intelligence algorithm. In the biological world, individuals within a group often exhibit coordinated and orderly behaviors during their activities [115]. In this way, simple tasks completed by individuals can be superimposed to accomplish complex tasks. SI provides another solution for UAV trajectory planning to deploy relatively basic agents for intricate and goal-based behaviors by emulating the natural group dynamic systems [116].

The number of research studies on UAV trajectory planning according to each SI algorithm is analyzed from 2010 to 2024 (Figure 10). It shows that the research on particle swarm optimization (PSO) and ant colony optimization (ACO) has the fastest-growing trend and the largest cumulative number, which are mainly summarized.

3.3.1. Particle Swarm Optimization

PSO originated from the study of the predatory behavior of bird flocks. It assumes that individuals navigate the search space as ideal particles and utilize information sharing among group members to transition the entire group from a disordered state to an ordered state to obtain the optimal solution [117]. The search space is defined as the set of all probabilistic solutions to the problem. Multiple particles are initialized with random velocities and positions in the search space. At each time step, each particle needs to calculate its fitness value and adjust its velocity and position based on the global optimal solution and individual optimal value. In PSO, the

i

-th particle updates its velocity and position at the

T

-th step according to the following equation [40]:

\begin{array}{l} V_{i}^{T + 1} = W V_{i}^{T} + r_{1} C_{1} (P_{b e s t} - X_{i}^{T}) + r_{2} C_{2} (G_{b e s t} - X_{i}^{T}) \\ X_{i}^{T + 1} = X_{i}^{T} + V_{i}^{T} \end{array}

(1)

where

X_{i}^{T}

and

V_{i}^{T}

denote the position and velocity vectors of the

i

-th particle at

T

-th step,

W

represents the inertia weight that maintains a balance of global and local search capabilities,

C_{1}

and

C_{2}

denote the acceleration constants preset by users,

r_{1}

and

r_{2}

denote the random numbers that are generated in the range of

[0, 1]

,

P_{b e s t}

denotes the optimal position of the

i

-th particle at

T

-th step,

G_{b e s t}

denotes the global optimal position, respectively.

The advantages of PSO lie in its minimal parameter settings and ease of implementation [25]. However, the optimization trajectory is limited by the individual and group particle experience, and over-reliance on individual particles is possible, making the algorithm become trapped in local optima. Numerous enhanced algorithms based on PSO have been proposed to address this challenge of UAV. To plan a trajectory with a balance of the flying distance, altitude, and cost, Liu et al. [118] addressed the issues of local optima and premature convergence by integrating a new adaptive sensitivity decision operator with PSO, achieving global search and fast convergence capabilities. Meanwhile, computation time was reduced by locating the potential position of the particle that was determined with high probability while eliminating other candidate particles. In addition, Phung et al. [119] proposed a new spherical vector-based particle swarm optimization (SPSO) algorithm for UAV trajectory planning in multi-threat complex environments via the correspondence of particle position and UAV velocity, turning angle, and diving angle. SPSO was used to efficiently search the UAV configuration space to determine the optimal trajectory by minimizing the cost function.

3.3.2. Ant Colony Optimization

ACO originated from a heuristic algorithm that simulates the activities of ants [120]. Ants release pheromones and travel in directions with high pheromone concentrations. This cooperative mechanism allows the ant colony to accumulate pheromones faster on shorter trajectories, ultimately finding the shortest trajectory. As an imitation of natural behavior, ACO first models the UAV flight area using nodes and initializes the number and origin location of agents [121]. Then, the agents construct trajectories and deposit pheromones accordingly. The probability distribution of

k

-th agent choosing the next node to visit is as follows:

P_{i j}^{k} (t) = \{\begin{array}{l} \frac{{[τ_{i j} (t)]}^{α} {[η_{i j} (t)]}^{β}}{\sum_{s \in a l l o w_{k}} {[τ_{i s} (t)]}^{α} {[η_{i s} (t)]}^{β}} & s \in a l l o w_{k} \\ 0 & otherwise \end{array}

(2)

where

P_{i j}^{k} (t)

denotes the probability of agent

k

moving from node

i

to node

j

at time

t

,

α

represents the importance coefficient of pheromone concentration,

β

represents the importance coefficient of heuristic information,

τ_{i j}

represents the pheromone concentration accumulated on the link

(i, j)

,

η_{i j}

represents the heuristic information,

a l l o w_{k}

represents the nodes to be visited by agent

k

. Regulating

α

and

β

can controlling the tendency to choose trajectories. When all agents complete an action, the pheromone concentration on each link trajectory is updated based on the following equation:

τ_{i j} (t + 1) = (1 - ρ) τ_{i j} (t) + ρ \sum_{k = 1}^{m} Δ τ_{i j}^{k}

(3)

where

ρ

is the pheromone evaporation coefficient,

m

is the number of agents, and

Δ τ_{i j}^{k}

is the pheromone concentration released by the

k

-th agent on the trajectory link node

i

to node

j

. The agent tends to move in the direction where the pheromone concentration is higher, leading to the global optimal trajectory with the highest pheromone intensity.

ACO has been used to address the requirements of minimizing threat exposure and minimizing the trajectory length required to cover a given area in UAV trajectory planning [122]. For example, Akshya et al. [123] used ACO to find the optimal trajectory covering a given area and compared it with the minimum spanning tree approximation algorithm. The result showed that ACO was better than the latter. In addition, ACO is a distributed algorithm based on individual collaboration and has strong scalability and adaptability. Therefore, it has unique advantages in dealing with the trajectory planning of UAV swarms. Rosalie et al. [124] combined the optimal chaotic migration model with ACO and introduced the chaotic ant colony optimization for coverage (CACOC) algorithm to solve the area coverage problem of UAV swarms. Subsequently, Dentler et al. [7] extended CACOC by a collision avoidance mechanism to compute UAV target waypoints, which effectively completed the coverage task and reduced the average tracking error of UAVs, while increasing the average minimum distance of UAVs.

3.3.3. Other Swarm Intelligence Algorithms

In addition to PSO and ACO, numerous swarm intelligence algorithms are developed, including the firefly algorithm (FA) that imitates the information exchange between fireflies [125,126], bat algorithm (BA) that imitates the echolocation behavior [127], artificial bee colony optimization (ABC) that imitates the intelligent foraging behavior [128], pigeon-inspired optimization (PIO) algorithm that imitates geomagnetic navigation and landmark navigation [129], whale optimization algorithm (WOA) that imitates the hunting characteristics hierarchy of whales [130], grey wolf optimization (GWO) algorithm which imitates the predatory behavior of gray wolves [131,132], dragonfly optimization algorithm (DOA) that imitates the foraging behavior of dragonflies [133]. SI algorithms exhibit substantial large-scale search capabilities and strong scalability, which supports the resolution of complex UAV trajectory planning challenges.

However, SI algorithms also have their limitations, including significant computational complexity, slow convergence, and a tendency to get trapped in local optima. To overcome these shortcomings, hybrid algorithms have been proposed to combine the advantages of individual algorithms to address specific challenges and improve overall performance. To speed up the convergence of bat positions updating in BA, Lin et al. [26] proposed an improved APF method and introduced a chaotic strategy to avoid falling into local optima. Wu et al. [27] introduced an adaptive chaos-Gaussian switching-solving strategy and coordinated decision-making mechanism into the basic WOA, overcoming the shortcoming that WOA is prone to falling into local minima. The combination of methods enhances the global search capability, accelerates the convergence speed, and avoids falling into local optima, which can have better optimization performance and adaptability in trajectory planning.

A summary of the development of various SI algorithms in UAV trajectory planning is shown in Table 5. The SI algorithm can achieve optimization through local interactions between individuals without central control. Therefore, SI-based UAV trajectory planning methods inherently have parallel processing capabilities, enabling them to improve the computational efficiency of trajectory planning. Additionally, the SI algorithm exhibits robustness against individual failures and environmental changes, maintaining effective performance in dynamic and uncertain environments.

4. Trajectory Planning Applications in Complex Environments

According to the systematic analysis of the impact of UAV types and complex environments on UAV trajectory planning and the theoretical progress of intelligent algorithms in trajectory planning, the following content further explores the application of trajectory planning methods according to intelligent algorithms in actual complex task environments. Existing studies have shown that trajectory planning methods based on machine learning, evolutionary algorithms, or swarm intelligence algorithms have demonstrated successful application in typical scenarios such as transportation and inspection. Starting from the characteristics of various tasks, the technical implementation pathways of trajectory planning methods based on intelligent algorithms are summarized across different application scenarios. The adaptability and advantages of these methods in addressing complex factors, such as obstacles, limited communication, and environmental uncertainty, are further analyzed, thus revealing the practicality and development potential of trajectory planning methods based on intelligent algorithms in complex environments.

4.1. Transportation

4.1.1. Slung-Load Transportation

A UAV can transport cargo by rigidly attaching it to the body or by suspending it with cables. The rigid connection method is relatively simple and suitable for delicate operations, while the increased inertia may diminish the attitude response rate of the UAV. The cable suspension method is a common cargo-carrying method for rotary-wing UAVs. This method reduces the impact of the payload on the rotational inertia and avoids the issue of mismatch between the payload and the fuselage shape of the UAV but introduces new constraints to trajectory planning, such as payload swing and cable tension [134]. The swing of the payload increases the instability of the UAV, while excessive cable tension can potentially cause the cable to break. The additional constraints make it more challenging to plan the UAV transportation trajectory in a complex environment.

For the UAV slung-load system, Li et al. [135] proposed a technique for generating the minimum swing trajectory of the UAV suspended load according to Deep Q-Networks (DQN) to control the swing of the suspended load within a certain range during the entire transportation process. Hua et al. [136] proposed a time-optimal trajectory planning strategy based on convex optimization and DRL, which fully considered physical constraints such as velocity, acceleration, and cable tension to ensure the cable tension was within the acceptable range. For time-sensitive cargo delivery tasks in environments with static obstacles, Faust et al. [137] developed a trajectory planning method that combined a sampling-based method and RL to deliver the payload to their destinations promptly without collisions while maintaining bounded load displacement, as shown in Figure 11. The method developed a swing-free strategy constrained to an action subspace, and the RL policy was modified to handle diverse tasks through the use of task-specific action spaces. To optimize the trajectory length, task duration, and energy consumed during the task for a UAV slung-load system, Ergezer [138] developed a multi-objective GA to solve the trajectory planning problem and developed problem-dependent mutation-like operators to avoid obstacles.

For larger or heavier cargo, a single UAV may be insufficient. The collaboration of multiple UAVs is a practical solution to increase payload-carrying capacity. This modular approach offers greater flexibility, with the number of UAVs adjustable based on the task requirements. Different from the trajectory planning of a single UAV, effectively addressing spatial, time, and task coordination among different UAV trajectories is crucial for solving the coordinative trajectory planning problem. The constraints introduced under such tasks mainly include collision avoidance [139], cooperation requirements [140], and formation shape change [141]. The cable suspension method introduces passive degrees of freedom to the UAV, resulting in swing during flight. To solve the problem, Li et al. [142] introduced a trajectory planning method according to the RL algorithm of an approximate value function. In this method, the parameter vector of the approximate value function was acquired via training, and the greedy strategy was used to generate the flight trajectory, aiming to ensure uniform tension on the suspension cables and efficiently transport the load to the target location.

In summary, existing research has fully demonstrated the wide application and significant advantages of trajectory planning methods based on intelligent algorithms in UAV transportation tasks. Such methods generally adopt reinforcement learning or evolutionary algorithms and achieve efficient trajectory planning under multiple constraints such as cable tension, load swing, obstacle avoidance, and time optimization, and they have good generalization and adaptability. With the increase in task complexity, research has gradually expanded to multi-UAV coordinative transportation tasks, solving problems such as time synchronization, spatial coordination, and load distribution. However, these methods still face some challenges, including unstable convergence, insufficient adaptability to dynamic environments, and high complexity of multi-UAV coordinative strategies. Therefore, although trajectory planning methods of intelligent algorithms show practical value and development potential for transportation tasks, further in-depth research is still needed in terms of algorithm efficiency, safety assurance mechanisms, and system coordinative strategies.

4.1.2. Multi-Destination Transport

In addition to the single-destination case, UAVs can also provide supplies to multiple units per delivery to further reduce the delivery cost in the transport scenario. For this more complex task scenario, the constraints on UAV energy consumption, flight time, and obstacle avoidance in the modeling of single-destination transportation are simple, and the constructed model cannot meet the actual delivery needs. In such cases, factors such as pickup and delivery order, maximum load, and endurance need to be considered to plan the trajectories for the UAV, which is a large-scale planning problem.

Regarding the energy consumption variation of UAVs, mixed time windows of the customers, and simultaneous delivery and pickup, Du et al. [143] used the large neighborhood search algorithm to accelerate the convergence rate and introduced an improved GA-based cooperative trajectory planning algorithm to optimize the trajectory, service UAV allocation jointly, and minimize the total energy cost of the UAV swarm during delivery. The proposed algorithm was verified on the Solomon standard data, indicating that the solution obtained by the proposed algorithm has a lower transportation cost relative to classic GA and PSO.

The flexibility of the UAV enhances the efficiency of material transportation, but it is worth noting that some UAVs, especially multi-rotor UAVs, also have disadvantages such as small payload and short range, which reduces the scope of application of pure UAV transportation solutions. The increase in vehicle load and endurance can reduce the delivery cost to a certain extent [143]. Traditional vehicles have the characteristics of large capacity and high reliability. Therefore, integrating UAVs with traditional delivery vehicles is a promising solution to maximize the benefits of both methods and optimize logistics efficiency. Petr and Kutěj [144] developed a model for cooperating ground vehicles and UAVs to supply units on the battlefield. They proposed a metaheuristic algorithm based on ACO as the solution, which improved the probabilistic strategies for UAV deployment, customer assignment, and local search refinement. The attempt shows that the savings achieved by the cooperation of ground vehicles and UAVs are significant, optimizing the delivery process to achieve minimal total completion time while minimizing the total cost of transportation.

These studies demonstrate that trajectory planning methods based on intelligent algorithms exhibit strong multi-objective optimization capabilities, can globally optimize transportation trajectories and service allocation while considering constraints such as payload limits, endurance, and time windows, and show high adaptability in ground-air coordinative delivery scenarios. Evolutionary algorithms and swarm intelligence algorithms are commonly used in such tasks, and the convergence speed of the algorithm and the quality of the solution are improved by combining strategies proposed according to the characteristics of specific problems. However, the complexity of modeling and solving such multi-objective, multi-constraint problems is considerable. Intelligent algorithms often entail significant computational overhead and stability challenges, particularly in dynamic environments, where optimality is difficult to guarantee.

4.2. Inspection

4.2.1. Predetermined Checkpoints

A typical case of an inspection task involves surveying several locations of interest, which requires a UAV with detection capabilities to plan a trajectory through these locations.

The checkpoint-based trajectory planning is an important component for inspection. In this problem, a UAV trajectory needs to be generated to visit all checkpoints while minimizing a specific objective function under several constraints. In practical applications, where reconnaissance often involves multiple targets, it is crucial to study autonomous strategies for planning a collision-free, shortest, or most economical trajectory. Checkpoint-based trajectory planning is non-deterministic polynomial complete (NP-complete), which has no optimal solution in polynomial time. Many studies have simplified this problem to generating an optimal sequence of checkpoints. The trajectory is identified by connecting these checkpoints with straight-line segments.

However, the generated trajectory may be unflyable because the UAV is not able to follow a non-smooth trajectory [145,146]. To plan the smooth trajectory of the target coverage task with predetermined checkpoints, Pehlivanoglu et al. [147] used a splines parameterization trajectory and proposed a GA-based UAV trajectory planning method. This method used the Voronoi diagram, ACO, and clustering methods to enhance the initial population of GA, thereby accelerating the convergence process and significantly shortening the planning time for feasible trajectories. The example phenotypes of individuals from the initial population before and after the enhancement are shown in Figure 12. Compared with conventional GA, the enhancement of the initial population provides random and controlled diversity, significantly reducing the computational time.

In the reconnaissance task of predetermined checkpoints, the UAV trajectory planning method based on intelligent algorithms usually models the task as a combinatorial optimization problem and obtains the trajectory by optimizing the visit sequence of target points and combining smoothing methods. With the assistance of evolutionary algorithms and swarm intelligence algorithms, such methods demonstrate strong global search abilities and multi-objective optimization capabilities, which can improve solution efficiency and handle complex constraints. However, these methods have problems such as sensitive parameter settings and being prone to getting trapped in local optima when handling large-scale checkpoint scenarios or complex environmental constraints, and their application in dynamic environments is still limited.

4.2.2. Minimum Time Search

In tactical inspection tasks, such as frontier search and target discovery, UAVs need to find one or more unknown targets as quickly as possible. The uncertain flight endpoint and the search time requirements make the trajectory planning of UAVs more complicated. Due to the limitations of endurance and flight speed, it is challenging for a single UAV to achieve such minimum time search (MTS) tasks.

Multiple UAV cooperations can improve reconnaissance efficiency by dividing the task into different parts that can be executed simultaneously. Intending to find lost targets in the minimum possible time, Perez-Carabaza et al. [13] developed the ACO-based trajectory planning method for a UAV swarm, which included a heuristic function designed specifically for MTS, enabling ACO to generate high-quality initial solutions and accelerate algorithm convergence. The method achieved better solutions in considerably less computational time compared to several heuristics or optimization algorithms previously tested in the MTS problem. However, policies for UAVs and the problem complexity often require continuous monitoring of the UAV states, which requires communications to be maintained at all times. To enhance the practicality of the trajectory planning method, this research team refined their previous method by incorporating constraints related to communication and collision avoidance, which reduced the time required to locate unknown targets, ensuring continuous multi-hop communication with the ground control station and collision avoidance among UAVs [120].

These studies have demonstrated the effectiveness of intelligent algorithm-based trajectory planning methods in minimum-time search tasks. The implementation path usually includes task area division, search strategy design, initial trajectory generation, and trajectory optimization. The methods exhibit good extensibility to multi-UAV systems, significantly enhancing parallel search efficiency. However, they lack sufficient real-time and robustness in situations where the environment changes rapidly, which limits their application in some practical scenarios.

4.2.3. Sweep Coverage

Another type of inspection task focuses on maximizing information acquisition from target regions. In such tasks, UAVs traverse the entire area to collect relevant sensor information, which can be classified as sweep coverage tasks [148]. To perform these tasks autonomously, coverage trajectory planning is a fundamental problem. Cakici et al. [149] introduced a GA-based trajectory planning method for a UAV swarm with the initial population generated from seed trajectories by the pattern search method and the solution of the multiple traveling salesman problem (mTSP). Similarly, Xie and Chen [150] proposed a GA-based method for trajectory planning to cover multiple disjoint regions with the new sets-based chromosome representation and crossover and mutation operators. However, these methods are implemented in a centralized manner, which hampers system autonomy and makes them susceptible to communication interference. Wang et al. [151] proposed an accurate coverage exploration distributed PSO trajectory planning algorithm based on distributed PSO and an online lawn mower algorithm for achieving complete area coverage and reliable reconnaissance of targets, which used geofencing to handle forbidden zones, boundaries, and collision avoidance and also designed the jump-out mechanism and a revisit mechanism to save invalid search work, as shown in Figure 13. In addition, with the development of end-to-end algorithms, Zhu et al. [76] exploited the state-of-the-art Transformer together with the weighted A* to develop a machine learning algorithm aimed at minimizing the total age of information of data gathered by a UAV from a ground IoT network, demonstrating that this trained model can generate solutions without the need to retrain the model.

Some inspection tasks aim for the UAV swarm to collect maximum information while minimizing the time required. Valente et al. [54] proposed a population-based metaheuristic algorithm called harmony search (HS) for the complete coverage trajectory planning task, which reduced the task time by minimizing the number of turns and was applied to the regular or irregularly shaped region. For tasks where UAVs need to fly close to the 3D terrain surface to thoroughly cover all areas for reconnaissance, Mou et al. [152] developed a geometric method to project a 3D irregular terrain surface into multiple weighted patches and proposed a DRL multi-UAV trajectory planning method based on a leader-follower architecture. The high-level leader UAV (LUAV) used the swarm deep Q-learning (SDQN) RL algorithm to select patches, and the low-level follower UAV (FUAV) planned the coverage trajectory using the spiral-zigzag searching pattern. To mitigate the communication constraints of the LUAV, SDQN incorporates an observation history module that leverages a CNN combined with a mean embedding technique. This method enables efficient and complete patch coverage with minimal overlap and outperforms reinforcement learning approaches such as the vibrating particles model and multi-agent spanning tree coverage in terms of total coverage time. In addition, factors such as unpredictable obstacles and emergencies may lead to UAV malfunctions or disruptions in communication links. To address such complex scenarios, Hu et al. [153] proposed a distributed online collaborative deep Q-learning algorithm trajectory planning method for multi-UAV systems, which introduced an efficient environmental information fusion technique to facilitate UAV collaboration and enable global situational awareness and allowed the algorithms to generate real-time coverage paths that minimize task duration while avoiding overlap, omissions, and collisions.

These studies mainly focus on the two goals of maximizing information collection and minimizing task time. They generally adopt environmental discretization modeling combined with machine learning, evolutionary algorithms, or swarm intelligence algorithms, and deal with practical constraints through technologies such as geographic fencing and jump-out mechanisms. This type of method shows significant improvement in coverage efficiency, adaptability, and scalability and can flexibly adapt to diverse terrains, no-fly zones, and dynamic changes in tasks. However, there are still challenges such as real-time bottlenecks and a lack of robustness, especially in environments with dense obstacles. The computational complexity problem is particularly prominent.

4.3. Others

A UAV can also be equipped with a variety of weapons for combat tasks. It is necessary to consider threat sources, including terrain obstacles, enemy radars, and defense forces, alongside the target and task requirements in this situation. One approach is to follow a trajectory that avoids the air defense threat area. For example, Aslan [154] considered the cost of enemy threats in the task field and the fuel cost to evaluate the trajectory determined for the UAV based on a modified immune plasma algorithm (IPA), called centrifuge IPA (centIPA). Duan et al. [155] proposed an NN trained by the imperialist competitive algorithm (ICA) and used this hybrid method to solve the problem of global trajectory planning for UAVs avoiding threat areas. In addition, low-altitude flight is also commonly used. Due to the terrain-mask effects, UAVs flying at low altitudes are less likely to be detected by enemy radars and much safer than flying at high altitudes. Duan et al. [156] proposed a UAV 3D trajectory planning method that mixed ACO and DE algorithms to plan a trajectory that both avoided the threat area and minimized fuel consumption. In the process of ant pheromone updating, DE was used to optimize the pheromone trail of the ACO model, which maintained the strong robustness of ACO while accelerating the global convergence speed. Niu et al. [157] proposed an adaptive neighborhood-based search enhanced artificial ecosystem optimizer to plan the minimum flight altitude trajectory for a UAV. It introduced quadratic interpolation to solve the premature convergence problem, thereby improving the capability to find the global optimum in conjunction with the adaptive neighborhood search strategy.

Formation applications of UAVs are also important in swarm shows or carrying out specific tasks. This requires the UAV to maintain a relative distance within a certain range during flight. Due to its non-deterministic polynomial hard (NP-hard) characteristics, collision avoidance constraints, and close formation requirement, UAV formation flight trajectory planning is challenging. For the arrival time coordination requirements of multiple UAVs, Chen et al. [158] used the Dubins trajectory to reduce the dimension of complicated trajectory planning problems and proposed a multi-UAV trajectory planning method based on PSO so that the UAVs in the swarm arrived at the expected location on time. Huang et al. [159] used the Voronoi diagram to model the environment consisting of multiple threat sources and proposed an improved ant colony optimization algorithm (IACO) that redefined the heuristic information function and pheromone update method. This multi-UAV trajectory planning method generated the initial trajectory by IACO and induced the k-degree smoothing method to ensure the smoothness of the trajectories and multi-UAV arrival at the targets simultaneously or with an acceptable time interval. When encountering complex obstacles and narrow passages in unknown environments, the difficulty is greatly increased because it involves formation maintenance and formation changes. This is because the UAV swarm needs to maintain formation, but also needs to change formations when encountering obstacles to pass through narrow passages. To solve this problem, Xing et al. [11] developed a multi-UAV adaptive trajectory planning method by introducing LSTM RNN into the environment perception end of the MATD3 network. The adaptive formation maintaining model and transformation strategy were proposed for the formation cooperation issue and navigating narrow passages, respectively.

In combat tasks, trajectory planning methods usually combine task requirements with threat modeling to build multi-objective optimization models such as target constraints, threat avoidance, and fuel consumption, and mainly generate safe flight trajectories that can avoid enemy defense areas through hybrid intelligent algorithms. Formation flight task research focuses on spatiotemporal coordination issues, solving the problems of coordinated arrival of multiple UAVs, maintaining relative spacing, synchronous formation flight, and obstacle avoidance. These studies have shown that trajectory planning methods based on intelligent algorithms can effectively cope with high-dimensional constraints, multi-source threats, and dynamic environmental changes. However, there are still problems such as strong dependence on perception accuracy and communication stability, and limited scalability of large-scale multi-UAV systems.

5. Trends and Challenges

Trajectory planning is a core scientific problem in the application of UAVs in complex fields. Using intelligent algorithms to plan trajectories for UAVs has been widely studied and has been partially applied. However, UAV trajectory planning in complex environments continues to present several challenges and constraints that require further investigation and refinement.

In many cases, the environmental information used for trajectory planning is incomplete or uncertain, containing many unpredictable dynamic factors that evolve. One common approach is to plan the trajectory for UAVs online, which has become a current research trend. The update rate is one of the important requirements for online trajectory planning, which will affect the quality of the planned trajectory and even flight safety. To address this challenge, finding more efficient intelligent algorithms or representing the trajectory with fewer parameters is necessary, thereby reducing the computational time required for planning the trajectory.
Intelligent algorithms are one of the novel solutions to plan trajectories for UAVs, and they have unique advantages in complex environments. However, significant challenges persist and cannot be overlooked. EAs and SI algorithms have the advantages of strong adaptability, good robustness, and strong parallelism, but they may face the challenge of falling into local optima, which may mislead the trajectory planning of the UAV. Despite numerous attempts to address this issue, further research is required to ensure global convergence throughout the optimization process. The characteristics of DL make it highly adaptable in complex environments, while its unpredictability and the potential for erroneous decisions may result in safety and reliability issues. Furthermore, when applying RL- or DRL-based trajectory planning methods in practical scenarios, the environment rarely provides useful reward feedback, typically indicating either partial or full task completion. The sparse reward problem may significantly slow the convergence of the algorithm or even fail to find a feasible trajectory within an acceptable time.
To cope with complex environments, UAV trajectory planning methods based on intelligent algorithms are often expected to balance coordination, accuracy, collision avoidance, and efficiency. Current research usually focuses on developing UAV trajectory planning methods tailored to specific scenarios or requirements. In transportation applications, factors such as terrain and obstacles are generally considered. Coordinated delivery between UAVs and ground vehicles and multi-UAV cable-suspended load transportation are gradually becoming a research trend. Still, the current related research based on intelligent algorithms usually models the load as a point mass, which lacks general applicability. The main research objectives of UAV trajectory planning in inspection applications include minimizing search time and maximizing information collection. Consequently, developing real-time trajectory planning methods capable of processing large volumes of information is a direction that warrants further research. Existing UAV trajectory planning methods based on intelligent algorithms each demonstrate unique advantages. However, approaches that achieve a balance across multiple scenarios may be more practical for real-world applications.

6. Conclusions

Pursuing efficient, real-time, and scalable trajectory planning methods for complex environments is a prominent issue in UAV research. Recent articles on intelligent algorithm-based UAV trajectory planning were reviewed. The increasing trend in the number of related research proves that intelligent algorithms have become one of the popular research directions in UAV trajectory planning. Current approaches are generally sufficiently practical in avoiding falling into local optima, circumventing obstacles, and minimizing computational costs. The characteristics of ML learning action strategies are well-suited for online trajectory planning. In particular, DRL makes up for the capacity to handle large state-action spaces, is ideal for complex environments, and has been widely used in this area. The mechanism of EA and SI algorithms determines that they are adaptive and easy to parallelize. They are suitable for solving complex, large-scale, nonlinear, and non-differentiable optimization problems in UAV trajectory planning and can search the solution space in parallel in multiple directions. The UAV trajectory planning method based on intelligent algorithms has fully considered factors such as terrain, obstacles, and meteorology in its application. Online trajectory planning leveraging intelligent algorithms adaptable to diverse scenarios represents a critical direction in the development of UAVs. Key contents include addressing the unpredictability of ML and mitigating the problem of EAs and SI algorithms falling into local optima. In applications, multi-agent coordination and the interactions between UAVs and the environment are also research focuses. These advancements will provide effective solutions for the applications of transportation, inspection, and other domains.

Author Contributions

Conceptualization, Z.C.; formal analysis, Z.C.; writing—original draft preparation, Z.C.; writing—review and editing, L.Z.; visualization, J.S. and J.Y.; supervision, L.Z.; funding acquisition, L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the Natural Science Foundation of China (No. 12072027), the Key Research and Development Program of Henan Province (No. 241111222000), the Henan Key Laboratory of General Aviation Technology (No. ZHKF-230201), and the Funding for the Open Research Project of the Rotor Aerodynamics Key Laboratory (No. RAL20200101).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Nayeem, G.M.; Fan, M.; Daiyan, G.M. Adaptive Q-Learning Grey Wolf Optimizer for UAV Path Planning. Drones 2025, 9, 246. [Google Scholar] [CrossRef]
Li, B.; Liu, H.; Ahn, C.K.; Gong, W. Optimized Intelligent Tracking Control for a Quadrotor Unmanned Aerial Vehicle with Actuator Failures. Aerosp. Sci. Technol. 2024, 144, 108803. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, X.; Dong, Q.; Wang, Z.; Xi, H.; Yuan, J. Fsmp: A Frontier-Sampling-Mixed Planner for Fast Autonomous Exploration of Complex and Large 3-D Environments. IEEE Trans. Instrum. Meas. 2025, 74, 1–14. [Google Scholar] [CrossRef]
Wang, M.; Zhang, D.; Wang, B.; Li, L. Dynamic Trajectory Planning for Multi-UAV Multi-Mission Operations Using a Hybrid Strategy. IEEE Trans. Aerosp. Electron. Syst. 2025, 61, 7369–7386. [Google Scholar] [CrossRef]
Ma, Y.; Zhang, Z.; Yao, M.; Fan, G. A Self-Adaptive Improved Slime Mold Algorithm for Multi-UAV Path Planning. Drones 2025, 9, 219. [Google Scholar] [CrossRef]
Xie, R.; Meng, Z.; Wang, L.; Li, H.; Wang, K.; Wu, Z. Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments. IEEE Access 2021, 9, 24884–24900. [Google Scholar] [CrossRef]
Dentler, J.; Rosalie, M.; Danoy, G.; Bouvry, P.; Kannan, S.; Olivares-Mendez, M.A.; Voos, H. Collision Avoidance Effects on the Mobility of a UAV Swarm Using Chaotic Ant Colony with Model Predictive Control. J. Intell. Robot. Syst. 2019, 93, 227–243. [Google Scholar] [CrossRef]
Sushma, M.B.; Mashhoodi, B.; Tan, W.; Liujiang, K.; Xu, Q. Spatial Drone Path Planning: A Systematic Review of Parameters and Algorithms. J. Transp. Geogr. 2025, 125, 104209. [Google Scholar]
Merei, A.; McHeick, H.; Ghaddar, A.; Rebaine, D. A Survey on Obstacle Detection and Avoidance Methods for UAVs. Drones 2025, 9, 203. [Google Scholar] [CrossRef]
Tang, Y.; Huang, K.; Tan, Z.; Fang, M.; Huang, H. Multi-Subswarm Cooperative Particle Swarm Optimization Algorithm and Its Application. Inf. Sci. 2024, 677, 120887. [Google Scholar] [CrossRef]
Xing, X.; Zhou, Z.; Li, Y.; Xiao, B.; Xun, Y. Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved Matd3 Algorithm of Deep Reinforcement Learning. IEEE Trans. Veh. Technol. 2024, 73, 12484–12499. [Google Scholar] [CrossRef]
Shao, J.; Cheng, J.; Xia, B.; Yang, K.; Wei, H. A Novel Service System for Long-Distance Drone Delivery Using the “Ant Colony+ A*” Algorithm. IEEE Syst. J. 2020, 15, 3348–3359. [Google Scholar] [CrossRef]
Perez-Carabaza, S.; Besada-Portas, E.; Lopez-Orozco, J.A.; de la Cruz, J.M. Ant Colony Optimization for Multi-UAV Minimum Time Search in Uncertain Domains. Appl. Soft Comput. 2018, 62, 789–806. [Google Scholar] [CrossRef]
Zhang, H.; Xin, B.; Dou, L.-H.; Chen, J.; Hirota, K. A Review of Cooperative Path Planning of an Unmanned Aerial Vehicle Group. Front. Inf. Technol. Electron. Eng. 2020, 21, 1671–1694. [Google Scholar]
Falomir, E.; Chaumette, S.; Guerrini, G. A Mobility Model Based on Improved Artificial Potential Fields for Swarms of UAVs. In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, Madrid, Spain, 1–5 October 2018; pp. 8499–8504. [Google Scholar]
Natarajan, R.; Choset, H.; Likhachev, M. Interleaving Graph Search and Trajectory Optimization for Aggressive Quadrotor Flight. IEEE Robot. Autom. Lett. 2021, 6, 5357–5364. [Google Scholar] [CrossRef]
Chamseddine, A.; Zhang, Y.; Rabbath, C.A.; Join, C.; Theilliol, D. Flatness-Based Trajectory Planning/Replanning for a Quadrotor Unmanned Aerial Vehicle. IEEE Trans. Aerosp. Electron. Syst. 2012, 48, 2832–2848. [Google Scholar] [CrossRef]
Tarjan, R. Depth-First Search and Linear Graph Algorithms. SIAM J. Comput. 1972, 1, 146–160. [Google Scholar] [CrossRef]
Mansard, N.; DelPrete, A.; Geisert, M.; Tonneau, S.; Stasse, O. Using a Memory of Motion to Efficiently Warm-Start a Nonlinear Predictive Controller. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation, Brisbane, Australia, 21–25 May 2018; pp. 2986–2993. [Google Scholar]
Kuwata, Y.; How, J.P. Cooperative Distributed Robust Trajectory Optimization Using Receding Horizon Milp. IEEE Trans. Control Syst. Technol. 2010, 19, 423–431. [Google Scholar] [CrossRef]
Maldonado-Romo, J.; Aldape-Pérez, M. Interoperability between Real and Virtual Environments Connected by a Gan for the Path-Planning Problem. Appl. Sci. 2021, 11, 10445. [Google Scholar] [CrossRef]
Zhou, Y.; Rao, B.; Wang, W. UAV Swarm Intelligence: Recent Advances and Future Trends. IEEE Access 2020, 8, 183856–183878. [Google Scholar] [CrossRef]
Glasius, R.; Komoda, A.; Gielen, S.C. Neural Network Dynamics for Path Planning and Obstacle Avoidance. Neural Netw. 1995, 8, 125–133. [Google Scholar] [CrossRef]
Oreski, S.; Oreski, G. Genetic Algorithm-Based Heuristic for Feature Selection in Credit Risk Assessment. Expert Syst. Appl. 2014, 41, 2052–2064. [Google Scholar] [CrossRef]
Xu, X.; Rong, H.; Trovati, M.; Liptrott, M.; Bessis, N. Cs-Pso: Chaotic Particle Swarm Optimization Algorithm for Solving Combinatorial Optimization Problems. Soft Comput. 2018, 22, 783–795. [Google Scholar] [CrossRef]
Lin, N.; Tang, J.; Li, X.; Zhao, L. A Novel Improved Bat Algorithm in UAV Path Planning. Comput. Mater. Contin. 2019, 61, 323–344. [Google Scholar] [CrossRef]
Wu, J.; Wang, H.; Li, N.; Yao, P.; Huang, Y.; Yang, H. Path Planning for Solar-Powered UAV in Urban Environment. Neurocomputing 2018, 275, 2055–2065. [Google Scholar] [CrossRef]
Yang, Y.; Xiong, X.; Yan, Y. UAV Formation Trajectory Planning Algorithms: A Review. Drones 2023, 7, 62. [Google Scholar] [CrossRef]
Chai, S.; Yang, Z.; Huang, J.; Li, X.; Zhao, Y.; Zhou, D. Cooperative UAV Search Strategy Based on Dmpc-Aaco Algorithm in Restricted Communication Scenarios. Def. Technol. 2024, 31, 295–311. [Google Scholar] [CrossRef]
Zhang, J.; Cui, Y.; Ren, J. Dynamic Mission Planning Algorithm for UAV Formation in Battlefield Environment. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 3750–3765. [Google Scholar] [CrossRef]
Liu, X.; Li, G.; Yang, H.; Zhang, N.; Wang, L.; Shao, P. Agricultural UAV Trajectory Planning by Incorporating Multi-Mechanism Improved Grey Wolf Optimization Algorithm. Expert Syst. Appl. 2023, 233, 120946. [Google Scholar] [CrossRef]
Basiri, A.; Mariani, V.; Silano, G.; Aatif, M.; Iannelli, L.; Glielmo, L. A Survey on the Application of Path-Planning Algorithms for Multi-Rotor UAVs in Precision Agriculture. J. Navig. 2022, 75, 364–383. [Google Scholar] [CrossRef]
Luo, Y.; Yu, X.; Yang, D.; Zhou, B. A Survey of Intelligent Transmission Line Inspection Based on Unmanned Aerial Vehicle. Artif. Intell. Rev. 2023, 56, 173–201. [Google Scholar]
Li, X.; Lu, X.; Chen, W.; Ge, D.; Zhu, J. Research on UAVs Reconnaissance Task Allocation Method Based on Communication Preservation. IEEE Trans. Consum. Electron. 2024, 70, 684–695. [Google Scholar] [CrossRef]
Wan, Y.; Zhong, Y.; Ma, A.; Zhang, L. An Accurate UAV 3-D Path Planning Method for Disaster Emergency Response Based on an Improved Multiobjective Swarm Intelligence Algorithm. IEEE Trans. Cybern. 2022, 53, 2658–2671. [Google Scholar] [CrossRef]
Puente-Castro, A.; Rivero, D.; Pazos, A.; Fernandez-Blanco, E. A Review of Artificial Intelligence Applied to Path Planning in UAV Swarms. Neural Comput. Appl. 2022, 34, 153–170. [Google Scholar] [CrossRef]
Yahia, H.S.; Mohammed, A.S. Path Planning Optimization in Unmanned Aerial Vehicles Using Meta-Heuristic Algorithms: A Systematic Review. Environ. Monit. Assess. 2023, 195, 30. [Google Scholar] [CrossRef] [PubMed]
Hooshyar, M.; Huang, Y.-M. Meta-Heuristic Algorithms in UAV Path Planning Optimization: A Systematic Review (2018–2022). Drones 2023, 7, 687. [Google Scholar] [CrossRef]
Poudel, S.; Arafat, M.Y.; Moh, S. Bio-Inspired Optimization-Based Path Planning Algorithms in Unmanned Aerial Vehicles: A Survey. Sensors 2023, 23, 3051. [Google Scholar] [CrossRef]
Sharma, A.; Shoval, S.; Sharma, A.; Pandey, J.K. Path Planning for Multiple Targets Interception by the Swarm of UAVs Based on Swarm Intelligence Algorithms: A Review. IETE Tech. Rev. 2022, 39, 675–697. [Google Scholar] [CrossRef]
Tang, J.; Duan, H.; Lao, S. Swarm Intelligence Algorithms for Multiple Unmanned Aerial Vehicles Collaboration: A Comprehensive Review. Artif. Intell. Rev. 2023, 56, 4295–4327. [Google Scholar] [CrossRef]
Pessanha Santos, N. Fixed-Wing UAV Pose Estimation Using a Self-Organizing Map and Deep Learning. Robotics 2024, 13, 114. [Google Scholar] [CrossRef]
Song, C.; Liu, L.; Wang, G.; Han, J.; Zhang, T.; Lan, Y. Particle Deposition Distribution of Multi-Rotor UAV-Based Fertilizer Spreader under Different Height and Speed Parameters. Drones 2023, 7, 425. [Google Scholar] [CrossRef]
Dudnik, V. Determination of the Tail Unit Parameters of Ultralight Manned and Unmanned Helicopters at the Preliminary Design Stage. Aerospace 2025, 12, 33. [Google Scholar] [CrossRef]
Sanchez-Rivera, L.M.; Lozano, R.; Arias-Montano, A. Development, Modeling and Control of a Dual Tilt-Wing UAV in Vertical Flight. Drones 2020, 4, 71. [Google Scholar] [CrossRef]
Ma, Y.; Luo, R.; Meng, X.; Cao, Y.; Yu, S.; Sun, H.; Ding, W. Path Planning for Searching Submarine with Cooperative Coverage of Fixed-Wing UAVs Cluster in Complex Boundary Sea Area. IEEE Sens. J. 2023, 23, 30070–30083. [Google Scholar] [CrossRef]
Freitas, E.J.R.; Cohen, M.W.; Neto, A.A.; Guimarães, F.G.; Pimenta, L.C.A. DE3D-NURBS: A Differential Evolution-Based 3D Path-Planner Integrating Kinematic Constraints and Obstacle Avoidance. Knowl.-Based Syst. 2024, 300, 112084. [Google Scholar] [CrossRef]
Chung, S.J.; Paranjape, A.A.; Dames, P.; Shen, S.; Kumar, V. A Survey on Aerial Swarm Robotics. IEEE Trans. Robot. 2018, 34, 837–855. [Google Scholar] [CrossRef]
Kanth Hari, S.K.; Rathinam, S.; Darbha, S.; Kalyanam, K.; Manyam, S.G.; Casbeer, D. Optimal UAV Route Planning for Persistent Monitoring Missions. IEEE Trans. Robot. 2021, 37, 550–566. [Google Scholar] [CrossRef]
Xu, J.; Huang, Y.; Li, H.; Fang, R.; Liu, Y.; Zhao, R. Spatio-Temporal Coding-Based Helicopter Trajectory Planning for Pulsed Neural Membrane System. Comput. Intell. Neurosci. 2022, 2022, 1787013. [Google Scholar] [CrossRef]
Choudhury, S.; Scherer, S.; Singh, S. RRT*-AR: Sampling-Based Alternate Routes Planning with Applications to Autonomous Emergency Landing of a Helicopter. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013; pp. 3947–3952. [Google Scholar]
Han, Z.; Chen, M.; Shao, S.; Zhou, T.; Wu, Q. Path Planning of Unmanned Autonomous Helicopter Based on Hybrid Satisficing Decision-Enhanced Swarm Intelligence Algorithm. IEEE Trans. Cogn. Dev. Syst. 2023, 15, 1371–1385. [Google Scholar] [CrossRef]
Ren, B.W.; Wang, B.; Zhang, X.Y.; Zhao, Q.J. Trajectory Planning of Tilt-Rotor UAV Based on Hybrid Algorithm. In Proceedings of the Asia-Pacific International Symposium on Aerospace Technology (APISAT), Jeju, Republic of Korea, 15–17 November 2021; pp. 619–631. [Google Scholar]
Valente, J.; Del Cerro, J.; Barrientos, A.; Sanz, D. Aerial Coverage Optimization in Precision Agriculture Management: A Musical Harmony Inspired Approach. Comput. Electron. Agric. 2013, 99, 153–159. [Google Scholar] [CrossRef]
Tang, J.; Liu, G.; Pan, Q. A Review on Representative Swarm Intelligence Algorithms for Solving Optimization Problems: Applications and Trends. IEEE/CAA J. Autom. Sin. 2021, 8, 1627–1643. [Google Scholar] [CrossRef]
Zhang, J.; Ning, X.; Ma, S.; Tang, R. Research on Global Trajectory Planning for UAV Based on the Information Interaction and Aging Mechanism Wolfpack Algorithm. Expert Syst. Appl. 2025, 273, 126867. [Google Scholar] [CrossRef]
Lin, X.; Wang, C.; Wang, K.; Li, M.; Yu, X. Trajectory Planning for Unmanned Aerial Vehicles in Complicated Urban Environments: A Control Network Approach. Transp. Res. Part C Emerg. Technol. 2021, 128, 103120. [Google Scholar] [CrossRef]
Li, B.; Yang, Z.-p.; Chen, D.-q.; Liang, S.-y.; Ma, H. Maneuvering Target Tracking of UAV Based on Mn-Ddpg and Transfer Learning. Def. Technol. 2021, 17, 457–466. [Google Scholar] [CrossRef]
Li, R.; Xin, B. Autonomous Navigation of Quadrotors in Dynamic Complex Environments. IEEE Trans. Ind. Electron. 2025, 72, 2790–2800. [Google Scholar] [CrossRef]
Huang, Y.; Wang, H.; Yao, P. Energy-Optimal Path Planning for Solar-Powered UAV with Tracking Moving Ground Target. Aerosp. Sci. Technol. 2016, 53, 241–251. [Google Scholar] [CrossRef]
Chen, Q.; Zhao, Q.; Zou, Z. Threat-Oriented Collaborative Path Planning of Unmanned Reconnaissance Mission for the Target Group. Aerospace 2022, 9, 577. [Google Scholar] [CrossRef]
Hinton, G.E.; Salakhutdinov, R.R. Reducing the Dimensionality of Data with Neural Networks. Science 2006, 313, 504–507. [Google Scholar] [CrossRef] [PubMed]
Antsaklis, P.J. Neural Networks for Control Systems. IEEE Trans. Neural Netw. 1990, 1, 242–244. [Google Scholar] [CrossRef]
Pandey, A.; Jain, K. An Intelligent System for Crop Identification and Classification from UAV Images Using Conjugated Dense Convolutional Neural Network. Comput. Electron. Agric. 2022, 192, 106543. [Google Scholar] [CrossRef]
Liu, Y.; Wang, H.; Fan, J.; Wu, J.; Wu, T. Control-Oriented UAV Highly Feasible Trajectory Planning: A Deep Learning Method. Aerosp. Sci. Technol. 2021, 110, 106435. [Google Scholar] [CrossRef]
Shaffer, J.A.; Xu, H. Centralized and Decentralized Application of Neural Networks Learning Optimized Solutions of Distributed Agents. In Proceedings of the Micro- and Nanotechnology Sensors, Systems, and Applications XI, Baltimore, MD, USA, 14–18 April 2019; pp. 395–408. [Google Scholar]
Khan, A.; Kumar, V.; Ribeiro, A. Large Scale Distributed Collaborative Unlabeled Motion Planning with Graph Policy Gradients. IEEE Robot. Autom. Lett. 2021, 6, 5340–5347. [Google Scholar] [CrossRef]
Tsubaki, M.; Tomii, K.; Sese, J. Compound–Protein Interaction Prediction with End-to-End Learning of Neural Networks for Graphs and Sequences. Bioinformatics 2019, 35, 309–318. [Google Scholar] [CrossRef] [PubMed]
Li, Q.; Lin, W.; Liu, Z.; Prorok, A. Message-Aware Graph Attention Networks for Large-Scale Multi-Robot Path Planning. IEEE Robot. Autom. Lett. 2021, 6, 5533–5540. [Google Scholar] [CrossRef]
Palossi, D.; Conti, F.; Benini, L. An Open Source and Open Hardware Deep Learning-Powered Visual Navigation Engine for Autonomous Nano-UAVs. In Proceedings of the 2019 15th International Conference on Distributed Computing in Sensor Systems (DCOSS), Santorini Island, Greece, 29–31 May 2019; pp. 604–611. [Google Scholar]
Dai, X.; Mao, Y.; Huang, T.; Qin, N.; Huang, D.; Li, Y. Automatic Obstacle Avoidance of Quadrotor UAV Via CNN-Based Learning. Neurocomputing 2020, 402, 346–358. [Google Scholar] [CrossRef]
Kalinov, I.; Petrovsky, A.; Ilin, V.; Pristanskiy, E.; Kurenkov, M.; Ramzhaev, V.; Idrisov, I.; Tsetserukou, D. Warevision: CNN Barcode Detection-Based UAV Trajectory Optimization for Autonomous Warehouse Stocktaking. IEEE Robot. Autom. Lett. 2020, 5, 6647–6653. [Google Scholar] [CrossRef]
Chen, N.; Fan, J.; Yuan, J.; Zheng, E. Obtpn: A Vision-Based Network for UAV Geo-Localization in Multi-Altitude Environments. Drones 2025, 9, 33. [Google Scholar] [CrossRef]
Wang, J.; Guan, X.; Sun, Z.; Shen, T.; Huang, D.; Liu, F.; Cui, H. Omega: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments Via State Space Model. IEEE Robot. Autom. Lett. 2025, 10, 1066–1073. [Google Scholar] [CrossRef]
Wang, H.; Tan, A.H.; Nejat, G. Navformer: A Transformer Architecture for Robot Target-Driven Navigation in Unknown and Dynamic Environments. IEEE Robot. Autom. Lett. 2024, 9, 6808–6815. [Google Scholar] [CrossRef]
Zhu, B.; Bedeer, E.; Nguyen, H.H.; Barton, R.; Gao, Z. UAV Trajectory Planning for Aoi-Minimal Data Collection in UAV-Aided Iot Networks by Transformer. IEEE Trans. Wirel. Commun. 2023, 22, 1343–1358. [Google Scholar] [CrossRef]
Yu, C.-H.; Tsai, J.; Chang, Y.-T. Intelligent Path Planning for UAV Patrolling in Dynamic Environments Based on the Transformer Architecture. Electronics 2024, 13, 4716. [Google Scholar] [CrossRef]
Dong, L.; Jiang, F.; Peng, Y. Attention-Based UAV Trajectory Optimization for Wireless Power Transfer-Assisted Iot Systems. IEEE Trans. Ind. Electron. 2025, 72, 8463–8471. [Google Scholar] [CrossRef]
Wang, H.; Liu, C.H.; Yang, H.; Wang, G.; Leung, K.K. Ensuring Threshold Aoi for UAV-Assisted Mobile Crowdsensing by Multi-Agent Deep Reinforcement Learning with Transformer. IEEE/ACM Trans. Netw. 2024, 32, 566–581. [Google Scholar] [CrossRef]
Henderson, P.; Islam, R.; Bachman, P.; Pineau, J.; Precup, D.; Meger, D. Deep Reinforcement Learning That Matters. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
Zhao, X.; Ding, S.; An, Y.; Jia, W. Asynchronous Reinforcement Learning Algorithms for Solving Discrete Space Path Planning Problems. Appl. Intell. 2018, 48, 4889–4904. [Google Scholar] [CrossRef]
Zhang, B.; Mao, Z.; Liu, W.; Liu, J. Geometric Reinforcement Learning for Path Planning of UAVs. J. Intell. Robot. Syst. 2015, 77, 391–409. [Google Scholar] [CrossRef]
Cui, Z.; Wang, Y. UAV Path Planning Based on Multi-Layer Reinforcement Learning Technique. IEEE Access 2021, 9, 59486–59497. [Google Scholar] [CrossRef]
Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G. Human-Level Control through Deep Reinforcement Learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef]
He, L.; Aouf, N.; Song, B. Explainable Deep Reinforcement Learning for UAV Autonomous Path Planning. Aerosp. Sci. Technol. 2021, 118, 107052. [Google Scholar] [CrossRef]
Yan, C.; Xiang, X.; Wang, C. Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments. J. Intell. Robot. Syst. 2020, 98, 297–309. [Google Scholar] [CrossRef]
Wang, X.; Gursoy, M.C.; Erpek, T.; Sagduyu, Y.E. Learning-Based UAV Path Planning for Data Collection with Integrated Collision Avoidance. IEEE Internet Things J. 2022, 9, 16663–16676. [Google Scholar] [CrossRef]
Bo, L.; Zhang, T.; Zhang, H.; Hong, J.; Liu, M.; Zhang, C.; Liu, B. 3D UAV Path Planning in Unknown Environment: A Transfer Reinforcement Learning Method Based on Low-Rank Adaption. Adv. Eng. Inform. 2024, 62, 102920. [Google Scholar] [CrossRef]
Fontanesi, G.; Zhu, A.; Arvaneh, M.; Ahmadi, H. A Transfer Learning Approach for UAV Path Design with Connectivity Outage Constraint. IEEE Internet Things J. 2023, 10, 4998–5012. [Google Scholar] [CrossRef]
Pérez-Cutiño, M.A.; Rodríguez, F.; Pascual, L.D.; Díaz-Báñez, J.M. Ornithopter Trajectory Optimization with Neural Networks and Random Forest. J. Intell. Robot. Syst. 2022, 105, 17. [Google Scholar] [CrossRef]
Hönig, W.; Preiss, J.A.; Kumar, T.K.S.; Sukhatme, G.S.; Ayanian, N. Trajectory Planning for Quadrotor Swarms. IEEE Trans. Robot. 2018, 34, 856–869. [Google Scholar] [CrossRef]
Pan, H.; Zahmatkesh, M.; Rekabi-Bana, F.; Arvin, F.; Hu, J. T-Star: Time-Optimal Swarm Trajectory Planning for Quadrotor Unmanned Aerial Vehicles. IEEE Trans. Intell. Transp. Syst. 2025, 1–16. [Google Scholar] [CrossRef]
Bäck, T.; Schwefel, H.-P. An Overview of Evolutionary Algorithms for Parameter Optimization. Evol. Comput. 1993, 1, 1–23. [Google Scholar] [CrossRef]
Holland, J. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence; University of Michigan Press: Ann Arbor, MI, USA, 1975. [Google Scholar]
Roberge, V.; Tarbouchi, M.; Labonté, G. Comparison of Parallel Genetic Algorithm and Particle Swarm Optimization for Real-Time UAV Path Planning. IEEE Trans. Ind. Inform. 2012, 9, 132–141. [Google Scholar] [CrossRef]
Cao, Y.; Wei, W.; Bai, Y.; Qiao, H. Multi-Base Multi-UAV Cooperative Reconnaissance Path Planning with Genetic Algorithm. Clust. Comput. 2019, 22, 5175–5184. [Google Scholar] [CrossRef]
Ergezer, H.; Leblebicioglu, K. Path Planning for UAVs for Maximum Information Collection. IEEE Trans. Aerosp. Electron. Syst. 2013, 49, 502–520. [Google Scholar] [CrossRef]
Sahingoz, O.K. Generation of Bezier Curve-Based Flyable Trajectories for Multi-UAV Systems with Parallel Genetic Algorithm. J. Intell. Robot. Syst. 2014, 74, 499–511. [Google Scholar] [CrossRef]
Shorakaei, H.; Vahdani, M.; Imani, B.; Gholami, A. Optimal Cooperative Path Planning of Unmanned Aerial Vehicles by a Parallel Genetic Algorithm. Robotica 2016, 34, 823–836. [Google Scholar] [CrossRef]
Volkan Pehlivanoglu, Y.; Baysal, O.; Hacioglu, A. Path Planning for Autonomous UAV Via Vibrational Genetic Algorithm. Aircr. Eng. Aerosp. Technol. 2007, 79, 352–359. [Google Scholar] [CrossRef]
Pehlivanoglu, Y.V. A New Vibrational Genetic Algorithm Enhanced with a Voronoi Diagram for Path Planning of Autonomous UAV. Aerosp. Sci. Technol. 2012, 16, 47–55. [Google Scholar] [CrossRef]
Storn, R.; Price, K. Differential Evolution–a Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
Slowik, A. Application of an Adaptive Differential Evolution Algorithm with Multiple Trial Vectors to Artificial Neural Network Training. IEEE Trans. Ind. Electron. 2010, 58, 3160–3167. [Google Scholar] [CrossRef]
Brintaki, A.N.; Nikolos, I.K. Coordinated UAV Path Planning Using Differential Evolution. Oper. Res. 2005, 5, 487–502. [Google Scholar] [CrossRef]
Adhikari, D.; Kim, E.; Reza, H. A Fuzzy Adaptive Differential Evolution for Multi-Objective 3D UAV Path Optimization. In Proceedings of the 2017 IEEE Congress on Evolutionary Computation (CEC), Donostia-San Sebastian, Spain, 5–8 June 2017; pp. 2258–2265. [Google Scholar]
Yu, X.; Li, C.; Zhou, J. A Constrained Differential Evolution Algorithm to Solve UAV Path Planning in Disaster Scenarios. Knowl.-Based Syst. 2020, 204, 106209. [Google Scholar] [CrossRef]
Wu, G.; Mallipeddi, R.; Suganthan, P.N.; Wang, R.; Chen, H. Differential Evolution with Multi-Population Based Ensemble of Mutation Strategies. Inf. Sci. 2016, 329, 329–345. [Google Scholar] [CrossRef]
Wu, G.; Shen, X.; Li, H.; Chen, H.; Lin, A.; Suganthan, P.N. Ensemble of Differential Evolution Variants. Inf. Sci. 2018, 423, 172–186. [Google Scholar] [CrossRef]
Gui, L.; Xia, X.; Yu, F.; Wu, H.; Wu, R.; Wei, B.; Zhang, Y.; Li, X.; He, G. A Multi-Role Based Differential Evolution. Swarm Evol. Comput. 2019, 50, 100508. [Google Scholar] [CrossRef]
Chai, X.; Zheng, Z.; Xiao, J.; Yan, L.; Qu, B.; Wen, P.; Wang, H.; Zhou, Y.; Sun, H. Multi-Strategy Fusion Differential Evolution Algorithm for UAV Path Planning in Complex Environment. Aerosp. Sci. Technol. 2022, 121, 107287. [Google Scholar] [CrossRef]
Koza, J.R. Genetic Programming: On the Programming of Computers by Means of Natural Selection (Complex Adaptive Systems); MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]
Yang, X.; Cai, M.; Li, J. Path Planning for Unmanned Aerial Vehicles Based on Genetic Programming. In Proceedings of the 2016 Chinese Control and Decision Conference (CCDC), Yinchuan, China, 28–30 May 2016; pp. 717–722. [Google Scholar]
Hohmann, N.; Bujny, M.; Adamy, J.; Olhofer, M. Hybrid Evolutionary Approach to Multi-Objective Path Planning for UAVs. In Proceedings of the 2021 IEEE Symposium Series on Computational Intelligence (SSCI), Orlando, FL, USA, 5–7 December 2021; pp. 1–8. [Google Scholar]
Hohmann, N.; Bujny, M.; Adamy, J.; Olhofer, M. Multi-Objective 3D Path Planning for UAVs in Large-Scale Urban Scenarios. In Proceedings of the 2022 IEEE Congress on Evolutionary Computation (CEC), Padua, Italy, 18–23 July 2022; pp. 1–8. [Google Scholar]
Duan, H.; Zhang, X. Phase Transition of Vortexlike Self-Propelled Particles Induced by a Hostile Particle. Phys. Rev. E 2015, 92, 012701. [Google Scholar] [CrossRef]
Long, N.K.; Sammut, K.; Sgarioto, D.; Garratt, M.; Abbass, H.A. A Comprehensive Review of Shepherding as a Bio-Inspired Swarm-Robotics Guidance Approach. IEEE Trans. Emerg. Top. Comput. Intell. 2020, 4, 523–537. [Google Scholar] [CrossRef]
Shi, Y. Particle Swarm Optimization: Developments, Applications and Resources. In Proceedings of the 2001 Congress on Evolutionary Computation, Seoul, Republic of Korea, 27–30 May 2001; pp. 81–86. [Google Scholar]
Liu, Y.; Zhang, X.; Guan, X.; Delahaye, D. Adaptive Sensitivity Decision Based Path Planning Algorithm for Unmanned Aerial Vehicle with Improved Particle Swarm Optimization. Aerosp. Sci. Technol. 2016, 58, 92–102. [Google Scholar] [CrossRef]
Phung, M.D.; Ha, Q.P. Safety-Enhanced UAV Path Planning with Spherical Vector-Based Particle Swarm Optimization. Appl. Soft Comput. 2021, 107, 107376. [Google Scholar] [CrossRef]
Pérez-Carabaza, S.; Scherer, J.; Rinner, B.; López-Orozco, J.A.; Besada-Portas, E. UAV Trajectory Optimization for Minimum Time Search with Communication Constraints and Collision Avoidance. Eng. Appl. Artif. Intell. 2019, 85, 357–371. [Google Scholar] [CrossRef]
Zhang, C.; Hu, C.; Feng, J.; Liu, Z.; Zhou, Y.; Zhang, Z. A Self-Heuristic Ant-Based Method for Path Planning of Unmanned Aerial Vehicle in Complex 3-D Space with Dense U-Type Obstacles. IEEE Access 2019, 7, 150775–150791. [Google Scholar] [CrossRef]
Hunter III, J.E.; Gannon, T.W.; Richardson, R.J.; Yelverton, F.H.; Leon, R.G. Integration of Remote-Weed Mapping and an Autonomous Spraying Unmanned Aerial Vehicle for Site-Specific Weed Management. Pest Manag. Sci. 2020, 76, 1386–1392. [Google Scholar] [CrossRef]
Akshya, J.; Priyadarsini, P. Graph-Based Path Planning for Intelligent UAVs in Area Coverage Applications. J. Intell. Fuzzy Syst. 2020, 39, 8191–8203. [Google Scholar] [CrossRef]
Rosalie, M.; Danoy, G.; Chaumette, S.; Bouvry, P. Chaos-Enhanced Mobility Models for Multilevel Swarms of UAVs. Swarm Evol. Comput. 2018, 41, 36–48. [Google Scholar] [CrossRef]
Li, F.; Fan, X.; Hou, Z. A Firefly Algorithm with Self-Adaptive Population Size for Global Path Planning of Mobile Robot. IEEE Access 2020, 8, 168951–168964. [Google Scholar] [CrossRef]
Cheng, L.; Zhong, L.; Zhang, X.; Xing, J. A Staged Adaptive Firefly Algorithm for UAV Charging Planning in Wireless Sensor Networks. Comput. Commun. 2020, 161, 132–141. [Google Scholar] [CrossRef]
Wang, G.-G.; Chu, H.E.; Mirjalili, S. Three-Dimensional Path Planning for Ucav Using an Improved Bat Algorithm. Aerosp. Sci. Technol. 2016, 49, 231–238. [Google Scholar] [CrossRef]
Han, Z.; Chen, M.; Shao, S.; Wu, Q. Improved Artificial Bee Colony Algorithm-Based Path Planning of Unmanned Autonomous Helicopter Using Multi-Strategy Evolutionary Learning. Aerosp. Sci. Technol. 2022, 122, 107374. [Google Scholar] [CrossRef]
Wang, B.H.; Wang, D.B.; Ali, Z.A. A Cauchy Mutant Pigeon-Inspired Optimization–Based Multi-Unmanned Aerial Vehicle Path Planning Method. Meas. Control 2020, 53, 83–92. [Google Scholar] [CrossRef]
Dai, Y.; Yu, J.; Zhang, C.; Zhan, B.; Zheng, X. A Novel Whale Optimization Algorithm of Path Planning Strategy for Mobile Robots. Appl. Intell. 2023, 53, 10843–10857. [Google Scholar] [CrossRef]
Jamshidi, V.; Nekoukar, V.; Refan, M.H. Real Time UAV Path Planning by Parallel Grey Wolf Optimization with Align Coefficient on Can Bus. Clust. Comput. 2021, 24, 2495–2509. [Google Scholar] [CrossRef]
Dewangan, R.K.; Shukla, A.; Godfrey, W.W. Three Dimensional Path Planning Using Grey Wolf Optimizer for UAVs. Appl. Intell. 2019, 49, 2201–2217. [Google Scholar] [CrossRef]
Marinaki, M.; Taxidou, A.; Marinakis, Y. A Hybrid Dragonfly Algorithm for the Vehicle Routing Problem with Stochastic Demands. Intell. Syst. Appl. 2023, 18, 200225. [Google Scholar] [CrossRef]
Wu, Y.; Zhao, P.; Liang, D.; Dong, J. A Learning-Based Anti-Swing Trajectory Refinement Approach for UAVs with Cable-Suspended Payload without Offline Training. IEEE Trans. Intell. Veh. 2024, 9, 6950–6959. [Google Scholar] [CrossRef]
Li, R.; Yang, F.; Xu, Y.; Yuan, W.; Lu, Q. Deep Reinforcement Learning-Based Swing-Free Trajectories Planning Algorithm for UAV with a Suspended Load. In Proceedings of the 2022 China Automation Congress (CAC), Xiamen, China, 25–27 November 2022; pp. 6149–6154. [Google Scholar]
Hua, H.; Fang, Y.; Zhang, X.; Qian, C. A Time-Optimal Trajectory Planning Strategy for an Aircraft with a Suspended Payload Via Optimization and Learning Approaches. IEEE Trans. Control Syst. Technol. 2022, 30, 2333–2343. [Google Scholar] [CrossRef]
Faust, A.; Palunko, I.; Cruz, P.; Fierro, R.; Tapia, L. Automated Aerial Suspended Cargo Delivery through Reinforcement Learning. Artif. Intell. 2017, 247, 381–398. [Google Scholar] [CrossRef]
Ergezer, H. Multi-Objective Trajectory Planning for Slung-Load Quadrotor System. IEEE Access 2021, 9, 155003–155017. [Google Scholar] [CrossRef]
Hegde, A.; Ghose, D. Multi-UAV Collaborative Transportation of Payloads with Obstacle Avoidance. IEEE Control Syst. Lett. 2021, 6, 926–931. [Google Scholar] [CrossRef]
Jackson, B.E.; Howell, T.A.; Shah, K.; Schwager, M.; Manchester, Z. Scalable Cooperative Transport of Cable-Suspended Loads with UAVs Using Distributed Trajectory Optimization. IEEE Robot. Autom. Lett. 2020, 5, 3368–3374. [Google Scholar] [CrossRef]
Duan, D.; Zu, R.; Yu, T.; Zhang, C.; Li, J. Differential Flatness-Based Real-Time Trajectory Planning for Multihelicopter Cooperative Transportation in Crowded Environments. AIAA J. 2023, 61, 4079–4095. [Google Scholar] [CrossRef]
Li, X.; Zhang, J.; Han, J. Trajectory Planning of Load Transportation with Multi-Quadrotors Based on Reinforcement Learning Algorithm. Aerosp. Sci. Technol. 2021, 116, 106887. [Google Scholar] [CrossRef]
Du, P.; He, X.; Cao, H.; Garg, S.; Kaddoum, G.; Hassan, M.M. AI-Based Energy-Efficient Path Planning of Multiple Logistics UAVs in Intelligent Transportation Systems. Comput. Commun. 2023, 207, 46–55. [Google Scholar] [CrossRef]
Stodola, P.; Kutěj, L. Multi-Depot Vehicle Routing Problem with Drones: Mathematical Formulation, Solution Algorithm and Experiments. Expert Syst. Appl. 2024, 241, 122483. [Google Scholar] [CrossRef]
Tsourdos, A.; White, B.; Shanmugavel, M. Cooperative Path Planning of Unmanned Aerial Vehicles; John Wiley & Sons: Hoboken, NJ, USA, 2010. [Google Scholar]
Cheng, Z.; Yang, J.; Sun, Y.; Zhao, L.; Zhao, L. State Predictor-Based Deep Model Reference Adaptive Control for Quadrotor Trajectory Tracking. Aerosp. Sci. Technol. 2025, 157, 109868. [Google Scholar] [CrossRef]
Pehlivanoglu, Y.V.; Pehlivanoglu, P. An Enhanced Genetic Algorithm for Path Planning of Autonomous UAV in Target Coverage Problems. Appl. Soft Comput. 2021, 112, 107796. [Google Scholar] [CrossRef]
Li, J.; Xiong, Y.; She, J.; Wu, M. A Path Planning Method for Sweep Coverage with Multiple UAVs. IEEE Internet Things J. 2020, 7, 8967–8978. [Google Scholar] [CrossRef]
Çakıcı, F.; Ergezer, H.; Irmak, U.; Leblebicioğlu, M.K. Coordinated Guidance for Multiple UAVs. Trans. Inst. Meas. Control 2016, 38, 593–601. [Google Scholar] [CrossRef]
Xie, J.; Chen, J. Multiregional Coverage Path Planning for Multiple Energy Constrained UAVs. IEEE Trans. Intell. Transp. Syst. 2022, 23, 17366–17381. [Google Scholar] [CrossRef]
Wang, Y.; Bai, P.; Liang, X.; Wang, W.; Zhang, J.; Fu, Q. Reconnaissance Mission Conducted by UAV Swarms Based on Distributed Pso Path Planning Algorithms. IEEE Access 2019, 7, 105086–105099. [Google Scholar] [CrossRef]
Mou, Z.; Zhang, Y.; Gao, F.; Wang, H.; Zhang, T.; Han, Z. Deep Reinforcement Learning Based Three-Dimensional Area Coverage with UAV Swarm. IEEE J. Sel. Areas Commun. 2021, 39, 3160–3176. [Google Scholar] [CrossRef]
Hu, W.; Yu, Y.; Liu, S.; She, C.; Guo, L.; Vucetic, B.; Li, Y. Multi-UAV Coverage Path Planning: A Distributed Online Cooperation Method. IEEE Trans. Veh. Technol. 2023, 72, 11727–11740. [Google Scholar] [CrossRef]
Aslan, S. An Immune Plasma Algorithm with a Modified Treatment Schema for Ucav Path Planning. Eng. Appl. Artif. Intell. 2022, 112, 104789. [Google Scholar] [CrossRef]
Duan, H.; Huang, L. Imperialist Competitive Algorithm Optimized Artificial Neural Networks for Ucav Global Path Planning. Neurocomputing 2014, 125, 166–171. [Google Scholar] [CrossRef]
Duan, H.; Yu, Y.; Zhang, X.; Shao, S. Three-Dimension Path Planning for Ucav Using Hybrid Meta-Heuristic ACO-DE Algorithm. Simul. Model. Pract. Theory 2010, 18, 1104–1115. [Google Scholar] [CrossRef]
Niu, Y.; Yan, X.; Wang, Y.; Niu, Y. An Adaptive Neighborhood-Based Search Enhanced Artificial Ecosystem Optimizer for UCAV Path Planning. Expert Syst. Appl. 2022, 208, 118047. [Google Scholar] [CrossRef]
Chen, Q.-y.; Lu, Y.-f.; Jia, G.-w.; Li, Y.; Zhu, B.-j.; Lin, J.-c. Path Planning for UAVs Formation Reconfiguration Based on Dubins Trajectory. J. Cent. South Univ. 2018, 25, 2664–2676. [Google Scholar] [CrossRef]
Huang, L.; Qu, H.; Ji, P.; Liu, X.; Fan, Z. A Novel Coordinated Path Planning Method Using K-Degree Smoothing for Multi-UAVs. Appl. Soft Comput. 2016, 48, 182–192. [Google Scholar] [CrossRef]

Figure 1. Structure diagram of the review.

Figure 2. Representative illustrations of UAVs with different wing configurations. (a) Fixed-wing UAV [42]; (b) Multi-rotor UAV [43]; (c) Single-rotor UAV [44]; (d) Tilt-rotor UAV [45].

Figure 3. Trajectory planning in typical high-risk scenarios. (a) Trajectory planning in simulated mountainous environments [56]; (b) Trajectory planning in large-scale urban environments [57]; (c) Trajectory planning in narrow and constrained environments, where the green box represents obstacles with a narrow passage [47].

Figure 4. Coordinative trajectory planning of the UAV swarm and tankers [61].

Figure 5. Classification of intelligent algorithms used to plan UAV trajectories in the surveyed articles.

Figure 6. The trend of research on UAV trajectory planning based on intelligent algorithms (2010–2024).

Figure 7. The Transformer model architecture diagram [77].

Figure 8. Basic principles of the agent developing strategies to optimize expected reward outcomes in RL.

Figure 9. Annual statistics on UAV trajectory planning research categorized by different types of EAs (2010–2024).

Figure 10. Annual statistics on UAV trajectory planning research categorized by different types of SI algorithms (2010–2024).

Figure 11. UAV carrying a suspended load and its trajectory in the coffee delivery task in the cafe. (a) Experimental system; (b) Position definition; (c) Geometric model; (d–f) Trajectory in coffee delivery for maximal allowed load displacement of 1° (d), 10° (e), and 25° (f) [137].

Figure 12. Example phenotypes of individuals from initial populations, where the square dots represent checkpoints and the dashed lines represent initial trajectories. (a) An initial population in basic GA; (b) an initial population in ACO-enhanced GA, where the solid line belongs to a baseline trajectory. Adapted with permission [147].

Figure 13. If two or more UAVs detect and explore the same cluster, the UAV that has explored more targets continues to explore the rest of the region, while the others leave to search for new clusters. Adapted with permission [151].

Table 1. Comparison of reviews on intelligent algorithms for UAV trajectory planning.

Publication	Algorithms	Planning Features	Applications
Puente-Castro et al. [36]	Machine learning, evolutionary algorithms, and swarm intelligence algorithms	\	\
Yahia and Mohammed [37]	Metaheuristic algorithms	Complex environment	\
Hooshyar and Huang [38]	Metaheuristic algorithms	Complex environment	\
Poudel et al. [39]	Bio-inspired algorithms	\	\
Sharma et al. [40]	Swarm intelligence algorithms	\	Multiple target interception
Tang et al. [41]	Swarm intelligence algorithms	\	\
This work	Machine learning, evolutionary algorithms, and swarm intelligence algorithms	Complex environment, UAV Types	Transportation, inspection, show, etc.

Table 2. Comparison of characteristics among different UAV types.

Characteristic	Multi-Rotor UAV	Single-Rotor UAV	Tilt-Rotor UAV	Fixed-Wing UAV
Propulsion	Multiple fixed motors with propellers	One main rotor and anti-torque mechanism	Tiltable rotors with fixed wings	Propeller or jet engine with fixed wings
Lift Generation	Rotors	Rotors	Rotors/fixed wings	Fixed wings
Thrust-Attitude Coupling	Strong coupling	Strong coupling	Strong in rotor mode, weak in fixed-wing mode	Weak coupling
Endurance	<1 h	About 1–3 h	About 3–8 h	About 5–30 h
Speed	About 0–20 m/s	About 0–50 m/s	About 0–100 m/s	About 15–100 m/s
Maneuverability	High	High	Medium (weak in transition)	Low (large turn radius)

Table 3. Comparison of UAV trajectory planning methods employing various types of machine learning.

Category	Algorithm	Contributions	Reference
DL	CNN	Adding time-series properties to CNN to adapt to time-series prediction problems	[65]
	RNN	Solving multi-agent trajectory planning problems together with a decentralized version	[66]
	GNN	Combining a mechanism that allows message-dependent attention and a message-aware graph attention network	[69]
	Transformer	Powerful representation capabilities and parallel computing characteristics	[77]
RL	Asynchronous RL	Combining asynchronous methods with the tabular Q-learning algorithm alleviates the slow convergence of classical RL.	[81]
	GRL	Adaptively updating the reward matrix based on geometric distance and risk information when only partial information about the map is available	[82]
	multi-layer RL	Collecting both global and local information	[83]
DRL	TD3	Proposing a model explanation method based on feature attribution to achieve model explainability	[85]
	D3QN	Combining the greedy strategy with heuristic search rules to improve the learning efficiency of the D3QN during the training phase	[86]
	MATD3	Introducing LSTM RNN into the environment perception end of the MATD3 network and developing an improved potential field-based dense reward function	[11]
Others	TL	Integrating prior knowledge, improving the effect of formal training	[88]
	RF	Improving accuracy and stability in the case of a few samples	[90]
	SVM	Establishing safe flight corridors for trajectory planning	[91]

Table 4. Comparison of UAV trajectory planning methods employing various types of EAs.

Category	Algorithm	Contributions	Reference
GA	GA	Solving complex combinatorial optimization problems	[96]
	GA	Proposing new mutation operators for specific problems	[97]
	parallel GA	Developing a parallel implementation of GA to find a solution faster.	[98]
	parallel GA	Trajectory planning based on probability graph and parallel GA	[99]
	mVGA	Using the vibrational mutation operators to solve the premature convergence of GA	[101]
DE	DE	Designing an offline UAV trajectory planner based on DE	[104]
	FA-DE	Introducing a fuzzy logic controller to automatically adjust parameters	[105]
	CDE	Designing an adaptive selection mutation operator	[106]
	MSFDE	Integrating multi-population strategy, adaptive strategy, and interactive mutation strategy	[110]
Others	GP	Designing special function and symbol operators for GP	[112]
	ES	Combining ES with the exact Dijkstra algorithm	[113]
	MO-CMA-ES	Proposing an energy model and a noise model for a UAV	[114]

Table 5. Comparison of UAV trajectory planning methods employing various types of SI algorithms.

Category	Algorithm	Contributions	Reference
PSO	PSO	Proposing an adaptive sensitivity decision operator to address the limitations of local optimal and premature	[118]
PSO	SPSO	SPSO searches for solutions within the configuration space rather than the Cartesian space to improve the chances of identifying quality solutions.	[119]
ACO	ACO	Using ACO to provide an approximate solution	[123]
	CACOC	Using chaotic dynamics to replace the random part of ACO to improve performance on the coverage problem	[124]
	CACOC	Extending CACOC by a collision avoidance mechanism	[7]
Others	BA	Using the artificial potential field method to speed up the convergence of bat position updates	[26]
Others	WOA	Introducing an adaptive chaos-Gaussian switching method coupled with a coordinated decision-making framework to overcome the defect of local minima	[27]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cheng, Z.; Yang, J.; Sun, J.; Zhao, L. Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm. Drones 2025, 9, 468. https://doi.org/10.3390/drones9070468

AMA Style

Cheng Z, Yang J, Sun J, Zhao L. Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm. Drones. 2025; 9(7):468. https://doi.org/10.3390/drones9070468

Chicago/Turabian Style

Cheng, Zhekun, Jueying Yang, Jinfeng Sun, and Liangyu Zhao. 2025. "Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm" Drones 9, no. 7: 468. https://doi.org/10.3390/drones9070468

APA Style

Cheng, Z., Yang, J., Sun, J., & Zhao, L. (2025). Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm. Drones, 9(7), 468. https://doi.org/10.3390/drones9070468

Article Menu

Trajectory Planning of Unmanned Aerial Vehicles in Complex Environments Based on Intelligent Algorithm

Abstract

1. Introduction

2. Trajectory Planning Features

2.1. Trajectory Planning Features Across Various UAV Types

2.2. Trajectory Planning Features in Complex Environments

3. Intelligent Algorithms for UAV Trajectory Planning

3.1. Machine Learning

3.1.1. Deep Learning

3.1.2. Reinforcement Learning

3.1.3. Deep Reinforcement Learning

3.1.4. Other Machine Learning Methods

3.2. Evolutionary Algorithms

3.2.1. Genetic Algorithm

3.2.2. Differential Evolution

3.2.3. Other Evolutionary Algorithms

3.3. Swarm Intelligence

3.3.1. Particle Swarm Optimization

3.3.2. Ant Colony Optimization

3.3.3. Other Swarm Intelligence Algorithms

4. Trajectory Planning Applications in Complex Environments

4.1. Transportation

4.1.1. Slung-Load Transportation

4.1.2. Multi-Destination Transport

4.2. Inspection

4.2.1. Predetermined Checkpoints

4.2.2. Minimum Time Search

4.2.3. Sweep Coverage

4.3. Others

5. Trends and Challenges

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI