A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles

Johnson, Nathan; Shafaei, Sima; Karem, Andrew; Sarkar, Sayani

doi:10.3390/drones10050351

Open AccessReview

A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles

Department of Computer Science, Bellarmine University, Louisville, KY 40205, USA

^*

Author to whom correspondence should be addressed.

Drones 2026, 10(5), 351; https://doi.org/10.3390/drones10050351

Submission received: 20 March 2026 / Revised: 17 April 2026 / Accepted: 25 April 2026 / Published: 7 May 2026

(This article belongs to the Special Issue Advances in Cartography, Mission Planning, Path Search, and Path Following for Drones: 2nd Edition)

Download

Browse Figures

Review Reports Versions Notes

Highlights

What are the main findings?

Existing UAV path planning methods remain fragmented and often do not jointly address uncertainty, safety assurance, regulatory compliance, and onboard resource constraints.
This review introduces the Risk-Calibrated, Certifiably Safe, Resource-Aware (RCSR) framework to unify classical, learning-based, and hybrid UAV planning approaches for real-world autonomy.

What are the implications of the main findings?

Future UAV planning systems should integrate risk awareness, certifiable safety, resource efficiency, and operational constraints within a single planning framework.
Progress in deployable UAV autonomy requires stronger benchmarking and validation across simulation, hardware-in-the-loop, and real-world flight environments.

Abstract

Effective mission planning, path search, and path following are critical for unmanned aerial vehicles (UAVs) operating in complex, dynamic, and resource-constrained environments. Classical path planning approaches, including graph-based search, sampling-based methods, and trajectory optimization, provide structured solutions with performance guarantees but often exhibit limited adaptability to uncertainty, environmental disturbances, and evolving mission constraints. Reinforcement learning (RL) offers a complementary capability by enabling adaptive decision-making and online response to dynamic obstacles and partial observability. This paper examines UAV path planning and navigation within a Risk-Calibrated, Certifiably Safe, and Resource-Aware (RCSR) framework, with emphasis on its implications for mission planning, path search, and path following. Classical planning techniques are reviewed alongside recent advances in RL-based navigation for single-UAV and multi-UAV systems. Particular attention is given to safe reinforcement learning, constrained optimization, and runtime assurance mechanisms that address safety, regulatory compliance, and resource limitations in real-world deployments. Through a comparative analysis of classical, learning-based, and hybrid planning architectures, this work highlights key trade-offs among adaptability, safety, computational cost, and energy efficiency. The paper concludes by identifying hybrid learning–planning approaches as a practical direction for scalable, reliable, and deployable UAV mission planning systems.

Keywords:

UAV mission planning; path search; path following; reinforcement learning; safe autonomy; hybrid planning

1. Introduction

A small pilotless drone navigates between skyscrapers in a windy urban corridor, buffeted by gusts, intermittently losing and regaining GPS lock, and maneuvering to avoid traffic, pedestrian bridges, and other airborne vehicles. Its mission is to deliver urgent medication to an elderly resident while complying with aviation regulations and safety requirements. This scenario illustrates a central challenge in modern uncrewed aerial vehicle (UAV) research: transitioning path planning algorithms from controlled laboratory environments to the uncertainty, complexity, and safety-critical demands of real-world deployment.

Over the past two decades, substantial progress has been made in UAV pathfinding under simplified assumptions. Classical planning approaches have demonstrated strong performance in static or partially known environments. Notably, Koenig and Likhachev introduced D* Lite, enabling efficient replanning in response to dynamic changes in traversal cost [1]. Karaman and Frazzoli later established a theoretical foundation for optimal motion planning with respect to user-defined cost functions, providing asymptotic optimality guarantees while accommodating penalties associated with distance, risk, or energy consumption [2]. These algorithms remain foundational components of contemporary UAV navigation research.

As UAV applications continue to expand, recent surveys have emphasized the increasing complexity of real-world operating environments. Meng et al. provided a comprehensive review of artificial intelligence and machine learning–based approaches for UAV path planning, with particular attention to three-dimensional urban environments and real-time operational constraints [3]. Ghambari et al. analyzed simulation platforms and evaluation methodologies, highlighting persistent gaps between algorithmic performance in simulated environments and operational deployability in real-world scenarios [4]. Constraint-based techniques, such as those proposed by Tayal et al., address collision avoidance in dynamic settings but remain sensitive to perception delays, modeling uncertainty, and computational limitations [5]. Despite these advances, fully autonomous and reliable UAV operation in complex real-world environments remains an open research challenge.

This paper advances a unifying research direction for UAV pathfinding termed Risk-Calibrated, Certifiably Safe, Resource-Aware (RCSR) pathfinding. Rather than introducing a new planning algorithm, RCSR serves as a conceptual framework that integrates fragmented research efforts into a coherent research agenda for deployable autonomy. More concretely, RCSR can function as a design pipeline directing planners to explicitly address each dimension in sequence, an evaluation rubric for determining which RCSR pillars a given method satisfies, or a research taxonomy for identifying deployment-critical gaps in current planning approaches. The framework synthesizes four interdependent dimensions: uncertainty-aware planning under imperfect sensing and dynamic environments; formal safety assurance and runtime verification; coordination among multiple UAVs operating in shared airspace under regulatory constraints; and resource-aware implementation subject to strict latency, energy, and computational limits.

For clarity, this survey uses three related but distinct terms throughout. Pathfinding refers to the algorithmic computation of a feasible or optimal route from a start state to a goal state, often at the graph-search or motion-planning level. Path planning is used in a broader sense to include route generation together with relevant objectives and constraints such as safety, risk, energy, dynamics, and regulatory feasibility. Navigation refers to the execution-level problem of following or adapting the planned path online using perception, state estimation, control, and replanning in response to environmental changes. Because these concepts are tightly coupled in real UAV systems, all three terms remain relevant in this manuscript, but path planning is used as the broadest umbrella term unless a narrower distinction is specifically intended.

Three conceptually related but operationally distinct modifiers appear throughout this survey and in the broader literature. Risk-aware planning refers to general recognition of environmental hazards—such as collision zones, terrain, or air traffic—without necessarily quantifying their probability or magnitude. Risk-calibrated planning involves explicit probabilistic or quantitative models of uncertainty distributions and threat exposure that are traded directly against path efficiency within the planning objective. Certifiably safe planning goes further by providing formal, verifiable guarantees on safety metrics—such as provable bounds on collision probability or constraint-satisfaction certificates—that hold under specified operating assumptions. Understanding these distinctions is essential for interpreting the algorithmic trade-offs surveyed in subsequent sections and for situating each method within the RCSR framework.

Reliable UAV path planning is critical across a wide range of current and emerging applications. Existing civilian uses include aerial imaging, precision agriculture, surveying and mapping, infrastructure inspection, public safety operations, and environmental monitoring [3,4,6]. Emerging applications envision urban air mobility, medical and humanitarian delivery, coordinated multi-UAV systems, and high-altitude communication platforms [7]. Many of these scenarios require beyond visual line of sight (BVLOS) operation in dense and regulated airspace, imposing stringent demands on robustness, safety, and efficiency.

Current planning paradigms address these requirements only partially. Classical search-based methods such as A* and D* variants support fast replanning but typically rely on deterministic models and limited representations of uncertainty [1,8,9]. Sampling-based planners, including RRT* and BIT*, explore continuous spaces effectively but often struggle to provide formal safety guarantees within strict real-time constraints [2]. Optimization-based approaches, such as model predictive control and mixed-integer formulations, explicitly encode vehicle dynamics and operational constraints but frequently suffer from computational fragility under sensing delays and model mismatch [10,11]. Across these paradigms, risk calibration is rarely modeled explicitly, safety is frequently assumed rather than formally certified, and resource constraints are often treated as secondary considerations [5,12].

RCSR pathfinding reframes these limitations by explicitly coupling uncertainty modeling, certifiable safety, coordination, and resource awareness within a unified research perspective. By doing so, it emphasizes the requirements necessary for UAV systems that are not only effective in simulation but also reliable in deployment. The contributions of this survey are as follows:

1.: An integrated review of UAV path planning algorithms, simulation and testing methodologies, uncertainty-aware planning, and formal safety assurance, with explicit focus on real-world operational constraints.
2.: A forward-looking research agenda based on the RCSR framework, identifying concrete research directions aimed at bridging the gap between laboratory prototypes and deployable UAV autonomy.

The remainder of this paper is organized as follows. Section 2 reviews prior survey studies and highlights the gaps that motivate the proposed synthesis. Section 3 reviews classical and modern UAV path planning paradigms. Section 4 surveys simulation platforms and experimental testbeds. Section 5 examines uncertainty-aware planning and formal safety assurance. Section 6 discusses environmental, computational, and energy constraints encountered in real-world deployment. Section 7 outlines future research directions through the RCSR framework, and Section 8 concludes the paper. A detailed taxonomy illustrating the overall structure of this paper and the surveyed research domains is presented in Figure 1.

2. Related Studies

Research on UAV path planning has expanded rapidly over the past decade, resulting in a growing body of survey papers that analyze planning algorithms, mission requirements, and environmental constraints. Most existing reviews, however, primarily emphasize algorithmic taxonomies and performance characteristics, while treating uncertainty, safety assurance, airspace regulation, and real-world deployment constraints as secondary or isolated concerns.

Jones et al. present one of the most comprehensive recent taxonomies of UAV path-planning methods, emphasizing the influence of environmental complexity and map representation on algorithmic performance [13]. Meng et al. provided a broad review spanning deterministic, sampling-based, evolutionary, learning-based, and hybrid planners—identifying scalability, real-time feasibility, and robustness as persistent challenges for operational deployment [3]. Debnath et al. focused on remote-sensing and agricultural missions, reviewing planning and obstacle-avoidance techniques with emphasis on environmental constraints and sensing payloads [14]. Gugan and Haque analyzed the limitations of widely used planners, including poor adaptability in cluttered three-dimensional environments, weak integration between perception and planning modules, and limited robustness to uncertainty [15]. A complementary mission-oriented classification by Luo et al. organized UAV planners by operational context, highlighting trade-offs between computational cost, optimality, and adaptability [16]. While these surveys provide valuable algorithmic insight, they largely stop short of integrating safety guarantees, regulatory considerations, and deployment realism into a unified planning perspective.

Several studies narrow their focus to specific operational subdomains, including multi-UAV coordination, autonomous swarms, and urban airspace integration. Puente-Castro et al. surveyed artificial intelligence–based approaches for multi-UAV path planning, covering reinforcement learning, swarm intelligence, and evolutionary algorithms, with emphasis on distributed optimization, communication constraints, and energy balancing across fleets [17]. Davidović and Urošević examined planning constraints arising from UAV integration into dense urban airspace, surveying separation requirements, airspace structure, and regulatory barriers that directly influence feasible trajectory design [18]. Ghambari et al. provided a mission-centric taxonomy of UAV planning problems, highlighting the growing importance of multimodal sensing, energy constraints, and autonomous operation in dynamic environments [4]. Collectively, these studies identify environmental uncertainty, coordination complexity, and mission-specific constraints as central limitations of current planning frameworks. However, these challenges are often addressed in isolation rather than within an integrated deployment-oriented framework.

A growing body of research focuses explicitly on risk-aware UAV path planning. Primatesta et al. proposed a risk-aware planning strategy for dense urban environments by integrating static and dynamic risk maps into a modified A* algorithm, demonstrating improved safety performance in populated areas [19]. Tang et al. extended this line of work by introducing a third-party risk model that quantifies obstacle risk, fatality risk, and infrastructure vulnerability, embedding these metrics within a multi-objective A* planner designed for urban missions [20]. Zhou et al. incorporated crash probability estimates and economic risk into a unified cost formulation that adapts to complex air–ground environments, demonstrating substantial reductions in expected operational risk [21]. While these approaches successfully integrate risk into planning objectives, they are primarily evaluated offline and typically lack formal safety guarantees or runtime enforcement mechanisms.

Parallel advances in formal safety assurance offer complementary mechanisms for enabling reliable UAV autonomy. Ames et al. surveyed control barrier functions (CBFs), demonstrating how safety constraints can be encoded as forward-invariant sets within optimization-based controllers [22]. Hobbs et al. reviewed runtime assurance (RTA) architectures, including Simplex-style supervisory control and optimization-based safety filters capable of enforcing safety constraints during online operation, even when primary planners are learning-based or unverified [23]. Sciancalepore et al. introduced ORION, a framework that leverages Remote ID broadcasts to verify UAV trajectories for regulatory compliance, bridging trajectory verification with emerging airspace management requirements [24]. Despite their importance, these safety-focused studies typically treat planning as an upstream component and therefore do not provide a unified perspective linking risk modeling, uncertainty, regulatory constraints, and real-time computational limitations.

Table 1 summarizes representative studies across key dimensions relevant to deployable UAV path planning. The comparison highlights that while existing surveys and frameworks address individual aspects such as algorithmic design, risk modeling, coordination, or safety assurance, none provide an integrated treatment spanning uncertainty-aware planning, certifiable safety, regulatory compliance, and real-world resource constraints.

Overall, the literature demonstrates substantial progress across individual research directions, including algorithm design, risk assessment, swarm coordination, energy-aware planning, and igureormal safety mechanisms. However, these themes are often developed independently, with limited consideration of their interaction in real-world deployment scenarios. This survey addresses this gap by adopting a constraint-centric perspective and synthesizing prior work through the RCSR framework, unifying risk modeling, safety assurance, regulatory considerations, and resource-aware implementation into a coherent research agenda for next-generation UAV autonomy.

3. Pathfinding Algorithms for Real-World UAVs

Many pathfinding algorithms widely used in UAV navigation were originally developed for abstract graph search problems or ground-based robotic systems. Despite their origins, several algorithmic families have proven highly effective for aerial navigation in complex three-dimensional environments. In particular, algorithms that support explicit constraints, dynamic replanning, and partial environmental knowledge have become central to modern UAV autonomy.

Rather than presenting a purely historical survey, this section focuses on algorithmic families that serve as foundational building blocks for RCSR pathfinding in real-world UAV deployments. These approaches underpin many contemporary navigation systems and continue to influence how uncertainty, safety guarantees, and resource constraints are incorporated into practical UAV planning frameworks. Readers should note that the text shifts between two analytical modes: a survey perspective that documents algorithmic properties, capabilities, and representative implementations, and a framework perspective that evaluates how each family aligns with or falls short of the RCSR pillars. Subsections labeled ‘Implications for RCSR’ or ‘RCSR Relevance’ explicitly signal these analytical transitions.

Classical path-planning algorithms remain particularly important because they provide well-understood guarantees regarding optimality, completeness, and computational behavior. Many modern planners, including learning-based approaches, still rely on these classical methods either directly or as underlying components. Reinforcement learning–based path-planning approaches represent a complementary and more recent paradigm and are discussed later in the paper (Section 3.2.)

3.1. Classical Pathfinding Algorithms for UAV Navigation

Classical pathfinding algorithms provide the conceptual and algorithmic foundation for many UAV navigation systems. These methods typically operate on graph representations of the environment or continuous configuration spaces and are designed to compute collision-free paths while optimizing cost functions such as distance, time, or energy consumption.

Figure 2 illustrates a taxonomy of classical pathfinding algorithms relevant to UAV navigation. The taxonomy highlights four major categories: deterministic grid-based search, incremental graph search methods designed for dynamic environments, sampling-based planners operating in continuous spaces, and trajectory generation approaches that combine search with motion constraints and optimization techniques.

3.1.1. Deterministic Grid-Based Pathfinding

Classical grid-based path planning models the environment as a weighted graph

G = (V, E, w),

(1)

where V denotes discrete states (e.g., grid cells),

E \subseteq V \times V

represents feasible transitions, and

w (e) \geq 0

encodes traversal costs such as distance, time, or energy consumption. A path

π = 〈 v_{0}, \dots, v_{k} 〉

has a total cost

C (π) = \sum_{i = 0}^{k - 1} w (v_{i}, v_{i + 1}),

(2)

and the canonical planning problem is to determine a minimum-cost path between a start state s and a goal state g in a static, fully known environment.

Dijkstra and A*

Dijkstra’s algorithm computes optimal shortest paths on graphs with non-negative edge costs by iteratively expanding the node with the smallest cost-to-come value

g (v)

[25]. While optimal, the algorithm performs an exhaustive exploration of the search space, which becomes computationally expensive on large three-dimensional grids typical of urban UAV environments.

A* improves efficiency by introducing a heuristic estimate

h (v)

of the remaining cost-to-go and expanding nodes according to

f (v) = g (v) + h (v) .

(3)

When h is admissible and consistent, A* guarantees optimality while significantly reducing the number of explored nodes [8]. In UAV applications, heuristics commonly incorporate Euclidean distance, altitude change penalties, or approximate energy expenditure. From an RCSR perspective, heuristics provide a natural mechanism for encoding prior knowledge about cost structure. However, classical A* assumes deterministic edge costs and does not explicitly represent uncertainty or probabilistic risk.

Weighted and Bounded-Suboptimal Variants

Weighted A* and related bounded-suboptimal algorithms trade optimality for reduced computational effort by inflating the heuristic:

f_{ε} (v) = g (v) + ε h (v), ε > 1 .

(4)

For admissible heuristics, the resulting path cost satisfies

C (π_{ε}) \leq ε C (π^{★}),

(5)

where

π^{★}

denotes the optimal path available, meaning the resulting path cost given by the Weighted A* is guaranteed not to exceed the ideal cost by more than a factor of

ε

. This guarantee of an explicit bound on solution quality is particularly useful for real-time UAV navigation, where computational resources are limited, and rapid responses are required. This property aligns naturally with the RCSR emphasis on resource-aware planning, where bounded deviations from optimality can be justified by strict latency or energy constraints. Nevertheless, these planners remain fundamentally deterministic and static, requiring extensions to handle dynamic obstacles, environmental uncertainty, or strict real-time constraints.

3.1.2. Incremental Graph Search with Temporal and Environmental Constraints

Real-world UAV operations rarely permit single-shot planning. Environmental conditions, obstacle locations, and sensing information often evolve during flight. Incremental and anytime search algorithms are therefore particularly relevant, as they reuse prior computation and adapt efficiently to changing environments.

LPA* and D* Lite

Lifelong Planning A* (LPA*) and D* Lite update the shortest paths incrementally when edge costs change, avoiding the need for full replanning. Both algorithms maintain a cost-to-come value

g (v)

and a one-step lookahead estimate

rhs (v) = min_{(u, v) \in E} (g (u) + w (u, v)),

(6)

and enforce consistency by driving

g (v)

toward

rhs (v)

. Nodes are prioritized using a lexicographically ordered key

k (v) = (min (g (v), rhs (v)) + h (v), min (g (v), rhs (v))),

(7)

enabling efficient updates when obstacles, weather conditions, or energy costs change during flight [1,26].

From an RCSR perspective, incremental planners provide three key advantages: reduced computational overhead through reuse of prior search effort, online adaptability to newly sensed environmental information, and formal guarantees of optimality under admissible heuristics. However, these algorithms still treat costs as deterministic, and risk must therefore be incorporated indirectly through expected cost models or external runtime safety mechanisms.

Anytime Variants (ARA, AD)

Anytime Repairing A* (ARA*) rapidly computes a bounded-suboptimal solution using an inflated heuristic and incrementally improves solution quality as time permits [27]. Anytime Dynamic A* (AD*) extends this capability to dynamic environments by combining bounded-suboptimal search with incremental replanning [28]. At any point, the current solution is guaranteed to lie within a known factor

ε

of optimality. These guarantees are particularly attractive for UAVs operating under strict real-time constraints since planners can produce feasible trajectories quickly and refine them as additional computation becomes available.

Safe Interval Path Planning (SIPP)

Dynamic obstacle avoidance can be addressed by augmenting spatial states with temporal constraints. Naïve time-expanded grid formulations quickly become computationally intractable, motivating the Safe Interval Path Planning (SIPP) framework. SIPP represents each state as

(v, I)

, where

I = [t_{min}, t_{max}]

denotes a contiguous time interval during which state v is guaranteed to remain collision-free. Transitions are permitted only if the arrival time

t^{'} = g (v, I) + Δ t (v, u)

(8)

lies within the successor state’s safe interval [9]. Subsequent extensions generalize SIPP to any-angle motion and real-time search settings suitable for dynamic robotic environments [29,30]. While effective for temporal collision avoidance, SIPP often requires preprocessing of obstacle trajectories and can remain computationally demanding in dense environments.

Summary and Limitations

Grid-based planners provide a flexible mechanism for encoding constraints relevant to real-world UAV operations. Regulatory restrictions and geofenced regions can be represented as forbidden vertices or edges, while energy and time budgets naturally appear as edge costs. Dynamic obstacles may be incorporated through incremental cost updates or temporal safe intervals. Despite these strengths, classical grid-based methods scale poorly to high-dimensional state spaces and dense three-dimensional maps. These limitations motivate the transition toward planners operating in continuous spaces, which are discussed in subsequent sections.

3.1.3. Sampling-Based Planning in Continuous 3D Airspace

Sampling-based motion planners are well suited for UAV navigation in cluttered three-dimensional environments, where grid discretizations can become prohibitively large or too coarse to capture feasible kinodynamic motion. Let the continuous state space be

X \subset R^{d}

, with obstacle region

X_{obs}

and collision-free region

X_{free} = X ∖ X_{obs} .

(9)

UAV motion is typically modeled by continuous-time dynamics

\dot{x} (t) = f (x (t), u (t)), x (t) \in X, u (t) \in U,

(10)

and a trajectory

σ

over

t \in [0, T]

is evaluated by a cost functional

J (σ) = \int_{0}^{T} L (x (t), u (t)) d t,

(11)

where L may encode travel time, energy consumption, trajectory smoothness, or multi-objective penalties. Exact optimal planning for realistic UAV dynamics is generally computationally intractable, which motivates randomized planners that seek feasible, and in some cases asymptotically optimal, solutions through sampling of the configuration space.

RRT and RRT*

The rapidly exploring random tree (RRT) algorithm constructs a search tree rooted at the start state by repeatedly sampling

x_{rand} \in X_{free}

, selecting the nearest tree node

x_{near}

, and extending toward

x_{rand}

using a steering operator that respects system constraints while performing local collision checking [31]. RRT is probabilistically complete: if a feasible path exists, the probability of finding one approaches one as the number of samples increases. However, the solutions produced by RRT are typically suboptimal.

RRT* extends the original RRT algorithm by introducing a rewiring step that selects parents and reconnects nearby nodes to improve the cost-to-come of the tree. Under standard assumptions, RRT* is asymptotically optimal, meaning that

P (J (σ_{n}) \to J^{★}) = 1, n \to \infty,

(12)

where

J^{★}

denotes the optimal cost [2]. From an RCSR perspective, RRT* provides a principled mechanism to trade computational effort (e.g., number of samples, neighbor radius, and rewiring budget) for solution quality while retaining convergence guarantees. In practice, however, finite-sample performance can vary significantly in large three-dimensional environments, and classical collision checking typically assumes deterministic obstacle representations without explicit modeling of uncertainty or risk.

Informed Sampling (Informed RRT*)

Informed RRT* accelerates convergence by restricting sampling to the subset of states that could potentially improve the current best solution. For problems that minimize path length, or more generally metric costs with admissible heuristics, samples are drawn from an ellipsoidal subset whose foci correspond to the start and goal states:

X_{informed} = \{x \in X_{free} | ∥ x - x_{start} ∥ + ∥ x - x_{goal} ∥ \leq c_{best}\},

(13)

where

c_{best}

denotes the cost of the best solution found thus far [32]. Restricting the sampling region in this manner focuses computational effort on areas of the search space that can reduce the current solution bound, which is particularly beneficial for high-dimensional UAV planning problems.

Batch Informed Trees (BIT*)

Batch Informed Trees (BIT*) combine heuristic graph search with sampling-based planning by operating on batches of randomly generated samples and searching an implicit random geometric graph. The algorithm maintains cost-to-come estimates

g (x)

and heuristic cost-to-go values

h (x)

, prioritizing candidate edges using an A*-like ordering:

f (e) = g (x_{from}) + \hat{c} (e) + h (x_{to}),

(14)

where

\hat{c} (e)

represents an estimate of edge cost [33,34]. BIT* incrementally tightens upper bounds on the solution cost and prunes samples or edges that cannot improve the current best solution, yielding an informed, anytime, and asymptotically optimal planner.

Implications for RCSR

Sampling-based planners offer several properties that are directly relevant for real-world UAV pathfinding: (i) scalability to continuous three-dimensional airspace, (ii) flexibility in handling geometric constraints through collision checking and steering functions, and (iii) compatibility with multi-objective cost formulations through the running cost

L (x, u)

. Within an RCSR framework, these planners can be extended by shaping sampling distributions using risk or turbulence maps, incorporating risk-sensitive objective functions, and pruning candidate solutions using probabilistic safety constraints rather than purely deterministic collision checks.

Despite these advantages, classical sampling-based planners provide limited hard real-time guarantees and typically rely on simplified dynamic models during planning. Producing aerodynamically feasible and trackable UAV trajectories therefore often requires an additional smoothing or optimization stage, and safety under perception uncertainty is frequently enforced through downstream safety filters or runtime assurance mechanisms discussed later in this survey.

3.2. Reinforcement Learning–Based Path Planning

While classical graph search, sampling-based planning, and trajectory optimization provide strong algorithmic foundations for UAV pathfinding, they typically rely on explicit environmental models and carefully engineered cost functions. Reinforcement learning (RL) offers a complementary paradigm in which navigation policies are learned directly through interaction with the environment. By optimizing behavior through trial-and-error experience, RL-based planners can adapt to complex disturbances, partial observability, and mission objectives that may be difficult to encode analytically [35,36].

Recent studies demonstrate that RL-based approaches can achieve effective online navigation, energy-aware flight behavior, and cooperative multi-UAV coordination, particularly when trained in high-fidelity simulation environments [3,12,37,38,39]. Figure 3 summarizes the major categories of reinforcement learning–based pathfinding approaches relevant to the RCSR framework.

A common formulation models UAV decision-making as a Markov decision process (MDP),

M = (S, A, P, r, γ),

(15)

where

S

denotes the state (e.g., position, velocity, battery state, and local map features),

A

denotes actions (e.g., heading commands or continuous thrust/attitude setpoints),

P (s^{'} ∣ s, a)

is the transition model,

r (s, a)

is the reward, and

γ \in (0, 1)

is a discount factor. The objective is to learn a policy

π (a ∣ s)

that maximizes the expected discounted return

J (π) = E [\sum_{t = 0}^{\infty} γ^{t} r (s_{t}, a_{t})] .

(16)

3.2.1. Single-UAV Navigation and Obstacle Avoidance

RL has been widely explored for single-UAV local navigation and obstacle avoidance in partially known environments. Typical observations include occupancy grids, distance-field features, or onboard camera/LiDAR inputs combined with UAV kinematics [3]. For discrete actions, deep Q-learning updates

Q (s, a)

using

Q_{k + 1} (s_{t}, a_{t}) \leftarrow (1 - α) Q_{k} (s_{t}, a_{t}) + α (r_{t + 1} + γ max_{a^{'}} Q_{k} (s_{t + 1}, a^{'})),

(17)

while continuous-control settings commonly use actor–critic algorithms such as DDPG, TD3, SAC, and PPO [40,41,42,43]. These methods can learn reactive behaviors that avoid obstacles, improve smoothness, and reduce energy consumption, and they can be trained under randomized disturbances (e.g., wind fields and sensor noise) to improve robustness in deployment. Attention-based and recurrent architectures further improve performance under partial observability and perception latency by leveraging temporal context [44].

From an RCSR standpoint, RL is most compelling as a fast local policy that can adapt to uncertainty and changing conditions with modest onboard inference cost. However, most approaches still rely on hand-designed rewards and do not provide formal guarantees of collision avoidance or constraint satisfaction, limiting their direct use in safety-critical UAV operations [37,39].

3.2.2. Multi-UAV Cooperation and Deep Multi-Agent RL

RL has also been extended to cooperative multi-UAV settings, where coordination, task allocation, and collision avoidance must be addressed jointly. Such problems are often modeled as decentralized partially observable MDPs (Dec-POMDPs), in which each UAV acts based on local observations while the team optimizes a shared objective [45,46]. A widely used paradigm is centralized training with decentralized execution (CTDE), where a centralized critic has access to joint state/action information during training, but each UAV executes a local policy at deployment. Multi-agent DDPG (MADDPG) is a canonical CTDE approach that learns decentralized actors with centralized critics [47]. Related methods have been applied to cooperative monitoring, tracking, and data-harvesting missions, improving coverage and robustness relative to heuristic baselines [12,45]. Value-factorization approaches (e.g., QMIX) and multi-agent actor–critic variants have also been used to couple task assignment with collision-aware path planning in dynamic environments [48,49,50,51].

These approaches relate directly to the RCSR agenda because they can incorporate resource limits (battery, bandwidth, team size) and adapt to changing mission geometry or agent failures. However, training is computationally expensive and typically performed offline. Ensuring that learned coordination policies remain safe, interpretable, and certifiable under realistic sensing and airspace constraints remains an open challenge [38,52].

3.2.3. Safe RL and Runtime Assurance

Standard RL optimizes expected return and does not inherently satisfy hard safety or resource constraints. Safe RL addresses this limitation by introducing constraint costs

c_{i} (s, a)

and enforcing bounds on their expected discounted sums. A constrained MDP can be written as

max_{π} J (π) s . t . J_{c_{i}} (π) = E [\sum_{t = 0}^{\infty} γ^{t} c_{i} (s_{t}, a_{t})] \leq d_{i}, \forall i,

(18)

where constraints may represent collision probability, minimum separation, or energy budget. Constrained policy optimization (CPO) and Lagrangian actor–critic approaches enforce these constraints approximately during training by optimizing a Lagrangian objective with dual variables [53,54].

A complementary and often more deployable strategy is to combine RL with shielding or runtime assurance: a safety filter monitors proposed actions and intervenes whenever the learned policy would violate certified constraints [55]. For UAVs, shields can be derived from control barrier functions or reachability analysis and wrapped around PPO- or SAC-trained controllers. More broadly, recent hybrid architectures combine otherwise unverified planners, especially learning-based policies, with certified supervisory layers that enforce hard safety constraints only when necessary. In such systems, the nominal planner provides adaptive or high-performance behavior, while runtime assurance, shielding, control barrier function filters, or reachability-based safety monitors ensure collision avoidance, safe separation, and control feasibility during execution. These approaches directly strengthen the “certifiably safe” pillar of RCSR by preserving the adaptability of RL-based planners while moving safety enforcement into a certifiable supervisory layer. However, systematic evaluation on real UAV platforms and integration with certification workflows remain limited [52,56].

A deeper challenge concerns the gap between academic safe RL methods and formal aviation certification standards such as DO-178C. [57].The methods discussed in this section—including PPO- and SAC-trained controllers—are deep reinforcement learning approaches that represent their learned policies as neural networks (typically actor and critic networks). These standards require traceable, deterministic software artifacts with verified coverage and requirements traceability—properties that are difficult to establish for such learned policies, even when augmented with shielding or control barrier functions. Shielding and CBF-based filters can reduce the frequency of unsafe actions during deployment, but they do not by themselves produce a certifiable safety case for the underlying learned policy. The internal representations of these neural network components are not amenable to exhaustive formal verification under current tools, making it difficult to provide the level of assurance demanded by aviation regulators. Bridging this gap will require closer engagement with certification authorities, development of interpretable or formally verifiable representations of learned policies, and integration of verification methods capable of reasoning about neural network components within broader certified system architectures. In the near term, the most credible path to deployment in safety-critical airspace remains a hybrid architecture in which a fully certified supervisory layer retains authority to override any unverified learned policy, and whose behavior can be independently validated to applicable aviation standards.

3.2.4. Evolutionary and Bio-Inspired Methods

Evolutionary and bio-inspired optimizers—including genetic algorithms (GA), particle swarm optimization (PSO), and ant colony optimization (ACO)—have also been explored for UAV path planning. These approaches evolve candidate paths to optimize multi-objective criteria such as path length, threat exposure, and fuel or energy consumption [58]. They are well-suited to complex nonconvex objective landscapes and are frequently used for offline route design or to tune parameters of other planners. However, their computational cost and limited online reactivity generally restrict their role in RCSR settings to offline optimization or hybrid warm-starting rather than standalone onboard planning [3].

3.2.5. Hybrid Learning–Planning Architectures and RCSR Alignment

Given challenges in sample efficiency, safety during exploration, and hard constraint enforcement, RL is increasingly deployed as part of hybrid architectures that combine learning with classical planning or optimization [3,52]. A common structure is to use a global planner (e.g., A*, RRT*, or mixed-integer optimization) to generate a constraint-respecting route, while a learned local policy adapts online to wind, moving obstacles, or model mismatch. RL has also been used to bias sampling in motion planners, propose warm-start trajectories for local optimization, or adjust waypoints and velocity profiles to reduce energy consumption [37,52,59,60]. In multi-UAV settings, learning can be combined with distributed optimization and edge/cloud offloading to handle computation-heavy policies under tight onboard resource budgets [38].

From an RCSR perspective, RL-based techniques are most promising when embedded within a broader safety and certification architecture. Learned policies provide adaptive behavior under uncertainty and resource variability, but they typically require additional layers—safe RL formulations, runtime assurance, control barrier functions, or reachability-based shields—to provide certifiable guarantees. Bridging high-performing RL policies with risk-calibrated, certifiably safe, resource-aware UAV operation therefore remains a central research challenge and a key opportunity for RCSR-oriented pathfinding.

3.3. Summary and RCSR Relevance

The algorithm families reviewed in this section each offer distinct advantages and limitations, and their suitability depends on the demands of the target application. Table 2 summarizes this perspective by providing a scenario-oriented selection guide that relates common UAV operating conditions to appropriate planning families.

Overall, classical algorithm families provide well-understood building blocks for RCSR by offering structure, heuristics, and—in some cases—formal optimality or bounded suboptimality guarantees. Their main gaps for deployable autonomy, addressed in later sections, include (i) calibrated models of uncertainty and risk, (ii) formal runtime safety layers, and (iii) systematic treatment of compute and energy budgets. Consequently, the most promising practical direction is toward hybrid planners that combine search or sampling with dynamically feasible trajectory generation and safety-enforcing runtime mechanisms, evaluated under realistic environmental and regulatory constraints. Taken together, the reviewed algorithmic families lay the groundwork for RCSR-compliant planning but individually satisfy only subsets of its three core pillars—risk calibration, certifiable safety, and resource awareness—highlighting the need for the integrated architectures and formal safety mechanisms discussed in subsequent sections.

4. Simulation Frameworks, Testbeds and Datasets

This section provides an integrated view of the experimental ecosystem supporting RCSR Path Planning for UAVs. The ecosystem spans simulation platforms, physical testbeds, and datasets, each addressing complementary aspects of deployable UAV autonomy. No single tool captures all dimensions of risk calibration, safety assurance, and resource constraints, making a combined evaluation pipeline essential.

Simulation enables controlled experimentation under uncertainty and resource limits, testbeds expose real-world system effects, and datasets support perception-driven risk estimation and benchmarking. Together, these components form the foundation for validating end-to-end RCSR pipelines.

4.1. Simulation Frameworks Supporting RCSR Path Planning

Simulation platforms vary in physical realism, sensor fidelity, uncertainty modeling capabilities, autopilot integration, scalability, and suitability for safety-critical benchmarking. Rather than relying on a single standard tool, the field uses a layered simulation ecosystem in which different platforms support different aspects of RCSR path planning.

Algorithm-centric simulators, such as MATLAB/Simulink (The MathWorks, Inc., Natick, MA, USA) [61], enable controlled evaluation of risk-aware cost functions, optimization strategies, and resource trade-offs. They are widely used in studies involving 2D/3D grid maps, meta-heuristic optimization, and comparative evaluation of planners such as A*, RRT [62], TSO [63], PSO [64], and related methods under risk-aware and resource-constrained objectives [65,66,67]. These environments typically employ simplified kinematic or dynamic UAV models, with trajectory outcomes visualized in 2D or 3D. Their primary advantage lies in rapid prototyping and fine-grained control over risk-weighted cost functions, safety constraints, and resource budgets. However, they provide limited realism in aerodynamics, sensing, and vehicle physics. As a result, MATLAB/Simulink is best suited for algorithmic benchmarking, convergence analysis, ablation studies, and exploration of risk-calibrated objective functions, but lacks realistic sensing and dynamics.

Physics-based robotics simulators, such as Gazebo [68], PX4 [69], and ArduPilot SITL [70], form a robotics-grade simulation stack for RCSR UAV path-planning research in safety-critical and resource-constrained 3D environments. Gazebo supports physics modeling, 3D environments, wind fields, and sensor plugins, while SITL executes the actual autopilot firmware. Together, these tools support end-to-end validation from planner to autopilot to dynamics, enabling evaluation under realistic flight dynamics, sensor noise, autopilot behavior, and environmental disturbances such as wind, while incorporating explicit safety and risk constraints [71,72,73,74].

Photorealistic simulators, such as AirSim [75], enable perception-driven RCSR path planning, particularly for vision-based navigation and deep reinforcement learning, by exposing planners to realistic sensing uncertainty and complex environments. AirSim provides APIs for Python and C++ and optional PX4-SITL integration, making it a useful platform for risk-aware vision-based navigation and for evaluating perception–planning pipelines, obstacle avoidance, and high-level autonomy under realistic camera and LiDAR conditions with explicit risk and safety constraints [76,77,78].

Flexible and RL-oriented environments, including Unity/Unity3D [79], Flightmare [80], and gym-pybullet-drones [81], support learning-based and multi-agent planning, emphasizing scalability and rapid experimentation under constrained resources. Unity/Unity3D [79] is a popular engine for building custom UAV simulation environments due to its flexibility, high-quality 3D rendering, and support for ML-Agents. It is especially attractive when custom and visually rich environments are required for RCSR experimentation. Unity/Unity3D is often used for swarm-based, heuristic, and learning-based RCSR path-planning research under safety-constrained objectives [82,83,84]. RL-oriented simulators like Flightmare [85] and gym-pybullet-drones are optimized for deep RL and multi-agent training with fast physics, simple APIs, and parallel environments. They enable efficient learning of safe, high-speed navigation and risk-aware path planning under safety constraints in RCSR settings [86,87,88,89].

Lightweight and autopilot-integrated tools, including jMAVSim [90] and Paparazzi NPS [91], enable efficient validation of control behavior, waypoint tracking, and onboard resource constraints. Although jMAVSim is less physically detailed than Gazebo or AirSim, it is widely used for validating controller performance, waypoint-following, and motion-planning algorithms under realistic onboard limitations [92,93]. Paparazzi NPS supports fixed-wing and rotorcraft platforms with realistic sensor noise, making it well suited for evaluating plan-level logic and waypoint-generation strategies for RCSR deployments under calibrated risk [94,95,96].

Multi-UAV and cloud-enabled platforms, including OpenUAV [97,98], UavSim [99], and SkyRover [100,101], enable large-scale coordination, mission-level evaluation, and resource-aware planning under shared environments. OpenUAV emphasizes scalability and system-level integration through containerized 3D simulation with PX4/ArduPilot, supporting swarm deployment and emerging capabilities such as vision-language navigation and risk-aware planning [102,103]. In contrast, UavSim focuses on algorithmic flexibility, offering plug-and-play planning modules for cooperative missions, with strengths in small-object detection and comparative evaluation of path-planning performance [99]. SkyRover targets cross-domain coordination by integrating UAVs and AGVs in ROS2–Gazebo environments, enabling standardized MAPF benchmarking with explicit modeling of constraints and collision dynamics [100].

Grid-based and custom simulators support algorithmic benchmarking, multi-agent conflict resolution, and controlled safety analysis by abstracting away full system complexity while preserving key planning constraints. Platforms such as V-REP/CoppeliaSim [104] provide moderate physics realism and motion-planning libraries for benchmarking coverage and indoor navigation performance [105,106], while MORSE [107] emphasizes sensor-driven simulation and conceptual design for perception and multi-UAV missions [108,109,110,111]. Aviones [112,113] focuses on fixed-wing dynamics and energy-aware planning with hardware-in-the-loop extensions. In contrast, grid-based simulators [114,115] prioritize discrete, structured environments to evaluate efficiency and conflict resolution in multi-agent settings. Additionally, other custom research simulators [82,116,117,118] target specific RCSR problems, ranging from swarm navigation and heuristic routing to RL-based collision avoidance and large-scale task allocation under explicit safety and resource constraints.

Overall, Table 3 highlights that no single simulator satisfies all RCSR requirements, reflecting an inherent trade-off between fidelity and scalability. High-fidelity platforms support safety validation under realistic conditions, whereas lightweight environments enable large-scale benchmarking and rapid experimentation. Consequently, the literature adopts a layered and complementary simulation ecosystem, where platforms are selected based on the maturity of the planning approach and the specific RCSR dimension being evaluated. Within this pipeline, simulators can be broadly categorized into three roles: (i) rapid algorithm prototyping environments (e.g., MATLAB), (ii) high-fidelity physics and perception simulators (e.g., AirSim, Gazebo), and (iii) scalable multi-agent or reinforcement learning (RL) training environments.

Qualitative ratings in Table 3 reflect the level of physical and sensing realism. Realism/Physics refers to how accurately the simulator models UAV dynamics, environmental interactions, and physical constraints. It ranges from low (simplified or discrete motion without aerodynamic effects), to moderate (basic rigid-body dynamics and collision handling), to high (physically grounded dynamics with environmental effects and controller integration). Sensor Fidelity reflects the realism and diversity of sensor outputs available for perception-driven planning. The values include minimal (no or abstract sensing), low (idealized outputs), moderate (configurable sensors with partial realism), high (realistic multi-modal sensing with noise and environmental interaction), and very high (photorealistic, physically consistent sensing suitable for perception learning and sim-to-real transfer).

4.2. UAV Testbeds Supporting RCSR Validation

While simulation is indispensable for scalable experimentation, physical UAV testbeds are critical for validating RCSR path-planning algorithms under real sensing, actuation, timing, communication, and resource constraints. Hardware testbeds expose planners to unmodeled dynamics, latency, packet loss, and sensor noise that are difficult to capture faithfully in simulation, and therefore play a key role in demonstrating robustness, risk calibration, certifiable safety, and deployability.

A large class of experimental platforms relies on indoor motion-capture systems that provide high-precision ground-truth pose estimates for small multirotors. Early influential examples include the GRASP multi-UAV testbed, which enabled coordinated flight, formation control, and cooperative transport under centralized planning. The Crazyswarm framework extends this paradigm using nano-quadrotors, supporting dense indoor swarms and rapid prototyping of multi-agent risk-aware and safety-constrained planning algorithms in safe, repeatable conditions. Such testbeds are particularly valuable for evaluating collision avoidance, formation changes, and coverage planning under bounded energy and communication budgets without the regulatory and weather constraints of outdoor flight.

Dedicated indoor facilities further extend these capabilities. TU Delft’s Cyberzoo and Swarming Lab provide configurable obstacle-rich environments for long-duration swarm autonomy, resource-aware coordination, and persistent coverage experiments. Related work on swarms of miniature drones highlights the importance of physical testbeds for validating exploration- and mapping-driven planning under severe sensing, computation, and risk-bound constraints.

Outdoor multi-UAV testbeds form a second major category, supporting search-and-rescue, inspection, and large-area coordination tasks. Platforms such as RISCuer and MUAVET enable evaluation of cooperative planning, task allocation, and path execution under realistic constraints including limited battery capacity, communication range, GNSS uncertainty, and airspace restrictions. These testbeds bridge the gap between laboratory-scale validation and safety-critical RCSR deployment.

4.3. Datasets for Risk-Aware and Safety-Critical Path Planning

Publicly available datasets play a central role in evaluating RCSR UAV path-planning, risk-aware navigation, safety-critical SLAM, and obstacle-avoidance algorithms under realistic sensing conditions. These datasets differ in sensing modality, environment type, and intended application, supporting different stages of the risk-calibrated and resource-aware planning pipeline.

As illustrated in Figure 4, we group influential datasets into five functional categories: (i) geospatial and aerial mapping datasets, (ii) vision-based perception datasets (with red boxes denoting object detection), (iii) SLAM and visual–inertial navigation datasets, (iv) disaster and forest inspection datasets, and (v) high-speed indoor navigation datasets. Sample images extracted from representative real-world UAV datasets are shown in Figure 5.

4.3.1. Geospatial and Aerial Image Datasets for Global Risk-Aware Path Planning

OpenStreetMap (OSM) [119] provides open geospatial data (e.g., roads, buildings, land use) widely used to construct realistic urban environments for high-level UAV path planning. Although not a UAV benchmark, it supports generation of georeferenced scenarios for urban routing, multi-UAV coordination, and safety-constrained missions [120,121,122,123,124]. Benchmarks such as SAREnv derive standardized search-and-rescue environments directly from OSM layers [125], enabling risk-aware planning under structured urban constraints. Massachusetts Road Dataset (MRD) is a key aerial-imagery dataset for road extraction and traversability segmentation, supporting perception-driven RCSR path planning. Beyond mapping, it is widely used to train segmentation models (e.g., U-Net, SegNet, LinkNet, D-LinkNet) that produce semantic cost maps feeding the perception layer of risk-calibrated planning pipelines [126]. These representations enable planners (e.g., URA* [127], multi-objective D* Lite [128], and A*/RRT*-based methods [127,129]) to define feasible corridors, encode safety margins, and incorporate environmental risk directly into path selection. pNEUMA/pNEUMA Vision provides large-scale drone-recorded urban traffic trajectories with high-resolution motion data for vehicles, pedestrians, and cyclists [130]. It is widely used for traffic-aware path planning, dynamic obstacle prediction, and risk-sensitive routing in dense urban environments [131,132,133]. Extensions such as pNEUMA Vision [134] further support integrated perception–planning pipelines for safety-critical RCSR evaluation.

4.3.2. Vision-Based Perception Datasets Supporting Navigation and Obstacle Avoidance

Vision-based datasets play a critical role in developing perception modules that support navigation and obstacle avoidance in RCSR UAV systems. VisDrone [135] provides large-scale urban aerial imagery for object detection and tracking under diverse conditions and is widely used to generate perception outputs (e.g., detection and tracking) that inform cost maps, dynamic obstacle prediction, and safety-aware trajectory generation. Similarly, UAVDT [136] offers annotated vehicle-centric aerial data with environmental attributes such as altitude, view angle, and weather, enabling perception-driven traffic monitoring and coordination pipelines that support risk-calibrated planning and multi-UAV routing [137,138,139]. The Stanford Drone Dataset (SDD) [140] provides detailed trajectories of humans and objects in complex outdoor environments, supporting research in dynamic-scene navigation, social-aware path planning, and prediction-based obstacle avoidance under explicit safety constraints [141,142]. In addition, KITTI [143], although originally collected from ground vehicles, is widely reused for UAV perception tasks such as depth estimation, obstacle detection, and scene understanding, with outputs frequently integrated into navigation, collision avoidance, and local planning modules [144,145,146].

4.3.3. Visual–Inertial and SLAM-Focused Datasets Used in UAV Navigation

Visual–inertial and SLAM-focused datasets provide essential perception and localization inputs for risk-aware path planning in RCSR UAV systems. EuRoC MAV [147,148] offers synchronized stereo imagery, IMU data, and precise motion-capture ground truth, making it a standard benchmark for evaluating VIO, SLAM, and pose estimation modules that support safe trajectory generation under uncertainty. The Zurich Urban MAV Dataset [149] captures UAV flights in dense urban environments, including narrow streets and GPS-denied areas, enabling research in global 3D path planning, skyline-based localization, and safety-constrained navigation in complex outdoor settings. PennCOSYVIO [150] provides visual–inertial sequences across challenging indoor–outdoor environments with complex geometries and varying lighting, supporting evaluation of VIO-guided navigation, global risk-aware planning, and reactive obstacle avoidance under safety constraints.

4.3.4. Synthetic Datasets and Simulation-Based Benchmarks

Synthetic datasets enable controlled, large-scale evaluation of planning algorithms in RCSR settings, particularly for DRL and multi-UAV systems where real-world experimentation is costly or unsafe. AirSim synthetic worlds [151] provide photorealistic environments for DRL navigation, obstacle avoidance, and risk-aware policy validation, while MATLAB/Simulink-generated environments support optimization-based planning through geometric safety constraints such as desired and forbidden regions [66,67]. Unity-generated environments enable flexible simulation of swarm navigation and collision avoidance under adjustable conditions [82], whereas Gazebo-based environments [100] supports safety- and resource-aware validation pipelines in structured indoor/outdoor scenarios. OpenUAV [97] and CityNav-style [152] datasets provide scalable synthetic environments and trajectory datasets, including vision-language navigation under operational constraints. Additionally, custom synthetic scenarios support specialized evaluations such as spatiotemporal coordination, stochastic risk modeling, and large-scale multi-agent task allocation under safety and resource constraints [115,117,118].

4.3.5. High-Speed, Agile, and Indoor Navigation Datasets

High-speed and indoor navigation datasets support evaluation of agile flight, time-critical planning, and collision avoidance under constrained RCSR conditions. The UZH-FPV Drone Racing Dataset [153] provides high-speed FPV flights with precise ground truth in cluttered indoor tracks, enabling research in time-optimal planning and DRL-based collision avoidance. The MIT Blackbird Dataset [154] offers high-frequency multi-camera and IMU data from aggressive quadrotor maneuvers, supporting perception-aware trajectory optimization and high-speed control with calibrated safety margins. Mini-drone indoor datasets [155,156] capture navigation in tight indoor spaces such as corridors and warehouses, enabling evaluation of DRL-based obstacle avoidance, waypoint tracking, and constrained navigation.

4.3.6. Disaster, Forest, and Inspection Datasets

Disaster, forest, and inspection datasets support environment-specific evaluation of risk-aware planning in RCSR UAV applications. Disaster and SAR aerial datasets [157,158] capture scenarios such as wildfires, collapsed structures, and victim search, enabling research in coverage-based search, uncertainty-aware replanning, and DRL-based search strategies. Forest navigation datasets [159,160] provide sequences in dense vegetation, supporting low-altitude obstacle avoidance, navigation through narrow gaps, and environment-aware replanning under perception and safety constraints. DroneDeploy mapping datasets [161] offer high-resolution orthomosaics and elevation models from real-world environments, enabling evaluation of coverage planning, inspection trajectory design, and altitude-constrained navigation under resource-aware conditions.

To summarize the above, the diversity of publicly available UAV datasets can be seen clearly in Table 4, which consolidates representative datasets by data content, indoor/outdoor coverage, and capture method. Taken together, Figure 4 and Figure 5, and Table 4 illustrate how datasets support different layers of the RCSR autonomy pipeline—global routing and map priors, perception modules for detection/tracking, localization/mapping prerequisites, and high-speed reactive navigation under explicit safety and risk constraints.

4.4. Evaluation Methods and Metrics

Evaluation in RCSR UAV path-finding research varies widely depending on each study’s focus—from multi-agent coordination and optimization to autonomy, risk calibration, efficiency, or realism under resource constraints. The reviewed works demonstrate a blend of algorithmic performance metrics, mission-specific indicators, and, in some cases, real-world validation to assess scalability, reliability, and computational feasibility. As summarized in Figure 6, RCSR UAV path-planning metrics can be categorized into five major groups based on what aspect of performance they measure, while Table 5 provides concise definitions to support consistent reporting and comparison.

1.: Path Optimality Metrics: assess how efficiently UAVs navigate from start to goal, considering distance, trajectory quality, and energy efficiency. Examples include path quality [82], path length [66,67,116,118], and goal distance/distance to goal [66,67].
2.: Computational Performance Metrics: measure how efficiently algorithms compute paths and handle system resources. Examples include computation time/runtime [82,100,116,118], computation efficiency [117], memory use [82], and convergence [67,117].
3.: Safety and Collision Metrics: evaluate whether UAVs maintain safe separation and avoid obstacles or restricted regions. Examples include collision rate/collision avoidance [100,117], conflict rate [118], and forbidden region (FR) penalty [66,67].
4.: Information- and Mission-Based Metrics: evaluate how well UAVs accomplish domain-related goals beyond path geometry. Examples include information gain/collected information [66,67], average waiting distance (AWD) [116], charging count [116], total delay [115], and number of rejections [115].
5.: System-Level and Scalability Metrics: capture overall robustness, fairness, adaptability, and ability to scale across environments or agents. Examples include scalability [82,100,118], fairness [115], success rate [100], and transferability [117].

Figure 6. Categories of Metrics Used in RCSR Path-Finding Simulation.

Table 5. Definition of key metrics.

Metric	Definition/Purpose
Path Quality	Total cost or efficiency of the planned path (often shortest or smoothest).
Path Length	Sum of distances between all consecutive waypoints, reflecting total travel distance.
Goal Distance	Euclidean distance between UAV’s final position and target endpoint, indicating path accuracy
Computation Time	Time required to generate a valid or optimal solution.
Computation Efficiency	Inverse of average computation time; measures algorithmic responsiveness in dynamic environments
Memory Use	Total memory consumed during the path computation process.
Convergence	Number of iterations needed for optimization or learning algorithms to stabilize.
Collision Avoidance	Number or rate of UAV collisions in simulation or physical tests.
Conflict Rate	Fraction of conflicting or overlapping trajectories among UAV pairs.
Forbidden Region Penalty	Quantifies penalties or violations for entering restricted or unsafe zones.
Average Waiting Distance	Average distance between UAV and assigned service target during waiting periods (e.g., rescue or delivery missions).
Charging Count	Number of visits to charging stations during a mission, indicating operational endurance.
Total Delay	Cumulative delay experienced by UAVs due to traffic conflicts or rescheduling.
Number of Rejections	Count of denied flight operations resulting from airspace congestion.
Scalability	Evaluates system performance as the number of UAVs or tasks increases.
Fairness	Assesses the equitable distribution of costs or delays across service providers or agents.
Information Gain	Quantifies the amount of sensor data or imagery collected during the mission
Success Rate	Percentage of successful missions or path completions without collisions.
Transferability	Measures how well a trained simulation model performs in real-world conditions.

4.5. Summary and RCSR Relevance

Across simulation frameworks, testbeds, datasets, and evaluation metrics, the RCSR ecosystem reflects a layered validation pipeline that spans controlled abstraction to real-world deployment. Simulation platforms enable systematic evaluation of risk calibration, safety constraints, and resource trade-offs under controlled and repeatable conditions, while datasets support perception, localization, and uncertainty modeling that directly influence planning decisions. In contrast, physical testbeds expose planners to tightly coupled system effects—including sensing noise, control limitations, communication delays, and environmental disturbances—that are often abstracted in simulation. Evaluation metrics further provide a unified framework for quantifying performance across path optimality, safety, robustness, computational efficiency, and transferability.

From an RCSR perspective, these components collectively support the four core pillars of deployable UAV autonomy. Simulation and synthetic environments enable controlled risk modeling and scalable benchmarking, while high-fidelity and SITL/HIL platforms allow validation of certifiable safety under realistic dynamics and control. Datasets provide measurable uncertainty in perception and mapping that propagates into risk-aware planning, and testbeds verify whether these objectives remain valid under real operational constraints. However, across the reviewed literature, validation remains predominantly simulation-based, with relatively few studies demonstrating comprehensive real-world flight experiments. Many works rely on simulation or intermediate validation stages (e.g., SIL/HIL or indoor testbeds), which, while valuable, do not fully capture operational conditions.

Despite strong performance in simulation, several studies report challenges in real-world deployment, including failures due to perception noise, GPS drift, and unmodeled environmental disturbances. In particular, RL-based planners often struggle under real-world uncertainty and distribution shifts, where assumptions made during training (e.g., ideal sensing or full state observability) no longer hold. Prior work shows that policies trained in simulation can degrade significantly when deployed with real sensor inputs or under out-of-distribution conditions, highlighting a fundamental sim-to-real gap [162,163].

This imbalance highlights a critical gap between algorithmic performance and deployable autonomy, underscoring the need for more systematic real-world validation to ensure that risk-aware, certifiably safe, and resource-aware planning objectives hold under practical flight conditions.

5. Uncertainty-Aware Planning and Formal Safety Assurance

Real-world UAV operations face uncertainty that directly impacts both safety and mission performance. Three major sources dominate practical deployments. First, imperfect perception means that onboard sensors (vision, LiDAR, GPS/IMU, radar) provide incomplete, noisy, or delayed information about the vehicle state and surrounding obstacles due to sensing limitations (e.g., field of view, range) and adverse environments (e.g., occlusion, low texture, lighting variation). Second, dynamic environments introduce time-varying obstacles and traversability: people, vehicles, animals, and other UAVs move, while environmental elements such as vegetation and water can change the effective free space over time. Third, stochastic disturbances (wind, turbulence, temperature and humidity effects, precipitation) perturb the UAV dynamics and can also degrade sensing, creating coupled uncertainty sources.

These factors are rarely independent. For example, wind can deflect the UAV state while simultaneously causing motion blur and localization drift, increasing perception uncertainty. As a result, deterministic planners that assume static, fully known maps and disturbance-free dynamics can be unreliable in real deployments, especially when uncertainty sources compound. Modern UAV path-planning therefore integrates probabilistic state estimation, prediction, and constraint-handling mechanisms, and often pairs planning with runtime safety layers to maintain safety under model mismatch. The following subsections summarize the dominant technical approaches used to address each uncertainty source. Figure 7 represents these categories and the principal methods used to address them.

5.1. Imperfect Perception

A core question in safe planning is where the UAV and surrounding obstacles actually are at any instant. In practice, localization and mapping must be inferred from noisy sensors, motivating probabilistic state estimation pipelines [164,165,166]. Rather than relying solely on a point estimate, many systems maintain a distribution (or at least a covariance) over the vehicle state and propagate this uncertainty into planning decisions.

Accounting for uncertainty magnitude is critical. Ignoring sensor noise, mapping error, and disturbance-induced drift can produce trajectories that appear safe under nominal estimates but are unsafe in the true state [167]. Accordingly, modern planners incorporate uncertainty through mechanisms such as inflating obstacles, enforcing chance constraints (e.g., bounding collision probability), and planning in belief space [167,168], where actions are chosen to both advance toward objectives and reduce future uncertainty.

A widely used backbone for state estimation is Kalman-style filtering, including the extended Kalman filter (EKF) and unscented Kalman filter (UKF) [169]. A standard negative log-likelihood objective for the measurement update can be written as

J_{k} = \frac{1}{2} {(z_{k} - {\hat{z}}_{k})}^{⊤} S_{k}^{- 1} (z_{k} - {\hat{z}}_{k}) + \frac{1}{2} log det S_{k},

(19)

where

z_{k}

is the measurement at time step k,

{\hat{z}}_{k}

is the predicted measurement mean, and

S_{k}

is the residual covariance. EKF uses Jacobian linearization and is typically appropriate when local linear approximations are adequate, while UKF uses sigma points to better capture nonlinearities, often at higher computational cost.

Uncertainty-aware planning also depends on estimating the environment. Probabilistic mapping methods such as occupancy grid mapping [170] have been extended with multi-sensor fusion (IMU, GPS, LiDAR, cameras) to improve robustness under partial observability [171]. These maps then support downstream cost fields and collision checking for planning.

Belief-space planning extends classical planning by optimizing over distributions. A common objective minimizes a scalar function of covariance across a horizon:

min_{u_{0 : T - 1}} J = \sum_{k = 0}^{T} ϕ (P_{k}), ϕ (P_{k}) = log det (P_{k}),

(20)

where

P_{k}

is the predicted state covariance at time k, T is the planning horizon, and

u_{0 : T - 1}

is the control sequence. More recent belief-space formulations explicitly incorporate measurement informativeness and sensing conditions [172]. One representative form replaces covariance-only penalties with residual-based costs:

min_{u_{0 : T - 1}} J = \sum_{k = 0}^{T} [\frac{1}{2} ν_{k}^{⊤} S_{k}^{- 1} ν_{k} + \frac{1}{2} log det (S_{k})],

(21)

with

\begin{matrix} ν_{k} & = z_{k} - {\hat{z}}_{k}, \end{matrix}

(22)

\begin{matrix} S_{k} & = H_{k} P_{k | k - 1} H_{k}^{⊤} + R_{k} (EKF), \end{matrix}

(23)

\begin{matrix} S_{k} & = \sum_{i} W_{i}^{(c)} (γ_{i, k} - {\hat{z}}_{k}) {(γ_{i, k} - {\hat{z}}_{k})}^{⊤} + R_{k} (UKF), \end{matrix}

(24)

where

ν_{k}

is the measurement residual and

S_{k}

is the residual covariance. In this view, routes that preserve measurement quality (e.g., maintaining informative visual features rather than flying through low-light or texture-poor regions) may be preferred over purely shortest-distance paths.

5.2. Dynamic Environments

Even with accurate state estimation, real-world UAVs operate in environments where obstacles and traversability change over time. Humans, vehicles, animals, and other UAVs can enter or exit the flight corridor, while environmental elements such as foliage and rocks can alter the effective free space. Modern approaches commonly address dynamic environments using a combination of (i) belief-space reasoning over obstacle states, (ii) receding-horizon re-planning via model predictive control (MPC), (iii) explicit obstacle prediction, and (iv) chance-constrained safety envelopes.

Belief-space methods extend naturally to dynamic obstacles by maintaining uncertainty over obstacle states (position, velocity, intent) and penalizing proximity according to uncertainty and predicted motion [168]. In practice, predictable entities are assigned tighter safety margins than uncertain or erratic ones, reflecting different risk levels in the planning objective.

A second widely used mechanism is model predictive control (MPC), which repeatedly solves a short-horizon planning or trajectory-optimization problem using the latest state estimate and environment model, executes only the first control action, then re-plans after receiving new measurements [173,174]. This receding-horizon structure enables rapid response to changes but requires balancing re-planning frequency against onboard compute limits and sensing latency. MPC is particularly effective in cluttered or densely populated scenes, where stale plans quickly become invalid.

Explicit prediction of dynamic obstacles produces forecasts of obstacle occupancy over a time horizon. A common pipeline first classifies an observed entity (e.g., pedestrian, vehicle, UAV) and then applies an appropriate motion model (e.g., constant-velocity/acceleration) with filtered state estimates. Kalman-style predictors are frequently used for this purpose by leveraging the same residual/covariance machinery as in (19). More advanced predictors use learned models (e.g., recurrent networks or Gaussian processes) to capture nonlinear or context-dependent motion patterns, often improving empirical performance in crowded scenes.

Other strategies used to deal with dynamic obstacles include sensor-aware planning, which focuses on keeping dynamic obstacles in sensor view in order to monitor their movement [175], as well as sensor training optimization, which focuses on streamlining object recognition for specific sensors so they are both effective and efficient in computation for real-time aware systems [176].

Finally, chance-constrained planning encodes safety requirements probabilistically, enforcing constraints such as collision probability below a threshold rather than purely geometric clearance. Chance constraints are most commonly applied as an added safety layer on top of belief-space planning or MPC, where spatial margins adapt based on obstacle uncertainty, classification, and predicted motion. This probabilistic framing provides a principled way to trade path efficiency against risk in time-varying environments.

5.3. Stochastic Disturbances

A third category of elements that impact the certainty of path planning approaches are stochastic disturbances. Broadly speaking, stochastic disturbances are defined to be relatively random and/or unpredictable effects that can alter the UAV’s motion, sensing, or surroundings. “Stochastic” here implies that these effects are probabilistic; for that reason, they are best represented using probability distributions. Because UAVs must effectively react to any and all challenges in real-time, they must be able to adjust to stochastic disturbances. The most common examples of disturbances, along with popular ways to mitigate them, are addressed below.

The most traditional (from a conceptual standpoint) form of disturbance is one that directly impacts the UAV’s motion in flight, meaning it may affect the trajectory of the UAV to its destination. Wind gusts and turbulence provide the most typical and critical forms of motion disturbance, as a sudden gust can completely change not only the position of the UAV but its heading as well [177]. Even lesser but constant turbulence can slowly distort the actual trajectory from the planned one. Other less common, but still important motion disturbances include motor and sensor noise. As with turbulence, incorrect positional or bearing readings can cause deviation from initially planned paths.

There are several measures taken to mitigate motion disturbances, but the most common approach is to introduce additional parameters into belief-space calculation to account for the potential impact of each motion effect. Under specific circumstances, these parameters may be computed using sophisticated approaches that take into account the characteristics of the UAV and the expected wind direction/magnitude. However, when this computation is too expensive or unavailable, the more general-purpose approach is to employ a standard distribution to model the disturbance. The most common of these relies on a traditional Gaussian process [178,179], which is normally modeled as

p (w) = \frac{1}{\sqrt{2 π σ^{2}}} exp (- \frac{w^{2}}{2 σ^{2}})

(25)

where w and

σ

are preset values or computed using basic information about the UAV and environment (UAV mass, expected wind gust speeds, etc.)

A second category of disturbances is one that involves UAV sensors. In this context, it is assumed that the ability of the UAV to reliably collect information about its position and environment will be negatively impacted as disturbances render the sensors less reliable. Examples of these disturbances include atmospheric effects, such as humidity, which can negatively impact virtually any sensor type (be it visual, electronic, pressure-based, etc.). In addition, effects that impact individual sensors are a concern, with these including electronic noise—which disproportionally affects LiDAR and GPS-based sensors, as well as photonic or light-based noise, which disproportionally affects sensors reliant on conventional imaging of the environment [179].

As sensor noise effects are highly varied and have disparate effects on the different types of sensors used, there is no singular solution to dealing with these types of disturbances. Attempts at mitigation may be preventative in nature—deployed during the planning phase of flight, or ameliorative—deployed in real-time to account for diminished reliability of sensors. The former approach might include additional parameters in belief-space planning that incorporate sensor disturbances. For example, a path that involves travel through a region with known electronic noise or interference would carry a heavy objective function penalty. Real-time mitigation might instead reduce the weight attached to sensors known to be under diminished capacity. For example, bright light or glare may reduce the weight given to traditional visual sensors mid-flight in comparison to GPS-based sensors.

The last category of disturbances to consider are those that affect the surroundings around the UAV, as opposed to the UAV or its sensors. These are most often referred to as environmental stochastic disturbances and are among the trickiest to deal with in practice. Sample environmental disturbances include variable surfaces, such as bodies of water that shift in response to external factors like wind, as well as terrain that may shift in response to stimuli, such as tree branches that move in response to a nearby entity.

While environmental disturbances overlap with the previously addressed subject of dynamic environments, the term disturbance is generally provided to indicate a phenomenon unpredictable to model using a projected path of motion. For that reason, when scientists or engineers attempt to incorporate these disturbances in path planning under the belief-space model, they are far more likely to incorporate parameters with uncertainty, such as those in (25), or to incorporate a technique known as stochastic model predictive control (SMPC) [180]. In a nutshell, SMPC is an approach to real-time UAV control that both incorporates calculations using parameters with uncertainity into the belief space, while also adjusting the interval for re-planning flight paths to account for proximity to disturbances. For example, a UAV traveling close to a water surface may re-plan its flight path much more frequently mid-control sequence than one traveling in an open space.

Another recent approach to dealing with environmental disturbances employs a corrective strategy during real-time planning. Prominent among these is the backstepping sliding mode method [181] seeks to approximate unknown parameters attached to environmental disturbances and produce a so-called anti-interference link to counteract the impact of the external environment on the UAV’s project flight path. The goal is to, over time, produce an adaptive correction to effectively stabilize UAV flight without needing to know prior parameters or effects.

5.4. Summary and RCSR Relevance

There are three principal uncertainty sources in UAV path planning: imperfect perception, dynamic environments, and stochastic disturbances. Imperfect perception arises from noisy and limited sensors, requiring probabilistic state estimation methods such as EKF and UKF, together with belief-space planning, to propagate uncertainty into planning decisions. Dynamic environments, where obstacles and traversability change over time, can invalidate otherwise effective flight paths and are commonly addressed through receding-horizon replanning via MPC, obstacle prediction models, and chance-constrained safety envelopes. Stochastic disturbances include wind and turbulence affecting motion, atmospheric and electronic effects degrading sensors, and unpredictable environmental changes such as shifting water surfaces. These are often mitigated through Gaussian disturbance models, real-time sensor re-weighting, and stochastic MPC that adapts replanning rates to nearby disturbance sources.

Across all three categories, a recurring trade-off is that making the planner more uncertainty-aware generally improves safety and robustness but also increases computational and sensing burden on platforms with limited onboard resources. In addition, not all solutions apply equally well to every uncertainty source, particularly in the case of stochastic disturbances, which are often too broad and varied to be addressed by a single universal approach. Table 6 provides a consolidated summary of the principal uncertainty sources, the trade-offs they introduce, and representative mitigation strategies.

These uncertainty sources correspond directly to the core pillars of the RCSR framework. Risk calibration depends on probabilistic state estimation and belief-space planning, since without quantified uncertainty, risk cannot be meaningfully computed or enforced. Certifiable safety is supported by chance-constrained planning and runtime safety layers that help manage dynamic environments and stochastic disturbances through formal probabilistic guarantees such as bounded collision probability. Resource awareness is relevant across all three uncertainty categories, because every improvement in uncertainty handling, including richer state distributions, more frequent MPC replanning, or disturbance parameter estimation, tends to increase computational and sensing demands on limited onboard platforms. Finally, the interdependence of these uncertainty sources reinforces the central premise of the RCSR framework: uncertainty, safety, and resource constraints must be addressed jointly within an integrated planning architecture.

6. Real-World Environment Constraints

We previously examined how path planners can address uncertainty through probabilistic state estimates to counter imperfect perception, dynamic obstacle prediction, and stochastic disturbance modeling. It’s key to note that these techniques do not operate in isolation. Each uncertainty-aware method must ultimately contend with the practical realities of deployment—that is, the sensors that feed state estimators have finite range and resolution, the onboard processors running belief-space optimization have limited compute budgets, energy constraints bound how long and how aggressively a UAV can maneuver, and airspace regulations limit where a UAV can legally fly. In other words, the counters to uncertainty developed previously are only as strong as the real-world resources and operational boundaries that support them. These relationships are often circular: unreliable perception by sensors degrades the accuracy of maps being produced during flight, and lack of map quality in turn increases our reliance on sensors in real-time to mitigate unreliable information.

For the reasons outlined above, we must therefore consider not just the mathematical models underpinning planning for uncertainty but also the engineering and regulatory constraints that shape what planners can actually execute in practice. This section focuses on the latter concern, wherein we consider that UAVs must operate with mapping constraints and limits, time-varying disturbances, strict energy and onboard compute budgets, and evolving airspace rules. These constraints are interdependent: mapping limitations can increase risk margins and energy use; wind and turbulence can amplify localization error; and regulatory compliance can restrict feasible corridors, thereby increasing planning complexity. This section reviews four practical constraint dimensions: (i) perception and mapping, (ii) environmental effects and resource limits, (iii) robustness and adaptation, and (iv) airspace and regulatory compliance, together with mitigation strategies reported in the literature.

6.1. Perception and Mapping Constraints

Perception and mapping underpin real-world UAV planning because representation fidelity and update rates directly influence feasibility, optimality, and safety. Accurate maps support obstacle avoidance and enable energy-aware and certifiably safe trajectory generation under limited onboard resources.

6.1.1. Occupancy Grids and Their Limits

Occupancy grid mapping remains widely used because of its probabilistic formulation and computational tractability. The environment is discretized into voxels, each assigned an occupancy probability

p (o_{i})

updated via Bayesian filtering:

p (o_{i} ∣ z_{1 : t}) = \frac{p (z_{t} ∣ o_{i}) p (o_{i} ∣ z_{1 : t - 1})}{p (z_{t} ∣ o_{i}) p (o_{i} ∣ z_{1 : t - 1}) + p (z_{t} ∣ \neg o_{i}) (1 - p (o_{i} ∣ z_{1 : t - 1}))},

(26)

where

z_{t}

is the observation at time t. Such representations have been used in applications ranging from autonomous pollination [182] to cooperative UAV–UGV navigation under degraded perception [183]. However, occupancy grids impose a resolution trade-off: coarse grids can yield overly conservative routes, whereas fine grids increase memory and computational demands that may exceed the capabilities of small UAV platforms.

6.1.2. Distance Fields and Local Replanning

To address the limitations of grid resolution, continuous distance-field representations are commonly used for local planning and trajectory optimization. Truncated signed distance fields (TSDFs) and Euclidean signed distance fields (ESDFs) encode the distance from a query point x to the nearest obstacle:

d (x) = min_{o \in O} {∥ x - o ∥}_{2},

(27)

where

O

is the obstacle set. ESDFs provide smooth gradients that can be directly exploited by optimization-based planners to generate dynamically feasible, collision-free trajectories. These fields are especially valuable in cluttered or dynamic environments, where fast local replanning must respond to evolving obstacle boundaries. Recent work has also integrated semantic segmentation into mapping pipelines to distinguish traversable regions from semantically meaningful obstacles such as vegetation, buildings, and restricted structures [184].

In many practical planning systems, obstacles are still represented using simplified geometric abstractions such as spheres, boxes, or occupancy voxels because these models are computationally efficient and integrate naturally with collision checking. However, real-world environments often contain obstacles with irregular geometry and semantic meaning, such as trees, buildings, poles, or restricted infrastructure, which may imply different levels of risk and different operational constraints. Incorporating semantic information into planning allows obstacle classes to be associated with class-specific safety margins, traversal penalties, or exclusion zones, while richer geometric representations such as meshes, signed distance fields, and semantic occupancy maps can better capture complex obstacle boundaries. These capabilities are especially important in cluttered urban and natural environments, where safe and deployable planning depends not only on obstacle location but also on obstacle type, structural complexity, and operational meaning. Hybrid pipelines, such as B-spline trajectory refinement layered over occupancy maps, further improve smoothness and robustness in the presence of dynamic obstacles [185].

6.1.3. Sensor Modality and Compute Constraints

Mapping choices are shaped by sensor payload, power availability, and onboard compute capacity. LiDAR provides accurate 3D structure but is heavier and more power intensive than camera-only configurations. Visual–inertial odometry and monocular depth estimation offer lightweight alternatives, but they are more sensitive to lighting, texture, and motion blur. Event-based cameras can provide low-latency perception in high-dynamic-range or low-light conditions, thereby supporting agile flight. Multi-modal fusion of LiDAR, camera, and inertial sensing is increasingly used to mitigate single-sensor failures in complex terrain [186].

6.1.4. Scalability and Adaptive Mapping

Scalable mapping remains a bottleneck for long-duration missions and dense 3D environments. Adaptive mapping strategies adjust voxel resolution, update frequency, or region-of-interest updates to balance representation fidelity against limited CPU/GPU budgets while preserving real-time feasibility on embedded platforms [13]. Recent surveys emphasize the need to evaluate perception constraints jointly with environment complexity, resource limits, and certification requirements [14]. Distributed mapping across multiple UAVs has also been explored, although bandwidth, synchronization, and regulatory acceptance remain open challenges.

6.2. Environmental Effects and Resource Limits

Real-world UAV deployments are constrained by both environmental disturbances and limited onboard resources. Wind, turbulence, and adverse weather perturb nominal trajectories and typically increase energy expenditure, motivating disturbance-aware models and compensation strategies. For example, Fan et al. [59] incorporated dynamic wind and extreme weather effects into swarm planning to reduce mission failures, while learning-based approaches can predict disturbance patterns and adapt trajectories online [37].

Energy limitations are a dominant constraint because UAV endurance depends on payload, climb rate, and maneuver aggressiveness. Energy-aware planning often incorporates explicit consumption models to ensure safe return-to-launch or diversion to a contingency landing site. A common formulation expresses energy use as

E = \int_{0}^{Δ t} P (t) d t, P (t) \approx T (t) v (t),

(28)

where

P (t)

is instantaneous power,

T (t)

is thrust, and

v (t)

is velocity. Surveys emphasize embedding such models into planning costs to maintain energy-efficient operation in complex environments [187,188]. This consideration is especially relevant for delivery missions, where recharge or battery-swap stops may need to be integrated into long-range routing [187].

Compute constraints further limit algorithmic choices. Many UAVs rely on embedded processors, such as Jetson-class platforms, which cannot support frequent global replanning using heavy nonlinear optimization or large-scale reinforcement learning. Consequently, hierarchical architectures are common: lightweight global planners provide coarse, constraint-respecting routes, while local modules refine trajectories in real time [3,15]. Offloading and distributed strategies allow swarms to share compute resources or use edge/cloud servers for updates [38], but they introduce latency, bandwidth dependence, and additional safety concerns, particularly in urban or contested airspace [189].

6.3. Robustness and Adaptation Strategies

Robust path planning requires mechanisms that adapt to uncertainty, disturbances, and unmodeled changes during flight. Offline plans can become unsafe or inefficient when exposed to real-world variability, thereby motivating online replanning and adaptation strategies that explicitly respect safety and resource constraints.

A major direction combines adaptive mapping with hierarchical planning. Frontier-based exploration can adjust voxel-map resolution and update rates in response to compute load, maintaining real-time feasibility without sacrificing critical environmental detail [190]. Similarly, hybrid planners decompose the problem into a global planner and a local replanner that reacts to dynamic regions [3].

Disturbance-aware planning further improves robustness. Real-time wind estimation integrated into optimization reduces deviation and energy use [59]. Reinforcement learning can complement model-based methods by learning policies that compensate for disturbances during execution [37]. At the swarm level, adaptive coordination is used to maintain safety and mission progress under multiple threats, including weather, interference, and adversarial conditions [49].

Regulatory and safety compliance during adaptation is often enforced through runtime monitors that impose constraints such as no-fly zones, Remote ID requirements, minimum battery reserves, and safe separation. Runtime assurance architectures enable switching to certified fallback controllers when the nominal planner fails or resource limits are exceeded [191].

Machine learning also supports robust adaptation through multi-agent learning and meta-heuristic strategies that generalize across environments and reallocate tasks under uncertainty [38,192]. Recent approaches such as RAPID incorporate robust reward design and inverse reinforcement learning to improve safety under distribution shift [56]. Overall, robustness emerges from combining online replanning, disturbance compensation, compliance monitoring, and learning-based adaptation in a unified architecture.

6.4. Airspace and Regulatory Constraints

UAV path planning must satisfy airspace regulations that constrain altitude, geography, access permissions, and allowable modes of operation. No-fly zones (NFZs), geofencing boundaries, and controlled airspace classifications impose hard feasibility constraints that planners must respect to ensure both legality and operational safety. Earlier approaches often modeled these restrictions as static forbidden regions, whereas more recent work has emphasized regulation-aware planning strategies that update constraints during flight and incorporate them directly into decision-making [193].

Regulatory requirements also vary substantially across jurisdictions, which complicates standardized planning, validation, and certification [194]. In some settings, the primary emphasis remains on altitude limits, visual line-of-sight rules, and restricted-zone avoidance. In others, growing attention is being directed toward emerging operational requirements such as UAS Traffic Management (UTM) and the integration of UAVs into urban air mobility corridors [195]. These developments are especially important for large-scale and shared-airspace operations, where planners must account not only for static restrictions but also for coordinated traffic management and evolving operational policies.

Digital low-altitude airspace infrastructures increasingly combine geofencing, obstacle mapping, traffic coordination, and policy enforcement to support scalable UAV deployment in urban environments [18,196]. Within this context, Remote ID and UTM-related mechanisms are becoming central to practical UAV operations because they support identification, traceability, conflict management, and compliance monitoring during flight. As a result, planners must be capable of responding to dynamic regulatory updates in near real time rather than relying solely on precomputed constraint maps. Palmerius et al. [197] illustrated this shift through route planning in flexible airspace designs where regulatory constraints are embedded directly into mission planning.

Risk-aware planning models further extend this perspective by incorporating regulatory compliance into optimization objectives through estimates of collision risk, communication reliability, and violation probability [198]. Urban airspace monitoring and infrastructure-aware planning likewise support routine UAV operations under these constraints [185]. In parallel, safety architectures that maximize throughput while enforcing regulatory limits suggest that efficiency and compliance can be optimized jointly rather than treated as competing objectives [199]. Overall, these developments show that deployable UAV path planning must address not only collision-free navigation but also real-time compliance, traceability, and scalable coordination under evolving airspace management requirements.

6.5. Summary and RCSR Relevance

Real-world UAV path planning is shaped not only by algorithmic capability but also by sensing fidelity, environmental disturbances, resource availability, adaptive robustness, and regulatory compliance. Perception and mapping determine how accurately the environment can be represented, while wind, energy, and compute limitations constrain what can be executed safely in practice. Robustness and adaptation mechanisms help planners remain effective under uncertainty, and airspace regulations define the legal and operational boundaries of deployment.

These practical considerations introduce recurring trade-offs across the literature. Higher-fidelity perception and richer environmental awareness can improve safety and planning accuracy, but they often increase memory, energy, and computational demands. Similarly, more adaptive and learning-enabled methods can enhance responsiveness under uncertainty, yet they are typically harder to validate and certify for safety-critical deployment. Table 7 provides a consolidated summary of the principal real-world constraints affecting UAV path planning, the trade-offs they introduce, and the representative mitigation strategies adopted in prior work.

Together, these constraints correspond directly to the core pillars of the RCSR framework. Risk calibration depends on accurate perception, environment modeling, and disturbance awareness, since uncertainty in sensing and operating conditions directly affects how risk can be estimated and managed. Certifiable safety is reflected in the need for robust adaptation, runtime monitoring, fallback mechanisms, and compliance with operational constraints such as no-fly zones, Remote ID, and UTM-related requirements. Resource awareness remains central because improvements in mapping fidelity, replanning frequency, and adaptive capability often increase onboard computational load, sensing demands, and energy consumption. Finally, the interaction among perception limits, environmental effects, adaptation needs, and regulatory constraints reinforces the central premise of the RCSR framework: deployable UAV planners must be designed not only for nominal optimality but also for resilience, safety, efficiency, and regulatory compatibility in real operational environments.

7. Future Research Directions

Despite substantial progress in UAV path planning, a persistent gap remains between algorithmic advances and reliable real-world deployment. Much of the literature optimizes individual objectives such as path optimality, collision avoidance, or computational speed without jointly addressing uncertainty, safety guarantees, multi-agent coordination, regulatory compliance, and onboard resource limits [3,4]. Consequently, methods that perform well in controlled simulations often degrade when exposed to real-world disturbances, sensing errors, and operational constraints.

The proposed Risk-Calibrated, Certifiably Safe, Resource-Aware (RCSR) framework provides a structured perspective for identifying research directions that prioritize deployable autonomy rather than purely benchmark-oriented performance improvements. Table 8 summarizes the primary limitations observed in current UAV planning systems and outlines key research directions required to enable safe, scalable, and deployable UAV autonomy.

7.1. Risk Calibration Under Uncertainty

Many path planners still assume deterministic or fully known environments and compensate for uncertainty using heuristic safety margins [1,2]. In operational deployments, however, uncertainty arises from perception errors, localization drift, wind disturbances, actuator saturation, and intermittent communications. These uncertainties propagate through the planning pipeline and can lead to unsafe trajectories if not explicitly modeled.

Future research should therefore emphasize quantitative risk calibration, where uncertainty is explicitly modeled and translated into interpretable and enforceable risk metrics. Belief-space planning and probabilistic roadmaps with uncertainty propagation provide promising foundations for such formulations [201]. Similarly, chance-constrained and distributionally robust optimization methods can enforce probabilistic safety guarantees under bounded uncertainty. Another important direction is the incorporation of semantic obstacle understanding into risk calibration, so that planners can distinguish among obstacle classes with different geometric complexity, operational meaning, and risk profiles rather than relying only on simplified geometric abstractions.

A key open problem is the development of lightweight uncertainty-aware planners that remain tractable under strict real-time and onboard compute constraints. Another important research direction is the integration of perception quality into planning risk models so that risk budgets adapt dynamically to sensing reliability and environmental conditions.

7.2. Certifiable Safety and Runtime Assurance

Safe UAV operation in shared airspace requires verifiable guarantees that trajectories satisfy collision avoidance, separation, and regulatory constraints. In many current systems, safety is enforced indirectly through conservative planning heuristics or large safety margins [10]. While such approaches reduce risk, they often degrade efficiency and fail to provide formal guarantees.

Recent work in control theory provides promising mechanisms for certifiable safety, including control barrier functions, reachability analysis, and invariant set methods [5,11]. However, these techniques remain insufficiently integrated with high-level mission planners and learning-based navigation systems.

Future research should therefore investigate compositional safety architectures that combine high-level planning, formally constrained local motion generation, and runtime assurance layers that detect violations and switch to certified fallback behaviors. Lightweight runtime verification and monitor synthesis will be particularly important for embedded deployment where latency constraints are strict. Cybersecurity should also be treated as a core requirement for certifiable safety in real-world UAV deployment. Threats such as GPS spoofing, signal jamming, and communication interference can corrupt localization, invalidate environment assumptions, and undermine formal safety guarantees even when the nominal planner is correct. Future runtime assurance layers should therefore incorporate integrity monitoring and attack-aware detection mechanisms that trigger certified fallback behaviors when sensing, localization, or communication reliability is compromised.

7.3. Multi-UAV Coordination and Airspace Compliance

As UAV deployments scale toward multi-vehicle operations, coordination becomes increasingly challenging. Many existing approaches assume centralized coordination or reliable communication channels [3,200]. In real deployments, communication bandwidth may be limited, latency may vary, and connectivity may be intermittent.

Future work should prioritize decentralized coordination strategies that operate under partial observability and unreliable communications. Mechanisms such as intent sharing, decentralized conflict resolution, and negotiation-based coordination can enable cooperative planning while maintaining scalability.

Another important direction involves integrating policy and regulatory constraints directly into planning algorithms. Rather than representing regulatory restrictions as static obstacles, planners should incorporate dynamic airspace rules such as geofencing updates, traffic corridors, and priority regulations associated with UAS traffic management systems [7]. This requires tight coupling between planning algorithms and compliance monitoring systems.

7.4. Resource-Aware Planning and Implementation

Real UAV platforms operate under strict resource limitations including limited compute, constrained memory, finite battery capacity, and communication bandwidth restrictions [10]. Algorithms that assume abundant computational resources often fail to scale to embedded systems used in operational UAV platforms.

Future research should therefore develop resource-aware planning frameworks that explicitly incorporate computational cost, energy consumption, and communication requirements into planning objectives. Anytime planning algorithms with bounded suboptimality guarantees can provide useful tradeoffs between planning quality and computational cost.

Hardware-aware planning implementations also represent an important research direction. Leveraging heterogeneous computing architectures, embedded GPUs, and specialized accelerators may enable complex algorithms to operate within practical energy and latency budgets.

7.5. Integrated Evaluation and Benchmarking

A persistent limitation in UAV planning research is the lack of standardized evaluation frameworks that capture real-world operational complexity. Many studies rely heavily on simplified simulation environments that fail to represent real disturbances, sensing errors, and regulatory constraints [3].

Future research should focus on integrated evaluation pipelines that combine simulation, software-in-the-loop, hardware-in-the-loop, and real-world testing. Benchmark scenarios should incorporate uncertainty sources such as sensor noise, wind disturbances, GPS denial, and evolving airspace constraints.

Evaluation metrics should also extend beyond geometric path optimality to include safety violations, risk exposure over time, energy consumption, and computational resource usage. Establishing such benchmarks will enable meaningful comparison of UAV planning algorithms based on deployability rather than purely algorithmic performance.

8. Conclusions

This survey examined UAV pathfinding from the perspective of real-world deployment. Although decades of research have produced strong algorithmic results under controlled assumptions, achieving reliable autonomy in complex, dynamic, and regulated environments remains challenging. As illustrated by the motivating real-world vignette introduced at the beginning of this paper, practical UAV operation requires planning systems that reason under uncertainty, maintain verifiable safety guarantees, coordinate with other airspace users, and operate within strict computational and energy constraints.

Across major planning paradigms, a recurring pattern emerges: many methods demonstrate strong performance when evaluated in isolation but encounter limitations when integrated into complete autonomy stacks. Classical graph-based planners can efficiently replan in changing environments but typically lack explicit uncertainty modeling. Sampling-based planners provide desirable asymptotic properties yet may struggle under strict latency constraints and complex operational restrictions. Optimization-based planners can incorporate rich objectives and constraints but are often sensitive to model mismatch and limited onboard computational resources. In many existing studies, risk is not quantitatively calibrated, safety guarantees are assumed rather than formally verified, and resource constraints receive limited attention during evaluation.

To organize research around deployable UAV autonomy, this paper introduced the Risk-Calibrated, Certifiably Safe, Resource-Aware (RCSR) framework. Instead of proposing a single algorithmic solution, the RCSR perspective emphasizes four complementary requirements: (i) calibrated risk reasoning under uncertainty, (ii) formal safety assurance supported by runtime verification mechanisms, (iii) scalable multi-UAV coordination with explicit regulatory compliance, and (iv) resource-aware algorithm design and evaluation. Viewing existing work through these dimensions clarifies both the progress achieved in the field and the remaining challenges that must be addressed for trustworthy real-world deployment.

Looking forward, meaningful progress will require integration across traditionally separate research areas. Planning systems must tightly couple perception reliability, uncertainty propagation, risk calibration, safety assurance mechanisms, coordination strategies, and resource constraints. In addition, evaluation methodologies must extend beyond simulation to include software-in-the-loop testing, hardware-in-the-loop experimentation, and real-world flight trials that capture operational complexity.

In summary, bridging the gap between theoretical path planning advances and operational UAV autonomy requires a shift toward integrated, certifiable, and resource-conscious navigation architectures. By framing UAV pathfinding research through the RCSR perspective, this survey provides a structured foundation for future work aimed at enabling safe, reliable, and scalable UAV operations in the environments they are ultimately designed to serve.

Author Contributions

Conceptualization, N.J. and S.S. (Sayani Sarkar); methodology, N.J., S.S. (Sima Shafaei), A.K., S.S. (Sayani Sarkar); formal analysis, N.J. and S.S. (Sima Shafaei); investigation, N.J., S.S. (Sima Shafaei), A.K., and S.S. (Sayani Sarkar); writing—original draft preparation, N.J., S.S. (Sima Shafaei), A.K., S.S. (Sayani Sarkar); writing—review and editing, N.J., S.S. (Sima Shafaei), A.K., and S.S. (Sayani Sarkar); visualization, S.S. (Sima Shafaei); supervision, S.S. (Sayani Sarkar); project administration, S.S. (Sayani Sarkar). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Koenig, S.; Likhachev, M. D* Lite. In Proceedings of the AAAI Conference on Artificial Intelligence; AAAI Press: Washington, DC, USA, 2002. [Google Scholar]
Karaman, S.; Frazzoli, E. Sampling-Based Algorithms for Optimal Motion Planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
Meng, W.; Zhang, X.; Zhou, L.; Guo, H.; Hu, X. Advances in UAV path planning: A comprehensive review of methods, challenges, and future directions. Drones 2025, 9, 376. [Google Scholar] [CrossRef]
Ghambari, S.; Golabi, M.; Jourdan, L.; Lepagnot, J.; Idoumghar, L. UAV Path Planning Techniques: A Survey. RAIRO-Oper. Res. 2024, 58, 2951–2989. [Google Scholar] [CrossRef]
Tayal, M.; Singh, R.; Keshavan, J.; Kolathaya, S. Control Barrier Functions in Dynamic UAVs for Kinematic Obstacle Avoidance: A Collision Cone Approach. In Proceedings of the 2024 American Control Conference (ACC); IEEE: New York, NY, USA, 2024. [Google Scholar]
Colomina, I.; Molina, P. Unmanned Aerial Systems for Photogrammetry and Remote Sensing: A Review. ISPRS J. Photogramm. Remote Sens. 2014, 92, 79–97. [Google Scholar] [CrossRef]
Federal Aviation Administration. Urban Air Mobility Concept of Operations v1.0; Federal Aviation Administration: Washington, DC, USA, 2020. [Google Scholar]
Hart, P.E.; Nilsson, N.J.; Raphael, B. A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Phillips, M.; Likhachev, M. SIPP: Safe Interval Path Planning for Dynamic Environments. In Proceedings of the International Conference on Automated Planning and Scheduling (ICAPS); AAAI Press: Washington, DC, USA, 2011. [Google Scholar]
Richter, C.; Bry, A.; Roy, N. Polynomial Trajectory Planning for Aggressive Quadrotor Flight in Dense Indoor Environments. Int. J. Robot. Res. 2016, 35, 573–590. [Google Scholar]
Sinhmar, H.; Greiff, M.; Cairano, S.D. Practical and Safe Navigation Function-Based Motion Planning of UAVs; MERL Technical Report TR2024-055; IEEE: New York, NY, USA, 2024. [Google Scholar]
Westheider, J.; Rückin, J.; Popović, M. Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning. In Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2023. [Google Scholar]
Jones, M.; Djahel, S.; Welsh, K. Path-Planning for Unmanned Aerial Vehicles with Environment Complexity Considerations: A Survey. ACM Comput. Surv. 2023, 55, 1–37. [Google Scholar] [CrossRef]
Debnath, D.; Vanegas, F.; Sandino, J.; Hawary, A.F. A Review of UAV Path-Planning Algorithms and Obstacle Avoidance Methods for Remote Sensing Applications. Remote Sens. 2024, 16, 1123. [Google Scholar] [CrossRef]
Gugan, G.; Haque, A. Path Planning for Autonomous Drones: Challenges and Future Directions. Drones 2023, 7, 356. [Google Scholar] [CrossRef]
Luo, J.; Tian, Y.; Wang, Z. Research on Unmanned Aerial Vehicle Path Planning. Drones 2024, 8, 51. [Google Scholar] [CrossRef]
Puente-Castro, A.; Rivero, D.; Pazos, A.; Munteanu, C.R. A Review of Artificial Intelligence Applied to Path Planning in UAV Swarms. Neural Comput. Appl. 2022, 34, 153–170. [Google Scholar] [CrossRef]
Davidović, U.N.; Urošević, D. Unmanned Aerial Vehicles (UAV) Path Planning Techniques and Constraints in Urban Airspace Integration: Literature Review. Math. Inst. Serbian Acad. Sci. Arts 2023, 75, 101–122. [Google Scholar]
Primatesta, S.; Guglieri, G.; Rizzo, A. A Risk-Aware Path Planning Strategy for UAVs in Urban Environments. J. Intell. Robot. Syst. 2019, 95, 629–643. [Google Scholar] [CrossRef]
Tang, H.; Zhu, Q.; Qin, B.; Song, R.; Li, Z. UAV Path Planning Based on Third-Party Risk Modeling. Sci. Rep. 2023, 13, 22259. [Google Scholar] [CrossRef] [PubMed]
Zhou, K.; Wang, K.; Wang, Y.; Wang, Y.; Qu, X. A Risk-Based Unmanned Aerial Vehicle Path Planning Scheme for Complex Air–Ground Environments. Risk Anal. 2024, 44, 1–20. [Google Scholar] [CrossRef]
Ames, A.D.; Coogan, S.; Egerstedt, M.; Notomista, G.; Sreenath, K.; Tabuada, P. Control Barrier Functions: Theory and Applications. In Proceedings of the 18th European Control Conference, Naples, Italy, 25–28 June 2019; pp. 3420–3431. [Google Scholar]
Hobbs, K.L.; Mote, M.L.; Abate, A.; Coogan, S.D.; Feron, E.M. Runtime Assurance for Safety-Critical Systems: An Introduction to Safety Filtering Approaches. IEEE Control Syst. Mag. 2023, 43, 28–65. [Google Scholar] [CrossRef]
Sciancalepore, S.; Davidovic, F.; Oligeri, G. ORION: Verification of Drone Trajectories via Remote Identification Messages. Future Gener. Comput. Syst. 2024, 157, 177–192. [Google Scholar] [CrossRef]
Dijkstra, E.W. A Note on Two Problems in Connexion with Graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef]
Koenig, S.; Likhachev, M.; Furcy, D. Lifelong Planning A*. Artif. Intell. 2004, 155, 93–146. [Google Scholar] [CrossRef]
Likhachev, M.; Gordon, G.; Thrun, S. ARA*: Anytime A* with Provable Bounds on Sub-Optimality. In Proceedings of the 17th Conference on Neural Information Processing Systems (NIPS), Vancouver, BC, Canada, 13–18 December 2004. [Google Scholar]
Likhachev, M.; Ferguson, D.; Gordon, G.; Stentz, A.; Thrun, S. Anytime Dynamic A*: An Anytime, Replanning Algorithm. In Proceedings of the International Conference on Automated Planning and Scheduling (ICAPS), Monterey, CA, USA, 5–10 June 2005. [Google Scholar]
Narayanan, V.; Phillips, M.; Likhachev, M. Anytime Safe Interval Path Planning for Dynamic Environments. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2012. [Google Scholar]
Thomas, D.W.; Ruml, W.; Shimony, S.E. Real-Time Safe Interval Path Planning. Proc. Int. Symp. Comb. Search 2024, 17, 161–169. [Google Scholar] [CrossRef]
LaValle, S.M.; Kuffner, J.J. Randomized Kinodynamic Planning. Int. J. Robot. Res. 2001, 20, 378–400. [Google Scholar] [CrossRef]
Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Informed RRT*: Optimal Sampling-Based Path Planning Focused via Direct Sampling of an Admissible Ellipsoidal Heuristic. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2014. [Google Scholar]
Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Batch Informed Trees (BIT*): Sampling-Based Optimal Planning via the Heuristically Guided Search of Implicit Random Geometric Graphs. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2015. [Google Scholar]
Gammell, J.D.; Barfoot, T.D.; Srinivasa, S.S. Batch Informed Trees (BIT*): Informed Asymptotically Optimal Anytime Search. Int. J. Robot. Res. 2020, 39, 543–567. [Google Scholar] [CrossRef]
Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction, 2nd ed.; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
Kaelbling, L.P.; Littman, M.L.; Moore, A.W. Reinforcement Learning: A Survey. J. Artif. Intell. Res. 1996, 4, 237–285. [Google Scholar] [CrossRef]
Hong, D.; Lee, S.; Cho, Y.H.; Baek, D.; Kim, J. Energy-Efficient Online Path Planning of Multiple Drones Using Reinforcement Learning. IEEE Trans. Intell. Transp. Syst. 2021, 22, 5482–5494. [Google Scholar] [CrossRef]
Dhuheir, M.; Baccour, E.; Erbad, A.; Al-Obaidi, S.S.; Hamdi, M. Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms. arXiv 2022, arXiv:2212.11201. [Google Scholar] [CrossRef]
Cheng, Z.; Wang, L.; Liu, Y. Deep Reinforcement Learning for Autonomous UAV Navigation in Complex Environments. Sensors 2025, 25, 1184. [Google Scholar]
Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous Control with Deep Reinforcement Learning. In Proceedings of the International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, 2–4 May 2016. [Google Scholar]
Fujimoto, S.; van Hoof, H.; Meger, D. Addressing Function Approximation Error in Actor–Critic Methods. In Proceedings of the International Conference on Machine Learning (ICML), Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
Haarnoja, T.; Zhou, A.; Abbeel, P.; Levine, S. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. In Proceedings of the International Conference on Machine Learning (ICML), Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal Policy Optimization Algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar] [CrossRef]
AlMahamid, F.; Grolinger, K. Agile DQN: Adaptive Deep Recurrent Attention Reinforcement Learning for Autonomous UAV Obstacle Avoidance. Sci. Rep. 2025, 15, 18043. [Google Scholar] [CrossRef]
Bayerlein, H.; Theile, M.; Caccamo, M.; Gesbert, D. Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning. IEEE Open J. Commun. Soc. 2021, 2, 1171–1187. [Google Scholar] [CrossRef]
Puente-Castro, A.; Rivero, D.; Pazos, A.; Fernández-Blanco, E. UAV Swarm Path Planning with Reinforcement Learning for Field Prospecting. Appl. Intell. 2022, 52, 14101–14118. [Google Scholar] [CrossRef]
Lowe, R.; Wu, Y.; Tamar, A.; Harb, J.; Abbeel, P.; Mordatch, I. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Qie, H.; Shi, D.; Shen, T.; Xu, X.; Li, Y.; Wang, L. Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning. IEEE Access 2019, 7, 146264–146272. [Google Scholar] [CrossRef]
Airlangga, G.; Sukwadi, R.; Basuki, W.W.; Sugianto, L.F.; Nugroho, O.I.A.; Kristian, Y.; Rahmananta, R. Adaptive Path Planning for Multi-UAV Systems in Dynamic 3D Environments: A Multi-Objective Framework. Designs 2024, 8, 136. [Google Scholar] [CrossRef]
Ren, Z.; Rathinam, S.; Choset, H. Multi-Objective Conflict-Based Search for Multi-Agent Path Finding. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2021. [Google Scholar]
van den Berg, J.; Guy, S.J.; Lin, M.C.; Manocha, D. Reciprocal n-Body Collision Avoidance. In Robotics Research; Springer Tracts in Advanced Robotics; Springer: Berlin/Heidelberg, Germany, 2011; Volume 70, pp. 3–19. [Google Scholar]
Rahman, M.; Sarkar, N.I.; Lutui, R. A Survey on Multi-UAV Path Planning: Classification, Algorithms, Open Research Problems, and Future Directions. Drones 2025, 9, 263. [Google Scholar] [CrossRef]
Achiam, J.; Held, D.; Tamar, A.; Abbeel, P. Constrained Policy Optimization. In Proceedings of the 34th International Conference on Machine Learning (ICML); IEEE: New York, NY, USA, 2017. [Google Scholar]
Ray, A.; Achiam, J.; Amodei, D. Benchmarking Safe Exploration in Deep Reinforcement Learning. arXiv 2019, arXiv:1910.01708. [Google Scholar]
Alshiekh, M.; Bloem, R.; Ehlers, R.; Könighofer, B.; Niekum, S.; Topcu, U. Safe Reinforcement Learning via Shielding. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI), New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
Kim, M.; Bae, G.; Lee, J.; Shin, W.; Kim, C.; Choi, M.-Y.; Shin, H.; Oh, H. RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation. In Proceedings of the Robotics: Science and Systems (RSS), Los Angeles, CA, USA, 21–25 June 2025. [Google Scholar]
RTCA. DO-178C: Software Considerations in Airborne Systems and Equipment Certification. Available online: https://my.rtca.org/productdetails?id=a1B36000001IcmqEAC (accessed on 15 February 2026).
Skarka, W.; Ashfaq, R. Hybrid Machine Learning and Reinforcement Learning Framework for Adaptive UAV Obstacle Avoidance. Aerospace 2024, 11, 870. [Google Scholar] [CrossRef]
Fan, X.; Li, H.; Chen, Y.; Dong, D. A Path-Planning Method for UAV Swarm under Multiple Environmental Threats. Drones 2024, 8, 171. [Google Scholar] [CrossRef]
Tao, B.; Kim, J.H. Deep Reinforcement Learning-Based Local Path Planning in Dynamic Environments for Mobile Robot. J. King Saud. Univ.-Comput. Inf. Sci. 2024, 36, 102254. [Google Scholar] [CrossRef]
The MathWorks, Inc. MATLAB and Simulink. Available online: https://www.mathworks.com (accessed on 24 April 2026).
Liu, M.; Liu, J.; Lan, Q.; Lu, Z.; Zhang, W.; Zou, G. MATLAB Simulation of UAV 3D Path Planning Research Based on ACO, A*, and RRT Algorithms. In 2024 3rd International Symposium on Semiconductor and Electronic Technology (ISSET); IEEE: New York, NY, USA, 2024; pp. 634–637. [Google Scholar]
Wang, Q.; Xu, M.; Hu, Z. Path Planning of UAVs Based on Improved Tuna Swarm Optimization. Biomimetics 2024, 9, 388. [Google Scholar] [CrossRef]
Zhang, W.; Sun, Y.; Gao, Y.; Guo, C.; Miao, R. UAV 3D Path Planning Based on Integrated PSO and Artificial Potential Field. Int. J. Comput. Intell. Syst. 2025, 18, 1–36. [Google Scholar] [CrossRef]
Zhou, G.; Lü, S.; Mao, L.; Xu, K.; Bao, T.; Bao, X. Path Planning of UAV Using Lévy Pelican Optimization Algorithm in Mountain Environment. Appl. Artif. Intell. 2024, 38, 2368343. [Google Scholar] [CrossRef]
Ergezer, H.; Leblebicioglu, K. Path Planning for UAVs for Maximum Information Collection. IEEE Trans. Aerosp. Electron. Syst. 2013, 49, 502–520. [Google Scholar] [CrossRef]
Ergezer, H.; Leblebicioğlu, K. 3D Path Planning for Multiple UAVs for Maximum Information Collection. J. Intell. Robot. Syst. 2014, 73, 737–762. [Google Scholar] [CrossRef]
Open Source Robotics Foundation. Gazebo. Available online: https://gazebosim.org (accessed on 24 April 2026).
PX4 Autopilot Community. PX4 Autopilot. Available online: https://px4.io (accessed on 24 April 2026).
ArduPilot Development Team. ArduPilot. Available online: https://ardupilot.org (accessed on 24 April 2026).
Jayaweera, H.M.; Hanoun, S. UAV Path Planning for Reconnaissance and Look-Ahead Coverage Support for Mobile Ground Vehicles. Sensors 2021, 21, 4595. [Google Scholar] [CrossRef]
Cella, M.; d’Apolito, F.; Fanta-Jende, P.; Sulzbachner, C. Fueling Glocal: Optimization-Based Path Planning for Indoor UAVs in an Autonomous Exploration Framework. ISPRS Arch. 2023, 48, 85–91. [Google Scholar] [CrossRef]
Janesh, A. On the Performance of UWB Technology on Indoor UAV Localization. Master’s Theses, Aalto University, Espoo, Finland, 2023. [Google Scholar]
Luna, M.A.; Ale Isaac, M.S.; Ragab, A.R.; Campoy, P.; Flores Peña, P.; Molina, M. Fast Multi-UAV Path Planning for Optimal Area Coverage in Aerial Sensing Applications. Sensors 2022, 22, 2297. [Google Scholar] [CrossRef]
Microsoft Corporation. AirSim. Available online: https://github.com/microsoft/AirSim (accessed on 24 April 2026).
Tu, G.T.; Juang, J.G. UAV Path Planning and Obstacle Avoidance Based on Reinforcement Learning in 3D Environments. Actuators 2023, 12, 57. [Google Scholar] [CrossRef]
Chao, Y.; Augenstein, P.; Roennau, A.; Dillmann, R.; Xiong, Z. Brain Inspired Path Planning Algorithms for Drones. Front. Neurorobotics 2023, 17, 1111861. [Google Scholar] [CrossRef]
Puente-Castro, A.; Rivero, D.; Pedrosa, E.; Pereira, A.; Lau, N.; Fernandez-Blanco, E. Q-Learning Based System for Path Planning with UAV Swarms in Obstacle Environments. Expert Syst. Appl. 2024, 235, 121240. [Google Scholar] [CrossRef]
Unity Technologies. Unity/Unity3D. Available online: https://unity.com (accessed on 24 April 2026).
Robotics and Perception Group. University of Zurich. Flightmare. Available online: https://uzh-rpg.github.io/flightmare/ (accessed on 24 April 2026).
LearnSys Lab. Gym-Pybullet-Drones. Available online: https://github.com/learnsyslab/gym-pybullet-drones (accessed on 24 April 2026).
Pyke, L.; Stark, C. Dynamic Pathfinding for Swarm-Intelligence UAV Control Using Particle Swarm Optimisation. Front. Appl. Math. Stat. 2021, 7, 744955. [Google Scholar] [CrossRef]
Tian, Z.-T.; Ding, Y.; Song, J.-M.; Zhao, L.-J.; Zhang, Y.-T. 3D path planning of UAV based on improved A* algorithm. In ITM Web of Conferences; EDP Sciences: London, UK, 2017; Volume 12, p. 01015. [Google Scholar]
Bacha, A.M.; Zamoum, R.B.; Lachekhab, F. Machine Learning Paradigms for UAV Path Planning: Review and Challenges. J. Robot. Control 2025, 6, 215–233. [Google Scholar] [CrossRef]
Song, Y.; Naji, S.; Kaufmann, E.; Loquercio, A.; Scaramuzza, D. Flightmare: A flexible quadrotor simulator. In Proceedings of the Conference on Robot Learning, London, UK, 8–11 November 2021; pp. 1147–1157. [Google Scholar]
Chan, J.H.; Liu, K.; Chen, Y.; Sagar, A.S.S.; Kim, Y.G. Reinforcement Learning-Based Drone Simulators: Survey, Practice, and Challenge. Artif. Intell. Rev. 2024, 57, 281. [Google Scholar] [CrossRef]
Zhu, Y. Motion Planning Framework for Unmanned Aerial Vehicles in Dynamic Environments. Master’s Thesis, KTH Royal Institute of Technology, Stockholm, Sweden, 2021. Available online: https://www.diva-portal.org/smash/record.jsf?pid=diva2:1633602 (accessed on 1 March 2026).
Panerati, J.; Zheng, H.; Zhou, S.; Xu, J.; Prorok, A.; Schoellig, A.P. Learning to fly: A gym environment with PyBullet physics for multi-agent quadcopter control. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2021; pp. 7512–7519. [Google Scholar]
Ikli, S.; Quénel, I. Autonomous Drone Takeoff and Navigation Using Reinforcement Learning. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence, Rome, Italy, 24–26 February 2024; pp. 63–71. [Google Scholar]
PX4 Development Team. jMAVSim. Available online: https://docs.px4.io/main/en/simulation/jmavsim.html (accessed on 24 April 2026).
Paparazzi UAV Development Team. Paparazzi NPS. Available online: https://wiki.paparazziuav.org/wiki/Simulation (accessed on 24 April 2026).
Xie, D.; Hu, R.; Wang, C.; Zhu, C.; Xu, H.; Li, Q. A Simulation Framework of UAV Route Planning Design and Validation for Landslide Monitoring. Remote Sens. 2023, 15, 5758. [Google Scholar] [CrossRef]
MathWorks. Onboard Computer Path Planning with PX4 Autopilots; MathWorks: Torrance, CA, USA, 2023. [Google Scholar]
Paparazzi UAV Wiki. NPS. 2017. Available online: https://wiki.paparazziuav.org/wiki/NPS (accessed on 1 February 2026).
Burns, J.H.; Liang, X.; Liu, Y.D. Adaptive variables for declarative UAV planning. In COP ’20: Proceedings of the 12th ACM International Workshop on Context-Oriented Programming and Advanced Modularity; ACM: New York, NY, USA, 2020; pp. 1–7. [Google Scholar]
Czerniejewski, A.; Cosgrove, S.; Yan, Y.; Dantu, K.; Ko, S.Y.; Ziarek, L. JUAV: A Java-based system for unmanned aerial vehicles. In Proceedings of the 14th International Workshop on Java Technologies for Real-Time and Embedded Systems; ACM: New York, NY, USA, 2016; pp. 1–10. [Google Scholar]
Schmittle, M.; Lukina, A.; Vacek, L.; Das, J.; Buskirk, C.; Rees, S.; Sztipanovits, J.; Grosu, R. OpenUAV: A UAV testbed for the CPS and robotics community. In 2018 ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS); IEEE: New York, NY, USA, 2018; pp. 130–139. [Google Scholar]
OpenUAV Project. OpenUAV TurboVNC. Available online: https://github.com/Open-UAV/openuav-turbovnc (accessed on 24 April 2026).
Thompson, K.; Kurfess, F.; Walter, D.; Maksymiuk, R.; Mevorach, R.; Joshi, G. UavSim: An open-source simulator for multiple UAV path planning. In 2022 18th International Conference on Distributed Computing in Sensor Systems (DCOSS); IEEE: New York, NY, USA, 2022; pp. 229–236. [Google Scholar]
Ma, W.; Li, W.; Jin, B.; Lu, C.; Wang, X. SkyRover: A Modular Simulator for Cross-Domain Pathfinding. arXiv 2025, arXiv:2502.08969. [Google Scholar]
SkyRover Project. SkyRover. Available online: https://sites.google.com/view/mapf3d (accessed on 24 April 2026).
Just, G.E.; Pellenz, M.E.; Lima, L.A.P., Jr.; Chang, B.S.; Souza, R.D.; Montejo-Sánchez, S. UAV Path Optimization for Precision Agriculture Wireless Sensor Networks. Sensors 2020, 20, 6098. [Google Scholar] [CrossRef]
Wang, X.; Yang, D.; Wang, Z.; Kwan, H.; Chen, J.; Wu, W.; Li, H.; Liao, Y.; Liu, S. Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology. arXiv 2024, arXiv:2410.07087. [Google Scholar] [CrossRef]
Coppelia Robotics AG. CoppeliaSim. Available online: https://www.coppeliarobotics.com (accessed on 24 April 2026).
Cabreira, T.M.; Brisolara, L.B.; Ferreira, P.R., Jr. Survey on Coverage Path Planning with Unmanned Aerial Vehicles. Drones 2019, 3, 4. [Google Scholar] [CrossRef]
Nascimento, L.B.; Santos, V.G.; Pereira, D.S.; Alsina, P.J. 3D Path Planning Based on Probabilistic Foam for UAV Indoor Applications. In Proceedings of the Simpósio Brasileiro de Automação Inteligente, Campinas, Brazil, 17–20 October 2021; Volume 1. [Google Scholar] [CrossRef]
LAAS-CNRS. MORSE: Modular Open Robots Simulation Engine. Available online: https://morse-simulator.github.io/ (accessed on 24 April 2026).
Jerath, K.; Langelaan, J.W. Simulation framework for UAS conceptual design. In AIAA Modeling and Simulation Technologies Conference; AIAA: Reston, VA, USA, 2016; p. 1186. [Google Scholar]
Vivaldini, K.C.; Martinelli, T.H.; Guizilini, V.C.; Souza, J.R.; Oliveira, M.D.; Ramos, F.T.; Wolf, D.F. UAV Route Planning for Active Disease Classification. Auton. Robot. 2019, 43, 1137–1153. [Google Scholar] [CrossRef]
Wzorek, M.; Berger, C.; Doherty, P. A framework for safe navigation of UAVs in unknown environments. In 2017 25th International Conference on Systems Engineering (ICSEng); IEEE: New York, NY, USA, 2017; pp. 11–20. [Google Scholar]
Amaral, G.; Silva, H.; Lopes, F.; Ribeiro, J.P.; Freitas, S.; Almeida, C.; Martins, A.; Almeida, J.; Silva, E. UAV Cooperative Perception for Target Detection and Tracking in Maritime Environment. In Proceedings of the OCEANS 2017—Aberdeen, New York, NY, USA, 19–22 June 2017; pp. 1–6. [Google Scholar]
Call, B. Random City Generator Technical Report; Brandon Call: Torrance, CA, USA, 2006. [Google Scholar]
Mystkowski, A. Implementation and Investigation of a Robust Control Algorithm for an Unmanned Micro-Aerial Vehicle. Robot. Auton. Syst. 2014, 62, 1187–1196. [Google Scholar] [CrossRef]
Han, B.; Qu, T.; Tong, X.; Jiang, J.; Zlatanova, S.; Wang, H.; Cheng, C. Grid-Optimized UAV Indoor Path Planning Algorithms in a Complex Environment. Int. J. Appl. Earth Obs. Geoinf. 2022, 111, 102857. [Google Scholar] [CrossRef]
Ho, F.; Geraldes, R.; Gonçalves, A.; Rigault, B.; Sportich, B.; Kubo, D.; Cavazza, M.; Prendinger, H. Decentralized Multi-Agent Path Finding for UAV Traffic Management. IEEE Trans. Intell. Transp. Syst. 2020, 23, 997–1008. [Google Scholar] [CrossRef]
Kilic, K.I.; Mostarda, L. Heuristic Drone Pathfinding over Optimized Charging Station Grid. IEEE Access 2021, 9, 164070–164089. [Google Scholar] [CrossRef]
Wu, Q.; Liu, K.; Chen, L.; Lü, J. Multi-Agent Reinforcement Learning-Based UAV Pathfinding in Stochastic Environments. arXiv 2023, arXiv:2310.16659. [Google Scholar]
Zhao, G.; Wang, Y.; Mu, T.; Meng, Z.; Wang, Z. Reinforcement-Learning-Assisted Multi-UAV Task Allocation and Path Planning for IIoT. IEEE Internet Things J. 2024, 11, 26766–26777. [Google Scholar] [CrossRef]
OpenStreetMap Contributors. OpenStreetMap Database. 2025. Available online: https://www.openstreetmap.org/ (accessed on 12 December 2025).
Hohmann, N.; Bujny, M.; Adamy, J.; Olhofer, M. Multi-objective 3D path planning for UAVs in large-scale urban scenarios. In 2022 IEEE Congress on Evolutionary Computation (CEC); IEEE: New York, NY, USA, 2022; pp. 1–8. [Google Scholar]
Jing, W.; Deng, D.; Wu, Y.; Shimada, K. Multi-UAV coverage path planning for inspection of large and complex structures. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2020; pp. 1480–1486. [Google Scholar]
Muñoz-Bañón, M.A.; Velasco-Sánchez, E.; Candelas, F.A.; Torres, F. OpenStreetMap-Based Autonomous Navigation with Lidar Naive-Valley-Path Obstacle Avoidance. IEEE Trans. Intell. Transp. Syst. 2022, 23, 24428–24438. [Google Scholar] [CrossRef]
Luo, X.; Zhang, T.; Xu, W.; Fang, C.; Lu, T.; Zhou, J. Multi-Tier 3D Trajectory Planning for Cellular-Connected UAVs in Complex Urban Environments. Symmetry 2023, 15, 1628. [Google Scholar] [CrossRef]
Lin, X.; Wang, C.; Wang, K.; Li, M.; Yu, X. Trajectory Planning for UAVs in Complicated Urban Environments: A Control Network Approach. Transp. Res. Part C Emerg. Technol. 2021, 128, 103120. [Google Scholar] [CrossRef]
Grøntved, K.A.R.; Jarabo-Peñas, A.; Reid, S.; Roll, E.G.A.; Watson, M.; Richards, A.; Bullock, S.; Christensen, A.L. SAREnv: An Open-Source Dataset and Benchmark Tool for Informed Wilderness Search and Rescue Using UAVs. Drones 2025, 9, 628. [Google Scholar] [CrossRef]
Liu, C.; Szirányi, T. UAV Path Planning Based on Road Extraction. In Proceedings of the 2nd International Conference on Image Processing and Vision Engineering, Setúbal, Portugal, 22–24 April 2022; pp. 202–210. [Google Scholar]
Moore, C.; Mitra, S.; Pillai, N.; Moore, M.; Mittal, S.; Bethel, C.; Chen, J. URA*: Uncertainty-Aware Path Planning Using Image-Based Aerial-to-Ground Traversability Estimation. arXiv 2023, arXiv:2309.08814. [Google Scholar]
Ren, Z.; Rathinam, S.; Likhachev, M.; Choset, H. Multi-Objective Path-Based D* Lite. IEEE Robot. Autom. Lett. 2022, 7, 3318–3325. [Google Scholar] [CrossRef]
Liu, C.; Sziranyi, T. Road Condition Detection and Emergency Rescue Using UAV in Wilderness Environments. Remote Sens. 2022, 14, 4355. [Google Scholar] [CrossRef]
Barmpounakis, E.; Geroliminis, N. On the New Era of Urban Traffic Monitoring with Massive Drone Data: The pNEUMA Large-Scale Field Experiment. Transp. Res. Part C Emerg. Technol. 2020, 111, 50–71. [Google Scholar] [CrossRef]
Li, A.; Xu, Z.; Pan, Y.; Gao, B.; Zhang, J.; Chen, Y.; Li, Y. Cell-Trans: A Traffic Prediction Method for Motion Planning of Autonomous Vehicles at Signalized Intersections. J. Transp. Eng. Part A Syst. 2025, 151, 04025102. [Google Scholar] [CrossRef]
Xu, Y.; Wang, Y.; Peeta, S. Leveraging Transformer Model to Predict Vehicle Trajectories in Congested Urban Traffic. Transp. Res. Rec. 2023, 2677, 898–909. [Google Scholar] [CrossRef]
Mahajan, V.; Barmpounakis, E.; Alam, M.R.; Geroliminis, N. Treating Noise and Anomalies in Vehicle Trajectories from a Swarm-of-Drones Experiment. IEEE Trans. Intell. Transp. Syst. 2023, 24, 9055–9067. [Google Scholar] [CrossRef]
Kim, S.; Anagnostopoulos, G.; Barmpounakis, E.; Geroliminis, N. Visual Extensions and Anomaly Detection in the pNEUMA Experiment with a Swarm of Drones. Transp. Res. Part C Emerg. Technol. 2023, 147, 103966. [Google Scholar] [CrossRef]
Zhu, P.; Wen, L.; Du, D.; Bian, X.; Ling, H.; Hu, Q.; Nie, Q.; Cheng, H.; Liu, C.; Liu, X.; et al. VisDrone-DET2018: The Vision Meets Drone Object Detection Challenge. In Proceedings of the European Conference on Computer Vision Workshops; Springer: Cham, Switzerland, 2018; pp. 294–311. [Google Scholar]
Du, D.; Qi, Y.; Yu, H.; Yang, Y.; Duan, K.; Li, G.; Zhang, W.; Huang, Q.; Tian, Q. The UAVDT Benchmark: Object Detection and Tracking. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 370–386. [Google Scholar]
Zwick, M.; Gerdts, M.; Stütz, P. Sensor-Model-Based Trajectory Optimization for UAVs: An Optimal Control Approach. Sensors 2023, 23, 664. [Google Scholar] [CrossRef]
Rao, J.; Xiang, C.; Xi, J.; Chen, J.; Lei, J.; Giernacki, W.; Liu, M. Path Planning for Dual UAVs Cooperative Suspension Transport Based on Artificial Potential Field–A*. Knowl.-Based Syst. 2023, 277, 110797. [Google Scholar] [CrossRef]
Almujally, N.A.; Wu, T.; Alhasson, H.F.; Hanzla, M.; Jalal, A.; Liu, H. UAV-Based Intelligent Traffic Surveillance Using Recurrent Neural Networks and Swin Transformer for Dynamic Environments. Front. Neurorobot. 2025, 19, 1681341. [Google Scholar]
Yonetani, R.; Taniai, T.; Barekatain, M.; Nishimura, M.; Kanezaki, A. Path planning using neural A* search. In Proceedings of the International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 12029–12039. [Google Scholar]
Li, D.; Lin, Z.; Hu, J. Training-Free Pedestrian Trajectory Prediction via Segmentation-Guided Path Planning. Expert Syst. Appl. 2025, 298, 129770. [Google Scholar] [CrossRef]
Guan, L.; Li, B. Obstacle-Free Robot Path Planning Based on Variational Autoencoder and Generative Networks. Int. J. Inf. Commun. Technol. 2025, 26, 17–31. [Google Scholar] [CrossRef]
Geiger, A.; Lenz, P.; Urtasun, R. The KITTI Vision Benchmark Suite. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 3354–3361. [Google Scholar]
Sombekke, N. Monocular Depth Estimation for Lightweight Real-Time Obstacle Avoidance. 2022. Available online: https://staff.fnwi.uva.nl/a.visser/education/bachelorAI/Bachelor_Thesis_Niels_Sombekke_Final.pdf (accessed on 1 February 2026).
Dadrass, A. Dynamic Object Detection Using Moving Cameras for UAV Collision Prevention. Ph.D. Thesis, University of Victoria, Victoria, BC, Canada, 2024. [Google Scholar]
Bai, Y.; Miao, Z.; Wang, X.; Liu, Y.; Wang, H.; Wang, Y. Vdbblox: Accurate and efficient distance fields for path planning and mesh reconstruction. In 2023 IEEE/RSJ International Conference on Intelligent. Robots and Systems (IROS); IEEE: New York, NY, USA, 2023; pp. 7187–7194. [Google Scholar]
Burri, M.; Nikolic, J.; Gohl, P.; Schneider, T.; Rehder, J.; Omari, S.; Achtelik, M.; Siegwart, R. The EuRoC Micro Aerial Vehicle Datasets. Int. J. Robot. Res. 2016, 35, 1157–1163. [Google Scholar] [CrossRef]
Campos, C.; Elvira, R.; Rodríguez, J.J.G.; Montiel, J.M.; Tardós, J.D. ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM. IEEE Trans. Robot. 2021, 37, 1874–1890. [Google Scholar] [CrossRef]
Majdik, A.L.; Till, C.; Scaramuzza, D. The Zurich Urban Micro Aerial Vehicle Dataset. Int. J. Robot. Res. 2017, 36, 269–273. [Google Scholar] [CrossRef]
Pfrommer, B.; Sanket, N.; Daniilidis, K.; Clevel, J. PennCOSYVIO: A challenging visual inertial odometry benchmark. In 2017 IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2017; pp. 3847–3854. [Google Scholar]
Shah, S.; Dey, D.; Lovett, C.; Kapoor, A. AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles. In Field and Service Robotics: Results of the 11th International Conference; Springer: Cham, Switzerland, 2018. [Google Scholar]
Lee, J.; Miyanishi, T.; Kurita, S.; Sakamoto, K.; Azuma, D.; Matsuo, Y.; Inoue, N. CityNav: Language-Goal Aerial Navigation Dataset Using Geographic Information. Preprint. Available online: https://openreview.net/forum?id=LjvIJFCa5J (accessed on 1 January 2026).
Delmerico, J.; Cieslewski, T.; Rebecq, H.; Faessler, M.; Scaramuzza, D. Are we ready for autonomous drone racing? The UZH-FPV dataset. In 2019 International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2019; pp. 6713–6719. [Google Scholar]
Antonini, A.; Guerra, W.L.; McGill, S.; Sayre-McCord, T.; Karaman, S. The Blackbird Dataset: A Large-Scale Dataset for UAV Perception in Aggressive Flight. Int. J. Robot. Res. 2020, 39, 1230–1255. [Google Scholar]
Perera, A.; Chamikara, M.; Chahl, J. UAV-GESTURE: A Dataset for UAV Control and Gesture Recognition. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops; Springer: Cham, Switzerland, 2018. [Google Scholar]
Imperial College London. Indoor Navigation UAV Dataset. Available online: https://www.imperial.ac.uk/a-z-research/intelligent-digital-systems/indoor-uav-data/ (accessed on 15 January 2026).
Chen, X.; Hopkins, B.; Wang, H.; O’Neill, L.; Afghah, F.; Razi, A.; Fulé, P.; Coen, J.; Rowell, E.; Watts, A. Wildland Fire Detection and Monitoring Using a Drone-Collected RGB/IR Image Dataset. IEEE Access 2022, 10, 121301–121317. [Google Scholar] [CrossRef]
Rahnemoonfar, M.; Chowdhury, T.; Murphy, R. RescueNet: A High-Resolution UAV Dataset for Search and Rescue. Sci. Data 2023, 10, 18. [Google Scholar]
Mowla, M.N.; Asadi, D.; Tekeoglu, K.N.; Masum, S.; Rabie, K. UAVs-FFDB: A High-Resolution Dataset for Advancing Forest Fire Detection and Monitoring Using Unmanned Aerial Vehicles (UAVs). Data Brief 2024, 55, 110706. [Google Scholar] [CrossRef]
Liu, Y.; Fu, Y.; Qin, M.; Xu, Y.; Xu, B.; Chen, F.; Goossens, B.; Sun, P.Z.; Yu, H.; Liu, C.; et al. BotanicGarden: A High-Quality Dataset for Robot Navigation in Unstructured Natural Environments. IEEE Robot. Autom. Lett. 2023, 9, 2798–2805. [Google Scholar] [CrossRef]
DroneDeploy. DroneDeploy: Unified Reality Capture Platform. Available online: https://www.dronedeploy.com/ (accessed on 27 November 2025).
Lin, F.; Wei, C.; Grech, R.; Ji, Z. VO-safe reinforcement learning for drone navigation. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2024; pp. 279–285. [Google Scholar]
Joshi, B.; Kapur, D.; Kandath, H. Sim-to-real deep reinforcement learning based obstacle avoidance for UAVs under measurement uncertainty. In Proceedings of the International Conference on Automation, Robotics and Applications (ICARA); IEEE: New York, NY, USA, 2024; pp. 278–284. [Google Scholar]
Luders, B.; Kothari, M.; How, J. Chance Constrained RRT for Probabilistic Robustness to Environmental Uncertainty. In AIAA Guidance, Navigation, and Control Conference; AIAA: Reston, VA, USA, 2010; p. 8160. [Google Scholar]
Shetty, A.; Gao, G.X. Predicting State Uncertainty for GNSS-Based UAV Path Planning Using Stochastic Reachability. In Proceedings of the 32nd International Technical Meeting of the Satellite Division of The Institute of Navigation (ION GNSS+ 2019); The Institute of Navigation: Manassas, VA, USA, 2019; pp. 131–139. [Google Scholar]
Liu, T.; Zhang, F.; Gao, F.; Pan, J. Tight Collision Probability for UAV Motion Planning in Uncertain Environment. In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2023; pp. 1055–1062. [Google Scholar]
Yao, P.; Wang, H.; Su, Z. Real-Time Path Planning of Unmanned Aerial Vehicle for Target Tracking and Obstacle Avoidance in Complex Dynamic Environment. Aerosp. Sci. Technol. 2015, 47, 269–279. [Google Scholar] [CrossRef]
Indelman, V.; Carlone, L.; Dellaert, F. Planning in the Continuous Domain: A Generalized Belief Space Approach for Autonomous Navigation in Unknown Environments. Int. J. Robot. Res. 2015, 34, 849–882. [Google Scholar] [CrossRef]
Rigatos, G.G. Nonlinear Kalman Filters and Particle Filters for Integrated Navigation of Unmanned Aerial Vehicles. Robot. Auton. Syst. 2012, 60, 978–995. [Google Scholar] [CrossRef]
Dryanovski, I.; Morris, W.; Xiao, J. Multi-Volume Occupancy Grids: An Efficient Probabilistic 3D Mapping Model for Micro Aerial Vehicles. In 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems; IEEE: New York, NY, USA, 2010; pp. 1553–1559. [Google Scholar]
Ye, X.; Song, F.; Zhang, Z.; Zeng, Q. A Review of Small UAV Navigation System Based on Multisource Sensor Fusion. IEEE Sens. J. 2023, 23, 18926–18948. [Google Scholar] [CrossRef]
Pedram, A.R.; Funada, R.; Tanaka, T. Gaussian Belief Space Path Planning for Minimum Sensing Navigation. IEEE Trans. Robot. 2022, 39, 2040–2059. [Google Scholar] [CrossRef]
Castillo-Lopez, M.; Sajadi-Alamdari, S.A.; Sanchez-Lopez, J.L.; Olivares-Mendez, M.A.; Voos, H. Model predictive control for aerial collision avoidance in dynamic environments. In Proceedings of the 26th Mediterranean Conference on Control and Automation (MED); IEEE: New York, NY, USA, 2018; pp. 1–6. [Google Scholar]
Xu, Z.; Jin, H.; Han, X.; Shen, H.; Shimada, K. Intent prediction-driven model predictive control for UAV planning and navigation in dynamic environments. IEEE Robot. Autom. Lett. 2025, 10, 4946–4953. [Google Scholar] [CrossRef]
Tordesillas, J.; How, J.P. PANTHER: Perception-Aware Trajectory Planner in Dynamic Environments. IEEE Access 2022, 10, 22662–22677. [Google Scholar] [CrossRef]
Wu, H.; Liu, W.; Ren, Y.; Liu, Z.; Wei, H.; Zhu, F.; Li, H.; Zhang, F. Flying through Cluttered and Dynamic Environments with LiDAR. arXiv 2025, arXiv:2504.17569. [Google Scholar] [CrossRef]
Hervas, J.R.; Reyhanoglu, M.; Tang, H.; Kayacan, E. Nonlinear Control of Fixed-Wing UAVs in Presence of Stochastic Winds. Commun. Nonlinear Sci. Numer. Simul. 2016, 33, 57–69. [Google Scholar] [CrossRef]
Aoude, G.S.; Luders, B.D.; Joseph, J.M.; Roy, N.; How, J.P. Probabilistically Safe Motion Planning to Avoid Dynamic Obstacles with Uncertain Motion Patterns. Auton. Robot. 2013, 35, 51–76. [Google Scholar] [CrossRef]
Mendez, A.P.; Whidborne, J.F.; Chen, L. Wind Preview-Based Model Predictive Control of Multi-Rotor UAVs Using LiDAR. Sensors 2023, 23, 3711. [Google Scholar] [CrossRef] [PubMed]
Mesbah, A. Stochastic Model Predictive Control: An Overview and Perspectives for Future Research. IEEE Control Syst. Mag. 2016, 36, 30–44. [Google Scholar]
Hou, Y.; Chen, D.; Yang, S. Adaptive Robust Trajectory Tracking Controller for a Quadrotor UAV with Uncertain Environment Parameters Based on Backstepping Sliding Mode Method. IEEE Trans. Autom. Sci. Eng. 2023, 22, 4446–4456. [Google Scholar] [CrossRef]
Rice, C.R.; McDonald, S.T.; Shi, Y.; Gan, H.; Lee, W.S.; Chen, Y. Perception, Path Planning, and Flight Control for a Drone-Enabled Autonomous Pollination System. Robotics 2022, 11, 117. [Google Scholar] [CrossRef]
Yang, P.; Li, Z.; Yan, H.; Rao, K. Guidance Drone: Navigating Perception-Failure UGV with UAV Assistance in Cluttered Environments. In Proceedings of the 14th Asian Control Conference, Dalian, China, 5–8 July 2024. [Google Scholar]
Bartolomei, L.; Teixeira, L.; Chli, M. Perception-Aware Path Planning for UAVs Using Semantic Segmentation. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 25–29 October 2020; pp. 2555–2562. [Google Scholar]
Liu, D.; Xu, L. Robot Path Planning and Obstacle Avoidance Algorithm Based on Visual Perception. Neural Comput. Appl. 2025, 37, 28495–28512. [Google Scholar] [CrossRef]
Gómez Arnaldo, C.; Zamarreño Suárez, M.; Pérez Moreno, F.; Delgado-Aguilera Jurado, R. Path Planning for Unmanned Aerial Vehicles in Complex Environments. Drones 2024, 8, 288. [Google Scholar] [CrossRef]
Taheri, A.; Ghodousian, A.; Abedian, R. Review of Path Planning Models, Environmental Constraints, and Application Domains in Drone Delivery Systems. J. Algorithms Comput. Technol. 2024, 18, 1748–1765. [Google Scholar]
Ahmed, G.; Sheltami, T.; Ghaleb, M.; Hamdan, M.; Shuaib, K. Energy-Efficient Internet of Drones Path-Planning Study Using Meta-Heuristic Algorithms. Appl. Sci. 2024, 14, 3772. [Google Scholar]
Wu, Y.; Low, K.H.; Pang, B.; Tan, Q. Swarm-Based 4D Path Planning for Drone Operations in Urban Environments. IEEE Trans. Veh. Technol. 2021, 70, 8389–8401. [Google Scholar] [CrossRef]
Shi, L.; Jiang, W.; Luo, Z.; Yang, L. Enhancing Adaptability: Hierarchical Frontier-Based Path Planning for Navigation in Challenging Environments. IEEE Robot. Autom. Lett. 2024, 9, 4551–4558. [Google Scholar] [CrossRef]
Suanpang, P.; Jamjuntr, P. Optimizing Autonomous UAV Navigation with D* Algorithm for Sustainable Development. Sustainability 2024, 16, 6225. [Google Scholar] [CrossRef]
Saeed, R.A.; Omri, M.; Abdel-Khalek, S.; Ali, E.S. Optimal Path Planning for Drones Based on Swarm Intelligence Algorithm. Neural Comput. Appl. 2022, 34, 12837–12856. [Google Scholar] [CrossRef]
Ortlieb, M.; Adolf, F.M. Rule-Based Path Planning for Unmanned Aerial Vehicles in Non-Segregated Air Space over Congested Areas. In Proceedings of the 39th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, 11–15 October 2020; pp. 1–7. [Google Scholar]
McTegg, S.J.; Tarsha Kurdi, F.; Simmons, S.; Gharineiat, Z. Comparative Approach of Unmanned Aerial Vehicle Restrictions in Controlled Airspaces. Remote Sens. 2022, 14, 5891. [Google Scholar] [CrossRef]
Causa, F.; Franzone, A.; Fasano, G. Strategic and Tactical Path Planning for Urban Air Mobility: Overview and Application to Real-World Use Cases. Drones 2022, 6, 350. [Google Scholar] [CrossRef]
Feng, O.; Zhang, H.; Tang, W.; Wang, F.; Feng, D.; Zhong, G. Digital Low-Altitude Airspace Unmanned Aerial Vehicle Path Planning and Operational Capacity Assessment in Urban Risk Environments. Drones 2025, 9, 312. [Google Scholar] [CrossRef]
Palmerius, K.L.; Uggla, A.; Fylkner, G.; Uggla, A.C. End-to-End Drone Route Planning in Flexible Airspace Design. Transp. Res. Part C Emerg. Technol. 2024, 158, 104428. [Google Scholar] [CrossRef]
Xu, C.; Liao, X.; Tan, J.; Ye, H.; Lu, H. Recent Research Progress of Unmanned Aerial Vehicle Regulation Policies and Technologies in Urban Low Altitude. IEEE Access 2020, 8, 74583–74599. [Google Scholar] [CrossRef]
Ahmed, G.; Sheltami, T.R. A Safety System for Maximizing Operated UAVs Capacity under Regulation Constraints. IEEE Access 2023, 11, 55321–55334. [Google Scholar] [CrossRef]
Sharon, G.; Stern, R.; Felner, A.; Sturtevant, N.R. Conflict-Based Search for Optimal Multi-Agent Pathfinding. Artif. Intell. 2015, 219, 40–66. [Google Scholar] [CrossRef]
Kavraki, L.E.; Svestka, P.; Latombe, J.C.; Overmars, M.H. Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces. IEEE Trans. Robot. Autom. 1996, 12, 566–580. [Google Scholar] [CrossRef]

Figure 1. Overview taxonomy illustrating the structure of the paper and categorized research areas.

Figure 2. Classical RCSR pathfinding algorithms relevant to UAV navigation. (Blue and green markers in each image indicate the source and destination points, respectively. The red lines represent the paths detected by the proposed algorithms.)

Figure 3. Taxonomy of reinforcement learning–based RCSR pathfinding approaches.

Figure 4. Categories of UAV Datasets for Pathfinding pipeline purposes.

Figure 5. Sample Images from Real-World Datasets. Each image represents a data category, and the labels below indicate example benchmark datasets. For clarity: pNEUMA is a large-scale traffic trajectory dataset; EuRoC MAV is the European Robotics Challenge Micro Aerial Vehicle dataset; UZH-FPV is the University of Zurich First-Person View drone dataset; and ICARUS refers to a disaster response and wildfire monitoring project dataset.

Figure 7. Uncertainty categories and corresponding planning methods in UAV navigation.

Table 1. Comparison of representative UAV path planning surveys and frameworks across key deployment-relevant dimensions.

Study	Year	Primary Focus	Planning Algorithms	Simulation and Testbeds	Risk and Safety	Real-World Constraints
Jones et al. [13]	2023	Environmental complexity and planning taxonomy	✓	✗	✗	✓
Meng et al. [3]	2025	Comprehensive survey of UAV planning methods	✓	✓	✗	✓
Debnath et al. [14]	2024	Remote-sensing missions and obstacle avoidance	✓	✗	✗	✓
Gugan and Haque [15]	2023	Limitations of autonomous UAV planners	✓	✗	✗	✓
Luo et al. [16]	2024	Mission- and environment-based planner categorization	✓	✗	✗	✓
Puente-Castro et al. [17]	2022	AI-based multi-UAV coordination and swarms	✓	✗	✗	✓
Davidović and Urošević [18]	2023	Urban airspace integration and regulatory constraints	✗	✗	✗	✓
Ghambari et al. [4]	2024	Mission-centric planning taxonomy	✓	✗	✗	✓
Primatesta et al. [19]	2019	Risk-aware urban UAV path planning	✗	✗	✓	✓
Tang et al. [20]	2023	Third-party risk modeling for UAV planning	✗	✗	✓	✓
Zhou et al. [21]	2024	Crash probability and economic risk integration	✗	✗	✓	✓
Ames et al. [22]	2019	Control barrier functions for safety-critical control	✗	✗	✓	✗
Hobbs et al. [23]	2023	Runtime assurance architectures	✗	✗	✓	✗
Sciancalepore et al. [24]	2024	Remote ID–based trajectory verification	✗	✗	✓	✓
This Survey	2025	RCSR framework integrating planning, risk, safety, and constraints	✓	✓	✓	✓

Legend: ✓ Covered; ✗ Not explicitly addressed.

Table 2. Scenario-oriented selection guide for UAV path planning algorithm families.

Application Scenario	Environment Characteristics	Recommended Algorithm Family	Rationale and Caution
Indoor single-UAV navigation	Cluttered, short-range, partially known, tight spaces	Trajectory optimization and motion-primitive search; incremental graph search for local updates	Produces smooth, dynamically feasible motion and supports frequent replanning in constrained spaces, but is sensitive to model mismatch and onboard compute limits.
Outdoor single-UAV navigation in large spaces	Large-scale, open, continuous 3D airspace, moderate obstacle density	Sampling-based planners (RRT, Informed RRT, BIT*)	Scales well to continuous spaces and supports flexible cost and constraint design but may require smoothing and lacks strong hard real-time guarantees.
Dynamic obstacle avoidance	Time-varying obstacles, frequent map or cost updates, changing traversability	Incremental graph search (D* Lite, AD*, SIPP); local trajectory replanning	Reuses previous computation and adapts quickly to environmental changes, but still depends on simplified state representations and may require downstream safety filters.
Resource-constrained embedded UAVs	Limited onboard compute, memory, and battery budget	Bounded-suboptimal search and lightweight hierarchical planners	Provides feasible solutions quickly while controlling computational effort but may sacrifice optimality or rely on coarse environment models.
Multi-UAV coordination	Shared airspace, conflict resolution, communication dependence, joint task execution	Decentralized multi-agent planning; MAPF-inspired methods; multi-agent RL	Supports cooperation, conflict management, and distributed decision making, but safety certification and communication robustness remain difficult.
Safety-critical or high-uncertainty missions	Dense urban airspace, imperfect sensing, regulatory constraints, elevated risk	Hybrid planners combining classical planning with runtime assurance or safe RL layers	Balances adaptability with explicit safety supervision and constraint enforcement, but system integration is complex and certification remains challenging.
Regulation-constrained urban operations	Geofencing, no-fly zones, Remote ID, UTM, dynamic policy updates	Regulation-aware planners with incremental or hybrid replanning	Incorporates evolving compliance constraints directly into route generation but reduces routing flexibility and increases planning complexity.

Table 3. Comparison of major UAV simulators used in RCSR path-planning research.

Simulator	Realism/Physics	Sensor Fidelity	Autopilot Integration	Key Path-Planning Use Cases
MATLAB/Simulink	Low–Moderate (kinematic/simplified dynamics)	Minimal (custom models)	No autopilot stack	Algorithm benchmarking, meta-heuristics, 2D/3D grid search, convergence studies
Gazebo	High (realistic physics, wind, collisions)	High (camera, LiDAR, IMU, GPS)	Yes (PX4 or ArduPilot SITL)	End-to-end navigation, inspection, coverage, flight feasibility under noise
AirSim	High (photorealistic Unreal Engine)	Very High (RGB, depth, LiDAR)	Optional PX4 SITL or custom controller	Vision-based navigation, RL, dynamic obstacles, perception-planning integration
V-REP CoppeliaSim	Moderate (OMPL-based motion planning)	Moderate (customizable)	Basic (no native PX4)	Coverage path planning, geometric planners, indoor 3D navigation
Unity3D	Moderate (game engine physics)	Moderate (scripted sensors)	No autopilot stack	Swarm planning, PSO/flocking, heuristic navigation, ML-Agents environments
RL-Focused Simulators (Flightmare, gym-pybullet-drones)	High-speed lightweight physics	Low–Moderate (synthetic sensors)	No autopilot stack	Reinforcement learning, agile flight, multi-agent RL
jMAVSim + PX4 SITL	Low–Moderate (light 3D physics)	Low (basic)	Yes (PX4 SITL)	Waypoint following, controller validation, low-overhead autopilot testing
OpenUAV	Moderate– High (UE4-based scenes)	High (vision-heavy)	Yes (PX4/ArduPilot)	Cloud-based multi-UAV experiments, VLN, large-scale evaluation
MORSE	Moderate– High (Blender physics)	High (configurable sensors)	Partial (ROS-based control)	Sensor-driven planning, cooperative perception, conceptual UAV design
Paparazzi NPS	High for fixed-wing (JSBSim dynamics)	Moderate–High (sensor noise models)	Yes (Paparazzi)	Waypoint planning, autopilot evaluation, fixed-wing research
UavSim	Moderate	Low–Moderate	No autopilot stack	Multi-UAV cooperation, small-object detection, MAPF-like tasks
Aviones	Moderate-High (fixed-wing 6-DoF)	Moderate	Yes (HIL autopilot libraries)	Fixed-wing path planning, terrain coverage, energy-aware navigation
SkyRover	High (Gazebo + ROS2)	High	Yes (ROS2 controllers)	MAPF benchmarking, UAV–AGV coordination, multi-agent planning
Grid-Based Simulators	None (discrete environment)	None	No autopilot stack	Indoor grid pathfinding, conflict resolution, UTM MAPF, benchmark datasets

Table 4. Overview of real-world UAV-related datasets, their contents, indoor/outdoor coverage, and their capture methods.

Dataset	Data Contents	Indoor/Outdoor	Capture Method
OpenStreetMap (OSM)	Vector map data: roads, buildings, land use and terrain	Outdoor	Crowdsourced GIS data; collected via GPS traces, manual digitization, satellite mapping.
Massachusetts Road Dataset (MRD)	High-resolution aerial orthoimagery with pixel-level road annotations.	Outdoor	Aerial imagery captured from fixed-wing aircraft commissioned by MIT CSAIL.
pNEUMA/pNEUMA Vision	100 k+ multimodal trajectories (vehicles, cyclists, pedestrians); Vision includes bounding boxes and object tracks.	Outdoor	Captured by a swarm of synchronized DJI drones flying over Athens at 50–120 m altitude.
VisDrone	261 k video frames + 10 k still images with bounding boxes, classes, occlusion attributes.	Outdoor	Multiple UAV models capturing videos over 14 cities under various weather and lighting conditions.
UAVDT	80 k annotated frames; vehicle classes, weather, altitude, and camera-view labels.	Outdoor	UAV flights at 15–80 m altitude recording traffic scenes in cities and highways.
Stanford Drone Dataset (SDD)	Human and vehicle trajectories across 8 campus scenes with full scene semantics.	Outdoor	Static top-down cameras mounted on campus buildings.
KITTI	Stereo images, LiDAR scans, IMU, GPS odometry, and object labels.	Outdoor	Captured from a car equipped with stereo cameras and Velodyne HDL-64E LiDAR.
EuRoC MAV	Stereo images, IMU data, motion-capture ground truth; machine hall + Vicon rooms.	Indoor	Asctec Firefly UAV with stereo camera + IMU; ground truth from Vicon and Leica systems.
Zurich Urban MAV	High-resolution UAV flights in narrow urban streets, plazas, and GPS-denied regions.	Outdoor	Quadrotor with downward-facing camera flying through urban canyons.
PennCOSYVIO	Visual–inertial sequences across complex indoor–outdoor transitions.	Both	Hand-carried and UAV-mounted sensor rig (camera + IMU) moved through turbine hall and outdoor areas.
UZH-FPV Drone Racing Dataset	High-speed quadrotor flights with precise ground truth in cluttered tracks.	Indoor	FPV racing drones flown in a motion-capture arena with multi-camera setup.
MIT Blackbird Dataset	Massachusetts Road Dataset High-frequency (up to 10 kHz IMU) aggressive quadrotor maneuvers in varied scenes.	Indoor	High-speed drones recorded using a dense motion-capture camera network.
Mini-Drone Indoor Navigation Datasets	Narrow corridors, corners, warehouse-like indoor UAV navigation sequences.	Indoor	Small quadrotors equipped with RGB/IMU flown in controlled indoor environments.
Disaster and SAR Aerial Datasets	Wildfire imagery, collapsed buildings, search areas, victim annotations.	Outdoor	UAV and manned aircraft using EO/IR sensors during disaster-response missions.
Forest Navigation Datasets (TUM/ETH)	Dense forest imagery with depth/trajectory labels for natural-environment navigation.	Outdoor	UAVs flying under forest canopy with RGB or RGB-D cameras; sometimes handheld for ground truth.
DroneDeploy Mapping Dataset	Orthomosaics, DEMs, 3D reconstructions of industrial/agricultural sites.	Outdoor	Photogrammetry flights using DJI UAVs at multiple altitudes.

Table 6. Summary of uncertainty sources, impacts, associated trade-offs, and representative mitigation strategies.

Uncertainty Source	Impact on Path Planning	Key Trade-Off	Mitigation/Strategies (with References)
Imperfect Perception	Onboard sensors (vision, LiDAR, GPS/IMU, radar) provide noisy, incomplete, or delayed information; localization and mapping must be inferred rather than known exactly, causing trajectories that appear safe under nominal estimates to be unsafe in the true state.	Maintaining richer state distributions and higher-resolution maps improves obstacle awareness and local safety, but tends to increase memory use, computational demand, and latency on resource-constrained platforms, which may not be viable.	Kalman-style filtering (EKF, UKF) for probabilistic state estimation [169]; belief-space planning that optimizes over state distributions to reduce future uncertainty [167,168]; probabilistic occupancy grid mapping with multi-sensor fusion [170,171].
Dynamic Environments	Time-varying obstacles (people, vehicles, animals, other UAVs) and changing traversability (vegetation, water) can invalidate planned paths during execution; stale plans quickly become unsafe in cluttered or populated scenes.	More adaptive and frequent replanning improves responsiveness to environmental changes but increases onboard compute requirements and must be balanced against sensing latency and available processing power.	Belief-space reasoning over obstacle states with uncertainty-scaled safety margins [168]; model predictive control (MPC) with receding-horizon replanning [173,174]; explicit dynamic obstacle prediction using Kalman filters, Gaussian processes, or recurrent networks [169]; chance-constrained planning that bounds collision probability; sensor-aware planning to maintain obstacle visibility [175].
Stochastic Disturbances	Motion disturbances (wind gusts, turbulence, motor noise) physically deflect the UAV from its planned trajectory. Sensor disturbances (humidity, electronic noise, photonic interference) degrade the reliability of LiDAR, GPS, and camera inputs. Environmental disturbances (shifting water surfaces, reactive vegetation) alter the effective free space unpredictably.	Accounting for disturbances improves trajectory robustness and estimation quality but adds computational overhead for parameter estimation and sensor management; different disturbance types affect different sensors in distinct ways, making universal solutions extremely difficult to implement.	Gaussian process models for motion disturbances [178,179]; disturbance-aware trajectory optimization [177]; backstepping sliding mode method to calculate unknown parameters in real time [181]; stochastic model predictive control (SMPC) that adjusts replanning frequency based on proximity to disturbance sources [180].

Table 7. Summary of real-world UAV constraints, associated trade-offs, and representative mitigation strategies.

Constraint	Impact on Path Planning	Key Trade-Off	Mitigation/Strategies (with References)
Perception and mapping	Resolution limits in occupancy grids can yield conservative paths, while higher-resolution maps increase memory and compute demands; sensing degradation also reduces planning reliability. Real-world obstacles may also have irregular geometry and semantic meaning, which affect risk interpretation and safe traversal.	Improving map fidelity, sensing richness, and semantic obstacle understanding generally enhances obstacle awareness and local safety, but it also increases memory use, payload burden, latency, and onboard computational demand.	TSDF/ESDF representations for continuous geometry [182,184]; hybrid global occupancy with local ESDF refinement [185]; multi-modal fusion (LiDAR/vision/inertial) [186]; semantic segmentation and class-aware mapping for obstacle-aware planning [184]; adaptive resolution and region-of-interest updates [13,14].
Environmental effects and resource limits	Wind and weather increase uncertainty and energy use, while finite batteries and limited onboard compute restrict endurance, replanning frequency, and model complexity.	Accounting for disturbances and optimizing energy use can improve mission safety and endurance, but these gains often require additional sensing, prediction, and computation that small UAV platforms may not sustain continuously.	Disturbance-aware optimization and online wind estimation [59]; learning-based compensation [37]; energy-aware cost functions and recharge-aware routing [187,188]; hierarchical global–local planners [3,15]; distributed/offloaded computation in swarms [38].
Robustness and adaptation	Offline plans degrade under dynamics and model mismatch, which can reduce safety and mission reliability in unseen conditions.	More adaptive planners respond better to dynamic and uncertain environments, but they are often harder to interpret, validate, and certify than more structured model-based methods.	Frontier-based adaptive mapping [190]; runtime assurance and fallback controllers [191]; deep RL and meta-heuristic adaptation [192]; multi-objective adaptive replanning in swarms [49].
Airspace and regulatory constraints	NFZs, geofencing, Remote ID, and UTM compliance impose hard operational constraints, while policy differences across jurisdictions can complicate deployment.	Stronger regulatory compliance improves legal and operational safety, but it can reduce routing flexibility and increase planning complexity, especially in dynamic or cross-jurisdictional airspace.	Regulation-aware and rule-based planning [193]; cross-jurisdiction analyses [194]; low-altitude digital airspace and UTM infrastructures [18,196,197]; risk-aware regulatory models [195,198]; throughput-maximizing safety architectures [199]; Remote ID-based trajectory verification [24].

Table 8. Future research directions for UAV pathfinding under the Risk-Calibrated, Certifiably Safe, Resource-Aware (RCSR) framework.

RCSR Dimension	Current Limitations	Future Research Directions
Risk Calibration	Uncertainty handled using heuristic buffers and weak coupling between perception quality and planning risk [1,2]	Explicit uncertainty propagation, calibrated risk metrics, and chance-constrained or distributionally robust planning that dynamically adapts risk budgets based on sensing and localization confidence [3,12]
Certifiable Safety	Safety often assumed offline with limited runtime guarantees and weak integration between formal methods and mission planning [10]	Compositional safety architectures combining planners, formally constrained motion generation, and runtime assurance layers with lightweight online verification and safety shields [5,11]
Multi-UAV Coordination	Centralized coordination assumptions and reliance on reliable communication links [200]	Decentralized coordination under partial observability, intent sharing, and communication-aware conflict resolution integrated with regulatory policy constraints [3,7]
Resource Awareness	Limited treatment of onboard compute limits, energy budgets, and communication constraints [10]	Resource-aware planning algorithms, anytime optimization with bounded suboptimality, and hardware-aware implementations optimized for embedded platforms [4]
Evaluation and Benchmarking	Simulation-heavy validation with limited realism and poor cross-study comparability [3]	Integrated evaluation pipelines combining simulation, SIL, HIL, and field-testing with standardized metrics for safety violations, risk exposure, and resource consumption [4]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Johnson, N.; Shafaei, S.; Karem, A.; Sarkar, S. A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles. Drones 2026, 10, 351. https://doi.org/10.3390/drones10050351

AMA Style

Johnson N, Shafaei S, Karem A, Sarkar S. A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles. Drones. 2026; 10(5):351. https://doi.org/10.3390/drones10050351

Chicago/Turabian Style

Johnson, Nathan, Sima Shafaei, Andrew Karem, and Sayani Sarkar. 2026. "A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles" Drones 10, no. 5: 351. https://doi.org/10.3390/drones10050351

APA Style

Johnson, N., Shafaei, S., Karem, A., & Sarkar, S. (2026). A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles. Drones, 10(5), 351. https://doi.org/10.3390/drones10050351

Article Menu

A Survey of Risk-Calibrated Certifiably Safe and Resource-Aware (RCSR) Path Planning for Unmanned Aerial Vehicles

Highlights

Abstract

1. Introduction

2. Related Studies

3. Pathfinding Algorithms for Real-World UAVs

3.1. Classical Pathfinding Algorithms for UAV Navigation

3.1.1. Deterministic Grid-Based Pathfinding

Dijkstra and A*

Weighted and Bounded-Suboptimal Variants

3.1.2. Incremental Graph Search with Temporal and Environmental Constraints

LPA* and D* Lite

Anytime Variants (ARA*, AD*)

Safe Interval Path Planning (SIPP)

Summary and Limitations

3.1.3. Sampling-Based Planning in Continuous 3D Airspace

RRT and RRT*

Informed Sampling (Informed RRT*)

Batch Informed Trees (BIT*)

Implications for RCSR

3.2. Reinforcement Learning–Based Path Planning

3.2.1. Single-UAV Navigation and Obstacle Avoidance

3.2.2. Multi-UAV Cooperation and Deep Multi-Agent RL

3.2.3. Safe RL and Runtime Assurance

3.2.4. Evolutionary and Bio-Inspired Methods

3.2.5. Hybrid Learning–Planning Architectures and RCSR Alignment

3.3. Summary and RCSR Relevance

4. Simulation Frameworks, Testbeds and Datasets

4.1. Simulation Frameworks Supporting RCSR Path Planning

4.2. UAV Testbeds Supporting RCSR Validation

4.3. Datasets for Risk-Aware and Safety-Critical Path Planning

4.3.1. Geospatial and Aerial Image Datasets for Global Risk-Aware Path Planning

4.3.2. Vision-Based Perception Datasets Supporting Navigation and Obstacle Avoidance

4.3.3. Visual–Inertial and SLAM-Focused Datasets Used in UAV Navigation

4.3.4. Synthetic Datasets and Simulation-Based Benchmarks

4.3.5. High-Speed, Agile, and Indoor Navigation Datasets

4.3.6. Disaster, Forest, and Inspection Datasets

4.4. Evaluation Methods and Metrics

4.5. Summary and RCSR Relevance

5. Uncertainty-Aware Planning and Formal Safety Assurance

5.1. Imperfect Perception

5.2. Dynamic Environments

5.3. Stochastic Disturbances

5.4. Summary and RCSR Relevance

6. Real-World Environment Constraints

6.1. Perception and Mapping Constraints

6.1.1. Occupancy Grids and Their Limits

6.1.2. Distance Fields and Local Replanning

6.1.3. Sensor Modality and Compute Constraints

6.1.4. Scalability and Adaptive Mapping

6.2. Environmental Effects and Resource Limits

6.3. Robustness and Adaptation Strategies

6.4. Airspace and Regulatory Constraints

6.5. Summary and RCSR Relevance

7. Future Research Directions

7.1. Risk Calibration Under Uncertainty

7.2. Certifiable Safety and Runtime Assurance

7.3. Multi-UAV Coordination and Airspace Compliance

7.4. Resource-Aware Planning and Implementation

7.5. Integrated Evaluation and Benchmarking

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Anytime Variants (ARA, AD)