Next Article in Journal
System Integration and Cost Evaluation for Full-Stack and Full-Domain Intelligentization of Complete Vehicles
Previous Article in Journal
An Efficient and Lightweight Model for Traffic Object Detection in Autonomous Vehicles Under Nighttime Conditions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

MEMOBWO: A Novel Multi-Objective Optimization Algorithm for UAV Path Planning in Complex Urban Environments

College of Information and Artificial Intelligence, Yangzhou University, Yangzhou 225127, China
*
Author to whom correspondence should be addressed.
Actuators 2026, 15(6), 314; https://doi.org/10.3390/act15060314
Submission received: 3 April 2026 / Revised: 27 May 2026 / Accepted: 29 May 2026 / Published: 2 June 2026
(This article belongs to the Section Aerospace Actuators)

Abstract

Path planning for Unmanned Aerial Vehicles (UAV) in complex urban environments poses significant challenges for autonomous systems. This paper proposes a Multi-strategy Enhanced Multi-Objective Beluga Whale Optimization algorithm, termed MEMOBWO, to address these problems. The proposed MEMOBWO adopts a multi-objective optimization framework to overcome the limitations of traditional single-objective approaches while simultaneously enhancing exploration and exploitation capabilities through three complementary strategies. Firstly, a Chaotic Quasi-Opposition-Based Learning (CQOBL) strategy is introduced to enhance initial population diversity and quality. Secondly, a Hybrid Adaptive Position Update (HAPU) strategy is designed to dynamically balance global exploration and local exploitation. Finally, a Multi-Objective Thinking Innovation (MOTI) strategy is proposed as a targeted repair operator to overcome specific performance deficiencies of whale agents in weaker objectives. To evaluate its performance, the MEMOBWO was comprehensively tested through 20 standard multi-objective benchmark functions, as well as three-dimensional (3D) UAV path planning experiments in simulated urban environments with varying obstacle configurations, and was compared against a series of classical and recently proposed multi-objective optimization algorithms. Moreover, the overall performance of the algorithms was assessed using Hypervolume (HV) and Inverted Generational Distance (IGD) metrics and further tested using the Friedman test and Wilcoxon rank-sum test. Experimental results demonstrated that MEMOBWO achieved competitive performance across benchmark functions, and showed favorable overall performance against comparison algorithms in path planning tasks, attaining the lowest average Friedman rank as 1.14 and HV improvements of 15.24% to 30.86%. This study provides a promising optimization framework for multi-objective UAV path planning problems in urban environments, thereby lowering the tracking burden of downstream UAV flight-control and trajectory tracking.

1. Introduction

Unmanned Aerial Vehicles (UAVs) are aircraft operated via autonomous flight systems [1,2]. Currently, technologies for UAV are widely used in fields such as military reconnaissance [3], aerial photography [4], material transport [5], agricultural spraying [6], and more recently, in Internet of Things networks for energy-efficient data collection [7], etc. As one of the core issues of UAV technologies, the UAV path planning problems [8,9] have always been extensively studied. From the perspective of UAV trajectory tracking and flight-control execution, the quality of the planned reference path directly affects the downstream control burden. Paths with abrupt heading changes, excessive climbing-angle variations, or high curvature may increase tracking difficulty and impose larger demands on the flight-control system.
Discussions of path planning problems tend to be regarded as optimization problems. After the objective functions are defined, typically multiple, the selection of optimal paths requires the simultaneous minimization of all the objective functions. Since the minimum points of each function do not coincide, some are even conflicting, it means that a reduction in one single function may lead to an increase in another. To solve this issue, most existing studies choose to combine the functions into a single objective function by using a weighted sum approach [10], in which different objective functions are calculated with corresponding weights. Alternatively, through the main goal method, the constraint method is transformed into selecting the most important sub-goal from a single goal as the optimization goal, and the rest of the sub-goals as constraints. Each sub-goal is constrained by setting an upper bound. Although the methods mentioned above are straightforward to implement, it is still difficult to determine appropriate weights for each function in complex optimization problems while maintaining the optimality of the solution. Therefore, this paper employs the multi-objective optimization [11] approach to solve the UAV path planning problems.
As UAV technologies evolve towards more intelligent and interconnected paradigms, such as Digital Twin-assisted autonomous navigation [12] and IoT-enabled cooperative sensing [13], the operational environments for UAVs are becoming increasingly dynamic and complex. Depending on the differences of spatial dimension, the path planning problems can be categorized into two-dimensional (2D) and three-dimensional (3D) path planning [14]. The dimensionality of the environments significantly influences path planning approaches. In traditional 2D space, the simplified environmental modeling cannot intuitively reflect the complex terrain encountered by the UAVs during the urban missions, often leading to inaccurate optimization results. Three-dimensional spatial models can incorporate real terrain data to establish terrain models for the UAV flight, which are used to test the optimization performances of path planning algorithms, thereby enhancing the path planning capabilities of UAV and improving its efficiency in exploration and transport tasks [15]. Moreover, traditional path planning algorithms, including Artificial Potential Field Method [16], Rapidly-exploring Random Tree (RRT) [17], A* [18] and Dijkstra [19], are suitable for solving relatively simple path planning problems, such as operating obstacle avoidance on 2D maps. However, due to the difficulties handling dynamic obstacles, susceptibility to falling into local optima and high computational complexity, these methods are often difficult to apply in problems with complex 3D environments or diverse obstacle types. In contrast, meta-heuristic algorithms [20] are intelligent optimization methods based on the cooperative mechanisms of biological populations. These algorithms treat the feasible paths as candidate solutions and improve them via swarm intelligence methods [21], thus possessing advantages of high efficiency, flexibility and robustness when solving complex problems. Consequently, they are widely used for solving complex math problems [22]. Typical meta-heuristic algorithms include Genetic Algorithm (GA) [23], Particle Swarm Optimization (PSO) [24], Simulated Annealing (SA) [25] and Ant Colony Optimization (ACO) [26], etc. These meta-heuristic algorithms provide new approaches for solving UAV path planning problems [27], especially in the rather complex 3D urban environments.
In recent years, with the gradual development of related research, researchers worldwide have conducted further exploration into UAV path planning, proposing many improved path planning algorithms. Nature-based optimization techniques have been widely applied in relevant research. Among them, the Beluga Whale Optimization (BWO) algorithm [28] is a relatively novel swarm-based meta-heuristic algorithm inspired from the behaviors of beluga whales, which was proposed by Changting Zhong, et al. in 2022. It features a simple structure, ease of implementation and insensitivity to initial values, making it widely applicable to various kinds of optimization problems like feature selection [29], energy capacity allocation [30] and transportation scheduling [31] after its publication. At the same time, several variants of the algorithm have been introduced by relevant researchers. For example, SC Horng et al. have introduced an improved Beluga Whale Optimization algorithm for solving the simulation optimization problems with stochastic constraints [32]. In 2024, a multi-strategies improved Beluga Whale Optimization algorithm was proposed by Zhaoyong Fan et al. to optimize feature selection process [33]. Moreover, multiple Beluga Whale Optimization algorithms improved by hybrid strategies were introduced to solve engineering optimization problems [34,35]. However, few studies applied the BWO to UAV path planning problems, particularly in 3D urban environments. This research gap motivates our work to adapt the MEMOBWO for multi-objective path planning in 3D urban environments.
This paper proposes a novel algorithm called Multi-strategy Enhanced Multi-Objective Beluga Whale Optimization (MEMOBWO). The standard BWO is enhanced by three complementary improvement strategies. Chaotic Quasi-Opposition-Based Learning (CQOBL) is used to improve the quality and diversity of the initial population. Hybrid Adaptive Position Update (HAPU) dynamically balances global exploration and local exploitation through nonlinear energy factor. Multi-Objective Thinking Innovation (MOTI) is used as a targeted repair operator to overcome the performance defects of the solution on the weaker objective dimension. Our contributions in this work include: (i) the establishment of kinematic model of UAV, the objective function system of optimization model and the simulated complex 3D urban environments, (ii) proposal of three collaborative improvement strategies to systematically improve the performance of the algorithm in convergence, diversity and solution quality, covering the deficiency of standard BWO when dealing with more complex multi-dimensional optimization problems, (iii) verification of the effectiveness of MEMOBWO in dealing with multiple conflicting targets in simulated 3D urban environments.
To evaluate the performance of MEMOBWO in addressing UAV path planning problems, extensive experiments were conducted in simulated three-dimensional urban environments with varying obstacle configurations. The algorithm was tested against a series of multi-objective optimization algorithms, including MORBMO [36], NMOPSO [37], MOHHO, NSGA-II and standard MOBWO. NSGA-II is a classical optimization algorithm while the others are all recently published algorithms. The experimental results demonstrated that the MEMOBWO consistently generated flight paths with better performance across multiple conflicting objectives, and exhibited enhanced convergence characteristics toward the Pareto front while maintaining greater solution diversity. This underscores the effectiveness of MEMOBWO in balancing the exploration and exploitation behaviors, validating its advantages on offline path planning.
The remainder of this paper is organized as follows: Section 2 focuses on three-dimensional environmental modeling and objective function design for the path planning problem, Section 3 proposes the MEMOBWO algorithm based on the standard Beluga Whale Optimization algorithm and analyzes its main innovative components, and Section 4 involves comparative experiments conducted on a simulation platform using the proposed algorithm to verify its effectiveness, Section 5 summarizes the paper and outlines potential areas for further improvement in future research.

2. Problem Formulation

The UAV path planning problem is described as follows: under the physical and kinematic constraints of UAV, as well as the certain terrain obstacles and non-flying zones of the environments, a series of optimal flight paths of the UAV from a start point to a destination are planned. Efficient solutions to this question by optimization algorithms require three key formulations, including simulated environment modeling, kinematic modeling and the defining of objective functions.

2.1. Simulated Environment Modeling

Assuming the map constraints are known, one of the challenges in three-dimensional environment modeling is acquiring ground elevation parameters. This can be addressed by creating a 3D matrix to include the terrain size and resolution, where the value of each element indicates the elevation above ground at its corresponding position. The constraints in UAV path planning problems can be specifically categorized into physical constraints and environmental constraints. The first category includes constraints on the geometric properties of the generated paths, such as heading angle, climbing angle and curvature. The second category encompasses terrain obstacles and electromagnetic interference in urban environments. Currently, the main safety threats to flight in urban environments come from tall buildings, trees and street lights, as well as electromagnetic field interference generated by signal towers or power plants.
To represent typical urban flight constraints in a computationally tractable form, building-like obstacles are modeled as cuboids, small urban structures such as trees, street lamps, and poles are modeled as cylinders, and electromagnetic interference (EMI) risk regions are modeled as hemispherical soft-threat regions. These geometric primitives are used as planning-level obstacle and risk proxies for controllable and repeatable algorithm evaluation.

2.2. UAV Kinematic Model

On the premise that a certain safe distance is left between the UAV and the environmental obstacles, and the obstacle avoidance distance is much larger than the body size, the structure of UAV can be regarded as a particle, indicating that most of the influence of the physical conditions is ignored. However, due to the limited inertia and motor thrust of the real UAV, the generated paths still necessitates constructing mathematical models that accurately reflect the motion characteristics of UAV and fully accounting for its inherent physical performance limitations.
A point-mass kinematic model is adopted at the path geometric planning level. Its core state variables are the position P = x , y , z T of UAV in three-dimensional space and its velocity v. The motion is determined by the heading angle  ψ  and climbing angle  θ , with the fundamental kinematic equations given by:
x ˙ = v cos θ cos ψ y ˙ = v cos θ sin ψ z ˙ = v sin θ
where v denotes the velocity scalar along the forward axis of the UAV body. Equation (1) is defined to describe the geometric relationship between the position change rate, velocity and orientation angle of the UAV. The climbing angle θ , heading-angle change Δ ψ , climbing-angle change Δ θ and curvature of the generated path are constrained as:
| θ | θ max | Δ ψ | Δ ψ max | Δ θ | Δ θ max κ κ max
After defining the kinematic parameters of the UAV, the minimum turning radius R min of UAV is derived from the maximum roll angle ϕ max . Accordingly, the curvature κ ( s ) at any point on the paths must satisfy the constraint of R min to ensure that the UAV has the ability to perform the turning maneuver required by the path.
R min = v 2 g · tan ϕ max κ ( s ) 1 R min
where g = 9.8 m/s2. denotes the gravitational acceleration, variable s should be valued between 0 and the total length of the flight path.
Additionally, the initial paths produced by discrete algorithms are polylines, which contain multiple sharp and angular turns that would be mechanically impossible for the UAV to execute at a certain speed. The MEMOBWO algorithm utilizes the continuous Bezier smoothing technique to improve geometric continuity of generated paths and reduce abrupt local direction changes. The satisfaction of the constraints above is quantitatively evaluated before and after smoothing in the experimental section.

2.3. Objective Functions Definition

To evaluate the performance of the path planning algorithm, considering the simulated environments in 3D space and the constraints mentioned in the previous sections, the following four objective functions are designed: path spatial length, path vertical maneuver cost, path smoothness and path threat degree. Let the path P be represented by a sequence of N waypoints, the objective vector is defined as F = f 1 , f 2 , f 3 , f 4 T . Each component is detailed below. All the objective functions and constraints are designed to guide the algorithm to generate collision-free and geometrically executable paths.

2.3.1. Path Spatial Length Function

Assuming the UAV flies at a constant speed during the flight, generated paths with shorter length implies reduced flight time and energy consumption. Therefore, the spatial length of a feasible path is one of the crucial factors for assessing the algorithm’s performance.
The path spatial length function aims to minimize the total Euclidean distance of the flight path to ensure the UAV reaches the target in a possibly minimum time, to increase its operation efficiency. Let N be the number of waypoints and P i be the coordinate of the ith waypoint. The cost function is defined as:
f 1 = i = 1 N 1 P i + 1 P i 2
where P i + 1 P i 2 denotes the Euclidean norm of two contiguous waypoints in the generated path. Minimizing f 1 ensures the shortest geometrical route between the start and goal configurations. By minimizing the f 1 function, the algorithm can gradually find a shorter path from the starting point to the end point, reducing the task consumption time.

2.3.2. Path Vertical Maneuver Cost Function

The power consumption of the UAV is non-linearly related to changes in height, where ascending actions require higher motor torque to overcome gravity compared to level flight. During UAV flight, frequent altitude variations may increase the energy demand and impose additional burden on vertical maneuvering.
Let h i = z i + 1 z i represent the altitude difference between two consecutive waypoints, the objective function models the asymmetrical energy cost of vertical maneuvers as:
f 2 = i = 1 N 1 ε ( Δ h i )
The path vertical maneuver cost function is formulated to penalize climbing or descending:
ε ( Δ h i ) = μ climb · Δ h i , if Δ h i > 0 μ desc · | Δ h i | , Otherwise
where μ climb and μ desc are weighting coefficients. The parameters are set as μ climb > μ desc , typically ratio 3:1, like μ climb = 0.3 while μ desc = 0.1, to impose a higher penalty on climbing than on descending due to the higher energy consumption of climbing maneuvers.
The explanation of f 2 function is as Figure 1. Minimizing f 2 naturally encourages the algorithm to find paths similar to the contour type that could avoid unnecessary or sudden altitude changes in the flight.

2.3.3. Path Smoothness Function

During the flight of UAV, excessive turns and pitch maneuvers, as well as excessively large turning angles, can affect travel speed and increase energy consumption and the risk of damage. Therefore, a feasible path should minimize the number of sharp turns and reduce the maximum turning angle. The path smoothness objective function is expressed as:
f 3 = α 1 i = 1 N 2 σ yaw 2 ( i ) + α 2 i = 1 N 2 σ pitch 2 ( i )
where σ yaw and σ pitch represent the horizontal heading angle and vertical climbing angle of the UAV at the ith waypoint, respectively, and α 1 and α 2 are the weight coefficients of turn and pitch actions of UAV, typically set as α 1 = α 2 = 0.5.
σ yaw is defined as:
σ yaw = arccos V i x y · V i + 1 x y V i x y · V i + 1 x y
where V i x y is the projection vector of each flight segment on the horizontal plane O x y , · represents the modulus length of the vector. To ensure numerical stability, | | V i x y | | is defined to be greater than 0.
σ pitch is defined as:
σ pitch = arctan Δ z ( Δ x ) 2 + ( Δ y ) 2 2
where Δ x , Δ y and Δ z represent the changes of 3D coordinates of each flight segment in the path. This angle is the absolute value of the difference between the elevation angles of the two preceding and following flight segments, which is used to measure the changes of altitude in the vertical direction during the flight. By minimizing f 3 , the flight path generated by the algorithm will be smoother, and the burden of UAV performing turning and pitching actions will be reduced, generating smooth flight paths that are more suitable as reference paths for subsequent trajectory tracking.

2.3.4. Path Threat Degree Function

A feasible path must guarantee avoidance of physical collision while simultaneously mitigating the risk of control link loss due to the electromagnetic interference (EMI). In this work, the urban environment contains physical obstacles and spectral threats. Let C p h y s be the physical collision risk and C s p e c be the interference risk, the composite safety objective is expressed as:
f 4 = 1 N k = 1 N C p h y s ( p k ) + C s p e c ( p k )
where p k denotes the UAV position at sample k, and E n represents the nth EMI source with center C n and interference radius R n .
The physical collision cost C p h y s is calculated based on d v , S u a v and S s a f e , which means the distance to the nearest structure, the diameter of UAV and the safety distance between it and the obstacles, respectively. C p h y s ( p k ) is then defined as:
C p h y s ( p k ) = , d v S u a v 1 d v S u a v S s a f e , S u a v < d v < S u a v + S s a f e 0 , d v S u a v + S s a f e
A hemisphere interference is defined to simulate the potential field threat. Let D n = p k C n be the Euclidean distance to the EMI center, the spectral cost C s p e c is defined as:
C s p e c ( p k ) = μ · exp α · 1.5 R n D n R n , if D n < 1.5 R n 0 , Otherwise
C s p e c ( p k ) implies that the UAV perceives EMI sources as soft repulsive fields that extend 1.5 times beyond their physical radius R n , hence forcing the path planner to bend the path away from high-interference zones to preserve communication integrity. The explanation of the EMI interference zone is shown in Figure 2. Minimizing f 4 encourages the algorithm to generate flight paths with higher safety in complex urban environment, thus ensuring the reliability and the operational safety of the UAV.

3. The Proposed Approach

3.1. Standard BWO

The Beluga Whale Optimization algorithm simulates the swimming, foraging, and the ’Whale Fall’ behaviors of a specific kind of animals called beluga whales. Belugas are highly social animals, usually gathering in groups of 2 to 25 members. When summer comes, many creatures gather in some estuaries, making the whales gather to forage. Belugas with the coordination group attack and feed the fish by guiding them into shallow water. In addition, some whales may die and fall into the deep ocean during the migration or escaping from predators, which is also known as the ’Whale Fall’ behaviors, providing plenty of food for creatures without sunlight or oxygen. The BWO algorithm process contains three certain phases, including exploration phase, exploitation phase and ’Whale Fall’ phase. The exploration phase of BWO is established by considering the swimming behavior of beluga whales. Instead of a search agent moving randomly on its own, it updates its position based on the position of a randomly selected partner whale from the population. Therefore, the positions for beluga whales are updated as:
X i , j T + 1 = X i , P j T + ( X r , P j T X l , P j T ) · ( 1 + r 1 ) · sin ( 2 π r 2 ) , j = even X i , P j T + ( X r , P j T X l , P j T ) · ( 1 + r 1 ) · cos ( 2 π r 2 ) , j = odd
where T is the current iteration, X i , j T + 1 is the new position for the ith beluga whale on the jth dimension, P j ( j = 1 , 2 , , d ) is a random number selected from d-dimension, X i , P j T is the position of the ith beluga whale on P j dimension, X r , P j T and X l , P j T are the current positions for rth and lth beluga whale, which are both randomly chosen, r 1 and r 2 are random numbers, sin ( 2 π r 2 ) and cos ( 2 π r 2 ) mean fins of the mirrored beluga whales are toward the surface. According to the dimension chosen by odd and even number, the updated position reflects the synchronous or mirror behaviors of pair swimming or diving.
The exploitation phase of BWO is inspired by the preying behavior of beluga whales. They prey by sharing the information of positions for each other, considering the best candidate and others. The Levy flight strategy is introduced in the exploitative phase of BWO to enhance the convergence. The mathematical model of the exploitation phase is expressed as:
X i T + 1 = r 3 X b e s t T r 4 X i T + C 1 L F ( X r T X i T )
where T is the current iteration, X i T and X r T are current position for the ith beluga whale and a random beluga whale, X i T + 1 is the position of new position of the ith beluga whale, X b e s t T is the best position among beluga whales, r 3 and r 4 are random numbers between ( 0 , 1 ) , C 1 = 2 r d 1 t T m a x is the random jump strength that measuring the intensity of Levy flight. LF is the Levy flight function, calculated as:
L F = 0.05 · u · σ | v | 1 β
σ = Γ ( 1 + β ) · sin ( π 2 · β ) Γ ( 1 + β 2 ) · β · 2 β 1 2 1 β
where u and v are normally distributed random numbers, β is the default constant equal to 1.5 .
The beluga whales either migrate elsewhere or die. In order to keep the population size constant, the positions of beluga whales and step size of ’whale fall’ phase are used to establish the updated position. The formula is as:
X l T + 1 = r 5 X l T r 6 X r T + r 7 X s t e p
where r 5 , r 6 , r 7 ( 0 , 1 ) are random numbers, X s t e p is the step size of ’whale fall’, which is established as:
X s t e p = ( u b l b ) exp ( C 2 T T max )
where C 2 = 2 W f · n is the step factor which is related to the probability of ’whale fall’ and population size, u b and l b are upper and lower boundary of variables, respectively. It can be seen that the step size is affected by the boundaries of design variables, iteration and maximum iterative number. The probability of ’whale fall’ W f is calculated as:
W f = 0.1 0.05 · T T max

3.2. The Proposed MEMOBWO

The original BWO algorithm, due to its own limitations, will face problems of insufficient diversity of the solution set and be easy to fall into local optima when solving some more complex optimization problems. In order to apply BWO algorithm to multi-objective optimization problems and make it more adaptable in path planning problems, a series of corresponding improvement approaches is proposed in this section.

3.2.1. Chaotic Quasi-Opposition-Based Learning

The diversity and quality of the initial population are important for constructing a well-distributed Pareto archive in the early stage of the search.Traditional initialization methods [38] based on random distribution often lead to an uneven exploration of the search space, potentially trapping the algorithm in local optima. To overcome these limitations, this paper proposes a Chaotic Quasi-Opposition-Based Learning (CQOBL) strategy. This strategy introduces Quasi-Opposition-Based Learning [39] (QOBL) to evaluate both the current solution and its opposite solution simultaneously. It also introduces a Piecewise chaotic sequence to control the generation of quasi-opposite candidates between the center of the search space and the opposite point. After generating both the random population and the chaotic quasi-opposition population, non-dominated sorting and objective evaluation are used to select higher-quality individuals for the initial population. This design provides a more diverse sampling pattern while keeping the generated candidates within the feasible variable bounds.
Let X i j be the jth dimension of the ith individual in the population, bounded by [ l b j , u b j ] . The standard opposition solution X ˜ i j is defined as:
X ˜ i , j = l b j + u b j X i , j
The center of the search space, M j , is calculated as M j = ( l b j + u b j ) / 2 . The proposed CQOBL strategy generates a candidate solution X { i , j } new dynamically within the interval [ M j , X ˜ { i , j } ] , governed by a chaotic sequence. The formulation is defined as:
X i , j new = M j + γ k · ( X ˜ i , j M j )
where γ k , valued between ( 0 , 1 ) , represents the chaotic value generated by the Piecewise chaotic map at iteration k. The Piecewise map is selected for its superior ergodicity and uniform distribution properties compared to the Logistic or Tent maps. The mathematical expression of the Piecewise map is defined as:
γ k + 1 = γ k P , 0 γ k < P γ k P 0.5 P , P γ k < 0.5 1 γ k P 0.5 P , 0.5 γ k < 1 P 1 γ k P , 1 P γ k < 1
By utilizing γ k , the algorithm can perform a fine-grained, non-linear scan of the potential optimal region between the geometric center and the opposition point. The operation principle of CQOBL is shown in Figure 3. This mechanism enhances the algorithm’s ability to jump out of local optima and improves the diversity of the initial population and subsequent generations.

3.2.2. Hybrid Adaptive Position Update

Existing meta-heuristic algorithms tend to introduce Lévy flight to enhance global exploration or spiral search to strengthen local exploitation. In contrast, HAPU is designed to improve the position update process of MEMOBWO by coordinating long-range exploration and local exploitation in a stage-adaptive manner. It combines three components in a unified update mechanism, including a nonlinear energy factor, an adaptive switching probability and a hybrid stochastic exploration operator.
The behavioral patterns of the beluga population are influenced by their energy states. To simulate this biological characteristic and to quantitatively describe the temporal evolution of the search process, a nonlinear adaptive energy factor B t is introduced to the algorithm. Unlike linear attenuation strategies, B t reflects energy fluctuations and nonlinear decay in complex environments. The formulation of B t is defined as:
B t = B 0 1 t T max α cos ( 2 π · rand )
where B 0 represents the initial energy state, typically set as constant 2, t and T max denote the current iteration index and the maximum number of iterations, respectively, and α is a nonlinear exponent controlling the decay rate.
The introduction of B t provides the algorithm with a global evolutionary mark, whose oscillatory decay trend characterizes the overall transition of the population from broad exploration to fine-grained exploitation. Under the energy evolution governed by B t , an adaptive switching probability P a ( t ) is introduced as the core control parameter to explicitly guide the behavioral decisions of individuals at different evolutionary stages. To achieve a smooth transition from exploration-dominated to exploitation-dominated behavior, the switching threshold P a ( t ) is designed as a cosine-based monotonically decreasing function of the iteration index t defined as:
P a ( t ) = μ + ( 1 μ ) · cos π · t 2 · T max
where μ is a constant coefficient that controls the lower bound of the switching probability. With the introduction of B t and P a ( t ) , the mechanism of dynamic cooperation between Lévy flight and spiral search is defined as:
X t + 1 = L ( X t ) , if B t < P a ( t ) S ( X t ) , Otherwise
where L ( X t ) and S ( X t ) denote the Lévy flight operator and the spiral search operator, respectively. During the early stages of iteration, P a ( t ) remains at a relatively high level, leading to a high probability of triggering the Lévy flight operator L ( X t ) . Leveraging its heavy-tailed distribution, Lévy flight drives individuals to perform long-distance jumps, thereby maximizing the coverage of the search space. As the iteration proceeds, P a ( t ) gradually decreases, causing the algorithm to favor the spiral search operator S ( X t ) , which guides individuals to approach the current best solution via logarithmic spiral trajectories for fine-grained exploitation.
The mathematical definition of cooperative operators is as follows. Focusing on global exploration, the position update rule is defined as:
L c ( λ , γ , t ) = 1 ω ( t ) · L F + ω ( t ) · Cauchy ( 0 , γ )
where Cauchy ( 0 , γ ) represents the standard Cauchy distribution with location parameter zero and scale parameter γ . The time-varying mixing weight ω ( t ) is given by:
ω ( t ) = 1 exp ( κ · t T max )
where κ is usually set to 3 to govern the transition speed of the weight. The scale parameter γ is scaled according to the problem dimension as γ = 0.1 · u b l b d , where u b and l b are the upper and lower bounds of the search space and d is the dimensionality. The innovation lies in combining the heavy-tail property of the Lévy distribution, which facilitates large-step global exploration, with the broad-tail characteristic of the Cauchy distribution, which promotes fine-grained local search. Through ω ( t ) , the HAPU strategy forms a multi-level adaptive optimization framework.
Focusing on deep exploitation, this rule is defined as:
S ( X t ) = D · e b l · cos ( 2 π t ) + X best
where D denotes the distance between the individual and the current best solution, b is a constant defining the shape of the logarithmic spiral, and l [ 1 ,   1 ] is a random number. This equation enables search agents to scan the target region locally with a smoothly contracting path.
The proposed HAPU strategy provides a staged and probabilistic update mechanism that adjusts both the search mode and perturbation structure during the iteration process. By overcoming the limitations of single-mechanism strategies in terms of search breadth and depth, it enhances the global optimization efficiency of the algorithm.

3.2.3. Multi-Objective Thinking Innovation Strategy

The search operators in optimization algorithms usually apply uniform evolutionary pressure on all decision variables, which lacks the flexibility to repair specific performance deficiencies. Inspired by the concept of “Thinking and Innovation” [40], the Multi-Objective Thinking Innovation (MOTI) strategy uses different elite guidance for different individuals according to their weakest objectives. The repair direction for the population is no longer uniform, but is adaptively determined by the objective-level deficiency of each individual.
Given the disparate units and magnitudes of the objective functions, direct comparisons of objective values is infeasible. At each generation t, the objective values of the population are dynamically normalized. Let f m ( x i ) denote the mth objective value of the ith whale, the normalized value f ^ m ( x i ) is calculated as:
f ^ m ( x i ) = f m ( x i ) f m min f m max f m min + ε
where f m max and f m min denote the maximum and minimum values of the population on the mth objective dimension, respectively, and ε is a small constant introduced to avoid division by zero. Through this linear mapping, all objective functions are projected onto a dimensionless interval [ 0 ,   1 ] , thereby eliminating the scale discrepancies among different physical quantities of the objective functions. This normalization provides a unified measurement basis for subsequent performance evaluation.
After the normalization of dynamic objectives, the algorithm identifies the performance deficiency of each individual based on the normalized objective matrix. For minimization problems, a larger normalized objective value indicates poorer performance on that dimension. Accordingly, the weakest objective index k i of individual X i is determined as:
k i = arg max m { 1 , , M } f ^ m ( X i )
This step identifies the primary performance deficiency of the current solution, providing explicit directional guidance for subsequent targeted search.
When the weakest dimension k i is determined, the algorithm performs targeted refinement. Specifically, the individual X i learns from the elite solution X best , k , which exhibits the best performance on objective k i within the current population. The position update rule is defined as:
X i new = X i + φ · ( X best , k X i ) + γ · ( R 1 R 2 )
where X best , k denotes the individual with the minimum fitness value on the objective, φ [ 0 ,   1 ] is a random learning rate that controls the degree of attraction toward the elite solution, and γ is a disturbance coefficient. Term ( R 1 R 2 ) represents a random difference vector, which is introduced to preserve population diversity and prevent premature convergence.
The flow chart of the MOTI strategy is shown as Figure 4. This proposed strategy acts as a supplementary local refinement mechanism that helps individuals improve their most deficient objective while maintaining the multi-objective search framework.

3.3. Time Complexity Analysis

The time complexity of the MEMOBWO algorithm is mainly determined by the population size N, number of objectives M, the maximum number of iterations T, the problem dimension D and the objective evaluation cost of one candidate path C f . The original BWO algorithm has a complexity of O ( T × N × D ) .
The proposed MEMOBWO introduces three improvement strategies. Their complexities are analyzed by examining the mathematical formulas involved:
CQOBL: This strategy is applied only once during initialization. It generates a chaotic sequence and computes a quasi-oppositional solution for each individual. Each operation involves basic arithmetic and logic per dimension, resulting in a complexity of O ( N × D ) . Since it is not repeated iteratively, its contribution does not multiply by T.
HAPU: For each iteration, this strategy updates every individual’s position using the energy factor B t , the adaptive switching probability P a ( t ) , and either the Lévy flight or spiral search operator. All calculations are performed per dimension per individual, with a complexity of O ( N × D ) per iteration. Over T iterations, the total complexity is O ( T × N × D ) .
MOTI: In each iteration, this strategy normalizes the objective values (Equation (29), O ( N × M ) , where M is a small constant), identifies the weakest objective dimension (Equation (30), O ( N × M ) ), and performs a targeted position update (Equation (31), O ( N × D ) ). All operations are linear in N and D, leading to a per-iteration complexity of O ( N × D ) . Thus, the total cost is O ( T × N × D ) .
The objective evaluation of all individuals requires O ( N × C f ) per iteration. For UAV path planning, C f includes the computation of path spatial length, vertical maneuver cost, smoothness, and safety-related threat cost, and is therefore reported separately. Archive management involves dominance comparison and crowding-distance-based pruning. During archive updating, the dominance comparison cost is O ( M ( A + N ) 2 ) , and the crowding-distance pruning cost is approximately O ( M × A l o g A ) . Thus, the per-iteration complexity can be expressed as:
O iter = O ( N × D ) + O ( N × M ) + O ( N × C f ) + O ( M ( A + N ) 2 ) + O ( M × A log A )
Therefore, the overall time complexity of MEMOBWO is described as:
O ( MEMOBWO ) = O ( N × D ) + T O iter
The analysis confirms that the proposed improvement strategies of MEMOBWO remain linear with respect to N and D, while the complete multi-objective implementation also includes objective evaluation and archive management costs. Since M and A are fixed in the experiments, and each run of path planning experiments can be completed within 1 min, the computational cost remains acceptable for offline mission planning scenarios.

3.4. Implementation of MEMOBWO

The flowchart of MEMOBWO is described as Figure 5. Furthermore, Figure 6 shows a schematic diagram of the MEMOBWO algorithm system model, where Figure 5 constitutes the core layer of the algorithm.
The detailed path planning steps of the proposed MEMOBWO algorithm are operated as Algorithm 1:
Algorithm 1 Implementation of MEMOBWO
  1:
Initialize parameters: population size N, maximum iterations T max , archive size A s i z e
  2:
Begin
  3:
Initialize chaotic sequence γ using Piecewise map
  4:
Initialize population P rand with random positions within [ l b , u b ]
  5:
Generate quasi-opposition population P quasi using CQOBL strategy by Equation (21).
  6:
Select top N individuals from { P rand P quasi } to form initial population X 0
  7:
Evaluate objective functions F ( X 0 ) = [ f 1 , f 2 , f 3 , f 4 ] T
  8:
Initialize archive A with non-dominated solutions from X 0
  9:
Set iteration counter t = 0
10:
while  t < T max  do
11:
      Update nonlinear energy factor B t using Equation (23)
12:
      Update adaptive switching probability P a ( t ) using Equation (24)
13:
      for each whale X i in population do
14:
    if  B t < P a ( t )  then
15:
       Update position using Lévy Flight operator L ( X , t ) by Equation (26)
16:
    else
17:
       Update position using Spiral Search operator S ( X , t ) by Equation (28)
18:
    end if
19:
    Amend violated boundaries
20:
      end for
21:
      Calculate normalized objective values f m ( X ) by Equation (29)
22:
      for each whale X i in population do
23:
    Identify weakest dimension index k , i = arg max { f m ( X i ) }
24:
    Identify dimension elite X best , k
25:
    Update position using MOTI by Equation (31)
26:
      end for
27:
      Evaluate fitness of updated population X t + 1
28:
      Update archive A by adding non-dominated solutions from X t + 1
29:
      Remove dominated solutions from A
30:
      if  | A | > A s i z e  then
31:
    Apply crowding distance sorting
32:
    Prune excess solutions to maintain diversity
33:
      end if
34:
       t = t + 1
35:
end while
36:
Return Pareto optimal solution set A
37:
End

4. Experiments and Comparisons

Relevant simulated urban environments, as well as a series of benchmark functions, are prepared on the certain data simulation platform in order to test the feasibility and optimization performance of MEMOBWO algorithm in face of UAV path planning problem in complex urban environment.

4.1. Performance Metrics

4.1.1. Simulation Scenarios Setup

The data simulation platform is MATLAB R2023b. Due to the restriction that the flight height of UAV should not be higher than 120 m in the certain countries, the simulated city flight area established on the simulation platform is set as a three-dimensional space with dimensions of 1000 m in length, 1000 m in width and 120 m in height. The main threat sources are cubic building obstacles in the city, cylindrical obstacles such as trees and light poles, and electromagnetic interference from signal towers in the city. The discrete flight paths are generated by randomly defining a number of way points in the three-dimensional space, and then combined with the starting point and ending point. All the discrete paths are smoothed by Bezier interpolation method to obtain the final flight path. In the three-dimensional simulated urban environment, the MEMOBWO algorithm is used to search and optimize the navigation points in the path, so as to obtain better path results. Each simulated environment is randomly changed to verify the generality of the algorithm in urban environment: scenario 1 is the case of building obstacles only, scenario 2 to add the elongated cylindrical obstacles such as telephone poles, trees and building roofs, and scenario 3 the urban environment with electromagnetic interference areas, in addition to the obstacles above. The algorithm parameters in the experiments were configured as population size N = 100, maximum iterations T m a x = 500, archive size A s i z e = 50.

4.1.2. Normalized Objective Functions

It is necessary to note that during the iteration process of MEMOBWO algorithm, the algorithm generates a comprehensive Pareto front to preserve solution diversity and accommodate decision-makers with different preferences in practical use. The weighted-sum method was employed solely for the objective comparison of the final selected solution across different algorithms in our simulation studies. This approach provides a single, scalar performance metric to clearly illustrate the comprehensive optimization ability of each algorithm. The evaluation function is defined as:
F nor = i = 1 4 w i f i
where f i represents the four objective functions mentioned in Section 2.3, from f 1 to f 4 are path spatial length, path vertical maneuver cost, path smoothness and path threat degree. w i is the weighted value of each objective function, hence the sum of all w i in the formula should be 1. In our experiments, the weighted values of all the problems and scenarios were set as w 1 = w 2 = w 3 = w 4 = 0.25 based on fairness of experiment and preferences of users in practical problems. In addition to the weighting coefficients in different cases to be balanced, all objective functions need to be normalized to facilitate data comparisons.

4.1.3. Hypervolume Metric

Hypervolume (HV) metric is one of the core indicators to evaluate the quality of the solution set of multi-objective optimization algorithms. It is able to measure the convergence and diversity of the solution set at the same time, and is frequently used to evaluate the comprehensive operation ability of the algorithm. For minimization problems, a larger HV value indicates better convergence and diversity of the solution set. The formula of HV calculation is defined as:
H V ( P ) = Vol x P [ f 1 ( x ) , r 1 ] × × [ f m ( x ) , r m ]
where P is the set of non-dominated solutions, r is the reference point selected for HV index calculation. All objective values are normalized to [ 0 ,   1 ] before HV calculation. The reference point is defined based on the worst objective values among all algorithms and remains identical for all experiments, defined as r = ( 1.1 f 1 max , , 1.1 f m max ) , which is set slightly worse than the worst objective values obtained among all algorithms.

4.1.4. Inverted Generational Distance Metric

Inverted Generational Distance (IGD) is a convergence metric that assesses how well the approximations align with the Pareto Front achieved by the algorithms. The metric is calculated as follows:
I G D = i = 1 n d i 2 n
where n represents the total quantity of elements in the True Pareto Front, and d i denotes the Euclidean distance between the ith solution in Pareto Front and its closest corresponding point within the True Pareto Front. A zero value of this metric signifies that all generated solutions exactly coincide with the True Pareto Front.

4.1.5. Friedman Test

To statistically validate the performance differences among the algorithms, the Friedman test [41] was employed. This non-parametric test is appropriate for comparing multiple algorithms across different test scenarios. For each scenario, the comparison algorithms were ranked based on their final HV values from multiple independent runs. The Friedman test statistic is calculated as:
χ F 2 = 12 N k ( k + 1 ) j = 1 k R j 2 k ( k + 1 ) 2 4
where k is the number of comparison algorithms, N is the number of simulation scenarios, and R j represents the average rank of the jth algorithm. This test reveals statistically significant differences among the comparison algorithms. The algorithm with lower rank shows better performance.

4.1.6. Wilcoxon Rank Sum Test

As a non-parametric test method, the core advantage of Wilcoxon rank sum test [42] is that the sample data does not depend on specific distributions, which makes it particularly suitable for dealing with the non-normality and heteroscedasticity problems that may exist in the performance metrics of the algorithms.

4.2. Performance on Multi-Objective Benchmark Suites

Experiments of benchmark suites are tested on the proposed MEMOBWO algorithm to evaluate its performance. The rationale for conducting experiments across such a diverse range of benchmark problem sets is to establish a testing framework that ensures fairness, consistency, and neutrality in the evaluation of the algorithms. Twenty standard multi-objective test problems were selected from three benchmark suites. The performance of MEMOBWO is evaluated on the benchmark problems. For all the experiments, 100 search agents were employed, with a maximum number of iterations set as 500. The statistical results are shown in Table 1 and Table 2.
As shown in the table, MEMOBWO demonstrates competitive performance across the majority of benchmark functions. Notably, on DTLZ2, DTLZ5 and DTLZ6, the IGD values of MEMOBWO closely approach those of the best-performing algorithms, while on DTLZ1, MEMOBWO achieves the highest HV value. On functions with more complex landscapes such as ZDT4 and DTLZ7, MEMOBWO exhibits comparatively higher IGD values, indicating that its domain-adaptive design is effectively leveraged in the multi-objective path planning context.
To validate the statistical significance of the observed differences, the Friedman test and Wilcoxon rank sum test were conducted. For the HV metric in Table 3, MEMOBWO achieves statistically better improvements, while showing no significant difference when compared with NMOPSO algorithm. For the IGD metric in Table 4, MEMOBWO significantly outperformed the majority of the comparison algorithms. However, it did not quite come first in both of the tests.
It should be emphasized that the primary strength of MEMOBWO is demonstrated through the comprehensive path planning experiments in Section 4.5. The benchmark experiments serve as a supplementary reference to establish the general optimization capability of MEMOBWO, confirming its competitive performance across diverse problem types.

4.3. MEMOBWO Path Planning Results

Figure 7 showed the paths generated by the MEMOBWO algorithm in the simulated urban environment of different scenarios. The results showed that the selected paths had no collision with the obstacles in the environment, and successfully reached the target position.
The paths shown in Figure 7 were merely part of the numerous non-dominated solutions in the Pareto front obtained by MEMOBWO algorithm. Each non-dominated solution has relative superiority in particular objectives, depending on the application when solving practical problems, thus demonstrating the intelligent balance of the MEMOBWO’s preference between energy efficiency and risk avoidance.

4.4. Ablation Studies

Ablation studies were conducted to systematically verify the effectiveness of each specific strategies introduced in this paper. MEMOBWO1, MEMOBWO2 and MEMOBWO3 represent that the CQOBL, HAPU and MOTI strategies, respectively, are independently selected and used in the standard MOBWO algorithm.
Figure 8 illustrated the optimal paths generated by the algorithms above in the simulated environments. It can be observed that the paths generated by the standard MOBWO algorithm passes dangerously close to the obstacle boundary, resulting in a high safety cost. In contrast, the MEMOBWO path proactively adjusted its heading angle well before approaching the threats. It maintains a safe clearance distance while minimizing the total spatial length. Figure 9 showed the Pareto front generated by the algorithms. Figure 10 and Figure 11 reveals the convergence trend of objective functions and Hypervolume values during the iteration process of each algorithm, respectively. The MEMOBWO and MEMOBWO1 began with a significantly lower fitness value compared to standard MOBWO and other algorithms, verifying that the CQOBL strategy successfully distributes the initial population more evenly across the search space. Moreover, the MEMOBWO3 tended to have better outcome Hypervolume and objective functions values compared to other algorithms except the MEMOBWO algorithm. This proved the specific effects of the MOTI strategy. Although the MEMOBWO2 lacked advantages in the initialization and outcome phase, its non-dominated solution set still performed better diversity and homogeneity, which was shown in Figure 9. Table 5 detailed the optimal solutions of objective values in each scenario.
The ablation experiments confirmed that the MEMOBWO has advantages over the standard MOBWO, as well as the single strategy versions of itself.

4.5. Comparison with Other Optimization Algorithms

The operating results of MEMOBWO are compared with other classical swarm intelligence algorithms. The comparison algorithms include MORBMO algorithm, standard MOBWO algorithm, MOHHO algorithm, NSGA-II algorithm and NMOPSO algorithm. To ensure the fairness and reproducibility of the comparative experiments, all algorithms were executed under the same population size, maximum number of iterations, archive size, and number of independent runs. The parameter settings were shown in Table 6.

4.5.1. Path Constraints Violation Analysis

To quantitatively examine whether the obtained Pareto paths satisfy the kinematic constraints, all non-dominated solutions stored in the Pareto archive were evaluated before and after the order-preserving Bézier smoothing procedure. A path was regarded as constraint-violating if it violated at least one of the heading angle, climbing angle change or curvature constraints. The constraint violation rate (CVR) was calculated as:
C V R = N vio N arc × 100 %
where N arc denotes the number of non-dominated paths in the final archive, and N vio denotes the number of archived paths that violate at least one constraint. In addition, a normalized constraint violation severity (CVS), defined as the sum of the normalized excess values of the four considered constraints, was used to measure the degree of violation.
For the final Pareto archive containing 50 non-dominated paths, all raw discrete waypoint paths violated at least one constraint before smoothing, resulting in an C V R pre of 100.00%. The violations were mainly caused by excessive heading-angle changes between adjacent waypoint segments, indicating that the raw waypoint sequences may still contain abrupt local directional changes. After Bézier smoothing, the constraint violation rate decreased to 4%. The mean CVS was reduced significantly after smoothing, while the maximum CVS reduced into the reasonable range.
A further inspection of individual constraint types shows that all 50 pre-smoothed archive paths violated the heading-change constraint, whereas only 2 smoothed paths still exhibited minor heading-change violations. No climbing angle, climbing-angle change or curvature violations were observed after smoothing. The average smoothing time was 7.9 × 10 5 s per archived path. Therefore, the adopted constraint-aware evaluation and Bézier smoothing procedure substantially reduces local geometric constraint violations with negligible additional computational cost.

4.5.2. Path Planning Results in the 2D Test Scenario

To further evaluate the general applicability of MEMOBWO across different planning complexities, supplementary experiments were conducted in 2D environments. Accordingly, the objective functions were reduced to three dimensions: path spatial length ( f 1 , considering only X and Y coordinates), path smoothness ( f 3 , considering only horizontal turning angles), and path threat degree ( f 4 , keeping the same logic). The path vertical maneuver cost objective function ( f 2 ) was omitted as altitude remains constant. All comparison algorithms were tested under identical parameter settings of population size N = 100 and maximum iterations T max = 500. Figure 12 illustrates the planned paths generated by each algorithm in this 2D environment. It can be observed that MEMOBWO successfully navigated through the obstacle-dense region while maintaining a smooth trajectory, achieving a balanced compromise between the length and safety of paths.
Table 7 presents the quantitative comparison of the objective values and the weighted-sum score. As shown, MEMOBWO achieved the lowest overall weighted sum, outperforming all comparison algorithms. Specifically, MEMOBWO demonstrated notable advantages in path smoothness and safety, while maintaining competitive path length. In contrast, algorithms such as MOHHO and MOBWO obtained significantly higher weighted-sum scores, primarily due to their inferior performance in safety or smoothness dimensions.

4.5.3. Path Planning Results in 3D Scenarios

In the comparison experiments, the parameter can be adjusted according to the emphasis of the specific problem on different objective functions to further verify the performance of MEMOBWO algorithm.
Figure 13 illustrated that all the algorithms can generate complete non-collision paths. Figure 14 and Figure 15 showed the final Pareto non-dominated solution set and the specific optimization trend on each objective function obtained by each experiment algorithm, respectively. Figure 16 revealed the normalized objective functions values of the comparison algorithms. During the iteration process, while algorithms like NMOPSO achieve competitive results in path vertical maneuver cost, they still showed a sharp spike in the path spatial length, indicating the fact that they easily sacrifice a shorter distance to maintain a lower altitude. In contrast, MEMOBWO algorithm maintained a balanced profile with no extreme spikes, suggesting a more safe compromise solution. The MEMOBWO algorithm outperformed other algorithms in objectives of path spatial length and path threat degree. Algorithms like NMOPSO and NSGA-II are relatively good at the results of vertical maneuver cost. However, by flying in a low altitude, they are forced to take longer routes or make sharp turns to avoid building obstacles, failing significantly in the path spatial length or path threat degree. Contrarily, MEMOBWO algorithm accepted a slightly higher altitude in exchange for a better global compromise, which meant a series of straighter, smoother, and safer paths.
According to data revealed on Table 8, the superior performance of MEMOBWO can be attributed to the adaptive exploration-exploitation balance mechanism, which enables better diversity preservation in the early stage and faster convergence in the later stage.

4.5.4. Hypervolume Metric Comparisons

In addition to the comparisons on the ability to deal with the path planning problems, this paper also set up the Hypervolume metric test functions to test the performance of the proposed MEMOBWO algorithm and compare it with other classical optimization algorithms. Figure 17 detailed the iterative convergence process of the Hypervolume metric of all the algorithms. It can be seen that although other algorithms converged rapidly in the initial phase, they plateau prematurely. The distinct characteristics of the MEMOBWO curve were the staircase-like rise in the iteration process and the slight oscillation during the initial iterations. Unlike the smooth, rapid saturation seen in the MORBMO curve, the MEMOBWO curve exhibited a step-wise improvement pattern and actively maintained population diversity to avoid the premature convergence observed in the MORBMO and NMOPSO curves. The MEMOBWO algorithm flattened out later than others but at a higher level, demonstrating that the adaptive exploration-exploitation balance allowed it to refine the solution set when others have stagnated.
The innovative strategies of the MEMOBWO required it to constantly jump out of the optimal solution, so it did not have advantages over other comparison algorithms in HV convergence speed, but exceeds all other experiment functions in the final score. As the algorithm progressed, the exploitation ability is strengthened, leading to a more stable final HV value. Minor fluctuations in the later stage indicated continuous diversity maintenance rather than premature convergence. This phenomenon can be attributed to the HAPU strategy. Moreover, MEMOBWO experienced a sudden jump in performance in the particular stage, allowing the algorithm to explore more promising regions of the search space while other algorithms stagnate in local optima. Ultimately, this led to a final Hypervolume value that is 15.24% to 30.86% higher than that of the competitors.

4.5.5. Statistical Stability Experiments

To comprehensively evaluate the statistical stability of each algorithm under random initialization, 30 independent runs and the mean and standard deviation of each objective function were carried out for all the comparison algorithms. The results were shown in Table 9, where the mean value represented the optimization ability of the algorithm, and the standard deviation indicated the degree of dispersion of the results. From the statistical perspective, MEMOBWO algorithm showed smaller standard deviation. Although algorithms like NSGA-II had similar performance with MEMOBWO in individual experiments, they were still not dominant in comprehensive statistical evaluation.
The Friedman test was applied to assess the statistical significance of performance differences among the comparison algorithms. The test statistic χ F 2 approximately followed a χ 2 distribution with 6 degrees of freedom, and the significance level was set to α = 0.05 . Average ranks are presented in Table 10. The results indicated that MEMOBWO achieved the lowest rank, thus having the best performance.
To further investigate the performance differences, the Wilcoxon rank sum test was conducted on the HV metric obtained from 30 independent runs across all test scenarios. The results are presented in Table 11. As shown in the table, MEMOBWO achieves statistically significant improvements over the majority of comparison algorithms. The only exception is the comparison with MORBMO, where no statistically significant difference is observed. This can be attributed to the fact that both MEMOBWO and MORBMO employ adaptive balance mechanisms between exploration and exploitation, leading to comparable solution diversity in terms of the comprehensive HV metric.

4.5.6. Parameters Sensitivity Analysis

To examine the stability of MEMOBWO against parameter variations, a sensitivity analysis was conducted on five ZDT benchmark functions, including ZDT1, ZDT2, ZDT3, ZDT4, and ZDT6. Three representative parameters were considered: the nonlinear decay exponent α in HAPU, the learning coefficient φ in MOTI, and the disturbance coefficient γ in MOTI. For each parameter setting, 30 independent runs were performed, while the remaining parameters were fixed to their default values.
For α { 0.5 , 1.0 , 1.5 , 2.0 , 2.5 } , the average HV and IGD values over the ZDT functions remained almost unchanged. The average HV stayed around 0.3589, and the average IGD stayed around 0.0207. This indicates that MEMOBWO is relatively insensitive to moderate variations of the nonlinear decay exponent under the current benchmark setting.
For the MOTI-related parameters, φ { 0.1 , 0.2 , 0.3 , 0.4 , 0.5 } and γ { 0 , 0.05 , 0.1 , 0.2 , 0.5 } were further tested. The results show that these two parameters have moderate effects on the search behavior, but no severe performance degradation was observed within the tested ranges. When φ varied from 0.1 to 0.5, the average HV remained within a narrow range. A similar pattern was observed when γ varied from 0 to 0.5. These results suggest that the MOTI strategy maintains stable performance under moderate variations of its learning and disturbance coefficients.
The Piecewise-map parameter P in CQOBL and the auxiliary constants in HAPU, such as B 0 , μ , κ , and b, were fixed according to the implemented algorithm configuration. Since CQOBL mainly affects initialization and these constants serve as auxiliary control factors, this study focuses on the representative parameters that directly affect the exploration–exploitation transition and the objective-specific refinement process. Overall, the default settings α = 1.5 , φ = 0.3 , and γ = 0.1 are adopted as a stable and balanced configuration rather than a universally optimal parameter combination.

5. Conclusions

This paper proposed a novel improved algorithm called MEMOBWO, which contains a series of optimization strategies to improve its performance on the problem of the UAV path planning in urban environment. The subsequent simulation results show that MEMOBWO can effectively improve the comprehensive performance of UAV path planning algorithm while maintaining stable performance across independent runs. It can play a certain auxiliary role in UAV flight control and trajectory tracking.
However, the current MEMOBWO algorithm still has room for further improvement in terms of solution accuracy and optimization speed. In future research, more sophisticated decision-making models will be explored to solve the existing and potential problems. Exploration of hybrid frameworks, such as integrating deep learning with meta-heuristic optimization, could be a promising research direction. Combination of these complementary paradigms could sufficiently leverage the strengths of both approaches, thereby accelerating the convergence speed of population-based algorithms like MEMOBWO.
Moreover, while the experimental results demonstrate the effectiveness of MEMOBWO in solving UAV path planning problems in complex urban environments, we acknowledge that the testing scope has certain limitations that the experimental validation in this study is limited to simulated urban environments with basic geometric primitives like cubes, cylinders and hemispheres. To further strengthen the practical relevance of the results, future work should validate the proposed MEMOBWO on more realistic 3D city models based on real-world environments, or directly tested in real-world experiment fields. Such validation would provide a more comprehensive assessment of the algorithm’s strengths and limitations under realistic deployment conditions.

Author Contributions

E.Z.: Conceptualization, supervision, funding acquisition, validation, writing–review and editing. S.W.: Methodology, software, formal analysis, data reduction, visualization, writing—original draft. Y.Y.: Methodology, validation, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China under Grant 62203381.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Kontogiannis, S.G.; Ekaterinaris, J.A. Design, performance evaluation and optimization of a UAV. Aerosp. Sci. Technol. 2013, 29, 339–350. [Google Scholar] [CrossRef]
  2. Mohsan, S.A.H.; Khan, M.A.; Noor, F.; Ullah, I.; Alsharif, M.H. Towards the Unmanned Aerial Vehicles (UAVs): A Comprehensive Review. Drones 2022, 6, 147. [Google Scholar] [CrossRef]
  3. Alexan, W.; Aly, L.; Korayem, Y.; Gabr, M.; El-Damak, D.; Fathy, A.; Mansour, H.A.A. Secure Communication of Military Reconnaissance Images Over UAV-Assisted Relay Networks. IEEE Access 2024, 12, 78589–78610. [Google Scholar] [CrossRef]
  4. Guo, J.; Liu, X.; Bi, L.; Liu, H.; Lou, H. UN-YOLOv5s: A UAV-Based Aerial Photography Detection Algorithm. Sensors 2023, 23, 5907. [Google Scholar] [CrossRef]
  5. Dhote, J.; Limbourg, S. Designing unmanned aerial vehicle networks for biological material transportation. Comput. Ind. Eng. 2020, 148, 106652. [Google Scholar] [CrossRef]
  6. Chen, H.; Lan, Y.; Fritz, B.K.; Hoffmann, W.C.; Liu, S. Review of agricultural spraying technologies for plant protection using unmanned aerial vehicle (UAV). Int. J. Agric. Biol. Eng. 2021, 14, 38–49. [Google Scholar] [CrossRef]
  7. Mondal, A.; Mishra, D.; Prasad, G.; Hossain, A. Joint Optimization Framework for Minimization of Device Energy Consumption in Transmission Rate Constrained UAV-Assisted IoT Network. IEEE Internet Things J. 2022, 9, 9591–9607. [Google Scholar] [CrossRef]
  8. Aggarwal, S.; Kumar, N. Path planning techniques for unmanned aerial vehicles: A review, solutions, and challenges. Comput. Commun. 2020, 149, 270–299. [Google Scholar] [CrossRef]
  9. Semsch, E.; Jakob, M.; Pavlicek, D.; Pechoucek, M. Autonomous UAV Surveillance in Complex Urban Environments. In Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, Milan, Italy, 15–18 September 2009; Volume 2, pp. 82–85. [Google Scholar]
  10. Marler, R.T.; Arora, J.S. The weighted sum method for multi-objective optimization: New insights. Struct. Multidiscip. Optim. 2010, 41, 853–862. [Google Scholar] [CrossRef]
  11. Pereira, J.L.J.; Oliver, G.A.; Francisco, M.B.; Cunha, S.S., Jr.; Gomes, G.F. A review of multi-objective optimization: Methods and algorithms in mechanical engineering problems. Arch. Comput. Methods Eng. 2022, 29, 2285–2308. [Google Scholar] [CrossRef]
  12. Zhou, L.; Leng, S.; Xiong, Z.; Niyato, D.; Han, Z.; Quek, T.Q.S. Cooperative Digital Twin-Enhanced UAV Topology Optimization for Multi-Target Tracking. IEEE Trans. Commun. 2025, 73, 15926–15941. [Google Scholar] [CrossRef]
  13. Zhou, L.; Leng, S.; Liu, Y.; Xiong, Z.; Quek, T.Q.S. Digital Twins for Low-Altitude UAV Networks–Cooperation and Learning. IEEE Trans. Mob. Comput. 2026, 25, 4839–4856. [Google Scholar] [CrossRef]
  14. Yang, L.; Qi, J.; Xiao, J.; Yong, X. A literature review of UAV 3D path planning. In Proceedings of the 11th World Congress on Intelligent Control and Automation, Shenyang, China, 29 June–4 July 2014; pp. 2376–2381. [Google Scholar]
  15. Liang, X.; Meng, G.; Xu, Y.; Luo, H. A geometrical path planning method for unmanned aerial vehicle in 2D/3D complex environment. Intell. Serv. Robot. 2018, 11, 301–312. [Google Scholar] [CrossRef]
  16. Hao, G.; Lv, Q.; Huang, Z.; Zhao, H.; Chen, W. UAV Path Planning Based on Improved Artificial Potential Field Method. Aerospace 2023, 10, 562. [Google Scholar] [CrossRef]
  17. Shi, Y.; Li, Q.; Bu, S.; Yang, J.; Zhu, L. Research on Intelligent Vehicle Path Planning Based on Rapidly-Exploring Random Tree. Math. Probl. Eng. 2020, 2020, 5910503. [Google Scholar] [CrossRef]
  18. Erke, S.; Bin, D.; Yiming, N.; Qi, Z.; Liang, X.; Dawei, Z. An improved A-Star based path planning algorithm for autonomous land vehicles. Int. J. Adv. Robot. Syst. 2020, 17, 1729881420962263. [Google Scholar] [CrossRef]
  19. Wang, H.; Yu, Y.; Yuan, Q. Application of Dijkstra algorithm in robot path-planning. In Proceedings of the 2011 Second International Conference on Mechanic Automation and Control Engineering, Inner Mongolia, China, 15–17 July 2011; pp. 1067–1069. [Google Scholar]
  20. Abdel-Basset, M.; Abdel-Fatah, L.; Sangaiah, A.K. Chapter 10 - Metaheuristic Algorithms: A Comprehensive Review. In Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications; Sangaiah, A.K., Sheng, M., Zhang, Z., Eds.; Intelligent Data-Centric Systems; Academic Press: Cambridge, MA, USA, 2018; pp. 185–231. [Google Scholar]
  21. Janga Reddy, M.; Nagesh Kumar, D. Evolutionary algorithms, swarm intelligence methods, and their applications in water resources engineering: A state-of-the-art review. H2Open J. 2020, 3, 135–188. [Google Scholar] [CrossRef]
  22. Konak, A.; Coit, D.W.; Smith, A.E. Multi-objective optimization using genetic algorithms: A tutorial. Reliab. Eng. Syst. Saf. 2006, 91, 992–1007. [Google Scholar] [CrossRef]
  23. Lamini, C.; Benhlima, S.; Elbekri, A. Genetic Algorithm Based Approach for Autonomous Mobile Robot Path Planning. Procedia Comput. Sci. 2018, 127, 180–189. [Google Scholar] [CrossRef]
  24. Shin, J.J.; Bang, H. UAV Path Planning under Dynamic Threats Using an Improved PSO Algorithm. Int. J. Aerosp. Eng. 2020, 2020, 8820284. [Google Scholar] [CrossRef]
  25. Behnck, L.P.; Doering, D.; Pereira, C.E.; Rettberg, A. A Modified Simulated Annealing Algorithm for SUAVs Path Planning. IFAC-PapersOnLine 2015, 48, 63–68. [Google Scholar] [CrossRef]
  26. Konatowski, S.; Pawłowski, P. Ant colony optimization algorithm for UAV path planning. In Proceedings of the 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET), Lviv–Slavske, Ukraine, 20–24 February 2018; pp. 177–182. [Google Scholar]
  27. Gasparetto, A.; Boscariol, P.; Lanzutti, A.; Vidoni, R. Path Planning and Trajectory Planning Algorithms: A General Overview. In Motion and Operation Planning of Robotic Systems: Background and Practical Approaches; Carbone, G., Gomez-Bravo, F., Eds.; Springer International Publishing: Cham, Switzerland, 2015; pp. 3–27. [Google Scholar]
  28. Zhong, C.; Li, G.; Meng, Z. Beluga whale optimization: A novel nature-inspired metaheuristic algorithm. Knowl.-Based Syst. 2022, 251, 109215. [Google Scholar] [CrossRef]
  29. Huang, J.; Hu, H. Hybrid beluga whale optimization algorithm with multi-strategy for functions and engineering optimization problems. J. Big Data 2024, 11, 3. [Google Scholar] [CrossRef]
  30. Wang, Z.; Wang, T.; Niu, Q.; Wu, J.; Li, M.; Zhu, S. Research on multiobjective capacity configuration optimization of grid-connected wind–solar–storage microgrid system based on improved BWO algorithm. Energy Sci. Eng. 2024, 12, 2365–2384. [Google Scholar] [CrossRef]
  31. Zhang, P.; Liu, X.; Yi, Z.; He, Q. Improved Multi-Objective Beluga Whale Optimization Algorithm for Truck Scheduling in Open-Pit Mines. Sustainability 2024, 16, 6939. [Google Scholar] [CrossRef]
  32. Horng, S.C.; Lin, S.S. Improved Beluga Whale Optimization for Solving the Simulation Optimization Problems with Stochastic Constraints. Mathematics 2023, 11, 1854. [Google Scholar] [CrossRef]
  33. Fan, Z.; Xiao, Z.; Li, X.; Huang, Z.; Zhang, C. MSBWO: A Multi-Strategies Improved Beluga Whale Optimization Algorithm for Feature Selection. Biomimetics 2024, 9, 572. [Google Scholar] [CrossRef]
  34. Jia, H.; Wen, Q.; Wu, D.; Wang, Z.; Wang, Y.; Wen, C.; Abualigah, L. Modified beluga whale optimization with multi-strategies for solving engineering problems. J. Comput. Des. Eng. 2023, 10, 2065–2093. [Google Scholar] [CrossRef]
  35. Punia, P.; Raj, A.; Kumar, P. An enhanced beluga whale optimization algorithm for engineering optimization problems. J. Syst. Sci. Syst. Eng. 2024, 1–38. [Google Scholar] [CrossRef]
  36. Ouyang, K.; Wei, D.; Fu, S.; Gu, S.; Sha, X.; Yu, J.; Yu, J.; Heidar, A.A.; Cai, Z.; Chen, H. Multi-objective red-billed blue magpie optimizer: A novel algorithm for multi-objective UAV path planning. Results Eng. 2025, 27, 106785. [Google Scholar] [CrossRef]
  37. Duong, T.T.N.; Bui, D.N.; Phung, M.D. Navigation variable-based multi-objective particle swarm optimization for UAV path planning with kinematic constraints. Neural Comput. Appl. 2025, 37, 5683–5697. [Google Scholar] [CrossRef]
  38. Mahdavi, S.; Rahnamayan, S.; Deb, K. Opposition based learning: A literature review. Swarm Evol. Comput. 2018, 39, 1–23. [Google Scholar] [CrossRef]
  39. Lenin, K. Quasi Opposition-Based Quantum Pieris Rapae and Parametric Curve Search Optimization for Real Power Loss Reduction and Stability Enhancement. IEEE Trans. Ind. Appl. 2023, 59, 3077–3085. [Google Scholar] [CrossRef]
  40. Jia, H.; Zhou, X.; Zhang, J. Thinking Innovation Strategy (TIS): A novel mechanism for metaheuristic algorithm design and evolutionary update. Appl. Soft Comput. 2025, 175, 113071. [Google Scholar] [CrossRef]
  41. Sheldon, M.R.; Fillyaw, M.J.; Thompson, W.D. The use and interpretation of the Friedman test in the analysis of ordinal-scale data in repeated measures designs. Physiother. Res. Int. 1996, 1, 221–228. [Google Scholar] [CrossRef] [PubMed]
  42. Rosner, B.; Glynn, R.J.; Ting Lee, M.L. Incorporation of Clustering Effects for the Wilcoxon Rank Sum Test: A Large-Sample Approach. Biometrics 2003, 59, 1089–1098. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Illustration of the vertical maneuver cost.
Figure 1. Illustration of the vertical maneuver cost.
Actuators 15 00314 g001
Figure 2. Illustration of the EMI interference zone.
Figure 2. Illustration of the EMI interference zone.
Actuators 15 00314 g002
Figure 3. Illustration of CQOBL strategy.
Figure 3. Illustration of CQOBL strategy.
Actuators 15 00314 g003
Figure 4. Illustration of MOTI strategy.
Figure 4. Illustration of MOTI strategy.
Actuators 15 00314 g004
Figure 5. Implementation flowchart of MEMOBWO.
Figure 5. Implementation flowchart of MEMOBWO.
Actuators 15 00314 g005
Figure 6. System model illustration of MEMOBWO.
Figure 6. System model illustration of MEMOBWO.
Actuators 15 00314 g006
Figure 7. Optimal paths generated by the MEMOBWO.
Figure 7. Optimal paths generated by the MEMOBWO.
Actuators 15 00314 g007
Figure 8. Optimal paths generated by the ablation studies of MEMOBWO.
Figure 8. Optimal paths generated by the ablation studies of MEMOBWO.
Actuators 15 00314 g008
Figure 9. Results of Pareto solutions generated by the ablation versions of MEMOBWO.
Figure 9. Results of Pareto solutions generated by the ablation versions of MEMOBWO.
Actuators 15 00314 g009
Figure 10. Objective functions convergence comparison of the ablation studies of MEMOBWO.
Figure 10. Objective functions convergence comparison of the ablation studies of MEMOBWO.
Actuators 15 00314 g010
Figure 11. Hypervolume convergence comparison of the ablation studies of MEMOBWO.
Figure 11. Hypervolume convergence comparison of the ablation studies of MEMOBWO.
Actuators 15 00314 g011
Figure 12. Path planning results of comparison algorithms in the 2D test scenario.
Figure 12. Path planning results of comparison algorithms in the 2D test scenario.
Actuators 15 00314 g012
Figure 13. Optimal paths generated by the comparison algorithms in 3D environment scenarios 1, 2 and 3.
Figure 13. Optimal paths generated by the comparison algorithms in 3D environment scenarios 1, 2 and 3.
Actuators 15 00314 g013
Figure 14. Pareto front solutions of the comparison algorithms in 3D environment scenarios 1, 2 and 3. The objective functions are divided into 2 groups to show Pareto results due to axis limitation.
Figure 14. Pareto front solutions of the comparison algorithms in 3D environment scenarios 1, 2 and 3. The objective functions are divided into 2 groups to show Pareto results due to axis limitation.
Actuators 15 00314 g014
Figure 15. Value convergence curves of each objective function by comparison algorithms in 3D environment scenarios 1, 2 and 3.
Figure 15. Value convergence curves of each objective function by comparison algorithms in 3D environment scenarios 1, 2 and 3.
Actuators 15 00314 g015
Figure 16. Normalized objective values of comparison algorithms in 3D environment scenarios 1, 2 and 3.
Figure 16. Normalized objective values of comparison algorithms in 3D environment scenarios 1, 2 and 3.
Actuators 15 00314 g016
Figure 17. HV values and variation trend of comparison algorithms in 3D environment scenarios 1, 2 and 3.
Figure 17. HV values and variation trend of comparison algorithms in 3D environment scenarios 1, 2 and 3.
Actuators 15 00314 g017
Table 1. Statistical results for the IGD metric on ZDT, DTLZ and WFG benchmark functions generated by the comparison algorithms.
Table 1. Statistical results for the IGD metric on ZDT, DTLZ and WFG benchmark functions generated by the comparison algorithms.
FunctionIndexAlgorithms
NSGA2MEMOBWOMOHHONMOPSOMOBWOMORBMO
ZDT1Mean5.68 × 10−31.93 × 10−32.98 × 10−27.91 × 10−22.01 × 10−32.07 × 10−3
Std8.19 × 10−31.31 × 10−41.81 × 10−25.24 × 10−21.42 × 10−41.74 × 10−3
ZDT2Mean1.79 × 10−11.92 × 10−34.50 × 10−22.87 × 10−11.93 × 10−31.14 × 10−1
Std1.02 × 10−18.37 × 10−57.06 × 10−24.67 × 10−21.17 × 10−41.21 × 10−1
ZDT3Mean1.14 × 10−21.43 × 10−32.64 × 10−27.48 × 10−21.46 × 10−31.59 × 10−3
Std1.15 × 10−26.03 × 10−51.88 × 10−25.82 × 10−29.50 × 10−59.29 × 10−5
ZDT4Mean2.13 × 1001.91 × 1001.91 × 1002.55 × 1001.50 × 1001.18 × 100
Std4.52 × 10−164.47 × 10−34.52 × 10−168.75 × 10−16.41 × 10−17.84 × 10−1
ZDT6Mean1.34 × 10−21.45 × 10−34.64 × 10−27.78 × 10−11.51 × 10−37.64 × 10−1
Std3.82 × 10−27.57 × 10−55.55 × 10−24.89 × 10−18.94 × 10−51.61 × 10−1
DTLZ1Mean5.79 × 10−11.89 × 10−22.20 × 1007.58 × 1003.75 × 1005.17 × 100
Std7.26 × 10−16.68 × 10−41.46 × 1004.83 × 1001.02 × 1001.04 × 100
DTLZ2Mean5.15 × 10−24.33 × 10−26.09 × 10−21.22 × 10−14.77 × 10−25.47 × 10−2
Std2.50 × 10−31.51 × 10−37.47 × 10−34.89 × 10−23.79 × 10−34.32 × 10−3
DTLZ4Mean5.34 × 10−26.48 × 10−29.00 × 10−25.69 × 10−15.02 × 10−21.37 × 10−1
Std1.67 × 10−21.18 × 10−16.57 × 10−21.11 × 10−15.35 × 10−36.04 × 10−2
DTLZ5Mean4.72 × 10−33.23 × 10−38.67 × 10−34.28 × 10−24.49 × 10−31.07 × 10−2
Std9.09 × 10−41.84 × 10−45.78 × 10−32.60 × 10−26.92 × 10−42.46 × 10−3
DTLZ6Mean3.11 × 10−33.01 × 10−33.46 × 10−24.22 × 10−13.10 × 10−36.46 × 10−1
Std1.43 × 10−41.16 × 10−43.07 × 10−24.22 × 10−11.47 × 10−41.61 × 10−1
DTLZ7Mean1.91 × 10−11.73 × 10−12.49 × 10−11.29 × 10−14.08 × 10−25.31 × 10−2
Std1.37 × 10−11.27 × 10−16.68 × 10−21.29 × 10−11.07 × 10−23.25 × 10−2
WFG1Mean1.75 × 10−13.73 × 10−21.83 × 10−21.33 × 10−14.46 × 10−12.35 × 10−1
Std3.69 × 10−23.03 × 10−31.89 × 10−25.66 × 10−23.94 × 10−41.05 × 10−2
WFG2Mean1.09 × 10−23.30 × 10−32.74 × 10−23.28 × 10−23.92 × 10−31.37 × 10−2
Std2.14 × 10−32.17 × 10−49.46 × 10−31.12 × 10−22.96 × 10−49.91 × 10−4
WFG3Mean1.30 × 10−11.25 × 10−11.41 × 10−11.29 × 10−11.25 × 10−11.34 × 10−1
Std3.03 × 10−34.95 × 10−56.72 × 10−38.92 × 10−32.22 × 10−41.13 × 10−3
WFG4Mean1.64 × 10−25.07 × 10−32.21 × 10−21.85 × 10−29.68 × 10−32.20 × 10−2
Std2.97 × 10−32.89 × 10−44.56 × 10−34.81 × 10−31.35 × 10−31.59 × 10−3
WFG5Mean2.43 × 10−22.23 × 10−22.63 × 10−23.36 × 10−22.32 × 10−22.59 × 10−2
Std1.57 × 10−38.63 × 10−52.69 × 10−31.42 × 10−21.47 × 10−31.65 × 10−3
WFG6Mean2.42 × 10−21.92 × 10−26.77 × 10−26.03 × 10−29.53 × 10−21.95 × 10−2
Std2.11 × 10−28.27 × 10−33.78 × 10−22.86 × 10−29.10 × 10−42.52 × 10−3
WFG7Mean1.37 × 10−25.16 × 10−32.71 × 10−24.28 × 10−25.32 × 10−31.71 × 10−2
Std4.00 × 10−32.42 × 10−48.23 × 10−32.66 × 10−22.92 × 10−41.97 × 10−3
WFG8Mean5.68 × 10−24.37 × 10−21.52 × 10−22.16 × 10−25.02 × 10−26.28 × 10−2
Std3.81 × 10−37.95 × 10−43.30 × 10−32.16 × 10−28.03 × 10−43.01 × 10−3
WFG9Mean6.68 × 10−35.93 × 10−32.62 × 10−23.98 × 10−27.27 × 10−31.67 × 10−2
Std1.62 × 10−35.35 × 10−42.30 × 10−34.07 × 10−23.43 × 10−42.32 × 10−3
Table 2. Statistical results for the HV metric on ZDT, DTLZ and WFG benchmark functions.
Table 2. Statistical results for the HV metric on ZDT, DTLZ and WFG benchmark functions.
FunctionIndexAlgorithms
NSGA2MEMOBWOMOHHONMOPSOMOBWOMORBMO
ZDT1Mean4.18 × 10−14.21 × 10−12.95 × 10−13.58 × 10−14.21 × 10−14.21 × 10−1
Std8.83 × 10−34.21 × 10−36.91 × 10−23.77 × 10−24.79 × 10−27.74 × 10−3
ZDT2Mean4.41 × 10−13.86 × 10−13.72 × 10−13.05 × 10−13.86 × 10−13.86 × 10−1
Std1.08 × 10−23.87 × 10−25.45 × 10−20.00 × 1003.45 × 10−23.87 × 10−2
ZDT3Mean3.71 × 10−13.77 × 10−12.76 × 10−12.98 × 10−13.80 × 10−13.80 × 10−1
Std2.13 × 10−27.66 × 10−22.82 × 10−18.24 × 10−21.83 × 10−21.83 × 10−2
ZDT4Mean8.05 × 10−21.95 × 10−41.69 × 10−31.52 × 10−41.52 × 10−41.52 × 10−4
Std9.01 × 10−28.73 × 10−40.00 × 1008.73 × 10−48.73 × 10−48.73 × 10−4
ZDT6Mean2.88 × 10−12.71 × 10−12.59 × 10−12.55 × 10−12.59 × 10−12.59 × 10−1
Std5.74 × 10−22.71 × 10−28.04 × 10−23.43 × 10−28.04 × 10−28.04 × 10−2
DTLZ1Mean5.59 × 10−19.96 × 10−12.38 × 10−30.00 × 1000.00 × 1000.00 × 100
Std2.80 × 10−27.94 × 10−33.52 × 10−30.00 × 1000.00 × 1000.00 × 100
DTLZ2Mean9.16 × 10−19.17 × 10−19.13 × 10−18.81 × 10−19.18 × 10−19.18 × 10−1
Std1.16 × 10−21.77 × 10−21.29 × 10−21.77 × 10−23.11 × 10−23.11 × 10−2
DTLZ4Mean9.20 × 10−19.17 × 10−19.23 × 10−13.24 × 10−19.17 × 10−19.17 × 10−1
Std1.99 × 10−23.29 × 10−21.03 × 10−29.59 × 10−13.29 × 10−23.29 × 10−2
DTLZ5Mean9.20 × 10−18.88 × 10−18.86 × 10−18.70 × 10−18.87 × 10−18.82 × 10−1
Std2.10 × 10−28.88 × 10−11.15 × 10−22.95 × 10−28.88 × 10−18.88 × 10−1
DTLZ6Mean8.76 × 10−18.58 × 10−18.88 × 10−15.53 × 10−18.58 × 10−18.69 × 10−1
Std2.10 × 10−28.25 × 10−11.15 × 10−22.95 × 10−28.37 × 10−18.88 × 10−1
DTLZ7Mean2.74 × 10−12.24 × 10−12.18 × 10−11.07 × 10−12.24 × 10−12.24 × 10−1
Std5.75 × 10−22.24 × 10−12.18 × 10−11.07 × 10−12.24 × 10−12.24 × 10−1
WFG1Mean6.50 × 10−16.37 × 10−15.86 × 10−17.05 × 10−16.37 × 10−16.37 × 10−1
Std8.79 × 10−26.37 × 10−15.86 × 10−17.05 × 10−16.37 × 10−16.37 × 10−1
WFG2Mean5.41 × 10−15.46 × 10−15.47 × 10−17.42 × 10−15.46 × 10−15.46 × 10−1
Std9.61 × 10−35.46 × 10−15.47 × 10−17.42 × 10−15.46 × 10−15.46 × 10−1
WFG3Mean5.76 × 10−15.77 × 10−15.83 × 10−17.05 × 10−15.77 × 10−15.77 × 10−1
Std2.99 × 10−35.77 × 10−15.83 × 10−17.05 × 10−15.77 × 10−15.77 × 10−1
WFG4Mean2.54 × 10−12.64 × 10−12.45 × 10−12.51 × 10−12.64 × 10−12.64 × 10−1
Std4.10 × 10−32.64 × 10−12.45 × 10−12.51 × 10−12.64 × 10−12.64 × 10−1
WFG5Mean2.56 × 10−12.56 × 10−12.71 × 10−12.74 × 10−12.56 × 10−12.56 × 10−1
Std1.07 × 10−22.56 × 10−12.71 × 10−12.74 × 10−12.56 × 10−12.56 × 10−1
WFG6Mean2.67 × 10−12.57 × 10−12.45 × 10−12.24 × 10−12.57 × 10−12.57 × 10−1
Std1.75 × 10−22.57 × 10−12.45 × 10−12.24 × 10−12.57 × 10−12.57 × 10−1
WFG7Mean2.56 × 10−12.63 × 10−12.61 × 10−12.24 × 10−12.63 × 10−12.63 × 10−1
Std3.79 × 10−32.63 × 10−12.61 × 10−12.24 × 10−12.63 × 10−12.63 × 10−1
WFG8Mean2.46 × 10−12.62 × 10−12.63 × 10−12.51 × 10−12.62 × 10−12.62 × 10−1
Std4.82 × 10−32.62 × 10−12.63 × 10−12.51 × 10−12.62 × 10−12.62 × 10−1
WFG9Mean2.60 × 10−12.63 × 10−12.55 × 10−12.51 × 10−12.63 × 10−12.63 × 10−1
Std6.03 × 10−32.63 × 10−12.55 × 10−12.51 × 10−12.63 × 10−12.63 × 10−1
Table 3. Friedman Average Rank and Wilcoxon rank sum test comparisons of HV metric, where ≈ means no significant difference, + means better and + means significantly better, / means not applicable.
Table 3. Friedman Average Rank and Wilcoxon rank sum test comparisons of HV metric, where ≈ means no significant difference, + means better and + means significantly better, / means not applicable.
AlgorithmsAverage RankSignificance
NMOPSO2.42
MEMOBWO2.80/
MORBMO3.48+
MOHHO3.61+
MOBWO3.91 +
NSGA24.50 +
Table 4. Friedman Average Rank and Wilcoxon rank sum test comparisons of IGD metric, where ≈ means no significant difference, + means better and + means significantly better, / means not applicable.
Table 4. Friedman Average Rank and Wilcoxon rank sum test comparisons of IGD metric, where ≈ means no significant difference, + means better and + means significantly better, / means not applicable.
AlgorithmsAverage RankSignificance
MORBMO1.66
MEMOBWO2.41/
NMOPSO3.26+
NSGA24.10+
MOHHO4.28+
MOBWO5.20 +
Table 5. Path planning results of MEMOBWO and its ablation versions.
Table 5. Path planning results of MEMOBWO and its ablation versions.
Scenarios &
Objectives
Algorithms
MOBWOMEMOBWO1MEMOBWO2MEMOBWO3MEMOBWO
1 f 1 1233.881286.851214.831166.471171.81
f 2 15.7326.9224.5430.3419.10
f 3 2.01 × 10−33.35 × 10−32.16 × 10−32.01 × 10−32.01 × 10−3
f 4 1.30 × 10−31.68 × 10−31.82 × 10−31.68 × 10−31.68 × 10−3
2 f 1 1226.361207.371195.381178.361172.34
f 2 13.0521.9424.5915.848.96
f 3 7.03 × 10−34.26 × 10−32.54 × 10−33.01 × 10−32.42 × 10−3
f 4 1.71 × 10−31.68 × 10−31.71 × 10−31.69 × 10−31.68 × 10−3
3 f 1 1224.381190.601196.371215.621185.24
f 2 18.2322.3139.5921.849.95
f 3 9.40 × 10−33.84 × 10−33.73 × 10−34.68 × 10−34.25 × 10−3
f 4 1.72 × 10−31.80 × 10−31.68 × 10−31.76 × 10−31.68 × 10−3
Table 6. Parameter settings of the compared algorithms. The common experimental settings, including population size N, maximum iteration number T max , and archive size A, were kept identical for all algorithms in the same experiment and are therefore not repeatedly listed in the table. Here, D denotes the decision-variable dimension.
Table 6. Parameter settings of the compared algorithms. The common experimental settings, including population size N, maximum iteration number T max , and archive size A, were kept identical for all algorithms in the same experiment and are therefore not repeatedly listed in the table. Here, D denotes the decision-variable dimension.
AlgorithmsSpecific Settings
NSGA-II p c = 0.9 , p m = 1 / D , η c = 20 , η m = 20
NMOPSO w 0 = 1 , w damp = 0.98 , c 1 = 1.5 , c 2 = 1.5 , V max = 0.2 ( u b l b ) , μ = 0.5 , δ = 20
MOHHO E = 2 E 0 ( 1 t / T max ) , E 0 [ 1 ,   1 ] , Lévy flight parameter β = 1.5 , Lévy scale factor = 0.01
MORBMO n grid = 5 , C F = ( 1 t / T max ) 2 t / T max , subgroup sizes n 1 [ 2 ,   5 ] , n 2 [ 10 ,   N ]
MOBWO B f = 1 t / T max , W f = 0.1 0.05 ( t / T max )
MEMOBWO α = 1.5 , B 0 = 2 , μ = 0.1 , κ = 3 , b = 1 , φ = 0.3 , γ MOTI = 0.1 , P = 0.4
Table 7. Comparison of algorithms on path planning Objectives. The weighted sum is used to compare the overall performance of the paths generated by each algorithm.
Table 7. Comparison of algorithms on path planning Objectives. The weighted sum is used to compare the overall performance of the paths generated by each algorithm.
Algorithms f 1 (Length) f 3 (Smoothness) f 4 (Safety)Weighted Sum
MEMOBWO1231.670.00030.000.0109
MORBMO1229.030.00040.000.0277
NMOPSO1256.840.00030.070.4016
NSGA-II1260.320.00150.000.4102
MOHHO1364.700.00110.000.5564
MOBWO1247.900.00130.050.5720
Table 8. Path planning results of the comparison algorithms in 3D environment scenarios 1, 2 and 3.
Table 8. Path planning results of the comparison algorithms in 3D environment scenarios 1, 2 and 3.
Scenarios &
Objectives
Algorithms
MEMOBWOMORBMONMOPSOMOHHOMOBWONSGA2
1 f 1 1178.511231.681229.181217.951312.491198.41
f 2 8.8827.1313.5825.7211.4314.94
f 3 2.70 × 10−32.14 × 10−37.70 × 10−37.60 × 10−35.64 × 10−33.37 × 10−3
f 4 1.30 × 10−31.41 × 10−31.30 × 10−31.30 × 10−31.30 × 10−31.80 × 10−3
2 f 1 1169.421195.501179.621195.951185.561241.68
f 2 9.9724.2222.619.308.099.56
f 3 2.1 × 10−36.00 × 10−34.9 × 10−35.6 × 10−33.89 × 10−32.80 × 10−3
f 4 1.68 × 10−31.84 × 10−31.80 × 10−31.83 × 10−31.84 × 10−31.82 × 10−3
3 f 1 1172.001290.931277.151247.961232.621308.17
f 2 19.6021.7218.9325.1318.9814.91
f 3 3.27 × 10−35.11 × 10−37.15 × 10−37.50 × 10−37.24 × 10−36.06 × 10−3
f 4 1.68 × 10−32.07 × 10−31.68 × 10−31.83 × 10−31.82 × 10−31.68 × 10−3
Table 9. Statistical stability results of MEMOBWO, its ablation variants, and the comparison algorithms over 30 independent runs.
Table 9. Statistical stability results of MEMOBWO, its ablation variants, and the comparison algorithms over 30 independent runs.
AlgorithmsObjectivesAveStd
MEMOBWO f 1 1177.0812.56
f 2 7.452.79
f 3 4.13 × 10−35.91 × 10−4
f 4 1.68 × 10−3<1 × 10−12
MOBWO f 1 1241.0534.71
f 2 28.074.67
f 3 6.44 × 10−31.56 × 10−3
f 4 1.68 × 10−3<1 × 10−12
MEMOBWO1 f 1 1459.8420.08
f 2 20.836.27 × 10−3
f 3 9.12 × 10−31.75 × 10−3
f 4 1.73 × 10−3<1 × 10−12
MEMOBWO2 f 1 1193.0518.78
f 2 15.945.26
f 3 5.18 × 10−38.70 × 10−4
f 4 1.75 × 10−3<1 × 10−12
MEMOBWO3 f 1 1166.0430.92
f 2 28.482.7 × 10−3
f 3 8.75 × 10−32.22 × 10−3
f 4 1.76 × 10−39.25 × 10−4
MORBMO f 1 1200.7120.12
f 2 11.763.57
f 3 5.40 × 10−36.08 × 10−4
f 4 1.68 × 10−3<1 × 10−12
NMOPSO f 1 1185.9521.23
f 2 25.273.51
f 3 3.94 × 10−38.08 × 10−4
f 4 1.79 × 10−35.98 × 10−5
MOHHO f 1 1437.28594.86
f 2 38.0511.67
f 3 3.41 × 10−27.38 × 10−2
f 4 2.03 × 10−35.34 × 10−4
NSGA2 f 1 1191.5729.51
f 2 13.904.73
f 3 3.68 × 10−31.05 × 10−3
f 4 1.78 × 10−36.17 × 10−5
Table 10. Friedman test results of comparison algorithms for the HV metric, where the lower rank represents the better overall performance.
Table 10. Friedman test results of comparison algorithms for the HV metric, where the lower rank represents the better overall performance.
AlgorithmsPerformance
Average Rank
MEMOBWO1.14
MORBMO2.86
NMOPSO3.29
NSGA-II3.86
MOHHO4.71
MOBWO5.43
Table 11. Wilcoxon rank sum test results for the HV metric, where ≈ means no significant difference, + means better and +* means significantly better, / means not applicable.
Table 11. Wilcoxon rank sum test results for the HV metric, where ≈ means no significant difference, + means better and +* means significantly better, / means not applicable.
Pair (MEMOBWO vs. Algorithm)p-ValueSignificance
MEMOBWO//
MORBMO2.45 × 10−1
NMOPSO1.57 × 10−2+ *
MOBWO<1.00 × 10−3+ *
MOHHO6.40 × 10−3+
NSGA2<1.00 × 10−3+ *
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, E.; Wu, S.; Yi, Y. MEMOBWO: A Novel Multi-Objective Optimization Algorithm for UAV Path Planning in Complex Urban Environments. Actuators 2026, 15, 314. https://doi.org/10.3390/act15060314

AMA Style

Zhang E, Wu S, Yi Y. MEMOBWO: A Novel Multi-Objective Optimization Algorithm for UAV Path Planning in Complex Urban Environments. Actuators. 2026; 15(6):314. https://doi.org/10.3390/act15060314

Chicago/Turabian Style

Zhang, Enze, Sining Wu, and Yang Yi. 2026. "MEMOBWO: A Novel Multi-Objective Optimization Algorithm for UAV Path Planning in Complex Urban Environments" Actuators 15, no. 6: 314. https://doi.org/10.3390/act15060314

APA Style

Zhang, E., Wu, S., & Yi, Y. (2026). MEMOBWO: A Novel Multi-Objective Optimization Algorithm for UAV Path Planning in Complex Urban Environments. Actuators, 15(6), 314. https://doi.org/10.3390/act15060314

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop