1. Introduction
As a critical component of modern intelligent unmanned systems, UAVs are profoundly reshaping combat paradigms and technological frameworks in the military domain. In modern warfare, UAVs demonstrate remarkable advantages in key missions such as target reconnaissance [
1], battlefield surveillance [
2,
3], and formation control through their highly coordinated operations. This collaborative combat system transcends limitations of traditional fixed infrastructure, resolves challenges like communication latency [
4], and significantly enhances the timeliness of battlefield information acquisition and operational effectiveness [
5]. Particularly in complex and dynamic combat environments, UAVs have emerged as critical penetration forces against enemy defense systems due to their high mobility [
6,
7], strong survivability, and autonomous coordination capabilities [
8]. However, most existing studies assume uniform motion for path optimization, which poses major challenges in real-world variable-speed scenarios.
In various combat applications of UAVs, passive target localization technology holds unique tactical value [
9,
10]. Compared to active localization systems, passive localization systems achieve precise localization of emitter targets without exposing their own location by passively receiving electromagnetic signals from radiation sources [
11], playing a crucial role in electronic reconnaissance, anti-stealth operations, and battlefield situational awareness. Tua A. Tamba et al. developed a distributed Kalman filter-based cooperative tracking algorithm for multiple UAVs, which enhances target state estimation accuracy in maritime moving vessel monitoring scenarios. However, their constant-velocity motion model demonstrates limited adaptability to abruptly maneuvering targets [
12]. The DQN-ALNS framework proposed by Lingjie Zhou et al. significantly improves the solution efficiency of UAV path optimization through the collaborative optimization of deep reinforcement learning and adaptive large neighborhood search. However, its state space design based on the uniform speed assumption limits its application in dynamic environments [
13]. Therefore, how to construct a dynamic geometric constraint model that accommodates variable-speed motion has become the core issue in improving passive localization accuracy.
1.1. Overview
For the application of target passive localization based on UAVs, the optimization of data quality [
14,
15], formation configuration [
16] and flight paths significantly impacts the performance of the localization accuracy. By elevating data quality and selecting optimal measurements, the precision and robustness of the localization system can be effectively improved. The bearing-only passive localization model proposed by Ruiqing Yang et al. improves localization accuracy through the fusion of triangulation and greedy strategies, but fails to account for the impact of variable-speed motion on the observation geometric configuration [
17]. Through optimized formation geometry, the UAVs can maintain the most effective configuration during target localization, enabling continuous optimal localization geometry that significantly enhances both accuracy and continuity of the localization performance. Meanwhile, UAV optimization involves dynamically adjusting the flight paths and spatial configurations of the UAVs to optimize the observation conditions, ultimately achieving high-precision localization accuracy. These studies ensure that the observational data provided by the UAVs consistently approximates the minimum error conditions, thereby substantially improving the localization performance.
At the path optimization algorithm level, swarm intelligence algorithms have made significant progress, but several critical challenges remain. Zhanghan Li et al. proposed the APF-DDQN model, which integrates artificial potential fields with deep reinforcement learning to effectively address path optimization in complex obstacle-rich environments. However, its state space design does not account for dynamic velocity variations [
18]. The improved ACO algorithm proposed by Dong et al. [
19] introduces roulette wheel selection and elitist strategies, significantly enhancing task execution efficiency and survival probability in multi-task scenarios while resolving the poor convergence issues of traditional methods in handling multi-constraint, nonlinear optimization problems. The PSO algorithm modified by Hao et al. [
20] features nonlinear inertia weight adjustments and incorporates logarithmic function control mechanisms with Gaussian random adjustment strategies, substantially improving convergence speed and local search capabilities in UAV path optimization. Han et al. [
21] enhanced multi-UAV cooperative mission optimization by integrating Dubins curve constraints with multi-objective optimization strategies, improving convergence rates and global optimization capabilities while resolving the challenge of simultaneously satisfying trajectory length, mission payoff, and flight constraints. Shengxi Li et al. proposed a hybrid path optimization framework integrating Model Predictive Control (MPC) with Control Barrier Functions (CBFs). While this approach enhances dynamic obstacle avoidance through safety constraints, its prediction model still relies on the uniform motion assumption, resulting in path tracking errors during sudden maneuvering scenarios [
22]. However, the modified PSO algorithm only adopts the fixed speed updates [
20], which cannot accommodate variable-speed UAV movements. When combined with Dubins curves, the system remains constrained by curvature limitations, confining the search space to a fixed circle, restricting the improvement of the localization accuracy.
For UAV path optimization in complex obstacle-laden environments where computational demands are high and efficiency is low, several algorithms have made breakthroughs. Wang et al.’s priority-based Artificial Potential Field (APF) algorithm [
23], Zhang et al.’s improved target-biased Rapidly-exploring Random Tree (RRT) path optimization algorithm [
24], Lu et al.’s RRT-APF hybrid path optimization algorithm [
25], and Xu et al.’s Three-phase Alternating Optimization Algorithm (TAOA) [
26] all overcome these limitations and resolve the issue of excessive computational burdens in complex environments. However, these methods suffer from poor adaptability and are unsuitable for variable-speed environments.
In dynamic environments, Hou et al. proposed a dynamic path optimization algorithm integrating improved Q-Learning with the Artificial Potential Field (APF) method [
27], resolving the issue where traditional methods fail to generate effective paths between path nodes. Zhou et al. developed a UAV TDOA localization path optimization algorithm based on the D-optimality criterion [
28], significantly enhancing localization accuracy and tracking stability for emitter targets. Li et al. presented a hybrid TDOA/FDOA-based UAV path optimization algorithm [
29], substantially improving position and velocity estimation accuracy for moving targets. However, these methods exhibit poor real-time performance and do not consider the situation where the UAVs have variable speed.
To improve the UAV path optimization efficiency, Pan et al. resolved the issues of traditional APF methods easily falling into local minima and formation instability caused by introducing rotation [
30]. Sun et al. proposed a UAV path optimization algorithm based on semantic maps and multi-scale feature fusion [
31], which notably improves path optimization efficiency for large-scale formations. Additionally, Zhou et al. significantly improved the efficiency and quality of image acquisition for 3D reconstruction through intelligent viewpoint selection and optimization strategies [
32]. Chen et al. transformed task allocation problems into a Vehicle Routing Problem (VRP) model and optimized observation time [
33], notably increasing the efficiency of multi-UAV cooperative remote sensing missions. Li et al. addressed the low efficiency of traditional swarm collision avoidance optimization by establishing a coupling matrix and employing the AR-SAS path optimization method [
34]. Li et al. also studied how to optimize UAV observation geometry configurations [
35], significantly reducing the flight path lengths and computational loads while resolving the inefficiency of conventional passive localization methods.
However, the above approaches are all under the assumption that the UAVs have a fixed speed and this will set strict constraints on the UAV path optimization problem, restricting the feasible region to circles with the UAVs as their centers. However, in the real situation, UAVs generally have variable speed; thus, the feasible region of the UAV path optimization problem is an annular region which extends the feasible region, leading to more optimal UAV position solutions.
1.2. Original Contribution
Evidently, the existing works mostly focus on UAV path optimization with fixed speed, which is mismatched with the real situation and will restrict the improvement of passive target localization accuracy. Thus, in this paper, we focus on the problem of UAV path optimization, considering UAVs with variable speed. Firstly, the signal model of the passive target localization method is established. Secondly, a UAV path optimization method is proposed based on the CRLB of the TDOA localization method, considering UAVs with variable speed. Thirdly, the PSO algorithm is applied to accelerate the optimization process and improve the efficiency of the path optimization algorithm. Simulation results illustrate that the proposals in this paper could achieve better localization accuracy with less computational complexity. The original contributions of this paper are as follows:
- (1)
A novel UAV path optimization method based on the CRLB of the TDOA localization method is proposed, where the trace of the CRLB is set as the objective function and UAV moving constraints are set as the constraint conditions. By optimally configuring the positions of the UAVs for each time step, the localization accuracy is improved.
- (2)
Different from the existing works where the UAVs are assumed to have fixed speed, in this paper, we consider the UAVs to have variable speed; thus, the feasible region of the UAV positions is extended from a circle to an annular region and the localization accuracy is improved.
- (3)
To accelerate the UAV path optimization process and improve the efficiency of the algorithm, the PSO algorithm is applied to search for the optimal positions of the UAVs at the next time step.
2. Signal Model of Passive Target Localization
In this section, the TDOA algorithm is applied for target localization. Since the TDOA algorithm generally has high computation complexity, the Taylor series expansion algorithm is introduced to accelerate the localization algorithm, and the modified TDOA–Taylor algorithm could achieve better computation efficiency. Although the modified TDOA–Taylor algorithm benefits from the smaller computation load, it is affected by the initial value of the target state error at a large scale. If the initial error of the target state is large, the accuracy of the target localization would be poor. Thus, the Chan algorithm is also introduced to guarantee the accuracy of the initial value of the target state, and the modified TDOA–Chan–Taylor algorithm could achieve better localization accuracy with low computation complexity.
2.1. Signal Model of the TDOA Algorithm
TDOA refers to a localization technique that determines the location of a target by measuring the time differences of the target’s arrival at different base stations. It is a multi-sensor localization technology which requires at least three receivers for localization.
The distance
from the target to any base station
j is
where the target position
and the coordinates of the base station
j are
.
The distance difference
between the target of the secondary station
j and the primary station is
Then, we can get the following nonlinear hyperbolic equations:
Thus, the TDOA localization problem reduces to solving for the intersection point of the aforementioned hyperbolic equations.
2.2. Modified TDOA–Chan Algorithm
The basic idea of the Chan algorithm is to calculate the position of the target using a two-step weighted least squares algorithm. Instead of the iterative process of the TDOA algorithm, the TDOA–Chan algorithm could obtain the analytical expression solution of the target position, achieving lower computation complexity and a good initial value of the target state.
By squaring the two sides of Equation (
1), we get
where
.
By arranging and squaring the two sides of Equation (
2), we obtain
Let
; we get
Subtracting Equation (
6) from Equation (
5), we have
According to the above derivations, when the total number of the base stations in a two-dimensional space is 3, the relationship between the target position
and
can be obtained as follows:
where
.
When the number of the base stations , a two-step weighted least squares method needs to be applied to obtain the position of the target.
Let the unknown variable matrix
and the time difference noise vector
be
where
Then the covariance matrix
of the time difference noise vector
is
where
and
R is the covariance matrix of the time difference measurements.
When the distance is large, the maximum likelihood estimation of
can be written as
Then we can get
Next, we conduct the calculation of the second weighted least squares algorithm. When there is random noise,
D,
, and
can be expressed as
where
represents the ideal design matrix under noiseless conditions,
denotes the true observation vector without measurement errors,
is the true 3D vector containing target coordinates (x, y) and the distance between the target and the reference station,
is the design matrix deviation from base station coordinate measurement errors,
is the observation vector error propagated from time difference measurement noise, and
is the estimation error of the first weighted least squares step solution.
Then the covariance matrix of
is
Then the expression of the time difference noise vector
can be obtained as
where
,
,
,
.
Following an analogous procedure to Equation (
12), the covariance matrix
of the time difference noise vector
for the second weighted least squares step can be derived. When the measurement noise and estimation residuals are small, since
obeys the Gaussian distribution, the maximum likelihood estimation of
can be obtained as
. When the distance between the base station and the target is large, the covariance matrix of
can be simplified as
According to the above derivations, the unambiguous solution of the unknown quantity can be obtained through two weighted least squares methods. Finally the position of the target can be obtained.
2.3. Modified TDOA–Chan–Taylor Algorithm
The Taylor expansion algorithm is significantly affected by the initial target position estimate. Thus, this subsection introduces the modified TDOA–Chan–Taylor algorithm. We utilize the target position estimate
of the TDOA–Chan algorithm as the initial value for Taylor expansion. By neglecting second-order and higher terms, Equation (
1) is transformed into
where
,
,
is the residual vector, and
represents the distance from the
j-th base station to the initial value.
Using the weighted least squares algorithm, the least squares estimation of the target position can be obtained as follows:
where
Q is the covariance matrix of the TDOA measurement.
3. CRLB Derivation and Problem Formulation
In this section, the CRLB for the TDOA–Chan–Taylor localization algorithm is derived, giving the theoretical lower bound of the target localization accuracy. Then, the trace of the CRLB is set as the objective function with the UAV motion constraints set as the constraint conditions, and the UAV path optimization problem is formulated.
3.1. CRLB Derivation for the TDOA Localization Algorithm
For an unbiased estimator, the minimum error of the estimator can be characterized by the CRLB of the estimator. Thus, in this paper, the CRLB of the TDOA algorithm is set as the objective function of the UAV path optimization problem, and through searching for the optimal UAV positions, the accuracy of the target localization performance can be improved.
Based on the TDOA
between the base station
j and the reference station, we establish the measurement equation as
where
, the measurement error
satisfies the normal distribution
, and
is the actual observed value.
is a zero-mean vector of dimension
, and
is the measurement noise covariance matrix of dimension
.
According to
, we can get the Jacobi matrix
.
Then, the Fisher matrix can be obtained as
Taking the inverse of the above equation, the CRLB
is obtained:
3.2. Problem Formulation
In this subsection, the UAV path optimization problem is formulated. Firstly, the minimum trace of the CRLB for the TDOA–Chan–Taylor localization algorithm is set as the objective function of the UAV path optimization problem, which would guarantee the accuracy of the target localization performance. The objective function is shown as follows:
where
represents the positions of the
ith UAV.
Meanwhile, to ensure the feasibility and stability of the UAV flight operations, the following constraint conditions need to be satisfied:
- (1)
Minimum approach distance constraint: At any given time
t, the distance
from the
ith UAV to the target needs to satisfy
where
is the minimum allowable distance between the UAVs and the target. This constraint prevents the UAVs from approaching the target too closely and being detected, thus ensuring the stealth and security of the mission.
- (2)
Heading angle deviation constraint: For the
ith UAV at time
t, the heading angle deviation
between two successive time steps must satisfy
where
is the minimum value of the heading angle deviation for the UAVs, and
is the maximum value of the heading angle deviation for the UAVs.
- (3)
Heading angle constraint: The heading angle
between the heading angle of the
ith UAV at time
t and the reference direction must satisfy
where
is the minimum value of the heading angle for the UAVs, and
is the maximum value of the heading angle for the UAVs. For target localization application, drastic changes in heading deviation angles
and heading angles
are detrimental to the UAVs’ successive target observation. Simultaneously, these angular constraints effectively prevent invalid solutions and enhance the efficiency of the path optimization.
- (4)
Velocity constraint: The velocity
of the
ith UAV at time
t satisfies
where
is the minimum velocity of the
ith UAV, and
is the maximum velocity of the
ith UAV.
- (5)
Energy Constraint: At any given time
t, the distance
from the
ith UAV to the target needs to satisfy
where
is the maximum allowable maneuver distance of the
ith UAV and
is the effective battery endurance time. This constraint limits the maximum maneuverable distance of UAVs to ensure that their path planning remains within the battery’s endurance range, thereby preventing mission failure due to power depletion.
Different from the existing works, the UAVs in this paper are considered with variable speed, and the velocity of the UAVs is limited to the range of [, ]. Thus, the feasible region of the optimization problem is extended from the conventional fixed-radius circumferences to annulars centered at the UAVs’ current positions with the inner radius of and the outer radius of .
By considering the UAVs with variable speed, the proposals in this paper could be more in accordance with the actual situation, significantly improving the UAVs’ flexibility and environmental adaptability.
4. PSO Algorithm
In this paper, the PSO algorithm is applied to search for the optimal UAV positions at each time step to improve the target localization accuracy. Through optimizing the UAV positions for successive time steps, the optimal UAV path would be obtained.
Assume that at each iteration there are
S particles, and each particle
s represents a candidate UAV heading angle deviations strategy
where
represents the heading angle deviation of the
sth particle at the
kth iteration of time step
t for the
ith UAV. Through the heading angle deviation
, the position of the
sth particle at the
kth iteration of time step
t,
, can be obtained.
At the
kth iteration of time step
t, the velocity of the
sth particle is denoted as
and its heading angle deviation is
. Then at the
th iteration of time step
t, the velocity
and heading angle deviation
of the
sth particle can be updated as
where
represents the individual historical best position of the
sth particle at the
kth iteration of time step
t,
represents the global best position of the entire particle swarm at the
kth iteration of time step
t, and
and
represent the uniformly distributed random variables in the range of
, which are used to introduce randomness and avoid getting trapped in local optima.
denotes the individual learning factor, which can adjust the step size of the particle moving towards its own historical best position
, and
denotes the social learning factor, which can adjust the step size of the particle moving towards the global best position
.
To select the optimal solution, the performance of these
S particles is evaluated using the fitness function. At the
kth iteration of time step
t, the fitness function for the
sth particle is as follows:
where
is the fitness value when the heading angle deviation of the
sth particle is
and the related position of the
sth particle is
,
is the CRLB when the heading angle deviation of the
sth particle is
and the related position of the
sth particle is
, and
denotes the trace operation. Then, the individual historical best position
and the global best position
can be updated as
When the iteration reaches the maximum number, the iteration stops and outputs the optimal solution; thus, the optimal UAV positions for the next time step
can be obtained. The flowchart of the PSO algorithm is shown as follows, see
Figure 1.
5. Numerical Simulations and Results
In this section, numerical simulations are performed to verify the validity and effectiveness of the proposals in this paper. Firstly, the basic parameters and the initial topology of the UAVs and the target are given. Secondly, five methods are presented and compared to verify the advantages of the proposed method in this paper, and the five methods include the following: (1) the PSO algorithm considering the UAVs to have variable speed; (2) the PSO algorithm considering the UAVs to have a fixed speed; (3) the exhaustive algorithm considering the UAVs to have variable speed; (4) the exhaustive algorithm considering the UAVs to have a fixed speed; (5) the algorithm considering the UAVs to have a fixed formation.
5.1. Parameters Configuration
Consider that there are four UAVs and two stationary targets in a two-dimensional plane. The initial coordinates of the UAVs are [0, 0] m, [8660, 0] m, [4330, 7500] m, and [0, 8660] m, and the targets are located at [20,000, 10,000] m and [15,000, 15,000] m. The initial topology of the UAVs and the targets is given in
Figure 2, where the green squares represent the initial positions of the UAVs, the blue and red five-pointed stars indicate the two target positions, and the blue arrows indicate the moving directions of the UAVs.
Through systematic parameter sensitivity analysis, we have determined the optimal parameter configuration scheme in this paper. The speed range for the variable-speed UAVs is set to [200, 400] m/s, and the speed value for the fixed-speed UAVs is set to 300 m/s. Through systematic experiments, it is determined that the global optimal fitness value stabilizes when the number of iterations reaches 110. To ensure algorithmic robustness, the maximum number of iterations is ultimately set to 150. The cognitive component and the social component are both set to 2 to effectively balance the influence of individual particle experience and swarm knowledge on the search process, and the inertia weight w is set to 0.9 to maintain strong global search capability during the initial phase of the algorithm. Since the feasible region for the PSO algorithm considering the UAVs with variable speed is larger than that for the PSO algorithm considering the UAVs with fixed speed, the number of the particles is 1000 for the PSO algorithm considering the UAVs with variable speed and the number of the particles is 100 for the PSO algorithm considering the UAVs with fixed speed, which can balance computational efficiency with precision requirements. To mitigate collision risks, the minimum approach distance is set to 8000 m, the heading angle range is set to [, ], and the heading deflection angle range is set to [].
5.2. Simulation Results Based on the Exhaustive Algorithm
In this part, we conduct the simulation experiments for different UAV path optimization algorithms, which include (1) UAVs with a fixed formation; (2) the UAV path optimization algorithm based on CRLB considering UAVs with fixed speed; and (3) the UAV path optimization algorithm based on CRLB considering UAVs with variable speed. The simulation results of the UAV path based on different UAV path optimization methods are given in
Figure 3 and
Figure 4, where
Figure 3 shows the UAV path with a fixed formation and
Figure 4 shows the UAV paths optimized by the CRLB algorithm where (a) considers UAVs with a fixed speed and (b) considers the UAVs with variable speed. Also, in
Section 5.2, we consider the UAV path optimized based on the exhaustive algorithm.
To evaluate the localization accuracy of different methods, we employ the Root Mean Square Error (RMSE) as the accuracy metric and conduct a comparative analysis of the localization accuracy between different methods.
where
N denotes the number of Monte Carlo experiment trials,
represents the true position of the target at time step
k and
represents the estimated position of the target at time step
k. A total of 100 independent Monte Carlo trials are conducted to complete the comparison.
From
Figure 5, the simulation results of the RMSE and CRLB between different UAV path optimization algorithms in the dual-target scenario are shown and compared, including the fixed formation, the exhaustive algorithm considering UAVs with fixed speed, and the exhaustive algorithm considering UAVs with variable speed. (a) presents the comparison between the RMSE and CRLB for the target located at [20,000, 10,000] m, while (b) shows the comparison between the RMSE and CRLB for the target located at [15,000, 15,000] m. From the results, we can see that for the target located at [20,000, 10,000] m, the RMSE of the UAVs with fixed formation converges to 12.23 m and the CRLB converges to 0.90 m. The RMSE of the UAV path optimized based on the exhaustive algorithm considering the UAVs with fixed speed converges to 14.47 m and the CRLB converges to 0.89 m. The RMSE of the UAV path optimized based on the exhaustive algorithm considering the UAVs with variable speed converges to 9.85 m and the CRLB converges to 0.41 m. For the target located at [15,000, 15,000] m, the RMSE of the UAVs with fixed formation converges to 11.65 m and the CRLB converges to 0.47 m. The RMSE of the UAV path optimized based on the exhaustive algorithm considering the UAVs with fixed speed converges to 15.21 m and the CRLB converges to 0.98 m. The RMSE of the UAV path optimized based on the exhaustive algorithm considering the UAVs with variable speed converges to 9.65 m and the CRLB converges to 0.40 m. Thus, compared to the UAVs with fixed formation, the UAV path optimized based on the exhaustive algorithm considering the UAVs with variable speed reduces the average RMSE of the two targets by about 18.35% and reduces the average CRLB by about 50.16%. Then, compared to the UAV path optimized based on the exhaustive algorithm considering the UAVs with fixed speed, the UAV path optimized based on the exhaustive algorithm considering the UAVs with variable speed reduces the average RMSE by about 34.27% and reduces the average CRLB by about 56.62%. The results indicate that by considering the UAVs with variable speed, the UAV path optimization algorithm could significantly improve the target localization accuracy in the dual-target scenario.
5.3. Simulation Results Based on the PSO Algorithm
In
Section 5.2, the simulation results for the UAV path optimization algorithms, which include the fixed formation, the UAV path optimization algorithm considering the UAVs with fixed speed, and the UAV path optimization algorithm considering the UAVs with variable speed, are all obtained based on the exhaustive algorithm. To improve the effectiveness of the path optimization algorithm, in this paper, the PSO algorithm is deployed to search for the optimal positions of the UAVs at the next time step, reducing the computation complexity and improving the real-time performance.
As can be seen from
Figure 6, the UAV paths optimized based on the PSO algorithm and the exhaustive algorithm considering the UAVs with fixed speed are shown and compared, where the red squares represent the UAV path optimized based on the PSO algorithm, and the blue squares represent the UAV path optimized based on the exhaustive algorithm. Obviously, the UAV paths optimized based on the PSO algorithm and the exhaustive algorithm considering the UAVs with fixed speed are almost the same because they are both optimized based on the CRLB criterion but by different optimization algorithms.
As shown in
Figure 7, the UAV paths optimized based on the PSO algorithm and the exhaustive algorithm considering the UAVs with fixed speed are shown and compared, where the red squares represent the UAV path optimized by the PSO algorithm considering the UAVs with variable speed, and the blue squares represent the UAV path optimized by the exhaustive algorithm considering the UAVs with variable speed. Obviously, the UAV paths optimized based on the PSO algorithm and the exhaustive algorithm considering the UAVs with variable speed are almost the same, because they are both optimized based on the CRLB criterion but by different optimization algorithms.
As shown in
Figure 8, the RMSE and the CRLB for different UAV paths optimized by the PSO algorithm and the exhaustive algorithm considering UAVs with fixed speed are given and compared. (a) presents the comparison between the RMSE and CRLB for the target located at [20,000, 10,000] m, while (b) shows the comparison between the RMSE and CRLB for the target located at [15,000, 15,000] m. From the simulation results, we can see that for the target located at [20,000, 10,000] m, the RMSE for the UAV path optimized by the exhaustive algorithm considering the UAVs with fixed speed converges to 14.47 m and the CRLB converges to 0.90 m. The RMSE for the UAV path optimized by the PSO algorithm considering the UAVs with fixed speed converges to 10.47 m and the CRLB converges to 0.46 m. For the target located at [15,000, 15,000] m, the RMSE for the UAV path optimized by the exhaustive algorithm considering the UAVs with fixed speed converges to 15.21 m and the CRLB converges to 0.98 m. The RMSE for the UAV path optimized by the PSO algorithm considering the UAVs with fixed speed converges to 13.05 m and the CRLB converges to 0.72 m. Thus, compared to the UAV path optimized based on the exhaustive algorithm considering the UAVs with fixed speed, the UAV path optimized based on the PSO algorithm considering the UAVs with fixed speed reduces the average RMSE of the two targets by about 20.93% and reduces the average CRLB by about 37.40%. The results indicate that according to the PSO algorithm considering the UAVs with fixed speed, the UAV path optimization algorithm could significantly improve the target localization accuracy in the dual-target scenario.
The situation where the UAVs are considered with variable speed is almost the same as the situation where the UAVs are considered with fixed speed. As shown in
Figure 9a, the RMSE for the UAV path optimized by the PSO algorithm converges to 5.54 m and the CRLB converges to 0.13 m. The RMSE for the UAV path optimized by the exhaustive algorithm converges to 9.85 m and the CRLB converges to 0.41 m. As shown in
Figure 9b, the RMSE for the UAV path optimized by the PSO algorithm converges to 5.62 m and the CRLB converges to 0.13 m. The RMSE for the UAV path optimized by the exhaustive algorithm converges to 9.65 m and the CRLB converges to 0.40 m. Thus, the RMSE and the CRLB obtained by the PSO algorithm are significantly lower than those obtained by the exhaustive algorithm. Compared to the UAV path optimized based on the exhaustive algorithm, the UAV path optimized based on the PSO algorithm considering the UAVs with variable speed reduces the average RMSE of the two targets by about 42.76% and reduces the average CRLB by about 67.58%. The results indicate that according to the PSO algorithm, the UAV path optimization algorithm could significantly improve the target localization accuracy in the dual-target scenario.
As shown in
Figure 10, the simulation results of the RMSE and the CRLB between different UAV path optimization algorithms are shown and compared, including the fixed formation, the PSO algorithm considering the UAVs with variable speed and the PSO algorithm considering the UAVs with fixed speed. (a) presents the comparison between the RMSE and CRLB for the target located at [20,000, 10,000] m, while (b) shows the comparison between the RMSE and CRLB for the target located at [15,000, 15,000] m. From the results, we can see that for the target located at [20,000, 10,000] m, the RMSE of the UAVs with fixed formation converges to 12.23 m and the CRLB converges to 0.90 m. The RMSE of the UAVs with fixed speed converges to 10.47 m and the CRLB converges to 0.46 m. The RMSE of the UAVs with variable speed converges to 5.54 m and the CRLB converges to 0.13 m. For the target located at [15,000, 15,000] m, the RMSE of the UAVs with fixed formation converges to 11.65 m and the CRLB converges to 0.74 m. The RMSE of the UAVs with fixed speed converges to 13.05 m and the CRLB converges to 0.72 m. The RMSE of the UAVs with variable speed converges to 5.62 m and the CRLB converges to 0.13 m. Thus, compared to the UAVs with fixed formation, the UAVs with variable speed reduce the average RMSE of the two targets by about 53.25% and the average CRLB by about 83.80%. Then, compared to the UAVs with fixed speed, the UAVs with variable speed reduce the average RMSE by about 52.03% and the average CRLB by about 77.04%. The results indicate that the PSO algorithm considering the UAVs with variable speed could significantly improve the target localization accuracy.
To verify the effectiveness of the PSO algorithm, we compare the average computation time per time step for different UAV path optimization algorithms. From
Figure 11, we can see that 17.50 s is needed for the exhaustive algorithm considering the UAVs with fixed speed, 656.44 s is needed for the exhaustive algorithm considering the UAVs with variable speed, 17.04 s is needed for the PSO algorithm considering the UAVs with fixed speed, and 20.96 s for the PSO algorithm considering the UAVs with variable speed. Thus, the computational efficiency of the PSO algorithm considering the UAVs with variable speed reduces by about 96.81% compared to the exhaustive algorithm considering the UAVs with variable speed. However, the computation time for the PSO algorithm considering the UAVs with variable speed is slightly higher than that for the PSO algorithm considering the UAVs with fixed speed and the exhaustive algorithm considering the UAVs with fixed speed. The localization accuracy of the former algorithm is better than that of the latter algorithms. Thus, from the numerical simulation results, it is illustrated that the proposed method in this paper could achieve better localization accuracy with proper algorithm efficiency.
6. Conclusions
In this paper, we study the problem of UAV path optimization based on the CRLB criterion. To overcome the stringent constraint for the traditional method that the UAVs have a fixed speed, in this paper, we consider that the UAVs have variable speed, which is more suitable to the real situation. However, when the UAVs have variable speed, the feasible region is extended and the computation complexity increases. Thus, in this paper, the PSO algorithm is deployed to accelerate the searching process and reduce the computation time to obtain the optimal UAV positions at the next time step. Firstly, the signal model of the target localization is established. Secondly, the CRLB for the TDOA localization algorithm is derived and the UAV path optimization problem based on the CRLB is formulated. Thirdly, the PSO algorithm is deployed to search for the optimal UAV positions for the next time step. Finally, numerical simulation results show that compared to the existing UAV path optimization algorithm, the method proposed in this paper could achieve higher localization accuracy and lower computation complexity.
7. Future Work
Future research will focus on a comprehensive and in-depth exploration of several key directions.
First, in terms of model extension, current studies primarily concentrate on two-dimensional space. Future work aims to extend the model to three-dimensional domains, constructing novel localization models that incorporate the altitude dimension. This will fully account for UAVs’ unique motion characteristics in 3D space to more accurately describe their positional information. Building upon this, emphasis will be placed on addressing the dynamic decision-making problem of optimal UAV flight altitude. In real-world operations, a UAV’s flight altitude is influenced by multiple factors, including obstacle distribution, signal strength, and meteorological conditions. By integrating advanced optimization algorithms with real-time environmental data, the flight altitude can be dynamically adjusted to balance localization accuracy with flight safety, energy consumption, and other multi-objective requirements. Simultaneously, a combined approach of theoretical analysis and simulation verification will be adopted to quantitatively investigate the nonlinear relationship between flight altitude and localization accuracy. An accurate mathematical model will be established, and extensive simulation experiments will be conducted to analyze the variation patterns of localization accuracy under different flight altitudes. This will provide solid theoretical guidance for 3D path optimization in complex scenarios. Furthermore, as the model incorporates more practical factors, computational complexity will increase significantly. Therefore, a quantitative analysis of computational complexity will be performed to serve as a reference for algorithm optimization and real-world applications.
In terms of algorithmic comparison and validation, we plan to establish a comprehensive multi-dimensional evaluation system. This system is designed to conduct a systematic comparison between the proposed algorithm and mainstream approaches. These mainstream approaches encompass the extended Kalman filter, genetic algorithms, and emerging deep reinforcement learning methods. The comparison will be carried out across key metrics, namely localization accuracy, computational efficiency, energy consumption characteristics, and robustness. To ensure the objectivity and accuracy of the evaluation results, scientifically justified evaluation metrics and well-designed testing scenarios will be developed. Leveraging this framework, a detailed comparative analysis will be conducted to assess algorithm performance under different conditions, thoroughly investigating the strengths, weaknesses, and applicable domains of each method. The findings will provide a theoretical basis for algorithm selection in practical applications. Additionally, targeted improvements will be proposed to address the limitations of the proposed algorithm, further enhancing its performance and competitiveness.
To better evaluate the algorithm’s practical applicability, future work also plans to conduct experiments using real UAVs. A comprehensive experimental platform will be established, integrating UAV hardware systems, sensor equipment, ground control stations, and other components to ensure smooth experiment execution. Additionally, diverse experimental scenarios will be designed, covering varying terrains, meteorological conditions, and mission requirements, to thoroughly validate the algorithm’s performance across different real-world environments. During the experiments, the focus will be on verifying the effectiveness of the proposed algorithm by comparing its performance with theoretical analyses and simulation results. Key evaluation metrics will include localization accuracy and path optimization rationality in practical applications. Furthermore, the algorithm’s computational time for determining the UAV’s next waypoint must meet real-time transmission requirements. Real-time performance serves as a critical metric for UAV applications. Only when ensuring that the algorithm can swiftly compute and transmit results can safe flight operations and mission execution be guaranteed. Experimental data analysis will further optimize the algorithm’s performance, enhancing its real-time capability and reliability.