Next Article in Journal
Image Fusion and Target Detection Based on Dual ResNet for Power Sensing Equipment
Previous Article in Journal
Bridging Domain Gaps in Computational Pathology: A Comparative Study of Adaptation Strategies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Efficient Multi-Target Localization Using Dynamic UAV Clusters

1
Department of Control Science and Engineering, Tongji University, Shanghai 201804, China
2
Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Shanghai 201210, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(9), 2857; https://doi.org/10.3390/s25092857
Submission received: 27 March 2025 / Revised: 28 April 2025 / Accepted: 29 April 2025 / Published: 30 April 2025
(This article belongs to the Section Sensor Networks)

Abstract

:
This paper proposes a dynamic unmanned aerial vehicle (UAV) clustering model for multi-target localization in complex 3D environments, where mobility-aware cluster formation is integrated to enhance collaborative localization accuracy. We derive the Cramér–Rao lower bound (CRLB) for localization performance analysis under measurement and motion-induced uncertainties. To solve the NP-hard clustering problem, we develop the MDQPSO-ASA algorithm, which combines multi-swarm discrete quantum-inspired particle swarm optimization with adaptive simulated annealing, incorporating a repair mechanism to satisfy spatial and cardinality constraints. Simulation results demonstrate the algorithm’s superiority in localization accuracy, computational efficiency, and adaptability to varying UAV/target scales compared to baseline methods. The developed algorithm provides an effective solution for resource-constrained collaborative localization tasks in practical scenarios.

1. Introduction

With the rapid development of unmanned aerial vehicle (UAV) technology, utilizing dynamic UAV networks for collaborative localization of multiple targets has become an important research topic. This technology demonstrates significant application potential in areas such as disaster rescue [1], wildlife monitoring [2], and infrastructure inspection [3]. However, due to the limited resources of UAV networks, insufficient individual sensing capabilities, and the wide distribution of targets, constructing an effective clustering model to achieve wide-area collaborative localization presents a critical challenge. In this context, it is necessary to design a clustering scheme that comprehensively considers the dynamic movement of UAVs, sensing resource allocation within the cluster, and the requirements of localization tasks.
The problem of UAV clustering for collaborative localization involves using multiple mobile UAVs to locate several potential non-cooperative targets (NCTs) in 3D environments. In this process, the UAVs, which are constantly moving, are divided into several clusters, each acting as a collaborative unit to localize the targets. This dynamic clustering problem aims to achieve optimal overall localization performance while accounting for the continuous motion of UAVs. Although extensive research has been conducted on target localization and wireless sensor network clustering separately, very few studies have systematically integrated UAV clustering strategies with multi-target localization tasks, particularly in scenarios where UAVs are mobile. Therefore, this paper aims to fill this research gap by exploring how to significantly improve the accuracy and efficiency of multi-target localization through adaptive clustering strategies and efficient collaboration mechanisms.
Traditional clustering methods often focus on optimizing specific metrics, such as minimizing energy consumption, reducing communication latency, maximizing coverage, or balancing nodes within clusters. However, in the context of multi-UAV collaborative localization, optimizing these single objectives often fails to meet the demands of real-world tasks. The focus of this paper is on improving the accuracy of multi-target localization through collaboration among multiple UAVs. Specifically, the goal of clustering is not only to achieve rational resource allocation but also to ensure that UAVs within each cluster can collaborate efficiently, fully leveraging their respective sensing and communication capabilities to achieve high-precision localization of potential NCTs in complex 3D environments. Additionally, algorithm design must consider real-time performance and robustness in dynamic environments to address uncertainties such as target movement and communication quality.
In summary, this paper addresses the challenge of multi-target localization using dynamic UAV clusters by proposing a novel clustering model that accounts for UAV mobility and resource constraints. The model integrates mobility-aware cluster formation and communication-efficient quality metrics to enhance localization accuracy in complex 3D environments. The proposed MDQPSO-ASA algorithm combines multi-swarm optimization, discrete quantum-inspired particle swarm optimization, and adaptive simulated annealing to solve the NP-hard clustering problem efficiently. Simulation results validate the superior performance of the proposed approach in terms of solution quality and computational efficiency.
The main contributions of this paper are summarized as follows:
  • We propose a novel dynamic clustering model for multi-target localization using UAV clusters, which accounts for UAV mobility and resource constraints. This model utilizes multiple UAVs in different clusters to collaboratively localize NCTs, enhancing localization accuracy and operational maneuverability in complex 3D environments.
  • We derive the Cramér–Rao lower bound (CRLB) for the proposed clustered localization framework, incorporating measurement uncertainties and motion-induced errors. This theoretical analysis provides critical insights into the achievable localization accuracy under specific cluster configurations.
  • We propose the MDQPSO-ASA algorithm, which integrates multi-swarm optimization, discrete quantum-inspired particle swarm optimization, and adaptive simulated annealing. Additionally, a repair mechanism is introduced to ensure the feasibility of the solutions by enforcing spatial and cardinality constraints. This algorithm effectively addresses the NP-hard clustering problem, ensuring high-quality solutions and computational efficiency.
  • We conduct extensive simulation experiments to validate the performance of the proposed algorithm. Through comparisons with baseline algorithms, we show that MDQPSO-ASA consistently achieves better solution quality and faster convergence. Besides, the results demonstrate the superior performance of the MDQPSO-ASA algorithm in terms of solution quality, computational efficiency, and adaptability to different scales of UAVs and NCTs.
The rest of this paper is organized as follows: Section 2 reviews related work in localization and clustering techniques. Section 3 presents the system model, including the dynamic UAV network architecture, mobility-aware quality metrics, and CRLB derivation. Section 4 details the proposed MDQPSO-ASA algorithm. Section 5 evaluates the performance of the proposed algorithm through simulation experiments. Finally, Section 6 concludes the paper.

2. Related Work

In this section, we review recent advancements in localization and clustering techniques and identify gaps that motivate our work.

2.1. Localization

In terms of target localization, major research and engineering practices typically employ range-based localization algorithms [4]. These algorithms include time of arrival (TOA), time difference of arrival (TDOA), direction of arrival (DOA), and received signal strength (RSS). The performance of these algorithms is typically evaluated using the classical Cramér–Rao lower bound (CRLB) [5], which provides a theoretical lower bound for the mean-square error (MSE) of unbiased estimators. However, as application scenarios become increasingly complex, traditional single-model approaches often struggle to achieve high localization accuracy.
Consequently, recent studies have shifted toward improving localization performance not only by integrating multiple measurement models or optimizing system design but also by leveraging emerging physical-layer technologies. For instance, the exploration of the terahertz (THz) band in 6G networks [6,7] has demonstrated remarkable potential in enhancing localization accuracy. With its ultra-wide bandwidth and high spatial resolution, the THz band surpasses the capabilities of mmWave-based systems [8], paving the way for transformative improvements in localization capabilities. Besides, Panwar et al. [9] proposed a hybrid TOA-RSS localization algorithm that enhances target localization accuracy under non line of sight (NLOS) conditions by solving a weighted least squares problem. Similarly, Wang et al. [10] addressed the issue of NLOS errors in indoor 3D TOA-based localization by proposing an error mitigation method. This approach estimates and corrects ranging errors and calculates the target position using linear least squares, significantly improving localization accuracy.
In parallel, sensor deployment strategy has emerged as a critical factor influencing localization performance. Xu et al. [11] investigated the optimal placement of sensors in 3D space based on the TOA model, applying the A-optimality criterion (i.e., minimizing the trace of the inverse Fisher information matrix (FIM)) to determine the best sensor positions under Gaussian noise conditions, thereby minimizing localization errors. Additionally, Zhao et al. [12] addressed the sensor placement optimization problem for TDOA-based localization in cluttered indoor environments using ultra-wideband signals, employing the block coordinate-wise minimization (BCM) algorithm [13] to solve the problem.
Nevertheless, these studies are largely confined to “single-target, multi-sensor” localization models and do not meet the requirements for cooperative localization in large-scale scenarios. There remain notable gaps in addressing the challenges of complex environments and large-scale cluster-based cooperative localization.

2.2. Clustering

Clustering has been a widely studied area, with significant advancements made in improving the performance of wireless sensor networks (WSNs) and multi-agent systems. For instance, Han et al. [14] introduced a meta-heuristic-based clustering protocol to extend network lifetime and enhance throughput, while Verma et al. [15] focused on securing data transmission in intelligent transportation systems through an innovative clustering method. Similarly, Guo et al. [16] developed the ICRA intelligent clustering routing algorithm to improve routing stability and reduce packet loss during data transmission. Additionally, El Khediri et al. [17] presented an energy-efficient K-means clustering algorithm for WSNs, prioritizing energy consumption as a key metric to prolong network operation under spatial constraints. Beyond WSNs, clustering has also been applied to multi-UAV systems. Wu et al. [18] addressed task allocation for UAV networks in large-scale interception scenarios, leveraging clustering to coordinate UAV groups effectively. Chen et al. [19] proposed an adaptive clustering approach for UAV path planning, optimizing time efficiency and task completion rates in heterogeneous multi-UAV missions.
Despite significant advancements in clustering, most existing studies primarily focus on traditional objectives, such as network formation, energy efficiency, and task allocation. Some research has explored target localization using UAV clusters, with notable contributions in this area. Arafat et al. [20] proposed particle swarm optimization (PSO)-based swarm intelligence localization (SIL) and swarm intelligence clustering (SIC) algorithms to address the localization and clustering challenges in UAV networks. Their work demonstrated improvements in localization accuracy while reducing energy consumption. However, their approach relies on the assumption that UAVs operate in static configurations, thereby overlooking the challenges arising from dynamic mobility To address this gap, recent studies have begun incorporating dynamic models into UAV localization frameworks. For instance, Garraffa et al. [21] proposed a closed-loop control system for mobile targets by integrating robotic methodologies, demonstrating stable tracking in dynamic scenarios. Nevertheless, these methods primarily focus on single-target tracking and lack scalable collaboration mechanisms for UAV clusters operating in complex 3D environments In contrast, our work aims to develop novel clustering strategies and collaboration mechanisms tailored to the dynamic nature of UAV networks, addressing the unique challenges posed by UAV mobility. This enables more accurate and robust localization in complex 3D environments, where both spatial dynamics and multi-agent coordination must be simultaneously considered.

3. System Model

In this section, we establish a cooperative localization model integrating UAV mobility patterns and estimation-theoretic analysis. The proposed system model consists of four fundamental components: (1) a dynamic UAV network-based localization architecture with cluster-based coordination mechanisms, (2) mobility-aware and communication-efficient quality metrics for cluster formation, (3) CRLB analysis incorporating both measurement uncertainties and motion-induced errors, and (4) an optimization formulation for cluster assignment of UAVs.

3.1. System Architecture and Basic Definitions

In this work, we investigate the cooperative localization problem of M NCTs with a UAV network consisting of N UAVs distributed in 3D space, where N M ensures system observability. The UAVs are represented by the set U = n = 1 N { u n } , while the NCTs are denoted by set G = m = 1 M { g m } . The position of UAV u n and NCT g m is represented by 3D vectors p n R 3 and q m R 3 , respectively. Besides, v n R 3 denotes the velocity of  u n .
To enhance localization accuracy, we adopt a cluster-based architecture where the UAVs are partitioned into K distinct clusters { C k } k = 1 K collaboratively estimating the position of a designated NCT g m G with each cluster C k U . The cardinality of each cluster, denoted by | C k | , represents the number of UAVs in C k . Figure 1 illustrates an example scenario of dynamic UAV clusters for multi-target localization. UAVs in the same cluster are expected to localize the same NCT while communicating with each other.
Let x n , k { 0 , 1 } to denote whether UAV u n is clustered into C k , where
x n , k = 1 if u n C k , 0 otherwise .
Then, let set I k denote the index set of UAVs in cluster C k , i.e., I k = { n x n , k = 1 } .
Besides, let y k , m { 0 , 1 } denote whether cluster C k is assigned to localize NCT g m , where
y k , m = 1 if C k g m , 0 otherwise .

3.2. Mobility-Aware Cluster Quality Metrics

Considering the mobility of UAVs and their changing positions, it is essential to maintain cluster nodes within an effective communication range and ensure similar movement trends. This contributes to network topology stability. Moreover, reducing communication overhead and delay within sensing clusters is crucial for preventing frequent topology changes. To achieve these objectives, we introduce three metrics to assess the cooperation quality of UAV clusters: position similarity, velocity similarity, and communication link consistency.

3.2.1. Position Similarity and Velocity Similarity

For any two UAVs, their distance d i , j is measured by the Euclidean distance between their positions
d i , j = p i p j 2 .
Next, we define the position similarity D i , j as follows.
Definition 1
(Position Similarity of a UAV Pair). Position similarity D i , j of u i and u j is defined as
D i , j = 1 d i , j d max ,
where d max is a scale factor constant.
Besides, we introduce the velocity similarity to evaluate the movement consistency of UAVs within a cluster.
Definition 2 
(Velocity Similarity of a UAV Pair). Velocity similarity V i , j of u i and u j is defined as cosine similarity
V i , j = v i · v j v i 2 v j 2 .
It can be observed that D i , j and V i , j are both symmetric, i.e., D i , j = D j , i and V i , j = V j , i .
Next, we consider the metrics for clusters. To begin with, let I k ( 2 ) denote the set of 2-element unordered pairs selected from the index set I k of C k , as defined by Equation (6).
I k ( 2 ) = I k 2 = { ( i , j ) i , j I k , i < j } .
To ensure the validity of the subsequent cluster metrics, we restrict our analysis to clusters with | C k | 2 .
The cluster diameter is introduced to evaluate the spatial distribution of UAVs within a cluster.
Definition 3 
(Cluster Diameter). Cluster diameter d k is defined as the maximum distance between any UAV pairs within the cluster
d k = max ( i , j ) I k ( 2 ) p i p j 2 .
Definition 4 
(Position Similarity of a Cluster). Position similarity of a cluster is the average position similarity of all UAV pairs within the cluster
D k = 1 | I k ( 2 ) | ( i , j ) I k ( 2 ) D i , j .
Remark 1. 
When choosing d max in Equation (4) to compute the position similarity of a UAV pair in the cluster, it is recommended to set d max max ( i , j ) I k ( 2 ) d i , j to ensure it is within the normalized range [ 0 , 1 ] .
Similarly, the velocity similarity of a cluster can be defined as follows.
Definition 5 
(Velocity Similarity of a Cluster). Velocity similarity of a cluster is the average velocity similarity of all UAV pairs within the cluster
V k = 1 | I k ( 2 ) | ( i , j ) I k ( 2 ) V i , j .
Finally, we define the motion similarity Φ k of cluster C k
Definition 6 
(Motion Similarity of a Cluster). Motion similarity Φ k of cluster C k is defined as the weighted sum of position similarity and velocity similarity
Φ k = ϵ D k + ( 1 ϵ ) V k ,
where 0 ϵ 1 is a weight factor.

3.2.2. Communication Link Consistency

The communication link consistency [22] serves as a critical metric for evaluating information propagation performance in sensor networks. To ensure robust, rapid, and efficient intra-cluster communication among UAVs, we introduce a mobility-based model for communication link consistency.
For two mobile UAVs u i and u j with identical communication radius r com , assuming constant relative velocity and direction over a short future time interval, the communication link consistency W i , j is defined as
W i , j = d j , j v i , j 2 T th if d j , j v i , j 2 < T th , 1 if d j , j v i , j 2 T th ,
where v i , j = v j v i is the relative velocity, p j = p j + ( v i , j / v i , j 2 ) · d j , j is the projected position of u j relative to u i when the distance between them exactly reaches r com , d j , j = p j p j 2 is the Euclidean distance between u j and the projected position, and T th is a time threshold for link stability, as illustrated in Figure 2.
Next, we define the communication link consistency of a cluster.
Definition 7 
(Communication Link Consistency of a Cluster). Communication link consistency W k of cluster C k is the average communication link consistency of all UAV pairs within the cluster
W k = 1 | I k ( 2 ) | ( i , j ) I k ( 2 ) W i , j .

3.3. Cramér–Rao Lower Bound Derivation

The CRLB [5] is a fundamental benchmark in estimation theory, establishing a theoretical lower bound on the variance in any unbiased estimator. In the context of clustered UAV-based localization, the CRLB provides critical insights into the achievable localization accuracy under specific cluster configurations and measurement models. This section derives the CRLB for the proposed clustered localization framework, explicitly incorporating key factors such as the geometric distribution and velocity of UAVs within clusters, and the statistical properties of measurement errors.
The CRLB derivation builds upon the following fundamental assumptions.
  • (A1) Measurement independence: Observations from different UAVs are statistically independent, i.e., δ m , n is independent of δ m , n for ( m , n ) ( m , n ) .
  • (A2) Error normality: Both position errors δ m , n loc and motion-induced errors δ n vel follow zero-mean Gaussian distributions.
  • (A3) Spatial-temporal uncorrelation: Measurement errors across different time instants and spatial positions are mutually uncorrelated.
Using the TOA model [23], the real distance between a UAV located at p n and an NCT at q m is given by
r m , n = p n q m 2 .
However, real-world observations from UAV u n for NCT g m are subject to normally distributed errors in sensor time-delay measurements, leading to noisy distance estimates
r ^ m , n = r m , n + e ,
where e represents measurement noise. According to [24], the position error from distance measurement of NCTs follows a zero-mean normal distribution with variance, that is
e N 0 , μ loc r m , n 4 ,
where μ loc is a proportionality constant.
To transform this error into Euclidean coordinates, the estimated position q ^ m , n of g m observed by UAV n is given by
q ^ m , n = q m + δ m , n loc ,
where δ m , n loc represents the equivalent localization noise, which follows a multivariate normal distribution
δ m , n loc N 0 , Σ m , n loc .
where the covariance matrix Σ m , n loc can be computed as
Σ m , n loc = μ loc r m , n 4 u m , n u m , n ,
where u m , n is the unit vector pointing from the UAV’s position p n to the NCT’s q m .
Beyond position noise, UAV mobility introduces an additional source of error. The velocity v n of UAV n affects its self-localization accuracy, as positioning measurements from external sensors (e.g., GPS or infrared systems) experience time-delay errors. Assuming the UAV self-localizes at a fixed clock frequency, the positioning error tends to increase with its velocity magnitude v n 2 . This inherent localization uncertainty propagates to the position estimation of NCTs, particularly in high-speed scenarios. Consequently, the motion-induced localization error δ n vel in Euclidean coordinates follows a multivariate normal distribution
δ n vel N 0 , Σ n vel ,
where Σ n vel is a velocity-dependent covariance matrix computed as
Σ n vel = μ vel v n 2 d n d n ,
where d n = v n / v n 2 is the unit velocity vector, and μ vel is a proportionality constant.
Combining these two sources of error, the total observed localization error for NCT m as measured by UAV u n is
δ m , n = δ m , n loc + δ n vel ,
which, according to assumption (A2), follows a multivariate normal distribution,
δ m , n N 0 , Σ m , n ,
where the total covariance matrix is given by
Σ m , n = Σ m , n loc + Σ n vel .
Figure 3 illustrates the spatial characteristics and superposition effects of these error components. Note that the resulting composite covariance matrix Σ m , n may also be singular or ill-conditioned. To address this, we replace the standard matrix inverse with the Moore–Penrose pseudoinverse, denoted as Σ m , n + , ensuring that the inversion required for subsequent calculations (e.g., in the FIM and CRLB computations) remains well-defined.
Following conventional estimation theory [23,25,26], we then employ the CRLB to quantify localization precision. The FIM, denoted as J , serves as the fundamental construct for CRLB computation. For a single observation q ^ m , n , the conditional probability density function (CPDF) is formulated as
p q ^ m , n q m = 1 ( 2 π ) 3 | Σ m , n | exp 1 2 q ^ m , n q m Σ m , n + q ^ m , n q m .
The corresponding log-likelihood function becomes
L q ^ m , n q m = 1 2 q ^ m , n q m Σ m , n 1 q ^ m , n q m 1 2 log | Σ m , n | 3 2 log ( 2 π ) .
For a UAV cluster C k with the UAV index set I k assigned to localize NCT g m , the joint CPDF under Assumptions (A1) and (A3) is
p q ^ m , k q m = n I k p q ^ m , n q m .
Then, the aggregate log-likelihood function consequently follows
L q ^ m , k q m = n I k L q ^ m , n q m = n I k log p q ^ m , n q m .
In 3D Euclidean space, the FIM adopts the symmetric structure
J = J X X J X Y J X Z J X Y J Y Y J Y Z J X Z J Y Z J Z Z ,
where each element is computed by
J θ 1 θ 2 q ^ m , k = E 2 L q ^ m , k θ 1 θ 2 ,
where θ 1 , θ 2 { X , Y , Z } . Leveraging multivariate normal distribution properties, the FIM simplifies to
J = n I k Σ m , n + .
The additive structure of FIM J = n I k Σ m , n + directly results from the assumption (A1), as cross-correlation terms between different UAV observations vanish.
Then, the CRLB B k , m of NCT m for the configured cluster C k is thereby computed as
B k , m = CRLB C k , m = tr ( J + ) = tr n I k Σ m , n + + ,
where tr ( · ) denotes matrix trace and · + is the Moore–Penrose pseudoinverse of · .

3.4. Problem Formulation for Optimal Cluster Assignment

We first consider the constraints among UAVs, clusters, and targets. Recall that x and y are indicator variables defined by Equations (1) and (2). Here, we assume each UAV can only be assigned to one cluster (The framework can be generalized by allowing each UAV to participate in multiple clusters, defined by k = 1 K x n , k κ n , where κ n specifies the maximum number of clusters UAV u n can belong to. This extension maintains mathematical consistency with our model. For simplicity, we focus on the case that κ n 1 in this paper.), i.e., k = 1 K x n , k 1 , n { 1 , , N } , while each cluster corresponds to one NCT, i.e., m = 1 M y k , m = 1 and k = 1 K y k , m = 1 , k { 1 , , K } , m { 1 , , M } . Besides, each cluster must have at least c UAVs, i.e., n = 1 N x n , k c , k { 1 , , K } , which also implies that the total number of UAVs must be at least c times the number of NCTs. Additionally, each cluster has a maximum radius constraint, i.e., d k d th , k { 1 , , K } .
Next, we define the objective function. For an NCT g m , both motion similarity Φ k ( m ) and communication link consistency W k ( m ) of its corresponding cluster C k ( m ) should be maximized to ensure effective collaboration while its CRLM B k ( m ) , m should be minimized to ensure localization performance. Thus, the objective function considering all NCTs and their corresponding clusters is formulated as
min max m α 1 B k ( m ) , m α 2 Φ k ( m ) α 3 W k ( m ) ,
where k ( m ) indicates that k is the index of the cluster assigned to the NCT g m decided by y, and α 1 , α 2 , α 3 are weight factors.
Thus, the optimization problem is formulated as P1.
P 1 : min x , y max m α 1 B k , m ( k ) α 2 Φ m ( k ) α 3 W m ( k ) s . t . k = 1 K x n , k 1 , n { 1 , , N } , ( C 1 ) m = 1 M y k , m ( k ) = 1 , k = { 1 , , K } , ( C 2 ) k = 1 K y k , m ( k ) = 1 , m { 1 , , M } , ( C 3 ) n = 1 N x n , m ( k ) c , k { 1 , , K } , ( C 4 ) d k d th , k { 1 , , K } . ( C 5 )
The formulated optimization problem is a zero-one programming problem, where the decision variables x and y are binary integers indicating the assignment of UAVs to clusters and the corresponding relationship between NCTs and clusters.
The formulated optimization problem P1 is computationally challenging due to multiple interrelated factors. First, the objective function involves nonlinear matrix operations through the CRLB term B k , m ( k ) = tr ( n I k Σ m , n + ) + . These operations include nested matrix inversions and trace computations, leading to a non-convex structure with many local minima. Second, the discrete constraints (C1)–(C4) introduce an NP-hard combinatorial structure.
While the original formulation P1 captures the full complexity of dynamic UAV clustering, its direct solution remains computationally intractable due to high-dimensional combinatorial search. Given the constraints (C2) and (C3) in the optimization problem P1, the variable y constitutes a permutation matrix, which inherently implies that the equality K = M must hold. This relationship leads to two significant consequences: First, k ( m ) becomes a single-valued function, allowing to use B k ( m ) to represent B k , m ( k ) for simplicity. Second, the correspondence between m and k establishes a bijective mapping. Consequently, each cluster C k can be uniquely associated with a certain NCT m ( k ) , enabling direct parameterization through either cluster indices { 1 , , K } or NCT indices { 1 , , M } . In this paper, we adopt the cluster index notation. Thus, the reformulated problem becomes P2.
P 2 : min x max k α 1 B k α 2 Φ k α 3 W k s . t . k = 1 K x n , k 1 , n { 1 , , N } , ( C 1 ) n = 1 N x n , k c , k { 1 , , K } , ( C 4 ) d k d th , k { 1 , , K } . ( C 5 )
The reformulated P2 is also a zero-one programming problem, where the decision variables x are binary integers. Even after simplification, P2 retains its non-convex and non-linear nature, making it challenging to solve. Therefore, it is necessary to establish an efficient algorithm to find a solution within a reasonable amount of computational time.

4. Methodology

4.1. Algorithm Framework

In this section, we present the proposed Multi-swarm Discrete Quantum-inspired Particle Swarm Optimization with Adaptive Simulated Annealing (MDQPSO-ASA) algorithm, a novel computational framework specifically designed to resolve the NP-hard dynamic UAV clustering problem. As depicted in Figure 4, MDQPSO-ASA integrates three synergistic components through bio-inspired computing principles: a multi-swarm optimization framework, a discrete quantum-inspired PSO module (DQPSO), and an adaptive simulated annealing (ASA) module. The multi-swarm framework enhances the diversity of the search process, increasing the global search capability and preventing premature convergence. The quantum-inspired updates help the algorithm escape local optima and improve the overall search effectiveness. The ASA module enables the algorithm to navigate through different areas of the solution space. Overall, the discrete nature of MDQPSO-ASA is particularly suitable for the zero-one integer programming problem presented in this paper, ensuring that the solutions adhere to the binary constraints. Additionally, a repair mechanism is incorporated to attempt to fix solutions that do not satisfy the cluster radius constraints. These features make MDQPSO-ASA well-suited for the dynamic and complex UAV clustering and multi-target localization scenarios addressed in this work. The overall procedure is detailed in Algorithm 1, and each component is explained in the following subsections based on its corresponding pseudocode.
In Algorithm 1, the multi-swarm framework initializes multiple swarms { F s } s = 1 S with random strategies (Line 1), ensuring a diverse set of starting points. Each swarm consists of P particles, whose positions { ξ s } s = 1 S representing potential solutions to the optimization problem. The temperature T and cooling rate r are initialized (Line 2) to control the simulated annealing process. The temperature adaptation counter is also initialized to track the progress of the temperature adaptation mechanism. The algorithm then initializes the global best ξ and local best ξ s for each swarm (Line 3), which are updated throughout the optimization process, allowing the algorithm to adaptively guide the search. The outer loop (Line 5) iterates until the maximum iteration count i t max is reached. For each iteration, a counter is incremented to track progress for temperature adaptation.
Then, for each swarm F s (Line 7), the DQPSO module is applied (Line 8) to update the positions of all particles using discrete quantum-inspired dynamics. Every τ 1 iterations (Line 9), the ASA module (Line 10) perturbs the current solutions based on a simulated annealing mechanism to escape local minima. Then, a repair mechanism (see Section 4.4) is adopted (Line 12) to ensure the feasibility of the solution. After processing the particles, a migration operation (Line 20) exchanges information among different swarms; that is, at fixed intervals τ 2 , the migration operation compares the best local solutions { ξ s } s = 1 S within each swarm and migrates better solutions from swarms to swarms. This facilitates information exchange across the population and enhances global search capability.
Algorithm 1 Framework of the proposed algorithm MDQPSO-ASA.
Require: 
Cost function C ( ξ ) , acceleration coefficient β , number of swarms S, number of particles per swarm P, initial temperature T 0 , cooling rate r 0 , ASA interval τ 1 , immigration interval τ 2
Ensure: 
Global optimum ξ , global minimum cost C min
1:
Initialize a set of swarms F = s = 1 S { F s } with random strategy where F s = p = 1 P { ξ p }
2:
Set initial temperature T T 0 and initial cooling rate r r 0
3:
Initialize global best ξ of F and local best ξ s of each F s
4:
Initialize temperature adaptation counter c n t 0
5:
for   i t = 0 , 1 , , i t max 1   do
6:
         c n t c n t + 1
7:
     for each swarm F s F  do
8:
            F s DQPSO( F s , β )                         ▷ see Section 4.2 and Algorithm 2
9:
           if  i t mod τ 1 = 0  then
10:
                 F s SA( F s , T )                          ▷ see Section 4.3 and Algorithm 3
11:
        end if
12:
         F s Repair( F s )                               ▷ see Section 4.4 and Algorithm 4
13:
        for each particle ξ p in F s  do
14:
              if  C ( ξ p ) < C ( ξ s )  then
15:
                   Update local best ξ s ξ p
16:
              end if
17:
        end for
18:
    end for
19:
    if  i t mod τ 2 = 0  then
20:
         F Migration( F )
21:
    end if
22:
    for each local best ξ s of swarm F s in F  do
23:
        if  C ( ξ s ) < C ( ξ )  then
24:
              Update global best ξ ξ s
25:
              Reset counter c n t 0
26:
        end if
27:
    end for
28:
     T , r , c n t TempAdapt( T , r , i t , c n t )                                  ▷ see Section 4.3
29:
end for
Finally, the algorithm checks each swarm’s best solution and updates the global best ξ if a better solution is found, while also resetting the temperature adaptation counter (Line 24). Next, the temperature T is updated using an adaptation mechanism (Line 28) that takes into account the cooling rate r and the stagnation counter c n t . This layered structure helps balance exploration and exploitation throughout the iterative process.
Note that in the algorithms introduced in this section, notation ξ represents the position of a particle in PSO, which corresponds to but is not identical to the decision variable x. The position ξ consists of continuous variables used in the optimization processes to represent potential solutions, while x consists of binary decision variables indicating the assignment of UAVs to clusters. The optimization process involves discretizing ξ to obtain feasible values for x, which will be further introduced in the DQPSO module (Section 4.2).

4.2. Discrete Quantum-Inspired PSO (DQPSO) Module

The DQPSO module, which is described in detail as Algorithm 2, is responsible for updating the positions of particles within each swarm by incorporating quantum-inspired principles.
Algorithm 2 DQPSO module.
Require: 
Swarm F s and acceleration coefficient β
Ensure: 
Updated swarm F s
1:
Compute mean position ξ ¯ 1 P p = 1 P ξ p
2:
for each particle ξ p in F s  do
3:
    Compute quantum rotation magnitude Q p β · ξ ¯ ξ p 2
4:
    Generate a set of random numbers w U ( 0 , 1 )
5:
    Update position ξ p ξ p + Q p ln 1 / w
6:
    Discretize ξ p Argmax–Softmax( ξ p )
7:
end for
Initially, the mean position ξ ¯ of all particles is computed (Line 1). Then, for each particle, the quantum rotation magnitude Q p is calculated as the product of the acceleration coefficient β and the Euclidean distance between the particle’s current position and the mean position (Line 3). A set of random numbers w with the same shape as ξ drawn from a uniform distribution in U ( 0 , 1 ) is used to generate a logarithmic perturbation, which is then added to the current position (Line 5). Since the optimization problem is discrete, the updated continuous value is discretized using a softmax-argmax combined function to compute the probability of the UAV switching to each cluster (Line 6). The cluster with the highest probability is then selected using the argmax function, setting its value to 1 and all others to 0. This ensures that the new positions adhere to the binary constraints of the problem.

4.3. Adaptive Simulated Annealing (ASA) Module

The ASA module, which consists of the temperature adaptation process and the SA procedure, ensures that the algorithm can traverse different regions of the solution space. For the temperature adaptation process, given the variables T, r, i t and c n t , the new r and temperature T can be updated by Equations (35) and (36), respectively.
r 0.9 · r , c n t 0 if c n t 5 ,
T T · exp ( r · i t ) .
The adaptive nature of the temperature parameter T, updated externally in the main framework, regulates the acceptance probability, balancing between exploration (high temperature) and exploitation (low temperature).
The SA procedure, as shown in Algorithm 3, is the key to the ASA module, which is designed to introduce thermal perturbations to each particle in the swarm, which helps in escaping local optima. For every particle, a neighboring solution ξ i is generated by applying a perturbation function (Line 2), which randomly selects two UAVs and swaps their cluster assignments. The energy difference Δ E , calculated as the difference between the cost of the new and current solutions, quantifies the change in objective value (Line 3). Using a randomly generated number h from a uniform distribution U ( 0 , 1 ) and the Metropolis acceptance criterion, the algorithm accepts the new solution if it yields a lower cost or with a probability proportional to exp ( Δ E / T ) when the cost is higher (Lines 5 and 6). This controlled acceptance mechanism allows the swarm to occasionally accept worse solutions, thereby maintaining diversity and improving the potential for global exploration.
Algorithm 3 SA procedure.
Require: 
Swarm F s , temperature T
Ensure: 
Modified swarm F s
1:
for each particle ξ p in F s  do
2:
    Generate neighbor ξ p Perturb( ξ p )
3:
    Compute energy difference Δ E C ( ξ p ) C ( ξ p )
4:
    Generate a random number h U ( 0 , 1 )
5:
    if  Δ E < 0 or exp ( Δ E / T ) > h  then
6:
           ξ p ξ p
7:
    end if
8:
end for

4.4. Repair Mechanism

The repair mechanism ensures solution feasibility by enforcing both spatial and cardinality constraints through a three-stage process, as formalized in Algorithm 4. This critical component guarantees that all candidate solutions satisfy conditions (C4) and (C5).
| C k | c , k { 1 , , K } ( C 4 ) max ( i , j ) I k ( 2 ) p i p j 2 d th k { 1 , , K } ( C 5 )
In the first stage, the algorithm enforces the spatial constraints within each cluster. For every cluster C k , it computes the centroid χ k (Line 4) and then identifies the pair of UAVs that are farthest apart (Line 5). If the distance between these two UAVs exceeds the threshold d th (Line 6), the UAV farther from the centroid within the pair is removed from the cluster and placed into a temporary set U (Lines 7 and 8). This process is repeated until the maximum pairwise distance within the cluster is within the allowable range, ensuring spatial compactness and communicational efficiency.
The second stage addresses the cardinality constraints. For any cluster C k with fewer than c members (Line 12), the algorithm creates a candidate set V by gathering UAVs from other clusters that exceed the minimum size (Line 13). These candidate UAVs are randomly shuffled (Line 14), and each candidate is evaluated to check if its addition to the under-populated cluster will keep the spatial constraint satisfied (Line 15). When a candidate UAV meets the condition, it is moved from its original cluster C k to the deficient one C k (Line 16). This reallocation continues until the cluster achieves the minimum required cardinality.
In the third stage, the algorithm attempts to reassign any UAVs that remain unassigned in the set U . Each UAV in this set is considered for reassignment to the cluster whose centroid is closest, provided that its inclusion does not violate the spatial constraint (Line 22). If such a cluster is found, the UAV is added to that cluster and removed from U (Lines 23 and 24). Finally, the algorithm checks whether every cluster meets the cardinality constraint. If any cluster still has fewer than c UAVs, the repair is deemed unsuccessful (Line 27); otherwise, the repaired solution ξ i is reconstructed from the adjusted clusters (Line 28).
This three-stage process systematically adjusts clusters to ensure both spatial compactness and sufficient cardinality, thereby maintaining feasibility for the subsequent optimization process.
Algorithm 4 Repair mechanism
Require: 
Invalid solution ξ i
Ensure: 
Repaired solution ξ i
1:
Initialize U
2:
for each cluster C k , k { 1 , , K } do                                             ▷ first stage
3:
      repeat
4:
            Compute cluster centroid χ k 1 | C k | u n C k p n
5:
             ( u i , u j ) arg max ( i , j ) I k ( 2 ) p i p j 2
6:
            if  p i p j 2 > d th  then
7:
                  u * arg max u * { u i , u j } p * χ k 2
8:
                  C k C k { u * } , U U { u * }
9:
            end if
10:
    until  p i p j 2 d th
11:
end for
12:
for each cluster C k with | C k | < c do                                         ▷ second stage
13:
        V k k { u n C k | C k | > c }
14:
       for  u n Shuffle( V ) (also, u n C k ) do
15:
              if  max i I k p n p i 2 d th and | C k | > c  then
16:
                       C k C k { u n } , C k C k { u n }
17:
                      break if  | C k | c
18:
              end if
19:
        end for
20:
end for
21:
for  u n Shuffle( U ) do                                                             ▷ third stage
22:
      Try to find C k minimizing p n χ k 2 subject to max i I k p n p i 2 d th
23:
      if such C k exists then
24:
         C k C k { u n } , U U { u n }
25:
      end if
26:
end for
27:
return  ξ i  if  C k such that | C k | < c                                           ▷ repair failed
28:
return  ξ i reconstructed from { C k } k = 1 K                                 ▷ repair succeed

5. Performance Evaluation

5.1. Experiment Setup

In this section, we evaluate the performance of the proposed algorithm under a typical scenario where UAVs and NCTs are randomly distributed within a 3D region. We investigate the impact of varying the number of UAVs and NCTs on the algorithm’s performance, measured by the objective function in P2, and assess the algorithm’s runtime. All experiments are executed on a Windows 11 laptop with an Intel Core i9-12900H CPU and 32 GB RAM.
The UAVs and NCTs are randomly and uniformly distributed within a 1000 × 1000 × 1000 3D environment. Table 1 presents the default parameter settings for the simulation experiments.

5.2. Baseline Algorithms

We compare the proposed MDQPSO-ASA algorithm with several baseline algorithms to evaluate its performance in dynamic UAV clustering and multi-target localization. The algorithms in the comparison include the following algorithms:
  • Simulated annealing (SA): A classical optimization algorithm that simulates the annealing process [28] to escape local optima.
  • SA with genetic algorithm (SA-GA): A hybrid optimization algorithm that combines simulated annealing with genetic algorithm [29] to enhance the search capability.
  • Discrete PSO (DPSO): A discrete version of PSO that enforces binary constraints [30] on the particle positions.
  • Quantum-inspired PSO (QPSO): A quantum-inspired optimization algorithm that uses quantum rotation gates [31] to update particle positions.
  • MDQPSO: The simplified MDQPSO-ASA algorithm without the ASA module.
In subsequent experiments, we test each algorithm in different cases 100 times and then take the average values along with the variances for performance evaluation.

5.3. Experiment Results and Analysis

5.3.1. Convergence Analysis

To evaluate the convergence properties of the proposed MDQPSO-ASA algorithm, we compare its performance with several baseline optimization algorithms, including MDQPSO, QPSO, and DPSO. Figure 5 illustrates the convergence curves of these algorithms over 30 iterations. In the first five iterations, MDQPSO-ASA reduces the objective slightly slower than QPSO. Between iterations 5 and 10, both curves of QPSO and MDQPSO-ASA stabilize, yet QPSO remains 1.45 % better than MDQPSO-ASA. From iterations 10 to 20, MDQPSO-ASA resumes lowering the objective, finding better solutions, while QPSO stops improving. In the final stage (iterations 20–30), MDQPSO-ASA continues to refine its solution and achieves the lowest objective value among all methods. In conclusion, MDQPSO-ASA establishes a more balanced exploration-exploitation trade-off through its adaptive mechanisms, leading to enhanced long-term stability and significantly lower final objective values. This improved convergence performance highlights the algorithm’s effectiveness in overcoming local optima and identifying high-quality solutions through dynamic strategy adjustments. Although we did not compare SA-based algorithms directly due to the inherent differences in their iteration structures, our subsequent experiments will include a comparative analysis of PSO-based and SA-based algorithms to provide a comprehensive performance evaluation.

5.3.2. Clustering Results for Different Scales

To illustrate the effectiveness of the proposed clustering algorithm, we present the clustering results for various scales of UAVs and NCTs. Figure 6 includes subfigures (a)–(d), each depicting different configurations with clusters color-coded for clarity, UAVs with the same color belong to the same cluster and collaboratively localize the NCT of the corresponding color: (a) 40 UAVs and 4 NCTs, (b) 60 UAVs and 6 NCTs, (c) 80 UAVs and 8 NCTs, and (d) 100 UAVs and 12 NCTs. These results demonstrate that the MDQPSO-ASA algorithm can efficiently form clusters that adhere to spatial and cardinality constraints, ensuring effective collaboration among UAVs for multi-target localization.

5.3.3. Solution Quality for Different Numbers of UAVs and NCTs

We compare the solution quality of different algorithms under different numbers of UAVs and NCTs. Figure 7 and Figure 8 illustrate the objective values obtained by each algorithm for different configurations.
Figure 7 presents the solution quality of various algorithms for different numbers of UAVs ( N = 40 , 50 , , 110 ) while maintaining a fixed number of NCTs at M = 8 . It is evident that in each configuration, the proposed MDQPSO-ASA method consistently achieves the lowest objective values, indicating its ability to find better feasible solutions compared to other algorithms. This demonstrates the superior solution quality of MDQPSO-ASA. Additionally, the performance hierarchy among the algorithms remains consistent across different scales, with MDQPSO-ASA performing the best and SA performing the worst. Furthermore, the experiments indicate that the ASA module enhances the solution quality of MDQPSO by comparing the performance between MDQPSO and MDQPSO-ASA. As the number of UAVs increases, the objective values of all algorithms tend to decrease, which is expected since more UAVs provide better coverage and collaboration opportunities for localization tasks. However, the rate of improvement varies among the algorithms. MDQPSO-ASA shows a more significant reduction in objective values compared to other algorithms, highlighting its effectiveness in leveraging additional UAVs for improved localization accuracy. Moreover, the stability of MDQPSO-ASA is evident from the relatively smaller variance in objective values across different runs, suggesting that the algorithm consistently finds high-quality solutions. In contrast, other algorithms exhibit larger variances, indicating less reliable performance. This robustness is particularly important in dynamic and uncertain environments where consistent performance is crucial.
Figure 8 illustrates the solution quality of different algorithms for different numbers of NCTs ( M = 4 , 6 , , 16 ) while keeping the number of UAVs fixed at N = 80 . MDQPSO-ASA consistently achieves the lowest objective values, demonstrating its superior performance in finding high-quality solutions. As the number of NCTs increases, the objective values of all algorithms rise, which is expected due to the increased complexity of clustering and localization. However, MDQPSO-ASA shows a more gradual increase, highlighting its robustness and adaptability. The results also indicate that MDQPSO-ASA maintains a smaller variance in objective values across different runs, suggesting stable and reliable performance. In contrast, other algorithms exhibit larger variances, indicating less consistent performance.
Overall, MDQPSO-ASA outperforms other algorithms in solution quality and maintains stable performance across different scales of UAVs or NCTs, making it an efficient algorithm for dynamic UAV clustering and multi-target localization.

5.3.4. Solution Quality for Different Key Threshold Parameters

We compare the solution quality of different algorithms under different key threshold parameters: (1) maximum allowable distance between UAVs in a cluster d th , and (2) minimal UAV number per cluster c. Figure 9 and Figure 10 illustrate the objective values obtained by each algorithm for different configurations.
First, to evaluate the robustness of the proposed algorithm under different clustering constraints, we conducted a sensitivity analysis by varying the maximum allowable distance between UAVs in a cluster d th from 700 to 1300. We recorded the mean value and the variance in the resulting objective function values. Figure 9 presents the solution quality of each algorithm for different settings of maximum allowable distance while all other parameters remain constant. As d th increases, the cluster formation becomes less spatially constrained, which generally allows more flexible UAV grouping. This leads to a gradual decrease in objective values for most algorithms, and the performance improvements vary across methods. Throughout the entire range of d th , MDQPSO-ASA consistently achieves the lowest mean objective values, indicating the highest solution quality. Moreover, its variance remains minimal, demonstrating strong stability and robustness to changes in spatial constraints. These results validate that MDQPSO-ASA not only adapts well to different spatial constraint settings but also provides superior and reliable performance with different d th settings.
Second, to further evaluate the robustness of the proposed algorithm, we conducted a sensitivity analysis with respect to the parameter c of minimal UAV number per cluster c, which defines the minimal UAV number per cluster. We varied c from 3 to 9 and recorded the mean value and the variance in the resulting objective function values. Figure 10 presents the solution quality of each algorithm for different settings of minimal UAV number per cluster, while all other parameters remain constant. As c increases, the algorithm faces stricter constraints in forming feasible clusters, which may reduce clustering flexibility and increase task difficulty. Meanwhile, it may also lead to an increase in the frequency of triggering the repair process and a decrease in the probability of successful repair. Across different values of c, MDQPSO-ASA consistently achieves the lowest mean objective function values, demonstrating its superior optimization ability under varying constraint levels. Notably, it maintains very low variance, ensuring consistent performance and high solution stability even when the system is forced to form larger clusters under stricter constraints. This indicates that the proposed algorithm not only achieves high-quality solutions on average but also exhibits strong reliability, which is critical for practical multi-target localization tasks where consistent outcomes are essential. These results confirm that MDQPSO-ASA is not only effective under normal conditions but also exhibits strong robustness when facing more stringent task constraints, such as requiring more UAVs per cluster. This robustness makes it particularly suitable for real-world multi-target localization missions where minimum resource guarantees per task are critical.

5.3.5. Evaluation of Repair Mechanism

The repair mechanism is an essential part of the proposed MDQPSO-ASA algorithm, so in this section, we provide the quantitative evaluation of this mechanism. Specifically, we consider the following two indicators
  • Triggering frequency of repair operation during an iteration of the optimization process, as denoted by f r .
  • Success rate of repair operation s r , i.e., the proportion of infeasible solutions that are successfully repaired into feasible ones.
First, we evaluated the trend of the triggering frequency and success rate of the repair mechanism with the number of iterations during the optimization process, as shown in Figure 11. As shown in Figure 11a, the initial solutions generated by the random initialization strategy are unable to satisfy the problem constraints, leading to a high frequency of repair operation activation in the early optimization phase. As the iterative process progresses, the clustering results gradually converge toward optimal configurations, accompanied by a steady decline in the frequency of constraint violations requiring repair. Notably, sporadic spikes in the repair frequency curve are observed during later iterations. These transient increases can be attributed to the stochastic nature of the simulated annealing mechanism, which may occasionally perturb existing solutions to generate new candidate solutions that violate constraints, thereby necessitating additional repair operations. As shown in Figure 11b, the repair mechanism consistently maintains a high success rate throughout the iterative optimization process, demonstrating its capability to effectively identify infeasible solutions and restore their constraint satisfaction. This characteristic verifies the enhancement of the repair mechanism on the robustness of the algorithm’s search procedure.
Second, we conducted a quantitative analysis of the repair mechanism’s behavior under different constraint levels. Specifically, two sets of experiments were designed: (1) varying the minimum number of UAVs per cluster c from 3 to 9, and (2) varying the maximum allowable distance between UAVs in a cluster d th from 700 to 1300. For each configuration, we recorded the trigger frequency and success rate of the repair process. The corresponding results are illustrated in Figure 12 and Figure 13. As shown in Figure 12a and Figure 13a, larger values of c and smaller values of d th represent more stringent constraints, under which the repair mechanism is triggered more frequently. Meanwhile, Figure 12b and Figure 13b demonstrate that the repair mechanism consistently maintains a high success rate across different constraint levels, indicating its robustness and effectiveness.

5.3.6. Computational Complexity Analysis

We provide a systematic analysis of the computational complexity of MDQPSO-ASA in terms of the number of UAVs N, NCTs M, swarms S and particle swarm size P.
First, the DQPSO module dominates when updating particle positions. At each iteration, computing the mean position, updating each particle and performing discretization requires O ( P · N ) operations per swarm, leading to an overall cost of O ( S · P · N ) per iteration.
Second, the ASA module introduces additional overhead when refining solutions. Every τ 1 iterations, ASA generates perturbed solutions and evaluates their energy differences in O ( P · N · M ) total operations per swarm, which amortizes to O ( S · P · N · M / τ 1 ) per iteration.
Third, the repair mechanism, which enforces intra-cluster distance constraints via a global greedy strategy, becomes the key bottleneck. Assuming UAVs are uniformly distributed across M clusters, computing and sorting all pairwise distances within each cluster incurs O ( ( N / M ) 2 log ( N / M ) ) per cluster; summing over M clusters and S swarms yields approximately O S · N 2 / M · log ( N / M ) per iteration. Then, the repair stage asymptotically determines the overall per-iteration complexity of MDQPSO-ASA as
O S · N 2 M log N M + N · M .
Fourth, the migration operation, which is invoked every τ 2 iterations to exchange particles among swarms, adds an amortized cost of O ( S · P / τ 2 ) per iteration.
By integrating all these parts, the overall time complexity of the MDQPSO-ASA algorithm is
O S · P · N + S · P · N · M τ 1 + S · N 2 M log N M + N · M + S · P τ 2 .
Given that in typical cooperative localization scenarios the number of UAVs N is significantly larger than the number of NCTs M, i.e., N M , the first stage of the repair mechanism largely determines the overall computational complexity per iteration in practical applications. Therefore, the asymptotic time complexity of the algorithm is given by
O S · N 2 M log N M .

5.3.7. Runtime Analysis for Different Numbers of UAVs and NCTs

To evaluate the efficiency of the proposed algorithm, we compare the runtime of different algorithms under varying scales of UAVs and NCTs. Table 2 presents the runtime results for scenarios with a fixed number of NCTs ( M = 8 ) and different numbers of UAVs.
The results show that while our algorithm (MDQPSO-ASA) is more complex, it achieves a balance between solution quality and runtime. Compared to other PSO-based algorithms, MDQPSO-ASA does not have a significant disadvantage in terms of runtime but consistently finds better solutions, as described in Section 5.3.3. On the other hand, it shows a substantial advantage over SA and SA-GA in both runtime and solution quality. Additionally, we also compared the runtime of the algorithms with a fixed number of UAVs and different numbers of NCTs. The experiments indicate that the runtime does not significantly change with different numbers of NCTs, demonstrating the robustness of our algorithm in different scenarios.

5.4. Discussion on Real-World Applications

Real-world UAV systems face significant challenges, including computational constraints, limited communication bandwidth, and environmental uncertainties. Regarding these issues, the proposed MDQPSO-ASA algorithm offers several key features. First, it ensures computational efficiency through a multi-swarm framework that supports parallelization, leverages hardware acceleration (e.g., via GPUs), and allows early termination with feasible solutions via a repair mechanism. Second, it optimizes communication by employing localized coordination to minimize global communication overhead, focusing on critical metrics (e.g., CRLB and velocity vectors). Third, the algorithm demonstrates robustness to noise by incorporating CRLB derivation that accounts for measurement noise and motion-induced errors. Notably, the MDQPSO-ASA algorithm allows extensions toward heterogeneous UAV clusters by considering diverse attributes (e.g., different error covariances of sensors and time thresholds for link stability) across UAV individuals. By incorporating these heterogeneous attributes into the cluster formation and optimization processes, the algorithm can better exploit the strengths of different UAV individuals, thereby enhancing overall system performance and adaptability in complex environments.
In addition, the MDQPSO-ASA framework can be adapted to semi-centralized scenarios where UAVs are intermittently connected to ground control stations. In such settings, the ground control stations could periodically collect UAV status information and use these data to perform centralized optimization using MDQPSO-ASA. The optimized clustering decisions, including task assignments and communication topologies, could then be redistributed back to the UAVs for execution. This approach not only leverages the computational power of ground stations to solve complex optimization problems but also reduces the onboard computational burden on UAVs, thereby conserving their limited energy resources. Furthermore, by allowing intermittent connectivity, this semi-centralized adaptation effectively balances global coordination and local autonomy, making the system more resilient to dynamic environmental changes, communication disruptions, and scalability challenges. Such a hybrid architecture could be particularly advantageous in large-scale UAV networks, where fully centralized control may become impractical due to communication overhead, while fully decentralized approaches may struggle to achieve optimal performance.

6. Conclusions

This paper establishes a novel dynamic clustering-based optimization model for multi-to-multi target localization in UAV clusters, followed by the development of the MDQPSO-ASA algorithm to solve this complex problem. The proposed MDQPSO-ASA algorithm demonstrates robust performance through its ability to balance exploration and exploitation while adhering to binary constraints. Experiment results demonstrate the effectiveness of the proposed algorithm in enhancing localization accuracy and system performance. Future research could extend this model to incorporate heterogeneous UAVs or explore scalability in larger-scale networks.

Author Contributions

Conceptualization, W.G. and L.D.; methodology, S.L. and Y.H.; software, S.L. and L.D.; validation, S.L., L.D., P.Y. and Y.H.; writing—original draft preparation, W.G. and L.D.; writing—review and editing, L.D., S.L. and P.Y.; project administration, W.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Key Research and Development Program of China under No. 2022YFA1004700 and the National Natural Science Foundation of China under Grant No. 62403360.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
UAVUnmanned Aerial Vehicle
NCTNon-Cooperative Target
CRLBCramér–Rao Lower Bound
CPDFConditional Probability Density Function
FIMFisher Information Matrix
TOATime of Arrival
SASimulated Annealing
ASAAdaptive Simulated Annealing
PSOParticle Swarm Optimization
DPSODiscrete Particle Swarm Optimization
QPSOQuantum-Inspired Particle Swarm Optimization
MDQPSOMulti-Swarm Discrete Quantum-Inspired Particle Swarm Optimization
SymbolsDefinitions
U , G Set of UAVs/NCTs.
p n , q m Position of UAV u n , NCT g m .
v n Velocity of UAV u n .
C k , I k k-th cluster, index set of UAVs in C k .
I k ( 2 ) Set of 2-element unordered pair in I k .
x n , k Indicator if UAV u n is in cluster C k .
y k , m Indicator if cluster C k is assigned to NCT g m .
d i , j Distance between UAV u i and UAV u j .
D i , j , V i , j Position/velocity similarity between UAV u i and UAV u j .
d k Diameter of cluster C k .
D k , V k , Φ k Position/velocity/motion similarity of cluster C k .
W i , j Communication link consistency between UAV u i and UAV u j .
W k Communication link consistency of cluster C k .
r m , n Ideal distance between UAV u n and NCT g m .
r ^ m , n Noisy distance estimate between UAV u n and NCT g m .
δ m , n Total observed localization error for NCT m by UAV n.
Σ m , n Total covariance matrix of observed position error.
J FIM.
B k , m CRLB for NCT m by cluster C k .

References

  1. Wang, Y.; Su, Z.; Xu, Q.; Li, R.; Luan, T.H.; Wang, P. A secure and intelligent data sharing scheme for UAV-assisted disaster rescue. IEEE/ACM Trans. Netw. 2023, 31, 2422–2438. [Google Scholar] [CrossRef]
  2. Li, X.; Huang, H.; Savkin, A.V. Autonomous navigation of an aerial drone to observe a group of wild animals with reduced visual disturbance. IEEE Syst. J. 2022, 16, 3339–3348. [Google Scholar] [CrossRef]
  3. Liu, K.; Chen, B.M. Industrial UAV-based unsupervised domain adaptive crack recognitions: From database towards real-site infrastructural inspections. IEEE Trans. Ind. Electron. 2022, 70, 9410–9420. [Google Scholar] [CrossRef]
  4. Dil, B.; Dulman, S.; Havinga, P. Range-based localization in mobile sensor networks. In Proceedings of the Wireless Sensor Networks (EWSN), Zurich, Switzerland, 13–15 February 2006; pp. 164–179. [Google Scholar]
  5. Cavassila, S.; Deval, S.; Huegen, C.; van Ormondt, D.; Graveron-Demilly, D. Cramér–Rao bounds: An evaluation tool for quantitation. NMR Biomed. 2001, 14, 278–283. [Google Scholar] [CrossRef]
  6. Zhang, R.; Wu, X.; Lou, Y.; Yan, F.G.; Zhou, Z.; Wu, W.; Yuen, C. Channel-training-aided target sensing for terahertz integrated sensing and massive MIMO communications. IEEE Internet Things J. 2025, 12, 3755–3770. [Google Scholar] [CrossRef]
  7. Zhang, R.; Wu, W.; Chen, X.; Gao, Z.; Cai, Y. Terahertz integrated sensing and communication-empowered UAVs in 6G: A transceiver design perspective. IEEE Veh. Technol. Mag. 2025; early access. [Google Scholar] [CrossRef]
  8. Zhang, R.; Cheng, L.; Wang, S.; Lou, Y.; Wu, W.; Ng, D.W.K. Tensor decomposition-based channel estimation for hybrid mmWave massive MIMO in high-mobility scenarios. IEEE Trans. Commun. 2022, 70, 6325–6340. [Google Scholar] [CrossRef]
  9. Panwar, K.; Katwe, M.; Babu, P.; Ghare, P.; Singh, K. A majorization-minimization algorithm for hybrid TOA-RSS based localization in NLOS environment. IEEE Commun. Lett. 2022, 26, 1017–1021. [Google Scholar] [CrossRef]
  10. Wang, W.; Zhang, Y.; Tian, L. TOA-based NLOS error mitigation algorithm for 3D indoor localization. China Commun. 2020, 17, 63–72. [Google Scholar] [CrossRef]
  11. Xu, S.; Ou, Y.; Wu, X. Optimal sensor placement for 3-D time-of-arrival target localization. IEEE Trans. Signal Process. 2019, 67, 5018–5031. [Google Scholar] [CrossRef]
  12. Zhao, W.; Goudar, A.; Schoellig, A. Finding the right place: Sensor placement for UWB time difference of arrival localization in cluttered indoor environments. IEEE Robot. Autom. Lett. 2022, 7, 6075–6082. [Google Scholar] [CrossRef]
  13. Tseng, P. Convergence of a block coordinate descent method for nondifferentiable minimization. J. Optim. Theory Appl. 2001, 109, 475–494. [Google Scholar] [CrossRef]
  14. Han, Y.; Li, G.; Xu, R.; Su, J.; Li, J.; Wen, G. Clustering the wireless sensor networks: A meta-heuristic approach. IEEE Access 2020, 8, 214551–214564. [Google Scholar] [CrossRef]
  15. Verma, S.; Zeadally, S.; Kaur, S.; Sharma, A.K. Intelligent and secure clustering in wireless sensor network (WSN)-based intelligent transportation systems. IEEE Trans. Intell. Transp. Syst. 2021, 23, 13473–13481. [Google Scholar] [CrossRef]
  16. Guo, J.; Gao, H.; Liu, Z.; Huang, F.; Zhang, J.; Li, X.; Ma, J. ICRA: An intelligent clustering routing approach for UAV ad hoc networks. IEEE Trans. Intell. Transp. Syst. 2022, 24, 2447–2460. [Google Scholar] [CrossRef]
  17. El Khediri, S.; Fakhet, W.; Moulahi, T.; Khan, R.; Thaljaoui, A.; Kachouri, A. Improved node localization using K-means clustering for wireless sensor networks. Comput. Sci. Rev. 2020, 37, 100284. [Google Scholar] [CrossRef]
  18. Wu, X.; Zhang, M.; Wang, X.; Zheng, Y.; Yu, H. Hierarchical task assignment for multi-UAV system in large-scale group-to-group interception scenarios. Drones 2023, 7, 560. [Google Scholar] [CrossRef]
  19. Chen, J.; Zhang, Y.; Wu, L.; You, T.; Ning, X. An adaptive clustering-based algorithm for automatic path planning of heterogeneous UAVs. IEEE Trans. Intell. Transp. Syst. 2021, 23, 16842–16853. [Google Scholar] [CrossRef]
  20. Arafat, M.Y.; Moh, S. Localization and clustering based on swarm intelligence in UAV networks for emergency communications. IEEE Internet Things J. 2019, 6, 8958–8976. [Google Scholar] [CrossRef]
  21. Garraffa, G.; Sferlazza, A.; D’Ippolito, F.; Alonge, F. Localization based on parallel robots kinematics as an alternative to trilateration. IEEE Trans. Ind. Electron. 2022, 69, 999–1010. [Google Scholar] [CrossRef]
  22. Zhang, M.; Li, J.; Wu, X.; Wang, X. Coalition game based distributed clustering approach for group oriented unmanned aerial vehicle networks. Drones 2023, 7, 91. [Google Scholar] [CrossRef]
  23. Yin, F.; Fritsche, C.; Gustafsson, F.; Zoubir, A.M. TOA-based robust wireless geolocation and Cramér–Rao lower bound analysis in harsh LOS/NLOS environments. IEEE Trans. Signal Process. 2013, 61, 2243–2255. [Google Scholar] [CrossRef]
  24. Moreno-Salinas, D.; Pascoal, A.; Aranda, J. Optimal sensor placement for acoustic underwater target positioning with range-only measurements. IEEE J. Ocean. Eng. 2016, 41, 620–643. [Google Scholar] [CrossRef]
  25. Xu, S.; Rice, M.; Rice, F. Optimal TOA-sensor placement for two target localization simultaneously using shared sensors. IEEE Commun. Lett. 2021, 25, 2584–2588. [Google Scholar] [CrossRef]
  26. He, J.; Ho, D.K.C.; Xiong, W.; So, H.C.; Chun, Y.J. Cramér–Rao lower bound analysis for elliptic localization with random sensor placements. IEEE Trans. Aerosp. Electron. Syst. 2024, 60, 5587–5595. [Google Scholar] [CrossRef]
  27. Guvenc, I.; Chong, C.C. A survey on TOA based wireless localization and NLOS mitigation techniques. IEEE Commun. Surv. Tutor. 2009, 11, 107–124. [Google Scholar] [CrossRef]
  28. Rutenbar, R. Simulated annealing algorithms: An overview. IEEE Circuits Devices Mag. 1989, 5, 19–26. [Google Scholar] [CrossRef]
  29. Adler, D. Genetic algorithms and simulated annealing: A marriage proposal. In Proceedings of the IEEE International Conference on Neural Networks, San Francisco, CA, USA, 28 March–1 April 1993; Volume 2, pp. 1104–1109. [Google Scholar] [CrossRef]
  30. Wu, H.; Nie, C.; Kuo, F.C.; Leung, H.; Colbourn, C.J. A discrete particle swarm optimization for covering array generation. IEEE Trans. Evol. Comput. 2015, 19, 575–591. [Google Scholar] [CrossRef]
  31. Jeong, Y.W.; Park, J.B.; Jang, S.H.; Lee, K.Y. A new quantum-inspired binary PSO: Application to unit commitment problems for power systems. IEEE Trans. Power Syst. 2010, 25, 1486–1495. [Google Scholar] [CrossRef]
Figure 1. An example scenario of dynamic UAV clusters for multi-target localization, where 9 UAVs are divided into 3 clusters C 1 , C 2 , C 3 , while 3 NCTs g 1 , g 2 , g 3 are assigned to the clusters, respectively.
Figure 1. An example scenario of dynamic UAV clusters for multi-target localization, where 9 UAVs are divided into 3 clusters C 1 , C 2 , C 3 , while 3 NCTs g 1 , g 2 , g 3 are assigned to the clusters, respectively.
Sensors 25 02857 g001
Figure 2. The positions and relative velocity of two UAVs u i and u j in the mobility-based model for defining communication link consistency W i , j .
Figure 2. The positions and relative velocity of two UAVs u i and u j in the mobility-based model for defining communication link consistency W i , j .
Sensors 25 02857 g002
Figure 3. Illustration of the composite covariance matrix Σ m , n from two error components: position error δ n loc and motion-induced error δ n vel (denoted as Σ n loc and Σ n vel in Euclidean space, respectively). The red and blue lines represent error distributions, and σ ( · ) represents the standard deviation of the error.
Figure 3. Illustration of the composite covariance matrix Σ m , n from two error components: position error δ n loc and motion-induced error δ n vel (denoted as Σ n loc and Σ n vel in Euclidean space, respectively). The red and blue lines represent error distributions, and σ ( · ) represents the standard deviation of the error.
Sensors 25 02857 g003
Figure 4. Framework of the proposed MDQPSO-ASA algorithm, which integrates multi-swarm optimization, discrete quantum-inspired PSO, adaptive simulated annealing, and a repair mechanism for solutions.
Figure 4. Framework of the proposed MDQPSO-ASA algorithm, which integrates multi-swarm optimization, discrete quantum-inspired PSO, adaptive simulated annealing, and a repair mechanism for solutions.
Sensors 25 02857 g004
Figure 5. Curves for objective functions of different algorithms over 30 iterations.
Figure 5. Curves for objective functions of different algorithms over 30 iterations.
Sensors 25 02857 g005
Figure 6. Clustering results for different scales of UAVs and NCTs.
Figure 6. Clustering results for different scales of UAVs and NCTs.
Sensors 25 02857 g006
Figure 7. Solution quality for different numbers of UAVs compared to baseline algorithms with M = 8 NCTs.
Figure 7. Solution quality for different numbers of UAVs compared to baseline algorithms with M = 8 NCTs.
Sensors 25 02857 g007
Figure 8. Solution quality for different numbers of NCTs compared to baseline algorithms with N = 80 UAVs.
Figure 8. Solution quality for different numbers of NCTs compared to baseline algorithms with N = 80 UAVs.
Sensors 25 02857 g008
Figure 9. Solution quality for different maximum allowable distances between UAVs in a cluster compared to baseline algorithms.
Figure 9. Solution quality for different maximum allowable distances between UAVs in a cluster compared to baseline algorithms.
Sensors 25 02857 g009
Figure 10. Solution quality for different minimal UAV numbers per cluster compared to baseline algorithms.
Figure 10. Solution quality for different minimal UAV numbers per cluster compared to baseline algorithms.
Sensors 25 02857 g010
Figure 11. Triggering frequency and success rate of repair operation during optimization process.
Figure 11. Triggering frequency and success rate of repair operation during optimization process.
Sensors 25 02857 g011
Figure 12. Triggering frequency and success rate of repair operation for different c.
Figure 12. Triggering frequency and success rate of repair operation for different c.
Sensors 25 02857 g012
Figure 13. Triggering frequency and success rate of repair operation for different d th .
Figure 13. Triggering frequency and success rate of repair operation for different d th .
Sensors 25 02857 g013
Table 1. Default parameter settings for the simulation experiments.
Table 1. Default parameter settings for the simulation experiments.
ParametersValuesDescriptions
c4Minimum number of UAVs per cluster, since TOA localization model in 3D space requires at least 4 sensor nodes [27].
d th 1000Maximum allowable distance between UAVs in a cluster.
α 1 1.0Weight factor for CRLB in the objective function.
α 2 0.5Weight factor for motion similarity in the objective function.
α 3 0.5Weight factor for communication link consistency in the objective function.
β 0.5Acceleration coefficient in DQPSO.
T 0 100Initial temperature for simulated annealing.
r 0 0.01Initial cooling rate for simulated annealing.
τ 1 5Interval for applying simulated annealing (iterations).
τ 2 5Interval for migration operation (iterations).
Table 2. Runtime (s) of different algorithms for different numbers of UAVs with M = 8 NCTs.
Table 2. Runtime (s) of different algorithms for different numbers of UAVs with M = 8 NCTs.
Number of UAVsSASA-GADPSOQPSOMDQPSOMDQPSO-ASA
404.86614.6633.1003.1592.9703.304
5013.63831.0299.1218.8768.5639.362
6018.94653.49512.68212.93512.14412.818
7021.23482.97716.71316.50515.18716.895
8025.69695.77519.27019.21118.91719.715
9027.772122.97722.43122.54621.46123.612
10032.818150.15530.22030.51929.58832.520
11034.325165.01234.99934.83032.03635.032
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gong, W.; Lou, S.; Deng, L.; Yi, P.; Hong, Y. Efficient Multi-Target Localization Using Dynamic UAV Clusters. Sensors 2025, 25, 2857. https://doi.org/10.3390/s25092857

AMA Style

Gong W, Lou S, Deng L, Yi P, Hong Y. Efficient Multi-Target Localization Using Dynamic UAV Clusters. Sensors. 2025; 25(9):2857. https://doi.org/10.3390/s25092857

Chicago/Turabian Style

Gong, Wei, Shuhan Lou, Liyuan Deng, Peng Yi, and Yiguang Hong. 2025. "Efficient Multi-Target Localization Using Dynamic UAV Clusters" Sensors 25, no. 9: 2857. https://doi.org/10.3390/s25092857

APA Style

Gong, W., Lou, S., Deng, L., Yi, P., & Hong, Y. (2025). Efficient Multi-Target Localization Using Dynamic UAV Clusters. Sensors, 25(9), 2857. https://doi.org/10.3390/s25092857

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop