Next Article in Journal
A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
Previous Article in Journal
Energy Consumption Performance of a VTOL UAV In and Out of Ground Effect by Flight Test
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Joint Optimization Control Algorithm for Passive Multi-Sensors on Drones for Multi-Target Tracking

Naval Aviation University, Yantai 264001, China
*
Author to whom correspondence should be addressed.
Drones 2024, 8(11), 627; https://doi.org/10.3390/drones8110627
Submission received: 3 September 2024 / Revised: 26 October 2024 / Accepted: 27 October 2024 / Published: 30 October 2024

Abstract

:
A distributed network of multiple unmanned aerial vehicles (UAVs) equipped with airborne passive bistatic radar (APBR) can form a passive detection network through cooperative networking technology, a novel passive early warning detection system. Its multi-target tracking performance has a significant impact on situational awareness of the detection area. This paper proposes a passive multi-sensors joint optimization control algorithm based on task adaptive switching, with the aim of addressing the impact of limited UAV sensors’ field of view (FOV) on multi-target tracking performance in APBR networks. Firstly, for a single UAV node, the Poisson Labeled Multi-Bernoulli (PLMB) filter is selected as the local filter of each node, with the objective of obtaining the local multi-target density independently. Subsequently, the consensus arithmetic average fusion rule is employed to address the multi-sensors density fusion problem in APBR networks. This enables the acquisition of the global multi-target density and multi-target tracks of the network. The task adaptive switching mechanism of the nodes is constructed further based on the partially observable Markov decision process (POMDP), and the objective functions for the UAV to perform the search task and the tracking task are derived based on differential entropy, respectively. Ultimately, a multi-node joint optimization control algorithm is devised. The simulation experiment demonstrates that the proposed algorithm is capable of effective control of multiple nodes to solve the multi-target search and tracking problem when the node FOV is limited. This further improves the multi-target tracking and fusion capability of the distributed APBR network.

1. Introduction

Airborne Passive Bistatic Radar (APBR) is a novel radar system that employs third-party non-cooperative emitters’ signals (including radio and television signals [1,2,3], communication and navigation signals [4], and non-cooperative radar signals [5], among various others) as the emitters and relies on onboard passive sensors to detect and track targets. This radar system does not emit electromagnetic waves actively; as a result, it has significant advantages in anti-stealth, anti-jamming, and low-slow small target detection. The rapid advancement of unmanned aerial vehicle (UAV) technology has enabled the networking of UAV formations equipped with passive sensors, thereby creating a distributed APBR network [6]. This network facilitates the fusion of multi-sensor information within the detection area, enhancing the flexibility and stability of the system’s early warning and detection capabilities. This innovative application has attracted significant research attention in recent years. The passive multi-sensor, multi-target tracking process in distributed APBR networks will continue to face challenges such as missed detections, false alarms, trajectory interruptions, and label switching. These issues stem from inherent limitations in sensor field of view (FOV), signal range, and the strength of non-cooperative radiation sources. However, by optimizing and collectively controlling the multi-sensors, the detection efficiency of the targets can be enhanced, thereby improving the multi-target tracking performance of the network. Therefore, achieving a rational joint optimal control of airborne passive multi-sensors is a crucial issue to address when utilizing distributed APBR networks for multi-target tracking tasks.
In a distributed APBR network, the detection and tracking performance of each sensor must be considered in addition to the network’s information fusion capabilities. The spatial distribution of multi-sensors, resource allocation, network topology, and other factors influence the network’s multi-target tracking performance. The current mainstream multi-target tracking algorithms are based on the theoretical framework of Random Finite Sets (RFS). Examples of such algorithms include Probability Hypothesis Density (PHD) filters [7], Cardinalized PHD (CPHD) filters [8], Multi-Bernoulli (MB) filter [9], and Poisson multi-Bernoulli mixture (PMBM) filter [10,11]. Furthermore, researchers have developed several sophisticated filters for estimating multi-target tracks, including generalized labeled multi-Bernoulli (GLMB) filters [12], LMB filters [13], Gibbs-GLMB filters [14], and Poisson labeled multi-Bernoulli (PLMB) filters [15], among others. To improve the multi-target tracking performance of distributed multi-sensor systems, multi-sensor fusion algorithms based on RFS theory have also been intensively studied. These algorithms are mainly based on two fusion rules, Geometric Average (GA) [16,17,18] and Arithmetic Average (AA) [19,20,21,22], and have emerged as distributed PHD filters [23,24], consensus CPHD filters [24], consensus LMB filters [18,25], and label-matched GA-LMB filters [26], among others. However, when the FOV of each node in a distributed APBR network is limited, it is not possible to obtain information on all interesting targets in the detection area solely through multi-target information fusion. Consequently, it is necessary to study multi-target control management and resource optimization algorithms to improve the target search and tracking capabilities of distributed APBR networks.
In essence, the design of multi-sensor optimal control for multi-target tracking tasks aims to enhance the future performance of the sensor network by adjusting detection resources (such as power and bandwidth) or the motion states (such as position and heading) of each sensor. Given that distributed APBR networks consist of passive sensors and non-cooperative radiation sources, the optimized control objective is focused on the motion state. Once the optimized control objective is established, the control algorithm needs to construct a target function pertaining to the control objective to determine the optimal control command. Consequently, the choice of target function plays a pivotal role in the decision-making process of multi-target sensor joint optimal control algorithms. In [27], a posteriori expected target number error and multi-target state estimation error are employed as objective functions in a proposed sensor control management approach. This was performed to quantify the variation of target tracking errors during the control process. In [28], a multi-target posteriori discretization based on optimal sub-pattern assignment (OSPA) is employed to quantify the confidence of target state estimation accuracy, with the confidence serving as the objective function. However, this approach results in the loss of some target information during estimation, which is suboptimal for decision-making. In light of the aforementioned considerations, R’enyi divergence is employed in [29] as a metric for quantifying the change in multi-target information during the control process, utilizing it as a measure of multi-target a posteriori density information gain. Similarly, Cauchy-Schwarz (CS) divergence is adopted in [30,31] and Kullback–Leibler (KL) divergence is adopted in [32] as measures of multi-target a posteriori density information gain for constructing the objective function of multi-sensor optimal control, respectively. Furthermore, the notion of differential entropy is expanded to encompass RFS theory in [33]. This resulted in the derivation of the value function associated with multi-target density information based on differential entropy. Additionally, the value function was presented as the objective function for target searching and target tracking tasks, respectively. This approach effectively reduced the computational burden and enhanced the efficiency of control decisions.
Considering limited node FOV in a distributed APBR network poses challenges for complete target detection in the specified area. This paper investigates a multi-node joint optimal control algorithm suitable for distributed APBR networks. The goal is to simultaneously achieve target search and tracking tasks to effectively enhance the multi-target tracking fusion performance of the network. Firstly, in terms of the selection of local node filters, this paper models detected and undetected targets using the PLMB filter to estimate the posteriori density of multiple targets and the density information of undetected targets. Secondly, in the design of information fusion algorithms for distributed networks, this paper adopts a flooding communication strategy [34] to ensure that all nodes can perceive target information from other nodes in the network. Additionally, applying the arithmetic mean fusion criterion can reduce computational complexity and promote multi-target density fusion. In the design of optimization control algorithms for multiple sensor nodes, this paper employs a task-adaptive switching mechanism to decompose the multi-target optimization problem into single-target optimization problems. Furthermore, target functions for tracking and search tasks are derived based on the differential entropy function and density function. Finally, the use of the Cubature Kalman Filter (CKF) [35] based on the Gaussian Mixture (GM) method [36] realizes local filtering for nodes and multi-node density fusion. Additionally, using a greedy algorithm [33,37] solves the single-task optimization problem, thereby obtaining optimal control commands and promoting real-time joint optimization and control of multiple nodes, enhancing the multi-target tracking fusion performance of distributed APBR networks. Figure 1 provides an overview of the research that forms the core of this paper.

2. Problem Description and Mathematical Model

Within a distributed APBR network, each sensor node must initially acquire multi-target fusion information. Subsequently, they are required to define a specific optimization objective function based on their individual task requirements. Following this, an optimal control algorithm is deployed to select optimal control instructions from a set of alternatives, which are then executed to enhance multi-target control and tracking performance. Following this process description, the multi-target tracking fusion and multi-sensor optimal control problem can be segmented into two components: the former involves a parameter estimation problem, while the latter centers around an optimization decision problem. The mathematical models for these two issues will be elucidated later.

2.1. Multi-Target Tracking Model for Distributed APBR Networks

A distributed APBR network performing multi-target tracking tasks within a detection area is considered, where each node of the network represents a UAV equipped with passive sensors and will be referred to as a node hereafter. Due to the limited FOV of each node, a single node may not be able to detect all the targets appearing in the network detection area at a certain moment; the local multi-target density needs to be able to describe both the undetected and detected targets in the network. In this paper, we consider the PLMB filter as a local filter for each node and use the Poisson point process (PPP) density [35] to model potential targets that have not yet been detected and the LMB density to model detected targets. Assuming that the multi-target state RFS at time k 1 is denoted as X k 1 , and the targets within the node FOV will either die out or born in, without considering the target spawning phenomenon, then the multi-target state set at time k consists of the targets that survive from time k 1 to time k and the targets that are newly born at time k , then the multi-target state RFS at time can be modeled as follows:
X k = x X k 1 X ˜ S x X B .
In the PLMB filter, the multi-target state RFS X k at the time k is represented by the union of the set X k u of potential targets that have not yet been detected and the set X ˜ k d of detected targets, as in the following:
X k = X k u X ˜ k d .
Equation (2) denotes X k = X k u     X ˜ k d and X k u     X ˜ k d = , then, the PLMB density form of the multi-target state set can be expressed as the following:
π P L M B = X k π P P P X k u π L M B     X ˜ k d .
π P P P X k u and π L M B   X ˜ k d denote the PPP density distribution and LMB density distribution, respectively, with the expressions as follows:
π P P P X k u = e λ x X k u λ p x ,
π L M B   X ˜ k d = Δ X ˜ k d i L \ L ( X ˜ k d ) 1 r i l L ( X ˜ k d ) 1 L l r l p X ˜ k d .
where r l denotes the existence probability of the target l , p x denotes the Probability Density Function (PDF) of the target l , λ denotes the expected number of targets, μ x = λ p x denotes the intensity function; L denotes the label space, l denotes the target identity label. The set of labels of a set X ˜ k d is denoted by Notation: L : X × L L denotes the projection L x ˜ = l , then the set L of labels is denoted as L = L X ˜ k d = l L : x ˜ X ˜ k d   s . t .   L x ˜ = l , Δ X ˜ k d denotes the labeling distinct indicator when X ˜ k d = L X ˜ k d , Δ X ˜ k d = 1 , otherwise Δ X ˜ k d = 0 , · denotes the cardinality of the set to be solved.
The multi-target measurement of the APBR network node is represented by the combination of the target’s Direction of Arrival (DOA), the bistatic range (also known as Time Difference of Arrival (TDOA)), and the range rate (also known as Frequency Difference of Arrival (FDOA)). Assuming that the set of multi-target measurements received by the node i at the time k is Z k , the state of the detected target l is denoted as x ˜ k l = ( p k l , v k l ) , l 1 , 2 , , M , where N = X ˜ k . The state of the emitter is denoted as x k s = ( p k s , v k s ) , s 1 , 2 , , S , and the state of the node itself is denoted as x k i = ( p k i , v k i ) , and the vectors p and v denote the position and velocity components of the above entity, respectively, and the vector dimension is determined by the space in which the target entity is located. As nodes will inevitably result in the generation of false alarm clutter during the detection process, the multi-target measurement set is typically composed of target point measurements and clutter measurements, expressed in the following manner:
Z k = x ˜ X ˜ k z x ˜ Z k c ,
z x ˜ k l = h ( x ˜ k l , x k i , x k s ) + υ k ,   x ˜ k l   detected   in   FOV ,                                             otherwise ,
h x ˜ k l , x k i , x k s = ϕ k l ; ϕ k s ; d k l ; d ˙ k l ,
π Z k c = e < 1 , K k > · K k Z k c .
h x ˜ k l , x k i , x k s contains the return signal DOA ϕ k l of the target l , the DOA ϕ k s of the direct wave signal of the emitter, TDOA d k l , and FDOA d ˙ k l . υ k denotes the independent identically distributed measurement noise whose vector dimension is determined by h x ˜ k l , x k i , x k s . Assume that υ k ~ N 0 , V k , N 0 , V k denotes a Gaussian distribution with mean 0 and covariance V k of the measurement noise. The formula for calculating the angular information in Equation (8) is determined by the spatial dimension of the target entity in the detection scene, and the formula for calculating the bistatic range is as follows:
d k l = d k s l + d k i l d k s i
where, d k s = p k s p k , d k i l = p k i p k l , d k s i = p k s p k i , · denotes the Euclidean norm. The calculation of the bistatic range rate is given by the following:
d ˙ k l = d ˙ k s l + d ˙ k i l d ˙ k s i
where d ˙ k s l = ( p k s p k l ) T · ( v k s v k l ) p k s p k l , d ˙ k i l = ( p k i p k l ) T · ( v k i v k l ) p k i p k l , d ˙ k s i = ( p k s p k i ) T · ( v k s v k i ) p k s p k i .
The clutter is modeled as a Poisson RFS distribution with intensity function K k · = λ k c · , λ k denotes the clutter rate, i.e., the average number of clutters, and c · denotes the spatial PDF of the clutter. Given a multi-target LRFS X ˜ k , the multi-target likelihood function of the measurements RFS Z k can be computed by combining Equations (6)–(11) as follows:
g Z k X ˜ k = e < 1 , K k > · K k Z k c θ Θ L ψ Z k x ˜ ; θ X ˜ k ,
ψ Z k x ˜ ; θ = p d x ˜ g z θ l x ˜ K k z θ l , θ l > 0 q d x ˜ ,                                     θ l = 0 .
where θ : L { 0 , 1 , , | Z | } denotes the association mapping function that satisfies: when and only when i = i , there is θ i = θ ( i ) > 0 , Θ L denotes the set of association mapping functions whose domain of definition is L . p d x ˜ denotes the probability that APBR detects the target l , and the corresponding miss detection probability is denoted as q d x ˜ = 1 p d x ˜ . When the node FOV is finite, the detection probability can be modeled as follows:
p d x ˜ = P D , p k i p k l R FOV 0 , p k i p k l > R FOV
After each node receives the multi-target measurements within its respective FOV, it runs the PLMB filter locally to obtain the local multi-target densities and then interacts with the information using the flooding strategy [31] and selects the optimal fusion rule to perform the multi-sensor density fusion, and finally, the obtained global multi-target densities of the network are used for the subsequent optimal control of the sensors. Considering the nonlinearity of the measurements, the specific implementation of the local PLMB filter chooses CKF as the embedded filtering algorithm, which is implemented using the GM method.

2.2. Multi-Node Optimal Control Model for Distributed APBR Networks

When dealing with multi-node optimal control problems, the nodes are not informed of the measurement results received after the execution of control commands, resulting in a decision-making process that often faces the challenge of incomplete information. In view of this, the distributed passive multi-sensor optimal control problem can be modeled as a Partially Observable Markov Decision Process (POMDP) [11], i.e., the node’s control command at the next moment is determined only by the states of the UAV, multi-target, and non-cooperative emitters at the current moment, independent of the state at the previous moment’s state. The following key elements need to be included when modeling a multi-sensor optimal control problem using POMDP: (1) a multi-target state set, (2) a multi-target measurement data set, (3) a state transfer model and a measurement model for the target, (4) a set of control instructions for the node, and (5) a reward or cost function associated with the control instructions. The first three elements are given in the multi-target tracking model in Section 2.1, and the last two elements are modeled next.
In principle, the space of the set of control instructions should be infinite and continuous, but to ensure the running efficiency of the algorithm, this paper needs to simplify the control process. Assuming that the set of control instructions of all nodes is represented by a finite set, only the control instructions of each node are allowed to be selected within the given finite set. Given the node i R , R is the set composed of all nodes in the network, Let R = R denotes as the number of nodes in the network, the sensor state transfer model under control instruction optimization is expressed as follows:
x k + 1 i = f x k i , c k i .
where c k i denotes the node control command vector consisting of heading φ k i at the time k . f · denotes the state transfer function of the sensor i . The set of all alternative control commands for a node i is denoted by C k i , and the set of joint optimization control commands for all nodes is denoted by C k = C k 1 C k 2 C k R . The optimal joint optimization control instruction at the time is denoted as c k o p t = c 1 , k o p t , c 2 , k o p t , , c R , k o p t C k . The joint optimization control instruction at the time k is c k , and the control horizon length of all nodes is l , and the control horizon length indicates that after the control instruction is determined at the time k , the control instruction remains unchanged for the subsequent time period k , k + l . The Predicted Ideal Measurement Sets (PIMS) received by a node after executing a control command is denoted as Z ˜ k + l i c k i , and the concatenated set of PIMS received by all nodes at the time is denoted as Z ˜ k + l c k = i = 1 : R Z ˜ k + l i c k i . Each node uses the received PIMS to run the PLMB filter locally to update and fuse the multi-target state a posteriori density information, and the global multi-target a posteriori density of the network at time k + l can be obtained, which is denoted as π ¯ k + l c k . The optimal joint optimization control command in the POMDP framework can be solved by building an optimization model with constraints, which is given by the following expression:
c k opt = arg min c k C k / arg max c k C k E θ π ¯ k + l k , Z ˜ k + l c k s . t . x k + l i = f x k i , c k i z ˜ k + l c k = h x k + l i , x s , x k π k + l k i = π P L M B x k + l i , z ˜ k + l π ¯ k + l k = f u s e d i = 1 : R π k + l k i .
where θ π ¯ k + l k , Z ˜ k + l c k denotes the objective optimization function specified by the task demand, which is used to measure the multi-target tracking fusion performance under the optimized control instructions, and E · denotes the expectation. The selection of the objective function is determined by different task requirements, and the specific selection and corresponding theoretical analysis of this paper will be given subsequently. In addition, to ensure that the objective functions calculated by different control instructions have obvious differentiation, it is necessary to set a reasonable length of the control FOV so that the optimization algorithm can quickly converge while minimizing the amount of computation.
In summary, the main process of multi-node joint optimization control is as follows: first, each node calculates the predicted multi-target density π ¯ k + l | k at time k + l based on the global multi-target density of the network at time k . In this process, the newborn and death of multi-targets are not considered, and the predicted multi-target density is called pseudo-predicted density. Then, the multi-target states X ^ k + l at time k + l are extracted from the pseudo-predictive density and combined with the control command alternative set C k of optimized node states to calculate the PIMS Z k + l i C k i | i = 1 : R obtained by all node. Here, the PIMS does not consider any measurement noise, association error, or false alarm clutter, so it is called the pseudo measurement set. Each sensor uses the PIMS to execute the PLMB filter to obtain the updated pseudo-posteriori density π k + l i | i = 1 : R C k , which is fused to obtain the global pseudo-posteriori density π ¯ k + l C k of the network at time k + l . Finally, the pseudo-predicted density π ¯ k + l | k and the pseudo-posteriori density π ¯ k + l C k are jointly used in the computation of the objective function to decide the optimal control commands. The distributed passive multi-node joint optimal control flow is shown in Figure 2.

3. Multi-Target Density Fusion Method Under Limited FOV

This paper emphasizes the utilization of the PLMB filter at the node level to facilitate track initiation for new targets and continuous tracking of existing targets within the field of view during the optimal control process. Detailed implementation specifications of the PLMB filter can be found in [15]. The adoption of the PLMB filter by network nodes leads to labeled multi-target densities; however, this fusion process may exhibit label inconsistencies, resulting in suboptimal fusion effects. To address this issue, a solution for multi-target density fusion in distributed APBR networks is presented in this section.

3.1. AA Fusion Based on LMB Density

Considering that the detection probability of passive sensors is low, and the label-matching process also increases a certain amount of computation, this paper prioritizes the AA fusion rule to avoid an excessive computational load of sensor nodes and reduce the missing of target detection [36].
The multi-target density obtained by any node in the network after locally executing the PLMB filter obeys the PLMB distribution, so the multi-target density can be decomposed into the PPP density and the LMB density. The fusion of the PPP density is mainly carried out with respect to its intensity function and the AA fusion process, which can be referenced to the PHD-AA fusion algorithm [23]. The AA fusion process of the LMB density is relatively complex, and it is necessary to solve the label-matching problem before the AA fusion of Bernoulli density can be executed. For any node i R , the distribution form of the decomposed LMB density of its multi-target density is denoted as the following:
π i X ˜ = Δ X ˜ w i L X ˜ p i X ˜ ,
w i L = m L 1 r i m l L 1 L l r i l 1 r i l .
where r i l denotes the existence probability of target l in the FOV of node i , and p i l denotes the PDF when target l exists. Consider any two network nodes i,j at time k perform AA fusion, and the decomposed LMB densities from their local PLMB densities are represented by their respective corresponding parameter sets π i = r i l , p i l l L i and π j = r j l , p j l l L j , respectively. If the label space of the two nodes is the same, then according to PHD consistency [21], using the PHD-AA fusion principle, we can derive the parameter representation of the fused LMB density as follows:
π f u s e d X = r f u s e d l , p f u s e d l l L ,
r f u s e d l = w i r i l + w j r j l ,
p f u s e d l = 1 r f u s e d l w i r i l p i l + w j r j l p j l .
From Equations (19) and (20), LMB-AA fusion can perform a linear fusion of Bernoulli densities independently and in parallel under each label. This label-by-label fusion has high computational efficiency, but it needs to ensure that the multi-target densities provided by different nodes have the same labeling information, and if the labels are inconsistent, it is not possible to perform LMB-AA fusion directly. Therefore, in this paper, the optimal assignment strategy proposed in [26] is first used to calculate the optimal label assignment, and then the LMB-AA fusion is executed. The specific steps of the optimal label assignment strategy are as follows.
Step 1: Calculate the difference L j c between the label cardinality of the two to-be-fused nodes and construct the assisted label set L j a . For the convenience of discussion, the label sets corresponding to the LMB densities of the two to-be-fused nodes are denoted as L i and L j , assuming that L i L j , then the difference of cardinality L j c between the two label sets can be computed as the following:
L j c = L i L j .
To ensure that the set L i and the set L j have the same cardinality, the set of assisted labels L j a is constructed as follows:
L j a = L j L j c .
where L j c denotes an arbitrary subset of the cardinality L j c in L L j . Then, the optimal label assignment problem can be modeled as the following:
min τ U J ( τ ) = min τ ˜ T L i , L j a log I F L i η ( I , τ ˜ ( I ) ) .
The corresponding optimal assignment can be expressed as follows:
τ ˜ * = arg min τ ˜ T L i , L j a log I F L i η ( I , τ ˜ ( I ) ) .
where τ : L i L j denotes the assignment function of the set label, which is expressed as follows:
τ = ( l , τ ( l ) ) : l L i , τ ( l ) L j .
η · , · denotes the GCI divergence between two label sets corresponding to multi-target densities, with the expression as the following:
η l 1 i , , l n i , l 1 j , , l n j = π i X ˜ i w i π j X ˜ j w j d x 1 , , x n .
Step 2: Construct the corresponding label distribution based on the difference set of two sensor label sets. For all labels l L j c , set their existence probability to be r j l = 0 based on the truncation strategy employed by the local filter.
Step 3: Calculate the cost function matrix. Combining the set of assisted labels and the cost function constructed by Equation (25), the optimal assignment problem is simplified as the following:
J ( τ ˜ ) = l L i C l , τ ˜ ( l ) ,
C l i , l j = log 1 r i ( l i ) w i 1 r j ( l j ) w j + r i ( l i ) w i r j ( l j ) w j c l i , l j ,
c l i , l j = p i l i ( x ) w i p j l j ( x ) w j d x .
At this point, the optimal assignment problem is degraded to the combinatorial optimization problem, and the matrix of the cost function is a L i × L i matrix of the following:
C C 1 i , 1 j C 1 i , L i j C L i i , 1 j C L i i , L i j
where the element in C l i , l j is computed from Equation (29) and represents the cost of the association between node-tracked target l i and node-tracked target l j .
Step 4: Based on the linear assignment function, the Hungarian algorithm is utilized to solve the optimal assignment matrix, and the optimal assignment matrix is obtained to be denoted as C * = c l , τ * ( l ) : l L i , and the detailed computation process is described in [26].
When the number of nodes in the network is more than 2, considering that all nodes need to perform joint optimization control, the flooding strategy is used to make all nodes receive multi-target densities from other nodes in the network before performing density fusion. The fusion process uses a pair-by-pair fusion method to perform LMB-AA fusion, and the specific algorithm flow is shown in Algorithm 1.
Algorithm 1: The flow of the LMB-AA fusion algorithm based on the flooding strategy
Input :   local   LMB   density   of   nodes   with   parameter   set   π i = r i ( l ) , p i ( l ) ( · ) : l L i , i = 1 , , R . Output :   fused   LMB   density   with   parameter   set   π f u s e d = r f u s e d ( l ) , p f u s e d ( l ) ( · ) : l L m a t c h e d .
Function :   LMB - AA   π 1 , π 2 , , π R For   j = 2 : R   do τ * , C * = opt _ matching _ algorithm π 1 - π j                     For   l L 1   do                                 r f u s e d l = w 1 r 1 l + w j r j l                                 p f u s e d l = 1 r f u s e d l w 1 r 1 l p 1 l + w j r j l p j l                     End                     π 1 = r f u s e d ( l ) , p f u s e d ( l ) ( · ) : l L 1 ; End π f u s e d = π 1 Return :   π f u s e d

3.2. AA Fusion Based on PLMB Density

For network nodes with limited FOV, their local multi-target density consists of PPP density and LMB density, where LMB density represents the density information of the detected targets, and PPP density represents the potentially undetected targets and the undetected targets outside the FOV. Therefore, the LMB density is concentrated within the FOV of each node, while the PPP density is mainly concentrated outside the FOV, and there may be a very small portion of PPP density that tends to zero within the FOV. The AA fusion process for LMB density has been given in the previous section, and the AA fusion process for PPP density is described next.
Unlike PHD-AA fusion, the PPP density decomposed from the PLMB density has a different range of values of intensity functions, although it also fuses the intensity functions. When a node with a limited FOV performs PHD filtering and then directly performs PHD-AA fusion, its fused intensity function is only concentrated within the FOV, while the intensity function to be fused in the PPP density is not only distributed within the FOV but also exists outside the FOV. The PPP density describes the distribution of potentially undetected targets that are not yet detected but are theoretically present, and therefore, more reliance should be placed on the information within the node’s FOV when evaluating the distribution of the density of these undetected targets. Specifically, for a region located within a node i ’s FOV, when it is not within the node j ’s FOV, estimation of undetected targets in that region should favor trusting the node i , even though the intensity function corresponding to the node j ’s PPP density may be larger, the node i ’s estimation may be more accurate. The key to the fusion of PPP density lies in how to design reasonable fusion weights based on the information within the node FOV to ensure that the fused PPP density can more accurately reflect the actual distribution of undetected targets. To this end, this paper proposes a PPP density fusion weight design method based on confidence update, which is described as follows.
Firstly, the area covered by the network is denoted as U , the area not covered is denoted as U C , and the covered area is uniformly divided into N sub-regions U = 1 N G i , and each node decomposes the intensity function corresponding to its respective PPP density into sub-intensity functions according to the sub-areas. Then, for any sub-region G i , the nodes corresponding to the sub-intensity functions present in the region form a set R i , and the corresponding number of nodes is denoted as R i , then the AA fusion is performed on different node sub-intensity in the sub-region to obtain the fused intensity of the region is denoted as follows:
D ¯ f u s e d i ( x ) = j R i w j D j ( x ) .
Since the FOV of each node is different and the sub-intensity functions present in each sub-region are also different, the sub-intensity fusion weights from different nodes are updated according to different confidence levels, for example, as follows:
ω ¯ j = ω j · 1 F o V j ( x ) j R i ω j · 1 F o V j ( x ) .
where F o V j denotes the FOV of the node in the sub-region, 1 F o V j ( x ) denotes the indicator function, and the specific expression is as follows:
1 F o V V j ( x ) = 1 , x F o V j   or   x C 0 , otherwise .
This equation shows that a node’s sub-intensity function is trustworthy when it is within that node’s FOV or is not covered by any node’s FOV. If the sub-intensity function is not within its own node FOV but appears in other node FOVs, then the sub-intensity function is not trustworthy. Finally, according to the above method, the intensities of all subregions are fused, and the fusion results are summed to obtain the globally fused PPP density of the network and the corresponding fused intensities are denoted as follows:
D ¯ U ( x ) = i = 1 N j R i w ¯ j D j ( x ) .
In conclusion, by combining Equations (19) and (35), a description of the fused multi-target density represented by the PLMB parameterization can be obtained as follows:
π f u s e d P L M B X = D ¯ U ( x ) , r f u s e d l , p f u s e d l l L m a t c h e d .

4. Multi-Node Joint Optimization Control Algorithm

Due to the limited FOV of the nodes and the stochastic nature of the appearance/disappearance of targets, the distributed APBR network needs to simultaneously search for undetected targets and track detected targets. In this paper, from the perspective of global optimization of the network, the search task and tracking task are decomposed so that all nodes execute only one task independently at any moment. In the network, a single node will perform task adaptive switching based on the local multi-target density to ensure that it performs optimization control based on the objective function of only one task. After completing the task adaptive switching, all nodes will be clustered according to the executed tasks, and nodes executing the same task will perform joint optimization control according to the objective function of the task, thus making the optimization result of a single task optimal and realizing the multi-target joint optimization control. In this way, the multi-objective optimization problem of the network is transformed into a single-objective optimization problem, which not only avoids the competition of detection resources between tasks but also improves the global optimization effect of the network. Next, the multi-node joint optimization control algorithm based on the task adaptive switching mechanism is given in this section.

4.1. Task Adaptive Switching Mechanism

The task adaptive switching means that each node automatically adapts the optimization task to be performed based on the target information within the FOV at the current moment and the global multi-target tracking situation of the network. To maximize the utilization of the detection resources of all nodes within the network, individual nodes must have the real-time switching capability of the tasks to be optimized. In this paper, each node can be defined according to the task type as S node and T node, and the corresponding set of nodes are denoted as R S and R T . The S node is responsible for performing the target search task, and the task goal is to search the unknown region to detect more potential targets; T node is responsible for performing the target tracking task, and the task goal is to continuously track the detected targets. When the node performing the search task detects a target, it needs to stop searching and switch to performing the tracking task; when all targets within the FOV of the node performing the tracking task are dying out, the node needs to switch to performing the search task in time.
At the initial moment, the state, number, and distribution of targets in the network are unknown, and all sensor nodes are set as S nodes. For any S node, when the estimated number of targets within its FOV is greater than 0 for m consecutive moments, its search task is switched to a tracking task and labeled as a T node. For any T node, when the number of targets in its FOV is estimated to be 0 for m consecutive moments, its tracking task is switched to a searching task and labeled as an S node. m is adjusted according to the detection probability of the receiver, and when the detection probability is larger, m can be set smaller, on the contrary, when the detection probability is smaller, m should be set larger.
In addition, assuming that the network multi-target tracking fusion error is set within the threshold range T h l o w e r , T h u p p e r , when the network global multi-target tracking fusion error is higher than the upper bound of the threshold range, it indicates that the multi-target tracking fusion effect is poor and some of the S nodes should be switched to T nodes according to the principle of proximity to the target to perform the tracking task, so as to improve the tracking accuracy of the network to the detected target. When the global tracking fusion error of the network is lower than the upper bound of the threshold range, it indicates that the multi-target tracking fusion effect is better, and some of the T nodes can be switched to S nodes according to the overlap of the FOV coverage to improve the network’s search coverage of the undetected targets. In summary, all nodes in the network will adaptively switch the executed optimization control tasks in accordance with the designed task adaptive switching mechanism, combining their own state and the global multi-target tracking fusion results of the network.

4.2. Optimization Objective Function for Different Tasks

After the nodes in the network perform a fusion update of multi-target density, they need to estimate the target state and the number of targets at the same time, and the multi-target tracking performance will be affected by both the target state estimation and the number estimation. In view of this, the information entropy can be used in the optimal control algorithm to measure the information difference between the pseudo-predicted multi-target densities of the global fusion of the network and the pseudo-posteriori multi-target densities of the fusion of the different control schemes. In this way, the information difference of the multi-target densities can be measured directly, which can consider the effects of the target state estimation and the number estimation at the same time. Considering the computational complexity of the algorithm, this paper adopts differential entropy as the information difference measurement to derive the objective function for the target tracking task. In addition, the fusion intensity is used as the main measurement in the construction of the objective function for the target search task. Next, in this section, the objective functions in the search task and the tracking task are derived, respectively.

4.2.1. Derivation of the Objective Function for Tracking Tasks

The task-switching indicator function for T nodes is first constructed. The global multi-target tracking performance of the network is related to the PLMB density of the detected targets, and the LMB density part of the PLMB density reflects the tracking performance of the detected targets and the PDF of a single detected target can be expressed by the Gaussian component as the following:
p ¯ k + l l c k | l L k + l = N x ; m k + l l c k , P k + l l c k ,
where L k + l reflects the estimation of the number of detected targets, and the covariance of the single-target PDF reflects the state estimation error of the detected targets. On this basis, this section uses the average tracking error function [37] of all detected targets as the task-switching indicator function of the T-node, for example, as follows:
D s w i t c h = l = 1 L k + l t r W P k + l l c k W L k + l .
where L k + l denotes the number of targets, W denotes the extraction matrix of the position component in the covariance matrix P k + l l c k , and the multi-target average tracking error is taken to avoid the overall error becoming larger with the increase of the number of detected targets.
When D s w i t c h < T h l o w e r , the multi-target tracking accuracy of the network is judged to be better, then some of the T nodes can be switched to S nodes to improve the network’s ability to search for undetected targets; the specific switching steps are as follows:
Step 1: Based on the respective tracking errors of the detected targets, find the label corresponding to the target with the smallest tracking error as follows:
l g o o d = min l   tr W P k l W T ,
Step 2: Extract the target position m k l g o o d corresponding to the label and find out all the T-nodes that have detected this target in the FOV and put them into the set R T / S of to-be-switched nodes. At the same time, to ensure that the estimation of the number of targets is not affected, T-nodes that contain other independent targets (which are only tracked by this T-node) in the FOV should be excluded from the set R T / S .
Step 3: Based on the distance of this target from all the T-sensors in the set R T / S , it is sorted according to the size of the distance. Considering that the farther the node is from the target, the larger the detection error is and the smaller the contribution to the target tracking performance. Switch the T node with the farthest distance in the set R T / S to an S node.
Next, the differential entropy is used to construct the optimal control cost function of T nodes. The differential entropy in the framework of RFS theory is defined as follows: for any RFS X , whose probability density is f X , its corresponding differential entropy h X is defined as the following:
h X = E X ln f X = ln f X Y f X Y μ d Y .
Y in Equation (39) denotes any finite subset of the RFS X and μ is a reference measure defined as the following:
μ T i = 0 1 i ! K i 1 T ( y 1 , . . . , y i ) d ( y 1 , . . . , y i ) .
where K is the hyper-volume unit on X , like the volume unit in the Euclidean space. T is a finite subset of X , and 1 T · is the indicator function of T . Observing Equation (39), we can find that the differential entropy is essentially the negative logarithmic expectation of the probability density, which is used to measure the uncertainty of the RFS X , and the smaller the differential entropy, the smaller the uncertainty. The expectation E X ln f X is obtained by taking the logarithm of the probability density f X and multiplying it by itself, and integrating over all possible subsets Y . Let f X | Z denote the conditional probability density of the RFS X on Z , then based on Equation (39), can be expanded to obtain the definition of the conditional differential entropy as follows:
h ( X Z ) = E X , Z ln f X Z .
Further, let the multi-target density of the RFS X be π X , then the differential entropy of the multi-target density can be expressed using the definition of set integral aa follows:
h ( X ) = ln K | Y | π X ( Y ) π X ( Y ) δ Y .
Combining Equations (39) and (41) yields the mutual information between two RFSs X and Z can be expressed as follows:
I ( X ; Z ) = h ( X ) h ( X Z ) .
The mutual information in Equation (44) indicates the amount of information obtained by the RFS X when acquiring the RFS Z , and the larger the mutual information, the smaller the uncertainty of X . Therefore, from the perspective of multi-target optimal control, the smaller the differential entropy of the multi-target state set or the larger the mutual information between the multi-target state set and the PIMS, the higher the accuracy of the multi-target trajectory estimation, and the T-node’s task is accomplished better. In this section, the LMB density part of the multi-target density at the time denotes the detected targets, denoted as π k L M B X . The optimal control of a T-node is essentially to solve for the optimal control scheme that minimizes the information difference between the multi-target predicted density π k + l L M B X at time k + l and the multi-target density π k + l L M B X | Z c k after the PIMS update. For an LMB density with a parameter set of r ( l ) , p ( l ) ( · ) l L , its differential entropy expression is as follows:
h ( X ) = l L r ( l ) ln r ( l ) + r ˜ ( l ) ln r ˜ ( l ) + r ( l ) < p ( l ) , ln K p ( l ) > .
where r ˜ ( l ) = 1 r ( l ) . Therefore, combining the multi-node optimal control model given in Equation (16) and the differential entropy definition of the LMB density in Equation (43), the optimal control function for all T nodes under the control scheme can be constructed as follows:
V t r a c k c k = j = k + 1 k + l h X j | Z j c k .
Equation (46) represents the cumulative differential entropy of the multi-target state given the PIMS in the control horizon l . To facilitate the calculation, Equation (46) is further disassembled as follows in conjunction with the definition of the PIMS value function in [37], as follows:
V t r a c k ( c k ) = j = k + 1 k + l ϱ Z c k , X j π j l m b X j Z c k δ X j ,
ϱ Z c k , X j = ln K | X | π j l m b ( X j Z c k ) .
In summary, Equations (38) and (46) are jointly constructed as the objective function in the tracking task, expressed as follows:
θ t r a c k = V t r a c k , D s w i t c h .
The former represents the mutual information measurement function between the multi-target prediction density and the PIMS update density, which serves as the joint optimization control function of the T nodes in order to obtain more informative multi-target measurements; the latter represents the average tracking error function of all detected targets, which serves as the task-switching indicator function of the T nodes in order to better balance the allocation of the node’s detection resources in the network, and to improve the network’s potential undetected target search capability of the network.

4.2.2. Derivation of the Objective Function for Searching Tasks

The task-switching indicator function of S nodes is decided according to the global multi-target tracking performance of the network, and the task-switching indicator function given in Equation (38) is still used here; only the conditions of task-switching are different. When D s w i t c h > T h u p p e r , the multi-target tracking performance of the network is judged to be poor, then some of the S nodes can be switched to T nodes to improve the tracking ability of the network on the detected targets; the specific switching steps are:
Step 1: Based on the global multi-target density of the network, extract the set of states of the detected targets at the current moment and find the target with the largest state estimation error, whose corresponding label is as follows:
l b a d = max l tr W P k l W T .
Step 2: Extract the location m k l b a d of the target corresponding to the label, calculate the distance from all S nodes to the target, and find the closest S node to the target as the to-be-switched node S T according to the proximity principle as follows:
S T = min i R S m k l b a d m k i .
Step 3: If the target is located within the FOV of the node S T , switch the node S T to T-node directly; if the target is located outside the FOV of the node S T , control the node to move towards the target until the target enters the FOV of the node S T , and then switch it to T-node.
Considering that the multi-target fusion density obtained by Equation (36) is the PLMB density, where the PPP part represents the intensity function of the undetected targets, integrating the PPP intensity function in the search area of the S node can get the expectation of the number of targets in the search area. Notice that the task-switching mechanism can ensure independence between the two tasks, and different methods can be chosen in the construction of the objective function according to the specific needs of the task. Therefore, to make the number of undetected targets in the network coverage area as small as possible, the objective function of the search task does not need to use the differential entropy construction method in the tracking task, and it can be directly integrated over the PPP intensity function in the coverage area of all S nodes as the optimization control function of the search task, for example, as follows:
D ¯ k + l U c k = j = 1 J k α k + l U , j c k N x ; m k + l U , j c k , P k + l U , j c k ,
V s e a r c h c k = D ¯ k + l U c k = j = 1 J k α k + l U , j c k .
The objective function in the search task is constructed by combining Equations (38) and (53) and is expressed as follows:
θ s e a r c h = V s e a r c h , D s w i t c h .
The former serves as an optimization control function for the intensity of undetected targets in the search region, which is used for the optimal control decision of multiple nodes under the search task to search more targets; the latter represents the average tracking error function of all detected targets, which measures the global multi-target tracking fusion effect of the network and serves as a task-switching indicator function for the S node, which is convenient for allocating more node probing resources to the target in the case of a poor tracking effect tracking to ensure the effectiveness of multi-target tracking fusion.

4.3. Joint Optimization Control Algorithm Flow

It is assumed that all the nodes in the network have completed the track initiation for the detected targets in the FOV and have completed the initial task allocation according to the targets in their respective FOVs. In the subsequent multi-target tracking fusion process of the network, all nodes will continue to execute the multi-node joint optimization control algorithm based on the task adaptive switching mechanism, and the specific implementation steps of the algorithm are as follows:
Step 1: All the nodes in the network execute local PLMB filters to obtain the multi-target density at time k according to the multi-target measurement data obtained from their respective FOVs and perform AA fusion on the multi-target density based on the flooding communication strategy to obtain the global fusion multi-target posteriori density of the network. At the same time, the multi-target prediction density at time k + l is calculated and the multi-target prediction state set at time k + l is extracted.
Step 2: Extract the set of multi-target states at time k and calculate D s w i t c h . All nodes determine whether the global multi-target tracking accuracy of the network at the current moment is within the set threshold based on D s w i t c h , and then perform task adaptive switching according to the determination result and the target situation within their respective FOVs. Finally, all nodes are clustered according to the task type.
Step 3: Firstly, according to the given set C k of control parameters, calculate the state set of each node under all control parameters, and calculate the PIMS received by each node under different control parameters according to the multi-target prediction state at time k + l , and then each node updates the multi-target posteriori density related to each control parameter based on the PIMS. Finally, AA fusion is performed on the multi-target posteriori densities of the same type’s nodes to obtain the fused multi-target posteriori density.
Step 4: Solve for the optimal set of control parameters for the T-nodes and S-nodes, respectively, according to the solution method of the single-objective optimization problem. For all T nodes, the control parameters are solved as follows:
c k T = a r g min c k C k V t r a c k c k .
For all S nodes, the control parameters are solved as follows:
c k S = a r g min c k C k V s e a r c h c k .
Considering the intricate nature of the objective function and the stringent real-time constraints, a greedy algorithm is an optimal choice for solving the single-objective optimization problem. The specific details of this algorithm can be found [38]. The control parameters obtained by solving the two objective functions separately constitute the optimal control parameter set c k opt = c k T c k S for multi-target joint optimization control at time k . Subsequently, all nodes adjust their motion heading in accordance with the optimal control parameter set c k opt and maneuver within the control FOV until the next control cycle commences.
For T nodes and S nodes, their numbers are denoted as R T and R S , respectively, and the base of the alternative control parameter set is denoted as C k , then the computational complexity of the algorithm can be expressed as 𝒪 C k R T + C k R S . Because all nodes are clustered according to the task type in advance, and the single-objective optimization problem only needs to be solved independently according to the objective function corresponding to each task, the efficiency of the optimization solution will be greatly improved, and the optimal solution can be obtained in the process of solving the objective function under each task. The following section will present the results of a series of simulation experiments, the objective of which is to verify and analyze the practical effect of the proposed algorithm.

5. Simulation Verification and Result Analysis

5.1. Experimental Scenario Setting

In this section, a multi-target tracking task scenario of a single-transmitter-multi-receiver distributed APBR network is designed to validate the effectiveness of the multi-node joint optimal control algorithm proposed in this paper in improving the multi-target tracking performance of the network. In the experiment, the whole range of the detection area is set as 20000 , 20000   m × 20000 , 20000   m , and the center of the area is set with a fixed position of the non-cooperative emitter, the coordinates of which are 0   m , 0   m , and its signal can cover the whole detection area. Considering the maneuverability of the UAV and the power coverage of the emitter, four UAVs equipped with passive sensors are dispersed in the detection area, constituting a distributed wireless sensor network, and each UAV can adjust its own state in real-time according to the control commands calculated by the algorithm. The detection period of the whole network is set to 100 s, and there are six targets appearing and disappearing at different moments during the detection period, and the altitude difference between the UAV and the targets is ignored. All the targets move in the CT model with an unknown turning rate, ω k l is denoted as the turning rate of the target l at time k , and its state transfer equation is shown as follows:
x ˜ + l = F CT x ˜ k l + W k l
F CT = 1 0 s i n ω k T ω k c o s ω k T 1 ω k 0 1 1 c o s ω k T ω k 1 c o s ω k T ω k 0 0 c o s ω k T s i n ω k T 0 0 s i n ω k T c o s ω k T
W k l = n p , n p , n v , n v
where W k l denotes the additive Gaussian process noise, n p ~ N 0 , σ p 2 , n v ~ N 0 , σ v 2 , σ p = 0.5 σ a · T 2 and σ v = σ a · T denotes the standard deviation of the process noise for position and velocity, σ a = 1   m / s 2 denotes the standard deviation of the acceleration noise and T = 1   s is the sampling interval of the receiver. The initial state of the target and the moment of birth and death, as well as the initial state settings of each UAV in the APBR network, are shown in Table 1.
The target measure received by the UAV is a nonlinear measure combination consisting of DOA-TDOA-FDOA; the measurement model has been given in Section 2.2, described by Formulas (7)–(11), and the measurement noise is set to the following:
V k = n θ l , n θ s n t d , n f d ,
where n θ l ~ N 0 , σ θ l 2 , n θ l ~ N 0 , σ θ s 2 , n t d ~ N 0 , σ t d 2 , n f d ~ N 0 , σ f d 2 are measurement noise distribution for target arrival angle, emitter arrival angle, bistatic range, and bistatic range rate, respectively. σ θ l = σ θ s = 0.05   rad , σtd = 100 m, σ f d = 2   m / s denote the standard deviation of the measurement noise, respectively.
It is assumed that all clutter in the measurement data set received by each UAV node are uniformly distributed in the detection area, and the number of clusters at each sampling moment obeys a Poisson distribution with the parameter λ C . The detection FOV of all UAV nodes in the network is set to be a circular area of radius R FOV , and the probability of detecting a target inside the FOV is P D , and the probability of detecting a target outside the FOV is 0. The basic parameters of the UAV nodes are set, as shown in Table 2.
Under the same experimental parameter settings, this paper will give the multi-target tracking performance when using only the multi-node fusion algorithm and compare the multi-target tracking performance when using the multi-node joint optimization control algorithm proposed in this paper and the multi-task joint optimization algorithm proposed in [39], to validate the necessity and the effectiveness of the proposed multi-target joint optimization control algorithm. The algorithm performance comparison metrics are chosen to be Generalized Optimal Sub-pattern Assignment (GOSPA) [40] and OSPA(2) [41] with the truncation parameter set to c = 200 , the order set to p = 2 , and the window length set to L = 10 . The Monte Carlo count is set to 200. The cruising speed of the UAV nodes is set to v s = 150   m / s . It is assumed that all the UAV nodes to be optimally controlled hover at the initial position and move at a constant cruising speed according to the initial heading after receiving the detection command. When all the UAV nodes in the network execute the joint optimization control algorithm, the control FOV length is set to l = 10   s , and the range of the alternative heading angle control parameters of the UAV is set to C k i = 180 ° , 179 ° , , 0 ° , , 179 ° , 180 ° . That is, during the whole detection cycle of the APBR network, all UAV nodes carry out mission adaptive switching every 10 s and adjust the heading according to the control command parameters given by the joint optimization control algorithm before executing the next stage of the mission.

5.2. Analysis of Experimental Results

First, Figure 3 gives the multi-target tracking scene of the simulation experiment. Each node sets the heading at the initial moment to be clockwise coordinated uniform motion, keeping the same distance from each other, and the patrol tracks set by all the nodes when no control algorithm is used are shown in Figure 3. In Figure 3, all the target tracks are described by the start point ○ and the endpoint Δ, as well as the duration of the tracks, and the shaded portion indicates the area that can be covered by all the nodes when detecting the targets according to the set patrol routes. The nodes are symmetrically distributed around the emitter, and the sum of the FOVs of all nodes cannot cover the entire detection area of interest. In the scene, target 1 and target 2 are in the FOV of the nodes just after birth, target 4 enters the FOV of the nodes sometime after birth, and target 5 and target 6 do not enter the FOV of any of the nodes during the whole detection duration. If the UAV nodes patrol according to the set route, it will inevitably lead to target 5 and target 6 never being detected, and the tracking tracks of some of the targets will also be incomplete.
Figure 4 gives a graph of the motion tracks formed by each node under the command control of the multi-node joint optimization control algorithm proposed in this paper. The motion tracks of each node are described by the start point ○, the control point □, and the endpoint Δ. Since the length of the control, FOV is 10 s, every 10 s, the node carries out the execution of a task-switching decision and makes a control command decision based on the task type to determine the heading angle in the next control cycle. Table 3 gives the task assignments for each node at each control point. Combining Figure 4 and Table 3, target 1 and target 2 appeared in the 1st control moment, but at that moment k < m, the nodes were unable to confirm the target track, so all nodes performed the search task. In the 2nd control moment, targets 1 and 2 are detected by node 1 and node 2, respectively, while no target is detected in node 3 and node 4, which continue to perform the search task. In the 3rd control moment, target 3 and target 4 are born; target 3 is detected by node 3, but target 4 has not been detected yet, so node 3 switches to perform the tracking task, and node 4 continues to search in the direction of target 4. At the 4th control moment, target 4 is detected, and node 4 switches to tracking target 4. At the 5th control moment, target 5 and target 6 are born but not yet detected. At the 6th control moment, all nodes were performing the tracking task, and nodes 1 and node 3 were tracking two targets at the same time because target 5 and target 6 had entered the FOV of node 1 and node 3, respectively. In the 7th control moment, target 1 disappeared, resulting in only target 6 existing within the FOV of node 1, so node 1 continued to track target 6. In the 8th control moment, target 2 disappeared, and target 5 entered the FOV of node 2, so node 2 continued to perform the tracking task. In the 9th control moment, target 3 disappeared, and only target 5 exists within the FOV of node 3, so node 3 continues to track target 5. In the 10th control moment, target 4 has disappeared, and node 4 switches to the searching task, but since there is no undetected target in the scene, node 4 searches in the direction of the nearest target 6; at the same time, node 2 switches to the T-node to track target 6 according to the proximity principle.
Figure 5 gives the GOSPA error plots of the local filtering results of each node and the global fusion results of the APBR network. From the localization error given in Figure 5a, when there is no target in the node’s FOV, the localization error is 0. When there is a target in the node’s FOV, the target localization error will gradually decrease with the optimization of the control algorithm. Specifically, in the first 20 s, only target 1 and target 2 appear, node 1 and node 2 perform the tracking task, and the localization error of the node’s local filter gradually decreases during the execution of the control command moving towards the tracked target. Between 21–30 s, node 3 tracks target 3, but node 4 has not yet tracked target 4, causing the network global fusion localization error to increase further. Between 31–40 s, node 4 tracked target 4, and the localization error gradually decreased during the control process. Between 41–70 s, all nodes perform the tracking task; the localization error is relatively stable and only becomes larger when the number of targets changes. Between 71–95 s, the number of targets changes frequently, and the localization error varies greatly. In the last 5 s, all the detection nodes are used to track target 6, which decreases the target localization error. The false alarm error in Figure 5b is lower throughout the detection period because the probability of detecting clutter is small due to the low clutter density setting and limited node FOV in the scene, so there is almost no target false alarm at each node. Figure 5c reflects the target missing from each node; due to the limited node FOV, each node detects a limited number of targets, the missing is obvious, and the error is large; while the global target missing from the network is relatively good, only in the detection area when the number of targets changes there will be a delay in detection time, resulting in an increase in the missing error at some moments, indicating that the adopted flooding communication strategy and AA fusion criteria are more effective and can effectively fuse the target information of each node. From the GOSPA error given in Figure 5d, as the number of nodes involved in tracking increases, the GOSPA error of the fusion result decreases gradually and is always smaller than the GOSPA error of the local filters of each node. In addition, the main sources of the GOSPA error are still the target localization error and the missing error, which is more obvious in the change curve of the GOSPA error of the fusion result.
Figure 6 gives the OSPA(2) error map of the local filtering results of each node and the fusion results of the APBR network when using the multi-node joint optimal control algorithm, which is used to reflect the multi-target track estimation by each node and the APBR network, and to validate the effect of the algorithm proposed in this paper on the estimation of multi-target tracks when the nodes’ FOV is limited. From Figure 6, the error variation rule of OSPA(2) is close to the GOSPA error in Figure 5, which indicates that the main sources of the error of OSPA(2) are the target state estimation error and the target number estimation error, and the error caused by the interruption of the trajectory is smaller. The PLMB filter used in this paper can estimate the target tracks with good track continuity.
The GOSPA error comparison plot of the global fusion results of the APBR network under different algorithms is given in Figure 7, which is used to evaluate the necessity and effectiveness of using a multi-node joint optimization control algorithm. Since the multi-task joint optimization algorithm in [38] uses a PMBM filter and does not estimate the target tracks, the comparison of the OSPA(2) error is not performed in this paper. From Figure 7, it can be clearly seen that the GOSPA error has been higher when the multi-target joint optimization control algorithm is not used; the reason is that the FOV of each node in the APBR network is limited, and it cannot detect all the targets in the area, there is a serious target missing, and the effect of the target tracking is not satisfactory. The patterns of change in GOSPA error of the multi-task joint optimization algorithm in [39] is close to the algorithm proposed in this paper, but the error value is higher than the algorithm proposed in this paper overall. The reason is that the multi-task joint optimization algorithm adopts a weighted method to allocate the detection resources of the sensor nodes, especially when the number of targets changes. The allocation of detection resources will be unreasonable, resulting in an increase in the error. In addition, the complexity of the algorithm in multi-task joint optimization is high, the computation is large, and the optimization control of each node will not be timely. In contrast, the tracking task objective function used in the algorithm proposed in this paper is more reasonable, and the target localization error will be smaller. In addition, the use of a task adaptive switching mechanism can allocate the detection resources of sensor nodes more reasonably, and the control response to each node is more rapid when the number of targets changes.
The target number estimation results of the global fusion results of the APBR network under different algorithms are given in Figure 8, which is used to compare and verify the accuracy of the target number estimation in the detection area using the joint multi-target optimization control algorithm. From Figure 8, the target number estimation when the control algorithm is not used is poor due to the limited node FOV. This is because even if the multi-target information is fused interactively, there will be some targets that are not within the FOV of all the nodes, and these targets are bound to miss detection. The multi-task joint optimization algorithm proposed in [39] can estimate all the targets appearing in the detection area, but the delay in estimating the number of targets in the network fusion result is large when the number of targets varies since the target search task and the tracking task are performed simultaneously. This is because the weighting method used in the algorithm cannot reasonably allocate the detection resources of the sensor nodes, especially when the number of targets changes. The allocation of detection resources will be more unreasonable, resulting in the target number estimation error becoming larger. In contrast, the multi-node joint optimization control algorithm proposed in this paper is based on the task adaptive switching mechanism, and all nodes are reclassified according to different tasks, which transforms the multi-task optimization problem into a single-task optimization problem. Therefore, the proposed algorithm can not only reasonably allocate the detection resources of the sensor nodes but also reduce the complexity of the algorithm, improve the optimization and control efficiency, and estimate the number of targets in the detection area more accurately.

6. Conclusions

This paper introduces a multi-node joint optimal control algorithm for addressing the multi-target tracking challenge in distributed APBR networks, focusing on incomplete target detection due to limited node FOV. Initially, a PLMB filter is applied at each network node to capture local multi-target density within their respective FOVs. Subsequently, the AA fusion rule, utilizing a flooding strategy, integrates multi-target densities from various sensor nodes. To tackle incomplete target detection resulting from limited FOVs, a task-adaptive switching mechanism classifies sensor nodes into search and tracking nodes, simplifying the optimization problem. Objective functions for tracking and searching tasks are derived based on differential entropy and intensity functions. The GM-CKF method facilitates joint optimal control for multi-sensor nodes by implementing local filtering and density fusion. The greedy algorithm resolves single-task optimization, enhancing multi-target tracking and fusion efficiency. Simulation results show real-time optimal control capability and efficient sensor resource allocation. This algorithm proves crucial for overcoming incomplete target detection and improving distributed APBR network performance. Future studies will incorporate factors like signal resources, network loads, and sensor power consumption to enhance the network’s utility in complex electromagnetic environments.

Author Contributions

Conceptualization, X.G.; methodology, Y.L.; software, Y.L.; validation, Y.L. and L.R.; formal analysis, Y.L.; investigation, L.R.; resources, X.G.; data curation, Y.L.; writing—original draft preparation, Y.L.; writing—review and editing, L.R.; visualization, Y.L.; supervision, X.G.; project administration, X.G.; funding acquisition, X.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank the anonymous reviewers for their constructive and detailed comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Hu, S.; Yi, J.; Wan, X.; Cheng, F.; Hu, Y.; Hao, C. Illuminator of Opportunity Localization for Digital Broadcast-Based Passive Radar in Moving Platforms. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 3539–3549. [Google Scholar] [CrossRef]
  2. Gabard, B.; Wasik, V.; Rabaste, O.; Deloues, T.; Poullin, D.; Jeuland, H. Airborne Targets Detection by UAV-Embedded Passive Radar. In Proceedings of the 2020 17th European Radar Conference (EuRAD), Utrecht, The Netherlands, 10–15 January 2021; pp. 346–349. [Google Scholar]
  3. Malanowski, M.; Rytel-Andrianik, R.; Kulpa, K.; Stasiak, K.; Ciesielski, M.; Kulpa, J. Geometric Clutter Analysis for Airborne Passive Coherent Location Radar. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–14. [Google Scholar] [CrossRef]
  4. Zhao, Z.; Cao, Y.; Wang, Y.; Zhou, H. Transmitted-Signal-Free Target Information Extraction for OFDM-Based Passive Radar with Time-Varying Sparse Model. IEEE Geosci. Remote Sens. Lett. 2023, 20, 1–5. [Google Scholar] [CrossRef]
  5. Colone, F.; Filippini, F.; Pastina, D. Passive Radar: Past, Present, and Future Challenges. IEEE Aerosp. Electron. Syst. Mag. 2023, 38, 54–69. [Google Scholar] [CrossRef]
  6. Sun, Z.; Wu, J.; Yen, G.G.; Lu, Z.; Yang, J. Performance Analysis and System Implementation for Energy-Efficient Passive UAV Radar Imaging System. IEEE Trans. Veh. Technol. 2023, 72, 9938–9955. [Google Scholar] [CrossRef]
  7. Mahler, R.P.S. Multitarget Bayes filtering via first-order multitarget moments. IEEE Trans. Aerosp. Electron. Syst. 2003, 39, 1152–1178. [Google Scholar] [CrossRef]
  8. Vo, B.; Vo, B.; Cantoni, A. Bayesian Filtering with Random Finite Set Observations. IEEE Trans. Signal Process. 2008, 56, 1313–1326. [Google Scholar] [CrossRef]
  9. Saucan, A.; Coates, M.J.; Rabbat, M. A Multisensor Multi-Bernoulli Filter. IEEE Trans. Signal Process. 2017, 65, 5495–5509. [Google Scholar] [CrossRef]
  10. García-Fernández, Á.F.; Williams, J.L.; Granström, K.; Svensson, L. Poisson Multi-Bernoulli Mixture Filter: Direct Derivation and Implementation. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 1883–1901. [Google Scholar] [CrossRef]
  11. Boström-Rost, P.; Axehill, D.; Hendeby, G. Sensor Management for Search and Track Using the Poisson Multi-Bernoulli Mixture Filter. IEEE Trans. Aerosp. Electron. Syst. 2021, 57, 2771–2783. [Google Scholar] [CrossRef]
  12. Vo, B.; Vo, B.; Phung, D. Labeled Random Finite Sets and the Bayes Multi-Target Tracking Filter. IEEE Trans. Signal Process. 2014, 62, 6554–6567. [Google Scholar] [CrossRef]
  13. Reuter, S.; Vo, B.; Vo, B.; Dietmayer, K. The Labeled Multi-Bernoulli Filter. IEEE Trans. Signal Process. 2014, 62, 3246–3260. [Google Scholar] [CrossRef]
  14. Vo, B.; Vo, B.; Hoang, H.G. An Efficient Implementation of the Generalized Labeled Multi-Bernoulli Filter. IEEE Trans. Signal Process. 2017, 65, 1975–1987. [Google Scholar] [CrossRef]
  15. Cament, L.; Correa, J.; Adams, M.; Pérez, C. The histogram Poisson, labeled multi-Bernoulli multi-target tracking filter. Signal Process. 2020, 176, 107714–107729. [Google Scholar] [CrossRef]
  16. Li, G.; Battistelli, G.; Yi, W.; Kong, L. Distributed multi-sensor multi-view fusion based on generalized covariance intersection. Signal Process. 2020, 166, 107246. [Google Scholar] [CrossRef]
  17. Li, G.; Battistelli, G.; Chisci, L.; Gao, L.; Wei, P. Distributed Joint Detection, Tracking, and Classification via Labeled Multi-Bernoulli Filtering. IEEE Trans. Cybern. 2024, 54, 1429–1441. [Google Scholar] [CrossRef]
  18. Shen, K.; Dong, P.; Jing, Z.; Leung, H. Consensus-Based Labeled Multi-Bernoulli Filter for Multitarget Tracking in Distributed Sensor Network. IEEE Trans. Cybern. 2022, 52, 12722–12733. [Google Scholar] [CrossRef]
  19. Yi, W.; Li, G.; Battistelli, G. Distributed Multi-Sensor Fusion of PHD Filters with Different Sensor Fields of View. IEEE Trans. Signal Process. 2020, 68, 5204–5218. [Google Scholar] [CrossRef]
  20. Li, T.C.; Song, Y.; Song, E.B.; Fan, H.Q. Arithmetic average density fusion—Part I: Some statistic and information-theoretic results. Inf. Fusion 2024, 104, 102199. [Google Scholar] [CrossRef]
  21. Li, T. Arithmetic Average Density Fusion—Part II: Unified Derivation for Unlabeled and Labeled RFS Fusion. IEEE Trans. Aerosp. Electron. Syst. 2024, 60, 3255–3268. [Google Scholar] [CrossRef]
  22. Li, T.; Yan, R.; Da, K.; Fan, H. Arithmetic Average Density Fusion—Part III: Heterogeneous Unlabeled and Labeled RFS Filter Fusion. IEEE Trans. Aerosp. Electron. Syst. 2024, 60, 1023–1034. [Google Scholar] [CrossRef]
  23. Li, T.C.; Hlawatsch, F. A distributed particle-PHD filter using arithmetic-average fusion of Gaussian mixture parameters. Inf. Fusion 2021, 73, 111–124. [Google Scholar] [CrossRef]
  24. Battistelli, G.; Chisci, L.; Fantacci, C.; Farina, A.; Graziano, A. Distributed multitarget tracking for passive multireceiver radar systems. In Proceedings of the 2013 14th International Radar Symposium (IRS), Dresden, Germany, 19–21 June 2013; pp. 337–342. [Google Scholar]
  25. Li, S.; Yi, W.; Hoseinnezhad, R.; Battistelli, G.; Wang, B.; Kong, L. Robust Distributed Fusion with Labeled Random Finite Sets. IEEE Trans. Signal Process. 2018, 66, 278–293. [Google Scholar] [CrossRef]
  26. Li, S.; Battistelli, G.; Chisci, L.; Yi, W.; Wang, B.; Kong, L. Computationally Efficient Multi-Agent Multi-Object Tracking with Labeled Random Finite Sets. IEEE Trans. Signal Process. 2019, 67, 260–275. [Google Scholar] [CrossRef]
  27. Gostar, A.K.; Hoseinnezhad, R.; Bab-Hadiashar, A. Multi-bernoulli sensor control via minimization of expected estimation errors. IEEE Trans. Aerosp. Electron. Syst. 2015, 51, 1762–1773. [Google Scholar] [CrossRef]
  28. Gostar, A.K.; Hoseinnezhad, R.; Bab-Hadiashar, A.; Liu, W. Sensor-Management for Multitarget Filters via Minimization of Posterior Dispersion. IEEE Trans. Aerosp. Electron. Syst. 2017, 53, 2877–2884. [Google Scholar] [CrossRef]
  29. Ristic, B.; Vo, B.; Clark, D. A Note on the Reward Function for PHD Filters with Sensor Control. IEEE Trans. Aerosp. Electron. Syst. 2011, 47, 1521–1529. [Google Scholar] [CrossRef]
  30. Beard, M.; Vo, B.T.; Vo, B.N.; Arulampalam, S. Sensor control for multi-target tracking using Cauchy-Schwarz divergence. In Proceedings of the 2015 18th International Conference on Information Fusion (Fusion), Washington, DC, USA, 6–9 July 2015; pp. 937–944. [Google Scholar]
  31. Zhu, Y.; Liang, S.; Gong, M.; Yan, J. Decomposed POMDP Optimization-Based Sensor Management for Multi-Target Tracking in Passive Multi-Sensor Systems. IEEE Sens. J. 2022, 22, 3565–3578. [Google Scholar] [CrossRef]
  32. LeGrand, K.A.; Zhu, P.; Ferrari, S. A Random Finite Set Sensor Control Approach for Vision-based Multi-object Search-While-Tracking. In Proceedings of the 2021 IEEE 24th International Conference on Information Fusion (FUSION), Sun City, South Africa, 1–4 November 2021; pp. 1–8. [Google Scholar]
  33. Nguyen, H.V.; Vo, B.N.; Vo, B.T.; Rezatofighi, H.; Ranasinghe, D.C. Multi-Objective Multi-Agent Planning for Discovering and Tracking Multiple Mobile Objects. IEEE Trans. Signal Process. 2024, 72, 3669–3685. [Google Scholar] [CrossRef]
  34. Li, T.; Corchado, J.M.; Prieto, J. Convergence of Distributed Flooding and Its Application for Distributed Bayesian Filtering. IEEE Trans. Signal Inf. Process. Netw. 2017, 3, 580–591. [Google Scholar] [CrossRef]
  35. Chen, X.; Tharmarasa, R.; Pelletier, M.; Kirubarajan, T. Integrated Clutter Estimation and Target Tracking Using Poisson Point Processes. IEEE Trans. Aerosp. Electron. Syst. 2012, 48, 1210–1235. [Google Scholar] [CrossRef]
  36. Wei, J.; Luo, F.; Qi, J.; Luo, X. Distributed Fusion of Labeled Multi-Bernoulli Filters Based on Arithmetic Average. IEEE Signal Process. Lett. 2024, 31, 656–660. [Google Scholar] [CrossRef]
  37. Boström-Rost, P.; Axehill, D.; Hendeby, G. PMBM Filter with Partially Grid-Based Birth Model with Applications in Sensor Management. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 530–540. [Google Scholar] [CrossRef]
  38. Verma, J.K.; Chhabra, J.K.; Ranga, V. Track Consensus-Based Labeled Multi-Target Tracking in Mobile Distributed Sensor Network. IEEE Trans. Mob. Comput. 2024, 23, 7351–7362. [Google Scholar] [CrossRef]
  39. Chen, K.; Chai, L.; Yi, W. Multi-Sensor Control for Jointly Searching and Tracking Multi-Target Using the Poisson Multi-Bernoulli Mixture Filter. In Proceedings of the 2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS), Hanoi, Vietnam, 21–24 November 2022; pp. 240–247. [Google Scholar]
  40. Rahmathullah, A.S.; García-Fernández, Á.F.; Svensson, L. Generalized optimal sub-pattern assignment metric. In Proceedings of the 2017 20th International Conference on Information Fusion (Fusion), Xi’an, China, 10–13 July 2017. [Google Scholar]
  41. Beard, M.; Vo, B.T.; Vo, B.N. OSPA(2): Using the OSPA metric to evaluate multi-target tracking performance. In Proceedings of the 2017 International Conference on Control, Automation and Information Sciences (ICCAIS), Chiang Mai, Thailand, 31 October–1 November 2017; pp. 86–91. [Google Scholar]
Figure 1. An overview of the research core.
Figure 1. An overview of the research core.
Drones 08 00627 g001
Figure 2. Passive Multi-node Joint Optimization Control Flow.
Figure 2. Passive Multi-node Joint Optimization Control Flow.
Drones 08 00627 g002
Figure 3. Scenario diagram of multi-target tracking simulation.
Figure 3. Scenario diagram of multi-target tracking simulation.
Drones 08 00627 g003
Figure 4. Motion tracks of each node under control commands.
Figure 4. Motion tracks of each node under control commands.
Drones 08 00627 g004
Figure 5. GOSPA error profile of local filtering results and fusion results.
Figure 5. GOSPA error profile of local filtering results and fusion results.
Drones 08 00627 g005
Figure 6. OSPA(2) error plot for local filtering results and fusion results.
Figure 6. OSPA(2) error plot for local filtering results and fusion results.
Drones 08 00627 g006
Figure 7. Comparison of GOSPA errors of fusion results under different algorithms.
Figure 7. Comparison of GOSPA errors of fusion results under different algorithms.
Drones 08 00627 g007
Figure 8. Estimated target number of fusion results under different algorithms.
Figure 8. Estimated target number of fusion results under different algorithms.
Drones 08 00627 g008
Table 1. Initial state settings of simulated entities in the detection area.
Table 1. Initial state settings of simulated entities in the detection area.
Simulated Entity Initial   State x m ;   x ˙ m / s ;   y m ;   y ˙ m / s Survival   Duration   t b / s , t d / s
Target 1[8800; −80; 9600; −60][1, 55]
Target 2[7500; −100; −8000; 60][1, 65]
Target 3[−6000; −60; −8000; 80][21, 75]
Target 4[−5000; 60; 16,000; −80][21, 85]
Target 5[−4000; 100; −1000; −30][41, 95]
Target 6[4500; −120; 3500; −50][41, 101]
Emitter[0; 0; 0; 0]-
Receiver 1[10,000; 0; 10,000; −150]-
Receiver 2[10,000; −150; −10,000; 0]-
Receiver 3[−10,000; 0; −10,000; 150]-
Receiver 4[−10,000; 150; 10,000; 0]-
Table 2. Basic parameter settings of the drone node.
Table 2. Basic parameter settings of the drone node.
Simulation ParametersSpecific Values
Detection radius of the node FOV R FOV 5000 m
Probability of detection within node FOV P D 0.95
Target Track Confirmation Time m 3
clutter   rate   λ C 5
Target   survival   probability   P S 0.99
Target Extraction Threshold0.5
GM   pruning   threshold   T h GMp 10−5
GM merging threshold ThGMm10
Survival probability of newborn BCs E P n 0.01
BCs   pruning   threshold   T h p r u n e 10−5
BCs   merging   threshold   T h Merge 30
Table 3. Task execution of multiple nodes at different control moments.
Table 3. Task execution of multiple nodes at different control moments.
Control Moments/sReceiver 1Receiver 2Receiver 3Receiver 4
1S SSS
11TTSS
21TTTS
31TTTT
41TTTT
51TTTT
61TTTT
71TTTT
81TTTT
91TTTS
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guan, X.; Lu, Y.; Ruan, L. Joint Optimization Control Algorithm for Passive Multi-Sensors on Drones for Multi-Target Tracking. Drones 2024, 8, 627. https://doi.org/10.3390/drones8110627

AMA Style

Guan X, Lu Y, Ruan L. Joint Optimization Control Algorithm for Passive Multi-Sensors on Drones for Multi-Target Tracking. Drones. 2024; 8(11):627. https://doi.org/10.3390/drones8110627

Chicago/Turabian Style

Guan, Xin, Yu Lu, and Lang Ruan. 2024. "Joint Optimization Control Algorithm for Passive Multi-Sensors on Drones for Multi-Target Tracking" Drones 8, no. 11: 627. https://doi.org/10.3390/drones8110627

APA Style

Guan, X., Lu, Y., & Ruan, L. (2024). Joint Optimization Control Algorithm for Passive Multi-Sensors on Drones for Multi-Target Tracking. Drones, 8(11), 627. https://doi.org/10.3390/drones8110627

Article Metrics

Back to TopTop