Multi-Objective Optimization Based Multi-Bernoulli Sensor Selection for Multi-Target Tracking

This paper presents a novel multi-objective optimization based sensor selection method for multi-target tracking in sensor networks. The multi-target states are modelled as multi-Bernoulli random finite sets and the multi-Bernoulli filter is used to propagate the multi-target posterior density. The proposed method is designed to select the sensor that provides the most reliable cardinality estimate, since more accurate cardinality estimate indicates more accurate target states. In the multi-Bernoulli filter, the updated multi-target density is a multi-Bernoulli random finite set formed by a union of legacy tracks and measurement-updated tracks. The legacy track and the measurement-updated track have different theoretical and physical meanings, and hence these two kinds of tracks are considered separately in the sensor management problem. Specifically, two objectives are considered: (1) maximizing the mean cardinality of the measurement-updated tracks, (2) minimizing the cardinality variance of the legacy tracks. Considering the conflicting objectives simultaneously is a multi-objective optimization problem. Tradeoff solutions between two conflicting objectives will be derived. Theoretical analysis and examples show that the proposed approach is effective and direct. The performance of the proposed method is demonstrated using two scenarios with different levels of observability of targets in the passive sensor network.


Introduction
Tracking of multi-target in sensor networks is an important but challenging task for radar, sonar and other surveillance systems [1]. Most traditional multi-target tracking formulations involve explicit associations between measurements and targets, based on the assumption that target states and models are known a priori. When it is difficult to know the true underlying model, joint Bayesian parameter estimation and model selection have been extensively studied within the Monte Carlo (MC) methodology [2,3]. However, due to communication and real-time constraints of the sensor network, the system may need to select a subset of sensors to report high quality target-related measurements [4,5]. Sensor selection (also named as sensor management) is inherently an optimal nonlinear control problem. Different from the classical control problem, it is embedded in highly complex multi-target systems. In multi-target systems, the number of targets and their individual states are random and unknown. Besides, the measurements are subject to missed detections and false alarms. Multi-target tracking under these circumstances is nontrivial and the difficulty is further compounded in the sensor selection problem [6].
In general, sensor selection is developed based on Bayesian multi-target filtering. Mahler's Finite set statistics (FISST) provides an elegant Bayesian filtering formulation based on random finite Sensors 2019, 19, 980 2 of 18 set (RFS) theory and has attracted a large amount of attention in recent years [7]. The probability hypothesis density (PHD) [8] and cardinalized PHD (CPHD) [9] filters are two popular approaches within FISST. They propagate moment approximations of the RFS of states in time and avoid the combinatorial problem arising from data association. Another type of FISST is the multi-target multi-Bernoulli (MeMBer) filter whose recursion propagates the approximate posterior multi-target density. However, the MeMBer filter was shown to overestimate the cardinality (number of targets). To solve this problem, a modified filter named Cardinality-Balanced MeMBer (CBMeMBer) was proposed in [10]. For these FISST filters, tractable solution methodologies can be provided by Sequential Monte Carlo (SMC) and Gaussian mixture implementations.
Several sensor-management solutions have been recently formulated within the FISST multi-target filtering. In general, an objective function is developed as the sensor-management criterion and is used to quantify the effect of each admissible command. A common approach is to measure the information gained via updating the predicted density and a typical objective function is the Rényi divergence between the predicted and updated density. In [11], the Rényi reward function was associated with the multi-target Bayes filter. However, this approach is computationally intractable even for a small number of targets. To increase the computation efficiency, the Rényi divergence was investigated within the PHD filtering scheme under the assumption of independent and identically distributed cluster [12]. Then, the work in [13] demonstrated that using the same Rényi reward function, the CBMeMBer filter based strategy performs better than the PHD filter based methods. However, the Rényi divergence function has no analytic solution and its computation needs MC sampling of the updated multi-target multi-Bernoulli distribution, resulting in great computational burden. An alternative information divergence measure is the Cauchy-Schwarz (CS) divergence [14]. An analytic expression for the CS divergence was developed as the reward function in [15,16]. However, the divergence is derived in some abstract mathematical sense but its practical interpretation is unclear. Due to the limitations of the divergence function mentioned above, it is desirable to manage the sensors by optimizing a task-driven objective function. In [17,18], the cost function was defined in terms of the cardinality variance, which can be computed analytically and enables an efficient sensor management. In [19], the authors proposed to minimize the dispersion of the multi-target posterior, quantified based on a multi-target estimation error metric. In presence of numerous sensors, [20] introduced a guided search to solve the multi-dimensional discrete optimization problem embedded in multi-sensor control.
This paper formulates the sensor management problem as a multi-objective optimization problem with two different objective functions, to improve the tracking performance while maintaining low computation cost. The CBMeMBer recursion is used as the multi-target filter because its SMC implementation accommodates non-linear non-Gaussian models and allows reliable and inexpensive extraction of state estimates. In contrast, the PHD and CPHD recursions require clustering to extract state estimates from the particle population, which is expensive and unreliable. Another advantage of the SMC implementation of the CBMeMBer recursion is that it allows the state-dependent probability of detection, which makes the filter applicable to more general detection profiles. At each time step, the multi-Bernoulli parameter set for the updated multi-target density is formed by the union of the multi-Bernoulli parameter sets for the legacy tracks and measurement-updated tracks. Nevertheless, the legacy track and the measurement-updated track are updated in different manners and have different physical meaning. To the best of the authors' knowledge, there is no literature considering the legacy tracks and measurement-updated tracks separately in solving the sensor management problem. In this paper, the objective functions are developed as the mean cardinality of the measurement-updated tracks and the cardinality variance of the legacy tracks, both of which can be computed analytically. Maximizing the mean cardinality of the measurement-updated tracks while minimizing the cardinality variance of the legacy tracks is conflicting and is modelled as a multi-objective optimization problem. The multi-objective problem gives a set of sensor selection solutions that reveal different trade-offs between conflicting objectives. The preferred solution-the one most desirable to the decision maker or the designer-is selected from the Pareto optimal set [21]. In the following, the proposed method is named as the mean-variance optimization (MVO) method. Simulation results for two challenging scenarios validate the advantages of the MVO method over the competitive methods.
The paper is organized as follows. In Section 2, the necessary background on multi-target Bayesian filtering is presented and the CBMeMBer recursion is briefly reviewed. Then, motivations and definitions of the proposed objective functions are presented and details of the multi-objective optimization procedure are given in Section 3, followed by numerical studies in Section 4. Finally, Section 5 concludes the paper.

Multitarget Bayesian Framework
An RFS is intuitively a random (spatial) point pattern, in which the number of points is random and the points themselves are random and unordered. In multi-target systems, not only does the number of targets vary randomly in time, but also the measurements are subject to missed detections and false alarms. This means that the set of target states and of measurements are both inherently RFSs. Suppose that at time k, there are n k target states x taking values in the observation space Z ⊆ R n z . The finite sets of targets and observations are treated as the multi-target state and multi-target observation, respectively where F (X ) and F (Z ) denote the space of finite subsets of X and Z, respectively. The multi-target filtering problem can be formulated in the Bayesian filtering framework. Let π k (X k |Z 1:k ) denote the multi-target posterior density at time k. The multi-target Bayes recursion propagates π k (X k |Z 1:k ) according to two main steps: time prediction and measurement update where f k|k−1 (·|·) is the multi-target transition density that encapsulates the underlying models of target motions, births and deaths; g k (·|·) is the multi-target likelihood that encapsulates the underlying models of detections and false alarms. Note that the integrals in (3) and (4) are not ordinary integrals, but are FISST set integrals, which have no analytic solution and computationally intractable in general.

Cardinality Balanced MeMBer filter
The CBMeMBer recursion is an approximation to the full multi-target Bayes recursion (3) and (4) using multi-Bernoulli RFSs [10]. A Bernoulli RFS X on X has probability 1 − r of being empty, and probability r of being a singleton distributed according to a probability density p (defined on X ). The cardinality distribution of the Bernoulli RFS X is a Bernoulli distribution with parameter r, while its probability density is [7] A multi-Bernoulli RFS is defined as the union of M independent Bernoulli RFSs X = ∪ M i=1 X (i) . The probability density is Thus, the multi-Bernoulli RFS can be completely denoted as . Then, the predicted density at time k is also a multi-Bernoulli and is given by are parameters of births at time k and {(r are used to express the survival ones which are computed as where the inner product p is the survival probability of x and f k|k−1 (·|·) is the single target transition density.
Let us denote the predicted density by the form π k|k−1 = {(r , then the posterior density can be approximated as follows: corresponds to the legacy tracks and {(r U,k (z), p U,k (·; z))} are the measurement-updated tracks. The existence probability and the spatial distribution are given by ψ k,z (x) = g k (z|x)p D,k (x) (16) where g k (·|x) is the single target measurement likelihood function, p D,k (x) is the state dependent probability of detection and κ k (·) is the intensity of the clutter. The SMC implementation of the CBMeMBer recursion is adopted in this paper. At time k, the probability density p (i) k is approximated by a set of particles where δ a (x) is the Dirac delta function concentrated at a, L (i) k−1 is the number of particles for ith Bernoulli component, and w (i,j) is the normalized weight of jth particle from ith Bernoulli component. To reduce the number of particles, at each time step hypothesized tracks with existence probabilities below a threshold P are discarded. For the remaining hypothesized tracks, a maximum of L max and minimum of L min particles are imposed on each hypothesized track. For more details of the SMC implementation, the reader is referred to [10].

Multi-Bernoulli Sensor Selection via Multi-Objective Optimization
As mentioned earlier, this paper focuses on solving the sensor selection problem in the context of choosing the best selection command among a finite number of admissible commands. We formulate this problem in the partially observed Markov decision process (POMDP) framework [22]. In a POMDP, the multi-target dynamics is modeled as a Markov process, but there is no direct access to current states and information can only be obtained from the past measurements [23]. The POMDP used in this paper is represented as where S denotes a finite set of sensors for selection and ϑ(X k−1 , s, X k ) is an objective function that measures a reward or cost for transition from X k−1 to X k given that the sensor s ∈ S is selected. The purpose of sensor selection is to find the optimal sensor * s which optimizes the statistical mean of the objective function ϑ(X k−1 , s, X k ) over all possible measurements [24]: * Note that the general formulation of a POMDP involves a p-step future decision process, whereas, in this paper we only consider a single step ahead (myopic) policy.

Objective Functions Proposal
Recall that the updated multi-target density is formed by two different sets of the multi-Bernoulli parameters that correspond to the legacy tracks [the first term in (11)] and the measurement-updated tracks [the second term in (11)]. Moreover, the legacy tracks and measurement-updated tracks are updated in different manners and have different physical meaning. Thus, the sensor selection in this paper is posed as a multi-objective optimization problem which consists of two objective functions corresponding to the legacy tracks and measurement-updated tracks, respectively. Assume that the updated multi-target density for sensor s at time k is a multi-Bernoulli RFS denoted as (20) where Z k (s) is the set of measurements collected by sensor s at time k. Motivations and definitions of the two objective functions are as follows.
The first objective function is the mean cardinality of the measurement-updated tracks. Maximizing of this function has the effect of maximizing the number of detected targets and also has the side-effect of improving the localization accuracy of such target as time goes on. At time k, the mean cardinality of the measurement-updated tracks for sensor s is defined as Some methods have been proposed for maximizing the number of detected targets in sensor management using the PHD and CPHD filters. In [25,26] the posterior expected number of targets was employed as a reward function, which is obtained by integrating the intensity density of the PHD filter.
Later, [27] shows that this reward function can be extended to the CPHD filter. Since the CPHD filter is more accurate than the PHD filter, the CPHD-based approach to sensor management produces more accurate results than the PHD-based approach. We use the CBMeMBer filter because it outweighs the PHD and CPHD filters in the SMC implementation.
On the other hand, the second objective function, the cardinality variance of the legacy tracks, aims to improve the accuracy of the number of legacy tracks. The cardinality variance of the legacy tracks for sensor s at time k is given by The cardinality variance is a measure of the uncertainty embedded in the estimated tracks. Therefore, we propose to use the statistical cardinality variance of the legacy tracks as the objective function, which tends to minimize the error in the number of the legacy tracks. Both the mean cardinality and the cardinality variance can be computed analytically and cheaply. Therefore, these two objective functions can provide an efficient solution for sensor management.
The objective functions are developed with parameters of the updated multi-Bernoulli distribution. However, the updated distribution depends on the measurement set, which is a function of the chosen sensor-management command. Theoretically, all possible measurement sets should be used to update the distribution, which is computationally expensive. To reduce the computational complexity, the predicted ideal measurement set (PIMS) approach [25] is used. First, the estimated number and states of targets are obtained from the predicted multi-target density. Then, a measurement is generated for each target under ideal conditions of no clutter, no measurement noise and perfect detection. The expectation term in (19) does not appear in the proposed objective functions, as the PIMS approach is used instead of sampling or averaging in measurement space.

Illustrative Examples
To distinguish the proposed objective functions from the mean cardinality of the posterior density and its variance, we also give the definitions and notations for the latter. At time k, the mean cardinality of the posterior density (given by (20)) and its variance for sensor s are defined as where N k|k (s) and σ 2 k|k (s) denote the mean cardinality and the cardinality variance of the posterior density, respectively. In the following, two indicative examples are used to illustrate why the measurement-updated tracks and legacy tracks should be considered separately, but not using the posterior density as a whole. In the first example, the mean cardinality of the posterior density, N k|k (s), is used as the objective function. In the second one, the cardinality variance of the posterior density, σ 2 k|k , is regarded as the objective function. Example 1: For sensors s 1 and s 2 , the existence probabilities of the measurement-updated tracks are (0.7, 0.7) and (0.5, 0.6), respectively. Besides, the existence probabilities of the legacy tracks are 0.1 and 0.5, respectively. For s 1 and s 2 , the mean cardinality of the posterior density is N k|k (s 1 ) = 0.7 + 0.7 + 0.1 = 1.5 and N k|k (s 2 ) = 0.5 + 0.6 + 0.5 = 1.6, respectively. Using the mean cardinality of the posterior density as the objective function, we should select s 2 , since s 2 returns a larger value of the objective function.
For s 1 and s 2 , the mean cardinality of the measurement-updated tracks are N U,k|k (s 1 ) = 0.7 + 0.7 = 1.4 and N U,k|k (s 2 ) = 0.5 + 0.6 = 1.1, respectively. Therefore, s 1 can detect more targets than s 2 . Furthermore, the variance of the legacy tracks for s 1 and s 2 are Thus, s 1 can provide more reliable number of the legacy tracks than s 2 . Based on the above analysis, it would be better to select s 1 . Using the mean cardinality of the posterior density as the objective function may not lead to meaningful choice of actions.
Example 2: For sensors s 1 and s 2 , the existence probabilities of the measurement-updated tracks are (0.1, 0.4) and (0.6, 0.9), respectively. Besides, the existence probabilities of the legacy tracks are (0.3, 0.6) and (0.7, 0.1), respectively. For s 1 and s 2 , the variance of the posterior density is σ 2 respectively. Using the variance of the posterior density as the objective function, we should select s 1 , since it returns a smaller value of the objective function.
For s 1 and s 2 , the mean cardinality of the measurement-updated tracks are N U,k|k (s 1 ) = 0.1 + 0.4 = 0.5 and N U,k|k (s 2 ) = 0.6 + 0.9 = 1.5, respectively. Therefore, s 2 can detect more targets than s 1 . Furthermore, the variance of the legacy tracks for s 1 and s 2 are 3, respectively. Thus, s 2 can provide more reliable number of the legacy tracks than s 1 . Based on the above analysis, it would be better to select s 2 .

Multi-Objective Optimization
This paper intends to account for both the mean cardinality of the measurement-updated tracks and the cardinality variance of the legacy tracks. Maximizing the mean cardinality N U,k|k (s) while minimizing the cardinality variance σ 2 L,k|k (s) is a multi-objective problem. To formulate the problem as a general optimization, we minus the mean cardinality −N U,k|k (s), translating the maximization into the minimization. At time k, the objectives for sensor s ∈ S are as follows Minimize The mathematical description of the multi-objective optimization problem is given as where F(s) is the objective vector and s is a decision variable within a parameter space S. The utopian solution is the solution that is optimal for all objectives. However, it does not generally exist since f 1 (s) and f 2 (s) can be conflicting. Classical approach usually converts the multi-objective problem into a single optimization problem by aggregating all objectives in a weighted function. This approach has several limitations, such as the requirement of a priori knowledge about the relative importance of the objectives and the solution may not be attainable unless the search space is convex [21].
In fact, there is a set of solutions for the multi-objective optimization problem, which represent different trade-offs between the objectives. A solution s 1 is said to Pareto-dominate the solution s 2 in a minimization context, if and only if [21] ∀i A solution is Pareto optimal if and only if there does not exist another solution that dominates it. The set of all Pareto optimal solutions is called the Pareto optimal set. The corresponding objective vectors form the Pareto front.
It is necessary to select one particular solution from the Pareto-front which can yield the desired trade-off between the conflicting objectives. We take a simple strategy and use the Euclidean distance to find the compromise solution [28]. The utopia point F * of the multi-objective problem is defined as where f * 1 and f * 2 are the minima of objectives f 1 (s) and f 2 (s), respectively.
The point in the Pareto optimal set which is closest to the utopia point is regarded as the compromise solution. The shortest Euclidean distance of a point F(s) = f 1 (s), f 2 (s) T from the utopia point can be determined as follows In this paper, the number of sensors to be selected is assumed known a priori and hence the generation of the Pareto front is straightforward. Figure 1 gives an illustration of the multi-objective problem, where the number of all possible solutions is 10 and 3 solutions are Pareto optimal.
A solution is Pareto optimal if and only if there does not exist another solution that dominates it. The set of all Pareto optimal solutions is called the Pareto optimal set. The corresponding objective vectors form the Pareto front.
It is necessary to select one particular solution from the Pareto-front which can yield the desired trade-off between the conflicting objectives. We take a simple strategy and use the Euclidean distance to find the compromise solution [28]. The utopia point * F of the multi-objective problem is defined where In this paper, the number of sensors to be selected is assumed known a priori and hence the generation of the Pareto front is straightforward. Figure 1 gives an illustration of the multi-objective problem, where the number of all possible solutions is 10 and 3 solutions are Pareto optimal.

Simulations
In this section, we demonstrate the performance of the proposed MVO method with two scenarios and compare it with the Rényi divergence based method of [13] and the cardinality variance strategy reported in [17]. Both scenarios are studied in a passive sensor network whose structure is borrowed from [29]. In the network, one transmitter and ten receivers are placed in the x-y plane as shown in Figure 2a

Simulations
In this section, we demonstrate the performance of the proposed MVO method with two scenarios and compare it with the Rényi divergence based method of [13] and the cardinality variance strategy reported in [17]. Both scenarios are studied in a passive sensor network whose structure is borrowed from [29]. In the network, one transmitter and ten receivers are placed in the x-y plane as shown in Figure 2a. During the target tracking process, the receivers are selected automatically at each time step. All receivers j = 1, 2, . . . , 10 are characterized by identical measurement noise and the sampling interval is fixed to T s = 10 s. The probability of detection is modeled as follows [29] p (j) where p k = [x k , y k ] T is the target position, r j is the position of receiver j, d k,j = p k − r j is the distance between the target and the receiver, and φ(d; α, β) = d −∞ N (v; α, β)dv is the Gaussian cumulative distribution function with α = 15 km and β = (3 km) 2 . Figure 2b plots the probability of detection as a function of the distance between the receiver and the target. Note that with the increase of the distance from the receiver, the probability of detection of the passive sensor decreases quickly [30]. The poor performance of the probability of detection makes this a difficult problem of sensor management for target tracking, in which many state-of-the-art techniques would fail.  Figure 2b plots the probability of detection as a function of the distance between the receiver and the target. Note that with the increase of the distance from the receiver, the probability of detection of the passive sensor decreases quickly [30]. The poor performance of the probability of detection makes this a difficult problem of sensor management for target tracking, in which many state-of-the-art techniques would fail. Two challenging scenarios are studied in the passive sensor network. In Scenario 1, a time varying number of targets turn through the scenario and the motion model is complex. The bistatic range (transmitter-target-receiver) and bearing measurements are used to guarantee full observability of the targets. In this case, the measurement is informative and hence only one receiver is selected at each time step. Scenario 2 is used to compare the performance of MVO method with other sensor management methods using the bistatic range only measurement. In this case, more than one receiver is selected at each time step to avoid the "ghost particles" caused by the high nonlinearity of the measurement model. For the selected receives, the posterior density is updated sequentially. For more details of the multi-Bernoulli filter with sequential update scheme, the reader is referred to [31]. The geometry of a single transmitter-target-receiver pair is illustrated in Figure 3 Two challenging scenarios are studied in the passive sensor network. In Scenario 1, a time varying number of targets turn through the scenario and the motion model is complex. The bistatic range (transmitter-target-receiver) and bearing measurements are used to guarantee full observability of the targets. In this case, the measurement is informative and hence only one receiver is selected at each time step. Scenario 2 is used to compare the performance of MVO method with other sensor management methods using the bistatic range only measurement. In this case, more than one receiver is selected at each time step to avoid the "ghost particles" caused by the high nonlinearity of the measurement model. For the selected receives, the posterior density is updated sequentially. For more details of the multi-Bernoulli filter with sequential update scheme, the reader is referred to [31]. The geometry of a single transmitter-target-receiver pair is illustrated in Figure 3, where the locations of the transmitter, the receiver and the target are (x T , y T ), (x R , y R ) and (x, y), respectively.

Scenario 1
In this scenario, the non-linear bearing and bistatic range measurements are used to track a time varying number of targets. A nearly constant turn model is considered. The target state is denoted as comprises the target position and velocity and k ω is the turn rate. The state transition model is  Figure 3. The geometry of a single transmitter-target-receiver pair. Locations of the transmitter and the receiver are shown with the red star and blue square, respectively. The equidistance curve can be plotted using the bistatic range ρ = R t + R r and the target position is further determined using the bearing ϕ r .
At each time step, L max = 1000 and L min = 300 particles per Bernoulli component is imposed. Pruning of Bernoulli component is performed with a threshold of P = 10e − 3. The survival probability is fixed to p s = 0.99. The birth process is a multi-Bernoulli RFS and all probabilities of existence are chosen at r Γ = 0.02. The tracking performance is evaluated using the optimal sub-pattern assignment (OSPA) error distance [32]. If X = {x 1 , x 2 , · · · , x n }, Y = {y 1 , y 2 , · · · , y m }, and n ≥ m, the OSPA distance is defined as where d (c) (x, y) := min(c, x − y ), c > 0 is the cut-off parameter, p ≥ 1 is an order parameter, and π is a permutation function in the set of permutations Π n . If n < m, d OSPA p,c (X, Y) := d OSPA p,c (Y, X). For each scenario, the average error performances are obtained over 100 MC trials. All experiments are tested in Matlab R2010a and implemented on a computer with a 3.40 GHz processor.

Scenario 1
In this scenario, the non-linear bearing and bistatic range measurements are used to track a time varying number of targets. A nearly constant turn model is considered. The target state is denoted as x k , y k , .
y k ] T comprises the target position and velocity and ω k is the turn rate. The state transition model is w k−1 ∼ N (·; 0, P) is a 2 × 1 independent and identically distributed Gaussian process noise vector with covariance P = σ 2 w I 2 , where σ w = 0.01 m/s 2 is the standard deviation of the target acceleration. A total of three targets appear in the surveillance area and the true trajectories of targets are shown in Figure 4, in which Target 1 is born at k = 1, Target 2 is born at k = 10 and Target 3 is born at k = 20.
If the target is detected by the receiver j located at r j = [x j R , y j R ] T , then the measurement is a noisy bearing and bistatic range vector given by , σ ϕ = (π/180) rad, and σ ρ = 5 m.
× independent and identically distributed Gaussian process noise vector with covariance x y = r , then the measurement is a noisy bearing and bistatic range vector given by  (giving an average of 2 clutter points per scan). To evaluate the performance of the MVO method, we compare it with the cardinality variance based method [17] and the Rényi detergency based method [13]. The Rényi detergency measures information gain between two densities and its parameter α tunes the emphasis on each of the two densities. In the simulations, the Rényi detergency based method is evaluated with 1 α → and 0.5 α = , and the Rényi divergence becomes the Kullback-Leibler divergence and the Hellinger affinity, respectively [1]. To make the comparisons more meaningful, the PIMS strategy is used in all of the methods. For the CBMeMBer filter, the birth process is a multi- x and . y. The clutter measurements for each receiver are distributed uniformly over the region [−π, π] rad × [0, 15000] m. The number of clutter measurements per scan is assumed to be Poisson distributed with the clutter intensity λ c = 2 × 10 −5 (radm) −1 (giving an average of 2 clutter points per scan). To evaluate the performance of the MVO method, we compare it with the cardinality variance based method [17] and the Rényi detergency based method [13]. The Rényi detergency measures information gain between two densities and its parameter α tunes the emphasis on each of the two densities. In the simulations, the Rényi detergency based method is evaluated with α → 1 and α = 0.5, and the Rényi divergence becomes the Kullback-Leibler divergence and the Hellinger affinity, respectively [1]. To make the comparisons more meaningful, the PIMS strategy is used in all of the methods.
The MC averages of the OSPA distance (for p = 1, c = 300 m) and the cardinality estimate are given in Figure 5. It can be observed that the MVO method performs better than the Rényi divergence based method. This is mainly because the cost function of the MVO method provides a better estimate of the target number and also tend to improve the localization accuracy of the detected targets. Since the measurements used to select the sensors are created in the "ideal" manner in PIMS, the Rényi detergency with α = 0.5 provides the maximum discrimination between the two similar densities [1]. Thus, in our case, the Rényi detergency based method with α = 0.5 performs than it with α → 1 . Using only the cardinality variance as the cost function, it is more easily to lose targets when the probability of detection is not satisfactory, as shown in Figure 5b. The cost function of the MVO method includes both the mean cardinality and the cardinality variance which can enhance the performance of sensor management. Thus, the MVO method outperforms the method reported in [17] in terms of the OSPA error.
The average run time for a complete MC simulation are 77.73 s (Rényi divergence based method with α → 1 ), 121.46 s (Rényi divergence based method with α = 0.5), 67.33 s (cardinality variance based method), and 70.96 s (MVO method). The Rényi divergence based method is more time-consuming than other methods, because it has no analytic solution in general and requires sampling on the multi-target state space. The MVO method is slower than the cardinality variance based method. Since the mean cardinality is not considered in the cardinality variance based method, the miss-detections occur often when the probability of detection is not satisfactory, as seen in Figure 5b. Therefore, the cardinality variance based method works more efficiently but performs worse than the MVO method. The MC averages of the OSPA distance (for p = 1, c = 300 m) and the cardinality estimate are given in Figure 5. It can be observed that the MVO method performs better than the Rényi divergence based method. This is mainly because the cost function of the MVO method provides a better estimate of the target number and also tend to improve the localization accuracy of the detected targets. Since the measurements used to select the sensors are created in the ''ideal'' manner in PIMS, the Rényi detergency with 0.5 α = provides the maximum discrimination between the two similar densities [1]. Thus, in our case, the Rényi detergency based method with 0.5 α = performs than it with 1 α → . Using only the cardinality variance as the cost function, it is more easily to lose targets when the probability of detection is not satisfactory, as shown in Figure 5b. The cost function of the MVO method includes both the mean cardinality and the cardinality variance which can enhance the performance of sensor management. Thus, the MVO method outperforms the method reported in [17] in terms of the OSPA error.
The average run time for a complete MC simulation are 77.73 s (Rényi divergence based method with 1 α → ), 121.46 s (Rényi divergence based method with 0.5 α = ), 67.33 s (cardinality variance based method), and 70.96 s (MVO method). The Rényi divergence based method is more timeconsuming than other methods, because it has no analytic solution in general and requires sampling on the multi-target state space. The MVO method is slower than the cardinality variance based method. Since the mean cardinality is not considered in the cardinality variance based method, the miss-detections occur often when the probability of detection is not satisfactory, as seen in Figure 5b. Therefore, the cardinality variance based method works more efficiently but performs worse than the MVO method. In Figure 6, the time averaged OSPA distances over the duration of the scenario for the Rényi detergency based method, the cardinality variance based method and the MVO method are shown against clutter intensities from . As expected, the missdistances increase with higher clutter intensities. Moreover, with the increase of the clutter intensity, the performance of the cardinality variance based method degrades more quickly than other methods. This is primarily because the high clutter intensity and undesirable probability of detection together would lead to large cardinality errors of the cardinality variance based method. In contrast, the performance of the MVO method is stable and better than other methods under consideration.

Scenario 2
Instead of bearing and bistatic range measurements, we now consider bistatic range only measurements and assume no target birth or death. The target state variable is a vector of target In Figure 6, the time averaged OSPA distances over the duration of the scenario for the Rényi detergency based method, the cardinality variance based method and the MVO method are shown against clutter intensities from λ c = 0 (radm) −1 to λ c = 8 × 10 −5 (radm) −1 . As expected, the miss-distances increase with higher clutter intensities. Moreover, with the increase of the clutter intensity, the performance of the cardinality variance based method degrades more quickly than other methods. This is primarily because the high clutter intensity and undesirable probability of detection together would lead to large cardinality errors of the cardinality variance based method. In contrast, the performance of the MVO method is stable and better than other methods under consideration. In Figure 6, the time averaged OSPA distances over the duration of the scenario for the Rényi detergency based method, the cardinality variance based method and the MVO method are shown against clutter intensities from . As expected, the missdistances increase with higher clutter intensities. Moreover, with the increase of the clutter intensity, the performance of the cardinality variance based method degrades more quickly than other methods. This is primarily because the high clutter intensity and undesirable probability of detection together would lead to large cardinality errors of the cardinality variance based method. In contrast, the performance of the MVO method is stable and better than other methods under consideration.

Scenario 2
Instead of bearing and bistatic range measurements, we now consider bistatic range only measurements and assume no target birth or death. The target state variable is a vector of target

Scenario 2
Instead of bearing and bistatic range measurements, we now consider bistatic range only measurements and assume no target birth or death. The target state variable is a vector of target position and velocity and is denoted as T . The single-target transition model is linear Gaussian specified by with the transition matrix The process noise u k is zero-mean Gaussian distributed with covariance where σ u = 0.01 m/s 2 is the standard deviation of the process noise. The true trajectories are shown in Figure 7.  The birth multi-Bernoulli RFS is denoted as  . The clutter measurements for each receiver are distributed uniformly over the region [0, 15000] m with the clutter intensity λ c = 1.33 × 10 −4 (m) −1 (giving an average of 2 clutter points per scan). All other parameters are the same as those in the previous scenario. The bistatic range only measurements are much less informative, and result in a lower target "observability'. If only one receiver is selected at each time step and the same receiver is selected for some consecutive time steps, the remaining particles will not only from target but also include "ghost particles" and these "ghost particles" occupy a major part of overall particles until they are severely biased from real target state. As a result, the performances of all sensor management methods can be worse. To solve this problem, more than one receiver is selected at each time step and the posterior densities of the selected receivers are updated sequentially. For simplify, the number of receivers being selected at each time step is denoted by S.
We verify the performance of the MVO method using different S. Figure 8 shows the averaged OSPA distances (with parameters p = 1, c = 100 m) of the MVO method, the cardinality variance based method and the Rényi detergency based method for S = 2, 3. Due to the lack of observability of the full states, it is difficult to resolve the targets. To avoid this problem and track the multiple targets, the birth multi-Bernoulli RFSs are generated using the initial target states. Therefore, it can be seen that the OSPA distances of all methods are on a rising trend. Figure 8 demonstrates that, in accordance with our intuition, using more receivers at each time step k can effectively reduce the OSPA distances.
Moreover, it appears that the MVO method outperforms other methods. Compared with Scenario 1, the cardinality variance based method performs better in Scenario 2. Thus, the "observability" of the targets and the target motions can be crucial factors in determining the performance of the sensor management method. only one receiver is selected at each time step and the same receiver is selected for some consecutive time steps, the remaining particles will not only from target but also include "ghost particles" and these "ghost particles" occupy a major part of overall particles until they are severely biased from real target state. As a result, the performances of all sensor management methods can be worse. To solve this problem, more than one receiver is selected at each time step and the posterior densities of the selected receivers are updated sequentially. For simplify, the number of receivers being selected at each time step is denoted by S. We verify the performance of the MVO method using different S. Figure 8 shows the averaged OSPA distances (with parameters p = 1, c = 100 m) of the MVO method, the cardinality variance based method and the Rényi detergency based method for S = 2, 3. Due to the lack of observability of the full states, it is difficult to resolve the targets. To avoid this problem and track the multiple targets, the birth multi-Bernoulli RFSs are generated using the initial target states. Therefore, it can be seen that the OSPA distances of all methods are on a rising trend. Figure 8 demonstrates that, in accordance with our intuition, using more receivers at each time step k can effectively reduce the OSPA distances. Moreover, it appears that the MVO method outperforms other methods. Compared with Scenario 1, the cardinality variance based method performs better in Scenario 2. Thus, the ''observability'' of the targets and the target motions can be crucial factors in determining the performance of the sensor management method.

Conclusions
A sensor-selection solution has been proposed to be employed within a multi-Bernoulli multitarget filter. At each step, the next sensor-selection command is chosen by solving a multi-objective problem. It's easy to see that each objective function can be computed analytically, enabling a principled and efficient solution for the sensor selection problem. The performance of the proposed method was demonstrated in two challenging scenarios where a passive sensor network with low detection probability is used to track targets with different levels of observability. Simulation results demonstrated that the proposed sensor selection method performs better than the competing methods. Future work will apply the proposed multi-objective methodology to other more advanced filters such as the generalized multi-Bernoulli filter.
Author Contributions: Yun Zhu developed the algorithm, conducted the experiments and participated in writing the paper. Jun Wang supervised the overall work and reviewed the paper. Shuang Liang contributed to the theoretical analysis of the algorithm.

Conclusions
A sensor-selection solution has been proposed to be employed within a multi-Bernoulli multi-target filter. At each step, the next sensor-selection command is chosen by solving a multi-objective problem. It's easy to see that each objective function can be computed analytically, enabling a principled and efficient solution for the sensor selection problem. The performance of the proposed method was demonstrated in two challenging scenarios where a passive sensor network with low detection probability is used to track targets with different levels of observability. Simulation results demonstrated that the proposed sensor selection method performs better than the competing methods. Future work will apply the proposed multi-objective methodology to other more advanced filters such as the generalized multi-Bernoulli filter.
Author Contributions: Y.Z. developed the algorithm, conducted the experiments and participated in writing the paper. J.W. supervised the overall work and reviewed the paper. S.L. contributed to the theoretical analysis of the algorithm.