Transmit Antenna Selection and Power Allocation for Joint Multi-Target Localization and Discrimination in MIMO Radar with Distributed Antennas under Deception Jamming

: In this paper, with the aim of performing joint multi-target localization and discrimination tasks, a performance-driven resource allocation scheme is proposed. In the ﬁrst, by establishing the signal model under deception jamming and utilizing the maximum likelihood (ML) estimator, the estimation information of targets can be obtained. Secondly, the Cramer–Rao lower bound (CRLB) for the transmit antenna selection and power allocation is derived. Then, to fully utilize the difference in spatial distribution between true and false targets, a false target discriminator based on the CRLB of the distance deception parameter is utilized. By introducing the nondimensionalization mechanism, we build an optimal objective function of target localization error and discrimination probability. Subsequently, a joint multi-target localization and discrimination optimization model has been established, which is mathematically a non-smooth and non-convex problem. By introducing an auxiliary variable, we propose a three-step solution strategy for solving this problem. Simulation results demonstrate that the proposed algorithm can improve the performance of joint localization accuracy and discrimination ability (JLADA) by more than 30% compared with the algorithms only for localization or discrimination. Meanwhile, by utilizing the proposed algorithm, the composite indicators of JLADA can decrease more than 70% compared with the uniform allocation scheme.


Introduction
By combining information from multiple nodes, the multiple radar system (MRS) can fully benefit from the advantages of multi-angle observation and increased area coverage, enabling stronger ability in detecting and locating targets for defense purposes [1].As a typical representative of the MRS, based on the "defocused transmit and focused receive" (DTFR) mode [2], the distributed multiple-input multiple-output (MIMO) radar system has high spatial diversity gain, structure diversity gain [3], polarization diversity gain [4] and waveform diversity gain [5].In theory, although the MIMO radar system has superior detection and parameter estimation capabilities, the physical resources and the hardware resources in the system are often finite, which becomes the main obstacle that limits the potential of MIMO radar.Generally speaking, when more antennas and higher transmit power budget are involved in the fixed radar system, the better performance of detection and parameter estimation can be obtained [6].However, too many active antennas require a lot of data transmission and can cause a heavy computational burden to the fusion center.Furthermore, the most radar systems can only provide finite power resource due to hardware limitations.Therefore, in order to increase the potential of the distributed MIMO radar system, the antenna selection or power allocation problem has been studied in [3,[5][6][7][8][9][10][11][12][13].
Remote Sens. 2022, 14, 3904 2 of 17 These studies seem to be fruitful for the resource allocation in MIMO radar, however, to the best of our knowledge, the resource allocation problem of simultaneously multiple tasks application under deception jamming has not been studied.Moreover, the powerful multi-direction observation ability and parameter identification ability of the distributed MIMO radar can bring strong anti-active jamming capability.In this case, it is of practical significance to study the resource allocation problem of the distributed MIMO radar under deception jamming.
So far, the resource-aware design for distributed MIMO radar has been studied in numerous literatures.Aiming at exploiting the available resources to improve radar capability for different tasks, the resource-aware design can be generally classified into four categories: target detection [7,8], target localization [5,[9][10][11], target tracking [3,6,12] and target imaging [13].The first category is aimed at improving the detection performance of the distributed MIMO radar system.Based on the Neyman-Pearson detector [7], studies the joint antenna placement and power allocation problem by using a waterfilling-type algorithm at the output of detector.Moreover, by introducing relative entropy, the joint transmit antenna selection and illumination time allocation have been studied in [8].In the second category, the power allocation problem for target localization in the distributed hybrid noncoherent active-passive radar networks based on the radio frequency stealth background has been studied in [5].The power allocation problem for single target localization has been studied in [9,10], which address the multi-target localization scenarios, and further propose the joint power and bandwidth allocation scheme.To improve resource utilization efficiency for target localization, an optimization scheme integrating power allocation, bandwidth allocation, and radar node selection has been established in [11].In the third category, with the objective of improving the worst tracking accuracy with multiple targets [3], proposes a receive-beams assignment scheme for multi-target tracking.Another study [6] proposes a standard joint subarray selection and power allocation scheme for tracking multiple targets in the large-scale distributed MIMO radar networks under clutter environments.By deriving the predicted conditional Cramer-Rao lower bound (CRLB), a joint node selection and power allocation scheme are developed in [12].In the target imaging category, the heuristic multi-resource allocation algorithm for imaging problems has been proposed in [13].
In the above literature, the optimization models are established under different task scenarios, and corresponding solving strategies are proposed.In general, it can be seen from the above literatures that the resource management problem in the distributed MIMO radar is usually multidimensional and non-convex, and it is difficult to obtain a global optimal solution even in the single task scenarios.Additionally, it should be noted that most of the existing literature focuses on resource allocation in the execution process of a certain task, while the multi-task cases are rarely involved.However, in military operations, the radar system is often required to perform multiple functions simultaneously.Evidently, in the multi-task collaborative scenarios, an increase in task complexity will increase the difficulty of solving the optimization model.In addition, previous studies on resource management have typically been conducted under ideal electromagnetic conditions, while there has been very limited research on jamming environments.Generally speaking, when target echoes are accompanied by deception jamming, the authenticity of the target needs to be discriminated.Since the discriminator must be built before resource allocation tasks can be performed, the previous resource-aware design strategies cannot be directly applicated.However, to the best of our knowledge, studies on resource-aware designs of anti-deception jamming are quite limited, and a resource-aware design in multi-task cases under deception jamming has not been found.
In this paper, we propose a transmit antenna selection and power allocation scheme for joint multi-target localization and discrimination in the MIMO radar with distributed antennas under range deception jamming.In this scheme, we firstly establish the signal model under range deception jamming and derive the Cramer-Rao lower bound (CRLB) [14] of target position and deception distance.Then, by calculating the CRLB of deception distance parameter, we build a false target discriminator based on the Chi-square test.After that, by adopting the nondimensionalization mechanism, we establish a transmit antenna selection and power allocation optimization model for multi-target localization and discrimination.Finally, since the formulated optimization model is non-smooth and non-convex, we propose a three-step solution method based on the convex relaxation technique and the particle swarm optimization (PSO) algorithm to obtain the effective suboptimal solution of the original problem.
The main contributions are summarized as follows.
(1) The optimization model of joint multi-target localization and discrimination in the distributed MIMO radar is established.At first, a false target discriminator based on probability is constructed by using the CRLB of range deceptive parameter estimation.Then, combined with a nondimensionalization mechanism, localization accuracy and discrimination probability (DP) are de-dimensionalized and normalized to simplify the optimization problem.Finally, the optimization model of joint multitarget localization discrimination is established by introducing two task assignment parameters.In this case, the original multi-objective optimization problem is transformed into a single objective optimization problem, which reduces the difficulty of the solving process.(2) An effective three-step solving algorithm which combines the relaxation technique and the sorting algorithm is proposed for solving the optimization model.Since the formulated optimization model is non-convex and non-smooth, it is hard to find a global solution.The proposed solving algorithm relaxes the original problem by taking the product of transmit antenna selection variable and the corresponding power allocation result as an auxiliary variable.Furthermore, by adopting the sorting algorithm and the particle swarm optimization (PSO) algorithm, we obtain the final resource allocation results.(3) A unified resource allocation mechanism in the distributed MIMO radar under deception jamming is developed.Considering the range deception jamming environment in the mission region, we establish the system model under deception jamming and derive the CRLB for range deceptive jamming parameter estimation.In this case, an effective technique for solving radar resource management under deception jamming environment is formulated.
The paper is organized as follows.The data processing mechanism is described in Section 2. The derivation of CRLB is presented in Section 3. In Section 4, we introduce a false target discriminator and subsequently formulate the joint multi-target localization and discrimination model.Then, a three-step-based solving algorithm is given.Moreover, the experiments and results are presented in Section 5. Section 6 concludes this paper.

Data Processing Mechanism
Assume that the entire region of interest (ROI) exists with Q ≥ 2 targets, and each target is widely separated.We consider that a narrowband MIMO radar with distributed antennas consists of M transmit antennas and N receive antennas, and is located in Cartesian 2-D space.Denote the entire sets of transmit antennas and receive antennas by sets M = {1, 2, . . . ,M} and N = {1, 2, . . . ,N}, respectively.The mth transmit antenna and the nth receive antenna are located at (x t m , y t m ) and (x r n , y r n ), where m ∈ M and n ∈ N .Let µ q = [x q , y q ] T ∈ R 2 denote the real position of target q, where q ∈ Q = {1, . . . ,Q}.Then, suppose that the targets are sufficiently dispersed, and only one target exists within each transmit beam.Hence, each transmit antenna can be used to illuminate only one target.

Signal Model
Each antenna transmits the orthogonal frequency-division multiplexing (OFDM) pulse signal [15], with a normalized equivalent s m (t), which satisfies that [9]: where ∀m ∈ M, ∀m ∈ M and the term of (•) H denotes the conjugate transpose operator.
To counter radar detection, the self-defense jammer equipped on the real target implements jamming by delaying and retransmitting transmit signals [16].For the qth target which is illuminated by the mth transmit antenna, the false target can be constructed by introducing the deceptive distance ∆d q .Hence, the baseband representation of the received signal reflected by the qth target via the (m, n) path can be expressed as [17]: where u t m,q is a binary variable and is defined as: 1, if the mth transmit antenna is selected to illuminate the qth target 0, else , P t m denotes the transmit power from the mth transmit antenna.The term of γ m,n,q ∝ 1/(R t m,q R r n,q ) 2 represents the attenuation in the signal strength due to the bistatic path loss effects.R t m,q and R r n,q denote the range from the mth transmit antenna to the qth target and the range from the nth receive antenna to the qth target, respectively.Moreover, R t m,q and R r n,q are given by: Herein, h m,n,q is modeled as a known complex gain of target reflectivity.The term of w m,n (t) is the zero-mean white complex noise, which satisfies that w m,n (t) ∼ CN(0, σ 2 w .τ J m,n,q is the superposition of the real target time-delay and the active deception time-delay, given by: where the term of c represents the speed of light.It should be noted that a real target is detected when ∆d q = 0, while a false target is detected when ∆d q = 0. Therefore, for the same illuminated target, the spatial resolution cells (SRCs) of jamming signals and the real target echoes could be mixed in space, which is demonstrated in Figure 1.In summary, based on the above assumptions and analysis, the intuitive process of multi-target detection in the presence of the self-defense range deception jamming signals can be shown in Figure 2.

Parameter Estimation
After obtaining the target echoes, it is necessary to extract the target measurement information by the parameter estimation method.Herein, we adopt the maximum likelihood (ML) estimation method to estimate the target parameters.Assume that all the targets are sufficiently dispersed in space, and each transmit beam only covers one target.In this case, multi-target detection problem can be converted into a series of independent single target detection problems.After signal processing and matching filtering, we can obtain an MN × 1 sampling matrix of all the receive signal from the qth target, which is given by r q = [r 1,1,q , r 2,1,q , . . ., r m,n,q , . . ., r M,N,q ] T .In summary, based on the above assumptions and analysis, the intuitive process of multi-target detection in the presence of the self-defense range deception jamming signals can be shown in Figure 2.
Intuitive process of multi-target detection under self-defense range deception jamming.

Parameter Estimation
After obtaining the target echoes, it is necessary to extract the target measurement information by the parameter estimation method.Herein, we adopt the maximum likelihood (ML) estimation method to estimate the target parameters.Assume that all the targets are sufficiently dispersed in space, and each transmit beam only covers one target.In this case, multi-target detection problem can be converted into a series of independent single target detection problems.After signal processing and matching filtering, we can obtain an 1 MN × sampling matrix of all the receive signal from the qth target, which is given by ,..., ,..., According to the receive signal model, the conditional probability density function (PDF) ( | )   q q p μ r could be calculated as: Then, the ML estimator for q μ can be calculated by: Remote Sens. 2022, 14, 3904 5 of 18 In summary, based on the above assumptions and analysis, the intuitive process of multi-target detection in the presence of the self-defense range deception jamming signals can be shown in Figure 2.
Intuitive process of multi-target detection under self-defense range deception jamming.

Parameter Estimation
After obtaining the target echoes, it is necessary to extract the target measurement information by the parameter estimation method.Herein, we adopt the maximum likelihood (ML) estimation method to estimate the target parameters.Assume that all the targets are sufficiently dispersed in space, and each transmit beam only covers one target.In this case, multi-target detection problem can be converted into a series of independent single target detection problems.After signal processing and matching filtering, we can obtain an 1 MN × sampling matrix of all the receive signal from the qth target, which is given by ,..., ,..., According to the receive signal model, the conditional probability density function (PDF) ( | )   q q p μ r could be calculated as: Then, the ML estimator for q μ can be calculated by: According to the receive signal model, the conditional probability density function (PDF) p(r q µ q ) could be calculated as: Then, the ML estimator for µ q can be calculated by: Therefore, the location of target q can be estimated by (7).For a MIMO radar system, since a closed-form solution for ( 7) is not available [3], a numerical search method is required.Here, we utilize a low-complexity approximate ML estimator to obtain the exact solution to (7); the details are shown in [18].

Derivation of Estimation Performance Metric
Given any unbiased estimator, the CRLB can provide a tight lower bound, and has been proven to be very close to the target state estimation error on the high signal/noise ratio (SNR) condition [19].In this section, we derive the joint CRLB of the target position integrated with the corresponding deceptive distance parameter.
Even though μq ML can be obtained by the ML estimation method, the estimated position vector μq could be inaccurate in the presence of the self-defense range deception jamming (∆d q = 0).Moreover, in practice, this phenomenon of the mixed SRCs in Figure 1 might be interpreted as the inaccurate DOA estimation problem caused by the radar itself, e.g., receiving beamwidth and angle measuring accuracy error.In this case, it is important to evaluate the existence of deception jamming signals by estimating the deceptive distance parameter ∆d q .According to (2) and ( 5), we define an extended location state as η q = [x q , y q , ∆d q ] T .In this case, the unbiased estimate of η q satisfies that [20]: where J(η q ) denotes the fisher information matrix (FIM), whose inverse is the CRLB, and J(η q ) can be expressed as [11]: where p(r q |η q ) represents the conditional PDF with respect to r q under condition η q .
According to (6), it can be known that p(r q |η q ) is both an explicit function of τ J m,n,q and an implicit function of η q , where m ∈ M and n ∈ N .Then, a vector is defined as Based on the chain rule, J(η q ) can be rewritten as [21]: Since the derive process of J(τ J q ) can be seen in [9], it is not repeated for simplicity.The Jacobian matrix Γ q is given by: where a t m,q = (x t m − x q )/R t m,q , b t m,q = (y t m − y q )/R t m,q , a r n,q = (x r n − x q )/R r n,q , and b r n,q = (y r n − y q )/R r n,q .Combined with ( 8)- (11), the CRLB for the qth target is expressed as: where u t q = [u t 1,q , u t 2,q , . . ., u t M,q ] T , P t = [P t 1 , P t 2 , . . ., P t M ] T , and Ψ m,q is a third-order square matrix.All the elements in matrix Ψ m,q are expressed as Herein, β m denotes the effective bandwidth of the transmit signal s m (t).From ( 12), it should be noted that all the elements in C q CRLB are inversely proportional to transmit power P t .Moreover, since (C x q + σ 2 y q , where σ 2 x q and σ 2 y q represent the mean square errors (MSEs) of target q for the position estimator on the X-direction and the Y-direction.After some additional matrix manipulations, the MSE of target q for locating estimator is bounded below: Herein, Λ q = u t q P t , where the term of denotes the Hadamard product operator.

Optimization Model and Solution Strategy
In general, the better localization accuracy indicates a more reliable radar system in practice.In this case, to attain the higher level of localization accuracy or low probability of intercept (LPI), the radar systems aim to allocate resources in a way that maximizes localization accuracy with the given resource budget [10] or minimizes transmitter power with the constraints of predetermined target detection performance [22].
Nevertheless, in reality, the higher localization accuracy of the radar system is not always better, especially when the detected target is a false one.This phenomenon also appears in the multiple radar system, which can prevent the effectiveness of the spatial diversity gain.The process can be intuitively shown in Figure 3, in which the abbreviations of RTSRC and FTSRC denote the real target SRC and the false target SRC, respectively.As a result, when locating multiple targets, it is necessary to discriminate the authenticity of each target simultaneously in order to improve resource utilization.

False Target Discriminator
In practice, the distance spoofing parameter q d Δ is an effective basis for identifying the real target and the false target in the presence of range deception jamming [14].In theory, with the existence of estimation errors, the estimate result of q d Δ can be seen a random variable, which satisfies that 2 ˆN( , ) , where 2 q d σ Δ denotes the MSE of distance spoofing estimator of the qth target.According to ( 12) and ( 13), the CRLB of 2 q d σ Δ can be computed as: . det q w q q q q q q d q q q c σ σ π Based on the Neyman-Pearson theory, we utilize ˆq d Δ as the statistical discriminator, and the binary hypotheses, 0 q  real target and 1 q  false target, given by [14]: Herein, the term of 2 χ .In this case, the theoretical DP of the active false target can be expressed as: where

Problem Formulation
Due to the effect of the range deception jamming signal, radar should consider both target localization accuracy and the relevant DP.In this case, in the detection process for the qth target, LE q  and FT q  must be taken into account in guiding resource allocation.
Theoretically, the value ranges of LE q  and FT q  are [0, ] +∞ and [0,1] , respectively.Since they have different dimensions, it is difficult to discuss the localization performance and the discrimination performance under the same framework.

False Target Discriminator
In practice, the distance spoofing parameter ∆d q is an effective basis for identifying the real target and the false target in the presence of range deception jamming [14].In theory, with the existence of estimation errors, the estimate result of ∆d q can be seen a random variable, which satisfies that ∆ dq ∼ N(∆d q , σ 2 ∆d q ), where σ 2 ∆d q denotes the MSE of distance spoofing estimator of the qth target.According to ( 12) and ( 13), the CRLB of σ 2 ∆d q can be computed as: Based on the Neyman-Pearson theory, we utilize ∆ dq as the statistical discriminator, and the binary hypotheses, H q 0 real target and H q 1 false target, given by [14]: Herein, the term of χ 2 1 denotes the chi-square distribution with one degree of freedom, and χ 2 1 (∆d 2 q /σ 2 ∆d q ) represents the noncentral chi-square distribution with one degree of freedom.Assume that the expected real target's DP is set as P q RT = P H q 0 H q 0 , and then the identification threshold of the proposed discriminator is η q = F −1 denotes the inverse cumulative distribution function of χ 2  1 .In this case, the theoretical DP of the active false target can be expressed as: where F χ 2 1 (∆d 2 q /σ 2 ∆dq ) (η q ) is the cumulative distribution of χ 2 1 .

Problem Formulation
Due to the effect of the range deception jamming signal, radar should consider both target localization accuracy and the relevant DP.In this case, in the detection process for the qth target, L q LE and P q FT must be taken into account in guiding resource allocation.Theoretically, the value ranges of L q LE and P q FT are [0, +∞] and [0, 1], respectively.Since they have different dimensions, it is difficult to discuss the localization performance and the discrimination performance under the same framework.
To balance the effects of different dimensions of the two performance parameters, we introduce a nondimensionalization mechanism.The specific calculation process is given as follows: Step 1: Let ∀L q LE ∈ [0, L max ], for ∀q ∈ Q. Herein, L max is a given upper bound, which is computed by L max = max q=1,2,...,Q L q LE (1 T M , P t min ) , where the vector P t min = [P t min,1 , P t min,2 , . . ., P t min,M ] T denotes the minimum power required to maintain the essential signal-to-noise ratio (SNR) condition for detecting the target.
Step 2: Let L q LE = L q LE /L max , for ∀q ∈ Q.In this case, the normalized CRLB of the localization estimation satisfies that L q LE ∈ [0, 1], for ∀q ∈ Q.
Step 3: Considering that the localization performance is better when the value of L q LE is smaller, while radar the discrimination performance improves with a larger P q FT .In this case, we reset P q FT is as P q FT = 1 − P q FT , thus P q FT ∈ [0, 1], for ∀q ∈ Q.
After that, the optimization model for localization accuracy and DP can be developed.For the multi-target scenario, the overall performance is considered in this paper, thus the objective function of antenna selection and power allocation can be expressed as: The terms of ς q and q jointly constitute the task assignment result for the qth target, and are defined as: where P FT,max and P FT,min are the preset threshold values of DP in the radar system, i.e., the qth target is judged as a false target when P q FT (u q t , P t ) ≥ P FT,max , and target q is declared true when P q FT (u q t , P t ) ≤ P FT,min .In this case, we can make the radar system simultaneously complete the localization of the real target and the discrimination of the target with a risk of fake, and consequently abandon the false target.Moreover, in (18), U t is given by: where T denotes the results of antenna selection for the mth transmit antenna, for ∀m ∈ M.However, due to the limited data transmission rate and the given bandwidth available for communication [23], the antenna selection problem should be constrained.Moreover, in order to obtain the DP of each target, it is necessary to ensure that each target is illuminated by at least two transmit antennas.In theory, to maintain each transmit antenna can work in a stable working mode and satisfy the power budget, the transmit power results also need to be restricted.According to the aforesaid analysis, the optimization formulation can be expressed as: 1, if P q FT u t q , P t ≤ P FT,min 0, esle q = 1, if P FT,min ≤ P q FT u t q , P t ≤ P FT,max 0, esle ∀q ∈ Q, ∀m ∈ M (22) where the term of L (Q ≤ L ≤ M) represents the maximum of transmit antennas that can be selected to detect one target, and the matrix T denotes the maximum values and the minimum values of transmit power in each transmit antenna, and P total is the total transmit power budget.
The first line constraints in (22) imply that the three constraints on transmit antenna selection, i.e., each target is illuminated by at least one transmitted beam, each transmitted beam covers only one target, and at most L transmit antennas are selected to participate in the detection mission.The second line constraints represent that the transmit power is bounded by a power budget and the antenna selection variable is binary.Moreover, the third line constraints indicate that if u tm = 0, the corresponding transmit power satisfies with P t min,m ≤ P t m ≤ P t max,m , otherwise, P tm = 0, for ∀q ∈ Q, ∀m ∈ M.

Solution Strategy
In mathematics, although the previous multi-objective optimization problem has been transformed into a single-objective optimization problem after the utilization of the nondimensionalization mechanism, (22) is still difficult to solve for the following reasons: (1) Due to the introduction of the task assignment parameters and the hypothesis testing process, the objective function is nonlinear and non-convex; (2) U t is a binary matrix; and (3) U t and P t are coupled and always appear in product form.In this case, (22) is very tricky to solve, and it takes too much time to obtain the global optimal solution by using the exhaustive search algorithm [11], especially in the large-scale antenna systems [6].In order to solve (22), a three-step solver is proposed to find a suboptimal solution.If we suppose that ∀q ∈ Q and ∀m ∈ M, the detailed steps are given as follows: Step 1: Reformulation and relaxation.Since u q t and P t are always coupled as u t m,q P t m in ( 12)-( 15), we introduce an auxiliary variable ξ t m,q = u t m,q P t m .By combining it with the corresponding matrix ξ t = Λ t , we can reformulate (22) as: However, since (23) contains non-linear and non-convex constraints, and it is still difficult to solve.We further relax (23) as: 1, if P FT,min ≤ P q FT u t q , P t ≤ P FT,max 0, esle (24) Similar with [24], ( 24) can be easily solved by the PSO algorithm.For simplicity, the details of the PSO algorithm are omitted and can be seen in [24].In addition, it is worth noting that because the solution of ( 24) is based on the assumption that all targets are illuminated by all transmit antennas, the solution ξ t,opt cannot be directly taken as the result of the original resource allocation problem.In this case, ξ t,opt should be further processed based on the transmit antenna selection constraints in (23).
Step 2: Transmit antenna selection based on the results of (24).We normalize the optimal transmit power of each transmit antenna as ζ t m,q,opt = ξ t m,q,opt /P total , and arrange the normalized transient results ζ t,opt = ζ t m,q,opt ∀m ∈ M, ∀q ∈ Q from the highest to the lowest.Then, sorting out the transmit antenna sequence corresponding to the maximum value of ξ t m,q,opt with the constraints in the second line of (23).The process of the sorting algorithm is shown in Algorithm 1.
Step 3: Power resource optimal allocation for a given budget.Firstly, we optimize the allocation of transmit power based on the transmit antenna selection results obtained by the sorting algorithm.For a fixed optimal antenna selection matrix U t,opt , the rest problem of ( 22) can be expressed as: min F(P t )| U t,opt s.t. 1 T M P t = P total , Hence, similar with (24), by utilizing the PSO algorithm, the optimal power allocation results P t,opt can be achieved through solving (25).Up to now, we have obtained the suboptimal solutions for the joint transmit antenna selection and power allocation in (22).( ) Hence, similar with (24), by utilizing the PSO algorithm, the optimal power allocation results t,opt P can be achieved through solving (25).Up to now, we have obtained the suboptimal solutions for the joint transmit antenna selection and power allocation in (22).
Algorithm 1. Sorting algorithm for the transmit antenna selection.
( U as the solution of (23).

Parameter Designation
In this section, a distributed MIMO radar system with 12 M = transmit antennas and 12 N = receive antennas is chosen for analysis.In the ROI, there are 4 Q = targets widely distributed and the state parameters of each target are shown in Table 1.To demonstrate the performance of the proposed strategy under different antenna deployment, four different antenna topologies are herein taken into consideration.As such, the

Parameter Designation
In this section, a distributed MIMO radar system with M = 12 transmit antennas and N = 12 receive antennas is chosen for analysis.In the ROI, there are Q = 4 targets widely distributed and the state parameters of each target are shown in Table 1.To demonstrate the performance of the proposed strategy under different antenna deployment, four different antenna topologies are herein taken into consideration.As such, the four different geometric relationships between the distributed MIMO radar systems and targets are demonstrated in Figure 4.For the radar system, the effective bandwidth is β m = 1 MHz and the effective time duration is T m = 1 ms for m = 1, 2, . . ., M. Furthermore, the maximum quantity of transmit antennas that can be selected to illuminate one target is set as L = 6.The bounds for transmit power are P t max,m = 0.3P total and P t min,m = 0.05P total for m = 1, 2, . . ., M, and the total transmit power budget P total = 100 kW.The SNR is set as 10 dB at the distance of 20 km, with the baseline measurement error R 0 = diag(50 2 , 0.1 2 .In the PSO algorithm framework, we set the particle number N p = 50, the inertia weight W i = 1, the acceleration factors c 1 = c 2 = 0.8, and the maximum iteration number L max = 50.The upper and lower bounds of the preset threshold of DP are separately set as P FT,max = 0.8 and P FT,min = 0.3.According to (2), it can be seen that the error of a fixed receiver comes from the zero-mean Gaussian white noise in the echo signals.Therefore, in order to possibly eliminate the effect of measurement errors on the validation of the proposed model, the Monte Carlo method is adopted in the numerical experiment in this section.Without loss of generality, the number of Monte Carlo trails is set as N sim = 100.

Effectiveness of the Proposed Solver
The results of transmit antenna selection and power allocation under four different cases are given in Figure 5. Herein, the color in each rectangle represents the ratio of allocated power r power m,q = ξ t m,q /P total , for m = 1, 2, . . ., M, and q = 1, 2, . . ., Q.In particular, the indigo blue color indicates the ratio r power m,q = 0, which means that the mth transmit antenna is not selected for illuminating the qth target.Meanwhile, the crimson color denotes that the ratio achieves the maximum.In addition, the results of task assignment and DP for each target under four different cases are shown in Table 2.
As can be seen from Table 2 that target 3 is assigned as the target to be located in each of the four cases because it has good observation conditions and is not covered by the distance deception signals.In addition, since both target 2 and target 4 transmit distance deceptive jamming signals, one of the targets between target 2 and target 4 in case 1, case 2, and case 3 is defined as a false target, while the other target is assigned to discrimination task.In particular, target 2 and target 4 are defined as false targets in case 4 due to the closer observation distances.As for target 1, since its position is near the center of the radar antennas in case 1 and case 2, better observation conditions are available and the DPs with respect to target 1 are lower.In this case, target 1 is judged to be a true target and the localization task is performed both in case 1 and case 2.Moreover, since target 1 is located far away from the radar antennas in case 3 and case 4, the relevant measurement error increases, resulting in an increase in DP values.As can be seen from Table 2 that target 3 is assigned as the target to be located in each of the four cases because it has good observation conditions and is not covered by the distance deception signals.In addition, since both target 2 and target 4 transmit distance deceptive jamming signals, one of the targets between target 2 and target 4 in case 1, case 2, and case 3 is defined as a false target, while the other target is assigned to discrimination  In order to demonstrate the effectiveness of the proposed algorithm, the following three benchmarks are used for comparison: (1) Multi-start local search [25] antenna selection with uniform power allocation (MSLSA-UP).This algorithm selects active transmit antennas by adopting the multi-start local search algorithm and allocates the transmit power resource to those selected active transmit antennas uniformly.(2) Optimal antenna selection with optimal power allocation for localization task (OA-OP-LT).In this algorithm, we consider the localization task, and the task assignment parameters in (22) are set as ς q = 1 and q = 0, for ∀q ∈ Q.Then, the proposed solving strategy is utilized to solve the modified optimization model, and the optimal transmit antenna selection and power allocation results can be obtained.(3) Optimal antenna selection with optimal power allocation for discrimination task (OA-OP-DT).This algorithm focuses exclusively on discrimination task, and we set ς q = 0 and q = 1, for ∀q ∈ Q.Similar with the OA-OP-LT algorithm, the optimization model is then solved by the proposed solving strategy.For the error analysis of the proposed discriminator, it can be seen from Table 1 that target 2 and target 4 are preset as false targets in the simulation.Since the existence of deception distance in target 2 and target 4, the fixed radar system tends to obtain higher DPs for the two targets in the same observation condition.From the simulation results in Tab 3, by utilizing the resource optimal allocation scheme, both target 2 and target 4 can be accurately identified as false targets in case 4. In addition, one of the two targets can be accurately identified under case 1 to case 3, while a higher DP is prompted for the other false target and the subsequent discrimination task is assigned.Thus, the correctness of the proposed discriminator can be proven from the identification results in a global perspective.

Conclusions and Future Work
In this paper, to deal with deception jamming in the distributed MIMO radar, we formulate an optimization model of transmit antenna selection and power allocation for joint multi-target localization and discrimination.By utilizing the relaxation technique and the PSO algorithm, a three-step solving algorithm is developed for this optimization problem.Numerical simulations demonstrate that the proposed strategy under the joint localization and discrimination task conditions can improve comprehensive performance by more than 30% compared with single task conditions in four different radar layouts cases.In addition, based on the proposed resource optimal algorithm, the composite indicators of JLADA can decrease more than 70% compared with the uniform allocation scheme.The main innovation of the proposed algorithm is the establishment of a unified optimization model of joint multi-target localization and discrimination under deception jamming.However, by artificially transforming the multi-objective optimization problem into the single-objective optimization problem, the model error in this paper is inevitable.Moreover, although the proposed solution strategy is effective, the global optimal solution still cannot be obtained due to the relaxation processing.
In this case, the future research direction will be to directly solve the initial multiobjective optimization problem, and discuss and use more efficient solving algorithms.Moreover, in the future work, we will further add scenarios, including target RCS timevarying, angle scintillation noise, and distance spoofing noise time-varying to verify the proposed algorithm; more quantitative analysis links will also be added.

Figure 1 .
Figure 1.Diagram of the SRCs with target echoes and jamming signals.

Figure 1 .
Figure 1.Diagram of the SRCs with target echoes and jamming signals.

Figure 1 .
Figure 1.Diagram of the SRCs with target echoes and jamming signals.

Figure 2 .
Figure 2. Intuitive process of multi-target detection under self-defense range deception jamming.

Figure 3 .
Figure 3. Target detection mechanism under range deception jamming.

1 χ
denotes the chi-square distribution with one degree of freechi-square distribution with one degree of freedom.Assume that the expected real target's DP is set as RT cumulative distribution function of2   1

Figure 3 .
Figure 3. Target detection mechanism under range deception jamming.

Algorithm 1 .
Sorting algorithm for the transmit antenna selection.
upper and lower bounds of the preset threshold of DP are separately set as FT,max 0.8 =  and FT,min 0.3 = .According to (2), it can be seen that the error of a fixed receiver comes from the zero-mean Gaussian white noise in the echo signals.Therefore, in order to possibly eliminate the effect of measurement errors on the validation of the proposed model, the Monte Carlo method is adopted in the numerical experiment in this section.Without loss of generality, the number of Monte Carlo trails is set as sim 100 N = .

Figure 4 .
Figure 4. Four different multiple radar layouts with multiple target locations.

Figure 4 .
Figure 4. Four different multiple radar layouts with multiple target locations.

Figure 5 .
Figure 5. Transmit antenna selection and power allocation results under four cases.

Figure 5 .
Figure 5. Transmit antenna selection and power allocation results under four cases.

m 2 and
MSE 3 = 103.5 m 2 , respectively.In conclusion, from the perspective of localization errors, the proposed model can effectively improve the target localization accuracy by allocation resources compared to the measurement errors.

Table 1 .
The target parameters of each target.

Table 1 .
The target parameters of each target.

Table 2 .
The results of task assignment and DP for each target under four cases.