2.1. Problem of UAV Detection
First consider a basic UAV detection problem, where a swarm of
n UAVs are used to search for one or multiple targets in a large region. The whole search region is divided into
m sub-regions based on topographic feature, such that different sub-regions have different topographic features, while there is no significant topographic change within one sub-region. Each
i-th sub-region is assigned with a target existence probability
(
), assuming that the holder of UAVs has a prior probability distribution function that describes the initial belief of the target location. Information about the search region and targets are typically collected by means such as satellite remote sensing and early manual exploration, and target existence probabilities can be estimated using empirical methods as proposed in [
4].
According to different flight altitudes and other operational settings, a UAV can use
K different search modes that affect its detection ability. For example, when a UAV flies at a low height and uses a high-resolution camera, the detection ability is high, but the detection time period is relatively long; when it flies at a high height and uses a wide-range sensing equipment, the detection ability is relatively low, but the time period is short, as illustrated in
Figure 1. We assume that a UAV can use only one mode to search in one sub-region; when it uses the
k-th mode to search in the
i-th sub-region, the required time period is
, and the posterior probability of detection is
subject to the precondition of target existence. We are also given the time period
for a UAV to fly from the
i-th sub-region with the
k-th search mode to the
-th sub-region with the
-th search mode (
). Specifically, we use
to denote the time period for a UAV to fly from its starting position to the
i-th sub-region with the
k-th search mode.
The UAV detection problem needs to plan a search path for each j-th UAV (), which consists of two parts: the first is the sequence of sub-regions, where denotes the number of sub-regions to be searched by the j-th UAV; the second is the associated search modes , where denotes the search mode of the j-th UAV in the sub-region ().
Each
j-th UAV takes off at time
and arrives at its first sub-region
at time:
and it finishes the search on
at time:
By analogy, the time at which the
j-th UAV arrives and finishes the search at its
i-th sub-region can be iteratively calculated as follows (
):
Let
T be the predefined time limit of the operation. Whenever the
j-th UAV completes the search of the
ith sub-region in its sequence, it obtains a reward calculated based on the completion time and detection probability in the sub-region as follows:
where the item
encourages to detect the targets as early as possible.
The objective of the problem is to maximize the total reward (i.e., the total time-weighted detection probability) of the UAVs, and the problem is formulated as follows:
where the denominator
T in (
5) is used to normalize the objective value in the range of [0, 1].
2.2. Adversarial Problem of False Target Jamming
The opponent wants to jam UAV detection by placing a set of Q false targets in the search region. Each sub-region can have at most one false target, and false targets can only be placed in sub-regions without real targets. The placement of false targets can have one or more of the following effects on UAV detection:
It will interfere with the observation of the prior probability in the sub-region by means such as satellite remote sensing and early manual exploration.
For a sub-region without a real target, placing a false target will generate a false posterior probability detected by UAV.
For a sub-region with a real target, placing false targets in its neighboring sub-regions will affect (typically decrease) the true posterior probability.
Let
be a solution of false target placement, where
denotes that a false target is placed in the
i-th sub-region and
; otherwise, we use
and
to denote the updated prior probability and posterior probability after placing false targets according to
, respectively. Initially, the UAVs do not have knowledge of the false targets; therefore, from the viewpoint of the holder of UAVs, the objective function (
5) of the UAV detection problem will be updated as:
Ideally, suppose that the holder of UAVs can find the optimal solution
that maximizes the updated objective function (
8) while satisfying all constraints of the original UAV detection problem. From the viewpoint of the opponent, the adversarial problem of false target jamming aims to find a false target placement solution
, such that, among all possible false target placement solutions, the corresponding UAV detection solution
will result in the minimum value of the original objective function (
5). Consequently, the adversarial problem can be formulated as follows:
However, in many practical situations, the holder of UAVs cannot guarantee finding the optimal solution: for a difficult detection problem instance, it is more possible for the holder to obtain one or more near-optimal solutions. Therefore, we consider a set
of possible solutions to the updated UAV detection problem subject to a given false target placement solution
, where each detection solution
is associated a probability
of being adopted by the holder of UAVs. In this way, the objective function (
9) of the false target jamming problem will be updated as: