Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation

Chen, Yang; Shu, Yifei; Hu, Mian; Zhao, Xingang

doi:10.3390/app122312111

Open AccessArticle

Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation

by

Yang Chen

^1,2,*

,

Yifei Shu

¹,

Mian Hu

¹ and

Xingang Zhao

²

¹

Engineering Research Center for Metallurgical Automation and Measurement Technology of Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, China

²

State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(23), 12111; https://doi.org/10.3390/app122312111

Submission received: 7 November 2022 / Revised: 22 November 2022 / Accepted: 23 November 2022 / Published: 26 November 2022

(This article belongs to the Special Issue Fault Detection and State Estimation in Automatic Control)

Download

Browse Figures

Review Reports Versions Notes

Abstract

UAVs have shown great potential application in persistent monitoring, but still have problems such as difficulty in ensuring monitoring frequency and easy leakage of monitoring path information. Therefore, under the premise of covering all monitoring targets by UAVs, it is necessary to improve the monitoring frequency of the target and the privacy protection of the monitoring intention as much as possible. In response to the above problems, this research proposes monitoring overdue time to evaluate the monitoring frequency and monitoring period entropy in order to evaluate the ability to ensure monitoring privacy protection. It then establishes a multi-UAV cooperative persistent monitoring path planning model. In addition, the multi-group ant colony optimization algorithm, called overdue-aware multiple ant colony optimization (OMACO), is improved based on the monitoring overdue time. Finally, an optimal flight path for multi-UAV monitoring with high monitoring frequency and strong privacy preservation of monitoring intention is obtained. The simulation results show that the method proposed in this paper can effectively improve the monitoring frequency of each monitoring node and the privacy preservation of the UAV monitoring path and has great significance for enhancing security monitoring and preventing intrusion.

Keywords:

persistent monitoring; privacy protection; path planning; monitoring frequency; overdue time

1. Introduction

For the purposes of public safety, environmental protection, scientific research, etc., people need to observe, measure and collect information in certain areas over a long time, and then make decisions based on the results of these observations, measurements and collection. This is generally called a persistent monitoring problem [1,2,3]. Monitoring in person or by hand is usually constrained by weather, geography, working hours and labor costs, and intelligent equipment can greatly overcome the above deficiencies of human based monitoring. Unmanned aerial vehicles (UAV) are examples of one of these typical intelligent monitoring devices. Because they are free of human intervention and offer stable flight, a wide range of motion, and low cost, UAVs are often used to perform persistent monitoring tasks [4], target detection and tracking [5], and border patrols [6]. This research mainly studies the UAV path planning problem when they are used in persistent monitoring.

With the emergence of various complex environments and complex tasks, a single UAV will find it hard to meet the requirements of increasingly complex inspection operations. Consequently, there has been extensive research on multi-UAV cooperation. Compared with single-UAV operation, multi-UAV cooperation has demonstrated greater advantages. For example, multi-UAV cooperation [7,8] can obtain more comprehensive and wide information and can realize multi-angle monitoring of the target area. However, such problems as cooperation strategy, inconsistent monitoring frequency, unsynchronized monitoring information, and unsafe monitoring strategies still exists for multi-UAV cooperation. The task decisions of multi-UAV persistent monitoring have become popular issues in the application field of UAVs.

The multi-UAV persistent monitoring problem can be divided into two levels. One level is the monitoring frequency constraint, and the other is the persistent monitoring security, i.e., monitoring privacy preservation. The above two levels correspond to the two so-called modes of UAV persistent monitoring. One is the regular monitoring mode, that is, the route planned for the UAV to minimize the time delay between each adjacent visit of the task nodes and to improve their monitor frequency as much as possible. The other is the adversarial monitoring mode, which is to plan uncertain, unpredictable and non-periodic monitoring paths for UAVs in order to prevent any intelligent intruders from detecting the monitoring regularity [9]. If the monitoring frequency constraint is considered as the only criterion, the monitoring path is usually a certain periodic path. Once an intelligent intrusion appears in the monitoring environment, the privacy of the UAV monitoring intention cannot be protected, and the monitoring task is easily destroyed by intelligent intruders. On the other hand, when only the security of persistent monitoring is considered, it may be difficult to satisfy the monitoring frequency requirements of each node due to excessive consideration of path privacy security. Therefore, it is of great theoretical significance and practical value to study the joint optimization problem of monitoring frequency and privacy protection.

Portugal [10] reviewed the multi-robot cooperative patrol algorithms that has been studied in recent years and pointed out that a distributed, non-deterministic and cooperative strategy represents the future trend. Alamdari [11] studied the persistent monitoring problem of a single robot. The optimization goal is to minimize the revisit duration of the given monitoring tasks. Two approximate algorithms with complexity O(log ρ_G) and O(log n) were proposed, respectively. Elmaliach [12] studied the patrol problem in a closed area and proposed the patrol frequency optimization criterion for the first time, and each point in the area should be repeatedly visited by multiple robots. Smith [3,13] studied persistent monitoring problems in discrete and continuous environments, and established two optimization models, aiming to enhance the monitoring frequency. Wang [14] studied the persistent monitoring problem of multiple UAVs and established a mathematic model based on the optimization of the maximum environmental recognition accuracy, which was then solved by a heuristic algorithm. Kalyanam [15] studied a similar problem, i.e., UAV data collection, allowing UAVs to visit some targeted location with high priority more than once in a single cycle. An optimization by maximizing the average period reward was formulated, and the precise solution combining dynamic programming and mixed integer linear programming was achieved. Subsequently, considering the scalability of the algorithm and improving its efficiency, an approximate solution was proposed for the nodes with specific visiting times [16]. Von [17] also discussed the algorithm scalability where a genetic algorithm was used to obtain the approximate solution that showed better scalability than a precise method through experiments. Scherer [18] studied a multi-UAV cooperative path planning problem with monitoring data transport for the purpose of minimizing the time delay between data being captured by UAVs and the arrival of the data at the base station. Hari [19] considered the monitoring frequency constraint and set the fixed horizon to a given number, k, which assumes that the UAV can only access k nodes in each cycle. However, once there exists an intelligent intrusion in the monitoring environment, the monitoring privacy will have already been destroyed. The above persistent monitoring studies considered monitoring frequency constraints, but only focused on the monitoring performance or coverage rate of the given area [20] and did not consider monitoring security issues in an adversarial environment.

With regard to the concern for monitoring security, one also needs to consider how easy the monitoring strategy can be acquired by intelligent intruders. The privacy of the persistent monitoring process is of great concern, especially in some applications where intelligent adversaries or intruders might occur. At present, there are at least two ideas in the field of monitoring security. One is to improve the existing deterministic strategy for the path planning problem and use random algorithms instead, such as Markov chains, or random walk theory. The other is to establish a game model and a balance scheme between the competing players. Agmon [21] proposed a Markov strategy, which is a polynomial-time algorithm, and their research is motivated by reducing the probability of being invaded at a weak task position as much as possible. Entropy has also been introduced in path planning [22]. For example, George [23] and Duan [24] studied the entropy rate maximization problem based on Markov chains. Stackelberg game theory was used by Basilico [25] to formulate an optimal solution to the path planning problem for a single robot on a security patrol, while assuming only one intruder. Security game theory has been proposed for the study of the persistent monitoring path planning problem in ecological protection [26]. The main motivation for their study on patrol and monitoring strategies is to obtain an unpredictable trajectory, which was finally obtained through maximum entropy.

With the aforementioned observations, some studies on persistent monitoring path planning only concern the complete coverage rate, and some studies consider the monitoring frequency, but the final paths often fall in a fixed monitoring period which makes the monitoring regularity completely exposed to intrusions. The other study considers monitoring security, but they still do not consider monitoring frequency constraints. To bridge the gap between the monitoring frequency and monitoring security, this study will comprehensively consider both sides simultaneously, that is, improving monitoring path privacy while increasing monitoring frequency. The main contributions of this paper are as follows:

Considering monitoring frequency and path privacy, this study shows how to formulate a multi-UAV cooperative persistent monitoring path planning problem with multiple constraints based on the monitoring of overdue time and of monitoring period entropy.
A multi-group ant colony optimization algorithm, called overdue-aware multiple ant colony optimization (OMACO), is proposed to obtain an optimal flight path for UAV cooperation. The heuristic function and pheromone update method are improved based on the monitoring delay time and overdue time. In addition, a target exclusive mechanism and greedy strategy are proposed for ant node selection.
Simulation experiments are carried out in complete and incomplete environments to verify the effectiveness and advantages of the designed algorithm. The simulation results show that the algorithm proposed in this paper can effectively improve both the monitoring frequency and the monitoring privacy protection.

2. Multi-UAV Cooperative Persistent Monitoring Path Planning Model

2.1. Problem Description

As the monitoring environment changes and the node quantity increases, computer resources onboard are often insufficient when performing persistent monitoring tasks in the stand-alone operation mode. As a result, the waiting time of nodes increase, causing some nodes to monitor overdue. Compared with a single drone, a drone group performing persistent monitoring tasks will face huge challenges. For example, each node will maintain a parameter that represents how long it has been waiting since its last monitoring. Once any drone visits a node position and completes that monitoring, the waiting-time parameter maintained by this node will be cleared— demonstrating a rigid nonlinearity. Other difficulties include collision avoidance between multiple drones, information synchronization, and collaborative work between drones.

This study focuses only on the multi-UAV cooperative path planning problem of persistent monitoring. A graph model is used to describe the distribution of the candidate nodes, i.e., G = (V, E), where V = {1, 2, …, N} represents the nodes set, N represents the total number of nodes, and

E = {e_{i j}, \forall i, j \in V}

represents the edges set of G. The UAV set is

M_{U A V} = {1, 2, \dots, M}

, where M is the total number in the given UAV group, M << N. Here are some assumptions about the background of this study.

(1) For safety and efficiency purposes, the same nodes cannot exist for multiple drones at the same time. This means that different UAV are permitted to monitor the same node on different time.

(2) Without loss of generality, all UAVs fly with a constant speed, v.

(3) After a UAV accesses a node, the waiting time of the node is cleared, and all other UAVs need to be notified to ensure information synchronization.

This research tries to find the optimal flight path of a UAV group, so that the path meets the requirements of high monitoring frequency and strong monitoring path privacy.

2.2. Discretization of the Graph

Persistent monitoring needs to consider UAV movement synchronization. In order to solve the problem, a discrete approximation operation is introduced on the graph G. Several virtual nodes are inserted in an approximately uniform way to the edges of G leading to a discretized graph that includes many more edges of equal intervals, denoted by δ. This operation encourages good behavior in which any UAV will certainly move forward from its current node position to its neighbor node in G instead of staying between nodes at time step k. This is called UAV movement synchronization. Consequently, nodes can be divided into two categories, one is the task node set, V, which requires monitoring and the other is the virtual node set, U, which is generated during discrete approximation operation and does not to be monitored. The complete node set, called a generalized node set, is denoted as

V^{'} = V \cup U = {1, 2, \dots, N + | U |}

. It should be emphasized that all virtual nodes in U are not real monitoring tasks, so they do not need to record their monitoring delays. The final adjacency matrix of G is

A \in ℝ^{(N + | U |) \times (N + | U |)}

, where any element a_ij is binary. a_ij = 1 indicates that node i and j are adjacent to each other, otherwise a_ij = 0.

2.3. Multi-UAV Collaborative Monitoring Constraints

Let K denote the maximum length of the monitoring horizon. Let the binary variable matrix

Y^{m} \in ℝ^{K \times (N + | U |)}

denote whether a node is monitored by UAV m,

m \in M_{U A V}

. For

\forall i \in V^{'}

, the element

y_{k, i}^{m} = 1

represents that the node i is monitored by UAV m at time k, and

y_{k, i}^{m} = 0

represents that the node i is not monitored by UAV m at time k.

Y^{m}

represents the monitoring of all nodes by UAV m in the entire monitoring time horizon.

Let the binary variable matrix

X \in ℝ^{K \times (N + | U |)}

represent whether a node is monitored by any UAV in the group, where the element

x_{k, i} = 1

represents that there is at least one UAV monitoring node i at time k, and the element

x_{k, i} = 0

represents that node i is not monitored by any UAV at time k. The matrix X stands for the monitored situation of all nodes in the monitoring time horizon, and can be obtained by combining all Y^m, m = 1, 2, …, M. The relationship between X and Y^m is

X = Y^{1} \cup Y^{2} \dots \cup Y^{M}

. The constraints are as follows:

x_{k, i} = {\begin{cases} 0, if \sum_{m = 1}^{M} y_{k . i}^{m} = 0, \\ 1, otherwise, i . e ., \sum_{m = 1}^{M} y_{k . i}^{m} = 1 \end{cases}

(1)

where

i \in V, k \in {1, 2, \dots, K}

\sum_{m = 1}^{M} y_{k . i}^{m} \leq 1, i \in V^{'}, k \in {1, 2, \dots, K}

(2)

\sum_{k = 1}^{K} x_{k, i} \geq 1, i \in V

(3)

\sum_{i = 1}^{N + | U |} x_{k, i} = M, k \in {1, 2, \dots, K}

(4)

Equation (2) indicates that at any time k, a node is monitored by, at most, one UAV, that is, multiple UAVs cannot appear at the same location at the same time. Equation (3) indicates that within the monitoring horizon K, each node must be visited at least once. Equation (4) indicates that a UAV only has one position at a certain time k.

2.4. UAV Motion Constraints

Assuming that the initial moment k=1, all the UAVs need to start from the same given initial node

S_{m} \in V

. The following constraints are satisfied:

y_{1, S_{m}}^{m} = 1, m \in M_{U A V}

(5)

At the same time, the UAV m cannot visit the same node in two adjacent time steps:

y_{k, i}^{m} + y_{k + 1, i}^{m} \leq 1, i \in V, k \in {1, 2, \dots, K - 1}, m \in M_{U A V}

(6)

2.5. The Waiting Time Constraint of the Task Node

Let

F \in ℝ^{(K - 1) \times N}

represent the whole task nodes’ waiting time, in which the element is

f_{k, i} \geq 0

. In the interval between time step k-1 to k, all UAVs select a candidate node from their individual neighbor according to a certain movement strategy. After that, the waiting time of almost all nodes increases by one unit time except the arrived node i which is exactly a task node. That is,

i \in V

. The waiting time corresponding to the arrived node i will be cleared. Therefore,

f_{k, i} = {\begin{cases} 0, & i \in V, k = 1 \\ (1 - x_{k, i}) (f_{k - 1, i} + c), & i \in V, k \in {2, 3, \dots, K} \end{cases}

(7)

where c is a unit time constant, which represents the time consumed by the UAV when passing through each edge interval. This specific value is related to the accuracy of the discretization operation.

2.6. Min–Max Optimization for Multi-UAV Cooperative Monitoring

2.6.1. UAVs Monitoring Overdue Time Evaluation

Let the maximum monitoring interval of a task node i between two adjacent monitoring events be the expected period of the node, denoted by T_i,

i \in V

. Ideally, for any time k, the waiting time of node i should not exceed its expected period. That is

0 \leq f_{k, i} \leq T_{i}, i \in V, k \in {1, 2, \dots, K}

(8)

However, in practical applications, since the number of UAVs is far less than the quantity of the task nodes, it is inevitable that some nodes’ monitoring will be overdue. The overdue time can be expressed as

f_{k - 1, i} + c - T_{i}

. Define the real monitoring period of the task node as

P \in ℝ^{K \times N}

:

p_{k, i} = {\begin{cases} 0, & i \in V, k = 1 \\ x_{k, i} (f_{k - 1, i} + c), & i \in V, k \in {2, 3, \dots, K} \end{cases}

(9)

The above equation indicates that when the UAV arrives at node i at time step k, i.e.,

x_{k, i} = 1

, the real monitoring period of this node is

f_{k - 1, i} + c

. Otherwise, p_k,i have no definition and it will be assigned to zero. Therefore, the maximum monitoring period of the task node i in the entire monitoring horizon is:

\max_{k \in {1, 2, \dots, K}} p_{k, i}

(10)

Then, the maximum overdue time of the task node i caused by exceeding its expected period T_i can be expressed as:

\max {0, \max_{k \in {1, 2, \dots, K}} (p_{k, i} - T_{i})}

(11)

The following objective, J₁, is proposed for optimization by minimizing the normalized maximum overdue time of all task nodes.

\min_{Y, F} J_{1} = \max_{i \in V} (\frac{1}{T_{i}} \max {0, \max_{k \in {1, 2, \dots, K}} (p_{k, i} - T_{i})})

(12)

2.6.2. UAVs Monitoring Path Privacy Criterion

As long as any UAV accesses a task node, its waiting time will be cleared. Therefore, it is necessary to evaluate the privacy of the monitoring path based on the actual visiting period of all task nodes. Since the uncertainty of the monitoring period indirectly reflects the monitoring privacy, this study proposes the concept of monitoring period entropy (MPE) which refers to the uncertainty when the UAV returns to the task node for monitoring again. The larger the MPE, the higher the randomness of the monitoring period. Define a vector

{\tilde{p}}_{i} = {p_{k, i} | p_{k, i} > 0, k = 1, 2, \dots, K}

to represent the vector composed of all the monitoring cycles of task node i in the entire monitoring horizon. The length of the vector,

{\tilde{p}}_{i}

, is

l_{{\tilde{p}}_{i}} = \sum_{k = 1}^{K} x_{k, i}

. Define the monitoring period entropy of node i as:

H ({\tilde{p}}_{i}) = - \sum_{j = 1}^{l_{{\tilde{p}}_{i}}} P ({\tilde{p}}_{i} (j)) \log P ({\tilde{p}}_{i} (j))

(13)

where

P ({\tilde{p}}_{i} (j))

is the probability that the jth element in vector

{\tilde{p}}_{i}

. One should note that

H ({\tilde{p}}_{i})

is always positive. The minimum monitoring period entropy among all task nodes is:

\min_{i \in V} H ({\tilde{p}}_{i})

(14)

Therefore, in order to improve the randomness of the monitoring period, the optimization objective is designed to maximize the entropy of the smallest monitoring period among all task nodes, namely

\max_{Y, F} (\min_{i \in V} H ({\tilde{p}}_{i}))

. This criterion is also equivalent to the reciprocal of the minimum monitoring period entropy (because

H ({\tilde{p}}_{i})

is a positive number), so the following optimization objectives can be designed:

\min_{Y, F} J_{2} = \frac{1}{\min_{i \in V} H ({\tilde{p}}_{i})}

(15)

The dimension of the multi-UAV path solution Y is K × (N + |U|), and the algorithm time complexity of the calculation for the monitoring of overdue time and the evaluation of path privacy is O(n²).

2.6.3. Multi-UAV Persistent Monitoring Path Planning Model

The optimization problem of multi-UAV cooperative persistent monitoring path planning is expressed as follows:

\begin{array}{l} \min_{Y, F} & J = w J_{1} + (1 - w) J_{2} \\ s . t . & (1) - (8) \end{array}

(16)

where

w \in (0, 1)

represents the weight coefficient, which will balance between the performance of overdue time and path privacy.

3. Improved Multi-Group Ant Colony Optimization Algorithms Based on Monitoring Overdue Time

From the perspective of reducing monitoring overdue time and improving path privacy, this section designs an improved ant colony optimization (ACO) algorithm based on the monitoring of overdue time, called an overdue-aware multiple ant colony optimization algorithm. Major improvements include the aspects:

A greedy strategy for node selection is proposed, in which the ant colony heuristic function is modified using the expected period of the task nodes.
Ant colony pheromone is updated based on monitoring overdue time and monitoring period entropy.
A target exclusion mechanism is proposed to improve the utilization rate of multi-UAV in cooperative monitoring.

3.1. Heuristic Function Based on Monitoring Expectation Period

In order to increase the monitoring frequency and reduce the visiting delay of each task node, the improved heuristic function, η_ij, is as follows:

η_{i j} = \frac{1}{T_{j} d_{i j}}

(17)

where d_ij represents the distance between node i and j. Comparing with the traditional heuristic function in ACO, Equation (17) takes into account the expected period (T_j) of the neighbor task nodes, which is helpful in reducing its monitoring overdue time.

3.2. Target Exclusion Mechanism

When multiple UAVs perform tasks at the same time and do not consider the path privacy issue, multiple UAVs will be evenly distributed on the minimum Hamiltonian cycle of the graph [25]. The ants select generalized nodes (task nodes or virtual nodes are both possible) depending on stochastic probability. Therefore, there is a slim chance that the UAV follows its previous UAV when selecting its next node, which results in some nodes being monitored frequently while other task nodes are missed for a long time. Consequently, monitoring overdue events happen. In order to prevent UAVs from following synchronically, this research proposes a target exclusion mechanism, as shown in Figure 1.

As an example, when UAV1 in Figure 1 selects node n₂ as the candidate task node, UAV1 exclusively occupies node n₂ and the node n₂ will be locked. However, UAV2, which is currently located at node n₄, cannot select the locked node as its candidate. Only one of n₁ and n₃ will be chosen as the UAV1’s next waypoint. The target exclusive mechanism can fundamentally solve the UAV following problem.

3.3. Greedy Strategy for Node Selection

This section proposes a greedy strategy, which can help UAV select the optimal node among its neighbors. The strategy is motivated by the idea that the greater the overdue time of the ant’s adjacent node j is, the greater the probability that node j will be selected by the ants in the next step. First calculate the overdue time of all adjacent nodes. Since some adjacent nodes may not be overdue, the calculated overdue time by

f_{k - 1, j} + c - T_{j}

is possibly negative and inconvenient to compute the transition probability. Therefore, this research constructs a pseudo-overdue time, R_j(t), which is guaranteed to be positive.

R_{j} (t) = f_{k - 1, j} + c - T_{j} + T_{0}, \forall k \in P

(18)

where j represents the adjacent node of the current node. T₀ represents the upper bound of the expected period of all monitoring nodes. Usually, it can be calculated by

T_{0} = \max_{i \in V} {T_{i}}

offline.

The transition probability is not only related to the overdue time of its neighbor node, but also related to the adjacency constraints, exclusive flags, and pheromone distribution of the ants’ current adjacent nodes. The improved ant transition probability

p_{i j}^{z}

is as follows:

p_{i j}^{z} = {\begin{cases} \frac{τ_{i j}^{α} (t) η_{i j}^{β} (t) R_{j} (t) a_{i j} (1 - o_{j})}{\sum_{s \in a l l o w_{z}} τ_{i s}^{α} (t) η_{i s}^{β} (t) R_{s} (t) a_{i s} (1 - o_{s})}, & j \in a l l o w_{z} \\ 0, & other \end{cases}

(19)

where i is the current node of the ant whose adjacent node is denoted by j. α and β stand for the importance factor of the pheromone and the heuristic function, respectively, τ_ij(t) represents the pheromone concentration on the edge e_ij after the optimization of each ant at the t-th iteration. a_ij stands for the adjacency relationship between node i and j, o_j represents the exclusive state of the node j,

z \in {1, 2, 3, \dots, Z}

represents the ant number, z is the ant quantity, and allow_z represents the set of nodes that the ant z can visit next time. After the transition probability of the ants is calculated, the roulette method is used to select the next node according to the maximum probability.

3.4. Pheromone Update Based on Monitoring Overdue Time and Monitoring Period Entropy

The traditional ant colony algorithm updates the pheromone mainly based on the path length that ants travelled. In order to promote the evolution of the ant colony to the direction with the smallest cost function value, this study updates the pheromone according to the optimization objective (16).

τ_{i j} (t + 1) = (1 - ρ) τ_{i j} (t) + \sum_{z = 1}^{Z} Δ τ_{i j}^{z}

(20)

Δ τ_{i j}^{z} = {\begin{cases} \frac{Q}{J_{z}}, ant z from node i to node j \\ 0, other \end{cases}

(21)

where

ρ \in (0, 1)

represents the pheromone volatile factor.

Δ τ_{i j}^{z}

represents the pheromone concentration released by the ant

z

on the edge between node i and j in the current iteration. Q is a constant, representing the total pheromone amount released by the ants at one time, and J_z represents the path cost of the ant

z

calculated according to (16).

To sum up, the scheme of the proposed OMACO algorithm is shown in Figure 2. The steps are as follows in Algorithm 1:

Algorithm 1: Overdue-aware multiple ant colony optimization (OMACO).

      Step 1: Initialization (node quantity N, adjacency matrix A, ant quantity Z, maximum iterations N_c, pheromone importance factor α, heuristic function importance factor β, pheromone volatility factor ρ, pheromone quantity Q, and maximum monitoring horizon K, weight parameter w).
      Step 2: Discretization of the graph.
      Step 3: Calculate the target exclusion set O₀.
      Step 4: Calculate the ant transition probability

p_{i j}^{z}

according to (19).
Step 5: Select the next node according to the roulette method, and update the node waiting time

f_{k, i}

.
Step 6: Update the ant’s taboo table.
Step 7: Update the target exclusive flag

o_{i}

.
      Step 8: Calculate the monitoring overdue time and monitoring period entropy according to (11) and (13).
      Step 9: Update pheromone according to (20) and (21).
      Step 10: Determine whether the iteration reaches the maximum iterations. If so, the procedure ends; otherwise, go to Step 3.

4. Simulation Experiments and Discussions

In this section, simulation experiments are carried out for multi-UAV persistent monitoring tasks in complete and incomplete environments to evaluate the path planning model and solution algorithm proposed in this study.

4.1. Algorithm Feasibility Analysis

Assume that three UAVs perform tasks in a complete environment containing 10 task nodes with known locations to be monitored, which are labeled as numbers in Figure 3. Task nodes and virtual nodes are illustrated by red and green dots, respectively. The blue solid lines represent adjacency relationships within the graph. All the simulation parameters are listed in Table 1. The expected periods of the task nodes are shown in Table 2. All simulation examples in this paper are implemented on a computer with Matlab R2020a installed and the system configuration is Intel Core i7-9750H, 2.59 GHz, 16 GB RAM.

Figure 4 shows the persistent monitoring flight path of the three UAVs obtained by the proposed method in this paper, where the x-axis represents the time, and the y-axis represents the node that the UAV arrived at the corresponding time step. The solid line represents the UAV flight path consisting of passing nodes.

Figure 5 shows the expected period and the actual monitoring period of the task nodes. It can be seen that the actual monitoring period of all task nodes is less than the expected period, which indicates that the monitoring process of the UAV meets the monitoring frequency requirements of all nodes. Figure 5 also shows that each node has been visited multiple times in the monitoring horizon, obtaining multiple actual monitoring periods which are all lower than their expected periods, i.e., meeting the monitoring frequency requirements.

More importantly, the actual monitoring period of each node is different, that is, the waiting time when each node is monitored has a good random distribution. The simulation shows that the method proposed in this paper can cover all monitoring nodes, meet the monitoring frequency requirements, and also improve the privacy protection of the monitoring path.

4.2. Comparative Analysis with Traditional ACO

In order to evaluate the performance of the proposed OMACO algorithm, this section compares the optimization ability of OMACO and the traditional ACO. Figure 6 shows the monitoring path solved by the traditional ACO with the same parameters to Section 3.1. Different from Figure 4, the path sequences (node 6 → 7 → 9) repeat up to eight times in Figure 6, and the UAV3 trajectory (blue) between steps 450 and 500 can be seen following by UAV1 (red). This leads to the same monitoring period and is very harmful to the monitoring privacy protection. However, the UAV path in Figure 4 has no obvious repetitive path or circular trajectory, and there is no UAV following the other. Therefore, compared with the traditional ACO, the proposed OMACO algorithm can obtain better privacy protection performance.

Figure 7 shows the actual monitoring period obtained by using the traditional ACO. There exist many nodes that have been monitored overdue many times, resulting in the waiting time of the task node frequently exceeding the expected period. Therefore, the proposed OMACO algorithm is superior to the traditional ACO in improving the monitoring frequency.

Table 3 shows a detailed comparison between OMACO and ACO on each task node monitoring data. Based on the proposed OMACO algorithm, most task nodes have been visited more times than that of ACO. Therefore, the average visit number is higher than the traditional ACO. Correspondingly, the average actual period will decrease and be less than ACO. Also, it is found that the ACO algorithm is not appropriate for our problem because the node No.10 exceeds its upper bound.

Figure 8 shows the iterative curves of the objective functions obtained by OMACO and ACO, and the related data are shown in Table 4. In the first iteration, the algorithm designed in this research has a lower value of objective function than ACO. This is because the waiting time of the task node has already been considered by OMACO when calculating the transition probability based on the greedy strategy. In fact, the node selection strategy has been optimized before the initial ant path. The traditional ACO only relies on the heuristic function and pheromone to decide the node transition probability. Consequently, the pheromone is equal on all path segments in the initial iteration which leads to a randomly path generated.

The OMACO algorithm gets the optimal solution of 0.433 in the 4th iteration while the traditional ACO only obtains the optimal solution of 0.814 in the 28th iteration. Since the OMACO algorithm introduces the overdue time for optimization, it is significantly better than the ACO in terms of reducing the monitoring overdue time and improving the monitoring path privacy.

4.3. Algorithm Scalability Analysis

This section demonstrates the simulation experiments with three UAVs performing persistent monitoring in an incomplete environment which contains 15 task nodes. Other parameter settings are the same as in Section 3.1. Figure 9 shows the environment topology where 15 task nodes connected incompletely will be persistently monitored by the UAVs. The expected period of 15 nodes is shown in Table 5.

In order to further evaluate the scalability of the OMACO algorithm, the algorithm is tested in the incomplete environment and the results are shown in Figure 10 and Figure 11. It can be seen that the OMACO algorithm can obtain the optimal path of the UAV swarm in an incomplete environment, satisfying the objective that the actual monitoring period of each node be not higher than the expected period. It can be concluded that the OMACO algorithm can solve the problem of UAV flight paths in different monitoring environments, satisfying the requirements for monitoring overdue events and monitoring privacy.

5. Conclusions

This research has studied the problem of multi-UAV persistent monitoring path planning from the perspective of monitoring privacy protection, reducing monitoring overdue events, and improving the privacy protection of the monitoring trajectory. A multi-UAV path planning mathematical model was established based on the monitoring overdue time and monitoring period entropy. Based on the overdue time, the heuristic function, transition probability and pheromone update, the strategy of the traditional ACO is improved. The simulation results show that the proposed OMACO algorithm can solve the optimal UAV flight path efficiently in both complete and incomplete monitoring environments and has better performance than ACO. This study is promising for the prevention of intelligent intrusions while meeting the requirements of regular monitoring.

However, as the complexity of the monitoring environment increases, there may be adversarial targets destroying monitoring tasks, and the privacy protection requirements may be more stringent. Subsequent consideration will be given to localize adversarial objects cooperatively while executing persistent monitoring assignments.

Author Contributions

Conceptualization, Y.C.; methodology, Y.S.; software, Y.C. and Y.S.; formal analysis, Y.C. and Y.S.; investigation, Y.C. and Y.S.; writing—original draft preparation, Y.S.; writing—review and editing, Y.C., M.H. and X.Z.; supervision, Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Joint fund of Science & Technology Department of Liaoning Province and the State Key Laboratory of Robotics with Grant No. 2021-KF-22-14.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cassandras, C.G.; Lin, X.; Ding, X. An optimal control approach to the multi-agent persistent monitoring problem. IEEE Trans. Autom. Control 2012, 58, 947–961. [Google Scholar] [CrossRef]
Wang, Y.W.; Wei, Y.W.; Liu, X.K.; Zhou, N.; Cassandras, C.G. Optimal persistent monitoring using second-order agents with physical constraints. IEEE Trans. Autom. Control 2018, 64, 3239–3252. [Google Scholar] [CrossRef]
Smith, S.L.; Rus, D. Multi-robot monitoring in dynamic environments with guaranteed currency of observations. In Proceedings of the 49th IEEE Conference on Decision and Control (CDC), Atlanta, GA, USA, 15–17 December 2010; IEEE: Manhattan, NY, USA, 2010; pp. 514–521. [Google Scholar]
Ostertag, M.; Nikolay, A.; Tajana, R. Trajectory planning and optimization for minimizing uncertainty in persistent monitoring applications. J. Intell. Robot. Syst. 2022, 106, 2. [Google Scholar] [CrossRef]
Zhu, S.; Wang, D. Adversarial ground target tracking using UAVs with input constraints. J. Intell. Robot. Syst. 2012, 65, 521–532. [Google Scholar] [CrossRef]
Girard, A.R.; Howell, A.S.; Hedrick, J.K. Border patrol and surveillance missions using multiple unmanned air vehicles. In Proceedings of the 43rd IEEE Conference on Decision and Control, Nassau, Bahamas, 14–17 December 2004; Volume 1, pp. 620–625. [Google Scholar]
Huang, L.; Qu, H.; Ji, P.; Liu, X.; Fan, Z. A novel coordinated path planning method using k-degree smoothing for multi-UAVs. Appl. Soft Comput. 2016, 48, 182–192. [Google Scholar] [CrossRef]
Yun, W.J.; Park, S.; Kim, J.; Shin, M.; Jung, S.; Mohaisen, D.A.; Kim, J.H. Cooperative multiagent deep reinforcement learning for reliable surveillance via autonomous multi-UAV control. IEEE Trans. Ind. Inform. 2022, 18, 7086–7096. [Google Scholar] [CrossRef]
Huang, L.; Zhou, M.; Hao, K.; Hou, E. A survey of multi-robot regular and adversarial patrolling. IEEE/CAA J. Autom. Sin. 2019, 6, 894–903. [Google Scholar] [CrossRef]
Portugal, D.; Rocha, R. A survey on multi-robot patrolling algorithms. In Proceedings of the Doctoral Conference on Computing, Electrical and Industrial Systems, Costa de Caparica, Portugal, 22–24 February 2011; Springer: Berlin/Heidelberg, Germany, 2011; pp. 139–146. [Google Scholar]
Alamdari, S.; Fata, E.; Smith, S.L. Persistent monitoring in discrete environments: Minimizing the maximum weighted latency between observations. Int. J. Robot. Res. 2014, 33, 138–154. [Google Scholar] [CrossRef]
Elmaliach, Y.; Agmon, N.; Kaminka, G.A. Multi-robot area patrol under frequency constraints. Ann. Math. Artif. Intell. 2009, 57, 293–320. [Google Scholar] [CrossRef]
Smith, S.L.; Schwager, M.; Rus, D. Persistent robotic tasks: Monitoring and sweeping in changing environments. IEEE Trans. Robot. 2011, 28, 410–426. [Google Scholar] [CrossRef]
Wang, T.; Huang, P.; Dong, G. Cooperative persistent surveillance on a road network by multi-UGVs with detection ability. IEEE Trans. Ind. Electron. 2022, 69, 11468–11478. [Google Scholar] [CrossRef]
Kalyanam, K.; Pachter, M.; Casbeer, D. Average reward dynamic programming applied to a persistent visitation and data delivery problem. In Proceedings of the Dynamic Systems and Control Conference, Tysons, VA, USA, 11–13 October 2017; American Society of Mechanical Engineers: New York, NY, USA, 2017; Volume 58295, p. V003T39A002. [Google Scholar]
Kalyanam, K.; Manyam, S.; Von Moll, A.; Casbeer, D.; Pachter, M. Scalable and exact MILP methods for UAV persistent visitation problem. In Proceedings of the 2018 IEEE Conference on Control Technology and Applications (CCTA), Copenhagen, Denmark, 21–24 August 2018; IEEE: Manhattan, NY, USA, 2018; pp. 337–342. [Google Scholar]
Von Moll, A.L.; Casbeer, D.W.; Kalyanam, K.; Manyam, S.G. Genetic algorithm approach for UAV persistent visitation problem. In Proceedings of the Dynamic Systems and Control Conference, Atlanta, GA, USA, 30 September–3 October 2018; American Society of Mechanical Engineers: New York, NY, USA, 2018; Volume 51913, p. V003T36A001. [Google Scholar]
Scherer, J.; Rinner, B. Multi-UAV surveillance with minimum information idleness and latency constraints. IEEE Robot. Autom. Lett. 2020, 5, 4812–4819. [Google Scholar] [CrossRef]
Hari, S.K.K.; Rathinam, S.; Darbha, S.; Kalyanam, K.; Manyam, S.G.; Casbeer, D. The generalized persistent monitoring problem. In Proceedings of the 2019 American Control Conference (ACC), Philadelphia, PA, USA, 10–12 July 2019; IEEE: Manhattan, NY, USA, 2019; pp. 2783–2788. [Google Scholar]
Lei, Z.; Chen, X.; Chen, X.; Chai, L. Radial coverage strength for optimization of monocular multicamera deployment. IEEE/ASME Trans. Mechatron. 2021, 26, 3221–3231. [Google Scholar] [CrossRef]
Agmon, N.; Kaminka, G.A.; Kraus, S. Multi-robot adversarial patrolling: Facing a full-knowledge opponent. J. Artif. Intell. Res. 2011, 42, 887–916. [Google Scholar]
Tao, X.; Lang, N.; Li, H.; Xu, D. Path planning in uncertain environment with moving obstacles using warm start cross entropy. IEEE/ASME Trans. Mechatron. 2021, 27, 800–810. [Google Scholar] [CrossRef]
George, M.; Jafarpour, S.; Bullo, F. Markov chains with maximum entropy for robotic surveillance. IEEE Trans. Autom. Control 2018, 64, 1566–1580. [Google Scholar] [CrossRef]
Duan, X.; George, M.; Bullo, F. Markov chains with maximum return time entropy for robotic surveillance. IEEE Trans. Autom. Control 2019, 65, 72–86. [Google Scholar] [CrossRef]
Basilico, N.; Gatti, N.; Amigoni, F. Patrolling security games: Definition and algorithms for solving large instances with single patroller and single intruder. Artif. Intell. 2012, 184, 78–123. [Google Scholar] [CrossRef]
Xu, H.; Ford, B.; Fang, F.; Dilkina, B.; Plumptre, A.; Tambe, M.; Driciru, M.; Wanyama, F.; Rwetsiba, A.; Nsubaga, M.; et al. Optimal patrol planning for green security games with black-box attackers. In Proceedings of the International Conference on Decision and Game Theory for Security, Vienna, Austria, 10–13 October 2017; Springer: Cham, Switzerland, 2017; pp. 458–477. [Google Scholar]

Figure 1. Target exclusion mechanism.

Figure 2. Flowchart of the OMACO algorithm.

Figure 3. Discretization of a completely connected graph.

Figure 4. The persistent monitoring path obtained by the OMACO algorithm.

Figure 5. The actual monitoring period of task nodes obtained by the OMACO algorithm.

Figure 6. The persistent monitoring path obtained by ACO algorithm.

Figure 7. The actual monitoring period of each node obtained by ACO algorithm.

Figure 8. The objective function iteration curves of the two algorithms.

Figure 9. Incomplete environment including 15 task nodes.

Figure 10. The monitoring path in incomplete environment obtained by OMACO algorithm.

Figure 11. The monitoring period in an incomplete environment obtained by OMACO algorithm.

Table 1. Simulation parameters.

Parameters	Value	Notes
v	8 m/s	UAV speed
δ	40 m	interval for discretization
Z	15	ant quantity
c	5 s	constant
N_c	200	maximum iteration
α	1.2	pheromone importance factor
β	4	heuristic function importance factor
ρ	0.3	pheromone volatility factor
Q	10	pheromone quantity
K	500	monitoring Horizon
w	0.6	weight parameter

Table 2. Expected period of 10 task nodes.

Node	1	2	3	4	5	6	7	8	9	10
T_i (s)	370	380	350	375	365	390	380	380	375	360

Table 3. Monitoring results comparison between OMACO and ACO.

Node	Number of Visits		Average of Actual Monitoring Period
Node	OMACO	ACO	OMACO	ACO
1	12	12	198.33	198.75
2	9	7	253.33	350.00
3	14	8	167.14	278.75
4	11	8	232.50	286.25
5	12	9	204.17	242.22
6	9	15	260.00	165.00
7	10	12	220.00	188.33
8	9	7	266.11	335.00
9	11	15	219.09	166.33
10	14	6	170.00	365.00 *
Average	11.1	9.9	219.07	257.56

Table 4. Solution comparison between OMACO and ACO.

	OMACO	ACO
Iterations	4	28
Minimum Cost	0.433	0.814

Table 5. Expected period of 15 task nodes.

Node	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15
T_i (s)	700	750	1050	950	950	850	950	850	700	850	850	750	700	700	750

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, Y.; Shu, Y.; Hu, M.; Zhao, X. Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation. Appl. Sci. 2022, 12, 12111. https://doi.org/10.3390/app122312111

AMA Style

Chen Y, Shu Y, Hu M, Zhao X. Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation. Applied Sciences. 2022; 12(23):12111. https://doi.org/10.3390/app122312111

Chicago/Turabian Style

Chen, Yang, Yifei Shu, Mian Hu, and Xingang Zhao. 2022. "Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation" Applied Sciences 12, no. 23: 12111. https://doi.org/10.3390/app122312111

APA Style

Chen, Y., Shu, Y., Hu, M., & Zhao, X. (2022). Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation. Applied Sciences, 12(23), 12111. https://doi.org/10.3390/app122312111

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-UAV Cooperative Path Planning with Monitoring Privacy Preservation

Abstract

1. Introduction

2. Multi-UAV Cooperative Persistent Monitoring Path Planning Model

2.1. Problem Description

2.2. Discretization of the Graph

2.3. Multi-UAV Collaborative Monitoring Constraints

2.4. UAV Motion Constraints

2.5. The Waiting Time Constraint of the Task Node

2.6. Min–Max Optimization for Multi-UAV Cooperative Monitoring

2.6.1. UAVs Monitoring Overdue Time Evaluation

2.6.2. UAVs Monitoring Path Privacy Criterion

2.6.3. Multi-UAV Persistent Monitoring Path Planning Model

3. Improved Multi-Group Ant Colony Optimization Algorithms Based on Monitoring Overdue Time

3.1. Heuristic Function Based on Monitoring Expectation Period

3.2. Target Exclusion Mechanism

3.3. Greedy Strategy for Node Selection

3.4. Pheromone Update Based on Monitoring Overdue Time and Monitoring Period Entropy

4. Simulation Experiments and Discussions

4.1. Algorithm Feasibility Analysis

4.2. Comparative Analysis with Traditional ACO

4.3. Algorithm Scalability Analysis

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI