1. Introduction
The coordination of autonomous robot teams has been a rising topic of interest during the last few decades. Formation control, surveillance or coverage are some of the most commonly-faced topics. Heterogeneous robot teams are gaining attention for monitoring tasks in dynamic environments. For instance, in [
1], a method is presented for computing the localization of a group with both ground and aerial robots, using measurements that are not associated with identified robots. Considering the low battery life of the aerial robots, in [
2], a group of them is assisted by a team of ground robots so that they can dock and recharge their batteries. Some authors focus on particular applications such as mobile sensory systems for monitoring environmental variables in greenhouses [
3], instead of large sensor networks, or sediment sampling in estuarine mudflats [
4] with drilling ground vehicles and aerial imagery robots.
We face the problem of coordinating a heterogeneous team composed of both ground and aerial robots so that they cooperate to monitor an environment. This way, the different view points of the sensors can be used to build a richer representation of the 3D scene. In these scenarios, the design of the coordination strategy is challenging, since several restrictions should be simultaneously satisfied. Ground and aerial robots should observe the same area to build the representation of the scene. In order to cover a greater area, agents should be deployed far from each other. However, the communication should be kept so that they can exchange data and perform the computations. In this framework, we propose a strategy in which the aerial robots act as leaders of the formation. Ground robots, also referred to as agents, execute different coverage-based algorithms to keep the aerial ones, from now on targets, in their sensing range.
We consider two alternatives for the ground robots, both of them built on classical centroidal Voronoi-based deployment ideas, which is also known as the Lloyd method and has been often used for static deployment. The alternatives proposed in this paper face the adaptation of these ideas for tracking tasks. The first strategy consists of building density functions, also known as importance functions, with focus on the aerial robots. As they move, the density functions are adapted accordingly, making the ground robots track them. All of the ground agents need to know where the density functions are. This strategy has been previously used in the literature as it is immediately extracted from the Lloyd algorithm. In the second strategy, we build virtual boundaries around the regions in which the aerial and ground robots are operating. Ground robots coordinate to evenly deploy over the region enclosed by these time-varying virtual boundaries. This strategy has never been used in the literature, as far as we know. It has the benefit that only the agents next to the boundaries need to be informed about them, and the others will react to their neighbors’ positions. For both strategies, we ensure that the ground robots remain connected by making them keep the links of a Minimum-distance Spanning Tree (MST) in the communication graph. This MST is recomputed at each iteration, so that agents have more freedom to move. Our method is centralized, although some of its steps admit a parallel execution.
We provide a thorough parametric study of the performance of the proposed strategies under several simulated scenarios built with MATLAB. We also evaluate the performance of the methods using realistic robotic simulation environments based on Gazebo and ROS (Robot Operating System), as well as using an experimental setup with six low-cost differential-drive robots.
To sum up, the main contributions of this paper are: (i) the proposal of methods for solving the problem of tracking a formation of targets or aerial robots with a team of agents or ground robots, covering the same areas as the targets and keeping the network connected; (ii) an algorithm based on virtual boundaries applicable to dynamic setups, which makes agents deploy in an equally-spaced configuration; (iii) we present extensive validations of our methods in different simulation scenarios and experiments under different setups.
This paper is organized as follows.
Section 2 gathers a review of related work in coverage, tracking and connectivity maintenance. In
Section 3, we present a base coverage algorithm with connectivity maintenance, using the Lloyd method and MST, respectively. Then, in
Section 4, we propose two strategies to adapt the coverage algorithm to tracking tasks. These strategies are tested with MATLAB scenarios in
Section 5. In
Section 6, we carry out some realistic simulations with Gazebo and ROS. In
Section 7, we show the results of the real experiments. Finally, a set of conclusions and future work are given in
Section 8.
2. Related Work
As previously stated, we consider tasks of environmental monitoring where a team of ground and aerial robots establishes coordination strategies to cover the same planar region with different view points. During their mission, ground agents must simultaneously satisfy several restrictions. They must follow the aerial robots and adapt dynamically to the positions covered by them. They also have to deploy in their coverage task, keeping the links of the communication network connected. Note that in this paper, we will consider that coverage and monitoring tasks are similar and will refer to them indifferently. In this section, we give an overview of the strategies that are the most related to the situation considered in this paper and discuss works on swarm aggregation, formation abstraction, formation control, containment control and leader-follower tracking. We discuss also some applications of Voronoi and Lloyd methods.
Leader-follower methods [
5,
6] are consensus-based control laws that make a team of followers track the position of the leader. The aim is that robots remain and finish close to each other and to the leader. Swarm aggregation methods [
7,
8] make robots move as a single entity through an environment, while avoiding obstacles. Although they do not collide, agents tend to be quite close to each other. In containment control tasks [
9,
10,
11], there are multiple stationary or dynamic leaders, which are aware of the global objectives of the task. The remaining agents execute distributed local rules to follow the leaders, so that they converge to the convex hull spanned by the leaders. Another approach is formation abstraction [
12]. The idea is to make agents remain inside the boundaries of a shape, e.g., polygonal, which can be modified or bent to adapt to the environment. The application is quite interesting, but ensuring collision avoidance or accurate tracking of the abstraction is usually hard to analyze. Here, we consider a similar idea, but associated with the well-known and widely-used Lloyd method. In this context, it is easy to introduce other ideas such as connectivity maintenance and collision avoidance, as well as uniform distribution on the area. There are also other leader-follower behaviors of higher complexity, as [
13], where a team of leaders moves so that they induce a set of periodic trajectories to be followed by the dependent robots. None of these methods is appropriate to deal with the problem we consider in this paper, since they may group too many agents, whereas we would like them to spread as much as possible to cover a larger area as they explore the environment. Moreover, some of the strategies described do not have control of the specific positions of the dependent robots.
Coverage and deployment algorithms are better suited for our purposes. The solutions considering Voronoi partitioning or the Lloyd method are the most common. They give rise to an easy to implement method, which is highly appealing. Voronoi partitioning consists of attaching to every robot the set of points of the working area that are closer to them and is very popular for deployment and coverage tasks. In fact, many researches take advantage of Voronoi ideas. Some examples include [
14], where agents detect evaders within their Voronoi partition and pursue them. In [
15], the goal is to capture targets randomly distributed in an environment. Each agent detects targets inside its Voronoi cell and estimates their position with a Kalman filter. The Lloyd method uses the Voronoi partitions to deploy the robots in the environment, and they are iteratively moved to the centroid of their Voronoi cell, separating them from each other and spreading over the working area. This method is widely used in the literature. The situation where some areas have more importance and robots have unlimited sensing and communication capabilities can be solved [
16,
17,
18] by iteratively moving the robots so that they reach centroidal Voronoi configurations. Several variations of this method have been presented to consider, e.g., robots using outdated information about the positions of the other team members [
19] or agents that can only sporadically send information to a central base station [
20]. The works discussing limited sensing and communication capabilities are more realistic and thus have a higher interest. Circular sensing footprints are included in [
21], while [
22] focuses on how to learn and adjust the regions associated with each robot depending on their sensing and actuating performances. Lloyd methods also allow the inclusion of obstacle-avoidance by the definition of safety regions that ensure there will be no collisions between the agents [
23]. These previous Lloyd-based strategies focus on static environments, while we want to make the agents track the mobile aerial robots. Lloyd coverage methods have been also used to track moving targets. The most commonly-used solution is to assign the density functions describing the importance of the environment to the targets and make them change with time. It has been experimentally validated that the Lloyd method works properly in these cases [
24,
25]. This is one of the strategies we also adopt in our work. However, in the presence of density functions, agents tend to accumulate nearby the targets. For this reason, we explore an alternative strategy that makes the agents spread uniformly along a varying working region.
A limitation of some multi-robot strategies is that often, there are no guarantees that the network will remain connected as the agents operate. This is an important issue, since robots often must be able to exchange data and communicate with each other in order to coordinate and to successfully fulfill their tasks. In [
26], this term is included in the control law governing the global coordination objective. It keeps an accurate estimate of the time-varying Fiedler eigenvector. Depending on the high-level coordination strategy, the connectivity control term will affect its achievement. It is remarkable that they study these effects both theoretically and experimentally. In [
27], there is a control law that enforces connectivity and an additional bounded control term that encapsulates the high-level control law for multi-robot cooperation. In addition, they design a control law that guarantees that the trajectories of the agents remain within a domain. This is done by means of a repulsion vector nearby the boundaries of the domain. A review of other connectivity control methods can be found in [
28]. All these control methods often rely on keeping up to date estimates of data that encapsulate global properties.
An alternative to maintain connectivity is to make the agents keep a set of links in the network. The less restrictive method is to build a tree in the graph. Specifically, the most used is the Minimum Spanning Tree (MST), where the links are selected depending on an assigned weight. Note also that as agents move, the weights of the tree branches change, as well, and the last MST becomes obsolete. Therefore, this MST should be periodically recomputed. Several works adopt this method for connectivity maintenance in different situations. A method for continuous-time systems that preserves a set of links is proposed in [
29]. They mention that the best structure to be kept would be a spanning tree, but no strategy is discussed for adapting the tree-links as robots move. In [
30], a high-level method is presented for managing tree topologies, which can be combined with motion tasks such as coverage, but the connections between the successive trees and the minimum spanning tree of the communication graph have not been established yet. In [
31], the authors present a formation control method for unicycle kinematics robots that keeps the connectivity. A triggered method for readjusting the tree at some time instances during deployment tasks is developed in [
32]. In [
33], the connectivity control takes place in a middle-ware layer to modify the goals of the robots when there is some risk of breaking some constraint. Furthermore, in [
34], the goal is to keep the links of a tree, done in this case in a separate layer at constant time intervals. In this paper, we explore a different approach. We make the agents keep the links of the Minimum-distance Spanning Tree (MST) of the communication graph. Therefore, we consider the re-computation of the MST at every time step, as is done in [
33,
34]. However, in order to alleviate the communication load, a triggering strategy as the one proposed in [
31,
32] could be used.
As a result, our methods are centralized. However, as acknowledged in [
35], some of the assumptions established for graph-based distributed control strategies are difficult to accommodate in realistic systems, and it may be reasonable to relax requirements on distributed methods. Some of the parts of our method are highly parallelizable. For instance, the MST computation could be alleviated by using the distributed algorithm presented in [
36]. We will investigate in the future ways to transfer information on density functions and on the virtual boundaries along the robot networks, to get an even more distributed method. Our aim is to get one step closer to a real implementation of the aerial-ground task, so we use a realistic simulation environment, as well as experiments with low-cost differential drive robots.
4. Simultaneous Deployment and Tracking with Connectivity Maintenance
We introduce an explanation of the connectivity maintenance method, and we give details on the control law. After that, we explain our two alternative methods for simultaneous deployment and tracking.
4.1. Connectivity Maintenance
Regarding the connectivity maintenance problem, we use a Minimum Spanning Tree (MST) [
31,
32,
33,
34]. This method joins the
n agents with
links. The nodes are connected in a communication tree that minimizes the sum of the lengths of the links in order to allow the maximum motion freedom. The MST must be recalculated at each step of the simulation. Otherwise, the links would heavily restrict the motion as agents move along. With all these considerations, the process that the algorithm runs at each iteration to calculate the MST in a centralized version is shown in Algorithm 1. A decentralized version can be found in [
36], and a triggered strategy is shown in [
32].
As mentioned above, the MST is used to ensure that the network remains connected as agents move. Observe Algorithm 2, which includes the main statements in one iteration of the Lloyd algorithm with connectivity maintenance. At every iteration
k, each agent
i computes the centroid
of its Voronoi region and computes its next position as:
If there are no restrictions on the agent motion, then , and the next agent position is its current centroid, .
Algorithm 1 MST calculation. |
- 1:
Distances - 2:
Communicables - 3:
MST_Graph - 4:
MST_Nodes - 5:
while do - 6:
for all MST_Nodes do - 7:
Candidates - 8:
for all Candidates do - 9:
if ∃ MST_Nodes = Candidates then - 10:
MST_Graph - 11:
MST_Graph - 12:
else - 13:
Curr_Distance - 14:
if Curr_Distance < Best_Distance then - 15:
Best_Distance = Curr_Distance - 16:
Best_Candidate = Candidates - 17:
Best_Source = MST_Nodes - 18:
end if - 19:
end if - 20:
end for - 21:
end for - 22:
MST_Nodes - 23:
MST_Graph - 24:
MST_Graph - 25:
end while
|
In order to keep the connectivity of the whole system, each agent must be within the communication radius
r of its MST-neighbors. However, the motion of the ground robots towards the centroids of their coverage regions may break the communication links, if no additional restrictions are included. At every step, the algorithm has to limit the motion of each pair of nodes
that are linked in the MST. When next positions
are restricted to a circle with the center at
and radius
r, their distance remains within
r [
38].
Figure 2 shows the maximum movement (red wide line) available in the worst case, where agents need to move in opposite directions.
Every
link in the MST that must be kept for agent
i can introduce a restriction on its next position
. Agent
i adjust its control input gain
K in Equation (
11) so that
equals the closest point from
to
that satisfies all the restrictions.
Note that agents may temporarily miss MST links as they navigate from to . However, all these MST links are re-established as all robots get to their goal positions , which is when they really need the network to be connected since they perform operations that require the exchange of data.
Algorithm 2 Lloyd_MSTConnCtrl. |
- 1:
Detect [ 37] - 2:
Coverage_regions - 3:
Compute centroid Equation ( 5) - 4:
Compute Algorithm 1 - 5:
Goal_Positions = Limit Section 4.1- 6:
Navigate - 7:
Send
|
Note that Algorithm 2 generates goal positions for each agent
, as if agents had integrator dynamics in a discrete-time setup. Our methods can be used by agents with other kinematic models. The positions
generated by our methods are high-level control laws. Agents are equipped with local motion controllers, specific for their kinematic models, to drive them nearby the high-level goals
. When they reach the goals with enough precision, agents inform each other and synchronize to start a new iteration of our algorithms. A detailed discussion of this control strategy can be found in
Section 7, for a team of six robot with differential drive kinematics.
4.2. Varying Importance Functions
We propose two alternatives to deal with tracking tasks. The first alternative solves the tracking issue through the density function
introduced in the coverage algorithm. This approach has been previously used in the literature to set up a dynamic coverage from the Voronoi tessellation, as it is immediate from the Lloyd static method. Each target is given an importance function defined as:
where
o is the position of the target. These values weight the calculation of the centroids, forcing the agents to reach the center of each function, where the targets are placed. Therefore, all the agents must be informed about the positions of all targets. Varying the position of these functions depending on the targets motion, we can achieve a tracking behavior. The ground agents will deploy, covering the same area as the aerial robots.
Figure 3 shows an example of the behavior of this method under fixed targets, in order to note how the agents deploy. Later, in
Section 5, we will present more examples with moving targets. Agents are represented with small red circles, and their communication tree is the set of gray lines that link them. The big blue circles represent the coverage regions of each agent. Finally, the grayscale concentric circles are the importance functions (with darker gray for higher values) where the targets are placed. In the first steps, the agents start moving towards the area of interest. When they reach it, they group around the first targets they obtain. Then, they deploy to cover all of them.
The process executed at each iteration of the simulation is gathered in Algorithm 3.
We have observed that this method provides an accumulation of ground robots around the targets, which is not the behavior desired for coverage tasks. In addition, the local optimal behavior of this method sometimes implies that some targets may be uncovered. For these reasons, we propose another alternative, explained next.
Algorithm 3 Varying importance functions. |
- 1:
Detect - 2:
Propagate - 3:
Update - 4:
Execute Algorithm 2
|
4.3. Redefining Fictitious Boundaries
We propose another alternative based on redefining the boundaries of the working area. Since in coverage problems with constant density functions, agents tend to evenly deploy over the area, here, the idea is to modify in a virtual way the boundaries of this area to make agents adapt accordingly. The interest in this method lies in the even deployment of the agents over the modified working area. In addition, only the agents next to the virtual limits need to be informed about them. As far as we know, this solution has not been used before in the literature.
We decide to define the new area as the minimum rectangle including both targets and agents. As the area reduces, agents deploy with no different importance until the zone is as small as possible. If there are enough agents, they will spread so as to cover all the targets. Therefore, this method requires the aerial robots, which will act as leaders of the mission, to move in ways that can be covered by the ground robots. This restriction depends on some factors like the number of agents or their sensing radius and will be studied in
Section 5.
Figure 4 shows the same example as the previous alternative. Here, the redefined fictitious boundaries are the green lines, and the targets are represented just with small gray circles. In the first steps, the agents move towards the targets while the rectangle reduces. Once the rectangle is as small as the targets allow, the agents spread over the final area. At the end, they cover a greater zone than the previous alternative, deploying uniformly. One can notice that the agents must have a fixed bearing reference in order to identify the orientation of the rectangle that iteratively defines the area to cover. In this paper, we make the assumption that they have this information. In real implementations, a mechanism for achieving this should be included. For instance, agents can be equipped with a compass or run a localization method, as in [
1].
The process that this alternative runs in a centralized way at each simulation step to follow the targets is shown in Algorithm 4.
Algorithm 4 Redefining fictitious boundaries. |
- 1:
Detect - 2:
- 3:
- 4:
- 5:
- 6:
Area - 7:
Area - 8:
Execute Algorithm 2
|
4.4. Comparison between Both Alternatives
The example studied allows making a comparison between the proposed alternatives. In addition to the previously-discussed differences between the amount of agents that need to be informed of the positions of the moving aerial robots, in this section, we explain some behavioral differences.
On the one hand, the importance function alternatives tend to group the ground robots around the targets. Depending on the initial relative positions, this could make them leave some others uncovered. On the other hand, the method redefining the boundaries achieves a wider formation, covering a greater zone than the first alternative. Therefore, this method focuses on the area around the targets, while the first method focuses on the targets themselves.
Considering this information, the importance function method is better suited to individual objectives whose environment is not important, while the redefining boundaries alternative is more appropriate for concentrations of targets or wide areas of interest.
5. Simulations and Results
Both alternatives presented so far are built on deployment methods, which were originally designed for static setups. Since our aim is to use these methods to dynamically track variations in the region to be covered, a question that arises is how slow the modifications in the environment must be in order for the agents to efficiently track them. This question is hard to answer from an analytical point of view. Moreover, the speed may depend on several facts (number of agents, area to be covered, sensing radius, communication radius, etc.). In this section, we make a thorough parametric study to identify the factors that influence the allowable speed. Here, the simulations are only executed with MATLAB in order to get repeatable results, abstracting from implementation issues carried by 3D simulation or real experiments. Later, in
Section 6 and
Section 7, we carry out 3D simulations and experiments that involve other specific problems (robot kinematics, collision avoidance, noisy data or restrictions on the motions).
We define now the base experiment for the parametric study. We have a group of six ground robots implementing the redefining boundaries algorithm. We will consider the particular case where the targets fly in a rectangular formation composed of twelve aerial robots and move 0.3 m per simulation step. The sensing radius of the ground robots is m, and their communication radius is m.
Starting from this experiment, we are varying four parameters in order to study how they influence the algorithm behavior. These four parameters are the velocity of the formation to track, the sensing radius, the number of agents and the tracking method.
5.1. Formation Velocity
In the first study, we vary the formation velocity in the interval 0.25–0.5 m/step. The formation of targets moves from left to right, and its initial position is described in
Figure 5, as well as the initial position of the ground agents.
The velocity is different in each case, and we run 60 steps of the method in all of the setups. Thus, the final position of the formation will be different. In
Figure 6, we show the final dispositions of all the robots for 0.25 and 0.5 m/step cases. For lower velocities, the algorithm is able to keep the formation covered as it moves. However, if velocity increases, the agents cannot correctly cover the whole formation. Some of them agglomerate in the rear formation, leaving the front targets uncovered.
Now, in
Figure 7a, we represent the cost functions (Equation (
1)) for each velocity along the simulation steps. One can realize that the cost function stabilizes at a constant value for each velocity. This behavior means that in all cases, the agents spread to a constant disposition and reach the speed of the formation, advancing with it. Furthermore, the metric minimizes for the lowest speeds, because the agents cover the whole formation of the targets, as we see in
Figure 6.
The behavior of the algorithm can also be monitored with a variable defined as:
i.e., the sum of the distances of each ground robot to the center of the formation,
O. This variable can describe the quality of the coverage performed.
Figure 7b gathers this sum of distances along the simulation with the velocities studied. We observe the same behavior of the curves as in the cost function. In the case of higher velocities, the sum of distances increases since the agents cannot cover the whole formation. The metric minimizes also for lower velocities. For this reason, from now on, we will show this performance metric (Equation (
13)), since it is easier to understand and its behavior stays similar to the cost function in the experiments remaining.
The results obtained evidence of the existence of a maximum formation speed that the algorithm can correctly track. In [
18], the authors prove that the static coverage task for a one-dimensional case, with
and a relative precision
, is achieved in a time that polynomially depends on the number of agents. Our case is dynamic and two-dimensional, so this study would imply a much higher complexity. We obtain the maximum speed experimentally instead. Taking a look at the graphs, we conclude that this maximum velocity is around 0.3 m/step for the described base setup. That is why the base experiment is designed with this formation velocity, in order to highlight the differences in the following studies.
Note that the velocity units are m/step. This will allow a higher maximum speed of the targets for systems that execute a step of the algorithm more quickly. This includes, e.g., the communication between the agents, the computational load or the motion restrictions of the ground robots.
5.2. Sensing Radius
In this case, we test five scenarios with the sensing radius in the interval 2–4 m. In order to maintain the relation
discussed in
Section 3, the communication radius is changed proportionally in the simulations. Initial positions are the same as in the previous experiments, and the simulations are run with 80 steps, so that every experiment reaches the steady state.
The sensing radius is often one of the parameters that affects the allowable speed of the formation to be tracked. Showing the final positions of the robots for 2 and 3.5 m (
Figure 8), one can conclude that, with a lower sensing radius, the agents cannot perform a wide deployment to cover the targets, and they accumulate at the rear formation.
Figure 9 represents the sum of the distances from the ground agents to the formation center for each sensing radius. As we said before, the smaller radius clearly makes this variable increase because the agents go to the back of the formation. On the other hand, while the radius increases, we can observe that the curves tend to the same value. This behavior is due to the fact that the agents have an equilibrium distribution that balances their advance with the formation speed. Even though the radius is much higher, the balance position would be the same.
From the results, we can observe that as the sensing radius of agents decreases, they have more difficulties following the moving formation. One of the possible reasons for this behavior is that, at every step, agents perform motions that keep them inside the intersection between their own Voronoi and sensing region. When this region is smaller, agents perform motions that may be too short and may prevent them from tracking the aerial formation.
Another possible explanation would be that the communication links were too restrictive for the motion of the agents. Recall that with each variation of the sensing radius, we have changed the communication radius as well in order to keep the relation
s. If the communication radius were higher, one could think that the agents may expand over the region and cover all the targets. For this reason, we have run another set of simulations with a larger communication radius, keeping a sensing radius of 2.5 m, which in the previous experiment could not cover all the formation. In
Figure 10, we can see the results of the sum of distances to the center of the formation varying the communication radius from 5–10 m. We can observe that all the graphs for each communication radius concur in the same curve. Therefore, we can conclude that this is not the reason why the agents cannot follow the formation with a lower sensing radius.
These results also allows us to think that the connectivity maintenance method is not interfering with the coverage and tracking goal, as far as the region can be indeed covered by the agents.
5.3. Number of Agents
This time, we vary the number of ground robots in the system, duplicating them iteratively in order to highlight the differences in the experiments. In this case, we test from 4–16 agents. The initial positions cannot be the same because the number of ground robots is variable. The agents start from positions that are close to the previous experiments and run a 300-step simulation in order to be able to observe the behavior of the curve with the highest number of agents.
Due to the fact that the number of agents is variable, now the sum of the agents’ distances to the formation center is not an appropriate variable to represent. We graph the mean distance instead in
Figure 11. The curves do not start from the same point as in the previous experiments because of this same reason.
Contrary to what one could have expected, we observe that, with a higher amount of agents, the method does not perform better, even though they have greater capabilities to cover an area. The mean distance to the center of the formation increases as the number of agents is higher. Observing
Figure 12, one can see the reason for this behavior. When there are more ground robots than needed, the extra agents tend to accumulate at the rear formation. In case this excess is extreme, the agents are very close and interfere with each other, making their available space to move too short. This prevents them from advance with the formation of targets, and their mean distance to the center of the formation increases.
The agents that lead the motion of the team are those that have direct contact with a boundary perpendicular to the direction of the target formation, i.e., the ones that are at the front and rear. They have a direct modification of their Voronoi and sensing region, implying a quicker reaction than the agents that are in the middle part of the formation. Therefore, the accumulation of unnecessary ground robots increases the number of agents in the center of the formation, obstructing the motion of the whole team. In the case that many ground robots were needed, the exploration strategy of the aerial team should consider a limit speed appropriate for the features of the system.
5.4. Tracking Method
Finally, we compare the behavior of the algorithm on the base experiment varying the particular method for the tracking task between that proposed in this paper. The initial positions of the agents are again the positions shown in
Figure 5 in both cases.
After a 30-step simulation, the final positions of the agents are shown in
Figure 13. The distributions of the agents over the formation are very similar. Both alternatives cover the twelve targets when the steady state is reached.
Representing the sum of the agents’ distances to the center of the formation (
Figure 14), we can see that the value of the importance functions method quickly decreases, so its deployment is faster and its response time lower. On the other hand, the redefining boundaries method achieves a lower value, so the agents in this case are a bit better deployed.
We can conclude then that both methods are suitable for this experiment, but the one redefining the boundaries leads to a better deployment. In addition, as we saw in
Section 4, this alternative is more appropriate for target agglomerations like the ones considered in the scenario discussed in this section.
6. Realistic Simulations
With the aim of getting one step closer to a real implementation of the methods proposed in
Section 4, we have implemented and tested our methods using a realistic 3D physics simulator specialized in robotics: Gazebo. We have used models associated with real robots (Turtlebots) which have a differential drive behavior. They are controlled with ROS through specific packages that include the robot description, the robot controller or the navigation stack. Configuring and connecting the set of packages, we are able to send goal positions to the ground agents once we compute the coverage algorithm. Iteratively, Turtlebots execute Algorithms 3 or 4 described in
Section 4. Between iterations, they navigate to their goal positions using local lower level controllers, taking into account their own motion kinematics restrictions. Since in the higher level methods described in
Section 3 and
Section 4, we have not included any collision-avoidance method, we incorporate a reactive local method in the Gazebo implementation so that agents do not collide as they navigate.
Lloyd methods, as other distributed strategies, make some assumptions that make the problem more tractable. In particular, in our simulations, we have made the following assumptions:
Scenarios are obstacle-free. Robots only need to avoid collisions with other robots, but not with other objects in the environment.
Ground robots can exchange data in one hop with robots within their communication radius.
Ground robots share a common orientation, as required by Algorithm 2.
Targets receive data from the ground robots to know that they have reached their goal destinations. They wait for this confirmation before performing their next motion.
Ground robots can also localize themselves, localize their neighbors and get information of the positions of the targets (here, aerial robots), by direct measurement or by multi-hop messages. We use the noise-free ground-truth data given by Gazebo.
We have designed four obstacle-free scenarios in order to examine the behavior of the algorithm in some common situations and some extreme cases where it might fail. The reader can find in [
39] four videos with the names “
case_i”, where
i is a number from 1–4. The videos represent the simulations carried out in this section, in order of appearance. They contain the simulation in Gazebo and the graphs obtained from MATLAB. The first one is a simple expansion without targets or zones more important than others. This is just a coverage task, and we do not need to implement any of the tracking alternatives proposed.
Figure 15 shows the final layout of the ground agents. We have developed a visualization system that helps to understand the movement of each agent in the Gazebo simulation. The black ground robots are the Turtlebots, which are the ones actually running the algorithms. We add some elements like the targets, implemented as aerial robots that are moved discretely. The colored spheres represent the positions of the ground robots and their goal positions in the current simulation step. Additionally, the gray cylinders represent the communication tree reached at the present step.
The second experiment consists of five agents implementing the redefining boundaries method covering and tracking a formation of ten quadrotors that fly in a triangular formation. The final disposition of both ground and aerial robots is shown in
Figure 16.
The third and fourth experiments implement the method with the importance functions to track the targets. In the third case, the targets start together and separate into opposite directions. The ground team spreads in a straight line until the point that the communication tree allows them.
Figure 17 shows this final straight formation of the agents.
Finally, in the fourth case, two targets describe curve trajectories that cross in the center of the working area. In the first part of the experiment, the ground team covers the aerial targets and tracks them. During the crossing, the ground agents get too close. Their collision avoidance method works well and prevents them from crashing. However, their redeployment can not be correctly performed because of the agglomeration produced, finally leaving one of the targets uncovered.
Figure 18 shows this fact at the end of the simulation.
7. Experimental Results
In addition to the simulations executed in
Section 5 and
Section 6, we present a real robot experiment in order to analyze the algorithm performance in a more realistic environment. We test the redefining boundaries method as it is the major contribution of this article.
The experimental tests were carried out with the setup shown in
Figure 19. Six differential-drive robots act as agents, and each of them includes a Raspberry Pi 2 Model B and a driver controller. The localization of each robot is achieved using aRuCo [
40] markers together with a webcam, Logitech C310. Therefore the source of the localization is noisy instead of the ideally precise ground-truth data used in the Gazebo simulation in
Section 6. Moreover, motions of the robot wheels are affected by real actuation restrictions and by occasional wheel slips. The robots are controlled by a goal-to-goal controller programmed in MATLAB and a wheel velocity controller running inside each Raspberry Pi with a Python code reading the encoders placed on each wheel and controlling the speed. Commands are sent to the robots by a TCP/IP server opened from MATLAB to each Raspberry Pi, and the Python code sends the voltage messages to the driver controller, which powers the motors.
In
Figure 20, a frame of the experiment conducted can be observed. The video recording this experiment can be found at [
39] with the name “
experiment”. The robots perform the role of agents, and the moving targets that should be covered are plotted with red circles. The movement of the robots reaches the green asterisks representing the centroids of each coverage region, which are drawn in blue lines. The sensing radius of the agents is set to
mm. All of these elements are plotted from the algorithm by homography in each frame of the video.
The movement designed for the targets is composed of two straight sections joined by a semicircle, scheduled beforehand in a three by four formation. The formation speed is mm/step, much lower than in the simulations. The movement set to the external targets during the curve is scheduled in order to maintain their linear velocity during the curve trajectory. These external agents perform a 750-mm radius curve, while the radius of the internal is 150 mm. The collision between the agents is avoided by narrowing the Voronoi regions by a safety radius of mm.
The control law scheme used in the experiment is the one described in
Section 4.1: agents receive goal positions, generated according to Equation (
11) (green asterisks). Note the goal positions at each iteration are the result of computing the centroids of the Voronoi-regions and adjusting them so as to keep the communication links, as described in
Section 3 and
Section 4. Note also that the low-cost robots in the experiment have differential-drive kinematics. Thus, agents run their low level controllers to get close to the high-level goal positions. When the agents are near enough to their goal (we use a distance tolerance
mm in our experiments), they stop moving and wait for the remaining agents to finish navigating to their goals. After that, all the agents start a new iteration of Algorithm 4, which includes the re-computation of Voronoi regions.
Figure 21 represents the real trajectories followed by the different agents (continuous line) against the set of consecutive goal positions (Equation (
11)) derived from the algorithm (dotted line).
In
Figure 22, we represent the evolution of the X coordinate of the trajectories and the goal positions along the time of the experiment.
Observe how the desired high-level goal positions (Equation (
11)) remain unchanged for some time. There, the low-level controllers of the agents drive them from their current positions to these new goals. Thus, as can be seen, Algorithms 3 and 4 proposed in this paper can be run by agents with different kinematic models, as long as they are equipped with local controllers that drive them to the high level goal positions generated by Equation (
11).
8. Conclusions
We propose and test two strategies to solve the problem of simultaneous coverage and tracking, taking into account sensing and communication limitations, while ensuring that the network remains connected. Both strategies are based on Lloyd methods for static deployment. The first consists of modifying the positions of density functions according to the movement of the targets. This solution has been commonly used in the literature for adapting the static algorithm to tracking tasks. The second strategy iteratively modifies the boundaries of the working area in order to follow the targets while covering their surroundings. As far as we know, this is a novel solution for the problem addressed in this paper.
The density functions solution results in reacting faster to the targets’ motion. However, depending on the initial disposition of the agents, some targets may end up uncovered, while the agents concentrate around other targets. In addition, all of the agents need to know the current positions of the targets in order to compute the algorithm.
On the other hand, the redefining boundaries solution has a slower reaction to the targets’ motion, but achieves an even deployment over the area of interest. Furthermore, the information about the position of the targets is transformed into the position of the boundaries, and this information is only needed by the agents that have contact with them. Therefore, the communication load is lighter when implementing many agents. A parametric study of the second method shows the existence of a maximum velocity of the targets that it can follow. We have shown that this limit speed depends on the number of agents and their sensing radius and does not depend on the communication radius. We have implemented and validated the system on a realistic simulator. We have carried out real experiments with a team of six low-cost robots. We have shown that the methods can cope with measurement noises and can be combined with several kinematic robot models.