Next Article in Journal
Clustering Analysis of Seismicity in the Anatolian Region with Implications for Seismic Hazard
Next Article in Special Issue
Optimum k-Nearest Neighbors for Heading Synchronization on a Swarm of UAVs under a Time-Evolving Communication Network
Previous Article in Journal
The “Real” Gibbs Paradox and a Composition-Based Resolution
Previous Article in Special Issue
Attitude Synchronization of a Group of Rigid Bodies Using Exponential Coordinates
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Formation and Flocking Control Algorithms for Robot Networks with Double Integrator Dynamics and Time-Varying Formations

by
Carlos Montañez-Molina
1,
Javier Pliego-Jiménez
1,2,* and
Rigoberto Martínez-Clark
3
1
Departamento de Electrónica y Telecomunicaciones, División de Física Aplicada, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada 22860, Mexico
2
Programa Investigadores por México, Consejo Nacional de Ciencia y Tecnología, Mexico City 03940, Mexico
3
Facultad de Ingeniería, Administración y Ciencias Sociales, Universidad Autónoma de Baja California, Mexicali 21100, Mexico
*
Author to whom correspondence should be addressed.
Entropy 2023, 25(6), 834; https://doi.org/10.3390/e25060834
Submission received: 16 February 2023 / Revised: 3 April 2023 / Accepted: 5 April 2023 / Published: 23 May 2023
(This article belongs to the Special Issue Synchronization in Time-Evolving Complex Networks)

Abstract

:
In this work, we study the problem of designing control laws that achieve time-varying formation and flocking behaviors in robot networks where each agent or robot presents double integrator dynamics. To design the control laws, we adopt a hierarchical control approach. First, we introduce a virtual velocity, which is used as a virtual control input for the position subsystem (outer loop). The objective of the virtual velocity is to achieve collective behaviors. Then, we design a velocity tracking control law for the velocity subsystem (inner loop). An advantage of the proposed approach is that the robots do not require the velocity of their neighbors. Additionally, we address the case in which the second state of the system is not available for feedback. We include a set of simulation results to show the performance of the proposed control laws.

1. Introduction

Collective behaviors in nature are interesting phenomena that have attracted the attention of scientists from different disciplines. Interesting examples include flocks of birds and schools of fish, social networks, vehicular traffic, etc. In these examples, collective behaviors emerge due to local interactions between the members of the swarm. Such collective behaviors can be used to perform complex tasks; thus, the objective is to emulate them in groups of robots (in this work, the words agent and robot are used interchangeably), such as wheeled mobile robots, aerial and underwater vehicles, or robot manipulators. Controlling multiple agents presents some advantages, such as time reductions and energy savings when performing complex tasks. On the other hand, some of the applications that can be performed when replicating the collective behaviors in robot networks are search and rescue missions, object transportation, agricultural irrigation, and area mapping.
Examples of collective behaviors are consensus, rendezvous, synchronization, formation, and flocking. In this work, we only focus on formation and flocking behaviors in robot networks with double integrator dynamics. Currently, we can find a great variety of works that address these behaviors. A survey on formation control algorithms can be found in [1]. In this work, the authors classify the cases of study in formation control based on position, displacement, and distance measurements. In [2,3], the leader–follower approach is used to carry out the desired formation in multi-agent systems with double integrator dynamics. In [4,5], the leader–follower approach is also used, but these works consider time-varying formation problems. The problem of time-varying formation is more challenging than fixed formation since the geometric pattern of the formation changes over time. The problem of achieving time-varying formation with bounded control inputs is addressed in [5]. The authors present experimental results to validate the performance of the proposed controllers. A bearing-based formation control approach is presented in [6]. The control strategy allows one to track the time-varying leader velocities and to scale the geometry of the formation. A distributed control protocol that achieves time-varying formation for robot networks with linear dynamics is proposed in [7]. The authors introduce an adaptive coupling strength, which is updated according to the relative positions of the agents. A controller that allows one to accomplish time-varying formation robot networks with switching directed topologies is designed in [8]. A fault-tolerant time-varying formation control is proposed in [9]; to achieve the control goal, adaptive control theory is used. In [10], an affine formation maneuver algorithm that makes use of undirected graphs for the communication between agents is presented.
On the other hand, the flocking control problem mainly consists of causing all agents in the group to move to the same velocity while keeping a desired shape. In [11], the author proposes a control law that ensures the flocking behavior in robot networks with double integrator dynamics; it is important to point out that the controller also avoids collisions between agents and static obstacles. In [12], three control algorithms are proposed: the first two replicate the flocking behavior in free space and the third algorithm replicates the collective behavior in a space with obstacles; numerical results are shown to validate the three cases. A bounded control law based on potential fields that maintains a safe distance between the agents is presented in [13]. A flocking algorithm that allows the tracking of a virtual leader is studied in [14]; it is important to point out that in in this work, it is demonstrated that the flocking is achieved even when only some agents know the reference signals. The same problem is addressed in [15], where the authors consider the case wherein only some robots have access to the leader information. In addition, the authors employ adaptive control theory, which is the main difference with respect to [14]; moreover, the authors assume uncertainties and disturbances in the dynamic model of the robots. In [16], the problem of flocking is solved using an adaptive flocking control law, and the adaptive neural network estimates the uncertain nonlinear dynamics of the agents. The flocking problem with time delays is addressed in [17]. An optimal control approach is used to design flocking control in [18]; such an algorithm allows one to address obstacle avoidance and trajectory tracking problems. In [19], the authors propose a dynamic feedback position control law to address the problem. Such a control law does not require the velocity information of the agents to achieve the desired behavior. The flocking problem in discrete time is studied in [20]; the proposed algorithm is decentralized and allows the agents to converge to a formation based on flocking behavior, and simulations for the validation of the algorithm are shown. Unlike the mentioned works above, in [21,22,23], the authors design an observer to estimate the velocity, due to the fact that, in practice, it is not always possible to measure all the states of the system.
This work focuses on the design of control algorithms that achieve time-varying formation and flocking for robot networks where each node is modeled as a double integrator. The main contribution of this work is the decentralized dynamic hierarchical control law where, unlike the controllers reported in the literature, for the agent i, the proposed controller only requires the relative position of its neighbors and its own velocity. Additionally, the algorithm works for undirected and directed graphs. However, the graph must be connected. Furthermore, we consider the case wherein the velocity of the agents is not available from measurements, and, for the flocking case, only a few agents know the desired flocking velocity. To address the aforementioned problems, we propose Luenberger and distributed observers to estimate the unknown states and signals. Finally, we provide numerical simulations to show the effectiveness of the proposed control laws and observers. The remainder of the paper is organized as follows. Some preliminaries and the problem statement are given in Section 2. Formation and flocking control laws using the full state of the robots are developed in Section 3. The problem of time-varying formation and flocking without velocity measurements is studied in Section 4. Section 5 presents a distributed observer for the problem of flocking with partial information. Extensive simulations are reported in Section 6. Finally, the paper ends with some conclusions given in Section 7.

2. Preliminaries

2.1. Problem Statement

Consider a robot network with N identical elements whose dynamics are governed by double integrator dynamics
p ˙ i = v i
v ˙ i = u i
where the states of the i-th robot are the position p i n and velocity v i n ; u i n is the control input and i N = { 1 , 2 , , N } . Next, we describe the collective behaviors under study.
Formation.
The time-varying formation problem consists of steering all the robots of the team to a certain position to form a desired time-varying geometric shape or pattern while the velocity of each robot converges to the rate of change of the formation. The formation control objective can be defined as
lim t p i j ( t ) δ i j ( t ) = 0 and lim t v i ( t ) = δ ˙ i ( t ) i , j N
where p i j ( t ) = p i ( t ) p j ( t ) n is the error of position between the agent i and its neighbor denoted by j. δ i j ( t ) = δ i ( t ) δ j ( t ) n describes the geometry of the formation and δ i ( t ) n , δ j ( t ) n are time-varying offset vectors that are at least twice differentiable.
Flocking.
In this collective behavior, all the robots follow a common reference velocity while maintaining a desired formation; thus, the flocking control objective is to achieve
lim t p i j ( t ) δ i j ( t ) = 0 and lim t v i ( t ) = v d ( t ) + δ ˙ i ( t ) i , j N
where v d ( t ) n is the desired velocity for the team.
From the above definitions, the time-varying formation is a special case of flocking when v d ( t ) = 0 .

2.2. Graph Theory

We consider that the communication topology is fixed and the communication channels can be either unidirectional or bidirectional. Such topologies can be modeled by undirected and directed graphs. A graph G = ( N , E ) consists of a set of nodes N (each node represents a robot in the network) and a set of edges E N × N . If the set of edges are unordered pairs of N , the graph is called undirected. For an undirected graph, the edge ( i , j ) E denotes that robots i and j can obtain information from each other. On the other hand, if the edges are ordered pairs of N , the graph is called a directed or digraph, and information only flows in one direction. The adjacency matrix A = [ a i j ] N × N for an undirected graph is defined as
a i j = 1 if ( i , j ) E 0 otherwise
For a directed graph, we have
a i j = 1 j N i 0 otherwise
where N i is the set of neighbors transmitting information to agent i. The Laplacian matrix L = [ i j ] N × N is defined as
i j = k = 1 N a i k i = j a i j i j .
For an undirected graph, the Laplacian matrix is symmetric and positive semidefinite. However, for a directed graph, L is not necessarily symmetric. By construction, the Laplacian matrix satisfies L 1 N = 0 , meaning that the vector 1 N = 1 1 N is an eigenvector of L and it is associated with the zero eigenvalue λ 1 = 0 . For an undirected graph, all of the nonzero eigenvalues of L are real and positive, whereas, for a directed graph, the rest of the spectrum of L have positive real parts. If the eigenvalue λ 1 = 0 has algebraic multiplicity of one, then the graph is connected [24,25,26]. If there exists a sequence of edges (undirected path) that joins any pair of nodes, we say that the graph is connected [27]. A digraph is strongly connected if there is a directed path that connects every pair of nodes.

3. Formation and Flocking Controllers

In order to design the formation and flocking control laws, we exploit the cascade structure (chain of integrators) of the robot’s dynamics given in (1). To this end, we introduce the virtual input ϑ i n and rewrite (1) as follows:
p ˙ i = v ˜ i + ϑ i
v ˜ ˙ i = ϑ ˙ i + u i
where v ˜ i = v i ϑ i n is the velocity error. The open-loop system (7) can be analyzed as an interconnected system with states ( p i , v ˜ i ) and inputs ( ϑ i , u i ) coupled by the term v ˜ i . Similar to the backstepping approach, the control problem is divided into two particular control problems: the first step consists of designing the control ϑ i that achieves time-varying formation or flocking for the subsystem (7a), and as a second step, we design the control u i that steers the velocity error v ˜ i asymptotically to zero.
The hierarchical control approach simplifies the control design; nevertheless, it imposes the restriction that the virtual input ϑ i must be at least once differentiable; see (7b). Furthermore, it is desirable that the time derivative of control input ϑ i for the i-th robot does not depend on the velocity of its neighbors. To fulfill the control objectives mentioned above, we propose the following dynamic control law:
ϑ i = v d ( t ) + δ ˙ i ( t ) k i φ i
φ ˙ i = k i φ i + c j = 1 N a i j ( p i j δ i j ( t ) φ i j )
u i = ϑ ˙ i γ i v ˜ i
where v d ( t ) n is the desired flocking velocity that the robot network must follow, φ i n is an additional state, k i , γ i are positive gains, c > 0 is the coupling strength, a i j is the i j -th element of the adjacency matrix and φ i j = φ i φ j . In Figure 1, we present in a visual manner the proposed control law, so that readers have a better interpretation. Now, we are in a position to state the first result of this paper, which is summarized in the following proposition.
Proposition 1.
Assume that the graph G is connected and consider the robot network (1) in a closed loop with (8). Then, the dynamic control law (8) guarantees
(i) 
time-varying formation if v d ( t ) = 0 ;
(ii) 
flocking behavior in the sense of (3).
Proof. 
To begin with stability analysis, we introduce the auxiliary state
r i = p i p d ( t ) δ i ( t ) φ i n
where p d ( t ) = 0 t v d ( τ ) d τ . Then, it follows that r ˙ i = p ˙ i v d ( t ) δ ˙ i ( t ) φ ˙ i and r i j = r i r j = p i j δ i j ( t ) φ i j . Substituting the control law (8) in (1) and taking into account (7), the closed-loop dynamics read
r ˙ i = c j = 1 N a i j r i j + v ˜ i
φ ˙ i = k i φ i + c j = 1 N r i j
v ˜ ˙ i = γ i v ˜ i .
Using the properties of the Kronecker product, the closed-loop dynamics can be written in compact form as follows:
φ ˙ = ( K I n ) φ + c ( L I n ) r
r ˙ = c ( L I n ) r + v ˜
v ˜ ˙ = ( Γ I n ) v ˜
where φ = φ 1 φ N n N , r = r 1 r N n N , v ˜ = v ˜ 1 v ˜ N n N are stacked vectors; K = diag { k 1 , , k N } n N × n N , Γ = diag { γ 1 , , γ N } n N × n N are positive definite diagonal matrices; I n n × n is the identity matrix; and the symbol ⊗ denotes the Kronecker product. Using the fact L 1 N = 0 , it can be shown that the closed-loop dynamics (11) have an equilibrium point at ( φ , r , v ˜ ) = ( 0 , 1 N r , 0 ) for some r n . To proceed with the stability analysis, we use the following change in coordinates [26]:
q = ( Q I n ) r n ( N 1 )
where the matrix Q N 1 × N is defined as
Q = 1 + ( N 1 ) ϑ 1 ϑ ϑ ϑ 1 + ( N 1 ) ϑ ϑ 1 ϑ ϑ 1 + ( N 1 ) ϑ ϑ ϑ 1 ϑ
with
ϑ = ( N N ) N ( N 1 ) .
By construction, the matrix Q satisfies [28]
Q 1 N = 0 , Q Q = I N 1 , Q Q = I N 1 N 1 N 1 N .
Using these properties, it can be shown that
q = ( Q I n ) r = 0 r = 1 N r
Finally, by taking into account (12), (14) and
r = ( Q I n ) q + 1 N ( 1 N 1 N I n ) r ,
the closed-loop dynamics (11) become
ξ ˙ = A ξ ξ
where ξ = φ q v ˜ 3 n N n is the extended state and
A ξ = K I n c ( L Q I n ) O n N O n ( N 1 ) × n N c ( L r I n ) Q I n O n N O n N × n ( N 1 ) Γ I n
where L r = Q L Q N 1 × N 1 is the reduced Laplacian matrix; O n N × n N n N × n N and O n ( N 1 ) × n N n N × n N denote zero matrices. Since G is a connected graph by assumption, the real parts of the eigenvalues of L r are positive [26]. Moreover, the eigenvalues of L r are the same as the Laplacian matrix L except for λ 1 = 0 . This implies that the matrix c L r is Hurwitz. Since each matrix on the diagonal of A ξ is a Hurwitz matrix and A ξ is a block upper triangular matrix, it follows that A ξ is also Hurwitz. Therefore, the origin of (16) ( ξ = 0 ) is a globally exponentially stable equilibrium point.
From Equation (15), the exponential convergence of q ( t ) to zero implies r ( t ) 1 N r as t and hence r i ( t ) r , which in turn implies r i j ( t ) 0 . Combining the previous result with the exponential convergence of φ ( t ) to zero yields
lim t p i j ( t ) δ i j ( t ) = r i j ( t ) + φ i j ( t ) = 0 , i , j N .
This implies that the robot network achieves the desired geometry formation. The last part of the proof consists of showing that the robots’ velocities satisfy lim t v i ( t ) = δ ˙ i ( t ) ( i N ) for time-varying formation and lim t v i ( t ) = v d ( t ) + δ ˙ i ( t ) for the flocking behavior. This can be done by noticing that lim t v ˜ i ( t ) = v i ( t ) ϑ i ( t ) = 0 , and from (8a) one has lim t v i ( t ) = ϑ i ( t ) = v d ( t ) + δ ˙ i ( t ) k i φ i ( t ) . Since φ i ( t ) 0 as t , it follows that
lim t v i ( t ) = v d ( t ) + δ ˙ i ( t )
meaning that the flocking behavior is achieved. Finally, for time-varying formation ( v d ( t ) = 0 ), we have lim t v i ( t ) ϑ i ( t ) = v i ( t ) δ ˙ i ( t ) + k i φ i ( t ) = 0 and hence
lim t v i ( t ) = δ ˙ i ( t )
which implies that the time-varying formation is achieved. □

4. Time-Varying Formation and Flocking Controllers without Velocity Measurements

Motivated by the fact that many commercial robots are only equipped with position sensors, in this section, we combine the dynamic control law (8) with a linear observer that estimates the second state for the robot i. Consider the robot network
p ˙ i = v i
v ˙ i = u i
y i = p i , i N
where y i n is the output of the system. To address the lack of velocity measurements, we propose the observer
p ^ ˙ i = v ^ i + Ξ 1 i y ˜ i
v ^ ˙ i = u i + Ξ 2 i y ˜ i
y ˜ i = p i p ^ i
where p ^ i n and v ^ i n denote, respectively, the estimated position and estimated velocity; y ˜ i 2 is the output error and Ξ 1 i n × n , Ξ 2 i n × n are the observer gains. In Figure 2, we show a block diagram that depicts the controller when including the Luenberger observer.
Proposition 2.
Consider the robot network (17) and the observer (18) in a closed loop with the dynamic controller
ϑ i = v d ( t ) + δ ˙ i ( t ) k i φ i
φ ˙ i = k i φ i + c j = 1 N a i j ( p i j δ i j ( t ) φ i j )
u i = ϑ ˙ i γ i ( v ^ i ϑ i )
where v d ( t ) n is the desired flocking velocity, k i , γ i are positive gains, c > 0 is the coupling strength, and v ^ i n is the estimated velocity. Assume that the communication graph G is connected and the observer gain Ξ i = Ξ 1 i Ξ 2 i 2 n × n is chosen such that the matrix A i Ξ i C i is Hurwitz where
A i = O n I n O n O n , C i = I n O n .
Then, the controller (19) in combination with the observer (18) guarantees
(i) 
time-varying formation if v d ( t ) = 0 ;
(ii) 
flocking as defined in (3);
(iii) 
v ^ i ( t ) v i ( t ) as t .
Proof. 
First, we define the observer error x ˜ i = ( p i p ^ i ) ( v i v ^ i ) 2 n . By taking into account (17) and (18), the time derivative of x ˜ i is given by
x ˜ ˙ i = A o i x ˜ i
where A o i = A i Ξ i C i 2 n × 2 n . By taking into account the observer error dynamics (20), the robot network (17) and the control law (19), the overall closed-loop dynamics are given by
φ ˙ i = k i φ i + c j = 1 N a i j r i j
r ˙ i = c j = 1 N a i j r i j + v ˜ i
v ˜ ˙ i = γ i v ˜ i + E i x ˜ i
x ˜ ˙ i = A o i x ˜ i
where E i = O n γ i I n n × 2 n and r i is defined in (9). Notice that the addition of the observer error dynamics does not destroy the cascade structure of the closed-loop dynamics. Therefore, we can follow the steps of the proof of Proposition 1 to obtain
η ˙ = A η η
where η = φ q v ˜ x ˜ 5 n N n is the extended state vector, q n ( N 1 ) is defined in (12), x ˜ = x ˜ 1 x ˜ N 2 n N , and A η is a block upper triangular matrix given by
A η = K I n c ( L Q I n ) O n N O n N × 2 n N O n ( N 1 ) × n N c ( L r I n ) Q I n O n ( N 1 ) × 2 n N O n N O n N × n ( N 1 ) Γ I n E O n N × 2 n N O 2 n N × n ( N 1 ) O 2 n N × n N A o
with A o = blockdiag { A o 1 , , A o N } 2 n N × 2 n N and
E = E 1 O n × 2 n O n × 2 n E N .
Since the observer gain Ξ i guarantees that A o i = A i Ξ i C i is Hurwitz, each matrix on the diagonal of A η is Hurwitz; thus, the equilibrium point η = 0 is exponentially stable. The previous result directly implies that
lim t v ^ i ( t ) = v i ( t ) ,
and hence item ( i i i ) is proven. Due to the fact that φ ( t ) , q ( t ) and v ˜ ( t ) converge exponentially to zero, we can follow the steps of the proof of Proposition 1 to prove items ( i ) and ( i i ) . □

5. Flocking Control with Partial Information

In order to design the controllers (8) and (19), it is assumed that all the robots have access to the desired flocking velocity v d ( t ) . In this section, we study the scenario in which only a portion of the robots have access to the flocking velocity. Let N d N be the subset that contains all the robots that have access to v d ( t ) . Since at least one robot knows v d ( t ) , it is possible to estimate the flocking velocity for the robot i N d by using information about its neighbors.
Proposition 3.
Assume that G is undirected (directed) and connected (strongly connected). Moreover, assume that the desired flocking velocity v d ( t ) n is a bounded continuous function and satisfies that v ˙ d ( t ) 0 as t . Then, the distributed observer
v ^ ˙ i d = μ j = 1 N a i j v ^ i j d + d i ( v ^ i d v d ( t ) )
where μ > 0 is the observer gain, v ^ i d ( t ) n is an estimate of v d ( t ) , v ^ i j d = v ^ i d v ^ j d and
d i = 1 , if   i N d 0 o t h e r w i s e
guarantees that v ^ i d ( t ) v d ( t ) as t .
Proof. 
The flocking estimation error is defined as v ¯ i d v ^ i d v d ( t ) and, from (24), its time derivative is given by
v ¯ ˙ i d = μ j = 1 N a i j v ¯ i j d + d i v ¯ i d v ˙ d ( t )
where we use the fact that v ^ i d v ^ j d = v ¯ i d v ¯ j d . Using again the Kronecker product, (26) becomes
v ¯ ˙ d = μ ( L I n ) v ¯ d μ ( D I n ) v ¯ d 1 N v ˙ d ( t ) = μ ( F I n ) v ¯ d 1 N v ˙ d ( t )
where v ¯ d = ( v ¯ 1 d ) ( v ¯ N d ) n N , D = diag { d 1 , , d N } N × N and F = L + D N × N . Since G is undirected (directed) and connected (strongly connected) by assumption, the eigenvalues of F are strictly positive (see [29] for further details). This implies that the matrix μ ( F I n ) is Hurwitz. Thus, if v ˙ d ( t ) = 0 , the system (27) has an exponentially stable equilibrium point at the origin. The differential Equation (27) can be analyzed as a stable linear system with a continuous bounded input v ˙ d ( t ) that converges asymptotically to zero [30]. Therefore, we can conclude that
lim t v ¯ d ( t ) = 0 .
Therefore, v ^ i d ( t ) v d ( t ) as t for all i N . □
The results of Propositions 2 and 3 can be combined to solve the problem of flocking without velocity measurements and with partial information.
Proposition 4.
Consider the robot network (1) in a closed loop with the flocking control law
ϑ i = v ^ i d ( t ) + δ ˙ i ( t ) k i φ i
φ ˙ i = k i φ i + c j = 1 N a i j ( p i j δ i j ( t ) φ i j ) , φ i ( 0 ) = φ i 0 n
u i = ϑ ˙ i γ i ( v ^ i v i a )
where v ^ i n and v ^ i d n are obtained from the observers (18) and (24), respectively. Assume that G is undirected (directed) and connected (strongly connected). Moreover, assume that the desired flocking velocity v d ( t ) n is a bounded continuous function and satisfies that v ˙ d ( t ) 0 as t . Then, the controller (28) in combination with the observers (18) and (24) achieves flocking in the sense of (3).
Proof. 
By taking into account (17), (7), (18), (24) and (28), the closed-loop dynamics read
φ ˙ i = k i φ i + c j = 1 N a i j r i j
r ˙ i = c j = 1 N a i j r i j + v ˜ i + v ¯ i d
v ˜ ˙ i = γ i v ˜ i + E i x ˜ i
x ˜ ˙ i = A o i x ˜ i
v ¯ ˙ i d = μ j = 1 N ( a i j + d i ) v ¯ i j d v ˙ d ( t ) .
Using (12) and (14), the closed-loop dynamics can be written as
σ ˙ = A σ σ + B σ v ˙ d ( t )
where σ = φ q v ˜ x ˜ ( v ¯ d ) 6 n N n and
A σ = K I n c ( L Q I n ) O n N O n N × 2 n N O n N O n ( N 1 ) × n N c ( L r I n ) Q I n O n ( N 1 ) × 2 n N Q I n O n N O n N Γ I n E O n N O 2 n N × n N O 2 n N × n ( N 1 ) O 2 n N × n N A o O 2 n N × n N O n N O n N × n ( N 1 ) O n N O n N × 2 n N μ ( F I n )
and
B σ = O n N O n N × n ( N 1 ) O n N O n N × 2 n N I n .
It can be verified that A σ is Hurwitz and hence σ = 0 is a globally exponentially stable equilibrium point if v ˙ d ( t ) = 0 . Therefore, the closed-loop dynamics (30) are a stable linear system with input v ˙ d ( t ) that converges to zero. Thus, we conclude that σ ( t ) 0 as t . This in turn implies that v ^ i ( t ) v i and v ^ i d ( t ) v d ( t ) as t . Following the steps of Proposition 1, it can be shown that p i j ( t ) δ i j ( t ) 0 and v i ( t ) v d ( t ) + δ ˙ i ( t ) as t . □

6. Numerical Results

In this section, we present numerical results to show the performance of the proposed control laws and observers. To carry out the simulation, we consider a robot network with six agents represented by (1). The simulations are carried out on Matlab software. In each simulation, it is assumed that only the position of each robot is available from sensor measurements; thus, the observer (18) is used in combination with the formation and flocking controllers.

6.1. Time-Varying Formation Control

First, we validate the performance of the time-varying formation that is obtained from (19) by setting v d ( t ) = 0 . All the robots start at rest, i.e., v i ( 0 ) = 0 , and the initial position of each agent is chosen as
p 1 ( 0 ) = [ 0.36 0.9 ] , p 2 ( 0 ) = [ 1.8 1.8 ] , p 3 ( 0 ) = [ 1.8 2.7 ] , p 4 ( 0 ) = [ 2.7 2.7 ] , p 5 ( 0 ) = [ 2.7 1.8 ] , p 6 ( 0 ) = [ 2.7 0.9 ] .
The initial conditions for the Luenberger observer states were set as p ^ i ( 0 ) = v ^ i ( 0 ) = 0 for all i N . On the other hand, the initial condition for the auxiliary state φ i was set as φ i ( 0 ) = 0 for all i N . The control and observer gains are, respectively, chosen as K = Γ = 5 I 12 × 12 and Ξ i = [ 6 I 5 I ] 4 × 2 , while the coupling strength is set as c = 2 .
To achieve the desired time-varying formation, we use the following
δ ( t ) = δ 0 + δ 1 tanh ( a t 6 )
time-varying offsets, where a = 0.25 and δ 0 n N and δ 1 n N are constant vectors and have been proposed as
δ 0 = 1.2 1.4 2 1.4 3.6 0 3.6 0 2 1.4 1.2 1.4 δ 1 = 0.4 1.4 0.4 1.4 0.4 0 0.4 0 0.4 1.4 0.4 1.4 .
In this simulation, the communication graph is a directed ring and its corresponding Laplacian matrix is given by
L = 1 0 0 0 0 1 1 1 0 0 0 0 0 1 1 0 0 0 0 0 1 1 0 0 0 0 0 1 1 0 0 0 0 0 1 1 .
To obtain a better insight into the performance of the Luenberger observer, we compute the norm of the velocity errors given
| | e v x | | = | | v ^ x v x | | , | | e v y | | = | | v ^ y v y | |
where
v x = v x 1 v x 6 6 , v y = v y 1 v y 6 6 v ^ x = v ^ x 1 v ^ x 6 6 , v ^ y = v ^ y 1 v ^ y 6 6
The time evolution of | | e v x | | and | | e v y | | is shown in Figure 3. Clearly, both quantities converge to zero exponentially; therefore, the estimated velocity converges to the actual velocity of the robots.
In Figure 4, we show that the the time-varying formation is achieved, due to the fact that p x i j ( t ) δ x i j ( t ) and p y i j ( t ) δ y i j ( t ) converge exponentially to zero after the transient response.
The estimated velocity is shown in Figure 5. In the figure, all robots’ estimated velocities converge to the rate of change of the formation given by δ ˙ i ( t ) after a few seconds. From the results of Figure 4 and Figure 5, we can conclude that the formation objective is achieved.
Figure 6 shows the robots’ trajectory in the x y plane. As can be seen for t < 6 , the agents move from the initial position to achieve the formation in blue dotted lines; subsequently, for t > 6 , the agents move to a line formation, in which they remain for the rest of the time.

6.2. Flocking Control

The objective of the second simulation is to validate the performance of the flocking controller (19) together with the observer (18). The desired velocity profile is proposed as
v d ( t ) = 0.6 0.1 ( tanh ( t 10 ) tanh ( t 30 ) ) 0.2 ( tanh ( t 10 ) tanh ( t 30 ) ) .
It is important to point out that the first derivative of the desired velocity profile v ˙ d ( t ) 0 as t ; hence, the condition in Proposition 3 is fulfilled. On the other hand, the initial conditions, the control and observer gains, the time-varying formation δ i ( t ) and the communication graph are the same as in the first simulation.
Before performing the tests to validate the flocking control, we carried out a simulation to assess the performance of the distributed observer (24). Note that the distributed observer (24) does not depend on the states of the robot network. We consider that only robots 1 and 5 know the desired velocity profile; thus, the parameter d i is set as d 1 , d 5 = 1 and d 2 , 3 , 4 , 6 = 0 . This means that the rest of the robots will have to recover the information through the distributed observer. The initial condition and observer gain for the distributed observer were chosen as v ^ i d ( 0 ) = 0 for all i N and μ = 10 .
Figure 7 shows the trajectories obtained with the distributed observer (24). As can be observed, all the estimated states converge to the desired velocity profile in both coordinates; see Figure 7a,b. With this result, we can say that the estimator is working well; therefore, it can be used in combination with the observer-based flocking approach described in Section 4.
According to the definition of flocking given in Section 2, this behavior is achieved if all the robots’ velocities follow the desired flocking velocity plus the rate of change of the formation and the robots maintain a safe distance from one another and form a desired geometric pattern. The velocity matching condition is shown in Figure 8, where all the estimated velocities of the robots converge to v d ( t ) + δ ˙ i ( t ) . The second flocking condition is depicted in Figure 9, where clearly the distance p i j ( t ) δ i j ( t ) converges to zero, meaning that the robots achieve the desired formation. It is important to highlight the good performance of the observer-based flocking algorithm considering that only the position of the robots is obtained from sensor measurements and only two robots know the the desired flocking velocity profile.
Figure 10 shows the trajectory of the robot network in the plane, which is similar to the flocking behavior observed in birds—that is, while maintaining a time-varying formation, all the robots move with the same speed. With this result and the ones presented in Figure 8 and Figure 9, we can conclude that the proposed flocking control law is able to emulate the flocking behavior.

7. Conclusions

In this work, we study the problem of designing control laws that achieve time-varying formation and flocking in robot networks where each robot is modeled as a double integrator system. To solve this problem, we adopt a hierarchical control strategy that allows us to design a decentralized dynamic controller able to achieve both collective behaviors. For instance, if time-varying formation is pursued, we only need to set the desired flocking velocity to zero. An important advantage of the proposed controller is that for the robot i, the control input u i does not require the linear velocities of its neighbors; it only requires the relative position of the neighbors and its own velocity.
Since, in practice, it is not always possible to measure all the states of the system, a Luenberger observer is proposed to estimate the robot’s velocity. The hierarchical control approach allows us to easily combine the formation and flocking control law with the observer, and the stability analysis is very straightforward. In addition to the velocity observer, we design a distributed observer that estimates the desired flocking velocity for those robots that do not have access to the desired velocity profile.
To model the communication between agents, the theory of graphs is employed. The proposed controller works for either undirected or directed graphs. As a result, it is possible to avoid the use of the leader–follower approach, which is most used in these types of works; the only condition is that the graph must be connected and strongly connected for the distributed observer. To validate the presented theory, numerical results are shown that allow us to conclude to that the proposed control law is able to emulate the collective behavior under study; in the case of the distributed observer, the obtained results were good, considering that only two agents knew the desired velocity profile. Finally, as future research, we will study the case of collision avoidance, so as to later implement the method presented in this work in some unmanned vehicles.

Author Contributions

Conceptualization, J.P.-J.; Methodology, C.M.-M. and J.P.-J.; Validation, C.M.-M. and R.M.-C.; Formal analysis, C.M.-M. and J.P.-J.; Investigation, C.M.-M. and R.M.-C.; Writing—original draft, C.M.-M.; Writing—review & editing, J.P.-J. and R.M.-C.; Visualization, C.M.-M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the CONACYT Research Projects "Synchronization of complex systems and its applications" under grant number A1-S-31628 and 1030 “Collective behaviors of unmanned vehicles”.

Acknowledgments

This work was supported by the CONACYT Research Projects “Synchronization of complex systems and its applications” under grant number A1-S-31628 and 1030 “Collective behaviors of unmanned vehicles”. The authors would like to thank the anonymous reviewers for their valuable comments and suggestions. Special thanks to César Cruz Hernández, who helped us during the elaboration of this work.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Oha, K.K.; Park, M.C.; Ahn, H.S. A survey of multi-agent formation control. Automatica 2015, 53, 424–440. [Google Scholar] [CrossRef]
  2. Wu, Z.; Sun, J.; Wang, X. Distance-based formation tracking control of multi-agent systems with double-integrator dynamics. Chin. Phys. B 2018, 27, 060202. [Google Scholar] [CrossRef]
  3. Zhi, H.; Chen, L.; Li, C.; Guo, Y. Leader-follower affine formation control of second-order nonlinear uncertain multi-agent systems. IEEE Trans. Circuits Syst. II Express Briefs 2021, 68, 3547–3551. [Google Scholar] [CrossRef]
  4. Wang, M.; Zhang, T. Leader-following formation control of second-order nonlinear systems with time-varying communication delay. Int. J. Control. Autom. Syst. 2021, 19, 1729–1739. [Google Scholar] [CrossRef]
  5. Lippay, Z.S.; Hoagg, J.B. Formation control with time-varying formations, bounded controls, and local collision avoidance. IEEE Trans. Control. Syst. Technol. 2021, 30, 261–276. [Google Scholar] [CrossRef]
  6. Zhao, S.; Zelazo, D. Translational and scaling formation maneuver control via a bearing-based approach. IEEE Trans. Control. Netw. Syst. 2017, 4, 429–438. [Google Scholar] [CrossRef]
  7. Hu, J.; Bhowmick, P.; Lanzon, A. Distributed adaptive time-varying group formation tracking for multiagent systems with multiple leaders on directed graphs. IEEE Trans. Control. Netw. Syst. 2019, 7, 140–150. [Google Scholar] [CrossRef]
  8. Dong, X.; Li, Q.; Wang, R.; Ren, Z. Time-varying formation control for second-order swarm systems with switching directed topologies. Inf. Sci. 2016, 369, 1–13. [Google Scholar] [CrossRef]
  9. Hua, Y.; Dong, X.; Li, Q.; Ren, Z. Distributed fault-tolerant time-varying formation control for second-order multi-agent systems with actuator failures and directed topologies. IEEE Trans. Circuits Syst. II Express Briefs 2018, 65, 774–778. [Google Scholar] [CrossRef]
  10. Xu, Y.; Zhao, S.; Luo, D.; You, Y. Affine formation maneuver control of linear multi-agent systems with undirected interaction graphs with undirected interaction graphs. In Proceedings of the IEEE Conference on Decision and Control (CDC), Miami, FL, USA, 17–19 December 2018; pp. 502–507. [Google Scholar]
  11. Tanner, H.G. Flocking with obstacle avoidance in switching networks of interconnected vehicles. In Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, LA, USA, 26 April–1 May 2004. [Google Scholar]
  12. Olfati-Saber, R. Flocking for multi-agent dynamic systems: Algorithms and theory. IEEE Trans. Autom. Control 2006, 51, 401–420. [Google Scholar] [CrossRef]
  13. Liu, B.; Yu, H. Flocking in multi-agent systems with a bounded control input. In Proceedings of the International Workshop on Chaos-Fractals Theories and Applications, Shenyang, China, 6–8 November 2009. [Google Scholar]
  14. Su, H.; Wang, X.; Lin, Z. Flocking of multi-agents with a virtual leader. IEEE Trans. Autom. Control 2009, 54, 293–307. [Google Scholar] [CrossRef]
  15. Yazdani, S.; Su, H. A ully distributed protocol for flocking of time-varying linear systems with dynamic leader and external disturbance. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 1234–1242. [Google Scholar] [CrossRef]
  16. Wu, S.; Pu, Z.; Yi, J.; Sun, J.; Xiong, T.; Qiu, T. Adaptive flocking of multi-agent systems with uncertain nonlinear dynamics and unknown disturbances using neural networks. In Proceedings of the 16th IEEE International Conference on Automation Science and Engineering (CASE), Hong Kong, China, 20–21 August 2020. [Google Scholar]
  17. Qing, Z.; Xiaohao, X.; Zhengquan, Y.; Zengqiang, C. Distance constrained flocking of multi-agents with time delay. In Proceedings of the 32nd Chinese Control Conference, Xi’an, China, 26–28 July 2013. [Google Scholar]
  18. Wang, J.; Xin, M. Flocking of multi-agent systems using a unified optimal control approach. J. Dyn. Syst. Meas. Control 2013, 135, 061005. [Google Scholar] [CrossRef]
  19. Dong, Y.; Chen, J.; Huang, J. Flocking with connectivity preservation of multiple double-integrator systems subject to external disturbances. In Proceedings of the IEEE Conference on Control Applications (CCA), Antibes, France, 8–10 October 2014. [Google Scholar]
  20. Wellman, B.J.; Hoagg, J.B. A discrete-time flocking algorithm for agents with sampled-data double-integrator dynamics. In Proceedings of the American Control Conference (ACC), Seattle, WA, USA, 24–26 May 2017. [Google Scholar]
  21. Luo, X.; Li, X.; Li, X.; Guan, X. Distributed velocity observer based formation control for multi-agent systems. In Proceedings of the 33rd Chinese Control Conference, Nanjing, China, 28–30 July 2014; pp. 1746–1750. [Google Scholar]
  22. Keer, Z. Observer-based formation control of heterogeneous multi-agent systems without velocity measurements. In Proceedings of the 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC), Hefei, China, 19–21 May 2017; pp. 669–674. [Google Scholar]
  23. Chai, X.; Liu, J.; Yu, Y.; Sun, C. Observer-based self-triggered control for time-varying formation of multi-agent systems. Sci. China Inf. Sci. 2021, 24, 132205. [Google Scholar] [CrossRef]
  24. Fax, J.A.; Murray, R.M. Information flow and cooperative control of vehicle formations. IEEE Trans. Autom. Control 2004, 49, 1465–1476. [Google Scholar] [CrossRef]
  25. Olfati-Saber, R.; Fax, J.A.; Murray, R.M. Consensus and cooperation in networked multi-agent systems. Proc. IEEE 2007, 95, 215–233. [Google Scholar] [CrossRef]
  26. Nuño, E.; Ortega, R.; Basañez, L.; Hill, D. Synchronization of Networks of Nonidentical Euler-Lagrange Systems With Uncertain Parameters and Communication Delays. IEEE Trans. Autom. Control 2011, 56, 935–941. [Google Scholar] [CrossRef]
  27. Ren, W.; Beard, R.W.; Atkins, E.M. Information consensus in multivehicle cooperative control. IEEE Control Syst. Mag. 2007, 27, 71–82. [Google Scholar]
  28. Scardovi, L.; Arcak, M.; Sontang, E. Synchronization of Interconnected Systems With Applications to Biochemical Networks: An Input-Output Approach. IEEE Trans. Autom. Control 2010, 55, 1367–1379. [Google Scholar] [CrossRef]
  29. Hong, Y.; Hu, J.; Gao, L. Tracking control for multi-agent consensus with an active leader and variable topology. Automatica 2006, 42, 1177–1182. [Google Scholar] [CrossRef]
  30. Hale, J.K. Ordinary Diferential Equations; Wiley: New York, NY, USA, 1969. [Google Scholar]
Figure 1. Block diagram of the proposed control for formation and flocking. In the green block, the external controller is included, while the orange block contains the internal controller; the values of the terms a i 1 , a i j and a i N will change depending on the used graph.
Figure 1. Block diagram of the proposed control for formation and flocking. In the green block, the external controller is included, while the orange block contains the internal controller; the values of the terms a i 1 , a i j and a i N will change depending on the used graph.
Entropy 25 00834 g001
Figure 2. Block diagram of the proposed control for formation and flocking using the observer.
Figure 2. Block diagram of the proposed control for formation and flocking using the observer.
Entropy 25 00834 g002
Figure 3. Time evolution of the quantities | | e v x | | and | | e v y | | .
Figure 3. Time evolution of the quantities | | e v x | | and | | e v y | | .
Entropy 25 00834 g003
Figure 4. Time evolution of the formation error p i j ( t ) δ i j ( t ) in the first simulation, (a) x [ m ] and (b) y [ m ].
Figure 4. Time evolution of the formation error p i j ( t ) δ i j ( t ) in the first simulation, (a) x [ m ] and (b) y [ m ].
Entropy 25 00834 g004
Figure 5. Estimated velocity of each robot, (a) v ^ x [ m / s ] and (b) v ^ y [ m / s ].
Figure 5. Estimated velocity of each robot, (a) v ^ x [ m / s ] and (b) v ^ y [ m / s ].
Entropy 25 00834 g005
Figure 6. Trajectory of the robots in the plane during the first simulation; □ denotes the initial position of the robots and ◯ denotes the final position of the robots.
Figure 6. Trajectory of the robots in the plane during the first simulation; □ denotes the initial position of the robots and ◯ denotes the final position of the robots.
Entropy 25 00834 g006
Figure 7. Estimated desired velocities generated by the distributed observer (24), (a) estimated velocity in the x coordinate, (b) estimated velocity in the y-component. The dotted line represents the desired velocity profile.
Figure 7. Estimated desired velocities generated by the distributed observer (24), (a) estimated velocity in the x coordinate, (b) estimated velocity in the y-component. The dotted line represents the desired velocity profile.
Entropy 25 00834 g007
Figure 8. Estimated velocity of each robot in the flocking simulation, (a) v ^ x [ m / s ] and (b) v ^ y [ m / s ].
Figure 8. Estimated velocity of each robot in the flocking simulation, (a) v ^ x [ m / s ] and (b) v ^ y [ m / s ].
Entropy 25 00834 g008
Figure 9. Time evolution of the distance error p i j ( t ) δ i j ( t ) in the flocking simulation, (a) p x [ m ] and (b) p y [ m ].
Figure 9. Time evolution of the distance error p i j ( t ) δ i j ( t ) in the flocking simulation, (a) p x [ m ] and (b) p y [ m ].
Entropy 25 00834 g009
Figure 10. Trajectory of the robots in the plane during the flocking simulation; □ denotes the initial position of the robots and ◯ denotes the final position of the robots.
Figure 10. Trajectory of the robots in the plane during the flocking simulation; □ denotes the initial position of the robots and ◯ denotes the final position of the robots.
Entropy 25 00834 g010
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Montañez-Molina, C.; Pliego-Jiménez, J.; Martínez-Clark, R. Formation and Flocking Control Algorithms for Robot Networks with Double Integrator Dynamics and Time-Varying Formations. Entropy 2023, 25, 834. https://doi.org/10.3390/e25060834

AMA Style

Montañez-Molina C, Pliego-Jiménez J, Martínez-Clark R. Formation and Flocking Control Algorithms for Robot Networks with Double Integrator Dynamics and Time-Varying Formations. Entropy. 2023; 25(6):834. https://doi.org/10.3390/e25060834

Chicago/Turabian Style

Montañez-Molina, Carlos, Javier Pliego-Jiménez, and Rigoberto Martínez-Clark. 2023. "Formation and Flocking Control Algorithms for Robot Networks with Double Integrator Dynamics and Time-Varying Formations" Entropy 25, no. 6: 834. https://doi.org/10.3390/e25060834

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop