Next Article in Journal
An Optimized LSTM Neural Network for Accurate Estimation of Software Development Effort
Next Article in Special Issue
Finite-Time Robust Path-Following Control of Perturbed Autonomous Ground Vehicles Using a Novel Self-Tuning Nonsingular Fast Terminal Sliding Manifold
Previous Article in Journal
Stability Analysis of Anti-Periodic Solutions for Cohen–Grossberg Neural Networks with Inertial Term and Time Delays
Previous Article in Special Issue
Coalitional Control Strategy for a Heterogeneous Platoon Application
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Stability Margin of Data-Driven LQR and Its Application to Consensus Problem

1
Research Center for Electrical and Information Technology, Department of Electrical and Information Engineering, Seoul National University of Science and Technology, Seoul 01811, Republic of Korea
2
School of Robotics, Kwangwoon University, Nowon-gu, Seoul 01897, Republic of Korea
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2024, 12(2), 199; https://doi.org/10.3390/math12020199
Submission received: 6 November 2023 / Revised: 23 December 2023 / Accepted: 4 January 2024 / Published: 8 January 2024
(This article belongs to the Special Issue Control Theory and Applications, 2nd Edition)

Abstract

:
In contrast with traditional control input design techniques based on mathematical models of the system, in data-driven control approaches, which have recently gained substantial attention, the controller is derived directly from the data that are collected from experiments or observations of the target system. In particular, several data-driven optimal control and model predictive control (MPC) techniques have been proposed. In this paper, it is shown that the recently proposed data-driven LQR (Linear Quadratic Regulator) has a stability margin that is the set of the uncertainties in the control input channels maintaining the closed-loop stability. As an application of the proposed stability margin of the data-driven LQR, the consensus problem is considered. Since the control design for the consensus of multi-agent systems can be reformulated into the robust stabilization of a linear system with uncertainty in the input channel, it is demonstrated that the derived stability margin can be used to design a controller for the consensus of multi-agent systems.

1. Introduction

Over the past few decades, research on model-based control design, a control input design technique based on mathematical models of the system under consideration, has dominated the control community. Using mathematical models of these systems, it has been possible to design control techniques that take into account various performance indicators of the system, such as optimality of the control performance expressed by a quadratic cost with respect to the control and state [1,2]. At the same time, it should be noted that the application of these control techniques requires the construction of an accurate mathematical model of the system.
However, constructing exact mathematical models is a difficult, expensive and time-consuming task [3]. At a first glance, the observed data from the running system are inseparable from noise. In the system identification field, researchers have attempted to design methods to address the influence of noise both on input and output data [4,5,6]. However, even though the system may be successfully modeled based on the data, there is no guarantee that the fitting model is the perfect mathematical description for the control task due to uncertainties [7]. After an exhaustive effort to optimally identify the underlying dynamics from the data, the actual plant still requires a robust controller to compensate for the above problems.
In contrast with model-based control design, data-driven control design aims to eliminate the need for system identification and directly tackle control problems when formulating the control input. For instance, one recent study [8] employs Petersen’s lemma through the linear matrix inequality formulation for data generated by linear and polynomial systems. An earlier study [9] constructs a quadratic matrix inequality (QMI) of a linear model with noise that is implied by another QMI that is composed of noisy data through S-Lemma [9]. In another recent study [10], the authors exploit prior knowledge of uncertainty or disturbance to design a non-conservative data-driven control design. These efforts are mainly motivated by the desire to remove the complexity and shorten the required time to build control systems while also dealing with problems that exist in the model-based control method.
The driving force of the transition to the data-driven control method is well outlined by Willems et al. [11]: a persistently excited input and output data pair can represent the behavior of a dynamic system. In light of this, two recent studies have presented a framework to design closed-loop data-driven control [12,13]. The notion of data informativity relaxes the requirement of persistently exciting data for some of the data-driven control problems [14]. The framework is then extended to directly formulate an optimal control method via the observed data such as in a LQR (Linear Quadratic Regulator) [15] and MPC (Model Predictive Control) [16,17].
In order to apply control inputs to real-world systems successfully, it is important to design control inputs that ensure system robustness, such as stability margins. It is well known that a continuous-time LQR provides stability margins such as the gain margin [ 1 2 , ] and the phase margin ( π 3 , π 3 ) [2,18,19]. Such a property is very useful in practice and quite special compared with other model-based control designs, which are usually weak against model uncertainties. In the case of a discrete-time LQR (DLQR), the stability margin is investigated By Kim et al. and Lee et al. [20,21]. Because of these robustness guarantees, LQR and DLQR have received a lot of attention over the years. A generalized stability margin, called the disc margin, has been investigated in the transfer function setting [22]. Furthermore, for the state space, a disc margin is proposed for a DLQR using Lyapunov stability theory [21]. To derive the disc margin of a DLQR, the closed-loop system including the uncertainty in the input channels is represented as x ( k + 1 ) = A x ( k ) + B ψ u ( k ) , where ψ is a complex number denoting the uncertainty and u ( k ) is the DLQR. Then, the region for ψ is derived using Lyapunov stability theory such that closed-loop stability is maintained. Since the region is a disc in the complex plane, it is also called a disc margin. Since ψ is a complex number, the disc margin implies both the gain and phase margins simultaneously.
Recently [23], the idea of a data-driven LQR (DDLQR) has been presented using the closed-form solutions of the Riccati equation and Markov parameters estimation [24]. Given the fact that the LQR and DLQR have received attention for their robustness guarantees, a natural straightforward question is whether the DDLQR provides a stability margin or not. In this paper, as the first contribution, a disc margin (or stability margin) of a DDLQR is derived using Lyapunov stability theory and an explicit solution of the Riccati equation. In other words, when the control u ( t ) is designed according to the DDLQR, a region for ψ is derived in which the closed-loop stability of x ( k + 1 ) = A x ( k ) + B ψ u ( k ) is maintained. The result can be understood as a data-driven version of the result in Lee et al. [21]. In particular, it is shown that the disc margin of the DDLQR becomes almost the same as those of DLQR and LQR [2,21] if the system is asymptotically null controllable [25].
It is known that designing a controller for consensus of a MAS (Multi-Agent System) is equivalent to the robust control design for one agent in the MAS when the agent has uncertainties in its input channels [26,27,28]. Hence, a control design method guaranteeing the stability margin can be used for the controller design for consensus of the MAS [26,29]. Therefore, as the second contribution in this paper, it is shown that the derived stability margin of the DDLQR can be used to design a control for the consensus problem of a multi-agent system. Simulation results show that the DDLQR indeed provides a stability margin and DDLQR-based control leads to consensus of the MAS if the agent in the MAS is asymptotically null controllable.
Notation: I k denotes the k × k identity matrix and x + and x denote x ( k + 1 ) and x ( k ) , respectively, where k is the discrete-time index. σ ¯ ( · ) and σ ̲ ( · ) represent the maximum and minimum singular values, respectively. System x ( k + 1 ) = A x ( k ) + B u ( k ) is said to be stabilizable if there exists a matrix K such that A + B K is Hurwitz stable. Furthermore, system x ( k + 1 ) = A x ( k ) + B u ( k ) , y ( k ) = C x ( k ) is said to be detectable if there exists a matrix F such that A + F C is Hurwitz stable.

2. Preliminaries

In this section, preliminary results required for deriving the main results are presented.

2.1. Guaranteed Stability Margin of Model-Based Discrete-Time LQR

In this subsection, the stability margin of DLQR (Discrete-Time Linear Quadratic Regulator) is briefly reviewed. To this end, consider the discrete-time LTI (Linear Time Invariant) system
x + = A x + B u , y = C x ,
where A R n × n is the system matrix, B R n × p is the input matrix, C R q × n is the output matrix. Furthermore, x R n , u R p , and y R q are the state, control input, and output, respectively. It is assumed that ( A , B ) is stabilizable. Then the DLQR is derived by minimizing the following infinite horizon cost function
J = k = 0 y ( k ) Q y ( k ) + u ( k ) R u ( k ) ,
where output weighting Q and control input weighting R are positive definite. It is assumed that the pair ( A , Q ) is detectable. The minimizing DLQR is given by
u = K x ,
where the optimal feedback gain K is given by
K = ( B P B + R ) 1 B P A .
Note that the positive definite matrix P is the solution to the algebraic Riccati equation
A P A P A P B ( B P B + R ) 1 B P A + C T Q C = 0 .
In contrast with other model-based controls, it is well known that the DLQR provides a stability margin, which is important for practical applications [2,21]. To be specific, if there is input channel uncertainty in the real system such as x + = A x + B ψ u where ψ denotes the input channel uncertainty, the stability of the closed-loop system with a DLQR is maintained to some extent although the DLQR is designed x + = A x + B u . This is formally presented in the following theorem.
Theorem 1 
([21]). Consider the closed-loop system consisting of (1) and (3) with the input channel uncertainty Ψ
x + = A x B Ψ K x ,
where Ψ = diag { ψ 1 , , ψ p } C p × p . The closed-loop system (6) is asymptotically stable if ψ i belongs to the following set, called the disc margin, for all i
Ω i = σ + j ω C : σ 1 + β 1 α 1 β 1 2 + ω 2 < α 1 β 1 + γ 1 ( α 1 β 1 ) ( α 1 β 1 ) 2 ,
where j denotes the complex number, i.e., j : = 1 , α 1 : = σ ¯ ( B P B + R ) , β 1 : = σ ̲ ( R ) , and γ 1 : = σ ̲ ( Q ) / σ ¯ ( K ) 2 . In particular, if the matrix R is diagonal such that R = diag { r 1 , , r p } , the disc margin is given by
Ω i = σ + j ω : σ 1 + r i δ 2 + ω 2 < r i ( r i + δ ) + γ 1 δ δ 2 ,
where δ : = σ ¯ ( B P B ) .
Note that ψ i can be interpreted as the ith control input channel uncertainty and is used only for analysis. In other words, the LQR gain K is computed using the system model ( A , B ) and the applied control input is u = K x . Then, in theory, the closed-loop system becomes x + = A x B K x . However, in this paper, the stability of x + = A x B Ψ K x is analyzed in order to investigate the stability margin. This method is again used later for the stability margin of the data-driven LQR. Since Ω i can be viewed as the set of ψ i such that closed-loop stability of (6) is maintained, it can be interpreted as both the gain and phase margins simultaneously. For instance, the projection of the circle Ω i in the complex plane onto the real axis corresponds to the gain margin. Theorem 1 means that the closed-loop stability is guaranteed with the DLQR if ψ i belongs to the disc margin (i.e., stability margin) in (7). Note that the DLQR is a model-based design, i.e., it needs the system information of ( A , B ) .

2.2. Data-Driven LQR (DDLQR)

This subsection summarizes the idea of the data-driven LQR [23], which is the main reference for the proposed result. In data-driven control design, the most important problem is how to obtain the feedback gain using measured data from the system rather than a system model ( A , B ). The approach [23] relies mainly on a model-based closed-form solution to the DARE (5) derived by Lewis and Furata and Wongsaisuwan [30,31], given by
P = O ( Q N 1 + S R N 1 S ) 1 O ,
where N is sufficiently large and denotes the amount of measured data from the system, and
R N diag ( R , R , R ) R p N × p N , Q N diag ( Q , Q , Q ) R q N × q N , O C ( C A ) ( C A N 1 ) R q N × n , O + ( C A ) ( C A 2 ) ( C A N ) R q N × n , M M 1 M 2 M N R q N × p , M i = C A ( i 1 ) B R q × p , S 0 0 M 1 0 M 2 M 1 M N 1 M N 2 M 1 0 R q N × p N .
Then, considering (4) and (9), the DLQR gain can be equivalently written as
K D L Q R = R + M Q N 1 + S R N 1 S 1 M 1 M Q N 1 + S R N 1 S 1 O + .
The main result of da Silva et al. [23] is to derive a data-driven version of (10), which is explained below. In view of the structure of the gain (10), since R N and Q N are weighting matrices, if M , O , and S can be estimated using the measured data from the system, it is possible to design a data-driven LQR. To this end, consider the Hankel matrix of a signal z ( k ) defined by
H ( z ( k ) ) z ( k ) z ( k + 1 ) z ( k + L D 1 ) z ( k + 1 ) z ( k + 2 ) z ( k + L D ) z ( k + N 1 ) z ( k + N ) z ( k + N + L D 2 ) R a N × b L D , z ( k ) R a × b ,
where L D is a constant satisfying a rank condition L D q ( N + 1 ) [12,32]. Furthermore, let us define matrices comprised of the set of historical input and output data
U p = H ( u ( 0 ) ) R p N × L D , U f = H ( u ( N ) ) R p N × L D , Y p = H ( y ( 0 ) ) R q N × L D , Y f = H ( y ( N ) ) R q N × L D ,
where it is assumed that ( N + 1 ) q n is satisfied [23]. In order to construct matrices U p , U f , Y p , and Y f , arbitrary input signals u are chosen and applied to the sytem and the corresponding outputs y are measured. From the main results in Overschee and de Moor and Lim et al. [33,34], a predictor of the system output can be written as
Y f = O ( D + F S ) O F S W U p Y p U f Φ W Φ ,
where
D = A N B A B B ,
and F is a solution of A N + 1 + F O = 0 . Then, for the data-driven design, S ^ (i.e., the estimate of S ) is the last p N columns of W ^ , which is the least squares solution of Equation (11). In other words,
W ^ = Y f Φ ,
where Φ stands for the pseudo inverse of Φ . Using S ^ , the estimate of O can be obtained by
O ^ = ( Y p S ^ U p ) X ,
where
X [ x ( 0 ) x ( 1 ) x ( L D 1 ) ] .
Therefore, the data-driven LQR gain K D is given by
K D = R + M ^ Q N 1 + S ^ R N 1 S ^ 1 M ^ 1 M ^ Q N 1 + S ^ R N 1 S ^ 1 O ^ q ,
where M ^ denotes the estimate of M , and O ^ q is the matrix obtained by removing the first q rows of O ^ . It should be noted that as more and more data is used, i.e., N , the gain K D given in (15) converges to a gain K D L Q R which is the optimal control gain for the DLQR problem [23,30]. Hence, strictly speaking, there is no guarantee that K D is optimal when N is not sufficiently large. In what follows, we assume that a sufficiently large amount of data are used to compute K D so that the computed gain, called data-driven DLQR gain, is optimal. Note that all variables in (15) are irrespective of the system model, i.e., K D is computed without using model information, which means that the gain K D is a data-driven LQR gain.
In contrast with the conventional LQR, which is based on the system model ( A , B ) , the DDLQR design computes the feedback gain K D relying only on the measured data from the system under the assumption that the amount of data N is sufficiently large.

3. Guaranteed Stability Margin of Data-Driven LQR

Considering the information provided in the previous section, it is a natural to question whether the DDLQR for (1) provides a stability margin or not.
To answer this question, consider the closed-loop system with the DDLQR gain K D in (15)
x + = A x B Ψ K D x ,
where Ψ = diag { ψ 1 , , ψ p } C p × p , and again Ψ denotes the stability margin or the uncertainties in the control input channel. It turns out that the explained DDLQR guarantees a stability margin similarly to the model-based design of Lee et al. [21].
Theorem 2. 
The closed-loop system (16) with input channel uncertainty Ψ = diag ( ψ i ) is asymptotically stable if ψ i belongs to the following set (i.e., disc margin) for all i
Ω ¯ i = σ + j ω C σ α β 2 + ω 2 < α 2 + β ( γ α ) β 2 ,
where
α σ max ( R + M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ ) , β σ max ( M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ ) , γ σ min ( Q ) σ max ( K D K D ) .
In particular, if the matrix R is diagonal, i.e., R = diag { r 1 , , r p } , the disc margin is given by
Ω ¯ i = σ + j ω C σ 1 + r i β 2 + ω 2 < r i 2 + β ( r i + γ ) β 2 .
Proof. 
See Appendix A. □
Since ψ i is a complex number, the disc margin in (17) (or (18)) represents both the gain and phase margins simultaneously. In view of Theorem 2, the DDLQR for system (1) also results in a stability margin.
It is well known that the gain and phase margins of a continuous-time LQR (CLQR) are given by [ 1 2 , ) and ( π 3 , π 3 ) , respectively. In the case of the discrete-time LQR, it is proved that the same gain and phase margins can be derived if the discrete-time system is asymptotically null controllable (ANCBC), which means that the system ( A , B ) is controllable and the eigenvalues of A are inside the unit circle or on the unit circle [21]. The following theorem means that the guaranteed gain and phase margins of the DDLQR are also [ 1 2 , ) and ( π 3 , π 3 ) , respectively, if the system is ANCBC. This means that the stability margin of a DDLQR is the same as that of a DLQR or CLQR if the system is ANCBC.
Theorem 3. 
Suppose that ( A , B ) is ANCBC, and that the DDLQR gain denoted by K D , ϵ is obtained according to (15) with weighting matrices Q = ϵ I and R = diag { r 1 , , r p } , where ϵ is a small positive constant. Consider a set
Ω ¯ i , ϵ = σ + j ω σ 1 + r i β 2 + ω 2 < r i 2 + β r i β 2 , i = 1 , , q ,
which corresponds to the set Ω i in (18) with γ = 0 . Then, as ϵ converges to zero, the set Ω ¯ i , ϵ approaches the set H : = { s C : R e ( s ) > 1 2 } .
Since the proof of the theorem is quite similar to that of the corresponding theorem of Lee et al. [21], it is omitted here. However, it should be noted that the most important points in the proof are that lim ϵ 0 P ϵ = 0 , where P ϵ is the solution of the discrete-time Riccati equation with Q = ϵ I , that δ ϵ = σ ¯ ( B P ϵ B ) converges to zero as ϵ 0 if the system is ANCBC [25], and that the leftmost point of the disc margin is δ ϵ + r i r i 2 + r i δ ϵ δ ϵ , which converges to 1 2 as ϵ 0 . lim ϵ 0 P ϵ = 0 holds in view of the structure of the explicit solution P in (9).
In view of the derivation in this section, the DDLQR also provides a stability margin. In the next section, how to use the stability margin for the control design for a consensus problem is presented.

4. Application of Disc Margin to a Consensus Problem

In this section, how to use the disc margin described in the previous section for controller design for a consensus problem is presented.

4.1. Consensus Problem

Consider a MAS (Multi-Agent System) consisting of N a identical discrete-time ANCBC systems written as
x 1 + = A x 1 + B u 1 , x 2 + = A x 2 + B u 2 , x N a + = A x N a + B u N a ,
where x i R n , and u i R p stand for the state and input of the ith agent in the MAS, A R n × n and B R n × p , the initial conditions of each agent are different, and the agents communicate with each other by exchanging state information between neighbor agents. It is assumed that the communication (i.e., the data exchange) is modeled by a graph Laplacian matrix L R N a × N a [21,27,28,35]. The Laplacian matrix is constructed by the degree of each agent and adjacency matrix, which indicates whether pairs of agents are adjacent or not in the graph. The diagonal element of the Laplacian matrix i i = Σ j i i j expresses the degree of agent i. i j = 1 for all i j if the agent i receives information from agent j. Otherwise, it is defined as i j = 0 .
The following facts about the Laplacian are instrumental in solving the consensus problem of the MAS:
  • Fact 1. The eigenvalues of L are on the closed disc belonging to the complex plane, and its center and radius are ( N a 1 , 0 ) and N a 1 , respectively, [36];
  • Fact 2. At least one eigenvalue of the Laplacian matrix is located at the origin of the complex plane, and the associated eigenvector is 1 N [27];
  • Fact 3. The zero eigenvalue at the origin is simple if the communication graph includes the directed spanning tree, which means that at least one node in the graph has a directed information path to all the other nodes in the graph [27,37].
For such a MAS, the consensus problem is to design a control achieving
lim k x 1 ( k ) = x 2 ( k ) = = x N a ( k ) ,
and the control of the ith agent is given by
u i = K c j N i l i j ( x i x j ) ,
where K c stands for the control gain for consensus, l i j is the ( i , j )th element of the Laplacian matrix L, and N i is the set of the neighbors of the ith agent. Note that the same control gain K c is used for each agent, and that the consensus problem requires only the design of the gain K c achieving (21).
Considering these properties of the algebraic graph theory and the definition of the consensus problem, the following assumption is inevitable in order to solve the consensus problem.
Assumption 1. 
A directed spanning tree is included in the communication graph modeling the information exchange of the MAS under consideration.
In designing K c for consensus of the MAS (20), Theorem 3 can be employed. Hence, only an ANCBC agent is considered in this paper.
Assumption 2. 
The system matrix A in (20) is ANCBC.

4.2. DDLQR-Based Consensus Algorithm

This subsection describes how the derived stability margin of the DDLQR can be effectively applied to the consensus problem for a multi-agent system (MAS).
The design of the control gain K c for consensus of the MAS relies on so-called simultaneous stabilization.
Theorem 4 
([21,26,29,38]). Suppose that Assumptions 1 and 2 hold. The MAS (20) reaches consensus if the control gain K c is designed such that
x + = A x + λ j B K c x ,
is asymptotically stable for all j = 1 , , N a 1 , where λ j denotes the nonzero eigenvalue of the graph Laplacian matrix L modeling the information exchange.
Note that the eigenvalue λ j of the Laplacian matrix is unknown in a distributed control setting. However, although λ j is unknown, if K c is designed according to the DDLQR and if it is possible to make all nonzero λ j belong to the stability margin of K c , consensus is reached [21,26,29].
In order to design the control for the consensus, it is assumed that the real part of the smallest nonzero eigenvalue of the Laplacian matrix L is known. Note that there are existing methods to estimate the second-largest eigenvalue of the Laplacian matrix in the literature [39,40]. Let ρ be the real part. Then, it is obvious that the real part of the smallest nonzero eigenvalue of 1 ρ L = : L is 1. As a result, it follows that the nonzero eigenvalues of L belong to the area Θ in Figure 1 in light of Fact 1, Fact 3, and scaling by ρ .
In view of Theorem 3, the disc margin becomes larger as ϵ gets smaller. Thanks to this tunable disc margin Ω ¯ ϵ , there exists ϵ such that Θ Ω ¯ ϵ for any ϵ < ϵ . Therefore, the control gain K c for consensus can be designed as K c = 1 ρ K D , ϵ . In other words, the control for the consensus is given by
u i = K c j N i l i j ( x i x j ) = 1 ρ K D , ϵ j N i l i j ( x i x j ) .
Then, the entire MAS can be written as
ξ + = [ I N A L ρ 1 B K D , ϵ ] ξ = [ I N A L B K D , ϵ ] ξ = : A L ξ ,
where ξ = x 1 x 2 x N and ⊗ denotes the Kronecker product. Using coordinate transformation v : = ( V 1 I n ) ξ , where V is the matrix whose columns are the eigenvectors of L, it is possible to convert (24) into
v + = A 0 0 0 0 A + λ 1 B K D , ϵ 0 0 0 0 0 0 0 A + λ N 2 B K D , ϵ 0 0 0 0 A + λ N 1 B K D , ϵ v .
For a detailed explanation, see Lee et al. [21]. Furthermore, it can be shown that all nonzero eigenvalues λ 1 , , λ N 1 belong to Θ in Figure 1 due to the scaling constant ρ , and Θ Ω ¯ ϵ is satisfied due to sufficiently small ϵ . In other words, since all λ i , i = 1 , , N 1 belongs to the disc margin of the DDLQR, Theorem 3 holds. Hence, the consensus problems is solved owing to Theorem 3 and the disc margin of the DDLQR. The designed procedure for the DDLQR-based consensus controller can be summarized as follows.
1.
Estimate of the real part of the second smallest eigenvalue of the graph Laplacian L and set the value as ρ ;
2.
Collect input, state, and output data;
3.
Choose sufficiently small ϵ such that the imaginary part of ω Ω ¯ ϵ is smaller than that of ω Θ , which means Θ Ω ¯ ϵ ;
4.
Calculate K c = 1 ρ K D , ϵ , and each agent implements the controller (23).
Note that the proposed DDLQR-based consensus controller is a data-driven distributed controller in the sense that it does not use the agent’s mathematical model nor any global information of the graph.

5. Simulation Study

In this section, a numerical example of the stability margin of the DDLQR is shown and its application to the consensus problem is presented.

5.1. Numerical Example for the Disc Margin of Marginally Stable Systems with DDLQR

Since the DDLQR is heavily dependent on the amount of data, the effect of N on the accuracy of the DDLQR gain is presented first, viewing the DLQR gain as the ground truth. For this purpose, two cases with N = 3 and N = 30 are considered and L D is chosen as L D = 3 q ( N + 1 ) . In addition, the gain margin of the DDLQR is presented by considering several uncertainty levels for an ANCBC system given by
A = 1.0 0.8 0.0 0.2 , B = 0.2 0 0 0.01 , C = I .
To obtain rich data with which the DDLQR gain can be computed, a PRBS (Pseudo Random Binary Sequence) is applied to the system for ( 2 N + L D 1 ) steps, and the resulting state, input, and output data depicted in Figure 2 and Figure 3 are stored. Then, the weighting matrices of the DDLQR are set to Q = 0.8 I n and R = 1.3 I m . The estimate S ^ is obtained by solving (12) using the stored input and output data. In addition, the computed S ^ is employed to calculate O ^ and M ^ together with the stored data. Finally, the DDLQR gain is obtained from (10) with K D 3 = 0.2319 0.2039 0.0046 0.0057 and K D 30 = 0.7247 0.6993 0.0299 0.0311 , where K D i denotes K D with i = N . As a comparison, we compute the model-based DLQR as K D L Q R = 0.7247 0.6993 0.0299 0.0311 . In view of K D 3 , K D 30 , and K D L Q R , the DDLQR K D i gain converges to the model-based DLQR gain K D L Q R as i increases. In other words, the DDLQR gain converges to the DLQR gain as the amount of stored data increases. Hence, with sufficiently large N, the DDLQR gain K D becomes the DLQR gain K D L Q R .
Next, the derived stability margin is validated for various input channel uncertainties with N = 30 . The maximum and minimum real parts of the gain margin (19) for the system are 0.5194 and 13.4183 , respectively. Five input channel uncertainty levels p { 5.0 , 0.6 , 7.0 , 13.0 , 18.0 } in Figure 4 are considered. In other words, input channel uncertainty p is applied to the system as follows
x + = A x + B p u ,
where x i R 2 , u R , A R 2 × 2 , and B R 2 × 1 . Figure 5 shows that the closed-loop system under the DDLQR is asymptotically stable when the input channel uncertainty p belongs to the disc margin depicted by the red circle in Figure 4. Conversely, it can be seen that the closed-loop system becomes unstable when the uncertainty p is not in the stability margin.

5.2. Application of the Disc Margin to the Consensus Problem

To present a DDLQR-based control design for consensus of a MAS consisting of ANCBC agents, consider an ANCBC agent
Σ i : x i + = 1.0 0.8 0.0 0.2 x i + 0.2 0.1 u i , i = 1 , , 5 ,
where x i R 2 and u i R . It is assumed that they communicate with each other using the communication graph in Figure 6. The graph in Figure 6 can be modeled by the Laplacian matrix L. The Laplacian matrix L = [ i j ] of a directed graph in Figure 6 is constructed as defined above. The following is the Laplacian matrix for the graph in Figure 6.
L = 3 1 1 0 1 1 2 0 1 0 0 1 2 1 0 1 0 1 3 1 0 1 1 1 3 .
To collect the data required for calculating the DDLQR gain, a PRBS is applied to an agent for ( 2 N + L D 1 ) steps, then all input, output, and state data are saved. Thereafter, the procedure for calculating the DDLQR gain is similar to the previous example. One important difference is that an arbitrarily large disc margin Ω ¯ ϵ can be obtained by setting the weighting matrices as Q = ϵ I and R = I with sufficiently small ϵ . As can be seen in Figure 7, the disc margin gets larger as ϵ becomes smaller. To satisfy the requirement that the disc margin Ω ¯ ϵ has to include Θ , this simulation set ϵ = 0.01 and ρ = 0.5 for scaling. The resulting set Ω ¯ ϵ and Θ are shown in Figure 1.
Figure 8a shows that the proposed consensus control (23) leads to consensus of the MAS consisting of the nominal ANCBC agents. Since the proposed data-driven consensus control is based on the stability margin, it is expected that robust consensus is achieved. To validate this, we consider the following MAS, in which each agent has different input channel uncertainties, i.e., the proposed consensus control is applied to the following MAS.
Σ i : x i + = 1.0 0.8 0.0 0.2 x i + 0.2 0.1 p i u i , i = 1 , , 5 ,
where p i = 15 14.9 14 13.5 12.5 . Figure 8b shows that the designed consensus control based on the stability margin of the DDLQR results in consensus of the MAS even when there are uncertainties in the input channels. In other words, although the agents have different uncertainties in their input channels, such as (6), the consensus is achieved by the proposed controller as long as the uncertainties are in the disc margin.

6. Conclusions

In this paper, it is shown that the data-driven LQR (Linear Quadratic Regulator) provides a stability margin. To achieve this, the closed-loop system is written such that the input channel is affected by uncertainties. Then, how much the uncertainty can change while closed-loop stability is maintained using Lyapunov stability is investigated. Since the uncertainty is expressed as a complex number, the set of possible uncertainty guaranteeing the closed-loop stability is given by a disc in the complex plane, which is the stability margin of the data-driven LQR. Furthermore, it is shown that the derived disc margin can be used to design a control for consensus of a MAS consisting of asymptotically null controllable linear agents.
Future work includes a similar analysis of data-driven MPC (Model Predictive Control). Since stability analysis of MPC is done posteriorly after computing the control input using the optimal cost in MPC, it might be possible to apply the proposed method if the prediction horizon is short. Furthermore, it would be beneficial to devise data-driven disturbance rejection methods.

Author Contributions

Methodology, K.R.; Investigation, A.A.U.; Supervision, J.B. and J.-S.K. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financially supported by Seoul National University of Science and Technology.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

To present the derivation, first, define a Lyapunov function candidate
V ( x ) = x P ^ x ,
for the closed-loop system x + = A x + B Ψ u = A x B Ψ K D x , where
P ^ = O ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 O ^ .
For the closed-loop stability, it has to be held that
V ( x + ) V ( x ) = x ( A B Ψ K D ) P ^ ( A B Ψ K D ) P ^ x 0 ,
where superscript * denotes the operator for the conjugate transpose of a matrix. Note that the conjugate transpose is used instead of the transpose operator since elements of Ψ are assumed to be complex numbers. Inequality (A1) is equivalent to
( A B Ψ K D ) P ^ ( A B Ψ K D ) P ^ 0 .
Furthermore, the following equivalence holds.
( A B Ψ K D ) P ^ ( A B Ψ K D ) P ^ 0 A P ^ A P ^ K D Ψ B P ^ A A P ^ B Ψ K D + K D Ψ B P ^ B Ψ K D 0 ( using Riccati Equation (5) and let R B R + B P ^ B ) ( A P ^ B ) R B 1 B P ^ A Q K D Ψ B P ^ A A P ^ B Ψ K D + K D Ψ B P ^ B Ψ K D 0 ( using ( R + B T P ^ B ) K D = B T P ^ A ) Q + K D R B K D K D Ψ R B K D K D R B Ψ K D + K D Ψ B P ^ B Ψ K D 0 Q + K D [ R B Ψ R B R B Ψ + Ψ B P ^ B Ψ ] K D 0 .
Using
Q σ min ( Q ) I q σ min ( Q ) σ max ( K D K D ) K D K D γ K D K D ,
(A3) is implied by
γ K D K D + K D [ R B Ψ R B R B Ψ + Ψ B P B Ψ ] K D K D Σ K D 0 .
Therefore, if Σ < 0 (i.e., negative definite), the closed-loop system is asymptotically stable. Note that, in fact, Σ < 0 implies lim k K x ( k ) = 0 as k . However, due to the detectability of ( A , Q ) , that also means lim k x ( k ) = 0 as k .
In order to look into Σ = γ I p + R B Ψ R B R B Ψ + Ψ B P ^ B Ψ < 0 , note that the following reasoning holds
γ I p + R B Ψ R B R B Ψ + Ψ B P ^ B Ψ < 0
γ I p + ( I p Ψ ) R B R B Ψ + Ψ B P ^ B Ψ < 0 ( using the definitions of R B and P ^ ) γ I p + ( I p Ψ ) ( R + M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ )
( R + M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ ) Ψ + Ψ M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ Ψ < 0 .
The last equality is satisfied if inequality (A8) is met
α ( I p Ψ Ψ ) + β Ψ Ψ γ I p < 0 ,
where
R + M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ σ max ( R + M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ ) I p α I p ,
M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ σ max ( M ^ ( Q N 1 + S ^ R N 1 S ^ ) 1 M ^ ) I p β I p .
It has to be noted that all terms on the left hand side of (A8) are diagonal matrices, which means that inequality (A8) can be understood element-wise. Hence, each element of (A8) is
α γ α ( σ i j ω i + σ i + j ω i ) + β ( σ i 2 + ω i 2 ) = β σ i α β 2 + ω i 2 + β ( α γ ) α 2 β 2 < 0 ,
where σ i + j ω i denotes the ith element of Ψ . This proves the first part of the theorem. For the case where R is diagonal, since it is simply an application of the first part, it is omitted here.

References

  1. Antsaklis, P.J.; Michel, A.N. Linear Systems; Birkhauser: Basel, Switzerland, 2006. [Google Scholar]
  2. Anderson, B.D.O.; Moore, J.B. Optimal Control: Linear Quadratic Methods; Prentice Hall: Hoboken, NJ, USA, 1989. [Google Scholar]
  3. Nelles, O.; Nelles, O. Nonlinear Dynamic System Identification; Springer: Berlin/Heidelberg, Germany, 2020. [Google Scholar]
  4. Mauroy, A.; Goncalves, J. Koopman-based lifting techniques for nonlinear systems identification. IEEE Trans. Autom. Control 2019, 65, 2550–2565. [Google Scholar] [CrossRef]
  5. Lin, J.; Michailidis, G. System identification of high-dimensional linear dynamical systems with serially correlated output noise components. IEEE Trans. Signal Process. 2020, 68, 5573–5587. [Google Scholar] [CrossRef]
  6. Xue, Y.; Liu, Y.; Ji, C.; Xue, G.; Huang, S. System identification of ship dynamic model based on Gaussian process regression with input noise. Ocean. Eng. 2020, 216, 107862. [Google Scholar] [CrossRef]
  7. Dörfler, F. Data-Driven Control: Part One of Two: A Special Issue Sampling from a Vast and Dynamic Landscape. IEEE Control Syst. Mag. 2023, 43, 27–31. [Google Scholar] [CrossRef]
  8. Bisoffi, A.; De Persis, C.; Tesi, P. Data-driven control via Petersen’s lemma. Automatica 2022, 145, 110537. [Google Scholar] [CrossRef]
  9. Pólik, I.; Terlaky, T. A Survey of the S-Lemma. SIAM Rev. 2007, 49, 371–418. [Google Scholar] [CrossRef]
  10. Berberich, J.; Scherer, C.W.; Allgöwer, F. Combining prior knowledge and data for robust controller design. IEEE Trans. Autom. Control 2022, 68, 4618–4633. [Google Scholar] [CrossRef]
  11. Willems, J.C.; Rapisarda, P.; Markovsky, I.; De Moor, B.L. A note on persistency of excitation. Syst. Control Lett. 2005, 54, 325–329. [Google Scholar] [CrossRef]
  12. De Persis, C.; Tesi, P. Formulas for data-driven control: Stabilization, optimality, and robustness. IEEE Trans. Autom. Control 2019, 65, 909–924. [Google Scholar] [CrossRef]
  13. Huang, L.; Zhen, J.; Lygeros, J.; Dörfler, F. Robust data-enabled predictive control: Tractable formulations and performance guarantees. IEEE Trans. Autom. Control 2023, 68, 3163–3170. [Google Scholar] [CrossRef]
  14. Van Waarde, H.J.; Eising, J.; Trentelman, H.L.; Camlibel, M.K. Data informativity: A new perspective on data-driven analysis and control. IEEE Trans. Autom. Control 2020, 65, 4753–4768. [Google Scholar] [CrossRef]
  15. Rotulo, M.; De Persis, C.; Tesi, P. Data-driven linear quadratic regulation via semidefinite programming. IFAC-PapersOnLine 2020, 53, 3995–4000. [Google Scholar] [CrossRef]
  16. Berberich, J.; Köhler, J.; Müller, M.A.; Allgöwer, F. Data-driven model predictive control with stability and robustness guarantees. IEEE Trans. Autom. Control 2020, 66, 1702–1717. [Google Scholar] [CrossRef]
  17. Berberich, J.; Köhler, J.; Müller, M.A.; Allgöwer, F. Linear tracking MPC for nonlinear systems—Part II: The data-driven case. IEEE Trans. Autom. Control 2022, 67, 4406–4421. [Google Scholar] [CrossRef]
  18. Shaked, U. Guaranteed stability margins for the discrete-time linear quadratic optimal regulator. IEEE Trans. Autom. Control 1986, 31, 162–165. [Google Scholar] [CrossRef]
  19. Safonov, M.G. Stability and Robustness of Multivariable Feedback Systems, 1st ed.; MIT Press: Cambridge, MA, USA, 1980. [Google Scholar]
  20. Kim, S.; Kwon, W.H. On the improvement of the stability robustness in the discrete-time LQ regulator. J. Control Autom. Syst. Eng. 1995, 1, 83–87. [Google Scholar]
  21. Lee, J.; Kim, J.S.; Shim, H. Disc margins of the discrete-time LQR and its application to consensus problem. Int. J. Syst. Sci. 2012, 43, 1891–1900. [Google Scholar] [CrossRef]
  22. Seiler, P.; Packard, A.; Gahinet, P. An introduction to disk margins (lecture notes). IEEE Control Syst. Mag. 2020, 40, 78–95. [Google Scholar] [CrossRef]
  23. da Silva, G.R.G.; Bazanella, A.S.; Lorenzini, C.; Campestrini, L. Data-driven LQR control design. IEEE Control Syst. Lett. 2018, 3, 180–185. [Google Scholar] [CrossRef]
  24. Furuta, K.; Wongsaisuwan, M. Discrete-time LQG dynamic controller design using plant Markov parameters. Automatica 1995, 31, 1317–1324. [Google Scholar] [CrossRef]
  25. Lin, Z. Low Gain Feedback, 1st ed.; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar]
  26. Wieland, P.; Kim, J.S.; Allgöwer, F. On topology and dynamics of consensus among linear high-order agents. Int. J. Syst. Sci. 2011, 42, 1831–1842. [Google Scholar] [CrossRef]
  27. Ren, W.; Beard, R.W.; Atkins, E.M. Information consensus in multivehicle cooperative control. IEEE Control Syst. Mag. 2007, 27, 71–82. [Google Scholar]
  28. Fax, J.A.; Murray, R.M. Information flow and cooperative control of vehicle formations. IEEE Trans. Autom. Control 2004, 49, 1465–1476. [Google Scholar] [CrossRef]
  29. Tuna, S.E. LQR-based coupling gain for synchronization of linear systems. arXiv 2008, arXiv:0801.3390. [Google Scholar]
  30. Lewis, F. A generalized inverse solution to the discrete-time singular Riccati equation. IEEE Trans. Autom. Control 1981, 26, 395–398. [Google Scholar] [CrossRef]
  31. Furuta, K.; Wongsaisuwan, M. Closed-form solutions to discrete-time LQ optimal control and disturbance attenuation. Syst. Control Lett. 1993, 20, 427–437. [Google Scholar] [CrossRef]
  32. Coulson, J.; van Waarde, H.J.; Lygeros, J.; Dörfler, F. A Quantitative Notion of Persistency of Excitation and the Robust Fundamental Lemma. IEEE Control Syst. Lett. 2023, 7, 1243–1248. [Google Scholar] [CrossRef]
  33. Overschee, P.V.; Moor, D. Subspace Identification for Linear Systems: Theory—Implementation—Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  34. Lim, R.K.; Phan, M.Q.; Longman, R.W. State Estimation with ARMarkov Models; Department of Mechanical and Aerospace Engineering Technical Report; Princeton University: Princeton, NJ, USA, 1998. [Google Scholar]
  35. Olfati-Saber, R.; Fax, J.A.; Murray, R.M. Consensus and cooperation in networked multi-agent systems. Proc. IEEE 2007, 95, 215–233. [Google Scholar] [CrossRef]
  36. Strang, G. Linear Algebra and Its Applications, 4th ed.; Thomson Brooks/Cole: Belmont, CA, USA, 2006. [Google Scholar]
  37. Bullo, F. Lectures on Network Systems; Kindle Direct Publishing: Seattle, WA, USA, 2020; Volume 1. [Google Scholar]
  38. Ren, W.; Beard, R.W. Consensus seeking in multiagent systems under dynamically changing interaction topologies. IEEE Trans. Autom. Control 2005, 50, 655–661. [Google Scholar] [CrossRef]
  39. de Abreu, N.M.M. Old and new results on algebraic connectivity of graphs. Linear Algebra Its Appl. 2007, 423, 53–73. [Google Scholar] [CrossRef]
  40. Franceschelli, M.; Gasparri, A.; Giua, A.; Seatzu, C. Decentralized estimation of Laplacian eigenvalues in multi-agent systems. Automatica 2013, 49, 1031–1036. [Google Scholar] [CrossRef]
Figure 1. Gain margin Ω ¯ ϵ and set Θ .
Figure 1. Gain margin Ω ¯ ϵ and set Θ .
Mathematics 12 00199 g001
Figure 2. Historical input data.
Figure 2. Historical input data.
Mathematics 12 00199 g002
Figure 3. Historical state data.
Figure 3. Historical state data.
Mathematics 12 00199 g003
Figure 4. Gain margin of the DDLQR and uncertainty test points.
Figure 4. Gain margin of the DDLQR and uncertainty test points.
Mathematics 12 00199 g004
Figure 5. DDLQR simulation result with applied input channel uncertainty.
Figure 5. DDLQR simulation result with applied input channel uncertainty.
Mathematics 12 00199 g005
Figure 6. Graph for information exchange.
Figure 6. Graph for information exchange.
Mathematics 12 00199 g006
Figure 7. Sufficiently large disc margin with sufficiently small ϵ .
Figure 7. Sufficiently large disc margin with sufficiently small ϵ .
Mathematics 12 00199 g007
Figure 8. Consensus problem data-driven LQR results.
Figure 8. Consensus problem data-driven LQR results.
Mathematics 12 00199 g008
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Umar, A.A.; Ryu, K.; Back, J.; Kim, J.-S. Stability Margin of Data-Driven LQR and Its Application to Consensus Problem. Mathematics 2024, 12, 199. https://doi.org/10.3390/math12020199

AMA Style

Umar AA, Ryu K, Back J, Kim J-S. Stability Margin of Data-Driven LQR and Its Application to Consensus Problem. Mathematics. 2024; 12(2):199. https://doi.org/10.3390/math12020199

Chicago/Turabian Style

Umar, Abdul Aris, Kunhee Ryu, Juhoon Back, and Jung-Su Kim. 2024. "Stability Margin of Data-Driven LQR and Its Application to Consensus Problem" Mathematics 12, no. 2: 199. https://doi.org/10.3390/math12020199

APA Style

Umar, A. A., Ryu, K., Back, J., & Kim, J.-S. (2024). Stability Margin of Data-Driven LQR and Its Application to Consensus Problem. Mathematics, 12(2), 199. https://doi.org/10.3390/math12020199

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop