Abstract
In the dual-mode model predictive control (MPC) framework, the size of the stabilizable set, which is also the region of attraction, depends on the terminal constraint set. This paper aims to formulate a larger terminal set for enlarging the region of attraction in a nonlinear MPC. Given several control laws and their corresponding terminal invariant sets, a convex combination of the given sets is used to construct a time-varying terminal set. The resulting region of attraction is the union of the regions of attraction from each invariant set. Simulation results show that the proposed MPC has a larger stabilizable initial set than the one obtained when a fixed terminal set is used.
1. Introduction
Model predictive control (MPC) is an optimal controller that minimizes a cost index over a finite horizon implemented in the receding horizon framework. The advantage of MPC over conventional controllers is the ability to handle the state and control constraints. When the state is measured at a sampling time, an optimization problem is solved. However, only the first element of the optimal solution is applied to the system. Then, the whole procedure is repeated at the next sampling time [1].
The concept of a dual-mode MPC is widely used to guarantee the stability of MPC for both linear systems, e.g., [2,3], and nonlinear systems, e.g., [4,5,6]. The terminal penalty function is used as an upper bound of the infinite horizon cost needed to drive the state trajectory to the origin when the initial condition is in the terminal region. Moreover, closed-loop stability is ensured by forcing the terminal state to belong to a feasible and invariant set. Hence, the region of attraction is the set of initial conditions that can be steered to the terminal region in N steps or less, where N is the prediction horizon. In other words, it is a N-steps stabilizable set. Several studies have been devoted to formulate a terminal set that makes the region of attraction as large as possible. By increasing the prediction horizon, the domain of attraction can be enlarged at the expense of more computational effort due to the increasing number of decision variables [7]. Hence, in the literature, various approaches have been proposed to enlarge the region of attraction through a larger terminal set. In [8], it is shown that the saturated local control law is used to yield a considerably larger terminal constraint set. A sequence of sets is used in [9] to replace a single terminal set. The contractive set does not need to be invariant as long as there is an admissible control that eventually steers the states to an invariant set. Hence, a larger domain of attraction can be obtained by extending the sequence with a reachable set. In [10], the terminal constraint is only applied to the unstable states, giving the flexibility to have a larger set. A linear time-varying MPC is mostly used to extend a linear MPC with an enlarged terminal set for a nonlinear system, e.g., in [11,12,13,14].
Interpolation-based MPC for a linear system is studied by [15,16] to achieve a compromise between the size of domain attraction and the optimality. The MPC is designed by selecting several invariant sets and expressing the terminal state as a convex combination of states belonging to the invariant sets. By doing that, the resulting terminal set becomes the convex hull of the predefined-invariant sets. For reducing the number of decision variables, the interpolation method can be implicitly employed, meaning that the terminal state is not explicitly expressed as the convex combination of several states but is still the convex hull of several invariant sets, e.g., in [17,18]. The design procedure starts by designing several stabilizing feedback control gains for a given linear time-invariant system. Then, feasible and invariant ellipsoids , , are defined such that some necessary linear matrix inequalities (LMIs) are satisfied for each matrix . These LMIs are popularly used to show that can be applied as the terminal set for a linear MPC [19]. It is then shown that applying a convex combination to the m given LMIs yields a new LMI. As a result, the set is a feasible and invariant set for any satisfying . In view of this, the method used in [17,18] is highly dependant on the definition of the invariant sets through the use of an LMI form. Although an LMI form has been widely applied for many applications, e.g., in a consensus problem [20,21], the terminal set for a nonlinear MPC is not generally defined in an LMI form; thus, the convex hull may not be an invariant set despite that the ingredient sets are invariant. To deal with this, a linear differential inclusion (LDI) is used in [12,13] to represent the nonlinear system. Specifically, the nonlinear system can be represented as where and with properly chosen . As a result, after computing all ellipsoids using a common LDI, the convex hull of the several invariant sets can be used as the terminal set of a nonlinear MPC [12]. However, LDI representation is generally conservative and is hard to obtain [22]. With this in mind, this paper is interested in applying a more general convex combination to several terminal sets of a nonlinear MPC.
This paper aims to enlarge the domain of attraction of a nonlinear MPC by having a larger terminal region. Given several feasible and invariant sets , this paper proposes a convex combination strategy to define a new set to define a larger terminal set. The proposed strategy is different from the one discussed above as it does not require the help of LDI to make to be used as a terminal set for a nonlinear MPC. Moreover, it is shown that results in the union of . The feasibility and stability of the MPC can be guaranteed with the enlarged terminal set through the use of a common local Lyapunov function defined in all sets . Numerical simulations demonstrate that the proposed MPC has a larger region of attraction than the one obtained by the conventional MPC.
Notation
Given any sets and , the union and the convex hull of these sets are denoted by and , respectively. For a square matrix , denotes a positive definite matrix, and is the eigenvalue of A whose absolute value is smallest and is defined similarly.
2. Preliminary Result: Nonlinear MPC
Consider discrete-time nonlinear systems:
where , is the state and is the input at time instant k. The system is subject to the control input and state constraints
where and are convex and compact sets. It is assumed that the state is measurable and there is neither external disturbance nor model uncertainty. The following assumption is required for system (1).
Assumption 1.
The function f is twice differentiable, and .
Having this assumption, is an equilibrium of system (1). The optimization problem for the finite horizon nonlinear MPC for nonlinear system (1) is formulated as follows:
where and is the current state measurement. Here, and denote input and state predictions at time that are computed at time k. Thus, is the control prediction over the prediction horizon. The function is known as the terminal penalty cost. The matrix and are the weighting matrices for the state and input variables. Set denotes the terminal penalty set where a popular choice for the terminal set is an ellipsoid set , where is a symmetric matrix.
The nonlinear MPC design procedure starts by solving the optimization problem when the state is measured at the kth sampling time. As a result, the optimal control sequence is obtained, and is applied to the plant. Therefore, the (implicit) model predictive control law is . Afterward, the same procedure is repeated at the next sampling time. For details, see [1,5,23].
Stability of MPC
The terminal region and terminal penalty function play a pivotal role in ensuring the stability of the nonlinear MPC (3). The following lemma describes the required condition for stability.
Lemma 1.
[1,3] With , , suppose that the following holds
- A.1
- such that , (feasibility)
- A.2
- , (invariance)
- A.3
- A.4
- .
Then, the origin is asymptotically stable for the closed loop system with a region of attraction , i.e., is the set of states steerable to by an admissible control in N steps or less.
For the purpose of enlarging the domain of attraction , this paper is interested in formulating an MPC in which its terminal set is defined as the convex combination of several invariant sets and is discussed in the next section.
3. MPC Based on the In-Between Terminal Set
In this paper, several sets are used as the ingredients of the time-varying terminal set for a nonlinear MPC. Specifically, given m ellipsoids , let be an ellipsoid given by
where
and . From (5), a convex combination of and are used to define the new ellipsoid . In [24], is called as the in-between ellipsoid. Note that when , . Hence, holds true, where . In view of this, in (4) can be seen as a general way to define all . A question arises on how the set looks like. Note that includes all the possible set for any such that . In the following lemma, it is shown that is a subset of the union of .
Lemma 2.
See Appendix A for the proof.
Since when and (6) holds true, it follows that , meaning that the union of all possible results in the union of Figure 1 demonstrates Lemma 2 when only two ellipsoids and are considered. The property of given by (6) is depicted in Figure 1b with . It is important to note that convex combination (5) does not result in convex hull of as a convex hull is made by applying a convex combination of ellipsoids expressed by with [17,18]. See Table 1. Instead, as can be seen in Figure 1, the convex combination (5) yields the union of . Since, in general, holds true, (6) implies that .Although this shows that the convex combination used in this paper results in a smaller set than the one obtained in [17,18], it is shown that under the following assumption, the in-between ellipsoid can be used as the terminal set of an MPC and requires a relatively simple procedure even for nonlinear systems.
Figure 1.
(a) For given ellipsoids and , (b) .
Assumption 2.
For given and , suppose that there exist m pairs of satisfying A.1-A.3 of Lemma 1, and such that
where is a symmetric and positive definite matrix and .
In the literature, there are several methods to find an invariant set and a terminal penalty cost such that the stability of the quasi-infinite horizon-based MPC (3) can be guaranteed, e.g., [5,25,26]. In these studies, is shown as a local Lyapunov function in the set under a stabilizing control law , meaning that A1-A4 in Lemma 1 are satisfied. Thus, and can be used as the terminal penalty and the terminal region of a nonlinear MPC, respectively. However, Assumption 2 requires the existence of a common local Lyapunov function defined in all the sets [27]. Suppose that satisfying Lemma 1 are given in the first place, the common Lyapunov function can be found by choosing a positive definite matrix such that the following holds true
Note that (8) is equivalent with (7). A method based on the existing method [5,25,26] can also be employed to make Assumption 2 holds true. In fact, the following lemma can be obtained by extending the result in [26].
Lemma 3.
Suppose that Assumption 1 holds true, and that the following linearized system of (1) in the neighborhood of the origin is controllable
Let be a stabilizing feedback control law with feedback gain K and P be a positive definite matrix satisfying
where , . Moreover, suppose that is a positive definite matrix such that
Then, there exists a constant α specifying an ellipsoid in the form of such that , , is an invariant set for nonlinear system (1), and that the following holds true
Having this lemma, the sets satisfying Assumption 2 can be obtained through the use of the linearized system and state feedback control laws. Specifically, is chosen such that (11) holds true for the given , and is chosen such that (12) holds true. The details of the method and the proof of the lemma are given in Appendix B.
Given , the proposed MPC with enlarged terminal region is computed by solving the following optimization problem
where ,
and . In the following, it is shown that the stability of the proposed MPC computed by (13) can be guaranteed.
Theorem 4.
Proof.
Let , and , , be the optimal solution of problem (13) at time k. Then, the optimal vector and are given by
Having this, the optimal cost function at time k is given by
By Lemma 2, there is at least one j, , such that . Note that , , are feasible at since is an invariant set with . Thus, the following is a feasible control sequence at
The resulting state prediction is given by
where due to the invariant set . Using (15) and (16), the cost function at is
Note that is not the optimal cost function at . Moreover, (17) can be rewritten as follows
where is defined in (14). Applying (7) to this inequality yields
As a result, the optimal cost function at time , denoted by , satisfies
Note that optimization problem (13) uses as the fixed penalty cost function but employs a time-varying terminal set . Since can be any non-negative constants such that their sum is one, the union of can be seen as the terminal set of (13), meaning that the proposed method potentially yields larger region of attraction when the region of attraction of each invariant set is not the subset of the others, i.e., .
Although the resulting terminal set is not the convex hull of the predefined sets, the proposed method can be easily employed for a nonlinear system without the need for an LDI argument. Moreover, the ingredient sets can be obtained using a similar procedure as in [5,25], or [26]. In the next section, numerical simulations are used to investigate the performance of the proposed MPC for a nonlinear system.
4. Example
In this section, the optimization problem (13) is employed for two nonlinear systems. At first, numerical simulation on a control-affine system with is presented. Then, a three-dimensional system is used to investigate the proposed method for a general nonlinear system.
4.1. A Two-Dimensional Nonlinear System
Consider the following nonlinear system
where . The system is subjected to constraints , , and . The system is discretized and linearized with sampling time . For given and , suppose that a state feedback controller is used as a stabilizing control law, i.e., . Moreover, consider two pairs of , and satisfying Assumption 2 given by
where , , and is computed using the procedure described in Appendix B. Figure 2 shows the sets and the estimate of region of attraction and for . In this example, the resulting region of attraction from is estimated by applying as a fixed terminal set for different initial conditions. The proposed MPC results in a time-varying terminal set at each sampling time where is depicted in Figure 3. Figure 4 demonstrates that, for any , the states are steered to the origin. Moreover, the input constraint is satisfied as shown in Figure 5.
Figure 2.
The invariant set and an estimate of the corresponding region of attraction with . Since , a larger region of attraction can be obtained, i.e., .
Figure 3.
Variable defining the time-varying terminal set .
Figure 4.
State converges to the origin for all .
Figure 5.
Control input u satisfies input constraint.
As shown in Figure 2 and Table 2, some initial conditions lead to infeasibility when only one set (i.e., either or ) is considered for the terminal region. Meanwhile, thanks to the time-varying terminal set , is the resulting region of attraction. As a result, the proposed MPC is feasible for all six different initial states in Table 2, meaning that a larger region of attraction is obtained. Let be the simulation cost. It is shown in Table 2 and Figure 6 that, compared to the cost obtained by the MPC with as the terminal region, the proposed MPC yields a comparable cost with a larger domain of attraction. In summary, the proposed nonlinear MPC with enlarged terminal set results in not only a larger region of attraction but also acceptable performance.
Table 2.
Cost comparison between various terminal sets for given six different initial states.
Figure 6.
(a) Simulation cost is computed using 22 initial states which are feasible for . (b) The proposed MPC yields a similar cost with the one obtained using .
4.2. A Three-Dimensional Nonlinear System
Consider the following nonlinear system [28,29]
Note that the considered system is not linear in the input variable u. It is assumed that . Similar to the previous example, the weighting matrices Q and R are set to and . The following are two pairs of , and satisfying Assumption 2
and , . The resulting feasible and invariant sets are shown in Figure 7. The set is not a subset of , and vice versa. Figure 8 depicts several initial states when is considered as the terminal set of a nonlinear MPC with . Note that there are several initial states which are only feasible when is employed as the terminal set of a nonlinear MPC. Likewise, there are several initial states which are only feasible when is considered. In view of this, similar as in the previous example, using both and as the ingredient to define the time-varying terminal set yields a larger region of attraction, i.e., . Applying the optimization problem (13) with to few different initial conditions of (18) results in and the state trajectories depicted in Figure 9a,b, respectively. Note that the resulting terminal set can be larger than the one obtained here if we consider more ingredient sets , i.e., .
Figure 7.
The invariant set and , , for system (18) where they are is plotted (a) in a three-dimensional space and (b) in view of and axes. Note that and .
Figure 8.
Few feasible initial states obtained using as the terminal set of a nonlinear MPC with , where they are is plotted (a) in a three-dimensional space and (b) in view of and axes. Note that some states are feasible only for the MPC with , .
Figure 9.
(a) Resulting defining the time-varying terminal set . (b) Trajectories from different initial states which are feasible to either or .
5. Conclusions
This paper proposes a new time-varying terminal set for the nonlinear MPC via a convex combination of given invariant and feasible sets. The resulting terminal set is the union of the predefined sets. Compared to the existing results on enlarging the terminal set through a convex combination strategy, the proposed approach appears to be more general since it does not require linear differential inclusion (LDI) representation of the nonlinear system in order to implement the proposed nonlinear MPC scheme. The existence of a common local Lyapunov function in all the ingredient invariant sets plays a key role in guaranteeing the stability of the proposed MPC. The paper gives a general way to use several sets obtained from different state-feedback gains. Possible future research includes the extension of this work to a tracking MPC problem.
Author Contributions
I.R.F. surveyed the backgrounds of this research, designed the control strategies, and performed the simulations to show the benefits of the proposed method. J.-S.K. supervised and supported this study. All authors have read and agreed to the published version of the manuscript.
Funding
This study was supported by the Advanced Research Project funded by the SeoulTech (Seoul National University of Science and Technology).
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Proof of Lemma 2
By definition, since is a symmetric and positive definite matrix, is an ellipsoid. Next, for the purpose of proving (6), let us consider its contradiction, meaning that there exists such that is not in the union of . For such , there exists satisfying the following conditions at the same time
- C.1
- C.2
- .
From C.1, is obtained, which contradicts C.2. Thus, the claim is proven.
Appendix B. Common Lyapunov Function via Linearized System
Appendix B.1. Proof of Lemma 3
For all , all the eigenvalues of lie in the unit circle since is Hurwitz. Thus, (10) admits a unique positive definite solution P. Since , there exist such that where , and . In light of this, the dynamics (1) in can be seen as an unconstrained nonlinear system, thereby, (1) with can be written as
where . Then, it remains to show that there exists such that (12), i.e., invariance holds true. Consider a candidate Lyapunov function for satisfying (11). It follows that
In view of (11), there exists a positive semidefinite matrix such that
holds true, i.e., . Using (A3) and the definition of , (A2) can be written as
where
Note that when . Recall that, according to the Taylor’s Theorem, for a continuous and twice differentiable function g on an open interval I around a, there is a value between a and b so that
In order to apply this to function , let be some value between and . Then, function yields
owing to and . Denote
Let be defined as follows
Thus, for all . Furthermore,
Let such that
for all . Then, . Note that and . With this and (A7) in mind,
It follows that for all . Substituting this to (A4) and using (10) yield .
Appendix B.2. A Common Lyapunov Function
References
- Mayne, D.Q.; Rawlings, J.B.; Rao, C.V.; Scokaert, P.O.M. Constrained model predictive control: Stability and optimality. Automatica 2000, 36, 789–814. [Google Scholar] [CrossRef]
- Rossiter, J.A. Model-Based Predictive Control: A Practical Approach; CRC Press: Boca Raton, FL, USA, 2013. [Google Scholar]
- Lee, J.W.; Kwon, W.H.; Choi, J. On Stability of Constrained Receding Horizon Control with Finite Terminal Weighting Matrix. Automatica 1998, 34, 1607–1612. [Google Scholar] [CrossRef]
- Mayne, D.Q.; Michalska, H. Receding horizon control of nonlinear systems. IEEE Trans. Autom. Control. 1990, 35, 814–824. [Google Scholar] [CrossRef]
- Chen, H.; AllGöwer, F. A Quasi-Infinite Horizon Nonlinear Model Predictive Control Scheme with Guaranteed Stability. Automatica 1998, 34, 1205–1217. [Google Scholar] [CrossRef]
- Yu, S.; Reble, M.; Chen, H.; Allgöwer, F. Inherent robustness properties of quasi-infinite horizon nonlinear model predictive control. Automatica 2014, 50, 2269–2280. [Google Scholar] [CrossRef]
- Magni, L.; Nicolao, G.; Magnani, L.; Scattolini, R. A stabilizing model-based predictive control algorithm for nonlinear systems. Automatica 2001, 37, 1351–1362. [Google Scholar] [CrossRef]
- De Doná, J.; Seron, M.; Mayne, D.; Goodwin, G. Enlarged terminal sets guaranteeing stability of receding horizon control. Syst. Control. Lett. 2002, 47, 57–63. [Google Scholar] [CrossRef]
- Limon, D.; Alamo, T.; Camacho, E. Enlarging the domain of attraction of MPC controllers. Automatica 2005, 41, 629–635. [Google Scholar] [CrossRef]
- González, A.H.; Odloak, D. Enlarging the domain of attraction of stable MPC controllers, maintaining the output performance. Automatica 2009, 45, 1080–1085. [Google Scholar] [CrossRef]
- Chen, W.H.; O’Reilly, J.; Ballance, D.J. On the terminal region of model predictive control for non-linear systems with input/state constraints. Int. J. Adapt. Control. Signal Process. 2003, 17, 195–207. [Google Scholar] [CrossRef]
- Bacic, M.; Cannon, M.; Kouvaritakis, B. General interpolation for input-affine nonlinear systems. In Proceedings of the 2004 American Control Conference, Boston, MA, USA, 30 June–2 July 2004; Volume 3, pp. 2010–2014. [Google Scholar]
- Zhao, M.; Jiang, C.; Tang, X.; She, M. Interpolation Model Predictive Control of Nonlinear Systems Described by Quasi-LPV Model. Autom. Control. Comput. Sci. 2018, 52, 354–364. [Google Scholar] [CrossRef]
- Yu, S.; Chen, H.; Böhm, C.; Allgöwer, F. Enlarging the terminal region of NMPC with parameter-dependent control law. In Nonlinear Model Predictive Control-Towards New Challenging Applications; Magni, L., Raimondo, D., Allgöwer, F., Eds.; Lecture Notes in Control and Information Sciences; Springer: Berlin/Heidelberg, Germany, 2009; pp. 69–78. [Google Scholar]
- Bacic, M.; Cannon, M.; Lee, Y.I.; Kouvaritakis, B. General interpolation in MPC and its advantages. IEEE Trans. Autom. Control. 2003, 48, 1092–1096. [Google Scholar] [CrossRef]
- Rossiter, J.A.; Kouvaritakis, B.; Bacic, M. Interpolation based computationally efficient predictive control. Int. J. Control. 2004, 77, 290–301. [Google Scholar] [CrossRef]
- Pluymers, B.; Roobrouck, L.; Buijs, J.; Suykens, J.; Moor, B.D. Constrained linear MPC with time-varying terminal cost using convex combinations. Automatica 2005, 41, 831–837. [Google Scholar] [CrossRef]
- Kim, J.S.; Lee, Y.I. An Interpolation Technique for Input Constrained Robust Stabilization. Int. J. Control. Autom. Syst. 2018, 16, 1569–1576. [Google Scholar] [CrossRef]
- Kothare, M.V.; Balakrishnan, V.; Morari, M. Robust constrained model predictive control using linear matrix inequalities. Automatica 1996, 32, 1361–1379. [Google Scholar] [CrossRef]
- Shang, Y. Couple-group consensus of continuous-time multi-agent systems under Markovian switching topologies. J. Frankl. Inst. 2015, 352, 4826–4844. [Google Scholar] [CrossRef]
- Shang, Y. Consensus seeking over Markovian switching networks with time-varying delays and uncertain topologies. Appl. Math. Comput. 2016, 273, 1234–1245. [Google Scholar] [CrossRef]
- Bohm, C.; Raff, T.; Findeisen, R.; Allgower, F. Calculating the terminal region of NMPC for Lure systems via LMIs. In Proceedings of the 2008 the American Control Conference, Seattle, WA, USA, 11–13 June 2008. [Google Scholar]
- Rawlings, J.B.; Mayne, D.Q. Model Predictive Control: Theory and Design; Nob Hill Pub.: Madison, WI, USA, 2009. [Google Scholar]
- Schröcker, H.P. Uniqueness results for minimal enclosing ellipsoids. Comput. Aided Geom. Des. 2008, 25, 756–762. [Google Scholar] [CrossRef]
- Johansen, T.A. Approximate explicit receding horizon control of constrained nonlinear systems. Automatica 2004, 40, 293–300. [Google Scholar] [CrossRef]
- Yu, S.; Qu, T.; Xu, F.; Chen, H.; Hu, Y. Stability of finite horizon model predictive control with incremental input constraints. Automatica 2017, 79, 265–272. [Google Scholar] [CrossRef]
- Liberzon, D. Switching in Systems and Control; Birkhäuser: Boston, MA, USA, 2003. [Google Scholar]
- Garrard, W.L.; Jordan, J.M. Design of nonlinear automatic flight control systems. Automatica 1977, 13, 497–505. [Google Scholar] [CrossRef]
- Kaiser, E.; Kutz, J.N.; Brunton, S.L. Sparse identification of nonlinear dynamics for model predictive control in the low-data limit. Proc. R. Soc. Math. Phys. Eng. Sci. 2018, 474. [Google Scholar] [CrossRef] [PubMed]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).