Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function

Montoya, Oscar Danilo; Gil-González, Walter; Dominguez-Jimenez, Juan A.; Molina-Cabrera, Alexander; Giral-Ramírez, Diego A.

doi:10.3390/sym12111771

Open AccessArticle

Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function

by

Oscar Danilo Montoya

^1,2,*

,

Walter Gil-González

^3,4

,

Juan A. Dominguez-Jimenez

⁵

,

Alexander Molina-Cabrera

⁴

and

Diego A. Giral-Ramírez

⁶

¹

Facultad de Ingeniería, Universidad Distrital Francisco José de Caldas, Bogotá 11021, Colombia

²

Laboratorio Inteligente de Energía, Universidad Tecnológica de Bolívar, km 1 vía Turbaco, Cartagena 131001, Colombia

³

Grupo GIIEN, Facultad de Ingeniería, Institución Universitaria Pascual Bravo, Campus Robledo, Medellín 050036, Colombia

⁴

Facultad de Ingenierías, Universidad Tecnológica de Pereira, Pereira 660003, Colombia

⁵

Hydrogen Research Institute, Université du Quebec à Trois-Rivieres, Trois-Rivières, QC 3351, Canada

⁶

Facultad Tecnológica, Universidad Distrital Francisco José de Caldas, Carrera 7 No. 40B-53, Bogotá 11021, Colombia

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(11), 1771; https://doi.org/10.3390/sym12111771

Submission received: 23 September 2020 / Revised: 16 October 2020 / Accepted: 20 October 2020 / Published: 26 October 2020

(This article belongs to the Special Issue Advances in Nonlinear, Discrete, Continuous and Hamiltonian Systems)

Download

Browse Figures

Versions Notes

Abstract

This paper deals with the global stabilization of the reaction wheel pendulum (RWP) in the discrete-time domain. The discrete-inverse optimal control approach via a control Lyapunov function (CLF) is employed to make the stabilization task. The main advantages of using this control methodology can be summarized as follows: (i) it guarantees exponential stability in closed-loop operation, and (ii) the inverse control law is optimal since it minimizes the cost functional of the system. Numerical simulations demonstrate that the RWP is stabilized with the discrete-inverse optimal control approach via a CLF with different settling times as a function of the control gains. Furthermore, parametric uncertainties and comparisons with nonlinear controllers such as passivity-based and Lyapunov-based approaches developed in the continuous-time domain have demonstrated the superiority of the proposed discrete control approach. All of these simulations have been implemented in the MATLAB software.

Keywords:

discrete-inverse optimal control; global exponential stabilization; reaction wheel pendulum; parametric uncertainties; discrete-affine systems; cost functional

Graphical Abstract

1. Introduction

Dynamics of nonlinear systems is a central concern in science, engineering, and mechanical problems. In order to analyze them, test prototypes represent a powerful strategy [1]. This is because their nonlinear dynamics allow getting a better grasp of the phenomena and physical behavior of several industries and equipment applications including: robotic systems, aerospace systems, marine vehicles [2,3]. There exist multiple versions of the classic models of the pendulum, among which the most known are the reaction wheel pendulum (RWP), the pendulum on a cart with linear displacement, the Furuta pendulum with a rotating base, and also these with two and three bars, among others. This study considers the well-know educative RWP dynamical system.

The RWP is an inverted pendulum balanced by a flywheel, i.e., an actuated rotating reaction wheel. It exhibits important challenges in the field of control theory including robustness, stabilization, and nonlinearities. These features combined make it an attractive and adequate system for performing research and high-level education. Various engineering problems can be modeled similarly as inverted pendulum, including rocket launch and human-powered vehicles.

In the literature, the problem of control in RWP has been addressed with both linear and non-linear techniques as well as artificial intelligent methods. These can be found mainly as: control Lyapunov functions [4], passivity-based control [5], proportional-integral controllers [6], fuzzy logic [7], feedback linearization [8], and deep neural networks [9].

The below-mentioned items represent the key aspects that make this approach different from similar works:

✓: Global stabilization of the discrete version of the RWP dynamic model system via control Lyapunov functions with global exponential convergence and optimality properties.
✓: Robustness performance against parametric uncertainties while preserving asymptotic convergence properties.
✓: The comparison of the proposed discrete-inverse optimal controllers via control Lyapunov functions with the discrete versions of the passivity-based and Lyapunov-based controllers (from the continuous domain) demonstrating superior performances regarding stabilization of the RWP system with minimum settling times.

After a thorough literature review regarding the different approaches to the RWP dynamical system modeling and control, we identified a niche to occupy. Despite the wide variety of methods presented to date, strategies based on discrete-inverse optimal control are scarcely applied to model this sort of system [10,11,12]. Therefore, it stands for the research gap this article tries to occupy.

To design a discrete inverse optimal controller via a CLF, it is required to obtain a discrete version of the dynamic model of the RWP system [13]. Here, we use the forward difference method to obtain this discrete equivalent. In addition, it is worth mentioning that the proposed controller is based on the discrete control theory presented in [11]. It is mandatory to have the discrete version of the studied plant for developing any simulation using the approach here proposed.

Regarding the design of the discrete inverse optimal controller, it is important to highlight that: (i) it is based on the discrete version of the classical optimal control theory that works with the discrete-time Hamilton–Jacobi–Bellman (DT-HJB) equivalent [12], and (ii) the solution of this partial differential equation is referenced in the literature as the function value associated with a functional cost for some discrete dynamic system [14]. Here, we use the DT-HJB solutions, which implies that this control law is optimal for the reaction wheel pendulum [11].

The remainder of this study is organized as follows: Section 2 presents the continuous dynamical formulation of the RWP and its discrete version using the backward difference method with discretization time

T_{s}

. Section 3 presents the general theory about discrete-inverse optimal control designs via control Lyapunov functions; also, it is presented the proof of the exponential stabilization capabilities and optimality properties. Section 4 presents all the numerical validations including parametric uncertainties, variations in the control gains, and comparisons with nonlinear classical methodologies developed in the continuous domain. Section 5 presents the main conclusions derived from this research and some guidelines for possible future works.

2. Dynamical Modeling of the Reaction Wheel Pendulum

One of the most classical dynamical systems used in education is the RWP, since it allows verifying multiple linear and nonlinear control strategies that deal with nonlinearities caused by trigonometric relationships between the system variables [10]. The dynamical structure of the RWP system emerges in different applications such as transportation systems, bridge crane models, synchronous machines used in power systems, among others. A schematic bi-dimensional representation of the RWP system is depicted in Figure 1, where the main physical variables have been reported [4,15].

In the physical representation of the RWP system presented in Figure 1, we can observe the following variables: the pendulum angle

φ

(from the vertical axis) and the angle

α

between the pendulum and wheel, which are measured with sensors located at each of the axes of rotation. In addition, this system has a motor coupled to the opposite end of the pivot, acting on an inertia wheel, which allows controlling oscillations on it to the reaction torque

τ

. The dynamic model of this system is derived as follows in the continuous-time domain.

2.1. Dynamical Model in the Continuous Domain

The first step in obtaining a discrete dynamical formulation of the RWP system corresponds to define its continuous formulation. For doing so, let us define an auxiliary variable named

θ = φ + α

. With this definition, the dynamical model of the RWP system is defined by (1) as recommended in [15].

\begin{matrix} \begin{matrix} \ddot{φ} & = a sin (φ) - b u, \\ \ddot{θ} & = c u, \end{matrix} \end{matrix}

(1)

where a, b, and c represent positive constants related to the physical parameters of the system,

φ

defines the angular position of the pendulum measured from the vertical axis, and

θ

is the relative angle of the reaction wheel measured from the same vertical axis.

The control input u couple the electric DC motor variables with the reaction wheel movement variables. Note that the control input represents the amount of voltage applied to the DC motor, which is traduced by the current flow into it to a mechanical torque applied directly to the reaction wheel [8]. This torque moves the wheel, which in turn makes the pendulum bar moves from the initial point to the desired upright position [13]. In addition, due to the structure of the nonlinear dynamical model (1) the behavior of the variable

θ

is wholly defined as the double integral of the control input u. In addition, this variable does not affect the angular position of the pendulum directly since both Equations in (1) are uncoupled. This situation allows ensuring that the dynamics of the angular speed

\dot{θ}

and the angular position of the reaction wheel, i.e.,

θ

can be wholly known as follows:

\begin{matrix} \dot{θ} = c \int_{0}^{t} u (τ) d τ, \\ θ = c \int_{0}^{ζ} (\int_{0}^{t} u (τ) d τ) d z . \end{matrix}

Observe that if the control input is of class one, i.e.,

u \in C^{1}

, and it is upper and lower bounded, then the angular speed of the reaction wheel, i.e.,

\dot{θ}

, will tend to zero when the angular position of the pendulum and its angular speed reach the equilibrium point.

Remark 1.

Note when the angular speed of the reaction wheel goes to zero, this implies that the angular position of this wheel, i.e., θ, reaches a constant value, which corresponds to its equilibrium point, since from (1), we know that

{\dot{θ}}^{⋆} = 0

for

u^{⋆}

, and

θ^{⋆} = k

, being k a real constant. It is worth mentioning that

x^{⋆}

must be understood as the equilibrium point of the state variable x.

From the transformation of the dynamical system (1) to a state-space representation, we define the following state variables:

x_{1} = φ

and

x_{2} = {\dot{x}}_{1}

. Now, if these are substituted in (1), then, the following second-order dynamical model yields:

\begin{matrix} \begin{matrix} {\dot{x}}_{1} & = x_{2}, \\ {\dot{x}}_{2} & = a sin (x_{1}) - b u . \end{matrix} \end{matrix}

(2)

Remark 2.

The RWP system defined in (2) has two main equilibrium points, which have infinite repetitions. These two main points are P

_{1} (x_{1}^{⋆}, x_{2}^{⋆}) =

P

_{1} (0, 0)

and P

_{2} (x_{1}^{⋆}, x_{2}^{⋆}) =

P

_{1} (π, 0)

; which are cyclical each

2 k π

, being k a integer number. However, the main interest of controlling an RWP is to maintain its upright position from any initial position, i.e., to regulate all the state variables over P

_{1}

.

2.2. Dynamical Model in the Discrete Domain

The representation of the RWP system in the discrete domain is reached by using the classical forward difference, which allows knowing the next step, i.e.,

x_{k + 1}

, as a function of the current information of the system [16], i.e.,

x_{k}

. This forward difference takes the following form [17,18]:

\begin{matrix} \frac{d}{d t} x = \frac{x_{k + 1} - x_{k}}{T_{s}}, \end{matrix}

(3)

where

T_{s}

represent the discretization time.

Note that if we apply the discretization defined in (3) on (2), then, we reach the following discrete model for the RWP system:

\begin{matrix} \begin{matrix} x_{1_{k + 1}} & = T_{s} x_{2_{k}} + x_{1_{k}}, \\ x_{2_{k + 1}} & = T_{s} a sin (x_{1_{k}}) + x_{2_{k}} - T_{s} b u_{k} . \end{matrix} \end{matrix}

(4)

For the sake of compactness, the discrete dynamical model of the RWP model defined in (4) is rewritten as follows:

\begin{matrix} x_{k + 1} = f (x_{k}) + g (x_{k}) u_{k}, \end{matrix}

(5)

where

f (x_{k}) \in R^{n \times 1}

corresponds to a vector of nonlinear functions of the state variables and

g (x_{k}) \in R^{n \times m}

is known as the input matrix,

x_{k} \in R^{n \times 1}

is the vector of all the state variables, and

u_{k} \in R^{m \times 1}

in an m-dimensional vector that contains all the control inputs. It is worth mentioning that in the case of the RWP system

n = 2

and

m = 1

, which implies that functions in (5) take the following form:

\begin{matrix} f (x_{k}) = [\begin{matrix} T_{s} x_{2_{k}} + x_{1_{k}} \\ T_{s} a sin (x_{1_{k}}) + x_{2_{k}} \end{matrix}], g (x_{k}) = [\begin{matrix} 0 \\ - T_{s} b \end{matrix}] . \end{matrix}

(6)

To develop a discrete-inverse optimal controller via control Lyapunov functions, the compact structure of the RWP system defined in (5) is considered as it will presented in the next section.

3. Discrete-Inverse Optimal Formulation via CLF

Using Lyapunov functions in control theory allows developing robust methodologies for controlling physical systems in continuous or discrete representations. The main advantage of using this approach is the global asymptotic stability performance under well-defined operative conditions of the system [11]. Here, we employ the discrete-inverse optimal control theory to develop a nonlinear controller applicable to the RWP system with optimal and asymptotic properties [19].

Definition 1.

(Inverse optimal control law (taken from [11])). The following control law

\begin{matrix} u_{k}^{⋆} = - \frac{1}{2} R^{- 1} g {(x_{k})}^{T} \frac{\partial V (x_{k + 1})}{\partial x_{k + 1}}, \end{matrix}

(7)

where

R^{- 1}

is a square

m \times m

matrix associated with the number of control inputs, and

g {(x_{k})}^{T}

is the transpose operation applied on the input matrix

g (x_{k})

is said inverse optimal if:

(i): it allows achieving the global exponential stability of the equilibrium point $x_{k} = 0$ for the discrete system (5), and
(ii): it minimizes a cost functional defined as (8)

$\begin{matrix} V (x_{k}) = \sum_{n = k}^{\infty} l (x_{n}) + u_{n}^{T} R u_{n}, \end{matrix}$

(8)

with $l (x_{k}) : = - V$ , being

$\begin{matrix} V : = V (x_{k + 1}) - V (x_{k}) + u_{k}^{⋆ T} R u_{k}^{⋆} . \end{matrix}$

(9)

Note that Definition 1 is based on the complete knowledge of the function

V (x_{k})

. Here, we employ a classical hyperparaboloid CLF with the structure presented in (10).

\begin{matrix} V (x_{k}) = \frac{1}{2} x_{k}^{T} P x_{k}, \end{matrix}

(10)

where P is a symmetric positive definite matrix with appropriate dimensions.

To obtain a general inverse optimal control law as defined in (7), we can substitute (10) into it, which yields:

\begin{matrix} u_{k}^{⋆} & = - \frac{1}{2} R^{- 1} g {(x_{k})}^{T} \frac{\partial V (x_{k + 1})}{\partial x_{k + 1}} \\ = - \frac{1}{2} R^{- 1} g {(x_{k})}^{T} P x_{k + 1} \\ = - \frac{1}{2} R^{- 1} g {(x_{k})}^{T} (P f (x_{k}) + P g (x_{k}) u_{k}^{⋆}) \\ (I + \frac{1}{2} R^{- 1} g {(x_{k})}^{T} P g (x_{k})) u_{k}^{⋆} & = - \frac{1}{2} R^{- 1} g {(x_{k})}^{T} P f (x_{k}), \end{matrix}

(11)

where I is an identity matrix with appropriate dimensions.

Now, if we define

P_{1} (x_{k}) = g {(x_{k})}^{T} P f (x_{k})

and

P_{2} (x_{k}) = g {(x_{k})}^{T} P g (x_{k})

, then, Equation (11) generates the following inverse optimal control law:

\begin{matrix} u_{k}^{⋆} = - \frac{1}{2} {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}), \end{matrix}

(12)

where both parts of (11) have been pre-multiplied by R.

Remark 3.

The existence of the inverse optimal control law is ensured due to the positive definite and symmetry properties of the

P_{2} (x_{k})

matrix, which ensures the existence of the inverse in (12) [11].

3.1. Global Stability Test

The inverse optimal control law (12) guarantees the global stability behavior in the discrete-affine dynamic system (5) if the following inequality constraint is held [11]

\begin{matrix} V_{f} (x_{k}) - \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}) - λ_{Q} {||x_{k}||}^{2} \leq 0, \end{matrix}

(13)

for some

P = P^{T} > 0

,

V_{f} (x_{k}) = V (f (x_{k})) - V (x_{k})

, being

V (f (x_{k})) = \frac{1}{2} f^{T} (x_{k}) P f (x_{k})

, and

λ_{Q} > 0

.

To verify that the condition (13) is required, let us to recur to the definition of

V

in (9) where we substitute

α (x_{k}) = u_{k}^{⋆}

, which produces:

\begin{matrix} V & = V (x_{k + 1}) - V (x_{k}) + α^{T} (x_{k}) R α (x_{k}) \\ = \frac{f^{T} (x_{k}) P f (x_{k}) + 2 f^{T} (x_{k}) P g (x_{k}) α (x_{k})}{2} + \frac{α^{T} (x_{k}) g^{T} (x_{k}) P g (x_{k}) α (x_{k}) - x_{k}^{T} P x_{k}}{2} \\ + α^{T} (x_{k}) R α (x_{k}) \\ = V_{f} (x_{k}) - \frac{1}{2} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1}^{T} (x_{k}) + \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1}^{T} (x_{k}) \\ = V_{f} (x_{k}) - \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1}^{T} (x_{k}) . \end{matrix}

(14)

Note that if P is selected such that

V < 0

, then, the asymptotic stability is ensured around the equilibrium point

x_{k} = 0

. Furthermore, through P, we can achieve a desired negativity amount for the closed-loop function

V

in Expression (14). Note that this desired negativity can be satisfied by defining a positive definite matrix Q such that:

\begin{matrix} V & = V_{f} (x_{k}) - \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1}^{T} (x_{k}) \\ \leq - x_{k}^{T} Q x_{k} \\ \leq - λ_{min} (Q) {||x_{k}||}^{2} = - λ_{Q} {||x_{k}||}^{2}, \end{matrix}

(15)

where

||\cdot||

corresponds to any norm, for simplicity it could be Euclidean norm; and

λ_{Q} > 0

defines the minimum eigenvalue of the matrix Q; which implies that (15) maintains the condition (13).

Now, if we compare (14) and (15), then,

\begin{matrix} V = V (x_{k + 1}) - V (x_{k}) + α^{T} (x_{k}) R α (x_{k}) \leq - λ_{Q} {||x_{k}||}^{2} \to V (x_{k + 1}) - V (x_{k}) \leq - λ_{Q} {||x_{k}||}^{2} . \end{matrix}

(16)

Remark 4.

Due to

V (x_{k})

being a radially unbounded function, then, the solution

x_{k} = 0

of the discrete-affine system (5) with (12) is global exponentially stable according to [19,20].

3.2. Optimality Test

To demonstrate that the discrete-inverse control law

u_{k}^{⋆}

defined in (7) is optimal, let us consider the function

l (x_{k})

as follows:

\begin{matrix} l (x_{k}) & {: = - V |}_{u_{k}^{⋆} = α (x_{k})} \\ = - V_{f} (x_{k}) + \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}), \end{matrix}

(17)

where

V (x_{k})

is the solution of the DT-HJB equation, i.e.,

\begin{matrix} l (x_{k}) + V^{⋆} (x_{k + 1}) - V^{⋆} (x_{k}) + \frac{1}{4} \frac{\partial V^{⋆ T} (x_{k + 1})}{\partial x_{k + 1}} g (x_{k}) R^{- 1} g^{T} (x_{k}) \frac{\partial V^{⋆} (x_{k + 1})}{\partial x_{k + 1}} = 0 . \end{matrix}

To obtain the optimal value for the cost functional defined in (8), let us substitute (17) on it as follows:

\begin{matrix} V (x_{k}) & = \sum_{k = 0}^{\infty} l (x_{n}) + u_{k}^{T} R u_{k} \\ = \sum_{k = 0}^{\infty} - V + u_{k}^{T} R u_{k} \\ = - \sum_{k = 0}^{\infty} V_{f} (x_{k}) - \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}) + \sum_{k = 0}^{\infty} u_{k}^{T} R u_{k} . \end{matrix}

(18)

To simplify Expression (18), we recur to an identity matrix with the form

(R + P_{2} (x_{k})) {(R + P_{2} (x_{k}))}^{- 1}

, which allows rewriting it as presented below.

\begin{matrix} V (x_{k}) & = - \sum_{k = 0}^{\infty} [\begin{matrix} V_{f} (x_{k}) - \frac{1}{2} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}) \\ + \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{2} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}) \\ + \frac{1}{4} P_{1}^{T} (x_{k}) {(R + P_{2} (x_{k}))}^{- 1} R {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k}) \end{matrix}] + \sum_{k = 0}^{\infty} u_{k}^{T} R u_{k}, \end{matrix}

(19)

Now, from (12) we know that

α (x_{k}) = - \frac{1}{2} {(R + P_{2} (x_{k}))}^{- 1} P_{1} (x_{k})

, which implies that (19) can take the following form:

\begin{matrix} V (x_{k}) & = - \sum_{k = 0}^{\infty} [\begin{matrix} V_{f} (x_{k}) + P_{1}^{T} (x_{k}) α (x_{k}) + α^{T} (x_{k}) P_{2} (x_{k}) α (x_{k}) \end{matrix}] + \sum_{k = 0}^{\infty} [\begin{matrix} u_{k}^{T} R u_{k} - α^{T} (x_{k}) R α (x_{k}) \end{matrix}] . \end{matrix}

(20)

Remembering the definitions of

V_{f} (x_{k})

,

P_{1} (x_{k})

, and

P_{2} (x_{k})

, we can simplify (20) as presented below.

\begin{matrix} V (x_{k}) = - \sum_{k = 0}^{\infty} [\begin{matrix} V (x_{k + 1}) - V (x_{k}) \end{matrix}] + \sum_{k = 0}^{\infty} [\begin{matrix} u_{k}^{T} R u_{k} - α^{T} (x_{k}) R α (x_{k}) \end{matrix}] . \end{matrix}

(21)

Note that if the upper bound of the sum is defined as N, then, we obtain from (22) the following result:

\begin{matrix} V (x_{k}) = - lim_{N \to \infty} [\begin{matrix} V (x_{N}) - V (x_{0}) \end{matrix}] + \sum_{k = 0}^{\infty} [\begin{matrix} u_{k}^{T} R u_{k} - α^{T} (x_{k}) R α (x_{k}) \end{matrix}] . \end{matrix}

(22)

where we can observe that

V (x_{N}) \to 0

when

N \to 0

due the global exponential convergence presented in the previous section; this implies that Expression (22) can be written as follows:

\begin{matrix} V (x_{k}) = V (x_{0}) + \sum_{k = 0}^{\infty} [\begin{matrix} u_{k}^{T} R u_{k} - α^{T} (x_{k}) R α (x_{k}) \end{matrix}] . \end{matrix}

(23)

Remark 5.

From (23), it is possible to conclude that the maximum value is reached when

u_{k} = α (x_{k})

, which implies that the control law (12) is indeed optimal, since it minimizes the cost functional (8), where the optimal solution is:

\begin{matrix} V^{⋆} (x_{k}) = V (x_{0}), \end{matrix}

for all

x_{0}

.

4. Numerical Validations

To demonstrate the effectiveness and robustness of the proposed discrete-inverse optimal control design via control Lyapunov functions, we consider that the dynamical model (5) has the following constants:

a = 78.4 {(\frac{rad}{s})}^{2}

and

b = 1.08 \frac{rad}{s^{2}}

, these values were taken from [7]. It is important to mention that as recommended in [21], the magnitude of the control function, i.e.,

|u_{k}|

, can be at most 10. Note that the control gains inside of the P matrix have been defined using a heuristic search based on multiple simulations as follows:

\begin{matrix} P = [\begin{matrix} 12 \times 10^{8} & 32 \times 10^{6} \\ 32 \times 10^{6} & 12 \times 10^{5} \end{matrix}] . \end{matrix}

4.1. Evaluation of the Controller for Different Values of the R Gain

The effectiveness of the proposed controller is tested considering the aforementioned parameters; also, the simulation was run for some values of gain R in the interval between

0.50

and

2.5

in steps of

0.50

(this interval is selected based on heuristic evaluations using multiple intervals of analysis. However, these values enclosed most of the possible behaviors reached by the proposed controller when applied to the RWP system.). In this case the initial conditions are

(x_{1_{0}}, x_{2_{0}}) = (\frac{7 π}{180}, 0)

. Note that these initial guesses have been selected based on the physical recommendations for the RWP system provided in reference [21], where these can avoid over-saturation events in the control input.

Figure 2 reports the behavior of the angular position of the pendulum bar, its velocity, and the control input. From these results, we can observe that:

✓: The gain R highly influences the behavior of the angular position of the RWP system depicted in Figure 2a, since small values of this allows reaching the desired operational point about in 375 samples, (i.e., 375 ms) (see curves for $R = 1$ and $R = 10$ ); however, when this parameter increases then the system exhibits oscillations around the desired point and the settling time is between 400 ms and 700 ms.
✓: The angular speed of the pendulum bar depicted in Figure 2b presents a negative acceleration between the interval from 0 ms to 300 ms. However, when the concave form of the angular position changes to a convex one, this velocity is reduced from its negative maximum to zero. In addition, the figure shows the overpass in the angular position due to the effects produced on this by the R gain.
✓: For small values of the R gain, it is possible to observe two main saturations of the control gain about 10 and $- 10$ (see Figure 2). These saturations imply that the motor, coupled with the reaction wheel, is accelerated to its maximum values to reach the desired operative point in minimum settling times. When the R gain increases, it is possible to observe that the second saturation disappears which enlarge the settling times of the state variables, due to the acceleration of the motor is not at its maximums.

Figure 3 presents the phase-portrait of the state variables and the evaluation of the Lyapunov function, which have been normalized with the maximum of the cost functional given by

V (x_{0}) = \frac{1}{2} x_{0}^{T} P x_{0}

. From Figure 3, we can observe that: (i) the phase portrait of the state variables show that for all the tested values of the R gain, the system is global exponentially stabilized via discrete-inverse optimal control based on a CLF design (see Figure 3a); and (ii) the Lyapunov function presented in Figure 3b confirms that the selected gains in the control matrix P generate a positive definite matrix, which makes that the Lyapunov function is always positive or zero. Additionally, it is confirmed that for

N = 1000

samples, the final value of this function is zero, which implies that the cost functional is maximum as demonstrated in (23).

4.2. Performance under Parametric Variations

To demonstrate that the proposed discrete-inverse optimal controller via CLF design is efficient under parametric variations, we evaluate variations in the a and b parameters from

50 %

to

150 %

of its rate values in steps of

25 %

. In this simulation is assumed that the R gain is settled as 1. Note that in this simulation case, the controller has been settling with the parameters a and b at their nominal values, i.e.,

100 %

of these.

Results in Figure 4 confirm that the proposed discrete-inverse optimal controller allows reaching global asymptotic stability in the angular position of the pendulum bar independent on the parametric variations of the constants a and b. However, for values lower than the

100 %

of these, the settling time increases, while for values larger than this bound, this time is reduced. This situation is attributable to the multiplication effect that has the gain b regarding the control input, as can be seen in the discrete model (4), which can attenuate or increment the effect of the control input in the closed-loop dynamics of the RWP system.

4.3. Comparison with Nonlinear Controllers

Here, the proposed inverse optimal control via CLF is compared with a nonlinear controller based on a direct Lyapunov control proposed in [4], the structure of this control law is presented below

\begin{matrix} u_{k} = \frac{1}{T_{s} b} (k_{1} x_{1_{k}} + k_{2} x_{2_{k}} + 2 a sin (x_{1_{k}})), \end{matrix}

(24)

being

k_{1}

and

k_{2}

defined as 3500 and 135, respectively. In addition, the proposed inverse optimal control is also compared with a nonlinear passivity-based controller proposed in [5], which has the following control law

\begin{matrix} u_{k} = \frac{1}{T_{s} b} (- j_{1} α x_{1_{k}} + r_{2} x_{2_{k}} + a sin (x_{1_{k}})), \end{matrix}

(25)

being

j_{1} = - 1

,

α = 3500

and

r_{2} = 135

, which are selected to make it comparative with the Lyapunov-based design.

It is important to mention that all the three controllers defined in (12), (24), and (25) are based on Lyapunov stability theory, which implies that all of them have global asymptotic stability properties for the closed-loop operation. In addition, it is possible to observe that all of them have a very similar control law, which is composed of linear feedback of the states

x_{1_{k}}

and

x_{2_{k}}

and the nonlinear effect of the sinusoidal function weighted by a constant [22].

In Figure 5 is presented the comparison between the proposed inverse optimal controller and the Lyapunov-based design and the passivity-based approach reported in [4,5], respectively.

From Figure 5, we can state that the Lyapunov-based and the passivity-based approaches have the same numerical performances since the angular position are overlapped for both controllers. Furthermore, these controllers take about 470 ms to establish around the reference. In comparison, the proposed inverse optimal control approach outperforms all its counterparts reaching the reference signal in about 370 ms. It is worthy to mention that the comparative approaches present an overpass to the reference signal. This implies that some oscillations in the vertical position are experienced. Simultaneously, the proposed method does not present this behavior, which confirms its efficiency in contrast to powerful and well-known nonlinear approaches.

5. Conclusions and Future Works

This research addressed the global stabilization of a reaction wheel pendulum using the discrete-affine dynamical model with two state variables via control Lyapunov functions. The main advantages of the proposed control design are the following:

The feedback control law $u_{k}^{⋆}$ guarantees an exponential an asymptotically stable behavior with capabilities of working under parametric uncertainties without compromising the convergence to the equilibrium point in settling times lower than 700 ms.
The control function is $u_{k}^{⋆}$ is indeed an optimal signal since it minimizes the cost function and make it to reach the global optimum value of the cost functional at $V^{⋆} (x_{k}) = \frac{1}{2} x_{0}^{T} P x_{0}$ with settling times between 370 ms and 700 ms.

Comparing the proposed controller with classical nonlinear controllers has demonstrated that the studied discrete inverse optimal controller via control Lyapunov functions can achieve global stabilization of the RWP system with exponential convergence. This is an advantage regarding classical passivity-based and Lyapunov-based depending on selecting the R gains and the values of the P matrix. Hence, it is possible to reach the equilibrium point with lower settling times.

As future works, the following developments could be considered: (i) to extend the formulation of the discrete-inverse optimal control via control Lyapunov functions to power electronic converters to integrate renewables and battery energy storage systems, and (ii) to apply the proposed controller to reduce frequency oscillations in power systems by improving controllers in synchronous machines.

Author Contributions

Conceptualization, O.D.M., W.G.-G., J.A.D.-J., A.M.-C., and D.A.G.-R.; methodology, O.D.M., W.G.-G., J.A.D.-J., A.M.-C., and D.A.G.-R.; formal analysis, O.D.M., W.G.-G., J.A.D.-J., A.M.-C., and D.A.G.-R.; investigation, O.D.M., W.G.-G., J.A.D.-J., A.M.-C., and D.A.G.-R.; resources, O.D.M., W.G.-G., J.A.D.-J., A.M.-C., and D.A.G.-R.; writing—original draft preparation, O.D.M., W.G.-G., J.A.D.-J., A.M.-C., and D.A.G.-R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the National Scholarship Program Doctorates of the Administrative Department of Science, Technology, and Innovation of Colombia (COLCIENCIAS), by calling contest 727-2015.

Acknowledgments

The authors want to thank to Vicerrectoria de Investigación, Innovación y Extensión from Universidad Tecnológica de Pereira by support given in this investigation.

Conflicts of Interest

The authors declare no conflict of interest.

References

Isidori, A. Nonlinear Control Systems; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Iqbal, J.; Ullah, M.; Khan, S.G.; Khelifa, B.; Ćuković, S. Nonlinear control systems-A brief overview of historical and recent advances. Nonlinear Eng. 2017, 6, 301–312. [Google Scholar] [CrossRef]
Lu, Q.; Sun, Y.; Mei, S. Nonlinear Control Systems and Power System Dynamics; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013; Volume 10. [Google Scholar]
Montoya, O.D.; Gil-González, W. Nonlinear analysis and control of a reaction wheel pendulum: Lyapunov-based approach. Eng. Sci. Technol. Int. J. 2020, 23, 21–29. [Google Scholar] [CrossRef]
Montoya, O.D.; Garrido, V.M.; Gil-González, W.; Orozco-Henao, C. Passivity-Based Control Applied of a Reaction Wheel Pendulum: An IDA-PBC Approach. In Proceedings of the 2019 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC), Ixtapa, Mexico, 13–15 November 2019; pp. 1–6. [Google Scholar]
Olivares, M.; Albertos, P. Linear control of the flywheel inverted pendulum. ISA Trans. 2014, 53, 1396–1403. [Google Scholar] [CrossRef] [PubMed]
Correa-Ramírez, V.D.; Giraldo-Buitrago, D.; Escobar-Mejía, A. Fuzzy control of an inverted pendulum Driven by a reaction wheel using a trajectory tracking scheme. TecnoLogicas 2017, 20, 57–69. [Google Scholar]
Spong, M.W.; Corke, P.; Lozano, R. Nonlinear control of the Reaction Wheel Pendulum. Automatica 2001, 37, 1845–1851. [Google Scholar] [CrossRef]
Baimukashev, D.; Sandibay, N.; Rakhim, B.; Varol, H.A.; Rubagotti, M. Deep Learning-Based Approximate Optimal Control of a Reaction-Wheel-Actuated Spherical Inverted Pendulum. In Proceedings of the 2020 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM), Boston, MA, USA, 6–9 July 2020; pp. 1322–1328. [Google Scholar]
Montoya, O.D.; Gil-González, W.; Ramírez-Vanegas, C. Discrete-Inverse Optimal Control Applied to the Ball and Beam Dynamical System: A Passivity-Based Control Approach. Symmetry 2020, 12, 1359. [Google Scholar] [CrossRef]
Sanchez, E.N.; Ornelas-Tellez, F. Discrete-Time Inverse Optimal Control for Nonlinear Systems; CRC Press Taylor and Francis Group: Boca Raton, FL, USA, 2017. [Google Scholar]
Ornelas, F.; Sanchez, E.N.; Loukianov, A.G. Discrete-time inverse optimal control for nonlinear systems trajectory tracking. In Proceedings of the 49th IEEE Conference on Decision and Control (CDC), Atlanta, GA, USA, 15–17 December 2010. [Google Scholar] [CrossRef]
Montoya, O.D.; Gil-González, W.; Serra, F.M. Discrete-time inverse optimal control for a reaction wheel pendulum: A passivity-based control approach. Rev. UIS Ing. 2020, 19, 123–132. [Google Scholar] [CrossRef]
Ohsawa, T.; Bloch, A.M.; Leok, M. Discrete Hamilton-Jacobi Theory. SIAM J. Control Optim. 2011, 49, 1829–1856. [Google Scholar] [CrossRef]
Block, D.J.; Åström, K.J.; Spong, M.W. The reaction wheel pendulum. Synth. Lect. Control Mechatron. 2007, 1, 1–105. [Google Scholar] [CrossRef]
Atkinson, C.; Osseiran, A. Discrete-space time-fractional processes. Fract. Calc. Appl. Anal. 2011, 14. [Google Scholar] [CrossRef]
Owolabi, K.M.; Atangana, A. Finite Difference Approximations. In Numerical Methods for Fractional Differentiation; Springer: Singapore, 2019; pp. 83–137. [Google Scholar] [CrossRef]
Sun, J.; Cheng, X.L. Iterative methods for a forward-backward heat equation in two-dimension. Appl. Math.-A J. Chin. Univ. 2010, 25, 101–111. [Google Scholar] [CrossRef]
Keadnarmol, P.; Rojsiraphisal, T. Globally exponential stability of a certain neutral differential equation with time-varying delays. Adv. Differ. Equ. 2014, 2014. [Google Scholar] [CrossRef]
Teel, A.R.; Forni, F.; Zaccarian, L. Lyapunov-Based Sufficient Conditions for Exponential Stability in Hybrid Systems. IEEE Trans. Autom. Control 2013, 58, 1591–1596. [Google Scholar] [CrossRef]
Valenzuela, J.G.; Montoya, O.D.; Giraldo-Buitrago, D. Local Control of Reaction Wheel Pendulum Using Fuzzy Logic. Sci. Tech. 2013, 18, 623–632. [Google Scholar]
Sanfelice, R.G. On the Existence of Control Lyapunov Functions and State-Feedback Laws for Hybrid Systems. IEEE Trans. Autom. Control 2013, 58, 3242–3248. [Google Scholar] [CrossRef]

Figure 1. Schematic representation of a reaction wheel pendulum (taken from [4]).

Figure 2. Dynamical behavior of the reaction wheel pendulum (RWP) for different values of the gain R: (a) angular position of the pendulum bar, (b) angular speed of the pendulum bar, and (c) control input.

Figure 3. Phase-portrait and Lyapunov function behaviors of the RWP for different values of the gain R: (a) Phase portrait and (b) Lyapunov function.

Figure 4. Angular position of the pendulum bar for parametric variations in a and b constants.

Figure 5. Behavior of the angle of the pendulum bar when compared the proposed inverse optimal control with the Lyapunov-based and the passivity-based approaches.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Montoya, O.D.; Gil-González, W.; Dominguez-Jimenez, J.A.; Molina-Cabrera, A.; Giral-Ramírez, D.A. Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function. Symmetry 2020, 12, 1771. https://doi.org/10.3390/sym12111771

AMA Style

Montoya OD, Gil-González W, Dominguez-Jimenez JA, Molina-Cabrera A, Giral-Ramírez DA. Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function. Symmetry. 2020; 12(11):1771. https://doi.org/10.3390/sym12111771

Chicago/Turabian Style

Montoya, Oscar Danilo, Walter Gil-González, Juan A. Dominguez-Jimenez, Alexander Molina-Cabrera, and Diego A. Giral-Ramírez. 2020. "Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function" Symmetry 12, no. 11: 1771. https://doi.org/10.3390/sym12111771

APA Style

Montoya, O. D., Gil-González, W., Dominguez-Jimenez, J. A., Molina-Cabrera, A., & Giral-Ramírez, D. A. (2020). Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function. Symmetry, 12(11), 1771. https://doi.org/10.3390/sym12111771

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Global Stabilization of a Reaction Wheel Pendulum: A Discrete-Inverse Optimal Formulation Approach via A Control Lyapunov Function

Abstract

1. Introduction

2. Dynamical Modeling of the Reaction Wheel Pendulum

2.1. Dynamical Model in the Continuous Domain

2.2. Dynamical Model in the Discrete Domain

3. Discrete-Inverse Optimal Formulation via CLF

3.1. Global Stability Test

3.2. Optimality Test

4. Numerical Validations

4.1. Evaluation of the Controller for Different Values of the R Gain

4.2. Performance under Parametric Variations

4.3. Comparison with Nonlinear Controllers

5. Conclusions and Future Works

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI