1. Introduction
Sliding mode control is a widely recognized and robust control method. For applications in robot manipulators, it has been developed over the past four decades [
1,
2,
3,
4,
5,
6]. In such a design, robustness against parametric uncertainties and unknown disturbances is achieved using a nonlinear switching control law. According to a review of such designs [
7,
8], early sliding mode control designs mostly operate under the assumption that full state information is available. However, because of technical or economic constraints, this assumption does not hold in actual settings; only output measurement is available in real applications. Therefore, developing output feedback sliding mode control designs is necessary.
Conventionally, the sliding surface is constructed as a static function of the output. To guarantee the existence of the control law, geometric conditions were given in [
9] and relaxed in [
10]. In [
11,
12,
13], different sliding surfaces structured using a linear matrix inequality technique were used to synthesize the static output feedback control law. An artificial time delay was introduced in the output feedback control law in [
14] to relax a stability assumption. Recently, the system structure has been reconsidered using the super-twisted algorithm with variable gain, and the first-order approximation filter has been used to estimate the upper bound for the norm of a partially unmeasured state vector [
15]. Most related approaches are hampered by the constraints applied to relative-degree-one systems only.
In output feedback control, a key solution to such problems is the observer-based structure. In this design, the required state information of the control system is estimated by observers or differentiators. For a system with unknown disturbances or uncertainties, constructing a robust observer or differentiator that can provide exact state estimations is the first challenge to be overcome. Related studies aiming to address this matter have reported remarkable results. In [
16], a reduced-order observer whose dynamics were decoupled from unknown disturbances was proposed. A sliding mode observer was proposed in [
17] for the state estimation of systems subject to unknown disturbances. However, the system must still be a relative-degree-one system. The relative degree constraint was relaxed in [
18,
19,
20] by using high-order sliding mode differentiators or high-gain observers. High-gain observer-based sliding mode control [
21,
22] can be used for multivariable nonlinear systems with stable zero dynamics, and a higher order sliding mode differentiator can be used to estimate the required output derivatives [
23,
24,
25].
In the observer-based approach, a design branch has focused on combining sliding mode control with the standard model reference adaptive control formulation [
26]. With this formulation, the design of output feedback sliding mode control for relative-degree-one systems is straightforward [
27,
28]. When the system relative degree is larger than one, the design becomes complicated, with too many filters required [
29,
30]. Some researchers have employed simpler approaches by using small-parameter filtering [
31], a high-gain observer [
32], a lead filter design [
33], or a higher order sliding mode differentiator [
34,
35] to estimate the time derivatives of the system output. With the output derivatives available, the difficulty of the control problem for a system with a relative degree larger than one is reduced to that for a relative-degree-one system.
In this paper, a novel design was proposed for observer-based output feedback sliding mode control. The proposed design relies on a robust observer, termed the loop transfer recovery (LTR) observer. The LTR observer [
36] was originally utilized to recover the robustness properties, such as the phase margin and gain margin, of linear quadratic Gaussian (LQG) control [
37]. This paper exploited the novel advantage of having an LTR observer, namely, the state estimation error of the LTR observer can be arbitrarily small with respect to the state- and input-dependent system uncertainties. The resultant LTR observer-based output feedback sliding mode control drives the system state into a small residual set around the origin, with the size of the residual set controlled by a design parameter in the observer Riccati equation.
Most other output feedback sliding mode control designs constrain unknown uncertainties, thus depending on the system output or time [
9,
11,
12,
13,
14,
16,
17,
18,
19,
20,
21,
22,
23,
24,
28,
29,
30,
32,
33,
34]. However, state-dependent uncertainties are prevalent in mechanical systems [
38], electronic circuits [
39], and robot manipulators [
6]; dealing with such uncertainties has become a pertinent research topic [
15,
35,
38]. First, in this paper, unknown disturbances were further extended to input- and state-dependent disturbances, which are the most widely encountered related research. Such an endeavor is considerably challenging because such uncertainties are potentially unbounded, and the input-dependent uncertainties further complicate analysis when sliding mode control is used. Second, the output feedback sliding mode control method detailed in this paper offered global stability, whereas designs [
23,
24,
25] based on higher order sliding mode differentiators are only locally or semiglobally stable. Third, the proposed sliding mode control design was intuitively and structurally simpler than the variable-structure model reference of adaptive control [
27,
28,
29,
30,
31,
32,
33,
34,
35], high-gain observer sliding mode control [
21,
22], the super-twisted algorithm [
15], and the design based on a higher order sliding mode differentiator [
23,
24,
25]; hence, it can be easily accepted by control engineers with only fundamental control knowledge. In [
15,
34,
35], multiple estimators were employed to guarantee global stability. By contrast, in they present study, only one Luenberger-type observer was used. Fourth, the approach described in this paper required minimum system information. The proposed design did not impose relative-degree-one constraints on the original controlled system structure as in [
9,
10,
11,
13,
14,
15,
16,
17,
20,
28,
29] or require a priori knowledge of the system’s relative degree and upper bound of disturbance as in [
27,
28,
29,
30,
31,
32,
33,
34,
35]. Furthermore, herein, assumptions on disturbance in [
10,
13] and the system structure in [
10,
11,
13,
14,
21,
22,
23,
40] were waived, and the design procedure proposed did not require any state transformation. The most general system structure was considered in this paper; the application of the related approach was therefore straightforward. In the application in a robot manipulator, the proposed design used only joint position and input torque command, whereas the information of system matrices in the dynamic equation [
3,
4] or the estimation of system matrices [
5] were requested in the control design. Moreover, the stability analysis in this paper was independent with the matrices in the governing equation of robot dynamics, while [
6] required the derivative of the state-dependent system matrix to prove the controller stability. In this paper, a given vector
,
denotes the usual Euclidean two-norm vector.
2. State Feedback Sliding Mode Control
Consider a linear multiple-input multiple-output (MIMO) system subject to unknown, but matched system uncertainties.
where
is the system state,
is the control input,
is the system output,
are nominal system matrices with proper dimensions,
is an unknown time-varying disturbance satisfying a matching condition [
41],
contains parametric uncertainty, and
is the uncertainty in the control gain matrix. The uncertainties, which may be arbitrarily time-varying and state- and input-dependent, satisfy the upper bounds:
This condition on
is more stringent than the Hurwitz condition in [
42], but the control in [
42] applies to relative-degree-one systems only. The system matrix
A may be stable or unstable; the matrix pair
was assumed to be controllable;
was assumed to be observable; the number of inputs was equal to that of the outputs; and the nominal system
was assumed to be minimum phase (all finite zeros are stable). System state
x was not accessible for measurement, and the only accessible signals were control input
u and system output
y. The control objective was to drive system output
y to track, in the face of system uncertainty
, desired bounded reference output
as closely as possible.
Remark 1. In most previous research on output feedback sliding mode control, uncertainty was assumed to be a function of disturbance and output, ; by contrast, in the proposed approach, uncertainty was allowed to be a function of state, input, and external disturbance: . This lumped uncertainty was potentially unbounded; the boundedness was uncertain when the stability of the control system (1) was not conclusively demonstrated. The first step of control design is to transform the tracking control problem into a stabilizing control problem. This can be achieved by establishing a reference model for the bounded reference output
as follows:
where the reference state
and the control command
are all bounded but unavailable and the reference output
is available for measurement. The system matrix
is stable for some stabilizing state feedback gain
K (see the control law (
7) below). The aforementioned reference model was temporarily used for analysis purposes. It can be written as:
where
is a bounded signal. Defining the error signals
,
and subtracting (
4) from (
1) yield:
where the uncertainty becomes:
in which
is potentially unbounded prior to obtaining the proof of the stability of the proposed control design, and
is unknown but bounded. If a stabilizing control design
u can be constructed to drive the error state
in (
5) to zero, the tracking mission is achieved. Thus, the tracking control problem for the system (
1) is transformed into a stabilizing control problem for the error system (
5).
In this section, a state feedback sliding mode control approach was reviewed for this error system (
5) (assuming
was accessible for measurement). In the next section, an observer-based output feedback sliding mode control design is presented (assuming only
was accessible for measurement). A state feedback sliding mode control design for the error system was constructed as follows.
where
is the state feedback gain that places the eigenvalues of
in the open left-half plane. As the nominal closed-loop system matrix
is stable, a positive definite matrix
exists, satisfying the Lyapunov equation (Chapter 5.4 in [
43]),
where
can be any small positive parameter and appears in the equation only for the purpose of conducting a theoretical analysis of closed-loop stability. Without loss of generality, it can be assumed that
. In the control law (
7),
is the sliding variable:
where
P is obtained from the Lyapunov Equation (
8) and
is a positive, time-varying control gain:
where
is an upper bound of
. Conducting stability analysis of the aforementioned state feedback sliding mode control approach (
7) based on a Lyapunov function
has become standard (details can be found in Chapter 14.2 in [
44]). State feedback sliding mode control (
7) can asymptotically drive the error state
to zero despite system uncertainties. Thus, the tracking mission of the original system was achieved.
Remark 2. In the reference model of (4), the structure naturally constrains the reference output to be sufficiently smooth, and the signal should have bounded derivatives for at least n-times differentiations. This property can be explicitly addressed if the reference model (4) is constructed in observable canonical form. 3. Robust Loop Transfer Recovery Observer
In the previous section, state feedback tracking sliding mode control was reviewed. In many real-world applications, full state information is unavailable, and the only accessible signal is the system output. It is therefore desirable to devise a sliding mode control approach that relies on output measurement only. First, a robust observer was constructed to obtain an estimate
of the error state
for the error system (
5). Subsequently, all of state
in the state feedback sliding mode control law (
7) was replaced by state estimate
.
The robust observer proposed in this paper was the LTR observer. The original purpose of the LTR observer [
36] was to recover the loop transfer function of the output feedback LQG control to that of the state feedback LQ control. Consequently, favorable and robust properties such as a large gain margin and phase margin were achieved against unstructured and unmodeled dynamics. However, related studies on the LTR observer have not revealed the upper bounds of the estimation errors when system uncertainties are present. In this paper, an upper bound of the state estimation error was explicitly derived. On the basis of this upper bound, the LTR observer can be used for robust sliding mode control of uncertain systems.
Consider the system (
5) with accessible output
but inaccessible state
. The LTR observer had the usual structure of a Luenberger observer and was constructed as follows:
where
is an estimation of the system state
and
is the solution matrix of the observer Riccati equation:
where
, and
are scalar design parameters. The initial condition of the observer,
, can be of any finite value and does not affect the estimation error stability. Normally,
is chosen. In the LTR observer design,
was used to control the convergent speed of the state estimation error, and a value for
that was sufficiently large was selected to ensure a small estimation error with respect to system uncertainty
e.
The following lemma presents a well-known result in the study of LTR observers. Because the system was a minimum-phase system, a positive number existed such that was also a minimum phase system.
Lemma 1. If is the minimum phase, then the solution Q for the Riccati Equation (12) satisfies . On the basis of Lemma 1, the theorem that follows proved that the LTR observer obtained only a small estimation error with respect to system uncertainty
e in (
5). The theorem enabled the derivation of an explicit upper bound of the state estimation error. According to this upper bound, the larger the design parameter
in the observer Riccati Equation (
12) is, the smaller the state estimation error is. This theorem was a novel addition to the literature. Previous research on the LTR observer [
36,
46] has only detailed the loop transfer function recovery property of the LTR observer, without an explicit upper bound of the estimation error.
Theorem 1. The LTR observer (11) asymptotically (as time approached infinity) achieved a small estimation error for the error system (5) in the sense that:where:in which denotes the maximum singular value. Proof. Define the state estimation error
. From (
5) and (
11), the dynamics of
is given as:
Choose the Lyapunov function
, where
Q is the solution matrix from the Riccati Equation (
12). Checking the time derivative of
V along the trajectory of
gives:
In the aforementioned equation, the maximum of the last two terms occurs when
, with the maximum being
. Hence,
The last inequality shows that
V decreases exponentially as long as
; thus, it can be concluded that
as time approaches infinity. The use of
produces:
where:
and the last inequality is obtained by substitution of the definition of
(
10).
In the definition provided for , was fixed in the design. Hence, according to Lemma 1, became arbitrarily small as the selected design parameter became sufficiently large. This proves the theorem. □
Remark 3. In Theorem 1, the estimation error was bounded by the control signal and system state vector ; therefore, it can only be temporarily concluded that the estimation error can be potentially unbounded in this stage. Components u and have the potential to be unbounded because no conclusion on boundedness was made, and the convergency of the state estimation should be examined after stability analysis for the controlled system (5) has been completed. Remark 4. Previously, many researchers used a high-gain observer (see Chapter 14.5 of [44]) to estimate the system state in their sliding mode control design. However, the high-gain observer is subject to the constraint that the system must have no zero dynamics; that is, the system relative degree is equal to the system dimension. As a result, when the high-gain observer is applied to the sliding mode control design for systems with an arbitrary relative degree [21,22,32], the high-gain observer can estimate only the partial state (the external state) of the system, while the LTR observer in this paper can estimate the full state of the system, providing more monitoring information of the system. Remark 5. In (11), the design of the robust observer has the usual structure of the Luenberger observer, which is an intuitive and simple design of a linear state observer. By contrast, for example, the higher order sliding mode differentiator follows the nonlinear recursive algorithm [23]:where is an estimation of the ith-order derivative of , and is a function of i and L, in which L stands for the known Lipschitz constant of the nth derivative of . Comparing (11) and (19), it is seen that a higher order sliding mode differentiator (19) has a more complex structure and requires more computation effort to estimate the desired system information. 4. Output Feedback Sliding Mode Control
As detailed in the previous section, a state estimate
was obtained from the LTR observer (
11). Next, the observer-based output feedback tracking sliding mode control can be established by replacing
in the state feedback control law (
7) with its estimate
. The resultant output feedback sliding mode control equation is as follows:
where the estimated sliding variable is:
and the switching control gain is chosen as:
with:
in which
and
are two positive constants to be defined. The choice of constants
and
in (
22) is different from that in (
10).
Given the uncertainty upper bounds
,
, the norm of state feedback gain
, and a sufficiently small
in Theorem 1, a constant
exists such that:
The aforementioned inequality ensures that the denominator in (
23) is positive. Given
as chosen, define another constant
:
The virtual constants
and
were used only in the following closed-loop stability analysis for the MIMO system (
1). The stability analysis was nontrivial for the MIMO system. In the case that the system is a single-input single-output system, the stability analysis can be considerably simplified, without the need to define the constants
and
.
To simplify the equation derivations in the stability analysis, the following lemma should be constructed in advance.
Lemma 2. The following inequalities hold for the system (1) and sliding mode control (20). - (i)
.
- (ii)
.
- (iii)
for some positive constants and .
- (iv)
for some positive constants and ,
Proof. (i) The control law (
20) suggests that:
One continues to derive:
where the first inequality follows from Theorem 1, the second follows from (
26), the third follows from the inequality
and (
22), and the last is obtained by rearranging the equation;
- (ii)
With the upper bound in (i), an upper bound for
can be derived,
where the second inequality results from (
5) and (
13), the third inequality follows from (
26), the fourth inequality follows from the inequality
and (
22), the fifth inequality results from Part (i) of the lemma, and
is as defined in (
24). The last inequality is derived using the inequalities in (
22) and (
24);
- (iii)
In this section, an alternative upper bound is presented for the estimation error
:
with
In the aforementioned equations, the first inequality follows from Theorem 1, the second follows from (
26), the third follows from the inequality
, and the last is obtained by rearranging the equation;
- (iv)
According to (
22),
where the second inequality is derived from
and the last inequality is obtained from Part (iii) of the lemma. Thus, Part (iv) is proven with
and
. □
Before closed-loop stability was proven, the system uncertainty
in (
13) was potentially unbounded. Hence, the state estimation error
in (
13) was also potentially unbounded. Determining whether the proposed observer-based state feedback tracking sliding mode control (
20) was stabilizing required a rigorous stability analysis. This was accomplished using the main theorem that follows.
Theorem 2. The LTR observer-based output feedback tracking sliding mode control (20) stabilized an uncertain system (1) in that the system state was driven into an arbitrarily small residue set around the origin if the design parameter ξ in the observer Riccati Equation (12) was sufficiently large. Proof. Choose a Lyapunov function candidate
for the controlled error system (
5), where the matrix
P is from the Lyapunov Equation (
8). Taking the time derivative of
W along the trajectory of the error system (
5) controlled by the output feedback sliding mode control (
20) yields:
where
e is the lumped system uncertainty
. With
, two possible cases may exist in the above equation.
Case 1 (reaching mode):
, where
is from (
25). In this case, using the two inequalities
and
can produce:
where
from (
25). The inequality (
29) then becomes:
where the second inequality is obtained from Part (ii) of Lemma 2. It can be concluded that in this case, the Lyapunov function
W in (
30) is strictly decreasing. Moreover, the sliding variable
fulfills the relation:
where
is the maximum singular value of the square matrix
and
is the minimum singular value of matrix
P. Combining (
30) and (
31) revealed that the absolute value of the sliding variable
was exponentially decreasing; in this case, the reaching law can be concluded as:
The statement implies that the sliding variable reaches , the boundary layer of the sliding surface .
Case 2 (sliding mode):
. In this case, the system state was restricted in the boundary layer of the sliding surface, and the following can be derived from (
29):
where the first inequality results from Part (ii) of Lemma 2; the second inequality results from the hypothesis of this case; and
, with
being the maximum eigenvalue of matrix
P. Equation (
32) shows that if
, then
W decays exponentially. Hence, the following is eventually obtained:
It follows from the definitions of sliding variables in (
9) and (
21) that
. With the inequality
, where
is the minimum eigenvalue of
P, the above equation can be written as:
where:
the second inequality results from Parts (iii) and (iv) of Lemma 2, and in the last inequality,
and
. Taking the square root of both sides of the last inequality yields:
where the second inequality is obtained by rearranging the first inequality. Because according to Theorem 1,
, this ensured that the denominator in the above equation was always positive if the design parameter
in the LTR observer was sufficiently large. Because
also appears in the numerator, Equation (
33) proves that, eventually, the error dynamic
converges to a small residual set around the origin, with the size of the residual set controlled by the design parameter
in the LTR observer design; thus,
and
are arbitrarily close. Furthermore, the result of (
33) guaranteed that the estimation error
in Theorem 1 was made arbitrarily small using Part (iii) of Lemma 2, and the state estimation
was bounded and made arbitrarily small by the relation
. Finally, the control signal
u in (
20) remained uniformly bounded because all components in it were bounded. □
The following section illustrates the effectiveness of the proposed output feedback sliding mode control scheme (
20).
5. Application Example
A two-degree-of-freedom robotic manipulator studied in [
4] was used here to illustrate the efficiency of the proposed control design. The dynamic equation of the two-link robot is defined as:
where
is the joint position and
and
are velocity and acceleration vectors;
,
,
, and
F are the inertia matrix, Coriolis matrix, gravity matrix, and frictional matrix with proper dimensions;
is the input torque vector. On the basis of the state-space realization proposed in [
4], system matrices in the error dynamic (
5) are given as:
with
, and the desired reference trajectory was assigned as
. In the output feedback tracking sliding mode control algorithm, the design parameters in the observer Riccati Equation (
12) were
,
, and
. According to (
18) and (
33),
should be sufficiently large to guarantee estimating and tracking performance. The control law (
20) was constructed with a state feedback gain:
and positive constants were assigned as
in (
8) and
in (
22) to achieve the desired convergent speed of (
30) and to satisfy (
23). Herein, the torque command was given as
in (
20) without the knowledge of the system matrices
,
,
, and
F in (
34), and the unknown parameters were considered as the state- and input-dependent disturbances in the error dynamic (
5).
Figure 1 shows the tracking performance of the controlled system. Because the initial condition of state estimation
was set as zero, coinciding with the system error
, the control system reached the residue set (
33) from the very beginning of the simulation. Moreover, the parameters
in (
12) and
in (
22) were properly chosen; therefore, the system state was constrained in a sufficiently small residue set of error (
33), and the two outputs precisely tracked their references; in
Figure 1, it is seen the tracking error was a constant from the very initial time. The control signals are shown in
Figure 2, and
Figure 3 presents the sliding variable. It was seen that the control input
u rapidly peaked in the very initial time interval; the actuators might be saturated when the control design was implemented. Moreover, because a nonlinear switching function was used in the control law, both control signals were subject to a high-frequency oscillation when the sliding variable
in
Figure 3 chattered around
, which is harmful for actuators in the real world.
To avoid the occurrence of the peaking and chattering phenomenon, the control design (
20) was modified as:
where the discontinuous function
was replaced by the boundary layer [
47] control
and the constants in the switching control gain (
22) were scheduled as
. The system performance of the modified control design is depicted in
Figure 4, and
Figure 5 shows the control signals. In
Figure 4, the references were reached more slowly since the switching gain in (
22) was scheduled with time. When the constants
and
in (
22) were small, the switching gain was incapable of completely suppressing the unknown disturbance, and the tracking performance was degraded; when the switching gain was increasing, the robustness of the control design was improved, and the references were therefore reached. With the modified control design, the control signals
u exhibited no peaking phenomenon; the chattering was efficiently removed, thus protecting the actuators. It is worth noting that the boundary layer design [
47] was sensitive to measurement noise [
48], and an alternative approach to chattering reduction is to use dynamic sliding mode control [
49], which can eliminate chattering, especially in noisy real-world environments. Moreover, the dynamic sliding mode control [
49] was capable of alleviating the rapid spikes in control design since the control input was an integrated result.
In order to provide a more complete set of validation experiments, the reference signal was extended as:
and the control design (
35) was applied with the switching gains
and
in (
22). The tracking performance is depicted in
Figure 6, and
Figure 7 presents the two control signals. In
Figure 6, the desired references had discontinuities at
, and system outputs, especially
, had a sudden jump at these moments. Likewise, the control signals in
Figure 7 inevitably peaked at the time of discontinuities even if the smooth boundary layer control design (
35) was implemented. According to (
33),
when
was sufficiently large, and
according to (
13). Consequently,
, and the steady-state of control input (
35) was approximated as:
In
Figure 8, the time history of the sliding variable is depicted, and the detail of control signal
u is shown in
Figure 9. In these figures, the approximation of the control signal (
36) was confirmed with
.
Remark 6. According to (34), it was straightforward to derive the steady-state of input torque signal as:when the desired reference was constant. However, the system matrix was assumed to be uncertain, and a robust control design (35) was proposed to suppress the unknown nonlinearity. In the sequel, the steady-state of control input (36) therefore depended on the switching gain (22) assigned by the user as in (36).