Model Predictive Control for Pneumatic Manipulator via Receding-Horizon-Based Extended State Observers

Yang Xu; Xiaohui Hao; Dongjie Zhu; Liangchao Wu; Peng Li

doi:10.3390/act14070343

,

and

¹

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

²

School of Artificial Intelligence, Tiangong University, Tianjin 300387, China

³

Xingyu Electronics (Ningbo) Co., Ltd., Ningbo 315500, China

⁴

Tianjin Key Laboratory of Intelligent Unmanned Swarm Technology and System, Tianjin 300072, China

Actuators2025, 14(7), 343;https://doi.org/10.3390/act14070343

This article belongs to the Special Issue Actuators in Robotic Control—3rd Edition

Version Notes

Order Reprints

Abstract

This paper presents a model predictive control (MPC)-enabled disturbance-rejection controller approach for a pneumatic manipulator system subjected to complex nonlinear terms within the system. To facilitate the handling of the complex nonlinear terms, they are modeled as disturbances. To address these disturbances, a receding-horizon-based extended state observer (RH-ESO) incorporating a decision variable is developed. The optimal disturbance estimation error is determined through a receding-horizon optimization procedure, which provides the best estimate of the disturbance. Using this optimal estimate, the MPC-enabled disturbance-rejection controller is proposed for the pneumatic manipulator system to achieve angle tracking control. Moreover, the proposed approach ensures both the recursive feasibility of the optimization problem and the uniform boundedness of the closed-loop system. The simulation results further demonstrate the effectiveness and validity of the proposed methodology.

Keywords:

pneumatic manipulator; receding-horizon-based extended state observer (RH-ESO); model predictive control (MPC); uniform boundedness

1. Introduction

The research on pneumatic soft robotic arms has accelerated with the recent progress in flexible actuation []. Owing to their intrinsic compliance, these arms are now prevalent in soft manipulation [], bio-inspired robotics [], and industrial automation []. Their primary actuators—pneumatic artificial muscles (PAMs)—provide high power-to-weight ratios [], large deformations [], low cost [], and straightforward fabrication []. Nevertheless, severe nonlinearities, rate-dependent hysteresis, input coupling, and sensitivity to disturbances complicate precise control []. To mitigate these difficulties, several robust strategies have been investigated, including adaptive control [], sliding mode control (SMC) [], and active disturbance-rejection control (ADRC) []. An adaptive hysteresis–compensation scheme in [] achieves high-precision trajectory tracking for PAM joints, whereas a constraint-aware adaptive fuzzy controller in [] maintains prescribed tracking accuracy under time-varying motion constraints. These advances highlight the growing maturity of PAM control; however, further work is needed to balance tracking accuracy, robustness, and computational efficiency for real-world deployment.

SMC has become the preferred robust strategy for PAM-actuated systems owing to its inherent resilience to parameter variations and external disturbances []. Literature [] combines adaptive SMC with fuzzy-logic approximation on a PAM-driven humanoid arm, achieving asymptotic trajectory tracking despite unknown nonlinearities. More recently, the study in [] incorporates a disturbance observer into the SMC framework and, with rigorous stability analysis, guarantees uniformly ultimately bounded tracking errors under discontinuous friction.

As a robust anti-disturbance methodology, ADRC has been widely deployed in power systems [], industrial robots [], and multi-agent networks []. By dynamically estimating and compensating both endogenous uncertainties and exogenous disturbances, ADRC largely decouples closed-loop performance from precise modeling requirements []. Its core component—the extended state observer (ESO)—treats disturbances as augmented state variables, permitting simultaneous estimation of the original and disturbance states and thereby yielding a full description of the extended system dynamics []. Numerous ESO variants have been proposed. A reduced-order ESO for mobile robots is introduced in [], while an event-triggered Takagi–Sugeno fuzzy ESO with adaptive parameters for bandwidth-constrained nonlinear systems is reported in []. To attenuate measurement noise, a composite observer that couples an ESO with a Kalman filter is presented in []. ESO-based solutions have also been tailored to pneumatic artificial muscles: a super-twisting ESO for trajectory regulation is presented in [], and an ESO-assisted sliding mode controller is detailed in []. Despite these advances, designing an ESO that maximizes closed-loop performance remains unresolved. The composite observer in [], for example, achieves minimum-variance state estimation rather than disturbance estimation explicitly optimized for control objectives. Motivated by this gap, the present work seeks to develop a new ESO that delivers optimal control performance.

In addition to exogenous disturbances, practical systems must respect stringent operational limits—especially state bounds and actuator saturation—which directly influence performance and safety [,,]. Model predictive control (MPC) mitigates these limitations by solving a receding-horizon optimization that enforces state and input constraints, delivering superior closed-loop behavior with explicit safety guarantees [,]. Literature [] introduces a hybrid disturbance-predictive, adaptive, event-triggered MPC that improves prediction accuracy while reducing communication demand, whereas [] develops a periodic event-triggered MPC for nonlinear systems under bounded disturbances. Despite such progress, applications of MPC to constraint-ridden pneumatic manipulators remain scarce—an observation that motivates the present study.

This work addresses the critical challenge of constrained control synthesis for pneumatic manipulator systems operating under concurrent state and input constraints as well as external disturbances. The principal theoretical and methodological advancements are systematically articulated as follows: (1) a novel ESO with an additive term, whose estimation error dynamics are explicitly incorporated into the MPC cost function formulation, is proposed that dynamically adjusts estimation gain through real-time error feedback; (2) an MPC-enabled disturbance-rejection controller is designed for the pneumatic manipulator system to address both constraints and exogenous perturbations in soft actuator applications; and (3) sufficient stability conditions are established in terms of linear matrix inequalities (LMIs), theoretically ensuring the uniform ultimate boundedness of all closed-loop signals under the proposed MPC-based disturbance-rejection controller.

2. Problem Formulation and Preliminaries

2.1. The Model of the Pneumatic Manipulator System

In this paper, the model of the pneumatic manipulator system considered is based on []. In [], the authors have established the dynamic model of the pneumatic manipulator for the corresponding experimental platform. In the experimental setup of [], compressed air is provided by an air compressor and supplied to pressure-proportional valves as the air source. Voltage signals from the industrial computer are transmitted to twelve pressure-proportional valves, which regulate the internal pressures and pulling forces of the twelve PAMs. During the control process, the system operates in three scenes: an initial scene, a preloading scene, and a movement scene. When the pneumatic manipulator is not pressurized, the input signals of the twelve pressure-proportional valves are zero, indicating that all PAMs are in a relaxed state without air pressure, and the deflection angle of the manipulator is zero. When the manipulator is required to track a given deflection angle, its motion is driven by antagonistic PAMs: one PAM is inflated while the other is deflated, resulting in an angular deflection of the manipulator. From this process, it can be seen that, although the input signal to the system is voltage, it is ultimately converted into internal pressure and pulling force of the PAMs. Therefore, in the dynamic model, the actual control input is the force, and the control objective is angle regulation. The objective of this study is to design an MPC scheme based on a novel ESO, enabling the pneumatic manipulator to track a given deflection angle signal with optimal control cost.

The pneumatic manipulator model is given according to []

\begin{matrix} \ddot{θ} = \frac{\tilde{R} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} (3 L_{0}^{2} - b^{2}) u - \frac{3 {\tilde{R}}^{2} L_{0} p_{0}}{π N^{2} m {\tilde{l}}^{2}} θ - \frac{g sin θ}{\tilde{l}} + \frac{3 {\tilde{R}}^{3} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} θ^{2} u, \end{matrix}

(1)

where

θ

denotes the deflection angle of the joint mechanism; u is the control input. The system configuration parameters include

\tilde{R}

(moment arm distance between PAM actuators and the rotating linkage), m (equivalent total mass incorporating the lower chassis and adjacent joint components), and

\tilde{l}

(effective length of the driving linkage). The pneumatic actuation characteristics are defined by

k_{0}

(pressure–voltage conversion coefficient of the proportional valves),

p_{0}

(preloading pressure of PAMs), and

L_{0}

(nominal rest length of the pneumatic artificial muscles). Additional manufacturing parameters specify N (number of synthetic fiber winding turns in PAM construction) and b (total length of rayon).

Then, system (1) is reformulated as the following state-space form as

\begin{matrix} \{\begin{matrix} {\dot{s}}_{1} = s_{2} \\ {\dot{s}}_{2} = b_{1} s_{1} + b_{0} u + w_{d} \\ y = s_{1} \end{matrix} \end{matrix}

(2)

where

s_{1} ≜ θ

,

s_{2} ≜ \dot{θ}

, and

\begin{matrix} b_{0} ≜ & \frac{\tilde{R} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} (3 L_{0}^{2} - b^{2}), b_{1} ≜ - \frac{3 {\tilde{R}}^{2} L_{0} p_{0}}{π N^{2} m {\tilde{l}}^{2}}, w_{d} ≜ \frac{3 {\tilde{R}}^{3} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} s_{1}^{2} u - \frac{g}{\tilde{l}} sin (s_{1}) . \end{matrix}

Then, system (2) can be transformed as the following discrete-time state-space form

\begin{matrix} \{\begin{matrix} s_{1} (i + 1) = s_{1} (i) + T s_{2} (i), \\ s_{2} (i + 1) = T b_{1} s_{1} (i) + s_{2} (i) + T b_{0} u (i) + T w_{d} (i), \\ y (i) = s_{1} (i), \end{matrix} \end{matrix}

(3)

where

T

is the sampling period; i is the time step.

Considering the physical constraints on

s_{1} (i)

,

s_{2} (i)

, and

u (i)

of the pneumatic manipulator system, this study formulates these limitations in the following mathematical representation:

\begin{matrix} | s_{1} (i) | & \leq {\bar{s}}_{1}, \end{matrix}

(4)

\begin{matrix} | s_{2} (i) | & \leq {\bar{s}}_{2}, \end{matrix}

(5)

\begin{matrix} | u (i) | & \leq \bar{u}, \end{matrix}

(6)

respectively, where

{\bar{s}}_{1}

,

{\bar{s}}_{2}

, and

\bar{u}

are known positive constants.

Then, constraints (4)–(6) can be rewritten as

\begin{matrix} s (i) \in S ≜ & {x : b_{s}^{T} s (i) \leq h_{s}}, \end{matrix}

(7)

\begin{matrix} u (i) \in U ≜ & {u : b_{u}^{T} u (i) \leq h_{u}}, \end{matrix}

(8)

where

s (i) ≜ [\begin{matrix} s_{1} (i) \\ s_{2} (i) \end{matrix}], h_{s} ≜ [\begin{matrix} {\bar{s}}_{1} \\ {\bar{s}}_{1} \\ {\bar{s}}_{2} \\ {\bar{s}}_{2} \end{matrix}], h_{u} ≜ [\begin{matrix} \bar{u} \\ \bar{u} \end{matrix}], b_{s} ≜ [\begin{matrix} 1 & - 1 & 0 & 0 \\ 0 & 0 & 1 & - 1 \end{matrix}], b_{u} ≜ [\begin{matrix} 1 & - 1 \end{matrix}] .

According to (7) and (8), the constraint of

w_{d} (i)

is easily obtained as

\begin{matrix} w_{d} (i) \in W ≜ {w_{d} (i) : b_{w}^{T} w_{d} (i) \leq h_{w}}, \end{matrix}

where

b_{w} ≜ [\begin{matrix} 1 & - 1 \end{matrix}], h_{w} ≜ [\begin{matrix} {\bar{w}}_{d} \\ {\bar{w}}_{d} \end{matrix}], {\bar{w}}_{d} ≜ \frac{3 {\tilde{R}}^{3} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} {\bar{s}}_{1}^{2} \bar{u} + \frac{g}{\tilde{l}} sin ({\bar{s}}_{1}) .

Remark 1.

Subject to the mechanical constraints of the pneumatic manipulator’s physical structure, its actual deflection angle is practically confined to

\pm 15^{\circ}

; i.e.,

θ (i) \in [- 15^{\circ}, + 15^{\circ}]

. Consequently, the disturbance boundary is defined as

{\bar{w}}_{d}

.

2.2. The Receding-Horizon-Based ESO

In this paper, a receding-horizon-based ESO (RH-ESO) is proposed based on the receding-horizon mechanism. Decision variables are used in the RH-ESO to simultaneously optimize observer performance, suppress high-gain phenomena, and mitigate overshoot in disturbance estimation errors. The RH-ESO is designed as

\begin{matrix} {\hat{s}}_{1} (i + 1) = & {\hat{s}}_{1} (i) + T {\hat{s}}_{2} (i) + T β_{1} e_{1} (i) + T l_{1} (i), \end{matrix}

(9)

\begin{matrix} {\hat{s}}_{2} (i + 1) = & T b_{1} {\hat{s}}_{1} (i) + {\hat{s}}_{2} (i) + T {\hat{s}}_{3} (i) + T b_{0} u (i) + T β_{2} e_{1} (i) + T l_{2} (i), \end{matrix}

(10)

\begin{matrix} {\hat{s}}_{3} (i + 1) = & {\hat{s}}_{3} (i) + T β_{3} e_{1} (i) + T l_{3} (i), \end{matrix}

(11)

where

{\hat{s}}_{1} (i)

is the estimation of

s_{1} (i)

;

{\hat{s}}_{2} (i)

is the estimation of

s_{2} (i)

;

{\hat{s}}_{3} (i)

is the estimation of

w_{d} (i)

;

l_{m}

(m = 1, 2, 3)

is the decision variable that is obtained by solving an optimization problem subsequently;

β_{m}

(m = 1, 2, 3)

are the adjustable gains;

ε_{m} (i) ≜ s_{m} (i) - {\hat{s}}_{m} (i)

(m = 1, 2, 3)

are the estimation errors that can be obtained as

\begin{matrix} ε_{1} (i + 1) & = (1 - T β_{1}) ε_{1} (i) + T ε_{2} (i) - T l_{1} (i), \end{matrix}

(12)

\begin{matrix} ε_{2} (i + 1) & = (T b_{1} - T β_{2}) ε_{1} (i) + ε_{2} (i) + T ε_{3} (i) - T l_{2} (i), \end{matrix}

(13)

\begin{matrix} ε_{3} (i + 1) & = - T β_{3} ε_{1} (i) + ε_{3} (i) - T l_{3} (i) + T Δ_{d} (i), \end{matrix}

(14)

where

Δ_{d} (i) ≜ \frac{w_{d} (i + 1) - w_{d} (i)}{T}

.

From (13), we have

\begin{matrix} ε_{3} (i) = \frac{1}{T} [ε_{2} (i + 1) - (T b_{1} - T β_{2}) ε_{1} (i) - ε_{2} (i) + T l_{2} (i)], \end{matrix}

(15)

Substituting (15) into (14), we have

\begin{matrix} ε_{3} (i + 1) = & - T β_{3} ε_{1} (i) + \frac{1}{T} [ε_{2} (i + 1) - (T b_{1} - T β_{2}) ε_{1} (i) - ε_{2} (i) + T l_{2} (i)] \\ - T l_{3} (i) + T Δ_{d} (i) \end{matrix}

(16)

From (12), we have

\begin{matrix} ε_{2} (i) = & \frac{1}{T} [ε_{1} (i + 1) - (1 - T β_{1}) ε_{1} (i) + T l_{1} (i)] \end{matrix}

(17)

Substituting (17) into (16), we have

\begin{matrix} ε_{3} (i + 1) = & - T β_{3} ε_{1} (i) + \frac{1}{T} [ε_{2} (i + 1) - (T b_{1} - T β_{2}) ε_{1} (i) \\ - \frac{1}{T} [ε_{1} (i + 1) - (1 - T β_{1}) ε_{1} (i) + T l_{1} (i)] + T l_{2} (i)] - T l_{3} (i) + T Δ_{d} (i) \\ = & (- \frac{1}{T^{2}}) ε_{1} (i + 1) + (\frac{1}{T^{2}} - b_{1} - \frac{β_{1}}{T} + β_{2} - T β_{3}) ε_{1} (i) \\ + (\frac{1}{T}) ε_{2} (i + 1) + (- \frac{1}{T}) l_{1} (i) + l_{2} (i) - T l_{3} (i) + T Δ_{d} (i) \end{matrix}

(18)

From (18), we have

\begin{matrix} ε_{3} (i) = & (- \frac{1}{T^{2}}) ε_{1} (i) + (\frac{1}{T^{2}} - b_{1} - \frac{β_{1}}{T} + β_{2} - T β_{3}) ε_{1} (i - 1) \\ + (\frac{1}{T}) ε_{2} (i) + (- \frac{1}{T}) l_{1} (i - 1) + l_{2} (i - 1) - T l_{3} (i - 1) + T Δ_{d} (i - 1), \end{matrix}

(19)

Then, substituting (17) and (19) into (13), we have

\begin{matrix} ε_{2} (i + 1) = & (\frac{2}{T}) ε_{1} (i + 1) + (T b_{1} - \frac{3}{T} + 2 β_{1} - T β_{2}) ε_{1} (i) \\ + (\frac{1}{T} - T b_{1} - β_{1} + T β_{2} - T^{2} β_{3}) ε_{1} (i - 1) + 2 l_{1} (i) - l_{1} (i - 1), \end{matrix}

(20)

From (20), we have

\begin{matrix} ε_{2} (i) = & (\frac{2}{T}) ε_{1} (i) + (T b_{1} - \frac{3}{T} + 2 β_{1} - T β_{2}) ε_{1} (i - 1) \\ + (\frac{1}{T} - T b_{1} - β_{1} + T β_{2} - T^{2} β_{3}) ε_{1} (i - 2) + 2 l_{1} (i - 1) - l_{1} (i - 2), \end{matrix}

(21)

Then, substituting (21) into (12), we have

\begin{matrix} ε_{1} (i + 1) = & (3 - T β_{1}) ε_{1} (i) + (T^{2} b_{1} - T^{2} β_{2} + 2 T β_{1} - 3) ε_{1} (i - 1) \\ + (1 - T^{2} b_{1} - T β_{1} + T^{2} β_{2} - T^{3} β_{3}) ε_{1} (i - 2) - T l_{1} (i) + 2 T l_{1} (i - 1) \\ - T l_{1} (i - 2) - T^{2} l_{2} (i - 1) + T^{2} l_{2} (i - 2) - T^{3} l_{3} (i - 2) + T^{3} Δ_{d} (i - 2) \end{matrix}

(22)

Let

\begin{matrix} m_{11} = & 3 - T β_{1}, m_{12} = T^{2} b_{1} - T^{2} β_{2} + 2 T β_{1} - 3, m_{13} = 1 - T^{2} b_{1} - T β_{1} + T^{2} β_{2} - T^{3} β_{3}, \\ m_{14} = & - T, m_{15} = 2 T, m_{16} = - T, m_{17} = - T^{2}, m_{18} = T^{2}, m_{19} = - T^{3}, m_{21} = \frac{2}{T}, \\ m_{22} = & T b_{1} - \frac{3}{T} + 2 β_{1} - T β_{2}, m_{23} = \frac{1}{T} - T b_{1} - β_{1} + T β_{2} - T^{2} β_{3}, m_{24} = 2, m_{25} = - 1, \\ m_{26} = & - T, m_{27} = T, m_{28} = - T^{2}, m_{31} = - \frac{1}{T^{2}}, m_{32} = \frac{1}{T^{2}} - b_{1} - \frac{β_{1}}{T} + β_{2} - T β_{3}, \\ m_{33} = & \frac{1}{T}, m_{34} = - \frac{1}{T}, m_{35} = 1, m_{36} = - T . \end{matrix}

Then, (18), (21), and (22) can be converted as

\begin{matrix} ε_{1} (i + 1) = & m_{11} ε_{1} (i) + m_{12} ε_{1} (i - 1) + m_{13} ε_{1} (i - 2) + m_{14} l_{1} (i) + m_{15} l_{1} (i - 1) + m_{16} l_{1} (i - 2) \\ + m_{17} l_{2} (i - 1) + m_{18} l_{2} (i - 2) + m_{19} l_{3} (i - 2) + T^{3} Δ_{d} (i - 2), \\ ε_{2} (i + 1) = & m_{21} ε_{1} (i + 1) + m_{22} ε_{1} (i) + m_{23} ε_{1} (i - 1) + m_{24} l_{1} (i) + m_{25} l_{1} (i - 1) + m_{26} l_{2} (i) \\ + m_{27} l_{2} (i - 1) + m_{28} l_{3} (i - 1) + T^{2} Δ_{d} (i - 1), \\ ε_{3} (i + 1) = & m_{31} ε_{1} (i + 1) + m_{32} ε_{1} (i) + m_{33} ε_{2} (i + 1) + m_{34} l_{1} (i) + m_{35} l_{2} (i) + m_{36} l_{3} (i) \\ + T Δ_{d} (i), \end{matrix}

respectively.

Denote

\begin{matrix} ε_{s} (i) ≜ & {[ε_{1} (i) ε_{2} (i) ε_{3} (i) ε_{1} (i - 1) ε_{1} (i - 2) ε_{1} (i - 3)]}^{T}, \\ l (i) ≜ & {[l_{1} (i) l_{2} (i) l_{3} (i)]}^{T}, l^{-} (i) = {[l^{T} (i - 1) l^{T} (i - 2)]}^{T}, \\ J (i) ≜ & [T^{3} Δ_{d} (i - 2) T^{2} Δ_{d} (i - 1) T Δ_{d} (i)], \\ l^{-} (1) ≜ & 0_{6 \times 1}, l^{-} (2) = {[l^{T} (1) 0_{1 \times 3}]}^{T} . \end{matrix}

The dynamics of

ε_{s} (i)

can be obtained as

\begin{matrix} ε_{s} (i + 1) = & G_{ε} ε_{s} (i + 1) + {\tilde{A}}_{ε} ε_{s} (i) + {\tilde{B}}_{ε} l (i) + {\tilde{D}}_{ε} l^{-} (i) + J (i), \end{matrix}

where

G_{ε} ≜ [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ m_{21} & 0 & 0 & 0 & 0 & 0 \\ m_{31} & m_{33} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], {\tilde{A}}_{ε} ≜ [\begin{matrix} m_{11} & 0 & 0 & m_{12} & m_{13} & 0 \\ m_{22} & 0 & 0 & m_{23} & 0 & 0 \\ m_{32} & 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \end{matrix}], {\tilde{B}}_{ε} ≜ [\begin{matrix} m_{14} & 0 & 0 \\ m_{24} & 0 & 0 \\ m_{34} & m_{35} & m_{36} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}], {\tilde{D}}_{ε} ≜ [\begin{matrix} m_{15} & m_{17} & 0 & m_{16} & m_{18} & m_{19} \\ m_{25} & m_{27} & m_{28} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}] .

Then, it yields

\begin{matrix} ε_{s} (i + 1) = & {(I - G_{ε})}^{- 1} {\tilde{A}}_{ε} ε_{s} (i) + {(I - G_{ε})}^{- 1} {\tilde{B}}_{ε} l (i) + {(I - G_{ε})}^{- 1} {\tilde{D}}_{ε} l^{-} (i) + {(I - G_{ε})}^{- 1} J (i), \end{matrix}

Denote

\begin{matrix} A_{ε} ≜ & {(I - G_{ε})}^{- 1} {\tilde{A}}_{ε}, B_{ε} ≜ {(I - G_{ε})}^{- 1} {\tilde{B}}_{ε}, D_{ε} ≜ {(I - G_{ε})}^{- 1} {\tilde{D}}_{ε}, F_{e} ≜ {(I - G_{ε})}^{- 1} . \end{matrix}

Thus, there exists

\begin{matrix} ε_{s} (i + 1) = & A_{ε} ε_{s} (i) + B_{ε} l (i) + D_{ε} l^{-} (i) + F_{e} J (i) . \end{matrix}

To mitigate abrupt changes in observed values induced by overshoot during observation error convergence, this study imposes constraints on the observation error to enforce smooth convergence characteristics. This restriction ensures gradual variations in both state and disturbance estimates, thereby enhancing the closed-loop system performance through stabilized estimation dynamics. The constraints on

ε_{m} (i)

(

m = 1, 2, 3

) are given as

\begin{matrix} | ε_{m} (i) | \leq h_{ε_{m}}, \end{matrix}

(23)

where

h_{ε_{m}}

are given positive constants. Then, it is easy to obtain

\begin{matrix} b_{ε_{s}}^{T} ε_{s} (i) \leq h_{ε_{s}}, \end{matrix}

(24)

where

b_{ε_{s}}

and

h_{ε_{s}}

are obtained from constraint (23).

Remark 2.

Constraint (24) is designed to bound the disturbance estimation error, which effectively mitigates the occurrence of high-gain phenomena in the RH-ESO and reduces the overshoot in disturbance estimation. By incorporating this constraint, the disturbance estimation error can achieve smooth convergence characteristics, thereby facilitating the subsequent derivation of control inputs with enhanced stability and reduced transient fluctuations. This systematic constraint implementation ensures coordinated performance improvements throughout the closed-loop control system.

2.3. The Tracking Differentiator

In this subsection, the tracking differentiator (TD) is used to handle the transient process of the system state. Denote

v_{0}

as the given input,

v_{1} (i)

as the output tracking of

v_{0}

, and

v_{2} (i)

as the differential signal of

v_{1} (i)

. In this paper,

v_{0} = θ_{0}

is the deflection angle to be tracked. The TD is designed as

\begin{matrix} v_{1} (i + 1) = & v_{1} (i) + T v_{2} (i), \\ v_{2} (i + 1) = & v_{2} (i) + T fhan (i), \end{matrix}

where

\begin{matrix} \{\begin{matrix} fhan (i) = - \{\begin{matrix} r_{s} sign (a (i)), & | a (i) | > d \\ r_{s} \frac{a (i)}{d}, & | a (i) | \leq d \end{matrix} \\ a (i) = \{\begin{matrix} v_{2} (i) + \frac{a_{0} (i) - d}{2} sign (d_{s} (i)), & | d_{s} (i) | > d_{0} \\ v_{2} (i) + \frac{d_{s} (i)}{T}, & | d_{s} (i) | \leq d_{0} \end{matrix} \\ d_{s} (i) = v_{1} (i) - v_{0} + T v_{2} (i), d = r_{s} T, d_{0} = r_{s} T^{2} \\ a_{0} (i) = \sqrt{d^{2} + 8 r_{s} | d_{s} (i) |} \end{matrix} \end{matrix}

with

r_{s} > 0

and

d > 0

as given constants.

2.4. The MPC-Enabled Disturbance-Rejection Controller

To design the controller, the state tracking errors are denoted as

r_{1} (i) ≜ v_{1} (i) - {\hat{s}}_{1} (i)

and

r_{2} (i) ≜ v_{2} (i) - {\hat{s}}_{2} (i)

. Then, the control input is designed as

\begin{matrix} u (i) = k_{1} r_{1} (i) + k_{2} r_{2} (i) + c (i) - \frac{1}{b_{0}} {\hat{s}}_{3} (i), \end{matrix}

(25)

where

k_{1}

and

k_{2}

are feedback gains to be designed;

c (i)

is the decision variable of the MPC scheme to be obtained subsequently.

According to

r_{1} (i)

,

r_{2} (i)

, and (25), there exists

\begin{matrix} r_{1} (i + 1) = & r_{1} (i) + T r_{2} (i) - T β_{1} ε_{1} (i) - T l_{1} (i), \\ r_{2} (i + 1) = & r_{2} (i) - T b_{0} u (i) + T fhan (i) - T b_{1} {\hat{s}}_{1} (i) - T {\hat{s}}_{3} (i) - T β_{2} ε_{1} (i) - T l_{2} (i) . \end{matrix}

Then, it is obtained that

\begin{matrix} r_{1} (i + 1) = & r_{1} (i) + T r_{2} (i) - T β_{1} ε_{1} (i) - T l_{1} (i), \\ r_{2} (i + 1) = & - T b_{0} k_{1} r_{1} (i) + (1 - T b_{0} k_{2}) r_{2} (i) - T b_{0} c (i) - T β_{2} ε_{1} (i) - T l_{2} (i) + q (i), \end{matrix}

where

q (i) = T fhan (i) - T b_{1} {\hat{s}}_{1} (i)

.

Denote

r (i) ≜ {[r_{1} (i) r_{2} (i)]}^{T}

. Then, it follows that

\begin{matrix} r (i + 1) = A_{r} r (i) + B_{r} ε_{s} (i) + C_{r} c (i) + D_{r} l (i) + E_{r} q (i) . \end{matrix}

where

A_{r} ≜ [\begin{matrix} 1 & T \\ - T b_{0} k_{1} & 1 - T b_{0} k_{2} \end{matrix}], B_{r} ≜ [\begin{matrix} - T β_{1} & 0_{1 \times 5} \\ - T β_{2} & 0_{1 \times 5} \end{matrix}], C_{r} ≜ [\begin{matrix} 0 \\ - T b_{0} \end{matrix}], D_{r} ≜ [\begin{matrix} - T & 0 & 0 \\ 0 & - T & 0 \end{matrix}], E_{r} ≜ [\begin{matrix} 0 \\ 1 \end{matrix}] .

Then,

r (i)

is constrained by

r (i) \in R ≜ {r (i) : b_{r}^{T} r (i) \leq h_{r}}

, where

b_{r} ≜ [\begin{matrix} 1 & - 1 & 0 & 0 \\ 0 & 0 & 1 & - 1 \end{matrix}], h_{r} ≜ {[\begin{matrix} {\bar{r}}_{1} & {\bar{r}}_{1} & {\bar{r}}_{2} & {\bar{r}}_{2} \end{matrix}]}^{T},

with

{\bar{r}}_{1}

and

{\bar{r}}_{2}

being given positive constants. Then, the constraint on

q (i)

is obtained as

\begin{matrix} q (i) \in Q ≜ {q (i) : b_{q}^{T} q (i) \leq \bar{q}}, \end{matrix}

with

\bar{q} ≜ T r_{s} + T b_{1} ({\bar{s}}_{1} + {\bar{r}}_{1})

. Denote

ζ (i) ≜ {[r^{T} (i) ε_{s}^{T} (i)]}^{T}

and

h (i) ≜ {[c^{T} (i) l^{T} (i)]}^{T}

. Then, we have

\begin{matrix} ζ (i + 1) = A_{ζ} ζ (i) + B_{ζ} h (i) + C_{ζ} l^{-} (i) + D_{ζ} q (i), \end{matrix}

where

A_{ζ} ≜ [\begin{matrix} A_{r} & B_{r} \\ 0_{6 \times 2} & A_{ε} \end{matrix}], B_{ζ} ≜ [\begin{matrix} C_{r} & D_{r} \\ 0_{6 \times 1} & B_{ε} \end{matrix}], C_{ζ} ≜ [\begin{matrix} 0_{2 \times 6} \\ D_{ε} \end{matrix}], D_{ζ} ≜ [\begin{matrix} E_{r} \\ 0_{6 \times 1} \end{matrix}] .

The nominal system of

ζ (i)

is obtained as

\begin{matrix} \bar{ζ} (i + 1) = A_{ζ} \bar{ζ} (i) + B_{ζ} h (i) + C_{ζ} l^{-} (i), \end{matrix}

where

\bar{ζ} (i) ≜ {[{\bar{r}}^{T} (i) {\bar{ε}}_{s}^{T} (i)]}^{T}

with

\begin{matrix} \bar{r} (i + 1) = & A_{r} \bar{r} (i) + B_{r} {\bar{ε}}_{s} (i) + C_{r} c (i), \\ {\bar{ε}}_{s} (i + 1) = & A_{ε} {\bar{ε}}_{s} (i) + B_{ε} l (i) + D_{ε} l^{-} (i) . \end{matrix}

Accordingly, the constraints on

\bar{ζ} (i)

and

h (i)

are obtained as

\begin{matrix} \bar{ζ} (i) & \in M ≜ {\bar{ζ} (i) : b_{ζ}^{T} \bar{ζ} (i) \leq h_{ζ}}, \\ h (i) & \in N ≜ {h (i) : {\tilde{b}}_{u}^{T} \bar{ζ} (i) + b_{h}^{T} h (i) \leq h_{h} + h_{q} (i)}, \end{matrix}

respectively, where

b_{ζ} ≜ [\begin{matrix} b_{r} & 0 \\ 0 & b_{ε_{s}} \end{matrix}], h_{ζ} ≜ [\begin{matrix} h_{r} \\ h_{ε_{s}} \end{matrix}], h_{q} (i) ≜ [\begin{matrix} b_{u}^{T} {\hat{s}}_{3} (i) \\ 0_{6 \times 1} \end{matrix}], {\tilde{b}}_{u} ≜ [\begin{matrix} {[k_{1} k_{2}]}^{T} b_{u} & 0_{2 \times 6} \\ 0_{6 \times 2} & 0_{6 \times 6} \end{matrix}], b_{h} ≜ [\begin{matrix} b_{u} & 0_{1 \times 3} \\ 0_{1 \times 1} & 0_{1 \times 3} \end{matrix}], h_{h} ≜ [\begin{matrix} h_{u} \\ 0_{6 \times 1} \end{matrix}] .

2.5. The MPC Scheme

In this paper, the MPC scheme is proposed to deal with the state and input constraints of the pneumatic manipulator as well as generate a part of the control input signal. To establish the MPC scheme, a cost function is designed as

\begin{matrix} J (\hat{l} (i), \hat{h} (i)) ≜ & \sum_{s = 0}^{T_{s} - 1} (∥ \hat{ζ} {(s, i) ∥}_{Q}^{2} + {∥ \hat{h} (s, i) ∥}_{R}^{2}) + {∥ \hat{ζ} (T_{s}, i) ∥}_{P}^{2}, \end{matrix}

where

\hat{ζ} (s, i)

is the predictive state with

\hat{ζ} (0, i) = \bar{ζ} (i) = ζ (i)

;

\hat{h} (s, i)

is the predictive decision variable;

\hat{ζ} (i) ≜ {[{\hat{ζ}}^{T} (0, i) {\hat{ζ}}^{T} (1, i) \dots {\hat{ζ}}^{T} (T_{s}, i)]}^{T}

is the predictive state sequence;

\hat{h} (i) ≜ {[{\hat{h}}^{T} (0, i) {\hat{h}}^{T} (1, i) \dots {\hat{h}}^{T} (T_{s} - 1, i)]}^{T}

is the predictive decision variable sequence with

\hat{h} (0, i) = h (i)

; Q and R denote the positive definite weighting matrices;

P > 0

represents the terminal penalty matrix requiring design. The predictive horizon is denoted by

T_{s}

.

Next, the optimization problem of the MPC scheme can be formulated as

Prob 1:

\begin{matrix} ({\hat{l}}^{*} (i), {\hat{h}}^{*} (i)) ≜ arg min_{\hat{h} (i)} J (\hat{l} (i), \hat{h} (i)) \end{matrix}

subject to

\begin{matrix} \hat{ζ} (s + 1, i) = A_{ζ} \hat{ζ} (s, i) + B_{ζ} \hat{h} (s, i) + C_{ζ} {\hat{l}}^{-} (s, i) \end{matrix}

(26)

\begin{matrix} b_{ζ}^{T} \hat{ζ} (s, i) \leq h_{ζ} - ϕ (s, i), s \in Z [0, T_{s} - 1] \end{matrix}

(27)

\begin{matrix} {\tilde{b}}_{u}^{T} \hat{ζ} (s, i) + b_{h}^{T} \hat{h} (s, i) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (s, i), s \in Z [0, T_{s} - 1], \end{matrix}

(28)

\begin{matrix} \hat{ζ} (T_{s}, i) \in ζ_{T} (T_{s}, i), \end{matrix}

(29)

where

\begin{matrix} ϕ (s, i) ≜ \sum_{j = 0}^{s - 1} max_{q (s - 1 - j, i) \in Q} b_{ζ}^{T} A_{ζ}^{j} D_{ζ} e_{ζ} (s - 1 - j, i), \\ χ (s, i) ≜ \sum_{j = 0}^{s - 1} max_{q (s - 1 - j, i) \in Q} {\tilde{b}}_{u}^{T} A_{ζ}^{j} D_{ζ} e_{ζ} (s - 1 - j, i), \\ e_{ζ} (s + 1, i) = A_{ζ} e_{ζ} (s, i) + D_{ζ} q (s, i), ϕ (0, i) ≜ 0_{10 \times 1}, χ (0, i) ≜ 0_{8 \times 1}, e_{ζ} (0, i) = 0_{4 \times 1}, \\ ζ_{T} (T_{s}, i) ≜ \hat{ζ} (T_{s}, i) \cap \hat{H} (T_{s}, i), \hat{ζ} (T_{s}, i) ≜ {\hat{ζ} (T_{s}, i) : b_{ζ}^{T} \hat{ζ} (T_{s}, i) \leq h_{ζ} - ϕ (T_{s}, i)}, \\ \hat{H} (T_{s}, i) ≜ {\hat{ζ} (T_{s}, i) : {\tilde{b}}_{u}^{T} \hat{ζ} (T_{s}, i) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (T_{s}, i)}, \\ {\hat{ζ}}^{*} (i) ≜ {[{\hat{ζ}}^{* T} (0, i) {\hat{ζ}}^{* T} (1, i) \dots {\hat{ζ}}^{* T} (T_{s}, i)]}^{T}, {\hat{h}}^{*} (i) ≜ {[{\hat{h}}^{* T} (0, i) {\hat{h}}^{* T} (1, i) \dots {\hat{h}}^{* T} (T_{s} - 1, i)]}^{T}, \\ {\hat{l}}^{-} (s, i) = \{\begin{matrix} l^{-} (i), & s = 0 \\ [\begin{matrix} [\begin{matrix} 0_{3 \times 1} & I_{3 \times 3} \end{matrix}] \hat{h} (0, i) \\ [\begin{matrix} I_{3 \times 3} & 0_{3 \times 3} \end{matrix}] l^{-} (i) \end{matrix}], & s = 1 \\ [\begin{matrix} [\begin{matrix} 0_{3 \times 1} & I_{3 \times 3} \end{matrix}] \hat{h} (s - 1, i) \\ [\begin{matrix} 0_{3 \times 1} & I_{3 \times 3} \end{matrix}] \hat{h} (s - 2, i) \end{matrix}], & s \in Z [2, T_{s} - 1] \end{matrix} \end{matrix}

Remark 3.

The optimality of the RH-ESO proposed in this paper is fundamentally different from that in []. In [], the system under consideration includes a slowly time-varying disturbance in the state equations and Gaussian white noise in the output equations. It should be noted that the optimality in [] is defined in terms of a minimum-variance performance index for estimating the states

s_{1} (i)

and

s_{2} (i)

and does not address an optimal estimate of the disturbance

w_{d} (i)

. By contrast, the focus of this paper is on the rolling optimization of the disturbance estimation error

ε_{s} (i)

to obtain the optimal estimate of the slowly varying disturbance

w_{d} (i)

at the current instant. This “optimal” estimate is understood as the one that minimizes the system control performance index—that is, the quadratic tracking error of the system state.

The structure diagram of the pneumatic manipulator system is presented in Figure 1.

Figure 1. Structure diagram of the pneumatic manipulator system.

Definition 1

([]). System (3) is said to be uniformly bounded if, for all

Ξ_{1} > 0

, there exists a constant

Ξ_{2} (Ξ_{1})

such that

\begin{matrix} ∥ s (0) ∥ \leq Ξ_{1} \Rightarrow ∥ s (i) ∥ \leq Ξ_{2} (Ξ_{1}), \end{matrix}

holds for all

s (0) \in R^{n}

and

i \in Z_{\geq 0}

.

Following the above discussion, this paper aims to achieve the following two objectives: (1) establish sufficient conditions to ensure that the proposed MPC scheme is recursively feasible, and (2) establish easily verifiable sufficient conditions to ensure that system (3) under control input (25) is uniformly bounded.

3. Results

This section establishes the recursive feasibility of Prob 1 and provides rigorous Lyapunov-based stability guarantees for the closed-loop system.

Firstly, the following lemma is given to prove the feasibility of Prob 1.

Lemma 1.

If Prob 1 is feasible at time instant t, then there exists at least one feasible solution for Prob 1 at time instant

i + 1

.

Proof.

Denote candidate decision variable sequences

\tilde{h} (i + 1)

and

{\tilde{l}}^{-} (i + 1)

as

\begin{matrix} \tilde{h} (i + 1) \\ ≜ & {[\begin{matrix} {\tilde{h}}^{T} (0, i + 1) & \dots & {\tilde{h}}^{T} (T_{s}, i + 1) \end{matrix}]}^{T} \\ = & {[\begin{matrix} {\hat{h}}^{T *} (1, i) & \dots & {\hat{h}}^{T *} (T_{s} - 1, i) & 0_{4 \times 1} \end{matrix}]}^{T}, \\ {\tilde{l}}^{-} (i + 1) \\ ≜ & {[\begin{matrix} {\tilde{l}}^{- T} (0, i + 1) & \dots & {\tilde{l}}^{- T} (T_{s}, i + 1) \end{matrix}]}^{T} \\ = & {[\begin{matrix} {\hat{l}}^{- T} (1, i) & \dots & {\hat{l}}^{- T} (T_{s} - 1, i) & 0_{3 \times 1} \end{matrix}]}^{T}, \end{matrix}

where

\tilde{h} (s, i + 1)

is a candidate decision variable for

s \in Z [0, T_{s}]

.

{\hat{l}}^{-} (s, i + 1)

can be obtained from

\tilde{h} (s, i + 1)

for

s \in Z [0, T_{s}]

. Then, we have

\begin{matrix} \hat{ζ} (0, i + 1) = & A_{ζ} \hat{ζ} (0, i) + B_{ζ} {\hat{h}}^{*} (0, i) + C_{ζ} {\hat{l}}^{- *} (0, i) + D_{ζ} q (i) . \end{matrix}

(30)

By considering the predictive dynamics of

ζ (i + 1)

based on

\tilde{h} (i + 1)

, it follows that the corresponding state predictions can be derived from the system’s predictive dynamics:

\begin{matrix} \tilde{ζ} (s + 1, i + 1) = A_{ζ} \tilde{η} (s, i + 1) + B_{ζ} \tilde{h} (s, i + 1) + C_{ζ} {\tilde{l}}^{-} (s, i + 1) + D_{ζ} q (s, i + 1), \end{matrix}

(31)

where

\tilde{ζ} (s + 1, i + 1)

is the feasible state for

s \in Z [0, T_{s}]

.

Combining (26), (30), and (31), it is obtained that

\begin{matrix} \tilde{ζ} (s, i + 1) = {\hat{ζ}}^{*} (s, i) + A_{ζ}^{s - 1} D_{ζ} q (i) . \end{matrix}

(32)

Denote

\tilde{l} (i + 1) ≜ {[\begin{matrix} {\tilde{ζ}}^{T} (0, i + 1) & \dots & {\tilde{ζ}}^{T} (T_{s}, i + 1) \end{matrix}]}^{T}

as the feasible predictive state sequence.

In order to ensure the existence of at least one solution for Prob 1 at each subsequent time step

i + 1

, it is necessary that the following conditions are satisfied:

\begin{matrix} b_{ζ}^{T} \tilde{ζ} (s, i + 1) \leq h_{ζ} - ϕ (s, i + 1), s \in Z [0, T_{s}], \end{matrix}

(33)

\begin{matrix} {\tilde{b}}_{u}^{T} \tilde{ζ} (s, i + 1) + b_{h}^{T} \tilde{h} (s, i + 1) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (s, i + 1), s \in Z [0, T_{s} - 1], \end{matrix}

(34)

\begin{matrix} \tilde{ζ} (T_{s}, i + 1) \in ζ_{T} (T_{s}, i + 1) . \end{matrix}

(35)

In the following analysis, conditions (33)–(35) will be demonstrated one by one. To begin with, we first provide the proof for condition (33). Assuming that Prob 1 has a feasible solution at time instant t, it directly follows that constraint (27) will be satisfied, which in turn implies

\begin{matrix} b_{ζ}^{T} {\hat{ζ}}^{*} (s, i) \leq h_{ζ} - ϕ (s, i), \end{matrix}

(36)

for

s \in Z [0, T_{s} - 2]

. Based on (32) and (36), we have

b_{ζ}^{T} (\tilde{ζ} (s, i + 1) - A_{ζ}^{s} D_{ζ} q (i)) \leq h_{ζ} - ϕ (s, i)

, which yields

\begin{matrix} b_{ζ}^{T} \tilde{ζ} (s, i + 1) \\ \leq h_{ζ} - ϕ (s, i) + b_{ζ}^{T} A_{ζ}^{s} D_{ζ} q (i) \\ = h_{ζ} - \sum_{j = 0}^{s - 1} max_{q (s - 1 - j, i) \in Q} b_{ζ}^{T} A_{ζ}^{j} D_{ζ} q (s - 1 - j, i) + b_{ζ}^{T} A_{ζ}^{s} D_{ζ} (q (i) - max_{q (s - j, i) \in Q} q (s - j, i)) \\ \leq h_{ζ} - \sum_{j = 0}^{s - 1} max_{q (s - 1 - j, i) \in Q} b_{ζ}^{T} A_{ζ}^{j} D_{ζ} q (s - 1 - j, i) \\ = h_{ζ} - ϕ (s, i + 1), \end{matrix}

for

s \in Z [0, T_{s}]

. Thus, condition (33) holds true for all s within the interval

Z [0, T_{s}]

.

Next, the condition (34) is verified for the time instant

i + 1

. By utilizing the sequences

\tilde{h} (i + 1)

and

{\tilde{l}}^{-} (i + 1)

, we have

\tilde{h} (s, i + 1) = \hat{h} (s, i)

and

{\tilde{l}}^{-} (s, i + 1) = {\hat{l}}^{-} (s, i)

. Based on (28) and (32), it follows that

\begin{matrix} {\tilde{b}}_{u}^{T} (\tilde{ζ} (s, i + 1) - A_{ζ}^{s} D_{ζ} q (i)) + b_{h}^{T} \hat{h} (s, i) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (s, i), \end{matrix}

which indicates

\begin{matrix} {\tilde{b}}_{u}^{T} \tilde{ζ} (s, i + 1) + b_{h}^{T} \tilde{h} (s, i + 1) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (s, i) + {\tilde{b}}_{u}^{T} A_{ζ}^{s} D_{ζ} q (i)) \\ = h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - \sum_{j = 0}^{s - 1} max_{q (s - 1 - j, i) \in Q} {\tilde{b}}_{u}^{T} A_{ζ}^{j} D_{ζ} q (s - 1 - j, i) \\ + {\tilde{b}}_{u}^{T} A_{ζ}^{s} D_{ζ} (q (i) - max_{q (s - j, i) \in Q} q (s - j, i)) \\ \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - \sum_{j = 0}^{s - 1} max_{q (s - 1 - j, i) \in Q} {\tilde{b}}_{u}^{T} A_{ζ}^{j} D_{ζ} q (s - 1 - j, i) \\ = h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (s, i + 1) \end{matrix}

for

s \in Z [0, T_{s} - 2]

. For

s = T_{s} - 1

, there exist

\tilde{h} (T_{s}, i + 1) = 0_{4 \times 1}

and

{\tilde{l}}^{-} (T_{s}, i + 1) = 0_{6 \times 1}

. Based on (29) and (32),

\tilde{h} (T_{s}, i + 1)

and

{\tilde{l}}^{-} (T_{s}, i + 1)

, it is easy to obtain

\begin{matrix} {\tilde{b}}_{u}^{T} (\tilde{ζ} (T_{s}, i + 1) - A_{ζ}^{T_{s} - 1} D_{ζ} q (i)) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (T_{s}, i), \end{matrix}

which means

\begin{matrix} {\tilde{b}}_{u}^{T} \tilde{ζ} (T_{s}, i + 1) \leq h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (T_{s}, i) + {\tilde{b}}_{u}^{T} A_{ζ}^{T_{s} - 1} D_{ζ} q (i) \\ = & h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - \sum_{j = 0}^{T_{s} - 2} max_{q (T_{s} - 2 - j, i) \in Q} {\tilde{b}}_{u}^{T} A_{ζ}^{j} D_{ζ} q (T_{s} - 2 - j, i) \\ - {\tilde{b}}_{u}^{T} A_{ζ}^{T_{s} - 1} D_{ζ} max_{q (T_{s} - 1 - j, i) \in Q} q (T_{s} - 1 - j, i) + {\tilde{b}}_{u}^{T} A_{ζ}^{T_{s} - 1} D_{ζ} q (i) \\ \leq & h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - \sum_{j = 0}^{T_{s} - 2} max_{q (T_{s} - 2 - j, i) \in Q} {\tilde{b}}_{u}^{T} A_{ζ}^{j} D_{ζ} q (T_{s} - 2 - j, i) \\ = & h_{h} + ({\bar{w}}_{d} + h_{ε_{3}}) {[1 1 0_{1 \times 6}]}^{T} - χ (T_{s}, i + 1) . \end{matrix}

Thus, condition (34) holds true for all s within the interval

Z [0, T_{s} - 1]

.

Finally, the satisfaction of condition (35) is established. Thus,

ζ_{T} (T_{s}, i + 1) ≜ \hat{ζ} (T_{s}, i) \cap \hat{H} (T_{s}, i + 1)

. From (33), it follows that

\tilde{ζ} (T_{s}, i + 1)

belongs to the set

\hat{ζ} (T_{s}, i)

. Additionally, based on (29), it can be readily shown that

\tilde{ζ} (T_{s}, i + 1)

also lies within

\hat{H} (T_{s}, i)

.

Therefore, since both membership conditions are satisfied, we conclude that

\tilde{ζ} (T_{s}, i + 1) \in ζ_{T} (T_{s}, i + 1)

holds true, thereby ensuring the validity of condition (35). □

Theorem 1.

If there exist appropriate parameters

k_{1}

,

k_{2}

,

β_{1}

,

β_{2}

,

β_{3}

, and a positive definite matrix P, make the linear matrix inequality (LMI)

\begin{matrix} A_{ζ}^{T} P A_{ζ} - P + Q < 0, \end{matrix}

hold; then,

ε_{s} (i) ≜ {[e_{s_{1}} (i) e_{s_{2}} (i)]}^{T}

is uniformly bounded under the control input (25) with

e_{s_{1}} (i) ≜ v_{1} (i) - s_{1} (i)

and

e_{s_{2}} (i) ≜ v_{2} (i) - s_{2} (i)

. Moreover, the state

ε_{s} (i)

converges to the set

E ≜ {ε_{s} (i) : ∥ ε_{s} (i) ∥ \leq ∥ M ∥ {\underset{̲}{λ}}^{- \frac{1}{2}} (Q) σ^{\frac{1}{2}}}

with

\begin{matrix} σ ≜ & 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) + \sum_{s = 1}^{T_{s} - 1} (2 {\bar{λ}}^{2} (Q) Θ (s) Ω (s) + {\bar{λ}}^{2} (Q) Ω^{2} (s)) \\ Θ (s) ≜ & max_{b_{ζ}^{T} {\hat{ζ}}^{*} (s, i) \leq h_{ζ} - ϕ (s, i)} ∥ {\hat{ζ}}^{*} (s, i) ∥, \\ Ω (s) ≜ & max_{b_{q}^{T} q (i) \leq \bar{q}} ∥ A_{ζ}^{s - 1} D_{ζ} q (i) ∥, \\ M ≜ & [\begin{matrix} 1 & 0 & - 1 & 0 & 0 \\ 0 & 1 & 0 & - 1 & 0 \end{matrix}] . \end{matrix}

Proof.

Considering

e_{s_{1}} (i) ≜ r_{1} (i) - ε_{1} (i)

and

e_{s_{2}} (i) ≜ r_{2} (i) - ε_{2} (i)

, it can be obtained that

∥ e_{s_{1}} (i) ∥ \leq ∥ r_{1} (i) ∥ + ∥ ε_{1} (i) ∥

and

∥ e_{s_{2}} (i) ∥ \leq ∥ r_{2} (i) ∥ + ∥ ε_{2} (i) ∥

. Subsequently, the proof of uniform boundedness of

∥ e_{s_{1}} (i) ∥

and

∥ e_{s_{2}} (i) ∥

is given first.

Denote the optimal cost of Prob 1 at time instant i as

\begin{matrix} J ({\hat{l}}^{*} (i), {\hat{h}}^{*} (i)) ≜ & \sum_{s = 0}^{T_{s} - 1} (∥ {\hat{ζ}}^{*} {(s, i) ∥}_{Q}^{2} + {∥ {\hat{h}}^{*} (s, i) ∥}_{R}^{2}) + {∥ {\hat{ζ}}^{*} (T_{s}, i) ∥}_{P}^{2} \end{matrix}

Then, a Lyapunov candidate function is denoted as

V (i) ≜ J ({\hat{l}}^{*} (i), {\hat{h}}^{*} (i))

. Consequently, there exists

\begin{matrix} Δ V (i) ≜ & J ({\hat{l}}^{*} (i + 1), {\hat{h}}^{*} (i + 1)) - J ({\hat{l}}^{*} (i), {\hat{h}}^{*} (i)) \end{matrix}

Given the optimal solution that Prob 1 is obtained at time instant i, then Prob 1 has at least one feasible solution at

i + 1

by Lemma 1. Therefore, the feasible cost of Prob 1 at time instant

i + 1

can be denoted as

\begin{matrix} J (\tilde{l} (i + 1), \tilde{h} (i + 1)) ≜ \sum_{s = 0}^{T_{s} - 1} (∥ \tilde{ζ} {(s, i + 1) ∥}_{Q}^{2} + {∥ \tilde{h} (s, i) ∥}_{R}^{2}) + {∥ \tilde{ζ} (T_{s}, i + 1) ∥}_{P}^{2} \end{matrix}

Denote

\begin{matrix} Δ \tilde{V} (i) ≜ J (\tilde{l} (i + 1), \tilde{h} (i + 1)) - J ({\hat{l}}^{*} (i), {\hat{h}}^{*} (i)) \end{matrix}

Split

Δ \tilde{V} (i) ≜ Δ_{1} (i) + Δ_{2} (i) + Δ_{3} (i)

, where

\begin{matrix} Δ_{1} (i) ≜ & \sum_{s = 1}^{T_{s} - 1} (∥ \tilde{ζ} {(s, i + 1) ∥}_{Q}^{2} - {∥ {\hat{ζ}}^{*} (s, i) ∥}_{Q}^{2}) + \sum_{s = 1}^{T_{s} - 1} (∥ \tilde{h} {(s, i + 1) ∥}_{R}^{2} - {∥ {\hat{h}}^{*} (s, i) ∥}_{R}^{2}) \\ Δ_{2} (i) ≜ & ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{Q}^{2} + ∥ \tilde{h} (T_{s}, i + 1) ∥_{R}^{2} + ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{P}^{2} - {∥ {\hat{ζ}}^{*} (T_{s}, i) ∥}_{P}^{2} \\ Δ_{3} (i) ≜ & - ∥ {\hat{ζ}}^{*} {(0, i) ∥}_{Q}^{2} - {∥ {\hat{h}}^{*} (0, i) ∥}_{R}^{2} \end{matrix}

For

Δ_{1} (i)

, there exists

\tilde{h} (s, i + 1) = {\hat{h}}^{*} (s, i)

for

s \in Z [1, T_{s} - 1]

. Therefore, it is obtained that

\begin{matrix} Δ_{1} (i) & = \sum_{s = 1}^{T_{s} - 1} (∥ {\hat{ζ}}^{*} (s, i) + A_{ζ}^{s - 1} D_{ζ} {q (i) ∥}_{Q}^{2}) - \sum_{s = 1}^{T_{s} - 1} (∥ {\hat{ζ}}^{*} {(s, i) ∥}_{Q}^{2}) \\ = & \sum_{s = 1}^{T_{s} - 1} (∥ {\hat{ζ}}^{*} (s, i) + A_{ζ}^{s - 1} D_{ζ} {q (i) ∥}_{Q} + {∥ {\hat{ζ}}^{*} (s, i) ∥}_{Q}) \\ \times \sum_{s = 1}^{T_{s} - 1} (∥ {\hat{ζ}}^{*} (s, i) + A_{ζ}^{s - 1} D_{ζ} {q (i) ∥}_{Q} - {∥ {\hat{ζ}}^{*} (s, i) ∥}_{Q}) \\ \leq & \sum_{s = 1}^{T_{s} - 1} (2 ∥ {\hat{ζ}}^{*} {(s, i) ∥}_{Q} + {∥ A_{ζ}^{s - 1} D_{ζ} q (i) ∥}_{Q}) {∥ A_{ζ}^{s - 1} D_{ζ} q (i) ∥}_{Q} \\ \leq & \sum_{s = 1}^{T_{s} - 1} (2 {\bar{λ}}^{2} (Q) Θ (s) Ω (s) + {\bar{λ}}^{2} (Q) Ω^{2} (s)) \end{matrix}

where

\begin{matrix} Θ (s) ≜ & max_{b_{ζ}^{T} {\hat{ζ}}^{*} (s, i) \leq h_{ζ} - ϕ (s, i)} ∥ {\hat{ζ}}^{*} (s, i) ∥, Ω (s) ≜ max_{b_{q}^{T} q (i) \leq \bar{q}} ∥ A_{ζ}^{s - 1} D_{ζ} q (i) ∥ \end{matrix}

For

Δ_{2} (i)

, there exists

\tilde{h} (T_{s}, i + 1) = 0_{4 \times 1}

. Then, it follows that

\begin{matrix} Δ_{2} (i) & ≜ ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{P}^{2} - ∥ {\hat{ζ}}^{*} (T_{s}, i) ∥_{P}^{2} + ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{P}^{2} - ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{P}^{2} + {∥ \tilde{ζ} (T_{s}, i + 1) ∥}_{Q}^{2} \\ = & ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{P}^{2} - ∥ {\hat{ζ}}^{*} (T_{s}, i) ∥_{P}^{2} - {∥ \tilde{ζ} (T_{s}, i + 1) ∥}_{\bar{Q}}^{2} \\ \leq & (∥ {\hat{ζ}}^{*} (T_{s}, i) + A_{ζ}^{T_{s} - 1} D_{ζ} q (i) ∥_{P} + {∥ {\hat{ζ}}^{*} (T_{s}, i) ∥}_{P}) \times ∥ A_{ζ}^{T_{s} - 1} D_{ζ} {q (i) ∥}_{P} - {∥ \tilde{ζ} (T_{s}, i + 1) ∥}_{\bar{Q}}^{2} \\ \leq & 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) - {∥ \tilde{ζ} (T_{s}, i + 1) ∥}_{\bar{Q}}^{2} \end{matrix}

where

\bar{Q} = P - Q - A_{ζ}^{T} P A_{ζ}

. If there exist appropriate parameters

k_{1}

,

k_{2}

,

β_{1}

,

β_{2}

,

β_{3}

, and a positive definite matrix P, make the LMI

\begin{matrix} A_{ζ}^{T} P A_{ζ} - P + Q < 0 \end{matrix}

hold, and then

- ∥ \tilde{ζ} (T_{s}, i + 1) ∥_{\bar{Q}}^{2} \leq 0

. In this case, it follows that

\begin{matrix} Δ_{2} (i) \leq & 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) \end{matrix}

For

Δ_{3} (i)

, it yields

Δ_{3} (i) \leq - {∥ {\hat{ζ}}^{*} (0, i) ∥}_{Q}^{2}

.

Combining

Δ_{1} (i)

,

Δ_{2} (i)

, and

Δ_{3} (i)

, we have

\begin{matrix} \tilde{Δ} V (i) = & Δ_{1} (i) + Δ_{2} (i) + Δ_{3} (i) \\ \leq & - ∥ {\hat{ζ}}^{*} {(0, i) ∥}_{Q}^{2} + 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) \\ + \sum_{s = 1}^{T_{s} - 1} (2 {\bar{λ}}^{2} (Q) Θ (s) Ω (s) + {\bar{λ}}^{2} (Q) Ω^{2} (s)) \end{matrix}

According to the principle of optimality, the feasible cost is greater than the optimal cost, which indicates

Δ V (i) \leq \tilde{Δ} V (i)

. Then, there exists

\begin{matrix} Δ V (i) \leq & - ∥ {\hat{ζ}}^{*} {(0, i) ∥}_{Q}^{2} + 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) \\ + \sum_{s = 1}^{T_{s} - 1} (2 {\bar{λ}}^{2} (Q) Θ (s) Ω (s) + {\bar{λ}}^{2} (Q) Ω^{2} (s)) \end{matrix}

Letting

V (i) < 0

, it follows that

\begin{matrix} ∥ {\hat{ζ}}^{*} {(0, i) ∥}_{Q}^{2} > & 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) + \sum_{s = 1}^{T_{s} - 1} (2 {\bar{λ}}^{2} (Q) Θ (s) Ω (s) + {\bar{λ}}^{2} (Q) Ω^{2} (s)) \end{matrix}

It indicates that

ζ (i) = {\hat{ζ}}^{*} (0, i)

will converge to the set

\tilde{E} ≜ {{ζ (i) : ∥ ζ (i) ∥}_{Q}^{2} \leq σ}

under

h^{*} (0, i)

as

i \to + \infty

with

\begin{matrix} σ ≜ & 2 {\bar{λ}}^{2} (P) Θ (T_{s}) Ω (T_{s}) + {\bar{λ}}^{2} (P) Ω^{2} (T_{s}) + \sum_{s = 1}^{T_{s} - 1} (2 {\bar{λ}}^{2} (Q) Θ (s) Ω (s) + {\bar{λ}}^{2} (Q) Ω^{2} (s)) \end{matrix}

Next, the proof that

ε_{s} (i)

is uniformly bounded is given. It is easy to get

ε_{s} (i) = M ζ (i)

, where

\begin{matrix} M = & [\begin{matrix} 1 & 0 & - 1 & 0 & 0 \\ 0 & 1 & 0 & - 1 & 0 \end{matrix}] \end{matrix}

Then, as

i \to + \infty

, there exists

\begin{matrix} ∥ e_{x} (+ \infty) ∥ = & ∥ M ζ (+ \infty) ∥ \leq ∥ M ∥ ∥ ζ (+ \infty) ∥ \leq \bar{σ} \end{matrix}

with

\bar{σ} ≜ ∥ M ∥ {\underset{̲}{λ}}^{- \frac{1}{2}} (Q) σ^{\frac{1}{2}}

, which suggests that

ε_{s} (i)

will converge to the set

\begin{matrix} E ≜ {ε_{s} (i) : ∥ ε_{s} (i) ∥ \leq ∥ M ∥ {\underset{̲}{λ}}^{- \frac{1}{2}} (Q) σ^{\frac{1}{2}}} \end{matrix}

as

i \to + \infty

. That is, the tracking error

ε_{s} (i)

is uniformly bounded under the control input (25). This completes the proof. □

Remark 4.

The main advantages of the proposed methodology are twofold: (1) a novel RH-ESO that significantly enhances disturbance estimation accuracy is proposed. By reformulating the disturbance estimation error as prediction states and recursively solving a constrained optimization problem at each sampling instant, this innovative structure guarantees optimal observer gain selection with proven convergence properties. (2) The MPC-enabled disturbance-rejection controller is proposed with disturbance compensation, which enables the closed-loop system to simultaneously achieve prescribed optimal control performance and maintain active disturbance-rejection capabilities.

4. Numerical Example

4.1. Time-Invariant Reference Input Signal Tracking

In this subsection, a numerical example of a pneumatic manipulator system is employed to validate the method and demonstrate its effectiveness. The simulation specifies an initial deflection angle of

0^{\circ}

and a target deflection angle of

10^{\circ}

; i.e.,

v_{0} (t) = 10

. In this part, system (1) is used to demonstrate the effectiveness of the proposed method.

Parameters of the pneumatic manipulator system used in this subsection are presented in Table 1.

Table 1. Parameters used in the simulation.

In the following, the simulation results of the pneumatic manipulator system using the proposed MPC-enabled disturbance-rejection controller and the ADRC in Figure 2 are used to show the effectiveness of the proposed method in this paper.

Figure 2. Simulation results for reference signal

v_{0} (t) = 10

of the pneumatic manipulator system. (a) The state

s_{1} (i)

under the proposed method and the ADRC. (b) The state

s_{2} (i)

under the proposed method and the ADRC. (c) The estimation error

ε_{1} (i)

under the method in this paper and the ADRC. (d) The estimation error

ε_{2} (i)

under the method in this paper and the ADRC. (e) The estimation error

ε_{3} (i)

under the method in this paper and the ADRC. (f) The control input

u (i)

under the method in this paper and the ADRC.

Figure 2a plots the deflection angle

s_{1} (i)

of the pneumatic manipulator system under the proposed MPC-enabled disturbance-rejection controller and the ADRC, respectively. From Figure 2a, it is evident that the proposed method can make the deflection angle closer to the set

10^{\circ}

. Figure 2b plots the angular velocity

s_{2} (i)

of the pneumatic manipulator system.

Figure 2c–e plot the estimation errors

ε_{m} (i)

(

m = 1, 2, 3

), respectively. It is evident that the proposed method achieves smaller estimation error magnitudes in steady-state conditions compared to the traditional ADRC approach. From Figure 2c–e, it is easy to find that the estimation errors under the proposed method demonstrate a smaller magnitude in steady-state conditions compared to that of the ADRC approach. Figure 2f plots the control input of the pneumatic manipulator system. In Figure 2f, it can be found that the proposed method effectively mitigates excessive control signal magnitudes, preventing potential actuator saturation and enhancing operational stability. As a result, from Figure 2a–f, the effectiveness of the proposed method is verified.

4.2. Time-Varying Reference Input Signal Tracking

In this subsection, the simulation specifies an initial deflection angle of

0^{\circ}

and a time-varying target deflection angle of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

. Furthermore, two simulation comparison groups are considered: (i) the back-stepping control (BSC) method described in [], and (ii) a classical proportional–derivative (PD) controller. Note that system (37) is used in this subsection to demonstrate the effectiveness of the proposed method.

\begin{matrix} \ddot{θ} = \frac{\tilde{R} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} (3 L_{0}^{2} - b^{2}) u - \frac{3 {\tilde{R}}^{2} L_{0} p_{0}}{π N^{2} m {\tilde{l}}^{2}} θ - \frac{g sin θ}{\tilde{l}} + \frac{3 {\tilde{R}}^{3} k_{0}}{2 π N^{2} m {\tilde{l}}^{2}} θ^{2} u + d + f, \end{matrix}

(37)

where

d = 2 sin \frac{t}{20}

is an external disturbance term and

f = sin \frac{θ}{2}

is a model uncertainty term.

The parameters of the pneumatic manipulator system and the MPC scheme are the same as in Table 1. For given reference input signal

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

, the parameters of RH-ESO, ADRC, PD, and BSC are presented in Table 2, and the results are given in Figure 3.

Table 2. Reference signal of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

.

Figure 3. Simulation results for reference signal

v_{0} (t) = sin (0.06 \times 2 π t + π / 2)

of the pneumatic manipulator system using the parameters given in Table 2. (a) Deflection angle of the pneumatic manipulator system. (b) Deflection angle tracking error of the pneumatic manipulator system. (c) Decision variables of the RH-ESO. (d) Costs of the pneumatic manipulator system under these methods.

Figure 3a plots the deflection angle of the pneumatic manipulator system (37). Figure 3b plots the deflection angle tracking error of the pneumatic manipulator system (37). Figure 3c plots the decision variables

l_{m} (t)

(

m = 1, 2, 3

) of the RH-ESO (9)–(11). Figure 3d plots the total costs throughout the entire simulation time of the pneumatic manipulator system (37) under the controller of this paper, ADRC, PD, and BSC, respectively. Note that the cost throughout the entire simulation time is calculated by the following formula:

\begin{matrix} J (t) = \int_{0}^{t} (∥ s_{1} (τ) - v_{0} {(τ) ∥}_{Q}^{2} + {∥ u (τ) ∥}_{R}^{2}) d τ \end{matrix}

where

t = 50

,

Q = I

, and

R = 0.01 I

.

Under the same input signal, i.e.,

v_{0} (t) = sin (0.06 \times 2 π t + π / 2)

, the controller gain

k_{1}

of the RH-ESO is increased from 70 to 130. The corresponding simulation results are shown in Figure 4.

Figure 4. Simulation results for reference signal

sin (0.06 \times 2 π t + π / 2)

of the pneumatic manipulator system using the parameters given in Table 3. (a) Deflection angle of the pneumatic manipulator system. (b) Deflection angle tracking error of the pneumatic manipulator system. (c) Decision variables of the RH-ESO. (d) Costs of the pneumatic manipulator system under these methods.

Table 3. Reference signals of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

.

From Figure 3a and Figure 4a, it can be clearly seen that the method mentioned in [] exhibits overshoot and vibration phenomena in the early stages of simulation. The reason is that the control quantity fluctuates greatly. Nevertheless, during the early stage of state convergence, the system still experiences considerable overshoot accompanied by minor residual oscillations. This behavior is primarily attributable to the controller structure adopted in [], which is given as

\begin{matrix} u (t) = & \frac{- k_{p} b_{1} (t) s_{1} (t) - k_{i} \dot{e} (t) + α_{2} (t) - \int_{0}^{t} (ξ_{3} ς_{3} (τ) + ς_{2} (τ)) d τ + k_{p} {\ddot{υ}}_{1} (t)}{k_{p} b_{0}} - \frac{{\hat{s}}_{3} (t)}{b_{0}} \end{matrix}

In this controller, an acceleration-level variable

{\ddot{v}}_{1} (t)

and a velocity-level variable

\dot{e} (t)

are employed. On the one hand,

{\ddot{v}}_{1} (t)

is driven only by the reference signal, yet it exhibits large variations during the initial stage of the tracking differentiator, which seeks to follow the reference rapidly. On the other hand, the rate of change of

\dot{e} (t)

in the control law

u (t)

is much higher than that of

e (t)

. When

e (t)

is large,

\dot{e} (t)

oscillates frequently with considerable amplitude. These rapid oscillations further induce high-frequency large-magnitude oscillations in

u (t)

, leading to overshoot and oscillations in the state

s_{1} (t)

at the outset. As

e (t)

gradually converges, its amplitude diminishes, the amplitude of

\dot{e} (t)

likewise decreases, and the oscillations in

u (t)

become weaker and smaller. Consequently, during the subsequent tracking process,

s_{1} (t)

no longer exhibits frequent oscillations or large excursions.

From Figure 3a and Figure 4a, it can be clearly seen that the estimated value

{\hat{s}}_{1} (t)

is oscillating during the simulation. The oscillations in the angle estimate

{\hat{s}}_{1} (i)

are directly caused by the decision vector

l^{* T} (i) = {[l_{1}^{* T} (i) l_{2}^{* T} (i) l_{3}^{* T} (i)]}^{T}

. In the receding-horizon optimization problem proposed here, the variable to be optimized is defined as

h (i) ≜ {[c^{T} (i) l^{T} (i)]}^{T}

. Because the receding-horizon strategy minimizes a cost that explicitly anticipates the evolution of states and decision variables over a future horizon, solving the optimization at time step i immediately yields the optimal decision vector

l^{*} (i)

. At the next instant, however, the optimal vector

l^{*} (i + 1)

is obtained by re-solving the problem with the updated initial state

\bar{ζ} (i + 1) ≜ {[{\bar{r}}^{T} (i + 1) {\bar{ε}}_{s}^{T} (i + 1)]}^{T}

and therefore bears no explicit relation to

l^{*} (i)

. Consequently, the value of

l^{*} (i)

is determined solely by the optimization outcome. The oscillatory behavior of

l^{*} (i)

thus corroborates the role of the receding-horizon mechanism: at each sampling instant, it predicts the state trajectory over the horizon from the current initial state and returns the decision vector that is optimal for that instant.

From Figure 3 and Figure 4, it can be seen that the proposed method achieves smoother bias angle tracking and the lowest overall control cost. Moreover, by comparing the simulation results, it is evident that the proposed method is insensitive to the controller gain

k_{1}

, whereas the traditional ADRC method is more sensitive to these gains. This further demonstrates the effectiveness of the proposed method.

4.3. Analysis of Estimation Error Under Different Observer Initial Values

In this subsection, we investigate the evolution of the estimation error when the RH-ESO initial states differ from that of the system. In Table 4, three initial states of RH-ESO and the same initial value of the system are given.

Table 4. Initial states of the system and the RH-ESO.

Firstly, the case in which the reference signal is

v_{0} (t) = 10

is shown. The parameters of the proposed RH-ESO used in this case are those in Table 1. The corresponding results are given in Figure 5, Figure 6 and Figure 7.

Figure 5. The estimation error

ε_{1} (t)

under different initial values of the proposed RH-ESO.

Figure 6. The estimation error

ε_{2} (t)

under different initial values of the proposed RH-ESO.

Figure 7. The estimation error

ε_{3} (t)

under different initial values of the proposed RH-ESO.

From Figure 6 and Figure 7, it is easy to observe that there exist large values of the estimation errors

ε_{2} (t)

and

ε_{3} (t)

. According to (12)–(14), when the same error signal

ε_{1} (i)

is used, the magnitudes of the terms

β_{2} ε_{1} (i)

and

β_{3} ε_{1} (i)

greatly exceed that of

β_{1} ε_{1} (i)

. This results in relatively large values of

ε_{2} (i)

and

ε_{3} (i)

during the initial phase of the simulation. In addition, another simulation result can be obtained. From Figure 5, Figure 6 and Figure 7, it is easy to obtain that, the closer the initial value of the observer is to the reference signal, the higher the final observation accuracy when the reference is a constant signal.

Secondly, the case in which the reference signal is

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

is shown. The parameters of the proposed RH-ESO in this case are those in Table 2.

The corresponding results are given in Figure 8, Figure 9 and Figure 10.

Figure 8. The estimation error

ε_{1} (t)

under different initial values of the proposed RH-ESO.

Figure 9. The estimation error

ε_{2} (t)

under different initial values of the proposed RH-ESO.

Figure 10. The estimation error

ε_{3} (t)

under different initial values of the proposed RH-ESO.

From Figure 9 and Figure 10, we can find that

ε_{2} (t)

and

ε_{3} (t)

remain relatively large at the beginning of the simulation. However, they are nevertheless much smaller than those obtained in Figure 6 and Figure 7. This reduction is due to a significant reduction in the RH-ESO gain used in sinusoidal tracking. Therefore, when the observation error shows a large value in the initial stage, the observer gain can be appropriately reduced to reduce the large value in the initial stage.

Furthermore, under these settings, the estimation accuracy is not as good as that achieved by a constant reference, and changing the initial state of the observer will not result in significant differences in the final observation error. The occurrence of this behavior is due to the fact that interference depends on time-varying states and control inputs, and the tracking of disturbance has a lag, resulting in a lag in the estimation of the system state. Therefore, the tracking error cannot converge precisely to zero but instead appears as a sine curve-like image. In general, a high observer gain effectively reduces the steady-state error; however, it also causes excessive overshoot at the beginning of the simulation. Therefore, the observer gain—high or low—should be selected flexibly according to the specific control requirements. In this simulation, the angle tracking error ultimately stays within ±0.05°, and the observation error during the initial phase is small and thus acceptable. Accordingly, the selected RH-ESO parameters are deemed appropriate.

5. Conclusions

This paper has proposed an MPC-enabled disturbance-rejection controller approach for a pneumatic manipulator system subjected to external disturbances. An RH-ESO has been designed that incorporates a decision variable to deal with the disturbances modeled from the complex nonlinear terms within the system. The optimal disturbance estimation error has been determined through a receding-horizon optimization procedure to obtain the best estimate of the disturbance. Using this optimal estimate, the MPC-enabled disturbance-rejection controller has been developed to enable precise angular trajectory tracking in the pneumatic manipulator system. Moreover, the recursive feasibility of Prob 1 has been guaranteed through rigorous proofs. Additionally, the uniform boundedness of the closed-loop system is mathematically guaranteed. Finally, the simulation results demonstrate the superior performance and effectiveness of the proposed approach.

Author Contributions

Conceptualization, Y.X.; methodology, X.H.; software, Y.X.; validation, X.H., D.Z. and L.W.; formal analysis, P.L.; investigation, Y.X.; resources, X.H.; data curation, D.Z.; writing—original draft preparation, Y.X.; writing—review and editing, D.Z.; supervision, D.Z.; project administration, L.W.; funding acquisition, Y.X. and X.H. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to thank the anonymous reviewers for their detailed comments, which helped to improve the quality of the paper. This work was supported in part by National Natural Science Foundation of China (Grant Nos. 62303061 and 62403353) and National Key Laboratory of Science and Technology on Space-Born Intelligent Information Processing under (Grant No. TJ-02-22-03).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data used in this paper are provided in the Section 4.

Conflicts of Interest

The authors Dongjie Zhu and Liangchao Wu were employed by the company Xingyu Electronics (Ningbo) Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MPC	Model predictive control
RH-ESO	Receding-horizon-based extended state observer
SMC	Sliding mode control
ADRC	Active disturbance-rejection control
PAMs	Pneumatic artificial muscles
LMIs	Linear matrix inequalities
TD	Tracking differentiator
Nomenclature
$R$	The real number set
$Z_{\geq 0}$	The non-negative integer set
$Z [a, b]$	The set of positive integers ${a, a + 1, \dots, b}$
$A \in R^{m \times n}$	The dimension of A is $m \times n$
$A > 0 (A < 0)$	Matrix A is positive definite (or negative definite)
$\bar{λ} (A)$	The maximum eigenvalue of A
I	The unit matrix with appropriate dimensions
$ζ (s, i)$	The s step ahead prediction of state $ζ$ conditioned on measurements
	available at time instant i

References

Xie, Z.; Mohanakrishnan, M.; Wang, P.; Liu, J.; Xin, W.; Tang, Z.; Wen, L.; Laschi, C. Soft robotic arm with extensible stiffening layer. IEEE Robot. Autom. Lett. 2023, 8, 3597–3604. [Google Scholar] [CrossRef]
Low, J.H.; Lee, W.W.; Khin, P.M.; Thakor, N.V.; Kukreja, S.L.; Ren, H.L.; Yeow, C.H. Hybrid tele-manipulation system using a sensorized 3-D-printed soft robotic gripper and a soft fabric-based haptic glove. IEEE Robot. Autom. Lett. 2017, 2, 880–887. [Google Scholar] [CrossRef]
Yu, N.; Zhai, Y.; Yuan, Y.; Wang, Z. A bionic robot navigation algorithm based on cognitive mechanism of hippocampus. IEEE Trans. Autom. Sci. Eng. 2019, 16, 1640–1652. [Google Scholar] [CrossRef]
Chi, H.R.; Radwan, A.; Huang, N.F.; Tsang, K.F. Guest editorial: Next-generation network automation for industrial internet-of-things in Industry 5.0. IEEE Trans. Ind. Inform. 2022, 19, 2062–2064. [Google Scholar] [CrossRef]
Miron, G.; Plante, J.S. Design principles for improved fatigue life of high-strain pneumatic artificial muscles. Soft Robot. 2016, 3, 177–185. [Google Scholar] [CrossRef]
Kim, W.; Park, H.; Kim, J. Compact flat fabric pneumatic artificial muscle (ffpam) for soft wearable robotic devices. IEEE Robot. Autom. Lett. 2021, 6, 2603–2610. [Google Scholar] [CrossRef]
Tsai, T.C.; Chiang, M.H. A lower limb rehabilitation assistance training robot system driven by an innovative pneumatic artificial muscle system. Soft Robot. 2023, 10, 1–16. [Google Scholar] [CrossRef] [PubMed]
Cho, Y.; Kim, W.; Park, H.; Kim, J.; Na, Y. Bidirectional double-spring pneumatic artificial muscle with inductive self-sensing. IEEE Robot. Autom. Lett. 2023, 8, 8160–8167. [Google Scholar] [CrossRef]
Liu, G.; Diao, S.; Liu, Z.; Zhang, X.; Xiao, X.; Men, S.; Sun, N. Practical Finite-Time Compliant Control for Horizontal Pneumatic Artificial Muscle Systems Under Force-Sensorless Reflecting. IEEE Trans. Autom. Sci. Eng. 2024, 22, 9515–9527. [Google Scholar] [CrossRef]
Wang, Q.; Yang, T.; Liu, G.; Qin, Y.; Fang, Y.; Sun, N. Adaptive compensation tracking control for parallel robots actuated by pneumatic artificial muscles with error constraints. IEEE Trans. Ind. Inform. 2023, 20, 1585–1595. [Google Scholar] [CrossRef]
Liang, D.; Sun, N.; Wu, Y.; Liu, G.; Fang, Y. Fuzzy-sliding mode control for humanoid arm robots actuated by pneumatic artificial muscles with unidirectional inputs, saturations, and dead zones. IEEE Trans. Ind. Inform. 2021, 18, 3011–3021. [Google Scholar] [CrossRef]
Khaled, T.A.; Akhrif, O.; Bonev, I.A. Dynamic path correction of an industrial robot using a distance sensor and an ADRC controller. Ieee/Asme Trans. Mechatronics 2020, 26, 1646–1656. [Google Scholar] [CrossRef]
Diao, S.; Liu, G.; Liu, Z.; Zhou, L.; Sun, W.; Wang, Y.; Sun, N. Prescribed-Time Adaptive Fuzzy Control for Pneumatic Artificial Muscle-Actuated Parallel Robots With Input Constraints. IEEE Trans. Fuzzy Syst. 2023, 32, 2039–2051. [Google Scholar] [CrossRef]
Qin, Y.; Zhang, H.; Wang, X.; Sun, N.; Han, J. Adaptive set-membership filter based discrete sliding mode control for pneumatic artificial muscle systems with hardware experiments. IEEE Trans. Autom. Sci. Eng. 2023, 21, 1682–1694. [Google Scholar] [CrossRef]
Hosseini, S.A.; Toulabi, M.; Dobakhshari, A.S.; Ashouri-Zadeh, A.; Ranjbar, A.M. Delay compensation of demand response and adaptive disturbance rejection applied to power system frequency control. IEEE Trans. Power Syst. 2019, 35, 2037–2046. [Google Scholar] [CrossRef]
Zeng, Y.; Liang, G.; Liu, Q.; Rodriguez, E.; Pou, J.; Jie, H.; Liu, X.; Zhang, X.; Kotturu, J.; Gupta, A. Multiagent soft actor-critic aided active disturbance rejection control of DC solid-state transformer. IEEE Trans. Ind. Electron. 2024, 72, 492–503. [Google Scholar] [CrossRef]
Fu, C.; Tan, W. Tuning of linear ADRC with known plant information. ISA Trans. 2016, 65, 384–393. [Google Scholar] [CrossRef]
Guo, B.Z.; Zhao, Z.l. On the convergence of an extended state observer for nonlinear systems with uncertainty. Syst. Control Lett. 2011, 60, 420–430. [Google Scholar] [CrossRef]
Qin, B.; Yan, H.; Zhang, H.; Wang, Y.; Yang, S.X. Enhanced reduced-order extended state observer for motion control of differential driven mobile robot. IEEE Trans. Cybern. 2021, 53, 1299–1310. [Google Scholar] [CrossRef]
Li, Z.; Yan, H.; Zhang, H.; Yang, S.X.; Chen, M. Novel extended state observer design for uncertain nonlinear systems via refined dynamic event-triggered communication protocol. IEEE Trans. Cybern. 2022, 53, 1856–1867. [Google Scholar] [CrossRef]
Sun, H.; Madonski, R.; Li, S.; Zhang, Y.; Xue, W. Composite control design for systems with uncertainties and noise using combined extended state observer and Kalman filter. IEEE Trans. Ind. Electron. 2021, 69, 4119–4128. [Google Scholar] [CrossRef]
Zhao, L.; Li, Q.; Liu, B.; Cheng, H. Trajectory tracking control of a one degree of freedom manipulator based on a switched sliding mode controller with a novel extended state observer framework. IEEE Trans. Syst. Man Cybern. Syst. 2017, 49, 1110–1118. [Google Scholar] [CrossRef]
Zhao, L.; Cheng, H.; Wang, T. Sliding mode control for a two-joint coupling nonlinear system based on extended state observer. ISA Trans. 2018, 73, 130–140. [Google Scholar] [CrossRef]
Mayne, D.Q.; Rawlings, J.B.; Rao, C.V.; Scokaert, P.O. Survey constrained model predictive control: Stability and optimality. Automatica 2000, 36, 789–814. [Google Scholar] [CrossRef]
Zeilinger, M.N.; Morari, M.; Jones, C.N. Soft constrained model predictive control with robust stability guarantees. IEEE Trans. Autom. Control 2014, 59, 1190–1202. [Google Scholar] [CrossRef]
Li, T.; Sun, X.; Lei, G.; Guo, Y.; Yang, Z.; Zhu, J. Finite-control-set model predictive control of permanent magnet synchronous motor drive systems—An overview. IEEE/CAA J. Autom. Sin. 2022, 9, 2087–2105. [Google Scholar] [CrossRef]
Li, P.; Kang, Y.; Wang, T.; Zhao, Y.B. Disturbance prediction-based adaptive event-triggered model predictive control for perturbed nonlinear systems. IEEE Trans. Autom. Control 2022, 68, 2422–2429. [Google Scholar] [CrossRef]
Li, N.; Zhang, K.; Li, Z.; Srivastava, V.; Yin, X. Cloud-assisted nonlinear model predictive control for finite-duration tasks. IEEE Trans. Autom. Control 2022, 68, 5287–5300. [Google Scholar] [CrossRef]
Wang, M.; Cheng, P.; Zhang, Z.; Wang, M.; Chen, J. Periodic event-triggered MPC for continuous-time nonlinear systems with bounded disturbances. IEEE Trans. Autom. Control 2023, 68, 8036–8043. [Google Scholar] [CrossRef]
Zhao, L.; Li, Z.; Li, H.; Liu, B. Backstepping integral sliding mode control for pneumatic manipulators via adaptive extended state observers. ISA Trans. 2024, 144, 374–384. [Google Scholar] [CrossRef]
Peuteman, J.; Aeyels, D.; Sepulchre, R. Boundedness properties for time-varying nonlinear systems. SIAM J. Control Optim. 2001, 39, 1408–1422. [Google Scholar] [CrossRef]

Figure 1. Structure diagram of the pneumatic manipulator system.

Figure 2. Simulation results for reference signal

v_{0} (t) = 10

of the pneumatic manipulator system. (a) The state

s_{1} (i)

under the proposed method and the ADRC. (b) The state

s_{2} (i)

under the proposed method and the ADRC. (c) The estimation error

ε_{1} (i)

under the method in this paper and the ADRC. (d) The estimation error

ε_{2} (i)

under the method in this paper and the ADRC. (e) The estimation error

ε_{3} (i)

under the method in this paper and the ADRC. (f) The control input

u (i)

under the method in this paper and the ADRC.

Figure 3. Simulation results for reference signal

v_{0} (t) = sin (0.06 \times 2 π t + π / 2)

of the pneumatic manipulator system using the parameters given in Table 2. (a) Deflection angle of the pneumatic manipulator system. (b) Deflection angle tracking error of the pneumatic manipulator system. (c) Decision variables of the RH-ESO. (d) Costs of the pneumatic manipulator system under these methods.

Figure 5. The estimation error

ε_{1} (t)

under different initial values of the proposed RH-ESO.

Figure 6. The estimation error

ε_{2} (t)

under different initial values of the proposed RH-ESO.

Figure 7. The estimation error

ε_{3} (t)

under different initial values of the proposed RH-ESO.

Figure 8. The estimation error

ε_{1} (t)

under different initial values of the proposed RH-ESO.

Figure 9. The estimation error

ε_{2} (t)

under different initial values of the proposed RH-ESO.

Figure 10. The estimation error

ε_{3} (t)

under different initial values of the proposed RH-ESO.

Table 1. Parameters used in the simulation.

Parameter	$\tilde{R}$	$\tilde{l}$	$L_{0}$	b
Value	$0.1$ (m)	$0.3$ (m)	$0.3$ (m)	$0.3$ (m)
Parameter	$p_{0}$	m	g	$T$
Value	1 (kpa)	1 (kg)	$9.8$ ( $m / s^{2}$ )	$0.01$ (s)
Parameter	$k_{0}$	N	${\bar{s}}_{1}$	${\bar{s}}_{2}$
Value	100	8	15	30
Parameter	$\bar{u}$	${\bar{ω}}_{d}$	$h_{ε_{1}}$	$h_{ε_{2}}$
Value	1000	10	1	10
Parameter	$h_{ε_{3}}$	$β_{1}$	$β_{2}$	$β_{3}$
Value	100	300	7000	1750
Parameter	$k_{1}$	$k_{2}$	$r_{1}$	$r_{2}$
Value	100	30	20	50
Parameter	d	$T_{s}$	Q	R
Value	1	5	$I_{8 \times 8}$	$0.5 I_{4 \times 4}$

Table 2. Reference signal of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

.

Table 2. Reference signal of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

.

Parameters of RH-ESO and ADRC				$β_{1}$	$β_{2}$	$β_{3}$	$k_{1}$	$k_{2}$
Value				150	550	100	70	10
Parameters of PD							$k_{p}$	$k_{d}$
Value							100	2
Parameters of BSC	$ξ_{1}$	$ξ_{2}$	$ξ_{3}$	$β_{1}$	$β_{2}$	$β_{3}$	$k_{p}$	$k_{i}$
Value	2	2	2	150	1000	100	50	$21.7$

Table 3. Reference signals of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

.

Table 3. Reference signals of

v_{0} (t) = 10 sin (0.06 \times 2 π t + π / 2)

.

Parameters of RH-ESO and ADRC				$β_{1}$	$β_{2}$	$β_{3}$	$k_{1}$	$k_{2}$
Value				150	550	100	130	10
Parameters of PD							$k_{p}$	$k_{d}$
Value							100	2
Parameters of BSC	$ξ_{1}$	$ξ_{2}$	$ξ_{3}$	$β_{1}$	$β_{2}$	$β_{3}$	$k_{p}$	$k_{i}$
Value	2	2	2	150	1000	100	50	$21.7$

Table 4. Initial states of the system and the RH-ESO.

Initial states of the system	Variable	$s_{1} (0)$	$s_{2} (0)$
Initial states of the system	Value	0	0
Case 1: Initial states of the RH-ESO	Variable	${\hat{s}}_{1} (0)$	${\hat{s}}_{2} (0)$
Case 1: Initial states of the RH-ESO	Value	0	0
Case 2: Initial states of the RH-ESO	Variable	${\hat{s}}_{1} (0)$	${\hat{s}}_{2} (0)$
Case 2: Initial states of the RH-ESO	Value	5	0
Case 3: Initial states of the RH-ESO	Variable	${\hat{s}}_{1} (0)$	${\hat{s}}_{2} (0)$
Case 3: Initial states of the RH-ESO	Value	8	0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Model Predictive Control for Pneumatic Manipulator via Receding-Horizon-Based Extended State Observers

Abstract

1. Introduction

2. Problem Formulation and Preliminaries

2.1. The Model of the Pneumatic Manipulator System

2.2. The Receding-Horizon-Based ESO

2.3. The Tracking Differentiator

2.4. The MPC-Enabled Disturbance-Rejection Controller

2.5. The MPC Scheme

3. Results

4. Numerical Example

4.1. Time-Invariant Reference Input Signal Tracking

4.2. Time-Varying Reference Input Signal Tracking

4.3. Analysis of Estimation Error Under Different Observer Initial Values

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics