Safety-Constrained Disturbance-Compensated Model Predictive Control for Flexible-Joint Robots

Shiqi Cao; Fan Wang; Xin Li; Dalei Yao; Meilin Xie

doi:10.3390/app152413238

,

and

¹

Xi’an Institute of Optics and Precision Mechanics of CAS, Xi’an 710119, China

²

School of Optoelectronics, University of Chinese Academy of Sciences, Beijing 100049, China

³

Key Laboratory of Space Precision Measurement Technology of CAS, Xi’an 710119, China

^*

Author to whom correspondence should be addressed.

Appl. Sci.2025, 15(24), 13238;https://doi.org/10.3390/app152413238

Version Notes

Order Reprints

Abstract

Flexible-joint robots (FJRs) offer safety and energy efficiency in collaborative tasks, yet achieving high-precision tracking remains challenging under strict state and safety constraints due to elastic coupling, model mismatch, and external disturbances. To address this issue, this paper proposes a safe and disturbance-compensated model predictive control (SDC-MPC) method that integrates model predictive control (MPC) with a disturbance observer (DOB) to estimate and compensate lumped uncertainties and disturbances in real time. To enforce safety, a control barrier function (CBF) is incorporated as an online inequality to maintain forward-invariance safety constraints. The method adapts safety margins to disturbances and allows soft relaxations of constraints when necessary, thereby ensuring feasibility under strong disturbances. A discrete-time implementation makes the approach suitable for real-time applications. Experiments on a single-joint platform demonstrate improved tracking performance and robustness.

Keywords:

flexible-joint robots (FJRs); model predictive control (MPC); disturbance observer (DOB); control barrier functions (CBFs); safety-constrained control

1. Introduction

With the continuous development of robotics technology, the application of flexible-joint robots (FJRs) in collaborative manufacturing and human–robot interaction scenarios has gradually increased. Compared with rigid-joint counterparts, FJRs offer significant advantages in terms of safety and energy efficiency due to their low mass, low output impedance, and inherent compliance [1,2]. These properties mitigate impact forces during physical contact, enable safer power-assisted operation, and reduce actuator effort in long-duration tasks where thermal and energy budgets are tight. However, the elastic transmission simultaneously introduces non-collocation between sensing and actuation, additional elastic modes, and pronounced motor–link coupling, rendering the closed-loop behavior higher-order and potentially non-minimum-phase [3]. Furthermore, practical effects—such as Coulomb/viscous friction, gear backlash, parameter drift caused by temperature and wear, and unknown external loads—can excite resonant peaks and trigger oscillations, thereby undermining both tracking performance and damping effectiveness [4]. Consequently, achieving high-precision position tracking while rigorously enforcing state and safety constraints (e.g., joint/deflection limits, torque/velocity bounds, and human-proximity envelopes) has become a central and persistent challenge in FJR control.

To address these challenges, a variety of control strategies have been proposed, including robust control [5,6], sliding-mode control [7,8], adaptive control [9,10], backstepping control [11], and LQR control [12]. Among these, model predictive control (MPC) stands out as a classical optimization-based framework that predicts the system’s future evolution and computes control inputs via online finite-horizon optimization [13]. MPC naturally incorporates multivariable couplings and explicit state/input constraints and has been widely applied to FJR trajectory tracking and performance trade-offs (e.g., balancing accuracy against actuation effort and deflection suppression) [14,15,16]. Nevertheless, in practice, MPC faces two persistent limitations: (1) sensitivity to model mismatch and disturbances, where even small unmodeled elastic dynamics or load variations propagate into prediction errors, shrinking the feasible set and jeopardizing recursive feasibility and constraint satisfaction; and (2) fragility in constraint handling, where hard constraints may become infeasible under perturbations, while straightforward softening or conservative tightening strategies either weaken safety semantics or induce excessive conservatism without guarantees of inter-sample invariance [17]. These issues are exacerbated in FJRs, as lightly damped flexible modes amplify modeling errors across the prediction horizon and degrade the reliability of terminal ingredients (terminal set/cost) typically used to ensure stability.

To address the first issue, researchers have combined disturbance observers (DOBs) with MPC, yielding DOB-based strategies that improve steady-state accuracy, disturbance rejection, and oscillation suppression for FJRs and flexible actuators, as verified in both simulations and hardware [18,19,20]. Integrating disturbance estimates into the predictive model enables offset-free tracking and reduces bias within the optimizer; alternatively, disturbance-aware constraint tightening can preserve feasibility under bounded perturbations [21,22]. In practice, however, DOB design introduces a trade-off between bandwidth and robustness: a high observer bandwidth accelerates disturbance cancellation but risks amplifying measurement noise and exciting elastic modes, whereas a conservative design may leave residual bias that the MPC must accommodate. Moreover, even with DOB compensation, vanilla MPC lacks a principled mechanism to certify state-space safety under fast, large transients or parameter drifts, and transient infeasibility can still arise when the optimization and the safety filter are not explicitly coordinated.

As safety considerations become paramount in robotic control, control barrier functions (CBFs) have emerged as a unified and computationally tractable formalism to encode forward-invariant “no-go” sets [23]. CBF constraints minimally modify a nominal controller to prioritize safety while preserving performance whenever possible. In robotics, CBFs have been extended to higher-order (relative-degree) formulations suitable for mechanical systems, to measurement- and parameter-robust variants that tolerate estimation error, and to discrete-time implementations compatible with digital controllers; they have been applied to joint-limit enforcement, velocity/force constraints, singularity avoidance, and human–robot safety envelopes [24,25,26]. Yet, directly imposing CBF constraints in the presence of strong disturbances or model mismatch can still cause conservatism (over-tight safety margins that unnecessarily restrict motion) or transient infeasibility (when estimated margins are insufficient). Although existing decoupled architectures have made progress in either disturbance rejection (e.g., MPC + DOB [27,28]) or safety enforcement (e.g., MPC + CBF [29,30]), these two mechanisms often conflict: aggressive compensation from a DOB may push the system towards safety boundaries, while a disturbance-unaware CBF may impose overly conservative constraints, thereby negating the performance gains intended by the disturbance rejection. This disconnect motivates a tighter integration in which safety margins adapt online to estimation quality and predicted transients.

To address these issues, and to fill the gap in FJR control regarding a unified framework that effectively coordinates disturbance rejection with formal safety guarantees—thereby overcoming the difficulty of balancing high-precision tracking and strict safety constraints under significant model uncertainty—this paper proposes a safe and disturbance-compensated MPC (SDC-MPC) method that integrates DOB and CBF within a single, coherent framework. The approach fuses predictive optimization, disturbance compensation, and formal safety enforcement to realize efficient, robust, and safe control for strongly coupled, strictly constrained FJR systems. The key innovations are: (1) an MPC architecture enhanced with a disturbance observer to mitigate model mismatch and unknown inputs, enabling offset-free tracking and improved modal damping without sacrificing noise robustness; (2) a CBF-based safety enforcement mechanism tailored to the predictive model so that safety constraints remain satisfied even under strong disturbances and uncertainties; (3) estimation-driven safety margins and judicious soft-constraint mechanisms that adapt to observer confidence and predicted transient severity, alleviating infeasibility while preserving clear safety semantics; and (4) a discrete-time realization designed for sampled-data implementation with real-time feasibility on embedded targets. Through simulations and hardware experiments, the method demonstrably improves disturbance rejection and tracking accuracy while rigorously maintaining safety constraints, including during fast contact transients and under parameter drift.

To facilitate a clear understanding of the paper’s structure, the remainder is organized as follows. Section 2 introduces the single-joint FJR dynamic model that forms the basis for controller design and analysis, including the motor–link coupling and joint elasticity that drive the need for safety-aware prediction. Section 3 presents the proposed SDC-MPC scheme, detailing the DOB design, the CBF construction, and their integration within the predictive optimization, together with the discrete-time implementation aspects required for real-time deployment. Section 4 validates the approach on a single-joint platform with experiments targeting tracking accuracy, disturbance rejection, oscillation damping, and constraint satisfaction under representative perturbations. Section 5 provides a critical analysis of the mechanisms driving the performance enhancements and discusses the limitations of the proposed method, including scalability and modeling assumptions. Finally, Section 6 concludes the paper and outlines potential extensions to multi-joint systems and interaction-rich tasks.

2. Flexible Joint Robotic Arm Dynamic Model

This section develops a physics-based dynamic model for a single-link flexible-joint robotic arm. Coordinates and gear–spring relations are specified together with standard modeling assumptions. Based on the kinetic/potential energies and Lagrange’s equations, the true-parameter equations of motion are obtained; a controller-oriented nominal form is then produced by introducing the nominal motor inertia

J_{0}

and decomposing disturbances into a motor-channel matched term

d_{1}

and a link-channel unmatched term

d_{2}

. An equivalent continuous-time state-space representation is finally presented to support subsequent estimation and control designs (e.g., DOB, MPC, and CBF).

2.1. Coordinates, Gear Ratio, and Modeling Assumptions

A single-link flexible-joint robotic arm is considered (the physical setup is depicted in Figure 2), where a rotary motor drives the link through a reduction gear and a torsional spring. Denoting the motor-side angle (pre-gear) by

θ

, the link-side angle by q, and the gear ratio by

N_{m} > 0

, the post-gear motor output angle equals

θ / N_{m}

. The spring deflection is

δ ≜ \frac{θ}{N_{m}} - q

(1)

The following assumptions are adopted: (i) the motor rotor and the link are rigid with inertias

J_{m}

and

I_{m}

; (ii) the joint elasticity is linear with stiffness

K_{m}

; (iii) the gear is ideal and power invariance holds; (iv) gravity, friction, and external torques are modeled as generalized forces or lumped into disturbances; (v) the operating range ensures small elastic deflection validating linearization.

2.2. Energy Functions and Lagrange Equations

With

θ

and q as generalized coordinates, the kinetic and potential energies are:

\begin{matrix} T & = \frac{1}{2} J_{m} {\dot{θ}}^{2} + \frac{1}{2} I_{m} {\dot{q}}^{2} \\ V & = \frac{1}{2} K_{m} {(\frac{θ}{N_{m}} - q)}^{2} \end{matrix}

(2)

For an ideal gear, the elastic torque on the link and its reflection to the motor side satisfy

τ_{ℓ} = K_{m} (\frac{θ}{N_{m}} - q)

(3)

τ_{m} = \frac{1}{N_{m}} τ_{ℓ} = \frac{K_{m}}{N_{m}^{2}} θ - \frac{K_{m}}{N_{m}} q

(4)

Lagrange’s equations with nonconservative generalized forces

Q_{θ}

and

Q_{q}

read

\frac{d}{d t} (\frac{\partial T}{\partial \dot{ξ}}) - \frac{\partial T}{\partial ξ} + \frac{\partial V}{\partial ξ} = Q_{ξ}, ξ \in {θ, q}

(5)

Specifying the generalized forces by

Q_{θ} = u + Δ, Q_{q} = Γ_{q}

(6)

and noting

\begin{matrix} \frac{\partial V}{\partial θ} & = \frac{K_{m}}{N_{m}^{2}} θ - \frac{K_{m}}{N_{m}} q \\ \frac{\partial V}{\partial q} & = - \frac{K_{m}}{N_{m}} θ + K_{m} q \end{matrix}

(7)

one obtains the true-parameter equations of motion:

J_{m} \ddot{θ} + \frac{K_{m}}{N_{m}^{2}} θ - \frac{K_{m}}{N_{m}} q = u + Δ

(8)

I_{m} \ddot{q} - \frac{K_{m}}{N_{m}} θ + K_{m} q = Γ_{q}

(9)

which are equivalently written as

\ddot{θ} = - \frac{K_{m}}{J_{m} N_{m}^{2}} θ + \frac{K_{m}}{J_{m} N_{m}} q + \frac{1}{J_{m}} u + \frac{1}{J_{m}} Δ

(10)

\ddot{q} = \frac{K_{m}}{I_{m} N_{m}} θ - \frac{K_{m}}{I_{m}} q + \frac{1}{I_{m}} Γ_{q}

(11)

2.3. Nominal-Inertia Rewriting and Disturbance Decomposition

Introducing a nominal motor inertia

J_{0}

and adding/subtracting

\frac{1}{J_{0}} u

in (10), while keeping the stiffness gains in terms of

J_{0}

, yields

\ddot{θ} = - \frac{K_{m}}{J_{0} N_{m}^{2}} θ + \frac{K_{m}}{J_{0} N_{m}} q + \frac{1}{J_{0}} u + \underset{d_{1}}{\underset{︸}{(\frac{1}{J_{m}} - \frac{1}{J_{0}}) u + \frac{1}{J_{m}} Δ}}

(12)

By defining the link-side unmatched disturbance

d_{2} ≜ \frac{1}{I_{m}} Γ_{q}

, the nominal two-channel disturbance model is

\{\begin{matrix} \ddot{θ} & = - \frac{K_{m}}{J_{0} N_{m}^{2}} θ + \frac{K_{m}}{J_{0} N_{m}} q + \frac{1}{J_{0}} u + d_{1} \\ \ddot{q} & = \frac{K_{m}}{I_{m} N_{m}} θ - \frac{K_{m}}{I_{m}} q + d_{2} \end{matrix}

(13)

with the matched disturbance

d_{1} = (\frac{1}{J_{m}} - \frac{1}{J_{0}}) u + \frac{1}{J_{m}} Δ

(14)

where

d_{1}

enters the motor channel whereas

d_{2}

acts on the link dynamics.

2.4. State-Space Representation

Defining

x_{1} = θ

,

x_{2} = \dot{θ}

,

x_{3} = q

,

x_{4} = \dot{q}

, and

x = {[x_{1}, x_{2}, x_{3}, x_{4}]}^{⊤}

, the continuous-time model reads

\{\begin{matrix} \dot{x} & = A_{c} x + B_{c} u + B_{d c} d \\ y & = C_{c} x \end{matrix}

(15)

where

d = {[\begin{matrix} d_{1}, d_{2} \end{matrix}]}^{T}

and the system matrices are given without numbering as

A_{c} = [\begin{matrix} 0 & 1 & 0 & 0 \\ - \frac{K_{m}}{J_{0} N_{m}^{2}} & 0 & \frac{K_{m}}{J_{0} N_{m}} & 0 \\ 0 & 0 & 0 & 1 \\ \frac{K_{m}}{I_{m} N_{m}} & 0 & - \frac{K_{m}}{I_{m}} & 0 \end{matrix}], B_{c} = [\begin{matrix} 0 \\ \frac{1}{J_{0}} \\ 0 \\ 0 \end{matrix}], B_{d c} = [\begin{matrix} 0 & 0 \\ 1 & 0 \\ 0 & 0 \\ 0 & 1 \end{matrix}], C_{c} = [\begin{matrix} 1 & 0 & 0 & 0 \end{matrix}]

3. Safe and Disturbance-Compensated MPC Design

3.1. Disturbance Observer Design

This subsection presents a disturbance observer (DOB) based on the forward Euler discrete-time model derived from Equation (15):

\begin{matrix} x (t + 1) & = A x (t) + B u (t) + B_{d} d (t) \\ y (t) & = C x (t) \end{matrix}

(16)

where t denotes the sampling instant and

T_{s}

is the sample time. The matrices are given as

A = [\begin{matrix} 1 & T_{s} & 0 & 0 \\ - \frac{K_{m} T_{s}}{J_{m} N_{m}^{2}} & 1 & \frac{K_{m} T_{s}}{J_{m} N_{m}} & 0 \\ 0 & 0 & 1 & T_{s} \\ \frac{K_{m} T_{s}}{I_{m} N_{m}} & 0 & - \frac{K_{m} T_{s}}{I_{m}} & 1 \end{matrix}], B = [\begin{matrix} 0 \\ \frac{T_{s}}{J_{m}} \\ 0 \\ 0 \end{matrix}], B_{d} = [\begin{matrix} 0 & 0 \\ T_{s} & 0 \\ 0 & 0 \\ 0 & T_{s} \end{matrix}], C = [\begin{matrix} 1 & 0 & 0 & 0 \end{matrix}]

To encode the slow variation of

d (t)

, the augmented state

x_{a} (t) ≜ [\begin{matrix} x (t) \\ d (t) \end{matrix}]

and the augmented model are considered

\begin{matrix} x_{a} (t + 1) & = A_{a} x_{a} (t) + B_{a} u (t) \\ y (t) & = C_{a} x_{a} (t) \end{matrix}

(17)

where

A_{a} ≜ [\begin{matrix} A & B_{d} \\ 0 & I \end{matrix}], B_{a} ≜ [\begin{matrix} B \\ 0 \end{matrix}], C_{a} ≜ [\begin{matrix} C & 0 \end{matrix}]

and the disturbance is modeled as

d (t + 1) = d (t)

A Luenberger-type observer for (17) is designed as

{\hat{x}}_{a} (t + 1) = A_{a} {\hat{x}}_{a} (t) + B_{a} u (t) + L_{a} (y (t) - C_{a} {\hat{x}}_{a} (t))

(18)

with

{\hat{x}}_{a} (t) = [\begin{matrix} \hat{x} (t) \\ \hat{d} (t) \end{matrix}]

and a structured gain

L_{a} ≜ [\begin{matrix} 0 \\ L \end{matrix}]

. Expanding Equation (18) leads to

\begin{matrix} \hat{x} (t + 1) & = A \hat{x} (t) + B u (t) + B_{d} \hat{d} (t) \\ \hat{d} (t + 1) & = \hat{d} (t) + L (y (t) - C \hat{x} (t)) \end{matrix}

(19)

Using

y (t) = C x (t)

and Equation (19), the DOB takes the compact form

\{\begin{matrix} z (t + 1) & = (I - L B_{d}) [z (t) + L x (t)] - L [A x (t) + B u (t)] \\ \hat{d} (t) & = z (t) + L x (t) \end{matrix}

(20)

where

z (t)

is an auxiliary state and L is the observer gain matrix

Define the observer error

e_{o b} (t) = d (t) - \hat{d} (t)

. The dynamics of the disturbance observer error can then be expressed as

\begin{matrix} e_{o b} (t + 1) & = d (t + 1) - \hat{d} (t + 1) \\ = d (t + 1) - z (t + 1) - L x (t + 1) \\ = d (t + 1) + L B_{d} (z (t) + L x (t)) + L (A x (t) + B u (t)) - L (A x (t) + B u (t)) \\ - L B_{d} d (t) - z (t) - L x (t) \\ = d (t + 1) + L B_{d} (z (t) + L x (t) - d (t)) - z (t) - L x (t) \\ = d (t) - \hat{d} (t) + L B_{d} (\hat{d} (t) - d (t)) \\ = (L B_{d} - I) e_{o b} (t) \end{matrix}

Therefore, as long as an appropriate L is chosen, the observer error

\hat{d} - d

will asymptotically converge to zero.

3.2. Control Barrier Function Design

In this section, a design of a control barrier function (CBF) is presented to enforce the state constraint objective. The CBF is designed to ensure that the robot’s joint angles remain within safe operating limits, preventing collisions or mechanical damage. The CBF is defined as follows:

h = \bar{y} - y = \bar{y} - C x

(21)

where

\bar{y}

represents the upper bound of the system output. Based on this CBF, the safe set

S

is defined as

S = \{x ∣ h (x) \geq 0\}

(22)

which ensures that the system stays within the desired safe operating region.

To guarantee that the safe set

S

is forward invariant, and thus the safety of the system is preserved, it is required that the time derivative of h satisfies

\dot{h} = \dot{\bar{y}} - C \dot{x} = \dot{\bar{y}} - C (A_{c} x + B_{c} u + B_{d c} \hat{d}) \geq - α (h)

(23)

where

α (h)

belongs to the

K

-class. For simplicity, let

α (h) = γ h

with

γ > 0

. Since the controller is implemented on the discrete-time model, the derivative condition (23) is enforced via the forward difference:

Δ h (x (t), u (t)) = h (x (t + 1)) - h (x (t)) \geq - γ h (x (t))

(24)

Using

h (x) = \bar{y} - C x

and the discrete model

x (t + 1) = A x (t) + B u (t) + B_{d} \hat{d} (t)

, the one–step increment of h is

\begin{matrix} h (x (t + 1)) - h (x (t)) & = \bar{y} (t + 1) - C [A x (t) + B u (t) \\ + B_{d} \hat{d} (t)] - [\bar{y} (t) - C x (t)] \end{matrix}

(25)

For the present output choice

C = [1 0 0 0]

, one has

C B = 0

, so the one–step discrete CBF does not directly constrain

u (t)

. A discrete high–order CBF with relative degree two is therefore adopted.

Define the first and second forward differences

h_{1} (t) ≜ h (x (t + 1)) + ρ_{1} h (x (t))

(26)

h_{2} (t) ≜ h (x (t + 2)) + ρ_{2} h_{1} (t)

(27)

with design parameters

ρ_{1}, ρ_{2} \in (0, 1]

. Enforcing

h_{2} (t) \geq 0

yields an affine inequality in the decision variables. Using

x (t + 2) = A^{2} x (t) + A B u (t) + B u (t + 1) + (A B_{d} + B_{d}) \hat{d} (t)

together with

h = \bar{y} - C x

, one obtains

\begin{matrix} \bar{y} (t + 2) - C A^{2} x (t) - C (A B_{d} + B_{d}) \hat{d} (t) \\ + ρ_{2} [\bar{y} (t + 1) - C A x (t) - C B_{d} \hat{d} (t) \\ + ρ_{1} (\bar{y} (t) - C x (t))] \geq C A B u (t) + C B u (t + 1) \end{matrix}

(28)

Since

C B = 0

, Equation (28) reduces to a scalar bound on

u (t)

:

\begin{matrix} \bar{y} (t + 2) - C A^{2} x (t) - C (A B_{d} + B_{d}) \hat{d} (t) \\ + ρ_{2} [\bar{y} (t + 1) - C A x (t) - C B_{d} \hat{d} (t) + ρ_{1} (\bar{y} (t) - C x (t))] \geq C A B u (t) \end{matrix}

(29)

For the present model in (16),

C A B = T_{s}^{2} / J_{m} > 0

. Hence it is convenient to define the explicit input bound:

\begin{matrix} β_{C B F} ≜ \frac{1}{C A B} { & \bar{y} (t + 2) - C A^{2} x (t) - C (A B_{d} + B_{d}) \hat{d} (t) \\ + ρ_{2} [\bar{y} (t + 1) - C A x (t) - C B_{d} \hat{d} (t) + ρ_{1} (\bar{y} (t) - C x (t))]} \end{matrix}

(30)

and impose

u (t) \leq β_{C B F}

(31)

3.3. Model Predictive Controller Design

This subsection formulates a model predictive controller (MPC) designed to achieve offset-free tracking, which ensures that the steady-state error between the system’s output and the reference trajectory converges to zero [13]. In offset-free MPC, the controller compensates for disturbances and modeling errors, ensuring that the system output stays on target despite these factors. Over the prediction horizon, system parameters and the disturbance are assumed constant. The prediction is initialized with the current measurements, i.e.,

x_{0} = x (t)

and

d_{0} = \hat{d} (t)

. The prediction model is defined as:

\{\begin{matrix} x_{k + 1} = A x_{k} + B u_{k} + B_{d} {\hat{d}}_{k} \\ y_{k} = C x_{k} \end{matrix}

(32)

where k denotes the prediction step.

The first-step prediction reads:

\{\begin{matrix} x_{1} = A x_{0} + B u_{0} + B_{d} {\hat{d}}_{0} \\ y_{1} = C x_{1} \end{matrix}

(33)

and the N-th step prediction is

\{\begin{matrix} x_{N} = A^{N} x_{0} + A^{N - 1} B u_{0} + A^{N - 2} B u_{1} + \dots + B u_{N - 1} + \sum_{i = 1}^{N} A^{i - 1} B_{d} {\hat{d}}_{0} \\ y_{N} = C x_{N} \end{matrix}

(34)

Stacking the predicted outputs over the horizon yields the compact relation

Y = M x_{0} + W U + Φ {\hat{d}}_{0}

(35)

with

Y = {[y_{1}^{T} y_{2}^{T} \dots y_{N}^{T}]}^{T}

and

U = {[u_{0}^{T} u_{1}^{T} \dots u_{N - 1}^{T}]}^{T}

, where

M = [\begin{matrix} C A \\ C A^{2} \\ ⋮ \\ C A^{N} \end{matrix}], W = [\begin{matrix} C B & 0 & \dots & 0 \\ C A B & C B & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C A^{N - 1} B & C A^{N - 2} B & \dots & C B \end{matrix}], Φ = [\begin{matrix} \sum_{i = 1}^{1} C A^{i - 1} B_{d} \\ \sum_{i = 1}^{2} C A^{i - 1} B_{d} \\ ⋮ \\ \sum_{i = 1}^{N} C A^{i - 1} B_{d} \end{matrix}] .

The offset-free MPC objective penalizes the output tracking error and the deviation of the control sequence from a steady-state sequence constructed for the target

(x_{r}, y_{r})

and the current disturbance estimate:

J = {[Y_{r} - Y]}^{T} \bar{Q} [Y_{r} - Y] + {[U_{s} - U]}^{T} \bar{R} [U_{s} - U]

(36)

where

Y_{r} = {[y_{r} y_{r} \dots y_{r}]}^{T}

and

U_{s} = W^{+} [Y_{r} - M x_{r} - Φ {\hat{d}}_{0}] .

(37)

The block-diagonal weights are

\bar{Q} = [\begin{matrix} Q \\ Q \\ ⋱ \\ F \end{matrix}], \bar{R} = [\begin{matrix} R \\ ⋱ \\ R \end{matrix}],

with

Q, R

denoting stage weights and F the terminal weight obtained from

F = A^{T} F A - A^{T} F B {(B^{T} F B + R)}^{- 1} B^{T} F A + Q .

(38)

Substituting Equation (35) into (36) and discarding constant terms that do not affect the optimizer gives the quadratic form:

J = G^{T} U + \frac{1}{2} U^{T} H U

(39)

where

\begin{matrix} G = - 2 \{W^{T} \bar{Q} [Y_{r} - M x_{0} - Φ {\hat{d}}_{0}] + {\bar{R}}^{T} U_{s}\}, \\ H = 2 (W^{T} \bar{Q} W + \bar{R}) . \end{matrix}

Together with actuator bounds and the CBF-induced input bound enforced at each step,

\{\begin{matrix} u_{min} \leq u_{k} \leq u_{max} \\ u_{k} \leq β_{C B F} \end{matrix},

the resulting MPC problem becomes

\begin{matrix} min_{U} J = G^{T} U + \frac{1}{2} U^{T} H U \\ s . t . [\begin{matrix} I \\ ⋮ \\ I \end{matrix}] u_{min} \leq U \leq [\begin{matrix} I \\ ⋮ \\ I \end{matrix}] min {u_{max}, β_{C B F}} \end{matrix}

(40)

which is a convex quadratic program under

Q ⪰ 0

and

R ≻ 0

. The schematic of the proposed disturbance-compensated model predictive control (SDC-MPC) method is shown in Figure 1.

Figure 1. Control block Diagram of SDC-MPC.

To establish closed-loop stability and offset-free tracking, Equation (40) is equivalently written as:

\begin{matrix} min_{u_{0}, \dots, u_{N - 1}} & {∥x_{N} - {\bar{x}}_{t}∥}_{F}^{2} + \sum_{k = 0}^{N - 1} {∥x_{k} - {\bar{x}}_{t}∥}_{Q}^{2} + {∥u_{k} - {\bar{u}}_{t}∥}_{R}^{2} \\ s . t . & Γ \leq u_{k} \leq Υ, k = 0, 1, \dots N \\ x_{k + 1} = A x_{k} + B u_{k} + B_{d} d_{k} \\ d_{k + 1} = d_{k} \\ x_{0} = x (t) \\ d_{0} = \hat{d} (t) \end{matrix}

(41)

where

Γ = u_{min}

,

Υ = min {u_{max}, β_{C B F}}

, and

{\bar{x}}_{t}, {\bar{u}}_{t}

are the steady-state targets obtained from

[\begin{matrix} A - I & B \\ C & 0 \end{matrix}] [\begin{matrix} {\bar{x}}_{t} \\ {\bar{u}}_{t} \end{matrix}] = [\begin{matrix} - B_{d} \hat{d} (t) \\ y_{r} \end{matrix}] .

(42)

Theorem 1.

Assuming that the MPC (41) is always feasible and the constraints are continuously satisfied, the steady-state error converges to zero, i.e.,

y_{\infty} = y_{r}

.

Proof.

First, according to Section 3.1, the prediction error satisfies

e_{o b} (t + 1) = Θ_{o b} e_{o b} (t)

(43)

where

Θ_{o b} = (L B_{d} - I)

is Hurwitz under a suitable L.

As

t \to \infty

, the prediction model (32) yields

[\begin{matrix} A - I & B \\ C & 0 \end{matrix}] [\begin{matrix} x_{\infty} \\ u_{\infty} \end{matrix}] = [\begin{matrix} - B_{d} {\hat{d}}_{\infty} \\ y_{\infty} \end{matrix}] .

(44)

Together with Equation (42), this implies

[\begin{matrix} A - I & B \\ C & 0 \end{matrix}] [\begin{matrix} δ x_{\infty} \\ δ u_{\infty} \end{matrix}] = [\begin{matrix} 0 \\ y_{\infty} - y_{r} \end{matrix}]

(45)

where

δ x_{\infty} = x_{\infty} - {\bar{x}}_{\infty}

and

δ u_{\infty} = u_{\infty} - {\bar{u}}_{\infty}

.

Define

δ x_{k} = x_{k} - {\bar{x}}_{t}

and

δ u_{k} = u_{k} - {\bar{u}}_{t}

; then

y_{k} - y_{r} = C δ x_{k}

. Problem (41) becomes

\begin{matrix} min_{δ u_{0}, \dots, δ u_{N - 1}} & {∥δ x_{N}∥}_{F}^{2} + \sum_{k = 0}^{N - 1} {∥δ x_{k}∥}_{Q}^{2} + {∥δ u_{k}∥}_{R}^{2} \\ s . t . & δ x_{k + 1} = A δ x_{k} + B δ u_{k}, δ x_{0} = x (t) - {\bar{x}}_{t} . \end{matrix}

(46)

Solving this problem yields

δ u_{0}^{*} = k_{M P C} δ x_{0} = u (t) - {\bar{u}}_{t}

, hence

u (t) = k_{M P C} δ x_{0} + {\bar{u}}_{t}

(47)

with

k_{M P C} = {(R + B^{T} F B)}^{- 1} B^{T} F A

.

Letting

e_{c} (t) = x (t) - {\bar{x}}_{t}

, one obtains:

\begin{matrix} e_{c} (t + 1) & = x (t + 1) - {\bar{x}}_{t + 1} \\ = A x (t) + B u (t) + B_{d} d (t) - {\bar{x}}_{t + 1} \\ = (A + B k_{M P C}) e_{c} (t) + B_{d} e_{o b} (t) \end{matrix}

(48)

where

Θ_{c} = A + B k_{M P C}

is Hurwitz. Therefore,

[\begin{matrix} e_{c} (t + 1) \\ e_{o b} (t + 1) \end{matrix}] = [\begin{matrix} Θ_{c} & B_{d} \\ 0 & Θ_{o b} \end{matrix}] [\begin{matrix} e_{c} (t) \\ e_{o b} (t) \end{matrix}] .

(49)

It follows that the closed-loop state matrix

[\begin{matrix} Θ_{c} & B_{d} \\ 0 & Θ \end{matrix}]

is Hurwitz, so

e_{c} (t) \to 0

. Since

y (t) - y_{r} = C (x (t) - {\bar{x}}_{t}) = C e_{c} (t)

, the tracking error converges to zero and thus

y_{\infty} = y_{r}

. □

4. Experimental Results

This section evaluates three controllers—traditional MPC, LQR + CBF, and the proposed SDC-MPC —in two scenarios: (i) persistent parametric uncertainty and (ii) external disturbances injected at a known time. Safety constraints are imposed on the joint angle; performance is assessed by angle-tracking accuracy, safety satisfaction, and angular-velocity smoothness.

The experimental platform parameters and nominal MPC weights are as stated previously (Figure 2). The control algorithms were implemented using MATLAB/Simulink (version 2022b, MathWorks, Natick, MA, USA). The relevant parameters of the experimental platform are

J_{m} = 0.54 \times 10^{- 3} kg \cdot m^{2}

,

N_{m} = 1

and

K_{m} = 38.5 Nm / rad

, and the setup is shown in Figure 2. The controller parameters were rigorously tuned based on system dynamics and performance trade-offs. The prediction horizon was set to

N = 10

to balance the capability of capturing dominant transient dynamics (specifically flexible modes) against the computational load required for real-time execution. The weighting matrices were selected as

Q = 10, 000

and

R = 0.001

; this high

Q / R

ratio prioritizes high-precision position tracking while permitting sufficient control authority to actively damp elastic oscillations within actuator limits. Finally, the DOB pole was placed at

p = - 100

following the bandwidth separation principle: this setting ensures the observer converges significantly faster than the outer motion control loop to effectively estimate time-varying disturbances, yet remains limited in bandwidth to prevent the amplification of high-frequency sensor noise. All quantitative results reported in this section represent the average performance metrics derived from three independent experimental trials to ensure statistical reliability.

Figure 2. Experimental setup diagram.

4.1. Tests Under Persistent Parametric Uncertainty

Parametric uncertainty (e.g., inertia mismatch) is present throughout the tests. From a control viewpoint, such mismatch alters the effective inertia ratio and shifts the flexible mode, which tends to magnify boundary approaches when tracking near constraints. Controllers that do not tightly couple prediction, safety enforcement, and bias compensation are prone to either conservative tracking or boundary grazing.

Experiment 1 (angle constraint 0.5 rad). As a baseline, LQR designs a linear state-feedback for a linearized model and is paired with a CBF to enforce safety [12]. SDC-MPC achieves the smallest tracking error while strictly respecting the safety band. Quantitatively, Table 1 presents a comprehensive statistical analysis. In terms of safety, the maximum absolute error (MAE) is 0.051 rad for SDC-MPC, significantly lower than 0.151 rad for MPC and 0.300 rad for LQR + CBF. Moreover, SDC-MPC achieves the lowest root mean square error (RMSE) of 0.019 rad and integral absolute error (IAE) of 0.032 rad·s, indicating that it maintains the highest average tracking precision and minimal cumulative deviation throughout the task. The zoomed view shows visibly smaller excursions near the boundary for SDC-MPC, indicating tighter safety enforcement.

Table 1. Performance Statistics under Parametric Uncertainty (MAE: Max Absolute Error; MVF: Max Velocity Fluctuation).

The advantage comes from the way SDC-MPC embeds the CBF into the predictive optimization: the barrier constraint continuously re-shapes the admissible set so that candidate trajectories avoid unsafe neighborhoods, while the DOB reduces the effective model bias seen by the optimizer. As a result, the controller does not need to “fight” the mismatch with aggressive inputs; instead, it anticipates and moderates the motion, yielding both lower peak error and a clear safety margin in Figure 3.

Figure 3. Parametric uncertainty, angle constraint

0.5 rad

. (a) Overall angle; (b) Zoomed angle.

With the 0.5 rad bound, SDC-MPC balances tracking and safety best, delivering the lowest peak error and the most conservative distance-to-boundary without sacrificing responsiveness.

Experiment 2 (angle constraint 0.45 rad). Under the tighter bound, SDC-MPC continues to enforce the constraint most effectively, with the smallest excursions near the limits and visibly faster recovery in the zoomed view.

Tight constraints accentuate the non-minimum-phase tendency of elastic joints: even modest overshoot can push the deflection towards the boundary. By coordinating prediction, barrier enforcement, and DOB compensation, SDC-MPC shapes the input to avoid exciting the flexible mode when close to the limit, hence the smaller excursions and faster re-convergence observed in Figure 4.

Figure 4. Parametric uncertainty, angle constraint

0.45 rad

. (a) Overall angle; (b) Zoomed angle.

With a 0.45 rad bound, SDC-MPC preserves a clear safety buffer and shortens the transient, maintaining constraint satisfaction despite the reduced feasible set.

Velocity responses (uncertainty). Across both constraints, SDC-MPC yields the smoothest dynamics and the lowest peak velocity, whereas LQR + CBF exhibits the largest fluctuations; MPC is intermediate.

Velocity smoothness reflects how effectively the controller damps the flexible mode while tracking. By compensating lumped bias via DOB and discouraging unsafe motions via CBF, the optimizer avoids high-gain “chasing” behavior, resulting in lower peaks and quicker decay in Figure 5.

Figure 5. Angular-velocity responses under persistent parametric uncertainty. (a) With

0.5 rad

constraint; (b) With

0.45 rad

constraint.

SDC-MPC consistently reduces peak velocities and shortens damping time, indicating stronger elastic-mode suppression under persistent mismatch.

4.2. Tests Under External Disturbances (Injected at $t =$ 1.6 s)

Two groups are conducted with a disturbance injected at

t = 1.6

s (Figure 6). The focus is anti-disturbance capability while maintaining safety. The disturbance excites the flexible mode and challenges both compensation and constraint handling. To rigorously test robustness, a step torque disturbance with a magnitude of

6.66 N \cdot m

was generated by the load motor and injected into the link side. This intentional perturbation emulates a sudden collision or payload change, creating a challenging scenario for stability and constraint enforcement.

Figure 6. External disturbance (Group 1): angle tracking. (a) Overall; (b) Zoomed around

t = 1.6

s.

Group 1. SDC-MPC exhibits the smallest post-disturbance deviation and the fastest resettling in the angle response, while LQR + CBF shows the largest overshoot and oscillations; MPC is intermediate.

Immediately after the disturbance, uncompensated bias and aggressive recovery can amplify oscillations. SDC-MPC jointly limits unsafe excursions (CBF) and compensates the disturbance (DOB) within the horizon, steering the state back without boundary grazing. This explains the reduced peak and faster settling in the zoomed trajectory.

After

t = 1.6

s, SDC-MPC achieves the smallest angle excursion and quickest recovery while preserving the safety margin; MPC is moderate, and LQR + CBF is most sensitive.

Group 2. The same ordering holds: SDC-MPC achieves the smallest excursion immediately after the disturbance and the most rapid recovery; MPC is moderate; LQR + CBF is most sensitive.

Repeating the disturbance under a different operating instance confirms that the advantage is systematic rather than incidental. The combined DOB–CBF mechanism yields smaller peaks regardless of initial phasing, indicating robust rejection and constraint maintenance.

The second run reproduces the same ranking with lower peaks and faster decay for SDC-MPC, consolidating the anti-disturbance evidence.

Velocity responses under disturbances. Plotted together below, the two groups confirm that SDC-MPC suppresses speed spikes most effectively; MPC is moderate; LQR + CBF exhibits the largest peaks.

The peak speed after the disturbance is a sensitive indicator of rejection. By attenuating the lumped disturbance in prediction and constraining unsafe transients, SDC-MPC limits the initial spike and accelerates damping, consistent with the angle-domain findings.

In both groups, SDC-MPC yields the lowest peak speed and the fastest decay, showing the strongest suppression of disturbance-induced oscillations while preserving safety (Figure 7 and Figure 8).

Figure 7. External disturbance (Group 2): angle tracking. (a) Overall; (b) Zoomed around

t = 1.6

s.

Figure 8. Angular-velocity under external disturbances (injected at

t = 1.6

s). (a) Group 1; (b) Group 2.

4.2.1. Quantitative Summary

Peak angle deviation and peak speed fluctuation near

t = 1.6

s are summarized in Table 2. Both groups show a consistent ranking, with SDC-MPC achieving the lowest peaks while ensuring constraint satisfaction. Additionally, the settling time (time to recover to steady state) is introduced to quantify rejection speed. As shown, SDC-MPC settles in approximately 0.15 s, less than half the time of MPC (0.32 s), demonstrating a decisive advantage in restoring equilibrium.

Table 2. Performance Statistics under External Disturbances (MAD: Max Angle Deviation; MSF: Max Speed Fluctuation; ST: Settling Time).

4.2.2. Real-Time Feasibility

To ensure the applicability of the proposed method in real-world scenarios, the computational load was logged during the experiments. The average computation time per MPC iteration was recorded at

6.66 ms

. This duration is well within the system’s sampling interval, confirming that the added complexity of the DOB and CBF constraints does not compromise the real-time performance of the SDC-MPC framework on the target hardware.

4.3. Parameter Sensitivity Analysis

To empirically validate the parameter selection discussed earlier, a sensitivity analysis was conducted regarding the prediction horizon N and the DOB bandwidth p. Table 3 summarizes the performance trade-offs based on the average of three experimental trials. It is observed that increasing N from 5 to 10 significantly reduces the tracking RMSE by capturing slower flexible modes, whereas further increasing N to 20 yields diminishing returns in accuracy but nearly doubles the computation time. Similarly, a moderate DOB pole (

p = - 100

) provides the best balance between disturbance rejection (low MAE) and noise amplification (low velocity fluctuation), validating the chosen tuning.

Table 3. Sensitivity Analysis of Controller Parameters (Average of 3 Trials).

In summary, the experiments demonstrate that the proposed SDC-MPC consistently strengthens safety-constraint satisfaction and robust tracking under both persistent parametric uncertainty and injected external disturbances. Compared with the baselines (MPC and LQR + CBF), it exhibits smaller boundary excursions, smoother velocity transients, and faster recovery without compromising stability, highlighting its suitability for flexible-joint robots operating in interaction-rich, safety-critical scenarios.

5. Discussion

The experimental results presented in Section 4 demonstrate that the proposed SDC-MPC framework significantly outperforms standard MPC and baseline safety schemes in terms of tracking accuracy, disturbance rejection, and constraint satisfaction. This section analyzes the underlying mechanisms driving these improvements, compares the findings with existing literature, and critically discusses the limitations of the current study.

5.1. Comparative Analysis with Recent Frameworks

To explicitly position the proposed method within the recent literature, Table 4 summarizes the key structural differences between SDC-MPC and existing decoupled architectures. We evaluate four critical architectural attributes: disturbance compensation (DC), which indicates the presence of an active observer; formal safety (FS), denoting the use of control barrier functions; disturbance-aware safety (DAS), representing the adaptive coupling between estimation and constraints; and feasibility assurance (FA), referring to mechanisms that prevent solver failure. While recent DOB-based approaches [19,20] focus on DC and CBF-based methods [25,26] prioritize FS, they typically treat these objectives in isolation (DAS: No). The proposed SDC-MPC distinguishes itself by introducing a DAS mechanism, where the estimated disturbance directly modulates the CBF constraints. This unique coupling allows the system to maintain strict safety under varying loads without the excessive conservatism found in robust MPC or the feasibility issues common in hard-constrained schemes.

Table 4. Comparison of Control Frameworks for FJRs (DC: Disturbance Compensation; FS: Formal Safety; DAS: Disturbance-Aware Safety; FA: Feasibility Assurance).

5.2. Mechanism of Performance Enhancement

The superior performance of SDC-MPC can be attributed to the tight coupling between disturbance estimation and safety enforcement. In conventional decoupled approaches (e.g., MPC + DOB without disturbance-aware safety, or MPC+CBF without disturbance compensation), a conflict often arises: the DOB attempts to cancel external forces by demanding aggressive torques, which may drive the system towards state limits, subsequently triggering the safety filter to abruptly cut off inputs. This “fighting” effect leads to the oscillations observed in the LQR + CBF baseline.

In contrast, SDC-MPC integrates the estimated disturbance

\hat{d}

directly into the predictive model used by the CBF mechanism (via the

β_{C B F}

constraint). This allows the controller to anticipate the trajectory deviation caused by the load and the necessary safety margin adjustment simultaneously. Consequently, the solver finds an optimal path that compensates for the disturbance within the safe set, rather than reacting to safety violations after they occur. This structural advantage explains the reduced settling time and lower MAE observed in Table 2.

5.3. Limitations and Future Work

While the proposed method shows promise, several limitations inherent to the current study must be acknowledged to provide a balanced perspective for future research.

5.3.1. Scalability to Multi-Joint Systems

The current validation is conducted on a single-link flexible-joint manipulator. While this setup effectively isolates the fundamental challenges of elasticity and motor-link coupling, extending the method to multi-degree-of-freedom (multi-DOF) robots introduces complex nonlinearities, such as Coriolis and centrifugal forces, which are coupled across joints. Although the theoretical formulation of DOB and CBF can be extended to multi-variable systems, the computational load for the MPC solver will increase quadratically. Future work will investigate efficient solvers (e.g., explicit MPC or warm-started SQP) to maintain real-time feasibility for multi-joint platforms.

5.3.2. Modeling Simplifications

The dynamic model derived in Section 2 assumes linear joint elasticity and ignores nonlinear friction phenomena (e.g., the Stribeck effect) or hysteresis in the harmonic drive. In practice, gear transmission often exhibits nonlinear stiffness and backlash. Although the DOB effectively lumps these unmodeled dynamics into the disturbance term

d_{1}

, relying entirely on the observer for significant structural nonlinearities may degrade the prediction horizon’s accuracy. Integrating data-driven friction models or nonlinear stiffness curves into the prediction model could further enhance performance.

5.3.3. Noise Sensitivity and Bandwidth Trade-Off

The efficacy of SDC-MPC relies heavily on the quality of the disturbance estimate. As noted in the parameter tuning, a high-bandwidth DOB (

p = - 100

) allows for fast disturbance tracking but risks amplifying measurement noise. In the experiments, high-precision encoders were used; however, in industrial settings with lower-resolution sensors or significant electromagnetic interference, the noise amplified by the differentiation in the observer could lead to chattering in the control input. This necessitates a careful trade-off between disturbance rejection speed and noise robustness, or the adoption of advanced filtering techniques (e.g., Kalman Filter-based DOB) in noise-prone environments.

6. Conclusions

This paper addressed the conflict between high-precision tracking and strict safety enforcement in flexible-joint robots (FJRs) by proposing the SDC-MPC framework, which unifies disturbance observation and control barrier functions within predictive optimization. Comprehensive experiments on a hardware platform demonstrated that the proposed method significantly enhances robustness and safety compared to decoupled architectures. Quantitatively, SDC-MPC reduced tracking RMSE by over 65% compared to standard MPC and halved the settling time under external shock loads to approximately 0.15 s, all while rigorously maintaining safety constraints during fast transients. Practically, these results imply that FJRs can operate closer to their physical limits with higher reliability, which is critical for efficiency in collaborative manufacturing and interaction-rich scenarios.

Despite these advancements, the current study is limited to a single-joint configuration with linearized elasticity assumptions, which may not fully capture the complex multi-body couplings and nonlinear transmission effects (e.g., hysteresis) present in full-scale manipulators. Consequently, future research will focus on three key directions: (1) extending the framework to multi-degree-of-freedom systems with coupled nonlinear dynamics; (2) incorporating data-driven friction and stiffness models to further reduce prediction bias; and (3) investigating efficient numerical solvers, such as explicit MPC or warm-started SQP, to ensure real-time feasibility as the system dimension increases.

Author Contributions

Writing—original draft, S.C.; Supervision, F.W., X.L., D.Y. and M.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Science and Technology Major Project, Grant No. 2022ZD0117301.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to the fact that they were collected using the experimental equipment specified in this paper.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Albu-Schffer, A.; Ott, C.; Hirzinger, G. A Unified Passivity-Based Control Framework for Position, Torque and Impedance Control of Flexible Joint Robots. Int. J. Robot. Res. 2007, 26, 23–39. [Google Scholar] [CrossRef]
Ott, C.; Albu-Schffer, A.; Kugi, A.; Hirzinger, G. On the Passivity-Based Impedance Control of Flexible Joint Robots. IEEE Trans. Robot. 2008, 24, 416–429. [Google Scholar] [CrossRef]
Mansfeld, N.; Keppler, M.; Haddadin, S. Speed Gain in Elastic Joint Robots: An Energy Conversion-Based Approach. IEEE Robot. Autom. Lett. 2021, 6, 4600–4607. [Google Scholar] [CrossRef]
Spyrakos, P.E.; Dai, J.S. Stable Flexible-Joint Floating-Base Robot Balancing and Locomotion via Variable Impedance Control. IEEE Trans. Ind. Electron. 2023, 70, 2748–2758. [Google Scholar] [CrossRef]
Jin, M.; Lee, J.; Tsagarakis, N.G. Model-Free Robust Adaptive Control of Humanoid Robots With Flexible Joints. IEEE Trans. Ind. Electron. 2017, 64, 1706–1715. [Google Scholar] [CrossRef]
Zhihua, Q. Input-output robust tracking control design for flexible joint robots. IEEE Trans. Autom. Control 1995, 40, 78–83. [Google Scholar] [CrossRef]
An-Chyau, H.; Yuan-Chih, C. Adaptive sliding control for single-link flexible-joint robot with mismatched uncertainties. IEEE Trans. Control Syst. Technol. 2004, 12, 770–775. [Google Scholar]
Rsetam, K.; Cao, Z.; Man, Z. Design of Robust Terminal Sliding Mode Control for Underactuated Flexible Joint Robot. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 4272–4285. [Google Scholar] [CrossRef]
Spong, M.W. Adaptive Control of Flexible Joint Manipulators: Comments on Two Papers. Automatica 1995, 31, 585–590. [Google Scholar] [CrossRef]
Jiang, H.; Yang, Y.; Hua, C.; Li, X.; Lu, F. Predefined-Time Composite Fuzzy Adaptive Control for Flexible-Joint Manipulator System With High-Order Fully Actuated Control Approach. IEEE Trans. Ind. Electron. 2025, 72, 7191–7199. [Google Scholar] [CrossRef]
Yang, K.; Zhao, L. Command-Filtered Backstepping Control for Flexible Joint Manipulator Systems with Full-State Constraints. Int. J. Control Autom. Syst. 2022, 20, 2231–2238. [Google Scholar] [CrossRef]
Doina, Z.M. LQG/LQR optimal control for flexible joint manipulator. In Proceedings of the 2012 International Conference and Exposition on Electrical and Power Engineering, Iasi, Romania, 25–27 October 2012; pp. 35–40. [Google Scholar]
Maeder, U.; Borrelli, F.; Morari, M. Linear offset-free Model Predictive Control. Automatica 2009, 45, 2214–2222. [Google Scholar] [CrossRef]
Hasan, M.N.; Haris, M.; Pirzada, S.J.H.; Qin, S. Adaptive Model Predictive Control for a Class of Nonlinear Single-Link Flexible Joint Manipulator. In Proceedings of the 2019 Second International Conference on Latest Trends in Electrical Engineering and Computing Technologies (INTELLECT), Karachi, Pakistan, 13–14 November 2019; pp. 1–6. [Google Scholar]
Sun, J.; Peng, C.; Zou, J.; Zhou, Q.; Wu, Y. A novel MPC-NNSMC composite control method for robotic manipulators considering uncertainties and constraints. Control Eng. Pract. 2025, 156, 106238. [Google Scholar] [CrossRef]
Pradhan, S.K.; Subudhi, B. Nonlinear Adaptive Model Predictive Controller for a Flexible Manipulator: An Experimental Study. IEEE Trans. Control Syst. Technol. 2014, 22, 1754–1768. [Google Scholar] [CrossRef]
Mayne, D.Q.; Rawlings, J.B.; Rao, C.V.; Scokaert, P.O.M. Constrained Model Predictive Control: Stability and Optimality. Automatica 2000, 36, 789–814. [Google Scholar] [CrossRef]
Kim, M.J.; Chung, W.K. Disturbance-Observer-Based PD Control of Flexible Joint Robots for Asymptotic Convergence. IEEE Trans. Robot. 2015, 31, 1508–1516. [Google Scholar] [CrossRef]
Tian, L.; Gao, Y.; Zhang, Z. Disturbance-observer-based tracking controller for a flexible-joint robotic manipulator with full-state constraints. Proc. Inst. Mech. Eng. Part I J. Syst. Control Eng. 2022, 236, 1166–1175. [Google Scholar] [CrossRef]
Lee, D.; Back, J.; Oh, S. Workspace Nonlinear Disturbance Observer for Robust Position Control of Flexible Joint Robots. IEEE Robot. Autom. Lett. 2024, 9, 4495–4502. [Google Scholar] [CrossRef]
Yang, J.; Zheng, W.X.; Li, S.; Wu, B.; Cheng, M. Design of a Prediction-Accuracy-Enhanced Continuous-Time MPC for Disturbed Systems via a Disturbance Observer. IEEE Trans. Ind. Electron. 2015, 62, 5807–5816. [Google Scholar] [CrossRef]
Yan, Y.; Yang, J.; Sun, Z.; Li, S.; Yu, H. Non-linear-disturbance-observer-enhanced MPC for motion control systems with multiple disturbances. IET Control Theory Appl. 2020, 14, 63–72. [Google Scholar] [CrossRef]
Ames, A.D.; Coogan, S.; Egerstedt, M.; Notomista, G.; Sreenath, K.; Tabuada, P. Control Barrier Functions: Theory and Applications. In Proceedings of the 2019 18th European Control Conference (ECC), Naples, Italy, 25–28 June 2019. [Google Scholar]
Kurtz, V.; Wensing, P.M.; Lin, H. Control Barrier Functions for Singularity Avoidance in Passivity-Based Manipulator Control. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 14–17 December 2021; pp. 6125–6130. [Google Scholar]
Heshmati-Alamdari, S.; Sharifi, M.; Karras, G.C.; Fourlas, G.K. Control barrier function based visual servoing for Mobile Manipulator Systems under functional limitations. Robot. Auton. Syst. 2024, 182, 104813. [Google Scholar] [CrossRef]
Liu, J.; Yang, J.; Mao, J.; Zhu, T.; Xie, Q.; Li, Y.; Wang, X.; Li, S. Flexible Active Safety Motion Control for Robotic Obstacle Avoidance: A CBF-Guided MPC Approach. IEEE Robot. Autom. Lett. 2025, 10, 2686–2693. [Google Scholar] [CrossRef]
Ott, C.; Roa, M.A.; Albu-Schäffer, A. An Experimental Study on MPC-Based Joint Torque Control for Flexible Joint Robots. IEEE Robot. Autom. Lett. 2019, 4, 4201–4208. [Google Scholar] [CrossRef]
Chen, W.H.; Yang, J.; Guo, L.; Li, S. A general nonlinear MPC framework for disturbances rejection. In Proceedings of the 19th World Congress the International Federation of Automatic Control, Cape Town, South Africa, 24–29 August 2014; Volume 47, pp. 4640–4645. [Google Scholar]
Zeng, J.; Zhang, B.; Sreenath, K. Safety-Critical Model Predictive Control with Discrete-Time Control Barrier Functions. In Proceedings of the 2021 American Control Conference (ACC), New Orleans, LA, USA, 25–28 May 2021; pp. 3882–3889. [Google Scholar]
Singletary, A.; Ames, A.D. Comparative Analysis of Control Barrier Functions and Artificial Potential Fields for Obstacle Avoidance. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; pp. 8129–8136. [Google Scholar]

Figure 1. Control block Diagram of SDC-MPC.

Figure 2. Experimental setup diagram.

Figure 3. Parametric uncertainty, angle constraint

0.5 rad

. (a) Overall angle; (b) Zoomed angle.

Figure 4. Parametric uncertainty, angle constraint

0.45 rad

. (a) Overall angle; (b) Zoomed angle.

Figure 5. Angular-velocity responses under persistent parametric uncertainty. (a) With

0.5 rad

constraint; (b) With

0.45 rad

constraint.

Figure 6. External disturbance (Group 1): angle tracking. (a) Overall; (b) Zoomed around

t = 1.6

s.

Figure 7. External disturbance (Group 2): angle tracking. (a) Overall; (b) Zoomed around

t = 1.6

s.

Figure 8. Angular-velocity under external disturbances (injected at

t = 1.6

s). (a) Group 1; (b) Group 2.

Table 1. Performance Statistics under Parametric Uncertainty (MAE: Max Absolute Error; MVF: Max Velocity Fluctuation).

Method	Tracking Error			MVF (rad/s)
Method	RMSE (rad)	MAE (rad)	IAE (rad·s)	MVF (rad/s)
MPC	0.055	0.151	0.118	14.019
LQR + CBF	0.112	0.300	0.245	21.177
SDC-MPC	0.019	0.051	0.032	8.732

Table 2. Performance Statistics under External Disturbances (MAD: Max Angle Deviation; MSF: Max Speed Fluctuation; ST: Settling Time).

Metric	MPC	LQR + CBF	SDC-MPC
Group 1
MAD (rad)	0.165	0.300	0.055
MSF (rad/s)	13.5	19.5	8.5
ST (s)	0.32	0.55	0.15
Group 2
MAD (rad)	0.170	0.305	0.056
MSF (rad/s)	13.7	19.8	8.6
ST (s)	0.33	0.56	0.16

Table 3. Sensitivity Analysis of Controller Parameters (Average of 3 Trials).

Parameter Setting	RMSE (rad)	MVF (rad/s)	Comp. Time (ms)
Impact of Prediction Horizon (N)
$N = 5$ (Short)	0.035	9.2	3.1
$N = 10$ (Nominal)	0.019	8.7	6.7
$N = 20$ (Long)	0.018	8.6	14.2
Impact of DOB Pole (p)
$p = - 50$ (Slow)	0.028	5.1	6.7
$p = - 100$ (Nominal)	0.019	8.7	6.7
$p = - 200$ (Fast)	0.017	15.3	6.7

Table 4. Comparison of Control Frameworks for FJRs (DC: Disturbance Compensation; FS: Formal Safety; DAS: Disturbance-Aware Safety; FA: Feasibility Assurance).

Framework	DC	FS	DAS	FA
MPC + DOB [19,20]	Yes	No	No	Hard
MPC + CBF [25,26]	No	Yes	No	Filter-Dep.
Robust MPC	Worst-Case	Yes	Fixed	Conservative
SDC-MPC (Ours)	Yes	Yes	Yes	Soft

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Safety-Constrained Disturbance-Compensated Model Predictive Control for Flexible-Joint Robots

Abstract

1. Introduction

2. Flexible Joint Robotic Arm Dynamic Model

2.1. Coordinates, Gear Ratio, and Modeling Assumptions

2.2. Energy Functions and Lagrange Equations

2.3. Nominal-Inertia Rewriting and Disturbance Decomposition

2.4. State-Space Representation

3. Safe and Disturbance-Compensated MPC Design

3.1. Disturbance Observer Design

3.2. Control Barrier Function Design

3.3. Model Predictive Controller Design

4. Experimental Results

4.1. Tests Under Persistent Parametric Uncertainty

4.2. Tests Under External Disturbances (Injected at $t =$ 1.6 s)

4.2.1. Quantitative Summary

4.2.2. Real-Time Feasibility

4.3. Parameter Sensitivity Analysis

5. Discussion

5.1. Comparative Analysis with Recent Frameworks

5.2. Mechanism of Performance Enhancement

5.3. Limitations and Future Work

5.3.1. Scalability to Multi-Joint Systems

5.3.2. Modeling Simplifications

5.3.3. Noise Sensitivity and Bandwidth Trade-Off

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Safety-Constrained Disturbance-Compensated Model Predictive Control for Flexible-Joint Robots

Abstract

1. Introduction

2. Flexible Joint Robotic Arm Dynamic Model

2.1. Coordinates, Gear Ratio, and Modeling Assumptions

2.2. Energy Functions and Lagrange Equations

2.3. Nominal-Inertia Rewriting and Disturbance Decomposition

2.4. State-Space Representation

3. Safe and Disturbance-Compensated MPC Design

3.1. Disturbance Observer Design

3.2. Control Barrier Function Design

3.3. Model Predictive Controller Design

4. Experimental Results

4.1. Tests Under Persistent Parametric Uncertainty

4.2. Tests Under External Disturbances (Injected at t = 1.6 s)

4.2.1. Quantitative Summary

4.2.2. Real-Time Feasibility

4.3. Parameter Sensitivity Analysis

5. Discussion

5.1. Comparative Analysis with Recent Frameworks

5.2. Mechanism of Performance Enhancement

5.3. Limitations and Future Work

5.3.1. Scalability to Multi-Joint Systems

5.3.2. Modeling Simplifications

5.3.3. Noise Sensitivity and Bandwidth Trade-Off

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

4.2. Tests Under External Disturbances (Injected at $t =$ 1.6 s)