Model Predictive Control for an SMA Actuator Based on an SGPI Model

Liu, Wei; Wei, Houzhen; Pang, Yan; Tang, Xudong; Wang, Kai; Zhou, Wenya

doi:10.3390/aerospace13020112

Open AccessArticle

Model Predictive Control for an SMA Actuator Based on an SGPI Model

by

Wei Liu

¹

,

Houzhen Wei

²,

Yan Pang

¹

,

Xudong Tang

²,

Kai Wang

² and

Wenya Zhou

^1,*

¹

School of Aeronautics and Astronautics, Dalian University of Technology, Dalian 116024, China

²

Beijing Institute of Machinery Equipment, The Second Institute of China Aerospace Science and Technology Corporation, Beijing 100854, China

^*

Author to whom correspondence should be addressed.

Aerospace 2026, 13(2), 112; https://doi.org/10.3390/aerospace13020112

Submission received: 21 December 2025 / Revised: 17 January 2026 / Accepted: 22 January 2026 / Published: 23 January 2026

Download

Browse Figures

Versions Notes

Abstract

Shape memory alloy (SMA) actuators possess unique advantages for aerospace applications, including significant deformation, a high work-to-weight ratio, and structural simplicity. However, SMA actuators exhibit inherently strongly saturated and asymmetric hysteresis characteristics, which cause significant hysteresis in the output response. These hysteresis nonlinearities, compounded by process and measurement noise, severely degrade control precision. To overcome these issues, this study proposes a Smoothed Generalized Prandtl–Ishlinskii (SGPI) model to characterize such hysteresis behavior. Based on the SGPI model, we developed a state-space representation for the SMA actuator. Furthermore, an Extended Kalman Filter (EKF) is employed to estimate unmeasurable internal hysteresis states, and these estimates are subsequently utilized as input states for Model Predictive Control (MPC). The simulation results demonstrate that the proposed EKF-MPC approach achieves both rapid dynamic response and high-precision tracking control, effectively compensating for hysteresis nonlinearity while rejecting noise disturbances.

Keywords:

shape memory alloy actuator; hysteresis model; Extended Kalman Filter; Model Predictive Control

1. Introduction

Shape memory alloy (SMA) actuators possess unique advantages for aerospace applications, including large deformation, a high work-to-weight ratio, and structural simplicity, leading to broad application prospects in areas such as space-deployable antenna structures [1], spacecraft attitude and thermal control [2,3], morphing wings [4], and aero-engine nozzle control [5]. However, their inherent strongly saturated and asymmetric hysteresis characteristics lead to significant nonlinearity, output oscillations, and closed-loop instability, which limits their application in high-precision and high-reliability aerospace actuation systems.

Researchers have developed numerous models to characterize SMA hysteresis based on micromechanics [6], constitutive modeling [7,8,9], and phenomenological approaches [10,11,12,13,14]. Current phenomenological hysteresis modeling approaches can be categorized into three distinct classes. The first class consists of operator-based models, which describe hysteresis via the weighted superposition of elementary operators. Common models include the Preisach [15,16,17,18], Prandtl–Ishlinskii (PI) [19,20], and Krasnosel’skii–Pokrovskii [21,22] models. The GPI model further characterizes the asymmetric saturation of SMAs by incorporating envelope functions. However, a common limitation of these models is the non-differentiability at switching points due to their piecewise-linear characteristics. This lack of smoothness complicates the computation of system Jacobians, which are essential for many modern control and estimation frameworks, thereby hindering the development of a unified and continuous mathematical representation for closed-loop analysis. The second class involves differential equation-based models, such as the Bouc-Wen [23,24] and Duhem [25,26] models. Although they offer the smoothness required for control design, these models have limitations in describing strong saturation and asymmetric hysteresis, making them less suitable for the complex characteristics of SMA actuators. The third class leverages data-driven approaches, utilizing deep learning architectures such as recurrent neural networks [27] or neural operators [28] to achieve high-precision nonlinear mapping. However, despite their high accuracy, data-driven models impose heavy computational burdens and are difficult to deploy in resource-constrained embedded systems, facing challenges in real-time control. Given the respective trade-offs between mathematical smoothness, structural flexibility, and computational cost in existing approaches, there is an urgent need for a modeling approach that preserves the structural flexibility of operator models while providing analytical differentiability. Based on this, the present study establishes a continuously differentiable model suitable for direct state-space analysis by smoothing the operators of the GPI model.

Most existing hysteresis models primarily focus on characterizing the relationship between temperature and displacement or output force [29]. However, such direct mapping is highly dependent on external load conditions. Models identified under specific loads may suffer from significant degradation in accuracy when the load changes, thereby limiting their practical applicability. As a temperature-driven material, the SMA spring undergoes variations in martensite volume fraction during temperature changes. According to the Tanaka model [7], the Young’s modulus of SMA exhibits a linear relationship with the martensite volume fraction. Consequently, the phase transformation process directly influences the mechanical properties of the SMA spring, resulting in different stiffness coefficients for SMA in different phases. Therefore, the relationship between temperature and stiffness coefficient—an intrinsic material property governed by phase transformation—demonstrates stronger independence from external loads. By developing a hysteresis model for the temperature–stiffness relationship, the resulting model achieves enhanced generalization capability under varying operational conditions.

Research on hysteresis compensation strategies is extensive in the literature. Quintanar-Guzmán et al. [30] evaluated multiple controllers for the position tracking of an SMA-driven robotic arm, while Bil et al. [31] compared the performance of different compensators for SMA morphing wings through wind tunnel tests. Schmidt et al. [32] proposed a predictive model for antagonistic SMA spring actuators to optimize bistable designs. Currently, mainstream architectures predominantly rely on feedforward compensation based on inverse hysteresis models. For piezoelectric actuators or specific SMA wires where the response is directly driven by an electric field or input current, such input–output mapping remains feasible [33,34]. However, stiffness regulation in SMA springs is primarily temperature-dependent rather than a direct mapping of current. In practical applications, Joule heating via current alters the temperature, but the heating and cooling processes are constrained by ambient convective heat transfer, exhibiting significant time constants and thermal hysteresis. Existing inverse compensation schemes often employ a cascaded structure [29] that calculates a reference temperature via a static inverse model. Such open-loop feedforward approaches typically ignore thermal inertia, leading to severe dynamic phase lags and a lack of robustness against environmental disturbances.

To address the limitations of open-loop inverse compensation, it is necessary to employ a predictive control framework capable of explicitly handling system delays and unobservable nonlinearities. Model Predictive Control (MPC) [35,36,37] has emerged as an effective method for such complex systems, as its multi-step optimization over a prediction horizon can effectively compensate for the phase lags induced by thermal inertia. Furthermore, since the internal hysteresis states of SMA are not directly observable, the integration of an Extended Kalman Filter (EKF) [38,39], a recursive estimation method, is essential for providing accurate state feedback under noisy environments. By combining the predictive characteristics of MPC with the state estimation capabilities of EKF, a closed-loop state-feedback control structure can be established to handle the complex hysteresis nonlinearities of SMA actuators.

In this study, we propose a Smoothed Generalized Prandtl–Ishlinskii (SGPI) model by modifying the hysteresis operators of the GPI model to be smooth differentiable functions. This preserves the ability to characterize the strongly saturated and asymmetric hysteresis characteristics of SMA actuators while enabling the design of state estimation-based controllers. Specifically, the state-space representation based on the SGPI model allows the design of an EKF for real-time estimation of unmeasurable hysteresis states under process and measurement noise. These state estimates serve as essential inputs to a MPC algorithm, which performs multi-step prediction and rolling optimization to compute optimal control signals that compensate for hysteresis effects, achieving high precision stiffness coefficients tracking.

The rest of this paper is organized as follows. Section 2 elaborates on the mathematical modeling of the SMA actuator, including thermodynamic modeling, SGPI hysteresis modeling, and model parameter identification. Section 3 presents the design of the EKF and MPC based on the SGPI model. Section 4 presents the simulation results. Finally, Section 5 concludes the paper.

2. Mathematical Modeling of SMA Actuator

As shown in Figure 1, SMAs are widely used in aerospace applications, such as motion control of space soft-bodied crawling robot and adaptive wings [40]. The core actuation of these applications relies on the reliable performance of SMA actuators. These applications fundamentally require an accurate description and control of the dynamic relationship among the input current, temperature state, and output performance of the SMA actuator. Therefore, the focus of this study is to establish the current–temperature–stiffness coefficient correlation model of the SMA actuator and achieve stiffness control of the SMA-driven system. As shown in Figure 2, the mathematical model of the SMA actuator system consists of two components: a thermal model and an SGPI phenomenological mathematical model. The thermal model characterizes the relationship between the input current and the spring temperature, while the SGPI model describes the relationship between the spring temperature and the stiffness coefficient of the SMA actuator.

2.1. SMA Spring Thermodynamic Model

The thermodynamic model of the SMA spring primarily considers three factors [41]: thermal absorption during heating, convective heat exchange with ambient air, and resistive heating governed by Joule’s law:

c_{p} m \frac{d T}{d t} = I^{2} R - h A (T - T_{\infty}),

(1)

where

c_{p}

is the specific heat capacity coefficient of the SMA spring; m is the mass of the spring; I is the current applied to the spring; R is the electrical resistance of the spring; h is the heat convection coefficient; A is the circumferential surface area of the spring; T and

T_{\infty}

represent the temperature of the spring and ambient temperature, respectively. Dividing both sides of the thermodynamic model equation by

c_{p} m

:

\frac{d T}{d t} = - \frac{h A}{c_{p} m} T + \frac{I^{2} R}{c_{p} m} + \frac{h A}{c_{p} m} T_{\infty} .

(2)

Assuming

a = - \frac{h A}{c_{p} m}

,

b = \frac{R}{c_{p} m}

,

c = \frac{h A}{c_{p} m} T_{\infty}

, and

u = I^{2}

, the thermodynamic model can be expressed by the following state equation:

\dot{T} = a T + b u + c .

(3)

2.2. GPI Hysteresis Model

The PI hysteresis model is a nonlinear modeling approach based on hysteresis operator superposition and is widely employed for characterizing hysteresis behavior in smart materials, such as shape memory alloy actuators and piezoelectric actuators. However, although the PI model exhibits symmetric loading or unloading paths, the actual SMA springs demonstrate complex asymmetric hysteresis with output saturation characteristics during thermal cycling. To address these limitations, an envelope function is incorporated into the PI framework, resulting in a Generalized Prandtl–Ishlinskii (GPI) model. The GPI approach fundamentally applies input-signal nonlinearization and characterizes complex hysteresis phenomena through the weighted superposition of multiple play operators. The architecture primarily consists of three cascaded components: envelope function, play hysteresis operators, and density function, as shown in Figure 3a.

For a piecewise monotonic input function

u (t) \in C [0, T]

, where

u (t) \in C [0, T]

denotes the space of continuous functions in the interval

C [0, T]

. The output

y (t)

of the GPI model can be expressed as

y (t) = G P I [u] (t) = P I [v] (t) = P I \circ γ (t),

(4)

where

γ

is the envelope function,

v (t)

is the system output of the envelope function, and

P I

denotes the PI hysteresis model.

The PI hysteresis model is characterized by multiple hysteresis play operators. For a piecewise monotonic envelope input function

v (t) \in C [0, T]

, a play hysteresis, as shown in Figure 3b, operator with a fixed threshold

r_{i}

, denoted as

Γ_{i}

, is defined as follows.

Let,

0 = t_{0} < t_{1} < \dots < t_{k} = T

be a partition of the time domain

[0, T]

such that

v (t)

is monotonic on each subinterval

[t_{m - 1}, t_{m}]

, where

m = 0, 1, \dots, k

. Then, the play hysteresis operator

Γ_{i} [v] (t)

and its corresponding hysteresis state

w_{i} (t)

can be given by when,

t = t_{0} = 0

:

\begin{matrix} w_{i} (0) & = Γ_{i} [v] (0) \\ = max (γ_{f} [u] (0) - r_{i}, min (γ_{l} [u] (0) + r_{i}, 0)), \end{matrix}

(5)

when,

t_{m - 1} \leq t \leq t_{m}

:

\begin{matrix} w_{i} (t) & = Γ_{i} [v] (t) \\ = max (γ_{f} [u] (t) - r_{i}, min (γ_{l} [u] (t) + r_{i}, w (t_{m - 1}))) . \end{matrix}

(6)

To characterize the saturation and asymmetric hysteresis properties of the SMA, the envelope function

γ

can be represented using various methods such as dead-zone functions, hyperbolic tangent functions, or polynomials. This study employs a hyperbolic tangent function to describe output saturation and asymmetry under specific inputs. The hyperbolic tangent function, as shown in Figure 3c, is expressed as follows:

v (t) = \{\begin{matrix} γ_{f} [u] (t) = a_{0} tanh (a_{1} u (t) + a_{2}) + a_{3} & \dot{u} (t) \geq 0 \\ γ_{l} [u] (t) = b_{0} tanh (b_{1} u (t) + b_{2}) + b_{3} & \dot{u} (t) < 0 \end{matrix}

(7)

where

a_{0}, a_{1}, a_{2}, a_{3}, b_{0}, b_{1}, b_{2}, b_{3}

are parameters to be identified.

The output of the GPI hysteresis model consists of a weighted linear combination of multiple play operators with distinct fixed thresholds

r_{i}

:

y (t) = G P I [v] (t) = \sum_{i = 1}^{n} p_{i} w_{i} (t),

(8)

where n is the number of play operators and

p_{i}

are positive weighting coefficients (also referred to the density function). The thresholds

r_{i}

and weighting coefficients

p_{i}

are defined as follows:

r_{i} = c i,

(9)

p_{i} = ρ e^{- τ r_{i}},

(10)

where

c, ρ, τ

are parameters to be identified.

2.3. SGPI Hysteresis Model

Although the GPI model can characterize the strongly saturated and asymmetric hysteresis behaviors of SMA actuators, its hysteresis play operators are constructed using non-smooth max()/min() functions with piecewise linearity. These operators induce non-differentiable dynamics, posing significant challenges for direct application in model-based controller synthesis.

To leverage the representational capabilities of the GPI model while enabling controller design, we propose the Smoothed Generalized Prandtl–Ishlinskii (SGPI) model. Specifically, the non-smooth

max () / min ()

operators are approximated by differentiable functions

smax () / smin ()

as follows:

smax (a, b) = \frac{a + b + \sqrt{{(a - b)}^{2} + ε}}{2},

(11)

smin (a, b) = \frac{a + b - \sqrt{{(a - b)}^{2} + ε}}{2} .

(12)

The smoothing parameter

ε > 0

introduces a trade-off between model fidelity and numerical differentiability. As

ε \to 0

, the SGPI operators converge to the standard non-smooth max/min operators, preserving the structural characteristics of the GPI model. However, an excessively small

ε

results in abrupt gradient changes, known as numerical stiffness, which can compromise the stability of gradient-based optimization and state estimation frameworks. Conversely, a larger

ε

ensures smoother derivatives that are beneficial for algorithmic convergence but may lead to deviations from the desired hysteresis envelope, potentially distorting the model’s output characteristics. In this work,

ε = 10^{- 3}

is selected to balance approximation accuracy with the numerical robustness required for subsequent computational tasks. The resulting smoothed play hysteresis operator is depicted in Figure 4. In contrast to the conventional piecewise-linear operator shown in Figure 3b, the SGPI formulation ensures smooth dynamics over the entire domain. This transition from piecewise-linear to smooth dynamics establishes a mathematically tractable foundation for model-based control and observation schemes. Consequently, the transformed hysteresis state

w_{i} (t)

is described as follows:

when, $t = t_{0} = 0$ :

$\begin{matrix} w_{i} (0) & = Γ_{i} [v] (0) \\ = smax (γ_{f} [u] (0) - r_{i}, smin (γ_{l} [u] (0) + r_{i}, 0)), \end{matrix}$

(13)
when, $t_{m - 1} \leq t \leq t_{m}$ :

$\begin{matrix} w_{i} (t) & = Γ_{i} [v] (t) \\ = smax (γ_{f} [u] (t) - r_{i}, smin (γ_{l} [u] (t) + r_{i}, w (t_{m - 1}))), \end{matrix}$

(14)

where the hyperbolic tangent envelope function is still selected for $γ_{f}$ and $γ_{l}$ .

Figure 4. The smoothed play hysteresis operator.

The SGPI model retains the capability of the GPI model to characterize the strongly saturated and asymmetric hysteresis in SMA actuators by establishing differentiable dynamics. This enables the formulation of state-space feedback control laws.

2.4. Identification the SGPI Model Parameters

The SMA actuator system is shown in Figure 5, with Figure 5a illustrating the structural schematic of the SMA actuator and Figure 5b showing the experimental setup of the SMA actuator. The system comprises six components: a SMA spring actuator, clamping slider, programmable power supply, computer, laser displacement sensor, and a thermocouple temperature module. One end of the SMA spring was clamped, whereas the opposite end was connected to the slider via a hook weight. The slider was moved vertically along the linear guide. The laser displacement sensor emitted a beam onto the slider surface, continuously measuring the distance between the slider and overhead beam to compute the real-time SMA spring length. A programmable DC power supply applies voltage across both SMA spring terminals. The thermocouple probe was affixed to the SMA spring to measure real-time temperature variations.

To comprehensively account for the relationship between the SMA temperature and output stiffness coefficients, and to effectively identify the parameters of the SGPI hysteresis model, the system was actuated with a constant current of 1 A under a constant load of 250 g. After sustained heating, the SMA spring underwent thermal expansion and contraction. When the SMA spring temperature reached the austenite finish temperature, the spring contracted to its minimum length. At this point, the constant-current power supply was shut off to allow natural cooling, restoring the initial length, while the experimental displacement data were collected.

The measured temperature and displacement data were subsequently processed to determine the stiffness coefficient. Based on the mechanical definition, the stiffness coefficient

y (T)

was calculated as follows:

y_{o} (T) = \frac{F_{l}}{l_{s} (T) - l_{s 0}},

(15)

where,

F_{l}

represents the constant load force applied to the spring,

l_{s} (T)

denotes the real-time length of the spring at temperature T, and

l_{s 0}

is the original length of the spring at room temperature before loading.

The parameters of the SGPI hysteresis model were identified through an optimization algorithm with the following mean square error (MSE) objective function:

J (X) = \frac{1}{N} \sum_{i = 1}^{N} {(y_{o} (T_{i}) - y (T_{i}))}^{2},

(16)

where N is the number of sampling points,

y_{o} (T)

is the experimentally measured output,

y (T)

is the output of the SGPI model, and the parameter vector X is defined as

X = (a_{0}, a_{1}, a_{2}, a_{3}, b_{0}, b_{1}, b_{2}, b_{3}, ρ, τ, c) .

Particle Swarm Optimization (PSO) and its advanced variants (e.g., Q-learning based PSO [42]) are widely recognized as efficient approaches for nonlinear system identification. In this study, parameter identification was performed using the PSO algorithm as shown in Figure 6. The identification results are presented as follows.

Table 1 presents the parameter identification results for the SGPI model. Figure 7a compares the identification results of both the SGPI and GPI models against the experimental data, while Figure 7b illustrates the corresponding tracking errors. The results demonstrate that the SGPI model effectively characterizes the hysteresis behavior of the SMA actuator, exhibiting smoother transitions at phase transition points and higher accuracy than the GPI model. To evaluate the modeling precision, three error metrics are defined in Table 2: Root Mean Square Error (RMSE), Average Absolute Error (AAE), and Maximum Relative Displacement Error (MRDE). The proposed SGPI model achieved reductions of 18.9% in RMSE, 15.9% in AAE, and 36.0% in MRDE compared to the GPI model. These quantitative metrics further validate the effectiveness of the proposed SGPI model.

3. Extended Kalman Filter and Model Predictive Control Design

SMA actuators exhibit strongly saturated and asymmetric hysteresis behaviors with a path dependency. Specifically, their current stiffness coefficients depends not only on current input temperature but also on historical stiffness coefficients. Such complex hysteretic nonlinearity hinders precise control using conventional linear controllers. In contrast, Model Predictive Control (MPC) offers a promising approach through receding-horizon optimization and future-state prediction capabilities. However, MPC implementation requires full-system states as prediction initial conditions, whereas only the temperature and stiffness coefficients outputs are measurable in SMA actuators. The virtual hysteresis state, which represents the core hysteretic behavior, is unmeasurable. To address this limitation, this study proposes an Extended Kalman Filter (EKF) state observer that estimates the complete system states in real time using limited measurements, thereby providing reliable initialization for MPC.

3.1. Electro-Thermo-Mechanical State-Space Model

To establish a unified framework for state estimation and control, the continuous-time thermodynamic model (Equation (3)) and SGPI hysteresis model are integrated into a discrete-time state-space representation. The thermodynamic model is discretized using forward Euler method with sampling period

T_{s}

:

T (k + 1) = (1 + a T_{s}) T (k) + (b T_{s}) u (k) + c T_{s},

(17)

Defining coefficients

a_{c} = 1 + a T_{s}

,

b_{c} = b T_{s}

,

c_{c} = c T_{s}

, the discrete thermodynamic equation for the SMA actuator is obtained:

T (k + 1) = a_{c} T (k) + b_{c} u (k) + c_{c} .

(18)

Concurrently, the hysteresis nonlinearity between temperature and stiffness coefficients is characterized by the SGPI model

\{\begin{matrix} w_{i} (k + 1) = smax (γ_{f} [T] (k) - r_{1}, smin (γ_{l} [T] (k) + r_{1}, w_{i} (k))) \\ y (k) = \sum_{i = 1}^{n} p_{i} w_{i} (k) \end{matrix}

(19)

Let,

x_{k} = {[T (k), w_{1} (k), \dots, w_{n} (k)]}^{T} \in R^{n + 1}

comprises the temperature state variable

T (k)

and n virtual hysteresis state variables

w_{i}

. The electro-thermo-mechanical nonlinear state-space representation of the SMA actuator is expressed as:

\begin{matrix} x_{k + 1} = f (x_{k}) + B u_{k} + ω_{k} \\ = [\begin{matrix} f_{T} (T (k)) \\ f_{w} (w_{1} (k)) \\ ⋮ \\ f_{w} (w_{n} (k)) \end{matrix}] + B u_{k} + ω_{k} \\ = [\begin{matrix} a_{c} T (k) + b_{c} u (k) + c_{c} + ω_{0} (k) \\ smax (γ_{f} [T] (k) - r_{1}, smin (γ_{l} [T] (k) + r_{1}, w_{1} (k))) + ω_{1} (k) \\ ⋮ \\ smax (γ_{f} [T] (k) - r_{n}, smin (γ_{l} [T] (k) + r_{n}, w_{n} (k))) + ω_{n} (k) \end{matrix}], \end{matrix}

(20)

where

u_{k} \in R

is the system voltage input and

B = {[b_{c}, 0, \dots, 0]}^{T} \in R^{(n + 1) \times 1}

is the input matrix. The process noise vectors

ω_{k} \sim N (0, Q_{a}) \in R^{n + 1}

characterize the external disturbances and SGPI model mismatch.

In SMA actuator system, the measurable system outputs

z_{k} = {[T_{a}, y_{a}]}^{T} \in R^{2}

, where

T_{a}

and

y_{a}

denote the measured temperature and measured stiffness coefficients, can be expressed as

\begin{matrix} z_{k} & = [\begin{matrix} 1 & 0 & 0 & \dots & 0 \\ 0 & ρ e^{- τ r_{1}} & ρ e^{- τ r_{2}} & \dots & ρ e^{- τ r_{n}} \end{matrix}] x_{k} + [\begin{matrix} υ_{1} (k) \\ υ_{2} (k) \end{matrix}] \\ = H_{m} x_{k} + υ_{k} \end{matrix}

(21)

where

H \in R^{2 \times (n + 1)}

is the observation matrix, and

υ_{k} \sim N (0, R_{a}) \in R^{2}

represents the measurement noise vector.

The electro-thermo-mechanical nonlinear state-space representation of the SMA actuator is expressed as

\{\begin{matrix} x_{k + 1} & = f (x_{k}) + B u_{k} + ω_{k} \\ y_{k} & = H_{m} x_{k} \\ z_{k} & = y_{k} + υ_{k} \end{matrix}

(22)

This unified representation provides the mathematical foundation for subsequent EKF state estimation and MPC controller design.

3.2. Extended Kalman Filter Design

Because of the inherent hysteresis nonlinearity of the system, the standard Kalman filter is inapplicable. Consequently, an Extended Kalman Filter (EKF) was employed to linearize the hysteresis nonlinearity and estimate the augmented state vector, which comprises the temperature and multiple virtual hysteresis states. For the nonlinear state function given in (22), linearization is performed via Taylor series expansion around the posterior state estimate

{\hat{x}}_{k - 1}

:

\begin{matrix} x_{k} & = f (x_{k - 1}) + B u_{k - 1} + ω_{k - 1} \\ \approx f ({\hat{x}}_{k - 1}) + F_{k - 1} (x_{k - 1} - {\hat{x}}_{k - 1}) + B u_{k - 1} + ω_{k - 1}, \end{matrix}

(23)

where

F_{k - 1}

is the Jacobian matrix of the system evaluated at the posterior state estimate

{\hat{x}}_{k - 1}

:

\begin{matrix} F_{k - 1} & = \frac{\partial f}{\partial x} |_{{\hat{x}}_{k - 1}} \\ = {[\begin{matrix} \frac{\partial f_{T}}{\partial T} & 0 & \dots & 0 \\ \frac{\partial f_{w_{1}}}{\partial T} & \frac{\partial f_{w_{1}}}{\partial w_{1}} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \frac{\partial f_{w_{n}}}{\partial T} & 0 & \dots & \frac{\partial f_{w_{n}}}{\partial w_{n}} \end{matrix}]}_{{\hat{x}}_{k - 1}} . \end{matrix}

(24)

The explicit gradients for the thermodynamic function

f_{T}

and the hysteresis state function

f_{w}

are derived based on the SGPI model definitions. Specifically, the diagonal elements

λ_{i, k} = \partial f_{w_{i}} / \partial w_{i}

represent the instantaneous stiffness retention rate of each operator. Due to the smoothing parameter

ε

in the smax() and smin() functions, these derivatives remain continuous, ensuring numerical stability during the Jacobian update.

A key issue in this observer design is the structural observability of the high-dimensional hysteresis state vector

w = {[w_{1}, \dots, w_{n}]}^{⊤}

when using a single scalar stiffness measurement

y_{k}

. As proven in Lemma A1 (see Appendix A), the augmented state vector is locally observable if the input temperature signal satisfies the Persistently Exciting (PE) condition. This observability stems from the diverse distribution of thresholds

r_{i}

assigned to each operator. As the input evolves, each operator undergoes the transition from the elastic regime (derivative

\approx 1

) to the saturated regime (derivative

\approx 0

) at a unique input magnitude. This staggered activation ensures that the operators do not saturate simultaneously, maintaining the linear independence of the columns in the local observability matrix over time. Consequently, the individual contribution of each hysteresis state can be distinguished from the aggregate measurement, ensuring the EKF estimation is well-posed.

The EKF algorithm is implemented in the following recursive steps.

Step 1: State Prediction. Predict the a priori state estimate

{\hat{x}}_{k | k - 1}

using the posterior estimate

{\hat{x}}_{k - 1}

and the control input

u_{k - 1}

:

{\hat{x}}_{k | k - 1} = f ({\hat{x}}_{k - 1}) + B u_{k - 1} .

(25)

Step 2: Covariance Prediction. Define the a priori state estimation error as

e_{k | k - 1} = x_{k} - {\hat{x}}_{k | k - 1}

. The a priori error covariance matrix

P_{k | k - 1}

is defined as the expectation

E [e_{k | k - 1} e_{k | k - 1}^{⊤}]

. It is computed using the Jacobian

F_{k - 1}

:

P_{k | k - 1} = F_{k - 1} P_{k - 1} F_{k - 1}^{T} + Q_{a},

(26)

where

Q_{a}

is the process noise covariance matrix representing model uncertainties.

Step 3: Gain Computation. Compute the optimal Kalman gain

K^{*}

that minimizes the trace of the posterior covariance:

K^{*} = P_{k | k - 1} H_{m}^{T} {(H_{m} P_{k | k - 1} H_{m}^{T} + R_{a})}^{- 1},

(27)

where

H_{m}

is the linearized measurement matrix and

R_{a}

is the measurement noise covariance matrix.

Step 4: Measurement Update. Fuse the prediction with the actual sensor measurement

z_{k}

to obtain the a posteriori state estimate:

{\hat{x}}_{k} = {\hat{x}}_{k | k - 1} + K^{*} (z_{k} - H_{m} {\hat{x}}_{k | k - 1}) .

(28)

Step 5: Covariance Update. Define the a posteriori estimation error as

e_{k} = x_{k} - {\hat{x}}_{k}

. Update the posterior error covariance matrix

P_{k} = E [e_{k} e_{k}^{⊤}]

:

P_{k} = (I - K^{*} H_{m}) P_{k | k - 1} .

(29)

The EKF algorithm comprises two core stages: time update (Step 1–3) and measurement update (Step 4–6), as illustrated in Figure 8. Initialization requires setting the initial state estimate

{\hat{x}}_{0}

and initial error covariance matrix

P_{0}

. The initial state estimate

{\hat{x}}_{0}

should be as close as possible to the true system state, whereas the initial error covariance matrix

P_{0}

may be initialized as a sufficiently large matrix to accelerate convergence.

3.3. Model Predictive Control Design

Based on the real-time state estimates provided by the EKF state observer in Section 3.2, we propose a Model Predictive Controller (MPC) for the trajectory tracking of the SMA actuator. Utilizing the discrete state-space model of the SMA actuator defined in (22), we adopt the control increment

Δ u (k)

as the MPC optimization variable:

Δ u_{k + i} = u_{k + i} - u_{k + i - 1} .

Adopting the control increment rather than the absolute input as the decision variable effectively introduces an inherent integral action into the closed-loop system. This mechanism effectively compensates for model mismatches and parameter drifts, significantly enhancing the robustness of the control system against uncertainties by eliminating steady-state errors.

The optimization variable vector is defined as

Δ U = {[Δ u_{k}, Δ u_{k + 1}, \dots, Δ u_{k + N - 1}]}^{T}

. The prediction horizon of the MPC controller is

N_{p}

, and the control horizon is

N_{c}

. Combined with the current EKF state estimate

{\hat{x}}_{k | k}

, the future state trajectory is predicted as:

x_{k + i | k} = f (x_{k + i - 1 | k}) + B u_{k + i - 1}, i = 1, 2, \dots, N,

(30)

where the predicted input accumulates as

u_{k + i | k} = u_{k - 1} + \sum_{j = 0}^{i} Δ u_{k + j}

.

The control objective is to ensure that the stiffness coefficient output

y_{a}

of the SMA actuator tracks the reference trajectory

y_{r e f}

, while satisfying the following constraints:

\begin{matrix} Δ u_{min} & \leq Δ u_{k} \leq Δ u_{max} \\ u_{min} & \leq u_{k} \leq u_{max} \\ y_{min} & \leq y_{k} \leq y_{max}, \end{matrix}

These hard constraints introduced in the optimization problem strictly confine the system states within the physically safe region. Combined with the intrinsic dissipative nature of the SMA system, where states naturally decay in the absence of input, this design ensures system stability in the Bounded-Input–Bounded-Output (BIBO) sense.

In MATLAB, the symbolic optimization framework CasADi [43] is employed to formulate the MPC problem as a Nonlinear Programming (NLP) problem:

\begin{matrix} \min_{Δ U} J & = \sum_{i = 1}^{N} ∥ y_{k + i | k} - y_{ref} ∥_{Q}^{2} + \sum_{i = 0}^{N - 1} {∥ Δ u_{k + i} ∥}_{R}^{2}, & i = 1, 2, \dots, N \\ subj . to \\ x_{k + i | k} & = f (x_{k + i - 1 | k}) + B u_{k + i - 1 | k} \\ y_{k + i | k} & = H_{m} x_{k + i | k} \\ u_{k + i | k} & = u_{k - 1} + \sum_{j = 0}^{i} Δ u_{k + j} \\ Δ u_{min} & \leq Δ u_{k + i | k} \leq Δ u_{max} \\ u_{min} & \leq u_{k + i | k} \leq u_{max} \\ y_{min} & \leq y_{k + i | k} \leq y_{max}, \end{matrix}

(31)

where

Q \in R^{p \times p}

and

R \in R^{m \times m}

are symmetric positive-definite weighting matrices. Leveraging the state estimates provided by the EKF, the NLP is solved in real time using the IPOPT solver to compute the optimal control inputs, thereby achieving precise tracking of the SMA actuator’s stiffness coefficients.

As illustrated in Figure 9, the EKF-MPC control architecture adopts a cascaded configuration. The EKF generates real-time state estimates

{\hat{x}}_{k}

based on the measurable system output

z_{k}

, while the incremental MPC utilizes these estimates as initial conditions to iteratively solve the finite-horizon optimal control problem. Based on the local observability established in Lemma A1, the estimation error covariance of the EKF remains bounded under the persistent excitation condition. This implies that the prediction initialization of the MPC is always situated within a bounded neighborhood of the true state. Since the cost function of the NLP optimization problem is continuous, this bounded initial error results only in a bounded sub-optimality of the control inputs, without compromising the stability of the control loop.

4. Simulation

To validate the effectiveness of the proposed EKF and MPC frameworks, numerical simulations were conducted using an SMA actuator model identified from the experimental tests. The numerical simulations were performed on a personal computer equipped with an Intel Core i7-14650H processor (2.20 GHz) and 16 GB of RAM. The algorithms were implemented using MATLAB R2023b, and the nonlinear programming problems were solved using the CasADi framework version 3.5.5. Under this computational setup, the average execution time per time step was approximately 17 ms.

The simulation environment was configured with a sampling time of

T_{s} = 0.1

s and initialized with the SMA thermodynamic parameters

a_{c} = - 0.0602

,

b_{c} = 5.0830

, and

c_{c} = 1.2040

. The MPC horizons were set to

N_{p} = 15

and

N_{c} = 5

. This selection provides a 1.5 s look-ahead window, which is sufficient to cover the dominant thermal time constant of the SMA for phase lag compensation, while the smaller

N_{c}

ensures that the optimization can be completed within the sampling interval. The initial conditions consisted of a spring temperature

T_{0} = 20 ° C

and the initial stiffness coefficients

y_{0} = 80.6190

N/m. The hysteresis virtual state vector

w \in R^{11}

was initialized using the operator definition in (13):

w_{0} = {\{0.8675, 0.8423, 0.8171, 0.7919, 0.7667, 0.7415, 0.7163, 0.6911, 0.6659, 0.6408, 0.6156\}}^{T}

Realistic perturbations were incorporated by injecting zero-mean Gaussian process noise

ω_{k} \sim N (0, Q_{c})

and measurement noise

υ_{k} \sim N (0, R_{c})

with diagonal covariance matrices:

Q_{c} = diag ([0 . 1^{2}, 0 . 01^{2}, \dots, 0 . 01^{2}]) \in R^{12 \times 12},

R_{c} = diag ([0 . 25^{2}, 0 . 25^{2}]) \in R^{2 \times 2} .

To validate the estimation capability of the EKF for both measurable outputs and virtual hysteresis states under process and measurement noise, a low-frequency sinusoidal excitation signal

u (t) = 0.5 (sin (0.005 t) + 1)

is applied to the system for the first 50 s, followed by a period of zero input for the subsequent 50 s. The process noise covariance matrix

Q_{a}

and the measurement noise covariance matrix

R_{a}

in the EKF were configured to match the true noise covariance matrices

Q_{c}

and

R_{c}

of the simulation environment. The estimation performance and quantitative analysis results are shown in Figure 10 and Table 3.

Figure 10a shows the estimated temperature profile of the actuator, Figure 10b shows the estimated stiffness coefficients trajectory against the measured and true values, and Figure 10c presents the Euclidean norm evolution of the virtual hysteresis state estimate. Table 3 presents quantitative performance metrics. The simulation results demonstrate that under measurement noise, the EKF data fusion mechanism achieves a 44.66% reduction in temperature estimation standard deviation, a 14.16% reduction in stiffness coefficients estimation standard deviation, and a virtual hysteresis state estimation error norm that is consistently below 0.08. This results validate the effectiveness of the EKF in estimating both the measurable outputs and unmeasurable hysteretic states under noisy conditions.

To analyze the sensitivity of the proposed Extended Kalman Filter (EKF) to initial state errors and noise parameter tuning, two sets of comparative simulation experiments were conducted. First, to assess the convergence performance under uncertain initialization, three experimental cases were established with the noise covariance matrices held constant. These cases corresponded to initial temperature estimates of 20 °C, 40 °C, and 60 °C, respectively. As illustrated in Figure 11, despite the significant discrepancies in the initial states, the state estimates in all cases rapidly converged to the true trajectory within approximately 0.7 s. This demonstrates that the algorithm possesses strong robustness to initial errors and exhibits fast convergence capabilities. Second, to evaluate the sensitivity of the filter to process noise parameters, three different filter configurations were selected while maintaining accurate initial states. Specifically, the process noise covariance matrix

Q_{a}

was scaled by factors of 0.1, 1, and 10, respectively. For these three scenarios, the Mean Squared Error (MSE) of the estimation errors for temperature, stiffness, and the Euclidean norm of the virtual hysteresis state vector

w

were calculated. The quantitative comparisons are summarized in Table 4. The results indicate that while the baseline parameter configuration yields the lowest MSEs, the estimation errors remain within a low and bounded range even under significant parameter mismatch. This confirms that the proposed algorithm maintains stable tracking performance across different noise parameter configurations.

To validate the effectiveness of the MPC algorithm, simulation studies were conducted for multi-setpoint tracking under disturbance conditions. Comparative analyses were performed against the traditional PID control and feedforward compensation control. The PID parameters were initially determined using the Ziegler–Nichols frequency response method and subsequently fine-tuned to achieve an optimal balance between transient response speed and overshoot suppression. The feedforward compensation strategy, developed in reference to [29], consists of a static inverse-GPI model to cancel hysteresis nonlinearities, integrated with a Model Reference Adaptive Control (MRAC) for thermal regulation and a PID loop to compensate for the modeling errors of the inverse-GPI model. Figure 12 presents the modeled stiffness coefficients tracking and error curves of the three control strategies, and Table 5 provides quantitative comparison results of the key performance metrics.

The simulation results indicate that the proposed EKF-MPC controller exhibits superior overall performance across all evaluated metrics. In terms of overshoot suppression, the EKF-MPC demonstrates a significant advantage. Simulation data from Steps 3 to 5 show that its overshoot is maintained within 2%, outperforming both the PID and feedforward–feedback control schemes. Notably, EKF-MPC achieves a faster response during the stiffness reduction phases (Steps 4–5). Fundamentally, both PID and feedforward–feedback control are reactive. They require the perception of a tracking error or a discrete change in the reference stiffness to trigger a reduction in the heating current. Specifically, the PID controller must wait for an error signal to emerge before adjusting its output, while the feedforward component remains static until the reference stiffness trajectory shifts. Consequently, these methods rely on a slow integration process to compensate for model mismatches. In contrast, by leveraging the receding horizon optimization mechanism, the MPC is capable of pre-adjusting the control input based on stiffness variations within the prediction horizon. This proactive control action allows for the early modulation of the input current, effectively compensating for the thermal inertia and passive cooling limitations that hinder rapid stiffness adjustment in SMA actuators. Regarding settling time, the EKF-MPC exhibits the fastest response across all steps, with an average settling time of 6.38 s, representing a substantial improvement over the 11.1 s recorded for the PID control. Furthermore, the EKF-MPC achieves the lowest average steady-state error (0.1355 N/m) among the three methods. These findings confirm that the integration of EKF state estimation with the MPC disturbance rejection mechanism effectively enhances the control precision and stability of the system under complex hysteric nonlinear conditions.

5. Conclusions

In this study, a SGPI model was proposed by reconstructing hysteresis operators within the GPI framework to characterize the strongly saturated and asymmetric hysteresis in SMA actuators. The model parameters were identified using the particle swarm optimization method. Owing to the differentiability of the SGPI model, state-space functions were established to implement an EKF-MPC strategy, achieving real-time estimation of unmeasurable virtual hysteresis states and precise positioning control. Simulations demonstrated rapid and effective reference tracking under process and measurement noise disturbances. While the EKF-MPC framework is designed to robustly compensate for parametric uncertainties via integral action, the current validation was limited to a nominal load. Future work will focus on rigorous experimental validation under varying dynamic loads.

Author Contributions

Conceptualization, W.L., H.W. and Y.P.; Data curation, W.L.; Formal analysis, W.L.; Funding acquisition, H.W., X.T. and K.W.; Investigation, W.L.; Methodology, W.L.; Project administration, H.W., X.T. and K.W.; Resources, W.Z.; Software, W.L.; Supervision, W.Z.; Visualization, W.L.; Validation, W.L. and H.W.; Writing—original draft preparation, W.L.; Writing—review and editing, Y.P. and W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data used to support this study are available from the authors upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Lemma A1

(Local Observability of Hysteresis States). Consider the hysteresis system described by the Smoothed Generalized Prandtl-Ishlinskii (SGPI) model, defined as follows:

\{\begin{matrix} w_{i} (k + 1) = smax (u_{k} - r_{i}, smin (u_{k} + r_{i}, w_{i} (k))) \\ y (k) = \sum_{i = 1}^{n} p_{i} w_{i} (k) \end{matrix}

(A1)

where

w_{i}

represents the state of the i-th operator with threshold

r_{i}

, and

p_{i}

are the weights. Assume the input signal

u_{k}

satisfies the Persistently Exciting (PE) condition, implying that the variation range of the input signal is sufficient to cover the threshold ranges of all hysteresis operators. Then, the hysteresis state vector

w_{k} = {[w_{1, k}, \dots, w_{n, k}]}^{⊤}

satisfies the local observability condition when observed through the stiffness measurement

y_{k}

.

Proof.

The local observability of the system depends on the full rank condition of the observability matrix derived from the linearized error dynamics. According to the measurement equation presented in the manuscript, the linearized observation matrix

H_{w} \in R^{1 \times n}

is given by

H_{w} = [p_{1}, \dots, p_{n}]

, where

p_{i} = ρ e^{- τ r_{i}}

denote non-zero and distinct weighting coefficients.

Consider the Jacobian matrix of the hysteresis subsystem, denoted as

Φ_{k} \in R^{n \times n}

. Based on the definitions of the smax and smin functions in the SGPI model,

Φ_{k}

is a diagonal matrix:

Φ_{k} = diag (λ_{1, k}, λ_{2, k}, \dots, λ_{n, k})

(A2)

where the diagonal element

λ_{i, k}

is defined as the partial derivative of the i-th SGPI operator with respect to its own state at time step k. Considering the loading (

Δ u_{k} > 0

) and unloading (

Δ u_{k} < 0

) phases, the explicit expression for

λ_{i, k}

is derived as

λ_{i, k} = \{\begin{matrix} \frac{1}{2} (1 + \frac{w_{i, k} - (u_{k} - r_{i})}{\sqrt{{(w_{i, k} - (u_{k} - r_{i}))}^{2} + ε}}), & if Δ u_{k} > 0 \\ \frac{1}{2} (1 - \frac{w_{i, k} - (u_{k} + r_{i})}{\sqrt{{(w_{i, k} - (u_{k} + r_{i}))}^{2} + ε}}), & if Δ u_{k} < 0 \end{matrix}

(A3)

As indicated by the equation above,

λ_{i, k}

varies continuously within the interval

(0, 1)

. Specifically,

Elastic Region: When the effective input $u_{k} \pm r_{i}$ is far from the current state $w_{i, k}$ , $λ_{i, k} \approx 1$ .
Saturated Region: When the operator is fully driven by the input, $λ_{i, k} \approx 0$ .
Transition Region: When $w_{i} (k) \approx u_{k} \pm r_{i}$ , $λ_{i, k}$ transitions smoothly between 0 and 1 due to the smoothing parameter $ε$ .

Construct the local observability matrix

O_{w} \in R^{n \times n}

within an n-step time window:

O_{w} = [\begin{matrix} H_{w} \\ H_{w} Φ_{k} \\ ⋮ \\ H_{w} \prod_{j = 0}^{n - 2} Φ_{k + j} \end{matrix}] = [\begin{matrix} p_{1} & p_{2} & \dots & p_{n} \\ p_{1} λ_{1, k} & p_{2} λ_{2, k} & \dots & p_{n} λ_{n, k} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ p_{1} \prod_{j = 0}^{n - 2} λ_{1, k + j} & p_{2} \prod_{j = 0}^{n - 2} λ_{2, k + j} & \dots & p_{n} \prod_{j = 0}^{n - 2} λ_{n, k + j} \end{matrix}]

(A4)

To prove the system is observable, we must demonstrate that

O_{w}

has full rank. This is equivalent to proving that all column vectors

v_{i}

of

O_{w}

are linearly independent.

We proceed by contradiction. Assume the column vectors are linearly dependent; then, there exists a set of constants

c_{1}, c_{2}, \dots, c_{n}

, not all zero, such that for any input state, the following linear combination vanishes:

\sum_{i = 1}^{n} c_{i} v_{i} = 0

(A5)

We examine the row in this vector equation containing first-order derivative information (i.e., the second row of

O_{w}

). This implies that the weighted sum of the derivatives of each operator must be zero for any arbitrary input u:

\sum_{i = 1}^{n} {\tilde{c}}_{i} λ_{i, k} (u) = 0, \forall u

(A6)

where

{\tilde{c}}_{i} = c_{i} p_{i}

are the modified coefficients. Differentiating the above equation with respect to input u, and defining the input sensitivity function

ϕ_{i} (u)

for the i-th operator:

ϕ_{i, k} (u) = \frac{\partial λ_{i, k} (u)}{\partial u} = \frac{ε}{{[{(w_{i, k} - (u \pm r_{i}))}^{2} + ε]}^{3 / 2}}

(A7)

The function

ϕ_{i} (u)

behaves as a bell-shaped function with local compact support characteristics. It achieves a strict maximum if and only if the input u satisfies

w_{i, k} - (u \pm r_{i}) = 0

. This indicates that the peak sensitivity corresponds precisely to the state switching point of the operator.

Since the SGPI model assigns a unique threshold

r_{i}

to each operator, such that

r_{1} < r_{2} < \dots < r_{n}

, for any two distinct operators i and j, the peak positions of their sensitivity functions

ϕ_{i} (u)

and

ϕ_{j} (u)

are strictly separated along the input axis. We select a specific test input

u^{*}

located at the switching point of the j-th operator, i.e.,

u^{*} = w_{j, k} \pm r_{j}

. Substituting this into the derived linear combination equation

\sum {\tilde{c}}_{i} ϕ_{i} (u) = 0

yields

{\tilde{c}}_{j} \cdot ϕ_{j} (u^{*}) + \sum_{i \neq j} {\tilde{c}}_{i} \cdot ϕ_{i} (u^{*}) = 0

(A8)

At this specific input,

ϕ_{j} (u^{*})

attains a significant non-zero maximum value. Conversely, for other terms

i \neq j

, due to the threshold difference

r_{i} - r_{j} \neq 0

, the input

u^{*}

is far from the switching point of the i-th operator. Given a sufficiently small smoothing parameter

ε

, it follows that

ϕ_{i} (u^{*}) \approx 0

. Therefore, to satisfy the equilibrium, we must have

{\tilde{c}}_{j} = 0

.

Iterating this logic through all

j \in {1, \dots, n}

, it is proven that all coefficients

{\tilde{c}}_{j}

must be zero. Since the weights

p_{i} \neq 0

, the original coefficients

c_{i}

must all be zero. This confirms that the column vectors of the observability matrix

O_{w}

are linearly independent in the real domain, i.e.,

rank (O_{w}) = n

.

Consequently, the hysteresis state vector satisfies the local observability condition. □

References

Choi, J.S.; Park, T.Y.; Chae, B.G.; Oh, H.U. Development of Lightweight 6 m Deployable Mesh Reflector Antenna Mechanisms Based on a Superelastic Shape Memory Alloy. Aerospace 2024, 11, 738. [Google Scholar] [CrossRef]
Carnier, F.; Villa, F.; Rigamonti, D.; Villa, E.; Di Landro, L.A.; Grande, A.M.; Bettini, P. Shape Memory Alloy Torsional Actuators Enabling Autonomous Thermal Control in Small Satellites. Aerospace 2025, 12, 1029. [Google Scholar] [CrossRef]
Liu, M.; Wang, Z.; Ikeuchi, D.; Fu, J.; Wu, X. Design and Simulation of a Flexible Bending Actuator for Solar Sail Attitude Control. Aerospace 2021, 8, 372. [Google Scholar] [CrossRef]
Azid, N.; Abdullah, E.J.; Harmin, M.Y. Self-Sensing Morphing Wing Using Shape Memory Alloy Actuator. J. Aeronaut. Astronaut. Aviat. 2025, 57, 785–800. [Google Scholar]
Turner, T.L.; Cabell, R.; Cano, R.J.; Silcox, R.J. Development of a Preliminary Model-Scale Adaptive Jet Engine Chevron. AIAA J. 2008, 46, 2545–2557. [Google Scholar] [CrossRef]
Uchil, J.; Mohanchandra, K.P.; Ganesh Kumara, K.; Mahesh, K.K.; Murali, T.P. Thermal Expansion in Various Phases of Nitinol Using TMA. Phys. B 1999, 270, 289–297. [Google Scholar] [CrossRef]
Tanaka, K. A Thermomechanical Sketch of Shape Memory Effect: One-Dimensional Tensile Behavior. Res. Mech. 1986, 2, 59–72. [Google Scholar]
Liang, C.; Rogers, C.A. One-Dimensional Thermomechanical Constitutive Relations for Shape Memory Materials. J. Intell. Mater. Syst. Struct. 1990, 1, 207–234. [Google Scholar] [CrossRef]
Brinson, L.C. One-Dimensional Constitutive Behavior of Shape Memory Alloys: Thermomechanical Derivation with Non-Constant Material Functions and Redefined Martensite Internal Variable. J. Intell. Mater. Syst. Struct. 1993, 4, 229–242. [Google Scholar] [CrossRef]
Banks, H.T.; Kurdila, A.J. Hysteretic Control Influence Operators Representing Smart Material Actuators: Identification and Approximation. In Proceedings of the 35th IEEE Conference on Decision and Control, Kobe, Japan, 13 December 1996; IEEE: Piscataway, NJ, USA, 1996; Volume 4, pp. 3711–3716. [Google Scholar]
Al Janaideh, M.; Al Saaideh, M.; Tan, X. The Prandtl–Ishlinskii Hysteresis Model: Fundamentals of the Model and Its Inverse Compensator. IEEE Control Syst. Mag. 2023, 43, 66–84. [Google Scholar] [CrossRef]
Oh, J.; Bernstein, D.S. Piecewise Linear Identification for the Rate-Independent and Rate-Dependent Duhem Hysteresis Models. IEEE Trans. Autom. Control 2007, 52, 576–582. [Google Scholar] [CrossRef]
Zhu, W.; Rui, X.T. Hysteresis Modeling and Displacement Control of Piezoelectric Actuators with the Frequency-Dependent Behavior Using a Generalized Bouc–Wen Model. Precis. Eng. 2016, 43, 299–307. [Google Scholar] [CrossRef]
Jiles, D.C.; Atherton, D.L. Theory of Ferromagnetic Hysteresis. J. Appl. Phys. 1984, 55, 2115–2120. [Google Scholar] [CrossRef]
Gu, G.Y.; Zhu, L.M.; Su, C.Y.; Ding, H.; Fatikow, S. A dynamic hysteresis model of piezoelectric ceramic actuators. IEEE/ASME Trans. Mechatron. 2024, 21, 2449–2459. [Google Scholar]
Li, Z.; Liu, Y. Data based constitutive modelling of rate independent inelastic effects in composite cables using Preisach hysteresis operators. Compos. Struct. 2024, 330, 117825. [Google Scholar]
Edelen, A.; Scheinker, A.; Ferme, G.; Mayes, B. Differentiable preisach modeling for characterization and optimization of particle accelerator systems with hysteresis. Phys. Rev. Accel. Beams 2024, 27, 034601. [Google Scholar]
Zhang, X.; Yan, P. Modeling of Dynamic Systems with Hysteresis Using Predictive Gradient-Based Method. IEEE Trans. Ind. Electron. 2024, 71, 8540–8549. [Google Scholar]
Chen, X.; Song, J. Temperature-dependent asymmetric Prandtl-Ishlinskii hysteresis model for piezoelectric actuators. Mech. Syst. Signal Process. 2024, 208, 111054. [Google Scholar]
Wang, L.; Li, J.; Zhao, X. Improved PI hysteresis model with one-sided dead-zone operator for soft joint actuator. Sens. Actuators A Phys. 2024, 365, 114890. [Google Scholar]
Li, M.; Feng, Y. A rate-dependent KP modeling and direct compensation control technique for hysteresis in piezo-nanopositioning stages. IEEE Trans. Control Syst. Technol. 2024, 32, 1245–1256. [Google Scholar]
Yang, Z.; Luo, Y. Development of a butterfly fractional-order backlash-like hysteresis model for dielectric elastomer actuators. Nonlinear Dyn. 2024, 112, 567–584. [Google Scholar]
Zhao, J.; Li, Y.; Cao, Y.; Zhang, F.; Cui, M.; Xu, R. High-Precision Position Tracking Control with a Hysteresis Observer Based on the Bouc–Wen Model for Smart Material-Actuated Systems. Actuators 2024, 13, 105. [Google Scholar] [CrossRef]
Zhou, H.; Zhang, D. Characterizing the electric field-and rate-dependent hysteresis of piezoelectric ceramics shear motion with the Bouc-Wen model. J. Intell. Mater. Syst. Struct. 2024, 35, 412–425. [Google Scholar]
Gan, J.; Mei, Z.; Chen, X.; Zhou, Y.; Ge, M.-F. A Modified Duhem Model for Rate-Dependent Hysteresis Behaviors. Micromachines 2019, 10, 680. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Li, X. Review on the nonlinear modeling of hysteresis in piezoelectric ceramic actuators. Measurement 2024, 224, 113910. [Google Scholar]
Ma, X.; Wang, H.; Chen, G. Hysteresis identification of SMA actuators using recurrent neural networks. Materials 2024, 17, 3311. [Google Scholar]
Chandra, A.; Daniels, B.; Curti, M.; Tiels, K.; Lomonova, E.A. Magnetic Hysteresis Modeling with Neural Operators. IEEE Trans. Magn. 2025, 61, 1–11. [Google Scholar] [CrossRef]
Zakerzadeh, M.R.; Sayyaadi, H. Precise Position Control of Shape Memory Alloy Actuator Using Inverse Hysteresis Model and Model Reference Adaptive Control System. Mechatronics 2013, 23, 1150–1162. [Google Scholar] [CrossRef]
Quintanar-Guzmán, S.; Kannan, S.; Aguilera-González, A.; Olivares-Mendez, M.A.; Voos, H. Operational Space Control of a Lightweight Robotic Arm Actuated by Shape Memory Alloy Wires: A Comparative Study. J. Intell. Mater. Syst. Struct. 2019, 30, 1368–1384. [Google Scholar] [CrossRef]
Bil, C.; Massey, K.; Abdullah, E.J. Wing Morphing Control with Shape Memory Alloy Actuators. J. Intell. Mater. Syst. Struct. 2013, 24, 879–898. [Google Scholar] [CrossRef]
Schmidt, J.; Sakovsky, M. Model-Based Design of Antagonistic Shape Memory Alloy Actuators for Bistable Structures. AIAA J. 2025; published online. [Google Scholar]
Zhang, X.; Li, B.; Chen, X.; Li, Z.; Peng, Y.; Su, C.Y. Adaptive Implicit Inverse Control for a Class of Discrete-Time Hysteretic Nonlinear Systems and Its Application. IEEE/ASME Trans. Mechatron. 2020, 25, 2112–2122. [Google Scholar] [CrossRef]
Xiang, C.; Yang, H.; Sun, Z.; Xue, B.; Hao, L.; Rahoman, M.D.A.; Davis, S. The Design, Hysteresis Modeling and Control of a Novel SMA-Fishing-Line Actuator. Smart Mater. Struct. 2017, 26, 037004. [Google Scholar] [CrossRef]
Tai, N.T.; Ahn, K.K. A Hysteresis Functional Link Artificial Neural Network for Identification and Model Predictive Control of SMA Actuator. J. Process Control 2012, 22, 766–777. [Google Scholar] [CrossRef]
Bekhiti, B.; Al-Sabur, R.; Roudane, M.; Younis, J.A.; Sharkawy, A.N. Intelligent Neuro-Fuzzy Adaptive MIMO Control for a Self-Balancing Two Wheeled Autonomous Robot via Recursive Resolution of the Matrix Diophantine Equation. Discov. Robot. 2025, 1, 11. [Google Scholar] [CrossRef]
Dong, F.; Xie, H.; Hu, Q. Deep-Hammerstein-Model-Based Predictive Control for Piezoelectric Actuators in Trajectory Tracking Applications. In Proceedings of the 2023 China Automation Congress (CAC), Chongqing, China, 17–19 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 5425–5430. [Google Scholar]
Lee, J.W. Adaptive Sensorless Control of High Speed PMSM with Back EMF Constant Variation. In Proceedings of the 2015 9th International Conference on Power Electronics and ECCE Asia (ICPE-ECCE Asia), Seoul, Republic of Korea, 1–5 June 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1400–1404. [Google Scholar]
Huang, Z.; Best, M.; Knowles, J.; Fly, A. Adaptive Piecewise Equivalent Circuit Model with SOC/SOH Estimation Based on Extended Kalman Filter. IEEE Trans. Energy Convers. 2023, 38, 959–970. [Google Scholar] [CrossRef]
Icardi, U.; Ferrero, L. SMA Actuated Mechanism for an Adaptive Wing. J. Aerosp. Eng. 2011, 24, 140–143. [Google Scholar] [CrossRef]
Bhattacharyya, A.; Lagoudas, D.C.; Wang, Y.; Kinra, V.K. On the Role of Thermoelectric Heat Transfer in the Design of SMA Actuators: Theoretical Modeling and Experiment. Smart Mater. Struct. 1995, 4, 252–263. [Google Scholar] [CrossRef]
Xiao, J.; Zhang, Z.; Terzi, S.; Anwer, N.; Eynard, B. Dynamic Task Allocations with Q-Learning Based Particle Swarm Optimization for Human-Robot Collaboration Disassembly of Electric Vehicle Battery Recycling. Comput. Ind. Eng. 2025, 204, 111133. [Google Scholar] [CrossRef]
Andersson, J.A.E.; Gillis, J.; Horn, G.; Rawlings, J.B.; Diehl, M. CasADi: A Software Framework for Nonlinear Optimization and Optimal Control. Math. Program. Comput. 2018, 11, 1–36. [Google Scholar] [CrossRef]

Figure 1. SMA actuators in aerospace applications: (a) Architecture of the adaptive wing structure and SMA actuators [40]; (b) SMA space soft-bodied crawling robot.

Figure 2. Mathematical modeling of SMA actuator.

Figure 3. Generalized Prandtl–Ishlinskii model: (a) Cascaded Structure of the GPI hysteresis model; (b) the play hysteresis operator; (c) hyperbolic tangent envelope function.

Figure 5. SMA actuator: (a) SMA actuator structure schematic; (b) SMA actuator experimental setup.

Figure 6. The parameter identification process of the SGPI model based on the particle swarm optimization algorithm.

Figure 7. Comparison of output characteristics between SGPI/GPI models and SMA actuator experimental data: (a) Model-experimental output characteristics; (b) model prediction error.

Figure 8. The structure of the Extended Kalman Filter.

Figure 9. The structure of the Model Predictive Control.

Figure 10. Simulation results of the Extended Kalman Filter: (a) temperature estimation; (b) stiffness coefficients estimation; (c) Euclidean norm of the estimation error in the virtual hysteresis state.

Figure 11. Estimation errors under different initial state guesses: (a) temperature estimation; (b) stiffness coefficients estimation; (c) Euclidean norm of the estimation error in the virtual hysteresis state.

Figure 12. Simulation results of the Model Predictive Control: (a) stiffness coefficients tracking performance under different controllers; (b) tracking error comparison.

Table 1. Model parameter identification results.

Parameters	Value
$a_{0}$	0.3301
$a_{1}$	0.1034
$a_{2}$	−7.2809
$a_{3}$	1.1976
$b_{0}$	1.2183
$b_{1}$	0.0509
$b_{2}$	−3.5333
$b_{3}$	1.6089
$ρ$	10.0000
$τ$	0.1000
c	0.0252

Table 2. Definition of error functions and identification error indices.

Error Function	Definition	SGPI Model	GPI Model
RMSE	$\sqrt{\frac{\sum_{i = 1}^{N} {(y_{0} (t_{i}) - y (t_{i}))}^{2}}{N}}$	0.8200	1.3690
AAE	$\frac{\sum_{i = 1}^{N} \|y_{0} (t_{i}) - y (t_{i})\|}{N}$	0.6270	1.0328
MRDE	$\frac{max (\|y (t_{i}) - y_{0} (t_{i})\|)}{max (y_{0}) - min (y_{0})} \times 100 %$	3.27%	6.12%

Table 3. Mean squared error comparison of measured and estimated values.

Variable	Measured MSE	Estimated MSE
Temperature	0.2499	0.1383
Stiffness coefficients	0.2409	0.2068

Table 4. MSE of estimation errors for temperature, stiffness, and virtual hysteresis state under different process noise parameter settings.

Parameter Settings	MSE of Temp. (°C)	MSE of Stiff. (N/m)	MSE of $∥ \tilde{w} ∥_{2}$
Under-estimated $Q_{a} = 0.1 Q_{c}$	0.3943	0.6102	0.0206
Baseline Tuning $Q_{a} = Q_{c}$	0.1383	0.2068	0.0115
Over-estimated $Q_{a} = 10 Q_{c}$	0.2352	0.2396	0.0116

Table 5. Performance comparison of different controllers across step transitions.

Controller	Overshoot (%)	Settling Time (s)	RMSE	Steady Error (N/m)
Step 1: 80 → 90 N/m
EKF-MPC	11.31	8.3	1.2239	0.0915
PID	18.01	10.1	1.2740	0.1697
Feedforward	91.99	25.8	2.3078	0.3165
Step 2: 90 → 110 N/m
EKF-MPC	4.53	1.0	0.9123	0.1990
PID	31.41	13.5	2.1161	0.3413
Feedforward	42.30	13.6	2.3681	0.1602
Step 3: 110 → 150 N/m
EKF-MPC	1.83	1.6	1.9744	0.1774
PID	1.39	2.8	3.5985	0.6002
Feedforward	6.05	20.8	3.3020	0.3649
Step 4: 150 → 120 N/m
EKF-MPC	2.41	8.5	4.2379	0.1047
PID	3.24	13.3	6.3494	0.3675
Feedforward	4.07	13.6	6.5506	0.1047
Step 5: 120 → 90 N/m
EKF-MPC	1.75	12.5	4.3793	0.1047
PID	2.40	15.8	5.9443	0.2449
Feedforward	1.36	13.1	4.8037	0.3242
Overall Average Performance
EKF-MPC	4.37	6.38	2.5455	0.1355
PID	11.29	11.10	3.8564	0.3447
Feedforward	29.15	17.38	3.8664	0.2541

Note: Steady Error represents the mean absolute error after the system reaches a stable state; RMSE evaluates the dynamic tracking precision throughout each step period.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, W.; Wei, H.; Pang, Y.; Tang, X.; Wang, K.; Zhou, W. Model Predictive Control for an SMA Actuator Based on an SGPI Model. Aerospace 2026, 13, 112. https://doi.org/10.3390/aerospace13020112

AMA Style

Liu W, Wei H, Pang Y, Tang X, Wang K, Zhou W. Model Predictive Control for an SMA Actuator Based on an SGPI Model. Aerospace. 2026; 13(2):112. https://doi.org/10.3390/aerospace13020112

Chicago/Turabian Style

Liu, Wei, Houzhen Wei, Yan Pang, Xudong Tang, Kai Wang, and Wenya Zhou. 2026. "Model Predictive Control for an SMA Actuator Based on an SGPI Model" Aerospace 13, no. 2: 112. https://doi.org/10.3390/aerospace13020112

APA Style

Liu, W., Wei, H., Pang, Y., Tang, X., Wang, K., & Zhou, W. (2026). Model Predictive Control for an SMA Actuator Based on an SGPI Model. Aerospace, 13(2), 112. https://doi.org/10.3390/aerospace13020112

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Model Predictive Control for an SMA Actuator Based on an SGPI Model

Abstract

1. Introduction

2. Mathematical Modeling of SMA Actuator

2.1. SMA Spring Thermodynamic Model

2.2. GPI Hysteresis Model

2.3. SGPI Hysteresis Model

2.4. Identification the SGPI Model Parameters

3. Extended Kalman Filter and Model Predictive Control Design

3.1. Electro-Thermo-Mechanical State-Space Model

3.2. Extended Kalman Filter Design

3.3. Model Predictive Control Design

4. Simulation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI