Actuator Selection and Control of an Array of Electromagnetic Soft Actuators

Zolfaghari, Hussein; Ebrahimi, Nafiseh; Pitkow, Xaq; Davoodi, Mohammadreza

doi:10.3390/electronics14183682

Open AccessFeature PaperArticle

Actuator Selection and Control of an Array of Electromagnetic Soft Actuators

¹

Department of Electrical and Computer Engineering, The University of Memphis, Memphis, TN 38152, USA

²

AI Institute for Artificial and Natural Intelligence (ARNI), New York, NY 10027, USA

³

Department of Applied Engineering Technology, Virginia State University, Petersburg, VA 23806, USA

⁴

Neuroscience Institute and Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA 15213, USA

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(18), 3682; https://doi.org/10.3390/electronics14183682

Submission received: 15 August 2025 / Revised: 10 September 2025 / Accepted: 11 September 2025 / Published: 17 September 2025

(This article belongs to the Special Issue Advances in Intelligent Control Systems)

Download

Browse Figures

Versions Notes

Abstract

Electromagneticsoft actuator arrays (ESAAs) combine compliance with fast, controllable actuation and scalability, providing a promising foundation for the development of interconnected soft actuator arrays inspired by the structure and function of biological muscles. In this work, we present a control framework and an actuator selection strategy for an artificial soft muscle composed of ESAAs to enable accurate reference tracking. Since directly measuring the states of each ESA is often impractical in real-world applications, we first design a Kalman filter-based observer to estimate all system states from available observations. Using these estimates, we develop a Linear Quadratic Gaussian (LQG) controller to achieve reference tracking. Since thermal buildup from constant use can damage the actuators, we consider whether switching between different subsets of active actuators could offer thermal relief. While actuator switching intuitively suggests reduced heating by providing resting periods, our investigation reveals that this strategy can lead to higher thermal accumulation compared to the continuous mode. This is because we need substantially larger control effort when we have fewer active actuators in the switching mode, which, in the absence of effective active cooling, fail to provide sufficient heat dissipation during operation. Simulation results are presented to demonstrate the effectiveness of the proposed method in achieving the trajectory objective and to explore how switching affects the system’s thermal profile, revealing a trade-off between tracking performance and heat generation.

Keywords:

controlsystems; electromagnetic soft actuator; electromagnetic soft actuator array; actuator selection; Kalman filter

Graphical Abstract

1. Introduction

Soft robots are a new class of robots made from flexible and compliant materials, enabling them to move and adapt more like living organisms [1]. Unlike traditional rigid robots, soft robots can continuously deform, allowing them to safely interact with humans and adapt to unstructured or dynamic environments. These features make soft robots highly suitable for a variety of applications, including wearable assistive devices, rehabilitation technologies, minimally invasive surgical tools, search and rescue operations, and bio-inspired locomotion [2,3,4]. A core component of such robots is the use of bio-inspired actuators, which are responsible for producing motion through deformation in response to external stimuli. Various types have been developed, including series elastic actuators (SEAs) [5], shape memory alloys (SMAs) [6], and pneumatic artificial muscles (PAMs) [7]. While SEAs and SMAs offer certain advantages, they suffer from bulkiness, slow response, or poor energy efficiency [8,9,10,11,12]. PAMs can generate large forces but require external air sources, limiting portability [13]. To address these limitations, electromagnetic soft actuators (ESAs) have emerged as a compact, lightweight, and electrically driven alternative. ESAs offer fast, controllable motion and are well-suited for wearable and mobile soft robotic applications [14,15].

It has been both analytically and experimentally demonstrated that scaling down the size of an ESA increases its force per unit cross-sectional area (F/CSA) [16,17,18], making it more efficient in generating force within a compact footprint. Leveraging this property, recent studies have focused on optimizing the size and structure of ESAs and integrating them into a coordinated array, i.e., an electromagnetic soft actuator array (ESAA), that mimics the function of biological muscles [2]. This array configuration enables the generation of greater force in limited spaces. It is inspired by biological muscles, which achieve this through bundles of sarcomeres, the fundamental units of muscle contraction. Each sarcomere consists of repeating structures of myosin and actin filaments that interact to produce force and motion. This biological architecture informs the design of the actuator array developed in our work. In this work, our primary focus is on developing effective control strategies for the ESAAs to enable precise tracking of a desired trajectory.

In recent years, there has been growing interest and several compelling results in the control of soft robots. For example, model predictive control (MPC) has been applied to a six-degree-of-freedom pneumatic robot with compliant plastic joints and rigid links [19]. Rus and Tolley [20] introduced a dynamic curvature controller and a Cartesian impedance controller for continuous soft robots. These controllers enabled closed-loop control by approximating the robot’s behavior with piecewise constant curvature assumptions. In [21], a data-driven method based on Koopman operator theory was used to derive a linear model for a soft pneumatic arm, and a model predictive controller was designed on top of it. In [22], the authors provide an overview of actuator mechanisms and control strategies, including open-loop, closed-loop, and autonomous control, and discuss their implementation from various perspectives. However, these control strategies generally target isolated actuators or particular robot architectures and thus do not easily extend to the control of interconnected soft actuator arrays. To the best of our knowledge, research on actuator arrays, particularly those based on networked ESAs, remains very limited [2] is among the few studies that have explored an array of bio-inspired actuators; however, important aspects such as the presence of noise in practical systems and partial state measurability due to limited sensing have not been adequately addressed. Our current work aims to design an optimal control strategy for a series of actuators that addresses the mentioned issues.

Since ESAs are prone to overheating, which can cause magnet degradation, increased risk of thermal damage and actuator failure, we are motivated to explore switching strategies that aim to provide rest periods for overheated actuators. This objective leads to the formulation of the actuator selection problem: determining, at each time step, the optimal subset of actuators that preserves required performance while minimizing hardware stress and energy use. Actuator and sensor selection has been widely studied across domains. For example, Taha et al. [23] investigated actuator selection in cyber-physical power systems using mixed-integer semidefinite and bilinear matrix inequality formulations and proposed greedy and branch-and-bound algorithms to address non-submodular objectives. Zare et al. [24] introduced a scalable proximal framework with structured regularization for large-scale stochastic systems, including aerospace applications. Despite these efforts, actuator selection remains a challenging problem, particularly in high-dimensional systems [2,23]. Motivated by real-time control requirements and system constraints, this research aims to develop a real-time actuator selection strategy capable of dynamically switching between different subsets of actuators.

This work presents a linear-quadratic-Gaussian (LQG)-based control framework for ESAA, addressing key challenges such as sparse state measurements, system noise, and reference tracking. Although the ESAA is inherently nonlinear, we construct a simplified linear model that captures its essential dynamics while remaining tractable for control design. Given the physical constraints and limited space within artificial muscles, which make embedding numerous sensors infeasible, we employ a Kalman filter to estimate the full system state from noisy, partial observations. To enable accurate trajectory tracking, we augment the system with a reference trajectory generator, converting the tracking problem into a more manageable regulation problem [25,26]. This approach yields an LQG tracker that jointly performs optimal state estimation and control under uncertainty [27,28,29,30,31]. Furthermore, to prevent overheating and providing rest time for the actuators within the array, we implement an actuator selection strategy that dynamically switches between subsets of actuators in real-time. Since identifying the globally optimal actuator subset is a computationally intractable problem due to its combinatorial complexity [32,33], we adopt a greedy algorithm that provides a practical balance between performance and computational speed, enabling real-time control of ESAA.

2. System Description and Problem Formulation

In this section, we introduce the actuator array model and define the problem under consideration. Figure 1a illustrates an array of six ESAs arranged into three parallel strands, each containing two actuators connected in series. The parallel strands share the total generated output force, while the actuators in series contribute to the overall deflection of the structure. Figure 1b provides a detailed view of an individual ESA. Each actuator is made primarily of biocompatible soft silicone, which encases conductive coils and a semi-soft magnetic core. An ESAA was tested in [34], reporting a force of approximately 2.5 N, which corresponds to an axial stress of about 10.6 kPa when normalized by cross section, with a strain of 15%. The ESAA shows potential for operation at around 10 Hz bandwidth. More details about the actuator structure and array can be found in [16,17,34].

2.1. Mathematical Modeling and Control-Oriented Formulation

In this section, a mathematical model of the ESA array (such as the one in Figure 1) is developed to support the design of the control framework and actuator selection strategy. Each deformable actuator can be modeled using two masses representing the conductive coils, shown in brown in Figure 1b. The two coils are connected by a soft, springy linkage shown in white, made of silicone, and modeled as a spring and a damper to represent its elastic and damping behavior. The connection between neighboring actuators is similarly modeled using springs and dampers, representing the mechanical interconnections of the array. The resulting system, consisting of n identical actuators in series and

α

actuators in parallel, can thus be represented as a mass-spring-damper array, as illustrated in Figure 2. Each mass is denoted by m, with internal springy linkages modeled by stiffness

k_{2}

and damping

c_{2}

. The external connections between adjacent actuators are defined by stiffness

k_{1}

and damping

c_{1}

, which generally have significantly higher values than

k_{2}

and

c_{2}

. Furthermore, parallel actuators and their corresponding masses are assumed to be connected via rigid elements to ensure synchronized motion across the parallel strands, guaranteeing that all strands experience the same deflection.

As shown in Figure 2, the variable

y (t)

denotes the displacement of the last mass relative to its initial position at the fully extended system length L. The absolute position

p (t)

of the last mass along the system is therefore given by

p (t) = L - y (t),

(1)

which reflects how the position of the last mass changes dynamically over time as a result of its displacement

y (t)

. This relationship, together with the displacements

x_{i} (t)

of the intermediate masses, provides a basis for analyzing the evolution of positions throughout the series-connected array. Equation (1) is key to relating relative displacements to absolute spatial positions within the system.

This work focuses on controlling the position of the end point of the actuator array. Due to the rigid interconnection of the parallel actuators, the complex physical structure can be simplified to a single row of n actuators connected in series. Figure 3 depicts this equivalent series connection of ESAs. In this structure, the total equivalent mass in each column is

\tilde{m} = α m

, where m is the mass of each unit, and

α

represents the number of parallel actuators in a column. Correspondingly, the stiffness and damping coefficients aggregate linearly across each column, yielding

{\tilde{k}}_{1} = α k_{1}

,

{\tilde{k}}_{2} = α k_{2}

,

{\tilde{c}}_{1} = α c_{1}

, and

{\tilde{c}}_{2} = α c_{2}

. This simplification preserves the fundamental dynamic characteristics of the original actuator array while making the analysis and control design more tractable.

Remark 1.

While not all masses may experience identical dynamics in general, the model in Figure 3 assumes identical input forces across parallel actuators to focus on the primary objective of this work, accurate displacement tracking at the system output. This simplification enables tractable analysis and control design without compromising the fidelity needed for tracking performance.

Let

x_{i} (t)

denote the displacement of the

i th

mass. The motion of each mass follows Newton’s second law, accounting for the net forces from neighboring spring and damper elements. The dynamic equation for mass i is

\tilde{m} {\ddot{x}}_{i} (t) = - {\tilde{k}}_{L} (x_{i} - x_{i - 1}) - {\tilde{k}}_{R} (x_{i} - x_{i + 1}) - {\tilde{c}}_{L} ({\dot{x}}_{i} - {\dot{x}}_{i - 1}) - {\tilde{c}}_{R} ({\dot{x}}_{i} - {\dot{x}}_{i + 1}) + f_{i} (t),

(2)

where,

{\tilde{k}}_{L}

and

{\tilde{k}}_{R}

denote the stiffness coefficients, and

{\tilde{c}}_{L}

and

{\tilde{c}}_{R}

denote the damping coefficients, where the subscripts L and R explicitly refer to the left and right sides of the i-th mass. These coefficients take values from

{{\tilde{k}}_{1}, {\tilde{k}}_{2}}

and

{{\tilde{c}}_{1}, {\tilde{c}}_{2}}

, respectively, depending on whether the connection is within one actuator (

{\tilde{k}}_{2}, {\tilde{c}}_{2}

) or between two adjacent actuators (

{\tilde{k}}_{1}, {\tilde{c}}_{1}

). The term

f_{i} (t)

denotes the control effort of actuator i, which is applied only when the actuator is active and receives a signal from the controller.

To obtain a state-space model representation of the behavior of the entire system, we define the state vector:

z (t) = {[\begin{matrix} x_{1} (t), x_{2} (t), \dots, x_{n} (t), {\dot{x}}_{1} (t), {\dot{x}}_{2} (t), \dots, {\dot{x}}_{n} (t) \end{matrix}]}^{⊤} \in R^{2 n} .

(3)

We assume that due to the small size of the actuators, it is not feasible to add sensors to each one individually. Instead, a single position sensor is attached to the entire array to measure the overall deflection, i.e., the change in position induced by the entire array. The position measured by this sensor corresponds to the displacement of the last mass in the array with respect to a fixed reference, i.e., the wall. Therefore, we assume that the position of the last mass is measurable.

By writing the equations of motion for all the masses and combining them, the state-space dynamics of the system can be expressed as:

\dot{z} (t) = A z (t) + B u (t) + ζ v (t), y (t) = C z (t) + w (t),

(4)

where,

z (t) \in R^{2 n}

denotes the state vector,

u (t) \in R^{n}

is the control effort, and

v (t)

,

w (t)

represent the process and measurement noise vectors, respectively. The system matrices are defined as follows:

A \in R^{2 n \times 2 n}

is the dynamics matrix,

B \in R^{2 n \times n}

is the input gain matrix, and

C \in R^{1 \times 2 n}

is the observation matrix. The process noise

v (t)

is scaled by the gain

ζ \in R

, and the measurement noise

w (t)

is modeled as a white Gaussian noise process with covariance R. We also assumed that the system is both fully observable and fully controllable.

A = [\begin{matrix} 0 & I \\ A_{K} & A_{C} \end{matrix}], B = [\begin{matrix} 0 \\ B_{act} \end{matrix}], C = [\begin{matrix} 0 & \dots & 1 & 0 & \dots & 0 \end{matrix}],

(5)

where

A_{K}

,

A_{C}

, and

B_{act}

are defined as follows:

A_{K} = [\begin{matrix} {\tilde{k}}_{1} & - {\tilde{k}}_{1} & 0 & \dots & 0 \\ - {\tilde{k}}_{1} & {\tilde{k}}_{1} + {\tilde{k}}_{2} & - {\tilde{k}}_{2} & \dots & 0 \\ 0 & - {\tilde{k}}_{2} & {\tilde{k}}_{2} + {\tilde{k}}_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & {\tilde{k}}_{1} \end{matrix}], A_{C} = [\begin{matrix} {\tilde{c}}_{1} & - {\tilde{c}}_{1} & 0 & \dots & 0 \\ - {\tilde{c}}_{1} & {\tilde{c}}_{1} + {\tilde{c}}_{2} & - {\tilde{c}}_{2} & \dots & 0 \\ 0 & - {\tilde{c}}_{2} & {\tilde{c}}_{2} + {\tilde{c}}_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & {\tilde{c}}_{1} \end{matrix}],

B_{act} = [\begin{matrix} 1 & 0 & 0 & \dots & 0 \\ - 1 & 0 & 0 & \dots & 0 \\ 0 & 1 & 0 & \dots & 0 \\ 0 & - 1 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 1 \\ 0 & 0 & 0 & \dots & - 1 \end{matrix}] .

(6)

To account for uncertainties and sensor limitations, we incorporate a Gaussian process and measurement noise. The process and measurement noise are modeled as zero-mean Gaussian random variables, where

v (t) \sim N (0, V)

represents the process noise, and

w (t) \sim N (0, W)

denotes the measurement noise. This model will be used for the control and actuator selection strategies discussed in the subsequent sections.

2.2. Problem Formulation

In this work, we address the problem of accurately tracking time-varying trajectories using an ESAA. The goal is to combine a Linear Quadratic Gaussian (LQG) controller with an actuator selection strategy to enable the array to follow desired reference trajectories while ensuring that only a subset of actuators is activated at any time. This subset is periodically updated to distribute usage across actuators and mitigate the risk of overheating.

To this end, our task is initially framed as a tracking problem and then converted to a standard regulation problem by augmenting the state variables to include the reference (target) trajectory. The solution to this classical LQG problem for a pre-specified set of selected actuators uses a Kalman filter to estimate the ESAA states in the presence of noise, followed by a linear quadratic regulator (LQG). Finally, to perform the actuator selection, we use a greedy algorithm to minimize the total cost, including the classical LQG task plus an additional selection cost proportional to the number of selected actuators. Optimization with this selection cost leads to balancing tracking performance against energy efficiency and thermal safety.

3. Main Results

This section presents the main results of the paper, including the Kalman filter design, LQG control, and actuator selection.

3.1. Kalman-Bucy Filter Design

In linear Gaussian systems where only partial measurements of the states are available, the Kalman filter offers an optimal approach to estimating the full system state [35,36,37].

The estimate update equation, which describes the evolution of the optimal state estimate

\hat{z} (t)

, is given by

\dot{\hat{z}} (t) = A \hat{z} (t) + B u (t) + L (t) (y (t) - C \hat{z} (t)),

(7)

where

L (t)

is the Kalman gain matrix, and

y (t) - C \hat{z} (t)

is the innovation or residual, representing the discrepancy between the actual measurement and the predicted measurement.

The Kalman gain

L (t)

is computed using the solution of the continuous-time differential Riccati equation, which describes the propagation of the error covariance

P (t)

:

\dot{P} (t) = A P (t) + P (t) A^{⊤} + ζ V ζ^{⊤} - P (t) C^{⊤} W^{- 1} C P (t),

(8)

where the Kalman gain

L (t)

is then given by

L (t) = P (t) C^{⊤} W^{- 1} .

(9)

The term

- P (t) C^{T} W^{- 1} C P (t)

in the error covariance update represents the decrease in state uncertainty due to the measurements.

This recursive estimator dynamically balances the trust between the model prediction and the actual measurements, providing optimal state estimation in the presence of noise. The resulting estimate

\hat{x} (t)

can be subsequently used in the control design.

3.2. Reference Tracking Control Design

The control objective is to ensure that the position of the end of the ESA array follows a desired trajectory. The change in this position corresponds to the deflection of the last mass in the array. Let

{\tilde{y}}_{d}

denote the desired trajectory for the array’s end-effector, and l be the total length of the array. Then,

y_{d} = {\tilde{y}}_{d} - l

represents the displacement of the last mass. We define the tracking error as

e (t) = C \hat{z} (t) - y_{d} (t),

(10)

which quantifies the deviation between the estimated output and the desired trajectory. To model the desired trajectory’s evolution, we assume it follows a known autonomous linear dynamic system, namely

{\dot{y}}_{d} (t) = F y_{d} (t),

(11)

where

F \in R^{p \times p}

is a known, stable matrix that governs the reference trajectory dynamics.

To simultaneously penalize tracking error and control effort, we define the following infinite-horizon discounted quadratic cost:

J = \frac{1}{2} [\int_{0}^{\infty} e^{- γ t} (e {(t)}^{⊤} Q e (t) + u {(t)}^{⊤} R u (t)) d t],

(12)

where the matrix

Q ⪰ 0

is the cost sensitivity to state errors, the matrix

R ≻ 0

is the cost sensitivity to actions, and the scalar

γ > 0

is a discount factor emphasizing near-term performance.

We define the augmented state and output matrices as

{\hat{Z}}_{aug} (t) = [\begin{matrix} \hat{z} (t) \\ y_{d} (t) \end{matrix}], C_{aug} = [\begin{matrix} C & - I \end{matrix}],

(13)

where

{\hat{Z}}_{aug} (t) \in R^{2 n + p}

is the augmented state vector, combining the estimated system state

\hat{z} (t) \in R^{2 n}

, the reference trajectory

y_{d} (t) \in R^{p}

, and

C_{aug} \in R^{p \times (2 n + p)}

is the output matrix of the augmented system.

Together these definitions lead to the augmented dynamics:

{\dot{\hat{Z}}}_{aug} (t) = \underset{A_{aug}}{\underset{︸}{[\begin{matrix} A & 0 \\ 0 & F \end{matrix}]}} {\hat{Z}}_{aug} (t) + \underset{B_{aug}}{\underset{︸}{[\begin{matrix} B \\ 0 \end{matrix}]}} u (t), e (t) = C_{aug} {\hat{Z}}_{aug} (t) .

(14)

In this formulation,

A_{aug} \in R^{(2 n + p) \times (2 n + p)}

combines the original system’s dynamics A and the reference trajectory’s dynamics F, while

B_{aug} \in R^{(2 n + p) \times n}

extends the input matrix B with zeros to align with the augmented state dimension. This construction reformulates the tracking problem as a regulation problem in the augmented state space. To simplify the resulting discounted cost, we now apply a change of variables to remove the exponential discount factor from the cost function:

{\tilde{Z}}_{aug} (t) = e^{- \frac{γ t}{2}} {\hat{Z}}_{aug} (t), \tilde{u} (t) = e^{- \frac{γ t}{2}} u (t),

(15)

From these definitions, the error term

e (t)

and the control effort

u (t)

can be expressed in terms of the transformed variables as

e (t) = C_{aug} {\hat{Z}}_{aug} (t) = C_{aug} (e^{\frac{γ t}{2}} {\tilde{Z}}_{aug} (t)) = e^{\frac{γ t}{2}} C_{aug} {\tilde{Z}}_{aug} (t),

(16)

u (t) = e^{\frac{γ t}{2}} \tilde{u} (t) .

(17)

Substituting these into the system dynamics, the transformed state

{\tilde{Z}}_{aug} (t)

evolves according to

{\dot{\tilde{Z}}}_{aug} = (A_{aug} - \frac{γ}{2} I) {\tilde{Z}}_{aug} + B_{aug} \tilde{u} (t) .

(18)

By substituting these expressions into the original cost function (12), we obtain

\begin{matrix} J & = \frac{1}{2} [\int_{0}^{\infty} e^{- γ t} {(e^{\frac{γ t}{2}} C_{aug} {\tilde{Z}}_{aug} (t))}^{⊤} Q (e^{\frac{γ t}{2}} C_{aug} {\tilde{Z}}_{aug} (t)) + {(e^{\frac{γ t}{2}} \tilde{u} (t))}^{⊤} R (e^{\frac{γ t}{2}} \tilde{u} (t)) d t] \\ = \frac{1}{2} [\int_{0}^{\infty} ({\tilde{Z}}_{aug}^{⊤} (t) C_{aug}^{⊤} Q C_{aug} {\tilde{Z}}_{aug} (t) + {\tilde{u}}^{⊤} (t) R \tilde{u} (t)) d t] . \end{matrix}

(19)

Now, define

\tilde{Q} : = C_{aug}^{⊤} Q C_{aug},

(20)

so the cost function becomes

J = \frac{1}{2} [\int_{0}^{\infty} {\tilde{Z}}_{aug}^{⊤} (t) \tilde{Q} {\tilde{Z}}_{aug} (t) + {\tilde{u}}^{⊤} (t) R \tilde{u} (t) d t] .

(21)

To determine the optimal state-feedback gain that minimizes the cost function (21), we apply the corresponding Algebraic Riccati Equation (ARE). This equation arises from minimizing the quadratic cost subject to the transformed linear system dynamics and its solution specifies a symmetric positive semidefinite matrix P used to construct the optimal controller. The corresponding discounted ARE is

{(A_{aug} - \frac{γ}{2} I)}^{⊤} P + P (A_{aug} - \frac{γ}{2} I) - P B_{aug} R^{- 1} B_{aug}^{⊤} P + \tilde{Q} = 0,

(22)

Assumption 1.

The triple (

A_{aug} - \frac{γ}{2} I

,

B_{aug}

,

\sqrt{Q}

) is stabilizable and detectable.

Under the Assumption 1, the ARE (22) has a unique positive semi definite

P ⪰ 0

. The optimal feedback controller that minimizes the cost function is

u (t) = K {\tilde{Z}}_{aug} (t), K = R^{- 1} B_{aug}^{⊤} P .

(23)

This control law guarantees system stability. More specifically, based on Assumption 1, the closed loop system matrix

A_{c l} = A_{aug} - \frac{γ}{2} I + B_{aug} K

is Hurwitz (stability matrix). Therefore, for any bounded desired reference input, the closed-loop output remains bounded. For more details on the proof, see the discussions provided in [25,38].

3.3. Actuator Selection Strategy

To study the impact of switching the actuators on the operation of the artificial muscle, an actuator selection strategy is integrated into the continuous-time LQG tracking framework. Instead of using the full set of actuators continuously, the strategy activates only a subset for short periods. Here, actuator activation (or selection) refers to a binary decision indicating whether an actuator is enabled at a given time. By sequentially activating different subsets of actuators for defined durations, the method balances accurate trajectory tracking with reduced simultaneous actuator usage.

Actuator selection is represented by introducing a diagonal matrix

G = diag (g_{1}, \dots, g_{n})

in system dynamics (4), in which each binary variable

g_{i} \in {0, 1}

indicates whether the i-th actuator is active (

g_{i} = 1

) or inactive (

g_{i} = 0

). This matrix modulates the input matrix B, such that only the selected actuators contribute to the system’s control effort. This formulation provides a convenient and compact way to encode the selection logic directly into the system dynamics. This leads to the following model for the array of actuators:

\dot{z} (t) = A z (t) + B_{new} u (t) + v (t),

(24)

where

B_{new} = B G

defines the modified input matrix based on the selected actuators.

The objective for selection and control is to design a system that reduces the number of active actuators while still achieving accurate reference tracking. One motivation for this goal is to balance the improved control performance expected from using more actuators against potential degradation due to overuse. This objective does not optimize scheduling of selected actuators; for that we develop a heuristic schedule for switching between sets of active actuators, as described below. This schedule must ensure sufficient rest time before re-selecting actuators to prevent long-term mechanical fatigue due to continuous usage. To achieve this in a scalable and usage-aware manner, we use a two-phase selection and switching algorithm. In the first phase, we determine a reduced number of actuators required to satisfy the control objectives while balancing performance with actuator usage efficiency. In the second phase, we implement a switching-based strategy that cycles between different subsets of actuators over time to avoid prolonged usage.

Phase 1: Determining the Optimal Number of Actuators. This phase seeks to determine an actuator configuration that achieves accurate motion tracking while minimizing the number of actuators used. To this end, a numerical optimization procedure is performed by incrementally increasing the number of active actuators from one to the total available. For each case, the binary selection matrix G is updated, and the associated cost

J (G)

, comprising both performance and operation terms, is evaluated using the following formulation:

\begin{matrix} J (G) & = \arg \min_{G} \{J^{*} (G) + β Tr (G)\} \\ = \arg \min_{G} \{\overset{Total Cost}{\overset{︷}{\underset{Performance Cost}{\underset{︸}{Tr (\tilde{Q} P) + Tr (P {\tilde{B}}_{aug} R^{- 1} {\tilde{B}}_{aug}^{⊤} P)}} + \underset{Operation Cost}{\underset{︸}{β Tr (G)}}}}\}, \\ subject to Tr (G) \leq ⌊\frac{N}{2}⌋, \end{matrix}

(25)

where

Tr (G)

represents the number of active actuators. The trade-off between tracking performance and actuator usage is governed by the parameter

β > 0

, which penalizes activation through the trace term. However, this regularization alone is not always sufficient to effectively limit actuator usage. Therefore, a hard constraint

Tr (G) \leq ⌊\frac{N}{2}⌋

is imposed to explicitly restrict the number of simultaneously active actuators to at most half of the total. This constraint enforces the need to define at least two distinct sets of actuators, enabling feasible switching between them during operation. The matrix

B_{aug}

is the augmented input matrix, defined as

\begin{matrix} {\tilde{B}}_{aug} = [\begin{matrix} B_{new} \\ 0 \end{matrix}], \end{matrix}

(26)

In (25),

J^{*} (G)

represents the analytical solution of the LQG cost (21), while the matrix P is determined by solving the discounted continuous-time Algebraic Riccati Equation (ARE):

{(A_{aug} - \frac{γ}{2} I)}^{⊤} P + P (A_{aug} - \frac{γ}{2} I) - P {\tilde{B}}_{aug} R^{- 1} {\tilde{B}}_{aug}^{⊤} P + \tilde{Q} = 0,

(27)

Among all the evaluated configurations, the one yielding the minimum total cost determines the optimal number of actuators, denoted by

K^{*}

. In (25), two competing costs are considered: the performance cost, which reflects the system’s ability to track the reference trajectory, and the operation cost, which accounts for the number of actuators used. Minimizing the number of active actuators is essential to reduce prolonged usage of individual components and prevent performance degradation due to overheating over time. Additionally, activating fewer actuators per set enables the creation of more distinct actuator groups, which increases opportunities for changing between active actuators and ensures each set has sufficient resting time between activations to support long-term operational reliability.

To ensure the actuator switching mechanism remains feasible, we impose the constraint

K^{*} \leq \frac{N}{2}

, meaning that at least two distinct actuator sets must be constructible. Without this constraint, switching becomes ineffective because there would not be enough unselected actuators to alternate. This would limit the opportunity to distribute usage evenly and avoid excessive wear on any single subset.

Phase 2: Switching Strategy. After determining the optimal number of actuators

K^{*}

out of the set

A = {1, \dots, N}

of all possible actuators, the control system switches between multiple distinct sets with that optimal number. To implement switching, each actuator is allowed to remain active for a fixed duration T, after which it must be deactivated and undergo a rest period of at least

\tilde{T}

(Figure 4). We assume that

\tilde{T} < T

, to provide ample rest time for each actuator before reactivating.

We define switching times t, after which the actuator selection process aims to minimize a cost function that balances setpoint tracking performance and the cumulative usage history of the actuators. This is formulated as the following constrained optimization problem:

\begin{matrix} G_{t} = \underset{G}{\arg \min} & J^{*} (G) + β \sum_{i = 1}^{N} {\tilde{W}}_{i} g_{i} \\ s . t . & Tr (G) = K^{*}, \\ g_{i} = 0 if τ_{i} (t) < \tilde{T}, \forall i \in {1, \dots, N}, \end{matrix}

(28)

where,

J^{*} (G)

denotes the closed-loop tracking cost,

{\tilde{W}}_{i}

represents the cumulative usage of actuator i, and

τ_{i} (t)

is the elapsed resting time since its last activation. The regularization factor

β \geq 0

is set to scale the operation cost so that it becomes comparable in magnitude to the performance cost. The admissible actuators are those which have already rested enough,

A_{valid} = {i ∣ τ_{i} (t) \geq \tilde{T}}

. The constraint on

Tr (G)

ensures that exactly

K^{*}

actuators are selected at each switching step.

To enable smooth switching between different sets, we allow multiple sets of actuators to be selected for a brief overlap duration

\hat{T}

around the switching time t:

\begin{matrix} \hat{T} = \frac{T - \tilde{T}}{2}, \end{matrix}

(29)

where this constraint ensures seamless coordination between consecutive actuator sets during switching, helping maintain continuity in both the control effort and the tracking performance.

In Figure 4, the timing of actuator switching is illustrated over two consecutive intervals. Blue shades correspond to Set 1 actuators and green shades correspond to Set 2 actuators. The dark blue and dark green segments represent periods when each set is active without overlap. The light blue and light green segments indicate the overlap period

\hat{T}

, during which both sets are simultaneously active to ensure a smooth transition. The mandatory rest period

\tilde{T}

for each set is represented by the white gaps that occur between its deactivation and the next activation. For Set 1, the rest period is positioned directly above the active time of Set 2, coinciding with it in the timeline, and vice versa. As the sets alternate roles, the set that was active in one interval becomes the resting set in the next.

This illustration is based on the assumption that the total actuator pool is divided into exactly two alternating sets, which alternate roles across switching intervals.

The general structure of the proposed controller and the actuator selection strategy is illustrated in Figure 5, while the actuator selection process itself is carried out using the procedure detailed in Algorithm 1 of the paper.

Algorithm 1 Two-Phase Actuator Selection and Tracking

Inputs: System matrices

A, B, C

; weighting matrices

Q, R

; noise covariance

\tilde{V}

; regularization parameter

β

; actuator set

A = {1, \dots, n}

; actuation interval T; resting time

\tilde{T}

; overlap time

\hat{T}

Outputs: Optimal actuator count

K^{*}

, actuator selections

G_{t}

, and control effort

u (t)

Phase 1: Determining the Optimal Number of Actuators.
for each actuator count

k = 1

to

⌊ n / 2 ⌋

do
Initialize actuator mask vector

{\bar{m}}_{k} \leftarrow zeros (n, 1)

while

Tr ({\bar{m}}_{k}) < k

do
for each actuator

i \in A

such that

{\bar{m}}_{k} (i) = 0

do
Set trial vector

{\bar{m}}_{trial} \leftarrow {\bar{m}}_{k}

, then set

{\bar{m}}_{trial} (i) \leftarrow 1

Compute

B_{new} = B \cdot diag ({\bar{m}}_{trial})

              Form augmented matrix (26)
              Solve ARE (27) to obtain matrix P
              Compute control gain matrix

K = R^{- 1} B_{new}^{⊤} P

for the current actuator subset
Evaluate cost

J_{i}

using (25)
end for
Choose the actuator

i^{*}

with the lowest cost

J_{i}

Add actuator

i^{*}

to the current set by setting

{\bar{m}}_{k} (i^{*}) \leftarrow 1

end while
Save the total cost

J_{k}^{*}

and the corresponding actuator set

{\bar{m}}_{k}

end for
Select optimal actuator count:

K^{*} = \arg \min_{k} J_{k}^{*}

Phase 2: Switching Strategy with Smooth Transitions
Initialize rest timer vector

\bar{τ} (t) \leftarrow \tilde{T} \cdot ones (n, 1)

; each actuator starts fully rested, with

\tilde{T}

representing the required rest time between activations
if

\hat{T} \geq \min (T, \tilde{T})

then
Raise error: overlap duration too long for given actuation and rest times
end if
for each switching interval

t \in {0, T, 2 T, \dots}

do
Solve optimization problem in Equation (28) to determine actuator set

G_{t}

Generate time-varying selection matrix

G (t)

that enables simultaneous activation of current and new actuator sets during the overlap interval

\hat{T}

Compute control effort

u (t)

using

G (t)

Update actuator usage:

{\bar{\tilde{W}}}_{i} \leftarrow {\bar{\tilde{W}}}_{i} + 1 \forall i \in G_{t}

Reset rest timers:

{\bar{τ}}_{i} (t) \leftarrow 0 \forall i \in G_{t}

Increment rest timers:

{\bar{τ}}_{i} (t) \leftarrow {\bar{τ}}_{i} (t) + T \forall i \notin G_{t}

end for

4. Simulation Setup and Results

In this section, we present simulation results to evaluate the effectiveness of the proposed methodology. The main objective is to compare the full-actuator case with the switching case in terms of trajectory tracking performance and thermal management. The analyses were performed in MATLAB R2024a (The MathWorks, Inc., Natick, MA, USA). All simulations and results reported in the manuscript were generated on a Dell Precision 3680 workstation equipped with an Intel Core i9-14900 processor (2.0 GHz, 24 cores, 32 logical processors) and 32 GB of RAM. The array comprises ten actuators (

n = 10

) connected in series via springs and dampers, as illustrated in Figure 3. It is worth noting that the connections between neighboring actuators are modeled as springs and dampers. This assumption is motivated by the physical behavior of soft structures, where elastic and dissipative interactions arise naturally due to material compliance and viscoelastic effects. Such a representation captures the dominant coupling dynamics while keeping the overall model tractable for control and estimation purposes. Intra-actuator springs and dampers are modeled as less stiff and less damping, while inter-actuator components are made stiffer and more heavily damped. The physical parameters are set as follows:

k_{1} = 4.0

N/m,

k_{2} = 0.343

N/m,

c_{1} = 0.318

Ns/m,

c_{2} = 0.053

Ns/m, and

m = 2.94 \times 10^{- 3}

kg. The stiffness of the silicone linkages can be estimated from its 100% modulus, average cross-sectional area, and length using Hooke’s law, while the damping coefficient is assumed to be very small [17,39]. The input matrices

A \in R^{40 \times 40}

and

B \in R^{40 \times 10}

corresponding to 10 actuators are obtained assuming that each actuator applies equal and opposite forces to its two internal masses resulting in contraction. A single scalar input per actuator is defined, resulting in 10 independent inputs in total. Realistic noises are introduced by adding Gaussian noise, including process noise

w (t) \sim N (0, 0.002)

and measurement noise

v (t) \sim N (0, 0.01)

.

In the following, two different scenarios are considered to study various aspects of the proposed methodology. In Scenario 1, we evaluate the effectiveness of the proposed overlapping interval during switching between different actuator sets, addressing a constant reference tracking problem. In Scenario 2, we evaluate motion tracking for a more complex sinusoidal reference, and examine the impact of the switching strategy on the thermal performance of the actuator.

4.1. Scenario 1: Switching Configuration with and Without Overlap Interval

The control objective is to track a step reference modeled by

{\dot{y}}_{d} = 0

, corresponding to a fixed target position for the final mass. Recall that the term C projects the vector of actuator positions onto the one dimension that should track the target. Thus, the tracking error is defined as

e (t) = C \hat{x} (t) - y_{d} (t)

, where

\hat{x} (t)

is the Kalman-filtered state estimate for all actuators. We refer to the formulations in Section 3 for details on the augmented dynamics, discounted LQG controller, and state estimation strategy. The simulation parameters are set as follows:

Q = 60 \cdot I_{40 \times 40}

,

R = 0.01 \cdot I_{10 \times 10}

,

γ = 0.5

, with initial conditions

\hat{x} (0) = zeros (40, 1)

and

P (0) = 0.01 \cdot I_{40 \times 40}

. The actuator selection follows the framework described in Section 3.3, using the formulations in Equations (25), (27) and (28).

We assume that each actuator can remain active for up to

T = 7.5

s, after which it must rest for at least

\tilde{T} = 2.5

s. This constraint defines an allowable overlap time of

\hat{T} = \frac{T - \tilde{T}}{2} = 2.5

s to enable smooth transitions. To determine the optimal number of actuators

K^{*}

, Phase 1 of the algorithm performs an incremental search over

k \in {1, \dots, ⌊ n / 2 ⌋}

, where

n = 10

. Based on this analysis, the optimal number of actuators was found to be

K^{*} = 3

.

To evaluate the proposed switching strategy for this optimal number of actuators, we consider two simulation cases: In Case 1, we assume no overlap between switching instances; each set is fully deactivated before the next is activated. In Case 2, an overlap of

\hat{T} = 2.5 s

is introduced between consecutive sets to ensure smoother transitions.

Case 1: Figure 6 shows the system output (a), control effort (b), and activation timeline (c) of the actuators under the non-overlapping switching scenario. As depicted in Figure 6b, abrupt transitions between actuator sets cause sharp discontinuities in the control effort, which lead to noticeable spikes in the system response Figure 6a. Although the reference trajectory is ultimately tracked, these transient behaviors highlight the need for actuator overlap, with benefits that will be demonstrated in Case 2.

Case 2: Figure 7 shows the system output (a), control effort (b), and activation timeline (c) of the actuators under the overlapping switching scenario. Figure 7b shows that introducing a brief overlap between actuator sets effectively eliminates the abrupt transients in the control effort. As a result, Figure 7a demonstrates smooth and accurate tracking of the reference trajectory, without the spikes observed in the non-overlapping case in Figure 6a. These results underscore the importance of the 2.5-s overlap, during which two sets of actuators are active simultaneously, as indicated by the dashed segments in Figure 7c. This overlap promotes stable and reliable system behavior.

Robustness Against Switching and Noise: Robustness, in this context, refers to the system’s ability to maintain stable and accurate tracking performance despite disturbances from abrupt switching and noise. The comparative results from both cases illustrate the robustness of the proposed switching strategy, with both maintaining robust performance under process and measurement noise through the use of a Kalman filter, which provides accurate state estimation despite stochastic disturbances. In the non-overlapping case (Figure 6), abrupt control transitions result in noticeable displacement spikes, which, when combined with process and measurement noise, substantially degrade tracking performance and lead to a higher mean squared tracking error (

MSE = \frac{1}{T} \sum_{t = 1}^{T} {| e (t) |}^{2}

) of 0.01 cm². Conversely, in the overlapping case (Figure 7), the system exhibits smooth displacement trajectories with no visible transients, despite the presence of measurement, process, and switching noise. This configuration achieves a lower mean squared tracking error of 0.0084 cm², highlighting the benefits of switching with overlapping. The inclusion of overlap time in the second case mitigates switching-induced spikes by ensuring smooth transitions between actuator sets, thereby enhancing robustness specifically against switching-related disturbances, in addition to maintaining robustness against process and measurement noise.

4.2. Scenario 2: Switching Strategy for Tracking and Thermal Management

In this scenario, we compare the tracking performance and thermal behavior of the system under both full actuation and the proposed switching strategy. To this end, we use the same 10-actuator array from the previous scenario to track a sinusoidal reference trajectory. In this scenario, we specifically evaluate three cases: (3) control with full actuation, (4) control using the overlapping switching strategy, and (5) control using the overlapping switching strategy with increased priority on minimizing control effort. Case 3 serves as the baseline for comparison. In Case 4, we investigate the effectiveness of the switching strategy in achieving motion tracking, as well as its adverse effect on actuator heating, compared to Case 3. In Case 5, we strengthen the soft constraint on control effort to more accurately reflect practical actuator limitations, and analyze the resulting trade-off between tracking performance and thermal management compared to the full actuation scenario.

Case 3: In the full actuation strategy, when all actuators are consistently active, we run the simulation and solve Equation (23) to find the control effort required for each of the actuators for setpoint tracking. Figure 8 illustrates the system’s output (a), control effort (b), and activation timeline. As shown in Figure 8a, the system output can accurately follow the desired trajectory, while Figure 8b shows the corresponding control effort. Overall, the results highlight that the full use of actuators enables accurate motion tracking by uniformly distributing control effort across all actuators.

Case 4: This case evaluates the system’s performance under the proposed switching control strategy, which balances tracking accuracy with actuator activation timing. In the switching strategy, actuator sets are active for a total duration of

7.5 s

, with a

2.5 s

overlap period between the outgoing and incoming actuator sets.

The selection process follows a two-phase approach. In Phase 1, the algorithm identifies a set of four actuators that can achieve effective tracking while distributing control among fewer actuators. In Phase 2, different sets are used to independently handle the control task. Using the same formulation as in Case 3, control effort are then generated accordingly.

Figure 9 presents the system’s displacement tracking (a), control effort (b), and activation timeline (c). As shown in Figure 9a, the system successfully tracks the sinusoidal reference while dynamically switching among actuator groups. The close alignment between the reference trajectory (blue) and the system’s output under LQG control (red), despite the measurement noise (black) and actuator switching, confirms that the system successfully performs tracking.

Figure 9 presents the system’s displacement tracking (a), control effort (b), and activation timeline (c). As shown in Figure 9a, the output under LQG control (red) closely follows the sinusoidal reference trajectory (blue), despite measurement noise (black) and dynamic switching among actuator groups. Figure 9b illustrates the corresponding control effort.

Figure 10 provides a detailed comparison between the squared magnitude of the control signal,

{| u |}^{2}

, for the full actuation and actuator selection strategies. As shown in this figure, the full actuation approach yields a lower overall control effort by evenly distributing the control effort across all actuators, requiring less power from each. In contrast, the switching strategy results in disproportionately higher control effort magnitudes, as fewer actuators are engaged to achieve the system’s tracking objectives.

Comparing the Thermal Profiles of Full Actuation and Switching Strategy. One problem with using actuators continuously is heat generation, which can degrade the actuators’ performance and potentially damage the device. Here, we assess whether the switching strategy mitigates this problem.

To compare the thermal impact of continuous full activation and the proposed switching strategy, we model the actuator temperatures in both scenarios. Heat generation is proportional to the square of the electrical current, but current is not explicitly modeled as a state in the ESA array model. Therefore, we use the control effort as a surrogate for current, with its square serving as a proxy for the heat generated by each actuator. To model the temperature of each actuator over time, we use a first-order linear differential equation for the accumulation of heat, namely:

\frac{dT (t)}{dt} = - a T (t) + b u {(t)}^{2},

(30)

where

T (t)

denotes the actuator temperature,

u_{i} {(t)}^{2}

represents the squared control effort for each actuator,

a > 0

describe the passive temperature decay rate during rest periods, and

b > 0

scales the heat generated by the actuator control effort.

The analytical solution to Equation (30), assuming an initial temperature of

T (- t_{0})

, is

T (t) = T (t_{0}) e^{- at} + \int_{t_{0}}^{t} e^{- a (t - τ)} b u {(τ)}^{2} d τ,

(31)

which indicates that the actuator temperature is an exponentially weighted integral of past control effort.

Figure 11 illustrates the control effort of a single actuator and the resulting actuator temperature computed from this model for both the full actuation and switching strategies. As is clear from Figure 11b, the switching strategy leads to significantly higher actuator temperatures than the continuous mode. In the switching mode, the active actuators must generate a larger control effort to achieve accurate motion tracking. This stronger control effort leads to increased heat generation, because heat production is proportional to the dissipated energy, which itself scales with the square of the control effort. This increased effort directly results in greater heat generation, since heat production is proportional to the energy dissipated, which in turn scales with the square of the control effort.

Case 5: As previously shown in Case 4 and supported by the thermal model results in Figure 11, the switching strategy, while effective for reference tracking, results in significantly higher actuator temperatures due to the larger control effort required from the active actuators under this strategy. The smaller the fraction of selected subsets, the greater the extra heat they generate. In practice, however, actuators are subject to physical constraints such as current limits, beyond which the device may fail due to exceeding safe operational boundaries. To realistically account for these limitations in our control design, we replace the hard constraint on control effort with a soft constraint by increasing the control penalty matrix

R

in the cost function (12). This modification discourages excessive control effort and helps keep

u

within a safe range. Rather than abruptly truncating the control effort for violating a hard constraint, this stronger soft constraint gradually reduces the overall control level.

Figure 12a illustrates the control effort

u_{1} (t)

applied to actuator 1. The model for the corresponding actuator temperature, denoted by

T (t)

and derived from the dynamic thermal model represented by Equation (30), is shown in Figure 12b. As shown, higher penalization with

R

results in lower peak temperatures than for the continuous case.

This confirms that increasing

R

effectively regulates actuator heating under switching, which otherwise would result in significantly higher heat. Figure 13 presents the corresponding position tracking,

x_{track} (t)

, where we observe that increasing

R

, while thermally beneficial, reduces the controller’s incentive to follow the desired trajectory. This highlights a key trade-off: limiting control effort reduces temperature but compromises tracking performance. Thus, Case 5 offers a practically motivated refinement to the switching control strategy explored in earlier cases, especially Case 4, by embedding physical actuator limitations via cost function tuning.

To clarify the differences between our approach and existing studies, we compare our work with that of Ebrahimi et al. [2], which also investigates actuator selection for trajectory tracking. Their framework relies on offline actuator selection, assumes that all system states are directly measurable, and does not consider thermal effects. By contrast, our framework implements a real-time actuator selection strategy with explicit switching between sets of activated actuators and soft switching, where an overlap is introduced between consecutive sets during transitions. We perform trajectory tracking using a methodology that is robust against process and measurement noise, and we incorporate thermal profile analysis to balance tracking performance with thermal management. These extensions make the proposed framework more representative of practical soft artificial muscle systems.

5. Conclusions

In this work, we proposed a control framework and actuator selection strategy for an artificial muscle composed of an array of ESAAs, aiming to achieve effective motion tracking while exploring the trade-offs between tracking performance and thermal management. Recognizing the challenges in directly measuring all actuator states, we implemented a Kalman filter-based observer to estimate system states in real-time. These estimates enabled the design of a linear-quadratic-Gaussian (LQG) controller to ensure effective tracking of the reference. A key innovation in our approach is a dynamic actuator selection algorithm that alternates between optimal subsets of actuators over time. While this strategy does not necessarily reduce overall heat due to higher control demands on fewer actuators, it enables effective management of actuator usage. The inclusion of a brief overlap during switching helps ensure smooth reference tracking performance, particularly in scenarios with uneven control effort distribution across the actuator array.

To comprehensively evaluate the performance of the proposed framework, we considered two simulation scenarios, each including multiple cases to clarify the analysis. Scenario 1 addressed constant reference tracking with two cases: (i) non-overlapping switching and (ii) overlapping switching. Scenario 2 evaluated sinusoidal reference tracking with three cases: (iii) full actuation, (iv) overlapping switching, and (v) overlapping switching with a stronger soft constraint on control effort. Collectively, these scenarios provided insight into different trade-offs between tracking performance, control effort, heat, and actuator use. For example, strengthening the soft constraint reduces heat further, by limiting control effort, but may slightly affect tracking performance, whereas overlapping intervals can improve tracking smoothness and reduce abrupt changes in control effort. The comparison of these cases demonstrated how intelligent switching, especially with overlapping intervals, can improve tracking smoothness and reduce abrupt changes in control effort, even under challenging actuation conditions. Simulation results validated the proposed framework’s ability to manage the tradeoff between trajectory tracking and actuator control effort. Switching strategies may introduce some performance degradation in tracking under certain conditions. However, they reduce continuous activation on all actuators by enabling only a subset at a time, which prevents constant use of the entire actuator set. This highlights the importance of designing smarter switching and control methods for future actuator systems.

An important finding from our study is that, under the current system scale and operating conditions, actuator switching can actually increase overall temperatures compared to full actuation. The smaller the subset of selected actuators, the more heat each one generates. This occurs because switching concentrates the control effort on fewer actuators at a time, increasing their instantaneous control effort and, consequently, their heat output.

In our future work, we plan to enhance the realism and applicability of our model by incorporating constraints on the maximum displacement of each actuator. Currently, the proposed approach allows actuators to deflect without physical limitations. However, in a real-world system, each ESA has a finite range of motion. Accounting for the ESAs’ physical limitations makes the model more realistic and aligns it with actual system behavior. We also plan to conduct more extensive evaluations across a wider range of loading conditions and actuator configurations to further investigate the effectiveness and limitations of switching strategies. Furthermore, as part of the future plan for this paper, we will extend the study to experimental validation on real electromagnetic soft actuator arrays, taking into account long-term operation, electromagnetic interference, and other practical implementation challenges.

Author Contributions

Conceptualization, H.Z., N.E., X.P. and M.D.; methodology, H.Z., N.E. and X.P.; software, H.Z. and M.D.; writing—original draft preparation, H.Z., M.D. and N.E.; writing—review and editing, H.Z., N.E., X.P. and M.D.; visualization, H.Z.; supervision, N.E., X.P. and M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the funds provided by the National Science Foundation and by DoD OUSD (R & E) under Cooperative Agreement PHY-2229929 (The NSF AI Institute for Artificial and Natural Intelligence, ARNI).

Data Availability Statement

Data are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

Appiah, C.; Arndt, C.; Siemsen, K.; Heitmann, A.; Staubitz, A.; Selhuber-Unkel, C. Living materials herald a new era in soft robotics. Adv. Mater. 2019, 31, 1807747. [Google Scholar] [CrossRef]
Ebrahimi, N.; Nugroho, S.; Taha, A.F.; Gatsis, N.; Gao, W.; Jafari, A. Dynamic actuator selection and robust state-feedback control of networked soft actuators. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia, 21–25 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 2857–2864. [Google Scholar] [CrossRef]
Milana, E. Soft robotics for infrastructure protection. Front. Robot. 2022, 9, 1026891. [Google Scholar] [CrossRef]
der Maur, P.A.; Djambazi, B.; Haberthür, Y.; Hörmann, P.; Kübler, A.; Lustenberger, M.; Sigrist, S.; Vigen, O.; Förster, J.; Achermann, F.; et al. Roboa: Construction and evaluation of a steerable vine robot for search and rescue applications. In Proceedings of the 2021 IEEE 4th International Conference on Soft Robotics (RoboSoft), New Haven, CT, USA, 12–16 April 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 15–20. [Google Scholar] [CrossRef]
Agarwal, P.; Deshpande, A.D. Series elastic actuators for small-scale robotic applications. J. Mech. Robot. 2017, 9, 031016. [Google Scholar] [CrossRef]
Jin, H.; Dong, E.; Xu, M.; Liu, C.; Alici, G.; Jie, Y. Soft and smart modular structures actuated by shape memory alloy (SMA) wires as tentacles of soft robots. Smart Mater. Struct. 2016, 25, 085026. [Google Scholar] [CrossRef]
Andrikopoulos, G.; Nikolakopoulos, G.; Manesis, S. A survey on applications of pneumatic artificial muscles. In Proceedings of the 2011 19th Mediterranean Conference on Control & Automation (MED), Corfu, Greece, 20–23 June 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 1439–1446. [Google Scholar] [CrossRef]
Grosu, V.; Rodriguez-Guerrero, C.; Grosu, S.; Vanderborght, B.; Lefeber, D. Design of smart modular variable stiffness actuators for robotic-assistive devices. IEEE/ASME Trans. Mechatron. 2017, 22, 1777–1785. [Google Scholar] [CrossRef]
Borboni, A.; Faglia, R.; Palpacelli, M. Shape memory actuator with slider and slot layout and single fan cooling. In Proceedings of the 2014 IEEE/ASME 10th International Conference on Mechatronic and Embedded Systems and Applications (MESA), Senigallia, Italy, 10–12 September 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–6. [Google Scholar] [CrossRef]
Dang, Y.; Cheng, L.K.; Stommel, M.; Xu, W. Technical requirements and conceptualization of a soft pneumatic actuator inspired by human gastric motility. In Proceedings of the 2016 23rd International Conference on Mechatronics and Machine Vision in Practice (M2VIP), Nanjing, China, 28–30 November 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–6. [Google Scholar] [CrossRef]
Zolfaghari, A.; Aminian, E.; Saffari, H. Numerical investigation on entropy generation in the dropwise condensation inside an inclined pipe. Heat Transf. 2022, 51, 551–577. [Google Scholar] [CrossRef]
Zolfaghari, H.; Momeni, H.; Karimi, H. Multilevel Inverter Real-Time Simulation and Optimization Through Hybrid GA/PSO Algorithm. arXiv 2021, arXiv:2110.13817. [Google Scholar] [CrossRef]
Park, Y.L.; Wood, R.J. Smart pneumatic artificial muscle actuator with embedded microfluidic sensing. In Proceedings of the SENSORS, 2013 IEEE, Baltimore, MD, USA, 3–6 November 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 1–4. [Google Scholar] [CrossRef]
Raman, R.; Laschi, C. Soft robotics for human health. Device 2024, 2, 100432. [Google Scholar] [CrossRef]
Shin, G.; Choi, Y.; Jeon, B.; Choi, I.; Song, S.; Park, Y.L. Soft Electromagnetic Artificial Muscles Using High-Density Liquid-Metal Solenoid Coils and Bistable Stretchable Magnetic Housings. Adv. Funct. Mater. 2023, 34, 2302895. [Google Scholar] [CrossRef]
Zolfaghari, H.; Ebrahimi, N.; Ji, Y.; Pitkow, X.; Davoodi, M. Integrated Analytical Modeling and Numerical Simulation Framework for Design Optimization of Electromagnetic Soft Actuators. Actuators 2025, 14, 128. [Google Scholar] [CrossRef]
Ebrahimi, N.; Schimpf, P.; Jafari, A. Design optimization of a solenoid-based electromagnetic soft actuator with permanent magnet core. Sens. Actuators A Phys. 2018, 284, 276–285. [Google Scholar] [CrossRef]
Song, C.W.; Lee, S.Y. Design of a solenoid actuator with a magnetic plunger for miniaturized segment robots. Appl. Sci. 2015, 5, 595–607. [Google Scholar] [CrossRef]
Hyatt, P.; Wingate, D.; Killpack, M.D. Model-based control of soft actuators using learned non-linear discrete-time models. Front. Robot. 2019, 6, 22. [Google Scholar] [CrossRef] [PubMed]
Rus, D.; Tolley, M.T. Design, fabrication and control of soft robots. Nature 2015, 521, 467–475. [Google Scholar] [CrossRef] [PubMed]
Bruder, D.; Gillespie, B.; Remy, C.D.; Vasudevan, R. Modeling and control of soft robots using the koopman operator and model predictive control. arXiv 2019, arXiv:1902.02827. [Google Scholar] [CrossRef]
Wang, J.; Chortos, A. Control strategies for soft robot systems. Adv. Intell. Syst. 2022, 4, 2100165. [Google Scholar] [CrossRef]
Taha, A.F.; Gatsis, N.; Summers, T.; Nugroho, S.A. Time-varying sensor and actuator selection for uncertain cyber-physical systems. IEEE Trans. Control Netw. Syst. 2018, 6, 750–762. [Google Scholar] [CrossRef]
Zare, A.; Mohammadi, H.; Dhingra, N.K.; Georgiou, T.T.; Jovanović, M.R. Proximal algorithms for large-scale statistical modeling and sensor/actuator selection. IEEE Trans. Autom. Control 2019, 65, 3441–3456. [Google Scholar] [CrossRef]
Modares, H.; Lewis, F.L. Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning. IEEE Trans. Autom. Control 2014, 59, 3051–3056. [Google Scholar] [CrossRef]
Milam, M.B. Real-Time Optimal Trajectory Generation for Constrained Dynamical Systems; California Institute of Technology: Pasadena, CA, USA, 2003; Available online: https://ezproxy.memphis.edu:3443/login?url=https://www.proquest.com/dissertations-theses/real-time-optimal-trajectory-generation/docview/305343031/se-2?accountid=14582 (accessed on 10 September 2025).
Lavretsky, E.; Wise, K.A. Output Feedback Control. In Robust and Adaptive Control: With Aerospace Applications; Springer: Berlin/Heidelberg, Germany, 2012; pp. 161–208. [Google Scholar] [CrossRef]
Lavretsky, E. Adaptive output feedback design using asymptotic properties of LQG/LTR controllers. IEEE Trans. Autom. Control 2011, 57, 1587–1591. [Google Scholar] [CrossRef]
Athans, M. The role and use of the stochastic linear-quadratic-Gaussian problem in control system design. IEEE Trans. Autom. Control 1971, 16, 529–552. [Google Scholar] [CrossRef]
George Thuruthel, T.; Renda, F.; Iida, F. First-order dynamic modeling and control of soft robots. Front. Robot. AI 2020, 7, 95. [Google Scholar] [CrossRef] [PubMed]
Zolfaghari, H.; Karimi, H.; Ramezani, A.; Davoodi, M. Minimizing voltage ripple of a DC microgrid via a particle-swarm-optimization-based fuzzy controller. Algorithms 2024, 17, 140. [Google Scholar] [CrossRef]
Summers, T.H.; Lygeros, J. Optimal sensor and actuator placement in complex dynamical networks. IFAC Proc. Vol. 2014, 47, 3784–3789. [Google Scholar] [CrossRef]
Summers, T. Actuator placement in networks using optimal control performance metrics. In Proceedings of the 2016 IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV, USA, 12–14 December 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 2703–2708. [Google Scholar] [CrossRef]
Ebrahimi, N.; Guda, T.; Alamaniotis, M.; Miserlis, D.; Jafari, A. Design optimization of a novel networked electromagnetic soft actuators system based on branch and bound algorithm. IEEE Access 2020, 8, 119324–119335. [Google Scholar] [CrossRef]
Lewis, F.L.; Xie, L.; Popa, D. Optimal and Robust Estimation: With an Introduction to Stochastic Control Theory; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar] [CrossRef]
Sarkka, S. On unscented Kalman filtering for state estimation of continuous-time nonlinear systems. IEEE Trans. Autom. Control 2007, 52, 1631–1641. [Google Scholar] [CrossRef]
Bishop, A.N.; Del Moral, P. On the mathematical theory of ensemble (linear-Gaussian) Kalman–Bucy filtering. Math. Control Signals Syst. 2023, 35, 835–903. [Google Scholar] [CrossRef]
Hespanha, J.P. Lecture notes on lqr/lqg controller design. Knowl. Creat. Diffus. Util. 2005. Available online: https://www.academia.edu/6945404/Undergraduate_Lecture_Notes_on_LQG_LQR_controller_design (accessed on 10 September 2025).
Smooth-On Inc. Ecoflex Series Technical Bulletin, 2017. Available online: https://www.smooth-on.com/tb/files/ECOFLEX_SERIES_TB.pdf (accessed on 14 September 2017).

Figure 1. (a) Schematic of the array of six ESAs; (b) A single soft electromagnetic actuator composed of two conductive coils on either side of a silicone spring linkage, with a soft silicone-ferromagnetic core housed within the coils.

Figure 2. An array of identical ESAs. To differentiate the springs and dampers in different components, red represents the external connections between adjacent actuators arranged in series, while black indicates the internal linkages connecting the masses within each actuator. Linkages between parallel strands are assumed to be rigid.

Figure 3. Series-connected ESA array. Each actuator comprises two masses linked via an internal spring-damper pair

({\tilde{k}}_{2}, {\tilde{c}}_{2})

, and adjacent actuators are connected through external spring-damper pairs

({\tilde{k}}_{1}, {\tilde{c}}_{1})

. A control force

f (t)

is applied to each mass to drive the contraction behavior of the actuators.

Figure 3. Series-connected ESA array. Each actuator comprises two masses linked via an internal spring-damper pair

({\tilde{k}}_{2}, {\tilde{c}}_{2})

, and adjacent actuators are connected through external spring-damper pairs

({\tilde{k}}_{1}, {\tilde{c}}_{1})

. A control force

f (t)

is applied to each mass to drive the contraction behavior of the actuators.

Figure 4. Timing diagram of actuator switching with overlap and rest periods.

\tilde{T}

denotes the rest time, T is the total activation time including overlap, and

\hat{T}

is the overlap duration.

Figure 4. Timing diagram of actuator switching with overlap and rest periods.

\tilde{T}

denotes the rest time, T is the total activation time including overlap, and

\hat{T}

is the overlap duration.

Figure 5. Closed-loop control structure of the array of soft actuators.

Figure 6. Trajectory tracking with constant reference under non-overlapping switching: (a) output trajectory, (b) control effort, and (c) actuator activation timeline.

Figure 7. Reference tracking with constant reference under overlapping switching: (a) output reference tracking, (b) corresponding control effort, and (c) actuator activation timeline.

Figure 8. Trajectory tracking with sinusoidal reference under full actuation strategy: (a) output reference tracking, (b) corresponding control effort, and (c) actuator activation timeline.

Figure 9. Sinusoidal reference tracking under overlapping switching: (a) output trajectory, (b) control effort, and (c) actuator activation timeline.

Figure 10.

ℓ_{2}

-norm of control effort for constant reference with full actuation (red) vs. overlapping switching (blue).

Figure 10.

ℓ_{2}

-norm of control effort for constant reference with full actuation (red) vs. overlapping switching (blue).

Figure 11. Temperature profile under overlapping switching strategy before limiting control effort. (a) Control effort for actuator 1. (b) Thermal model temperature profile, showing higher temperatures compared to the full actuation strategy.

Figure 12. Temperature reduction under overlapping switching after limiting control effort: (a) Control effort for actuator 1. (b) Thermal model temperature profile, showing lower temperatures during rest periods for higher weight

R

in

u^{⊤} Ru

compared to the full actuation strategy.

Figure 12. Temperature reduction under overlapping switching after limiting control effort: (a) Control effort for actuator 1. (b) Thermal model temperature profile, showing lower temperatures during rest periods for higher weight

R

in

u^{⊤} Ru

compared to the full actuation strategy.

Figure 13. Trajectory tracking with sinusoidal reference under overlapping switching after limiting control effort: (a) output trajectory, (b) control effort, and (c) actuator activation timeline.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zolfaghari, H.; Ebrahimi, N.; Pitkow, X.; Davoodi, M. Actuator Selection and Control of an Array of Electromagnetic Soft Actuators. Electronics 2025, 14, 3682. https://doi.org/10.3390/electronics14183682

AMA Style

Zolfaghari H, Ebrahimi N, Pitkow X, Davoodi M. Actuator Selection and Control of an Array of Electromagnetic Soft Actuators. Electronics. 2025; 14(18):3682. https://doi.org/10.3390/electronics14183682

Chicago/Turabian Style

Zolfaghari, Hussein, Nafiseh Ebrahimi, Xaq Pitkow, and Mohammadreza Davoodi. 2025. "Actuator Selection and Control of an Array of Electromagnetic Soft Actuators" Electronics 14, no. 18: 3682. https://doi.org/10.3390/electronics14183682

APA Style

Zolfaghari, H., Ebrahimi, N., Pitkow, X., & Davoodi, M. (2025). Actuator Selection and Control of an Array of Electromagnetic Soft Actuators. Electronics, 14(18), 3682. https://doi.org/10.3390/electronics14183682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Actuator Selection and Control of an Array of Electromagnetic Soft Actuators

Abstract

1. Introduction

2. System Description and Problem Formulation

2.1. Mathematical Modeling and Control-Oriented Formulation

2.2. Problem Formulation

3. Main Results

3.1. Kalman-Bucy Filter Design

3.2. Reference Tracking Control Design

3.3. Actuator Selection Strategy

4. Simulation Setup and Results

4.1. Scenario 1: Switching Configuration with and Without Overlap Interval

4.2. Scenario 2: Switching Strategy for Tracking and Thermal Management

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI