Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling

Omi, Taketo; Omori, Toshiaki

doi:10.3390/e26080653

Open AccessArticle

Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling

by

Taketo Omi

¹

and

Toshiaki Omori

^1,2,*

¹

Department of Electrical and Electronic Engineering, Graduate School of Engineering, Kobe University, 1-1 Rokkodai-cho, Nada-ku, Kobe 657-8501, Japan

²

Center for Mathematical and Data Sciences, Kobe University, 1-1 Rokkodai-cho, Nada-ku, Kobe 657-8501, Japan

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(8), 653; https://doi.org/10.3390/e26080653

Submission received: 26 May 2024 / Revised: 5 July 2024 / Accepted: 17 July 2024 / Published: 30 July 2024

(This article belongs to the Special Issue Probabilistic Models for Dynamical Systems)

Download

Browse Figures

Versions Notes

Abstract

Estimating and controlling dynamical systems from observable time-series data are essential for understanding and manipulating nonlinear dynamics. This paper proposes a probabilistic framework for simultaneously estimating and controlling nonlinear dynamics under noisy observation conditions. Our proposed method utilizes the particle filter not only as a state estimator and a prior estimator for the dynamics but also as a controller. This approach allows us to handle the nonlinearity of the dynamics and uncertainty of the latent state. We apply two distinct dynamics to verify the effectiveness of our proposed framework: a chaotic system defined by the Lorenz equation and a nonlinear neuronal system defined by the Morris–Lecar neuron model. The results indicate that our proposed framework can simultaneously estimate and control complex nonlinear dynamical systems.

Keywords:

statistical machine learning; data assimilation; data-driven science; nonlinear dynamics; modern control theory

1. Introduction

Estimating dynamical systems from noisy time-series data is vital to understanding complex systems [1]. Furthermore, controlling dynamical systems has attracted significant attention in various fields for manipulating known complex systems. Controlling unknown complex systems requires simultaneously estimating and controlling dynamical systems from time-series data.

The state-space model has been utilized in various fields to estimate latent variables from noisy observational data [2,3,4,5,6,7,8,9,10]. The state-space model (SSM), also called the general hidden Markov model, is a probabilistic framework used to represent the dynamical system governing the latent variable and the probabilistic observation producing the observation variable. Under the assumption that the parameters governing the entire dynamics are known, the latent variable can be estimated efficiently using a sequential Bayesian filter such as a particle filter [11,12,13]. Nonetheless, in practice, it is unrealistic that the parameter values are completely known. If the parameters are unknown, it is necessary to estimate the latent variable and the parameters simultaneously. Therefore, many parameter estimation methods based on the particle filter have been proposed to estimate the latent variable and the parameters of the SSM (see [14] and the references therein). Some offline methods have been proposed, where every recursive parameter update requires a batch of observational data [15]. Other online methods have been proposed, where the parameter values can be estimated at the same time that new observational data become available [16]. However, previous online parameter estimation methods require multiple samples for each particle drawn from backward ancestor sampling every time, which leads to high computational costs.

In contrast, model predictive control has been applied in various fields to control nonlinear dynamical systems [17]. Model predictive control is a control strategy that predicts a known model’s behavior in a future interval and calculates an optimal control input; however, to apply model predictive control to a nonlinear model, it is necessary to formulate an optimal control problem, depicted by the calculus of variations or the Euler–Lagrange equation. Solving the optimal control problem, including nonlinearity, usually requires high computational cost and is not always feasible. In particular, particle filter-based model predictive control (PF-MPC) has been proposed [18]. PF-MPC can directly handle known nonlinear models by using a particle filter; however, in previous studies on control, the model dynamics are assumed to be known [18]. For considering real-world applications, the dynamics of complex systems are often unknown.

In this paper, we propose a framework for simultaneously estimating the latent variable and the parameters of the state-space model, as well as controlling the latent variable, fully based on the particle filter. The proposed method realizes a general method for estimating and controlling dynamical systems described by the state-space model by integrating and adapting the framework of the particle filter (PF), the online expectation-maximization (EM) algorithm, and model predictive control (MPC). With the PF, we estimate the latent variable of the nonlinear dynamics online. In particular, to fully realize the online algorithm for simultaneous estimation and control, we derive a novel online parameter estimation method for the state-space model by applying an adaptive smoothing (AdaSmooth) algorithm, which has recently been proposed for smoothing statistics based on the PF [19]. By combining AdaSmooth with the online EM algorithm, we efficiently approximate the sufficient statistics and estimate the parameter values based on the smoothed sufficient statistics. Furthermore, we consider combining PF-based model predictive control (PF-MPC) [18] with our AdaSmooth-based online EM algorithm to form the feedback control law. By introducing the PF not only as a state estimator and a prior estimator for the dynamics but also as a controller, we can directly handle the nonlinearity of the dynamics and uncertainty of the latent state.

This paper is organized as follows. Section 2 introduces the state-space model and the particle filter. We then propose the AdaSmooth-based online EM algorithm for estimating the parameters of the state-space model. Finally, by combining PF-MPC with the state estimation PF and the AdaSmooth-based online EM algorithm, we propose a framework for estimating and controlling dynamical systems. In Section 3, we verify the effectiveness of our proposed method using simulation environments. We apply our proposed framework to two distinct dynamical systems: a chaotic system defined by the Lorenz equation and a nonlinear neuronal model. In each subsection, we first formulate the state-space model for each dynamical system. We then derive the parameter estimation procedure based on the AdaSmooth-based online EM algorithm. Next, we derive the concrete control law based on PF-MPC before applying our proposed framework to each complex nonlinear dynamical system. Futhermore, we present the results for estimating and control the chaostic system and neural system. Finally, Section 4 presents our concluding remarks.

2. Methods

This study proposes a probabilistic framework for concurrent data assimilation-based control of dynamical systems. To handle a realistic situation, we assume that we can only observe noisy data via some observation model, i.e., we cannot directly obtain the true latent state. Figure 1 shows the overview of our proposed framework. Our proposed framework comprises three parts: (i) estimation of the posterior distribution of the latent state from noisy observational time-series data, (ii) online estimation of the model parameters governing the dynamical system, and (iii) solving the optimal control problem using a control PF with the estimated state and parameters.

2.1. Formulation of State-Space Model

First, we formulate the SSM. Figure 2 shows the overview of the SSM. The SSM includes the system model and the observation model. The system model represents the dynamics of the latent variable

x_{t}

. Conversely, the observation model represents how we obtain the observation variable

y_{t}

under a realistic situation where some noise exists.

We consider the system model, including state

x_{t}

, control input

u_{t}

, and certain system noise

z_{t}

, with a Markov property:

x_{t} = f (x_{t - 1}, u_{t - 1}, z_{t - 1}; θ),

(1)

where t denotes a discrete time index,

θ

is an unknown parameter governing the entire dynamics, and f is a function determining the dynamics, which can include nonlinearity. Corresponding to Equation (1), we also define the probabilistic representation of the system model for the SSM as follows:

x_{t} \sim p (x_{t} ∣ x_{t - 1}, u_{t - 1}; θ) .

(2)

We then formulate the observation model for the SSM. To reflect a realistic scenario, we assume that we can only obtain the observation variable

y_{t}

and not the latent variable

x_{t}

directly at time t. Hence, we formulate the following observation model for the SSM:

y_{t} = g (x_{t}, η_{t}; θ),

(3)

where g is a function determining the information we obtain via the observation and

η_{t}

represents certain noise. Corresponding to Equation (3), we also define the probabilistic representation of the observation model for the SSM:

y_{t} \sim p (y_{t} ∣ x_{t}; θ) .

(4)

For simplicity, we assume that the joint probability

p (x_{1 : t}, y_{1 : t}; θ)

belongs to the exponential family, which is a common assumption. Here,

x_{1 : t}

and

y_{1 : t}

represent the time-series of the latent variable and observation variable

x_{1 : t} = {x_{1}, \dots, x_{t}}

and

y_{1 : t} = {y_{1}, \dots, y_{t}}

, respectively. For example, when both the latent state and observation variable are one-dimensional real numbers and the model is linear Gaussian, the state-space model is described as follows:

\{\begin{matrix} x_{t} & = a x_{t - 1} + b z_{t - 1} \\ y_{t} & = c x_{t} + d η_{t} \end{matrix},

(5)

where

x_{t}

and

y_{t} \in R

are the latent state and observation variable, respectively;

a, b, c,

and

d \in R

are parameters; and

z_{t - 1}

and

η_{t}

are noise terms, both of which obey standard Gaussian distributions

N (0, 1)

. In this linear Gaussian case, the probabilistic representation of the SSM is expressed as follows:

\begin{matrix} x_{t} \sim N (x_{t} ∣ a x_{t - 1}, b^{2}), \end{matrix}

(6)

\begin{matrix} y_{t} \sim N (y_{t} ∣ c x_{t}, d^{2}), \end{matrix}

(7)

where

N (x ∣ μ, σ^{2})

denotes the Gaussian distribution with mean

μ

and variance

σ^{2}

.

2.2. Estimation of the Hidden State

To estimate the latent state

x_{t}

from noisy time-series observational data

y_{1 : t} = {y_{1}, y_{2}, \dots, y_{t}}

online, we apply the particle filter (PF) [11,12,13,20].

Here, we estimate the latent variable using the filtering distribution

p (x_{t} ∣ y_{1 : t})

. In the PF, the filtering distribution is approximated online using particles

{x_{t}^{(1)}, \dots, x_{t}^{(N)}}

and their associated weights

{w_{t}^{(1)}, \dots, w_{t}^{(N)}}

, where N is the number of particles. We calculate these particles by alternately performing two steps: a resampling step and a prediction step.

In the resampling step, we resample particles obtained previously based on their weights. Although many resampling schemes have been proposed, we use multinomial resampling [21,22]. Using a multinomial distribution

M (w_{t - 1}^{(1)}, \dots, w_{t - 1}^{(N)})

with weights, the ancestor index

A_{t}^{(i)}

for the i-th particle at time t is sampled as follows:

A_{t}^{(i)} \sim M (w_{t - 1}^{(1)}, \dots, w_{t - 1}^{(N)}),

(8)

where

i \in \{1, 2, \dots, N\}

is the index of the particle and

M (w^{(1)}, \dots, w^{(N)})

denotes the multinomial distribution supported on

{1, 2, \dots, N}

. Using the ancestor index

A_{t}^{(i)}

, the i-th particle

{\bar{x}}_{t}^{(i)}

at time t is resampled as follows:

{\bar{x}}_{t}^{(i)} = x_{t}^{(A_{t}^{(i)})} .

(9)

We apply this resampling step only when the estimated effective sample size

{\hat{N}}_{eff}

satisfies

{\hat{N}}_{eff} (t) \leq α N

, where

0 \leq α \leq 1

is a hyperparameter that determines the threshold of the effective sample size [23]. Here, the estimated effective sample size

{\hat{N}}_{eff}

is calculated as follows:

{\hat{N}}_{eff} (t) = \frac{1}{\sum_{i = 1}^{N} {(w_{t}^{(i)})}^{2}} .

(10)

When

{\hat{N}}_{eff} (t) > α N

, instead of probabilistic resampling, as described in Equation (8), we simply set

A_{t}^{(i)} = i

for all particles.

In the prediction step, we sample particles that approximate the filtering distribution when new observational data

y_{t}

are obtained. First, we sample each particle using the system model:

x_{t}^{(i)} \sim p (x_{t} ∣ x_{t - 1}^{(A_{t}^{(i)})}) .

(11)

We then calculate the weight of each particle using the observation model.

{\tilde{w}}_{t}^{(i)} = w_{t - 1}^{(A_{t}^{(i)})} p (y_{t} ∣ x_{t}^{(A_{t}^{(i)})}),

(12)

where

{\tilde{w}}_{t}^{(i)}

denotes an un-normalized weight, and we assume that the weights from the previous time step and the ancestors are accessible. Note that the weight update given by Equation (12) is derived from sequential importance sampling [20], where the importance distribution is assumed to be the same as in the system model. Finally, we normalize the weights as follows:

w_{t}^{(i)} = \frac{{\tilde{w}}_{t}^{(i)}}{\sum_{l = 1}^{N} {\tilde{w}}_{t}^{(l)}} .

(13)

Using the sampled particles and their weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

allows us to obtain the filtering distribution of the latent variable online as follows:

p (x_{t} ∣ y_{1 : t}) = \sum_{i = 1}^{N} w_{t}^{(i)} δ (x_{t} - x_{t}^{(i)}),

(14)

where

δ (\cdot)

is Dirac’s delta function. Here, the particles and their weights at time

t = 0

are generated as follows:

x_{0}^{(i)} \sim p_{0} (x), w_{0}^{(i)} = \frac{1}{N},

(15)

where

p_{0} (x)

is the initial distribution.

We estimate the latent state online by alternately applying the resampling and prediction steps as new observational data arrive. Although the PF can suffer from the path-degeneracy problem, it can be mitigated using the resampling step. In this study, we apply multinomial resampling; however, other resampling methods can be used in our proposed framework. The particles and their associated weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

are used to estimate the latent state and the parameters, including a prior for the state in the controller, within our proposed framework.

2.3. Estimation of the Parameter

We utilize the particle filter-based online parameter estimation strategy to estimate the parameters governing the dynamics online. This subsection briefly introduces the original expectation-maximization (EM) algorithm [15] and particle-based online EM algorithm [14,16]. We then derive an efficient parameter estimation procedure using additive smoothing (AdaSmooth) [19].

2.3.1. EM Algorithm

The EM algorithm [15] is a widely used offline parameter estimation method. In the EM algorithm, when we have a batch of observational data

{y_{1}, \dots, y_{T}}

, we estimate the parameter sequence

{{\hat{θ}}_{1}, \dots, {\hat{θ}}_{k}, \dots}

by alternately conducting an E-step and an M-step, where T is the terminal time and k is the number of iterations.

In the E-step, we calculate the expectation of the joint probability

p (x_{1 : T}, y_{1 : T}; θ)

given the previous iteration parameter

{\hat{θ}}_{k - 1}

.

Φ_{1 : T; {\hat{θ}}_{k - 1}} [L_{1 : T} (θ)] = E {[\log p (x_{1 : T}, y_{1 : T}; θ)]}_{p (x_{1 : T} ∣ y_{1 : T}, {\hat{θ}}_{k - 1})},

(16)

where

E {[\cdot]}_{p (x_{1 : T} ∣ y_{1 : T}, {\hat{θ}}_{k - 1})}

denotes the expectation of the joint filtering distribution with respect to

x_{1 : T}

given the observational data

y_{1 : T}

and the parameter

{\hat{θ}}_{k - 1}

.

L_{1 : T} (θ)

corresponds to the log-likelihood of the joint probability

p (x_{1 : T}, y_{1 : T}; θ)

. In the M-step, we update the estimation parameter at this iteration

{\hat{θ}}_{k}

as follows:

\begin{matrix} {\hat{θ}}_{k} = arg \max_{θ} Φ_{1 : T; {\hat{θ}}_{k - 1}} [L_{1 : T} (θ)], \end{matrix}

(17)

where

arg \max_{θ}

denotes the maximizer with respect to the parameter

θ

.

The EM algorithm is an appealing parameter estimation method; however, we need a batch of observational data, and the computational cost of repeating two steps as the size of the observational data increases cannot be ignored. Furthermore, it is difficult to calculate the expectation in the E-step in typical cases. Moreover, an online parameter estimation algorithm must be utilized to realize a simultaneous method for estimating and controlling nonlinear dynamics.

2.3.2. AdaSmooth-Based Online EM Algorithm

In this subsection, we propose the AdaSmooth-based online EM algorithm for estimating the parameters of the SSM. To overcome the problems in the original EM algorithm, some online adaptations have been proposed [14,24,25]. In particular, a particle-based online EM algorithm for time-series state-space models has been proposed [16]. However, a previous parameter estimation method based on PaRIS [26] requires multiple samples for each particle drawn from backward ancestor sampling every time, which leads to high computational costs.

Figure 3 shows the flow of our proposed online EM algorithm for estimating the parameter

θ_{t}

online. We consider the parameter

θ_{t}

at time t rather than the iteration number k used in the offline EM algorithm. Note that the model parameter

θ

in the true dynamics is not time-varying, whereas the model parameter for estimating and controlling the dynamical system should be treated as a time-varying state parameter

{\hat{θ}}_{t}

in an online manner. Furthermore, we approximate smoothing sufficient statistics

Φ_{1 : t; {\hat{θ}}_{t - 1}} [S_{t}]

using particles obtained online before and after time

{(x_{t - 1}^{(i)}, w_{t - 1}^{(i)})}_{i = 1}^{N}, {(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

via the PF [16]. To efficiently estimate the parameters, we apply AdaSmooth [19] to smooth the sufficient statistics. AdaSmooth requires only one sample for each particle drawn from backward ancestor sampling. In addition, AdaSmooth decides whether we should conduct backward ancestor sampling adaptively; hence, we can obtain approximated smoothing sufficient statistics

Φ_{1 : t; {\hat{θ}}_{t - 1}} [S_{t}]

and estimate the parameter

{\hat{θ}}_{t}

online.

First, we rewrite the M-step in the EM algorithm. Assuming that the joint probability

p (x_{1 : t}, y_{1 : t}; θ)

belongs to the exponential family, a suitable function

Λ (\cdot)

[14] exists as a maximizer of the log-likelihood in a specific form. That is, we can rewrite the M-step in the context of online estimation as follows:

{\hat{θ}}_{t} = Λ (\frac{1}{t} Φ_{1 : t; {\hat{θ}}_{t - 1}} [S_{t}]),

(18)

where

Φ_{1 : t; {\hat{θ}}_{t - 1}}

denotes the expectation with respect to the joint probability

p (x_{1 : t} ∣ y_{1 : t}, {\hat{θ}}_{t - 1})

. The parameter index now becomes time t, contrasting with the iteration count k used in Equation (17) in the offline context.

S_{t}

is the sufficient statistics defined as follows:

S_{t} (x_{0 : t}) = \sum_{τ = 1}^{t} {\tilde{s}}_{τ} (x_{τ - 1 : τ}) .

(19)

Here,

{\tilde{s}}_{τ} (x_{τ - 1 : τ}) = {\tilde{s}}_{τ} (x_{τ - 1}, x_{τ})

is a statistic given by the states

x_{τ - 1}

and

x_{τ}

before and after times

τ - 1, τ

.

Next, we estimate

Φ_{1 : t; {\hat{θ}}_{t - 1}} [S_{t}]

online. At time t, we calculate the approximated smoothed sufficient statistics in the E-step [14,16] as follows:

Φ_{0 : t; {\hat{θ}}_{t - 1}} [S_{t} (x_{0 : t})] = \int κ_{t} (x_{t}) p (x_{t} ∣ y_{1 : t}) d x_{t},

(20)

where

κ_{t} (x_{t})

is a statistic updated each time via the following procedure:

κ_{t} (x_{t}) = \int {γ_{t} {\tilde{s}}_{t} (x_{t - 1}, x_{t}) + (1 - γ_{t}) κ_{t - 1} (x_{t - 1})} p (x_{t - 1} ∣ y_{1 : t - 1}, {\hat{θ}}_{1 : t - 1}) d x_{t - 1},

(21)

where

κ_{0} (x_{0}) = 0

and

γ_{t}

is a decay rate that satisfies

\sum_{t = 1}^{\infty} γ_{t} = \infty

and

\sum_{t = 1}^{\infty} γ_{t}^{2} < \infty

.

Here, we estimate

κ_{t} (x_{t})

using particles. We prepare a new set of particles

κ_{t} (x_{t})

. Then, we derive the update dynamics of

κ_{t} (x_{t})

based on AdaSmooth. Essentially, we update each particle

κ_{t}^{(i)}

corresponding to Equation (21) as follows:

κ_{t}^{(i)} = (1 - γ_{t}) κ_{t - 1}^{(i)} + γ_{t} {\tilde{s}}_{t} (x_{t - 1}^{(A_{t}^{(i)})}, x_{t}^{(i)}),

(22)

where

A_{t}^{(i)}

is the ancestor index obtained in the PF resampling step. The first term in Equation (22) represents the value at a previous time. The second term in Equation (22) represents the sufficient statistics, including the information obtained at time t. Nonetheless, updating

κ_{t}^{(i)}

using only Equation (22) can lead to the particle path-degeneracy phenomenon and usually leads to instability. We introduce adaptive ancestor backward resampling to avoid this problem [19]. During resampling in the PF, when the diversity of the ancestors

{\hat{N}}_{anc} (t)

satisfies

{\hat{N}}_{anc} (t) \leq β N

, we conduct backward ancestor resampling to sample

B_{t}^{(i)}

. We then update the dynamics using the following instead of Equation (22):

κ_{t}^{(i)} = \frac{1}{2} {(1 - γ_{t}) κ_{t - 1}^{(i)} + γ_{t} {\tilde{s}}_{t} (x_{t - 1}^{(A_{t}^{(i)})}, x_{t}^{(i)})} + \frac{1}{2} {(1 - γ_{t}) κ_{t - 1}^{(i)} + γ_{t} {\tilde{s}}_{t} (x_{t - 1}^{(B_{t}^{(i)})}, x_{t}^{(i)})} .

(23)

The first term in Equation (23) is the same as in Equation (22). The second term in Equation (23) represents the correction of the sufficient statistics using a new ancestor drawn from backward ancestor sampling. Here,

0 \leq β \leq 1

is a hyperparameter that determines the threshold of ancestor diversity.

{\hat{N}}_{anc} (t)

is the number of unique Enoch indices

{E_{t}^{(i)}}

, defined as follows [19]:

{\hat{N}}_{anc} (t) = |{E_{t}^{(1)}, \dots, E_{t}^{(N)}}| .

(24)

Here, the Enoch index

E_{t}^{(i)}

is updated via the following procedure:

E_{t}^{(i)} = \{\begin{matrix} i & (t = t_{0} + 1) \\ E_{t - 1}^{(A_{t}^{(i)})} & (t > t_{0} + 1) \end{matrix},

(25)

where

t_{0}

represents the last time backward ancestor sampling was conducted. The definition of each Enoch index shows that it retains the index of the ancestor at the last time backward ancestor sampling was conducted. Hence, a decrease in

{\hat{N}}_{anc} (t)

corresponds to a decrease in the diversity of the ancestors. The value of

{\hat{N}}_{anc} (t)

is used to determine whether we should conduct backward ancestor sampling.

Here, we show the procedure used in backward ancestor sampling to obtain the backward ancestor index

B_{t}^{(i)}

. According to Bayes’ theorem, the backward ancestor index is sampled as follows:

B_{t}^{(i)} \sim M ({w_{t - 1}^{(l)} p (x_{t}^{(i)} ∣ x_{t - 1}^{(l)})}_{l = 1}^{N}) .

(26)

We implement this sampling scheme efficiently using rejection-acceptance sampling [19,26].

Once

κ_{t} (x_{t})

is approximated, the smoothing sufficient statistics are estimated as follows:

Φ_{0 : t; {\hat{θ}}_{t - 1}} [S_{t} (x_{0 : t})] ≃ \sum_{i = 1}^{N} w_{t}^{(i)} κ_{t}^{(i)} .

(27)

Finally, we estimate the parameter using Equation (18). The parameter

{\hat{θ}}_{t}

, estimated using our AdaSmooth-based online EM algorithm, is also used in the controller described in Section 2.4.

Here, we briefly compare our proposed AdaSmooth-based online EM algorithm with the offline EM algorithm and their variants without considering control contexts. The offline EM algorithm requires many iterations with a batch of data. As a result, it has high computational costs. On the other hand, our AdaSmooth-based online EM algorithm requires no iteration for each step. Hence, our proposed framework realizes efficient online estimation of model parameters.

2.4. Control Strategy for the Dynamics

To design an efficient controller for the general state-space model, PF-based model predictive control (PF-MPC) [18] is integrated with our dynamics estimator in the proposed method. Model predictive control (MPC) is a theoretical framework used for feedback control in dynamical systems. In conventional MPC, an optimal control problem is formulated at time t over a future interval called a horizon, using the model

x_{t + 1} = f (x_{t}, u_{t})

. Next, the problem is numerically solved to obtain the optimal control input series

{u_{τ}^{*}}_{τ \in horizon}

over the horizon. Finally, the first control input from this series is adopted as the actual input. Specifically, in conventional MPC, the optimal control problem in the horizon is treated as solving the Euler–Lagrange equation numerically, which is computationally expensive because it includes the derivative of the model concerning the state

x_{t}

and control input

u_{t}

. In contrast, in PF-MPC, the optimal control problem is treated as a filtering problem for the control input

u_{t}

based on an augmented state-space model within the horizon.

Figure 4 shows the overview of PF-MPC. Our framework has two distinct PF modules: one (state PF) acts as a state estimator, as described in Section 2.2, and the other (control PF) serves as a solver for the control-filtering problem defined below. First, the latent state particles and their associated weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

are obtained via the PF. Then, the control input particles

{{\bar{u}}_{t}^{(i)}}

are sampled from the proposal distribution

p ({\bar{u}}_{t} ∣ u_{t - 1})

. Subsequently, by regarding the input variable as part of the latent state, we form augmented state particles

{(x_{t}^{(i)}, {\bar{u}}_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

over the horizon. With the given reference trajectory

{r_{t}, \dots, r_{T + T_{H}}}

, we solve the control-filtering problem by applying the control PF within the horizon and obtain the filtering distribution of the control input

p ({\bar{u}}_{t} ∣ r_{t}, \dots, r_{T + T_{H}})

. Here,

T_{H}

is the duration of the horizon. Finally, we adopt

u_{t}

as the actual input based on the filtering distribution of the initial control input.

Here, we discuss the details of PF-MPC. To formulate the control-filtering problem, we define an augmented state-space model over the horizon. The augmented state-space model consists of the augmented system model and the augmented observation model.

First, we formulate the augmented system model over the horizon. Within this horizon, we consider the control input

u_{t}

as part of the augmented state

ζ_{t} = ({\bar{x}}_{t}, {\bar{u}}_{t}, {\tilde{u}}_{t})

, where

{\tilde{u}}_{t}

serves as an auxiliary variable to preserve the initial input throughout the horizon. Here,

{\bar{x}}_{t}, {\bar{u}}_{t}

, respectively, denote the predictive state and control input within the horizon. Note that the

{\bar{x}}_{t}

and

{\bar{u}}_{t}

notations are required to distinguish the predicted state and control input transitions over the horizon

{({\bar{x}}_{t}, {\bar{u}}_{t}), \dots, ({\bar{x}}_{t + T_{H}}, {\bar{u}}_{t + T_{H}})}

from the actual state and input time series

{(x_{1}, u_{1}), \dots, (x_{t}, u_{t}), \dots}

. Therefore, we define the following augmented system model for the augmented state

ζ_{t}

within the horizon:

ζ_{τ + 1} \sim p (ζ_{τ + 1} ∣ ζ_{τ}; {\hat{θ}}_{t}) = p ({\bar{x}}_{τ + 1} ∣ {\bar{x}}_{τ}, {\bar{u}}_{τ}; {\hat{θ}}_{t}) p ({\bar{u}}_{τ + 1} ∣ ζ_{t}) δ ({\tilde{u}}_{τ + 1} - {\tilde{u}}_{τ}),

(28)

where

τ \in {t, t + 1, \dots, t + T_{H}}

is the time index within the horizon,

p ({\bar{x}}_{τ + 1} ∣ {\bar{x}}_{τ}, {\bar{u}}_{τ}; {\hat{θ}}_{t})

is defined using the original system model within the state-space model described in Section 2.1,

p ({\bar{u}}_{τ + 1} ∣ ζ_{τ})

is the transition probability, and

δ ({\tilde{u}}_{τ + 1} - {\tilde{u}}_{τ})

is the probability corresponding to the deterministic transition

{\tilde{u}}_{τ + 1} = {\tilde{u}}_{τ}

. The parameter

{\hat{θ}}_{t}

is estimated using our AdaSmooth-based online EM algorithm and is fixed throughout the horizon.

Next, we formulate the augmented observation model. We consider controlling the latent state

x_{t}

toward the given reference trajectory

{r_{1}, \dots, r_{t}, \dots}

. We define the augmented observation model as follows:

r_{τ} \sim p (r_{τ} ∣ ζ_{τ}) .

(29)

For example, a Gaussian distribution can be used as the observation model:

r_{τ} \sim N (r_{τ} ∣ R_{τ} ζ_{τ}, Σ_{r}),

(30)

where

R_{τ}

is an appropriately sized matrix and

Σ_{r}

is a diagonal covariance matrix that adjusts the error between the state and reference trajectory. Note that the reference value’s dimension does not have to be the same as that of the latent variable, i.e.,

R_{τ}

can be a non-square matrix. Furthermore, the augmented observation model is very flexible, but its details are omitted here (see [18]).

We then solve the control-filtering problem given by the augmented state-space model [Equations (28) and (29)] and the given reference trajectory

{r_{τ}}_{τ = t}^{t + T_{H}}

. Using control PF in the horizon

{t, t + 1, \dots, t + T_{H}}

, we solve the control-filtering problem efficiently to estimate the augmented state

ζ_{t} = ({\bar{x}}_{t}, {\bar{u}}_{t}, {\tilde{u}}_{t})

. Here, the initial augmented particles

{ζ_{t}^{(i)} = ({\bar{x}}_{t}^{(i)}, {\bar{u}}_{t}^{(i)}, {\tilde{u}}_{t}^{(i)})}_{i = 1}^{N}

and their associated weights

{{\bar{w}}_{t}^{(i)}}_{i = 1}^{N}

in the horizon are initialized for each particle using information obtained from the state PF as follows:

\begin{matrix} {\bar{x}}_{t}^{(i)} & = x_{t}^{(i)}, \end{matrix}

(31)

\begin{matrix} {\bar{u}}_{t}^{(i)} & \sim p ({\bar{u}}_{t}^{(i)} ∣ u_{t}^{(i)}), \end{matrix}

(32)

\begin{matrix} {\tilde{u}}_{t}^{(i)} & = {\bar{u}}_{t}^{(i)}, \end{matrix}

(33)

\begin{matrix} {\bar{w}}_{t}^{(i)} & = w_{t}^{(i)}, \end{matrix}

(34)

where

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

are the particles obtained from the state PF for the latent state estimation and

p ({\bar{u}}_{t}^{(i)} ∣ u_{t}^{(i)})

is the initial proposal distribution for the control input. To avoid confusion, we note here that the aim of the control PF is to estimate the optimal control input obtained using the initial control input via the proposal distribution. For filtering the initial control input

{\bar{u}}_{t}

, an auxiliary variable

{\tilde{u}}_{t}

is defined to preserve the initial input

{\bar{u}}_{t}

. Although

{\tilde{u}}_{t}^{(i)} = {\bar{u}}_{t}^{(i)}

at time t is defined in Equation (31), the predictive control inputs

{\bar{u}}_{t + 1}^{(i)}, {\bar{u}}_{t + 2}^{(i)}, \dots

in the horizon have values different from

{\tilde{u}}_{t}^{(i)}

. Therefore we need both notations

{\bar{u}}_{t}

and

{\tilde{u}}_{τ}

for the control PF in the horizon.

Finally, we estimate the optimal control input based on the filtered particles. By extracting the auxiliary variables that retain the initial control input and their associated weights

{({\tilde{u}}_{t + T_{H}}^{(i)}, {\bar{w}}_{t + T_{H}}^{(i)})}_{i = 1}^{N}

, we compute the actual control input at time t via point estimation of the filtering distribution as follows:

u_{t} = \sum_{i = 1}^{N} {\bar{w}}_{t + T_{H}}^{(i)} {\tilde{u}}_{t + T_{H}}^{(i)} .

(35)

It should be noted that the auxiliary variable

{\tilde{u}}_{t + T_{H}}

inherits the information at time

t, \dots, t + T_{H} - 1

. Hence, to obtain the estimated optimal control input, we focus solely on these particles at time

t, \dots, t + T_{H} - 1

within the horizon.

Here, we briefly discuss the differences between conventional MPC and PF-MPC. MPC requires derivatives of the model concerning the state and the control input. Furthermore, exact point estimation of the latent state is required. Conversely, PF-MPC does not require model derivatives and only relies on particles representing the latent variable. The only requirement of PF-MPC is that the model can be simulated forward in time, which is satisfied in typical cases. Nonetheless, under certain assumptions, the control input obtained using PF-MPC is equivalent to that of conventional MPC [18]. Moreover, to clarify the advantage of our proposed method compared with previous work, we briefly touch on the differences between the previous work [27] and our proposed framework. Three significant differences exist between the previous work and our proposed framework. First, the previous work updates parameters at fixed intervals. In contrast, our proposed framework updates parameters continuously, with the controller referencing these updated parameters; hence, our proposed framework is more adaptive. Second, the previous work uses the offline EM algorithm for parameter estimation. Due to the structure of the offline EM algorithm, the previous framework maintains a history of particles within a window. In contrast, our proposed method is based on the online EM algorithm. Using the online EM algorithm as the fundamental parameter estimation strategy, we retain only the preceding and succeeding particle sets; therefore, our framework is more memory-efficient. Third, in the previous work, there are complex hyperparameters, e.g., the parameter update interval and window length. These complex hyperparameters require sensitive hand-tuning and domain-specific knowledge. Conversely, the hyperparameters of our proposed framework are defined in a simpler form. For further details, the typical values of the hyperparameters for AdaSmooth are discussed in [19]. Therefore, our proposed framework is easier to tune compared with the previous work.

We have proposed a fully PF-based framework for simultaneously estimating the latent state and the parameters governing the dynamics, and controlling the dynamics against a general state-space model. Although conventional MPC can handle the model’s nonlinearity directly and PF-MPC can address the model’s uncertainty, the latent variable and dynamics are assumed to be known. The previous method [27] attempts to resolve this problem; however, it requires difficult hyperparameter tuning and applies only to neuronal models. Our proposed method establishes a feedback control law for general state-space models by combining a PF and an AdaSmooth-based online EM algorithm.

3. Experiments

We verify the effectiveness of our proposed method in a simulation environment. We apply our proposed method to two complex systems: the Lorenz system and the Morris–Lecar neuron model. For each distinct model, we first formulate its complex dynamics as a general SSM. Next, we derive the sufficient statistics necessary for our AdaSmooth-based online EM algorithm to estimate the parameters. Then, we formulate the augmented SSM over the horizon to control the dynamics. Finally, the experimental settings and results for each experiment are shown.

3.1. Application to Chaotic Lorenz System

The Lorenz system, introduced in [28], is known as a chaotic system. Because chaotic behavior is not always desirable, eliminating it is essential for practical application. Delayed feedback control has been proposed to stabilize chaotic models [29,30]. In contrast, a control strategy for chaotic behavior with simultaneous parameter estimation using statistical machine learning has not been studied. By applying our proposed method to the Lorenz system, we aim to confirm its ability to estimate the latent state and parameters of the chaotic Lorenz system and to stabilize its chaotic behavior.

The Lorenz system is given as follows:

\{\begin{matrix} \frac{d x}{d t^{'}} & = σ (y - x), \\ \frac{d y}{d t^{'}} & = r x - y - x z + u, \\ \frac{d z}{d t^{'}} & = x y - b z, \end{matrix}

(36)

where

t^{'}

represents continuous time;

{(x, y, z)}^{⊤} \in R^{3}

represents the three-dimensional latent states; u is the control input; and

σ, r, b

are parameters. Based on a previous study on suppressing chaotic behavior [30], we assume the control input only affects one element, y. To avoid confusion, the variables used in the Lorenz system are shown in Table 1. When we fix

u = 0

, the Lorenz system with

σ = 10, b = 8 / 3, r > r_{H}

exhibits subcritical Hopf bifurcation, where

r_{H} ≃ 24.73684211

. We aim to control the y-element value toward the unstable fixed point

y_{f} = \sqrt{b (r - 1)}

under noisy environmental conditions of the latent state where only noisy observations of the latent state can be observed. To this end, we first derive the SSM for the Lorenz system. We then derive the E-step and the M-step of our online EM algorithm. Finally, we formulate the augmented SSM for the Lorenz system to control the latent state. With these Lorenz system formulations, we apply the proposed method to the Lorenz system.

3.1.1. Formulation of State-Space Model

Here, we derive the SSM for the Lorenz system. We first derive the system model of the SSM. We discretize the continuous-time model given by Equation (36) with a time step

Δ t

.

x_{t + 1} = x_{t} + f (x_{t}, u_{t}; θ) Δ t,

(37)

where

x_{t} = {[x_{t}, y_{t}, z_{t}]}^{⊤} \in R^{3}

corresponds to the latent state at time

t Δ t

,

u_{t} \in R

is the control input at time

t Δ t

, and

θ = {[σ, r, b]}^{⊤} \in R^{3}

corresponds to the unknown parameters of the system.

f (x_{t}, u_{t}; θ)

is a function that describes the dynamics of the Lorenz system as follows:

f (x_{t}, u_{t}; θ) = [\begin{matrix} σ (y_{t} - x_{t}) \\ r x_{t} - y_{t} - x_{t} z_{t} + u_{t} \\ x_{t} y_{t} - b z_{t} \end{matrix}] .

(38)

Next, we introduce system noise governed by additive Gaussian noise with zero mean and covariance matrix

Σ_{x} = diag (σ_{x_{1}}^{2}, σ_{x_{2}}^{2}, σ_{x_{3}}^{2}) \in R^{3 \times 3}

. Here,

diag (a_{1}, a_{2}, a_{3})

denotes the diagonal matrix with diagonal elements

a_{1}, a_{2}, a_{3}

. The system model of the SSM for the Lorenz system is formulated as follows:

p (x_{t + 1} ∣ x_{t}, u_{t}, θ) = N (x_{t + 1} ∣ x_{t} + f (x_{t}, u_{t}; θ) Δ t, Σ_{x} Δ t) .

(39)

Next, we derive the observation model of the SSM. We assume that the latent state with additive Gaussian noise is obtained. Hence, we formulate the observation model as follows:

y_{t} \sim N (y_{t} ∣ x_{t}, Σ_{y}),

(40)

where

Σ_{y} \in R^{3 \times 3}

is the covariance matrix corresponding to the observation noise.

3.1.2. Derivation of Online EM Algorithm

Here, we derive the sufficient statistics

S_{t} (x_{0 : t})

and the update function

Λ (\cdot)

for the Lorenz SSM, using Equations (39) and (40), in our AdaSmooth-based online EM algorithm. After calculation, we obtain the maximizer of the expectation of log-likelihood

Λ (\cdot)

as follows:

Λ (S_{t} = (A_{t}, b_{t})) = A_{t}^{- 1} b_{t},

(41)

where the sufficient statistics

S_{t} = (A_{t}, b_{t})

comprise two additive-form statistics

A_{t} \in R^{3 \times 3}

and

b_{t} \in R^{3}

, given as follows:

\begin{matrix} A_{t} (x_{0 : t}) = \sum_{τ = 1}^{t} {\tilde{A}}_{τ} (x_{τ - 1 : τ}), \end{matrix}

(42)

\begin{matrix} b_{t} (x_{0 : t}) = \sum_{τ = 1}^{t} {\tilde{b}}_{τ} (x_{τ - 1 : τ}), \end{matrix}

(43)

where

{\tilde{A}}_{τ}

and

{\tilde{b}}_{τ}

are defined as follows:

\begin{matrix} {\tilde{A}}_{τ} (x_{τ - 1 : τ}) = diag (\frac{{(y_{τ - 1} - x_{τ - 1})}^{2}}{σ_{x_{1}}^{2}}, \frac{x_{τ - 1}^{2}}{σ_{x_{2}}^{2}}, \frac{z_{τ - 1}^{2}}{σ_{x_{3}}^{2}}) Δ t, \end{matrix}

(44)

\begin{matrix} {\tilde{b}}_{τ} (x_{τ - 1 : τ}) = [\begin{matrix} \frac{y_{τ - 1} - x_{τ - 1}}{σ_{x_{1}}^{2}} (x_{τ} - x_{τ - 1}) \\ \frac{x_{τ - 1}}{σ_{x_{2}}^{2}} {y_{τ} - y_{τ - 1} + (y_{τ - 1} + x_{τ - 1} z_{τ - 1} - u_{τ - 1}) Δ t} \\ - \frac{z_{τ - 1}}{σ_{x_{3}}^{2}} (z_{τ} - z_{τ - 1} - x_{τ - 1} y_{τ - 1} Δ t) \end{matrix}] . \end{matrix}

(45)

3.1.3. Formulation of the Augmented State-Space Model for Control

We aim to suppress the chaotic behavior and control the y-element of the latent state toward the fixed point

y_{f} = \sqrt{b (r - 1)}

[30]. First, we define the control transition over the horizon. To consider realistic scenarios, we impose a constraint concerning the control input at every time t as follows:

- u_{\lim} \leq u_{t} \leq u_{\lim},

(46)

where

u_{\lim} > 0

is the limit of the control input. Therefore, we define the augmented system model for the control input transition as follows:

{\bar{u}}_{τ + 1} = {clamp}_{u_{\lim}} (u_{τ} + z_{τ + 1}),

(47)

where

z_{τ + 1} \sim N (0, σ_{u}^{2})

is Gaussian noise and

{clamp}_{u_{\lim}} (\cdot)

is a clamp function defined as follows:

{clamp}_{u_{\lim}} (u) = \frac{| u + u_{\lim} | - | u - u_{\lim} |}{2} .

(48)

To initialize the augmented particles, we define the following initialization procedure corresponding to the initial proposal distribution

p ({\bar{u}}_{t}^{(i)} ∣ u_{t}^{(i)})

:

{\bar{u}}_{t}^{(i)} = {clamp}_{u_{\lim}} (u_{t} + z_{t}^{(i)}),

(49)

where

z_{t}^{(i)} \sim N (0, σ_{u 0}^{2})

is Gaussian noise. Note that we do not need a probabilistic form of the augmented SSM.

We then define the augmented observation model of the augmented SSM over the horizon. We aim to control the y-element of the latent state toward the fixed point

y_{f}

; hence, we define the augmented observation model for the PF-MPC controller as follows:

r_{τ} \sim N (r_{τ} ∣ y_{t}, σ_{r}^{2}),

(50)

where

σ_{r}^{2} \in R

is a variance that adjusts for acceptable error against the reference trajectory. The given reference value is also always equivalent to

y_{f}

, i.e.,

r_{t} = y_{f}

for

t = 1, 2, \dots

.

3.1.4. Settings

The true dynamics are assumed to be the Lorenz system defined by Equation (36), and in the simulation, they are solved using the Runge–Kutta method. Here, the true parameters are

σ = 10

and

r = 28, b = 8 / 3

. We apply our proposed method to the Lorenz system to suppress its chaotic behavior toward the unstable fixed point

y_{f} = \sqrt{b (r - 1)}

. Note that we can only observe the noisy time series of the latent variable, not the true value.

The hyperparameters are set as follows. For simulation, the time step is

Δ t = 0.01

. The variances of the system noise and the observation noise are

Σ_{x} = diag (σ_{x_{1}}^{2}, σ_{x_{2}}^{2}, σ_{x_{3}}^{2}) = diag (1^{2}, 1^{2}, 1^{2})

and

Σ_{y} = diag (1^{2}, 1^{2}, 1^{2})

, respectively. For the PF, the number of particles is

N = 1000

, and the threshold rate of resampling is

α = 0.8

. The initial particles

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

are sampled from the initial distribution, defined as follows:

p_{0} (x) = N (x ∣ x_{0}, Σ_{x 0}),

(51)

where

Σ_{x 0} = diag (10^{2}, 10^{2}, 10^{2})

. For parameter estimation, we use the threshold rate of backward resampling

β = 0.7

and the decay rate

γ_{t} = t^{- 1}

. Following [16], we start updating the parameter after

T_{burnIn}

steps. In this experiment, we set

T_{burnIn} = 100

. To control the Lorenz system, we set the number of steps in the horizon to

T_{H} = 10

. As a hyperparameter in the augmented SSM, the acceptable error variance and the control input limit are

σ_{r}^{2} = 1^{2}

and

u_{\lim} = 10

, respectively. The variances of the augmented SSM’s control transitions are

σ_{u}^{2} = 1^{2}

and

σ_{u 0}^{2} = 10^{2}

.

3.1.5. Results

Figure 5 shows the simulation results after applying our proposed framework to the Lorenz system. We confirmed that our method can simultaneously estimate and control the Lorenz dynamics. To verify whether our proposed method can suppress chaotic behavior, we did not adopt the control input until

t = 5

.

The graphs on the left in Figure 5 show the

x, y, z

elements of the latent state from top to bottom, respectively. The state estimated by the PF represents the filtering distribution of the latent state. Hence, to confirm that we can estimate the latent variable, the estimated value was calculated using the weighted mean of the particles as follows:

x_{est, t} = \sum_{i = 1}^{N} w_{t}^{(i)} x_{t}^{(i)} .

(52)

The three graphs on the left in Figure 5 show the estimated values

(x_{est}, y_{est}, z_{est})

(blue solid lines) and the true values

(x_{true}, y_{true}, z_{true})

(green dashed lines), respectively. We found that the estimated values tracked the true values over time. In particular, the middle graph in Figure 5 also shows the target unstable fixed point

y_{f}

(blue dotted line). We also found that the estimated values

(x_{est}, y_{est}, z_{est})

accurately reproduced the true values

(x_{true}, y_{true}, z_{true})

, and the state was successfully manipulated. Here, the red dash-dotted line represents the simulation results when no input was applied, as shown in the graphs on the left in Figure 5. Without our proposed method, the Lorenz system still exhibited chaotic behavior. Conversely, after applying our proposed method, the Lorenz system stabilized and converged to the fixed point

y_{f}

.

To confirm whether our proposed method could suppress chaotic behavior, we visualized the trajectory of the latent variable

(x_{t}, y_{t}, z_{t})

for the non-controlled and controlled Lorenz systems in a three-dimensional graph. Figure 6 shows the two trajectories of the non-controlled Lorenz system and the Lorenz system controlled by our proposed automatic control law. The left side of Figure 6 shows the simulation results of the true dynamics when the control input was fixed at zero:

u_{t} = 0

. When we did not adopt any control input, the Lorenz system exhibited chaotic behavior around two fixed points. Conversely, by applying our proposed framework, as shown on the right in Figure 6, the Lorenz system stabilized around the fixed point determined in advance.

Next, we clarified whether our proposed method could estimate the unknown parameters. The graphs on the right in Figure 5 show the parameters governing the Lorenz system. Each estimated parameter value of three types

(σ_{t}, r_{t}, b_{t})

of the Lorenz system converged to the true values over time, although the estimated value for

σ_{t}

oscillated slightly. In particular, the bifurcation parameter

r_{t}

, which influences the dynamics of y, was accurately estimated by our AdaSmooth-based online EM algorithm. Additionally,

b_{t}

, governing the dynamics of z, was also estimated accurately by our proposed method.

Finally, we clarified whether our proposed method could estimate and control the Lorenz dynamics using the control input

u_{t}

with a limited range of strengths. Figure 7 shows the control input (blue line) and the limit value (blue dash-dotted line). We maintained

u_{t} = 0

during the time period marked with a gray background. The control input

u_{t}

always satisfied the norm constraint

| u_{t} | \leq u_{\lim}

. Hence, we can control the Lorenz dynamics simultaneously, even if we impose input constraints.

3.2. Application to Morris–Lecar Neuron Model

It is important to estimate and control the nonlinear dynamics of neurons to understand nerve systems and brain functions [31,32]. Nonetheless, it is difficult to control neuronal dynamics due to their complexity and partial observability. Typically, only noisy membrane potentials are observable [33,34,35].

Following previous work [27], this section aims to control the time series of membrane potentials toward a given reference trajectory under realistic conditions, where only noisy time series of membrane potentials can be observed. To this end, we first derive the SSM for the Morris–Lecar neuron model. We then derive the sufficient statistics and the update function for our online AdaSmooth-based EM algorithm for online parameter estimation. Finally, we formulate the augmented SSM for the Morris–Lecar neuron model to control the membrane potential. We apply the proposed method to the Morris–Lecar neuron model based on these SSM formulations.

3.2.1. Formulation of State-Space Model

In the Morris–Lecar neuron model [36], the nonlinear dynamics of the membrane potential v and the channel variable n are described by the following continuous- time model:

\{\begin{matrix} C_{m} \frac{d v}{d t^{'}} & = - g_{L} (v - E_{L}) - g_{C a} m_{\infty} (v) (v - E_{C a}) - g_{K} n (v - E_{K}) + I \\ τ (v) \frac{d n}{d t^{'}} & = - n + n_{\infty} (v), \end{matrix}

(53)

where v is the membrane potential, n is the channel variable, and I is the external input current. Here,

τ (v) = \frac{1}{ϕ} {cosh}^{- 1} (\frac{v - V_{3}}{2 V_{4}})

is a function that determines the speed of the dynamics.

m_{\infty} (v)

and

n_{\infty} (v)

are nonlinear functions that describe the calcium and potassium dynamics;

E_{L}, E_{C a},

and

E_{K}

are the reversal potentials; and

C_{m}

is the membrane capacitance [27]. To avoid confusion, the variables used in the Morris–Lecar neuron model are shown in Table 2.

First, we derive the system model for the SSM. The continuous-time model given by Equation (53) is discretized with a time step

Δ t

.

x_{t + 1} = x_{t} + f (x_{t}, u_{t}; θ) Δ t,

(54)

where

x_{t} = {[v_{t}, n_{t}]}^{⊤} \in R^{2}

corresponds to the latent state at time

t Δ t

,

u_{t} \in R

corresponds to the control input at time

t Δ t

, and

θ = {[g_{L}, g_{C a}, g_{K}]}^{⊤} \in R^{3}

corresponds to the unknown parameters. Following previous work [27], we separate the net input

I_{t}

into the controllable input

u_{t}

and the known streaming currents from other neurons

I_{t}^{inj}

, i.e.,

I_{t} = u_{t} + I_{t}^{inj}

.

f (x_{t}, u_{t}; θ)

is a function representing the dynamics of the Morris–Lecar neuron model as follows:

f (x_{t}, u_{t}; θ) = [\begin{matrix} - \frac{1}{C_{m}} {g_{L} (v_{t} - E_{L}) + g_{C a} m_{\infty} (v_{t}) (v_{t} - E_{C a}) + g_{K} n (v_{t} - E_{K})} + \frac{1}{C_{m}} I_{t} \\ - ϕ cosh (\frac{v_{t} - V_{3}}{2 V_{4}}) (n_{t} - n_{\infty} (v_{t})) \end{matrix}] .

(55)

We then introduce system noise governed by white Gaussian noise with zero mean and covariance matrix

Σ_{x} = diag (σ_{v}^{2}, σ_{n}^{2}) \in R^{2 \times 2}

. The system model of the SSM for the neuron model is formulated as follows:

p (x_{t + 1} ∣ x_{t}, u_{t}, θ) = N (x_{t + 1} ∣ x_{t} + f (x_{t}, u_{t}; θ) Δ t, Σ_{x} Δ t) .

(56)

Next, we formulate the observation model for the SSM. We consider that only noisy membrane potentials

y_{t} \in R

can be observed. Introducing additive Gaussian noise as observation noise, we formulate the observation model as follows:

y_{t} \sim N (y_{t} ∣ v_{t}, σ_{y}^{2}),

(57)

where

σ_{y}^{2} \in R

is the variance corresponding to the observation noise.

3.2.2. Derivation of Online EM Algorithm

Here, we derive the sufficient statistics

S_{t} (x_{0 : t})

and the update function

Λ (\cdot)

for the neuron model described by Equations (56) and (57), which are used in our AdaSmooth-based online EM algorithm. After calculation, we obtain the maximizer of the expectation of log-likelihood

Λ (\cdot)

as follows:

Λ (S_{t} = (A_{t}, b_{t})) = - A_{t}^{- 1} b_{t},

(58)

where the sufficient statistics

S_{t} = (A_{t}, b_{t})

include two additive-form statistics

A_{t} (x_{0 : t}) \in R^{3 \times 3}

and

b_{t} (x_{0 : t}) \in R^{3}

, given as follows:

\begin{matrix} A_{t} (x_{0 : t}) = \sum_{τ = 1}^{t} {(\frac{Δ t}{C_{m}})}^{2} V_{ion} (x_{τ - 1}) V_{ion}^{⊤} (x_{τ - 1}), \end{matrix}

(59)

\begin{matrix} b_{t} (x_{0 : t}) = \sum_{τ = 1}^{t} {\tilde{b}}_{τ} (x_{τ - 1 : τ}) = \frac{Δ t}{C_{m}} (v_{τ} - v_{τ - 1} - \frac{Δ t}{C_{m}} I_{τ - 1}) V_{ion} (x_{τ - 1}) . \end{matrix}

(60)

Here,

V_{ion} (x_{t}) \in R^{3}

represents the potentials of each ion channel, given as follows:

V_{ion} (x_{t}) = [\begin{matrix} v_{t} - E_{L} \\ m_{\infty} (v_{t}) (v_{t} - E_{C a}) \\ n_{t} (v_{t} - E_{K}) \end{matrix}] .

(61)

3.2.3. Formulation of the Augmented State-Space Model for Control

Here, we formulate the augmented SSM over the horizon. First, we derive the control transition of the augmented system model

p ({\bar{u}}_{τ + 1} ∣ ζ_{τ})

. In realistic scenarios, it is essential to guarantee that the net input current

I_{t} = u_{t} + I_{t}^{inj}

is always within a specific range. Therefore, we impose a norm constraint on the control input at every time t as follows:

| u_{t} + I_{t}^{inj} | \leq I_{\lim},

(62)

where

I_{t}^{inj}

is the current injected from other neurons and

I_{\lim}

is the limit current. To this end, the control transition within the horizon is described as follows:

{\bar{u}}_{τ + 1} = {clip}_{I_{\lim}} ({\bar{u}}_{τ} + {\bar{I}}_{τ + 1}^{inj} + z_{τ + 1}) - {\bar{I}}_{τ + 1}^{inj},

(63)

where

z_{τ + 1} \sim N (0, σ_{I}^{2})

is Gaussian noise. The proposed framework fixes the prediction of the external currents

{\bar{I}}_{τ + 1}^{inj}

at the value observed at time t, i.e.,

{\bar{I}}_{τ + 1}^{inj} = I_{t}^{inj}

for

τ \in \{t, t + 1, \dots, t + T_{H} - 1\}

. Furthermore, the initialization for the control input particles is described as follows:

{\bar{u}}_{t}^{(i)} = {clip}_{I_{\lim}} (u_{t - 1}^{(i)} + {\bar{I}}_{t}^{inj} + z_{t}^{(i)}) - {\bar{I}}_{t}^{inj},

(64)

where

z_{t}^{(i)} \sim N (0, σ_{I 0}^{2})

is Gaussian noise.

Next, we derive the augmented observation model for the augmented SSM over the horizon. We define the following augmented observation model to control the time series of membrane potentials toward a given reference trajectory

r_{t} \in R

:

r_{t} \sim N (r_{t} ∣ v_{t}, σ_{r}^{2}),

(65)

where

σ_{r}^{2} \in R

is a variance that adjusts for acceptable error between the predictive state transition and the reference trajectory within the horizon.

3.2.4. Settings

The true dynamics are assumed to follow the Morris–Lecar neuron model defined by Equation (53), and in simulation, they are solved using the Euler method. Here, the true parameters are those for the Homoclinic [37]. We apply our proposed method to the Morris–Lecar neuron model to control the membrane potentials

v_{t}

toward the desired reference membrane potentials

r_{t}

. The reference trajectory

r_{t}

is generated by the true model in advance. Note that we can only observe noisy membrane potentials; that is, we assume a partial observation situation.

The hyperparameters are set as follows. For simulation, the time step is

Δ t = 0.1

. The variances of the system noise and observation noise are

Σ_{x} = diag (σ_{v}^{2}, σ_{n}^{2}) = diag (0 . 1^{2}, 0 . 001^{2})

and

σ_{y}^{2} = 0 . 1^{2}

, respectively. For the PF, the number of particles is

N = 1000

, and the threshold rate of resampling is

α = 0.8

. The initial distribution for the initial particles

p_{0} (x)

is

p_{0} (x) = N (v ∣ \frac{2}{3} E_{L}, 10^{2}) U (n ∣ 0, 1)

, where

U (n ∣ 0, 1)

is a uniform distribution within

[0, 1]

. For parameter estimation, we use the threshold rate of backward resampling

β = 0.7

and the decay rate

γ_{t} = t^{- 1}

. We update the parameters after

T_{burnIn} = 1000

steps. To control the Morris–Lecar neuron model, we set the number of steps in the horizon as

T_{H} = 10

. As a hyperparameter in the augmented SSM, the acceptable error variance and the control input limit are set to

σ_{r}^{2} = 0 . 1^{2}

and

I_{\lim} = 150

, respectively. The variances of the augmented SSM’s control transitions are

σ_{I}^{2} = 1^{2}

and

σ_{I 0}^{2} = 10^{2}

.

3.2.5. Results

Figure 8 shows the simulation results after applying our proposed framework to the Morris–Lecar neuronal system. Here, we controlled the neuronal dynamics toward the reference trajectory, which represents the Homoclinic behavior during time intervals

0 \leq t^{'} \leq 250

and

500 \leq t^{'} \leq 750

, and the resting state during intervals

250 \leq t^{'} \leq 500

and

750 \leq t^{'} \leq 1000

. First, we found that our method could simultaneously estimate and control the Morris–Lecar neuron model. The middle-left graph shows the true membrane potential

v_{true}

(blue solid line), the estimated value

v_{est}

(green dashed line), and the target membrane potential

v_{ref}

(red dotted line). We found that the estimated value tracked the true value, as shown in the middle-left graph in Figure 8. In particular, the pale purple line represents the simulation results without adopting any control input. Without our proposed method, the Morris–Lecar neuron model converged to a specific value. However, by applying our proposed method, the Morris–Lecar neuron model was controlled toward the desired trajectory, allowing us to switch between neuronal firing and resting states. The bottom-left graph shows the true value

n_{true}

(blue solid line) and the estimated value

n_{est}

(green dashed line). We can see that the estimated value also closely tracked the true value.

Next, we verified whether our proposed method could estimate the parameters governing the dynamics of the neuron model. The right side of Figure 8 shows the parameters

(g_{L}, g_{C a}, g_{K})

governing the Morris–Lecar neuronal model. As shown in the three graphs on the right, each estimated parameter value converged to the true value over time, although the estimated values spiked around 500 to 600.

Finally, we verified whether the proposed framework could estimate and control the neuronal dynamics under varying strengths of the control input

u_{t}

. The top-left graph shows the controllable input current u (blue solid line), the injected current

I_{inj}

(green dashed line), the actual net input

I = u + I_{inj}

, and the limit value (light-blue dash-dotted line). The net input

u = u + I_{inj}

always satisfied the norm constraint

| u + I_{inj} | \leq I_{\lim}

. Hence, we can simultaneously estimate and control the nonlinear neuronal dynamics while adhering to the constraint for the control signal.

4. Conclusions

In this paper, we have proposed a probabilistic framework for simultaneously estimating and controlling general dynamics entirely based on the particle filter. By introducing the particle filter not only as a state estimator and a prior estimator for the dynamics but also as a controller, we can directly handle the nonlinearity of the dynamics and the uncertainty of the latent state effectively and efficiently. With adaptive backward resampling, the proposed framework can suppress the degeneracy of the particles with respect to the sufficient statistics and realize accurate parameter estimation. Through experiments on two general models, including nonlinearity in distinct fields, we verified the effectiveness of our proposed framework. By applying our proposed framework to the Lorenz system in a simulation environment, we found that the proposed framework can not only estimate the latent variables from noisy observational data and the parameters governing chaotic behavior but also control the latent state toward the fixed point. Furthermore, by applying our proposed framework to the Morris–Lecar neuron model, we showed that even under noisy partial observation conditions, it can estimate the latent state and parameters and control the latent state toward the desired trajectory. Although our proposed method can simultaneously estimate the latent state and parameters and control nonlinear complex dynamics, our framework cannot handle semi-parametric or non-parametric models directly. In future work, we plan to extend our proposed framework to handle such semi-parametric or non-parametric models.

Author Contributions

Conceptualization, T.O. (Toshiaki Omori); Methodology, T.O. (Taketo Omi) and T.O. (Toshiaki Omori); Software, T.O. (Taketo Omi); Formal analysis, T.O. (Taketo Omi) and T.O. (Toshiaki Omori); Investigation, T.O. (Taketo Omi) and T.O. (Toshiaki Omori); Writing—original draft, T.O. (Taketo Omi); Writing—review & editing, T.O. (Toshiaki Omori); Supervision, T.O. (Toshiaki Omori); Project administration, T.O. (Toshiaki Omori); Funding acquisition, T.O. (Toshiaki Omori). All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by Grants-in-Aid for Scientific Research (B) (Nos. JP21H03509 and JP23K21702); the Fund for the Promotion of Joint International Research (International Collaborative Research) (No. JP23KK0184); MEXT, Japan; CREST (No. JPMJCR1914), JST, Japan; and AMED (No. 23wm052517h0003), Japan.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author (Toshiaki Omori).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations were used in this manuscript:

SSM	State-space model
PF	Particle filter
EM algorithm	Expectation-maximization algorithm
AdaSmooth	Adaptive smoothing
MPC	Model predictive control
PF-MPC	Particle filter-based model predictive control

References

Cheng, S.; Quilodrán-Casas, C.; Ouala, S.; Farchi, A.; Liu, C.; Tandeo, P.; Fablet, R.; Lucor, D.; Iooss, B.; Brajard, J.; et al. Machine learning with data assimilation and uncertainty quantification for dynamical systems: A review. IEEE/CAA J. Autom. Sin. 2023, 10, 1361–1387. [Google Scholar] [CrossRef]
Inoue, H.; Hukushima, K.; Omori, T. Estimating distributions of parameters in nonlinear state space models with replica exchange particle marginal metropolis-hastings method. Entorpy 2022, 24, 115. [Google Scholar] [CrossRef] [PubMed]
Ito, M.; Kuwatani, T.; Oyanagi, R.; Omori, T. Data-driven analysis of nonlinear heterogeneous reactions through sparse modeling and Bayesian statistical approaches. Entropy 2021, 23, 824. [Google Scholar] [CrossRef] [PubMed]
Omori, T.; Kuwatani, T.; Okamoto, A.; Hukushima, K. Bayesian inversion analysis of nonlinear dynamics in surface heterogeneous reactions. Phys. Rev. E 2016, 94, 033305. [Google Scholar] [CrossRef] [PubMed]
Ditlevsen, S.; Samson, A. Estimation in the partially observed stochastic Morris–Lecar neuronal model with particle filter and stochastic approximation methods. Ann. Appl. Stat. 2014, 8, 674–702. [Google Scholar] [CrossRef]
Azza, L.J.; Crompton, D.; D’Eleuterio, G.M.T.; Skinner, F.; Lankarany, M. Adaptive unscented Kalman filter for neuronal state and parameter estimation. J. Comput. Neurosci. 2023, 51, 223–237. [Google Scholar] [CrossRef]
Chan, J.C.; Strachan, R.W. Bayesian state space models in macroeconometrics. J. Econ. Surv. 2023, 37, 58–75. [Google Scholar] [CrossRef]
Newman, K.; King, R.; Elvira, V.; de Valpine, P.; McCrea, R.; Morgan, B.J.T. State-space models for ecological time-series data: Practical model-fitting. Methods Ecol. Evol. 2023, 14, 26–42. [Google Scholar] [CrossRef]
Ahwiadi, M.; Wang, W. An enhanced particle filter technology for battery system state estimation and RUL prediction. Measurement 2022, 191, 110817. [Google Scholar] [CrossRef]
El-Dalahmeh, M.; Al-Greer, M.; El-Dalahmeh, M.; Bashir, I. Physics-based model informed smooth particle filter for remaining useful life prediction of lithium-ion battery. Measurement 2023, 214, 112838. [Google Scholar] [CrossRef]
Kitagawa, G. A Monte Carlo filtering and smoothing method for non-Gaussian nonlinear state space models. In Proceedings of the 2nd U.S.-Japan Joint Seminar on Statistical Time Series, Honolulu, HI, USA, 25–29 January 1993. [Google Scholar]
Doucet, A.; Freitas, N.; Gordon, N. Sequenatial Monte Carlo Methods in Practice; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Wills, A.G.; Schön, T.B. Sequential Monte Carlo: A unified review. Annu. Rev. Control Robot. Auton. Syst. 2023, 6, 159–182. [Google Scholar] [CrossRef]
Kantas, N.; Doucet, A.; Singh, S.S.; Maciejowski, J.; Chopin, N. On particle methods for parameter estimation in state-space models. Stat. Sci. 2015, 30, 328–351. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977, 39, 1–22. [Google Scholar] [CrossRef]
Olsson, J.; Westerborn, J. An efficient particle-based online EM algorithm for general state-space models. IFAC-Pap. 2015, 48, 963–968. [Google Scholar] [CrossRef]
Schwenzer, M.; Ay, M.; Bergs, T.; Abel, D. Review on model predictive control: An engineering perspective. Int. J. Adv. Manuf. Technol. 2021, 117, 1327–1349. [Google Scholar] [CrossRef]
Stahl, D.; Hauth, J. PF-MPC: Particle filter-model predictive control. Syst. Control Lett. 2011, 60, 632–643. [Google Scholar] [CrossRef]
Mastrototaro, A.; Olsson, J.; Alenlöv, J. Fast and numerically stable particle-based online additive smoothing: The AdaSmooth algorithm. J. Am. Stat. Assoc. 2024, 119, 356–367. [Google Scholar] [CrossRef]
Särkkä, S. Bayesian Filtering and Smoothing; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
Douc, R.; Cappe, O. Comparison of resampling schemes for particle filtering. In Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis 2005, Zagreb, Croatia, 15–17 September 2005; pp. 64–69. [Google Scholar] [CrossRef]
Hol, J.D.; Schon, T.B.; Gustafsson, F. On resampling algorithms for particle filters. In Proceedings of the 2006 IEEE Nonlinear Statistical Signal Processing Workshop 2006, Cambridge, UK, 13–15 September 2006; pp. 79–82. [Google Scholar] [CrossRef]
Doucet, A.; Godsill, S.; Andrieu, C. On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput. 2000, 10, 197–208. [Google Scholar] [CrossRef]
Neal, R.M.; Hinton, G.E. A View of the EM Algorithm That Justifies Incremental, Sparse, and Other Variants; Springer: Dordrecht, The Netherlands, 1998; pp. 355–368. [Google Scholar] [CrossRef]
Cappé, O.; Moulines, E. Online EM algorithm for latent data models. J. R. Stat. Soc. Ser. B Stat. Methodol. 2007, 71, 593–613. [Google Scholar] [CrossRef]
Olsson, J.; Westerborn, J. Efficient particle-based online smoothing in general hidden Markov models: The PaRIS algorithm. Bernoulli 2017, 23, 1951–1996. [Google Scholar] [CrossRef]
Omi, T.; Omori, T. Simultaneously estimating and controlling nonlinear neuronal dynamics based on sequential Monte Carlo framework. Nonlinear Theory Appl. 2024, 15, 237–248. [Google Scholar] [CrossRef]
Lorenz, E.N. Deterministic nonperiodic flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef]
Pyragas, K. Continuous control of chaos by self-controlling feedback. Phys. Lett. A 1992, 170, 421–428. [Google Scholar] [CrossRef]
Pyragas, V.; Pyragas, K. Delayed feedback control of the Lorenz system: An analytical treatment at a subcritical Hopf bifurcation. Phys. Rev. E 2006, 73, 036215. [Google Scholar] [CrossRef] [PubMed]
Shine, J.M.; Müller, E.J.; Munn, B.; Cabral, J.; Moran, R.J.; Breakspear, M. Computational models link cellular mechanisms of neuromodulation to large-scale neural dynamics. Nat. Neurosci. 2021, 24, 765–776. [Google Scholar] [CrossRef] [PubMed]
Chialvo, D.R. Emergent complex neural dynamics. Nat. Phys. 2010, 6, 744–750. [Google Scholar] [CrossRef]
Vogt, N. Voltage imaging in vivo. Nat. Rev. Neurosci. 2019, 16, 573. [Google Scholar] [CrossRef] [PubMed]
Thomas, K.; Chenchen, S. Optical voltage imaging in neurons: Moving from technology development to practical tool. Nat. Rev. Neurosci. 2019, 20, 719–727. [Google Scholar]
Peterka, D.S.; Takahashi, H.; Yuste, R. Imaging voltage in neurons. Neuron 2011, 69, 9–21. [Google Scholar] [CrossRef]
Morris, C.; Lecar, H. Voltage oscillations in the barnacle giant muscle fiber. Biophys. J. 1981, 35, 193–213. [Google Scholar] [CrossRef]
Ermentrout, G.B.; Terman, D.H. Mathematical Foundations of Neuroscience; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]

Figure 1. Overview of the proposed framework for probabilistic estimation and control of dynamical systems. (i) We apply a particle filter to estimate the latent state

x_{t}

from noisy observational data

y_{t}

online using data assimilation. (ii) We apply our adaptive smoothing (AdaSmooth)-based online expectation-maximization (EM) algorithm to estimate the dynamics parameter

θ

using the particles obtained from the state estimator in statistical machine learning. (iii) We employ another particle filter based on model predictive control to determine the approximate feedback control signal

u_{t}

for controlling the target dynamics.

Figure 1. Overview of the proposed framework for probabilistic estimation and control of dynamical systems. (i) We apply a particle filter to estimate the latent state

x_{t}

from noisy observational data

y_{t}

online using data assimilation. (ii) We apply our adaptive smoothing (AdaSmooth)-based online expectation-maximization (EM) algorithm to estimate the dynamics parameter

θ

using the particles obtained from the state estimator in statistical machine learning. (iii) We employ another particle filter based on model predictive control to determine the approximate feedback control signal

u_{t}

for controlling the target dynamics.

Figure 2. The overview of the state-space model (SSM).

Figure 3. The flow of our proposed AdaSmooth-based online EM algorithm. (E-step) When the latent state particles

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

at time t are obtained by the particle filter (PF) with new observation

y_{t}

and current parameter value

{\hat{θ}}_{t - 1}

, we update the sufficient statistics

S_{t} (x_{0 : t})

using AdaSmooth, which incorporates particles at two subsequent times:

t - 1

and y

{(x_{t - 1}^{(i)}, w_{t - 1}^{(i)})}_{i = 1}^{N}, {(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

. (M-step) Based on the updated sufficient statistics

S_{t} (x_{0 : t})

, we estimate the parameter value

{\hat{θ}}_{t}

online. We can efficiently estimate dynamics online by repeating the E-step and the M-step as new observational data are obtained.

Figure 3. The flow of our proposed AdaSmooth-based online EM algorithm. (E-step) When the latent state particles

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

at time t are obtained by the particle filter (PF) with new observation

y_{t}

and current parameter value

{\hat{θ}}_{t - 1}

, we update the sufficient statistics

S_{t} (x_{0 : t})

using AdaSmooth, which incorporates particles at two subsequent times:

t - 1

and y

{(x_{t - 1}^{(i)}, w_{t - 1}^{(i)})}_{i = 1}^{N}, {(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

. (M-step) Based on the updated sufficient statistics

S_{t} (x_{0 : t})

, we estimate the parameter value

{\hat{θ}}_{t}

online. We can efficiently estimate dynamics online by repeating the E-step and the M-step as new observational data are obtained.

Figure 4. Overview of particle filter-based model predictive control (PF-MPC) integrated with our AdaSmooth-based online EM algorithm. The particle filter for state estimation (state PF) provides the distribution of latent variables

p (x_{t} ∣ y_{1 : t})

as particles and their corresponding weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

. These particles and their weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

, obtained via the state PF, are directly used as initial augmented particles by the PF for control in the horizon (control PF). The input particles

{u_{t}^{(i)}}_{i = 1}^{N}

are sampled from the proposal distribution

p ({\bar{u}}_{t} ∣ u_{t - 1})

. Using the control PF, we can obtain the filtering distribution of the optimal control

p ({\bar{u}}_{t} ∣ r_{t : t + T_{H}})

given the reference trajectory; the optimal control input is calculated using a specific point estimator like the posterior mean.

Figure 4. Overview of particle filter-based model predictive control (PF-MPC) integrated with our AdaSmooth-based online EM algorithm. The particle filter for state estimation (state PF) provides the distribution of latent variables

p (x_{t} ∣ y_{1 : t})

as particles and their corresponding weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

. These particles and their weights

{(x_{t}^{(i)}, w_{t}^{(i)})}_{i = 1}^{N}

, obtained via the state PF, are directly used as initial augmented particles by the PF for control in the horizon (control PF). The input particles

{u_{t}^{(i)}}_{i = 1}^{N}

are sampled from the proposal distribution

p ({\bar{u}}_{t} ∣ u_{t - 1})

. Using the control PF, we can obtain the filtering distribution of the optimal control

p ({\bar{u}}_{t} ∣ r_{t : t + T_{H}})

given the reference trajectory; the optimal control input is calculated using a specific point estimator like the posterior mean.

Figure 5. Results for simultaneous estimating and controlling the Lorenz system. The dynamical behavior of the state variables

x_{t} = {[x_{t}, y_{t}, z_{t}]}^{⊤}

and model parameters

θ = {[σ, r, b]}^{⊤}

are shown. By applying the reference signal [fixed point

y_{f} = \sqrt{b (r - 1)}

], all three variables

(x, y, z)

approach a steady point

(x_{f}, y_{f}, z_{f}) = (\sqrt{b (r - 1)}, \sqrt{b (r - 1)}, r - 1)

. The latent state

[x_{t}, y_{t}, z_{t}]

is estimated using our proposed method over time. In addition, all of the estimated parameters converge to the true values compared to the initial values, although

σ

oscillates slightly. To confirm that the model exhibits chaotic behavior, we provide the control input after time

t = 5

(gray dashed line in the middle-left graph).

Figure 5. Results for simultaneous estimating and controlling the Lorenz system. The dynamical behavior of the state variables

x_{t} = {[x_{t}, y_{t}, z_{t}]}^{⊤}

and model parameters

θ = {[σ, r, b]}^{⊤}

are shown. By applying the reference signal [fixed point

y_{f} = \sqrt{b (r - 1)}

], all three variables

(x, y, z)

approach a steady point

(x_{f}, y_{f}, z_{f}) = (\sqrt{b (r - 1)}, \sqrt{b (r - 1)}, r - 1)

. The latent state

[x_{t}, y_{t}, z_{t}]

is estimated using our proposed method over time. In addition, all of the estimated parameters converge to the true values compared to the initial values, although

σ

oscillates slightly. To confirm that the model exhibits chaotic behavior, we provide the control input after time

t = 5

(gray dashed line in the middle-left graph).

Figure 6. Dimensional visualization of trajectories with the non-controlled and controlled Lorenz systems. Left: The non-controlled trajectory obtained through simulation, starting with the same initial state used in the experiment. Right: The controlled trajectory obtained through simultaneous estimation and control of dynamics using our proposed method. Here, the arrows on each trajectory represent the direction of the transitions. By applying our proposed framework, the Lorenz system stabilizes around the desired fixed point

(x, y, z) = (\sqrt{b (r - 1)}, \sqrt{b (r - 1)}, r - 1)

.

Figure 6. Dimensional visualization of trajectories with the non-controlled and controlled Lorenz systems. Left: The non-controlled trajectory obtained through simulation, starting with the same initial state used in the experiment. Right: The controlled trajectory obtained through simultaneous estimation and control of dynamics using our proposed method. Here, the arrows on each trajectory represent the direction of the transitions. By applying our proposed framework, the Lorenz system stabilizes around the desired fixed point

(x, y, z) = (\sqrt{b (r - 1)}, \sqrt{b (r - 1)}, r - 1)

.

Figure 7. The history of the control input

u_{t}

obtained automatically by our proposed method. Note that the control input

u_{t}

always satisfies the absolute value constraint

| u_{t} | \leq u_{\lim}

. Nonetheless, Figure 5 shows that we can simultaneously estimate and control chaotic nonlinear dynamical systems.

Figure 7. The history of the control input

u_{t}

obtained automatically by our proposed method. Note that the control input

u_{t}

always satisfies the absolute value constraint

| u_{t} | \leq u_{\lim}

. Nonetheless, Figure 5 shows that we can simultaneously estimate and control chaotic nonlinear dynamical systems.

Figure 8. Results for simultaneously estimating and controlling the Morris–Lecar neuronal system. The dynamics of state variables

x_{t} = {[v_{t}, n_{t}]}^{⊤}

, control input u, and model parameters

θ = {[g_{L}, g_{C a}, g_{K}]}^{⊤}

are shown. Top-left graph: Control current u, streaming currents

I_{inj}

, and net input

u + I_{inj}

for the neuron, along with the control limit value. Middle-left graph: True membrane potential

v_{true}

, estimated membrane potential

v_{est}

, and reference trajectory

v_{ref}

, along with the non-controlled trajectory. Bottom-left graph: True channel variable

n_{t}

and estimated channel variable

n_{est}

. Right three graphs: Estimated values of the parameters

g_{L}, g_{C a}, g_{K}

, with true values indicated by dashed lines. We can simultaneously estimate and control the Morris–Lecar neuronal dynamics by applying our proposed method.

Figure 8. Results for simultaneously estimating and controlling the Morris–Lecar neuronal system. The dynamics of state variables

x_{t} = {[v_{t}, n_{t}]}^{⊤}

, control input u, and model parameters

θ = {[g_{L}, g_{C a}, g_{K}]}^{⊤}

are shown. Top-left graph: Control current u, streaming currents

I_{inj}

, and net input

u + I_{inj}

for the neuron, along with the control limit value. Middle-left graph: True membrane potential

v_{true}

, estimated membrane potential

v_{est}

, and reference trajectory

v_{ref}

, along with the non-controlled trajectory. Bottom-left graph: True channel variable

n_{t}

and estimated channel variable

n_{est}

. Right three graphs: Estimated values of the parameters

g_{L}, g_{C a}, g_{K}

, with true values indicated by dashed lines. We can simultaneously estimate and control the Morris–Lecar neuronal dynamics by applying our proposed method.

Table 1. Input and output variables used in the Lorenz system.

	Variable	Description
Input	u	Control input
Output	$x, y, z$	Latent state
Output	$σ, r, b$	Parameters

Table 2. Input and output variables used in the Morris–Lecar neuron model.

	Variable	Description
Input	I	External input current
Output	$v, n$	Latent state
Output	$g_{L}, g_{C a}, g_{K}$	Parameters

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Omi, T.; Omori, T. Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling. Entropy 2024, 26, 653. https://doi.org/10.3390/e26080653

AMA Style

Omi T, Omori T. Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling. Entropy. 2024; 26(8):653. https://doi.org/10.3390/e26080653

Chicago/Turabian Style

Omi, Taketo, and Toshiaki Omori. 2024. "Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling" Entropy 26, no. 8: 653. https://doi.org/10.3390/e26080653

APA Style

Omi, T., & Omori, T. (2024). Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling. Entropy, 26(8), 653. https://doi.org/10.3390/e26080653

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Probabilistic Estimation and Control of Dynamical Systems Using Particle Filter with Adaptive Backward Sampling

Abstract

1. Introduction

2. Methods

2.1. Formulation of State-Space Model

2.2. Estimation of the Hidden State

2.3. Estimation of the Parameter

2.3.1. EM Algorithm

2.3.2. AdaSmooth-Based Online EM Algorithm

2.4. Control Strategy for the Dynamics

3. Experiments

3.1. Application to Chaotic Lorenz System

3.1.1. Formulation of State-Space Model

3.1.2. Derivation of Online EM Algorithm

3.1.3. Formulation of the Augmented State-Space Model for Control

3.1.4. Settings

3.1.5. Results

3.2. Application to Morris–Lecar Neuron Model

3.2.1. Formulation of State-Space Model

3.2.2. Derivation of Online EM Algorithm

3.2.3. Formulation of the Augmented State-Space Model for Control

3.2.4. Settings

3.2.5. Results

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI