How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models

Iglesias, Cristovão Freitas; Bolic, Miodrag

doi:10.3390/s24020653

Open AccessArticle

How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models

by

Cristovão Freitas Iglesias, Jr.

^*

and

Miodrag Bolic

^*

School of Electrical Engineering and Computer Science (EECS), University of Ottawa, Ottawa, ON K1N 6N5, Canada

^*

Authors to whom correspondence should be addressed.

Sensors 2024, 24(2), 653; https://doi.org/10.3390/s24020653

Submission received: 21 November 2023 / Revised: 22 December 2023 / Accepted: 6 January 2024 / Published: 19 January 2024

(This article belongs to the Special Issue Soft Sensors and Sensing Techniques)

Download

Browse Figures

Versions Notes

Abstract

The unstructured mechanistic model (UMM) allows for modeling the macro-scale of a phenomenon without known mechanisms. This is extremely useful in biomanufacturing because using the UMM for the joint estimation of states and parameters with an extended Kalman filter (JEKF) can enable the real-time monitoring of bioprocesses with unknown mechanisms. However, the UMM commonly used in biomanufacturing contains ordinary differential equations (ODEs) with unshared parameters, weak variables, and weak terms. When such a UMM is coupled with an initial state error covariance matrix

P (t = 0)

and a process error covariance matrix

Q

with uncorrelated elements, along with just one measured state variable, the joint extended Kalman filter (JEKF) fails to estimate the unshared parameters and state simultaneously. This is because the Kalman gain corresponding to the unshared parameter remains constant and equal to zero. In this work, we formally describe this failure case, present the proof of JEKF failure, and propose an approach called SANTO to side-step this failure case. The SANTO approach consists of adding a quantity to the state error covariance between the measured state variable and unshared parameter in the initial P(t = 0) of the matrix Ricatti differential equation to compute the predicted error covariance matrix of the state and prevent the Kalman gain from being zero. Our empirical evaluations using synthetic and real datasets reveal significant improvements: SANTO achieved a reduction in root-mean-square percentage error (RMSPE) of up to approximately 17% compared to the classical JEKF, indicating a substantial enhancement in estimation accuracy.

Keywords:

joint extended Kalman filter; unstructured mechanistic model; bioprocess monitoring

1. Introduction

The extended Kalman filter (EKF) is a recursive Bayesian filter [1,2]. This nonlinear state estimator (NSE) is a commonly used technique for estimating the state of a nonlinear system using a state-space model, first-order linearization, and linear estimation theory. It is composed of a process model and a measurement model along with error covariance matrices of the process (Q), measurement (R), and state (P) [3,4]. The EKF, beyond state estimation, is also used for the parameter estimation (parameter evolution [5]) of nonlinear systems (process models) considering a single joint state variable vector, which includes both the states and parameters of the process model [6,7,8]. This approach is called the joint estimation of states and parameters with an extended Kalman filter (JEKF). The joint estimation problem is motivated by the need to correct the prediction of a process model regarding state variables and to update the process model by evolving its parameters based on the corrections made [8]. A process model should be estimated (evolved) for different conditions of the same application. For example, in biomanufacturing, the parameters of a process model for monitoring a cell culture should change for each new condition. We can use a general set of parameters at the beginning of the process, but we need to evolve them during the process to improve the predictions of the states of the cell culture. Thus, JEKF uses each measurement as soon as it becomes available to correct both the predictions and parameters of a process model [8]. The first discussions and applications of the JEKF approach started in the 1960s for the estimation of linear systems (in which there is a bilinear relation between the states and parameters) [6,7,8,9,10]. However, the JEKF is still very popular, with several new applications in different areas [5,11,12,13,14,15,16,17,18,19,20,21], and with unsolved problems [22,23]. Furthermore, the JEKF has been established as the least expensive nonlinear estimator for moderate-size systems in terms of computational cost because the practical implementation of adaptive controllers using microcontrollers (and/or minicomputers and/or microprocessors) requires numerically economical and robust algorithms, such as the JEKF [11,24]. An important area of application of the JEKF is biomanufacturing, that is, the production of biological products from living cells [20,25,26]. The reason for this is that the JEKF with the mechanistic model (MM) as a process model effectively serves as a soft sensor in biomanufacturing. This combination can enable the real-time monitoring of critical process parameters (CPPs) or critical quality attributes (CQAs) that are difficult to measure directly or that can only be measured at low sampling frequencies in a bioprocess [20,27]. There are two types of MM: structured mechanistic models (SMMs) and unstructured mechanistic models (UMMs) [28]. When we have knowledge about a bioprocess, we can use an SMM with the JEKF. On the other hand, when we do not have knowledge about a bioprocess, we can use a UMM with the JEKF because the UMM allows us to model the macro-scale of a phenomenon. It is a mass-balance equation system with few parameters and variables and less complexity than SMMs [29,30].

The UMM used in biomanufacturing typically consists of ODEs with unshared parameters, weak variables, and weak terms. However, these characteristics of UMM in biomanufacturing, together with the use of P(t = 0) and Q with uncorrelated elements and the presence of a single measured state variable, represent a failure case that occurs when the JEKF cannot estimate the unshared parameters and the state simultaneously. There are many new bioprocesses for which the literature contains no prior knowledge that the biopharmaceutical industry aims to monitor, such as recombinant adeno-associated virus (rAAV) production [31]. Therefore, enabling the JEKF to side-step the failure case described above may help the industry perform biomanufacturing with the real-time monitoring of bioprocesses with unknown mechanisms. Consequently, this skill can support the biopharmaceutical industry in achieving biomanufacturing 4.0 by becoming more agile and intelligent, thus enhancing product quality, optimizing operations, and reducing costs [25,26,32,33]. Although the biopharmaceutical industry was valued at USD 239.8 billion in 2019 and is estimated to grow at an annual rate of over 13%, it faces significant challenges in achieving the desired productivity and product quality consistently [34].

In this work, we present the common conditions in biomanufacturing that represent a failure case where the JEKF fails to perform the unshared parameter evolution of a UMM, and we propose a solution to side-step this failure case, called SANTO, which consists of a Specific initiAl coNdiTiOn (SANTO) for the matrix Ricatti differential equation (MRDE). Our solution is inspired by the regularization technique to avoid singularity issues in EKF. However, instead of adding a small quantity to the diagonal elements of the state error covariance matrix P [35], we only add a quantity to the state error covariance between the measured state variable (MSV) and an unshared parameter (UP) in P(t = 0) for the MRDE. The proposed approach can avoid JEKF failure by preventing the Kalman gain from being zero throughout the entire process, which is an unrealistic situation that would mean that the predictions of the UMM (used as a process model) are perfect. Our theoretical and empirical results demonstrate the effectiveness of SANTO, which was assessed using synthetic and real datasets. The code and data used in this work are available in the data availability section of this paper to facilitate reproducibility. Our contributions can be summarized as follows:

We provide proof of JEKF failure when acting as an unshared parameter estimator under specific biomanufacturing conditions that represent a failure case. To our knowledge, this is the first work to formally report this failure case regarding the JEKF.
An approach to avoid the JEKF failure that enables using JEKF with UMM for real-time bioprocess monitoring. This is helpful in the macro-scale modeling of a phenomenon with UMM where the underlying process mechanism is not fully understood.

2. Related Work

In contrast to JEKF, the dual extended Kalman filter (DEKF) employs two consecutive EKFs, separating the estimation of system states and parameters [36]. This separation can be advantageous in certain scenarios, but JEKF offers three important benefits, particularly in the context of the practical implementation of adaptive controllers using microcontrollers in biomanufacturing that requires numerically economical and robust algorithms such as JEKF [11,24]. First, JEKF avoids the computational overhead associated with running two separate filters, as in DEKF, enhancing computational efficiency [37]. Second, it can provide more accurate and robust estimates in scenarios, such as nonlinear biochemical systems, that commonly occur in biomanufacturing processes [36]. Lastly, the single-filter structure of JEKF is simpler to implement and tune compared to the dual-filter approach of DEKF [8]. The main limitation of JEKF is not guaranteed convergence in some cases, as reported by [6,24,38]. A solution to deal with the convergence problems of JEKF is to use recurrent derivatives [6,38]. However, a theoretical justification for that was not provided [8]. On the other hand, it was reported that the cause of divergence in JEKF is linked to the linearization of the coupled system and not due to the lack of recurrent derivatives [24]. Furthermore, there are certain cases where the JEKF may be unable to estimate the parameters and the state simultaneously, such as singularity issues [35]. However, until now, the failure case (biomanufacturing conditions) where JEKF fails as an unshared parameter estimator has not been formally reported. Recently, the JEKF was applied for monitoring rAAV production [19]. In developing this application, the authors dealt with a situation that resembles the failure case reported here. Because they reported the use of a simple UMM, P(t = 0), and Q with uncorrelated elements and a second linear operator as an approach to enable Kalman gain (K) and P to be updated with prior error covariances with regard to the UMM parameters, their results showed the unshared parameter evolution with convergence. However, the authors did not describe the problem in detail. They did not present a theoretical justification for the approach used (second linear operator). They clearly stated that the work is an initial study and reported the need for future validation. We named this approach KPH2 because the authors used a second linear operator to enable K and P to be updated, and we used this approach in our experimental evaluation for comparison purposes with our proposed approach. A description of KPH2 and a possible interpretation can be found in Section S6 of the Supplementary Material.

3. Background

3.1. Unstructured Mechanistic Model (UMM)

Unstructured Mechanistic Models (or Unstructured Mechanistic Kinetic Models) are models of the temporal evolution of a bioprocess [39]. They are based on first-principle mechanisms that drive the bioprocess under consideration [34]. Examples of bioprocesses are (i) the production of therapeutic monoclonal antibodies (mAbs), which is projected to bring in USD 300 billion by 2025 [34], and (ii) the rAAV production that is a viral vector technology for gene therapy considered the safest and most effective way to repair single-gene abnormalities in non-dividing cells [19,31]. It is essential to point out that despite UMM being the most suitable option to describe the dynamic behavior of bioprocesses and being considered a crucial foundation for soft sensors in DT development, its industrial use is still in its early stages [28,39,40]. The UMMs are important because they allow for the macro-scale modeling of the bioreactor’s functionality and can provide insight into the upstream process’s underlying macro-scale phenomena. For example, this kind of model can be used to depict the dynamics of the cell density, viability, nutrient/metabolite concentrations, and product titer [41,42,43]. Therefore, UMMs are the most suitable option for explaining observed phenomena, predicting process behavior, and analyzing intrinsic bioprocess characteristics such as controllability [34].

The main difference between UMM and SMM is that SMM is more complex than UMM because it provides details about the intracellular environment of a homogenous cell population. Therefore, the development of SMM for a specific bioprocess requires extensive domain knowledge and substantial effort [34,41]. SMM is unsuitable for the dynamic control of bioprocess in bioreactors used commonly in biomanufacturing because many of the variables used in SMM cannot be manipulated directly [34]. SMM is most suited for cell-line development, in which a cells’ genome-level properties are changed to produce the desired process behavior [34].

It is essential to point out that a simple UMM has limited predictive power and is insufficient to process state estimation. Moreover, it is improbable that a single set of parameter values enables a kinetic model to satisfy several datasets collected under distinct operating circumstances [44]. The Kalman filter approach is commonly implemented with UMM [45] to improve prediction accuracy and generate predictions between sampling instances. Among several data analysis methods, the Kalman filter and its nonlinear extensions, such as the extended Kalman filter, are effective tools for predicting the values of unobserved states. Examples of UMM used in biomanufacturing can be found in Section S1 of Supplementary Material.

3.2. Continuous-Discrete Extended Kalman Filter

This section gives an overview of the continuous-discrete EKF (CD-EKF) algorithm. A detailed description of CD-EKF can be found in Section S2 of the Supplementary Material. The EKF requires a state-space model to perform an estimation on the state variables of a process (nonlinear system) present in a state variable vector

ψ (t)

[1,36,44]. A state-space model consists of process and measurement (observation) models [46]. EKF linearizes the nonlinear system (state-space model) by calculating the Jacobians of the nonlinear process and measurement models based on the first-order Taylor series expansion in order to analytically propagate the Gaussian random-variable representation [8,20,44].

A UMM can be used as the process model of EKF. The state variables vector to be used by the EKF is composed of the state variables of the UMM (observed and unobserved), and the state variables vector is defined as:

ψ (t) = {[x_{1}, x_{2}, \dots, x_{n}]}^{T} .

(1)

Subsequently, the process model is represented as

\frac{d ψ (t)}{d t} = ϕ (ψ (t), t, θ) + ω (t),

(2)

where

ϕ

denotes nonlinear functions of the state variables in

ψ (t)

, which corresponds to a UMM. The process model is formulated in a continuous time t, and the white process noise vector is represented by

ω \sim N (0, Q)

with the zero mean and the error covariance matrix of process model represented by

Q

.

The measurement model is treated as a discrete system and defined as

Z_{k} = h (ψ (t_{k})) + v .

(3)

The nonlinear function h in the measurement model relates the current state variables to the measurements

Z_{k}

. The white measurement noise vector is represented by

v \sim N (0, R)

with zero mean and measurement noise variance represented by

R

. When some state variables can be measured directly, we have a simple case and h can be a linear model. If h is linear, we have

h (ψ (t_{k})) = H ψ (t_{k})

[20,36,47] where the matrix

H

is a linear operator (row vector) that matches the states variables of

ψ (t_{k})

to the measured variables

Z_{k}

that are obtained at a discrete instance k [20,47]. Consequently, the measurement model (3) can be rewritten as

Z_{k} = H ψ (t_{k}) + v .

(4)

The EKF algorithm is implemented through a state variables vector

ψ (t)

, initial condition, prediction step (time update) and correction step (measurement update) [1,20,21,36,47].

Initialization step: The initial condition is composed of the initial mean

{\hat{ψ}}_{0} = E [ψ_{0}]

and initial error covariance matrix

P_{0} = P (t = 0) = E [(ψ_{0} - {\hat{ψ}}_{0}) {(ψ_{0} - {\hat{ψ}}_{0})}^{T}]

of the state variables vector in addition to the error covariance matrices of the process Q and measurement R [8].

Prediction step: In this step, the a priori predictions represented by the predicted mean

\hat{ψ} (t_{k / k - 1})

and predicted error covariance matrix

P (t_{k | k - 1})

of state variables vector

ψ (t)

are obtained. This is completed by numerically integrating

ϕ (ψ (t), t, θ)

from discrete time

t_{k - 1}

to

t_{k}

the following equation

\hat{ψ} (t_{k / k - 1}) = {\hat{ψ} (t_{k - 1}) + \int_{t_{k - 1}}^{t_{k}} ϕ (\hat{ψ} (t)) d t|}_{\hat{ψ} (t_{k - 1})}

(5)

and solving the MRDE to predict the state error covariance matrix [4,48]

\frac{d P (t)}{d t} = J_{t}^{ϕ} P (t) + P (t) J_{t}^{ϕ T} + Q

(6)

from

t_{k - 1}

to

t_{k}

, where a new measurement is obtained at time k [4,49], and

J_{t}^{ϕ}

is the Jacobian matrix of

ϕ

evaluated at the prior mode [50,51],

J_{t}^{ϕ} = {\frac{\partial ϕ (ψ (t))}{\partial ψ_{i}}|}_{ψ (t) = \hat{ψ} (t - 1)} .

(7)

Equation (6) is basically a matrix of ODEs, and the matrix of ODEs solutions obtained from

t_{k - 1}

to

t_{k}

represent each error covariance of the system state.

Correction step: In this step, the results of the prediction step (

\hat{ψ} (t_{k / k - 1})

and

P (t_{k | k - 1})

) are combined with the measured value

Z_{k}

and the Kalman gain (

K_{k}

) to provide the estimated mean

\hat{ψ} (t_{k / k})

and estimated error covariance matrix

P (t_{k | k})

of state variables using the following equations:

(i) innovation equations

e_{Z, k} = Z_{k} - H \hat{x} (t_{k / k - 1})

(8)

S_{k} = H P (t_{k | k - 1}) H^{T} + R

(9)

and (ii) update step equations

K_{k} = P (t_{k | k - 1}) H^{T} S_{k}^{- 1}

(10)

\hat{x} (t_{k / k}) = \hat{x} (t_{k / k - 1}) + K_{k} e_{Z, k}

(11)

P (t_{k | k}) = (I - K_{k} H) P (t_{k | k - 1})

(12)

where

e_{Z, k}

and

S_{k}

represent, respectively, the innovation error and innovation covariance.

The Kalman gain is a scaling factor (ratio) to estimate the state variables by setting a value between the predicted state and measured state [4,50]. The

K_{k}

chooses a value along the residual range (

Z_{k}

-

H \hat{ψ} (t_{k / k - 1})

) [8,50].

K_{k}

enables to set a value for

\hat{ψ} (t_{k / k})

between the

\hat{ψ} (t_{k / k - 1})

(prediction) and

Z_{k}

(measurement) using Equation (11) and update the belief regarding the state variables based on how certain we are regarding the measurement using Equation (12) [50]. The Kalman gain is computed as a ratio of prior and measurement uncertainty available; see Equation (10). The one-dimensional form of Equation (10) is the following

K = P / (P + R)

[50]. It is important to point out that linear operator

H

matches the states variables of

ψ (t_{k})

to the measured variables

Z_{k}

that are obtained at a discrete instance.

Using the estimated mean

\hat{ψ} (t_{k / k})

and the estimated error covariance matrix

P (t_{k | k})

of the vector of the state variables as an initial condition, we can return to the prediction step until the next measurement is obtained and everything repeated again.

3.3. JEKF

JEKF is a Bayesian filter-based joint estimation approach where the states

x_{i}

and parameters

θ

of a process model are concatenated into a single joint state vector [52]. Then, the state variables vector (

ψ (t) = {[x_{1}, x_{2}, \dots, x_{n}]}^{T}

) is considered as extended/augmented as following,

ψ (t) = {[x_{1}, x_{2}, \dots, x_{n}, θ_{1}, \dots, θ_{n}]}^{T} .

(13)

To be more specific, we consider the problem of learning both the states

x_{i}

and parameters

θ_{i}

of a discrete-time nonlinear dynamical system (such as the UMM described in Section S1 of Supplementary Material) that is used as a process model. In JEKF, the system states

x_{i}

and the set of model parameters

θ_{i}

for the dynamical system are simultaneously corrected based only on the observed noisy signal

Z_{k}

. It is essential to point out that we consider JEKF as an approach for parameter evolution [5], because it cannot guarantee convergence in some cases [6]. However, it can guarantee the evolution of the parameters based on the following equation [5]

θ (t_{k}) = θ (t_{k - 1}) + n o i s e,

(14)

where the parameters are defined as random variables with perturbation (noise) added at each time step. This parameter evolution can be enough to update the process model parameters when we are near the optimal parameters regarding a specific condition. In this paper, when we say parameter estimation, we are referring to parameter evolution.

4. Theoretical Analysis

This section presents the theoretical analysis of the JEKF failure to perform unshared parameter evolution with a UMM and SANTO, which is the proposed solution for this problem.

4.1. JEKF Failure

First, we present the conditions where JEKF fails to estimate (parameter evolution) the unshared parameters of a UMM. Next, we present the theoretical proof of the failure. However, before starting the analysis, we formally define unshared parameters and weak and strong terms/variables of an ODE as follows:

Unshared parameters: They are parameters used only in one term of an ODE and not used by other ODEs of the same UMM. See the example in Section S3.1 of the Supplementary Material.
Weak and Strong term of an ODE: A weak term is a term of an ODE with a low percentage of variables of the state variable vector, and a "strong term" is one with a high percentage of variables of the state variable vector. See the example in Section S3.2 of the Supplementary Material.
Weak and Strong variable of an ODE: A weak variable is a variable used only in the first member of an ODE in UMM, and a strong variable is a variable used in the first member and different terms of the second member of an ODE. Furthermore, it is used in the second member of other ODEs of the same UMM. See the example in Section S3.3 of the Supplementary Material.

4.1.1. Failure Case: Biomanufacturing Conditions

The following conditions are prevalent in biomanufacturing and should be taken into consideration while developing JEKF applications for this area:

ODEs of UMM with unshared parameters. This parameter type is commonly used in ODE to model the dynamic of product formation in biomanufacturing [53,54,55]. See the example in Section S3.1 of the Supplementary Material.
P and Q with uncorrelated elements. In case of the limited amount of data, it is very common to assume P and Q with uncorrelated elements in EKF applications [19,20,21,47]. This assumption means that the error covariance matrices P and Q are diagonal, with the diagonal elements being the noise variances (P $_{i, i} \neq 0$ and Q $_{i, i} \neq 0$ ) and off-diagonal elements equal to zero (P $_{i, j} = 0$ and Q $_{i, j} = 0$ ). The Q constant and with uncorrelated elements is used only to build the MRDE, and the P with uncorrelated elements can be used to build an MRDE and as an initial condition of MRDE (the initial predicted state error covariance P(t = 0)).
This assumption raises two scenarios:
- The use of P with uncorrelated elements to build the MRDE (Equation (6)) and P(t = 0) with uncorrelated elements as the initial condition. When P with uncorrelated elements is used to build the MRDE, the ODEs of MRDE are based only on noise variance of P $_{i, i}$ and Q $_{i, i}$ and elements of Jacobian $J_{t}^{ϕ}$ . See the example in Section S3.4 of the Supplementary Material. It is important to point out that depending on the partial derivative, the ODE to predict a state error covariance can be time-invariant $\frac{d P_{i, j} (t_{k | k - 1})}{d t} = 0$ . See Section S3.2 of the Supplementary Material.
- The use of P with correlated elements to build the MRDE (Equation (6)) and P(t = 0) with uncorrelated elements as the initial condition. This means that the ODE of MRDE can be composed of off-diagonal elements of P, and it can reduce the number of the time-invariant ODE to predict a state error covariance between two state variables.
ODEs of UMM with weak terms. A strong term contributes more than a weak term to compute the predicted state error covariance $P (t_{k | k - 1})$ . Many elements of Jacobian $J_{t}^{ϕ}$ result from the partial derivation of a strong term. See the example in Section S3.2 of the Supplementary Material.
ODEs of UMM with weak variables. In the Jacobian $J_{t}^{ϕ}$ , the first-order partial derivatives of all functions with respect to a weak variable are equal to zero. Consequently, this variable type does not contribute to the calculations of predicted error covariance $P (t_{k | k - 1})$ since it will not be part of any element of MRDE to predict the state error covariance matrix $P (t_{k | k - 1})$ . On the other hand, a strong variable contributes to the calculations of predicted error covariance $P (t_{k | k - 1})$ . See the example in Section S3.3 of Supplementary Material.
Only one measured state variable. In some cases (JEKF application), measuring only one state variable is possible. This measured state variable determines which column of the predicted state error covariance $P (t_{k | k - 1})$ is used to compute the Kalman gain through $P (t_{k | k - 1}) H^{T}$ in Equation (10). If this column has a row with a value equal to zero (no covariance between the measured variable and state variable represented by the row), the Kalman gain cannot be computed to the state variable defined by the row. See the example in Section S3.5 of the Supplementary Material.

4.1.2. Lemma: Inability to Update Kalman Gain for Unshared Parameters based P(t = 0) and Q with Uncorrelated Elements

Given the conditions described above, we have the following Lemma:

Lemma 1.

The Kalman gain cannot be updated (by Equation (10)) for an unshared parameter that is part of a state variable vector and part of a weak term in a UMM if the initial state error covariance matrix P(t = 0) and Q are formed by uncorrelated elements and there is only one state variable measured.

The proof of this lemma is in the following, and an example can be found in Section S4 of the Supplementary Material.

Proof of Lemma 1.

Let us consider the following:

A general UMM with an unshared parameter in a weak term represented by a system of nonlinear differential equations of the form:

$\begin{matrix} \frac{d x_{m s v}}{d t} & = f_{1} (x_{m s v}, x_{2}, \dots, x_{n - 1}, θ_{1}, θ_{2}, \dots, θ_{m}) \end{matrix}$

(15)

$\begin{matrix} \frac{d x_{2}}{d t} & = f_{2} (x_{m s v}, x_{2}, \dots, x_{n - 1}, θ_{1}, θ_{2}, \dots, θ_{m}) \end{matrix}$

(16)

$\begin{matrix} ⋮ \end{matrix}$

(17)

$\begin{matrix} \frac{d x_{n}}{d t} & = f_{n} (x_{m s v}, θ_{u p}) \end{matrix}$

(18)

where $x_{m s v}$ and $x_{2}, \dots, x_{n}$ are the variables of the system, $f_{1}, f_{2}, \dots, f_{n}$ are the functions defining the system, and $θ_{1}, θ_{2}, \dots, θ_{m}$ are the parameters of the system, and $θ_{u p}$ is an unshared parameter.
A joint state variables vector defined as

$ψ {(t)}_{g e n e r a l} = [x_{m s v}, x_{2}, \dots, x_{n}, θ_{u p}] .$

(19)
A process model defined as

$\frac{d ψ {(t)}_{g e n e r a l}}{d t} = ϕ (ψ {(t)}_{g e n e r a l}, t) + ω (t) = \frac{d}{d t} [\begin{matrix} x_{m s v} \\ x_{2} \\ ⋮ \\ x_{n} \\ θ_{u p} \end{matrix}] = [\begin{matrix} f_{1} \\ f_{2} \\ ⋮ \\ f_{n} \\ 0 \end{matrix}] + ω (t) .$

(20)
$x_{m s v}$ as the unique measured state variable (MSV) and $H$ = [1 0 … 0 0].
R as measurement noise variance of $x_{m s v}$ .
$θ_{u p}$ as the unshared parameter (UP) to be evolved (estimated) and presented in only one weak term.
P and Q with uncorrelated elements for the $ψ {(t)}_{g e n e r a l}$ (Equation (19)),

$P = [\begin{matrix} P_{x_{m s v}, x_{m s v}} & 0 & \dots & 0 & 0 \\ 0 & P_{x_{2}, x_{2}} & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & P_{n, n} & 0 \\ 0 & 0 & \dots & 0 & P_{θ_{u p}, θ_{u p}} \end{matrix}],$

(21)

$Q = [\begin{matrix} Q_{x_{m s v}, x_{m s v}} & 0 & \dots & 0 & 0 \\ 0 & Q_{x_{2}, x_{2}} & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & Q_{n, n} & 0 \\ 0 & 0 & \dots & 0 & Q_{θ_{u p}, θ_{u p}} \end{matrix}] .$

(22)
The Jacobian $J_{t}^{ϕ}$ (Equation (7)), with the $ψ {(t)}_{g e n e r a l}$ (Equation (19)),

$J_{t}^{ϕ} (ϕ (ψ {(t)}_{g e n e r a l}, t)) = [\begin{matrix} \frac{\partial f_{1}}{\partial x_{m s v}} & \frac{\partial f_{1}}{\partial x_{2}} & \dots & \frac{\partial f_{1}}{\partial x_{n}} & 0 \\ \frac{\partial f_{2}}{\partial x_{m s v}} & \frac{\partial f_{2}}{\partial x_{2}} & \dots & \frac{\partial f_{2}}{\partial x_{n}} & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ \frac{\partial f_{n}}{\partial x_{m s v}} & \frac{\partial f_{n}}{\partial x_{2}} & \dots & \frac{\partial f_{n}}{\partial x_{n}} & \frac{\partial f_{n}}{\partial θ_{u p}} \\ 0 & 0 & \dots & 0 & 0 \end{matrix}] .$

(23)

Given these conditions and Equation (6), we have the following MRDE (based on P uncorrelated)

\frac{d P (t)}{d t} = [\begin{matrix} \frac{d P_{x_{m s v}, x_{m s v}} (t)}{d t} = Q_{1, 1} + 2 P_{1, 1} \frac{\partial f_{1}}{\partial x_{m s v}} & \frac{d P_{x_{2}, x_{m s v}} (t)}{d t} = (P_{1, 1} + P_{2, 2}) \frac{\partial f_{1}}{\partial x_{2}} & \dots & \frac{d P_{x_{n}, x_{m s v}} (t)}{d t} = (P_{1, 1} + P_{3, 3}) \frac{\partial f_{1}}{\partial x_{n}} & \frac{d P_{θ_{u p}, x_{m s v}} (t)}{d t} 0 \\ \frac{d P_{x_{m s v}, x_{2}} (t)}{d t} = (P_{1, 1} + P_{2, 2}) \frac{\partial f_{2}}{\partial x_{m s v}} & \frac{d P_{x_{2}, x_{2}} (t)}{d t} = Q_{2, 2} + 2 P_{2, 2} \frac{\partial f_{2}}{\partial x_{2}} & \dots & \frac{d P_{x_{n}, x_{2}} (t)}{d t} = (P_{3, 3} + P_{2, 2}) \frac{\partial f_{2}}{\partial x_{n}} & \frac{d P_{θ_{u p}, x_{2}} (t)}{d t} = 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ \frac{d P_{x_{m s v}, x_{n}} (t)}{d t} = (P_{1, 1} + P_{n, n}) \frac{\partial f_{n}}{\partial x_{m s v}} & \frac{d P_{x_{2}, x_{n}} (t)}{d t} = (P_{2, 2} + P_{n, n}) \frac{\partial f_{n}}{\partial x_{2}} & \dots & \frac{d P_{x_{n}, x_{n}} (t)}{d t} = Q_{n, n} + 2 P_{n, n} \frac{\partial f_{n}}{\partial x_{n}} & \frac{d P_{θ_{u p}, x_{n}} (t)}{d t} = 0 \\ \frac{d P_{x_{m s v}, θ_{u p}} (t)}{d t} = 0 & \frac{d P_{x_{2}, θ_{u p}} (t)}{d t} = 0 & \dots & \frac{d P_{x_{n}, θ_{u p}} (t)}{d t} = 0 & \frac{d P_{θ_{u p}, θ_{u p}} (t)}{d t} = 0 \end{matrix}] .

(24)

Now, using this Equation (24) to compute the predicted state error covariance matrix

P (t_{k / k - 1})

from

t_{k - 1}

to

t_{k}

with an initial predicted state error covariance matrix

P (t_{k - 1}) = P_{0} = P_{i n i t} (t = 0)

with uncorrelated elements as the following

P_{i n i t} (t = 0) = [\begin{matrix} P_{x_{m s v}, x_{m s v}} (t = 0) & 0 & \dots & 0 & 0 \\ 0 & P_{x_{2}, x_{2}} (t = 0) & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & P_{n, n} (t = 0) & 0 \\ 0 & 0 & \dots & 0 & P_{θ_{u p}, θ_{u p}} (t = 0) \end{matrix}],

(25)

we have

P (t_{k / k - 1}) = [\begin{matrix} P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) & P_{x_{2}, x_{m s v}} (t_{k / k - 1}) & \dots & P_{n, x_{m s v}} (t_{k / k - 1}) & P_{θ_{u p}, x_{m s v}} (t_{k / k - 1}) \\ P_{x_{m s v}, x_{2}} (t_{k / k - 1}) & P_{x_{2}, x_{2}} (t_{k / k - 1}) & \dots & P_{n, x_{2}} (t_{k / k - 1}) & P_{θ_{u p}, x_{2}} (t_{k / k - 1}) \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ P_{x_{m s v}, n} (t_{k / k - 1}) & P_{x_{2}, n} (t_{k / k - 1}) & \dots & P_{n, n} (t_{k / k - 1}) & P_{θ_{u p}, n} (t_{k / k - 1}) \\ P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) = 0 & P_{x_{2}, θ_{u p}} (t_{k / k - 1}) & \dots & P_{n, θ_{u p}} (t_{k / k - 1}) & P_{θ_{u p}, θ_{u p}} (t_{k / k - 1}) \end{matrix}] .

(26)

Now, using

P (t_{k / k - 1})

, H and R to compute the Kalman gain for all variables in the state variable vector

ψ {(t)}_{g e n e r a l}

(Equation (19)), we have

K_{k} = P (t_{k | k - 1}) H^{T} {(H P (t_{k | k - 1}) H^{T} + R)}^{- 1} = [\begin{matrix} K_{x_{m s v}} \\ K_{x_{2}} \\ ⋮ \\ K_{x_{n}} \\ K_{θ_{u p}} \end{matrix}] = [\begin{matrix} \frac{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ \frac{P_{x_{m s v}, x_{2}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ ⋮ \\ \frac{P_{x_{m s v}, n} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ \frac{P_{x_{m s v}, θ_{u p}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \end{matrix}] = [\begin{matrix} \frac{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ \frac{P_{x_{m s v}, x_{2}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ ⋮ \\ \frac{P_{x_{m s v}, n} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ \frac{0}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \end{matrix}] = [\begin{matrix} \frac{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ \frac{P_{x_{m s v}, x_{2}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ ⋮ \\ \frac{P_{x_{m s v}, n} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \\ 0 \end{matrix}] .

(27)

H selected the first column of

P (t_{k / k - 1})

, since it is related to the measured value

x_{m s v}

. However, in this column, we have that the predicted state error covariance between

x_{m s v}

and

θ_{u p}

is zero,

P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) = C o v (x_{m s v}, θ_{u p}) = 0

. The solution of

\frac{d P_{x_{m s v}, θ_{u p}} (t)}{d t} = 0

obtained from

t_{k - 1}

to

t_{k}

is equal to the initial condition that is zero due to P(t = 0) with uncorrelated elements, and we have

C o v (x_{m s v}, θ_{u p}) = P_{x_{m s v}, θ_{u p}} (t_{k - 1}) = P_{x_{m s v}, θ_{u p}} (t = 0) = 0

. Then, the Kalman gain value for the unshared parameter is zero,

K_{θ_{u p}}

= 0, and consequently, the predicted state error covariance

P_{x_{m s v}, θ_{u p}} (t_{k / k - 1})

cannot be updated (by Equation (12)). Since

\begin{matrix} P (t_{k | k}) = (I - K_{k} H) P (t_{k | k - 1}) = [\begin{matrix} ⋮ \\ P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) - K_{θ_{u p}} . P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) & \dots \end{matrix}] = \\ [\begin{matrix} ⋮ \\ 0 - 0 . P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) & \dots \end{matrix}] . \end{matrix}

(28)

Therefore, we have that

P_{x_{m s v}, θ_{u p}} (t_{k / k}) = P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) = 0

, and as

P_{x_{m s v}, θ_{u p}} (t_{k / k}) = 0

has to be used as a new initial condition for MRDE (Equation (24)), we have

K_{θ_{u p}}

= 0 for all

P_{x_{m s v}, θ_{u p}} (t_{k / k - 1})

obtained from

t_{k - 1}

to

t_{k}

using Equation (24) and consequently

K_{θ_{u p}}

and

P_{x_{m s v}, θ_{u p}} (t_{k / k}) = P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) = 0

are always zero and cannot be updated. □

4.1.3. Theorem: JEKF Failure

The consequence of Lemma 1 (Section 4.1.2) is the following theorem:

Theorem 1.

The JEKF (Section 3.3) fails to estimate an unshared parameter (parameter evolution) that is part of a state variable vector and part of a weak term in a UMM if the initial state error covariance matrix P(t = 0) and Q are composed of uncorrelated elements, and there is only one state variable measured. This is because the Kalman gain value for the unshared parameter is equal to zero for all steps of execution of the JEKF algorithm.

The proof of Theorem 1 is in the following, and an example of this theorem can be found in Section S5 of the Supplementary Material.

Proof of Theorem 1.

This proof can be completed using the conditions and results described previously in the proof of Lemma 1 (Section 4.1.2).

Then, let us consider the following:

H = [1 0 … 0 0] and $K_{k} = {[K_{x_{m s v}}, K_{x_{2}}, \dots, K_{x_{n}}, K_{θ_{u p}}]}^{T}$ as obtained in the proof of Lemma 1 in Section 4.1.2, where $K_{θ_{u p}}$ = 0.
$Z_{k}$ as a measured value of $x_{m s v}$ .
Predicted mean of the state variable vector $\hat{ψ} {(t_{k / k - 1})}_{g e n e r a l} = {[{\hat{x}}_{m s v}, {\hat{x}}_{2}, \dots, {\hat{x}}_{n}, {\hat{θ}}_{u p}]}^{T}$ with regard to the general UMM used in the proof of Lemma 1 in Section 4.1.2.

Now, using Equation (11) to compute the estimated mean of the state variable vector

\hat{ψ} {(t_{k / k})}_{g e n e r a l}

, we have

\hat{ψ} {(t_{k / k})}_{g e n e r a l} = \hat{ψ} {(t_{k / k - 1})}_{g e n e r a l} + K_{k} (Z_{k} - H \hat{ψ} {(t_{k / k - 1})}_{g e n e r a l})

(29)

\hat{ψ} {(t_{k / k})}_{g e n e r a l} = [\begin{matrix} {\hat{x}}_{m s v} \\ {\hat{x}}_{2} \\ ⋮ \\ {\hat{x}}_{n} \\ {\hat{θ}}_{u p} \end{matrix}] + [\begin{matrix} K_{x_{m s v}} \\ K_{x_{2}} \\ ⋮ \\ K_{x_{n}} \\ K_{θ_{u p}} \end{matrix}] . (Z_{k} - {\hat{x}}_{m s v}) = [\begin{matrix} {\hat{x}}_{m s v} + K_{x_{m s v}} . (Z_{k} - {\hat{x}}_{m s v}) \\ {\hat{x}}_{2} + K_{x_{2}} . (Z_{k} - {\hat{x}}_{m s v}) \\ ⋮ \\ {\hat{x}}_{n} + K_{x_{n}} . (Z_{k} - {\hat{x}}_{m s v}) \\ {\hat{θ}}_{u p} + 0 \end{matrix}]

(30)

Then, we have that the estimated mean of the unshared parameter

{\hat{θ}}_{u p} (t_{k / k})

(composing the

\hat{ψ} {(t_{k / k})}_{g e n e r a l}

) is equal to the predicted mean of unshared parameter

{\hat{θ}}_{u p} (t_{k / k - 1})

(composing the

\hat{ψ} {(t_{k / k - 1})}_{g e n e r a l}

) for all steps from

t_{k - 1}

to

t_{k}

. In other words, the JEKF fails to perform the parameter evolution, since it does not have a noise component to evolve the parameter as described in the

θ (t_{k}) = θ (t_{k - 1}) + n o i s e

(Equation (14)); then,

{\hat{θ}}_{u p} (t_{k / k}) = {\hat{θ}}_{u p} (t_{k / k - 1})

for all steps from

t_{k - 1}

to

t_{k}

. □

4.2. SANTO: Specific Initial Condition for MRDE ( $P_{M S V, U P} (t = 0) \neq 0 i n$ $P_{0}$ )

This section presents the SANTO approach to avoid the JEKF failure described in Theorem 1. The initial condition of MRDE is the initial state error covariance matrix

P_{0} = P (t = 0)

. When it is composed of uncorrelated elements (P

​_{i, j} = 0

), some initial conditions of time-invariant ODEs (

\frac{d P_{i, j} (t_{k | k - 1})}{d t} = 0

) in the MRDE are zero, and consequently, the obtained solutions from

t_{k - 1}

to

t_{k}

for some of these time-invariant ODEs are zero, too. Furthermore, in the presence of the biomanufacturing conditions (failure case presented in Section 4.1.1), we have that the Kalman gain value regarding the unshared parameter (

K_{U P}

) and the predicted state error covariance between the unique measured state variable and the unshared parameter (

P_{M S V, U P} (t_{k | k - 1})

), are zero too,

K_{U P} = 0

and

P_{M S V, U P} (t_{k | k - 1}) = 0

. Then, the

K_{U P}

and

P_{M S V, U P} (t_{k | k - 1})

that compose

P (t_{k | k - 1})

cannot be updated with regard to the unshared parameter (see Lemma 1), and they are constant and equal to zero during the entire process execution of JEKF. It is worth noting that

P_{M S V, U P} (t_{k | k - 1})

is an element of

P (t_{k | k - 1})

such as

P_{M S V, U P} (t = 0)

is an element of

P (t = 0)

. Furthermore, that

K_{U P} = 0

during the entire JEKF execution reflects an unrealistic situation. This would mean that the prediction regarding the unshared parameter is perfect and does not need the influence of the measurement in the correction step of JEKF since there is no uncertainty in the prediction regarding the unshared parameter. This reflects the second intuition behind Kalman gain described in Section S2 of the Supplementary Material. However, based on prior knowledge, we know that the process model predictions regarding the unshared parameter are imperfect since we need to perform the evolution of the unshared parameter; otherwise, they would be the same during the entire process. Therefore, we need

K_{U P} \neq 0

and

P_{M S V, U P} (t_{k | k - 1}) \neq 0

.

In general, the initial condition of MRDE is P

(t = 0)

with uncorrelated elements (P

​_{i, j} = 0

) due to the difficulty of estimating all covariances with a limited dataset. However, instead, considering all off-diagonal elements of

P (t = 0)

equal zero (P

​_{i, j} = 0

), we can consider only the key off-diagonal element (that is P

​_{M S V, U P} (t = 0)

) with an initial value different of zero (

P_{M S V, U P} (t = 0) \neq 0

) to avoid the failure case. This value could be a positive quantity,

λ

, since the off-diagonal elements of

P (t = 0)

can show a positive covariance between two variables, indicating that they tend to increase or decrease together. Furthermore, the value of

λ

should be different from zero and small enough to not significantly affect the filter’s estimates but large enough to prevent the failure case. Then, with this consideration, we can have a value for the initial state error covariance between the MSV and an UP (P

​_{M S V, U P} (t = 0)

). If we add it to the initial state error covariance matrix

P (t = 0)

with the other uncorrelated elements, we have a specific initial condition for MRDE that enables us to update the

K_{U P}

and

P_{M S V, U P} (t_{k | k - 1})

present in

P (t_{k | k - 1})

and, consequently, avoids the JEKF failure.

Theorem 2

(SANTO—Proposed approach to avoid the JEKF failure). The addition of a positive quantity (λ) to the

P_{M S V, U P} (t = 0)

in

P (t = 0)

to initialize the MRDE with a specific initial condition can prevent the Kalman gain being zero in the entire execution of JEKF and prevent the JEKF failure (Section 4.1).

Proof.

The proof of the SANTO approach can be completed using the conditions described previously in the proof of Lemma 1 (Section 4.1.2) and Theorem 1 (Section 4.1.3).

Then, let us consider the following:

A positive quantity $λ$ .
$x_{m s v}$ as the unique measured state variable (MSV) and $H$ = [1 0 … 0 0].
R as measurement noise variance of $x_{m s v}$ .
$θ_{u p}$ as the unshared parameter (UP) to be evolved (estimated) and presented in only one weak term.
A specific initial predicted state error covariance matrix $P (t_{k - 1}) = P_{0} = P_{s a n t o} (t = 0)$ with uncorrelated elements and $P_{M S V, U P} (t = 0) = P_{x_{m s v}, θ_{u p}} (t = 0) = λ$ as following

$P_{s a n t o} (t = 0) = [\begin{matrix} P_{x_{m s v}, x_{m s v}} (t = 0) & 0 & \dots & 0 & 0 \\ 0 & P_{x_{2}, x_{2}} (t = 0) & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & P_{n, n} (t = 0) & 0 \\ P_{x_{m s v}, θ_{u p}} (t = 0) = λ & 0 & \dots & 0 & P_{θ_{u p}, θ_{u p}} (t = 0) \end{matrix}],$

(31)

Now, using this Equation (24) to compute the predicted state error covariance matrix

P (t_{k / k - 1})

from

t_{k - 1}

to

t_{k}

with the specific initial predicted state error covariance matrix

P (t_{k - 1}) = P_{0} = P_{s a n t o} (t = 0)

, we have

P (t_{k / k - 1}) = [\begin{matrix} ⋮ & ⋮ \\ P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) = λ & P_{x_{2}, θ_{u p}} (t_{k / k - 1}) & \dots \end{matrix}] .

(32)

where

P_{x_{m s v}, θ_{u p}} (t_{k / k - 1}) = λ

because the solution of

\frac{d P_{x_{m s v}, θ_{u p}} (t)}{d t} = 0

obtained from

t_{k - 1}

to

t_{k}

is equal to the initial condition that is

λ

in

P_{0}

. Now, using

P (t_{k / k - 1})

, H and R to compute the Kalman gain for all variables in the state variable vector

ψ (t)

(Equation (19)), we have

K_{k} = P (t_{k | k - 1}) H^{T} {(H P (t_{k | k - 1}) H^{T} + R)}^{- 1} = [\begin{matrix} ⋮ \\ K_{θ_{u p}} \end{matrix}] = [\begin{matrix} ⋮ \\ \frac{P_{x_{m s v}, θ_{u p}} (t_{k / k - 1})}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \end{matrix}] = [\begin{matrix} ⋮ \\ \frac{λ}{P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R} \end{matrix}] .

(33)

Then, we have the Kalman gain value for the unshared parameter as

K_{θ_{u p}} = λ {(P_{x_{m s v}, x_{m s v}} (t_{k / k - 1}) + R)}^{-} \neq 0,

(34)

and consequently, the predicted state error covariance

P_{x_{m s v}, θ_{u p}} (t_{k / k - 1})

can be updated by Equation (12) and predicted mean of the state variable vector with regard to UP,

{\hat{θ}}_{u p} (t_{k / k - 1})

can be updated as Equation (11). Therefore, we have

P_{x_{m s v}, θ_{u p}} (t_{k / k}) \neq P_{x_{m s v}, θ_{u p}} (t_{k / k - 1})

and

{\hat{θ}}_{u p} (t_{k / k}) \neq {\hat{θ}}_{u p} (t_{k / k - 1})

during the entire execution of JEKF. □

It is essential to point out that the SANTO is inspired by the idea of a regularization technique used to avoid the singularity problem in the state error covariance matrix [35,56]. However, instead of adding a small quantity to the diagonal elements of the state error covariance matrix P, such as the perturbed-P algorithm [35], we only add a positive quantity (

λ

) to the

P_{M S V, U P} (t = 0)

in

P (t = 0)

to initialize the MRDE. Furthermore, a positive quantity to the P

​_{M S V, U P} (t = 0)

can be defined by empirical tuning. One of the most common ways to define a quantity is by trial and error. This involves running the filter with different values of

λ

and choosing the value that results in the best performance [57].

Figure 1 shows the steps to develop a soft sensor for bioprocess monitoring based on JEKF-SANTO.

Step 1: Data Collection and Preprocessing. The first step in developing a soft sensor for bioprocess monitoring using the JEKF-SANTO approach involves comprehensive data collection and preprocessing. Once collected, these data must be meticulously cleaned and preprocessed to remove outliers and address any missing values. This preprocessing is crucial to ensure the quality and reliability of the data, which forms the foundation for accurate modeling and estimation in subsequent steps.
Step 2: Analyze the Biomanufacturing Conditions. This step involves a comprehensive analysis of the biomanufacturing conditions where JEKF fails to estimate an unshared parameter that is part of a state variable vector and part of a weak term in a UMM if the initial state error covariance matrix P(t = 0) and Q are composed of uncorrelated elements, and there is only one state variable measured.
Step 3: Implement JEKF with the SANTO approach. Implement the JEKF algorithm, defining the process model and the measurement model. Modify the initial state error covariance matrix $P (t = 0)$ as per the SANTO approach, adding a specific positive quantity $λ$ to the covariance between the measured state variable and the unshared parameter.
Step 4: JEKF-SANTO calibration. Tune the R and Q of JEKF-SANTO based on consistency tests, and adjust the $λ$ parameter model based on the estimates obtained from JEKF-SANTO related to the unshared parameter and the associated weak variable.
Step 5: Deployment and Monitoring. Integrate the JEKF-SANTO as a soft sensor into the biomanufacturing process control system to monitor critical quality attributes (CQAs) and critical process parameters (CPPs) in real time.

5. Empirical Evaluation

In our evaluation, we have the two goals (G1 and G2) that are addressed by answering three Research Questions (RQs) comparing three NSEs: JEKF-Classic, JEKF-SANTO and JEKF-KPH2. First, the goals are the following:

(G1) Experimentally test Theorem 1 (JEKF Failure ) in Section 4.1.3;
(G2) Test whether SANTO can avoid the JEKF failure and compare its performance with KPH2.

Lastly, the research questions are the following:

(RQ1-G1) Is there any variation in the unshared parameter estimation completed by JEKF-Classic with the biomanufacturing conditions (failure case), or are the estimations constant in the entire process?
(RQ2-G2) Is there any variation in the unshared parameter estimation completed by SANTO and KPH2 with the biomanufacturing conditions (failure case), and which one has the best estimations (performance)?
(RQ3-G2) Can the SANTO simultaneously estimate more than one unshared parameter, performing better than KPH2?

5.1. Experimental Setup

5.1.1. Synthetic Dataset—mAb Production

The synthetic dataset (SD) has data regarding Monoclonal Antibody (mAb) productions that represent the biomanufacturing of a protein widely used as diagnostic reagents and for therapeutic purposes [58]. The SD comprises two runs (A-SD and B-SD) with different cell expansions and maximums of the mAb (titer) production. The runs of SD can be seen in Figure 2, and the runs have a sample rate of 7.5 minutes during 103 hours of the process. The runs were generated using the UMM proposed by [59] with small variations in parameters

μ_{m a x}

(maximum growth rate) and QmAb (mAb specific production rate) (see Table S1 of the Supplementary Material) but with the same initial concentrations of states variables (viable cell density (Xv), glucose (GLC), glutamine (GLN), lactate (LAC), ammonium (AMM) and mAb) and with different conditions of pH and temperature as completed in the synthetic dataset of [55]. The run A-SD (red lines in plots of Figure 2) was generated using the original parameters proposed by [59], which are the parameters

μ_{m a x}

=

5.8 \times 10^{- 9} (

h

​^{-})

and QmAb = 7.21 (

\times 10^{- 9}

mg cells

​^{- 1}

h

​^{- 1}

). Run B-SD (blue lines in plots of Figure 2) has the maximum cell expansions and a maximum of mAb (titer) production of SD, and they were obtained with the parameters

μ_{m a x}

=

7.5 \times 10^{- 9} (

h

​^{-})

and QmAb = 9.21 (

\times 10^{- 9}

mg cells

​^{- 1}

h

​^{- 1}

). Furthermore, the run B-SD has samples regarding X

​_{V}

(cell/L) with Gaussian white noise, and they were created by adding the Gaussian white noise with a standard deviation of 20

\times 10^{7}

to the data represented in blue and green lines. The

X_{v}

of B-SD with noise is highlighted in light blue in the first plot. It is essential to point out that X

​_{V}

samples with Gaussian white noise represent a possible online measurement with a sensor that includes noises. This noise is used to evaluate the performance of the NSEs (JEKF-Classic, JEKF-SANTO, and JEKF-KPH2) to estimate mAb and QmAb.

5.1.2. Real Dataset: AAV Production

The real dataset (RD) contains data regarding rAAV productions, which are described and available in [19]. rAAV is a viral vector technology for gene therapy that is considered the safest and most effective way to repair single-gene abnormalities in non-dividing cells [60]. The RD has two runs with online and offline measurements of the state variables viable cell density (Xv), glucose (GLC), glutamine (GLN), lactate (LAC), ammonium (AMM), and rAAV (titer) regarding the rAAV production in shake-flasks and in bioreactors. The run A-RD (production in shake-flasks) has only offline measurements, and the run B-RD (production in bioreactor) has online measurements of Xv and offline measurements of GLC, LAC, and rAAV (titer). The samples of the runs add up to 2902 with a sample rate of 1 minute during 48.3 hours of the process. The details of the real dataset development can be seen in [19].

5.1.3. NSEs Assessment with Synthetic Dataset to Address RQ1-G1 and RQ2-G2

All NSEs (JEKF-Classic, JEKF-SANTO, and JEKF-KPH2) used the UMM described in Section S1.4 of the Supplementary Material as a process model and the same initial concentration regarding the state variables; see Table S2 of the Supplementary Material. The NSEs were used to correct (estimate) the predictions regarding state variables (Xv and mAb) and to evolve the unshared parameter (QmAb) of the process model. This was accomplished using the Xv samples with the noise of the run B-SD as the unique measured state variable and the parameters used to generate the run A-SD as initial parameters of the process model (see Tables S1 and S2 of Supplementary Material). This situation represents a joint estimation problem where the prediction and parameter of the process model should be corrected by the NSEs based on measured state variable Xv with noise. For example, the initial value used for QmAb is the value of run A-SD (QmAb = 7.21

\times 10^{- 9}

mg cells

​^{- 1}

h

​^{- 1}

), and it should be evolved to the value of run B-SD (9.21

\times 10^{- 9}

mg cells

​^{- 1}

h

​^{- 1}

) based on Xv with the noise of run B-SD. Furthermore, the Xv (without noise) and mAb samples of run B-SD were used as ground truth, too. It is important to point out that the estimations were made with MRDE formed by P with correlated elements (MRDE-PC) and uncorrelated elements (MRDE-PU). In addition, MRDE-PC and MRDE-PU were combined with standard and specific P(t = 0) to check the sensitivity of SANTO (with regard to P

​_{M S V, U P} (t = 0)

) and KPH2 (with regard to P

​_{U P, U P} (t = 0)

). The standard P(t = 0) means that all NSEs used the same P(t = 0). On the other hand, the specific P(t = 0) means that each NSE used a different P(t = 0) that enables its best performance. For example, the specific P(t = 0) for SANTO contains a specific value of P

​_{M S V, U P} (t = 0)

, and the specific P(t = 0) for KPH2 includes specific value of P

​_{U P, U P} (t = 0)

. The specific P(t = 0) was obtained by trial and error, and a standard Q was used for all NSEs. For example, the specific P(t = 0) for SANTO contains a specific value of P

​_{M S V, U P} (t = 0) = λ

, and the specific P(t = 0) for KPH2 includes a specific value of P

​_{U P, U P} (t = 0)

. The values of specific P(t = 0) (including

λ

) were obtained by trial and error. Furthermore, a standard and specific Q were also used for all NSEs. In addition, the root-mean-square percentage error (RMSPE) was used as a metric to assess the similarity between NSEs estimations and the ground truth of run B-SD. The details about the design of NSEs with SD can be found in the Section S7.3 of the Supplementary Material.

5.1.4. NSEs Assessment with Real Dataset to Address RQ3-G2

The NSEs (JEKF-Classic, JEKF-SANTO, and JEKF-KPH2) used the UMM described in Section S1.5 of the Supplementary Material as a process model and the same initial concentration regarding the state variables; see Table S8 of the Supplementary Material. These three NSEs were used to correct (estimate) the predictions regarding Xv, GLC, LAC, and rAAV (titer) and to evolve the unshared parameters (

μ_{L a c}

,

μ_{G L C}

and

μ_{r A A V}

) of the process model. This was accomplished using the Xv samples with the noise of the run B-RD as the unique measured state variable and the parameters obtained with the run A-SD as initial parameters (see Table S9 of the Supplementary Material). This situation also represents a joint estimation problem where the predictions and parameters of the process model should be corrected simultaneously by the NSEs based on measured state variable Xv with noise. However, in this case, the NSEs have to correct three different unshared parameters simultaneously based on Xv with the noise of run B-RD. Furthermore, the RMSPE was used as a metric to assess the similarity between NSEs estimations and the ground truth of run B-SD, which are the offline measurements of GLC, LAC and rAAV (titer) of run B-RD. It is essential to point out that the estimations were also completed with MRDE-PC and with specific P(t = 0). The details about the design of NSEs with RD can be found in Section S7.4 of the Supplementary Material.

5.1.5. Checking Consistency and Efficiency

The calibration of standard and specific Q were based on consistency tests, specifically the innovation magnitude bound (IMB) test and the normalized innovations squared (NIS) Chi-square test [61]. These two tests are used to check that the NSEs are performing correctly with Q and R selected [50,62].

IMB Test. It checks that the innovation is consistent with its covariance by verifying that the magnitude of the innovation is bounded by

\pm 2 \sqrt{S_{k}}

. A positive result in this test occurs when at least 95% of the values of the innovation lie within the

\pm 2 \sqrt{S_{k}}

. Figure 3 presents the innovation error sequence for the NSEs configured with MRDE-PC, utilizing specific Q and P(0) settings as detailed in Tables S4–S6 of the Supplementary Material using run B of the synthetic dataset. This figure demonstrates that the innovation errors are unbiased with approximately 95.14% of the values falling within the

\pm 2 \sqrt{S_{k}}

bounds as required. Similar outcomes were observed for NSEs configured with both MRDE-PC and MRDE-PU, and irrespective of whether standard or specific Q and P(0) settings were employed, as shown in Figures S1–S3 using Q and P(0) as detailed in Tables S4, S5 and S7 of the Supplementary Material. Each of these configurations yielded similar innovation error characteristics, underscoring the robustness of the NSEs under varying conditions. Furthermore, similar results were also obtained with run B of the real dataset. Figure 4 presents the innovation error sequence for the NSEs configured with MRDE-PC, utilizing specific Q and P(0) settings as detailed in Tables S10 and S11 of the Supplementary Material. This figure demonstrates that the innovation errors are unbiased with approximately 95.9% of the values falling within the

\pm 2 \sqrt{S_{k}}

bounds as required.

Complementarily, standard error (SE) plots, based on the P matrix’s diagonal, demonstrate the changing uncertainty in state estimates. Filter stability and consistency are indicated by SEs, related to the measured state variable, converging to a stable value. This convergence signifies the adaptability and equilibrium of a filter in making accurate predictions. The alignment of positive innovation test results with this convergent SE (of measured state variable) trend substantiates the overall stability and consistency of a filter. Figure 5 depicts the SE over time of

X_{V}

(measured state variable) estimated by NSEs with a synthetic dataset using MRDE-PC and specific P(0). Initially, these errors exhibited an increase, reflecting a period of adaptation as the filter assimilated the initial data. However, after this initial phase, the standard errors converged around a stable value. This convergence signifies the increasing reliability of the filter in estimating the state of

X_{V}

as it processed more data. The initial increase followed by a steady convergence of the standard errors, in tandem with the favorable innovation test results, compellingly demonstrates the robustness of the NSEs. Similar results were obtained with NSEs with the synthetic dataset using MRDE-PU and specific P(0) (Figure S4 of the Supplementary Material) and with NSEs with the real dataset using MRDE-PC and specific P(0); see Figure 6. It is important to point out that Figures S5–S8 in the Supplementary Material show the normal behavior of standard errors for the state variables (QmAb and mAb) estimated by JEKF-SANTO and JEKF-KPH2 with the synthetic dataset. Similarly, Figures S10–S14 in the Supplementary Material show the standard errors for the state variables (GLC, LAC, rAAV,

μ_{GLC}

,

μ_{LAC}

, and

μ_{rAAV}

) estimated by JEKF-SANTO and JEKF-KPH2 with the synthetic dataset.

NIS Chi-square Test. It verifies that the innovation is unbiased and white by using hypothesis testing (

χ^{2}

test) [50,62]. The NIS is defined as

N I S_{k} = e_{Z, k} S_{k}^{-} e_{Z, k}

, and the mean of NIS is defined as

μ (N I S) = \frac{1}{N} \sum_{k = 1}^{N} e_{Z, k} S_{k}^{-} e_{Z, k}

from a single run of a JEKF. Therefore, the NIS test involves verifying that

μ (N I S)

lies in the confidence interval [r1, r2] defined by the hypothesis

H_{0}

that

N \times μ (N I S)

is

χ_{N m}^{2}

distributed with probability 1 −

α

, such that

P (N \times μ (N I S) \in [r 1, r 2] | H_{0}) = 1 - α

where m is the number of measured state variables and N is the number of samples from the measured state variables. In our case, m = 1 because we have only one measure state variable, and N = 824 for SD and N = 2901 for RD. Furthermore, for the case of a two-sided 95% confidence region, we have

[r 1, r 2] = [χ_{N m}^{2} (0.025), χ_{N m}^{2} (0.975)]

. Therefore, the NIS test of NSEs with the synthetic dataset is concerned with answering the following question: Is

N \times μ (N I S)

inside of

[χ_{824}^{2} (0.025), χ_{824}^{2} (0.975)] = [745.39, 904.39]

where N = 824, such that

P (N \times μ (N I S) \in [745.39, 904.39] | H_{0}) = 1 - α

? All the NSEs designed with the synthetic dataset using the Q, R and P(0) defined in Tables S3–S7 of the Supplementary Material presented the

N \times μ (N I S)

falling inside of the confidence bound defined by the

χ

² test. The NSEs with MRDE-PU had an

N \times μ (N I S) = 835.29

and NSEs with MRDE-PC presented a

N \times μ (N I S) = 830.35

. Furthermore, the NSEs with the real dataset had a positive result in the

χ^{2}

test with the following question: Is

N \times μ (N I S)

inside of

[χ_{2901}^{2} (0.025), χ_{2901}^{2} (0.975)] = [2752.63, 3051.15]

where N = 2901? NSEs with MRDE-PC and, Q, R and P(0) defined in Tables S10 and S11 of the Supplementary Material presented an

N \times μ (N I S) = 2840.55

.

Normalized estimation error squared (NEES) test. It is the metric used to evaluate the efficiency of the JEKF-SANTO as an estimator. This involves verifying that the actual estimation errors (

e_{x, k}

) appropriately match the predictions made by the P(t

​_{k / k}

) [62]. Essentially, if the P(t

​_{k / k}

) predicts a certain degree of uncertainty, it is expected for the real-world errors

e_{x, k}

to match this prediction. This match is crucial for the estimator to be considered accurate and reliable. NEES is calculated as

NEES (k) = e_{x, k}^{⊤} P {(t_{k / k})}^{- 1} e_{x, k}

where

e_{x, k}

is the estimation error at time step k, defined as

e_{x, k} = x (t_{k}) - \hat{x} (t_{k / k})

, with

x (t_{k})

being the true state and

\hat{x} (t_{k / k})

being the estimated state. Then, for the case of a single run, the

NEES (k)

is Chi-square distributed with n

​_{x}

degrees of freedom. In our case, we have n

​_{x} = 3

because we are concerned with evaluating the performance of JEKF-SANTO to estimate the states

X_{V}

, QmAb and mAb of the synthetic dataset. Therefore, we consider a one-sided 95% probability region as seen in Bar-Shalom et al. in [62] for single-run simulation tests with small degrees of freedom. We have the hypothesis H

​_{0}

that JEKF-SANTO’s efficiency (

e_{x, k}

matches P(t

​_{k / k}

)), and H

​_{0}

is accepted if

P (N E E S (k) \leq χ_{3}^{2} (0.95) = 7.815 | H_{0}) = 1 - α

. Figure 7 depicts the result of NEES for the JEKF-SANTO with the synthetic dataset using MRDE-PC and MRDE-PU. The designated upper threshold for the acceptance region is set at 7.815. The majority of the

NEES (k)

values are observed to fall within the defined confidence interval [0,

χ_{3}^{2} (0.95) = 7.815

], which means the estimation error and the covariance are compatible with each other, and the estimation of the JEKF-SANTO is reliable and credible. Moreover, these findings are in alignment with those reported by Bar-Shalom et al. in [62], particularly in the context of single-run simulation tests with a small number of degrees of freedom.

6. Results

The results are organized by research questions RQ1-G1, RQ2-G2 and RQ3-G2.

Answer to RQ1-G1.The results of the experimental test of Theorem 1 (JEKF failure) can be seen in Figure 8 and Figure 9. We also reported the estimations made using JEKF-SANTO and JEKF-KPH2 in regard to Xv, mAb, and QmAb of mAb production (run B-SD) using MRDE-PC and MRDE-PU with specific P(t = 0). In plot A of Figure 8 and Figure 9, we can see that all NSEs estimated the Xv close to the ground truth. However, the JEKF-Classic (purple line) was not able to evolve (update) the unshared parameter QmAb, because the estimations about QmAb were constant and equal to the initial value of 7.21

\times 10^{- 9}

mg cells

​^{- 1}

h

​^{- 1}

during the entire process. Consequently, the JEKF-Classic estimation regarding mAb was far from the ground truth (red dash line) of run B-SD (see plots B and C in Figure 8), and it had a high RMSPE value of 18.65%; see Table 1. The same results regarding the JEKF-Classic were obtained using MRDE-PU; see Figure 9. It is important to point out that the Kalman gain over time obtained by JEKF-Classic with SD is constant and equal to zero using MRDE-PU or MRDE-PC (see Figure 10). Furthermore, the Kalman gain values obtained by JEKF-SANTO with MRDE-PC were more stable than those obtained by JEKF-KPH2.

Answer to RQ2-G2. The results of JEKF-SANTO avoiding the JEKF failure (using runs B-SD ) can be seen in the plots B and C of Figure 8 and Figure 9. In these plots, we can see that JEKF-SANTO (blue line) evolved the QmAb from the initial value to the ground truth (red dash line) and consequently estimated the mAb close to the ground truth of run B-SD (red dash line) with MRDE-PU and MRDE-PC. These results are the opposite of the ones obtained with JEKF-Classic. In addition, JEKF-SANTO had the smallest RMSPE values between the NSEs in all cases; see Table 1. On the other hand, the JEKF-KPH2 did not perform similarly to JEKF-SANTO. The unique case where JEKF-KPH2 (green line) had a good performance was in run B-SD with MRDE-PU with specific P(t = 0). In that case, JEKF-KPH2 estimations were near to the ground truth (red dash line); see plots B and C of Figure 9. However, JEKF-KPH2 did not present stability, and the estimation converged to values far from the ground truth in run B-SD (with MRDE-PC with specific P(t = 0)). The best performances of JEKF-SANTO and JEKF-KPH2 were obtained by the use of specific P(t = 0) because when we used a standard P(t = 0) for JEKF-SANTO and JEKF-KPH2, their estimations are worse with runs B-SD. The results using standard P(t = 0) with runs B-SD can be found in Figures S15 and S16 and Table S12 of the Supplementary Material. These results (with standard and specific P(t = 0)) show that JEKF-KPH2 is sensitive to the initial P

​_{Q m A b, Q m A b} (t = 0)

, and JEKF-SANTO is sensitive to P

​_{X_{V}, Q m A b} (t = 0)

, since their better results were obtained with their specific P(t = 0). Table S4 of the Supplementary Material shows the specific P(t = 0) used in JEKF-KPH2, and Table S5 of the Supplementary Material shows the specific P(t = 0) used in JEKF-SANTO. It is important to point out that the best results of JEKF-SANTO were obtained with P

​_{X_{V}, Q m A b} (t = 0)

with positive values in case of run B-SD; see Table S5 of the Supplementary Material.

Answer to RQ3-G2. In Figure 11, we show the estimations made by JEKF-SANTO and JEKF-KPH2 with regard to Xv, GLC, LAC, rAAV and the three unshared parameters (

μ_{G L C}

,

μ_{L A C}

, and

μ_{r A A V}

) of rAAV production (real dataset) using the MRDE-PC and the specific P(t = 0) and standard Q. In plot A of Figure 11, we can see that JEKF-SANTO and JEKF-KPH2 estimated the Xv inside of the noise range of the real online measurement of Xv by the capacitance probe. The following plots, B, C, and D, show the estimation obtained for the variables GLC, LAC, and rAAV. JEKF-SANTO (blue line) and JEKF-KPH2 (green line) were able to evolve the three unshared parameters simultaneously converging to values that enabled the estimation of GLC, LAC, and rAAV near the ground truth (red points in plots B, C and D). In these plots of Figure 11, and the RMSPE in Table 2, we can see that JEKF-SANTO and JEKF-KPH2 made similar estimations. Nevertheless, JEKF-SANTO had a slightly better performance than JEKF-KPH2 estimating GLC, LAC, and rAAV (titer); see the RMSPE Table 2. It is important to point out that the Kalman gain over time obtained by JEKF-Classic with RD is constant and equal to zero. See Figure 12. Consequently, JEKF-Classic had the worst performance and could not evolve the three unshared parameters simultaneously; see plots of Figure 11, and the RMSPE Table 2.

7. Discussion

Our theoretical and empirical results showed the JEKF failure with biomanufacturing conditions. These results showed that JEKF-Classic could not estimate the unshared parameters and the state simultaneously, since the Kalman gain related to the unshared parameter was constant and equal to zero from the beginning to the end of the processes tested. On the other hand, the results showed that the JEKF-SANTO and JEKF-KPH2 approaches can avoid the JEKF failure. However, the JEKF-SANTO had a more accurate estimation than JEKF-KPH2 while having faster and stable unshared parameters evolution to values that allowed the best performance of the process model tested. It is essential to point out that JEKF-SANTO performed best in two different situations, which were represented by run B-SD with MRDE-PC and MRDE-PC. The best performance of JEKF-KPH2 was only with run B-SD. Furthermore, the results showed that both approaches are sensitive to P(t = 0). JEKF-KPH2 is sensible to the P

​_{U P, U P} (t = 0)

, and JEKF-SANTO is sensible to P

​_{M S V, U P} (t = 0)

. It is essential to point out that the JEKF-SANTO approach did not change the probabilistic view of JEKF, and the minimization cost function in JEKF remained the same. Therefore, the JEKF-SANTO approach can be viewed as an artifact that prevents the Kalman gain from becoming zero with the biomanufacturing conditions (failure case). In addition, the JEKF-SANTO approach only addresses the failure case. It does not solve other issues, such as nonlinearity or high dimensionality, and should be used as a complementary approach. Beyond the SANTO approach, several methods have been established to tackle singularities and convergence issues in EKF. Rank reduction techniques address ill-conditioned covariance matrices by reducing their dimensionality, thus preventing singularities [63]. Time-correlated noise analysis allows for a more accurate state estimation by adjusting noise covariance matrices based on observed temporal correlations in system noise, providing a more realistic noise model [64]. These methods, used in conjunction with JEKF-SANTO, offer a comprehensive approach to EKF optimization in challenging scenarios like biomanufacturing. It is important to note that our analysis did not explicitly consider observability and stability conditions. However, this omission does not invalidate our study. The focus of our research was on addressing a specific failure case of JEKF under certain biomanufacturing conditions. Our proposed solution, SANTO, was developed to specifically address this issue based on experiments with JEKF that are consistent. Therefore, in our study, while we did not explicitly detail the observability and stability analysis in the traditional sense, we implicitly addressed these aspects through empirical evaluation methods. Regarding observability, our approach primarily focused on the empirical performance of the JEKF in the given case study rather than a formal observability analysis.

8. Conclusions

In this work, firstly, we presented the common conditions in biomanufacturing that represent a failure case for the classical JEKF. Secondly, we proved that the classical JEKF, with these conditions, cannot estimate the unshared parameters and the state simultaneously since the Kalman gain related to the unshared parameter is constant and equal to zero in the entire process. Lastly, we presented an approach called SANTO, which is a simple and effective way to address the JEKF failure case by adding a positive quantity (

λ

) regarding the initial state error covariance between a measured state variable and an unshared parameter (P

​_{M S V, U P} (t = 0)

) in P(t = 0). Our empirical evaluation demonstrated that the SANTO approach effectively estimates unshared parameters and states simultaneously, aligning closely with ground truth values in the tested datasets. SANTO notably outperformed both JEKF-Classic and JEKF-KPH2 in accuracy. In a rigorously controlled test using a synthetic dataset, JEKF-SANTO, whether paired with MRDE-PC or MRDE-PU, exhibited a substantial improvement in RMSPE, achieving up to approximately 17% enhancement compared to JEKF-Classic. Meanwhile, JEKF-KPH2 showed an improvement of around 8.7% in RMSPE, but this was limited to its execution with MRDE-PU. This highlights the effectiveness of SANTO in overcoming the limitations of classical JEKF in biomanufacturing applications. Our future works will focus on the development of an auto-tuning mechanism based on an objective function to systematically calibrate Q, R and

λ

, as seen in [65], but also investigate the potential of the Unscented Kalman Filter (UKF) to estimate the unshared parameters and the state simultaneously with the biomanufacturing conditions.

Supplementary Materials

The following supporting information can be downloaded at: www.mdpi.com/xxx/s1, Figure S1: Innovation Magnitude Bound Test using the run B of Synthetic dataset for the NSEs with MRDE-PU and specific Q and P(0); Figure S2: Innovation Magnitude Bound Test using the run B of Synthetic dataset for the NSEs with MRDE-PC and standard Q and P(0); Figure S3: Innovation Magnitude Bound Test using the run B of Synthetic dataset for the NSEs with MRDE-PU and standard Q and P(0); Figure S4: Standard Error of

X_{V}

at each k estimated by NSEs with Synthetic Dataset using MRDE-PU and specific P(0); Figure S5: Standard Error of

Q m A b

at each k estimated by NSEs with Synthetic Dataset using MRDE-PC and specific P(0); Figure S6: Standard Error of

Q m A b

at each k estimated by NSEs with Synthetic Dataset using MRDE-PU and specific P(0); Figure S7: Standard Error of

m A b

at each k estimated by NSEs with Synthetic Dataset using MRDE-PC and specific P(0); Figure S8: Standard Error of

m A b

at each k estimated by NSEs with Synthetic Dataset using MRDE-PU and specific P(0); Figure S9: Standard Error of

G L C

at each k estimated by NSEs with Real Dataset using MRDE-PC and specific P(0); Figure S10: Standard Error of

L A C

at each k estimated by NSEs with Real Dataset using MRDE-PC and specific P(0); Figure S11: Standard Error of

r A A V

at each k estimated by NSEs with Real Dataset using MRDE-PC and specific P(0); Figure S12: Standard Error of

μ G L C

at each k estimated by NSEs with Real Dataset using MRDE-PC and specific P(0); Figure S13: Standard Error of

μ L A C

at each k estimated by NSEs with Real Dataset using MRDE-PC and specific P(0); Figure S14: Standard Error of

μ r A A V

at each k estimated by NSEs with Real Dataset using MRDE-PC and specific P(0); Figure S15: The JEKF-SANTO and JEKF-KPH2 avoid the JEKF failure in B-SD, but they need an specific

P (t = 0)

. First, plot A shows the estimations regards Xv, and all estimations were close the ground truth. The plots B and C show the estimations regards the unshared parameter QmAb and mAb (titer) far from the ground truth, respectively. The NSEs were executed with MRDE-PC and standard $P (t = 0)$ ; Figure S16: The JEKF-SANTO and JEKF-KPH2 avoid the JEKF failure in B-SD, but they need an specific

P (t = 0)

. First, plot A shows the estimations regards Xv, and all estimations were close the ground truth. The plots B and C show the estimations regards the unshared parameter QmAb and mAb (titer) far from the ground truth, respectively. The NSEs were executed with MRDE-PU and standard $P (t = 0)$ .; Table S1: Parameters used in UMM case 1.4 to generate the runs A-SD, and B-SD of Synthetic Dataset (SD); Table S2: Initial conditions of state variables of UMM case 4 for the JEKF test with Synthetic Dataset; Table S3: Standard initial state error covariance matrix (standard P(t = 0)) for JEKF-Classic, JEKF-KPH2 and JEKF-SANTO with run B of Synthetic Dataset; Table S4: Specific initial state error covariance matrix (specific P(t = 0)) for JEKF-KPH2 with run B of Synthetic Dataset; Table S5: Specific initial state error covariance matrix (specific P(t = 0)) for JEKF-SANTO with run B of Synthetic Dataset; Table S6: Measurement noise variance R and error covariance matrix of process model (Q) for the JEKF-Classic, JEKF-SANTO and JEKF-KPH2 with run B of Synthetic Dataset using MRDE-PC; Table S7: Measurement noise variance R and error covariance matrix of process model (Q) for the JEKF-Classic, JEKF-SANTO and JEKF-KPH2 with run B of Synthetic Dataset using MRDE-PU; Table S8: Initial conditions of state variables of UMM case 5 for the JEKF-SANTO and JEKF-KPH2 test with run B-RD (Source [19]); Table S9: Initial parameters obtained with A-RD for the JEKF-SANTO and JEKF-KPH2 test with run B-RD (Source [19]); Table S10: Specific initial state error covariance matrix (specific P(t = 0)) for for the JEKF-Classic, JEKF-SANTO and JEKF-KPH2 with Real Dataset (run B) using MRDE-PC; Table S11: Measurement noise variance R, and error covariance matrix of process model

Q_{i, i}

for the JEKF-Classic, JEKF-SANTO and JEKF-KPH2 with Real Dataset (run B) using MRDE-PC; Table S12: RMSPE between NSEs estimations about mAb and ground truth of run B in synthetic dataset with standardP(t = 0).

Author Contributions

Conceptualization, C.F.I.J.; Implemented algorithms and conducted the experiments, C.F.I.J.; Performed analysis on experimental results and wrote the manuscript, C.F.I.J.; Provided insightful discussions, reviewed the results and revised the manuscript, M.B.; Supervision, M.B.; Project administration, M.B.; funding acquisition, M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Research Council through AI for Design Challenge Program (operating grants AI4D-103-1).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data and code used in this study are available in github https://github.com/cristovaoiglesias/JEKF-SANTO (accessed on 20 November 2023).

Acknowledgments

We want to acknowledge the support provided by National Research Council Canada through the AI4D grant. We are grateful to Amine A. Kamen and his group at McGill University for providing the data and for providing expert domain knowledge during our meetings and discussion. We would also like to thank Nabil Belacel from National Research Council Canada for his constructive comments and discussions.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

JEKF	Joint estimation of states and parameters with Extended Kalman Filter
NSE	Nonlinear State Estimator
UMM	Unstructured Mechanistic Model
SMM	Structured Mechanistic Model
MRDE	Matrix Ricatti Differential Equation
MSV	Measured State Variable
UP	Unshared parameter
SANTO	Specific initiAl coNdiTiOn
CD-EKF	Continuous-Discrete EKF
RD	Real dataset
SD	Synthetic dataset
rAAV	Recombinant Adeno-Associated Virus
mAb	Monoclonal Antibody

References

Jin, X.B.; Robert Jeremiah, R.J.; Su, T.L.; Bai, Y.T.; Kong, J.L. The new trend of state estimation: From model-driven to hybrid-driven methods. Sensors 2021, 21, 2085. [Google Scholar] [CrossRef] [PubMed]
Alexander, R.; Campani, G.; Dinh, S.; Lima, F.V. Challenges and opportunities on nonlinear state estimation of chemical and biochemical processes. Processes 2020, 8, 1462. [Google Scholar] [CrossRef]
Kalman, R.E. A new approach to linear filtering and prediction problems. J. Basic Eng. Mar 1960, 82, 35–45. [Google Scholar] [CrossRef]
Jazwinski, A. Stochastic Processes and Filtering Theory; Mathematics in Science and Engineering; Academic Press: Cambridge, MA, USA, 1970; Available online: https://books.google.ch/books?id=nGlSNvKyY2MC (accessed on 20 November 2023).
Aswal, N.; Bhattacharya, B.; Sen, S. Joint and Dual Estimation of States and Parameters with Extended and Unscented Kalman Filters. In Recent Developments in Structural Health Monitoring and Assessment–Opportunities and Challenges: Bridges, Buildings and Other Infrastructures; World Scientific: Singapore, 2022; pp. 223–252. [Google Scholar]
Ljung, L. Asymptotic behavior of the extended Kalman filter as a parameter estimator for linear systems. IEEE Trans. Autom. Control 1979, 24, 36–50. [Google Scholar] [CrossRef]
Kopp, R.E.; Orford, R.J. Linear regression applied to system identification for adaptive control systems. Aiaa J. 1963, 1, 2300–2306. [Google Scholar] [CrossRef]
Haykin, S.S.; Haykin, S.S. Kalman Filtering and Neural Networks; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2001; Volume 284. [Google Scholar] [CrossRef]
Cox, H. On the estimation of state variables and parameters for noisy dynamic systems. IEEE Trans. Autom. Control 1964, 9, 5–12. [Google Scholar] [CrossRef]
Urrea, C.; Agramonte, R. Kalman filter: Historical overview and review of its use in robotics 60 years after its creation. J. Sensors 2021, 2021, 1–21. [Google Scholar] [CrossRef]
Aswal, N.; Sen, S.; Mevel, L. Switching Kalman filter for damage estimation in the presence of sensor faults. Mech. Syst. Signal Process. 2022, 175, 109116. [Google Scholar] [CrossRef]
Stojanovic, V.; He, S.; Zhang, B. State and parameter joint estimation of linear stochastic systems in presence of faults and non-Gaussian noises. Int. J. Robust Nonlinear Control 2020, 30, 6683–6700. [Google Scholar] [CrossRef]
Beelen, H.; Bergveld, H.J.; Donkers, M. Joint estimation of battery parameters and state of charge using an extended Kalman filter: A single-parameter tuning approach. IEEE Trans. Control Syst. Technol. 2020, 29, 1087–1101. [Google Scholar] [CrossRef]
Dhanalakshmi, R.; Bhavani, N.; Raju, S.S.; Shaker Reddy, P.C.; Marvaluru, D.; Singh, D.P.; Batu, A. Onboard Pointing Error Detection and Estimation of Observation Satellite Data Using Extended Kalman Filter. Comput. Intell. Neurosci. 2022, 2022, 4340897. [Google Scholar] [CrossRef] [PubMed]
Huang, K.; Yuen, K.V.; Wang, L. Real-time simultaneous input-state-parameter estimation with modulated colored noise excitation. Mech. Syst. Signal Process. 2022, 165, 108378. [Google Scholar] [CrossRef]
Huang, K.; Yuen, K.V. Online dual-rate decentralized structural identification for wireless sensor networks. Struct. Control Health Monit. 2019, 26, e2453. [Google Scholar] [CrossRef]
Yuen, K.V.; Huang, K. Real-time substructural identification by boundary force modeling. Struct. Control Health Monit. 2018, 25, e2151. [Google Scholar] [CrossRef]
Kleyman, V.; Schaller, M.; Mordmuller, M.; Wilson, M.; Brinkmann, R.; Worthmann, K.; Muller, M.A. State and parameter estimation for retinal laser treatment. arXiv 2022, arXiv:2203.12452. [Google Scholar] [CrossRef]
Iglesias, C.F., Jr.; Xu, X.; Mehta, V.; Akassou, M.; Venereo-Sanchez, A.; Belacel, N.; Kamen, A.; Bolic, M. Monitoring the Recombinant Adeno-Associated Virus Production using Extended Kalman Filter. Processes 2022, 10, 2180. [Google Scholar] [CrossRef]
Yousefi-Darani, A.; Paquet-Durand, O.; Hitzmann, B. The Kalman filter for the supervision of cultivation processes. Digital Twins 2020, 177, 95–125. [Google Scholar]
Paquet-Durand, O.; Zettel, V.; Yousefi-Darani, A.; Hitzmann, B. The supervision of dough fermentation using image analysis complemented by a continuous discrete extended Kalman filter. Processes 2020, 8, 1669. [Google Scholar] [CrossRef]
Song, H.; Hu, S. Open Problems in Applications of the Kalman Filtering Algorithm. In Proceedings of the 2019 International Conference on Mathematics, Big Data Analysis and Simulation and Modelling (MBDASM 2019), Changsha, China, 30–31 August 2019; Atlantis Press: Amsterdam, The Netherlands, 2019; pp. 185–190. [Google Scholar] [CrossRef]
Khodarahmi, M.; Maihami, V. A Review on Kalman Filter Models. Arch. Comput. Methods Eng. 2022, 30, 727–747. [Google Scholar] [CrossRef]
Nelson, L.; Stear, E. The simultaneous on-line estimation of parameters and states in linear systems. IEEE Trans. Autom. Control 1976, 21, 94–98. [Google Scholar] [CrossRef]
Herwig, C.; Pörtner, R.; Möller, J. Digital Twins: Tools and Concepts for Smart Biomanufacturing; Springer Nature: Berlin/Heidelberg, Germany, 2021; Volume 176. [Google Scholar] [CrossRef]
Herwig, C.; Pörtner, R.; Möller, J. Digital Twins: Applications to the Design and Optimization of Bioprocesses; Springer Nature: Berlin/Heidelberg, Germany, 2021; Volume 177. [Google Scholar] [CrossRef]
Sinner, P.; Daume, S.; Herwig, C.; Kager, J. Usage of Digital Twins Along a Typical Process Development Cycle. In Digital Twins: Tools and Concepts for Smart Biomanufacturing; Herwig, C., Pörtner, R., Möller, J., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 71–96. [Google Scholar] [CrossRef]
Moser, A.; Appl, C.; Brüning, S.; Hass, V.C. Mechanistic mathematical models as a basis for digital twins. Digit. Twins 2020, 176, 133–180. [Google Scholar]
Narayanan, H.; Sokolov, M.; Morbidelli, M.; Butté, A. A new generation of predictive models: The added value of hybrid models for manufacturing processes of therapeutic proteins. Biotechnol. Bioeng. 2019, 116, 2540–2549. [Google Scholar] [CrossRef] [PubMed]
Fernandes-Platzgummer, A.; Badenes, S.M.; da Silva, C.L.; Cabral, J.M. Bioreactors for Stem Cell and Mammalian Cell Cultivation. Bioprocess. Technol. Prod. Biopharm. Bioprod. 2018, 4, 131–173. [Google Scholar]
Iglesias, C.F., Jr.; Ristovski, M.; Bolic, M.; Cuperlovic-Culf, M. rAAV Manufacturing: The Challenges of Soft Sensing during Upstream Processing. Bioengineering 2023, 10, 229. [Google Scholar] [CrossRef] [PubMed]
Gargalo, C.L.; Udugama, I.; Pontius, K.; Lopez, P.C.; Nielsen, R.F.; Hasanzadeh, A.; Mansouri, S.S.; Bayer, C.; Junicke, H.; Gernaey, K.V. Towards smart biomanufacturing: A perspective on recent developments in industrial measurement and monitoring technologies for bio-based production processes. J. Ind. Microbiol. Biotechnol. Off. J. Soc. Ind. Microbiol. Biotechnol. 2020, 47, 947–964. [Google Scholar] [CrossRef] [PubMed]
Udugama, A.; Öner, M.; Lopez, P.C.; Beenfeldt, C.; Bayer, C.; Huusom, J.K.; Gernaey, K.V.; Sin, G. Towards Digitalization in Bio-Manufacturing Operations: A Survey on Application of Big Data and Digital Twin Concepts in Denmark. Front. Chem. Eng. 2021, 3, 727152. [Google Scholar] [CrossRef]
Luo, Y.; Kurian, V.; Ogunnaike, B.A. Bioprocess systems analysis, modeling, estimation, and control. Curr. Opin. Chem. Eng. 2021, 33, 100705. [Google Scholar] [CrossRef]
Wang, K.; Li, Y.; Rizos, C. Practical approaches to Kalman filtering with time-correlated measurement errors. IEEE Trans. Aerosp. Electron. Syst. 2012, 48, 1669–1681. [Google Scholar] [CrossRef]
Ji, Z.; Brown, M. Joint state and parameter estimation for biochemical dynamic pathways with iterative extended Kalman filter: Comparison with dual state and parameter estimation. Open Autom. Control Syst. J. 2009, 2, 69–77. [Google Scholar] [CrossRef]
Mariani, S.; Corigliano, A. Impact induced composite delamination: State and parameter identification via joint and dual extended Kalman filters. Comput. Methods Appl. Mech. Eng. 2005, 194, 5242–5272. [Google Scholar] [CrossRef]
Ljung, L.; Söderström, T. Theory and Practice of Recursive Identification; MIT Press: Cambridge, MA, USA, 1983; Volume 4, pp. 136–249. [Google Scholar]
Kyriakopoulos, S.; Ang, K.S.; Lakshmanan, M.; Huang, Z.; Yoon, S.; Gunawan, R.; Lee, D.Y. Kinetic modeling of mammalian cell culture bioprocessing: The quest to advance biomanufacturing. Biotechnol. J. 2018, 13, 1700229. [Google Scholar] [CrossRef] [PubMed]
Park, S.Y.; Park, C.H.; Choi, D.H.; Hong, J.K.; Lee, D.Y. Bioprocess digital twins of mammalian cell culture for advanced biomanufacturing. Curr. Opin. Chem. Eng. 2021, 33, 100702. [Google Scholar] [CrossRef]
Tsopanoglou, A.; del Val, I.J. Moving towards an era of hybrid modelling: Advantages and challenges of coupling mechanistic and data-driven models for upstream pharmaceutical bioprocesses. Curr. Opin. Chem. Eng. 2021, 32, 100691. [Google Scholar] [CrossRef]
Mears, L.; Stocks, S.M.; Albaek, M.O.; Sin, G.; Gernaey, K.V. Mechanistic fermentation models for process design, monitoring, and control. Trends Biotechnol. 2017, 35, 914–924. [Google Scholar] [CrossRef] [PubMed]
Reyes, S.J.; Durocher, Y.; Pham, P.L.; Henry, O. Modern Sensor Tools and Techniques for Monitoring, Controlling, and Improving Cell Culture Processes. Processes 2022, 10, 189. [Google Scholar] [CrossRef]
Zhang, D.; Del Rio-Chanona, E.A.; Petsagkourakis, P.; Wagner, J. Hybrid physics-based and data-driven modeling for bioprocess online simulation and optimization. Biotechnol. Bioeng. 2019, 116, 2919–2930. [Google Scholar] [CrossRef]
Kourti, T. Multivariate Statistical Process Control and Process Control, Using Latent Variables; Elsevier: Amsterdam, The Netherlands, 2020. [Google Scholar] [CrossRef]
Brockwell, P. Time series analysis. Encycl. Stat. Behav. Sci. 2005. [Google Scholar] [CrossRef]
Ohadi, K.; Legge, R.L.; Budman, H.M. Development of a soft-sensor based on multi-wavelength fluorescence spectroscopy and a dynamic metabolic model for monitoring mammalian cell cultures. Biotechnol. Bioeng. 2015, 112, 197–208. [Google Scholar] [CrossRef]
Assimakis, N.; Adam, M. Kalman filter Riccati equation for the prediction, estimation, and smoothing error covariance matrices. Int. Sch. Res. Not. 2013, 2013, 249594. [Google Scholar] [CrossRef]
Kulikova, M.V.; Kulikov, G.Y. Adaptive ODE solvers in extended Kalman filtering algorithms. J. Comput. Appl. Math. 2014, 262, 205–216. [Google Scholar] [CrossRef]
Särkkä, S.; Svensson, L. Bayesian Filtering and Smoothing; Cambridge University Press: Cambridge, UK, 2023; Volume 17. [Google Scholar] [CrossRef]
Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012; Volume 1, pp. 631–660. [Google Scholar]
Haldar, A.; Al-hussein, A.A.A. Recent Developments in Structural Health Monitoring and Assessment-Opportunities and Challenges: Bridges, Buildings and Other Infrastructures; World Scientific: Singapore, 2022. [Google Scholar] [CrossRef]
Goudar, C.T. Computer programs for modeling mammalian cell batch and fed-batch cultures using logistic equations. Cytotechnology 2012, 64, 465–475. [Google Scholar] [CrossRef]
Kornecki, M.; Strube, J. Accelerating biologics manufacturing by upstream process modelling. Processes 2019, 7, 166. [Google Scholar] [CrossRef]
Narayanan, H.; Behle, L.; Luna, M.F.; Sokolov, M.; Guillén-Gosálbez, G.; Morbidelli, M.; Butté, A. Hybrid-EKF: Hybrid Model coupled with Extended Kalman Filter for real-time monitoring and control of mammalian cell culture. Biotechnol. Bioeng. 2020, 117, 2703–2714. [Google Scholar] [CrossRef] [PubMed]
Ueno, G.; Nakamura, N. Bayesian estimation of the observation-error covariance matrix in ensemble-based filters. Q. J. R. Meteorol. Soc. 2016, 142, 2055–2080. [Google Scholar] [CrossRef]
Michel, V.; Gramfort, A.; Varoquaux, G.; Eger, E.; Thirion, B. Total variation regularization for fMRI-based prediction of behavior. IEEE Trans. Med Imaging 2011, 30, 1328–1340. [Google Scholar] [CrossRef] [PubMed]
Jyothilekshmi, I.; Jayaprakash, N. Trends in monoclonal antibody production using various bioreactor systems. J. Microbiol. Biotechnol. 2021, 31, 349–357. [Google Scholar] [CrossRef]
Liu, Y.; Gunawan, R. Bioprocess optimization under uncertainty using ensemble modeling. J. Biotechnol. 2017, 244, 34–44. [Google Scholar] [CrossRef]
Bulcha, J.T.; Wang, Y.; Ma, H.; Tai, P.W.; Gao, G. Viral vector platforms within the gene therapy landscape. Signal Transduct. Target. Ther. 2021, 6, 53. [Google Scholar] [CrossRef]
Evangelidis, A.; Parker, D. Quantitative verification of Kalman filters. Form. Asp. Comput. 2021, 33, 669–693. [Google Scholar] [CrossRef]
Bar-Shalom, Y.; Li, X.R.; Kirubarajan, T. Estimation with Applications to Tracking and Navigation: Theory Algorithms and Software; John Wiley & Sons: Hoboken, NJ, USA, 2001; Volume 5, pp. 199–265. [Google Scholar]
Simon, D. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar] [CrossRef]
Maybeck, P.S. Stochastic Models, Estimation, and Control; Academic Press: Cambridge, MA, USA, 1982; Volume 3, pp. 126–132. [Google Scholar]
Boulkroune, B.; Geebelen, K.; Wan, J.; van Nunen, E. Auto-tuning extended Kalman filters to improve state estimation. In Proceedings of the 2023 IEEE Intelligent Vehicles Symposium (IV), Anchorage, AK, USA, 4–7 June 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar] [CrossRef]

Figure 1. The basic steps to develop a soft sensor for bioprocess monitoring based on JEKF-SANTO.

Figure 2. Synthetic dataset regarding mAb production. The run A-SD (red lines) was generated using the original parameters proposed by [59]. Run B-SD (blue lines) has the maximum cell expansions and the maximum mAb (titer) production of SD. The

X_{v}

of B-SD with noise is highlighted in light blue in the first plot. This noise is used to evaluate the performance of the NSE to estimate mAb and QmAb.

Figure 2. Synthetic dataset regarding mAb production. The run A-SD (red lines) was generated using the original parameters proposed by [59]. Run B-SD (blue lines) has the maximum cell expansions and the maximum mAb (titer) production of SD. The

X_{v}

of B-SD with noise is highlighted in light blue in the first plot. This noise is used to evaluate the performance of the NSE to estimate mAb and QmAb.

Figure 3. Innovation magnitude bound test using the run B of synthetic dataset for the NSEs with MRDE-PC and specific Q and P(0).

Figure 4. Innovation magnitude bound test using the run B of real dataset for the NSEs with MRDE-PC and specific Q and P(0).

Figure 5. Standard error of

X_{V}

at each k estimated by NSEs with synthetic dataset (run B) using MRDE-PC and specific P(0).

Figure 5. Standard error of

X_{V}

at each k estimated by NSEs with synthetic dataset (run B) using MRDE-PC and specific P(0).

Figure 6. Standard error of

X_{V}

at each k estimated by NSEs with real dataset using MRDE-PC and specific P(0).

Figure 6. Standard error of

X_{V}

at each k estimated by NSEs with real dataset using MRDE-PC and specific P(0).

Figure 7. NEES test of JEKF-SANTO with synthetic dataset. In case of JEKF-SANTO with MRDE-PC with specific P(0), we have that 100% of all NEES computed are found inside the one-sided 95% probability region where the 5% tail is

χ_{3}^{2} (0.95) = 7.815

(upper limit). In case of JEKF-SANTO with MRDE-PU with specific P(0), we have 98.4% of the NEES inside of confident interval [0,

χ_{3}^{2} (0.95) = 7.815

].

Figure 7. NEES test of JEKF-SANTO with synthetic dataset. In case of JEKF-SANTO with MRDE-PC with specific P(0), we have that 100% of all NEES computed are found inside the one-sided 95% probability region where the 5% tail is

χ_{3}^{2} (0.95) = 7.815

(upper limit). In case of JEKF-SANTO with MRDE-PU with specific P(0), we have 98.4% of the NEES inside of confident interval [0,

χ_{3}^{2} (0.95) = 7.815

].

Figure 8. Experimental test of the theorem (JEKF failure) and the JEKF-SANTO to avoid the JEKF failure with the biomanufacturing conditions (failure case). This experiment used run B of the synthetic dataset, and plot (A) shows that all estimations with regard to Xv were close to the ground truth. Plots (B,C) show the estimations with regard to the unshared parameter QmAb and mAb (titer), respectively. The JEKF-SANTO was able to evolve QmAb with convergence to the ground truth value, but JEKF-KPH2 and JEKF-Classic failed. They were not able to evolve the mAb. All NSEs were executed with MRDE-PC and specific P(t = 0).

Figure 9. Experimental test that JEKF-Classic cannot avoid the JEKF failure with run B of the synthetic dataset. First, plot (A) shows the estimations regarding Xv, and all estimations were close to the ground truth. The plots (B,C) show the estimations regarding the unshared parameter QmAb and mAb (titer), respectively. All NSEs evolved QmAb with convergence to the ground truth value except JEKF-Classic. All NSEs were executed with MRDE-PU and specific P

​_{U P, U P} (t = 0)

.

Figure 9. Experimental test that JEKF-Classic cannot avoid the JEKF failure with run B of the synthetic dataset. First, plot (A) shows the estimations regarding Xv, and all estimations were close to the ground truth. The plots (B,C) show the estimations regarding the unshared parameter QmAb and mAb (titer), respectively. All NSEs evolved QmAb with convergence to the ground truth value except JEKF-Classic. All NSEs were executed with MRDE-PU and specific P

​_{U P, U P} (t = 0)

.

Figure 10. Kalmain gain over time for the NSEs with run B of synthetic dataset. In all cases, JEKF-Classic is constant and equal to zero.

Figure 11. Simultaneous unshared parameters estimation by JEKF-SANTO and JEKF-KPH2 with real dataset (rAAV production). Plot (A) shows the estimations regarding Xv, and all estimations were inside of the noise range of the real online measurement of Xv by the capacitance probe. Plots (B–D) show the estimation obtained regarding the variables GLC, LAC, and rAAV, respectively. In these plots, we can see that JEKF-SANTO and JEKF-KPH2 had similar estimations. They evolved the

μ_{G L C}

,

μ_{L A C}

, and

μ_{r A A V}

(unshared parameters) with convergence, and their estimations related to GLC, LAC, and rAAV were close to the ground truth (red points); see plots (E–G). All NSEs were executed with MRDE-PC and specific P(t = 0).

Figure 11. Simultaneous unshared parameters estimation by JEKF-SANTO and JEKF-KPH2 with real dataset (rAAV production). Plot (A) shows the estimations regarding Xv, and all estimations were inside of the noise range of the real online measurement of Xv by the capacitance probe. Plots (B–D) show the estimation obtained regarding the variables GLC, LAC, and rAAV, respectively. In these plots, we can see that JEKF-SANTO and JEKF-KPH2 had similar estimations. They evolved the

μ_{G L C}

,

μ_{L A C}

, and

μ_{r A A V}

(unshared parameters) with convergence, and their estimations related to GLC, LAC, and rAAV were close to the ground truth (red points); see plots (E–G). All NSEs were executed with MRDE-PC and specific P(t = 0).

Figure 12. Kalmain gain over time for the NSEs with run B of real dataset. In all cases, the JEKF-Classic is constant and equal to zero.

Table 1. RMSPE between NSEs estimations about mAb and ground truth of run B in synthetic dataset with specific P(t = 0).

NSE	RMSPE (MRDE-PU)	RMSPE (MRDE-PC)
JEKF-SANTO	2.06%	1.30%
JEKF-KPH2	10.11%	48.44%
JEKF-Classic	18.80%	18.65%

Table 2. RMSPE between NSEs estimations and ground truth of real dataset with MRDE-PC, specific P(t = 0) and (standard Q).

Ground Truth	JEKF-Classic	JEKF-SANTO	JEKF-KPH2
GLC	11.6%	3.48%	3.58%
LAC	16.66%	1.01%	1.61%
rAAV (titer)	41.41%	2.87%	3.28%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Iglesias, C.F., Jr.; Bolic, M. How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models. Sensors 2024, 24, 653. https://doi.org/10.3390/s24020653

AMA Style

Iglesias CF Jr., Bolic M. How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models. Sensors. 2024; 24(2):653. https://doi.org/10.3390/s24020653

Chicago/Turabian Style

Iglesias, Cristovão Freitas, Jr., and Miodrag Bolic. 2024. "How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models" Sensors 24, no. 2: 653. https://doi.org/10.3390/s24020653

APA Style

Iglesias, C. F., Jr., & Bolic, M. (2024). How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models. Sensors, 24(2), 653. https://doi.org/10.3390/s24020653

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

How Not to Make the Joint Extended Kalman Filter Fail with Unstructured Mechanistic Models

Abstract

1. Introduction

2. Related Work

3. Background

3.1. Unstructured Mechanistic Model (UMM)

3.2. Continuous-Discrete Extended Kalman Filter

3.3. JEKF

4. Theoretical Analysis

4.1. JEKF Failure

4.1.1. Failure Case: Biomanufacturing Conditions

4.1.2. Lemma: Inability to Update Kalman Gain for Unshared Parameters based P(t = 0) and Q with Uncorrelated Elements

4.1.3. Theorem: JEKF Failure

4.2. SANTO: Specific Initial Condition for MRDE ( P M S V , U P ( t = 0 ) ≠ 0 i n P 0 )

5. Empirical Evaluation

5.1. Experimental Setup

5.1.1. Synthetic Dataset—mAb Production

5.1.2. Real Dataset: AAV Production

5.1.3. NSEs Assessment with Synthetic Dataset to Address RQ1-G1 and RQ2-G2

5.1.4. NSEs Assessment with Real Dataset to Address RQ3-G2

5.1.5. Checking Consistency and Efficiency

6. Results

7. Discussion

8. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. SANTO: Specific Initial Condition for MRDE ( $P_{M S V, U P} (t = 0) \neq 0 i n$ $P_{0}$ )