Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance

Ketthong, Patinya; Samkunta, Jirayu; Mai, Nghia Thi; Kamal, Md Abdus Samad; Murakami, Iwanori; Yamada, Kou

doi:10.3390/sci6040084

Open AccessArticle

Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance

by

Patinya Ketthong

^*,

Jirayu Samkunta

,

Nghia Thi Mai

,

Md Abdus Samad Kamal

,

Iwanori Murakami

and

Kou Yamada

^*

Graduate School of Science and Technology, Gunma University, Maebashi 376-8515, Japan

^*

Authors to whom correspondence should be addressed.

Sci 2024, 6(4), 84; https://doi.org/10.3390/sci6040084

Submission received: 13 August 2024 / Revised: 31 October 2024 / Accepted: 12 December 2024 / Published: 19 December 2024

Download

Browse Figures

Versions Notes

Abstract

The identification of dynamical systems from data is essential in control theory, enabling the creation of mathematical models that accurately represent the behavior of complex systems. However, real-world applications often present challenges such as the unknown dimensionality of the system and limited access to measurements, particularly in partially observed systems. The Hankel alternative view of Koopman (HAVOK) method offers a data-driven approach to identify linear representations of nonlinear systems, but it often overlooks the influence of external control signals (inputs) and disturbances. This paper introduces a novel input-aware modeling method for unstable linear systems using data-driven Koopman analysis. By explicitly incorporating the impact of inputs and disturbances, our method enhances the accuracy and robustness of system identification, even in the face of incomplete observations. The proposed approach leverages Koopman operator theory on augmented state-input data to capture both the intrinsic dynamics and the system’s sensitivity to external control. Through extensive numerical examples, we demonstrate the effectiveness of our method in accurately identifying and predicting the behavior of various dynamical systems, including real-world nonlinear systems and simulated unstable linear systems with and without disturbances. The results highlight the potential of our approach to advance the field of system identification and control, offering a powerful tool for modeling and analyzing complex systems in diverse applications.

Keywords:

system identification; sparse modeling; HAVOK; koopman; dynamical systems

1. Introduction

In the field of scientific and engineering, it is essential to have an accurate mathematical model of dynamical systems, as it underpins control design, optimization, and performance analysis [1,2,3,4]. However, real-world systems are often intricate, demanding a deep understanding of their dynamics for optimal behavior. A major challenge arises in practical applications where only a subset of the system’s state variables are directly measurable. This limitation, known as the closure problem [5,6], renders direct inference of the complete state vector

(x)

from the available measurements

(y)

impossible.

The closure problem presents a significant hurdle for scientific modeling due to several factors. Limited access to observations can render closed-form models infeasible, making the construction of explicit models for hidden variables impractical [7]. This ultimately hinders the accurate simulation and prediction of the system’s behavior.

Conversely, statistical learning, machine learning, and system identification are interrelated fields that deal with extracting knowledge from data. At its core, this knowledge extraction process aims to transform raw data into actionable insights and predictive models. Statistical learning provides the foundation by analyzing data to reveal hidden patterns and relationships [8,9]. These patterns can range from simple correlations to complex, multi-dimensional structures. Machine learning builds on this, creating algorithms that automatically learn and improve on specific tasks from data, such as classification or prediction [10]. This learning process often involves adjusting internal parameters to minimize errors and optimize performance. System identification, a specialized field within machine learning, focuses on building mathematical models of dynamical systems from input-output data [11,12]. These models can be used to simulate, predict, and control the behavior of complex systems like robots, power grids, and even biological processes. In essence, statistical learning provides the groundwork for understanding data, machine learning allows algorithms to learn and improve on tasks, and system identification leverages this learning to create models of real-world systems.

However, traditional system identification techniques heavily rely on input-output (I/O) data, model structure (order and type), and selection criteria (rank and cost function) [13]. However, this approach presents a significant challenge to balancing model complexity with real-world applicability. While a high-order model can theoretically capture the exact dynamics of a system, regardless of its nature (linear/nonlinear, time-invariant/time-variant), it comes at a cost [14,15,16,17]. Increased model complexity can lead to difficulties in validation, as the model may fit noise alongside the actual system behavior. Additionally, designing control algorithms for such high-order models becomes cumbersome, hindering practical implementation.

On the other hand, Koopman operator theory offers a robust framework for identifying and controlling nonlinear dynamical systems by transforming them into a higher-dimensional linear space [18]. Through the utilization of singular value decomposition (SVD) for approximate the Koopman operator, recent advancements have focused on data-driven methods that leverage input-output data to construct Koopman-based models capable of handling disturbances and inputs effectively [19,20]. For instance, these methods have been applied to control robotic systems with high precision, despite the presence of nonlinearities and external disturbances [21]. Similarly, they have been used to design power system stabilizers that enhance the stability of power networks under various disturbance conditions [22,23,24].

Moreover, the Koopman operator framework has been extended to incorporate control inputs, leading to the development of bilinear models that offer enhanced control capabilities [25,26]. These models allow for the integration of modern control techniques, such as model predictive control (MPC), to manage complex dynamical systems efficiently [27,28]. While Koopman operator theory offers significant advantages, a key challenge lies in its infinite dimensionality. For practical applications, finite-dimensional approximations are necessary. Existing methods like Dynamic Mode Decomposition (DMD) [29,30], Extended DMD (EDMD) [31,32], and Tunable Accuracy Symmetric Subspace Decomposition (T-SSD) [33].

In this paper, the Koopman operator is applied within the Hankel Alternative View of Koopman (HAVOK) method for scenarios involving partially observed systems by proposing a novel-aware modeling method for unstable linear systems. The traditional HAVOK method offers a data-driven approach to identifying nonlinear systems through linear representations that focus on state dynamics, our proposed method comprises the influence of the input and offers robustness to external disturbances. The organization of this work as follows. In Section 2, we represent the problem formulation and the brief reviews of the Koopman operator. Moving on, Section 3 describes the proposed method for identifying an input influences dynamical system through a data-driven Koopman operator, Low rank approximation method also was presented to approximate a given matrix with lower rank based on linear algebra technique. In Section 4, we showcase numerical examples to demonstrate the effectiveness of the proposed method based on 3 cases of dynamical systems (real-world nonlinear system, simulated unstable linear system, and simulated an unstable linear system with input disturbance). Finally, we conclude the proposed approach to the system identification based on the data-driven Koopman Analysis with a novel input-aware modeling technique in Section 5.

2. Background

2.1. Problem Formulation

In system identification scheme, due to the fact that most of real-world dynamical systems are nonlinear, the main objective is to identify the nonlinear dynamical system

\dot{x} (t) = f (x (t), u (t)),

(1)

where

x (t) \in R^{n}

denotes the state variable,

u (t) \in R^{m}

represents the input, and

f (\cdot, \cdot)

is a nonlinear function that describe the dynamics. According to Koopman Operator Theory [34], the system (1) can be linearized and approximate to linear state space model of the form

\{\begin{matrix} \dot{x} (t) = A x (t) + B u (t) \\ y (t) = C x (t) + D u (t) \end{matrix},

(2)

where

y (t) \in R^{l}

signifies the measured output.

A \in R^{n \times n}

,

B \in R^{n \times m}

,

C \in R^{l \times n}

, and

D \in R^{l \times n}

are the matrices with appropriate size to be determined.

This poses a challenge in determining whether the available measurements are sufficient for accurately modeling the system. For instance, the dimension l of the measured output might be smaller than the actual state dimension n of the state space. In this work, we focus specifically on the scenario where

D = 0

and

l = 1

, meaning the system has no feed-forward and we have access to only a one-dimensional time-series measurement for the system.

It’s important to note that the dimensionality of the underlying dynamics is often unknown in practice, and the choice of measurements may be limited by various constraints.

2.2. A Brief Review on Koopman Operator Theory

In 1931, B.O. Koopman established a foundational framework for the analysis of nonlinear dynamical systems, demonstrating that any autonomous nonlinear system can be represented as a linear system of infinite order within a Hilbert space [34].

Consider a continuous-time autonomous nonlinear dynamical system described by the following equation:

\dot{x} (t) = F (x),

(3)

where

F : R^{n} \to R^{n}

defines the system dynamics. We are interested in a set of observables, denoted by

g (x) : R^{n} \to R

, which are scalar-valued functions of the state that represent the system’s output. The Koopman operator, denoted by

K \in B (x) \to B (x)

, where

B (x)

is the space of bounded measurable functions, acts on observable functions

g \in B (x)

according to

K g (x) = g \circ F (x) .

(4)

where ∘ represents the composition operator, which essentially applies the function

F

to the state vector and then evaluates g on the resulting state. Then, a subspace

H \subset B (x)

is said to be Koopman-invariant if

\begin{matrix} K (H) \subseteq H . \end{matrix}

(5)

In other words, a Koopman-invariant subspace is a collection of observable functions such that the Koopman operator, when applied to any function within the subspace, also produces a function within the same subspace. This property allows us to restrict the Koopman operator to the Koopman-invariant subspace, effectively reducing the problem to a finite-dimensional analysis within

H

.

By analyzing the spectral properties of the Koopman operator, valuable insights into the system’s behavior can be obtained. These properties enable the expression of system dynamics as a linear system in a higher-dimensional space as

\dot{x} (t) = K g (x) = A x (t) .

(6)

In this equation, A represents a linear operator derived from the Koopman operator and g. It governs the state propagation over time, despite the original system dynamics being nonlinear.

There are crucial to determine Koopman-invariant subspace and compute the matrix A in (6) as the choice of observable functions significantly impacts the resulting Koopman-invariant subspace and the accuracy of the finite-dimensional approximation. To address this challenge, various methods have been developed to identify Koopman-invariant subspaces and compute the matrix A in (6). One widely used method is Dynamic Mode Decomposition (DMD) [29,30,31,32]. DMD leverages a technique called Singular Value Decomposition (SVD) to factorize a data-driven approximation of the system’s state transition matrix. This factorization reveals the system’s dynamic modes, ordered by their decreasing magnitude.

3. Proposed Method

This section details the proposed methodology for identifying how an input influences a dynamical system through a data-driven Koopman operator. Additionally, we include the technique used for low-rank approximation.

3.1. Data-Driven Koopman Operator

Consider a Least Square Estimation (LSE) problem of the form:

\dot{y} = Θ (Y) Ξ^{T},

(7)

where

\dot{y}

is a time-derivative of vector y, the library matrix

Θ (Y)

is

Θ (Y) = [\begin{matrix} | & | & | & \dots & | \\ y_{1} & y_{2} & y_{3} & \dots & y_{r} \\ | & | & | & \dots & | \end{matrix}],

(8)

where

Ξ \in R^{(r + 1) \times r}

. Each row

ξ_{k}

in

Ξ

is a vector of coefficients determining the active terms in the k-th column in the library matrix (8). Then we construct the Hankel matrix of measured data

y (t)

as

H = [\begin{matrix} y (t_{1}) & y (t_{2}) & \dots & y (t_{p}) \\ y (t_{2}) & y (t_{3}) & \dots & y (t_{p + 1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ y (t_{q}) & y (t_{q + 1}) & \dots & y (t_{m}) \end{matrix}] .

(9)

This Hankel matrix can be related to the Koopman operator:

\begin{matrix} H = [\begin{matrix} K y (t_{1}) & K^{2} y (t_{1}) & \dots & K^{p} y (t_{1}) \\ K^{2} y (t_{1}) & K^{3} y (t_{1}) & \dots & K^{p + 1} y (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ K^{q} y (t_{1}) & K^{q + 1} y (t_{1}) & \dots & K^{m} y (t_{1}) \end{matrix}], \end{matrix}

(10)

by the used of SVD, as it will be described later, the low rank approximation of H is defined as

\hat{H} \approx U_{r} Σ_{r} V_{r}^{T},

(11)

where

\hat{H}

is rank r approximation of H.

Mirroring the structure of the initial LSE problem, we formulate another LSE problem using the matrix

V_{r}

which represents the system dynamics:

{\dot{V}}_{r} = Θ (V_{r}) Ξ^{T} .

(12)

where

{\dot{V}}_{r} = {[\begin{matrix} {\dot{v}}_{1} & {\dot{v}}_{2} & \dots & {\dot{v}}_{r} \end{matrix}]}^{T}

,

Ξ \in R^{r \times r + 1}

each column

ξ_{k}

in

Ξ

is a vector of coefficients determining the active terms in the kth column in

Θ (V_{r})

.

Given that the measurement data

y (t)

is sampled with a period

Δ t

, we approximate the second-order central derivative of the unitary matrix

V_{r} (t)

with approximation error of

O (Δ t^{2})

\begin{matrix} {\dot{V}}_{r} = \frac{d V_{r} (t)}{d t} \approx \frac{V_{r} (t + 1) - V_{r} (t - 1)}{2 Δ t} + O (Δ t^{2}) . \end{matrix}

(13)

To incorporate the input, the input vector

u (t)

has been append to the last column of the library matrix

Θ (V_{r})

as

Θ (V_{r}) = [\begin{matrix} | & | & \dots & | & | \\ v_{1} & v_{2} & \dots & v_{r} & u (t) \\ | & | & \dots & | & | \end{matrix}] .

(14)

We can reinterpret the LSE problem as:

\dot{V_{r}} = Θ (V_{r}) ξ_{1} + Θ (V_{r}) ξ_{2} + \dots + Θ (V_{r}) ξ_{r + 1} .

(15)

Consequently, we find the values in

Ξ

by solving a convex

l_{1}

regularized Least Absolute Deviations (LAD) regression where

ϵ

is an arbitrary small positive number:

\begin{matrix} minimize & ∥ ξ_{k} ∥_{1} \\ subject to & ∥ {\dot{v}}_{k} - Θ (V) ξ_{k} ∥_{1} \leq ϵ . \end{matrix}

(16)

The main algorithm (Algorithm 1) performs minimization of the

L 1

norm with an

L 1

constraint to find an optimal matrix

Ξ

that best represents the relationship between the matrices

\dot{V}

and

Θ .

It begins by initializing

Ξ

and iteratively updating it until convergence is achieved. In each iteration, a subproblem (Algorithm 2) is solved to refine

Ξ

. The subproblem calculates the residual

R = \dot{V} - Θ Ξ

and finds a feasible direction D that aligns with the sign of R, adjusting

Ξ

in a way that minimizes the

L 1

norm. A line search is conducted to determine the optimal step size

α

that minimizes

{∥ Ξ + α D ∥}_{1}

while maintaining the constraint

∥ Θ (Ξ + α D) - \dot{V} ∥_{1} \leq ϵ

. This approach ensures that

Ξ

is iteratively refined under the

L 1

constraint, balancing sparsity and accuracy. The iterative process terminates once

Ξ

stabilizes, achieving a solution that minimizes the

L 1

norm within the given tolerance.

We opt for LAD over LSE due to its robustness against outliers, particularly in the presence of gross errors or heavy-tailed noise. This strengthens the proposed method’s resilience to noise and measurement inaccuracies.

\hat{A}

and

\hat{B}

are given using matrix of coefficients

Ξ

as

\begin{matrix} Ξ & = & [\begin{matrix} ξ_{1, 1} & ξ_{1, 2} & \dots & ξ_{1, r} & ξ_{1, r + 1} \\ ξ_{2, 1} & ξ_{2, 2} & \dots & ξ_{2, r} & ξ_{2, r + 1} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ ξ_{r, 1} & ξ_{r, 2} & \dots & ξ_{r, r} & ξ_{r, r + 1} \end{matrix}] \\ = & [\begin{matrix} \hat{A} & \hat{B} \end{matrix}], \end{matrix}

(17)

where

\begin{matrix} \hat{A} = [\begin{matrix} ξ_{1, 1} & ξ_{1, 2} & \dots & ξ_{1, r} \\ ξ_{2, 1} & ξ_{2, 2} & \dots & ξ_{2, r} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ξ_{r, 1} & ξ_{r, 2} & \dots & ξ_{r, r} \end{matrix}] \end{matrix}

(18)

and

\begin{matrix} \hat{B} = [\begin{matrix} ξ_{1, r + 1} \\ ξ_{2, r + 1} \\ ⋮ \\ ξ_{r, r + 1} \end{matrix}] . \end{matrix}

(19)

For the matrix

\hat{C}

is determined as the first row of

U_{r} Σ_{r}

.

Algorithm 1 Minimization of L1 Norm with L1 Constraint

Input:: Matrices $\dot{V} \in R^{m \times n}$ , $Θ \in R^{m \times p}$ , tolerance $ϵ$
Output:: Matrix $Ξ \in R^{p \times n}$
1:: Initialize $Ξ$
2:: repeat
3:: $Ξ_{prev} = Ξ$ {Store previous iterate}
4:: Solve subproblem to update $Ξ$
5:: until Convergence

Algorithm 2 Subproblem for Updating

Ξ

Input:: $\dot{V}$ , $Θ$ , $ϵ$ , current $Ξ$
Output:: Updated $Ξ$
1:: $R = \dot{V} - Θ Ξ$ {Residual}
2:: Find a feasible direction D such that
3:: $sign (D_{i j}) = - sign (R_{i j})$ if $| R_{i j} | > 0$
4:: and ${∥ D ∥}_{1} \leq 1$
5:: Perform line search to find step size $α$ :
6:: $α^{*} = arg {min}_{α \geq 0} {∥ Ξ + α D ∥}_{1}$
7:: subject to $∥ Θ (Ξ + α D) - \dot{V} ∥_{1} \leq ϵ$
8:: Update $Ξ$ : $Ξ \leftarrow Ξ + α^{*} D$

3.2. Low Rank Approximation Method

Low-rank approximation is a fundamental technique in linear algebra, where a given matrix is represented by another matrix of reduced rank. This process serves multiple purposes including dimensionality reduction, data compression, and the extraction of salient patterns from the original data. Given a matrix H, the objective is to identify a matrix

\hat{H}

of lower rank that optimally approximates H. The quality of this approximation is typically assessed using a matrix norm.

Figure 1 illustrates the magnitude of singular values across different ranks r, while Figure 2 offers a closer look at the magnitudes from

r = 1

to

r = 20

. It is evident that only a few initial terms are required to capture the essential dynamics of the system. Consequently, the rank r can be determined visually.

In cases involving large matrices, the Partial SVD using Matrix Sketching [35] provides an efficient alternative. This approach constructs a “sketch” that encapsulates the key features of the matrix, thereby enabling the approximation of its singular value decomposition (SVD). A notable advantage of this method is its adaptive rank determination based on a predefined tolerance, eliminating the need to specify the rank a priori. This results in faster computation, making it particularly suitable when a full SVD is computationally expensive or when the appropriate rank is unknown. The accuracy of the approximation is adjustable through the tolerance parameter, allowing for a balance between computational efficiency and the desired level of precision. The detailed implementation steps are outlined in Algorithm 3.

Algorithm 3 Partial SVD using Matrix Sketching

Input:: Matrix $H \in R^{m \times n}$ , tolerance tol
Output:: $U_{r}$ , $S_{r}$ , $V_{r}$ (approximate SVD of H)
1:: Initialize $U = []$ , $V = []$ , $S = []$
2:: $k = 0$ {Rank of the sketch}
3:: while approximation error > tol do
4:: $k = k + 1$
5:: Randomly sample a column c from H
6:: Orthogonalize c against the columns of U to get $u_{k}$
7:: $U = [U, u_{k}]$ {Add $u_{k}$ to the basis}
8:: Randomly sample a row r from H
9:: Orthogonalize r against the rows of V to get $v_{k}$
10:: $V = [V; v_{k}]$ {Add $v_{k}$ to the basis}
11:: end while
12:: $B = U^{T} A V$ {Project H onto the sketch space}
13:: $[U_{B}, S_{B}, V_{B}] = svd (B)$ {Compute SVD of the smaller matrix}
14:: $U_{r} = U U_{B}$
15:: $S_{r} = S_{B}$
16:: $V_{r} = V V_{B}$

4. Numerical Examples

To evaluate the efficacy of the proposed method, we selected three distinct dynamical systems for investigation: a real-world nonlinear system, a simulated unstable linear system, and a simulated unstable linear system subjected to input disturbances. All experiments were conducted using MATLAB R2024a on a MacBook Pro (14-inch, 2021) equipped with an Apple M1 Pro 8-core CPU and 16 GB of unified memory. This hardware configuration was chosen to assess the method’s performance on a readily available computing platform, ensuring its accessibility and applicability to a wide range of researchers and practitioners.

4.1. Nonlinear System

For the nonlinear system evaluation, we utilized the dataset and reference models of the Electro-Mechanical Positioning System (EMPS) [36]. This dataset provides a comprehensive representation of a standard drive system configuration commonly found in prismatic joints of robots and machine tools. The data were acquired using a dSPACE digital control system with a sampling frequency of 1 kHz and a duration of approximately 25 s, capturing the rich dynamics of the EMPS under various operating conditions.

To assess performance, following a typical machine learning training process, the measured data have been spitted into train and test sets. The initial 10 s of data were employed as the training set. The rest of dataset were used as test set. This means 40% of data was used as train set and 60% is used as test set. The Hankel matrix H was constructed with

q = p = 5000

. Our simulation results, as depicted in Figure 3, demonstrate the capability of the proposed method to forecast at least 20 s of system behavior using only 10 s of training data. Furthermore, the prediction error, visualized in Figure 4, remains relatively small, further attesting to the accuracy and effectiveness of the method in this nonlinear system scenario.

4.2. Unstable Linear System

Furthermore, the application of Koopman operator theory proves to be particularly advantageous in the identification process of unstable linear systems, offering a unique perspective and analytical tools to understand their complex dynamics. To illustrate this, let’s examine a representative unstable linear system characterized by the following matrices:

A = [\begin{matrix} 0.99 & 0.01 & 0.18 & - 0.09 & 0 \\ 0 & 0.94 & 0 & 0.29 & 0 \\ 0 & 0.14 & 0.81 & - 0.9 & 0 \\ 0 & - 0.2 & 0 & 0.95 & 0 \\ 0 & 0.09 & 0 & 0 & 0.9 \end{matrix}],

B = [\begin{matrix} 0.01 \\ - 0.14 \\ 0.05 \\ 0.02 \\ - 0.01 \end{matrix}], C = [\begin{matrix} 0 & 1 & 0 & 0 & - 1 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 \end{matrix}] .

In this system, the vector

x (t) = {[\begin{matrix} x_{1} (t) & x_{2} (t) & x_{3} (t) & x_{4} (t) & x_{5} (t) \end{matrix}]}^{T}

represented the state variables, while the outputs are captured by

y (t) = {[\begin{matrix} y_{1} (t) & y_{2} (t) & y_{3} (t) & y_{4} (t) \end{matrix}]}^{T}

. The system is driven by an input signal defined as

u (t) = sin (ω t)

.

For the purpose of this analysis, we make the assumption that only the first output,

y_{1}

, is available for measurement. This constraint reflects a realistic scenario where complete state observation might be impractical or impossible, further highlighting the utility of the Koopman-based approach in extracting meaningful system information from limited measurements.

To evaluate the effectiveness and accuracy of the proposed method, we conducted simulations of the system with parameters

ω = 1

and

Δ t = 0.001

. Time series data

y (t)

was collected from the simulation, spanning from

t = 0

to

t = 5

, resulting in 5000 samples for the measurement data vector, given the chosen simulation time step. The initial 2 s of data were employed as the training set. The rest of dataset were used as test set. This means 40% of data was used as train set and 60% is used as test set. The Hankel matrix H was then constructed with

q = p = 1000

.

The outcomes of these time series prediction efforts are showcased in Figure 5, while Figure 6 provides a visualization of the corresponding prediction error. These figures offer insights into the method’s performance, particularly its ability to accurately predict the behavior of the unstable linear system under investigation.

The identified coefficient matrices are

\hat{A} = [\begin{matrix} 0.6626 & 0.2648 & 0.1426 & - 0.0217 & 0 \\ - 0.0328 & 0.4353 & 0.0342 & - 0.0042 & 0 \\ 0.2844 & - 6.4921 & - 0.0743 & - 0.0099 & 0 \\ 5.8002 & - 7.5986 & 0.1658 & 1.1511 & - 0.0001 \\ 2.5930 & - 0.4246 & 0 & 17.5583 & 0.9088 \end{matrix}],

\hat{B} = [\begin{matrix} 0.0040 \\ - 0.2799 \\ - 0.0047 \\ - 0.0242 \\ - 0.0362 \end{matrix}],

and

\hat{C} = [\begin{matrix} 1.0022 & 0.4550 & 0.0001 & 0.0133 & 0 \end{matrix}] .

4.3. Unstable Linear System with Input Disturbance

In addition to investigating the unstable linear system in Section 4.2, we also explored the influence of disturbances on the identification process. We introduced a disturbance generator into the linear system, described by

\begin{matrix} {\dot{x}}_{d} (t) & = & A_{d} x_{d} (t) \\ y_{d} (t) & = & C_{d} x_{d} (t) \end{matrix} .

(20)

This disturbance generator was then incorporated into the original plant model, leading to an augmented system;

\begin{matrix} {\dot{x}}_{0} (t) & = & A_{0} x_{0} (t) + B_{0} u (t) \\ y_{0} (t) & = & C_{0} x_{0} (t) \end{matrix},

(21)

where

x_{0} = [\begin{matrix} x \\ x_{d} \end{matrix}]

,

A_{0} = [\begin{matrix} A & B C_{d} \\ 0 & A_{d} \end{matrix}]

,

B_{0} = [\begin{matrix} B \\ 0 \end{matrix}]

, and

C_{0} = [\begin{matrix} C & 0 \end{matrix}]

.

For our numerical example, we retained the same parameter matrices A, B, and C from the previous unstable linear system case, and introduced the following matrices for the disturbance generator:

A_{d} = [\begin{matrix} 0 & 1 \\ - 10 & 0 \end{matrix}], C_{d} = [\begin{matrix} 1 & 0 \end{matrix}] .

Similar to the previous example, the Hankel matrix H was constructed with parameters

q = p = 1000

. Applying the proposed method to this augmented system yielded the following identified matrices:

\hat{A} = [\begin{matrix} - 0.0764 & 0.8656 & - 0.0002 & 0.1047 & - 0.0366 & - 0.1479 & 0.0000 \\ - 1.1933 & 1.8177 & - 0.1190 & 0.3936 & - 0.0763 & - 0.2691 & 0.0001 \\ 0.2463 & 0.0093 & - 0.1991 & - 3.1713 & 0.2899 & 0.0625 & 0.0001 \\ - 0.5049 & 1.5165 & 3.1884 & 0.3364 & - 0.0775 & - 0.7332 & 0.0002 \\ - 0.2920 & 0.6841 & 1.1492 & 0.0762 & 0.3508 & - 0.4892 & 0.0001 \\ - 0.0532 & - 0.8513 & 0.1012 & - 1.4484 & 6.1917 & - 0.0324 & 0.0000 \\ - 0.6698 & - 5.8829 & 1.5594 & - 5.4843 & 1.8886 & - 0.0000 & 0.9133 \end{matrix}],

\hat{B} = [\begin{matrix} - 0.0043 \\ - 0.0112 \\ 0.0002 \\ - 0.0464 \\ 0.1669 \\ - 0.0020 \\ - 0.0115 \end{matrix}],

\hat{C} = [\begin{matrix} - 3.7384 & 2.1729 & 1.0037 & 1.3466 & - 0.3981 & - 0.1606 & - 0.0002 \end{matrix}] .

Finally, to recover the original system matrices

A_{0}

,

B_{0}

, and

C_{0}

, a similarity transformation can be applied, utilizing the relationship

A_{0} = T \hat{A} T^{- 1}

,

B_{0} = T \hat{B}

and

C_{0} = \hat{C} T^{- 1}

The outcomes of these time series prediction efforts, even in the presence of disturbances, are illustrated in Figure 7. Correspondingly, Figure 8 visualizes the prediction error associated with these forecasts. These figures offer valuable insights into the robustness and performance of the proposed method, particularly highlighting its capacity to accurately predict the behavior of the unstable linear system even when subjected to external disturbances.

4.4. Comparison to Existing System Identification Methods

This subsection presents a detailed evaluation of the proposed system identification method, comparing its performance against established techniques within the field. A single pendulum system is employed as a benchmark, ensuring a consistent and transparent assessment of each method’s capabilities. The comparative methods include the widely used N4SID algorithm [37], known for its robust performance across various applications; the Derivative-Based Koopman Operators [19], a newer approach with promising advantages in nonlinear system identification; and the proposed method, which seeks to address specific limitations of current techniques. This comparative analysis aims to highlight the strengths and weaknesses of each approach, providing insights into the proposed method’s effectiveness relative to established alternatives.

To generate the training dataset and train the Derivative-Based Koopman Operators model, we utilize Python code from [19], simulating 500 uniformly sampled initial states for 0.04 s with time steps of

Δ t = 0.001

s. The N4SID model is trained using MATLAB’s system identification toolbox. The test dataset was generated for 0.3 s with time steps of

Δ t = 0.001

s. Thus, 13% of data was used as train set and 87% is used as test set.

Figure 9 illustrates the time-series prediction performance across various techniques, where the proposed method and N4SID exhibit better performance in long-term predictions. In Addition, Table 1 presents a detailed comparison of training and testing normalized root mean square error (NRMSE) across the selected system identification methods, highlighting the performance differences in handling the single pendulum system. The N4SID method, utilizing LQ decomposition and singular value decomposition (SVD), achieves a training NRMSE of 0.0486 and a notably lower testing NRMSE of 0.0096, demonstrating reliable predictive performance in linear scenarios but showing some limitations in the training phase. In contrast, the Derivative-Based Koopman Operators, which apply high-order derivatives, excel with a very low training NRMSE of 0.0028, suggesting effective internal learning of system dynamics. However, it exhibits a high testing NRMSE of 8.3228, indicating reduced capability for long term prediction. The proposed method, which also leverages SVD, yields the lowest NRMSE values in both training (0.0010) and testing (0.0011), underscoring its robustness and ability to generalize across varying conditions.

5. Conclusions

This paper introduced a data-driven methodology for the identification of the Koopman operator in dynamical systems, with a particular emphasis on accurately capturing the influence of inputs on the system’s dynamic behavior. The proposed approach effectively leverages the power of low-rank approximations and the inherent robustness of Least Absolute Deviations regression to extract meaningful representations from potentially noisy and incomplete measurements. The efficacy of this method has been rigorously validated through its application to a diverse range of dynamical systems, encompassing both real-world nonlinear systems and simulated linear systems, with and without the presence of external disturbances. The results unequivocally demonstrate the method’s proficiency in both system identification and time-series prediction tasks.

The proposed method’s strong performance in terms of accuracy and generalization suggests that it could be valuable for applications requiring precise system identification in nonlinear environments, such as robotics, control systems, and predictive maintenance. However, as the proposed method is open-loop identification method, the ability in closed-loop identification are still not covered in this study as the input may cannot measured. Despite those limitation, the method’s stability in prediction could lead to improved reliability and performance in these applications, supporting decision-making and system optimization.

Author Contributions

Conceptualization, P.K. and J.S.; methodology, P.K., J.S. and N.T.M.; software, P.K. and J.S.; validation, M.A.S.K. and I.M.; formal analysis, P.K. and K.Y.; investigation, K.Y.; resources, K.Y.; data curation, P.K.; writing—original draft preparation, P.K.; writing—review and editing, J.S. and K.Y.; supervision, K.Y.; project administration, K.Y.; funding acquisition, K.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Xu, L.; Li, X.R.; Liang, Y.; Duan, Z. Modeling and State Estimation of Linear Destination-Constrained Dynamic Systems. IEEE Trans. Signal Process. 2022, 70, 2374–2387. [Google Scholar] [CrossRef]
Wang, K.; Liu, M.; He, W.; Zuo, C.; Wang, F. Koopman Kalman Particle Filter for Dynamic State Estimation of Distribution System. IEEE Access 2022, 10, 111688–111703. [Google Scholar] [CrossRef]
Kang, T.; Peng, H.; Xu, W.; Sun, Y.; Peng, X. Deep Learning-Based State-Dependent ARX Modeling and Predictive Control of Nonlinear Systems. IEEE Access 2023, 11, 32579–32594. [Google Scholar] [CrossRef]
Makki, O.T.; Moosapour, S.S.; Mobayen, S.; Nobari, J.H. Design, Mathematical Modeling, and Control of an Underactuated 3-DOF Experimental Helicopter. IEEE Access 2024, 12, 55568–55586. [Google Scholar] [CrossRef]
Gupta, A.; Lermusiaux, P.F.J. Neural closure models for dynamical systems. R. Soc. Math. Phys. Eng. Sci. 2021, 477, 20201004. [Google Scholar] [CrossRef]
Freeman, D.C.; Giannakis, D.; Slawinska, J. Quantum Mechanics for Closure of Dynamical Systems. Multiscale Model. Simul. 2024, 22, 283–333. [Google Scholar] [CrossRef]
Agrawal, A.; Koutsourelakis, P.-S. A probabilistic, data-driven closure model for RANS simulations with aleatoric, model uncertainty. J. Comput. Phys. 2024, 508, 112982. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R.; Friedman, J. Textbook of The Elements of Statistical Learning Data Mining, Inference, and Prediction, 2nd ed.; Springer: New York, NY, USA, 2009; pp. 1–2. [Google Scholar]
James, G.; Witten, D.; Hastie, T.; Tibshirani, R.; Taylor, J. Textbook of An Introduction to Statistical Learning with Application in Python; Springer: New York, NY, USA, 2023; pp. 1–17. [Google Scholar]
Rainio, O.; Teuho, J.; Klén, R. Evaluation metrics and statistical tests for machine learning. Sci. Rep. 2024, 14, 6086. [Google Scholar] [CrossRef]
Wang, L.Y.; Yin, G.G.; Zhao, Y.; Zhang, J.-F. Identification Input Design for Consistent Parameter Estimation of Linear Systems With Binary-Valued Output Observations. IEEE Trans. Autom. Control 2008, 53, 867–880. [Google Scholar] [CrossRef]
Wigren, A.; Wagberg, J.; Lindsten, F.; Wills, A.G.; Schon, T.B. Nonlinear System Identification: Learning While Respecting Physical Models Using a Sequential Monte Carlo Method. IEEE Control Syst. Mag. 2022, 42, 75–102. [Google Scholar] [CrossRef]
Kwad, A.M.; Hanafi, D.; Omar, R.; Rahman, H.A. Development of system identification from traditional concepts to real-time soft computing based. In IOP Conference Series: Materials Science and Engineering; IOP Publishing: Bristol, UK, 2020; Volume 767. [Google Scholar]
Hjalmarsson, H. System Identification of Complex and Structured Systems. Eur. J. Control 2009, 15, 275–310. [Google Scholar] [CrossRef]
Rojas, C.R.; Barenthin, M.; Welsh, J.S.; Hjalmarsson, H. The cost of complexity in system identification: The Output Error case. Automatica 2011, 47, 1938–1948. [Google Scholar] [CrossRef]
Schoukens, J.; Marconato, A.; Pintelon, R.; Rolain, Y.; Schoukens, M.; Tiels, K.; Vanbeylen, L.; Vandersteen, G.; Mulders, A.V. System identification in a real world. In Proceedings of the 2014 IEEE 13th International Workshop on Advanced Motion Control (AMC), Yokohama, Japan, 14–16 March 2014; pp. 1–9. [Google Scholar]
Kabzinski, T.; Jax, P. A Flexible Framework for Expectation Maximization-Based MIMO System Identification for Time-Variant Linear Acoustic Systems. IEEE Open J. Signal Process. 2024, 5, 112–121. [Google Scholar] [CrossRef]
Brunton, S.L.; Brunton, B.W.; Proctor, J.L.; Kutz, J.N. Koopman Invariant Subspaces and Finite Linear Representations of Nonlinear Dynamical Systems for Control. PLoS ONE 2016, 11, 11–19. [Google Scholar] [CrossRef]
Mamakoukas, G.; Castaño, M.L.; Tan, X.; Murphey, T.D. Derivative-Based Koopman Operators for Real-Time Control of Robotic Systems. IEEE Trans. Robot. 2021, 37, 2173–2192. [Google Scholar] [CrossRef]
Xu, Y.; Wang, Q.; Mili, L.; Zheng, Z.; Gu, W.; Lu, S.; Wu, Z. A Data-Driven Koopman Approach for Power System Nonlinear Dynamic Observability Analysis. IEEE Trans. Power Syst. 2024, 39, 4090–4104. [Google Scholar] [CrossRef]
Jia, J.; Zhang, W.; Guo, K.; Wang, J.; Yu, X.; Shi, Y.; Guo, L. EVOLVER: Online Learning and Prediction of Disturbances for Robot Control. IEEE Trans. Robot. 2024, 40, 382–402. [Google Scholar] [CrossRef]
Choi, H.; Elliott, R.; Byrne, R. Data-Driven Power Flow Estimation with Inverter Interfaced Energy Storage Using Dynamic Injection Shift Factor. In Proceedings of the IEEE Power and Energy Society General Meeting (PESGM), Denver, CO, USA, 17–21 July 2022; pp. 1–5. [Google Scholar]
Zheng, L.; Liu, X.; Xu, Y.; Hu, W.; Liu, C. Data-driven Estimation for a Region of Attraction for Transient Stability Using the Koopman Operator. CSEE J. Power Energy Syst. 2023, 9, 1405–1413. [Google Scholar]
Sarić, A.A.; Transtrum, M.K.; Sarić, A.T.; Stanković, A.M. Integration of Physics- and Data-Driven Power System Models in Transient Analysis After Major Disturbances. IEEE Syst. J. 2023, 17, 479–490. [Google Scholar] [CrossRef]
Bruder, D.; Fu, X.; Vasudevan, R. Advantages of bilinear koopman realizations for the modeling and control of systems with unknown dynamics. IEEE Robot. Autom. Lett. 2021, 6, 4369–4376. [Google Scholar] [CrossRef]
Otto, S.; Peitz, S.; Rowley, C. Learning Bilinear Models of Actuated Koopman Generators from Partially Observed Trajectories. SIAM J. Appl. Dyn. Syst. 2024, 23, 885–923. [Google Scholar] [CrossRef]
Mamakoukas, G.; Cairano, S.D.; Vinod, A.P. Robust model predictive control with data-driven koopman operators. In Proceedings of the American Control Conference (ACC), Atlanta, GA, USA, 8–10 June 2022; pp. 3885–3892. [Google Scholar]
Yu, S.; Shen, C.; Ersal, T. Autonomous driving using linear model predictive control with a koopman operator based bilinear vehicle model. Ifac-Papersonline 2022, 55, 254–259. [Google Scholar] [CrossRef]
Williams, M.O.; Kevrekidis, I.G.; Rowley, C.W. A data-driven approximation of the Koopman operator: Extending dynamic mode decomposition. J. Nonlinear Sci. 2015, 25, 1307–1346. [Google Scholar] [CrossRef]
Arbabi, H.; Mezić, I. Ergodic theory, dynamic mode decomposition, and computation of spectral properties of the Koopman operator. SIAM J. Appl. Dyn. Syst. 2017, 16, 2096–2126. [Google Scholar] [CrossRef]
Netto, M.; Susuki, Y.; Krishnan, V.; Zhang, Y. On Analytical Construction of Observable Functions in Extended Dynamic Mode Decomposition for Nonlinear Estimation and Prediction. IEEE Control. Syst. Lett. 2021, 5, 1868–1873. [Google Scholar] [CrossRef]
Folkestad, C.; Pastor, D.; Mezic, I.; Mohr, R.; Fonoberova, M.; Burdick, J. Extended Dynamic Mode Decomposition with Learned Koopman Eigenfunctions for Prediction and Control. In Proceedings of the American Control Conference (ACC), Denver, CO, USA, 1–3 July 2020; pp. 3906–3913. [Google Scholar]
Haseli, M.; Cortés, J. Data-Driven Approximation of Koopman-Invariant Subspaces with Tunable Accuracy. In Proceedings of the American Control Conference (ACC), New Orleans, LA, USA, 26–28 May 2021; pp. 470–475. [Google Scholar]
Koopman, B.O. Hamiltonian Systems and Transformation in Hilbert Space. Proc. Natl. Acad. Sci. USA 1931, 17, 315–318. [Google Scholar] [CrossRef]
Wenjian, Y.; Gu, Y.; Li, Y. Efficient Randomized Algorithms for the Fixed-Precision Low-Rank Matrix Approximation. SIAM J. Matrix Anal. Appl. 2018, 39, 1339–1359. [Google Scholar]
Janot, A.; Gautier, M.; Brunot, M. Data Set and Reference Models of EMPS. In Proceedings of the Nonlinear System Identification Benchmarks, Eindhoven, The Netherlands, 10–12 April 2019. [Google Scholar]
Overschee, P.V.; Moor, B.D. N4SID: Subspace algorithms for the identification of combined deterministic-stochastic systems. Automatica 1994, 30, 75–93. [Google Scholar] [CrossRef]

Figure 1. Singular values for the training data. The high singular values of the training data are outstanding only the lower rank r (column of the matrix) significantly.

Figure 2. Singular values for the first 20 column of matrix.

Figure 3. Comparison of actual data and predicted data for EMPS dataset during the training period (0 to 10 s) and the predicted period (10 to 24 s).

Figure 4. The error between actual data and predicted data for EMPS dataset during the training period (0 to 10 s) and the predicted period (10 to 24 s).

Figure 5. Comparison of actual data and predicted data for an unstable linear system during the training period (0 to 2 s) and the predicted period (2 to 5 s).

Figure 6. The error between measurement data and predicted data for an unstable linear system during the training period (0 to 2 s) and the predicted period (2 to 5 s).

Figure 7. Comparison of actual data and predicted data for an unstable linear system under the influence of disturbance during the training period (0 to 2 s) and the predicted period (2 to 5 s).

Figure 8. The error between actual data and predicted data for an unstable linear system under the influence of disturbance during the training period (0 to 2 s) and the predicted period (2 to 5 s).

Figure 9. Comparison of the prediction performance of the proposed method, N4SID, and [19] based on simulated trajectories of the single pendulum system.

Table 1. Comparison of some existing system identification methods for the single pendulum system.

Method	Basis Generation	Train NRMSE	Test NRMSE
N4SID	LQ decompostion and SVD	0.0486	0.0096
[19]	High-order derivative	0.0028	8.3228
Proposed	SVD	0.0010	0.0011

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ketthong, P.; Samkunta, J.; Mai, N.T.; Kamal, M.A.S.; Murakami, I.; Yamada, K. Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance. Sci 2024, 6, 84. https://doi.org/10.3390/sci6040084

AMA Style

Ketthong P, Samkunta J, Mai NT, Kamal MAS, Murakami I, Yamada K. Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance. Sci. 2024; 6(4):84. https://doi.org/10.3390/sci6040084

Chicago/Turabian Style

Ketthong, Patinya, Jirayu Samkunta, Nghia Thi Mai, Md Abdus Samad Kamal, Iwanori Murakami, and Kou Yamada. 2024. "Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance" Sci 6, no. 4: 84. https://doi.org/10.3390/sci6040084

APA Style

Ketthong, P., Samkunta, J., Mai, N. T., Kamal, M. A. S., Murakami, I., & Yamada, K. (2024). Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance. Sci, 6(4), 84. https://doi.org/10.3390/sci6040084

Article Menu

Data-Driven Koopman Based System Identification for Partially Observed Dynamical Systems with Input and Disturbance

Abstract

1. Introduction

2. Background

2.1. Problem Formulation

2.2. A Brief Review on Koopman Operator Theory

3. Proposed Method

3.1. Data-Driven Koopman Operator

3.2. Low Rank Approximation Method

4. Numerical Examples

4.1. Nonlinear System

4.2. Unstable Linear System

4.3. Unstable Linear System with Input Disturbance

4.4. Comparison to Existing System Identification Methods

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI