Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding

Karimov, Artur; Nepomuceno, Erivelton G.; Tutueva, Aleksandra; Butusov, Denis

doi:10.3390/math8020300

Open AccessArticle

Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding

¹

Youth Research Institute, Saint Petersburg Electrotechnical University “LETI”, Saint Petersburg 197376, Russia

²

Control and Modelling Group (GCOM), Department of Electrical Engineering, Federal University of São, João del-Rei, São João del-Rei MG 36307-352, Brazil

³

Department of Computer Aided Design, Saint Petersburg Electrotechnical University “LETI”, Saint Petersburg 197376, Russia

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(2), 300; https://doi.org/10.3390/math8020300

Submission received: 30 December 2019 / Revised: 17 February 2020 / Accepted: 19 February 2020 / Published: 24 February 2020

(This article belongs to the Section C2: Dynamical Systems)

Download

Browse Figures

Versions Notes

Abstract

The identification of partially observed continuous nonlinear systems from noisy and incomplete data series is an actual problem in many branches of science, for example, biology, chemistry, physics, and others. Two stages are needed to reconstruct a partially observed dynamical system. First, one should reconstruct the entire phase space to restore unobserved state variables. For this purpose, the integration or differentiation of the observed data series can be performed. Then, a fast-algebraic method can be used to obtain a nonlinear system in the form of a polynomial dynamical system. In this paper, we extend the algebraic method proposed by Kera and Hasegawa to Laurent polynomials which contain negative powers of variables, unlike ordinary polynomials. We provide a theoretical basis and experimental evidence that the integration of a data series can give more accurate results than the widely used differentiation. With this technique, we reconstruct Lorenz attractor from a one-dimensional data series and B. Muthuswamy’s circuit equations from a three-dimensional data series.

Keywords:

nonlinear systems; nonlinear identification; system reconstruction; Buchberger–Möller algorithm; Laurent polynomials; nonlinear regression; memristor; chaotic system

1. Introduction

The problem in nonlinear dynamical system reconstruction from data series can be described as follows: For a given data series, a system must be found that produces data series that are close in a certain sense to the initial series. Obtaining system models using a limited set of observations is crucial for the reverse engineering of gene regulatory networks [1], analysis of chemical oscillations [2], studies on biological circadian rhythms [3], reconstruction of neuronal models [4], etc.

When systems described by ordinary differential equations (ODEs) are considered, one can specify the problem statement as follows: For a given data series

{x_{i}}

and its time derivative

{{\dot{x}}_{i}}

a system of ODEs must be reconstructed as

\dot{x} = f (x),

(1)

where

f

is a vector function. Usually, some information about

f

is known apriori, for instance, the possible elementary operations constituting it. These operations usually construct a group or, more specifically, a ring. One such ring is a polynomial ring which includes a set of polynomials in several variables with coefficients belonging to a certain field such as a field of real or complex numbers [5,6]. Dynamical systems with polynomial derivative functions are often referred to as polynomial dynamical systems (PDSs). In these systems,

f (x)

is considered as

\dot{x} = h \otimes t (x),

(2)

where

h

is the vector of unknown coefficients by the known nonlinear terms (monomials)

t

and

\otimes

denotes element-wise multiplication. The problem usually consists of finding all the entries in

h

, taking into account that the system can be sparse and many of these coefficients can equal zero. The set of monomials

t

is a-priory known. Nonlinear dynamical models of Type (2) are used quite often, examples can be found in the classical works of E. Lorenz, O. Rossler, and J.C. Sprott [7]. System reconstruction in terms of PDS has been investigated since the early 1980s by many authors, for example, by J. P. Crutchfield and B. S. McNamara [8], who proposed one of the classical definitions of the problem, and J. Cremers and A. Hübler, who used Legendre polynomials as a basis for the derivative function construction [9]. Later, Taylor-like expansions were introduced by J. L. Breeden et al. [10]. Many alternatives to PDS models have been proposed, for example, neuronal networks, also including nonlinearities and coefficients found from the experimental data [11]. A neuronal network can sometimes give more precise results and is more preferable in cases of complex systems. The disadvantage of the neuronal technique is that it yields large-scale and computationally costly models. Neuronal networks and the PDS approach can also be combined, for example, for gene regulatory network identification.

The problem of finding a reliable algorithm for determining coefficients of PDS (2) is difficult, especially in case(s) of noisy and incomplete data. Determining coefficient vector

h

can be considered as the optimization problem

h^{*} = \arg \min ‖ v - f (X) ‖,

(3)

where

v

is the set of observed derivatives

v = {{\dot{x}}_{i}}

, and

f (X) = {f (x_{i})}

is a set of function values evaluated on

{x_{i}}

. The size of the unknown vector

h

is equal to

M \times L

where

M

is the system dimension and

L

is a number of available monomials.

The Problem (3) can be treated as a problem of nonlinear regression. Several researchers, for example, H. Iba and D. P. Searson et al. [12,13] developed specialized versions of genetic algorithms for solving it. The main disadvantage of genetic algorithms is that their run times are, in a general case, much longer in comparison to other approaches. Meanwhile, classical optimization methods such as Newton or quasi-Newton methods are inefficient on this problem because the optimization problem is discontinuous. When one or several coefficients become equal to zero, the nonlinear regression result changes sufficiently.

Fortunately, Problem (3) can also be considered as the linear regression problem. Indeed, consider

‖ \cdot ‖

as 2-norm, and notice that since the set

X

is already known, the function values

f (X)

can be evaluated. Therefore, the least squares method (LSM) can be applied in order to obtain

h^{*}

:

E^{T} E h^{*} = E^{T} v,

(4)

where

E = f (X)

.

The disadvantage of LSM is that Problem (3) is often ill-conditioned since some of the monomials vanish on the given data series and cause the singularity of the matrix

E^{T} E

. To avoid this, Laubenbacher and Stigler proposed division by Gröbner basis to simplify the initial set of monomials used for regression [1], but this approach is not applicable to noisy data series.

Even if monomials do not vanish or the monomial set was somehow simplified to remove all vanishing elementary functions, the matrix

E^{T} E

usually has a large condition number which makes Problem (3) difficult to solve. Recently, D. Yogatama and N. Smith considered the application of

L_{1}

-regularized LSM to these types of problems, which converge better than the classical version of LSM [14]. Later, H. Kera and Y. Hasegawa proposed excluding vanishing polynomials using the approximate Buhberger-Müller algorithm and, then, deleting minor terms from regression using a specialized procedure [15]. The latter algorithm, further referred to as the Kera-Hasegawa (KH) method, underlies the reconstruction procedure developed in this paper.

From a physical point of view, some state variables needed to perform the reconstruction procedure can be unknown. The simplest case is when only system states are known and the derivatives are not, so numerical differentiation should be applied. Howeve, if the system is partially observed, phase space reconstruction must be performed. For example, when a circuit with a memristive element is considered, one of its state variables, magnetic flux or charge in a memristive device, cannot be directly measured in a macroscopic scale [16]. The problem is more complicated in the domain of physical chemistry where only one variable is usually observed, for example, in Belousov-Zhabotinsky reaction this is Br-ISE electric potential [17]. Using the well-known Takens’ embedding theorem [17] we can reconstruct the discrete-time system phase space using a shifted time series. For continuous-time reconstruction, we can use differentiation or integration. The latter technique is used quite rarely because it can cause the emergence of linear trends in integrated data but we will show how to avoid this and why it is more preferable than differentiation.

The main aim of this paper is to establish an efficient and reliable technique for identifying chaotic circuits, especially, ones with nonlinear elements with unknown parameters. The problem arises from the fact that real circuits even with simple nonlinearities and known parameters need certain parameter optimization for simulation despite the theoretical prediction that the circuit can accurately correspond to the ODE [18]. When the circuit contains a nonlinear element, the problem becomes more complicated since the relevant model of this element may also be different from the prediction. For example, it has recently been shown that original HP models are irrelevant to the actual resistance switching devices with memory effect [16]. The relevant model proposed in this work [16] contains exponents, hyperbolic trigonometric functions and negative powers.

This paper is organized as follows: First, we describe how PDS can be reconstructed from complete data using the extended Kera-Hasegawa method; second, we explain how these results can be applied to data series lacking some state variables; third, we carry out computational experiments showing the validity of the approach, to be more precise, we reconstruct the Lorenz system from 1-dimensional data series and a 4-dimensional memristive circuit proposed by B. Muthuswamy [19] using incomplete 3-dimensional data and we show that the accuracy of numerical integration and differentiation plays an important role in the success of reconstruction, even though the reconstruction method is noise-tolerant; finally, we discuss the limitations of the proposed approach and give a conclusion.

2. Algebraic Method for ODE Reconstruction

In this section, we describe an algebraic approach to the system reconstruction method based on subsequent application of the approximate Buchberger–Möller method, the least-squares method and deleting minor terms method, close to the method proposed by Kera and Hasegawa, but we extend this approach to the Laurent polynomials which can have negative powers of monomials in comparison with ordinary polynomials with nonnegative powers only.

2.1. Polynomial Rings, Ideals, ABM Algorithm

Consider a nonlinear function in Equation (1) as

f (x) = (\begin{matrix} \sum_{i = 1}^{L} h_{1 i} t_{i} (x) \\ ⋮ \\ \sum_{i = 1}^{L} h_{M i} t_{i} (x) \end{matrix}),

(5)

where

M

is dimension,

L

is the number of elementary monomials

t_{i}

, and

h_{j i}

are the coefficients of each elementary monomial, and

x = {(x_{1}, x_{2}, \dots x_{M})}^{T}

. Each component of Equation (5) represented in the form

\sum_{i = 1}^{L} h_{1 i} t_{i} (x)

is a polynomial. The monomials have general form as

t_{i} (x) = \prod_{i = j}^{M} x_{j}^{α_{i j}}

(6)

where

α_{i j}

are the powers of each variable. The polynomial functions

f_{k} (x) = \sum_{i = 1}^{L} h_{k i} t_{i} (x)

(7)

form a polynomial ring

ℙ

over the field

F

, where operations of addition and multiplication are defined as usual. More precisely,

ℙ

is an Abelian group under addition since it is associative, commutative, has an additive inverse and has zero element

f (x) = 0

,

ℙ

is monoid under multiplication since it is associative and has an identity element

f (x) = 1

which is exactly

f (x) = 1 \cdot \prod_{i = j}^{M} x_{j}^{0},

also this multiplication is distributive. The field over which the ring is defined determines the properties of the coefficients

h_{k i} \in F

. In this paper, we consider

ℙ

over a field of real numbers

ℝ

.

When

α_{i j} \in ℕ

, the functions of Equation (6) form ordinary polynomials, or Taylor-like expressions, e.g., for the two-variable function it can be

f (x, y) = h_{11} + h_{12} x + h_{13} y + h_{14} x^{2} + h_{15} x y + h_{16} y^{2} + \dots

(8)

When

α_{i j} \in ℤ

, the functions for Equation (5) are called Laurent polynomials where powers are conventionally defined within a symmetric interval

α_{i j} \in {[- M, M]}_{ℤ}

, but we will not require this property further. The terms in Equation (8) can be ordered in various ways. We will consider a degree-lexicographic order, where we primarily consider the power of the entire monomial defined as

α_{i} = \sum_{j = 1}^{M} α_{i j} .

Second, we consider the powers of variables selected alphabetically organized in ascending order of absolute value of power. This may be exemplified for a set of Laurent monomials of 2 variables and order in

[- 1; 1]

as

T (x, y) = {x^{- 1}, y^{- 1}, 1, x y^{- 1}, x^{- 1} y, x, y} .

Originally, only positive powers are considered in degree-lexicographic order, so we introduce this equation here by analogy; for discussion, see Section 4.

The order ideal

O

of the polynomial ring

ℙ

is a finite set of monomials in

T

if it is closed under divisors, i.e., satisfying the condition: for each monomial

t \in O

all monomials

t_{j}

dividing it also belong to

O

. A “border” of the order ideal

\partial O

in an ordinary polynomial ring is a set of monomials in

T

defined as

\partial O = (x_{1} O \cup x_{2} O \cup \dots \cup x_{M} O) - O,

(9)

where “minus” denotes set subtraction. If the monomial set

O = {1, x, y}

is an order ideal in an ordinary polynomial ring, its border is

\partial O = {x^{2}, x y, y^{2}} .

In Laurent polynomial ring, a border is defined similarly to Equation (9) but also includes negative powers

x_{i}^{- 1}, i = 1 \dots M

:

\partial O = (x_{1} O \cup x_{2} O \cup \dots \cup x_{M} O \cup x_{1}^{- 1} O x_{2}^{- 1} O \cup \dots \cup x_{M}^{- 1} O) - O .

The vanishing ideal of a set of data points

X

is a set of all polynomials vanishing (i.e., equals to zero) on

X

:

I (X) = {p \in ℙ | p (x) = 0, \forall x \in X} .

(10)

The vanishing ideal

I (X)

can be spanned by a finite set of polynomials called the basis of

I (X)

. Let

O = {t_{1} \dots t_{L}}

be an order ideal with a border

\partial O = {b_{1} \dots b_{ν}}

. Let

G = {g_{1} \dots g_{ν}} \subset ℙ

be a set of polynomials, and there is an ideal

I \subseteq ℙ

. The set

G

is an

O

-border prebasis if the polynomials in it have the form

g_{j} = b_{j} - \sum_{i = 1}^{L} α_{i j} t_{i} for j \in [1 \dots ν], α_{i j} \in F .

(11)

The set

G

is an

O

-border basis of

I

if

G

generates

I

and if the residue classes of the terms in

O

constitute a vector basis of

ℙ - I

. Several other bases are known, such as Gröbner basis and Macaulay basis. To find a border basis, Buchberger–Möller (BM) algorithms can be used [15]. Border basis

G

is called approximate if

‖ g {(X) ‖}_{2} < ϵ

, where

g \in G

and

ϵ

is a small positive number. Modification of BM algorithm, known as the approximate Buchberger–Möller (ABM) algorithm can be used to compute an approximate border basis on a noisy dataset

X

.

We also define the evaluation map

ℰ

, which transforms a given dataset

X = {x_{1} \dots x_{N}}

into a matrix

ℰ_{T} (X) = E_{T}

, where

E_{T} = (\begin{matrix} t_{1} (x_{1}) & \dots & t_{L} (x_{1}) \\ ⋮ & ⋱ & ⋮ \\ t_{1} (x_{N}) & \dots & t_{L} (x_{N}) \end{matrix}) .

The ABM algorithm can be described as follows. Suppose, we have a dataset

X

, a tolerance

ϵ

, and set of monomials

σ \subset T

in degree-lexicographic order. We should obtain a border basis

G

and an order ideal

O

. We start with degree

d = 1

and

O = {1}

. Let

L \subset σ

initially contain all terms of degree 1. Then, the process starts while the order

d

is not greater than the maximal order defined as

\max \deg σ

and

L

is not empty. At the current step, all monomials of degree

d - 1

have been already processed. We estimate the vector

A

on the next monomial in

σ

as

A = ℰ_{t} (X), t \in L,

(12)

and, then, remove it from

L

. Let

λ_{\min}

be the smallest eigenvalue of the matrix

A^{T} A

λ_{\min} = λ_{i}, i = \arg \min | eig (A^{T} A) |,

and if it satisfies the condition

\sqrt{λ_{\min}} > ϵ,

then

t

is appended to

O

and all the polynomials of order

d + 1

are added to

\partial O

. A polynomial with coefficients defined by an eigenvector of

A^{T} A

corresponding to

λ_{\min}

is added to

G

.

Application of ABM algorithm to the dataset

X

and the set of initial monomials

σ

allows obtaining

O

and ensures that any polynomial

p (x) = \sum_{i = 1}^{L} h_{k i} o_{i} (x_{k}),

where

o_{i} \in O

, does not vanish on

X

, even though

O

can be, strictly speaking, not always an order ideal [15].

2.2. KH Algorithm for Polynomial Function Reconstruction

Consider a dataset

X = {x_{1}, \dots, x_{M}}

which can be represented as observation vectors

x_{i}

for i-th state variable, M is the system dimension. Each data vector has N observation points. Using the order ideal

O

as a set of basis monomials, one can evaluate them on a given dataset

X

and then build an interpolation, using derivatives set

V = {v_{1}, \dots, v_{M}}

which is also given or can be derived from

X

by the relation

v_{i} = \dot{x_{i}}

:

v_{k} = \sum_{i = 1}^{L} h_{k i} o_{i} (x_{k}) .

(13)

In each dimension, Equation (13) can be reconstructed using the least squares method. Denote

E = ℰ_{O} (X)

and transform Equation (13) into a linear equation:

v_{k} = E h_{k} .

(14)

Equation (14) is usually considered as overdetermined, since the number of estimated points can be arbitrary and, in the presence of noise, a large number of observations are a necessary condition for obtaining a reliable solution. The overdetermined Equation (14) can be solved using the ordinary least squares method, obtaining the minimum

h^{*} = \arg \min {‖ v_{k} - E h_{k} ‖}^{2}

(15)

by analytical Formula (4). Due to the presence of noise and since it is supposed that the vector

h_{k}

is sparse,

L_{1}

-regularization is recommended to obtain better results:

h^{*} = \arg \min (\frac{1}{2} {‖ v_{k} - E h_{k} ‖}^{2} - λ | h_{k} |),

(16)

where

λ

is a regularization parameter. Increasing

λ

provides a sparser solution. Problem (16) is also known as a basis pursuit denoising (BPDN) problem, and several special optimization methods have been proposed for solving it, e.g., Nesterov method [20,21]. If it is assumed that

h^{*}

is dense,

L_{2}

-regularization or Tikhonov regularization [22] can be applied:

h^{*} = \arg \min (\frac{1}{2} {‖ v_{k} - E h_{k} ‖}^{2} - λ {‖ h_{k} ‖}^{2}),

(17)

with a known solution in terms of matrices:

h^{*} = {(E^{T} E + λ N I)}^{- 1} (E^{T} v),

(18)

where

I

is an identity matrix [23].

Kera and Hasegawa [15] proposed removing minor terms from the interpolation polynomial

H_{k} (x) = \sum_{i = 1}^{L} h_{k i} o_{i} (x)

(19)

using a special procedure based on an estimation

w_{i} = ‖ ℰ_{O_{i}} (X) h_{i k} ‖ .

For each term in a polynomial

h_{k i} o_{i} (x)

its weight (20) is estimated, and terms with minimal weight are subsequently removed from the interpolation while the error of approximation remains within a given tolerance

η

:

\frac{1}{N} ‖ v_{k} - H_{k} (X) ‖ \leq η .

(20)

Following Kera and Hasegawa, we call this procedure delMinorTerms. It ensures that the solution is sparse enough even if the data is contaminated with noise.

The Kera–Hasegawa (KH) PDS reconstruction method finds a set of interpolating polynomials

ℍ

in

ℙ = (T \times ℝ)

of type (19), where

\times

is the Cartesian multiplication, satisfying

(ℍ \subset ℙ : H_{k} (X) \approx v_{k})

for a given

X

and

V

. Firstly, it performs the ABM algorithm to remove monomials from

T

vanishing on

X

, comprising an order ideal

O

of

T

. Next, it applies

L_{1}

-regularizated LSM to find an interpolation polynomial on

O \times ℝ

and then runs delMinorTerms to ensure that the interpolated polynomials are sparse while the accuracy criterion Equation (20) is satisfied.

2.3. Phase Space Reconstruction Using Integration Embedding

The KH method can be performed only if

X

and

V

are complete, i.e., the dimension M of the reconstructed system corresponds to dimensions of

X

and

V

. In the general case, we can observe only one or several state variables, and thus we need to reconstruct the entire phase space. The well-known F. Takens’ theorem [23] states that having a time-delayed series of a single-variable signal

{x [n], x [n - 1], x [n - 2], \dots, x [n - (M - 1)]}

, we can reconstruct M-dimensional phase space due to time delay embedding, and, similarly, N. Packard et al. [24] proposed reconstructing M-dimensional phase using

M - 1

subsequent derivatives

{x [n], \frac{d x [n]}{d t}, \dots, \frac{d^{M - 1} x [n]}{d t^{M - 1}}},

(21)

due to differential embedding. Recently, J. Lekscha and R. Donner [25] have shown that reconstruction is possible in the case of non-uniformly sampled and noisy time series as well, which is an important result for practical use.

Unlike the differential embedding approach, integral embedding did not become popular in scientific literature, despite being more useful in several practical cases.

Lemma 1.

M-dimensional phase space can be reconstructed due to integral embedding, and a trajectory

ℱ

of the system

F

reconstructed from time series

x [n]

can be written as:

{\underset{M - 1}{\underset{⏟}{\int \dots \int}} x [n] d t^{M - 1} + C_{M - 1} (t), \dots, \int x [n] d t + C_{1}, x [n]},

(22)

where

C_{M - 1} (t)

is a residual polynomial whose general form is:

C_{K} (t) = C_{K} + C_{K - 1} t + C_{K - 2} t^{2} + \dots + C_{1} t^{K - 1},

(23)

where

K

is a natural number.

Proof.

A system of ODEs in the Cauchy normal form is usually organized so as

\frac{d}{d t} x_{j} = x_{j + 1},

where the state variable

x_{j}

is related to the variable of the next index through differentiation. It can also be written through the integration

x_{j} = \int x_{j + 1} d t + C_{j}, j \in [1, M - 1],

(24)

where

C_{j}

is a constant depending on the initial condition. Equation (24) can be used for subsequent integration of state variables for reconstructing the trajectory in a full phase space when variables

x_{1} \dots x_{j - 1}

are known. After K integrations of a time series given by

x_{1}

, we obtain

x_{K} = \underset{K}{\underset{⏟}{\int \dots \int}} x_{1} dt + C_{K} + C_{K - 1} t + C_{K - 2} t^{2} + \dots + C_{1} t^{K - 1},

from which Equations (22) and (23) follow obviously. □

Corollary 1.

Since the polynomial residual Equation (23) grows very fast if

t^{K - 1}

is large enough, it is undesirable if we consider a computer implementation of the reconstruction algorithm. Otherwise, the accuracy of the data representation may suffer. We propose the following procedure for obtaining

x_{j}

. First, we choose zero initial condition to detect and integrate

x_{j + 1}

on a given time interval

[t_{0}, t_{1}]

:

x_{j}^{*} = \int_{t_{0}}^{t_{1}} x_{j + 1} d t .

Then, we estimate its mean value

〈 x_{j} 〉

, and supposing

C_{j} =

〈 x_{j} 〉

, subtract it obtaining the desired solution

x_{j} = x_{j}^{*} - C_{j} .

Another way is to de-trend the second integral containing

C_{j} t

with a specialized procedure, e.g., detrend function in a MATLAB environment.

The system reconstructed using integration embedding is equivalent to the system reconstructed using differential embedding up to

M - 1

differentiations. More strictly, this can be formulated as Lemma 2.

Lemma 2.

Suppose, there is a map

G (t, y) : ℝ \times ℝ^{M} \to ℝ^{M}

,

t \in ℝ, y \in ℝ^{M}

which projects

G

onto the phase space

ℝ^{M}

, can be obtained via subsequent differentiation of the one-dimensional trajectory

G_{1}

corresponding to the state variable

y_{1}

. Then, then a dynamical system

(t, x) : ℝ \times ℝ^{M} \to ℝ^{M}

,

x \in ℝ^{M}

exists which projects

ℱ

onto the phase space

ℝ^{M}

can be obtained via subsequent integration of the one-dimensional trajectory

ℱ_{M} = G_{1}

corresponding to the state variable

x_{M}

.

Proof.

The following ODE describes a dynamical system

G (t, y)

due to the condition that subsequent differentiation of one-dimensional trajectory

y_{1}

generates

G

{\begin{matrix} \frac{d}{d t} y_{1} = y_{2}, \\ ⋮ \\ \frac{d}{d t} y_{M - 1} = y_{M}, \\ \frac{d}{d t} y_{M} = g (t, y) . \end{matrix}

(25)

Since

G_{1} = ℱ_{M}

the corresponding state variables are equal:

y_{1} = x_{M}

. The condition that the projection

ℱ

of the map

F

onto the entire phase space can be obtained via subsequent integration of its one-dimensional component

ℱ_{M}

can be expressed in the form of Equation (24). Consequently, the following ODE describes

F (t, x)

:

{\begin{matrix} \frac{d}{d t} x_{1} = x_{2}, \\ ⋮ \\ \frac{d}{d t} x_{M - 1} = x_{M}, \\ \frac{d}{d t} x_{M} = f (t, x), \end{matrix}

where

f (t, x)

is a differentiable function. It is related with

y_{1}

in the following way:

y_{1} = \int f (t, x) d t + C

. □

Corollary 2.

A dynamical system

F

which can be reconstructed via integration embedding exists if the system

G

which can be reconstructed via differential embedding exists.

The ODE representation of a partially observed system with K observed variables and M-dimensional entirely phase space can be represented in the following way:

{\begin{matrix} \begin{matrix} \frac{d}{d t} x_{1} = g_{1} (x), \\ ⋮ \end{matrix} \\ \frac{d}{d t} x_{K} = g_{K} (x), \\ \begin{matrix} \frac{d}{d t} x_{K + 1} = x_{K}, \\ ⋮ \\ \frac{d}{d t} x_{M} = x_{M - 1}, \end{matrix} \end{matrix}

(26)

where

f_{j} (x)

is a scalar function. Missing derivatives of orders

{K + 1, \dots, M}

can be found numerically. Finite-difference-based numerical differentiation is the most common approach, but some other techniques can be used as well, e.g., numerical differentiation based on Legendre polynomials [25]. If the known state variables are denoted

x_{K + 1}, \dots, x_{M}

, the system reconstructed via integration reads

{\begin{matrix} \begin{matrix} \frac{d}{d t} x_{1} = x_{2}, \\ ⋮ \end{matrix} \\ \frac{d}{d t} x_{K} = x_{K + 1}, \\ \begin{matrix} \frac{d}{d t} x_{K + 1} = f_{K + 1} (x), \\ ⋮ \\ \frac{d}{d t} x_{M} = f_{M} (x), \end{matrix} \end{matrix}

(27)

which is an alternative form of Equation (26) with other variables. □

The number of dimensions

M

can be selected concerning its physical sense or with one of the several algorithms, e.g., false nearest neighbor (FNN) algorithm [26,27], or estimating one of the fractal dimensions or entropies [28].

The embedded integration approach is more useful in some practical cases due to the following two theorems. Theorem 1 considers a case appearing in many practical applications when the differential embedding approach cannot give a valid reconstruction due to the non-uniqueness of the solution corresponding to the reconstructed trajectory

ℱ

.

Theorem 1.

A function

f

(t) determined on

ℝ

exists, satisfying the following conditions:

(1): $f$ (t) is a function that produces time series u = ${f (t_{i})}$ being a one-dimensional projection of a M-dimensional trajectory $ℱ$ of the dynamical system $F$ ;
(2): $f$ (t) is continuous and satisfies the Lipchitz condition

$| f (t_{1}) - f (t_{2}) | \leq K | t_{1} - t_{2} |$

where $K > 0$ is the Lipchitz constant;
(3): an M-dimensional trajectory $\hat{ℱ}$ can be reconstructed from time series u using differentiation which does not satisfy the condition of the uniqueness of the solution;
(4): an M-dimensional trajectory $\tilde{ℱ}$ can be reconstructed from time series u using integration that satisfies the condition of uniqueness of the solution and thus is topologically valid.

Proof.

Consider a continuous function that satisfies the following conditions. First, there exists an infinite set of time points at which the function

f

and its derivatives equal an arbitrary number

a \in ℝ

:

\exists τ : f (t_{0}) = a, f^{'} (t_{0}) = a, \dots, f^{M - 1} (t_{0}) = a, t_{0} \in τ .

(28)

At other time points the function

f

differs from

a

:

f (t_{j}) \neq a, f^{'} (t_{j}) \neq a, \dots, f^{M - 1} (t_{j}) \neq a, t_{0} \notin τ .

(29)

Second, for a sufficiently large number

Q \in ℕ

:

M \leq Q

there exist two time points

t_{i}

and

t_{j}

, where the function changes its value relatively to

a

:

f \exists (t_{j}, t_{i}) : f^{(q)} (t_{i}) 〈 a; f^{(q)} (t_{j}) 〉 a,

(30)

q \in [0; Q]

. Then, under this conditions, reconstruction by differentiation yields in a phase space where the point

x_{a} = {(a, a, \dots, a)}^{T}

is a point at which at least two trajectory loops are connected and therefore the uniqueness of the solution is violated: starting from this point and having no apriori information of which branch can be selected we can neither predict the next state of the system nor determine the previous state of the system. Therefore, the trajectory

\hat{ℱ}

reconstructed from time series u using differentiation does not satisfy the condition of uniqueness of the solution. Meanwhile, due to Lemma 1 and Lemma 2, it is possible to obtain M-dimensional trajectory

\tilde{ℱ}

from time series u using integration. Due to Corollary 1 it is possible to select such integration constants

C_{j}

when

u

and its integrals are not equal to

a

so it may satisfy the condition of the uniqueness of the solution and thus

\tilde{ℱ}

is topologically valid. Figure 1 illustrates the phase space geometry. □

Time series u =

{f (t_{i})}

satisfying conditions 2–4 of the Theorem 1 when

a = 0

are rather common in biology, medicine, genetics and neuroscience.

Theorem 2 is dedicated to noise-amplifying properties of differentiation operators. In the presence of additive white Gaussian noise, when the sampling frequency is high enough and the spectral density of the signal is mostly low-frequency, differentiation tends to decrease the signal-to-noise ratio (SNR), while integration does not.

Theorem 2.

Denote the Fourier transform of a signal

u (t)

as

U (ω)

. Let the power spectrum of

u

lie almost entirely in a frequency interval

[ω_{0}, ω_{1}]

. Let

ξ (t)

be the additive white Gaussian noise (AWGN) contaminating

u

. Then:

(1): after differentiation, the SNR of $\frac{d}{d t} u$ will be smaller than the SNR of $u$ ;
(2): after integration, the SNR of $\int u d t$ will be at least not smaller than the SNR of $u$ .

Proof.

Recall, the signal-to-noise ratio is determined as

SNR = \frac{A_{u}^{2}}{A_{ξ}^{2}},

where

A_{u}

is root-mean-square (RMS) amplitude of the signal with its square equal to

A_{u}^{2} = \lim_{T \to \infty} \int_{0}^{T} \frac{{[u (t)]}^{2}}{T} d t,

where

T

is time interval, and

A_{ξ}

is RMS amplitude of the noise.

Denote the sampling frequency as

ω_{s}

. Denote the Fourier transform of the noise

ξ (t)

as

Ξ (ω)

. Then, the power spectral density of the noise calculated within frequency interval

ω \in [0; ω_{s}

/2] is

S_{ξ ξ} (ω) = \lim_{T \to \infty} \frac{{| Ξ (ω) |}^{2}}{T}

According to Parseval’s theorem,

\int_{0}^{T} {[ξ (t)]}^{2} d t = \int_{0}^{\frac{ω_{s}}{2}} | Ξ (ω) |^{2} d t,

and the same is true for the signal. Consequently, from Equation (31) and Equations in (32) SNR may be expressed through power spectra as

SNR = \lim_{T \to \infty} \frac{\int_{0}^{\frac{ω_{s}}{2}} S_{u u} (ω) d ω}{\int_{0}^{\frac{ω_{s}}{2}} S_{ξ ξ} (ω) d ω}

Note that AWGN has a uniform power spectral density, i.e.,

\forall ω \in [0; \frac{ω_{s}}{2}] : S_{ξ ξ} (ω) = 〈 S_{ξ ξ} 〉 .

The condition that the power spectrum of

u

lie almost entirely in a frequency interval

[ω_{0}, ω_{1}]

means that its power spectrum

S_{u u} (ω)

satisfies the condition:

\int_{0}^{ω_{1}} S_{u u} (ω) d ω > q \int_{0}^{ω_{s} / 2} S_{u u} (ω) d ω,

where

0 < q < 1

is a positive real number close to 1, and

ω_{1}

is a frequency up to which almost the whole spectrum is contained, usually,

ω_{1} < \frac{ω_{s}}{2 \dots 10}

. The gap between

ω_{1}

and

ω_{s} / 2

is almost occupied by the noise only, see Figure 2a. Consequently, in this case, a formula for SNR can be rewritten as:

SNR = \lim_{T \to \infty} \frac{\int_{0}^{ω_{1}} S_{u u} (ω) d ω}{\int_{0}^{\frac{ω_{s}}{2}} S_{ξ ξ} (ω) d ω}

(31)

Consider integration and differentiation acting at

u

. Taking Fourier transform, obtain

{\begin{matrix} \int_{0}^{T} u (t) d t ≑ \frac{1}{j ω} U (ω) \\ \frac{d}{d t} u (t) ≑ j ω U (ω) \end{matrix},

where

j

is the imaginary unit. The same is true for the noise. Denote

\frac{d}{d t} u (t) = \dot{u} (t)

,

\int_{0}^{T} u (t) d t = U (t)

,

\frac{d}{d t} ξ (t) = \dot{ξ} (t)

,

\frac{d}{d t} ξ (t) = Ξ (t)

. Then, corresponding spectral densities are:

{\begin{matrix} S_{\dot{u} \dot{u}} (ω) = ω^{2} S_{u u} (ω), \\ S_{U U} (ω) = \frac{1}{ω^{2}} S_{u u} (ω), \\ S_{\dot{ξ} \dot{ξ}} (ω) = ω^{2} S_{ξ ξ} (ω), \\ S_{Ξ Ξ} (ω) = \frac{1}{ω^{2}} S_{Ξ Ξ} (ω) . \end{matrix}

(32)

From Equation (31) and Equations in (32), SNR of the differentiated signal is:

{SNR}_{D} = \lim_{T \to \infty} \frac{\int_{0}^{ω_{1}} ω^{2} S_{u u} (ω) d ω}{\int_{0}^{\frac{ω_{s}}{2}} ω^{2} S_{ξ ξ} (ω) d ω} = \lim_{T \to \infty} \frac{\int_{0}^{ω_{1}} ω^{2} S_{u u} (ω) d ω}{\frac{ω_{s}^{3}}{24} 〈 S_{ξ ξ} 〉} .

Due to Hölder’s inequality, and the fact that both

ω^{2}

and

S_{u u} (ω)

are nonnegative,

{SNR}_{D} \leq \lim_{T \to \infty} \frac{\frac{ω_{1}^{3}}{3} \int_{0}^{ω_{1}} S_{u u} (ω) d ω}{\frac{ω_{s}^{3}}{24} 〈 S_{ξ ξ} 〉} = {(\frac{2 ω_{1}}{ω_{s}})}^{3} SNR .

Since

\frac{2 ω_{1}}{ω_{s}} < 1

, the

{SNR}_{D}

is never greater than the SNR of the original signal under the above made assumptions.

For the integration operator, from Equation (31) and Equations in (32),

{SNR}_{I} = \lim_{T \to \infty} \frac{\int_{0}^{ω_{1}} \frac{1}{ω^{2}} S_{u u} (ω) d ω}{\int_{0}^{\frac{ω_{s}}{2}} \frac{1}{ω^{2}} S_{ξ ξ} (ω) d ω} = \lim_{T \to \infty} \frac{\int_{0}^{ω_{1}} \frac{1}{ω^{2}} S_{u u} (ω) d ω}{- \frac{2}{ω_{s}} 〈 S_{ξ ξ} 〉} .

Again, due to Hölder’s inequality,

{SNR}_{I} \leq \lim_{T \to \infty} \frac{\frac{1}{ω_{1}} \int_{0}^{ω_{1}} S_{u u} (ω) d ω}{\frac{2}{ω_{s}} 〈 S_{ξ ξ} 〉} = \frac{ω_{s}}{2 ω_{1}} SNR .

Therefore, while the

{SNR}_{I}

is not obligatorily greater than the SNR of the original signal, it is not necessarily less than it in contrast to the case of using differentiation. □

Theorem 2 is in good correspondence with the signal processing theory [29] from which it is known that differentiation operator relates to a high-pass filter, and amplifies high-frequency noise, while integration operator relates to a low-pass filter.

2.4. Description of the Proposed Reconstruction Technique

The overall proposed technique is described as follows:

First, we consider the time series set

U

of a partially observed dynamical system and its derivative

W

. Using both integral or differential embedding, preferring the first one due to Theorems 1 and 2, we reconstruct the phase space and the

M

-dimensional phase space trajectory

X

. In this paper we suppose that

M

is already known. If not, the FNN algorithm or other dimension-finding algorithms can be applied. For the reasons why we do not reject differential embedding, see the Limitations section.

Then, we generate a degree-lexicographically organized set of monomials

σ

of order

D = [d_{-}; d_{+}]

, where the power of each monomial lies within

D

and powers of each term in monomial also lies within

D

. From a practical point of view, we assume that

d_{-} \leq 0

and

d_{+} \geq 0

. The algorithm for generating

σ

is given in pseudocode in Listing 1.

The subroutine generateV returns a special indexing matrix V:

V = (\begin{matrix} Dpos & \underset{\overset{⏞}{{\begin{matrix} 1 & 0 & \begin{matrix} \dots & 0 \end{matrix} \\ ⋮ & ⋮ & \begin{matrix}  \end{matrix} \\ 1 & 0 & ⋱ \end{matrix}}}{M} \\ Dneg & {\begin{matrix} - 1 & 0 & \begin{matrix}  \end{matrix} \\ ⋮ & ⋮ & \begin{matrix}  \end{matrix} \\ - 1 & 0 & \begin{matrix} \dots & 0 \end{matrix} \end{matrix} \end{matrix})

The subroutine adds produces a column-wise sum of

V

as a candidate vector

α = {(α_{1}, \dots, α_{M})}^{T}

of monomial terms powers:

t (x) = \prod_{i = 1}^{M} x_{i}^{α_{i}} .

(33)

The monomial

t (x)

is added to

σ

if it is not already contained in

σ

. The order of multiplication is neglected, e.g., the term

x_{1} x_{2} x_{1}

is considered similar to the term

x_{1}^{2} x_{2}

.

The subroutine shiftForward produces the next state of matrix

V_{k + 1} = V_{k} \oplus

1 considering

V

as a column-type of D-digit number in M-ary numeral system, where each digit has M states defined by the position of the non-zero element in each row, and where the first row is the least significant digit of the number, and

V_{k} \oplus

1 denotes summation with the column vector 1

= {(1, 0, \dots, 0)}^{T}

.

Listing 1. A pseudocode for a deglexord routine generating degree-lexicographically ordered Laurent monomials

When the degree-lexicographically ordered set of monomials

σ

is computed, the Kera–Hasegawa method is applied to

σ

,

V

, and

X

. The obtained reconstructed PDS corresponds to the original dynamical system if the noise level in the data is sufficiently low and the system is representable in a form of PDS in a given set of coordinates.

3. Experimental Results

In this section, we first illustrate the advantages of using integral embedding using two examples and, then, show how the proposed approach can be used for the reconstruction of Lorenz attractor from a one-dimensional time series and chaotic memristive circuit equations from a three-dimensional time series.

3.1. Illustrations of the Theorems

The following example illustrates the Theorem 1.

Example 1.

Consider the ODE:

{\begin{matrix} \dot{x} = y, \\ \dot{y} = - x, \\ \dot{z} = f (x, y), \end{matrix}

(34)

where

f (x, y) = 2 s i g n (x) \frac{y}{x^{3}} e^{- \frac{1}{x^{2}}}

. The solution of (30) is

{\begin{matrix} x = \sin (t), \\ y = \cos (t), \\ z = sign (\sin t) e^{- \frac{1}{\sin^{2} t},} \end{matrix}

(35)

The function

z (t)

in (31) satisfies the conditions of the Theorem 1. Its unresolvable point is

x_{0} = {(0, 0, 0)}^{T}

. Figure 3 shows two variants of 3-dimensional phase spaces, reconstructed via differentiation and integration from time series

u = {z (t_{i})}

.

Example 2 shows how embedded differentiation and integration approaches can be applied to ECG signals. In this example, the conditions of both Theorems are satisfied, i.e., differentiation produces noisy trajectory reconstruction with a singular point, while integration does not have these drawbacks.

Example 2.

Consider the ECG signal from ECG-ID Database [30,31], as shown in Figure 4a.

After applying the embedded differentiation approach, we obtain a trajectory that does not satisfy the uniqueness criterion and is contaminated with noise. After applying embedded integration approaches, we obtain a rather a smooth trajectory with no singular points.

3.2. Reconstruction of Lorenz Attractor

Consider the classical Lorenz attractor

{\begin{matrix} \dot{x} = - s x + s y, \\ \dot{y} = - z x + r x - y, \\ \dot{z} = x y - b z, \end{matrix}

(36)

with a standard parameter set:

s = 10, r = 28, b = 8 / 3

[32]. Suppose, only one variable

z

can be observed, so we should transform the Lorenz system to obtain a model in new coordinates:

{\begin{matrix} {\dot{x}}_{1} = x_{2}, \\ {\dot{x}}_{2} = x_{3}, \\ {\dot{x}}_{3} = - s {x_{1}}^{3} - {x_{1}}^{2} x_{2} + b s (r - 1) x_{1} - b (1 + s) x_{2} - (1 + b + s) x_{3} + \frac{x_{2} x_{3}}{x_{1}} + \frac{(s + 1) {x_{2}}^{2}}{x_{1}}, \end{matrix}

(37)

Equation (37) cannot be reconstructed into ordinary polynomial functions due to the negative powers of variable

x_{1}

, but it is possible to reconstruct it as the Laurent polynomial function. The application of the proposed algorithm can be done using both differential and integration approaches.

The deglexord procedure (see Listing 1) generates a set of

L = 50

monomials or degrees

D = [- 1; 3]

. To ensure that the system of equations is overdetermined, we should take not less than 51 points in

X

. We are never sure that any term will vanish on

X

and will be removed by the ABM algorithm. An example of the reconstructed system is:

{\begin{matrix} {\dot{x}}_{1} = x_{2}, \\ {\dot{x}}_{2} = x_{3}, \\ {\dot{x}}_{3} = - 10 {x_{1}}^{3} + 720 x_{1} - {x_{1}}^{2} x_{2} - 29.3333 x_{2} - 13.6667 x_{3} + 11 {x_{1}}^{- 1} {x_{2}}^{2} + {x_{1}}^{- 1} x_{2} x_{3}, \end{matrix}

(38)

To reach a high quality of reconstruction, we generate data using the function ode113 with parameters RelTol

= 10^{- 13}

and AbsTol

= 10^{- 15}

and constant stepsize

h = 10^{- 4}

and simulation time

T = 45

in MATLAB 2019b environment (Campus license No. 40502181, ETU “LETI”) and use the 4th order integration formula. Note that Equation (37) can be reconstructed from the data obtained from the original Lorenz system using the time series corresponding to the variable

z

and its two integrals.

Two projections of the attractor Equation (37) and its reconstructed version are given in Figure 5. The initial conditions were

{(0.1, 0, - 0.1)}^{T}

, the parameter

η = 10^{- 2}

.

Truncation and round-off errors can be considered as a sort of noise [33] which makes it possible to compare differential embedding and integral embedding in terms of sensitivity to the reconstruction error induced by the stepsize. The results are given in Figure 6.

In Figure 6, we denote the phase space reconstruction method based on differentiation of the 2nd order of accuracy as DiffOrd2 (central differences) and of the 4th order of accuracy as DiffOrd4 (4 order finite differences) [34]. The method based on of 2nd order integration is denoted as IntOrd2 (trapezoidal rule) and the method of 4th order of accuracy is denoted as IntOrd4 (Simpson’s rule). Two plots summarize the results on mean error

E = ‖ v_{k} - H_{k} (X) ‖

across three state variables and overall number of terms

L

.

We see from both plots that for small stepsizes, the integration based reconstruction method performs well while the differentiation approach suffers from numerical noise following with the proof of Theorem 2. The higher the sampling frequency is, the more noise is added into the data and the algorithm is more likely to fail. However, large stepsizes are more likely to corrupt the integration-based approach, but in this case this happens at

h > 10^{- 2}

when both methods give inaccurate results even on 4th order. The reconstruction techniques based on second-order formulae became inaccurate at

h > 2.5 \cdot 10^{- 4}

, the fourth-order based formulae became inaccurate at

h > 3.5 \cdot 10^{- 3}

. By “inaccurate” we mean that Equation (37) was reconstructed with some missing or excessive terms.

3.3. Reconstruction of Muthuswamy’s Memristive Circuit

Replacing the nonlinear negative resistor (Chua diode) in the well-known Chua’s circuit with the flux-controlled memristor one obtains a circuit able to exhibit the chaotic behavior [19]. The circuit proposed by B. Muthuswamy includes five elements (Figure 7): a linear passive inductor, two linear passive capacitors, a linear passive resistor and a nonlinear active memristor.

After scaling and transformation, we obtain the following system of ODEs:

{\begin{matrix} \dot{x_{1}} = p_{1} x_{2}, \\ \dot{x_{2}} = p_{2} (x_{3} - x_{2}) - p_{3} (α + 3 β x_{1}^{2}) x_{2}, \\ \dot{x_{3}} = p_{4} (x_{2} - x_{3}) - p_{5} x_{4}, \\ \dot{x_{4}} = p_{6} x_{3} . \end{matrix}

(39)

Parameters for chaotic behavior are

α = - 0.667 \cdot 10^{- 3}

;

β = 0.029 \cdot 10^{- 3}

;

p_{1} = 2.594 \cdot 10^{- 3}

;

p_{2} = 73.53 \cdot 10^{3}

;

p_{3} = 147 \cdot 10^{6}

;

p_{4} = 7.353 \cdot 10^{3}

;

p_{5} = 14.7 \cdot 10^{6}

;

p_{6} = 55.55.

We simulated the system by the function ode113 with parameters RelTol

= 10^{- 13}

and AbsTol

= 10^{- 15}

and constant stepsize

h = 10^{- 8}

and final time

T = 0.01

. In this experiment we used data obtained from the original Equation (39) and reconstructed the state variable

x_{1}

by integrating variable

x_{2}

. This is motivated by two facts: first, variable

x_{1}

stands for the magnetic flux between the terminals of the memristive device [19], and therefore cannot be measured on a macroscopic scale, so it is assumed to be unobservable. Second, integrating

x_{2}

is very natural to the system (39) due to the relation

\dot{x_{1}} = p_{1} x_{2}

and therefore, needs no complicated transformations.

The degrees of terms in the monomials were placed within the interval

D = [0; 3]

, the tolerance parameter of AMB algorithm was

ϵ = 10^{- 2}

. The reconstructed equations are

{\begin{matrix} {\dot{x}}_{1} = x_{2}, \\ {\dot{x}}_{2} = 24519 x_{2} + 73530 x_{3} - 0.086055 x_{1}^{2} x_{2}, \\ {\dot{x}}_{3} = 7353 x_{2} - 7353 x_{3} - 14700000 x_{4}, \\ {\dot{x}}_{4} = 55.55 x_{3} . \end{matrix}

(40)

Note, that the Equations in (40) are a rescaled version of the Equations in (39) when

{\hat{p}}_{1} = 1

instead of

p_{1}

in the original equation.

Two projections of the attractor of original and reconstructed systems are given in Figure 8. The initial conditions were

{(0, 0.1, 0.1, 0)}^{T}

. We compared three integration methods by changing the tolerance parameter

η

in the delMinorTerms algorithm in the range

η \in [10^{- 3}, 10^{4}]

. Figure 9 depicts the result of the experiment. Using higher-order integration formulae allows for the achievement of more precise and more compact solutions. This experiment shows the difficulty of choosing

η

: its low values force the algorithm to generate equations with many excessive terms making the least-squares solution more precise. But, as one can see in Figure 9, the accuracy of the proper reconstruction is rather high due to high absolute values of the state variables (see Figure 6) and thus truncation and numerical errors in the reconstructed data for

x_{1}

are large as well.

To summarize, the experiment shows that higher-order integration is more preferable and value

η

should be adjusted with respect to the problem properties.

4. Discussion and Conclusions

Two main findings are highlighted in this paper. First, we show that the integral embedding approach has several advantages over the differential approach often used in cases of incompletely observed systems. Second, we extend the Kera–Hasegawa method to Laurent polynomials and show by example its performance.

Still, some limitations of this method exist. The first, and the most prominent one being, is that not each system can be represented as a PDS, and furthermore, there are some PDSs that do not allow transformation to the form of Equation (22) or Equation (24) or their combinations.

Consider the following system with an unobservable variable

z

:

{\begin{matrix} \dot{x} = y, \\ \dot{y} = - c x + b y - b y z^{2}, \\ \dot{z} = - y - a z + y z, \end{matrix}

(41)

where

a, b, c

are parameters. The natural way to reconstruct Equations (41) in this case is to transform it into

{\begin{matrix} \dot{x} = y, \\ \dot{y} = u, \\ \dot{u} = - c y + b u - \frac{u (b y - c x - u)}{y} \pm 2 b y \sqrt{\frac{b y - c x - u}{b y}} (- y \pm (a - y) \sqrt{\frac{b y - c x - u}{b y}}), \end{matrix}

(42)

where a new variable

u

is reconstructed from the observable variable

y

through differentiation. Equation (42) cannot be represented in terms of polynomial functions and, moreover, needs a special technique to resolve the ambiguity in

\pm

sign appearing when

z

variable is expressed from the second Equation of (41).

The second limitation of the proposed reconstruction method can be exemplified by Muthuswamy’s memristive circuit. In many cases, the algorithm found a PDS different from Equations in (40), but which is also correct on limited data series

X, V

and has a stable attractor similar to the attractor of the system Equations in (40):

{\begin{matrix} {\dot{x}}_{1} = x_{2}, \\ {\dot{x}}_{2} = 24519 x_{2} + 73530 x_{3} - 0.086055 x_{1}^{2} x_{2}, \\ {\dot{x}}_{3} = 35748530.2895 x_{1} + 5895.007 x_{2} - 7353 x_{3} - 12770094.9573 x_{4} - 41.8226 x_{1}^{3}, \\ {\dot{x}}_{4} = 55.55 x_{3} . \end{matrix}

(43)

Moreover, the third Equation in (43) is parametric, so infinitesimally many variants of it exist. So, if similar data can be produced by several PDSs, the method may find any one of them.

The third limitation of the proposed reconstruction method was found in both experiments with the Lorenz system and Muthuswamy’s memristive circuit, i.e., it is sensitive to algorithm parameters, such as stepsize, the order of accuracy of the reconstruction method, its type (differentiation or integration), and tolerance of delMinorTerms algorithm

η

. Obviously, sensitivity to the AMB algorithm tolerance

ϵ

also takes place, but we did not investigate its influence.

The fourth limitation is that the dimension of search grows dramatically when the number of terms or system dimension grows. For example, four-dimensional PDS in complete Laurent polynomials containing degrees in

D \in [- 3, 3]

can be constructed out of 471 unique monomials, five-dimensional PDS out of 1281 unique monomials, and six-dimesional PDS out of 3067 unique monomials. Even if ABM algorithm sufficiently reduces these numbers, it is likely to obtain very poorly conditioned large-scale regression problems, and it is not obvious whether the algorithm remains useful in these cases. This question needs further investigation.

Laurent monomial ordering, delgexord, and BM algorithm for Laurent monomials also need the development of a stricter theoretical basement, as in the case of well-established ordinary monomials [35]. Moreover, from Equation (4) it is clear that the reconstruction approach based on the LSM can also handle not only PDSs but functions containing other elementary nonlinearities as well (e.g., trigonometric functions). This also needs further development.

Author Contributions

Conceptualization, D.B.; data curation, E.G.N.; formal analysis, E.G.N.; funding acquisition, D.B.; investigation, A.K. and A.T.; methodology, A.K.; project administration, A.K. and D.B.; resources, A.T.; software, A.K. and A.T.; supervision, E.G.N. and D.B.; validation, D.B.; visualization, A.T.; writing—original draft, A.K.; writing—review & editing, E.G.N. and D.B. All authors have read and agreed to the published version of the manuscript.

Funding

The reported study was supported by RFBR, research project no. 19-07-00496.

Acknowledgments

The authors are grateful to Valerii Ostrovskii for preparing Figure 7 and fruitful comments on the introduction.

Conflicts of Interest

The authors declare no conflict of interest.

References

Laubenbacher, R.; Stigler, B.A. Computational algebra approach to the reverse engineering of gene regulatory networks. J. Theor. Biol. 2004, 229, 523–537. [Google Scholar] [CrossRef] [PubMed]
Letellier, C.; Maquet, J.; Labro, H.; Sceller, L.L.; Gouesbet, G.; Argoul, F.; Arnéodo, A. Analyzing Chaotic Behavior in a Belousov− Zhabotinskyi Reaction by Using a Global Vector Field Reconstruction. J. Phys. Chem. A 1998, 102, 10265–10273. [Google Scholar] [CrossRef]
Akman, O.E.; Watterson, S.; Parton, A.; Binns, N.; Millar, A.J.; Ghazal, P. Digital clocks: Simple Boolean models can quantitatively describe circadian systems. J. R. Soc. Interface 2012, 9, 2365. [Google Scholar] [CrossRef] [PubMed]
Gerhard, F.; Kispersky, T.; Gutierrez, G.J.; Marder, E.; Kramer, M.; Eden, U. Successful reconstruction of a physiological circuit with known connectivity from spiking activity alone. PLoS Comput. Biol. 2013, 9, e1003138. [Google Scholar] [CrossRef]
Bansal, M.; Della, G.G.; Bernardo, D.D. Inference of gene regulatory networks and compound mode of action from time course gene expression profiles. Bioinformatics 2006, 22, 815. [Google Scholar] [CrossRef]
Li, J.; Zhang, X.S. An optimization model for gene regulatory network reconstruction with known biological information. Optim. Syst. Biol. 2007, 7, 35. [Google Scholar]
Sprott, J.C. Some simple chaotic flows. Phys. Rev. E 1994, 50, R647. [Google Scholar] [CrossRef]
Crutchfield, J.P.; McNamara, B.S. Equations of motion from a data series. Complex Syst. 1987, 1, 417–452. [Google Scholar]
Cremers, J.; Hübler, A. Construction of differential equations from experimental data. Z. Für Nat. A 1987, 42, 797–802. [Google Scholar]
Breeden, J.L.; Dinkelacker, F.; Hübler, A. Noise in the modeling and control of dynamical systems. Phys. Rev. A 1990, 42, 5827–5836. [Google Scholar] [CrossRef]
Aguirre, L.A.; Letellier, C. Modeling nonlinear dynamics and chaos: A review. Math. Probl. Eng. 2009, 2009, 238960. [Google Scholar] [CrossRef]
Iba, H. Inference of differential equation models by genetic programming. Inf. Sci. 2008, 178, 4453. [Google Scholar] [CrossRef]
Searson, D.P.; Leahy, D.E.; Willis, M.J. GPTIPS: An open source genetic programming toolbox for multigene symbolic regression. In Proceedings of the International multiconference of engineers and computer scientists, Hong Kong, 17–19 March 2010; Volume 1, pp. 77–80. [Google Scholar]
Yogatama, D.; Smith, N. Making the most of bag of words: Sentence regularization with alternating direction method of multipliers. In Proceedings of the 31st International Conference on Machine Learning, Beijing, China, 21–26 June 2014; Volume 32, p. 656. [Google Scholar]
Kera, H.; Hasegawa, Y. Noise-tolerant algebraic method for reconstruction of nonlinear dynamical systems. Nonlinear Dyn. 2016, 85, 675–692. [Google Scholar] [CrossRef]
Linn, E.; Siemon, A.; Waser, R.; Menzel, S. Applicability of well-established memristive models for simulations of resistive switching devices. IEEE Trans. Circuits Syst. I Regul. Pap. 2014, 61, 2402–2410. [Google Scholar] [CrossRef]
Abarbanel, H.D.; Brown, R.; Sidorowich, J.J.; Tsimring, L.S. The analysis of observed chaotic data in physical systems. Rev. Mod. Phys. 1993, 65, 1331. [Google Scholar] [CrossRef]
Karimov, T.; Butusov, D.; Andreev, V.; Karimov, A.; Tutueva, A. Accurate synchronization of digital and analog chaotic systems by parameters re-identification. Electronics 2018, 7, 123. [Google Scholar] [CrossRef]
Muthuswamy, B. Implementing memristor based chaotic circuit. Int. J. Bifurc. Chaos 2010, 20, 1335–1350. [Google Scholar] [CrossRef]
Nesterov, Y. A Method of Solving a Convex Programming Problem with Convergence Rate O(1/k2). In Sov. Math. Dokl; USSR: Moscow, Russia, 1983; Volume 27, pp. 372–376. [Google Scholar]
Shi, Y.; Zhu, X.X.; Yin, W.; Bamler, R. A fast and accurate basis pursuit denoising algorithm with application to super-resolving tomographic SAR. IEEE Trans. Geosci. Remote Sens. 2018, 56, 6148–6158. [Google Scholar] [CrossRef]
Tikhonov, A.N.; Arsenin, V.Y. Solution of Ill-posed Problems; Winston & Sons: Hoboken, NJ, USA, 1977. [Google Scholar]
Takens, F. Detecting Strange Attractors in Turbulence. Lecture Notes in Mathematics; Springer Science and Business Media: Berlin/Heidelberg, Germany, 1980; pp. 366–381. [Google Scholar]
Packard, N.H.; Crutchfield, J.P.; Farmer, J.D.; Shaw, R.S. Geometry from a time series. Phys. Rev. Lett. 1980, 45, 712–716. [Google Scholar] [CrossRef]
Lekscha, J.; Donner, R.V. Phase space reconstruction for non-uniformly sampled noisy time series. Chaos: Interdiscip. J. Nonlinear Sci. 2018, 28, 085702. [Google Scholar] [CrossRef]
Kennel, M.B.; Brown, R.; Abarbanel, H.D. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 1992, 45, 3403. [Google Scholar] [CrossRef] [PubMed]
Rhodes, C.; Morari, M. The false nearest neighbors algorithm: An overview. Comput. Chem. Eng. 1997, 21, S1149–S1154. [Google Scholar] [CrossRef]
Bradley, E.; Kantz, H. Nonlinear time-series analysis revisited. Chaos: Interdiscip. J. Nonlinear Sci. 2015, 25, 097610. [Google Scholar] [CrossRef] [PubMed]
Oppenheim, A.V.; Buck, J.R.; Schafer, R.W. Discrete-Time Signal Processing; Prentice Hall: Upper Saddle River, NJ, USA, 2001; Volume 2. [Google Scholar]
Goldberger, A.L.; Amaral, L.A.N.; Glass, L.; Hausdorff, J.M.; Ivanov, P.Ch.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.-K.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals. Circulation 2003, 101, e215–e220. [Google Scholar] [CrossRef]
Lugovaya, T.S. Biometric Human Identification Based on Electrocardiogram. Master’s Thesis, Faculty of Computing Technologies and Informatics, Electrotechnical University “LETI”, Saint-Petersburg, Russia, June 2005. [Google Scholar]
Leonov, G.A.; Pogromsky, A.Y.; Starkov, K.E. The dimension formula for the Lorenz attractor. Phys. Lett. A 2011, 375, 1179–1182. [Google Scholar] [CrossRef]
Hinamoto, T.; Lu, W.S. Digital Filter Design and Realization. River Publ. 2017, 384. [Google Scholar]
Jordan, C.; Jordán, K. Calculus of finite differences. Am. Math. Soc. 1965, 33, 15. [Google Scholar] [CrossRef]
Rahkooy, H.; Zafeirakopoulos, Z. Using resultants for inductive Gröbner bases computation. Acm Comm. Comput. Algebra 2011, 45, 135–136. [Google Scholar] [CrossRef]

Figure 1. Illustration of Theorem 1. The red line corresponds to the double-loop trajectory

\hat{ℱ}

obtained by the differential approach, the blue line corresponds to cyclic trajectory

\tilde{ℱ}

obtained by the integration approach. Here,

x

and

y

are phase variables.

Figure 1. Illustration of Theorem 1. The red line corresponds to the double-loop trajectory

\hat{ℱ}

obtained by the differential approach, the blue line corresponds to cyclic trajectory

\tilde{ℱ}

obtained by the integration approach. Here,

x

and

y

are phase variables.

Figure 2. Illustration of Theorem 2. Blue plot corresponds to the Fourier transform of the signal

U (ω)

and red plot corresponds to the Fourier transform of the noise

Ξ (ω)

. (a) Original signal and noise; (b) after differentiation; and (c) after integration.

Figure 2. Illustration of Theorem 2. Blue plot corresponds to the Fourier transform of the signal

U (ω)

and red plot corresponds to the Fourier transform of the noise

Ξ (ω)

. (a) Original signal and noise; (b) after differentiation; and (c) after integration.

Figure 3. Example of phase space reconstruction. (a) Time series

z

and its two integrals; (b) comparison of an integration-based approach (blue trajectory) and a differentiation-based approach (red trajectory). The red dot at

x_{0} = {(0, 0, 0)}^{T}

denotes a singular point unresolvable in sense of FNN where the uniqueness condition is violated.

Figure 3. Example of phase space reconstruction. (a) Time series

z

and its two integrals; (b) comparison of an integration-based approach (blue trajectory) and a differentiation-based approach (red trajectory). The red dot at

x_{0} = {(0, 0, 0)}^{T}

denotes a singular point unresolvable in sense of FNN where the uniqueness condition is violated.

Figure 4. Example of the phase space reconstruction. (a) ECG time series

x

; (b) comparison of the integration-based approach (left) and the differentiation-based approach (right).

Figure 4. Example of the phase space reconstruction. (a) ECG time series

x

; (b) comparison of the integration-based approach (left) and the differentiation-based approach (right).

Figure 5. The trajectory of the modified Lorenz system Equation (37) is projected onto (a) x-y plane and (b) x-z plane. The green dots denote 95 data points, the blue line depicts the original trajectory, and the orange line depicts a trajectory of the reconstructed system.

Figure 6. Results of the reconstruction of system Equation (37) using integration and differentiation of orders 2 and 4 with respect to the stepsize. (a) mean error

\bar{E} = ‖ v_{k} - H_{k} (X) ‖

by all 3 dimensions; (b) overall number of terms

L

in the reconstruction, where

L = 9

corresponds to the original system (37). Square markers correspond to the differentiation-based approach; crosses correspond to the integration-based approach. We tested 15 stepsizes, 10 experiments were carried out with each stepsize.

Figure 6. Results of the reconstruction of system Equation (37) using integration and differentiation of orders 2 and 4 with respect to the stepsize. (a) mean error

\bar{E} = ‖ v_{k} - H_{k} (X) ‖

by all 3 dimensions; (b) overall number of terms

L

in the reconstruction, where

L = 9

corresponds to the original system (37). Square markers correspond to the differentiation-based approach; crosses correspond to the integration-based approach. We tested 15 stepsizes, 10 experiments were carried out with each stepsize.

Figure 7. Schematics of the five-element chaotic circuit [19].

Figure 8. The trajectory of the system Equations in (40) projected onto (a) x-y plane and (b) x-z plane. The green Dots denote 53 data points, the blue line depicts the original trajectory, and the orange line depicts a trajectory of the reconstructed system scaled to fit the original trajectory. The equations were reconstructed with high accuracy.

Figure 9. Results of the reconstruction of system Equations in (39) using integration of orders 1, 2 and 4 (Euler, Trapezoidal and Simpson’s respectively) in accordance to the delMinorTerms tolerance. (a) Mean error

\bar{E}

by 4 dimensions; (b) overall number of terms

L

in the reconstruction, where

L = 8

corresponds to the original system Equations in (39). We tested 15 values of

η

, 10 experiments were carried out with each value.

Figure 9. Results of the reconstruction of system Equations in (39) using integration of orders 1, 2 and 4 (Euler, Trapezoidal and Simpson’s respectively) in accordance to the delMinorTerms tolerance. (a) Mean error

\bar{E}

by 4 dimensions; (b) overall number of terms

L

in the reconstruction, where

L = 8

corresponds to the original system Equations in (39). We tested 15 values of

η

, 10 experiments were carried out with each value.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Karimov, A.; Nepomuceno, E.G.; Tutueva, A.; Butusov, D. Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding. Mathematics 2020, 8, 300. https://doi.org/10.3390/math8020300

AMA Style

Karimov A, Nepomuceno EG, Tutueva A, Butusov D. Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding. Mathematics. 2020; 8(2):300. https://doi.org/10.3390/math8020300

Chicago/Turabian Style

Karimov, Artur, Erivelton G. Nepomuceno, Aleksandra Tutueva, and Denis Butusov. 2020. "Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding" Mathematics 8, no. 2: 300. https://doi.org/10.3390/math8020300

APA Style

Karimov, A., Nepomuceno, E. G., Tutueva, A., & Butusov, D. (2020). Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding. Mathematics, 8(2), 300. https://doi.org/10.3390/math8020300

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Algebraic Method for the Reconstruction of Partially Observed Nonlinear Systems Using Differential and Integral Embedding

Abstract

1. Introduction

2. Algebraic Method for ODE Reconstruction

2.1. Polynomial Rings, Ideals, ABM Algorithm

2.2. KH Algorithm for Polynomial Function Reconstruction

2.3. Phase Space Reconstruction Using Integration Embedding

2.4. Description of the Proposed Reconstruction Technique

3. Experimental Results

3.1. Illustrations of the Theorems

3.2. Reconstruction of Lorenz Attractor

3.3. Reconstruction of Muthuswamy’s Memristive Circuit

4. Discussion and Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI