State Estimation of Memristor Neural Networks with Model Uncertainties

Ma, Libin; Wang, Mao

doi:10.3390/machines10121228

Open AccessArticle

State Estimation of Memristor Neural Networks with Model Uncertainties

by

Libin Ma

and

Mao Wang

^*

Space Control and Inertial Technology Research Center, Harbin Institute of Technology, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

Machines 2022, 10(12), 1228; https://doi.org/10.3390/machines10121228

Submission received: 24 October 2022 / Revised: 7 December 2022 / Accepted: 12 December 2022 / Published: 15 December 2022

(This article belongs to the Section Automation and Control Systems)

Download

Browse Figures

Versions Notes

Abstract

This paper is concerned with the problem of state estimation of memristor neural networks with model uncertainties. Considering the model uncertainties are composed of time-varying delays, floating parameters and unknown functions, an improved method based on long short term memory neural networks (LSTMs) is used to deal with the model uncertainties. It is proved that the improved LSTMs can approximate any nonlinear model with any error. On this basis, adaptive updating laws of the weights of improved LSTMs are proposed by using Lyapunov method. Furthermore, for the problem of state estimation of memristor neural networks, a new full-order state observer is proposed to achieve the reconstruction of states based on the measurement output of the system. The error of state estimation is proved to be asymptotically stable by using Lyapunov method and linear matrix inequalities. Finally, two numerical examples are given, and simulation results demonstrate the effectiveness of the scheme, especially when the memristor neural networks with model uncertainties.

Keywords:

memristor neural networks; model uncertainties; long short term memory neural networks (LSTMs); adaptive updating laws; full-order state observer

1. Introduction

Since the prototype of memristor was born in 1971 by Chua [1], memristor have been widely used in all walks of life. The vector–matrix multiplication is realized by the crossbar array structure of memristor, and a neural network can be realized by the corresponding coding scheme based on it. Various neural networks based on memristor hardware have been developed rapidly. Because of an incomparable advantage that memristor neural networks can reflect the memorized information, the memristor neural networks are particularly suitable for self-adaptability, nonlinear systems, self-learning, and associative storage, so memristor neural networks are widely used in brain simulation, pattern recognition, neural morphologic computation, knowledge acquisition, and various hardware applications involving neural networks [2,3,4,5,6,7,8,9,10,11,12,13,14,15]. To list a few, the experimental implementation of transistor-free metal-oxide memristor crossbars, with device variability sufficiently low to allow operation of integrated neural networks, in a simple network: a single-layer perceptron (an algorithm for linear classification) was shown in [16]. In [17], a structure suppressing the overshoot current was investigated to approach the conditions required as an ideal synapse of a neuromorphic system. In [18], fully memristive artificial neural networks were built by using diffusive memristors based on silver nanoparticles in a dielectric film. The electrical properties and conduction mechanism of the fabricated IGZO-based memristor device in a 10 × 10 crossbar array were analyzed in [19]. Operation of one-hidden layer perceptron classifier entirely in the mixed-signal integrated hardware was demonstrated in [20]. Therefore, the research on memristor neural networks is very necessary and meaningful. Although many papers have extended the memristor neural networks and solved some problems, there are still problems in the memristor neural networks. Therefore, memristor neural networks including their various kinds of deformation have broad market prospects. Especially, the research on memristor neural networks with model uncertainties has become a hot topic.

In recent decades, scholars have carried out a great amount of research and analysis on memristor neural networks. The results can be broadly divided into four categories: (1) Stability analysis of memristor neural networks [21,22,23,24,25]; (2) State estimation of memristor neural networks [26,27,28,29]; (3) Synchronization problem of memristor neural networks [30,31,32]; (4) Control problem of memristor neural networks [33,34,35]. In practice, time-varying delays must exist in the hardware implementation of memristor neural networks. Due to the existence of time-varying delays, the future states of the system are affected by the previous states, which leads to instability of the system and poor control performance. Consequently, state estimation of memristor neural networks is of great research value and a large part of the research has focused on state estimation of memristor neural networks. Note that the above results are generally based on the known structures and parameters of memristor neural networks without model uncertainties. In practice, the hardware implementation of memristor neural networks usually fails to attain the ideal design values, and there are design deviations. In particular, model uncertainties often exist in the hardware implementation of memristor neural networks. Therefore, model uncertainties and model errors are common in hardware memristor neural networks. Similarly, affected by model uncertainties, state estimation of memristor neural networks is also a challenging problem. Considering the above analysis, it is needed to study state estimation of memristor neural networks with model uncertainties.

A great amount of valuable research on state estimation of memristor neural networks with model uncertainties can be found in [26,27,28,29,36,37,38]. In [26], it used passivity theory to deal with the state estimation problem of memristor-based recurrent neural networks with time-varying delays. By using Lyapunov–Krasovskii function (LKF), convex combination technique and reciprocal convexity technique, a delay-dependent state estimation matrix was established, and the expected estimator gain matrix was obtained by solving linear matrix inequalities (LMIs). It is a pity that the model of the system must be determined and the functions in the system must be known. In [27], for memristor neural networks with randomness, the random system was transformed into an interval parameter system by Filippov, and the H∞ state observer was designed on this basis. One of the problems in the paper is that it is a random interference that affects the system rather than model uncertainty. The random interference is regular and limited. In [28], for memristor-based bidirectional associative memory neural networks with additive time-varying delays, a state estimation matrix was constructed by selecting an appropriate LKF and using the Cauchy-Schwartz-based summation inequality, and the gain matrix was obtained by the LMIs. The paper also has the problems mentioned above. In [29], for a class of memristor neural networks with different types of inductance functions and uncertain time-varying delays, a state estimation matrix was constructed by selecting a suitable LKF, and the gain matrix was solved by using the LMIs and Wirtinger-type inequality. Model uncertainty is involved in the paper, but it is only for the uncertainty of the time-varying delays. In [36], an extended dissipative state observer was proposed by using nonsmooth analysis and a new LKF. In [37], based on the basic properties of quaternion-valued, a state observer was designed for quaternion-valued memristor neural networks, and algebraic conditions were given to ensure global dissipation. The methods proposed in [36,37] are not suitable for memristor neural networks with model uncertainties. In [38], for memristor neural networks with random sampling, the randomness was represented by two different sampling periods, which satisfied a Bernoulli distribution. The random sampling system was transformed into a system with random parameters by using an input delay method. On this basis, a state observer was designed based on the LMIs and a LKF. Through the above discussion, it is not difficult to find that a similar method is used to estimate the states of memristor neural networks. By selecting an appropriate LKF, the state observation matrix is constructed based on the structure of the system, and the gain matrix is solved by utilizing the LMIs. It can be seen from the above analysis that most studies on state estimation of memristor neural networks have the same problem, which requires that the system cannot contain the model uncertainties. Some studies include model uncertainties, which are only for time-varying delays. Other studies also include model uncertainties, which are only about the fluctuation of parameters. There are few studies on the state estimation of memristor neural networks whose model uncertainties include time-varying delays, floating parameters and unknown functions. It has a huge research potential to tap.

When the memristor neural networks are designed and translated into hardware by the designer, the model uncertainties of the system only include the time-varying delays and floating parameters. In practice, the situation is not unique. Sometimes it is necessary to analyze the memristor neural networks designed by other designers. At this time, the model uncertainties of memristor neural networks include time-varying delays, floating parameters and unknown functions. The model of the memristor neural networks can be designed as in Figure 1 [28]. Motivated by the above discussion, the main concern of this paper is to design a state observer for memristor neural networks with model uncertainties, which include time-varying delays, floating parameters and unknown functions. Model uncertainties are composed of current states, past states and unknown functions. In order to approach the model uncertainties that contain memory information, improved long short term memory neural networks (LSTMs) are proposed. It is theoretically proved that the improved LSTMs can approach the model uncertainties with arbitrary error. Memristor neural networks with model uncertainties can be transformed into a new system with an improved LSTMs. On this basis, a full-order state observer is designed according to the output of the system. An error matrix of the states is constructed by a designed LKF, and the gain matrix is solved by the LMIs. In order to make the new system more accurate, a new error matrix of the states is constructed by using Young’s inequality based on a LKF. On this basis, adaptive updating laws of the weights of improved LSTMs are designed to reduce the errors of the states. The main contributions of this paper are as follows.

Improved LSTMs are proposed for memristor neural networks with model uncertainties. It is proved that the improved LSTMs can well approach the model uncertainties in memristor neural networks. Model uncertainties include time-varying delays, floating parameters and unknown functions. It has not been seen in other studies.
By utilizing the LMIs and a LKF, a full-order observer based on the output of the system is presented to obtain state information and solve the problem of state estimation.
By using Young’s inequality and a designed LKF, adaptive updating laws of the weights of improved LSTMs are given to obtain the new system with improved LSTMs precisely.

This paper is organized as follows. In Section 2, the problem is formulated, and several essential assumptions and lemmas are listed. Section 3 presents the primary theorems, including improved LSTMs, observer design for memristor neural networks with model uncertainties, and adaptive updating laws of the weights of improved LSTMs. In Section 4, the effectiveness of the proposed scheme is demonstrated through numerical examples. Finally, the conclusions are drawn in Section 5.

Notation:

R^{n}

denotes the n dimensional Euclidean space. For a given matrix

A

or vector

B

,

A^{T}

and

B^{T}

denote their transpose, and

t r {A}

denotes its trace.

A < 0

indicate a negative definite matrix.

2. Preliminaries

Considering the memristor neural networks as follows, the same model can be found in [26,27,28,36,37],

\begin{matrix} \begin{matrix} \{\begin{matrix} {\dot{x}}_{1} (t) = - a_{1} x_{1} (t) + \sum_{j = 1}^{n} b_{1 j} (x_{1} (t)) f_{j} (x_{j} (t)) + \sum_{j = 1}^{n} c_{1 j} (x_{1} (t)) g_{j} (x_{j} (t - τ_{j} (t))) + U_{1} \\ {\dot{x}}_{2} (t) = - a_{2} x_{2} (t) + \sum_{j = 1}^{n} b_{2 j} (x_{2} (t)) f_{j} (x_{j} (t)) + \sum_{j = 1}^{n} c_{2 j} (x_{2} (t)) g_{j} (x_{j} (t - τ_{j} (t))) + U_{2} \\ ⋮ \\ {\dot{x}}_{n} (t) = - a_{n} x_{n} (t) + \sum_{j = 1}^{n} b_{n j} (x_{n} (t)) f_{j} (x_{j} (t)) + \sum_{j = 1}^{n} c_{n j} (x_{n} (t)) g_{j} (x_{j} (t - τ_{j} (t))) + U_{n} \\ y_{1} (t) = \sum_{j = 1}^{n} h_{1 j} x_{j} (t) \\ y_{2} (t) = \sum_{j = 1}^{n} h_{2 j} x_{j} (t) \\ ⋮ \\ y_{m} (t) = \sum_{j = 1}^{n} h_{n j} x_{j} (t) \end{matrix}, \end{matrix} \end{matrix}

(1)

where

x_{i} (t) (i = 1, \dots, n)

represents the state variable of the memristor neural networks, and n is the system dimension;

a_{i} (i = 1, \dots, n)

is the self-feedback coefficient, which satisfies

a_{i} > 0

;

f_{j} (x_{j} (t))

and

g_{j} (x_{j} (t - τ_{j} (t)))

(j = 1, \dots, n)

represent the activation functions of states

x_{j} (t)

and

x_{j} (t - τ_{j} (t)

respectively;

b_{i j} (x_{i} (t))

represents the memristive synaptic connection weight between states

x_{i} (t)

and

x_{j} (t)

, and

c_{i j} (x_{i} (t))

represents the memristive synaptic connection weight between states

x_{i} (t)

and

x_{j} (t - τ_{j} (t))

;

τ_{j} (j = 1, \dots, n)

denotes the time-varying delay which satisfies

0 ⩽ τ_{j} ⩽ τ_{m a x}

, and

τ_{m a x}

is the upper bound constant;

U_{i} (i = 1, \dots, n)

denotes the input of the system, and

y_{i} (t) (i = 1, \dots, m)

represents the measurement output of the system;

h_{i j} (i = 1, \dots, m; j = 1, \dots, n)

is the measurement constant from state

x_{j} (t)

to output

y_{i} (t)

, and m is the output dimension.

The system (1) can be represented in vector form,

\begin{matrix} \begin{matrix} \{\begin{matrix} \dot{x} (t) = - A x (t) + B f (x (t)) + C g (x (t - τ (t))) + U \\ y (t) = H x (t) \end{matrix}, \end{matrix} \end{matrix}

(2)

where

A = [\begin{matrix} a_{1} & 0 & \dots & 0 \\ 0 & a_{2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & a_{n} \end{matrix}]

,

B = [\begin{matrix} b_{11} & b_{12} & \dots & b_{1 n} \\ b_{21} & b_{22} & \dots & b_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ b_{n 1} & b_{n 2} & \dots & b_{n n} \end{matrix}]

,

U = [\begin{matrix} U_{1} \\ U_{2} \\ ⋮ \\ U_{n} \end{matrix}]

,

f (x (t)) = [\begin{matrix} f_{1} (x_{1} (t)) \\ f_{2} (x_{2} (t)) \\ ⋮ \\ f_{n} (x_{n} (t)) \end{matrix}]

,

g (x (t - τ (t))) = [\begin{matrix} g_{1} (x_{1} (t - τ_{1} (t))) \\ g_{2} (x_{2} (t - τ_{2} (t))) \\ ⋮ \\ g_{n} (x_{n} (t - τ_{n} (t))) \end{matrix}]

,

y (t) = [\begin{matrix} y_{1} (t) \\ y_{2} (t) \\ ⋮ \\ y_{m} (t) \end{matrix}]

,

C = [\begin{matrix} c_{11} & c_{12} & \dots & c_{1 n} \\ c_{21} & c_{22} & \dots & c_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ c_{n 1} & c_{n 2} & \dots & c_{n n} \end{matrix}]

,

H = [\begin{matrix} h_{11} & h_{12} & \dots & h_{1 n} \\ h_{21} & h_{22} & \dots & h_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{m 1} & c_{m 2} & \dots & h_{m n} \end{matrix}]

,

x (t) = [\begin{matrix} x_{1} (t) \\ x_{2} (t) \\ ⋮ \\ x_{n} (t) \end{matrix}]

.

As mentioned in the introduction, most studies involve model uncertainties that only include floating parameters. In the process of neural network hardware implementation as memristor neural networks, the memristive synaptic connection weights

b_{i j}

and

c_{i j}

will produce deviations [28]. The fluctuation of parameters

b_{i j}

and

c_{i j}

is regarded as model uncertainty. This is the starting point of much research on state estimation of memristor neural networks, such as [26,27,28,36,37,38]. Some studies regard time-varying delay

τ_{j} (t)

as model uncertainty and study state estimation of memristor neural networks based on it, for example [29]. It should be noted that model uncertainties in all the above studies do not include

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

. Both

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

must be known, and

b_{i j}

and

c_{i j}

float within the ideal range. If

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

are unknown, and the ideal values of

b_{i j}

and

c_{i j}

are unknown, the model uncertainties include floating parameters

b_{i j}

and

c_{i j}

, time-varying delay

τ_{j} (t)

and unknown functions

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

, and all the above studies are not applicable. The state estimation of memristor neural networks with model uncertainties including floating parameters, time-varying delays and unknown functions is the main concern in this paper.

Remark 1.

In other studies, model uncertainties only include floating parameters

b_{i j}

and

c_{i j}

or time-varying delay

τ_{j} (t)

. Functions

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

must be known. In this paper, model uncertainties include floating parameters

b_{i j}

and

c_{i j}

, time-varying delay

τ_{j} (t)

and unknown functions

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

.

As shown in system (2), the model uncertainties contain memory information

x_{i} (t - τ_{i})

. The LSTMs are the most suitable to deal with the model uncertainties. LSTMs are networks of basic LSTMs cells, and the architecture of a conventional LSTMs cell is illustrated in Figure 2. A memory cell, an input gate, an output gate and a forgetting gate make up a LSTMs cell. The forgetting gate, input gate, and output gate respectively determine whether historical information, input information, and output information are retained [39]. The specific computation is shown in Equation (3).

\begin{matrix} \begin{matrix} (\begin{matrix} f_{t} \\ i_{t} \\ o_{t} \\ g_{t} \end{matrix}) & = (\begin{matrix} σ (W_{f} [x_{t}, h_{t - 1}] + b_{f}) \\ σ (W_{i} [x_{t}, h_{t - 1}] + b_{i}) \\ σ (W_{o} [x_{t}, h_{t - 1}] + b_{o}) \\ t a n h (W_{g} [x_{t}, h_{t - 1}] + b_{g}) \end{matrix}) \\ c_{t} & = f_{t} \otimes c_{t - 1} \oplus i_{t} \otimes g_{t} \\ h_{t} & = o_{t} \otimes t a n h (c_{t}), \end{matrix} \end{matrix}

(3)

where

f_{t}

denotes the forgetting gate;

i_{t}

and

o_{t}

represent input gate and output gate, respectively;

g_{t}

is the updating vector of the LSTM cell;

h_{t}

is the hidden state vector;

h_{t - 1}

is the hidden state vector at step

t - 1

;

x_{t}

is the input vector of the LSTM cell;

c_{t}

is the state vector of the cell;

c_{t - 1}

is the state vector of the cell at step

t - 1

;

W

is the weight matrix and

b

refers to the bias vector;

σ

and

t a n h (\cdot)

are the sigmoid and tanh activation functions, respectively; ⊗ and ⊕ represent elementwise multiplication and addition, respectively.

Remark 2.

LSTMs cell is not completely suitable for estimating the states of memristor neural networks with model uncertainties. LSTMs cell needs to be improved to save computation and be more suitable for state estimation.

Moreover, in order to improve the LSTMs, design the state observer of memristor neural networks with model uncertainties, and derive the updating laws of the weights of the improved LSTMs, some assumptions and lemmas need to be introduced for the following proof.

Assumption 1.

The functions

f_{j} (\cdot)

and

g_{j} (\cdot)

satisfy local Lipschitz conditions. For all

k, p \in R

, have

|f_{j} (k) - f_{j} (p)| ⩽ K_{f} |k - p|

and

|g_{j} (k) - g_{j} (p)| ⩽ K_{g} |k - p|

, where

K_{f}

and

K_{g}

are Lipschitz constants, and satisfy

f_{j} (0) = g_{j} (0) = 0

.

f_{j} (\cdot)

and

g_{j} (\cdot)

are the activation functions of memristor neural networks, so Assumption 1 is generally tenable.

Lemma 1

([40]).

k (\cdot)

is a continuous function defined on a set Ω. Multilayer neural networks can be defined as,

\begin{matrix} \begin{matrix} \bar{k} = W^{T} S (VI), \end{matrix} \end{matrix}

where

W

and

V

are the second weight matrix and the first weight vector of the Multilayer neural networks, respectively;

I

is the input vector of Multilayer neural networks, and

S (\cdot)

is the activation function of Multilayer neural networks.

Then, for a given desired level of accuracy

ε > 0

, there exist the ideal weights

\bar{W}

and

\bar{V}

to satisfy the following inequality,

\begin{matrix} \begin{matrix} sup_{I \in Ω} ∥ k (\cdot) - \bar{k} ∥ ⩽ ε . \end{matrix} \end{matrix}

Lemma 2.

(Young’s inequality) For all

x, y \in R

, the following inequality holds,

\begin{matrix} \begin{matrix} x y ⩽ \frac{ε^{p}}{p} {∥ x ∥}^{p} + \frac{1}{q ε^{p}} {∥ y ∥}^{q}, \end{matrix} \end{matrix}

where

ε > 0

,

p > 1

,

q > 1

, and

(p - 1) (q - 1) = 1

.

3. Main Result

In this part, improved LSTMs, state observer design for memristor neural networks with model uncertainties, and adaptive updating laws of the weights of improved LSTMs will be discussed.

To begin with, the system (2) can be redefined as follows,

\begin{matrix} \begin{matrix} \{\begin{matrix} \dot{x} (t) = - A x (t) + K (x (t), x (t - τ (t)))) + U \\ y (t) = H x (t) \end{matrix}, \end{matrix} \end{matrix}

(4)

where

K (\cdot)

is a vector of functions, which can be defined as

[K_{1} (x (t), x (t - τ (t))))

,

K_{2} (x (t), x (t - τ (t)))), \dots, K_{n} (x (t), x (t - τ (t))) {)]}^{T}

.

As mentioned in Remark 1,

K (\cdot)

is the function vector of model uncertainties formed by floating parameters

b_{i j}

and

c_{i j}

, time-varying delay

τ_{j} (t)

and unknown functions

f_{i} (x_{i} (t))

and

g_{i} (x_{i} (t - τ_{i}))

. In order to approximate the unknown function vector

K (x (t), x (t - τ (t))))

, improved LSTMs are proposed, and an improved LSTMs cell is shown in Figure 3.

Comparing Figure 2 and Figure 3, it can be seen that the input gate

i_{t}

and the hidden state vector

h_{t - 1}

at step

t - 1

have been removed. Since

x (t)

is part of

K (\cdot)

in the form of a function vector, the input gate can be removed.

x (t)

should be part of LSTMs cell in the form of tanh function. The reason why

h_{t - 1}

is removed is that

K (\cdot)

contains

x (t - τ (t))

, so the functions of

h_{t - 1}

can be combined into

c_{t - 1}

to save computation. Remove the output gate

o_{t}

and use

h_{t}

as the output of LSTMs cell to simplify the structure of LSTMs cell. Therefore, the improved LSTMs cell is made up of the following parts: (1) The state vector

x (t)

of the system at time t and the state vector with weights

c_{t - 1}

of the system at time

t - 1

constitute the input of the improved LSTMs cell; (2)

c_{t}

is the vector that holds the state of the improved LSTMs cell at time t; (3)

h_{t}

is the output vector of the improved LSTMs cell at time t; (4)

σ (x (t))

is the forgetting function at time t, which is used to control whether the memory information stored by the improved LSTMs cell at time

t - 1

is added to the improved LSTMs cell calculation at time t. The specific computation of a simplified and improved LSTMs cell can be expressed as follows,

\begin{matrix} \begin{matrix} \begin{matrix} c_{t} = W_{t, i} x_{t} \oplus c_{t - 1} \otimes σ (x_{t}) \\ h_{t} = t a n h (W_{t, i} x_{t} \oplus c_{t - 1} \otimes σ (x_{t}) \oplus b_{t, i}), \end{matrix} \end{matrix} \end{matrix}

(5)

where

W_{t, i} = [W_{t, i, 1}, W_{t, i, 2}, \dots, W_{t, i, n}]

denotes the weight vector of the ith cell and

b_{t, i}

is a bias constant of the ith cell;

x_{t} = {[x_{1} (t), x_{2} (t), \dots, x_{n} (t)]}^{T}

represents the state vector at time t.

Based on the simplified and improved LSTMs cell, the improved LSTMs are illustrated in Figure 4. In Figure 4, each column represents a neural network composed of p improved LSTMs cells at time j. The outputs of the p improved LSTMs cells pass through the weight matrix

V_{j}

to obtain the output vector of the neural network at time j, which is used to approximate

K (\cdot)

. The neural network at each time can be connected through

c_{j}

and

c_{j - 1}

to form neural networks at all times.

x_{j}

represents the state vector at time j, and

j \in [1, t]

.

c_{j}^{i}

denotes the output of the hidden states of the ith

(i \in [1, p])

LSTMs cell at time j, and p is the number of LSTMs cells.

W_{j, i}

represents the weight vector of the ith LSTMs cell at time j.

b_{j, i}

is the bias of the ith LSTMs cell at time j.

h_{j}^{i}

denotes the output of the states of the ith LSTMs cell at time j.

V_{j, i, l}

represents the weight coefficient from the output of the ith LSTMs cell to the lth system output at time j, and

l \in [1, m]

.

y_{j, l}

denotes the lth system output at time j. The improved LSTMs can approximate any nonlinear function by the following theorem.

Theorem 1.

k (\cdot)

is a continuous nonlinear function defined on a set Ω. Improved LSTMs are shown in Figure 3.

\bar{k}

is an approximate function of

k (\cdot)

based on the improved LSTMs. Then, for a given desired level of accuracy

ε > 0

, there exist the ideal weights

{\bar{W}}_{j, i} (j \in [1, t], i \in [1, p])

and

{\bar{V}}_{j, i, l} (j \in [1, t], i \in [1, p], l \in [1, m])

to satisfy the following inequality,

\begin{matrix} \begin{matrix} sup_{x_{j} \in Ω} ∥ k (\cdot) - \bar{k} ∥ ⩽ ε . \end{matrix} \end{matrix}

(6)

The proof of Theorem 1 can be found in Appendix A.

Based on Theorem 1, the estimation system for the system (4) can be defined as the following formula,

\begin{matrix} \begin{matrix} \{\begin{matrix} \dot{\hat{x}} (t) = - A \hat{x} (t) + \bar{K} + L \cdot [y (t) - H \cdot \hat{x} (t)] + U \\ \hat{y} (t) = H \hat{x} (t) \end{matrix}, \end{matrix} \end{matrix}

(7)

where

L \in R^{n \times m}

denotes the observer gain matrix;

\bar{K} \in R^{n}

is an estimated function vector of

K (x (t), x (t - τ (t))))

based on the improved LSTMs, which satisfies Theorem 1.

\bar{K}

is given in Equation (8),

\begin{matrix} \begin{matrix} \bar{K} = {\bar{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\}, \end{matrix} \end{matrix}

(8)

where

{\bar{V}}_{t} \in R^{p \times n}

and

{\bar{W}}_{i} \in R^{p \times n}

denote the ideal weight matrices, and

{\bar{b}}_{t} \in R^{p}

is the ideal bias vector.

The function

σ (x_{j})

is determined by the time-varying delay

τ_{j} (t)

which satisfies

0 ⩽ τ_{j} (t) ⩽ τ_{m a x}

.

σ (x_{j})

is 1 in the range of

[t - τ_{m a x}, t]

, and

σ (x_{j})

is 0 in the rest of the range. This ensures that all the data in the interval

t - τ_{m a x}

to t will be included in the calculation. Considering the system (4) and the estimation system (7), the error system can be obtained as follows

\begin{matrix} \begin{matrix} e (t) & = \hat{x} (t) - x (t), \\ \dot{e} (t) & = \dot{\hat{x}} (t) - \dot{x} (t) \\ = [- A \hat{x} (t) + \bar{K} + L \cdot [y (t) - H \cdot \hat{x} (t)] + U] \\ - [- A x (t) + K (x (t), x (t - τ (t)))) + U] \\ = - (A + LH) e (t) + [\bar{K} - K (x (t), x (t - τ (t)))] . \end{matrix} \end{matrix}

(9)

Assumption 2.

For the unknown function

K_{i} (x (t), x (t - τ (t))))

and the estimated function

{\bar{K}}_{i}

(i = 1, 2, \dots, n)

, there exist Lipschitz constant vectors

K_{L 1}

and

K_{L 2}

, which satisfy the following inequality,

\begin{matrix} \begin{matrix} |K_{i} (x (t), x (t - τ (t)))) - {\bar{K}}_{i}| ⩽ K_{L 1}^{T} | x (t) - \hat{x} (t) | + K_{L 2}^{T} | x (t - τ (t)) - \hat{x} (t - τ (t)) | . \end{matrix} \end{matrix}

(10)

Considering Theorem 1,

{\bar{K}}_{i}

is an estimated function of the finite error of

K_{i} (x (t), x (t - τ (t)))

. Similarly,

{\bar{K}}_{i}

is a function of

\hat{x} (t)

and

\hat{x} (t - τ (t))

. On this basis, considering Assumption 1, Assumption 2 is tenable.

Theorem 2.

Suppose that Assumption 2 holds for the system (4) and the estimation system (7), if there exist symmetric positive definite matrices

P

,

Q

,

M

, a diagonal matrix

F

, a matrix

G \in R^{n \times p}

and a real constant

δ > 0

such that inequality (11) holds,

\begin{matrix} \begin{matrix} [\begin{matrix} Ω_{1} & P & M & M \\ P & - F & 0 & 0 \\ M & 0 & Ω_{2} & - 2 M \\ M & 0 & - 2 M & - 2 M \end{matrix}] < 0, \end{matrix} \end{matrix}

(11)

where

Ω_{1} = - A^{T} P - H^{T} G^{T} - P A - GH + Q + 2 K_{L 1} t r {F} K_{L 1}^{T}

, and

Ω_{2} = 2 K_{L 2} t r {F} K_{L 2}^{T} - (1 - δ) Q - 2 M

.

Then, the error system (9) is asymptotically stable with observer gain matrix calculated by

L = P^{- 1} G

. The proof of Theorem 2 can be found in Appendix B.

Based on Theorem 2, the observer gain matrix

L

can be obtained. Considering the function vector

\bar{K}

in system (7), the weight matrices

{\bar{W}}_{i}

and

{\bar{V}}_{i}

are ideal. In fact, the ideal weights are hard to select, and the estimated weights need to be adjusted by adaptive laws to be close to the ideal weights. With reference to the system (7), the estimated system can be redefined as follows

\begin{matrix} \begin{matrix} \{\begin{matrix} \dot{\hat{x}} (t) = - A \hat{x} (t) + \hat{K} + L \cdot [y (t) - H \cdot \hat{x} (t)] + U \\ \hat{y} (t) = H \hat{x} (t) \end{matrix}, \end{matrix} \end{matrix}

(12)

where

\hat{K}

is an estimated function vector of

\bar{K}

.

\hat{K}

is given in Equation (13),

\begin{matrix} \begin{matrix} \hat{K} = {\hat{V}}_{t}^{T} \cdot t a n h \{{\hat{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\hat{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\hat{b}}_{t}\}, \end{matrix} \end{matrix}

(13)

where

{\hat{V}}_{t}

and

{\hat{W}}_{i}

are estimated weight matrices;

{\hat{b}}_{t}

is a estimated bias vector.

With reference to the error system (9), the error system can be obtained as follows by using the Equation (6)

\begin{matrix} \begin{matrix} \dot{e} (t) = - (A + LH) e (t) + (\hat{K} - \bar{K} - ϵ_{1}), \end{matrix} \end{matrix}

(14)

where

ϵ_{1}

is an error vector.

For error weight matrices

{\tilde{V}}_{t}

and

{\tilde{W}}_{i}

and a error weight vector

{\tilde{b}}_{t}

, we have

\begin{matrix} \begin{matrix} {\tilde{V}}_{t} & = {\hat{V}}_{t} - {\bar{V}}_{t} \\ {\tilde{W}}_{i} & = {\hat{W}}_{i} - {\bar{W}}_{i} \\ {\tilde{b}}_{t} & = {\hat{b}}_{t} - {\bar{b}}_{t} . \end{matrix} \end{matrix}

(15)

Theorem 3.

For the error system (14), the design parameters

N_{w_i} \in R^{p}

,

N_{v_t} \in R^{p}

and

N_{b_t} \in R^{p \times n}

satisfy following inequality

\begin{matrix} \begin{matrix} [\begin{matrix} Ω_{3} & - P & \frac{1}{2} Ω_{w v b}^{T} \\ - P & 0 & 0 \\ \frac{1}{2} Ω_{w v b} & 0 & 0 \end{matrix}] ⩽ 0, \end{matrix} \end{matrix}

where

Ω_{3} = - {(A + LH)}^{T} P - P (A + LH)

,

Ω_{w v b} = [\begin{matrix} Ω_{w_t} \\ Ω_{w_t - 1} \cdot σ (x_{t}) \\ ⋮ \\ Ω_{w_1} \cdot \prod_{j = 2}^{t} σ (x_{j}) \\ Ω_{v_t} \\ N_{b_t} \end{matrix}]

,

Ω_{w_i} = [\begin{matrix} N_{w_i} & 0 & \dots & 0 \\ 0 & N_{w_i} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & N_{w_i} \end{matrix}]

,

Ω_{v_t} = ([\begin{matrix} N_{v_t} & 0 & \dots & 0 \\ 0 & N_{v_t} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & N_{v_t} \end{matrix}] - 2 [\begin{matrix} ϵ_{2} \cdot P_{1 •} \\ ϵ_{2} \cdot P_{2 •} \\ ⋮ \\ ϵ_{2} \cdot P_{n •} \end{matrix}])

.

The adaptive updating laws of the weights can be given as follows

\begin{matrix} \begin{matrix} {\dot{\hat{W}}}_{i} & = N_{w_i} \cdot e^{T} (t) - 4 {\hat{V}}_{t} P e (t) x_{i}^{T} (i = 1, 2, \dots, t) \\ {\dot{\hat{V}}}_{t} & = N_{v_t} \cdot e^{T} (t) - 2 {\hat{S}}_{t} e^{T} (t) P \\ {\dot{\hat{b}}}_{t} & = N_{b_t} \cdot e (t) - 4 {\hat{V}}_{t} P e (t), \end{matrix} \end{matrix}

(16)

then the error system (14) is asymptotically stable. The proof of Theorem 3 can be found in Appendix C.

Considering (16), the adaptive updating laws of the weights are determined by

e (t)

. Hence, it requires that

e (t)

is a n-dimensional vector. According to (12), we have

\begin{matrix} \begin{matrix} \hat{y} (t) - y (t) = H \cdot (\hat{x} (t) - x (t)) = H \cdot e (t) . \end{matrix} \end{matrix}

(17)

If there exists

H^{- 1}

, the

e (t)

can be obtained as follows by using (17)

\begin{matrix} \begin{matrix} e (t) = H^{- 1} \cdot (\hat{y} (t) - y (t)) . \end{matrix} \end{matrix}

(18)

In general, m is not equal to n, and

H^{- 1}

does not exist. Hence, (18) does not hold. To solve this problem, the following assumption is given.

Assumption 3.

\hat{y} (t)

and

y (t)

are continuously differentiable functions and the first derivative of

\hat{y} (t)

and

y (t)

are bounded and measurable.

Theorem 4.

Based on Assumption 3, if the given matrix

G

is left invertible, the

e (t)

can be obtained as follows

\begin{matrix} \begin{matrix} e (t) = G^{- 1} \cdot Y, \end{matrix} \end{matrix}

(19)

where

G = [\begin{matrix} H \\ - H (A + LH) \end{matrix}]

and

Y = [\begin{matrix} \hat{y} (t) - y (t) \\ \dot{\hat{y}} (t) - \dot{y} (t) \end{matrix}]

. The proof of Theorem 4 can be found in Appendix D.

Remark 3.

Based on Theorem 1, the estimated system (12) can be given. By using Theorem 2, the observer gain matrix

L

can be obtained. By using Theorems 3 and 4, the adaptive updating laws of the weights can be obtained.

4. Simulation Analysis

In this section, two numerical cases are presented to verify the rationality of the above results.

4.1. Examples

Example 1. 2-dimensional memristor neural networks are considered, and the parameters of the system (2) are given as follows,

\begin{matrix} \begin{matrix} A = [\begin{matrix} 2.3 & 0 \\ 0 & 2 \end{matrix}], B = [\begin{matrix} 0.31 & 0.38 \\ 0.49 & 0.32 \end{matrix}], C = [\begin{matrix} 0.32 & 0.19 \\ 0.39 & 0.25 \end{matrix}], U = [\begin{matrix} 0.2 \\ 0.3 \end{matrix}], \\ H = [\begin{matrix} 1 & 0.5 \end{matrix}], f (x (t)) = [\begin{matrix} \frac{| x_{1} + 1 | - | x_{1} - 1 |}{2} \\ \frac{| x_{2} + 1 | - | x_{2} - 1 |}{2} \end{matrix}], g (x (t - τ (t))) = [\begin{matrix} x_{1} (t - τ_{1} (t)) \\ x_{2} (t - τ_{2} (t)) \end{matrix}], \\ τ_{1} (t) = τ_{2} (t) = \frac{0.05 t}{1 + t}, x (0) = [\begin{matrix} 1 \\ 1 \end{matrix}] . \end{matrix} \end{matrix}

Based on the system (2), the estimated system (17) can be designed as follows,

\begin{matrix} \begin{matrix} \hat{x} (0) = [\begin{matrix} 0.2 \\ 0.3 \end{matrix}], K_{L 1} = K_{L 2} = [\begin{matrix} 1 \\ 1 \end{matrix}], δ = 0.1 . \end{matrix} \end{matrix}

By using Theorem 2 and LMIs tools, the parameters of the estimated system (17) can be obtained,

\begin{matrix} \begin{matrix} P = [\begin{matrix} 0.03553 & - 0.00345 \\ - 0.00345 & 0.03629 \end{matrix}], Q = [\begin{matrix} 0.03545 & - 0.01504 \\ - 0.01504 & 0.04070 \end{matrix}], F = [\begin{matrix} 0.00363 & 0 \\ 0 & 0.00379 \end{matrix}], \\ M = [\begin{matrix} 0.00494 & - 68.44222 \\ 68.44222 & 0.00505 \end{matrix}], L = [\begin{matrix} - 0.88667 \\ - 0.75623 \end{matrix}] . \end{matrix} \end{matrix}

Set sampling time

T = 30 s

and sampling period

▵ T = 0.001 s

. Considering (18), set

x_{i} = {\hat{x}}_{i \cdot ▵ T} (i = 1, 2, \dots, t / ▵ T)

and

σ (x_{i}) = 0 (i < t / ▵ T - 30)

. Based on Theorem 3 and Theorem 4, set

N_{w_i}

and

N_{v_t}

and

N_{b_t}

are negative unit vectors and matrix.

The state trajectories of the state

x (t)

and the state observer

\hat{x} (t)

are drawn in Figure 5. Figure 6 is drawn for the estimated error between the state

x (t)

and the state observer

\hat{x} (t)

. In Figure 7, the trajectories of the derivative of the state

\dot{x} (t)

and the derivative of the state observer

\dot{\hat{x}} (t)

are depicted. The trajectories of the error between

\dot{x} (t)

and

\dot{\hat{x}} (t)

are given in Figure 8. In Figure 9, the output curve

y (t)

and the estimated output curve

\hat{y} (t)

are given. Figure 10 shows the estimated error curve between

y (t)

and

\hat{y} (t)

.

In order to verify the accuracy of the estimated structure, a test system is designed based on the gain observation matrix

L

. Under the same simulation conditions as above, the effects of the adjusted weights and the random weights on the system are compared. In Figure 11, the state trajectories of the real system and the estimated system with the adjusted weights and the system with the random weights are given. Figure 12 shows the real output curve and the estimated output with the adjusted weights and the output curve with the random weights.

Example 2. 3-dimensional memristor neural networks are considered, and the parameters of the system (2) are given as follows,

\begin{matrix} \begin{matrix} A = [\begin{matrix} 2.5 & 0 & 0 \\ 0 & 2.5 & 0 \\ 0 & 0 & 3.3 \end{matrix}], B = [\begin{matrix} 0.2 & 0.15 & 0.35 \\ 0.15 & 0.35 & 0.08 \\ 0.1 & 0.15 & 0.3 \end{matrix}], C = [\begin{matrix} 0.125 & 0.15 & 0.136 \\ 0.35 & 0.18 & 0.3 \\ 0.2 & 0.04 & 0.05 \end{matrix}], \\ H = [\begin{matrix} 1 & 1 & 1 \\ 1 & 0 & 1 \end{matrix}], f (x (t)) = [\begin{matrix} \frac{| x_{1} + 1 | - | x_{1} - 1 |}{2} \\ \frac{| x_{2} + 1 | - | x_{2} - 1 |}{2} \\ \frac{| x_{3} + 1 | - | x_{3} - 1 |}{2} \end{matrix}], g (x (t - τ (t))) = [\begin{matrix} x_{1} (t - τ_{1} (t)) \\ x_{2} (t - τ_{2} (t)) \\ x_{3} (t - τ_{3} (t)) \end{matrix}], \\ U = [\begin{matrix} 0.2 \\ 0.3 \\ 0.1 \end{matrix}], x (0) = [\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}], τ_{1} (t) = τ_{2} (t) = τ_{3} (t) = \frac{0.05 t}{1 + t} . \end{matrix} \end{matrix}

Based on the system (2), the estimated system (17) can be designed as follows,

\begin{matrix} \begin{matrix} \hat{x} (0) = [\begin{matrix} 0.4 \\ 0.5 \\ 0.6 \end{matrix}], K_{L 1} = K_{L 2} = [\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}], δ = 0.1 . \end{matrix} \end{matrix}

By using Theorem 2 and LMIs tools, the parameters of the estimated system (17) can be obtained,

\begin{matrix} \begin{matrix} P = [\begin{matrix} 0.51349 & 0.03364 & 0.10199 \\ 0.03364 & 0.71344 & - 0.02472 \\ 0.10199 & - 0.02472 & 0.39999 \end{matrix}], Q = [\begin{matrix} - 0.30425 & - 0.32097 & - 0.34837 \\ - 0.32097 & - 0.51236 & - 0.19432 \\ - 0.34837 & - 0.19432 & 0.00547 \end{matrix}], \\ F = [\begin{matrix} 0.10879 & 0 & 0 \\ 0 & 0.17792 & 0 \\ 0 & 0 & 0.20825 \end{matrix}], M = [\begin{matrix} 0.05344 & - 47.67758 & - 16.83918 \\ 47.67685 & 0.04188 & - 39.81878 \\ 16.83445 & 39.83217 & 0.07044 \end{matrix}], \\ L = [\begin{matrix} - 0.29468 & - 0.06180 \\ - 0.99352 & 0.98883 \\ 0.32407 & - 0.98883 \end{matrix}] . \end{matrix} \end{matrix}

Set sampling time

T = 30 s

and sampling period

▵ T = 0.001 s

. Considering (18), set

x_{i} = {\hat{x}}_{i \cdot ▵ T} (i = 1, 2, \dots, t / ▵ T)

and

σ (x_{i}) = 0 (i < t / ▵ T - 30)

. Based on Theorems 3 and 4, set

N_{w_i}

and

N_{v_t}

and

N_{b_t}

are negative unit vectors and matrix.

The state trajectories of the state

x (t)

and the state observer

\hat{x} (t)

are drawn in Figure 13. Figure 14 is drawn for the estimated error between the state

x (t)

and the state observer

\hat{x} (t)

. In Figure 15, the trajectories of the derivative of the state

\dot{x} (t)

and the derivative of the state observer

\dot{\hat{x}} (t)

are depicted. The trajectories of the error between

\dot{x} (t)

and

\dot{\hat{x}} (t)

are given in Figure 16. In Figure 17, the output curve

y (t)

and the estimated output curve

\hat{y} (t)

are given. Figure 18 shows the estimated error curve between

y (t)

and

\hat{y} (t)

.

In order to verify the accuracy of the estimated structure, a test system is designed based on the gain observation matrix

L

. Under the same simulation conditions as above, the effects of the adjusted weights and the random weights on the system are compared. In Figure 19, the state trajectories of the real system and the estimated system with the adjusted weights and the system with the random weights are given. Figure 20 shows the real output curve and the estimated output with the adjusted weights and the output curve with the random weights.

4.2. Description of Simulation Results

Figure 5 and Figure 13 show that the estimated state vector is a good approximation of the real state vector. On the other hand, Figure 6 and Figure 14 verify that the estimation error vector of states is going to zero. Figure 7 and Figure 15 show that the derivative vector of estimated states is a good approximation of the derivative vector of real states. On the other hand, Figure 8 and Figure 16 verify that the estimation error vector of derivative of states is going to zero. Figure 9 and Figure 17 show that the estimated output vector is a good approximation of the real output vector. On the other hand, Figure 10 and Figure 18 verify that the estimation error vector of outputs is going to zero. Figure 11 and Figure 19 show that the estimated state vector with adaptive weights is better than that with random weights. Figure 12 and Figure 20 show that the output vector with adaptive weights is better than that with random weights. Simulation results indicate that the state observer proposed in this paper has stronger adaptability and more accurate estimation results for memristor neural networks with model uncertainties.

5. Conclusions

The state estimation of memristor neural networks with model uncertainties is discussed in this paper. In particular, model uncertainties include time-varying delays, floating parameters and unknown functions. An improved approach based on LSTMs is proposed to deal with model uncertainties. This paper proves that the improved neural networks can approximate any nonlinear function with any error. On this basis, a full-order state observer is proposed to achieve the reconstruction of states based on the measurement output of the system. The adaptive updating laws of the weights of improved neural networks are proposed based on a LKF. By using LKF and LMIs tools, this paper obtains the asymptotic stability conditions for the error systems. The simulation results show that by using the full-order state observer and the adaptive updating laws of the weights, an accurate estimate of the solution can be obtained. The test results show that the model uncertainties can be approximated accurately. As mentioned in the introduction, the improved LSTMs designed in this paper can also be realized by crossbar array of memristor, which will be our next work.

Author Contributions

Conceptualization, methodology, software, validation, formal analysis, investigation, resources, data curation, writing—original draft preparation, writing—review and editing, L.M.; visualization, supervision, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declared no potential conflict of interest with respect to the research, authorship, and/or publication of this article.

Appendix A. The Proof of Theorem 1

Proof.

Considering the Equation (5), the improved neural networks are derived by using the recursive method.

Take

j = 1

, we have

\begin{matrix} \begin{matrix} c_{1}^{1} = {\bar{W}}_{1, 1} x_{1}, h_{1}^{1} = t a n h ({\bar{W}}_{1, 1} x_{1} + {\bar{b}}_{1, 1}), \\ c_{1}^{2} = {\bar{W}}_{1, 2} x_{1}, h_{1}^{2} = t a n h ({\bar{W}}_{1, 2} x_{1} + {\bar{b}}_{1, 2}), \\ ⋮ \\ c_{1}^{p} = {\bar{W}}_{1, p} x_{1}, h_{1}^{p} = t a n h ({\bar{W}}_{1, p} x_{1} + {\bar{b}}_{1, p}) . \end{matrix} \end{matrix}

Then,

\begin{matrix} \begin{matrix} c_{1} = {\bar{W}}_{1} x_{1}, h_{1} = t a n h ({\bar{W}}_{1} x_{1} + {\bar{b}}_{1}), \end{matrix} \end{matrix}

where

c_{1} = [\begin{matrix} c_{1}^{1} \\ c_{1}^{2} \\ ⋮ \\ c_{1}^{p} \end{matrix}]

,

{\bar{W}}_{1} = [\begin{matrix} {\bar{W}}_{1, 1} \\ {\bar{W}}_{1, 2} \\ ⋮ \\ {\bar{W}}_{1, p} \end{matrix}] = [\begin{matrix} {\bar{W}}_{1, 1, 1} & {\bar{W}}_{1, 1, 2} & \dots & {\bar{W}}_{1, 1, n} \\ {\bar{W}}_{1, 2, 1} & {\bar{W}}_{1, 2, 2} & \dots & {\bar{W}}_{1, 2, n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{W}}_{1, p, 1} & {\bar{W}}_{1, p, 2} & \dots & {\bar{W}}_{1, p, n} \end{matrix}]

,

h_{1} = [\begin{matrix} h_{1}^{1} \\ h_{1}^{2} \\ ⋮ \\ h_{1}^{p} \end{matrix}]

,

{\bar{b}}_{1} = [\begin{matrix} {\bar{b}}_{1, 1} \\ {\bar{b}}_{1, 2} \\ ⋮ \\ {\bar{b}}_{1, p} \end{matrix}]

.

And we have

\begin{matrix} \begin{matrix} y_{1, 1} = {\bar{V}}_{1, 1, 1} \cdot t a n h ({\bar{W}}_{1, 1} x_{1} + {\bar{b}}_{1, 1}) + {\bar{V}}_{1, 2, 1} \cdot t a n h ({\bar{W}}_{1, 2} x_{1} + {\bar{b}}_{1, 2}) + \dots \\ + {\bar{V}}_{1, p, 1} \cdot t a n h ({\bar{W}}_{1, p} x_{1} + {\bar{b}}_{1, p}), \\ y_{1, 2} = {\bar{V}}_{1, 1, 2} \cdot t a n h ({\bar{W}}_{1, 1} x_{1} + {\bar{b}}_{1, 1}) + {\bar{V}}_{1, 2, 2} \cdot t a n h ({\bar{W}}_{1, 2} x_{1} + {\bar{b}}_{1, 2}) + \dots \\ + {\bar{V}}_{1, p, 2} \cdot t a n h ({\bar{W}}_{1, p} x_{1} + {\bar{b}}_{1, p}), \\ ⋮ \\ y_{1, m} = {\bar{V}}_{1, 1, m} \cdot t a n h ({\bar{W}}_{1, 1} x_{1} + {\bar{b}}_{1, 1}) + {\bar{V}}_{1, 2, m} \cdot t a n h ({\bar{W}}_{1, 2} x_{1} + {\bar{b}}_{1, 2}) + \dots \\ + {\bar{V}}_{1, p, m} \cdot t a n h ({\bar{W}}_{1, p} x_{1} + {\bar{b}}_{1, p}) . \end{matrix} \end{matrix}

Then,

\begin{matrix} \begin{matrix} y_{1} = {\bar{V}}_{1}^{T} \cdot t a n h ({\bar{W}}_{1} x_{1} + {\bar{b}}_{1}), \end{matrix} \end{matrix}

where

y_{1} = [\begin{matrix} y_{1, 1} \\ y_{1, 2} \\ ⋮ \\ y_{1, m} \end{matrix}]

,

{\bar{V}}_{1} = [\begin{matrix} {\bar{V}}_{1, 1, 1} & {\bar{V}}_{1, 1, 2} & \dots & {\bar{V}}_{1, 1, m} \\ {\bar{V}}_{1, 2, 1} & {\bar{V}}_{1, 2, 2} & \dots & {\bar{V}}_{1, 2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{V}}_{1, p, 1} & {\bar{V}}_{1, p, 2} & \dots & {\bar{V}}_{1, p, m} \end{matrix}]

.

Take

j = 2

, we have

\begin{matrix} \begin{matrix} c_{2}^{1} = {\bar{W}}_{2, 1} x_{2} + c_{1}^{1} \cdot σ (x_{2}) = {\bar{W}}_{2, 1} x_{2} + {\bar{W}}_{1, 1} x_{1} \cdot σ (x_{2}), \\ h_{2}^{1} = t a n h ({\bar{W}}_{2, 1} x_{2} + {\bar{W}}_{1, 1} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 1}), \\ c_{2}^{2} = {\bar{W}}_{2, 2} x_{2} + c_{1}^{2} \cdot σ (x_{2}) = {\bar{W}}_{2, 2} x_{2} + {\bar{W}}_{1, 2} x_{1} \cdot σ (x_{2}), \\ h_{2}^{2} = t a n h ({\bar{W}}_{2, 2} x_{2} + {\bar{W}}_{1, 2} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 2}), \\ ⋮ \\ c_{2}^{p} = {\bar{W}}_{2, p} x_{2} + c_{1}^{p} \cdot σ (x_{2}) = {\bar{W}}_{2, p} x_{2} + {\bar{W}}_{1, p} x_{1} \cdot σ (x_{2}), \\ h_{2}^{p} = t a n h ({\bar{W}}_{2, p} x_{2} + {\bar{W}}_{1, p} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, p}) . \end{matrix} \end{matrix}

Then,

\begin{matrix} \begin{matrix} c_{2} = {\bar{W}}_{2} x_{2} + {\bar{W}}_{1} x_{1} \cdot σ (x_{2}), h_{2} = t a n h ({\bar{W}}_{2} x_{2} + {\bar{W}}_{1} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2}), \end{matrix} \end{matrix}

where

c_{2} = [\begin{matrix} c_{2}^{1} \\ c_{2}^{2} \\ ⋮ \\ c_{2}^{p} \end{matrix}]

,

{\bar{W}}_{2} = [\begin{matrix} {\bar{W}}_{2, 1} \\ {\bar{W}}_{2, 2} \\ ⋮ \\ {\bar{W}}_{2, p} \end{matrix}] = [\begin{matrix} {\bar{W}}_{2, 1, 1} & {\bar{W}}_{2, 1, 2} & \dots & {\bar{W}}_{2, 1, n} \\ {\bar{W}}_{2, 2, 1} & {\bar{W}}_{2, 2, 2} & \dots & {\bar{W}}_{2, 2, n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{W}}_{2, p, 1} & {\bar{W}}_{2, p, 2} & \dots & {\bar{W}}_{2, p, n} \end{matrix}]

,

h_{2} = [\begin{matrix} h_{2}^{1} \\ h_{2}^{2} \\ ⋮ \\ h_{2}^{p} \end{matrix}]

,

{\bar{b}}_{2} = [\begin{matrix} {\bar{b}}_{2, 1} \\ {\bar{b}}_{2, 2} \\ ⋮ \\ {\bar{b}}_{2, p} \end{matrix}]

.

And we have

\begin{matrix} \begin{matrix} y_{2, 1} = {\bar{V}}_{2, 1, 1} \cdot t a n h ({\bar{W}}_{2, 1} x_{2} + {\bar{W}}_{1, 1} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 1}) + {\bar{V}}_{2, 2, 1} \cdot t a n h ({\bar{W}}_{2, 2} x_{2} \\ + {\bar{W}}_{1, 2} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 2}) + \dots + {\bar{V}}_{2, p, 1} \cdot t a n h ({\bar{W}}_{2, p} x_{2} + {\bar{W}}_{1, p} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, p}), \\ y_{2, 2} = {\bar{V}}_{2, 1, 2} \cdot t a n h ({\bar{W}}_{2, 1} x_{2} + {\bar{W}}_{1, 1} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 1}) + {\bar{V}}_{2, 2, 2} \cdot t a n h ({\bar{W}}_{2, 2} x_{2} \\ + {\bar{W}}_{1, 2} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 2}) + \dots + {\bar{V}}_{2, p, 2} \cdot t a n h ({\bar{W}}_{2, p} x_{2} + {\bar{W}}_{1, p} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, p}), \\ ⋮ \\ y_{2, m} = {\bar{V}}_{2, 1, m} \cdot t a n h ({\bar{W}}_{2, 1} x_{2} + {\bar{W}}_{1, 1} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 1}) + {\bar{V}}_{2, 2, m} \cdot t a n h ({\bar{W}}_{2, 2} x_{2} \\ + {\bar{W}}_{1, 2} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, 2}) + \dots + {\bar{V}}_{2, p, m} \cdot \cdot t a n h ({\bar{W}}_{2, p} x_{2} + {\bar{W}}_{1, p} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2, p}) . \end{matrix} \end{matrix}

Then,

\begin{matrix} \begin{matrix} y_{2} = {\bar{V}}_{2}^{T} \cdot t a n h ({\bar{W}}_{2} x_{2} + {\bar{W}}_{1} x_{1} \cdot σ (x_{2}) + {\bar{b}}_{2}), \end{matrix} \end{matrix}

where

y_{2} = [\begin{matrix} y_{2, 1} \\ y_{2, 2} \\ ⋮ \\ y_{2, m} \end{matrix}]

,

{\bar{V}}_{2} = [\begin{matrix} {\bar{V}}_{2, 1, 1} & {\bar{V}}_{2, 1, 2} & \dots & {\bar{V}}_{2, 1, m} \\ {\bar{V}}_{2, 2, 1} & {\bar{V}}_{2, 2, 2} & \dots & {\bar{V}}_{2, 2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{V}}_{2, p, 1} & {\bar{V}}_{2, p, 2} & \dots & {\bar{V}}_{2, p, m} \end{matrix}]

.

Take

j = t

, we have

\begin{matrix} \begin{matrix} c_{t}^{1} = {\bar{W}}_{t, 1} x_{t} + c_{t - 1}^{1} \cdot σ (x_{t}) \\ = {\bar{W}}_{t, 1} x_{t} + [{\bar{W}}_{t - 1, 1} x_{t - 1} + c_{t - 2}^{1} \cdot σ (x_{t - 1})] \cdot σ (x_{t}) \\ = {\bar{W}}_{t, 1} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 1} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})], \\ h_{t}^{1} = t a n h \{{\bar{W}}_{t, 1} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 1} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 1}\}, \\ c_{t}^{2} = {\bar{W}}_{t, 2} x_{t} + c_{t - 1}^{2} \cdot σ (x_{t}) \\ = {\bar{W}}_{t, 2} x_{t} + [{\bar{W}}_{t - 1, 2} x_{t - 1} + c_{t - 2}^{2} \cdot σ (x_{t - 1})] \cdot σ (x_{t}) \\ = {\bar{W}}_{t, 2} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 2} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})], \\ h_{t}^{2} = t a n h \{{\bar{W}}_{t, 2} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 2} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 2}\}, \end{matrix} \end{matrix}

\begin{matrix} \begin{matrix} ⋮ \\ c_{t}^{p} = {\bar{W}}_{t, p} x_{t} + c_{t - 1}^{p} \cdot σ (x_{t}) \\ = {\bar{W}}_{t, p} x_{t} + [{\bar{W}}_{t - 1, p} x_{t - 1} + c_{t - 2}^{p} \cdot σ (x_{t - 1})] \cdot σ (x_{t}) \\ = {\bar{W}}_{t, p} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, p} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})], \\ h_{t}^{p} = t a n h \{{\bar{W}}_{t, p} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, p} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, p}\} . \end{matrix} \end{matrix}

Then,

\begin{matrix} \begin{matrix} c_{t} = {\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})], \\ h_{t} = t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\}, \end{matrix} \end{matrix}

where

c_{t} = [\begin{matrix} c_{t}^{1} \\ c_{t}^{2} \\ ⋮ \\ c_{t}^{p} \end{matrix}]

,

{\bar{W}}_{t} = [\begin{matrix} {\bar{W}}_{t, 1} \\ {\bar{W}}_{t, 2} \\ ⋮ \\ {\bar{W}}_{t, p} \end{matrix}] = [\begin{matrix} {\bar{W}}_{t, 1, 1} & {\bar{W}}_{t, 1, 2} & \dots & {\bar{W}}_{t, 1, n} \\ {\bar{W}}_{t, 2, 1} & {\bar{W}}_{t, 2, 2} & \dots & {\bar{W}}_{t, 2, n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{W}}_{t, p, 1} & {\bar{W}}_{t, p, 2} & \dots & {\bar{W}}_{t, p, n} \end{matrix}]

,

h_{t} = [\begin{matrix} h_{t}^{1} \\ h_{t}^{2} \\ ⋮ \\ h_{t}^{p} \end{matrix}]

,

{\bar{b}}_{t} = [\begin{matrix} {\bar{b}}_{t, 1} \\ {\bar{b}}_{t, 2} \\ ⋮ \\ {\bar{b}}_{t, p} \end{matrix}]

.

And we have

\begin{matrix} \begin{matrix} y_{t, 1} = {\bar{V}}_{t, 1, 1} \cdot t a n h \{{\bar{W}}_{t, 1} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 1} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 1}\} \\ + {\bar{V}}_{t, 2, 1} \cdot t a n h \{{\bar{W}}_{t, 2} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 2} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 2}\} + \dots \\ + {\bar{V}}_{t, p, 1} \cdot t a n h \{{\bar{W}}_{t, p} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, p} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, p}\}, \\ y_{t, 2} = {\bar{V}}_{t, 1, 2} \cdot t a n h \{{\bar{W}}_{t, 1} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 1} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 1}\} \\ + {\bar{V}}_{t, 2, 2} \cdot t a n h \{{\bar{W}}_{t, 2} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 2} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 2}\} + \dots \\ + {\bar{V}}_{t, p, 2} \cdot t a n h \{{\bar{W}}_{t, p} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, p} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, p}\}, \\ ⋮ \\ y_{t, m} = {\bar{V}}_{t, 1, m} \cdot t a n h \{{\bar{W}}_{t, 1} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 1} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 1}\} \\ + {\bar{V}}_{t, 2, m} \cdot t a n h \{{\bar{W}}_{t, 2} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, 2} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, 2}\} + \dots \\ + {\bar{V}}_{t, p, m} \cdot t a n h \{{\bar{W}}_{t, p} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i, p} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t, p}\} . \end{matrix} \end{matrix}

Then,

\begin{matrix} \begin{matrix} y_{t} = {\bar{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\}, \end{matrix} \end{matrix}

where

y_{t} = [\begin{matrix} y_{t, 1} \\ y_{t, 2} \\ ⋮ \\ y_{t, m} \end{matrix}]

,

{\bar{V}}_{t} = [\begin{matrix} {\bar{V}}_{t, 1, 1} & {\bar{V}}_{t, 1, 2} & \dots & {\bar{V}}_{t, 1, m} \\ {\bar{V}}_{t, 2, 1} & {\bar{V}}_{t, 2, 2} & \dots & {\bar{V}}_{t, 2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\bar{V}}_{t, p, 1} & {\bar{V}}_{t, p, 2} & \dots & {\bar{V}}_{t, p, m} \end{matrix}]

.

Take

σ (x_{j}) = 1, (j = 1, 2, \dots, t)

, and

y_{t}

can be reconstituted as follows,

\begin{matrix} \begin{matrix} y_{t} = {\bar{V}}_{t}^{T} \cdot t a n h \{\bar{W} x + {\bar{b}}_{t}\}, \end{matrix} \end{matrix}

where

\bar{W} = [{\bar{W}}_{t}, {\bar{W}}_{t - 1}, \dots, {\bar{W}}_{1}]

,

x = {[x_{t}, x_{t - 1}, \dots, x_{1}]}^{T}

.

It can be found that the formula is the basic form of multiple neural networks. Therefore, the improved LSTMs can be converted into multiple neural networks. Considering Lemma 1, multiple neural networks can approximate any nonlinear function with any error. Considering the improved LSTMs, since it can be converted into the multiple neural networks, the lemma satisfied by the multiple neural networks can also be satisfied by the improved LSTMs. Therefore, Theorem 1 can be obtained according to the form of Lemma 1. □

Appendix B. The Proof of Theorem 2

Proof.

Take a Lyapunov-Krasovskii function,

\begin{matrix} \begin{matrix} V (t, e (t)) = e^{T} (t) P e (t) + \int_{t - τ (t)}^{t} e^{T} (s) Q e (s), \end{matrix} \end{matrix}

where

P

and

Q

are positive definite matrices.

Based on the error system (9), the derivative of

V (t, e (t))

can be obtained as follows,

\begin{matrix} \dot{V} (t, e (t)) = {\dot{e}}^{T} (t) P e (t) + e^{T} (t) P \dot{e} (t) + e^{T} (t) Q e (t) - e^{T} (t - τ (t)) Q e (t - τ (t)) \\ = {[- (A + LH) e (t) + (\bar{K} - K (x (t), x (t - τ (t))))]}^{T} P e (t) \\ + e^{T} (t) P [- (A + LH) e (t) + (\bar{K} - K (x (t), x (t - τ (t))))] \\ + e^{T} (t) Q e (t) - e^{T} (t - τ (t)) Q e (t - τ (t)) \\ = - e^{T} (t) {(A + LH)}^{T} P e (t) - e^{T} (t) P (A + LH) e (t) + e^{T} (t) Q e (t) \\ + e^{T} (t) P (\bar{K} - K (x (t), x (t - τ (t)))) - e^{T} (t - τ (t)) Q e (t - τ (t)) \\ + {(\bar{K} - K (x (t), x (t - τ (t))))}^{T} P e (t) . \end{matrix}

By using Assumption 2 and Lemma 2, for a positive definite diagonal matrix

F

, we have

\begin{matrix} \begin{matrix} {[\bar{K} - K (x (t), x (t - τ (t)))]}^{T} F [\bar{K} - K (x (t), x (t - τ (t)))] \\ = \sum_{i = 1}^{n} \{F_{i} {[{\bar{K}}_{i} - K_{i} (x (t), x (t - τ (t)))]}^{2}\} \\ ⩽ \sum_{i = 1}^{n} \{F_{i} {[K_{L 1}^{T} | x (t) - \hat{x} (t) | + K_{L 2}^{T} | x (t - τ (t)) - \hat{x} (t - τ (t)) |]}^{2}\} \\ ⩽ \sum_{i = 1}^{n} \{F_{i} [e^{T} (t) K_{L 1} K_{L 1}^{T} e (t) + e^{T} (t - τ (t)) K_{L 2} K_{L 2}^{T} e (t - τ (t))]\} \\ + \sum_{i = 1}^{n} \{F_{i} [2 | e^{T} (t) | K_{L 1} K_{L 2}^{T} | e (t - τ (t)) |]\} \\ ⩽ 2 e^{T} (t) K_{L 1} t r {F} K_{L 1}^{T} e (t) + 2 e^{T} (t - τ (t)) K_{L 2} t r {F} K_{L 2}^{T} e (t - τ (t)) . \end{matrix} \end{matrix}

And there exists a positive definite matrix

M

to satisfy the following equation

\begin{matrix} \begin{matrix} 2 [\int_{t - τ (t)}^{t} \dot{e} (s) d s - e (t) + e (t - τ (t))] M [- \int_{t - τ (t)}^{t} {\dot{e}}^{T} (s) d s - e^{T} (t - τ (t))] = 0 . \end{matrix} \end{matrix}

For a real constant

δ > 0

, combining with the above formulas, we have

\begin{matrix} \dot{V} (t, e (t)) ⩽ - e^{T} (t) {(A + LH)}^{T} P e (t) - e^{T} (t) P (A + LH) e (t) + e^{T} (t) Q e (t) \\ + e^{T} (t) P (\bar{K} - K (x (t), x (t - τ (t)))) - e^{T} (t - τ (t)) (1 - δ) Q e (t - τ (t)) \\ + {(\bar{K} - K (x (t), x (t - τ (t))))}^{T} P e (t) + 2 e^{T} (t - τ (t)) K_{L 2} t r {F} K_{L 2}^{T} e (t - τ (t)) \\ + 2 e^{T} (t) K_{L 1} t r {F} K_{L 1}^{T} e (t) - {[\bar{K} - K (x (t), x (t - τ (t)))]}^{T} F [\bar{K} - K (x (t), x (t - τ (t)))] \\ + 2 [\int_{t - τ (t)}^{t} \dot{e} (s) d s - e (t) + e (t - τ (t))] M [- \int_{t - τ (t)}^{t} {\dot{e}}^{T} (s) d s - e^{T} (t - τ (t))] \\ ⩽ e^{T} (t) [- A^{T} P - H^{T} L^{T} P - P A - PLH + Q + 2 K_{L 1} t r {F} K_{L 1}^{T}] e (t) \\ + e^{T} (t) P (\bar{K} - K (x (t), x (t - τ (t)))) + {(\bar{K} - K (x (t), x (t - τ (t))))}^{T} P e (t) \\ - {[\bar{K} - K (x (t), x (t - τ (t)))]}^{T} F [\bar{K} - K (x (t), x (t - τ (t)))] \\ + e^{T} (t - τ (t)) [2 K_{L 2} t r {F} K_{L 2}^{T} - (1 - δ) Q - 2 M] e (t - τ (t)) \\ + 2 e^{T} (t - τ (t) M e (t) + 2 \int_{t - τ (t)}^{t} {\dot{e}}^{T} (s) d s M e (t) - 2 \int_{t - τ (t)}^{t} {\dot{e}}^{T} (s) d s M e (t - τ (t)) \\ - 2 e^{T} (t - τ (t)) M \int_{t - τ (t)}^{t} \dot{e} (s) d s - 2 \int_{t - τ (t)}^{t} {\dot{e}}^{T} (s) d s M \int_{t - τ (t)}^{t} \dot{e} (s) d s \\ ⩽ ȷ^{T} \cdot [\begin{matrix} Ω_{1} & P & M & M \\ P & - F & 0 & 0 \\ M & 0 & Ω_{2} & - 2 M \\ M & 0 & - 2 M & - 2 M \end{matrix}] \cdot ȷ, \end{matrix}

where

ȷ = {[e (t), (\bar{K} - K (x (t), x (t - τ (t)))), e (t - τ (t)), \int_{t - τ (t)}^{t} \dot{e} (s) d s]}^{T}

,

Ω_{1} = - A^{T} P - H^{T} G^{T} - P A - GH + Q + 2 K_{L 1} t r {F} K_{L 1}^{T}

,

Ω_{2} = 2 K_{L 2} t r {F} K_{L 2}^{T} - (1 - δ) Q - 2 M

, and

G = P L

.

Considering the above inequality, take

\begin{matrix} \begin{matrix} [\begin{matrix} Ω_{1} & P & M & M \\ P & - F & 0 & 0 \\ M & 0 & Ω_{2} & - 2 M \\ M & 0 & - 2 M & - 2 M \end{matrix}] < 0, \end{matrix} \end{matrix}

then the error system (9) is asymptotically stable, and the proof of Theorem 2 is completed. □

Appendix C. The Proof of Theorem 3

Proof.

Based on the Equation (15), take a Lyapunov–Krasovskii function(LKF),

\begin{matrix} \begin{matrix} V (t, e (t)) = e^{T} (t) P e (t) + t r {{\tilde{V}}_{t}^{T} \cdot {\tilde{V}}_{t}} + t r {{\tilde{W}}_{t}^{T} \cdot {\tilde{W}}_{t}} + {\tilde{b}}_{t}^{T} \cdot {\tilde{b}}_{t} \\ + \sum_{i = 1}^{t - 1} [t r {{\tilde{W}}_{i}^{T} \cdot {\tilde{W}}_{i}} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] . \end{matrix} \end{matrix}

By using the error system (14) and the Equation (15), the derivative of V (t; e(t)) can be obtained as follows,

\begin{matrix} \begin{matrix} \dot{V} (t, e (t)) = {\dot{e}}^{T} (t) P e (t) + e^{T} (t) P \dot{e} (t) + t r {{\tilde{V}}_{t}^{T} \cdot {\dot{\hat{V}}}_{t}} + t r {{\tilde{W}}_{t}^{T} \cdot {\dot{\hat{W}}}_{t}} + {\tilde{b}}_{t}^{T} \cdot {\dot{\hat{b}}}_{t} \\ + \sum_{i = 1}^{t - 1} [t r {{\tilde{W}}_{i}^{T} \cdot {\dot{\hat{W}}}_{i}} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] \\ = e^{T} (t) [- {(A + LH)}^{T} P - P (A + LH)] e (t) + {(\hat{K} - \bar{K})}^{T} P e (t) - ϵ_{1}^{T} P e (t) \\ + e^{T} (t) P (\hat{K} - \bar{K}) - e^{T} (t) P ϵ_{1} + t r {{\tilde{V}}_{t}^{T} \cdot {\dot{\hat{V}}}_{t}} + t r {{\tilde{W}}_{t}^{T} \cdot {\dot{\hat{W}}}_{t}} + {\tilde{b}}_{t}^{T} \cdot {\dot{\hat{b}}}_{t} \\ + \sum_{i = 1}^{t - 1} [t r {{\tilde{W}}_{i}^{T} \cdot {\dot{\hat{W}}}_{i}} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] . \end{matrix} \end{matrix}

Combining (8) and (13), it can be obtained

\begin{matrix} \begin{matrix} \hat{K} - \bar{K} = {\hat{V}}_{t}^{T} \cdot t a n h \{{\hat{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\hat{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\hat{b}}_{t}\} \\ - {\bar{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\} \\ = {\hat{V}}_{t}^{T} \cdot t a n h \{{\hat{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\hat{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\hat{b}}_{t}\} \\ - {\hat{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\hat{b}}_{t}\} \\ + {\hat{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\hat{b}}_{t}\} \\ - {\hat{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\} \\ + {\hat{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\} \\ - {\bar{V}}_{t}^{T} \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\} \\ ⩽ 2 {\hat{V}}_{t}^{T} \cdot \{({\hat{W}}_{t} - {\bar{W}}_{t}) x_{t} + \sum_{i = 1}^{t - 1} [({\hat{W}}_{i} - {\bar{W}}_{i}) x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})]\} + 2 {\hat{V}}_{t}^{T} \cdot ({\hat{b}}_{t} - {\bar{b}}_{t}) \\ + ({\hat{V}}_{t}^{T} - {\bar{V}}_{t}^{T}) \cdot t a n h \{{\bar{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\bar{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\bar{b}}_{t}\} \\ ⩽ 2 {\hat{V}}_{t}^{T} \cdot \{{\tilde{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\tilde{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})]\} + 2 {\hat{V}}_{t}^{T} \cdot {\tilde{b}}_{t} + {\tilde{V}}_{t}^{T} \cdot {\hat{S}}_{t} - {\tilde{V}}_{t}^{T} \cdot ϵ_{2}, \end{matrix} \end{matrix}

where

{\hat{S}}_{t} = t a n h \{{\hat{W}}_{t} x_{t} + \sum_{i = 1}^{t - 1} [{\hat{W}}_{i} x_{i} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + {\hat{b}}_{t}\}

and

ϵ_{2} = {\hat{S}}_{t} - {\bar{S}}_{t}

.

Considering the above equations, we have

\begin{matrix} \begin{matrix} \dot{V} (t, e (t)) ⩽ e^{T} (t) [- {(A + LH)}^{T} P - P (A + LH)] e (t) - e^{T} (t) P ϵ_{1} - ϵ_{1}^{T} P e (t) \\ + 4 {[{\hat{V}}_{t}^{T} {\tilde{W}}_{t} x_{t}]}^{T} P e (t) + 4 \sum_{i = 1}^{t - 1} [{({\hat{V}}_{t}^{T} {\tilde{W}}_{i} x_{i})}^{T} P e (t) \cdot \prod_{j = i + 1}^{t} σ (x_{j})] \\ + 4 {[{\hat{V}}_{t}^{T} \cdot {\tilde{b}}_{t}]}^{T} P e (t) + 2 {[{\tilde{V}}_{t}^{T} \cdot {\hat{S}}_{t}]}^{T} P e (t) - 2 {[{\tilde{V}}_{t}^{T} \cdot ϵ_{2}]}^{T} P e (t) + t r {{\tilde{V}}_{t}^{T} \cdot {\dot{\hat{V}}}_{t}} \\ + t r {{\tilde{W}}_{t}^{T} \cdot {\dot{\hat{W}}}_{t}} + {\tilde{b}}_{t}^{T} \cdot {\dot{\hat{b}}}_{t} + \sum_{i = 1}^{t - 1} [t r {{\tilde{W}}_{i}^{T} \cdot {\dot{\hat{W}}}_{i}} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] \\ ⩽ e^{T} (t) [- {(A + LH)}^{T} P - P (A + LH)] e (t) - e^{T} (t) P ϵ_{1} - ϵ_{1}^{T} P e (t) \\ + t r \{{\tilde{W}}_{t}^{T} \cdot [4 {\hat{V}}_{t} P e (t) x_{t}^{T} + {\dot{\hat{W}}}_{t}]\} + t r \{{\tilde{V}}_{t}^{T} \cdot [2 {\hat{S}}_{t} e^{T} (t) P + {\dot{\hat{V}}}_{t}]\} \\ + \sum_{i = 1}^{t - 1} [t r \{{\tilde{W}}_{i}^{T} \cdot [4 {\hat{V}}_{t} P e (t) x_{i}^{T} + {\dot{\hat{W}}}_{i}]\} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] - 2 ϵ_{2}^{T} {\tilde{V}}_{t} P e (t) \\ + {\tilde{b}}_{t}^{T} \cdot [4 {\hat{V}}_{t} P e (t) + {\dot{\hat{b}}}_{t}] . \end{matrix} \end{matrix}

Then, adaptive updating laws of the weights are given as follows

\begin{matrix} \begin{matrix} {\dot{\hat{W}}}_{i} & = N_{w_i} \cdot e^{T} (t) - 4 {\hat{V}}_{t} P e (t) x_{i}^{T} (i = 1, 2, \dots, t) \\ {\dot{\hat{V}}}_{t} & = N_{v_t} \cdot e^{T} (t) - 2 {\hat{S}}_{t} e^{T} (t) P \\ {\dot{\hat{b}}}_{t} & = N_{b_t} \cdot e (t) - 4 {\hat{V}}_{t} P e (t), \end{matrix} \end{matrix}

where

N_{w_i} = [\begin{matrix} N_{w_i, 1} \\ N_{w_i, 2} \\ ⋮ \\ N_{w_i, p} \end{matrix}]

and

N_{v_t} = [\begin{matrix} N_{v_t, 1} \\ N_{v_t, 2} \\ ⋮ \\ N_{v_t, p} \end{matrix}]

denote the design vectors;

N_{b_t} = [\begin{matrix} N_{b_t, 1, 1} & N_{b_t, 1, 2} & \dots & N_{b_t, 1, n} \\ N_{b_t, 2, 1} & N_{b_t, 2, 2} & \dots & N_{b_t, 2, n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ N_{b_t, p, 1} & N_{b_t, p, 2} & \dots & N_{b_t, p, n} \end{matrix}]

denotes the design matrix.

And we have

\begin{matrix} \begin{matrix} t r \{{\tilde{W}}_{t}^{T} \cdot [4 {\hat{V}}_{t} P e (t) x_{t}^{T} + {\dot{\hat{W}}}_{t}]\} = t r \{{\tilde{W}}_{t}^{T} N_{w_t} e^{T} (t)\} \\ = t r \{{[\begin{matrix} {\tilde{W}}_{t, 1, 1} & {\tilde{W}}_{t, 1, 2} & \dots & {\tilde{W}}_{t, 1, n} \\ {\tilde{W}}_{t, 2, 1} & {\tilde{W}}_{t, 2, 2} & \dots & {\tilde{W}}_{t, 2, n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\tilde{W}}_{t, p, 1} & {\tilde{W}}_{t, p, 2} & \dots & {\tilde{W}}_{t, p, n} \end{matrix}]}^{T} \cdot [\begin{matrix} N_{w_t, 1} \\ N_{w_t, 2} \\ ⋮ \\ N_{w_t, p} \end{matrix}] \cdot {[\begin{matrix} e_{1} (t) \\ e_{2} (t) \\ ⋮ \\ e_{n} (t) \end{matrix}]}^{T}\} \\ = \sum_{i = 1}^{n} [{\tilde{W}}_{t, •, i}^{T} \cdot N_{w_t} \cdot e_{i} (t)] \end{matrix} \end{matrix}

\begin{matrix} \begin{matrix} = {[\begin{matrix} {\tilde{W}}_{t, •, 1} \\ {\tilde{W}}_{t, •, 2} \\ ⋮ \\ {\tilde{W}}_{t, •, n} \end{matrix}]}^{T} \cdot [\begin{matrix} N_{w_t} & 0 & \dots & 0 \\ 0 & N_{w_t} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & N_{w_t} \end{matrix}] \cdot [\begin{matrix} e_{1} (t) \\ e_{2} (t) \\ ⋮ \\ e_{n} (t) \end{matrix}] \\ = ȷ_{w_t}^{T} \cdot Ω_{w_t} \cdot e (t), \\ \sum_{i = 1}^{t - 1} [t r \{{\tilde{W}}_{i}^{T} \cdot [4 {\hat{V}}_{t} P e (t) x_{i}^{T} + {\dot{\hat{W}}}_{i}]\} \cdot \prod_{j = i + 1}^{t} σ (x_{j})] \\ = \sum_{i = 1}^{t - 1} [ȷ_{w_i}^{T} \cdot Ω_{w_i} \cdot e (t) \cdot \prod_{j = i + 1}^{t} σ (x_{j})], \\ t r \{{\tilde{V}}_{t}^{T} \cdot [2 {\hat{S}}_{t} e^{T} (t) P + {\dot{\hat{V}}}_{t}]\} - 2 ϵ_{2}^{T} {\tilde{V}}_{t} P e (t) = t r \{{\tilde{V}}_{t}^{T} N_{v_t} e^{T} (t)\} - 2 ϵ_{2}^{T} {\tilde{V}}_{t} P e (t) \\ = \sum_{i = 1}^{n} [{\tilde{V}}_{t, •, i}^{T} \cdot N_{v_t} \cdot e_{i} (t)] - 2 \sum_{i = 1}^{n} [{\tilde{V}}_{t, •, i}^{T} \cdot ϵ_{2} \cdot P_{i •} \cdot e_{(} t)] \\ = {[\begin{matrix} {\tilde{V}}_{t, •, 1} \\ {\tilde{V}}_{t, •, 2} \\ ⋮ \\ {\tilde{V}}_{t, •, n} \end{matrix}]}^{T} \cdot ([\begin{matrix} N_{v_t} & 0 & \dots & 0 \\ 0 & N_{v_t} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & N_{v_t} \end{matrix}] - 2 [\begin{matrix} ϵ_{2} \cdot P_{1 •} \\ ϵ_{2} \cdot P_{2 •} \\ ⋮ \\ ϵ_{2} \cdot P_{n •} \end{matrix}]) \cdot [\begin{matrix} e_{1} (t) \\ e_{2} (t) \\ ⋮ \\ e_{n} (t) \end{matrix}] \\ = ȷ_{v_t}^{T} \cdot Ω_{v_t} \cdot e (t), \\ {\tilde{b}}_{t}^{T} \cdot [4 {\hat{V}}_{t} P e (t) + {\dot{\hat{b}}}_{t}] = {\tilde{b}}_{t}^{T} \cdot N_{b_t} \cdot e (t) . \end{matrix} \end{matrix}

The derivative of V (t; e(t)) can be obtained as follows

\begin{matrix} \begin{matrix} \dot{V} (t, e (t)) ⩽ e^{T} (t) [- {(A + LH)}^{T} P - P (A + LH)] e (t) - e^{T} (t) P ϵ_{1} - ϵ_{1}^{T} P e (t) \\ + ȷ_{w_t}^{T} \cdot Ω_{w_t} \cdot e (t) + \sum_{i = 1}^{t - 1} [ȷ_{w_i}^{T} \cdot Ω_{w_i} \cdot e (t) \cdot \prod_{j = i + 1}^{t} σ (x_{j})] + ȷ_{v_t}^{T} \cdot Ω_{v_t} \cdot e (t) \\ + {\tilde{b}}_{t}^{T} \cdot N_{b_t} \cdot e (t) \\ ⩽ e^{T} (t) [- {(A + LH)}^{T} P - P (A + LH)] e (t) - e^{T} (t) P ϵ_{1} - ϵ_{1}^{T} P e (t) \\ + {[\begin{matrix} ȷ_{w_t} \\ ȷ_{w_t - 1} \\ ⋮ \\ ȷ_{w_1} \\ ȷ_{v_t} \\ {\tilde{b}}_{t} \end{matrix}]}^{T} \cdot [\begin{matrix} Ω_{w_t} \\ Ω_{w_t - 1} \cdot σ (x_{t}) \\ ⋮ \\ Ω_{w_1} \cdot \prod_{j = 2}^{t} σ (x_{j}) \\ Ω_{v_t} \\ N_{b_t} \end{matrix}] \cdot e (t) \\ ⩽ e^{T} (t) [- {(A + LH)}^{T} P - P (A + LH)] e (t) - e^{T} (t) P ϵ_{1} - ϵ_{1}^{T} P e (t) \\ + ȷ_{w v b}^{T} \cdot Ω_{w v b} \cdot e (t) \\ ⩽ {[\begin{matrix} e (t) \\ ϵ_{1} \\ ȷ_{w v b} \end{matrix}]}^{T} \cdot [\begin{matrix} Ω_{3} & - P & \frac{1}{2} Ω_{w v b}^{T} \\ - P & 0 & 0 \\ \frac{1}{2} Ω_{w v b} & 0 & 0 \end{matrix}] \cdot [\begin{matrix} e (t) \\ ϵ_{1} \\ ȷ_{w v b} \end{matrix}] \\ ⩽ ȷ_{1}^{T} \cdot [\begin{matrix} Ω_{3} & - P & \frac{1}{2} Ω_{w v b}^{T} \\ - P & 0 & 0 \\ \frac{1}{2} Ω_{w v b} & 0 & 0 \end{matrix}] \cdot ȷ_{1} . \end{matrix} \end{matrix}

Select appropriate

N_{w_i}

,

N_{v_t}

and

N_{b_t}

to satisfy

\begin{matrix} \begin{matrix} [\begin{matrix} Ω_{3} & - P & \frac{1}{2} Ω_{w v b}^{T} \\ - P & 0 & 0 \\ \frac{1}{2} Ω_{w v b} & 0 & 0 \end{matrix}] ⩽ 0, \end{matrix} \end{matrix}

then the error system (14) is asymptotically stable, and the proof of Theorem 3 is completed. □

Appendix D. The Proof of Theorem 4

Proof.

By using Assumption 3 and (14), we have

\begin{matrix} \begin{matrix} \dot{\hat{y}} (t) - \dot{y} (t) = H (\dot{\hat{x}} (t) - \dot{x} (t)) = H [- (A + LH) e (t) + (\hat{K} - \bar{K} - ϵ_{1})] . \end{matrix} \end{matrix}

By using (17), Theorems 2 and 3, we have

\begin{matrix} \begin{matrix} [\begin{matrix} \hat{y} (t) - y (t) \\ \dot{\hat{y}} (t) - \dot{y} (t) \end{matrix}] = [\begin{matrix} H \\ - H (A + LH) \end{matrix}] e (t) . \end{matrix} \end{matrix}

And we have

\begin{matrix} \begin{matrix} Y = G \cdot e (t), \end{matrix} \end{matrix}

where

G = [\begin{matrix} H \\ - H (A + LH) \end{matrix}]

and

Y = [\begin{matrix} \hat{y} (t) - y (t) \\ \dot{\hat{y}} (t) - \dot{y} (t) \end{matrix}]

.

If

G

is left invertible, there exists

G^{- 1}

such that

e (t) = G^{- 1} \cdot Y

, and the proof of Theorem 4 is completed. □

References

Chua, L. Memristor-the missing circuit element. IEEE Trans. Circuit Theory 1971, 18, 507–519. [Google Scholar] [CrossRef]
Wu, A.; Wen, S.; Zeng, Z. Synchronization control of a class of memristor-based recurrent neural networks. Inf. Sci. 2012, 183, 106–116. [Google Scholar] [CrossRef]
Wen, S.; Zeng, Z.; Huang, T. Exponential stability analysis of memristor-based recurrent neural networks with time-varying delays. Neurocomputing 2012, 97, 233–240. [Google Scholar] [CrossRef]
Chen, J.; Zeng, Z.; Jiang, P. Global mittag-leffler stability and synchronization of memristor-based fractional-order neural networks. Neural Netw. 2014, 51, 1–8. [Google Scholar] [CrossRef] [PubMed]
Wang, F.; Na, H.; Wu, S.; Yang, X.; Guo, Y.; Lim, G.; Rashid, M.M. Delayed switching applied to memristor neural networks. J. Appl. Phys. 2012, 111, 507–511. [Google Scholar] [CrossRef]
Jiang, M.; Mei, J.; Hu, J. New results on exponential synchronization of memristor-based chaotic neural networks. Neurocomputing 2015, 156, 60–67. [Google Scholar] [CrossRef]
Wu, H.; Li, R.; Ding, S.; Zhang, X.; Yao, R. Complete periodic adaptive antisynchronization of memristor-based neural networks with mixed time-varying delays. Can. J. Phys. 2014, 92, 1337–1349. [Google Scholar] [CrossRef]
Hu, M.; Chen, Y.; Yang, J.; Wang, Y.; Li, H.H. A compact memristor-based dynamic synapse for spiking neural networks. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 2016, 36, 1353–1366. [Google Scholar] [CrossRef]
Zheng, M.; Li, L.; Xiao, J.; Yang, Y.; Zhao, H. Finite-time projective synchronization of memristor-based delay fractional-order neural networks. Nonlinear Dyn. 2017, 89, 2641–2655. [Google Scholar] [CrossRef]
Negrov, D.; Karandashev, I.; Shakirov, V.; Matveyev, Y.A. A plausible memristor implementation of deep learning neural networks. Neurocomputing 2015, 237, 193–199. [Google Scholar] [CrossRef]
Wang, X.; Ju, H.; Yang, H.; Zhong, S. A new settling-time estimation protocol to finite-time synchronization of impulsive memristor-based neural networks. IEEE Trans. Cybern. 2020, 52, 4312–4322. [Google Scholar] [CrossRef] [PubMed]
Chen, B.; Yang, H.; Zhuge, F.; Li, Y.; Chang, T.-C.; He, Y.-H.; Yang, W.; Xu, N.; Miao, X.-S. Optimal tuning of memristor conductance variation in spiking neural networks for online unsupervised learning. IEEE Trans. Electron Devices 2019, 66, 2844–2849. [Google Scholar] [CrossRef]
Liu, X.Y.; Zeng, Z.G.; Wunsch, D.C. Memristor-based LSTM network with in situ training and its applications. Neural Netw. 2020, 131, 300–311. [Google Scholar] [CrossRef] [PubMed]
Ning, L.; Wei, X. Bipartite synchronization for inertia memristor-based neural networks on coopetition networks. Neural Netw. 2020, 124, 39–49. [Google Scholar]
Xu, C.; Wang, C.; Sun, Y.; Hong, Q.; Deng, Q.; Chen, H. Memristor-based neural network circuit with weighted sum simultaneous perturbation training and its applications. Neurocomputing 2021, 462, 581–590. [Google Scholar] [CrossRef]
Prezioso, M.; Bayat, F.M.; Hoskins, B.D.; Likharev, K.K.; Strukov, D.B. Training andoperation of an integrated neuromorphic network based on metal-oxide memristors. Nature 2015, 521, 61–64. [Google Scholar] [CrossRef] [PubMed]
Kim, S.; Park, J.; Kim, T.H.; Hong, K.; Hwang, Y.; Park, B.g.; Kim, H. 4-bit multilevel operation in overshoot suppressed Al2O3/TiOx resistive random-access memory crossbar array. Adv. Intell. Syst. 2022, 4, 2100273. [Google Scholar] [CrossRef]
Wang, Z.R.; Joshi, S.; Savel’ev, S.; Song, W.; Midya, R.; Li, Y.; Rao, M.; Yan, P.; Asapu, S.; Zhuo, Y.; et al. Fully memristive neural networks for pattern classification with unsupervised learning. Nat. Electron. 2018, 1, 137–145. [Google Scholar] [CrossRef]
Choi, W.S.; Jang, J.T.; Kim, D.; Yang, T.J.; Kim, C.; Kim, H.; Kim, D.H. Influence of Al2O3 layer on InGaZnO memristor crossbar array for neuromorphic applications. Chaos Solitons Fractals 2022, 156, 111813. [Google Scholar] [CrossRef]
Bayat, F.M.; Prezioso, M.; Chakrabarti, B.; Nili, H.; Kataeva, I.; Strukov, D.B. Implementation of multilayer perceptron network with highly uniform passive memristive crossbar circuits. Nat. Commun. 2018, 9, 2331. [Google Scholar] [CrossRef]
Chen, Y.; Chen, G. Stability analysis of delayed neural networks based on a relaxed delay-product-type lyapunov functional. Neurocomputing 2021, 439, 340–347. [Google Scholar] [CrossRef]
Wang, L.; Song, Q.; Liu, Y.; Zhao, Z.; Alsaadi, F.E. Finite-time stability analysis of fractional-order complex-valued memristor-based neural networks with both leakage and time-varying delays. Neurocomputing 2017, 245, 86–101. [Google Scholar] [CrossRef]
Meng, Z.; Xiang, Z. Stability analysis of stochastic memristor-based recurrent neural networks with mixed time-varying delays. Neural Comput. Appl. 2017, 28, 1787–1799. [Google Scholar] [CrossRef]
Chen, C.; Zhu, S.; Wei, Y. Finite-time stability of delayed memristor-based fractional-order neural networks. IEEE Trans. Cybern. 2020, 50, 1607–1616. [Google Scholar] [CrossRef]
Du, F.; Lu, J. New criteria for finite-time stability of fractional order memristor-based neural networks with time delays. Neurocomputing 2021, 421, 349–359. [Google Scholar] [CrossRef]
Rakkiyappan, R.; Chandrasekar, A.; Laksmanan, S.; Park, J.H. State estimation of memristor-based recurrent neural networks with time-varying delays based on passivity theory. Complexity 2014, 19, 32–43. [Google Scholar] [CrossRef]
Bao, H.; Cao, J.; Kruths, J.; Alsaedi, A.; Ahmad, B. H∞ state estimation of stochastic memristor-based neural networks with time-varying delays. Neural Netw. 2018, 99, 79–91. [Google Scholar] [CrossRef]
Nagamani, G.; Rajan, G.S.; Zhu, Q. Exponential state estimation for memristor-based discrete-time bam neural networks with additive delay components. IEEE Trans. Cybern. 2020, 5, 4281–4292. [Google Scholar] [CrossRef]
Sakthivel, R.; Anbuvithya, R.; Mathiyalagan, K.; Prakash, P. Combined h∞ and passivity state estimation of memristive neural networks with random gain fluctuations. Neurocomputing 2015, 168, 1111–1120. [Google Scholar] [CrossRef]
Wei, F.; Chen, G.; Wang, W. Finite-time synchronization of memristor neural networks via interval matrix method. Neural Netw. 2020, 127, 7–18. [Google Scholar] [CrossRef]
Wang, J.; Wang, Z.; Chen, X.; Qiu, J. Synchronization criteria of delayed inertial neural networks with generally markovian jumping. Neural Netw. 2021, 139, 64–76. [Google Scholar] [CrossRef] [PubMed]
Li, L.; Xu, R.; Gan, Q.; Lin, J. Synchronization of neural networks with memristor-resistor bridge synapses and lévy noise. Neurocomputing 2021, 432, 262–274. [Google Scholar] [CrossRef]
Ren, H.; Peng, Z.; Gu, Y. Fixed-time synchronization of stochastic memristor-based neural networks with adaptive control. Neural Netw. 2020, 130, 165–175. [Google Scholar] [CrossRef] [PubMed]
Zheng, C.D.; Zhang, L. On synchronization of competitive memristor-based neural networks by nonlinear control. Neurocomputing 2020, 410, 151–160. [Google Scholar] [CrossRef]
Pan, C.; Bao, H. Exponential synchronization of complex-valued memristor-based delayed neural networks via quantized intermittent control. Neurocomputing 2020, 404, 317–328. [Google Scholar] [CrossRef]
Xiao, J.; Li, Y.; Zhong, S.; Xu, F. Extended dissipative state estimation for memristive neural networks with time-varying delay. ISA Trans. 2016, 64, 113–128. [Google Scholar] [CrossRef] [PubMed]
Li, R.; Gao, X.; Cao, J.; Zhang, K. Dissipativity and exponential state estimation for quaternion-valued memristive neural networks. Neurocomputing 2019, 363, 236–245. [Google Scholar] [CrossRef]
Li, H. Sampled-data state estimation for complex dynamical networks with time-varying delay and stochastic sampling. Neurocomputing 2014, 138, 78–85. [Google Scholar] [CrossRef]
Dai, X.; Yin, H.; Jha, N. Grow and prune compact, fast, and accurate lstms. IEEE Trans. Comput. 2020, 69, 441–452. [Google Scholar] [CrossRef]
Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators. Neural Netw. 1989, 2, 359–366. [Google Scholar] [CrossRef]

Figure 1. Circuit of memristor neural networks [28].

Figure 2. Schematic diagram of a basic LSTMs cell.

Figure 3. Schematic diagram of an improved LSTMs cell.

Figure 4. Schematic diagram of the improved LSTMs.

Figure 5. The state and estimated state curves of the 2-dimensional memristor neural networks.

Figure 6. The estimated error curves of the states.

Figure 7. The derivative curves of the states and estimated states.

Figure 8. The error curves of the derivative of the states.

Figure 9. The output and estimated output curves.

Figure 10. The error curves of the output.

Figure 11. The state and estimated state with the adjusted weights and state with the random weights curves.

Figure 12. The output and estimated output with the adjusted weights and output with the random weights curves.

Figure 13. The state and estimated state curves of the 3-dimensional memristor neural networks.

Figure 14. The estimated error curves of the states.

Figure 15. The derivative curves of the states and estimated states.

Figure 16. The error curves of the derivative of the states.

Figure 17. The output and estimated output curves.

Figure 18. The error curves of the output.

Figure 19. The state and estimated state with the adjusted weights and state with the random weights curves.

Figure 20. The output and estimated output with the adjusted weights and output with the random weights curves.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ma, L.; Wang, M. State Estimation of Memristor Neural Networks with Model Uncertainties. Machines 2022, 10, 1228. https://doi.org/10.3390/machines10121228

AMA Style

Ma L, Wang M. State Estimation of Memristor Neural Networks with Model Uncertainties. Machines. 2022; 10(12):1228. https://doi.org/10.3390/machines10121228

Chicago/Turabian Style

Ma, Libin, and Mao Wang. 2022. "State Estimation of Memristor Neural Networks with Model Uncertainties" Machines 10, no. 12: 1228. https://doi.org/10.3390/machines10121228

APA Style

Ma, L., & Wang, M. (2022). State Estimation of Memristor Neural Networks with Model Uncertainties. Machines, 10(12), 1228. https://doi.org/10.3390/machines10121228

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

State Estimation of Memristor Neural Networks with Model Uncertainties

Abstract

1. Introduction

2. Preliminaries

3. Main Result

4. Simulation Analysis

4.1. Examples

4.2. Description of Simulation Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. The Proof of Theorem 1

Appendix B. The Proof of Theorem 2

Appendix C. The Proof of Theorem 3

Appendix D. The Proof of Theorem 4

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI