An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control

Tasinaffo, Paulo M.; Gonçalves, Gildárcio S.; Marques, Johnny C.; Dias, Luiz A. V.; da Cunha, Adilson M.

doi:10.3390/a18090562

Open AccessArticle

An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control

by

Paulo M. Tasinaffo

^*

,

Gildárcio S. Gonçalves

,

Johnny C. Marques

,

Luiz A. V. Dias

and

Adilson M. da Cunha

Instituto Tecnológico de Aeronáutica (ITA), São José dos Campos 12228-900, SP, Brazil

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(9), 562; https://doi.org/10.3390/a18090562

Submission received: 7 August 2025 / Revised: 26 August 2025 / Accepted: 31 August 2025 / Published: 4 September 2025

(This article belongs to the Topic AI and Computational Methods for Modelling, Simulations and Optimizing of Advanced Systems: Innovations in Complexity, Second Edition)

Download

Browse Figures

Review Reports Versions Notes

Abstract

A Universal Numerical Integrator (UNI) is a computational framework that combines a classical numerical integration method, such as Euler, Runge–Kutta, or Adams–Bashforth, with a universal approximator of functions, such as a feed-forward neural network (including MLP, SVM, RBF, among others) or a fuzzy inference system. The Euler-Type Universal Numerical Integrator (E–TUNI) is a particular case of UNI based on the first-order Euler integrator and is designed to model non-linear dynamic systems observed in real-world scenarios accurately. The UNI framework can be organized into three primary methodologies: the NARMAX model (Non-linear AutoRegressive Moving Average with eXogenous input), the mean derivatives approach (which characterizes E–TUNI), and the instantaneous derivatives approach. The E–TUNI methodology relies exclusively on mean derivative functions, distinguishing it from techniques that employ instantaneous derivatives. Although it is based on a first-order scheme, the E–TUNI achieves an accuracy level comparable to that of higher-order integrators. This performance is made possible by the incorporation of a neural network acting as a universal approximator, which significantly reduces the approximation error. This article provides a comprehensive overview of the E–TUNI methodology, focusing on its application to the modeling of non-linear autonomous dynamic systems and its use in predictive control. Several computational experiments are presented to illustrate and validate the effectiveness of the proposed method.

Keywords:

Adams–Bashforth neural network; Euler–Type universal numerical integrator; neural differential equation; Runge–Kutta neural network; universal numerical integrator

1. Introduction

The numerical integration of non-linear dynamic systems is a central problem in computational modeling and control engineering. Classical methods such as Euler, Runge–Kutta, and predictor–corrector are widely used, but they often present limitations in terms of accuracy, stability, and adaptability, especially when applied to highly non-linear systems or model-based predictive control tasks. This scenario motivates the development of methodologies that combine simplicity, robustness, and generality while maintaining computational efficiency.

In this context, the Euler-Type Universal Numerical Integrator (E–TUNI) is proposed as a methodology that preserves the structural simplicity of the Euler scheme but extends its formulation to achieve better performance and greater applicability through its coupling with an Artificial Neural Network (ANN). By coupling the concept of a universal approximator of functions to the mean-derivative formulation, E-TUNI offers a flexible and practical framework for the simulation of non-linear systems and for applications in model-based predictive control.

Therefore, the main contributions of this paper are as follows: (i) presenting a basic mathematical formulation of E–TUNI, (ii) demonstrating four representative technological applications of E-TUNI in non-linear dynamics and/or predictive control, including the non-linear pendulum and the orbital transfer problem, (iii) illustrating a procedure for obtaining approximate continuous solutions from the naturally discrete scheme of E–TUNI (see Example 3 in Section 6), (iv) critically discussing the advantages and limitations of E–TUNI in comparison with other universal numerical integrators, pointing out potential directions for future research (see Example 4 in Section 6), (v) evaluating the computational cost of E–TUNI in different scenarios, showing that the method can be competitive or even superior to traditional methods in large-scale problems (see again Example 4), and (vi) pointing out how E–TUNI can be incorporated into hybrid frameworks, bringing the method closer to current research lines in deep learning applied to differential equations.

The remainder of this article is organized as follows: Section 2 presents a literature review on E–TUNI and other types of Universal Numerical Integrators (UNIs). Section 3 briefly defines the type of neural predictive control problem that is addressed in this article. Section 4 presents the notation and variables used in this formulation. Section 5 provides a theoretical overview of E–TUNI in a predictive control structure. Section 6 describes the numerical simulations and applications. Finally, Section 7 concludes the article with a summary of the results and prospects for future work.

2. Related Work

One of the earliest works on artificial neural networks is attributed to McCulloch and Pitts in [1], dated 1943. In the eighty years of development of artificial intelligence and neural networks, many significant advances have emerged. For example, in [2], Kolmogorov (1957) demonstrated that any n-dimensional function can be represented as a linear combination of non-linear one-dimensional functions. This result was later recognized in [3], where the concept of Kolmogorov neural networks was introduced. The classical back-propagation algorithm for training MLP networks was proposed in [4], and subsequent work [5,6] demonstrated that MLPs are universal approximator of functions. A detailed summary of the advances in neural networks in the 20th century can be found in [7].

One of the most relevant applications of neural networks is the modeling of non-linear dynamical systems. In [8], the concept of a Universal Numerical Integrator (UNI) was introduced, defined as the coupling of a conventional numerical integrator (e.g., Euler, Runge–Kutta, predictor–corrector, among others) to a universal approximator of functions (e.g., MLP, SVM, RBF, wavelet networks, fuzzy inference systems, paraconsistent inference systems, among others). The classification proposed in [8] divides UNIs into three main categories: (i) the NARMAX model, (ii) the mean derivative methodology (e.g., E–TUNI), and (iii) the instantaneous derivative methodology (e.g., Runge–Kutta neural network, Adams–Bashforth neural network, predictor–corrector neural network, among others).

Classic references on numerical integrators can be found in [9,10,11]. In [12,13], the NARMAX model is discussed in detail using neural networks. Euler’s original work from 1768 is presented in [14], in which the first-order scheme based on instantaneous derivatives was proposed. Although simple, Euler’s method is relatively imprecise. However, when mean derivatives replace instantaneous derivatives, the process becomes as accurate as higher-order integrators, which motivates the structure of the Euler-Type Universal Numerical Integrator (E–TUNI).

It should be noted that in [8], the E–TUNI was called “Euler Neural Network,” while in [15] the same term appeared in another context. To avoid ambiguity, in this work we adopt the designation E-TUNI. References [16,17,18,19] discuss the mean derivative methodology in different contexts, including predictive control [16], qualitative design analysis [17], and quantitative mathematical analysis [18]. In [19], the E-TUNI is applied technologically and mathematically in backward integration processes.

The first work identified in the literature on UNIs is attributed to Wang and Lin (1998) [20], who introduced the Runge–Kutta Neural Network (RKNN). Later work explored its application in control [21,22], while others addressed neural differential equations more broadly [23]. It is also important to note that, to our knowledge, the Predictor–Corrector Neural Network (PCNN) has not yet been formally introduced in the literature.

Finally, both the NARMAX model and the mean derivative methodology (E–TUNI) are intrinsically linked to a fixed integration step

Δ t

. Changing the integration step requires a new neural network training process. In contrast, methodologies based on instantaneous derivatives (RKNN, ABNN, PCNN, among others) allow variations of

Δ t

, within certain stability limits, without the need for retraining [8,17,18,20]. For validation and testing purposes, simple theoretical models of non-linear differential equations can be found in [24].

3. Neural Predictive Control Problem Formulation

Neural predictive control is an advanced strategy that combines prediction of future system behavior and optimization of control actions. Unlike traditional methods, which rely on complex mathematical models, neural predictive control uses artificial neural networks to learn directly from historical data, adaptively capturing non-linearities and complex dynamics.

The process begins with the collection of relevant variables, such as past system states, previous control actions, and external disturbances. A neural network (often an MLP) is trained to act as a predictive model, estimating future states based on these inputs. An optimizer then calculates the optimal sequence of control actions that minimizes a performance criterion, such as the error between the prediction and a desired target. The first action in this sequence is applied to the system, and the cycle repeats in real time, continually readjusting itself with new data.

The advantage of this method is that the neural network eliminates the need for explicit analytical equations, adapts to operational changes, and robustly handles noise. Typical applications include temperature control in industrial environments, flow management in watersheds, and chemical process automation. To implement it, the architecture of the artificial neural network is defined, representative data is collected, and the predictive control algorithm is integrated into the system. In short, neural predictive control offers a powerful and flexible alternative for controlling dynamic systems, combining machine learning and control theory in a cohesive, future-oriented framework.

In this article, a first-order Euler integrator is coupled to a neural network with a Multi-Layer Perceptron (MLP) architecture. This structure is the representative dynamic model of our real plant, which we call E–TUNI. The E–TUNI architecture is then coupled to a predictive control structure that minimizes the Mean Squared Error (MSE) of a quadratic functional composed of two parts to be optimized: (i) a smooth control policy and (ii) the tracking of a reference trajectory. In aerospace engineering problems, as discussed at the end of this article, achieving autonomous controllability of satellites and rockets is of fundamental importance, eliminating the need for human intervention.

4. Preliminaries and Symbols Used

To facilitate a deeper understanding of the theoretical framework developed in this paper, we present a complete definition of all symbols and variables used throughout the work. The proposed methodology is named Euler Type Universal Numerical Integrator (E–TUNI), and it is based on a forward integration scheme that operates in a discrete-time setting. This approach is designed to approximate the behavior of continuous-time autonomous ordinary differential equations by means of a discrete formulation. For clarity, all notations used in this context are listed below and classified into two main categories: (i) variables defined in continuous time and (ii) variables defined in discrete time.

(i) Continuous Variables

$\dot{y} = f (y)$ ⋯ System of continuous differential equations.
$y = {[y_{1} y_{2} \dots y_{n}]}^{T}$ ⋯ State Variables.
$f (y) = {[f_{1} (y) f_{2} (y) \dots f_{n} (y)]}^{T}$ ⋯ Instantaneous derivative functions.
$y_{j}^{i} (t) = g_{j}^{i} (t)$ ⋯ Particular continuous and differentiable curve of a family of solution curves of the dynamical system $\dot{y} = f (y)$ .
${\dot{y}}_{j}^{i} (t) = {\dot{g}}_{j}^{i} (t)$ ⋯ First derivative of $y_{j}^{i} (t)$ .

(ii) Discrete Variables

${}^{k}y^{i} = y^{i} (t_{0} + k \cdot Δ t)$ ⋯ Vector of state variables at time $t_{k}$ .
${}^{k}y_{j}^{i}$ ⋯ Scalar state variable for $j = 1, 2, \dots, n$ at time $t_{k}$ . It is a generic discretization point of the state variables generated by the integers i, j, and k.
$n = n_{y}$ ⋯ Total number of state variables.
${}^{k}u = u (t_{0} + k \cdot Δ t)$ ⋯ Vector of control variables at time $t_{k}$ .
${}^{k}u_{j}$ ⋯ Scalar control variable for $j = 1, 2, \dots, m$ at time $t_{k}$ .
$n_{u}$ ⋯ Total number of control variables.
${}^{k + 1}y^{i} = y^{i} [t_{0} + (k + 1) \cdot Δ t]$ ⋯ Exact vector of state variables at time $t_{k + 1}$ .
${}^{k + 1}y_{j}^{i}$ ⋯ Exact scalar state variable for $j = 1, 2, \dots, n$ at time $t_{k + 1}$ .
${}^{k + 1}{\hat{y}}^{i} = {\hat{y}}^{i} [t_{0} + (k + 1) \cdot Δ t]$ ⋯ Estimated Vector of state variables by UNI or E-TUNI at time $t_{k + 1}$ .
${}^{k + 1}{\hat{y}}_{j}^{i}$ ⋯ Scalar state variable estimated by UNI or E-TUNI for $j = 1, 2, \dots, n$ at time $t_{k + 1}$ .
${}^{k + 1}{\tilde{y}}^{i}$ ⋯ Estimated Vector of state variables when using only the integrator and without using the neural network at time $t_{k + 1}$ .
$t a n_{Δ t} {}^{k}α^{i} = t a n_{Δ t} {}^{k}α^{i} = {[t a n_{Δ t} {}^{k}α_{1}^{i} t a n_{Δ t} {}^{k}α_{2}^{i} \dots t a n_{Δ t} {}^{k}α_{n}^{i}]}^{T}$ ⋯ Exact vector of positive mean derivative functions at time $t_{k}$ .
$t a n_{Δ t} {}^{k}α_{j}^{i} = t a n_{Δ t} {}^{k}α_{j}^{i} = \frac{{}^{k + 1}y_{j}^{i} - {}^{k}y_{j}^{i}}{Δ t}$ ⋯ Scalar positive mean derivative functions for $j = 1, 2, \dots, n$ at time $t_{k}$ .
$t a n_{Δ t} {}^{k}{\hat{α}}^{i} = {[t a n_{Δ t} {}^{k}{\hat{α}}_{1}^{i} t a n_{Δ t} {}^{k}{\hat{α}}_{2}^{i} \dots t a n_{Δ t} {}^{k}{\hat{α}}_{n}^{i}]}^{T}$ ⋯ Estimated vector of positive mean derivative functions by the E-TUNI at time $t_{k}$ .
$t a n {}^{k}θ^{i} = {[t a n^{k} θ_{1}^{i} t a n^{k} θ_{2}^{i} \dots t a n^{k} θ_{n}^{i}]}^{T}$ ⋯ Vector of positive instantaneous derivatives at time $t_{k}$ .
$t a n^{k} θ_{j}^{i} = \underset{Δ t \to 0}{l i m} \frac{{}^{k + 1}y_{j}^{i} -^{k} y_{j}^{i}}{Δ t}$ ⋯ Scalar positive instantaneous derivative for $j = 1, 2, \dots, n$ at instant $t_{k}$ .
$t_{k}$ ⋯ Time instant $t_{k} = t_{0} + k \cdot Δ t$ .
$t_{k + 1}$ ⋯ Time instant $t_{k + 1} = t_{0} + (k + 1) \cdot Δ t$ .
$Δ t$ ⋯ Integration step.
i⋯ Over-index that enumerates a particular curve from the family of curves of the dynamical system to be modeled ( $i = 1, 2, \dots, q$ ).
j⋯ Under-index that enumerates the state and control variables.
k⋯ Over-index that enumerates the discrete time instants ( $k = 1, 2, \dots, r$ ).
r⋯ Total number of horizons of the time variable.
q⋯ Total number of curves from the family of curves of the dynamic system to be modeled.
$t_{k}^{*}$ ⋯ Instant of time within the interval $[t_{k}, t_{k + 1}]$ as a result of the Differential Mean Value Theorem (see Theorem 1).
$t_{k}^{x}$ ⋯ Instant of time within the interval $[t_{k}, t_{k + 1}]$ as a result of the Integral Mean Value Theorem (see Theorem 2).
m⋯ Total number of horizons in a predictive control structure.

To interpret the notation used in this work, it is essential to distinguish between the variables

{}^{k + 1}y^{i}

,

{}^{k + 1}{\tilde{y}}^{i}

, and

{}^{k + 1}{\hat{y}}^{i}

. The vector variable

{}^{k + 1}y^{i}

is the exact value of the state variables at time

t_{k + 1} = t_{0} + (k + 1) \cdot Δ t

. The vector variable

{}^{k + 1}{\tilde{y}}^{i}

is the estimated value of the state variables using only the numerical integrator. Finally, the vector variable

{}^{k + 1}{\hat{y}}^{i}

is the estimated value of the state variables using the numerical integrator coupled to an artificial neural network. Figure 1 follows this convention and helps to understand better the notations adopted here. Additionally, it is important to understand the meaning of the auxiliary variables i, j, and k. This is accomplished in the next paragraph.

A key point in this notation is the interpretation of the over-indexes k and i and the under-index j in the variables

{}^{k}y_{j}^{i}

and

t a n_{Δ t} {}^{k}α_{j}^{i}

. When the auxiliary variables i, j, and k are used simultaneously, they uniquely identify the secant (

t a n_{Δ t} {}^{k}α_{j}^{i}

) at the point

{}^{k}y_{j}^{i}

. Note that the over-index k indicates the time instant

t_{k} = t_{0} + k \cdot Δ t

of the secant, the under-index j indicates the state variable in question, where

j = 1, 2, \dots, n

, and the over-index i

(i = 1, 2, \dots, q)

indicates the particular curve where the respective secant is located, from the family of possible curves of the system of differential equations considered. The value of q can be as large as desired. The larger the value of q, the more different curves the neural network trains on, and the better its generalization. This convention is quite useful to fully understand the formal proof of the general expression of E–TUNI that is performed in Section 5.3.

5. Mathematical Development

In this section, we present a concise and complete description of the Euler-Type Universal Numerical Integrator (E–TUNI). To this end, we present a formal mathematical proof for the general expression that governs E–TUNI’s operation to generate discrete solutions for autonomous non-linear dynamical systems governed by ordinary differential equations. Additionally, we also present the correct way to use E–TUNI in a predictive control framework. We conclude this section by obtaining an approximate mathematical expression for E-TUNI to provide a continuous solution, rather than a discrete solution, for autonomous dynamical systems.

5.1. Basic Mathematical Development of E-TUNI

We provide a brief mathematical description of E–TUNI below. Then, in the following sub-section, we perform a formal mathematical demonstration of the general expression of the first-order Euler integrator designed with mean derivative functions. So, by definition, the secants or mean derivatives

t a n_{Δ t} {}^{k}α_{j}^{i}

for

j = 1, 2, \dots, n

between the points

{}^{k + 1}y_{j}^{i}

and

{}^{k}y_{j}^{i}

are given by:

t a n_{Δ t} {}^{k}α_{j}^{i} = \frac{{}^{k + 1}y_{j}^{i} - {}^{k}y_{j}^{i}}{Δ t}

(1)

where

{}^{k + 1}y_{j}^{i} = y_{j}^{i} [t + (k + 1) \cdot Δ t]

is the forward state of the dynamic system,

{}^{k}y_{j}^{i} = y_{j}^{i} [t + k \cdot Δ t]

is the present state of the dynamical system, the over-index k on the left indicates the instant k, the over-index i on the right suggests a discretization of the continuum, the sub-index j on the right indicates the

j - t h

state variable, n is the total number of state variables, and

Δ t

is the integration step. We talk more about the over-index i later in this article.

A geometric and intuitive difference between the mean and the instantaneous derivative functions is shown in Figure 1. In line with Figure 1 and Figure 2, the E-TUNI is entirely based on the concept of mean derivative functions and not on the idea of instantaneous derivative functions. As will be seen throughout this article, this change is quite significant when incorporated adequately into the Euler-type first-order integrator. Furthermore, the mean derivative functions can also be obtained from supervised training with input/output patterns using a neural network with any feed-forward architecture (MLP, RBF, SVM, among others).

It is also essential to note that it is possible to train E–TUNI in two different ways, namely, (a) through the direct approach or (b) through the indirect or empirical approach. In the direct approach, the neural network is trained decoupled from the Euler-type integrator, while in the indirect or empirical approach the neural network is trained coupled to the structure of the first-order Euler integrator. For this reason, in the indirect approach the back-propagation algorithm needs to be modified slightly. In the references [8,17,18], this is explained in more detail.

In this way, we have a non-linear dynamic system of n simultaneous first-order equations with dependent variables

y_{1}, y_{2}, \dots, y_{n}

. If each of these variables satisfies a given initial condition for the same value a of t, then we have an initial value problem for a first-order system, and we can write:

\{\begin{matrix} {\dot{y}}_{1} = f_{1} (y_{1}, y_{2}, \dots y_{n}), & y_{1} (a) = η_{1} \\ {\dot{y}}_{2} = f_{2} (y_{1}, y_{2}, \dots y_{n}), & y_{2} (a) = η_{2} \\ ⋮ & ⋮ \\ {\dot{y}}_{n} = f_{n} (y_{1}, y_{2}, \dots y_{n}), & y_{n} (a) = η_{n} \end{matrix}

(2)

In [14], the mathematician Leonard Euler himself proposed in 1768 the first numerical integrator in the history of mathematics to approximately solve non-linear dynamical systems governed by Equation (2). This solution, which is well-known to everyone, is given by:

\{\begin{matrix} {}^{k + 1}y_{1}^{i} ≅ t a n {}^{k}θ_{1}^{i} \cdot Δ t + {}^{k}y_{1}^{i} \\ {}^{k + 1}y_{2}^{i} ≅ t a n {}^{k}θ_{2}^{i} \cdot Δ t + {}^{k}y_{2}^{i} \\ ⋮ \\ {}^{k + 1}y_{n}^{i} ≅ t a n {}^{k}θ_{n}^{i} \cdot Δ t + {}^{k}y_{n}^{i} \end{matrix}

(3)

where

t a n {}^{k}θ_{j}^{i}

for

j = 1, 2, \dots, n

are instantaneous derivative functions, according to the graphical notation shown in Figure 1. On the other hand, there is an attempt in [19] to prove mathematically that if you exchange the instantaneous derivatives in (3) for the mean derivatives, then the solution proposed by Leonard Euler becomes accurate, i.e.,

\{\begin{matrix} {}^{k + 1}y_{1}^{i} = t a n_{Δ t} {}^{k}α_{1}^{i} \cdot Δ t + {}^{k}y_{1}^{i} \\ {}^{k + 1}y_{2}^{i} = t a n_{Δ t} {}^{k}α_{2}^{i} \cdot Δ t + {}^{k}y_{2}^{i} \\ ⋮ \\ {}^{k + 1}y_{n}^{i} = t a n_{Δ t} {}^{k}α_{n}^{i} \cdot Δ t + {}^{k}y_{n}^{i} \end{matrix}

(4)

Comparing Equations (3) and (4), it is observed that the instantaneous derivative functions do not depend on the integration step, but the mean derivative functions do. Additionally, the equations in (4) can also be expressed in a more compact notation, given by:

{}^{k + 1}y^{i} = t a n_{Δ t} {}^{k}α^{i} \cdot Δ t + {}^{k}y^{i}

(5)

where

{}^{k + 1}y^{i} =

[

{}^{k + 1}y_{1}^{i}

{}^{k + 1}y_{2}^{i}

⋯

{}^{k + 1}y_{n}^{i}

]^{T}

,

t a n_{Δ t} {}^{k}α^{i} =

[

t a n_{Δ t} {}^{k}α_{1}^{i}

t a n_{Δ t} {}^{k}α_{2}^{i}

⋯

t a n_{Δ t} {}^{k}α_{n}^{i}

]^{T}

e

{}^{k}y^{i} =

[

{}^{k}y_{1}^{i}

{}^{k}y_{2}^{i}

⋯

{}^{k}y_{n}^{i}

]^{T}

. The generalization of the mean derivatives methodology to multiple backward inputs and/or multiple forward outputs is relatively easy. This expression can be obtained by the following equation:

{}^{k + p}y_{j}^{i} = \sum_{m = 0}^{p - 1} t a n_{Δ t} {}^{k + m}α_{j}^{i} \cdot Δ t + {}^{k}y_{j}^{i}

(6)

where p is the number of backward and/or forward instants and

j = 1, 2, \dots, n

. For example, if

p =

p_{1} + p_{2}

then it is possible to design an E–TUNI with

p_{1}

inputs in the role of backward mean derivatives and

p_{2}

outputs in the role of forward mean derivatives. In this way, the reader is free to choose the values of

p_{1}

and

p_{2}

as long as the previous equality is satisfied. However, it should be noted that the first input of the neural network must be an absolute value, not a relative one.

On the other hand, notice that if the reader tries to compare the NARMAX model with the E–TUNI structure, one can see that the former is very similar to the latter. In the NARMAX model, for example, the output of the universal approximator of functions is the forward instant

y_{j} (t + Δ t)

. However, in the E-TUNI model, the output of the universal approximator of functions is the mean derivative function

t a n_{Δ t} {}^{k}α_{j}^{i}

at the present instant. This description is the only practical difference between these two methodologies. However, the E-TUNI may be a little more computationally efficient than the NARMAX model, as explained in the following paragraph.

Considering the direct approach to training the mean derivative functions required by the E–TUNI structure, it is then possible to perform a theoretical analysis of the local error committed by this first-order universal numerical integrator. Thus, let the exact value

{}^{k + 1}{\bar{y}}_{j}^{i}

and the estimated value

{}^{k + 1}{\hat{y}}_{j}^{i}

be obtained, respectively, by Equations (7) and (8) of a given solution of a generic dynamical system.

{}^{k + 1}{\bar{y}}_{j}^{i} = t a n_{Δ t} {}^{k}α_{j}^{i} \cdot Δ t + {}^{k}y_{j}^{i}

(7)

{}^{k + 1}{\hat{y}}_{j}^{i} = (t a n_{Δ t} {}^{k}α_{j}^{i} \pm e_{m}) \cdot Δ t + {}^{k}y_{j}^{i}

(8)

where

e_{m}

is the mean absolute error of the output variables of the universal approximator of functions used to learn the mean derivative functions. Thus, if Equation (7) is subtracted from Equation (8) and the result of this subtraction is squared, we have:

{(\begin{matrix} {}^{k + 1}{\bar{y}}_{j}^{i} - {}^{k + 1}{\hat{y}}_{j}^{i} \end{matrix})}^{2} = {Δ t}^{2} \cdot e_{m}^{2}

(9)

Equation (9) states that the local squared error made by E–TUNI can dampen the squared error of training the universal approximator of functions, used in neural training; if

0 < Δ t < 1

. If

Δ t > 1

, the local error will be amplified. However, for a more accurate analysis of the global training error, further studies are needed.

Equation (9) is fundamental, as it partially explains why training the E–TUNI with a mean square error

e_{m}^{2}

, greater than that obtained by training the NARMAX model can also yield good results from estimation and potentially surpass those obtained in the NARMAX methodology. However, as stated earlier, this is true only if

Δ t < 1

. Finally, an E–TUNI neural integrator can be used for supervised training of a generic plant.

5.2. Predictive Control Designed with E–TUNI

In a typical neural predictive control scheme, a feed-forward neural network can be trained to learn a discrete E-TUNI model. This discrete model can then be used as an internal response model to obtain smooth control actions that are capable of tracking a reference trajectory by minimizing a quadratic functional of the following form [16,25]:

\begin{matrix} J = {\sum_{j = 1}^{m} {[y_{r} (t_{j}) - \hat{y} (t_{j})]}^{T} \cdot r_{y}^{- 1} (t) \cdot [y_{r} (t_{j}) - \hat{y} (t_{j})] + \\ \sum_{j = 0}^{m - 1} {[u (t_{j}) - u (t_{j - 1})]}^{T} \cdot r_{u}^{- 1} (t) \cdot [u (t_{j}) - u (t_{j - 1})]} / 2 \end{matrix}

(10)

where

y_{r} (t_{j})

is the reference trajectory at the instant

t_{j}

, m is the number of horizons ahead,

r_{y}^{- 1} (t_{j})

and

r_{u}^{- 1} (t_{j})

are positive definite weight matrices, and

\hat{y} (t_{j})

is the output of the previously trained E–TUNI. Thus, in [25] it is shown that it is necessary to know the partial derivatives

\frac{\partial {}^{k + q}y^{i}}{\partial {}^{k}u}

to solve the problem of optimization given by the equation iteratively (10) through the Kalman filter. Also, in [25] these partial derivatives can be calculated as follows:

\frac{\partial^{k + q} y^{i}}{\partial^{k} u} = (\begin{matrix} \frac{\partial t a n_{Δ t}^{k + q - 1} α^{i}}{\partial^{k + q - 1} y^{i}} \cdot Δ t + I \end{matrix}) \cdot \frac{\partial^{k + q - 1} y^{i}}{\partial^{k} u}

(11)

t o q = 2, 3, . . ., m

where

{[\begin{matrix} \frac{\partial^{k + q - 1} y^{i}}{\partial^{k} u} \end{matrix}]}_{n_{y} \times n_{u}} =

Δ t \cdot [\begin{matrix} \frac{\partial t a n_{Δ t}^{k + q - 2} α_{1}^{i}}{\partial^{k} u_{1}} & \frac{\partial t a n_{Δ t}^{k + q - 2} α_{1}^{i}}{\partial^{k} u_{2}} & \dots & \frac{\partial t a n_{Δ t}^{k + q - 2} α_{1}^{i}}{\partial^{k} u_{n_{u}}} \\ \frac{\partial t a n_{Δ t}^{k + q - 2} α_{2}^{i}}{\partial^{k} u_{1}} & \frac{\partial t a n_{Δ t}^{k + q - 2} α_{2}^{i}}{\partial^{k} u_{2}} & \dots & \frac{\partial t a n_{Δ t}^{k + q - 2} α_{2}^{i}}{\partial^{k} u_{n_{u}}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \frac{\partial t a n_{Δ t}^{k + q - 2} α_{n_{y}}^{i}}{\partial^{k} u_{1}} & \frac{\partial t a n_{Δ t}^{k + q - 2} α_{n_{y}}^{i}}{\partial^{k} u_{2}} & \dots & \frac{\partial t a n_{Δ t}^{k + q - 2} α_{n_{y}}^{i}}{\partial^{k} u_{n_{u}}} \end{matrix}]

(12)

{[\begin{matrix} \frac{\partial t a n_{Δ t}^{k + q - 1} α^{i}}{\partial^{k + q - 1} y^{i}} \end{matrix}]}_{n_{y} \times n_{y}} =

[\begin{matrix} \frac{\partial t a n_{Δ t}^{k + q - 1} α_{1}^{i}}{\partial^{k + q - 1} y_{1}^{i}} & \frac{\partial t a n_{Δ t}^{k + q - 1} α_{1}^{i}}{\partial^{k + q - 1} y_{2}^{i}} & \dots & \frac{\partial t a n_{Δ t}^{k + q - 1} α_{1}^{i}}{\partial^{k + q - 1} y_{n_{y}}^{i}} \\ \frac{\partial t a n_{Δ t}^{k + q - 1} α_{2}^{i}}{\partial^{k + q - 1} y_{1}^{i}} & \frac{\partial t a n_{Δ t}^{k + q - 1} α_{2}^{i}}{\partial^{k + q - 1} y_{2}^{i}} & \dots & \frac{\partial t a n_{Δ t}^{k + q - 1} α_{2}^{i}}{\partial^{k + q - 1} y_{n_{y}}^{i}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \frac{\partial t a n_{Δ t}^{k + q - 1} α_{n_{y}}^{i}}{\partial^{k + q - 1} y_{1}^{i}} & \frac{\partial t a n_{Δ t}^{k + q - 1} α_{n_{y}}^{i}}{\partial^{k + q - 1} y_{2}^{i}} & \dots & \frac{\partial t a n_{Δ t}^{k + q - 1} α_{n_{y}}^{i}}{\partial^{k + q - 1} y_{n_{y}}^{i}} \end{matrix}]

(13)

{[\begin{matrix} \frac{\partial^{k + 1} y^{i}}{\partial^{k} u} \end{matrix}]}_{n_{y} \times n_{u}} =

Δ t \cdot [\begin{matrix} \frac{\partial t a n_{Δ t}^{k} α_{1}^{i}}{\partial^{k} u_{1}} & \frac{\partial t a n_{Δ t}^{k} α_{1}^{i}}{\partial^{k} u_{2}} & \dots & \frac{\partial t a n_{Δ t}^{k} α_{1}^{i}}{\partial^{k} u_{n_{u}}} \\ \frac{\partial t a n_{Δ t}^{k} α_{2}^{i}}{\partial^{k} u_{1}} & \frac{\partial t a n_{Δ t}^{k} α_{2}^{i}}{\partial^{k} u_{2}} & \dots & \frac{\partial t a n_{Δ t}^{k} α_{2}^{i}}{\partial^{k} u_{n_{u}}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \frac{\partial t a n_{Δ t}^{k} α_{n_{y}}^{i}}{\partial^{k} u_{1}} & \frac{\partial t a n_{Δ t}^{k} α_{n_{y}}^{i}}{\partial^{k} u_{2}} & \dots & \frac{\partial t a n_{Δ t}^{k} α_{n_{y}}^{i}}{\partial^{k} u_{n_{u}}} \end{matrix}]

(14)

In Equations (11)–(14), we have that

n_{y} = n

is the total number of state variables and

n_{u}

is the total number of control variables. Furthermore, to derive these equations, it was assumed that the neural network, which learns the mean derivative functions, was designed with only one backward input for the state and control variables, and also only one forward output for the mean derivatives.

So, for example, if a neural network with an MLP architecture is used, the calculation of the partial derivatives, necessary for the use of the gradient training algorithm, can be obtained as follows [25]:

\begin{matrix} {[\begin{matrix} t a n_{Δ t} {}^{k}α_{1}^{i}, & t a n_{Δ t} {}^{k}α_{2}^{i}, & \dots, & t a n_{Δ t} {}^{k}α_{n_{y}}^{i} \end{matrix}]}^{T} = \\ f_{N N}^{M L P} = {[\begin{matrix} y_{1} (t), \dots, & y_{n_{y}} (t), & u_{1} (t), & \dots, & u_{n_{u}} (t) \end{matrix}]}^{T} \end{matrix}

(15)

where

\frac{\partial y^{l}}{\partial {\bar{y}}^{k}} = \frac{\partial y^{l}}{\partial {\bar{y}}^{k + 1}} \cdot W^{k + 1} \cdot I_{f^{'} ({\bar{y}}^{k})} t o k = l - 1, l - 2, \dots, 1

\frac{\partial y^{l}}{\partial {\bar{y}}^{l}} = I_{f^{'} ({\bar{y}}^{l})}

I_{f^{'} ({\bar{y}}^{l})} = [\begin{matrix} I_{f^{'} ({\bar{y}}_{1}^{l})} & 0 & \dots & 0 \\ 0 & I_{f^{'} ({\bar{y}}_{2}^{l})} & \dots & 0 \\ 0 & 0 & ⋱ & 0 \\ 0 & 0 & \dots & I_{f^{'} ({\bar{y}}_{n_{k}}^{l})} \end{matrix}]

It is essential to note that l is a generic layer of the MLP network. So

y^{l}

are the output values of the l layer, and when l is the last layer, then these outputs necessarily are the mean derivative functions. The

f^{'} (\cdot)

functions can be any sigmoid function. Furthermore, if another neural architecture is used, for example, RBF networks or Wavelets, it will only change Equation (15). Equations (11)–(14) remain unchanged. The reason for this is that the equations from (11)–(14) refer exclusively to the type of integrator used, which, in this case, is necessarily the E–TUNI.

On the other hand, Equation (15) refers exclusively to the type of feed-forward neural network used, which, in this case, is necessarily the MLP network. Thus, Equation (15) is nothing more than an iterative version of the back-propagation algorithm, which calculates the partial derivatives from the output of the MLP network, concerning its inputs and not concerning the synaptic weights.

It is also worth noticing that, to fully understand the equations from (11)–(14), it is convenient to consult the references [25], as it is necessary to make a temporal chain of several first-order Euler integrators, which work exclusively with mean derivative functions. This fact is because the horizon m has, in general, temporal advances in an amount greater than the total number of delayed inputs of the E–TUNI used.

Thus, the determination of predictive control actions can be treated as a stochastic parameter estimation problem, if the minimization of the functional of Equation (10) is viewed as the following stochastic problem:

y_{r} (t_{j}) = \hat{y} (t_{j}) + v_{y} (t_{j})

(16)

0 = u (t_{j - 1}) - u (t_{j - 2}) + v_{u} (t_{j - 1})

(17)

E [v_{y} (t_{j})] = 0, E [v_{y} (t_{j}) . v_{y}^{T} (t_{j})] = r_{y} (t_{j})

(18)

E [v_{u} (t_{j})] = 0, E [v_{u} (t_{j}) . v_{u}^{T} (t_{j})] = r_{u} (t_{j})

(19)

with

j = 1, 2, \dots, m

and

\hat{y} (t_{j})

is the dynamics obtained through E-TUNI. To stochastically solve the equations from (16)–(19), an iterative approach is necessary, where in each iteration a perturbation is made to obtain a linear approximation of (16) and (17) through the Kalman filter. Thus, to solve the problem of Equation (16), a perturbation is performed to obtain a linear approximation of this same equation, as follows:

α (i) \cdot [y_{r} (t_{j}) - \bar{y} (t_{j}; i)] = \sum_{k = 0}^{j - 1} {[\partial \hat{y} (t_{j}) / \partial u (t_{k})]}_{\bar{u} (t_{k}; i)} \cdot [u (t_{k}, i) - \bar{u} (t_{k}, i)] + v_{y} (t_{j})

(20)

For a detailed knowledge of the equations of the Kalman filter, which solves this problem iteratively, see [25].

5.3. Correct Mathematical Demonstration of the E-TUNI General Expression

In this section, we formally demonstrate the general mathematical expression for E-TUNI, which is nothing more than a first-order Euler-type integrator that works with mean derivative functions. This development is done here because, as mentioned previously, there is a proof error in the general expression for E–TUNI presented in reference [16]. Furthermore, this proof will be summarized, as it has already been extensively detailed in [19]. Therefore, the starting point is Figure 3.

So, for the reader to understand this figure, we explain it, starting from the point

{}^{k}y_{j}^{i}

. The point

{}^{k}y_{j}^{i}

is an instant of the solution of the considered autonomous dynamic system. The variable j means the

j - t h

state variable of the considered dynamic system, where

j = 1, 2, \dots, n

, that is, there are a total of n state variables. The variable i represents the

i - t h

curve of the family of curves confined in the interval

[y_{j}^{m i n} (t_{0}),

y_{j}^{m a x} (t_{0})]

and which is the solution of the considered dynamic system. By the continuum hypothesis, one can have infinite curves (

i = 1, 2, \dots, \infty

). The variable k represents the k-th instant of time, that is,

t^{k} = t_{0} + k \cdot Δ t

. In the case of the variable k, if the dynamical system solution does not come out of the region of interest [

y_{j}^{m i n} (t_{0}),

y_{j}^{m a x} (t_{0})

] and the system is autonomous, then the variable k can also go to infinity (

k = 1, 2, \dots, \infty

). Also, when we write

y_{j}^{i} (t_{k}^{x})

or

y_{j}^{i} (t_{k}^{*})

it means that

t_{k} < t_{k}^{x} < t_{k + 1}

and

t_{k} < t_{k}^{*} < t_{k + 1}

.

This notation for

{}^{k}y_{j}^{i}

is sufficient to uniquely map it to its respective region of interest, where its associated secant characteristic

t a n_{Δ t} {}^{k}α_{j}^{i}

is confined. So, in this case, for example, we can say that for the state

{}^{k}y_{j}^{i} = y_{j}^{i} (t_{0} + k \cdot Δ t)

we have its associated characteristic secant given by

t a n_{Δ t} {}^{k}α_{j}^{i}

.

Thus, the secant

t a n_{Δ t} {}^{k}α_{j}^{i}

is associated with the state variable

y_{j}

with a curve i specific to the family of solutions, in this case, by

y_{j}^{i} (t)

and the instant

t_{k}

. The fact that the secant is associated with the instant

t_{k}

means that it is confined to the closed interval

[t_{k}, t_{k + 1}]

. Thus, the secant

t a n_{Δ t} {}^{k}α_{j}^{i}

starts at

t_{k}

and ends at

t_{k + 1}

.

Furthermore, the states

y_{j}^{i} (t_{k}^{x})

and

y_{j}^{i} (t_{k}^{*})

are special states, where both

t_{k}^{x}

and

t_{k}^{*}

are also confined to the closed interval

[t_{k}, t_{k + 1}]

. The instants

t_{k}^{x}

and

t_{k}^{*}

are directly associated, respectively, with the integral mean value theorem and the differential mean value theorem [26]. So, having made these preliminary considerations, we begin now with this demonstration in a precise manner. Thus, let the following autonomous system of non-linear differential equations be given by:

\dot{y} = f (y)

(21)

y = {[\begin{matrix} y_{1} & y_{2} & \dots & y_{n} \end{matrix}]}^{T}

(22)

f (y) = {[\begin{matrix} f_{1} (y) & f_{2} (y) & \dots & f_{n} (y) \end{matrix}]}^{T}

(23)

Consider also, by definition,

y_{j}^{i} = y_{j}^{i} (t)

for

j = 1, 2,

\dots, n

a particular trajectory for a family of solutions to the system of differential equations

\dot{y} = f (y)

passing through

y_{j}^{i} (t_{0})

at instant

t_{0}

, i.e., initializing from a domain of interest

[y_{j}^{m i n} (t_{0}),

y_{j}^{m a x} (t_{0})]^{n}

, where

y_{j}^{m i n} (t_{0})

and

y_{j}^{m a x} (t_{0})

are finite. It is also appropriate to introduce the following vector notation over (21):

y_{0}^{i} = y^{i} (t_{0}) = {[\begin{matrix} y_{1}^{i} (t_{0}) & y_{2}^{i} (t_{0}) & \dots & y_{n}^{i} (t_{0}) \end{matrix}]}^{T}

(24)

y^{i} = y^{i} (t) = {[\begin{matrix} y_{1}^{i} (t) & y_{2}^{i} (t) & \dots & y_{n}^{i} (t) \end{matrix}]}^{T}

(25)

In [26], by definition, the secant curve between two points

{}^{k}y_{j}^{i}

and

{}^{k + 1}y_{j}^{i}

belonging to the curve

y_{j}^{i} (t)

to

j = 1, 2, \dots, n

is the line segment joining these two points. Thus, the tangents of the secants between the points

{}^{k}y_{1}^{i}

and

{}^{k + 1}y_{1}^{i}

,

{}^{k}y_{2}^{i}

and

{}^{k + 1}y_{2}^{i}

, ⋯,

{}^{k}y_{n}^{i}

and

{}^{k + 1}y_{n}^{i}

are defined as:

t a n_{Δ t} α^{i} (t + k \cdot Δ t) = t a n_{Δ t} {}^{k}α^{i} =

{[\begin{matrix} t a n_{Δ t} {}^{k}α_{1}^{i} & t a n_{Δ t} {}^{k}α_{2}^{i} & \dots & t a n_{Δ t} {}^{k}α_{n}^{i} \end{matrix}]}^{T}

(26)

with,

t a n_{Δ t} {}^{k}α_{j}^{i} = \frac{{}^{k + 1}y_{j}^{i} - {}^{k}y_{j}^{i}}{Δ t}

(27)

for

j = 1, 2, \dots, n

. Thus, we can now state the two fundamental theorems to consolidate our proof. These two theorems are the differential mean value theorem and the integral mean value theorem, which are presented below without proof [26].

Theorem 1

(The Differential Mean Value Theorem). If a function

g_{j}^{i} (t)

for

j = 1, 2, \dots, n

is a continuous function and defined over the closed interval

[t_{k}, t_{k + 1}]

is differentiable over the open interval

(t_{k}, t_{k + 1})

, then there is at least one number

t_{k}^{*}

with

t_{k} < t_{k}^{*} < t_{k} + Δ t = t_{k + 1}

, such that

{\dot{g}}_{j}^{i} (t_{k}^{*}) = \frac{{}^{k + 1}g_{j}^{i} - {}^{k}g_{j}^{i}}{Δ t}

(28)

Theorem 2

(The Integral Mean Value Theorem). If a function

g_{j}^{i} (t)

for

j = 1, 2, \dots, n

is a continuous function and defined over the closed interval

[t_{k}, t_{k + 1}]

, then there is at least one inner point

t_{k}^{x}

in

[t_{k}, t_{k + 1}]

, such that

g_{j}^{i} (t_{k}^{x}) \cdot Δ t = \int_{t_{k}}^{t_{k + 1}} g_{j}^{i} (t) d t

(29)

It is important to note that, generally,

t_{k}^{*}

is different from

t_{k}^{x}

. Also, the mean value theorems say nothing about how to determine the values of

t_{k}^{*}

and

t_{k}^{x}

. These two theorems simply state that

t_{k}^{*}

and

t_{k}^{x}

are confined to the closed interval

[t_{k}, t_{k + 1}]

.

Property 1.

Applying Theorem 2 on the curve

{\dot{g}}_{j}^{i} (t)

is equivalent to applying Theorem 1 on the curve

g_{j}^{i} (t)

both on the same interval closed

[t_{k}, t_{k + 1}]

, that is,

{\dot{g}}_{j}^{i} (t_{k}^{x}) = {\dot{g}}_{j}^{i} (t_{k}^{*})

.

Proof.

It is detailed in reference [19]. □

Principle 1.

Given the non-linear autonomous dynamic system

{\dot{y}}_{j}^{i} =

f_{j} (y_{1}^{i}, y_{2}^{i}, \dots, y_{n}^{i})

= f_{j} (y^{i})

and their respective general solutions

y_{j}^{i} (t) = g_{j}^{i} (t)

for

j = 1, 2, \dots, n

and

i = 1, 2, \dots, \infty

, then the autonomous instantaneous derivative functions

f_{j} (y^{i})

can be replaced by the exclusively non-autonomous instantaneous derivative functions given by

{\dot{g}}_{j}^{i} (t)

, that is,

f_{j} (y_{1}^{i}, y_{2}^{i}, \dots, y_{n}^{i}) =

f_{j} (y^{i}) =

{\dot{g}}_{j}^{i} (t) =

h_{j}^{i} (t)

for

j = 1, 2, \dots, n

and

i = 1, 2, \dots, \infty

.

Proof.

It is detailed in reference [19]. □

Theorem 3.

The discrete and exact general solution for the autonomous system of non-linear ordinary differential equations of the type

{\dot{y}}^{i} = f (y^{i})

can be established through the first-order Euler relation of the type

{}^{k + 1}y_{j}^{i} =

t a n_{Δ t} {}^{k}α_{j}^{i} \cdot Δ t

+ {}^{k}y_{j}^{i}

, for

{}^{k}y_{j}^{i}

and

Δ t

fixed; since that the general solution of this dynamical system, given by,

y_{j}^{i} (t) = g_{j}^{i} (t)

and

{\dot{y}}_{j}^{i} (t) = {\dot{g}}_{j}^{i} (t)

are, previously, known for

j = 1, 2, \dots, n

;

i = 1, 2, \dots, \infty

and tϵ

[t_{o}, t_{f}]

. Furthermore, the solutions

g_{j}^{i} (t)

for

j = 1, 2, \dots, n

must all be continuous and differentiable. However, note that

{\dot{g}}_{j}^{i} (t)

for

j = 1, 2, \dots, n

is suffice to be continuous.

Proof.

It is detailed quite clearly in reference [19]. This proof is consolidated using the theorems and principles described prior in this section. The initial starting point, to fully understand this demonstration, is to know the mapping described in Figure 3. □

Finally, the E–TUNI universal numerical integrator is suitable for solving only ordinary differential equations. However, it should be noted that the current use of neural networks in solving Partial Differential Equations (PDEs) is quite extensive [27,28,29,30,31,32,33,34,35]. Additionally, ref. [36] explains how to use the Runge–Kutta numerical integrator to solve partial differential equations (mainly hyperbolic ones). Therefore, using the E–TUNI to solve partial differential equations may be a good option for future work and should be investigated more carefully.

5.4. Mathematical Relationship Between Mean and Instantaneous Derivatives

A very remarkable fact is that it is possible to obtain an equation that relates the instantaneous derivative functions to the mean derivative functions. This equation can be easily obtained using the chain rule for functions on several independent variables [26]. In this way, two previously trained neural networks are known to represent the instantaneous and mean derivative functions, respectively, by:

{[\begin{matrix} {\dot{y}}_{1} (t), & {\dot{y}}_{2} (t), & \dots, & {\dot{y}}_{n_{y}} (t) \end{matrix}]}^{T} =

(30)

{[\begin{matrix} t a n^{k} θ_{1}^{i}, & t a n^{k} θ_{2}^{i}, & \dots, & t a n^{k} θ_{n_{y}}^{i} \end{matrix}]}^{T} =

f_{N N}^{i d} {[\begin{matrix} y_{1} (t), & y_{2} (t), & \dots, & y_{n_{y}} (t), & \hat{w} \end{matrix}]}^{T}

and

{[\begin{matrix} t a n_{Δ t} {}^{k}α_{1}^{i}, & t a n_{Δ t} {}^{k}α_{2}^{i}, & \dots, & t a n_{Δ t} {}^{k}α_{n_{y}}^{i} \end{matrix}]}^{T} =

(31)

f_{N N}^{m d} {[\begin{matrix} y_{1} (t), & y_{2} (t), & \dots, & y_{n_{y}} (t), & \hat{w} \end{matrix}]}^{T}

For reasons of simplification, we do not consider the case using control variables in Equations (30) and (31). So, if we use the chain rule [26] in Equation (31) we get:

\frac{d}{d t} t a n_{Δ t} {}^{k}α_{j}^{i} =

(32)

\frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y_{1}^{i}} \cdot \frac{d^{k} y_{1}^{i}}{d t} + \dots + \frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y_{n_{y}}^{i}} \cdot \frac{d^{k} y_{n_{y}}^{i}}{d t} =

[\begin{matrix} \frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y_{1}^{i}}, & \dots, & \frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y_{n_{y}}^{i}} \end{matrix}] \cdot [\begin{matrix} \frac{d^{k} y_{1}^{i}}{d t} \\ ⋮ \\ \frac{d^{k} y_{n_{y}}^{i}}{d t} \end{matrix}] =

[\begin{matrix} \frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y_{1}^{i}}, & \dots, & \frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y_{n_{y}}^{i}} \end{matrix}] \cdot [\begin{matrix} t a n^{k} θ_{1}^{i} \\ ⋮ \\ t a n^{k} θ_{n_{y}}^{i} \end{matrix}] =

\frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y^{i}} \circ t a n^{k} θ^{i} t o j = 1, 2, \dots, n_{y}

For the sake of simplicity, we can represent the expression (32) as follows:

t a n_{Δ t}^{k} Ψ_{j}^{i} = \frac{d}{d t} t a n_{Δ t} {}^{k}α_{j}^{i} = \frac{\partial t a n_{Δ t} {}^{k}α_{j}^{i}}{\partial^{k} y^{i}} \circ t a n^{k} θ^{i}

(33)

t o j = 1, 2, \dots, n_{y}

The operator ∘ is just the usual dot product applied over a Euclidean space. So, just for completeness, the vector form of Equation (33) can also be expressed by:

\frac{d}{d t} t a n_{Δ t} {}^{k}α^{i} = \frac{\partial t a n_{Δ t} {}^{k}α^{i}}{\partial^{k} y^{i}} \cdot t a n^{k} θ^{i}

(34)

where

\frac{d}{d t} t a n_{Δ t}^{k} α^{i} = \{\begin{matrix} \frac{d}{d t} t a n_{Δ t}^{k} α_{1}^{i} \\ \frac{d}{d t} t a n_{Δ t}^{k} α_{2}^{i} \\ ⋮ \\ \frac{d}{d t} t a n_{Δ t}^{k} α_{n y}^{i} \end{matrix}\}

(35)

\frac{\partial t a n_{Δ t}^{k} α^{i}}{\partial^{k} y^{i}} = [\begin{matrix} \frac{\partial t a n_{Δ t}^{k} α_{1}^{i}}{\partial^{k} y_{1}^{i}} & \dots & \frac{\partial t a n_{Δ t}^{k} α_{1}^{i}}{\partial^{k} y_{n_{y}}^{i}} \\ \frac{\partial t a n_{Δ t}^{k} α_{2}^{i}}{\partial^{k} y_{1}^{i}} & \dots & \frac{\partial t a n_{Δ t}^{k} α_{2}^{i}}{\partial^{k} y_{n_{y}}^{i}} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial t a n_{Δ t}^{k} α_{n_{y}}^{i}}{\partial^{k} y_{1}^{i}} & \dots & \frac{\partial t a n_{Δ t}^{k} α_{n_{y}}^{i}}{\partial^{k} y_{n_{y}}^{i}} \end{matrix}]

(36)

t a n^{k} θ^{i} = \{\begin{matrix} t a n^{k} θ_{1}^{i} \\ t a n^{k} θ_{2}^{i} \\ ⋮ \\ t a n^{k} θ_{n_{y}}^{i} \end{matrix}\}

(37)

There is an essential fact between instantaneous and mean derivative functions. Using these two kinds of derivative functions, that is,

t a n^{k} θ^{i}

and

t a n_{Δ t}^{k} α^{i}

, it is possible to interpolate a parabola between the discrete interval

[t_{k}, t_{k + 1}]

, which passes at least at two precise points of the exact solution of the considered non-linear dynamical system, in this same interval. Thus, Table 1 considers the points used to perform this parabolic interpolation, and Equations (39)–(41) are the coefficients of the parabola (38).

{}^{k}y_{j}^{i} (t) = α_{k} \cdot t^{2} + β_{k} \cdot t + γ_{k}

(38)

α_{k} = \frac{1}{2} \cdot t a n_{Δ t} {}^{k}Ψ_{j}^{i}

(39)

β_{k} = t a n_{Δ t} {}^{k}α_{j}^{i} - \frac{1}{2} \cdot (t_{k + 1} + t_{k}) \cdot t a n_{Δ t} {}^{k}Ψ_{j}^{i}

(40)

γ_{k} = \frac{1}{2} \cdot (t_{k} \cdot t_{k + 1} \cdot t a n_{Δ t} {}^{k}Ψ_{j}^{i} - 2 \cdot t_{k} \cdot t a n_{Δ t} {}^{k}α_{j}^{i} + 2 \cdot^{k} y_{j}^{i})

(41)

where

t ϵ [t_{k}, t_{k + 1}] .

6. Results and Analysis

In this article, we perform a complete computational numerical analysis, studying the training of the neural integration structures discussed in the previous sections, i.e., the E–TUNI. Thus, for all the experiments that are presented below, it was standardized to use MLP neural networks with the traditional back-propagation [4] and Kalman filter [37,38,39] training algorithms in Example 2 and the Levenberg–Marquardt [40] training algorithm in Examples 3 and 4. All experiments were trained using only one inner layer in the MLP network.

Example 1.

Check the validity of Principle 1 for the dynamical system

\dot{y} = f (y) = y^{2}

.

Proof.

From

\dot{y} = \frac{d y}{d t} = y^{2}

, then,

\int_{y (t_{0})}^{y (t)} y^{- 2} d y = \int_{t_{o}}^{t} d t

, as a result of the fundamental theorem of calculus. Solving this integral analytically results in

y (t) = \frac{y (t_{0})}{- y (t_{0}) t + y (t_{0}) t_{0} + 1}

. Differentiating the function

y (t)

, with respect to time t, we have that

\dot{y} = \dot{g} (t)

=

y (t_{0}) \frac{d}{d t} {[- y (t_{0}) t + y (t_{0}) t_{0} + 1]}^{- 1}

=

\frac{y^{2} (t_{0})}{{[- y (t_{0}) t + y (t_{0}) t_{0} + 1]}^{2}}

. Thus, replacing

y (t)

in

\dot{y} = y^{2}

results in

\dot{y} = f (y) = y^{2} =

{[\begin{matrix} \frac{y (t_{0})}{- y (t_{0}) t + y (t_{0}) t_{0} + 1} \end{matrix}]}^{2}

= \dot{g} (t)

. □

The graph in Figure 4 illustrates the drawing of the function

y (t)

(upper part) and the drawing of the function

\dot{y} (t)

(lower part). For a proper understanding of the graphs in Figure 4, it should be noted that

t_{0} = 0

, and there are seven curves in the graphs of Figure 4 representing the solution family of Example 1. For example, in

y (t_{0}) = y (0) = 1

a particular solution of the dynamical system

\dot{y} = y^{2}

that have a vertical asymptote at

t = 1

(upper part of Figure 4). Note that, to the left of this asymptote, the solution of the dynamical system has been plotted in blue. On the other hand, to the right of this asymptote, the solution has been plotted in red. This convention is followed to plot the remaining solutions that appear in Figure 4. It is also important to note that the blue curves have distinct vertical asymptotes for each of the initial conditions. On the other hand, all red curves have horizontal asymptotes at

y = 0

, when t tends to infinity.

Note that Principle 1 is valid for discontinuous solutions in

y (t)

. However, the use of the general expression of E–TUNI to learn this discontinuous solution is not possible according to the theory presented in this article. The reasons for this impossibility are as follows: (i) the integral and differential mean value theorems require that the E–TUNI solution be continuous and differentiable, and (ii) E–TUNI would have to learn a numerical value equal to infinity, at the points of the vertical asymptotes shown in Figure 4 (upper part), and this is impossible, as conventional computers have finite memory. For the use of E–TUNI, in examples like this further studies are required.

Example 2.

Solve the predictive control problem, necessarily using E–TUNI as a model of the non-linear plant, for the system of non-linear ordinary differential equations, which describes the Earth/Mars orbit transfer dynamics for a rocket of mass

m_{r}

, as shown in Figure 5. Solve this problem for two different horizons, that is,

m = 1

and

m = 20

, according to Equations (10)–(20). The set of ordinary differential equations describing this problem is given below [24,25]:

\begin{matrix} \dot{m_{r}} = - 0.0749 \\ \dot{r} = w \\ \dot{w} = \frac{v^{2}}{r} - \frac{μ}{r^{2}} + \frac{T \cdot s i n θ}{m} \\ \dot{v} = \frac{- w \cdot v}{r} + \frac{T \cdot c o s θ}{m} \end{matrix}

(42)

An illustrative scheme of this dynamic system can be seen in Figure 5. So, the state variables for this predictive control problem are

m_{r}

(rocket mass), r (orbit radius), w (radial velocity), and v (transverse velocity). The only controlling variable is the thrust angle

θ

of the rocket. The normalized constants of this problem are

μ = 1.0

(gravitational constant),

T = 0.1405

(rocket thrust),

t_{o} = 0

(initial instant), and

t_{f} = 3.3

(final instant). In this problem, the unit of the time variable t adopted is equal to 58.2 days; that is, for example, in Figure 6 each unit of time (time variable is normalized) is equal to 58.2 days.

We trained the E–TUNI using an MLP network with only one inner layer. This inner layer contained 41 neurons and one bias neuron. We applied the hyperbolic tangent activation function in the inner layer, while we used a purely linear activation function at the network output. The training and testing yielded Mean Square Errors (MSEs) of

9.021 \times 10^{- 6}

and

1.067 \times 10^{- 5}

, respectively.

In the neural training of the E–TUNI structure, two standard training algorithms were used: the classic back-propagation [4] and the Kalman filter with recursive and parallel processing [39]. They have been casually changed. In addition, this training took a considerable amount of time, as a shallow architecture network was used instead of a deep architecture network.

Figure 6 presents the trajectory traced by the Kalman filter for a predictive control structure whose plant model was obtained through E–TUNI. The solid line is the reference trajectory, and the dotted line is the trajectory tracked by the Kalman filter. Figure 7 presents the control estimated by the Kalman filter in a predictive control structure designed according to Equations (10)–(20) and associated with the trajectory traced in Figure 6. However, the Kalman filter equations employed were taken from [25]. The integration step used was

Δ t = 0.1

and the horizon equal to

m = 1

. As can be seen, the estimated control was not very good for just using one horizon in Equation (10).

On the other hand, Figure 8 and Figure 9 present the same E–TUNI structure as in the previous case, but now in a new control estimating with the horizon used equally to

m = 20

(see Equation (10)). Note that this small change significantly improved the result obtained. However, while the training time for a horizon of

m = 1

may only take between an hour or two, the training time for a horizon of

m = 20

can be as long as ten days, if the integration step is changed from

Δ t = 0.1

to

Δ t = 0.01

. For a better understanding of E–TUNI coupled to a predictive control structure, see reference [25].

Figure 6, Figure 7, Figure 8 and Figure 9 are an original extension of what can be found in [25]. In [25], the predictive control structure with E–TUNI for reference trajectories displaced from the initial condition was not explored, as presented here. Similar behavior for the other state variables m, w, and v was observed. Therefore, we have omitted the other graphics due to space constraints.

Figure 10 and Figure 11 were taken from [16] (see page 149 and Figure 6.11). They represent the application of a predictive control structure on the E–TUNI, for the case where a deviation between the reference trajectories and the initial condition of the dynamic system in question was not imposed. However, note that an integration step of

Δ t = 0.01

and a horizon of

m = 10

was used. Note also that the 10-fold reduction in the integration step, although it makes the algorithm much slower, manages to considerably refine the control policy obtained in Figure 11, compared to Figure 7 and Figure 9.

Example 3.

The non-linear simple pendulum is a second-order autonomous system given by

l \cdot \ddot{θ} + g \cdot s i n θ = 0

. Note that l and g are, respectively, the length of the pendulum and the local acceleration due to gravity. Here, we solve this problem by using E–TUNI for several different integration steps. Furthermore, for each of these integration steps we obtain the interpolating parabolas governed by the equations from (38)–(41).

To precisely understand the resolution of Example 3, we follow the following basic algorithm:

Step 1. Generate the E-TUNI input/output training patterns through the Runge–Kutta 4-5 integrator, applied to the non-linear simple pendulum equation. Note that, as this dynamical system is a non-linear equation, its solution in the phase plane is not a perfect circle.
Step 2. Use the Levenberg–Marquardt algorithm in the direct approach to training two distinct neural networks, namely: (i) a neural network to learn the instantaneous derivative functions and (ii) another neural network to learn the mean derivative functions.
Step 3. Determine the vector $\frac{d}{d t} t a n_{Δ t} {}^{k}Ψ^{i}$ using the equations from (34)–(37). Just pay attention to the fact that the partial derivatives $\frac{\partial}{\partial^{k} y^{i}} t a n_{Δ t} {}^{k}α^{i}$ were obtained numerically and not analytically. For this, a $Δ^{k} y^{i} = 10^{- 6}$ was used.
Step 4. Determine the interpolating parabolas within a horizon of interest $[t_{0}, t_{f}]$ and with a step of $Δ t$ . For this, the equations from (38)–(41) were used recursively.
Step 5. Compare the results obtained for different $Δ t$ integration steps. Also, compare the solution of the interpolating parabolas with the mean derivative equations.
Step 6. A summary of this algorithm is presented in Figure 12 and Figure 13.

To solve this problem effectively, all graphical simulation results will be displayed using an MLP neural network with only one inner layer (a shallow network). In the inner layer, the hyperbolic tangent sigmoid activation function was employed, while a purely linear function was used in the output of the neural network. The training algorithm used was the traditional Levenberg–Marquardt [40], which always employed the direct approach of E–TUNI, rather than the indirect approach. For that, MATLAB R2024b Update 1 (24.2.0.2740171) and its Neural Network Toolbox were used.

The dynamical system in question was trained within the finite domains

D o m^{θ} = [- π / 2, + π / 2]

r a d

and

D o m^{\dot{θ}} = [- 2, + 2]

\frac{r a d}{s}

. For all neural networks trained, 400 training patterns were used;

75 %

of them were used for training,

15 %

for validation, and

15 %

for testing. All training has been standardized to be trained with exactly

10, 000

epochs. Table 2 summarizes the main results achieved in these training sessions.

Analyzing Table 2, it is observed that the greater the integration step

Δ t

used in the E-TUNI training, the greater the training error for the validation patterns. This fact experimentally confirms (9), which states that the integration step

Δ t

amplifies the Mean Squared Error (MSE) of E–TUNI training when it is greater than

1.00

.

However, the most interesting numerical results, concerning Example 3, are shown in Figure 12 and Figure 13. In Figure 12, the integration steps are decreased from top to bottom. Notice that when the integration step

Δ t

is equal to

10.0

, E–TUNI could not accurately learn the angular position prediction

θ

(see all blue points in Figure 12). With the integration steps

Δ t

equal to or less than

1.00

, E–TUNI was able to learn them perfectly. The reason for this is that multiple instances of E–TUNI training were carried out for the integration step

Δ t

equal to

10.0

, but the best result achieved was the one presented in Table 2, that is,

M S E = 1.7520 \times 10^{- 3}

, confirming, again, the validity of Equation (9) as a limiting factor for this method. The E–TUNI training, which involved integration steps

Δ t

equal to or less than

1.00

, was easily learned by the MLP neural networks in the first attempts.

Figure 13 also presents several simulation graphs with the integration steps

Δ t

, used in the E–TUNI training, also decreasing from top to bottom. The dotted pink lines represent the true values of the dynamical system in question. The solid lines in blue represent the mean derivatives between two consecutive points. Continuous lines in red interpolate parabolas between two successive points. Note that, for tiny

Δ t

integration steps, the parabolic interpolation using mean and instantaneous derivatives functions, simultaneously, is quite accurate and relatively better than using E–TUNI alone.

Analyzing Figure 13 more closely, observe that the E–TUNI equation is equivalent to consecutive uniform rectilinear movements and with constant velocities equal to the mean derivatives (constant from interval to interval). On the other hand, the equations of the interpolating parabolas are consecutive equations of uniformly varied motions and with consecutive accelerations

t a n_{Δ t} {}^{k}Ψ_{j}^{i}

(also constant interval to interval).

Note also that uniformly varied motions (interpolating parabolas) allow at most one change of direction in their motion. However, uniform rectilinear motion (E–TUNI) does not. For this reason, it appears that interpolating parabolas are more accurate than mean derivative curves for tiny integration steps. Because neural networks are universal approximators of functions [2,5,6], the physical variables that can be used as input to E–TUNI can be any.

Additionally, the Euler integrator, designed exclusively with instantaneous derivative functions, is never able to achieve the equivalent precision of the Euler integrator, designed with mean derivative functions. This fact happens mainly if substantial integration steps are used.

Finally, it is only possible to use control variables (in the inputs of E–TUNI) to obtain the parabolic interpolations if the control variable remains with its constant value throughout the entire interval

[t_{k}, t_{k + 1}]

. This property is only practical for control if the E–TUNI integration step is tiny and close to zero.

Example 4.

Solve the dynamical Lorenz equation involving deterministic chaos using three algorithms that use Universal Numerical Integrators (UNIs). These three algorithms are the NARMAX model, E–TUNI, and RKNN. Perform a comparative study of these three basic methodologies on UNIs. Equation (43) represents the dynamical system in question. In this simulation, use the default parameters

σ = 10

,

ρ = 28

, and

β = 8 / 3

.

\begin{matrix} {\dot{y}}_{1} = σ \cdot (y_{2} - y_{1}) \\ {\dot{y}}_{2} = y_{1} \cdot (ρ - y_{3}) - y_{2} \\ {\dot{y}}_{3} = y_{1} \cdot y_{2} - β \cdot y_{3} \end{matrix}

(43)

The Lorenz system, composed of three non-linear ordinary differential equations, was originally derived by Edward Lorenz in 1963 as a simplified model for studying atmospheric convection. This system exhibits the fundamental properties that define deterministic chaos.

Table 3 presents the main training parameters for the experiment related to Example 4. As can be seen, the first three rows of Table 3 present the mean squared errors for training, validation, and testing, respectively. This table also has three columns of training data. The first column presents the training data for the NARMAX model, the second column presents the training data for E–TUNI, and the third column presents the training data for RKNN. All four experiment tables follow this same pattern.

Figure 14 presents the numerical solution of the Lorenz equation using the Runge–Kutta 4-5 algorithm with the instantaneous derivative functions given by Equation (43). This solution in red is used as a benchmark. Figure 15 compares the benchmark solution with the results obtained by the NARMAX, E–TUNI, and RKNN models. Note that all the hybrid models used had difficulty learning this dynamic. However, these models showed great versatility for future improvement using deep learning neural networks. The reason why models utilizing some UNI had trouble learning this dynamic equation is explained in the following paragraphs.

For an Artificial Neural Network (ANN) to learn the inherent dynamics of the Lorenz system and maximize its short-term prediction window, the Mean Squared Error (MSE) of training must be minimized. However, due to the property of extreme sensitivity to initial conditions—a phenomenon popularly known as the “Butterfly Effect”—no MSE value, no matter how small, will guarantee prediction accuracy over an infinite time horizon. Thus, the neural network becomes susceptible to the exact error amplification mechanism identified by Lorenz, which exemplifies the intrinsic limits to the predictability of chaotic systems. The necessity of a low MSE for this dynamic model is crucial, and its reasons and limitations can be elucidated in two main points:

(i) Exponential Error Amplification: In inherently chaotic systems, such as the Lorenz system, any modeling or estimation error, no matter how minute, is amplified exponentially over time. Consequently, to maximize the prediction horizon—defined as the time interval during which the prediction remains useful—it is essential to minimize the initial error to the maximum technically feasible extent. To illustrate, a training MSE on the order of

10^{- 1}

would result in an almost instantaneous divergence of predictions. An MSE of

10^{- 3}

would allow for a small number of predictive steps before divergence, while values on the order of

10^{- 6}

or

10^{- 8}

, as achieved in the present study, provide a significantly longer prediction horizon.

(ii) Fundamental Limit of Predictability: Even achieving an exceptionally low training MSE, approaching machine precision, does not overcome the fundamental barrier imposed by the chaotic nature of the system. The inevitable failure of long-term prediction is not a deficiency of the Artificial Neural Network (ANN) architecture, but rather an intrinsic property of the system under study. Therefore, the objective of minimizing the MSE is not to enable perpetual prediction—a theoretical impossibility—but rather to extend the limit of the useful prediction horizon to its maximum.

7. Conclusions

This work presented the formulation of the Euler-Type Universal Numerical Integrator (E–TUNI), highlighting its structural simplicity and applicability to non-linear dynamics and predictive control problems. In addition to demonstrating its feasibility, the results obtained allow us to discuss some relevant implications. In particular, E–TUNI proves to be a competitive integrator in terms of accuracy, even while maintaining a first-order formulation, and can be naturally incorporated into model-based control schemes. This property suggests that the methodology can serve as a practical alternative to classical integrators in scenarios where algorithmic simplicity and low computational cost are decisive factors.

However, the analysis also reveals limitations. The main limitation is its dependence on the fixed integration step

Δ t

, which necessitates retraining the neural network whenever it is altered. This characteristic reduces the flexibility of E–TUNI compared to instantaneous derivative methodologies (such as RKNN and PCNN). Furthermore, although experiments have indicated good performance on low- to medium-dimensional problems, the method’s scalability to higher-order dynamical systems has not yet been comprehensively evaluated, which remains an open question.

These limitations point to possible future extensions. First, adapting E–TUNI to variable-step schemes could broaden its applicability to systems with multiple dynamical regimes. Second, its integration with modern learning techniques, such as Physics-Informed Neural Networks (PINNs) or hybrid networks based on differential programming, could expand the method’s scope to more complex multidisciplinary problems, including bioengineering, astrodynamics, and quantitative finance. Finally, a formal stability and error analysis, accompanied by comparative studies with benchmark methods, represents an essential step toward consolidating E-TUNI as a tool for widespread use in the scientific community.

In summary, the results presented here demonstrate the potential of E–TUNI, but also make it clear that its consolidation as a methodological alternative will depend on further studies on stability, scalability, and integration with emerging machine learning techniques. The method should be understood not simply as a variation of the Euler scheme but also as a promising and extensible framework capable of inspiring new lines of research in universal numerical integration.

Author Contributions

P.M.T. designed the proposed model and wrote the main part of the mathematical model. J.C.M., L.A.V.D. and A.M.d.C. supervised the writing of the paper and its technical and grammatical review. P.M.T. and G.S.G. (supervised by J.C.M., L.A.V.D. and A.M.d.C.) developed the software and performed several computer simulations to validate the proposed model. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

I would like to thank my great friend Atair Rios Neto for his valuable tips for improving this article. Finally, I would also like to thank the valuable improvement tips given by the good reviewers of this journal. The authors of this article would also like to thank God for making all of this possible.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ABNN	Adams–Bashforth Neural Network
CNN	Convolutional Neural Network
DNN	Deep Neural Networks
E-TUNI	Euler-Type Universal Numerical Integrator
MLP	Multi-Layer Perceptron
MSE	Mean Squared Error
NARMAX	Non-linear Auto Regressive Moving Average with eXogenous input
PCNN	Predictive-Corrector Neural Network
RBF	Radial Basis Function
RKNN	Runge–Kutta Neural Network
SVM	Support Vector Machine
UNI	Universal Numerical Integrator

References

McCulloch, W.; Pitts, W. A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biophys. 1943, 5, 115–133. [Google Scholar] [CrossRef]
Kolmogorov, A.N. On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition. Dokl. Akad. Nauk. SSR 1957, 114, 953–956. [Google Scholar] [CrossRef]
Charpentier, E.; Lesne, A.; Nikolski, N. Kolmogorov’s Heritage in Mathematics, 1st ed.; Spring: Berlin/Heidelberg, Germany; New York, NY, USA, 2004. [Google Scholar]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators. Neural Netw. 1989, 2, 359–366. [Google Scholar] [CrossRef]
Haykin, S. Neural Networks: A Comprehensive Foundation; Prentice-Hall, Inc.: Hoboken, NJ, USA, 1999. [Google Scholar]
Tasinaffo, P.M.; Gonçalves, G.S.; Cunha, A.M.; Dias, L.A.V. An introduction to universal numerical integrators. Int. J. Innov. Comput. Inf. Control 2019, 15, 383–406. [Google Scholar] [CrossRef]
Henrici, P. Elements of Numerical Analysis; John Wiley and Sons: New York, NY, USA, 1964. [Google Scholar]
Vidyasagar, M. Non-linear Systems Analysis; Electrical Engineering Series; Prentice-Hall, Inc.: Hoboken, NJ, USA, 1978. [Google Scholar] [CrossRef]
Rama Rao, K. A Review on Numerical Methods for Initial Value Problems; Internal Report, INPE-3011-RPI/088; Instituto Nacional de Pesquisas Espaciais (INPE): São José dos Campos, SP, Brazil, 1984. [Google Scholar]
Chen, S.; Billings, S.A. Representations of nonlinear systems: The NARMAX model. Int. J. Control 1989, 49, 1013–1032. [Google Scholar] [CrossRef]
Hunt, K.J.; Sbarbaro, D.; Zbikowski, R.; Gawthrop, P.J. Neural networks for control systems—A survey. Automatica 1992, 28, 1083–1112. [Google Scholar] [CrossRef]
Euler, L.P. Institutiones Calculi Integralis; Impensis Academiae Imperialis Scientiarum: St. Petersburg, Russia, 1768. [Google Scholar]
Zhang, Y.; Li, L.; Yang, Y.; Ruan, G. Euler neural network with its weight-direct-determination and structure-automatic-determination algorithms. In Proceedings of the Ninth International Conference on Hybrid Intelligent Systems, Shenyang, China, 12–14 August 2009; IEEE Computer Society: Shenyang, China, 2009; pp. 319–324. [Google Scholar]
Tasinaffo, P.M. Estruturas de Integração Neural Feedforward Testadas em Problemas de Controle Preditivo. Ph.D. Thesis, INPE-10475-TDI/945. Instituto Nacional de Pesquisas Espaciais (INPE), São José dos Campos, SP, Brazil, 2003. [Google Scholar]
Tasinaffo, P.M.; Dias, L.A.V.; da Cunha, A.M. A qualitative approach to universal numerical integrators (UNIs) with computational application. Hum.-Centric Intell. Syst. 2024, 4, 571–598. [Google Scholar] [CrossRef]
Tasinaffo, P.M.; Dias, L.A.V.; da Cunha, A.M. A quantitative approach to universal numerical integrators (UNIs) with computational application. Hum.-Centric Intell. Syst. 2025, 5, 1–20. [Google Scholar] [CrossRef]
Tasinaffo, P.M.; Gonçalves, G.S.; Marques, J.C.; Dias, L.A.V.; da Cunha, A.M. The Euler-type universal numerical integrator (E-TUNI) with backward integration. Algorithms 2025, 18, 153. [Google Scholar] [CrossRef]
Wang, Y.-J.; Lin, C.-T. Runge-Kutta neural network for identification of dynamical systems in high accuracy. IEEE Trans. Neural Netw. 1998, 9, 294–307. [Google Scholar] [CrossRef]
Uçak, K. A Runge-Kutta neural network-based control method for nonlinear MIMO systems. Soft Comput. 2019, 23, 7769–7803. [Google Scholar] [CrossRef]
Uçak, K. A novel model predictive Runge-Kutta neural network controller for nonlinear MIMO systems. Neural Process. Lett. 2020, 51, 1789–1833. [Google Scholar] [CrossRef]
Chen, R.T.Q.; Rubanova, Y.; Bettencourt, J.; Duveand, D. Neural ordinary differential equations. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurlPS), Montréal, QC, Canada, 2–8 December 2018; pp. 1–19. [Google Scholar] [CrossRef]
Sage, A.P. Optimum Systems Control; Prentice-Hall, Inc.: Englewood Cliffs, NJ, USA, 1968. [Google Scholar]
Tasinaffo, P.M.; Rios Neto, A. Predictive control with mean derivative based neural Euler integrator dynamic model. Rev. Controle Autom. 2006, 18, 94–105. [Google Scholar] [CrossRef]
Munem, M.A.; Foulis, D.J. Calculus with Analytic Geometry (Volumes I and II); Worth Publishers, Inc.: New York, NY, USA, 1978. [Google Scholar]
Lagaris, I.E.; Likas, A.; Fotiadis, D.I. Artificial neural networks for solving ordinary and partial differential equations. IEEE Trans. Neural Netw. 1998, 9, 987–1000. [Google Scholar] [CrossRef]
Hayati, M.; Karami, B. Feedforward neural network for solving partial differential equations. J. Appl. Sci. 2007, 7, 2812–2817. [Google Scholar] [CrossRef]
Lagaris, I.E.; Likas, A.; Fotiadis, D.I. Neural-network methods for boundary value problems with irregular boundaries. IEEE Trans. Neural Netw. 2000, 11, 1041–1049. [Google Scholar] [CrossRef]
Weinan, E.; Han, J.; Jentzen, A. Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations. Commun. Math. Stat. 2017, 5, 349–380. [Google Scholar] [CrossRef]
Han, J.; Arnulf, J.; Weinan, E. Solving high-dimensional partial differential equations using deep learning. Proc. Natl. Acad. Sci. USA 2018, 115, 8505–8510. [Google Scholar] [CrossRef] [PubMed]
Sacchetti, A.; Bachmann, B.; Löffel, K.; Künzi, U.-M.; Paoli, B. Neural networks to solve partial differential equations: A comparison With finite elements. J. IEEE Access 2022, 10, 32271–32279. [Google Scholar] [CrossRef]
Zhu, Q.; Yang, J. A local deep learning method for solving high order partial differential equations. Numer. Math. Theor. Meth. Appl. 2022, 15, 42–67. [Google Scholar] [CrossRef]
Lu, L.; Meng, X.; Mao, Z.; Karniadakis, G.E. Deepxde: A deep learning library for solving differential equations. SIAM Rev. 2021, 63, 208–228. [Google Scholar] [CrossRef]
Han, J.; Nica, M.; Stinchcombe, A.R. A derivative-free method for solving elliptic partial differential equations with deep neural networks. J. Comput. Phys. 2020, 419, 109672. [Google Scholar] [CrossRef]
Van der Houven, P.J. The development of Runge-Kutta methods for partial differential equations. Appl. Numer. Math. 1996, 20, 261–272. [Google Scholar] [CrossRef]
Kalman, R.E. A new approach to linear filtering and prediction problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
Singhal, S.; Wu, L. Training multilayer perceptrons with the extended Kalman algorithm. Adv. Neural Inf. Process. Syst. 1989, 1, 133–140. [Google Scholar]
Rios Neto, A. stochastic optimal linear parameter estimation and neural nets training in systems modeling. J. Braz. Soc. Mech. Sci. 1997, 19, 138–146. [Google Scholar]
Hagan, M.T.; Menhaj, M.B. Training feedforward networks with the Marquardt algorithm. IEEE Trans. Neural Netw. 1994, 5, 989–993. [Google Scholar] [CrossRef]

Figure 1. Difference between mean derivative and instantaneous derivative functions (Source: see [19]).

Figure 2. A feed-forward neural network project with the concept of mean derivative functions.

Figure 3. Mapping scheme used to characterize the discretization of the solution obtained through E–TUNI (Source: see [19]).

Figure 4. Analytical solution of the dynamical system presented in Example 1.

Figure 5. Graphic scheme of the dynamical system associated with Example 2 (Sources: [16,24,25]).

Figure 6. Orbit radius trajectory for

m = 1

and

Δ t = 0.1

. The solid line represents the reference trajectory, and the dashed line represents the actual trajectory traveled by the dynamical system.

Figure 6. Orbit radius trajectory for

m = 1

and

Δ t = 0.1

. The solid line represents the reference trajectory, and the dashed line represents the actual trajectory traveled by the dynamical system.

Figure 7. Rocket angle thrust estimated by predictive control structure designed with E-TUNI and referring to Figure 6.

Figure 8. Orbit radius trajectory for

m = 20

and

Δ t = 0.1

. The solid line represents the reference trajectory, and the dashed line represents the actual trajectory traveled by the dynamical system.

Figure 8. Orbit radius trajectory for

m = 20

and

Δ t = 0.1

. The solid line represents the reference trajectory, and the dashed line represents the actual trajectory traveled by the dynamical system.

Figure 9. Rocket angle thrust estimated by predictive control structure designed with E–TUNI and referring to Figure 8.

Figure 10. Orbitradius trajectory for

m = 10

and

Δ t = 0.01

(Source: see [16]). The solid line represents the reference trajectory, and the dashed line represents the actual trajectory traveled by the dynamical system.

Figure 10. Orbitradius trajectory for

m = 10

and

Δ t = 0.01

(Source: see [16]). The solid line represents the reference trajectory, and the dashed line represents the actual trajectory traveled by the dynamical system.

Figure 11. Rocket angle thrust estimated by predictive control structure designed with E–TUNI and referring to Figure 10 (Source: see [16]).

Figure 12. The E–TUNI was designed with several distinct integration steps. The red solid line represents the exact solution of the considered dynamical system, and the blue dots denote the E–TUNI predictions.

Figure 13. The E–TUNI equivalent to the previous figure, but with the mean derivatives and interpolating parabolas present. The red solid line is the exact solution of the considered dynamical system, the blue lines are the E–TUNI predictions using mean derivatives, and the magenta lines are the interpolating parabolas.

Figure 14. Example of numerical solution of the Lorenz dynamic equation.

Figure 15. The Lorenz equation being solved by three distinct types of UNI architecture.

Table 1. Coordinate points within the interval

[t_{k}, t_{k + 1}]

.

Table 1. Coordinate points within the interval

[t_{k}, t_{k + 1}]

.

n	Time	$y (t),$ $\dot{y} (t)$ and $\ddot{y} (t)$	Determining Form
1	$t_{k}$	${}^{k}y_{j}^{i}$	Initial Instant
2	$t_{k + 1}$	${}^{k + 1}y_{j}^{i}$	Given by E-TUNI
3	$t_{k}^{x}$	${\dot{y}}_{j}^{i} (t_{k}^{x}) = t a n_{Δ t} {}^{k}α_{j}^{i}$	Output of net
4	$t_{k}^{x}$	${\ddot{y}}_{j}^{i} (t_{k}^{x}) = t a n_{Δ t} {}^{k}Ψ_{j}^{i}$	From Equations (33) or (34)

Table 2. Summary of training performed for Example 3.

$Δ t$	MSE of Validation Patterns	Direct Approach
-	$2.5624 \times 10^{- 11}$	Instantaneous Derivatives
$0.01$	$2.4351 \times 10^{- 10}$	Mean Derivatives
$0.25$	$3.5268 \times 10^{- 8}$	Mean Derivatives
$1.00$	$2.1592 \times 10^{- 6}$	Mean Derivatives
$10.0$	$1.7520 \times 10^{- 3}$	Mean Derivatives

Table 3. Learning performance comparison between NARMAX, E–TUNI, and RKNN neural training methods (Example 4).

	NARMAX	E–TUNI	RKNN
$e_{m}^{2}$ (Training)	$2.25 \times 10^{- 8}$	$8.67 \times 10^{- 6}$	$1.80 \times 10^{- 6}$
$e_{m}^{2}$ (Validation)	$2.59 \times 10^{- 8}$	$1.04 \times 10^{- 5}$	$1.71 \times 10^{- 6}$
$e_{m}^{2}$ (Testing)	$2.96 \times 10^{- 8}$	$1.13 \times 10^{- 5}$	$3.77 \times 10^{- 6}$
$Δ t$ (Training) [s]	0.01	0.01	Not applicable
$Δ t$ (Simulation) [s]	0.01	0.01	0.01
Propagation Interval [s]	[0, 25]	[0, 25]	[0, 25]
Training Patterns	1600	1600	1600
Testing Patterns	400	400	400
Training Algorithm	LM	LM	LM
Hidden Layers (HLs)	1	1	1
Neurons	30	30	30
Activation Function	tansig	tansig	tansig
Learning Rate	0.2	0.2	0.2
Training Domain	$\underset{y_{1}}{[- 20, 20]}$ , $\underset{y_{2}}{[- 20, 20]}$ , $\underset{y_{3}}{[0, 51]}$	same	same

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tasinaffo, P.M.; Gonçalves, G.S.; Marques, J.C.; Dias, L.A.V.; da Cunha, A.M. An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control. Algorithms 2025, 18, 562. https://doi.org/10.3390/a18090562

AMA Style

Tasinaffo PM, Gonçalves GS, Marques JC, Dias LAV, da Cunha AM. An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control. Algorithms. 2025; 18(9):562. https://doi.org/10.3390/a18090562

Chicago/Turabian Style

Tasinaffo, Paulo M., Gildárcio S. Gonçalves, Johnny C. Marques, Luiz A. V. Dias, and Adilson M. da Cunha. 2025. "An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control" Algorithms 18, no. 9: 562. https://doi.org/10.3390/a18090562

APA Style

Tasinaffo, P. M., Gonçalves, G. S., Marques, J. C., Dias, L. A. V., & da Cunha, A. M. (2025). An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control. Algorithms, 18(9), 562. https://doi.org/10.3390/a18090562

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Overview of the Euler-Type Universal Numerical Integrator (E-TUNI): Applications in Non-Linear Dynamics and Predictive Control

Abstract

1. Introduction

2. Related Work

3. Neural Predictive Control Problem Formulation

4. Preliminaries and Symbols Used

5. Mathematical Development

5.1. Basic Mathematical Development of E-TUNI

5.2. Predictive Control Designed with E–TUNI

5.3. Correct Mathematical Demonstration of the E-TUNI General Expression

5.4. Mathematical Relationship Between Mean and Instantaneous Derivatives

6. Results and Analysis

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI