Quantifying Chaos by Various Computational Methods. Part 1: Simple Systems

The aim of the paper was to analyze the given nonlinear problem by different methods of computation of the Lyapunov exponents (Wolf method, Rosenstein method, Kantz method, the method based on the modification of a neural network, and the synchronization method) for the classical problems governed by difference and differential equations (Hénon map, hyperchaotic Hénon map, logistic map, Rössler attractor, Lorenz attractor) and with the use of both Fourier spectra and Gauss wavelets. It has been shown that a modification of the neural network method makes it possible to compute a spectrum of Lyapunov exponents, and then to detect a transition of the system regular dynamics into chaos, hyperchaos, and others. The aim of the comparison was to evaluate the considered algorithms, study their convergence, and also identify the most suitable algorithms for specific system types and objectives. Moreover, an algorithm of calculation of the spectrum of Lyapunov exponents based on a trained neural network has been proposed. It has been proven that the developed method yields good results for different types of systems and does not require a priori knowledge of the system equations.


Introduction
The first part of the present work is focused on the numerical investigation of classical dynamical systems to estimate velocity of divergence of the neighborhood trajectories with the help of a measure coupled with the Kolmogorov entropy [1] (or metrics). In reference [1], based on the mathematical results of Oseledec [2] and Pesin [3], it has been shown that the numerically imposed relations can be treated as exact/true values. The method proposed by Wolf [1] is most widely used to verify and study chaotic dynamics. However, also the Rosenstein [4] and Kantz [5] methods are often employed to estimate the largest Lyapunov exponents. The state-of-the-art of papers devoted to the theoretical background of the Lyapunov exponents and methods of their computations has been carried out by Awrejcewicz et al. [6]. In particular, the method of the choice of an embedding dimension has been described. The method of the correlating dimension, the false nearest neighbor method and the method of gamma-test have been presented based on the Hénon and Lorenz attractors. In particular, the occurrence of high computational difficulties has been observed in the case of the Wolf method and its marginally successful employment to small values of the studied data.
To avoid the abovementioned drawbacks, a novel neural network-based algorithm to estimate the largest Lyapunov exponents by considering only one coordinate has been proposed. In reference [6] have reported the neural network algorithm for computation of a full spectrum of Lyapunov exponents. A comparison of the results obtained by Golovko with the exact values of the Lyapunov exponents of the Lorenz and Hénon systems have exhibited small errors.
In References [7,8], the method of largest Lyapunov exponent computation using the synchronization phenomena of identical systems has been proposed. A few types of coupling have been studied, depending on the type of the considered system. It has been pointed out that large computation time is required to achieve full synchronization.
The method proposed in References [9,10] is particularly suitable to study chaotic dynamics of continuous mechanical systems. It should be emphasized that, owing to the results published by the authors of the present paper, the analysis of nonlinear dynamics based on the estimation of the Lyapunov exponents yields a conclusion that the mentioned problems have not been satisfactorily solved yet [1,4,5,9,10].
More recently, Vallejo and Sanjuan [11,12] have studied the predictability of orbits in coupled systems by means of finite-time Lyapunov exponents. This approach has allowed them to estimate how close the computed chaotic orbits are to the real/true orbits, being characterized by the systems shadowing properties.
It is known that the fundamental property of chaos is the existence of strong sensitivity to a change of the initial conditions. The definition of chaos, given first by Devaney in 1989 [22], includes three fundamental parts. In addition to sensitivity to the variation of the initial conditions, a condition of mixing, known also as the transitivity condition and the regularity condition, measured by the density of the periodic points or classical notion of periodicity is also included. In 1992, Banks et al. [23] proved that the condition of sensitivity to the initial condition can be neglected, i.e., conditions of transitivity and periodicity imply the sensitivity condition.
Knudsen [24] has defined chaos as a function given on a bounded metric space which has a dense orbit and essentially depends on initial conditions.
Owing to the definition proposed by Gulick [25], chaos exists when either there is essential dependence on the initial conditions or a chaotic function has positive Lyapunov exponents in each point of the space and which does not eventually tend to a periodic orbit. This definition has been also employed in the present work.

The Largest Lyapunov Exponent
The following dynamical system was considered: . x = f (x) (1) where x stands for the N-dimensional state vector. Two closed phase points x 1 and x 2 were chosen (in the phase space). They stand for the origins of the trajectories (x 1 (t) and x 2 (t)). The change in the distance d between two corresponding points of these trajectories under evolution of system (1) can be monitored by: Entropy 2018, 20, 175 3 of 28 If the dynamics of system (1) is chaotic, d(t) increases exponentially in time, i.e.,: This yields the average velocity of the exponential divergence of the trajectories: or more precisely: The quantity h is known as the Kolmogorov-Sinai entropy (KS-entropy). Employing the KS-entropy, one can define the studied process, i.e., quantify if the process is regular or chaotic. In particular, if the system dynamics is periodic or quasi-periodic, the distance d(t) is not inversed in time and the KS-entropy is equal to zero (h = 0). If the system dynamics tends to a stable fixed point d(t) → 0, then h < 0. Contrarily, KS-entropy is positive (h > 0) if one deals with chaotic dynamics. KS-entropy is the maximum characteristic Lyapunov exponent that enables one to follow velocity of information lost with respect to the initial system state.

Results
The spectrum of Lyapunov exponents makes it possible to qualitatively quantify a local property with respect to the stability of an attractor. Consider a phase trajectory x(t) of the dynamical system (1), starting from the point x(0) as well as its neighborhood trajectory x 1 (t) as follows: The following function can be constructed: which is defined on the vector of initial displacement → ε (0) such that → ε (0) = ε, where ε → 0. All possible rotations of the initial displacements vector with respect to n directions of the N-dimensional phase space of the Function (7) will suffer the jump-like changes in the finite series of the values λ 1 , λ 2 , λ 3 , . . . , λ n . These values of the function λ are called Lyapunov exponents (LEs). Positive/negative values of LEs can be viewed as a measure of the averaged exponential divergence/convergence of the neighborhood trajectories.
A sum of LEs stands for an averaged divergence of the phase trajectories flow. In the case of a dissipative system, i.e., a system possessing an attractor, this sum is always negative. As numerical case studies show, in some dissipative systems the LEs are invariant with respect to the chosen initial conditions. Hence, a spectrum of LEs can be understood as the property of an attractor.
Usually, LEs are presented in a sequence of LE values in decreasing order. For instance, symbols (+, 0, −) mean that for the analyzed attractor, there is one direction in a 3D space, where exponential stretching is exhibited, the second direction indicates neutral stability, and the third one-exponential compression. It should be noted that all attractors different from stable stationary points always have at least one LE equal to zero (in average sense, all points of a trajectory are bounded by a compact manifold and they cannot exhibit divergence or converge). In what follows, relationships between the Lyapunov exponents and the properties and types of attractors are illustrated and discussed: (1) n = 1. In this case only a stable fixed point can be an attractor (node or focus). There exists one negative LE denoted by λ 1 = (−), (2) n = 2. In 2D systems, there are two types of attractors: stable fixed points and limit cycles.
(3) n = 3. In 3D phase space, there exist four types of attractors: stable points, limit cycles, 2D tori and strange attractors. The following set of LEs characterizes possible dynamical situations to be met: In the majority of the studied problems, it is impossible to give an analytical definition of LEs, since the analytical solution to the governing differential equations would have to be known. However, there exist reliable algorithms to find all Lyapunov exponents numerically.

Benettin Method
We began with the numerical investigation of the Kolmogorov entropy of the Hénon-Heiles model. Numerical computations were carried out with accuracy up to 14 digits by means of employing the so-called method of central points. Observe that a similar method has been used in reference [26].
Based on the Lyapunov exponents, the ergodic properties of dissipative dynamical systems with a few degrees of freedom were numerically studied with the Lorenz system. The system exhibited the exponents spectrum of the (+, 0, −) type, and the exponents had the same values for the orbits beginning from an arbitrary point on the attractor. It means that the ergodic property of a general dynamical system can be quantified by a spectrum of the characteristic Lyapunov exponents. Below, a brief description of the used method is presented.
Let a point x 0 belong to the attractor A of a dynamical system. An evolution trajectory of the point x 0 is referred to as a real/true trajectory. A positive quantity ε, being significantly less than the attractor dimension, is chosen. Furthermore, an arbitrary perturbed point x 0 is chosen in a way to satisfy x 0 − x 0 = ε. The evolution of points x 0 and x 0 is considered in a short time interval T, and new points are denoted by x 1 and x 1 , respectively. A vector ∆x 1 = x 1 − x 1 is called the perturbation vector. The first estimate of the exponent is found with the use of the following formula The time interval T is chosen in a way to keep the amplitude of perturbation less than the linear dimensions of the phase space nonhomogenity and the attractor dimension. The normalized perturbation vector ∆x 1 = ε∆x 1 /||∆x 1 || is taken, and a new perturbed point x 1 = x 1 + ∆x 1 is defined. Finally, the so far described procedure is implemented taking into account x 1 and x 1 instead of x 0 and x 0 , respectively.
After repeating this procedure M times, λ is defined as an arithmetic average of the estimates λ l obtained on each computational step: In order to achieve a higher estimate, one can take large M and carry out computations for a different initial point x 0 . This method can be used when the equations governing the system evolution are known. It should be noted, however, that these equations are usually unknown for the experimental data.
To compute the Lyapunov spectrum numerically, one can use another approach generalizing the Benettin's algorithm. In general, it is necessary to follow a few trajectories of the perturbed points instead of only one, fundamental trajectory (the number of perturbed trajectories is equal to the dimension of the phase space). For this purpose, a numerical approach based on derivation of the dynamic equations in variations can be used [27]. Since the largest LE plays a crucial role in the evolution of all perturbed trajectories, it is necessary to carry out orthogonalization of the perturbation vectors on each step of the algorithm. In what follows, a procedure of numerical estimation of the Lyapunov spectrum of a dynamical system is briefly described. To simplify, the considerations are limited to 3D systems.
Let r 0 stand for a point of the chaotic attractor and ε be a fixed positive number, small in comparison to linear dimensions of the attractor. The points x 0 , y 0 and z 0 are chosen so that the perturbation vectors ∆x 0 = x 0 − r 0 , ∆y 0 = y 0 − r 0 , ∆z 0 = z 0 − r 0 have the length ε and are mutually orthogonal. After a certain small time interval T, the points r 0 , x 0 , y 0 and z 0 are transformed into points r 1 , x 1 , y 1 and z 1 , respectively. Then, new perturbation vectors ∆x 1 = x 1 − r 1 , ∆y 1 = y 1 − r 1 , ∆z 1 = z 1 − r 1 are considered. The orthogonlization using the well-known (in linear algebra) Gramm-Schmidt method is carried out. After this step, the obtained vectors of perturbation ∆x 1 , ∆y 1 , ∆z 1 become orthonormalized, i.e., they are mutually orthogonal and have the unit length. Then, the renormalization of the perturbation vectors is carried out again to get lengths of the vectors in terms of the magnitude ε: .
We take the following perturbed points: Next, the process is repeated, i.e., instead of the points r 0 , x 0 , y 0 and z 0 , the points r 1 , x 1 , y 1 and z 1 are taken into account, respectively.
Repeating the so far described procedure M times, one finds: Then, a spectrum Λ = {λ 1 , λ 2 , λ 3 } of LEs can be found by the following formulas: In this method, the choice of time interval T is crucial. If one takes too large time interval T, then all perturbed trajectories are inclined in the direction corresponding to the maximum LE, and hence the obtained results are not reliable.

Wolf Method
In Reference [1], a novel algorithm to find nonnegative Lyapunov exponents by using a time series has been proposed. It has been illustrated that the Lyapunov exponents are associated with either exponential divergence or convergence of the neighborhood orbits in the considered phase space. In general, the method is applicable only when analytical governing equations are known, and it is based on tracing the large time-consuming increase in the number of elements in a small volume of an attractor.
We defined a Lyapunov exponent and a spectrum of Lyapunov exponents, and then illustrated how the system dynamics depends on the number of exponents with different signs in the spectrum. Our approach included reconstruction of an attractor and investigation of orbital divergence on the possibly smallest distances using the approximate Gramm-Schmidt orthogonalization procedure in the reconstructed phase. In order to estimate the largest Lyapunov exponent, a long trace of time evolution of the chosen pair of the neighborhood orbits was carried out. Note that a particular attention should be paid, since the reconstructed attractor may contain points belonging to different attractors.
Two versions of the method are proposed. The first one includes the so-called fixed evolution time, where the time interval associated with the change of the points is fixed.
The main idea of the proposed method is that the largest Lyapunov exponent is computed based on one time series and used when the equations describing the system evolution are unknown and when it is impossible to measure all remaining phase coordinates.
Consider a time series x(t), t = 1, . . . , N of one coordinate of a chaotic process measured in equal time intervals. The method of mutual information allows one to define the time delay τ, whereas the method of false neighbors yields the dimension of the embedded space m. As a result of the reconstruction, one gets a set of points of the space R m : where i = ((m − 1)τ + 1), . . . , N.
We take a point from the series (3) and denote it by x 0 . In the series (3), one can find a point x 0 , where the relation || x 0 − x 0 ||= ε 0 < ε holds, and where ε is a fixed quantity, essentially less than the dimension of the reconstructed attractor. It is required that the points x 0 and x 0 are separated in time. Then, time evolution of these points is observed on the reconstructed attractor until the distance between points achieves ε max . The new points are denoted by x 1 and x 1 , the distance is ε 0 , and the associate interval of time evolution is denoted by T 1 .
After that, we again consider the Sequence (14) the find the point x 1 , located close to x 1 , where || x 1 − x 1 || = ε 1 < ε holds. Vectors x 1 − x 1 and x 1 − x 1 should possibly have the same direction. Then, the procedure is repeated for points x 1 and x 1 .
By repeating the above procedure M times, the largest Lyapunov exponent is estimated: This method has been employed in the present research to test the accuracy of results by using the classical and known spectra of the Lyapunov exponents of the Hénon map, Rössler equations, chaos and hyperchaos exhibited by the Lorenz system, and McKay-Glass equation [28]. In addition, it has been also employed to study the Belousov-Zhabotinsky reaction [29] and the Couette-Taylor flow [30].
Wolf et al. [1] have pointed out certain restrictions on the choice of the embedding dimension and the time required for the attractor reconstruction to achieve the most accurate estimates of the Lyapunov exponents. Using the Rössler attractor [16] and the Belousov-Zhabotynskiy reaction [29], the authors have demonstrated the effects of the time change during the attractor reconstruction, the time of evolution of the system between steps of the time change, the maximum length of the replacement vector and the minimum length of the exchange vector on the values of the estimated largest Lyapunov exponent. Furthermore, it has been shown that variation (between 0.5 and 1.5) of the time of the system evolution leads to reliable estimates of the studied three chaotic attractors. Also, some data requirements that make it possible to obtain the most accurate estimate of the Lyapunov Entropy 2018, 20, 175 7 of 28 exponent, such as the use of small length scale data as well as some restrictions on the presence of noisy perturbations in the data (static and dynamic), have been discussed.
The proposed algorithms can be used to detect chaos as well as to compute its parameters also for the experimental data with a few positive exponents. Furthermore, numerical studies have presented the topological complexity of chaos (the Lorenz attractor) and have shown that the deterministic chaos can be distinguished from white noise (the Belousov-Zhabotinsky reaction).

Rosenstein Method
Despite this method is simple in realization in comparison to the previous ones and it is characterized by high computational speed, it does not directly yield λ 1 , but rather the function: where x j is a given point, and x j denotes its neighbor.
The algorithm is based on the relationship between d j and the Lyapunov exponents: d j (i) ≈ e λ 1 (i∆t) . The largest Lyapunov exponent is computed by estimating the inclination of the most linear part of the function. It should be mentioned, however, that finding this linear part does not belong to easy tasks.

Kantz Method
The algorithm proposed by Kantz [5] computes the LLE by searching all neighbors in vicinity of the reference trajectory and estimates the average distance between neighbors and the reference trajectory as a function of time (or a relative time multiplied by the data sampling frequency). The algorithm is based on the following formula: where x t stands for an arbitrary signal point; U t is a neighborhood of x t ; x i is a neighbor of x t ; τ-relative time multiplied by the sampling frequency; T-sample size; S(τ)-stretching factor in the region of a linear growth indicating a curve whose slope is equal to LE, i.e., e λτ ∝ e S(τ) . However, the assumption of a linear growth introduces new errors. Despite the fact that the method is useful and accurate for systems with known LEs, the choice of parameters and the region where the mentioned linear growth occurs is, in practice, arbitrary.
The method yields correct results if the value of the Lyapunov exponent is known a priori, and hence the space with the tangent equal to that value can be chosen.

Computation of LLE Based on Synchronization of Nonnegative Feedback
In reference [7], the method of LLE computation based on synchronization of coupled identical systems has been proposed. The following k-dimensional discrete system: has been considered, where y ∈ R k , i ∈ (1, 2, . . . , k). The supplemental system has been proposed in the following way: where x, y, ∆y ∈ R k . Evolution of k-dimensional system is governed by k of LLEs. Consequently, synchronization of the perturbed and nonperturbed systems (19) is guaranteed by the following inequality: (20) where λ max stands for LLEs of the studied systems (18). Figure 1 shows synchronization between perturbed (first equation of (19)) and nonperturbed (second equation of (19)) systems for alogistic map. The synchronization starts at p equal to λ, and this value represents the largest Lyapunov exponent of the system. In reference [8], systems with excitations have been studied. The authors have proposed the following way of coupling of identical systems: The application of this approach is limited to the systems with known equations of evolutions, and the way of introducing the coupling of two identical systems depends on the type of the considered system.

Jacobi Method
This method has been proposed in references [31,32]. The main idea is to use an algorithm, the scheme of which is illustrated in Figure 2. A sphere of small radius is taken. After a few iterations m, a certain operator transforms this sphere into an ellipsoid having 1 , … , half-axes. The sphere is stretched along the axes 1 , … , > , where s is the number of positive LEs. For sufficiently small , the operator is close to the sum of the shear operator and the linear operator A. The LLEs are computed as averaged eigenvalues of the operator A on the whole attractor. In reference [8], systems with excitations have been studied. The authors have proposed the following way of coupling of identical systems: .
The application of this approach is limited to the systems with known equations of evolutions, and the way of introducing the coupling of two identical systems depends on the type of the considered system.

Jacobi Method
This method has been proposed in references [31,32]. The main idea is to use an algorithm, the scheme of which is illustrated in Figure 2. A sphere of small radius ε is taken. After a few iterations m, a certain operator T m transforms this sphere into an ellipsoid having a 1 , . . . , a p half-axes. The sphere is stretched along the axes a 1 , . . . , a s > ε, where s is the number of positive LEs. For sufficiently small ε, the operator T m is close to the sum of the shear operator and the linear operator A. The LLEs are computed as averaged eigenvalues of the operator A on the whole attractor.
A vector ς j is chosen, and a set ς k i (i = 1, . . . , N) of i-th neighborhood vectors is found. The following set of vectors y i ≡ ς k i − ς j , where y i ≤ ε, is taken. After m successive iterations, the operator T m transforms the vector ς j into ς j+m , and the vector ς k i into ς k i+m . Eventually, the vectors y i are transformed into Assuming that the radius ε is sufficiently small, one can introduce the operator A j as follows The operator A j describes the system in variations. To estimate the operator A, the least-square method can be employed: This yields the following system of equations of the dimension n × n: where V, C are the matrices of the dimension n × n, y k i stands for the k-th component of vector y i , and y k i + m is the k-th component of the vector y i + m . If A is a solution of the equations, then the LEs can be found in the following way where e j is a set of basic vectors in a tangent space ς j . In reference [8], systems with excitations have been studied. The authors have proposed the following way of coupling of identical systems: The application of this approach is limited to the systems with known equations of evolutions, and the way of introducing the coupling of two identical systems depends on the type of the considered system.

Jacobi Method
This method has been proposed in references [31,32]. The main idea is to use an algorithm, the scheme of which is illustrated in Figure 2. A sphere of small radius is taken. After a few iterations m, a certain operator transforms this sphere into an ellipsoid having 1 , … , half-axes. The sphere is stretched along the axes 1 , … , > , where s is the number of positive LEs. For sufficiently small , the operator is close to the sum of the shear operator and the linear operator A. The LLEs are computed as averaged eigenvalues of the operator A on the whole attractor.  The algorithm can be realized in a way similar to the computation of LEs of the ODEs given analytically.
Let us choose an arbitrary basis {e s } and then follow the changes in the length of the vector A j e s . As the vectors A j e s grow and their orientations change, it is necessary to perform their orthogonalization and normalization by using, for example, the Gramm-Schmidt procedure. The procedure is then repeated for the new basis.
The mentioned method allows one to estimate a spectrum of nonnegative LEs. However, it has a serious disadvantage-it is highly sensitive to noise and errors.

Modification of the Neural Network Method
We have proposed a novel counterpart method to compute LEs based on a modification of the neural network method (see Figure 3).
A single-layer feed forward neural network presented in Figure 3 has multiple input neurons, a layer of hidden neurons and one output neuron. The following notation is employed: a ij -weight of the connection between the i-th input neuron and the j-th hidden neuron; b i -weight of connection between the i-th hidden neuron and the output neuron. To realize the neural network algorithm, the following criteria were taken into account: (i) the network is sensitive to the input information (information is given in the form of real numbers); (ii) the network is self-organizing, i.e., it yields the output space of solutions only based on the inputs; (iii) the neural network is a network of straight distribution (all connections are directed from input neurons to output neurons); (iv) owing to the synapses tuning, the network exhibits dynamic couplings (in the learning process, the tuning of the synaptic coupling takes place (dW/dt = 0), where W stands for the weighted coefficients of the network).  The hidden layer of neurons contains the hyperbolic tangent, which plays a role of an activation function ( Figure 4). A derivative of the hyperbolic tangent is described by a quadratic function, as it is in the case of a logistic function. However, contrarily to the logistic function, the space of the values of the hyperbolic tangent falls within the interval (−1; 1). This results in higher convergence in comparison to the standard logistic function. Prognosis of ̂ of a scalar time series is made by employing the following formula where stands for the number of neurons, is the number of the searched LE, stands for the The hidden layer of neurons contains the hyperbolic tangent, which plays a role of an activation function ( Figure 4). A derivative of the hyperbolic tangent is described by a quadratic function, as it is in the case of a logistic function. However, contrarily to the logistic function, the space of the values of the hyperbolic tangent falls within the interval (−1; 1). This results in higher convergence in comparison to the standard logistic function.  The hidden layer of neurons contains the hyperbolic tangent, which plays a role of an activation function ( Figure 4). A derivative of the hyperbolic tangent is described by a quadratic function, as it is in the case of a logistic function. However, contrarily to the logistic function, the space of the values of the hyperbolic tangent falls within the interval (−1; 1). This results in higher convergence in comparison to the standard logistic function.  Prognosis ofx k of a scalar time series x k is made by employing the following formulâ a ij x k−j (22) where n stands for the number of neurons, d is the number of the searched LE, a ij stands for the n × (d + 1) matrix of coefficients, and b i is the vector of the length n. The matrix a ij contains the coupling forces with respect to the network input, the vector b i is used to control the input of each neuron to the network output, whereas the vector a i0 is used for relatively simple learning based on data with nonzero averaged value. Weights a and b are chosen in a probabilistic way, and the dimension of the searched solution is decreased in the process of learning. The associated Gaussian is chosen in a way to have initial standard distribution 2 −j , centered with respect to zero in order to promote the most recent time delays (small values of j) in the phase space. The coupling forces are chosen in a way to minimize the averaged one step mean square error of a forecast: During the training of the network, sensitivity of the output is defined by computing partial derivatives of all averaged points of the time series in each time step x k−j : In the case of the network given by (22), the partial derivatives have the following form: The largest value j is the optimal embedding dimension, and the key role is played byŜ(j) as in the false nearest neighbors method. The individual values ofŜ(j) yield a quantitative estimate of the importance of each time step using the associated terms of the autocorrelation function or coefficients of the associated linear model.
The weights of the trained neural network are substituted to the matrix of solutions, and the input data are used to define the initial state. The computation of the spectrum is realized by employment of the generalized Benettin's algorithm based on the obtained system of equations.

Gauss Wavelets
In the majority of engineering problems, the Fourier analysis is insufficient, since it deals with the averaged spectrum of the whole studied vibration signal and presents only a general picture of the signal. On the contrary, wavelets play a role of a "microscope" which allows one to observe the spectrum at each time instant, and detect births/deaths of the frequencies in time.
A wavelet transform of a 1D signal is realized with respect to a basis being usually a soliton-like function with given properties. The basis is obtained by displacement and tension/compression of a function called a wavelet.
In the present work, the Gauss wavelets, defined as derivatives of the Gauss function, were used. Higher-order derivatives have many zero moments, and hence they allow one to obtain information about higher-order features hidden in the investigated signal. The 8th order Gauss wavelets of the of the following form were employed:

Analysis of Classical Dynamical Systems by LEs and Gauss Wavelets
In this section, simple classical systems (Figures 5-9) have been studied with emphasis put on a comparison of the LEs (Tables 1-5) obtained using the Wolf, Rosenstein, Kantz and neural network methods. The convergence of the mentioned methods, depending on the number of iteration steps, has been illustrated and discussed (Tables 6-10). The Benettin method has been used as a reference because for most systems, there are no analytically calculated spectra of Lyapunov exponents. Moreover, the Benettin method calculates Lyapunov exponents based on the system equations.

Logistic Map
A logistic map describes how the population changes with respect to time: Here, takes the values from 0 to 1 and presents the population in the n-th year, whereas 0 denotes the initial population (in the year 0); R is a positive parameter characterizing an increase in the population (computations were carried out for R = 4). The first Lyapunov exponent and the Kaplan-Yorke dimension have been estimated by Sprott [33,34]. He has obtained: λ1 = 0.693147181, and the Kaplan-Yorke dimension: 1.0. The power spectrum is noisy and it is not possible to distinguish the dominating frequency. A similar situation is exhibited by the Gauss wavelet, where a large set of frequencies is visible. Dynamics of LLE changing increases for r > 3.
As can be seen in Table 1, all computational methods were compared with Benettin's original results. Good coincidence was exhibited by the neural network method, the Rosenstein method, the Kantz method, and the method of synchronization. The Wolf method gave decreased/increased value of LLE in comparison to the original value.

Logistic Map
A logistic map describes how the population changes with respect to time: Here, X n takes the values from 0 to 1 and presents the population in the n-th year, whereas X 0 denotes the initial population (in the year 0); R is a positive parameter characterizing an increase in the population (computations were carried out for R = 4). The first Lyapunov exponent and the Kaplan-Yorke dimension have been estimated by Sprott [33,34]. He has obtained: λ 1 = 0.693147181, and the Kaplan-Yorke dimension: 1.0.

Logistic Map
A logistic map describes how the population changes with respect to time: Here, takes the values from 0 to 1 and presents the population in the n-th year, whereas 0 denotes the initial population (in the year 0); R is a positive parameter characterizing an increase in the population (computations were carried out for R = 4). The first Lyapunov exponent and the Kaplan-Yorke dimension have been estimated by Sprott [33,34]. He has obtained: λ1 = 0.693147181, and the Kaplan-Yorke dimension: 1.0. The power spectrum is noisy and it is not possible to distinguish the dominating frequency. A similar situation is exhibited by the Gauss wavelet, where a large set of frequencies is visible. Dynamics of LLE changing increases for r > 3.
As can be seen in Table 1, all computational methods were compared with Benettin's original results. Good coincidence was exhibited by the neural network method, the Rosenstein method, the Kantz method, and the method of synchronization. The Wolf method gave decreased/increased value of LLE in comparison to the original value.

Logistic Map
A logistic map describes how the population changes with respect to time: Here, takes the values from 0 to 1 and presents the population in the n-th year, whereas 0 denotes the initial population (in the year 0); R is a positive parameter characterizing an increase in the population (computations were carried out for R = 4). The first Lyapunov exponent and the Kaplan-Yorke dimension have been estimated by Sprott [33,34]. He has obtained: λ1 = 0.693147181, and the Kaplan-Yorke dimension: 1.0. The power spectrum is noisy and it is not possible to distinguish the dominating frequency. A similar situation is exhibited by the Gauss wavelet, where a large set of frequencies is visible. Dynamics of LLE changing increases for r > 3.
As can be seen in Table 1, all computational methods were compared with Benettin's original results. Good coincidence was exhibited by the neural network method, the Rosenstein method, the Kantz method, and the method of synchronization. The Wolf method gave decreased/increased value of LLE in comparison to the original value.

Logistic Map
A logistic map describes how the population changes with respect to time: Here, takes the values from 0 to 1 and presents the population in the n-th year, whereas 0 denotes the initial population (in the year 0); R is a positive parameter characterizing an increase in the population (computations were carried out for R = 4). The first Lyapunov exponent and the Kaplan-Yorke dimension have been estimated by Sprott [33,34]. He has obtained: λ1 = 0.693147181, and the Kaplan-Yorke dimension: 1.0. The power spectrum is noisy and it is not possible to distinguish the dominating frequency. A similar situation is exhibited by the Gauss wavelet, where a large set of frequencies is visible. Dynamics of LLE changing increases for r > 3.
As can be seen in Table 1, all computational methods were compared with Benettin's original results. Good coincidence was exhibited by the neural network method, the Rosenstein method, the Kantz method, and the method of synchronization. The Wolf method gave decreased/increased value of LLE in comparison to the original value.

Logistic Map
A logistic map describes how the population changes with respect to time: Here, takes the values from 0 to 1 and presents the population in the n-th year, whereas 0 denotes the initial population (in the year 0); R is a positive parameter characterizing an increase in the population (computations were carried out for R = 4). The first Lyapunov exponent and the Kaplan-Yorke dimension have been estimated by Sprott [33,34]. He has obtained: λ1 = 0.693147181, and the Kaplan-Yorke dimension: 1.0. The power spectrum is noisy and it is not possible to distinguish the dominating frequency. A similar situation is exhibited by the Gauss wavelet, where a large set of frequencies is visible. Dynamics of LLE changing increases for r > 3.
As can be seen in Table 1, all computational methods were compared with Benettin's original results. Good coincidence was exhibited by the neural network method, the Rosenstein method, the Kantz method, and the method of synchronization. The Wolf method gave decreased/increased value of LLE in comparison to the original value.
The power spectrum is noisy and it is not possible to distinguish the dominating frequency. A similar situation is exhibited by the Gauss wavelet, where a large set of frequencies is visible. Dynamics of LLE changing increases for r > 3.
As can be seen in Table 1, all computational methods were compared with Benettin's original results. Good coincidence was exhibited by the neural network method, the Rosenstein method, the Kantz method, and the method of synchronization. The Wolf method gave decreased/increased value of LLE in comparison to the original value.

Hénon Map
The Hénon map takes a point (X n , Y n ) and maps it into another point by the following formulas: The following parameters are fixed for numerical experiments: a = 1.4, b = 0.3. Since the Equations (28) do not correspond to a real object, the parameters are replaced with fixed values. Sprott [34] has computed the Lyapunov spectrum and the Kaplan-Yorke dimension of the map using the Benettin method [27] by solving (28). He has obtained the following LEs: λ 1 = 0.419217, λ 2 = −1.623190, and the Kaplan-Yorke dimension: 1.258267.
Similarly to the logistic map, the power spectrum exhibits a uniform noisy shape. However, one can distinguish a dominating frequency (ω 1 ≈ 0, 45). This frequency is also visible on the wavelet spectrum as a region of the largest amplitudes along the whole signal (brighter regions in the graph). Plots of the change in the LLE correlate with bifurcation diagrams for the same interval of changes in the parameters a and b. Dynamics of the LLE changes increases with the increase in both control parameters. Starting with the graphs of LEs for a given set of control parameters, the system mainly remains in a periodic regime, but it exhibits chaotic dynamics for large values of the control parameters.
Beginning from the results shown in Table 2, the majority of the employed computational methods yielded good results. However, the most accurate results were obtained by the neural network method (for whole spectrum of LEs), the Rosenstein method, the Kantz method, and the method of synchronization (in the case of LLEs). The Wolf method gave decreased estimated values of the LLEs.   Table 4. Lyapunov exponents spectrum and LLEs computed by different methods (Rössler attractor).  Table 6. Fourier power spectra (a) and Gauss wavelet spectra (b) obtained for ∆t = 1, 2 and the LLEs computed by different methods (logistic map).      Table 7. Fourier power spectra (a) and Gauss wavelet spectra (b) obtained for ∆t = 1, 2 and the computed LLEs by different methods (Hénon map Table 7. Fourier power spectra (a) and Gauss wavelet spectra (b) obtained for ∆ = 1, 2 and the computed LLEs by different methods (Hénon map).

Hyperchaotic Generalised Hénon Map
To obtain the hyperchaotic Hénon map, one needs to take a point (X n , Y n , Z n ) and map it into the following one: The computations were carried out for the following fixed parameters: a = 3.4, b = 0.1. The Lyapunov spectrum reported in reference [14] is: 0.276; 0.257; 4.040.
One can distinguish a large number of frequencies in the power spectrum. Frequencies with the largest amplitude are located in the interval [0.15; 0.3] (frequencies ω 1 − ω 4 ), but the remaining part of the spectrum is noisy. This interval corresponds to the brightest region on the Gauss wavelet, which is correlated with the values of the power spectrum. Changes in LLEs coincide with the bifurcation diagrams constructed for the same intervals of changes in the control parameters a and b. Dynamics of LLEs increases with the increase in the control parameters. As in the case of the Hénon map, the chart of LEs for the selected control parameters exhibits, for a majority of studied parameters, periodic dynamics. It transits into chaos for a ≈ 1.4, and is almost suddenly shifted into hyperchaos (2 positive LEs).
Good results were obtained by the Benettin, Rosenstein and synchronization methods (divergence from the third decimal place). The neural network yielded slightly increased estimates of two first LEs, whereas the third LE was estimated almost exactly. The Kantz method gave a decreased result in comparison to reference data. The Wolf method resulted in the largest error.

Rössler Attractor
The following Rössler system of ODEs was investigated: and the computations were carried out for the following fixed parameters a = b = 0.2 and c = 5.7.
The power spectrum contains the fundamental frequency ω 1 , which is accompanied by damped bursts (frequencies ω 2 − ω 10 ). In the whole time interval, the Gauss wavelet exhibits the brightest region of the fundamental frequency with darker peaks going to zero. Thus, the picture is analogous to the power spectrum. Contrarily to the studied maps, the bifurcation diagrams have a more complex structure. However, there is still correlation with the changes in LLEs for the corresponding control parameters. The parameter b has the smallest influence on the change in LLE. Graphs of LLEs also exhibit a more complex structure. Borders of different vibration kinds have complex forms, which illustrates the increase in the system complexity. Aside from the chaos and hyperchaos zones, there are drops indicating 3 positive LEs. Amabili et al. [35] have suggested to call all chaotic oscillations, for which at least two positive Lyapunov exponents exist, by hyperchaotic.
As far as Table 4 is considered, the best results were yielded by the Benettin and Rosenstein methods. The method of neural networks gave very good results in the case of estimates of two first LEs, but underestimated the third exponent. The Wolf method yielded smaller value of the first exponent compared to the reference data. The most underestimated results were given by the Kantz method.
The carried out numerical experiments showed that changing the sampling frequency did not affect the power spectrum and wavelet spectrum. This was also validated by results obtained by the Benettin, neural networks, and Rosenstein methods, which yielded the results very close to original ones. The Kantz method gave underestimated results for different sampling frequency, correlating with the results obtained for the standard sample size.

Lorenz Attractor
The system is described by the following ODEs: where r stands for the normalized Rayleigh number (nondimensional number defining fluid behavior under gradient): In the above equation, the following notation is used: g-gravity of Earth; L-characteristic dimension of the fluid space; ∆T-temperature difference between fluid walls; ν-kinematic fluid viscosity, χ-thermal conductivity of the fluid; β-coefficient of heat fluid extension; σ-Prandtl number (takes into account heat source property) governed by the following equation where: ν = η/ρ-kinematic viscosity, η-dynamic viscosity, ρ-density, α = ℵ ρC p -temperature transfer coefficient, ℵ-heat transfer coefficient, C p -specific heat capacity under constant pressure; and ρ-information about the geometry of the convective cell.
The power spectrum of the attractor decreases uniformly when approaching a finite frequency, and there are no frequencies with a strongly dominating amplitude. The latter observation has been also verified by the Gauss wavelet spectrum. The bifurcation diagrams, similar to those for the Rössler system, exhibit a complex structure, but the correlation to the LLEs change is conserved. The richest/lowest dynamics of LLE is obtained for changing parameter r/σ. Based on the reported graphs of LEs, one can conclude that the system dynamics is fully chaotic. There are also narrow windows of hyperchaotic dynamics.
A comparison of the results reported in Table 5 with the original results exhibits an excellent coincidence of the Benettin method (original results) and the neural network method (+4.79%). The Wolf and Rosenstein methods yielded the underestimated results of the LLE value. The worst estimation has been obtained by Kantz method.
Changing the sampling frequency did not change Fourier and wavelet power spectra. This was also validated by the Benettin and Rosenstein methods, which yield the results very close to the original values in spite of the arbitrary choice of the sampling frequency.

Concluding Remarks
The analysis of the dynamics of the studied classical system by different methods leads to a conclusion that the most perspective and useful is the modified method of neural networks [9,10]. It gives excellent convergence to the original results and, as the only one (besides of the Benettin method), allows to compute the spectrum of all Lyapunov exponents. In addition, very good results were obtained by the Rosenstein and Kantz methods for all studied systems. However, this method can be used to estimate only the largest Lyapunov exponents.
As far as the convergence is considered, the Wolf method yielded either over-or underestimated values of LEs. The method of synchronization worked reasonably well for the maps, but it was not useful in studying differential equations (the Rössler or Lorenz systems). The mentioned systems require the use of another type of coupling, which is a drawback of the method.
It should be emphasized that this part of the paper serves as a preliminary study of a more complicated nonlinear continuous structural system, which is studied in Part 2. The carried out analysis of the works devoted to feasible methods for computation of Lyapunov exponents shows that there is no universal, verified, and general method to compute the exact (in the sense of numerics) values of the Lyapunov exponents. This observation leads to the conclusion that there is a need to employ qualitatively different methods to check the reliability of "true chaotic results". Furthermore, the carried out analysis is a helping tool to study systems of an infinite dimension. Such an analysis is the subject of the second part of the paper.