An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network

Lan, Pu; Xia, Kewen; Pan, Yongke; Fan, Shurui

doi:10.3390/sym13091706

Open AccessArticle

An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network

School of Electronics and Information Engineering, Hebei University of Technology, Tianjin 300401, China

^*

Authors to whom correspondence should be addressed.

Symmetry 2021, 13(9), 1706; https://doi.org/10.3390/sym13091706

Submission received: 6 May 2021 / Revised: 5 September 2021 / Accepted: 6 September 2021 / Published: 15 September 2021

Download

Browse Figures

Versions Notes

Abstract

:

An improved equilibrium optimizer (EO) algorithm is proposed in this paper to address premature and slow convergence. Firstly, a highly stochastic chaotic mechanism is adopted to initialize the population for range expansion. Secondly, the capability to conduct global search to jump out of local optima is enhanced by assigning adaptive weights and setting adaptive convergence factors. In addition 25 classical benchmark functions are used to validate the algorithm. As revealed by the analysis of the accuracy, speed, and stability of convergence, the IEO algorithm proposed in this paper significantly outperforms other meta-heuristic algorithms. In practice, the distribution is asymmetric because most logging data are unlabeled. Traditional classification models have difficulty in accurately predicting the location of oil layer. In this paper, the oil layers related to oil exploration are predicted using long short-term memory (LSTM) networks. Due to the large amount of data used, however, it is difficult to adjust the parameters. For this reason, an improved equilibrium optimizer algorithm (IEO) is applied to optimize the parameters of LSTM for improved performance, while the effective IEO-LSTM is applied for oil layer prediction. As indicated by the results, the proposed model outperforms the current popular optimization algorithms including particle swarm algorithm PSO and genetic algorithm GA in terms of accuracy, absolute error, root mean square error and mean absolute error.

Keywords:

equilibrium optimizer algorithm; adaptive inertia weights; IEO-LSTM; oil layer prediction

1. Introduction

Most of the optimization problems arising from various engineering settings are highly complex and challenging. Since traditional approaches tend to use specific solutions to solve specific problems, they are highly characteristic but not universally applicable. Characterized by linear programming and convex optimization [1], traditional methods are restricted by various drawbacks. For example, the problem-solving process is extremely complicated, and it is prone to obtaining local optimal solutions, thus it is difficult to solve realistic problems using traditional methods and therefore, currently it is a common practice to use meta-heuristic optimization algorithms. The meta-heuristic optimization algorithm is becoming popular because it is easy to implement, bypass local optimization, and can be used in a wide range of problems covering different disciplines utilized in various problems covering different disciplines. Meta-heuristic algorithms solve optimization problems by mimicking biological or physical phenomena. It can be grouped into three main categories: natural evolution (EA)-based algorithms [2], physics-based algorithms [3], and swarm intelligence algorithms (SI) [4]. They all have excellent performances in combinatorial optimization [5], feature selection [6], image processing [7], data mining [8], and many other fields. In recent years, new meta-heuristic algorithms based on mimic the natural behavior have been proposed one after another, including particle swarm optimization (PSO) [9,10], gray wolf optimization (GWO) [11], seagull optimization algorithm (SOA) [12], whale optimization algorithm (WOA) [13,14], cuckoo search algorithm (CSA) [15,16], marine predator algorithm (MPA) [17,18], coyote optimization algorithm (COA) [19], carnivorous plant algorithm (CPA) [20], transient search algorithm (TSA) [21], genetic algorithms (GA) [22] and more.

Inspired by control volume mass balance models used to estimate both dynamic and equilibrium states, the equilibrium optimization algorithm (EO) is proposed by Faramarzia and Heidarinejad in 2019 [23]. The mass balance equation is used to describe the concentration of non-reactive components in the control volume depending on the exact resource library model, before reaching the equilibrium state as the optimal result. However, the EO algorithm is disadvantaged by its slow convergence and the tendency to fall into the local optimum. Thus, an improved algorithm is proposed in this paper. Firstly, allowing for the instability caused by the traditional randomly initialized population, chaotic mapping is performed to initialize the population. At this time, the distribution of search agents will be made more comprehensive. Secondly, adaptive weights and adaptive convergence factors are introduced to improve global search ability in the early stage, for avoiding the prospect of falling into the local optimum. In addition, the local search ability in the later stage is also improved. At present, there have been a lot of comparative experiments conducted to validate the improved algorithm. Herein, there are three popular group optimization algorithms chosen for comparison. Besides, there are fifteen commonly-used standard test functions selected, including five high-dimensional unimodal functions, three multimodal functions, seven fixed-dimensional multimodal functions, and 10 composite functions. For each function, 30 experiments were performed, and then the experimental results were recorded. According to the experimental results obtained, IEO has a higher convergence speed and higher search accuracy for most functions.

Like other neural network models, in LSTM applications it is necessary to manually set some parameters for the LSTM neural network model, such as the learning rate and the number of batches. These parameters tend to have a significant impact on the topology of the network model. Since prediction performance of models trained with different parameters varies significantly. To confirm the appropriate model parameters, this paper proposes the EO algorithm with adaptive weights based on chaos theory to optimize the internal parameters of LSTM, and then use the optimized neural network for oil layer prediction. Experimentally, it is proved that the improved EO algorithm can globally optimize parameters of LSTM, reduce the randomness and improve the prediction effect of LSTM for the oil layer.

2. Equilibrium Optimizer Algorithm and Its Improvement

2.1. Description of Equilibrium Optimizer Algorithm

In the equilibrium optimization algorithm, the solution is similar to particles, the concentration is similar to the position of particles in PSO. The mass balance equation is used to describe the concentration of non-reactive components in the control volume. The mass balance equation is reflected in the physical process of controlling the mass entering, leaving and generating in the volume. Usually described by a first-order differential equation, its expressed as follows:

V \frac{d C}{d t} = Q C_{eq} - Q C + G

(1)

where V represents control volume;

V \frac{d C}{d t}

denotes the rate of quality change; C indicates the concentration in the control volume; Q is referred to as the volumetric flow rate into or out of the control volume; C_eq denotes the concentration inside the control volume in the absence of mass generation (which is the equilibrium state);

G

represents the rate of mass production inside the control volume.

Through the differential equation as described using Equation (1) is solved, it can be obtained that:

C = C_{eq} + (C_{0} - C_{eq}) F + \frac{G}{λ V} (1 - F)

(2)

F = \exp [- λ (t - t_{0})]

(3)

where

C_{0}

represents the initial concentration of the control volume in time t0;

F

indicates the index term coefficient;

λ

refers to the flow rate. For an optimization problem, the concentration C on the left side of the equation represents the newly generated current solution;

C_{0}

denotes the initial concentration of the control volume at the time

t_{0}

, while representing the solution as obtained in the previous iteration;

C_{eq}

represents the best solution currently found by the algorithm;

t

is defined as an iterative (iter) function, which decreases as the number of times of iteration increases:

t = 1 - {(\frac{I t e r}{M a x_i t e r})}^{a_{2} \frac{I t e r}{M a x_i t e r}}

(4)

I t e r

and

M a x_i t e r

are constant, representing the current and maximum number of times of iteration, respectively;

a_{2}

is a constant, which is used to manage the optimization ability of the function.

The mathematical model is as follows:

(1): Initialization: In the initial state of the optimization process, the equilibrium concentration is unknown. Therefore, the algorithm performs random initialization within the upper and lower limits of each optimization variable.

${\vec{C}}_{i}^{0} = {\vec{C}}_{\min} + {\vec{r}}_{i} ({\vec{C}}_{\max} - {\vec{C}}_{\min}), i = 1, 2, \dots, n$

(5)

where ${\vec{C}}_{\min}$ and ${\vec{C}}_{\max}$ represent the lower limit and upper limit vector of the optimized variable, respectively; ${\vec{r}}_{i}$ denotes the random number vector of individual i, the dimension of which is consistent with that of the optimized space. The value of each element is a random number ranging from 0 to 1.
(2): Update of the balance pool: The equilibrium state in Equation (2) is selected from the five current optimal candidate solutions to improve the global search capability of the algorithm and avoid the prospect of falling into low-quality local optimal solutions. The equilibrium state pool formed by these candidate solutions is expressed as follows:

${\vec{C}}_{e q, p o a l} = \{{\vec{C}}_{e q (1)}, {\vec{C}}_{e q (2)}, {\vec{C}}_{e q (3)}, {\vec{C}}_{e q (4)}, {\vec{C}}_{e q (a x e)}\}$

(6)

where ${\vec{C}}_{e q (1)}, {\vec{C}}_{e q (2)}, {\vec{C}}_{e q (3)}, {\vec{C}}_{e q (4)}$ represent the best solution found through the iteration so far; ${\vec{C}}_{e q (a x e)}$ refers to the average of these four solutions. The probability is identical for these five candidate solutions selected, that is, 0.2.
(3): Index term F: To better balance the local search and global search of the algorithm, Equation (3) is improved as follows:

$\vec{F} = a_{1} s i g n (\vec{r} - 0.5) [e^{- \vec{λ} t} - 1]$

(7)

where, $a_{1}$ represents the weight constant coefficient of the global search; $s i g n$ indicates the symbolic function; $\vec{r}, \vec{λ}$ both represent a random number vector, the dimension of which is consistent with the optimized space dimension. The value of each element is a random number ranging from 0 to 1.
(4): Generation rate G: To enhance the local optimization capability of the algorithm, the generation rate is designed as follows:

$\vec{G} = \vec{G_{0}} e^{- \vec{λ} (t - t_{0})} = \vec{G_{0}} \vec{F}$

(8)

where:

$\vec{G_{0}} = \vec{G C P} (\vec{C_{e q}} - \vec{λ} \vec{C})$

(9)

$\vec{G C P} = \{\begin{array}{l} 0.5 r_{1} & r_{2} \geq G P \\ 0 & r_{2} < G P \end{array}$

(10)

when $G P = 0.5$ , the algorithm achieves balance between global optimization and local optimization.
(5): Solution update: To address the update optimization problem, the solution of the algorithm is updated as follows according to Equation (2):

$\vec{C} = {\vec{C}}_{e q} + (\vec{C} - {\vec{C}}_{e q}) \vec{F} + \frac{\vec{G}}{\vec{λ} V} (1 - \vec{F})$

(11)

$\vec{F}$ is obtained using Equation (9), and $V$ represents the unit. The right side of Equation (10) is comprised of three terms. The first one is an equilibrium concentration, while the second and third ones represent the changes in concentration.

Though the equalization optimization algorithm (EO) performs well in optimization, it is prone to find local optimal solutions, as a result of which the capability of global search is inadequate, as is that of convergence [24]. Therefore, an inertial weight optimization algorithm based on chaos theory is proposed in this paper to improve the capability of global search and local convergence.

2.2. Improved Equilibrium Optimizer (IEO)

2.2.1. Improved Strategies for EO

(1): Population initialization based on chaos theory

The diversity of the initial population can have a considerable impact on the speed and accuracy of convergence for the swarm intelligence algorithm. However, the basic equilibrium optimization algorithm can’t ensure the diversity of the population by randomly initializing the population. Featuring regularity, randomness, and ergodicity, chaotic mapping has been widely practiced for the optimization of intelligent algorithms [25]. In this paper, the information in the solution space is fully extracted and captured through chaotic mapping to enhance the diversity of the initial concentration. One of the mapping mechanisms widely used in the research of chaos theory is chaotic mapping, the mathematical iterative equation of which is expressed as follows:

λ_{t + 1} = μ \times λ_{t} (1 - λ_{t}), t = 0, 1, 2, \dots, T

(12)

where

λ_{t}

represents a uniformly distributed random number in the interval [0,1],

λ_{0} \notin {0, 0.25, 0.5, 0.75, 1}

;

T

indicates the preset maximum number of chaotic iterations,

T \geq D

;

μ

denotes the chaos control parameter. When μ = 4, the system is in a completely chaotic state.

Equation (8) is first used to generate a set of chaotic variables. Then, the chaotic sequence is adopted to map the concentration of all dimensional liquids into the chaotic interval

[f_{\min}, f_{\max}]

according to Equation (9). Furthermore, the fitness function is used to calculate the fitness value corresponding to each solubility:

X_{i}^{j} = f_{\min} + λ_{j} \times (f_{\max} - f_{\min})

(13)

where

X_{i}^{j}

represents the coordinate of the j-th dimension of the i-th search agent, and λ_j refers to the coordinate of the j-th dimension randomly sorted inside λ.

(2): Adaptive inertia weights Based on the nonlinear decreasing strategy

The inertia weight plays an essential role in optimizing the objective function. More specifically, when the inertia weight is larger, the algorithm performs better in global search, and it can search a larger area. Conversely, when the inertia weight is smaller, the algorithm has a better capability of local search, and it can search the area around the optimal solution. However, the value of inertia weight is constant at 1 in most cases of EO initialization. This is not a better value in the early and late stages of the algorithm. Therefore, appropriate weighting is required to improve the capability of optimization for the algorithm. In the process of EO optimization, the convergence speed of the algorithm will be reduced if the linear inertia weight adjustment strategy chosen is inappropriate. Therefore, an approach to adaptively changing the weight based on the number of iterations is proposed in this paper in line with the nonlinear decrement strategy. It is expressed as follows:

w (t) = 2 \times \sin ((π \times t / (m \times Max_t)) + π) + n

(14)

where m and n represent optional parameters;

M a x_t

indicates the maximum number of iterations; t denotes the current number of iterations.

By combining directions Equations (7) and (8), the concentration update formula is expressed as follows:

\vec{C} = w (t) {\vec{C}}_{e q} + (\vec{C} - {\vec{C}}_{e q}) \vec{F} + \frac{\vec{G}}{\vec{λ} V} (1 - \vec{F})

(15)

The fluctuation range of the convergence factor is shown in Figure 1. Plenty of experiments have been conducted to demonstrate that the effect is better when both m and n are 2, respectively.

The large inertia weight and rapid decline in the early stage are conducive to improving the capability of global search. While the small inertia weight and slow decline in the later stage are favorable to enhancing the capability of local search and improving convergence speed. In the later stage of the algorithm, the updated concentration increasingly approaches the optimal concentration to reach an equilibrium state. Thus, the small adaptive weight can be used to change the concentration at this time while improving the capability of local optimization.

2.2.2. IEO Algorithm Process

The main process of the IEO algorithm is detailed as follows:

(1): The parameters are initialized, including $a_{1}$ , $a_{2}$ , and $G P$ , Then the maximum number of iterations and the fitness function of the candidate concentration are set;
(2): The fitness value of each object is calculated according to Equation (5), including: ${\vec{C}}_{e q 1}$ , ${\vec{C}}_{e q 2}$ , ${\vec{C}}_{e q 3}$ , ${\vec{C}}_{e q 4}$ , and ${\vec{C}}_{e q (a v e)}$ ;
(3): Calculation is performed for ${\vec{C}}_{e q, p o o l}$ according to Equation (6), and the concentration of each search object is updated using Equation (15);
(4): The output result is saved when the maximum number of iterations is reached.

The pseudo-code of the IEO algorithm is shown in Algorithm 1.

Algorithm 1 Pseudo-code of the IEO algorithm.

Initialisation:
particle populations i = 1, …., n;

free parameters a₁;a₂; GP:
a₁ = 2;
a₂ = 1;
GP = 0.5;

While (Iter < Max_iter) do

for (i = 1:number of particles(n)) do

Calculate fitness of i-th particle

if (fit(

{\vec{C}}_{i}

) < fit(

{\vec{C}}_{e q 1}

)) then

Replace

{\vec{C}}_{e q 1}

with

{\vec{C}}_{i}

and fit(

{\vec{C}}_{e q 1}

) with (

{\vec{C}}_{i}

)

Else if (fit(

{\vec{C}}_{i}

) > fit(

{\vec{C}}_{e q 1}

) & fit(

{\vec{C}}_{i}

) < fit(

{\vec{C}}_{e q 2}

)) then

Replace

{\vec{C}}_{e q 2}

with

{\vec{C}}_{i}

and fit(

{\vec{C}}_{e q 2}

) with (

{\vec{C}}_{i}

);

Else if (fit(

{\vec{C}}_{i}

) > fit(

{\vec{C}}_{e q 1}

) & fit(

{\vec{C}}_{i}

) > fit(

{\vec{C}}_{e q 2}

) & fit(

{\vec{C}}_{i}

) < fit(

{\vec{C}}_{e q 3}

)) then

Replace

{\vec{C}}_{e q 3}

with

{\vec{C}}_{i}

and fit(

{\vec{C}}_{e q 3}

) with (

{\vec{C}}_{i}

)

else fit(

{\vec{C}}_{i}

) > fit(

{\vec{C}}_{e q 1}

) & fit(

{\vec{C}}_{i}

) > fit(

{\vec{C}}_{e q 2}

) & fit(

{\vec{C}}_{i}

) > fit(

{\vec{C}}_{e q 3}

) & fit(

{\vec{C}}_{i}

) < fit(

{\vec{C}}_{e q 4}

)

Replace

{\vec{C}}_{e q 4}

with

{\vec{C}}_{i}

and fit(

{\vec{C}}_{e q 4}

) with (

{\vec{C}}_{i}

);

end if

end for

{\vec{C}}_{a v e} = ({\vec{C}}_{e q 1} + {\vec{C}}_{e q 2} + {\vec{C}}_{e q 3} + {\vec{C}}_{e q 4}) / 4

Construct the equilibrium pool

{\vec{C}}_{e q, p o o l} = {{\vec{C}}_{e q 1}, {\vec{C}}_{e q 2}, {\vec{C}}_{e q 3}, {\vec{C}}_{e q 4}, {\vec{C}}_{e q (a v e)}}

Accomplish memory saving (if Iter > 1)

Assign

t = 1 - {(\frac{I t e r}{M a x_i t e r})}^{a_{2} \frac{I t e r}{M a x_i t e r}}

For I = 1:number of particles(n)

Randomly choose one candidate from the equilibrium pool(vector)

Generate random vectors of λ, r

Construct

\vec{F} = a_{1} s i g n (\vec{r} - 0.5) [e^{- λ t} - 1]

Construct

\vec{G C P} = \{\begin{array}{l} 0.5 r_{1} & r_{2} \geq 0.5 \\ 0 & r_{2} < 0.5 \end{array}

Construct

\vec{G_{0}} = \vec{G C P} (\vec{C_{e q}} - \vec{λ} \vec{C})

Construct

\vec{G} = \vec{G_{0}} \cdot \vec{F}

Update concentrations

\vec{C} = w (t) \cdot \vec{C e q} + (\vec{C} - \vec{C e q}) \cdot \vec{F} + \frac{\vec{G}}{\vec{λ} V} (1 - \vec{F})

end for

Iter = Iter+1

End while

2.3. Numerical Optimization Experiments

In order to validate the IEO algorithm, it is compared with 6 well-known meta-heuristic algorithms, including equilibrium optimizer (EO), whale optimization algorithm (WOA), marine predator algorithm (MPA), grey wolf optimization algorithm (GWO), particle swarm optimization (PSO), and genetic algorithm (GA). Furthermore, a total of 25 standard test functions are used to test its performance. In the following part, the benchmark function, experimental settings, and simulation results will be detailed.

2.3.1. Experimental Settings and Algorithm Parameters

The experimental environment is Microsoft Corporation, Windows 10 64 bit, MATLAB R2012a, Intel Core CPU (I5-3210M 2.50GHz) equipped with 8 G RAM. Since the results of each algorithm are randomized during operation, each comparison algorithm will be operated independently in all test functions 30 times to ensure the objectivity of comparison. The population size is 30, and the maximum number of iterations is set to 1000. Then, the average and standard deviations are calculated for the data collected.

2.3.2. Benchmark Test Functions

Currently, the parameter settings of these 25 standard test functions have been widely adopted to verify meta-heuristics. In general, it is difficult for an algorithm to fit all test functions. Therefore, the 25 test functions selected are different. In this way, the experimental results can better reflect the capability of optimization for the algorithm.

The five high-dimensional unimodal test functions (

F_{1} (x)

~

F_{5} (x)

) have as few as one global optimal value but no local optimal value. Therefore, this function can be applied to test the algorithm for its capability of local search and convergence speed. In contrast to the high-dimensional unimodal function, the three high-dimensional multimodal test functions (

F_{6} (x)

~

F_{8} (x)

) have multiple local optimal values, thus making it more difficult for the algorithm to solve high-dimensional multimodal test functions than to solve unimodal functions. Thus, this function can be used to verify the algorithm for its capability of global search. The 7 fixed-dimensional multimodal functions (

F_{9} (x)

~

F_{15} (x)

) have multiple local extrema but the level of their dimensionality is lower as compared to high-dimensional multimodal functions. Therefore, the number of local extrema is relatively small. Similar to the high-dimensional multi-modal test function, the fixed-dimensional multi-modal test function can also be used to assess the algorithm for its performance in global search. To test its comprehensive capability, we added 10 composition functions of cec2017 [26]. In Table 1, Dim indicates the dimension of the standard test function, f_min represents the theoretical optimal value of the standard test function, and the range means the range of search space, N means the number of basic functions.

2.3.3. Experimental Results and Analysis

In this section, comparative numerical optimization experiments are conducted on EO, WOA, MPA, GWO, PSO, GA, and IEO, respectively. The convergence curve is presented in Figure A1, and the stability curve of each group optimization algorithm is presented in Figure A2. The test results of the unimodal function, high-dimensional multimodal function, fixed-dimensional multimodal function and composition functions are listed in Table A1, where Ave and Std represent the average solution in 30 independent experiments and the standard deviation of the results in 30 runs, respectively.

As shown in Figure A1, the convergence speed of IEO on

F_{1} (x)

~

F_{5} (x)

is significantly higher when compared with other algorithms and it achieves higher accuracy of convergence. Besides, the convergence of each function is relatively slow, and the IEO convergence value is shown to be the best, suggesting that the IEO is less prone to the local optimum. In general, when the high-dimensional unimodal function is tested, the IEO algorithm demonstrates its superior and consistent performance, implying its effectiveness in solving the high dimensionality of space. On

F_{6} (x)

~

F_{8} (x)

, the convergence speed of IEO is the highest, and the accuracy of convergence is improved, suggesting that IEO has a better ability to jump out of the local optimum than other algorithms. Compared with other algorithms, it can be seen from the average (Ave) and standard deviation (Std) that IEO performs best and is most stable. Overall, the performance of the IEO algorithm in solving the high-dimensional multi-peak test function shows higher stability than other algorithms, which evidences its strong capability of global search. On

F_{9} (x)

~

F_{15} (x)

, in the first five functions, IEO is not only reflected in the fastest convergence speed, but also the higher convergence accuracy. While on

F_{15} (x)

, despite the slow convergence of IEO, it jumped out of the local optimum in the later stage and found the optimum value which is closer to the theory. In general, the capability of IEO to jump out of the local optimal value is better than that of other algorithms for the fixed-dimensional multi-modal function. Thus, the algorithm is verified as effective and stable. On

F_{16} (x)

~

F_{25} (x)

, Overall, the convergence speed of IEO is faster than that of EO, and compared with other algorithms, the convergence speed of IEO has certain advantages. Among them,

F_{16} (x)

,

F_{18} (x)

,

F_{21} (x)

and

F_{23} (x)

converge the fastest and obtain the optimal value. As shown in Table A1, on

F_{1} (x)

~

F_{5} (x)

, the IEO algorithm outperforms other algorithms. In these five test functions, IEO produces better results than other algorithms regardless of the average value or the standard deviation, and the accuracy of convergence is significantly improved relative to other algorithms. Moreover, in the tests in four of the functions, IEO achieved the ideal optimal value of 0 every time. The variance of the IEO algorithm is considerably smaller as compared to other algorithms, which indicates the stability of the IEO algorithm. On the high-dimensional multimodal test functions

F_{6} (x)

~

F_{8} (x)

, IEO outperforms other algorithms to a significant extent. Compared with other algorithms, IEO performs more consistently with the smallest average and standard deviation. Notably, it achieves the theoretical optimal value of 0 on

F_{6} (x)

and

F_{8} (x)

. On fixed-dimensional multimodal functions (

F_{9} (x)

~

F_{15} (x)

), the data comparison of all algorithms to optimize the fixed-dimensional multimodal function is performed. According to the comparison of the mean and standard deviation in the table, IEO performs better in mean and standard deviation on the test functions of

F_{9} (x)

,

F_{12} (x)

, and

F_{13} (x)

, while MPA performs best in average and standard deviation on

F_{10} (x)

, followed by IEO. On

F_{11} (x)

, the average and standard deviation of IEO are not as satisfactory as the four functions of MPA, GWO, PSO and GA. On

F_{14} (x)

, EO has achieved the best average and standard deviation, followed by IEO. On

F_{15} (x)

, IEO and MPA have achieved the best average and standard deviation together. As a whole, IEO shows stronger capabilities of global search in fixed-dimensional multi-modal functions. On

F_{16} (x)

~

F_{25} (x)

,

F_{16} (x)

,

F_{18} (x)

,

F_{21} (x)

and

F_{22} (x)

all achieved the minimum value. In the remaining functions, IEO performed better than EO, indicating that the IEO improvement strategy was successful. It shows that IEO has a significant improvement over EO in solving composition functions, and has a great advantage over other algorithms, and ranks second in most of the functions. Figure A2 shows the test result block diagram of the selected function. In the experiment, box plots are used to indicate the distribution of the results for each function after 30 independent tests to demonstrate the stability of IEO and other algorithms in a more intuitive way. The numerical distribution of the selected function is selected from the test functions. IEO exhibits outstanding stability in all functions, which allows it to outperform other algorithms. More specifically, its bottom edge, top edge, and median value of the same function are superior or inferior to other algorithms.

In general, the IEO optimization algorithm outperforms other algorithms in optimization speed, accuracy and stability.

3. LSTM Model Optimization by Improved IEO Algorithm

Time series data are everywhere, and prediction is an eternal topic for human beings. Gooijer and Hyndman highlight researched published in the International Institute of Forecasters (IIF) and key publications in other journals [27]. From the early traditional autoregressive integrated moving average (ARIMA) linear model to various nonlinear models (e.g., SETAR model). Due to increased computational power and electronic storage capacity, nonlinear forecasting models have grown in a spectacular way and received a great deal of attention during the last decades. The neural networks (NN) model is a prominent example of such nonlinear and nonparametric models that do not make assumptions about the parametric form of the functional relationship between the variables. Several studies have been done relating to the application of NN, such as Caraka and Chen used vector autoregressive (VAR) to improve NN for space-time pollution date forecasting modeling [28], then they used VAR-NN-PSO model to predicting PM_2.5 in air [29]. Suhartono and Prastyo applied Generalized Space-Time Autoregressive (GSTAVR) to FFNN for forecasting oil production [30]. In recent years, types of recurrent neural networks (RNN) have been frequently employed for forecasting tasks and have shown promising reslts. This is mainly due to RNNs having the ability to persist information about previous time steps and being able to use that information when processing the current time step. Due to vanishing gradients, RNN is unable to persist long-term dependencies. A long short term memory (LSTM) network is a type of RNN that was introduced to persist long term dependencies [31]. By comparing LSTM with a random forest, a standard deep net, and a simple logistic regression in the large-scale financial market prediction task. It is proved that LSTM performed better than others and was a method naturally suitable for this field [32]. By comparing LSTM with the Prophet model proposed by Facebook, the result showed LSTM performs better on minimum air temperature [33]. ChihHung and Yu-Feng proposed a new forecasting framework with the LSTM model to forecasting bitcoin daily prices. The results revealed its possible applicability in various cryptocurrencies prediction [34]. Helmini and Jihan adopted a special variant LSTM with peephole connections for the sales forecasting tasks, proved that the initial LSTM and improved LSTM both outperform two machine learning models (extreme gradient boosting (XGB) and random forest regressor (RFR)) [35]. To enhance the performance of the LSTM model, Gers and Schmidhuber proposed a novel adaptive “forget gate” that enables an LSTM cell to learn to reset itself at appropriate times [36], Karim and Majumdar added the attention model to detect regions of the input sequence that contribute to the class label through the context vector [37]. Saeed and Li proposed a hybrid model called BLSTM, constructing quality intervals based on the high-quality principle, presented excellent prediction performance [38]. Venskus and Treigys described two methods—the LSTM prediction region learning and the wild bootstrapping for estimation of prediction regions—and the expected results have been achieved when it is used for abnormal marine traffic detection [39]. In order to strengthen the early diagnosis of cardiovascular disease, Zheng and Chen presented an arrhythmia classification method that combines 2-D CNN and LSTM and uses ECG images as the input data for the model [40]. Currently, the most mainstream method for time series forecasting would be LSTM and its variants.

3.1. Basic Model of LSTM

The LSTM model introduces a gating unit into its network topology for gaining control on the degree of impact of current information on the previous information, thus endowing the model with a longer “memory function”. It is made suitable for solving long-term nonlinear sequence forecasting problems. The LSTM network structure consists of input gates, forget gates, output gates, and unit states. The basic structure of the network is illustrated in Figure 2.

LSTM is a purpose-built recurrent neural network. Not only does the design of the “gate” structure avoid the gradient disappearance and gradient explosion caused by traditional recurrent neural networks, it is also effective in learning long-term dependency. Therefore, the LSTM model with memory function shows a clear advantage in dealing with the problems arising from the prediction and classification of time series.

Let the input sequence be

(x_{1}, x_{2}, \dots, x_{T})

, and the hidden layer state be

(h_{1}, h_{2}, \dots, h_{T})

. Then, at time t:

f_{t} = σ (W_{f} h_{t - 1} + U_{f} x_{t} + b_{f})

(16)

i_{t} = σ (W_{i} h_{t - 1} + U_{i} x_{t} + b_{i})

(17)

g_{t} = t a n h (W_{C} h_{t - 1} + U_{C} x_{t} + b_{C})

(18)

C_{t} = f_{t} * C_{t - 1} + i_{t} * g_{t}

(19)

o_{t} = σ (W_{o} h_{t - 1} + U_{o} x_{t} + b_{o})

(20)

h_{t} = o_{t} * t a n h (C_{t})

(21)

where,

x_{t}

represents the input vector of the LSTM cell;

h

indicates the cell output vector;

f_{t}

,

i_{t}

, and

o_{t}

refer to the forget gate, input gate, output gate, respectively;

C_{t}

denotes the status of the cell unit;

t

stands for the time;

σ

and

t a n h

represent the activation functions;

W

and

b

denote the weight and deviation matrix, respectively.

U

stands for the input weight matrix.

The key to LSTM lies in the cell state

C_{t}

. It maintains the memory of the unit state at time

t

, and adjusts it through the forget gate

f_{t}

and input gate

i_{t}

. The purpose of the forget gate is to make the cell remember or forget its previous state

C_{t - 1}

. The input gate is purposed to allow incoming signals to update the unit state or prevent it. The output gate aims to control the output of the unit state

C_{t}

and then transmit it to the next cell. The internal structure of the LSTM unit is comprised of multiple perceptrons. In general, backpropagation algorithms are the most commonly used training method.

The LSTM neural network framework applied in the design of this paper involves one input layer, three hidden layers (64 neurons in the first layer, 16 neurons in the second layer, and 50 neurons in the third fully connected layer), and one output layer (two neuron output). Using the Adam optimizer, the learning rate is 0.01, the loop is executed 30 times, and the batch size is 256.

3.2. LSTM Based on Improved EO

As mentioned above, the parameters of the LSTM-based model are set manually, which however affects the outcome of the prediction made using the model. Thus, an improved EO-LSTM model is proposed in this paper. The main purpose of doing so is to optimize the relevant parameters of the LSTM through the excellent parameter optimization capabilities of the above-mentioned IEO algorithm for the improved outcome of prediction for the LSTM. In order to ensure the objectivity of the experiment, the LSTM model structure is fixed (the number of nodes in each layer does not modified), The modeling process of the IEO-LSTM model is detailed as follows.

(1): The relevant parameters are initialized. The parameters of the IEO algorithm are initialized, including: population size, fitness function, are free parameter assignment. The parameters of the LSTM algorithm are initialized by setting the time window, the initial learning rate, the number of initial iterations, and the size of the packet. In this paper, the error is treated as a fitness function, which is expressed as follows:

$f i t n e s s = 1 - acc (f; D) = 1 - \frac{1}{m} \sum_{i = 1}^{m} I (f (x_{i}) = y_{i})$

(22)

where D represents the training set, m denotes the number of samples in the training set, $f (x_{i})$ indicates the predicted sample label, $y_{i}$ means the original sample label, and $I (f (x_{i}) = y_{i})$ is a function. If $f (x_{i}) = y_{i}$ , $I (f (x_{i}) = y_{i})$ = 1; otherwise, $I (f (x_{i}) = y_{i})$ = 0.
(2): The LSTM parameters are set as required to form the corresponding particles. The particle structure is ( $a l p h a$ , $n u m_e p o c h s$ , $b a t c h_s i z e$ ). Among them, $a l p h a$ represents the LSTM learning rate, $n u m_e p o c h s$ indicates the number of iterations, $b a t c h_s i z e$ suggests the size of the bag. The particles mentioned above are the objects of IEO optimization.
(3): The concentration is updated according to Equation (15). According to the newly-obtained concentration, calculate the fitness value and then update the individual optimal concentration and the global optimal concentration of the particles.
(4): If the number of iterations reaches the maximum number of times of iteration, that is, 30, the LSTM model trained on the optimal particles will output the prediction value; otherwise, return to step (3) for continued iteration.
(5): The optimal parameters are substituted into the LSTM model, for constructing obtaining the IEO-LSTM model.

The pseudo-code of the IEO-LSTM on oil prediction algorithm is shown in Algorithm 2.

Algorithm 2 Pseudo-code of the IEO-LSTM on oil prediction.

Input:

Training samples set: trainset_input, trainset_output

LSTM initialization parameters

Output:
IEO-LSTM model

1. Initialize the LSTM model:

assign parameters para = [lr,epoch,batch_size] = [0.01,100,256];

loss = categorical_crossentropy, optimizer = adam

2. Set IEO equalization parameters and Import trainset to Optimization:
    Parameters:para = [lr,epoch,batch_size]
    Ll para = [0.001,1, 1, 1, 1, 1];        % Set lower limit for merit search
    Ul para = [0.01, 100,256, 100, 100,100];    % Set the upper limit of merit seeking
      IEO_para = [ll,ul,trainset_input, trainset_output, num_epoch = 30];
      Get_fitness_position ((IEO)) = length(find((trainset_output - LSTM(trainset-
            input)~ = 0))/length(trainset_output);
            % Find the best Concentration to minimize the LSTM error rate
return para_best to LSTM model;
save (‘IEO-LSTM.mat’,’LSTM’)

3.3. Actual Application in Oil Layer Prediction

3.3.1. Design of Oil Layer Prediction System

In respect of oil logging, reservoir prediction is regarded as crucial for improving the accuracy of oil layer prediction. Herein, we apply IEO-LSTM to oil logging and verify the effectiveness of this algorithm by using oil data provided by three oil fields.

Figure 3 shows the block diagram of the oil layer prediction system based on IEO-LSTM.

The oil layer prediction mainly includes the following processes:

(1): Data acquisition and preprocessing

In practice, the logging data is classified into two categories after being collected. On the one hand, labeled data as obtained through core sampling and laboratory analysis is generally used as a training set. On the other hand, the unlabeled logging data is treated as the test set. The data pre-processing mainly includes denoising process, normalization process, etc.

Besides, due to the different dimensions of each attribute and value range, these data need to be normalized in the first place, so that the sample data ranges between [0,1]. The normalized logging data are then trained and tested. The formula for sample normalization is expressed as follows:

\bar{x} = \frac{(x - x_{\min})}{x_{\max} - x_{\min}}

(23)

where,

x \in [x_{\min}, x_{\max}]

;

x_{\min}

represents the minimum value of the attributes of the data sample, and

x_{\max}

refers to the maximum value of the attribute of the data sample.

(2): Selection of sample set and attribute reduction

In the selection of the sample set, the selection of the sample set for training is supposed not only to be complete and comprehensive but also to be closely related to the evaluation of the oil. Moreover, the degree to which the prediction of oil can be made varies due to the different attributes of oil logging. In general, there are dozens of logging condition attributes in logging data but not all of them play a decisive role, which makes it necessary to perform attribute reduction. In this paper, the discretization algorithm based on the inflection point is applied and then the reduction method based on the attribute dependence is applied to reduce the logging attributes [41].

(3): IEO-LSTM modeling

First of all, the LSTM model is established, the training set information is inputted after attribute reduction, and The IEO algorithm is used to find the optimal combination of parameters for the LSTM model to obtain a high-precision IEO-LSTM prediction model.

(4): Prediction

The trained IEO-LSTM model is used to perform prediction and output the results. Among them, before the test set is inputted into the IEO-LSTM model, the reduced test set must be obtained according to the sample attribute reduction result for normalization according to the normalization principle of sample training.

The pseudo-code of the oil layer prediction system based on IEO-LSTM is shown in Algorithm 3.

Algorithm 3 Pseudo-code of the oil prediction system based on IEO-LSTM.

Input:

Training samples set: trainset_input, trainset_output, testset

Test samples set: testset

Output:
Parameter optimization value
Prediction results of the test samples set

1. Original data initialization, Divide the trainset and testset

2. Initialize the LSTM model:

assign parameters para = [lr,epoch,batch_size] = [0.01,100,256];

loss = categorical_crossentropy, optimizer = adama;

3. Set IEO equalization parameters and Import trainset to Optimization:
4. Set IEO equalization parameters and Import trainset to Optimization:
   Parameters:para = [lr,epoch,batch_size]
    ll =[0.001,1, 1];        % Set lower limit for merit search
    ul = [0.01, 100, 256];      % Set the upper limit of merit seeking
    para = IEO[ll,ul,trainset_input, trainset_output, num_epoch = 30];
    Get_fitness_position (

\vec{C}

) = length(find((trainset_output—LSTM(trainset-
input)~ = 0))/length(trainset_output);
% Find the best Concentration to minimize the LSTM error rate
return para_best to LSTM model;
save (‘IEO-LSTM.mat’,’LSTM’);
5. Import test samples set to IEO-LSTM
load(‘IEO-LSTM.mat’)
predictset = IEO-LSTM(testset)
return predictset

3.3.2. Practical Application

To verify the effectiveness and stability of the proposed IEO-LSTM model, we selected logging data from three actual oil wells for the experiment. And the prediction results are compared and evaluated with those of the LSTM, PSO-LSTM [42], GA-LSTM [43] and EO-LSTM models.

The logging data for the three selected actual wells were recorded as

o_{1}

,

o_{2}

and

o_{3}

, Table 2 shows the reduction results obtained after the attribute reduction process for the three wells. It can be seen from the table that there are various redundant condition attributes in the original data. After attribute reduction is performed through a rough set, those significant attributes can be selected, which is conducive to simplifying the complexity of the LSTM model. The data set division of three wells is shown in Table 3. It can be seen from Table 3 that the oil layer and dry layer distribution range in the training set and test set.

Table 4 lists the value range of each attribute after

o_{1}

attribute reduction. Among them,

G R

,

D T

,

S P

,

L L D

,

L L S

,

D E N

, and

K

represent natural gamma, acoustic time difference, natural potential, deep lateral resistivity, shallow lateral resistivity, compensation density, and potassium, respectively. Besides, Table 5 shows the value range of each attribute after

o_{2}

attribute reduction. Among them,

A C

,

R T

, and

R X O

denote jet lag, intact formation resistivity, and flush zone resistivity, respectively. Table 6 lists the value range of each attribute after

o_{3}

attribute reduction, where

N G

and

R I

represent neutron gamma and intrusion zone resistivity, respectively.

The logging curve is obtained after the normalization of the reduced attributes, as shown in Figure 4. The horizontal axis and the vertical axis indicate the depth and the standardized value, respectively.

In order to conduct comparative experiments better, the LSTM model, PSO-LSTM model, GA-LSTM model, and EO-LSTM model are established for comparison against the IEO-LSTM model. Then, these optimized classification models are applied to oil predict using the test set.

In addition to the prediction accuracy of the test set, the following performance indicators are defined to assess the performance of the prediction model:

MSE = \frac{1}{m} \sum_{i = 1}^{m} {(f (x_{i}) - y_{i})}^{2}

(24)

R M S E = \sqrt{\frac{1}{m} a_{i = 1}^{m} {(f (x_{i}) - y_{i})}^{2}}

(25)

MAE = \frac{1}{m} \sum_{i = 1}^{m} |f (x_{i}) - y_{i}|

(26)

where,

f (x_{i})

and

y_{i}

represent the predicted output value and the expected output value, respectively.

RMSE is applied to measure the deviation between the observed value and the true value. MAE is referred to as the average of absolute errors. If the RMSE and MAE are reduced, the performance of the algorithm model is improved. Therefore, RMSE is treated as a criterion applied to evaluate the accuracy of each algorithm model. MAE is often treated as the actual forecast and forecast error because it can better reflect the real state of the forecast value error.

For a comprehensive experimental comparison, the number of times of training is set to 30, based on which the average value of the performance indicators is calculated for each model.

Table 7 shows the prediction ability of each model on the test set of three wells (

o_{1}

,

o_{2}

,

o_{3}

). On the whole, the prediction accuracy of IEO-LSTM is most satisfactory. Compared with other models, it performs better in each index, and the accuracy of it is about 5% higher than the original LSTM model on average.

4. Conclusions

In this paper, an improved equalization optimization algorithm is proposed, which applies chaos theory to initialize the population and relies on adaptive weights to enhance the capability of global and local search. Firstly, a total of 30 independent experiments are conducted on 25 benchmarks using six popular swarm intelligence algorithms. According to the experimental results, IEO achieves a significant improvement of convergence speed and convergence accuracy as compared to traditional EO. Compared with PSO, GA, MPA, WOA, and GWO, IEO performs better in convergence speed, accuracy, and stability.

The poor robustness of LSTM is attributed to the randomized generation of such parameters as the learning rate, the number of iterations, and the packet size. To improve the performance of LSTM, IEO and LSTM are combined, with the aforementioned three parameters globally optimized to improve the performance of LSTM in prediction.

Finally, it is applied to well-logging oil prediction for the validation of IEO-LSTM. The data set includes the logging data collected from three different wells. The prediction accuracy of the proposed model is shown to be significantly higher as compared to LSTM. Compared with PSO-LSTM, GA-LSTM, and EO-LSTM, the performance of IEO-LSTM is better. When the number of times of iteration is 200, IEO-LSTM is superior to other classification models in terms of classification accuracy, root mean square error, and mean absolute error. In comparison with the actual oil layer distribution, the prediction of IEO-LSTM shows an average classification error of about 5% relative to the original oil layer distribution.

Moreover, this method can also be applied to solve the generic optimization problems encountered in other fields, which suggests a promising prospect of practical application for it. However, LSTM is a special form of time recursive neural network, which appears to solve a fatal defect of RNN, and it still has its own defects: It only solves the gradient problem of RNN to some extent, it is still difficult for data with too large magnitude; It is time-consuming and computationally intensive during optimization. These two problems will be addressed and practically applied in my subsequent research.

Author Contributions

Conceptualization, P.L. and K.X.; methodology, P.L. and K.X.; software, P.L.; validation, P.L., Y.P. and S.F.; formal analysis, P.L. and K.X.; investigation, P.L. and K.X.; resources, P.L., Y.P. and S.F.; data curation, P.L., Y.P. and S.F.; writing—original draft preparation, P.L.; visualization, Y.P. and S.F. supervision, K.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (No. U1813222, No. 42075129), Hebei Province Natural Science Foundation (No. E2021202179), Key Research and Development Project from Hebei Province (No. 19210404D, No. 20351802D, No. 21351803D).

Data Availability Statement

All data are available upon request from the corresponding author.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A

The experimental results in Section 2 are presented below, including statistics for each algorithm, convergence curves, and box line plots.

Figure A1. Convergence curves for 25 benchmark functions in Table 1. (a) Convergence curves for F1; (b) Convergence curves for F2; (c) Convergence curves for F3; (d) Convergence curves for F4; (e) Convergence curves for F5; (f) Convergence curves for F6; (g) Convergence curves for F7; (h) Convergence curves for F8; (i) Convergence curves for F9; (j) Convergence curves for F10; (k) Convergence curves for F11; (l) Convergence curves for F12; (m) Convergence curves for F13; (n) Convergence curves for F14; (o) Convergence curves for F15; (p) Convergence curves for F16; (q) Convergence curves for F17; (r) Convergence curves for F18; (s) Convergence curves for F19; (t) Convergence curves for F20; (u) Convergence curves for F21; (v) Convergence curves for F22; (w) Convergence curves for F23; (x) Convergence curves for F24; (y) Convergence curves for F25.

Figure A2. Stability curves of various swarm optimization algorithms,“+” Represents the outliers in the calculation results. (a) Box line diagram for F1; (b) Box line diagram for F2; (c) Box line diagram for F3; (d) Box line diagram for F4; (e) Box line diagram for F5; (f) Box line diagram for F6; (g) Box line diagram for F7; (h) Box line diagram for F8; (i) Box line diagram for F9; (j) Box line diagram for F10; (k) Box line diagram for F11; (l) Box line diagram for F12; (m) Box line diagram for F13; (n) Box line diagram for F14; (o) Box line diagram for F15; (p) Box line diagram for F16; (q) Box line diagram for F17; (r) Box line diagram for F18; (s) Box line diagram for F19; (t) Box line diagram for F20; (u) Box line diagram for F21; (v) Box line diagram for F22; (w) Box line diagram for F23; (x) Box line diagram for F24; (y) Box line diagram for F25.

Table A1. Optimization results and comparison for functions.

	Function		IEO	EO	WOA	MPA	GWO	PSO	GA
Unimodal Benchmark Functions	F1	Ave	0.00 × 10⁺⁰⁰	2.45 × 10⁻⁸⁵	1.41 × 10⁻³⁰	3.27 × 10⁻²¹	1.07 × 10⁻⁵⁸	0.32 × 10⁺⁰⁰	0.55 × 10⁺⁰⁰
	F1	Std	0.00 × 10⁺⁰⁰	9.08 × 10⁻⁸⁵	4.91 × 10⁻³⁰	4.61 × 10⁻²¹	2.77 × 10⁻⁵⁸	0.21 × 10⁺⁰⁰	1.23 × 10⁺⁰⁰
	F2	Ave	0.00 × 10⁺⁰⁰	2.91 × 10⁻⁴⁸	1.06 × 10⁻²¹	0.07 × 10⁺⁰⁰	1.36 × 10⁻³⁴	1.04 × 10⁺⁰⁰	0.01 × 10⁺⁰⁰
	F2	Std	0.00 × 10⁺⁰⁰	2.92 × 10⁻⁴⁸	2.39 × 10⁻²¹	0.11 × 10⁺⁰⁰	1.40 × 10⁻³⁴	0.46 × 10⁺⁰⁰	0.01 × 10⁺⁰⁰
	F3	Ave	0.00 × 10⁺⁰⁰	1.77 × 10⁻²¹	5.39 × 10⁻⁰⁷	2.78 × 10⁺⁰²	9.25 × 10⁻¹⁶	8.14 × 10⁺⁰¹	8.46 × 10⁺⁰²
	F3	Std	0.00 × 10⁺⁰⁰	8.45 × 10⁻²¹	2.93 × 10⁻⁰⁶	4.00 × 10⁺⁰²	2.44 × 10⁻¹⁵	2.13 × 10⁺⁰¹	1.62 × 10⁺⁰²
	F4	Ave	0.00 × 10⁺⁰⁰	2.84 × 10⁻²¹	0.73 × 10⁻⁰¹	6.78 × 10⁺⁰⁰	1.84 × 10⁻¹⁴	1.51 × 10⁺⁰⁰	4.56 × 10⁺⁰⁰
	F4	Std	0.00 × 10⁺⁰⁰	9.26 × 10⁻²¹	0.40 × 10⁺⁰⁰	2.94 × 10⁺⁰⁰	3.40 × 10⁻¹⁴	0.22 × 10⁺⁰⁰	0.59 × 10⁺⁰⁰
	F5	Ave	6.70 × 10⁻⁰⁵	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.09 × 10⁺⁰⁰	0.07 × 10⁺⁰⁰	0.11 × 10⁺⁰⁰
	F5	Std	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.03 × 10⁺⁰⁰	0.01 × 10⁺⁰⁰	0.04 × 10⁺⁰⁰
Multimodal Benchmark Functions	F6	Ave	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	4.84 × 10⁺⁰¹	0.00 × 10⁺⁰⁰
	F6	Std	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	3.44 × 10⁺⁰⁰	3.97 × 10⁺⁰⁰
	F7	Ave	8.84 × 10⁻¹⁶	5.15 × 10⁻¹⁵	7.40 × 10⁺⁰⁰	9.69 × 10⁻¹²	0.00 × 10⁺⁰⁰	1.20 × 10⁺⁰⁰	0.18 × 10⁺⁰⁰
	F7	Std	0.00 × 10⁺⁰⁰	1.45 × 10⁻¹⁵	9.90 × 10⁺⁰⁰	6.13 × 10⁻¹²	0.00 × 10⁺⁰⁰	0.73 × 10⁺⁰⁰	0.15 × 10⁺⁰⁰
	F8	Ave	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.60 × 10⁺⁰⁰	0.01 × 10⁺⁰⁰	0.66 × 10⁺⁰⁰
	F8	Std	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.00 × 10⁺⁰⁰	0.09 × 10⁺⁰⁰	0.01 × 10⁺⁰⁰	0.19 × 10⁺⁰⁰
Fifixed-Dimenstion Multimodal Benchmark Functions	F9	Ave	1.00 × 10⁺⁰⁰	1.00 × 10⁺⁰⁰	2.11 × 10⁺⁰⁰	1.00 × 10⁺⁰⁰	0.16 × 10⁺⁰⁰	2.18 × 10⁺⁰⁰	1.00 × 10⁺⁰⁰
	F9	Std	2.12 × 10⁻¹⁷	1.11 × 10⁻¹⁶	2.50 × 10⁺⁰⁰	2.47 × 10⁻¹⁶	0.84 × 10⁺⁰⁰	2.01 × 10⁺⁰⁰	8.84 × 10⁻¹²
	F10	Ave	3.58 × 10⁻⁰⁴	3.69 × 10⁻⁴⁹	0.00 × 10⁺⁰⁰	3.07 × 10⁻⁰⁴	1.62 × 10⁻⁰⁴	5.61 × 10⁻⁰⁴	2.69 × 10⁻⁰³
	F10	Std	1.54 × 10⁻⁰⁴	2.36 × 10⁻⁰⁴	0.00 × 10⁺⁰⁰	4.09 × 10⁻¹⁵	3.33 × 10⁻⁰⁴	4.38 × 10⁻⁰⁴	4.84 × 10⁻⁰³
	F11	Ave	−1.03 × 10⁺⁰⁰	−1.03 × 10⁺⁰⁰	−1.03 × 10⁺⁰⁰	−1.03 × 10⁺⁰⁰	−1.03 × 10⁺⁰⁰	−1.03 × 10⁺⁰⁰	−1.03 × 10⁺⁰⁰
	F11	Std	5.63 × 10⁻⁰⁶	2.04 × 10⁺¹⁶	4.2 × 10⁻⁰⁷	4.46 × 10⁻⁰¹	5.46 × 10⁻⁰⁹	6.64 × 10⁻¹⁶	2.39 × 10⁻⁰⁸
	F12	Ave	3.00 × 10⁺⁰⁰	3.00 × 10⁺⁰⁰	3.00 × 10⁺⁰⁰	3.00 × 10⁺⁰⁰	3.00 × 10⁺⁰⁰	3.00 × 10⁺⁰⁰	3.00 × 10⁺⁰⁰
	F12	Std	5.95 × 10⁻¹⁷	4.96 × 10⁻¹⁶	4.22 × 10⁻¹⁵	1.95 × 10⁻¹⁵	2.29 × 10⁻⁰²	1.38 × 10⁻¹⁵	2.45 × 10⁻⁰⁷
	F13	Ave	−3.86 × 10⁺⁰⁰	−3.86 × 10⁺⁰⁰	−3.85 × 10⁺⁰⁰	−3.86 × 10⁺⁰⁰	0.51 × 10⁺⁰⁰	−3.86 × 10⁺⁰⁰	−3.86 × 10⁺⁰⁰
	F13	Std	3.17 × 10⁻¹⁶	2.15 × 10⁻¹⁵	0.00 × 10⁺⁰⁰	2.42 × 10⁻¹⁵	2.73 × 10⁻¹⁵	2.68 × 10⁻¹⁵	2.85 × 10⁻⁸
	F14	Ave	−3.32 × 10⁺⁰⁰	-−3.28 × 10⁺⁰⁰	−2.98 × 10⁺⁰⁰	−3.32 × 10⁺⁰⁰	−2.84×10⁺⁰⁰	−3.26 × 10⁺⁰⁰	−3.27 × 10⁺⁰⁰
	F14	Std	1.59 × 10⁻¹⁵	0.06 × 10⁺⁰⁰	0.38 × 10⁺⁰⁰	1.14 × 10⁻¹¹	4.68 × 10⁻²	6.05 × 10⁻²	5.99 × 10⁻²
	F15	Ave	−1.05 × 10⁺¹	−10.27 × 10⁺⁰⁰	−9.34 × 10⁺⁰⁰	−1.05 × 10⁺¹	−7.11 × 10⁺⁰⁰	−7.25 × 10⁺⁰⁰	−7.77 × 10⁺⁰⁰
	F15	Std	1.92 × 10⁻⁰⁷	1.48 × 10⁺⁰⁰	2.41 × 10⁺⁰⁰	3.89 × 10⁻¹¹	3.11 × 10⁺⁰⁰	3.66 × 10⁺⁰⁰	3.73 × 10⁺⁰⁰
Composition Functions	F16	Ave	2.20 × 10⁺⁰³	2.30 × 10⁺⁰³	2.33 × 10⁺⁰³	2.20 × 10⁺⁰³	2.31 × 10⁺⁰³	2.32 × 10⁺⁰³	2.28 × 10⁺⁰³
	F16	Std	6.54 × 10⁻⁰⁷	4.13 × 10⁺⁰¹	4.34 × 10⁺⁰¹	8.66 × 10⁻⁰⁶	2.28 × 10⁺⁰¹	6.34 × 10⁺⁰¹	5.84 × 10⁺⁰¹
	F17	Ave	2.28 × 10⁺³	2.30 × 10⁺⁰³	2.36 × 10⁺⁰³	2.28 × 10⁻⁰⁶	2.31 × 10⁺⁰³	2.37 × 10⁺⁰³	2.41 × 10⁺⁰³
	F17	Std	0.54 × 10⁺⁰⁰	0.35 × 10⁺⁰⁰	1.43 × 10⁺⁰²	3.38 × 10⁺⁰¹	8.52 × 10⁺⁰⁰	3.20 × 10⁺⁰²	1.00 × 10⁺⁰²
	F18	Ave	2.61 × 10⁺⁰³	2.62 × 10⁺⁰³	2.65 × 10⁺⁰³	2.61 × 10⁺⁰³	2.62 × 10⁺⁰³	2.71 × 10⁺⁰³	2.65 × 10⁺⁰³
	F18	Std	3.55 × 10⁺⁰⁰	8.76 × 10⁺⁰⁰	2.88 × 10⁺⁰¹	5.06 × 10⁺⁰⁰	1.04 × 10⁺⁰¹	4.18 × 10⁺⁰¹	1.28 × 10⁺⁰¹
	F19	Ave	2.63 × 10⁺⁰²	2.74 × 10⁺⁰³	2.76 × 10⁺⁰³	2.51 × 10⁺⁰³	2.75 × 10⁺⁰³	2.77 × 10⁺⁰²	2.79 × 10⁺⁰³
	F19	Std	5.13 × 10⁺⁰⁰	4.30 × 10⁺⁰⁰	6.11 × 10⁺⁰¹	3.58 × 10⁺⁰¹	1.40 × 10⁺⁰¹	1.16 × 10⁺⁰²	2.31 × 10⁺⁰¹
	F20	Ave	2.91 × 10⁺⁰³	2.92 × 10⁺⁰³	2.95 × 10⁺⁰³	2.88 × 10⁺⁰³	2.94 × 10⁺⁰³	2.90 × 10⁺⁰³	2.99 × 10⁺⁰²
	F20	Std	2.29 × 10⁺⁰¹	2.39 × 10⁺⁰²	2.36 × 10⁺⁰¹	2.09 × 10⁺⁰¹	2.68 × 10⁺⁰¹	8.60 × 10⁺⁰¹	8.07 × 10⁺⁰¹
	F21	Ave	2.78 × 10⁺⁰³	3.03 × 10⁺⁰³	3.60 × 10⁺⁰³	2.79 × 10⁺⁰³	3.23 × 10⁺⁰³	3.08 × 10⁺⁰³	3.20 × 10⁺⁰³
	F21	Std	5.11 × 10⁺⁰¹	3.07 × 10⁺⁰²	6.33 × 10⁺⁰²	1.10 × 10⁺⁰²	4.32 × 10⁺⁰²	3.57 × 10⁺⁰²	2.17 × 10⁺⁰²
	F22	Ave	3.08 × 10⁺⁰³	3.09 × 10⁺⁰³	3.13 × 10⁺⁰³	3.09 × 10⁺⁰³	3.10 × 10⁺⁰³	3.16 × 10⁺⁰³	3.14 × 10⁺⁰³
	F22	Std	0.33 × 10⁺⁰⁰	6.52 × 10⁺⁰⁰	4.27 × 10⁺⁰¹	0.22 × 10⁺⁰⁰	2.11 × 10⁺⁰¹	7.44 × 10⁺⁰¹	3.53 × 10⁺⁰¹
	F23	Ave	3.11 × 10⁺⁰³	3.32 × 10⁺⁰³	3.41 × 10⁺⁰³	3.12 × 10⁺⁰³	3.34 × 10⁺⁰³	3.19 × 10⁺⁰³	3.62 × 10⁺⁰³
	F23	Std	5.57 × 10⁻⁰⁵	1.41 × 10⁺⁰²	1.77 × 10⁺⁰²	5.00 × 10⁻⁰⁵	9.99 × 10⁺⁰¹	6.07 × 10⁺⁰¹	2.02 × 10⁺⁰²
	F24	Ave	3.15 × 10⁺⁰³	3.18 × 10⁺⁰³	3.36 × 10⁺⁰³	3.14 × 10⁺⁰³	3.22 × 10⁺⁰³	3.26 × 10⁺⁰³	3.30 × 10⁺⁰³
	F24	Std	3.14 × 10⁺⁰⁰	3.47 × 10⁺⁰¹	1.12 × 10⁺⁰²	8.63 × 10⁺⁰⁰	5.49 × 10⁺⁰¹	8.77 × 10⁺⁰¹	8.67 × 10⁺⁰¹
	F25	Ave	4.73 × 10⁺⁰³	3.35 × 10⁺⁰⁵	9.75 × 10⁺⁰⁵	3.40 × 10⁺⁰³	1.03 × 10⁺⁰⁶	1.08 × 10⁺⁰⁴	1.39 × 10⁺⁰⁶
	F25	Std	3.51 × 10⁺⁰¹	4.11 × 10⁺⁰⁵	8.97 × 10⁺⁰⁵	3.51 × 10⁺⁰¹	1.09 × 10⁺⁰⁶	4.96 × 10⁺⁰³	1.10 × 10⁺⁰⁶

References

Ma, X.S.; Li, Y.L.; Yan, L. A comparative review of traditional multi-objective optimization methods and multi-objective genetic algorithms. Electr. Drive Autom. 2010, 32, 48–53. [Google Scholar]
Koziel, S.; Michalewicz, Z. Evolutionary Algorithms, homomorphous mappings, and constrained parameter optimization. Evol. Comput. 1999, 7, 19–44. [Google Scholar] [CrossRef]
Hashim, F.A.; Houssein, E.H.; Mabrouk, M.S.; Al-Atabany, W.; Mirjalili, S. Henry gas solubility optimization: A novel physics-based algorithm. Future Gener. Comput. Syst. 2019, 101, 646–667. [Google Scholar] [CrossRef]
Karaboga, D.; Basturk, B. A powerful and efficient algorithm for numerical function optimization: Artificial bee colony (ABC) algorithm. J. Glob. Optim. 2007, 39, 459–471. [Google Scholar] [CrossRef]
Fister, I.; Yang, X.S.; Brest, J.; Fister, D. A Brief Review of Nature-Inspired Algorithms for Optimization. Elektrotehniski Vestn. Electrotech. Rev. 2013, 80, 116–122. [Google Scholar]
Brezočnik, L.; Fister, I.; Podgorelec, V. Swarm Intelligence Algorithms for Feature Selection: A Review. Appl. Sci. 2018, 8, 9. [Google Scholar] [CrossRef] [Green Version]
Omran, M.; Engelbrecht, A.; Salman, A. Particle Swarm Optimization Methods for Pattern Recognition and Image Processing. Ph.D. Thesis, University of Pretoria, Pretoria, South Africa, 2004. [Google Scholar]
Martens, D.; Baesens, B.; Fawcett, T. Editorial survey: Swarm intelligence for data mining. Mach. Learn. 2010, 82, 1–42. [Google Scholar] [CrossRef] [Green Version]
Kennedy, J.; Eberhart, R. Particle Swarm Optimization. In Proceedings of the IEEE International Conference on Neural Networks, Perth, Australia, 27 November–1 December 1995; pp. 1942–1948. [Google Scholar]
Zhang, J.; Xia, K.; He, Z.; Fan, S. Dynamic Multi-Swarm Differential Learning Quantum Bird Swarm Algorithm and Its Application in Random Forest Classification Model. Comput. Intell. Neurosci. 2020, 2020, 6858541. [Google Scholar] [CrossRef] [PubMed]
Alejo-Reyes, A.; Cuevas, E.; Rodríguez, A.; Mendoza, A.; Olivares-Benitez, E. An Improved Grey Wolf Optimizer for a Supplier Selection and Order Quantity Allocation Problem. Mathematics 2020, 8, 1457. [Google Scholar] [CrossRef]
Dhiman, G.; Kumar, V. Seagull optimization algorithm: Theory and its applications for large-scale industrial engineering problems. Knowl.-Based Syst. 2018, 165, 169–196. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The Whale Optimization Algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Pan, Y.-K.; Xia, K.-W.; Niu, W.-J.; He, Z.-P. Semisupervised SVM by Hybrid Whale Optimization Algorithm and Its Application in Oil Layer Recognition. Math. Probl. Eng. 2021, 2021, 5289038. [Google Scholar] [CrossRef]
Yang, X.S.; Deb, S. Engineering Optimisation by Cuckoo Search. Int. J. Math. Model. Numer. Optim. 2010, 1, 330–343. [Google Scholar] [CrossRef]
He, Z.; Xia, K.; Niu, W.; Aslam, N.; Hou, J. Semisupervised SVM Based on Cuckoo Search Algorithm and Its Application. Math. Probl. Eng. 2018, 2018, 8243764. [Google Scholar] [CrossRef] [Green Version]
Faramarzi, A.; Heidarinejad, M.; Mirjalili, S.; Gandomi, A.H. Marine Predators Algorithm: A nature-inspired metaheuristic. Expert Syst. Appl. 2020, 152, 113377. [Google Scholar] [CrossRef]
Yang, W.B.; Xia, K.W.; Li, T.J.; Xie, M.; Song, F. A Multi-Strategy Marine Predator Algorithm and Its Application in Joint Regularization Semi-Supervised ELM. Mathematics 2021, 9, 291. [Google Scholar] [CrossRef]
Pierezan, J.; Coelho, L. Coyote Optimization Algorithm: A New Metaheuristic for Global Optimization Problems. In Proceedings of the 2018 IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro, Brazil, 8–13 July 2018; IEEE: Piscataway, NJ, USA, 2018. [Google Scholar]
Kiong, S.C.; Ong, P.; Sia, C.K. A carnivorous plant algorithm for solving global optimization problems. Appl. Soft Comput. 2020, 98, 106833. [Google Scholar]
Yang, W.B.; Xia, K.W.; Li, T.J.; Xie, M.; Song, F.; Zhao, Y.L. An Improved Transient Search Optimization with Neighborhood Dimensional Learning for Global Optimization Problems. Symmetry 2021, 13, 244. [Google Scholar] [CrossRef]
Qais, M.H.; Hasanien, H.M.; Alghuwainem, S. Transient search optimization: A new meta-heuristic optimization algorithm. Appl. Intell. 2020, 50, 3926–3941. [Google Scholar] [CrossRef]
Afshin, F.; Mohammad, H.; Stephens, B.; Mirjalili, S. Equilibrium optimizer: A novel optimization algorithm. Knowl. Based Syst. 2019, 191, 105190. [Google Scholar]
Yang, L.Q. A novel improved equilibrium global optimization algorithm based on Lévy flight. Astronaut. Meas. Technol. 2020, 40, 66–73. [Google Scholar]
Feng, Y.H.; Liu, J.Q.; He, Y.C. Dynamic population firefly algorithm based on chaos theory. J. Comput. Appl. 2013, 54, 796–799. [Google Scholar]
Wu, G.; Mallipeddi, R.; Ponnuthurai, N. Problem Definitions and Evaluation Criteria for the CEC 2017 Competition on Constrained Real Parameter Optimization; Nanyang Technological University: Singapore, 2010. [Google Scholar]
Gooijer, J.; Hyndman, R.J. 25 years of time series forecasting. In Monash Econometrics & Business Statistics Working Papers; Monash University: Clayton, Australia, 2005; Volume 22, pp. 443–473. [Google Scholar]
Caraka, R.E.; Chen, R.C.; Yasin, H.; Suhartono, S.; Lee, Y.; Pardamean, B. Hybrid Vector Autoregression Feedforward Neural Network with Genetic Algorithm Model for Forecasting Space-Time Pollution Data. Indones. J. Sci. Technol. 2021, 6, 243–268. [Google Scholar] [CrossRef]
Caraka, R.E.; Chen, R.C.; Toharudin, T.; Pardamean, B.; Yasin, H.; Wu, S.H. Prediction of Status Particulate Matter 2.5 using State Markov Chain Stochastic Process and Hybrid VAR-NN-PSO. IEEE Access 2019, 7, 161654–161665. [Google Scholar] [CrossRef]
Suhartono, S.; Prastyo, D.D.; Kuswanto, H.; Lee, M.H. Comparison between VAR, GSTAR, FFNN-VAR and FFNN-GSTAR Models for Forecasting Oil Production. Mat. Malays. J. Ind. Appl. Math. 2018, 34, 103–111. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Fischer, T.; Krauss, C. Deep learning with long short-term memory networks for financial market predictions. Eur. J. Oper. Res. 2017, 270, 654–669. [Google Scholar] [CrossRef] [Green Version]
Toharudin, T.; Pontoh, R.S.; Caraka, R.E.; Zahroh, S.; Lee, Y.; Chen, R.C. Employing Long Short-Term Memory and Facebook Prophet Model in Air Temperature Forecasting. Commun. Stat. Simul. Comput. 2021. (Early Access). [Google Scholar] [CrossRef]
Wu, C.H.; Lu, C.C.; Ma, Y.-F.; Lu, R.-S. A new forecasting framework for bitcoin price with LSTM. In Proceedings of the IEEE International Conference on Data Mining Workshops, Singapore, 17–20 November 2018; pp. 168–175. [Google Scholar]
Helmini, S.; Jihan, N.; Jayasinghe, M.; Perera, S. Sales Forecasting Using Multivariate Long Short Term Memory Networks. PeerJ 2019, 7, e27712v1. [Google Scholar]
Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to forget: Continual prediction with LSTM. Neural Comput. 2000, 12, 2451–2471. [Google Scholar] [CrossRef]
Karim, F.; Majumdar, S.; Darabi, H.; Chen, S. LSTM Fully Convolutional Networks for Time Series Classification. IEEE Access 2017, 6, 1662–1669. [Google Scholar] [CrossRef]
Saeed, A.; Li, C.; Danish, M.; Rubaiee, S.; Tang, G.; Gan, Z.; Ahmed, A. Hybrid Bidirectional LSTM Model for Short-Term Wind Speed Interval Prediction. IEEE Access 2020, 8, 182283–182294. [Google Scholar] [CrossRef]
Venskus, J.; Treigys, P.; Markevičiūtė, J. Unsupervised marine vessel trajectory prediction using LSTM network and wild bootstrapping techniques. Nonlinear Anal. Model. Control. 2021, 26, 718–737. [Google Scholar] [CrossRef]
Zheng, Z.; Chen, Z.; Hu, F.; Zhu, J.; Tang, Q.; Liang, Y. An automatic diagnosis of arrhythmias using a combination of CNN and LSTM technology. Electronics 2020, 9, 121. [Google Scholar] [CrossRef] [Green Version]
Bai, J.; Xia, K.; Lin, Y.; Wu, P. Attribute Reduction Based on Consistent Covering Rough Set and Its Application. Complexity 2017, 2017, 8986917. [Google Scholar] [CrossRef] [Green Version]
Wang, P.; Zhao, J.; Gao, Y.; Sotelo, M.A.; Li, Z. Lane Work-schedule of Toll Station Based on Queuing Theory and PSO-LSTM Model. IEEE Access 2020, 8, 84434–84443. [Google Scholar] [CrossRef]
Wen, H.; Zhang, D.; Lu, S.Y. Application of GA-LSTM model in highway traffic flow prediction. J. Harbin Inst. Technol. 2019, 51, 81–95. [Google Scholar]

Figure 1. Fluctuation range of inertia weights.

Figure 2. LSTM network structure.

Figure 3. Block diagram of the oil layer prediction system based on IEO-LSTM.

Figure 4. The normalized curves of attributes: (a,b) attribute normalization of

o_{1}

; (c,d) attribute normalization of

o_{2}

; (e) attribute normalization of

o_{3}

.

Figure 4. The normalized curves of attributes: (a,b) attribute normalization of

o_{1}

; (c,d) attribute normalization of

o_{2}

; (e) attribute normalization of

o_{3}

.

Table 1. Description of benchmark functions.

Function	Dim	Range	$f_{\min}$
$F_{1} (x) = \sum_{i = 1}^{n} x_{i}^{2}$	30	[−100, 100]	0
$F_{2} (x) = \sum_{i = 1}^{n} \|x_{i}\| + \prod_{i = 1}^{n} \|x_{i}\|$	30	[−10, 10]	0
$F_{3} (x) = \sum_{i = 1}^{n} {(\sum_{j - 1}^{i} x_{j})}^{2}$	30	[−100, 100]	0
$F_{4} (x) = \max_{i} \{\|x_{i}\|, 1 \leq i \leq n\}$	30	[−100, 100]	0
$F_{5} (x) = \sum_{i = 1}^{n} i x_{i}^{4} + random [0, 1)$	30	[−1.28, 1.28]	0
$F_{6} (x) = \sum_{i = 1}^{n} [x_{i}^{2} - 10 \cos (2 π x_{i}) + 10]$	30	[−5.12, 5.12]	0
$F_{7} (x) = - 20 \exp (- 0.2 \sqrt{\frac{1}{n} \sum_{i = 1}^{n} x_{i}^{2}}) - \exp (\frac{1}{n} \sum_{i = 1}^{n} \cos (2 π x_{i})) + 20 + e$	30	[−32, 32]	0
$F_{8} (x) = \frac{1}{4000} \sum_{i = 1}^{n} x_{i}^{2} - \prod_{i = 1}^{n} \cos (\frac{x_{i}}{\sqrt{i}}) + 1$	30	[−600, 600]	−418.9829 × 5
$F_{\begin{array}{l} 9 \end{array}} (x) = {(\frac{1}{500} + \sum_{j = 1}^{25} \frac{1}{j + \sum_{i = 1}^{2} {(x_{i} - a_{i j})}^{6}})}^{- 1}$	2	[−65, 65]	1
$F_{10} (x) = \sum_{i = 1}^{11} {[a_{i} - \frac{x_{1} (b_{i}^{2} + b_{i} x_{2})}{b_{i}^{2} + b_{i} x_{3} + x_{4}}]}^{2}$	4	[−5, 5]	0.00030
$F_{11} (x) = 4 x_{1}^{2} - 2.1 x_{1}^{4} + \frac{1}{3} x_{1}^{6} + x_{1} x_{2} - 4 x_{2}^{2} + 4 x_{2}^{4}$	2	[−5, 5]	−1.0316
$\begin{array}{l} F_{12} (x) = [1 + {(x_{1} + x_{2} + 1)}^{2} (19 - 14 x_{1} + 3 x_{1}^{2} - 14 x_{2} + 6 x_{1} x_{2} + 3 x_{2}^{2})] \\ \times [30 + {(2 x_{1} - 3 x_{2})}^{2} \times (18 - 32 x_{1} + 12 x_{1}^{2} + 48 x_{2} - 36 x_{1} x_{2} + 27 x_{2}^{2})] \end{array}$	2	[−2, 2]	3
$F_{13} (x) = - \sum_{i = 1}^{4} c_{i} \exp (- \sum_{j = 1}^{3} a_{i j} {(x_{j} - p_{i j})}^{2})$	3	[1, 3]	−3.86
$F_{14} (x) = - \sum_{i = 1}^{4} c_{i} \exp (- \sum_{j = 1}^{6} a_{i j} {(x_{j} - p_{i j})}^{2})$	6	[0, 1]	−3.32
$F_{15} (x) = - \sum_{i = 1}^{10} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	4	[0, 10]	−10.5363
$F_{16} (x) = C o m p o s i t i o n F u n c t i o n 1 (N = 3)$	30	${[- 100, 100]}^{D}$	0
$F_{17} (x) = C o m p o s i t i o n F u n c t i o n 2 (N = 3)$	30	${[- 100, 100]}^{D}$	0
$F_{18} (x) = C o m p o s i t i o n F u n c t i o n 3 (N = 4)$	30	${[- 100, 100]}^{D}$	0
$F_{19} (x) = C o m p o s i t i o n F u n c t i o n 4 (N = 4)$	30	${[- 100, 100]}^{D}$	0
$F_{20} (x) = C o m p o s i t i o n F u n c t i o n 5 (N = 5)$	30	${[- 100, 100]}^{D}$	0
$F_{21} (x) = C o m p o s i t i o n F u n c t i o n 6 (N = 5)$	30	${[- 100, 100]}^{D}$	0
$F_{22} (x) = C o m p o s i t i o n F u n c t i o n 7 (N = 6)$	30	${[- 100, 100]}^{D}$	0
$F_{23} (x) = C o m p o s i t i o n F u n c t i o n 8 (N = 6)$	30	${[- 100, 100]}^{D}$	0
$F_{24} (x) = C o m p o s i t i o n F u n c t i o n 9 (N = 3)$	30	${[- 100, 100]}^{D}$	0
$F_{25} (x) = C o m p o s i t i o n F u n c t i o n 10 (N = 3)$	30	${[- 100, 100]}^{D}$	0

Table 2. Reduction results of logging data.

Well	Attributes
Original results ( $o_{1}$ )	GR,DT,SP,WQ,LLD,LLS,DEN,NPHI,PE,U,TH,K,CALI
Reduction results ( $o_{1}$ )	GR, DT,SP,LLD,LLS, DEN, K
Original results ( $o_{2}$ )	AC,CNL,DEN,GR,RT,RI,RXO,SP,R2M,R025,BZSP,RA2,C1,C2,CALI,RINC,PORT,VCL,VMA1,VMA6,RHOG,SW,VO,WO,PORE,VXO,VW,so,rnsy,rsly,rny,AC1
Reduction results ( $o_{2}$ )	AC,GR,RT,RXO,SP
Original results ( $o_{3}$ )	AC,CALI,GR,NG,RA2,RA4,RI,RM,RT,RXO,SP
Reduction results ( $o_{3}$ )	AC,NG,RI,SP
Decision attribute	where–1, 1 represents the dry layer and oil layer respectively.

Table 3. Data set partition details.

Well	Depth (m)	Training Set		Depth (m)	Test Set
Well	Depth (m)	Oil Layer	Dry Layer	Depth (m)	Oil Layer	Dry Layer
$o_{1}$	3150~3330	122	1319	3330~3460	115	925
$o_{2}$	1180~1255	65	496	1230~1300	40	360
$o_{3}$	1150~1250	65	736	1233~1260	104	112

Table 4. Value range of attributes in

o_{1}

.

Table 4. Value range of attributes in

o_{1}

.

Attributes	GR	DT	SP	LLD	LLS	DEN	K
Boundary	[6, 200]	[152, 462]	[−167, −68]	[0, 25000]	[0, 3307]	[1, 4]	[0, 5]

Table 5. Value range of attributes in

o_{2}

.

Table 5. Value range of attributes in

o_{2}

.

Attributes	AC	GR	RT	RXO	SP
Boundary	[54, 140]	[27, 100]	[2, 90]	[1, 328]	[−32, −6]

Table 6. Value range of attributes in

o_{3}

.

Table 6. Value range of attributes in

o_{3}

.

Attributes	AC	NG	RI	SP
Boundary	[0, 571]	[0, 42]	[−1, 81]	[−50, 28]

Table 7. Prediction performance on

o_{1}

,

o_{2}

,

o_{3}

.

Table 7. Prediction performance on

o_{1}

,

o_{2}

,

o_{3}

.

Optimization Algorithm	Oil	Learning Rate	Epochs	Batch Size	ACC	MSE	RMSE	MAE
LSTM	o1	0.01	100	256	0.8837	0.2049	0.4887	0.2341
	o2	0.01	100	256	0.8937	0.1765	0.0519	0.0826
	o3	0.01	100	256	0.9095	0.1561	0.4896	0.1971
PSO_LSTM	o1	0.0098	86	103	0.9025	0.1697	0.3925	0.1137
	o2	0.0011	42	154	0.9154	0.0937	0. 035	0.0985
	o3	0.003	52	75	0.8859	0.0952	0.3131	0.0933
GA_LSTM	o1	0.0082	20	200	0.9084	0.6732	0.8587	0.7368
	o2	0.0082	54	167	0.8979	0.0753	0.2853	0.0877
	o3	0.0038	29	176	0.9132	0.1368	0.3368	0.1404
EO_LSTM	o1	0.0089	28	163	0.9231	0.1481	0.2347	0.5442
	o2	0.0081	45	127	0.9183	0.0605	0.0375	0.0116
	o3	0.0049	56	155	0.9203	0.0795	0.2353	0.0793
IEO_LSTM	o1	0.0030	29	137	0.9433	0.1023	0.3781	0.4544
	o2	0.0082	43	109	0.9371	0.041	0.0212	0.0107
	o3	0.0027	53	128	0.9661	0.0323	0.2016	0.0571

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lan, P.; Xia, K.; Pan, Y.; Fan, S. An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network. Symmetry 2021, 13, 1706. https://doi.org/10.3390/sym13091706

AMA Style

Lan P, Xia K, Pan Y, Fan S. An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network. Symmetry. 2021; 13(9):1706. https://doi.org/10.3390/sym13091706

Chicago/Turabian Style

Lan, Pu, Kewen Xia, Yongke Pan, and Shurui Fan. 2021. "An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network" Symmetry 13, no. 9: 1706. https://doi.org/10.3390/sym13091706

APA Style

Lan, P., Xia, K., Pan, Y., & Fan, S. (2021). An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network. Symmetry, 13(9), 1706. https://doi.org/10.3390/sym13091706

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Improved Equilibrium Optimizer Algorithm and Its Application in LSTM Neural Network

Abstract

1. Introduction

2. Equilibrium Optimizer Algorithm and Its Improvement

2.1. Description of Equilibrium Optimizer Algorithm

2.2. Improved Equilibrium Optimizer (IEO)

2.2.1. Improved Strategies for EO

2.2.2. IEO Algorithm Process

2.3. Numerical Optimization Experiments

2.3.1. Experimental Settings and Algorithm Parameters

2.3.2. Benchmark Test Functions

2.3.3. Experimental Results and Analysis

3. LSTM Model Optimization by Improved IEO Algorithm

3.1. Basic Model of LSTM

3.2. LSTM Based on Improved EO

3.3. Actual Application in Oil Layer Prediction

3.3.1. Design of Oil Layer Prediction System

3.3.2. Practical Application

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI