Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem

Güler, Şahiner; Eker, Erdal; Yumuşak, Nejat

doi:10.3390/en18112916

Open AccessArticle

Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem

by

Şahiner Güler

^1,*,

Erdal Eker

² and

Nejat Yumuşak

³

¹

Vocational Schools of Technical Sciences, Muş Alparslan University, 49250 Muş, Turkey

²

Vocational Schools of Social Sciences, Muş Alparslan University, 49250 Muş, Turkey

³

Department of Computer Engineering, Faculty of Computer and Information Sciences, Sakarya University, 54187 Sakarya, Turkey

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(11), 2916; https://doi.org/10.3390/en18112916

Submission received: 13 April 2025 / Revised: 27 May 2025 / Accepted: 29 May 2025 / Published: 2 June 2025

(This article belongs to the Section B: Energy and Environment)

Download

Browse Figures

Versions Notes

Abstract

This paper aims to enhance the success of the Wild Horse Optimization (WHO) algorithm in optimization processes by developing strategies to overcome the issues of stuckness and early convergence in local spaces. The performance change is observed through a Multi-Layer Perceptron (MLP) sample. In this context, an advanced Wild Horse Optimization (IWHO) algorithm with a random walking strategy was developed to provide solution diversity in local spaces using a random walking strategy. Two challenging test sets, CEC 2019, were selected for the performance measurement of IWHO. Its competitiveness with alternative algorithms was measured, showing that its performance was superior. This superiority is visually represented with convergence curves and box plots. The Wilcoxon signed-rank test was used to evaluate IWHO as a distinct and powerful algorithm. The IWHO algorithm was applied to MLP training, addressing a real-world problem. Both WHO and IWHO algorithms were tested using MSE results and ROC curves. The Energy Efficiency Problem dataset from UCI was used for MLP training. This dataset evaluates the heating load (HL) or cooling load (CL) factors by considering the input characteristics of smart buildings. The goal is to ensure that HL and CL factors are evaluated most efficiently through the use of HVAC technology in smart buildings. WHO and IWHO were selected to train the MLP architecture, and it was observed that the proposed IWHO algorithm produced better results.

Keywords:

artificial intelligence; machine learning algorithms; metaheuristic algorithm; multi-layer perceptron

1. Introduction

Optimization involves a series of tasks that employ mathematical or analytical methods to enhance the parameters of a given system or process at reduced cost. This process follows a trajectory that enables the problem to attain a global optimum [1]. To date, optimization has been applied to real-world problems represented by mathematical models to address challenges in engineering, finance, and science [2,3]. The development of optimization algorithms has garnered significant interest in addressing optimization issues across various domains, including engineering, science, economics, and business [4,5,6]. A real-world optimization problem can be resolved if it is mathematically expressed. When addressing real-world challenges, it is crucial to optimally derive the desired variables. Consequently, intelligent algorithms have been acknowledged as vital tools for solving these problems. These algorithms are categorized into deterministic and stochastic types [7]. However, deterministic algorithms present several limitations, such as the necessity for complex mathematical computations, implementation challenges, and tendency to become trapped in local optima [2,8]. Stochastic algorithms that employ non-deterministic derivative-free methods have been utilized to mitigate these limitations. In this regard, metaheuristic optimization techniques, classified as stochastic algorithms, provide promising solutions to fulfil this requirement.

Such algorithms are based on theorems derived from scientific laws and mathematical modelling with specific constraints, as well as theorems that mimic the behavior of swarms with shared intelligence [2]. The goal of metaheuristics is to identify search or optimization methods that are superior when applied to instances of a given class of problems. In all swarm-based optimization algorithms, the problem is optimized using a set of random solutions. Each time the algorithm is executed, this set of random solutions is evaluated and improved by using a fitness function. The probability of obtaining an overall optimal solution was increased through enough random solutions and iterations.

Metaheuristic algorithms are characterized by two primary phases: exploration and exploitation. The efficacy of the balance between these phases indicated the overall quality of the algorithm. Exploration pertains to the algorithm’s capacity to survey the entire search space, whereas exploitation involves each search agent’s ability to identify the optimal solution within its local vicinity. The effectiveness of an algorithm in addressing a specific problem set may not necessarily translate to optimization problems of a different nature or type [9]. It is acknowledged that no single optimization algorithm is universally applicable to all problems. Metaheuristic algorithms can be designed to efficiently navigate the search space and deliver optimal solutions within a reasonable time frame.

The Genetic Algorithm (GA) [10] is among the earliest swarm-based algorithms. The particle swarm optimization (PSO) algorithm [11] has been extensively examined and has drawn inspiration from various algorithms, including the Grey Wolf Algorithm (GWO) [12] and Harris Hawks Optimizer (HHO) [8], among others, in terms of their operational principles [11]. The hunger game search algorithm (HGS) [13] was inspired by the hunger drive of animals, a common theme in swarm-based algorithms. Over the past two decades, numerous algorithms have been developed and implemented in this domain [14,15,16].

The No-Free Lunch (NFL) theorem underscores the importance of leveraging problem-specific knowledge to enhance performance, thereby providing an opportunity to develop various strategies for fine-tuning a metaheuristic algorithm as desired [17]. Hybridization techniques can be employed to obtain superior solutions [18,19]. Hybridization offers three significant advantages: it capitalizes on the flexibility of metaheuristics to tackle large-scale problems that individual metaheuristics alone cannot manage by integrating different strategies or algorithms, and it improves the problem-solving performance, resulting in more robust algorithms [20].

Some examples of hybridization were examined in this study.

The authors of one paper highlight that hybrid metaheuristic algorithms improve the effectiveness of individual algorithms by combining their unique advantages. Many hybrid algorithms have been created for feature selection, with the goal of identifying the best feature subsets from datasets. These methods avoid getting stuck in local optima and prevent early convergence while thoroughly examining the search space. The algorithms strike a balance between exploration and exploitation. The improved algorithms produce results that are nearly optimal. By integrating multiple methods, hybrid metaheuristics enhance both convergence and the quality of solutions [21].

The seminal study on hybridization explores hybrid meta-heuristic algorithms for optimizing the dimensions of hybrid renewable energy systems (HRES). HRES enhances energy sustainability in remote areas; however, optimization is complex due to multiple factors. Hybrid meta-heuristic algorithms yield more precise results than traditional methods for HRES optimization. The paper reviews these algorithms for both single-objective and multi-objective design optimization. In single-objective optimization, algorithms such as GA-TS, PSO-HS, and the GA-Markov model are employed to minimize costs and enhance reliability. For multi-objective optimization, algorithms like FPA-SA, MOPSO-NSGA-II, and ACO-ABC are utilized to optimize cost, emissions, and reliability simultaneously. Low-level co-evolutionary hybridization is most commonly employed to balance algorithm exploration and exploitation. The authors note an increased use of hybrid algorithms for multi-objective HRES optimization [22]. The study evaluates an autonomous green energy system using hybrid metaheuristic methods. It presents a hybrid renewable energy system (HRES) with wind turbines, solar photovoltaic panels, biomass gasifier, and fuel cells with hydrogen storage for an off-grid university campus in Turkey. The research examines electricity production and optimization algorithms to minimize annual system cost while ensuring reliable supply. Using a Hybrid Firefly Genetic Algorithm (HFGA), which outperforms other algorithms, the optimal sizing is determined. The standalone system proves most cost-effective, achieving 100% renewable energy fraction [23].

Recently, metaheuristic algorithms have been widely employed to address real-world problems, with applications spanning engineering challenges [24,25], efficient use of air conditioning technology [25], and training of artificial neural networks [2]. Furthermore, a substantial body of literature exists on the integration of metaheuristics into machine learning algorithms [26,27].

For instance, in one study, the authors examined the HVAC system [28]. In paper, introduces a hybrid model that combines physical system modeling with symbolic regression, noted for its effectiveness with limited data and physical expressiveness. This model has been tested on air conditioning systems to forecast energy performance. IWHO develops optimization algorithms for machine learning (MLP) training. The algorithm’s optimization effectiveness and its impact on MLP training are evaluated, with applications to standard optimization tests and MLP training using the UCI Energy Efficiency dataset. IWHO excels in optimization and training stability. Enhanced with a Random Walk strategy, IWHO addresses premature convergence issues and delivers superior results compared to classical WHO and meta-heuristic algorithms, as validated by the Wilcoxon test. As an MLP training consultant, IWHO achieves lower error rates and higher explanatory power (R²: 0.98) than classical WHO, providing precise outcomes in energy efficiency and HVAC systems. Box plots and convergence curves demonstrate that IWHO yields more stable results near the global optimum. Although it is slower by 23%, its accuracy makes this acceptable. IWHO’s adaptable nature suits various optimization problems and machine learning applications, ranging from energy efficiency to engineering design. The algorithm’s primary advantages are its novel optimization approach, integration with machine learning, and successful application to energy efficiency, with statistically significant results and stable performance.

In this study, the IWHO algorithm was utilized as an advisor for an artificial neural network, a machine-learning algorithm, thereby contributing to this field. The wild horse optimization algorithm considered in this study is a swarm-based algorithm [27]. The group hierarchy in a swarm of wild horses is inspired by common swarm behaviors such as the satisfaction of motives. Several studies have addressed the WHO algorithm. It is pertinent to provide examples of previous studies. In 2022, the Wild Horse Optimizer (WHO) algorithm was developed based on swarm characteristics and the hierarchy of wild horses. For instance, to augment the exploitation capacity of the WHO algorithm and prevent it from becoming trapped in local optima, a random running strategy (RRS) was employed to ensure a balance between exploration and exploitation, and to strengthen competition with the waterhole mechanism (CWHM). In addition, the dynamic inertia weight strategy (DIWS) was used to achieve a global optimal result [29]. To address the limitations of the WHO algorithm, such as its inadequate search accuracy and slow convergence speed, a Hybrid Multi-Strategy Improved Wild Horse Optimizer (HMSWHO) is proposed. The Halton sequence is utilized to increase the diversity of the swarm, the adaptive parameter TDR is employed for exploration-exploitation balance, and the simplex method is applied to improve the worst position of the swarm [30]. In another study aimed at enhancing the WHO algorithm, a new algorithm was developed by implementing two strategies to increase the global search efficiency and avoid local optima. The first strategy is the quantum acceleration strategy, which enhances the individual abilities of each horse, and the chaotic mapping strategy, which prevents entrapment in the local optima [31].

Numerous studies have applied the WHO algorithm to various engineering challenges. In one study, the WHO algorithm was employed as a controller in a wind turbine system to standardize the wind speed through a specific controller and to avert potential failures [32]. Another study hybridized the WHO algorithm with the dwarf mongoose optimization (DMO) algorithm to assess its efficacy as a controller to reduce electricity costs in a small-grid system model [33]. Within the domain of energy engineering, the WHO algorithm has demonstrated its effectiveness as an optimization controller [34]. It has also been used for attribute selection [35]. In the context of scheduling problems, the WHO algorithm has been successfully applied to the planning of work processes in workshops by hybridizing it with various strategies [36]. The stability of the WHO algorithm served as a key motivation for this study. The quality of the algorithm was evaluated using the CEC 2019 [37] benchmark functions and hybridized with the Random Walking (RW) strategy to enhance outcomes. The RW strategy augments the potential of metaheuristic algorithms by introducing randomness into local search processes and generating alternative solutions [38,39]. The findings of this study indicated a significant improvement in the performance of the WHO algorithm across most functions. Previously, Eker et al. conducted a similar investigation using the RW-ARO algorithm, which resulted from the hybridization of the artificial rabbit optimization (ARO) algorithm and RW strategy, achieving successful results in Control Engineering [40]. In this study, the proposed IWHO hybrid algorithm was employed to train the CL and HL for HVAC systems in buildings via MLP, with the IWHO designated as the trainer. The WHO algorithm was selected as an alternative. This study utilized the Energy Efficiency dataset from the UCI, which addresses the energy efficiency problem [41]. Previous studies have explored related topics. One study assessed the predictive capabilities of Artificial Neural Networks (ANN) and XGBoost surrogate models for heating, cooling, lighting loads, and equivalent primary energy requirements, using EnergyPlus (Matlab2024) simulation data across various office cell model versions. The XGBoost models exhibited superior accuracy in predicting these loads. The mc-intersite-proj-th sampling method slightly enhanced prediction accuracy compared to maximin Latin hypercube sampling for both models. XGBoost models, which consist of collections of piecewise constant functions, perform optimally when the simulated loads show minimal variation or smaller gradients across the design space. This implies that fewer regression trees may suffice for precise predictions, or the same number could yield more accurate results. In this case study, XGBoost models demonstrated greater accuracy for lighting loads and primary energy needs than for heating and cooling loads, which have simpler slanted shapes with higher gradients not approaching zero [42]. In another study on the increasing prevalence of electric vehicles, accurately predicting energy consumption is essential for effective power grid management. This research evaluated eleven machine learning models, including Ridge Regression, Lasso Regression, K-Nearest Neighbors, Gradient Boosting, Support Vector Regression, Multi-Layer Perceptron, XGBoost, CatBoost, LightGBM, Gaussian Process Regression (GPR), and Extra Trees Regressor, using data from Colorado. The models were assessed using metrics such as Mean Absolute Error (MAE), Mean Squared Error (MSE), R², Root Mean Squared Error (RMSE), and Normalized RMSE (NRMSE). Both Gradient Boosting and K-Nearest Neighbors (KNN) also demonstrated strong performance, while non-linear and linear regression models faced challenges in predicting extreme energy levels. The Extra Trees Regressor emerged as the most precise in forecasting energy consumption. A limitation of this study is its exclusive reliance on data from Colorado. Although accurate, the computational demands of the Extra Trees Regressor may affect its scalability for real-time applications. Future research should consider exploring deep learning models, expanding datasets, and employing time-series analysis to enhance forecasting precision [43]. Khishe and Mohammadi trained a passive sonar dataset using the SSA-MLP hybrid system, achieving high classification accuracy [44]. He et al. proposed a metaheuristic-based algorithm model for designing underwater wireless sensor networks to enhance energy efficiency and extend network lifetime, suggesting a hybrid hierarchical chimpanzee optimization algorithm to efficiently manage clustering and multihop routing procedures [45].

Building on these advancements, our research introduces the IWHO-MLP framework, which utilizes the Improved Wild Horse Optimizer for training Multi-Layer Perceptrons (MLPs). Comparative analyses using boxplots and convergence curves reveal that IWHO-MLP consistently outperforms both traditional and hybrid methods, achieving lower MSE, RMSE, and higher R² values on the energy efficiency dataset and CEC 2019 benchmark functions. These findings not only confirm but also advance the current state-of-the-art, highlighting the efficacy of metaheuristic-driven optimization in neural network training.

In the existing literature, there are studies on artificial neural networks that incorporate the WHO algorithm. A comprehensive study of face recognition systems in the banking sector employed methods such as artificial neural networks, decision trees, and adaptive neural fuzzy inference systems, with the WHO algorithm contributing to the optimization of these methods [44]. Additionally, the WHO algorithm has been utilized as an optimizer in artificial neural networks for the classification of long-term storage of medical images in the healthcare domain [45].

The proposed approach offers several key strengths and advantages, including enhanced performance. The IWHO algorithm demonstrated superior results compared with the original WHO algorithm and other alternatives across most benchmark functions.

Stability and consistency: boxplot analysis indicated that the IWHO algorithm consistently exhibited stable performance.

Improved exploration-exploitation balance: the Random Walk strategy mitigates early convergence and prevents entrapment in local optima.

Competitive performance: the IWHO surpasses other algorithms when addressing the challenging test sets of CEC 2019.

Statistical significance: the Wilcoxon signed-rank test confirmed the distinctiveness and efficacy of the IWHO algorithm.

Efficient MLP training: the IWHO excels in training a Multi-Layer Perceptron for the energy efficiency problem, achieving lower MSE values and more reliable outcomes.

Improved prediction accuracy: the IWHO algorithm delivers higher prediction rates for Cooling Load (CL) factors than the original WHO algorithm.

Time efficiency: despite its increased complexity, the IWHO completes the optimization process within a reasonable timeframe.

Flexibility and hybridization potential: the success of the IWHO suggests that the WHO algorithm possesses a flexible structure that is amenable to integration with other strategies.

Real-world applicability: the IWHO algorithm demonstrates potential for addressing practical issues such as optimizing HVAC systems in smart buildings for energy efficiency.

The remainder of the paper is organized as follows: Section 2 discusses the WHO algorithm, RW search strategy, and proposed IWHO hybridization structure. Section 3 presents the testing of the IWHO algorithm using the CEC 2019. Section 4 covers MLP training, and Section 5 presents the results.

The proposed hybrid IWHO and the optimization process with alternative algorithms are given in Figure 1 below.

2. Algorithms

2.1. Wild Horse Optimizer (WHO)

The WHO algorithm is inspired by the common swarm behavior of wild animals. Figure 2 illustrates the swarm diagram of the WHO algorithm, which includes a swarm hierarchy (N), stallion (S), and foals (F), with each leader forming a group (G). As the foals mature, they leave the swarm to form their own independent families. The algorithm consists of five stages [32,46].

The algorithm starts the optimization process with a set of candidate solutions. In the initial set, the candidate horse simulation presents a certain hierarchy. In this hierarchy, there are groups of horses under different leaders. At this stage, a swarm diagram is generated as shown in Figure 2.

The ratio of the number of leaders

(P S)

in a group,

P S = \frac{G_{i}}{N_{i}}, i = 1, 2, 3, \dots

(1)

is determined by Equation (1).

In the second phase, the behavior of the swarm in the grazing pasture is modeled. The stallion (S) plays a central role in this behavior.

X_{G, j}^{i} = 2 Z c o s (2 π R Z) \cdot ({S t a l l i o n}_{G, j} - X_{G, j}^{i}) + {S t a l l i o n}_{G, j}

(2)

where j is the position of the members and stallions of group

i

within the group is expressed by

{S t a l l i o n}_{G, j}

and

X_{G, j}^{i}

. The parameter R is a random value in the open interval (−2, 2) and the parameter z is a self-adaptive parameter according to the changes in the process calculated in Equation (4) below.

P = \vec{r_{1}} < T D R, T D R = 1 - \frac{i t}{m a x_i t} I D X = (P = = 0)

(3)

z = r_{2} θ I D X + \vec{r_{3}} θ (~ I D X)

(4)

where P and

r_{1}, r_{2} v e r_{3}

parameters are vectors that take random values in the range (0, 1). The current number of iterations is expressed as

i t

, and the maximum number of iterations is expressed as

m a x_i t

.

Here, P is a vector containing 0 and 1 of a size equal to the size of the equation, and

r_{1}, r_{2} a n d r_{3}

are vectors taking random values in the interval (0, 1). The current number of iterations is denoted by

i t

and the maximum number of iterations is denoted by

m a x_i t

.

In the third stage, Equation (1) simulates the mating behavior of horses. The swarm separates foals that have not yet reached mating age and prevents mating between parents and foals.

X_{G, k}^{p} = C r o s s o v e r (X_{G, j}^{q}, X_{G, j}^{z}), i \neq j \neq k, q = z = e n d, C r o s s o v e r = m e a n

(5)

In Equation (5),

X_{G, k}^{p}

is the position of horse

p

at position

k

, which includes the positions of horse

q

in group i and horse z in group

j

. Here the crossover parameter Crossover percentage (PC) is adjustable.

In the fourth phase, the competition between group leaders is simulated with Equation (6). While the group leaders direct the other group members to the area with a suitable puddle, the winning group will have priority in the competition.

{\bar{S t a l l i o n}}_{G, j} = \{\begin{matrix} 2 Z c o s (2 π R Z) \times W H - ({S t a l l i o n}_{G, j}) + W H, r_{3} > 0.5 \\ 2 Z c o s (2 π R Z) \times W H - ({S t a l l i o n}_{G, j}) - W H, r_{3} \leq 0.5 \end{matrix}

(6)

where

W H

is the location of the puddle,

{\bar{S t a l l i o n}}_{G, j}

and

{S t a l l i o n}_{G, j}

are the candidate position and current position of the leader, respectively.

In the final phase, leader selection is considered and simulated using Equation (7). Initially, the leader is randomly selected from the entire swarm, and then the stallion with the most favorable value becomes the leader.

{S t a l l i o n}_{G, j} = \{\begin{matrix} X_{G, j}^{i} \\ {S t a l l i o n}_{G, i} \end{matrix} \begin{matrix} , f (X_{G, j}^{i}) < f ({S t a l l i o n}_{G, j}) \\ , f (X_{G, j}^{i}) \geq f ({S t a l l i o n}_{G, j}) \end{matrix}

(7)

where

f (X_{G, j}^{i})

is the fitted value of foal,

f ({S t a l l i o n}_{G, j})

is the fitted value of parents.

2.2. Random Walking Strategy

Metaheuristic algorithms employ randomization to identify optimal solutions, working with a set of candidate solutions to explore all possible paths for the best outcome. Achieving a global search requires reaching a local optimum, a process facilitated by non-deterministic randomization, which is also applied in local searches. Balancing local and global searches is crucial for evaluating the performance of metaheuristic algorithms [47]. The Random Walk strategy enhances this balance by introducing a predetermined number of random steps within defined boundaries. This strategy acts as a fine-tuning mechanism in later iterations, preventing the algorithm from becoming trapped in local optima [38,39,40]. The random walks method introduces diversity by disrupting the regular progression, interfering with future iterations to avoid premature entrapment in local optima. This approach aids in escaping local optima without reaching them too swiftly. During the exploration phase, as deviation decreases in subsequent iterations, progress is made, leading to less pronounced deviations and achieving stable outcomes. The parameter Walk, which determines the step size, allows random walks to be integrated into various search algorithms. It serves to perturb the population of solutions, thereby preventing entrapment in local optima. Selecting the step size distribution is vital in the search process. A small step size encourages the exploitation of the search space near the current state, focusing on refining existing solutions. Conversely, a sufficiently large step size promotes the exploration of uncharted areas within the search space, potentially uncovering new and promising regions for further investigation [47].

In this paper, by increasing the number of random steps through RW, the possibility of avoiding premature convergence and getting stuck at the local optimum is enhanced by delaying the discovery of the optimum point in the local search phase. In this strategy, the fitness values are differentiated by adjusting the position of the group leader according to a varying parameter.

\binom{g B e s t = W H . p o s}{x_{0} = g B e s t}\}

(8)

x = x_{0} + W a l k \times (0.5 - r a n d (1))

(9)

where gBest is the best location of the puddle in Equation (8),

x

represents the deviated value,

x_{0}

is the current best position of the leader, and

W a l k

is the number of randomized steps in Equation (9). The flowchart of the IWHO algorithm is shown in Figure 3 and pseudo code is shown in Algorithm 1.

Algorithm 1: Pseudo Code of IWHO

1: Initialization

2: Set parameters:

3:

N =

number of search agents (horses)

4:

t_m a x =

maximum number of iterations

5:

d i m =

problem dimension

6:

l b, u b =

lower and upper bounds

7:

r =

number of random walk steps

8: Generating Initial Population

9: for

i = 1

to N do

10:

X_i =

random position within

[l b, u b]

11: Evaluate fitness

(X_i)

12: end for

13: Establish Hierarchy and Group Structure

14: Calculate

P S =

ratio of leaders using Equation (1)

15: Divide population into groups with leaders

16: Main Loop

17: for

t = 1

to

t_m a x

do

18: Update z parameter (self-adaptive)

19:

z = 2 * (1 - {(t / t_m a x)}^{2})

20: Phase 1: Grazing behavior

21: for each group

i

do

22: for each member

j

in group

i

do

23: Update position using Equation (2)

24:

R =

random value in (−2, 2)

25:

X_i j = X_i j + R * z * (X_s t a l l i o n - X_i j)

26: Evaluate fitness

(X_i j)

27: end for

28: end for

29: Phase 2: Mating Behavior

30: for each group

i

do

31: for each member j in group i do

32: if random() < PC then // PC is crossover percentage

33: Apply crossover using Equation (5)

34: Select random horse z from different group k

35:

X_i j = c r o s s o v e r (X_i j, X_z k)

36: Evaluate fitness

(X_i j)

37: end if

38: end for

39: end for

40: Phase 3: Competition Among Leaders

41: for each leader

i

do

42: Update leader position using Equation (8)

43:

X_c a n d i d a t e = X_i + r a n d o m () * (X_p u d d l e - X_i)

44: if fitness

(X_c a n d i d a t e)

etter than fitness

(X_i)

then

45:

X_i = X_c a n d i d a t e

46: end if

47: end for

48: Phase 4: Leader Selection

49: for each group

i

do

50: Select new leader using Equation (7)

51: if any foal has better fitness than current leader then

52: Update leader

53: end if

54: end for

55: IMPROVEMENT: Random Walk Strategy

56: for each leader

i

do

57:

b e s t_p o s i t i o n = X_{i}

58: Execute Random Walk Steps

59: for

s t e p = 1

to r do

60: Apply random walk using Equation (9)

61:

X_d e v i a t e d = b e s t_p o s i t i o n + r a n d o m_d e v i a t i o n ()

62: Ensure bounds

63:

X_d e v i a t e d = m a x (l b, m i n (u b, X_d e v i a t e d))

64: Evaluating New Position

65: if fitness

(X_d e v i a t e d)

better than fitness (best_position) then

66:

b e s t_p o s i t i o n = X_d e v i a t e d

67: end if

68: end for

69: Updating Leader with Best Position Found During Random Walk

70:

X_i = b e s t_p o s i t i o n

71: end for

72: Update Global Best Solution

73: Update

X_b e s t

if better solution found

74: end for

75: return

X_b e s t

In accordance with the standard WHO phases, leaders engage in stochastic explorations, taking multiple steps to investigate the surrounding environment. This methodology assists in circumventing local optima and preventing premature convergence. The optimal position identified during these stochastic explorations is subsequently adopted. The random walk strategy constitutes a significant enhancement in IWHO compared to the original WHO algorithm, improving the exploration of the search space and facilitating the avoidance of local optima.

3. Multilayer Perceptron Architecture

The Multi-Layer Perceptron (MLP) is an artificial neural network architecture that serves as a nonparametric estimator, applicable for both classification, prediction, and regression tasks. The brain is an information processing device with extraordinary capabilities, surpassing current technological products in various fields such as vision, speech recognition, and learning. These applications have clear economic advantages when implemented on machines. By understanding how the brain accomplishes these functions, we can develop formal algorithms to solve these tasks and implement them on computers. The human brain is fundamentally different from a computer. While a computer typically has a single processor, the brain consists of an enormous number of processing units, approximately

10^{11}

neurons, which operate in parallel. Although the precise details are not fully understood, these processing units are thought to be much simpler and slower than a computer processor. Another distinguishing feature of the brain, which is believed to contribute to its computational power, is its extensive connectivity. Neurons in the brain have connections, called synapses, to around

10^{4}

other neurons, also functioning in parallel. In a computer, the processor is active and the memory is separate and passive; however, in the brain, it is believed that processing and memory are distributed throughout the network. Processing is carried out by the neurons, while memory resides in the synapses between neurons. The information processing system in the brain requires a fundamental theory, algorithm, and hardware. This includes data input for the desired operation, an algorithm to perform the function, and the capability to execute commands using a specific hardware implementation. The training of neural networks, which involves a mathematical function, is made possible through statistical techniques. One of the architectures designed for this training is the MLP structure [48].

The aim of perceptron is to correctly classify the set of externally applied stimuli

x_{1}, x_{2}, \dots x_{m}

into one of two classes. In Figure 4 bias denoted by b,

x_{1}, x_{2}, \dots x_{m}

inputs value,

w_{1}, w_{2}, \dots w_{m}

connection or synaptic weight, v induced local field of the neuron, and

φ (v)

transfer function. Given

v

can also be defined as a hyperplane.

v = \sum_{i = 1}^{m} w_{i} x_{i} + b

(10)

We can express the output of the perceptron as a dot product, and augmenting the input vector

x = {(x_{1}, x_{2}, \dots x_{m})}^{T}

and weight vector

w = {(w_{1}, w_{2}, \dots w_{m})}^{T}

to include the bias weight and input, respectively.

v = w^{T} x

(11)

During testing, the output y is calculated with the given weights w for input x. To implement a given task, it is necessary to learn the weights w, which are the parameters of the system, such that the correct outputs are produced given the inputs.

The hyperplane can be utilized to partition the input space into two regions: one where the values are positive and another where the values are negative. This concept is foundational in implementing a linear separation function. In Figure 5, the detector can then classify the input into one of two classes by examining the sign of the output: if the output is positive, it belongs to one class

(C_{1})

; if negative, it belongs to the other class

(C_{2})

[49].

S (v) = \{\begin{matrix} C_{1}, S (v) > 0 \\ C_{2}, S (v) < 0 \end{matrix}

(12)

If

x_{i}

belongs to

C_{1}

else

C_{2}

,

S (v)

as a treshold function, We need to activation functon or output (y) for calculate risk. The transfer function is the mathematical function that defines the properties of the network and is chosen according to the problem that the network has to solve. The reason for choosing sigmoid as the transfer function in this paper is that it has continuous and differentiable values in the interval [0, 1]. Here the parameter a is the slope of the transfer function.

y = s i g m o i d (v) = \frac{1}{1 + e^{(- a v)}}

(13)

A perceptron with a single layer of weights can only approximate linear functions of the input and cannot solve problems like XOR, where the discriminant to be estimated is nonlinear. Similarly, a perceptron cannot be used for nonlinear regression. This limitation does not apply to feedforward networks with hidden layers between the input and output layers. When used for classification, such Multi-Layer Perceptrons can implement nonlinear discriminants, and when used for regression, they can approximate nonlinear functions of the input. Typically, the network consists of a set of source nodes that constitute the input layer, one or more hidden layers of computation nodes, and an output layer of computation nodes. These neural networks are called Multi-Layer Perceptrons (MLPs), representing a generalization of the single-layer perceptron. MLPs have been applied successfully to solve some difficult and diverse problems by training them in a supervised manner with a viral algorithm known as “backprop”. The learning process performed with this algorithm is called backpropagation learning. As seen Figure 6 in an MLP, while function signals flow in the forward propagation direction, the error signal responds with the backpropagation flow and is corrected to minimize the error. An error signal begins at an output neuron of the network and travels backward through the network. Each neuron in the network computes using a function namely error that depends on the error in some way [49,50].

Learning is a process in which the free parameters of a neural network are adapted by the environment in which the network is embedded through a simulation process shown as Figure 7. The type of learning is determined by the way parameter changes take place [50].

Error metrics serve as a quantitative measure of the deviation between predicted outcomes and actual values. While a single result may not yield extensive insights, it facilitates the selection of the most appropriate regression model by providing a numerical basis for comparison with other model outcomes. The supervisor can provide the neural network with the desired response for a specific training vector, representing the optimal action the network should undertake. The network parameters are adjusted under the combined influence of the training vector and the error signal, defined as the difference between the desired and actual responses of the network. This tuning process is conducted iteratively and incrementally, with the objective of ensuring that the neural network eventually emulates the supervisor, which is presumed to be optimal in a statistical sense. Through this process, the environmental information available to the supervisor is transferred as comprehensively as possible to the neural network during training. Once this condition is achieved, the tutor can be dispensed with, allowing the neural network to independently interact with the environment. Error values can be calculated using metrics such as mean squared error (MSE), root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and the coefficient of determination (

R^{2}

).

M S E = \frac{1}{N} \sum_{i = 1}^{N} {(P_{i} - O_{i})}^{2}

(14)

R M S E = \sqrt{M S E}

(15)

M A E = \frac{1}{N} \sum_{i = 1}^{N} |P_{i} - O_{i}|

(16)

M A P E = \frac{M A E}{A V E R A G E T R U E V A L U E} * 100

(17)

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(P_{i} - O_{i})}^{2}}{\sum_{i = 1}^{N} {(P_{i} - \bar{P_{i}})}^{2}}

(18)

In Equation (14)

N

is the training samples,

i

is the number of outputs value,

P_{i}

is the desired output, and

O_{i}

actual output.

4. Experimental Results

The main goal of this paper is to solve a global non-parametric standardized function set using the WHO algorithm (IWHO) hybridized with the Random Walk strategy and to compare the results with those of alternative competitive metaheuristic algorithms. A notation table summarizing all parameters of the algorithm has been added as Table 1.

The analysis tools used include statistical tables, boxplots, and convergence curve. The CEC 2019 functions are among the most challenging non-linear function sets due to their structure, which delays reaching the global solution, and are widely accepted for evaluating algorithm quality. The primary rationale for selecting only 10 functions from the CEC 2019 set in our research is that these functions are internationally recognized as a challenging and diverse benchmark. The CEC 2019 functions are designed to evaluate the effectiveness of optimization algorithms, encompassing a wide range of problem types and complexities. This variety is sufficient for an objective assessment of both the algorithms’ convergence speed and their ability to identify the global optimum. As indicated in the manuscript (refer to Table 2), the selected 10 functions vary in dimension, range, and problem type, providing adequate challenge and diversity to thoroughly evaluate the algorithms’ performance. These functions constitute the core and most frequently cited subset of the CEC 2019 benchmark. The specified the hardware used (Intel Core i7-10700K CPU, 32 GB RAM) and MATLAB settings (R2021a, parallel processing enabled). For IWHO, 40 search agents, 500 iterations, and 300 Random Walk steps were chosen. Each function was tested in 51 independent runs, a number that ensures the results are more objective and reliable.

In Table 2, CEC 2019 featured demonstration. In Table 3, mean, best value, worst value, and standard deviation are used as statistical measures to evaluate the suitability of the values obtained by the algorithms.

When examining Table 3, the proposed IWHO algorithm achieves the best value in all functions except CEC03 and CEC04. For CEC03, all algorithms yield the same results with different standard deviations. In this case, WHO performs very well, and the proposed IWHO algorithm ranks second.

The IWHO algorithm proposed in this paper is compared with the WHOFWA and WHOW algorithms, which were previously proposed as variants of WHO. It is also compared with the RW-DO algorithm, which has been studied with the random walking strategy in the IWHO algorithm. As a result, as shown in Table 4 the IWHO algorithm has a superior performance in seven functions.

In Table 5, the nonparametric Wilcoxon test was used to evaluate whether the proposed IWHO algorithm differs from other algorithms when working with a different data set, assessing its originality. In this context, W (Win) represents a victory, T (Till) a draw, and L (Low) a defeat. The confidence level was set at p = 0.05. From the results, it can be concluded that the proposed IWHO algorithm is unique and original, as it produced significantly different results with 24 wins, 1 tills, and 5 losses out of 30 results.

Table 5 analyzed, The Wilcoxon signed-rank test was employed to assess if there was a statistically significant difference in the performance of the two algorithms. Table 5: W/T/L: These letters stand for ‘Win’, ‘Tie’, and ‘Lose’, respectively, reflecting the status of the IWHO algorithm in comparison to other algorithms. p-value: A value under 0.05 was deemed to indicate statistical significance. The IWHO algorithm demonstrated notable superiority (W) over the CAPSA and HHO algorithms for almost all functions. When compared to the WHO algorithm, it was superior in some functions and either matched or fell short in others. Remarkably, against CAPSA and HHO, the IWHO excelled in nine or all ten functions. The p-values were mostly very small (e.g., 5.15 × 10⁻¹⁰), indicating that the observed difference is unlikely to be by chance, but rather that the IWHO’s superiority is statistically sound. The results of the Wilcoxon signed-rank test reveal that the IWHO algorithm is statistically significantly superior in nearly all CEC 2019 functions, especially when compared to the CAPSA and HHO algorithms. Although it surpassed the WHO algorithm in certain functions, it showed similar or lower performance in others. These findings imply that the IWHO is generally a more effective and dependable optimization algorithm than its competitors.

Benferroni correction was used. Taking into account the

{(p}_{v a l u e s} < 0.05 / 30)

evaluation, all elements of the h vector were 1. This result shows that the functions are significant according to the Benferroni test.

This paper performs a sensitivity analysis to determine the optimal beta parameter values within the RW-DO algorithm. For this purpose, the CEC 2019 benchmark, consisting of ten distinct functions, was chosen. The parameter C was tested at values of 0.12, 0.13, and 0.14, while the parameter ps remained constant at 0.2. The effects of these adjustments on the IWHO algorithm were evaluated. Although the results for CEC 06 did not produce the best standard deviation, the outcomes for all other functions were favorable. These results are detailed in the accompanying Table 6 and Table 7.

Upon reviewing Figure 8, it becomes evident that the values produced by the proposed IWHO algorithm are more closely clustered across all functions, making it the algorithm nearest to the optimal point compared to others. The whiskers and boxes for the CEC01 function in both the WHO and IWHO algorithms are notably lower and shorter than those of HHO and CapSA. This indicates that WHO and IWHO achieve superior (lower MSE) and more consistent results. The lowest whisker point suggests that these algorithms may be closer to, or have reached, the global optimum. For the CEC02 function, WHO, IWHO, and CapSA show very short whiskers and low boxes, while HHO is significantly higher and more variable. It is likely that WHO and IWHO have reached the global optimum. Similarly, for the CEC03 function, the whiskers for WHO and IWHO are the lowest, whereas HHO is considerably higher, further indicating that WHO and IWHO are closer to the global optimum. Across functions in the range CEC04-CEC10, WHO and IWHO display the lowest whisker values, suggesting that these algorithms yield superior results (closer to the global optimum) compared to the others. Although a few outlier values are observed in the CEC01, CEC03, CEC04, and CEC10 functions, the consistency of results from fifty-one independent runs highlights the algorithm’s stable structure. Consequently, it can be concluded that the global solutions of this algorithm are within the best limits and closer to the optimal global solution.

As seen in the box plot model in Figure 9, box plots are displayed as lines with whiskers extending from the box to the minimum and maximum non-outlier values, revealing the spread of the data. The lowest point reached by the whisker, or the lowest outlier, might represent the optimal value (global optimum) identified by the algorithm, assuming it aligns with the true global optimum of the function. In a boxplot, the whiskers extend from the box to the minimum and maximum non-outlier values, indicating the spread of most of the data. The box itself represents the interquartile range (IQR), which encompasses the middle 50% of the data, from the first quartile (Q1) to the third quartile (Q3). The thick line within the box signifies the median, or the 50th percentile, of the data. Any data point outside the whiskers is considered an outlier and is depicted as a red dot. The lowest value reached by the left whisker, or the lowest outlier, might represent the optimal value (global optimum) identified by an optimization algorithm, provided it corresponds to the true minimum of the function.

Convergence curves determine how the algorithms approach the optimal value over 500 iterations, whether they get stuck at local minimum points, and if they are subject to premature convergence. In this context, Figure 10 shows that the proposed IWHO algorithm progresses gradually in all functions without early convergence. Observing that the IWHO algorithm does not show constant progress in a certain range of iterations, but instead continually moves towards better solutions, can be considered proof that the IWHO algorithm reaches the global solution without getting stuck at local optima. As a result, it can be observed that the IWHO algorithm converges towards the optimal point across the entire function set.

5. MLP Training with the Proposed IWHO Algorithm

Solving Energy Efficiency Problem with IWHO-Based MLP Training

The main aim of this chapter is to optimize the efficiency of energy load factors in heating, ventilation, and air conditioning (HVAC) systems used in smart buildings. Energy used in buildings is consumed through (HVAC) systems. The HVAC system is designed to create suitable indoor air conditions by heating, cooling, and ventilating the air inside the building, and in this process, it calculates the building’s heat load (HL) and cooling load (CL) factors. Accurate estimation of HL and CL factors in buildings is important for reducing energy consumption based on occupancy, managing changes in building performance according to energy demands, reducing harmful gas emissions, and lowering costs. The necessary cooling and heating capacities are estimated based on fundamental factors such as building characteristics, usage, and climate conditions. The optimal design of the HVAC system plays a critical role in ensuring energy savings. In this context, the interior design of buildings is important not only for saving energy consumption but also for human health. To achieve energy savings, HVAC systems, which are generally preferred as active systems, are used. To maximize efficiency and energy savings, HVAC systems should be designed according to the building’s climate conditions [52,53,54,55].

The MLP (Multi-Layer Perceptron) architecture is designed according to the dimensions of the selected problem. In this article, the Energy Efficiency problem dataset has 8 features. Therefore, the input signal of the network consists of eight features. These features are: Relative Compactness, Surface Area, Wall Area, Roof Area, Overall Height, Orientation, Glazing Area, and Glazing Area Distribution. The output signals are defined as Heating Load and Cooling Load. The number of nodes in the hidden layer, which is located between the input and output layers, is 17, which is one more than twice the 8 nodes in the input layer. This choice is an arbitrary selection found to be appropriate by the author.

In Figure 11, the simulation of the MLP architecture created for the Energy Efficiency Problem dataset is drawn.

In the architecture shown in Figure 11, MSE is assigned as the error rate, and the sigmoid function is assigned as the transfer function. While cooling load specifies a different working universe, heating load specifies a different working universe, so a single output is used. Both WHO and the proposed IWHO hybrid algorithms were selected as advisors, and MLP training was performed. The results of the MLP training were statistically evaluated. While the running time of WHO is 0.0087 on average for 10 independent runs, the running time of IWHO is 0.0107. Although the parameters added for the hybrid algorithm result in an average delay of 22.98% in time, the optimization processes of the algorithms are fast given the short time spent on each iteration. The comparative results of WHO and IWHO for cooling load and heating load are shown in Table 8.

The IWHO algorithm exhibits strong regression performance, evidenced by low error metrics (MSE, RMSE, MAE) and significant explanatory power (R²) for both Cooling and Heating Load. MAPE values under 2% indicate that the model’s predictions are closely aligned with the actual values. An R² value above 0.98 suggests that the model accounts for nearly all the variance in the data.

As a second analysis, when examining the convergence curves of WHO and IWHO values for average MSE, as shown in Figure 12, the MSE values of the IWHO algorithm for CL were lower than those of WHO in 6 runs and were also more consistent. Similarly, for HL the MSE values of the IWHO algorithm were closer together and more stable compared to the WHO algorithm. The comparative convergence curve of WHO and WHO-RW for cooling load and heating load are shown in Figure 13.

As a third analysis, when the WHO and IWHO algorithms are compared in terms of prediction rates, the average prediction value of the WHO algorithm for the CL factor is 99.6980 ± 1.4980 × 10⁻¹⁴; for IWHO, this value is 99.7010 ± 1.1751 × 10⁻¹⁴. For the HL factor, the average prediction value of the WHO algorithm is 100; for IWHO, this value is also 100.

As a fourth analysis, it is recommended to include a concise discussion on computational complexity, specifically using Big-O notation, along with a plot illustrating runtime against population size or iterations. It is essential to specify the hardware utilized (CPU/GPU) and any MATLAB configurations employed. The computational complexity of the IWHO algorithm introduced in this study is influenced by several parameters: the swarm size (N), the maximum number of iterations (t), the number of random walk steps (s), and the problem size (d). In each iteration, position updates, random walks, and fitness evaluations are performed for each member of the population. These operations can be expressed in Big-O notation as O(t × N × (s + d)). The complexity calculation can be determined based on the variations in the variables t and

n

, while maintaining S = 300 and d = 162 as constants. Table 9 expresses the results of analysis and Figure 14 shows the computational complexity of IWHO.

This analysis demonstrates that the computational complexity of the IWHO algorithm remains within acceptable bounds and is comparable to that of similar meta-heuristic optimization algorithms. A notation table summarizing all the parameters of the MLP architecture is added as Table 10.

6. Conclusions

In this research, the suboptimal performance of the WHO algorithm during the exploitation phase is tackled by incorporating a Random Walking strategy to avoid stagnation. The experimental results of the proposed IWHO algorithm show notable enhancements. Analysis of both box plots and convergence curves demonstrates stable convergence, which is due to a well-balanced exploitation-exploration dynamic. The Wilcoxon signed-rank nonparametric test further supports the claim that the IWHO algorithm is both unique and robust. The success of IWHO algorithm also highlights its inherent flexibility, making it suitable for hybridization. This adaptability implies that various strategies can be applied to the WHO algorithm, potentially overcoming its local search limitations through diverse hybridization. Moreover, IWHO algorithm is considered applicable to optimization challenges such as engineering design problems and artificial neural network training in future studies. IWHO-based MLP architecture is particularly relevant in the current era, where efficient energy use is crucial. When assessed in the context of MLP training, the IWHO algorithm effectively addressed certain shortcomings of the WHO algorithm through the Random Walking strategy, completing the optimization process in an optimal timeframe with a low error rate and a high prediction rate. Additionally, the successful resolution of such nonlinear problems via MLP training is expected to contribute to the scientific community’s efforts to optimize real-world problems.

The paper contains several limitations that can be addressed through a variety of studies, allowing for multiple investigations to be conducted. To effectively evaluate the Improved Whale Optimization Algorithm (IWHO) against the Whale Optimization Algorithm (WHO) and other limited algorithms, it is beneficial to include a wider array of algorithms in the comparison. The Random Walk strategy in IWHO presents a potential risk of overfitting the CEC 2019 test functions; however, the exploratory characteristics of the CEC 2019 test set help to mitigate this risk. Despite reasonable optimization times, IWHO experiences a 22.98% increase in runtime compared to WHO, which could be a concern for applications with strict time constraints. The effectiveness of the Random Walk strategy is heavily reliant on parameter selection, a challenge that can be addressed through parameter sensitivity testing. Additionally, exploring different domains and hybridizations can help reduce parameter sensitivity. Currently, the algorithm’s application is restricted to an energy efficiency problem, which does not fully demonstrate its versatility, even though the study primarily targets HVAC systems. While the Random Walk strategy is intended to avoid local optima, its performance in complex optimization landscapes remains uncertain and requires further investigation. Moreover, this paper did not perform k-fold cross-validation or conduct a thorough comparison of various optimizers due to computational constraints. Although these potential limitations do not compromise the study’s conclusions, they highlight areas where further research and analysis could enhance the proposed IWHO algorithm and its applications.

The proposed IWHO algorithm is anticipated to guide future research endeavors aimed at solving various engineering challenges.

Author Contributions

Data curation, Ş.G.; Writing—original draft, Ş.G.; Writing—review & editing, E.E.; Visualization, Ş.G.; Supervision, N.Y.; Project administration, N.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

Izci, D. An Enhanced Slime Mould Algorithm for Function optimization. In Proceedings of the 2021 3rd International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), Ankara, Turkey, 11–13 June 2021; pp. 1–5. [Google Scholar]
Eker, E.; Kayri, M.; Ekinci, S.; Izci, D. A New Fusion of ASO with SA Algorithm and Its Applications to MLP Training and DC Motor Speed Control. Arab. J. Sci. Eng. 2021, 46, 3889–3911. [Google Scholar] [CrossRef]
Rizk-Allah, R.M. Hybridizing sine cosine algorithm with multi-orthogonal search strategy for engineering design problems. J. Comput. Des. Eng. 2017, 5, 249–273. [Google Scholar] [CrossRef]
Rao, S.S.; Desai, R.C. Optimization Theory and Applications. IEEE Trans. Syst. Man Cybern. 1980, 10, 280. [Google Scholar] [CrossRef]
Uryasev, S.; Pardalos, P.M. Stochastic Optimization: Algorithms and Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Antoniou, A.; Lu, W.-S. Practical Optimization: Algorithms and Engineering Applications; Springer: New York, NY, USA, 2021. [Google Scholar] [CrossRef]
Zhao, W.; Wang, L.; Zhang, Z. Artificial ecosystem-based optimization: A novel nature-inspired meta-heuristic algorithm. Neural Comput. Appl. 2020, 32, 9383–9425. [Google Scholar] [CrossRef]
Heidari, A.A.; Mirjalili, S.; Faris, H.; Aljarah, I.; Mafarja, M.; Chen, H. Harris hawks optimization: Algorithm and applications. Futur. Gener. Comput. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]
Adam, S.P.; Alexandropoulos, S.-A.N.; Pardalos, P.M.; Vrahatis, M.N. No Free Lunch Theorem: A Review. In Approximation and Optimization; Springer: Cham, Switzerland, 2019; pp. 57–82. [Google Scholar] [CrossRef]
Mirjalili, S.M. Evolutionary Algorithms and Neural Networks—Theory and Applications; Studies in Computational Intelligence Series; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
Kennedy, J.; Eberhart, R. Particle Swarm Optimization. In Proceedings of the ICNN’95—International Conference on Neural Networks, Perth, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
Yang, Y.; Chen, H.; Heidari, A.A.; Gandomi, A.H. Hunger games search: Visions, conception, implementation, deep analysis, perspectives, and towards performance shifts. Expert Syst. Appl. 2021, 177, 114864. [Google Scholar] [CrossRef]
Agushaka, J.O.; Ezugwu, A.E.; Abualigah, L.; Alharbi, S.K.; Khalifa, H.A.E.-W. Efficient Initialization Methods for Population-Based Metaheuristic Algorithms: A Comparative Study. Arch. Comput. Methods Eng. 2022, 30, 1727–1787. [Google Scholar] [CrossRef]
Lagaros, N.D.; Plevris, V.; Kallioras, N.A. The Mosaic of Metaheuristic Algorithms in Structural Optimization. Arch. Comput. Methods Eng. 2022, 29, 5457–5492. [Google Scholar] [CrossRef]
Farag, A.A.; Ali, Z.M.; Zaki, A.M.; Rizk, F.H.; Eid, M.M.; EL-Kenawy, E.S.M. Exploring Optimization Algorithms: A Review of Methods and Applications. J. Artif. Intell. Metaheuristics 2024, 7, 8–17. [Google Scholar] [CrossRef]
Igel, C. No Free Lunch Theorems: Limitations and Perspectives of Metaheuristics; Springer: Berlin/Heidelberg, Germany, 2014; pp. 1–23. [Google Scholar] [CrossRef]
Nenavath, H.; Jatoth, R.K. Hybridizing sine cosine algorithm with differential evolution for global optimization and object tracking. Appl. Soft Comput. 2018, 62, 1019–1043. [Google Scholar] [CrossRef]
Ting, T.O.; Yang, X.; Cheng, S.; Huang, K. Hybrid Metaheuristic Algorithms: Past, Present, and Future. In Recent Advances in Swarm Intelligence and Evolutionary Computation; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
Fakhouri, H.N.; Hudaib, A.; Sleit, A. Hybrid Particle Swarm Optimization with Sine Cosine Algorithm and Nelder–Mead Simplex for Solving Engineering Design Problems. Arab. J. Sci. Eng. 2020, 45, 3091–3109. [Google Scholar] [CrossRef]
Tomar, V.; Bansal, M.; Singh, P. Metaheuristic Algorithms for Optimization: A Brief Review. Eng. Proc. 2024, 59, 238. [Google Scholar] [CrossRef]
Bouaouda, A.; Sayouti, Y. Hybrid Meta-Heuristic Algorithms for Optimal Sizing of Hybrid Renewable Energy System: A Review of the State-of-the-Art. Arch. Comput. Methods Eng. 2022, 29, 4049–4083. [Google Scholar] [CrossRef]
Güven, A.F.; Samy, M.M. Performance analysis of autonomous green energy system based on multi and hybrid metaheuristic optimization approaches. Energy Convers. Manag. 2022, 269, 116058. [Google Scholar] [CrossRef]
Mehta, P.; Yildiz, B.S.; Sait, S.M.; Yildiz, A.R. Hunger games search algorithm for global optimization of engineering design problems. Mater. Test. 2022, 64, 524–532. [Google Scholar] [CrossRef]
Eker, E.; Atar, Ş.; Şevgin, F.; Tuğal, İ. Optimization of Non-Linear Problems Using Salp Swarm Algorithm and Solving the Energy Efficiency Problem of Buildings with Salp Swarm Algorithm-based Multi-Layer Perceptron Algorithm. Electrica 2024, 24, 436–449. [Google Scholar] [CrossRef]
Karimi-Mamaghan, M.; Mohammadi, M.; Meyer, P.; Karimi-Mamaghan, A.M.; Talbi, E.-G. Machine learning at the service of meta-heuristics for solving combinatorial optimization problems: A state-of-the-art. Eur. J. Oper. Res. 2022, 296, 393–422. [Google Scholar] [CrossRef]
Ponmalar, A.; Vijayakumar, K.; Lakshmipriya, C.; Karthikeyan, M.; Gracelin Sheena, B.; Beslin Pajila, P.J.; Siva Subramanian, R. Meta-Heuristics and Machine Learning Applications in Complex Systems. In Metaheuristic and Machine Learning Optimization Strategies for Complex Systems; IGI Global Scientific Publishing: Hershey, PA, USA, 2024; pp. 257–275. [Google Scholar] [CrossRef]
Yousaf, S.; Bradshaw, C.R.; Kamalapurkar, R.; San, O. A gray-box model for unitary air conditioners developed with symbolic regression. Int. J. Refrig. 2024, 168, 696–707. [Google Scholar] [CrossRef]
Zheng, R.; Hussien, A.G.; Jia, H.-M.; Abualigah, L.; Wang, S.; Wu, D. An Improved Wild Horse Optimizer for Solving Optimization Problems. Mathematics 2022, 10, 1311. [Google Scholar] [CrossRef]
Li, Y.; Yuan, Q.; Han, M.; Cui, R. Hybrid Multi-Strategy Improved Wild Horse Optimizer. Adv. Intell. Syst. 2022, 4, 2200097. [Google Scholar] [CrossRef]
Li, M.-W.; Wang, Y.-T.; Yang, Z.-Y.; Huang, H.-P.; Hong, W.-C.; Li, X.-Y. CQND-WHO: Chaotic quantum nonlinear differential wild horse optimizer. Nonlinear Dyn. 2024, 112, 4899–4927. [Google Scholar] [CrossRef]
Mahmoud, M.M. Improved current control loops in wind side converter with the support of wild horse optimizer for enhancing the dynamic performance of PMSG-based wind generation system. Int. J. Model. Simul. 2022, 43, 952–966. [Google Scholar] [CrossRef]
Ganesan, P.; Xavier, S.A.E. A Hybrid Wild Horses Optimization (WHO) and Dwarf Mongoose Optimization (DMO) method for optimum energy management in SG system. Optim. Control. Appl. Methods 2024, 45, 1925–1949. [Google Scholar] [CrossRef]
Milovanović, M.; Klimenta, D.; Panić, M.; Klimenta, J.; Perović, B. An application of Wild Horse Optimizer to multi-objective energy management in a micro-grid. Electr. Eng. 2022, 104, 4521–4541. [Google Scholar] [CrossRef]
Ewees, A.A.; Ismail, F.H.; Ghoniem, R.M. Wild Horse Optimizer-Based Spiral Updating for Feature Selection. IEEE Access 2022, 10, 106258–106274. [Google Scholar] [CrossRef]
Peng, F.; Zheng, L. An Improved Multi-Objective Wild Horse Optimization for the Dual-Resource-Constrained Flexible Job Shop Scheduling Problem: A Comparative Analysis with NSGA-II and a Real Case Study. Available online: https://www.researchgate.net/publication/376330051_An_improved_multi-objective_Wild_Horse_optimization_for_the_dual-resource-constrained_flexible_job_shop_scheduling_problem_A_comparative_analysis_with_NSGA-II_and_a_real_case_study (accessed on 28 May 2025).
Bujok, P.; Zamuda, A. Cooperative Model of Evolutionary Algorithms Applied to CEC 2019 Single Objective Numerical Optimization. In Proceedings of the 2019 IEEE Congress on Evolutionary Computation (CEC), Wellington, New Zealand, 10–13 June 2019; pp. 366–371. [Google Scholar]
Gupta, S.; Deep, K. A novel Random Walk Grey Wolf Optimizer. Swarm Evol. Comput. 2019, 44, 101–112. [Google Scholar] [CrossRef]
Yin, Y.; Tu, Q.; Chen, X. Enhanced Salp Swarm Algorithm based on random walk and its application to training feedforward neural networks. Soft Comput. 2020, 24, 14791–14807. [Google Scholar] [CrossRef]
Eker, E.; Izci, D.; Ekinci, S.; Migdady, H.; Abu Zitar, R.; Abualigah, L. Efficient voltage regulation: An RW-ARO optimized cascaded controller approach. e-Prime 2024, 9, 100687. [Google Scholar] [CrossRef]
Tsanas, A.; Xifara, A. Energy Efficiency. UCI Machine Learning Repository. 2012. Available online: https://doi.org/10.24432/C51307 (accessed on 28 May 2025).
Stevanović, S.; Dashti, H.; Milošević, M.; Al-Yakoob, S.; Stevanović, D. Comparison of ANN and XGBoost surrogate models trained on small numbers of building energy simulations. PLoS ONE 2024, 19, e0312573. [Google Scholar] [CrossRef]
Hussain, I.; Ching, K.B.; Uttraphan, C.; Tay, K.G.; Noor, A. Evaluating machine learning algorithms for energy consumption prediction in electric vehicles: A comparative study. Sci. Rep. 2025, 15, 16124. [Google Scholar] [CrossRef]
Khishe, M.; Mohammadi, H. Passive sonar target classification using multi-layer perceptron trained by salp swarm algorithm. Ocean Eng. 2019, 181, 98–108. [Google Scholar] [CrossRef]
He, S.; Li, Q.; Khishe, M.; Mohammed, A.S.; Mohammadi, H.; Mohammadi, M. The optimization of nodes clustering and multi-hop routing protocol using hierarchical chimp optimization for sustainable energy efficient underwater wireless sensor networks. Wirel. Netw. 2024, 30, 233–252. [Google Scholar] [CrossRef]
Eker, E. Development of Random Walks Strategy based Dandelion Optimizer and Its Application to Engineering Design Problems. IEEE Access 2025, 13, 56547–56575. [Google Scholar] [CrossRef]
Nosrati, L.; Bidgoli, A.M.; Javadi, H.H.S. Identifying People’s Faces in Smart Banking Systems Using Artificial Neural Networks. Int. J. Comput. Intell. Syst. 2024, 17, 9. [Google Scholar] [CrossRef]
Alpaydin, E. Machine Learning Textbook: Introduction to Machine Learning (Ethem ALPAYDIN). Available online: https://www.cmpe.boun.edu.tr/~ethem/i2ml/ (accessed on 28 May 2025).
Haykin, S.S. Neural Networks: A Comprehensive Foundation; Prentice Hall: Hoboken, NJ, USA, 1999. [Google Scholar]
Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications. Available online: https://shop.elsevier.com/books/adaptive-learning-and-pattern-recognition-systems-theory-and-applications/mendel/978-0-12-490750-8 (accessed on 28 May 2025).
Rouhi, A.; Pira, E. WHOFWA: An effective hybrid metaheuristic algorithm based on wild horse optimizer and fireworks algorithm. J. Electr. Comput. Eng. Innov. JECEI 2024, 12, 319–342. [Google Scholar] [CrossRef]
Yussuf, R.O.; Asfour, O.S. Applications of artificial intelligence for energy efficiency throughout the building lifecycle: An overview. Energy Build. 2024, 305, 113903. [Google Scholar] [CrossRef]
Prediction of Residential Building Energy Efficiency Performance Using Deep Neural Network. Available online: https://www.researchgate.net/publication/358929639_Prediction_of_Residential_Building_Energy_Efficiency_Performance_using_Deep_Neural_Network (accessed on 28 May 2025).
McQuiston, F.C.; Parker, J.D.; Spitler, J.D. Heating, Ventilating and Air Conditioning Analysis and Design, 6th ed.; John Wiley & Sons Inc.: Hoboken, NJ, USA, 2004. [Google Scholar]
Bui, D.T.; Moayedi, H.; Anastasios, D.; Foong, L.K. Predicting Heating and Cooling Loads in Energy-Efficient Buildings Using Two Hybrid Intelligent Models. Appl. Sci. 2019, 9, 3543. [Google Scholar] [CrossRef]

Figure 1. The process of optimization.

Figure 2. Hierarchy of wild horse swarms.

Figure 3. Workflow of IWHO.

Figure 4. Sample of perceptron.

Figure 5. (a) A pair of linearly separable patterns; (b) a pair of non-linearly separable patterns [49].

Figure 6. Directions of function and error signals.

Figure 7. MLP architecture.

Figure 8. Boxplots view of algorithms.

Figure 9. Demonstration of boxplot model.

Figure 10. Converge curves of algorithms.

Figure 11. MLP architecture of Energy Efficiency Problem dataset.

Figure 12. Mean MSE.

Figure 13. Average processing time.

Figure 14. Computational complexity of IWHO.

Table 1. Algorithm parameters.

Parameter	Statement	Value
$N$	Number of search agents	40
$t_{m a x}$	Maximum number of iterations	500
$d$	Problem dimension	9, 10, 16, 18 (see CEC 2019 functions)
lb, ub	Lower and upper bounds	[−100, 100]
r	Number of random walk steps	300
$x_{i}$	Position of the i-th horse/agent	-
$x_{i, j}$	Position of the j-th member in group i	-
$x_{s t a l l i o n}$	Position of the group leader (stallion)	-
z	Self-adaptive parameter	z = 2 × (1 − (t/t_max)²)
$R$	Random value in update equations	R ∈ (−2, 2)
$P$	Random vector	P ∈ (0, 1)
$P C$	Crossover percentage	Adjustable
Hardware	CPU used for experiments	Intel Core i7-10700K, 32 GB RAM
Software	MATLAB version	R2021a (parallel processing enabled)

Table 2. Set of CEC 2019 benchmark functions.

Functions	Dimension	Bounds	Fitting Value
CEC01: Storn’s Chebyshev Polynomial Fitting Problem	9	[−8192, 8192]	1
CEC02: Inverse Hilbert Matrix Problem	16	[−16,384, 16,384]	1
CEC03: Lennard–Jones Minimum Energy Cluster	18	[−4, 4]	1
CEC04: Rastrigin’s Function	10	[−100, 100]	1
CEC05: Griewangk’s Function	10	[−100, 100]	1
CEC06: Weierstrass Function	10	[−100, 100]	1
CEC07: Modified Schwefel’s Function	10	[−100, 100]	1
CEC08: Expanded Schafer’s F6 Function	10	[−100, 100]	1
CEC09: Happy Cat Function	10	[−100, 100]	1
CEC10: Ackley Function	10	[−100, 100]	1

Table 3. Performance results of functions.

	Score
Func.	Metric	WHO	IWHO	CapSA	HHO
CEC01	Mean	5.34 × 10⁴	4.21 × 10⁴	5.01 × 10⁴	5.11 × 10⁴
	Std	4.62 × 10⁴	4.17 × 10⁴	5.39 × 10⁴	6.15 × 10³
	Best	3.54 × 10⁴	3.15 × 10⁴	3.59 × 10⁴	4.28 × 10⁴
	Worst	3.60 × 10⁵	3.34 × 10⁵	3.18 × 10⁵	7.30 × 10⁴
CEC02	Mean	17.3	17.3	17.3	17.4
	Std	2.15 × 10⁻¹⁴	2.15 × 10⁻¹⁴	8.80 × 10⁻⁵	9.99 × 10⁻³
	Best	17.3	17.3	17.3	17.3
	Worst	17.3	17.3	17.3	17.4
CEC03	Mean	12.7	12.7	12.7	12.7
	Std	3.15 × 10⁻⁶	2.36 × 10⁻⁷	2.21 × 10⁻¹⁷	6.49 × 10⁻⁶
	Best	12.7	12.7	12.7	12.7
	Worst	12.7	12.7	12.7	12.7
CEC04	Mean	26.2	28.7	41.2	180
	Std	15.7	23.6	21	62.9
	Best	5.97	8.95	8.96	86
	Worst	86.6	124	105	380
CEC05	Mean	1.07	1.05	1.19	2.45
	Std	0.071	0.029	0.108	0.515
	Best	1.01	1.00	1.02	1.27
	Worst	1.49	1.14	1.59	3.86
CEC06	Mean	5.60	3.60	7.64	9.03
	Std	0.928	0.987	1.23	1.11
	Best	3.53	1.53	4.66	5.75
	Worst	7.37	6.02	10.3	11.1
CEC07	Mean	122	60.1	463	361
	Std	125	126	307	178
	Best	−205	-253	−166	182
	Worst	369	273	1000	939
CEC08	Mean	4.74	4.33	5.39	5.77
	Std	0.643	0.706	84.9	0.484
	Best	3.33	2.90	3.06	4.59
	Worst	5.88	5.70	6.84	6.76
CEC09	Mean	2.45	2.38	2.53	3.15
	Std	0.129	0.0319	0.160	0.449
	Best	2.35	2.34	2.35	2.48
	Worst	3.10	2.48	2.98	4.64
CEC10	Mean	20	19.6	20.1	20.2
	Std	0.0091	2.64	0.0995	0.0952
	Best	20	1.16	20	20
	Worst	20	20	20.4	20.4

Table 4. Comparison of IWHO with other hybrid algorithms.

Func.	Metric		Score
-	-	IWHO	WHOFWA [51]	WHOW [35]	RW-DO [45]
CEC01	Mean	4.21 × 10⁰⁴	4.76 × 10⁴	7.59 × 10⁴	1.01 × 10¹²
CEC01	Std	4.17 × 10⁴	47.1	4.57 × 10⁴	5.48 × 10¹²
CEC02	Mean	17.3	17.3	17.3	754
CEC02	Std	2.15 × 10⁻¹⁴	2.09 × 10⁻⁵	8.31 × 10⁻¹⁴	4.03 × 10³
CEC03	Mean	12.7	12.7	12.7	12.7
CEC03	Std	2.36 × 10⁻⁷	8.75 × 10⁻¹⁰	3.95 × 10⁻¹⁵	3.24 × 10⁻⁴
CEC04	Mean	28.7	46.2	22.9	2.00 × 10³
CEC04	Std	23.6	16.8	5.03	5.18 × 10⁴
CEC05	Mean	1.05	1.31	1.08	2.22
CEC05	Std	0.0286	0.141	0.0393	1.12
CEC06	Mean	3.60	4.73	5.78	1.55
CEC06	Std	0.987	0.960	0.606	6.87
CEC07	Mean	60.1	-	154	585
CEC07	Std	126	-	56	381
CEC08	Mean	4.33	5.10	4.88	6.03
CEC08	Std	0.706	6.57	0.511	0.888
CEC09	Mean	2.38	2.57	2.67	258
CEC09	Std	0.0319	0.126	0.129	1.22 × 10³
CEC10	Mean	19.6	19.4	20	20.5
CEC10	Std	2.64	3.23	9.41 × 10⁻³	0.217

Table 5. Wilcoxon test results.

Functions	Metrics	WHO	CapSA	HHO
CEC01	p-value	3.03 × 10⁻⁴	2.80 × 10⁻³	9.66 × 10⁻⁵
CEC01	W/T/L	W	W	W
CEC02	p-value	1.00	5.15 × 10⁻¹⁰	5.15 × 10⁻¹⁰
CEC02	W/T/L	L	W	W
CEC03	p-value	1.00	0.5	5.15 × 10⁻¹⁰
CEC03	W/T/L	L	W	W
CEC04	p-value	0.7786	2.27 × 10⁻⁵	5.47 × 10⁻¹⁰
CEC04	W/T/L	L	W	W
CEC05	p-value	0.107	1.40 × 10⁻⁹	5.15 × 10⁻¹⁰
CEC05	W/T/L	T	W	W
CEC06	p-value	8.27 × 10⁻¹⁰	5.15 × 10⁻¹⁰	5.15 × 10⁻¹⁰
CEC06	W/T/L	W	W	W
CEC07	p-value	0.325	8.65 × 10⁻⁹	7.80 × 10⁻¹⁰
CEC07	W/T/L	L	W	W
CEC08	p-value	0.593	8.61 × 10⁻⁷	3.94 × 10⁻⁹
CEC08	W/T/L	L	W	W
CEC09	p-value	2.27 × 10⁻⁵	7.32 × 10⁻⁹	5.15 × 10⁻¹⁰
CEC09	W/T/L	W	W	W
CEC10	p-value	6.70 × 10⁻⁸	5.15 × 10⁻¹⁰	5.15 × 10⁻¹⁰
CEC10	W/T/L	W	W	W

Table 6. Sensitivity analysis for RW = 300.

	RW = 300 and Population Size = 40				RW = 300 and Population Size = 50
Function	Metric (ps = 0.2)	PC = 0.12	PC = 0.13	PC = 0.14	Metric (ps = 0.2)	PC = 0.12	PC = 0.13	PC = 0.14
CEC 2019
CEC01	Mean	6.89 × 10⁸	6.41 × 10⁸	7.67 × 10⁸	Mean	5.74 × 10⁸	4.52 × 10⁸	4.66 × 10⁸
CEC01	Std	1.04 × 10⁹	6.82 × 10⁸	1.16 × 10⁹	Std	8.51 × 10⁸	3.86 × 10⁸	1.77 × 10⁸
CEC02	Mean	1.73 × 10⁵	1.73 × 10⁵	1.73 × 10⁵	Mean	1.73 × 10⁵	1.73 × 10⁵	1.73 × 10⁵
CEC02	Std	0	0	1.40 × 10⁻¹⁰	Std	0	0	0
CEC03	Mean	1.27 × 10⁵	1.27 × 10⁵	1.27 × 10⁵	Mean	1.27 × 10⁵	1.27 × 10⁵	1.27 × 10⁵
CEC03	Std	1.40 × 10⁻²	2.18 × 10⁻¹⁰	0	Std	1.57 × 10⁻¹	9.62 × 10⁻³	8.93 × 10⁻³
CEC04	Mean	3.25 × 10⁵	2.46 × 10⁵	3.62 × 10⁵	Mean	1.98 × 10⁵	1.71 × 10⁵	1.76 × 10⁵
CEC04	Std	5.34 × 10⁵	1.21 × 10⁵	6.28 × 10⁵	Std	1.00 × 10⁵	1.50 × 10⁵	7.38 × 10⁵
CEC05	Mean	1.06 × 10⁴	1.05 × 10⁴	1.06 × 10⁴	Mean	1.04 × 10⁴	1.03 × 10⁵	1.05 × 10⁴
CEC05	Std	431	292	475	Std	279	244	269
CEC06	Mean	5.40 × 10⁴	5.54 × 10⁴	5.40 × 10⁴	Mean	5.22 × 10⁴	5.15 × 10⁴	5.24 × 10⁴
CEC06	Std	1.07 × 10⁴	8.05 × 10³	8.27 × 10³	Std	9.78 × 10³	8.47 × 10³	7.89 × 10³
CEC07	Mean	1.28 × 10⁶	1.27 × 10⁶	1.29 × 10⁶	Mean	7.41 × 10⁵	7.32 × 10⁵	1.20 × 10⁶
CEC07	Std	1.37 × 10⁶	1.30 × 10⁶	1.35 × 10⁶	Std	1.31 × 10⁶	1.21 × 10⁶	1.11 × 10⁶
CEC08	Mean	4.78 × 10⁴	4.50 × 10⁴	4.69 × 10⁴	Mean	4.64 × 10⁴	4.35 × 10⁴	4.49 × 10⁴
CEC08	Std	7.51 × 10³	6.54 × 10³	6.96 × 10³	Std	5.05 × 10³	7.43 × 10³	7.15 × 10³
CEC09	Mean	2.44 × 10⁴	2.43 × 10⁴	2.44 × 10⁴	Mean	2.39 × 10⁴	2.30 × 10⁴	2.39 × 10⁴
CEC09	Std	661	913	675	Std	306	762	502
CEC10	Mean	1.97 × 10⁵	1.96 × 10⁵	2.00 × 10⁵	Mean	2.00 × 10⁵	1.90 × 10⁵	2.00 × 10⁵
CEC10	Std	2.52 × 10⁴	82.2	2.64 × 10⁴	Std	48.5	38.2	132

Table 7. Sensitivity analysis for RW = 500.

	RW = 500 and Population Size = 40				RW = 500 and Population Size = 50
Function	Metric (ps = 0.2)	PC = 0.12	PC = 0.13	PC = 0.14	Metric (ps = 0.2)	PC = 0.12	PC = 0.13	PC = 0.14
CEC 2019
CEC01	Mean	3.30 × 10⁹	5.17 × 10⁸	6.16 × 10⁸	Mean	4.27 × 10⁸	7.56 × 10⁸	7.06 × 10⁸
CEC01	Std	1.26 × 10¹⁰	2.20 × 10⁸	6.61 × 10⁸	Std	4.01 × 10⁷	1.44 × 10⁹	1.23 × 10⁹
CEC02	Mean	1.73 × 10⁵	1.73 × 10⁵	1.73 × 10⁵	Mean	1.73 × 10⁵	1.73 × 10⁵	1.73 × 10⁵
CEC02	Std	0	0	0	Std	3.61 × 10⁻¹¹	3.61 × 10⁻¹¹	0
CEC03	Mean	1.27 × 10⁵	1.27 × 10⁵	1.27 × 10⁵	Mean	1.27 × 10⁵	1.27 × 10⁵	1.27 × 10⁵
CEC03	Std	5.48 × 10⁻¹¹	5.48 × 10⁻¹¹	3.86 × 10⁻³	Std	5.48 × 10⁻¹¹	3.86 × 10⁻³	5.48 × 10⁻¹¹
CEC04	Mean	5.18 × 10⁵	2.25 × 10⁵	2.32 × 10⁵	Mean	2.45 × 10⁵	2.31 × 10⁵	2.38 × 10⁵
CEC04	Std	8.78 × 10⁵	1.61 × 10⁵	1.19 × 10⁵	Std	1.08 × 10⁵	1.18 × 10⁵	1.43 × 10⁵
CEC05	Mean	1.05 × 10⁴	1.06 × 10⁴	1.06 × 10⁴	Mean	1.05 × 10⁴	1.04 × 10⁴	1.04 × 10⁴
CEC05	Std	3.86 × 10²	3.80 × 10²	3.39 × 10²	Std	2.74 × 10²	3.30 × 10²	2.35 × 10²
CEC06	Mean	5.58 × 10⁴	5.14 × 10⁴	5.32 × 10⁴	Mean	5.57 × 10⁴	5.09 × 10⁴	5.20 × 10⁴
CEC06	Std	1.13 × 10⁴	1.34 × 10⁴	9.04 × 10³	Std	6.66 × 10³	1.07 × 10⁴	9.77 × 10³
CEC07	Mean	1.29 × 10⁶	9.15 × 10⁵	9.26 × 10⁵	Mean	9.52 × 10⁵	7.05 × 10⁵	7.18 × 10⁵
CEC07	Std	9.83 × 10⁵	1.29 × 10⁶	1.16 × 10⁶	Std	9.11 × 10⁵	1.03 × 10⁶	1.39 × 10⁶
CEC08	Mean	4.60 × 10⁴	4.48 × 10⁴	4.81 × 10⁴	Mean	4.76 × 10⁴	4.36 × 10⁴	4.43 × 10⁴
CEC08	Std	5.21 × 10³	8.35 × 10³	5.35 × 10³	Std	4.68 × 10³	9.39 × 10³	7.79 × 10³
CEC09	Mean	2.47 × 10⁴	2.43 × 10⁴	2.46 × 10⁴	Mean	2.40 × 10⁴	2.37 × 10⁴	2.39 × 10⁴
CEC09	Std	1.91 × 10³	7.42 × 10²	1.09 × 10³	Std	5.48 × 10²	4.26 × 10²	3.86 × 10²
CEC10	Mean	1.91 × 10⁵	2.00 × 10⁵	1.81 × 10⁵	Mean	2.00 × 10⁵	1.80 × 10⁵	1.89 × 10⁵
CEC10	Std	4.06 × 10⁴	1.06 × 10²	5.67 × 10⁴	Std	62.1	228	4.59 × 10⁴

Table 8. Results of regression measures.

Metric	WHO		IWHO
W	Cooling Load	Heating Load	Cooling Load	Heating Load
MSE	2.65 × 10⁻¹	0.333	0.257	0.314
RMSE	0.515	0.568	0.508	0.561
MAE	0.42	0.47	0.43	0.48
MAPE	1.7%	1.9%	1.7%	1.9%
R²	0.981	0.980	0.985	0.983

Table 9. Big-O notation.

Swarm Size (N)	Total Operations (millions)	Iterations (t)	Total Operations (millions)
50	5.78	50	4.62
100	11.55	100	9.24
150	17.32	150	13.86
200	23.10	200	18.48
250	28.88	250	23.10
300	34.65	300	27.72

Table 10. Summarized notation tables of MLP.

Parameter	Statement	Value
HL	Heating load (output of MLP)	-
CL	Cooling load (output of MLP)	-
Input features	Features for MLP input	8 (Relative Compactness, Surface Area, Wall Area, Roof Area, Overall Height, Orientation, Glazing Area, Glazing Area Distribution)
Hidden layer nodes	Number of nodes in MLP hidden layer	17
Output nodes	Number of output nodes in MLP	1 (HL or CL)
Population size	Number of agents in optimization	200
Number of runs	Number of independent runs for experiments	10
Processing time (WHO)	Average running time for 10 runs	0.0087
Processing time (IWHO)	Average running time for 10 runs	0.0107
Prediction rate (CL, WHO)	Average prediction value for CL (WHO)	99.6980 ± 1.4980 × 10⁻¹⁴
Prediction rate (CL, IWHO)	Average prediction value for CL (IWHO)	99.7010 ± 1.1751 × 10⁻¹⁴
Prediction rate (HL, WHO)	Average prediction value for HL (WHO)	100
Prediction rate (HL, IWHO)	Average prediction value for HL (IWHO)	100
Complexity	Computational complexity of IWHO	O(t × N × (s + d))

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Güler, Ş.; Eker, E.; Yumuşak, N. Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem. Energies 2025, 18, 2916. https://doi.org/10.3390/en18112916

AMA Style

Güler Ş, Eker E, Yumuşak N. Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem. Energies. 2025; 18(11):2916. https://doi.org/10.3390/en18112916

Chicago/Turabian Style

Güler, Şahiner, Erdal Eker, and Nejat Yumuşak. 2025. "Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem" Energies 18, no. 11: 2916. https://doi.org/10.3390/en18112916

APA Style

Güler, Ş., Eker, E., & Yumuşak, N. (2025). Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem. Energies, 18(11), 2916. https://doi.org/10.3390/en18112916

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improvement of Wild Horse Optimizer Algorithm with Random Walk Strategy (IWHO), and Appointment as MLP Supervisor for Solving Energy Efficiency Problem

Abstract

1. Introduction

2. Algorithms

2.1. Wild Horse Optimizer (WHO)

2.2. Random Walking Strategy

3. Multilayer Perceptron Architecture

4. Experimental Results

5. MLP Training with the Proposed IWHO Algorithm

Solving Energy Efficiency Problem with IWHO-Based MLP Training

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI