Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems

Nguyen, Van Hien; Ahn, Kyoung Kwan

doi:10.3390/math13071177

Open AccessArticle

Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems

by

Van Hien Nguyen

¹

and

Kyoung Kwan Ahn

^2,*

¹

Graduate School of Mechanical and Automotive Engineering, University of Ulsan, Ulsan 44610, Republic of Korea

²

School of Mechanical Engineering, University of Ulsan, Ulsan 44610, Republic of Korea

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(7), 1177; https://doi.org/10.3390/math13071177

Submission received: 3 March 2025 / Revised: 25 March 2025 / Accepted: 27 March 2025 / Published: 2 April 2025

(This article belongs to the Section E2: Control Theory and Mechanics)

Download

Browse Figures

Versions Notes

Abstract

Over the past few years, reducing energy consumption in hydraulic excavators has gained increasing attention, driving significant research in this field. One effective strategy involves integrating hydrostatic transmission (HST) and hydraulic pump/motor (HPM) systems into hydraulic excavators. However, challenges like disturbances, throttling-induced pressure drops, and fluid leakage often hinder both positional accuracy and energy efficiency. To tackle these issues, our study proposes a sophisticated dynamic forecasting model for positional control, integrating beluga whale optimization (BWO), long short-term memory (LSTM), and random forest (RF) techniques. The approach begins with dynamic data evaluation using Pearson’s correlation analysis to identify tuning parameters that have moderate to strong correlations with control variables, which are then used as inputs for predictive modeling. Initially, a standalone LSTM framework is developed to estimate the system’s positional output, with BWO optimizing four key tuning parameters. Subsequently, a hybrid BWO-enhanced LSTM-RF system is deployed to capture complex nonlinear patterns, improving the accuracy of motion trajectory predictions. Simulations and experiments confirm that our approach achieves a positional error below 3 mm, ensuring precise tracking and providing reliable data for operators. Compared to conventional proportional–integral–derivative (PID) controllers, standalone LSTM-RF, and a hybrid controller combining particle swarm optimization (PSO), LSTM, a gated recurrent unit (GRU), and PID (PSO-LSTM-GRU-PID), our method achieves superior tracking precision and energy savings of 12.46%, 8.98%, and 3.97%, respectively.

Keywords:

energy consumption; excavators; beluga whale optimization; random forest; tracking precision; hydrostatic transmission

MSC:

93C10; 93D21

1. Introduction

The escalating pressures of resource scarcity and environmental degradation have heightened the demand for innovative solutions in energy-intensive industries such as infrastructure and agriculture, where heavy machinery, notably hydraulic excavators (HEs), significantly contributes to global energy consumption and greenhouse gas emissions [1]. HEs, indispensable in these sectors, account for a substantial share of operational energy use, making them critical targets for advancements in energy efficiency and sustainability [2]. Current hybrid architectures, integrating an internal combustion engine (ICE) with advanced powertrains, aim to enhance energy efficiency in HEs. For instance, Do et al. [3] proposed an electric hybrid hydraulic excavator that optimizes ICE–hydraulic pump synergy, improving efficiency across diverse conditions. However, this approach struggles with torque management under high loads due to the limited capacity of the hydraulic pump. Similarly, Nguyen et al. [4] introduced a hydrostatic transmission paired with a planetary gear system, expanding the ICE’s operational range and enhancing adaptability, yet it falls short in delivering precise actuator control essential for operational accuracy and safety. These limitations, namely insufficient adaptability to dynamic conditions and high computational complexity, highlight the need for novel control strategies that effectively balance energy efficiency and robust performance in challenging HE scenarios.

Recent advancements in machine learning and metaheuristic optimization have spurred the development of intelligent control systems to tackle these challenges. Within the last five years, significant contributions include Ben Hazem et al. [5], who optimized an adaptive neuro-fuzzy inference system (ANFIS) using an enhanced particle swarm optimization (PSO) algorithm for improved actuator precision, and Song et al. [6], who employed a hybrid adaptive quantum PSO (HAQPSO) to tune PID parameters, achieving notable accuracy gains. Nguyen et al. [7] advanced this field with a PSO-LSTM-GRU-PID model, integrating PSO with LSTM and GRU networks to achieve positional errors below 3 mm and energy savings of up to 10.89%. However, its computational complexity hinders real-time applicability under rapidly shifting conditions. Likewise, Xuanyu et al. [8] proposed a beluga whale optimization–long short-term memory–gated recurrent unit (BWO-LSTM-GRU) framework for dynamic trajectory prediction, yet its reliance on GRU- and PID-like elements limits adaptability and increases processing demands, reducing its effectiveness in high-stakes hydraulic environments.

Despite these strides, existing methods often fail to reconcile precision, adaptability, and computational efficiency. The PSO-LSTM-GRU-PID model, for instance, integrates particle swarm optimization with a combination of LSTM and GRU networks alongside a PID controller, resulting in a highly complex architecture. This complexity not only increases the computational burden and makes real-time processing challenging but also complicates hyperparameter tuning. As a result, performance may be less robust under the pronounced nonlinearity, time delays, and noise typical of hydraulic environments. Similarly, the BWO-LSTM-GRU framework, while improving tracking accuracy, lacks the flexibility to handle abrupt operational shifts due to its dependence on GRU components. To address these shortcomings, this study proposes the beluga whale optimization–long short-term memory–random forest (BWO-LSTM-RF) framework. By replacing the GRU and traditional PID elements with a random forest (RF) module, this approach reduces computational complexity while enhancing adaptability to dynamic behaviors. Leveraging BWO to optimize four key LSTM-RF hyperparameters, the proposed model achieves boom cylinder trajectory errors below 3 mm and energy savings of up to 12.46%, outperforming conventional PID, LSTM-RF, and PSO-LSTM-GRU-PID systems by 12.46%, 8.98%, and 3.97%, respectively. Validated through AMESim simulation and experimental trials on a physical platform, this model-free design minimizes processing demands, offering a scalable, efficient solution for hybrid hydraulic excavators (HHEs).

To address these challenges with the proposed BWO-LSTM-RF framework, it is essential to first understand the system setup and its operational dynamics, which are detailed in the next section. The remainder of this paper is organized as follows: Section 2 outlines the system setup, Section 3 details the hybrid algorithmic control principles, Section 4 evaluates performance through simulation and experimental analysis, Section 5 examines economic implications, and Section 6 concludes with key findings and future directions.

2. Overview of the Setup

2.1. Drivetrain Analysis

To improve power conservation, this work builds on its established success in hydraulic applications as demonstrated in [4], as depicted in Figure 1. At the core of its functionality, the HST and HPM regulate the planetary gear’s output shaft, enabling seamless integration with the HM and ensuring smooth transitions between ′boom-up’ and `boom-down’ modes under varying conditions. Leveraging this proven framework, we implement the BWO-LSTM-RF controller to enhance positional accuracy and energy efficiency across a range of speeds and operational scenarios. This algorithm-driven approach actively enhances the system’s intrinsic advantages, delivering an effective solution for practical HE operations.

Within this system framework, the motor, engine, and hydrostatic transmission components align with the sun, carrier, and ring gears, respectively, forming the core of the planetary gear arrangement. To understand the system’s dynamic performance, it is crucial to analyze the rotational speeds of these gears, which are governed by Equation (1) [4], providing the mechanical foundation that underpins the efficacy of the BWO-LSTM-RF control strategy.

\begin{matrix} ω_{r} \times d_{r} = ω_{ca} \times (d_{r} + d_{s}) - ω_{s} \times d_{s} \end{matrix}

(1)

where the diameter and speed are considered, denoted as

d_{r}

and

ω_{r}

for the ring gear and

d_{s}

and

ω_{s}

for the sun gear. The carrier’s speed is indicated by

ω_{ca}

.

The HST ratio is described by Equation (2).

\begin{matrix} i_{hst} = \frac{ω_{r}}{ω_{hpm}} \end{matrix}

(2)

where

ω_{hpm}

represents the speed of the pump while

i_{hst}

reflects the transmission ratio of the HST. Equation (3) governs these parameters, ensuring that the system’s operational dynamics are effectively captured.

\begin{matrix} ω_{hpm} = \frac{q_{c}}{D_{hpm} η_{v}} \end{matrix}

(3)

where

η_{v}

represents the volumetric efficiency of the HPM and

q_{c}

indicates its flow rate. By applying Equations (2) and (3), the appropriate HPM displacement and HST ratio can be identified based on the necessary flow rate. This approach leads to numerous potential combinations for the ICE and EMG speeds, as described by Equation (1).

Equation (4) determines the torque for each gear [4].

\begin{matrix} T_{r} : T_{s} : T_{ca} = d_{r} : d_{s} : (d_{r} + d_{s}) \end{matrix}

(4)

where

T_{ca}

,

T_{s}

, and

T_{r}

represent the torque values associated with the carrier, sun, and ring gears, respectively. The torque generated by the HPM is determined by the pressure at its output, as outlined in Equation (5).

\begin{matrix} T_{hpm} = \frac{p_{c} D_{hpm}}{2 π η_{m}} = i_{hst} T_{r} \end{matrix}

(5)

where

T_{hpm}

signifies the torque produced by the HPM while

p_{c}

reflects the pressure at the HPM’s output. Furthermore,

D_{hpm}

defines the displacement of the HPM and

η_{m}

indicates its hydro-mechanical performance. The torque generated by the HPM, denoted as

T_{hpm}

, impacts the torque of the ring gear.

By applying Equations (4) and (5), it is possible to determine

T_{ca}

and

T_{s}

, which are subsequently presented in Equations (6) and (7).

\begin{matrix} T_{ca} = \frac{(d_{r} + d_{s}) p_{c} D_{hpm}}{2 π i_{hst} d_{r} η_{m} η_{hst}} \end{matrix}

(6)

\begin{matrix} T_{s} = \frac{d_{s} p_{c} D_{hpm}}{2 π i_{hst} d_{r} η_{m} η_{hst}} \end{matrix}

(7)

where

p_{c}

represents the critical pressure in the system, essential for analyzing the hydraulic performance and efficiency.

2.2. Analysis of Critical Parameters

Table 1 summarizes the key specifications of the system’s primary components. In this configuration, a main pump is utilized, which plays a vital role in system operation. The ICE and EMG deliver essential energy, with their efficiency characteristics detailed in [4]. Furthermore, the battery’s energy capacity is evaluated; to prolong its durability by avoiding severe depletion and excessive charging, the working range is generally kept within 30–90% of full capacity [9].

2.3. Energy Management Strategy

As highlighted previously, the EMS plays a pivotal role in determining the system’s overall energy performance. To further improve performance, we utilize an ECMS, which we specifically designed to identify operating conditions that minimize fuel consumption in HEs. Given the unique attributes of our system, we adopted the ECMS as an optimal strategy to implement an effective EMS. The underlying cost function, detailed in Equation (8), provides a quantitative framework for enhancing fuel economy [4].

\begin{matrix} {\dot{m}}_{Σ} = {\dot{m}}_{eng} + s (t) \times {\dot{m}}_{mot} \end{matrix}

(8)

where

{\dot{m}}_{Σ}

signifies the overall fuel usage while

{\dot{m}}_{eng}

and

{\dot{m}}_{mot}

denote the energy consumption rates for the engine and the motor, respectively. The

s (t)

mechanism converts electrical energy consumption into its fuel equivalent, as delineated by Equation (9) [10].

\begin{matrix} s (t) = \{\begin{matrix} \frac{1}{η_{eng} \times η_{eng} \times η_{mge}} \times (1 - {(\frac{S O C - S O C_{a}}{4 \times (S O C_{max} - S O C_{min})})}^{3}), & ω_{s} T_{s} > 0 \\ \frac{η_{mot} \times η_{d}}{η_{eng}} \times (1 - {(\frac{S O C - S O C_{a}}{0.5 \times (S O C_{max} - S O C_{min})})}^{3}), & ω_{s} T_{s} \leq 0 \end{matrix} \end{matrix}

(9)

where

η_{eng}

corresponds to the efficiency of the engine;

η_{mge}

indicates the power generation performance of the motor;

η_{mdr}

signifies the operational efficiency of the motor in driving mode;

η_{d}

and

η_{ch}

represent the discharging and charging performance metrics of the battery, respectively. Additionally,

S O C_{max}

signifies the maximum state of charge,

S O C_{min}

denotes the minimum state of charge, and

S O C_{a}

indicates the average state of charge, calculated over a specific operational cycle.

Equation (10) determines the SOC for a battery [11], taking into account multiple parameters: the starting

{SOC}_{init}

, the charge transfer efficiency

η_{bat}

, the energy output

P_{mot}

, and the battery’s rated energy storage

E_{bat}

.

\begin{matrix} SOC = {SOC}_{init} + \int_{o}^{t} \frac{P_{mot}}{3600 η_{bat} E_{bat}} d t \end{matrix}

(10)

where

P_{mot}

can be expressed by Equation (11):

\begin{matrix} P_{mot} = \frac{ω_{s} T_{s}}{9550} \end{matrix}

(11)

In the ECMS,

{\dot{m}}_{eng}

and

{\dot{m}}_{mot}

can be expressed by Equations (12) and (13) [12].

\begin{matrix} {\dot{m}}_{eng} = \int_{o}^{t} \frac{ω_{ca} T_{ca}}{9550 η_{eng}} \end{matrix}

(12)

\begin{matrix} {\dot{m}}_{mot} = \{\begin{matrix} \int_{o}^{t} \frac{ω_{s} T_{s}}{9550 \times η_{mge} η_{bat}}, & P_{mot} > 0 \\ \int_{o}^{t} \frac{ω_{s} T_{s} η_{mot} η_{bat}}{9550}, & P_{mot} \leq 0 \end{matrix} \end{matrix}

(13)

where

η_{eng}

,

η_{mot}

, and

η_{bat}

indicate the performance metrics of the engine, motor, and battery, respectively, while

{\dot{m}}_{eng}

and

{\dot{m}}_{motor}

represent the energy outputs of the ICE and the battery’s equivalent contribution.

With the system setup and energy management strategy established, the next section delves into the core principles of the hybrid BWO-LSTM-RF control algorithm.

3. Core Principles of Hybrid Algorithmic Control

To improve both positional accuracy and power efficiency within the system, this study employs the BWO-LSTM-RF controller to orchestrate the interplay of its key components. Specifically, we refined cylinder positioning through precise valve adjustments that modulated the control valve’s aperture. Meanwhile, the HST optimized energy usage by regulating system dynamics. We depict this synergistic approach, facilitated by the BWO-LSTM-RF framework, in Figure 2, with its operational details elaborated in the subsequent section. Within this schematic,

x_{d}

represents the desired command (for instance, a specified position or speed), while

y_{r}

denotes the measured output, providing feedback to ensure that the system’s response closely follows

x_{d}

and minimizes any deviation.

3.1. Beluga Whale Optimization (BWO) Algorithms

Zhong et al. [13] introduced the beluga whale optimization (BWO) algorithm, which mimics the hunting patterns of beluga whale groups along with the phenomenon of whale fall. This algorithm is primarily structured into three phases: exploration, exploitation, and whale fall, with detailed procedures outlined as follows:

Step 1: Define the position matrix for beluga whales in the search space:

\begin{matrix} Y = [\begin{matrix} y_{1, 1} & y_{1, 2} & \dots & y_{1, d} \\ y_{2, 1} & y_{2, 2} & \dots & y_{2, d} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ y_{n, 1} & y_{n, 2} & \dots & y_{n, d} \end{matrix}] \end{matrix}

(14)

where n represents the population size of beluga whales and d indicates the dimensionality of the variables. The associated fitness value for each beluga whale is as follows:

\begin{matrix} F_{Y} = [\begin{matrix} f (y_{1, 1}, y_{1, 2}, \dots, y_{1, d}) \\ f (y_{2, 1}, y_{2, 2}, \dots, y_{2, d}) \\ ⋮ \\ f (y_{n, 1}, y_{n, 2}, \dots, y_{n, d}) \end{matrix}] \end{matrix}

(15)

Subsequently, the transition from the exploration phase to the exploitation phase hinges on the equilibrium factor

B_{f}

, formulated as follows:

\begin{matrix} B_{f} = B_{0} (1 - \frac{t}{2 T}) \end{matrix}

(16)

Here, t signifies the current iteration number, T denotes the maximum number of iterations, and

B_{0}

is a random value between 0 and 1. A value of

B_{f} > 0.5

indicates that the population is in the exploration phase, while

B_{f} \leq 0.5

signifies the exploitation phase.

Step 2: The exploration phase of the algorithm is modeled based on the swimming patterns of beluga whales. The position update equation for beluga whales is as follows:

\begin{matrix} \{\begin{matrix} Y_{i, j}^{t + 1} = Y_{i, p j}^{t} + (Y_{r, p 1}^{t} - Y_{i, p j}^{t}) (1 + r_{1}) sin (2 π r_{2}), j = even \\ Y_{i, j}^{t + 1} = Y_{i, p j}^{t} + (Y_{r, p 1}^{t} - Y_{i, p j}^{t}) (1 + r_{1}) cos (2 π r_{2}), j = odd \end{matrix} \end{matrix}

(17)

where

Y_{i, j}^{t + 1}

indicates the position of the i-th beluga whale in the j-th dimension,

P_{j}

represents a random index within the d-dimensional space, and

Y_{i, p j}^{t}

denotes the current position of the i-th beluga whale. The random numbers

r_{1}

and

r_{2}

, ranging from

(0, 1)

, enhance the stochastic nature of the update. The terms

sin (2 π r_{2})

and

cos (2 π r_{2})

reflect the fin orientations of beluga whales in odd or even dimensions, capturing their synchronized or mirrored movements during swimming or diving.

Next, Step 3 involves the exploitation phase of the algorithm, modeled after the foraging behavior of beluga whales, incorporating the Levy flight strategy to boost convergence, expressed as follows:

\begin{matrix} Y_{i}^{t + 1} = r_{3} Y_{best}^{t} - r_{4} Y_{i}^{t} + C_{1} \cdot L_{F} \cdot (Y_{r}^{t} - Y_{i}^{t}) \end{matrix}

(18)

where

Y_{best}^{t}

is the optimal position found,

r_{3}

and

r_{4}

are random numbers between 0 and 1,

Y_{i}^{t}

denotes the current position of the i-th beluga whale, and

Y_{r}^{t}

indicates the position of a randomly selected beluga whale. The parameter

C_{1} = 2 r_{4} (1 - t / T_{max})

measures the intensity of random jumps in the Levy flight.

The Levy flight function

L_{F}

is defined as follows:

\begin{matrix} L_{F} = 0.05 \times \frac{u \times σ}{{| v |}^{1 / β}} \end{matrix}

(19)

\begin{matrix} σ = {(\frac{Γ (1 + β) \times sin (π β / 2)}{Γ ((1 + β) / 2) \times β \times 2^{(β - 1) / 2}})}^{1 / β} \end{matrix}

(20)

where u and v are random numbers following a normal distribution, with

β

set to 1.5.

Step 4: Model the whale-fall phase when a beluga whale faces threats, updating the position equation as follows:

\begin{matrix} Y_{i}^{t + 1} = r_{5} Y_{i}^{t} - r_{6} Y_{r}^{t} + r_{7} Y_{step} \end{matrix}

(21)

where

r_{5}, r_{6}

, and

r_{7}

are random numbers between 0 and 1.

The whale-fall step size

Y_{step}

is formulated as follows:

\begin{matrix} Y_{step} = (u_{b} - l_{b}) exp (- C_{2} \frac{t}{T}) \end{matrix}

(22)

where

C_{2} = 2 W_{f} \times n

represents the step factor linked to the whale-fall probability and population size and

u_{b}

and

l_{b}

are the upper and lower boundaries of the variables.

The whale-fall probability

W_{f}

is given by the following:

\begin{matrix} W_{f} = 0.1 - 0.05 t / T \end{matrix}

(23)

3.2. Long Short-Term Memory (LSTM) Algorithm

LSTM [14,15,16] is a notable variant of recurrent neural networks (RNNs) that mitigates the vanishing gradient problem common in conventional RNNs. It accomplishes this by utilizing three primary gates: the discard gate, the input gate, and the output gate. These gates operate in concert to selectively capture and update long-term information, thereby enabling the network to model dependencies across lengthy sequences. In practical applications, LSTM has been effectively applied to various time-series prediction tasks, including forecasting crude oil prices and estimating the operational lifespan of electronic components such as power MOSFET devices. More specifically, the discard gate employs a logistic activation to determine the proportion of historical data to retain, where a value of 1 signifies full retention of past information while 0 indicates complete removal. Figure 3 illustrates the internal structure of an LSTM cell, showcasing how these gates coordinate to manage data flow and preserve long-range dependencies.

\begin{matrix} g_{f} = sigmoid (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}) \end{matrix}

(24)

The forget gate within an LSTM cell controls the extent to which past information is retained in the cell state, thus managing the retention of historical data. This process is mathematically described as follows:

\begin{matrix} g_{i} = sigmoid (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}) \end{matrix}

(25)

\begin{matrix} {\tilde{S}}_{t} = tanh (W_{s} \cdot [h_{t - 1}, x_{t}] + b_{s}) \end{matrix}

(26)

\begin{matrix} S_{t} = g_{f} \cdot S_{t - 1} + g_{i} \cdot {\tilde{S}}_{t} \end{matrix}

(27)

The output gate calculates its result by integrating the prior state with the present input, which can be expressed mathematically as follows:

\begin{matrix} g_{o} = sigmoid (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}) \end{matrix}

(28)

\begin{matrix} h_{t} = g_{o} \cdot tanh (S_{t}) \end{matrix}

(29)

In this framework,

x_{t}

represents the input vector at the current time step. The weight matrices

W_{f}

,

W_{i}

, and

W_{o}

, along with their corresponding bias terms

b_{f}

,

b_{i}

, and

b_{o}

, are associated with the forget, input, and output gates, respectively. The prior hidden state is indicated by

h_{t - 1}

. The outputs from these gates are denoted as

g_{f}

,

g_{i}

, and

g_{o}

. Moreover,

S_{t}

signifies the current cell state while

{\tilde{S}}_{t}

denotes the candidate state. Lastly, the parameters

W_{s}

and

b_{s}

govern the integration of new information into the temporary cell memory.

3.3. Random Forest (RF) Technique

The random forest (RF) is an ensemble learning approach developed by Breiman to address classification challenges [17]. It mitigates overfitting by aggregating numerous decision trees, each constructed from distinct bootstrap samples of the dataset [18]. These trees are grown to their fullest depth or halted by a termination condition, employing a randomly chosen subset of features at each node to reduce impurity, typically assessed via the Gini index:

\begin{matrix} G (t) = 1 - \sum_{k = 1}^{K} p_{k}^{2} \end{matrix}

(30)

where

p_{k}

represents the fraction of samples belonging to class k. Each tree produces a predicted class

{\tilde{y}}_{i} (x)

for an input x, and the final RF prediction is derived through a majority vote across all tree predictions

\tilde{y} (x)

.

\begin{matrix} \tilde{y} (x) = majority (\{{\tilde{y}}_{1} (x), {\tilde{y}}_{2} (x), \dots, {\tilde{y}}_{N} (x)\}) \end{matrix}

(31)

where “majority” refers to the most common class among the classifications from all trees. This method not only curtails overfitting but also enhances generalization capabilities, positioning the RF as a robust and dependable tool for classification tasks [19]. Figure 4 depicts the structure of the RF classification framework.

3.4. Assessment Metrics for Controller Effectiveness

In this study, we evaluate the performance of the proposed controller using two key metrics: positional accuracy and system energy usage. To measure positional accuracy, the cylinder displacement error is assessed through the root mean square error (RMSE) and the mean absolute error (MAE). These metrics facilitate a quantitative comparison between the proposed controller and other controllers, allowing for well-informed conclusions regarding their ability to achieve precise and accurate control of the cylinder’s position. The RMSE and MAE are defined as follows:

\begin{matrix} R M S E = \sqrt{\frac{1}{m} \sum_{j = 1}^{m} e_{j}^{2}} \end{matrix}

(32)

\begin{matrix} M A E = \frac{1}{m} \sum_{j = 1}^{m} | e_{j} | \end{matrix}

(33)

where m denotes the total number of displacement error samples e (

e_{j}

, where

j = 1, 2, 3, \dots, m

).

4. Performance Evaluation and Experimental Analysis

To rigorously evaluate the effectiveness of the proposed control approach, this study conducted comprehensive assessments across multiple scenarios using both simulations and experimental trials.

The performance and precision of the BWO-LSTM-RF algorithm were validated through data collected from a test bench designed to replicate boom cylinder dynamics. Integrated with an ECMS for energy management, the algorithm was implemented and controlled via Simulink, enabling precise trajectory tracking and optimized energy usage. Concurrently, a simulation model incorporating BWO-LSTM-RF and an ECMS was developed and tested in Python 3.8.8, with its lifting performance benchmarked against experimental results from the test bench. The close correspondence between simulation predictions and experimental outcomes underscores the algorithm’s accuracy, with only minor deviations observed between the virtual and real-world data.

4.1. Development of the Simulation Environment

4.1.1. Model Configuration

A simulation environment was constructed using AMESim (version 2310), as depicted in Figure 5. This setup was designed to assess the controllability of the system and validate its potential for energy savings.

4.1.2. Evaluation of Simulation Outcomes and Discussion

Figure 6 illustrates a comparative analysis of the boom cylinder’s position along with its tracking error profiles during both the upward and downward movements. Both the LSTM-RF and BWO-LSTM-RF controllers outperform the PID controller in tracking precision, with the BWO-LSTM-RF variant achieving enhanced accuracy and faster convergence by fine-tuning four key hyperparameters of the LSTM-RF framework. To ensure an objective evaluation, the mean absolute error (MAE) and root mean square error (RMSE) for all three controllers are presented in Figure 7a,b. Table 2 shows that BWO-LSTM-RF consistently delivers a lower RMSE, reduced to approximately 8.59% of that of PSO-LSTM-GRU-PID, 36.97% of that of LSTM-RF, and 60.7% of that of PID, with MAE values following a similar trend, being reduced to 7.33% of that of PSO-LSTM-GRU-PID, 36.54% of that of LSTM-RF, and 60.69% of that of PID, consistently favoring the BWO-enhanced controller. These findings underscore the marked improvement in tracking accuracy attributable to optimized hyperparameter settings.

In most instances, the controller that we developed exhibits exceptional positional accuracy, primarily due to the effective integration of its system components. To further evaluate the system’s performance, the ICE’s operational characteristics, including speed and torque, are analyzed in Figure 8a, which illustrates the ICE’s speed profile, Figure 8b, which depicts the torque characteristics, and Figure 8c, which outlines the energy consumption metrics. Beyond positional accuracy, energy usage remains a vital performance indicator. A mode-by-mode analysis reveals that the BWO-LSTM-RF controller achieves energy consumption reductions of 3.301 kJ, 7.304 kJ, and 10.997 kJ compared to the PSO-LSTM-GRU-PID, LSTM-RF, and PID controllers, respectively. Collectively, these findings affirm that the proposed controller offers remarkable energy efficiency.

In conclusion, the simulation results provide a thorough assessment of the BWO-LSTM-RF controller’s effectiveness in the system. Even amid complex noise, this controller sustains exceptional positional accuracy while achieving notable energy reductions across various operating conditions. Remarkably, it delivers energy savings of 3.69% compared to the PSO-LSTM-GRU-PID controller, 7.83% compared to the LSTM-RF controller, and 11.33% compared to the PID controller, as validated by statistical tests. Moreover, the improved efficiency is further corroborated by the comparative data summarized in Table 3.

To further validate the energy savings reported in Table 3, we conducted statistical tests to compare the energy consumption of the BWO-LSTM-RF controller against each baseline controller (PSO-LSTM-GRU-PID, LSTM-RF, and PID). Time-series energy consumption data were extracted from Figure 8c, providing 70 paired samples per controller (assuming a 1 s sampling interval over 69.49 s). The Shapiro–Wilk test indicated non-normal distributions for all differences (e.g., p = 0.0130 for BWO-LSTM-RF vs. PSO-LSTM-GRU-PID,

p < 0.05

), so we performed Wilcoxon signed-rank tests. All comparisons showed statistically significant energy savings with p-values far below the 0.05 threshold (

p < 10^{- 4}

), meaning that the probability of these savings occurring by chance is extremely low (less than 0.01%):

BWO-LSTM-RF vs. PSO-LSTM-GRU-PID: 3.69% savings ( $p < 10^{- 4}$ ).
BWO-LSTM-RF vs. LSTM-RF: 7.83% savings ( $p < 10^{- 4}$ ).
BWO-LSTM-RF vs. PID: 11.33% savings ( $p < 10^{- 4}$ ).

An analysis of algorithm runtimes, as detailed in Table 4, indicates that the PSO-LSTM-GRU-PID controller requires the highest training duration at 1977.65 s, with the BWO-LSTM-RF and LSTM-RF controllers requiring 1680.93 and 1185.45 s, respectively. The incorporation of PSO markedly extends the training period compared to the less computationally intensive LSTM-RF, while PID, owing to its simplicity, requires no training phase. Notably, all algorithms share an identical testing time of 69.49 s, suggesting uniform testing efficiency despite their varying training complexities.

4.2. Experimental Setup and Methodology

4.2.1. System Configuration and Instrumentation

A dedicated experimental testbed was constructed to assess the real-world performance of the BWO-LSTM-RF controller within the system. As depicted in Figure 9, this setup was engineered to replicate the operating conditions of a hydraulic excavator boom in a laboratory environment. The controller was deployed using MATLAB/Simulink software R2023a and connected via Simulink Desktop Real-Time. Communication with system components was facilitated through a PCI card that transmitted signals to an electrical enclosure managing the ICE (1) along with other key elements. For data acquisition, sensors measuring speed and torque were affixed to the ICE shaft to monitor energy consumption, while a sensor on the cylinder was employed to assess tracking performance.

In the experimental phase, four controllers were evaluated side-by-side to emphasize the practical advantages of the proposed method. Additionally, during cylinder extraction, the efficiency map of the ICE (illustrated in Figure 10) confirms that performance is a function of both torque and speed.

A robust control framework was developed using MATLAB/Simulink, as shown in Figure 11, to govern the real-time operation of a hydraulic actuator. This framework not only replicates the system dynamics for validation of the proposed control strategy but also actively manages the experimental setup, ensuring precise actuator positioning and enhanced energy efficiency.

By utilizing this MATLAB/Simulink-based control framework, we implemented the BWO-LSTM-RF model. Table 5 presents an overview of the key parameters and features of the algorithm used in our research, describing the model’s structure, input sequence duration, activation mechanisms, and application of the Adam optimizer. Furthermore, the table specifies how BWO is utilized for hyperparameter optimization, with the goal of reducing the mean squared error (MSE).

4.2.2. Performance Evaluation and Analysis

Figure 12 and Figure 13 illustrate the tracking performance metrics for the three controllers evaluated in this study. The data indicate that the proposed BWO-LSTM-RF controller consistently outperforms the PID, LSTM-RF, and PSO-LSTM-GRU-PID controllers, maintaining a maximum positioning error of less than 0.02 m (see Figure 12). Additionally, the BWO-LSTM-RF controller exhibits faster response times and more rapid convergence across various operational conditions, including different raising and lowering scenarios with variable speeds and loads, thanks to its optimized control parameters. Table 6 and Figure 13a,b demonstrate that BWO-LSTM-RF consistently achieves the lowest error metrics, reducing its RMSE to approximately 9.41% of PSO-LSTM-GRU-PID’s, 34.88% of LSTM-RF’s, and 61.34% of PID’s, while lowering MAE values to 6.55% of PSO-LSTM-GRU-PID’s, 33.66% of LSTM-RF’s, and 60.86% of PID’s. This superiority underscores the enhanced performance of the BWO-enhanced controller.

By analyzing the recorded torque and speed data, the efficiency for each operating point is estimated using the engine efficiency map shown in Figure 14. This estimation is then employed to calculate the energy consumption of the engine.

Owing to the enhanced performance of the proposed controller, the ICE undergoes a rapid acceleration between 10.5 and 11 s, as illustrated in Figure 15a. During this phase, the ICE attains a peak speed of 282 rpm, which, in turn, facilitates a more prompt cylinder response and a swifter decrease in displacement error.

Figure 15c depicts the energy consumption comparison across four working modes, highlighting the superior efficiency of the BWO-LSTM-RF controller. This approach yields reduced energy usage, securing savings of 12.46% relative to the PID controller, 8.98% relative to the LSTM-RF controller, and 3.97% relative to the PSO-LSTM-GRU-PID controller.

Moreover, the enhanced efficiency is further corroborated by the comparative data summarized in Table 7.

To validate the energy savings reported in Table 7, we conducted statistical tests using time-series energy consumption data extracted from Figure 15c, providing 70 paired samples per controller (assuming a 1 s sampling interval over 69.49 s). The Shapiro–Wilk test indicated non-normal distributions for all differences (p < 0.05), so we performed Wilcoxon signed-rank tests. All comparisons showed statistically significant energy savings with p-values far below the 0.05 threshold (

p < 10^{- 4}

), meaning that the probability of these savings occurring by chance is extremely low (less than 0.01%):

BWO-LSTM-RF vs. PSO-LSTM-GRU-PID: 3.97% savings ( $p < 10^{- 4}$ ).
BWO-LSTM-RF vs. LSTM-RF: 8.98% savings ( $p < 10^{- 4}$ ).
BWO-LSTM-RF vs. PID: 12.46% savings ( $p < 10^{- 4}$ ).

A comparative evaluation of algorithm runtimes, as presented in Table 8, indicates that the PSO-LSTM-GRU-PID controller demands the longest training duration at 2005.22 s, followed by the BWO-LSTM-RF controller at 1702.15 s and the LSTM-RF controller at 1201.87 s. The inclusion of PSO substantially extends the training phase compared to the more streamlined LSTM-RF, whereas the PID controller, due to its straightforward design, requires no training. All algorithms exhibit an identical testing time of 69.49 s, demonstrating uniform testing performance despite their diverse training requirements. In a MATLAB environment, these training durations are anticipated to closely align with those from Python-based simulations, owing to the similarities in their core processing frameworks and model structures.

The results indicate that the BWO-LSTM-RF controller effectively mitigates challenges such as acoustic disturbances, throttling-induced pressure losses, and fluid leakage while simultaneously achieving outstanding positional accuracy and efficient power management throughout the system. Experimental evaluations further confirm its robust performance, particularly in enhancing boom excavator operations. Moreover, as detailed in Table 9, the minimal discrepancies in energy consumption between simulation and experimental trials underscore the controller’s consistency and reliability in practical applications.

The experimental results presented above demonstrate the superior performance of the proposed BWO-LSTM-RF controller in terms of trajectory accuracy, energy efficiency, and operational consistency across various working conditions. To further contextualize these findings within the broader landscape of hydraulic excavator control, a comparative analysis with recent studies is essential. Table 10 provides a comprehensive comparison of the proposed controller against state-of-the-art methods reported in the literature, focusing on both trajectory control accuracy (measured by RMSE) and energy efficiency (measured by energy saving percentage). This comparison highlights the advancements achieved by BWO-LSTM-RF over existing approaches, reinforcing its potential for practical applications in hydraulic excavator systems.

5. Economic Analysis of Proposed Algorithm

The BWO-LSTM-RF framework offers considerable potential to reshape the design and operation of future hydraulic excavators, while delivering notable economic and environmental benefits. In terms of design, the framework’s ability to achieve positional accuracy below 3 mm and energy savings of up to 12.46% compared to traditional PID controllers could reduce the reliance on oversized hydraulic components, often used to compensate for control inaccuracies. This advancement may enable manufacturers to develop lighter, more compact excavators, lowering material costs and enhancing mobility without compromising performance. Operationally, the improved control precision and responsiveness, demonstrated in both simulation and experimental results (e.g., the RMSE reduced to 9.41% of PSO-LSTM-GRU-PID, 34.88% of LSTM-RF, and 61.34% of PID, as shown in Figure 13), could enhance productivity. These improvements minimize downtime and optimize trajectory tracking in dynamic construction environments.

Economically, the energy efficiency gains translate into substantial cost savings for operators. For instance, a 12.46% reduction in energy consumption, as evidenced in Table 11, could save approximately USD 3332 annually per 48 t excavator in fuel costs, particularly in large-scale construction projects where fleets of machines operate for extended periods. This estimate assumes that an excavator operates 10 h per day for 250 days per year, with a load scaled from 800 kg in the experiment to 48 t in real conditions, and uses diesel priced at USD 1.20 per liter as of March 2025. Environmentally, these energy savings contribute to sustainability efforts by reducing greenhouse gas emissions. With the same operational assumptions, the 12.46% decrease could reduce CO₂ emissions by approximately 7.3 tons per machine annually, based on a carbon dioxide emission factor of 2.63 kg per liter of diesel [21].

Given the global scale of the construction industry, the proposed framework has significant potential for widespread impact. Based on annual sales data and market trends, the global fleet of hydraulic excavators is estimated to be approximately 6.5 million units in 2025, despite a projected 8% decline in global construction equipment sales to 1.08 million units in that year [22]. With a forecasted recovery in sales to 1.2 million units by 2028, the global fleet is projected to grow to around 6.8 million units by that year [22]. If the BWO-LSTM-RF framework were adopted across this global fleet, the cumulative fuel savings and CO₂ reductions could be substantial. For example, applying the per-machine savings to the estimated 6.5 million units in 2025 could result in annual fuel cost savings of approximately USD 21.66 billion and a CO₂ emission reduction of around 47.47 million tons, potentially increasing to USD 22.65 billion and 49.66 million tons by 2028. Furthermore, the control precision and energy efficiency improvements offered by the BWO-LSTM-RF framework could be adapted to other construction machines, such as forklifts, further amplifying its potential to reduce fuel consumption and emissions on a global scale.

6. Conclusions

This study introduces the BWO-LSTM-RF control framework to enhance positional accuracy and energy efficiency in boom excavator operations. The key findings are summarized as follows:

The BWO-LSTM-RF controller delivers exceptional accuracy, robustness, and responsiveness across diverse operating conditions, effectively handling disturbances such as leakage, pressure drops, and noise, as validated by both simulations and experiments.
By leveraging the BWO algorithm, the framework optimizes four critical LSTM-RF hyperparameters, ensuring adaptable and robust operation under varying external conditions.
The proposed framework outperforms benchmark controllers in tracking accuracy and energy savings, achieving significant reductions in energy consumption compared to PID, LSTM-RF, and PSO-LSTM-GRU-PID.

Future directions: future efforts will focus on reducing the delay in cylinder actuation to further improve tracking precision in HEs. Additionally, upcoming studies will investigate techniques to recover and reuse potential energy from weight displacement, thereby enhancing overall efficiency. To further validate the universal applicability of the proposed BWO-LSTM-RF framework, we plan to apply and test it under real-world operational conditions, encompassing all phases of excavator tasks and multi-actuator coordination.

Author Contributions

V.H.N. conducted the complete research and methodological design, developed and verified the model and control algorithm through AMESim simulation and Python, organized the experimental setup, and drafted the initial manuscript while also assisting with the setup and reviewing the manuscript. K.K.A. provided overall project leadership and financial support, offering critical guidance and oversight throughout the study as well as contributing to the review and refinement of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by “Regional Innovation Strategy (RIS)” through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (MOE), South Korea (2021RIS-003).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

HEs	Hydraulic excavators
HHEs	Hybrid hydraulic excavators
EERSs	Electric energy regeneration systems
HERSs	Hydraulic energy regeneration systems
ICE	Internal combustion engine
EMG	Electric motor/generator
BAT	Battery
HST	Hydrostatic transmission
HPM	Hydraulic pump/motor
HM	Hydraulic motor
EHCVP	Electrical hydraulic continually variable powertrain
EMS	Energy management strategy
ECMS	Equivalent consumption minimization strategy
A-ECMS	Adaptive equivalent consumption minimization strategy
SOC	State of charge
BC	Cylinder
VB	Control valve of cylinder
RV	Regeneration valve of cylinder
CV	Check valve
CE	Clutch of engine
CM	Clutch of electric motor/generator
CP (DbC)	Clutch of HPM (double clutch)
CH (DbC)	Clutch of HM (double clutch)
Ring	Ring gear
Sun	Sun gear
ST	Sensor torque
SS	Sensor speed
SD	Sensor displacement
SV	Sensor velocity
PSO	Particle swarm optimization
LSTM	Long short-term memory
GRU	Gated recurrent unit
PID	Proportional–integral–derivative
BWO	Beluga whale optimization
RF	Random forest

References

Ding, R.; Zhang, J.; Xu, B.; Cheng, M.; Pan, M. Energy efficiency improvement of heavy-load mobile hydraulic manipulator with electronically tunable operating modes. Energy Convers. Manag. 2019, 188, 447–461. [Google Scholar] [CrossRef]
Chen, M.; Zhao, D. The gravitational potential energy regeneration system with closed-circuit of boom of hydraulic excavator. Mech. Syst. Signal Process. 2017, 82, 178–192. [Google Scholar] [CrossRef]
Do, T.C.; Dinh, T.Q.; Yu, Y.; Ahn, K.K. Innovative powertrain and advanced energy management strategy for hybrid hydraulic excavators. Energy 2023, 282, 128951. [Google Scholar] [CrossRef]
Nguyen, V.H.; Do, T.C.; Ahn, K.K. Hybrid powertrain with dual energy regeneration for boom cylinder movement in a hydraulic excavator. Autom. Constr. 2025, 171, 105974. [Google Scholar] [CrossRef]
Ben Hazem, Z.; Guler, N.; Altaif, A.H. A study of advanced mathematical modeling and adaptive control strategies for trajectory tracking in the Mitsubishi RV-2AJ 5-DOF Robotic Arm. Discov. Robot. 2025, 1, 2. [Google Scholar] [CrossRef]
Song, H.; Li, G.; Li, Z.; Xiong, X. Trajectory Control Strategy and System Modeling of Load-Sensitive Hydraulic Excavator. Machines 2023, 11, 10. [Google Scholar] [CrossRef]
Nguyen, V.H.; Do, T.C.; Ahn, K.K. Implementing PSO-LSTM-GRU Hybrid Neural Networks for Enhanced Control and Energy Efficiency of Excavator Cylinder Displacement. Mathematics 2024, 12, 3185. [Google Scholar] [CrossRef]
Xuanyu, L.; Mengting, J.; Wenshuai, Z.; Yudong, W. Intelligent real-time prediction for shield machine position on the basis of BWO-LSTM-GRU. Eng. Res. Express 2024, 6, 015105. [Google Scholar] [CrossRef]
Hu, B.; Li, J. A Deployment-Efficient Energy Management Strategy for Connected Hybrid Electric Vehicle Based on Offline Reinforcement Learning. IEEE Trans. Ind. Electron. 2022, 69, 9644–9654. [Google Scholar] [CrossRef]
Wei, C.; Chen, Y.; Sun, X.; Zhang, Y. Optimal Equivalent Consumption Minimization Strategy for Plug-In Hybrid Electric Vehicle with Improved Genetic Algorithm. SAE Int. J. Electrified Veh. 2020, 9, 143–154. [Google Scholar] [CrossRef]
Zhou, D.; Al-Durra, A.; Matraji, I.; Ravey, A.; Gao, F. Online Energy Management Strategy of Fuel Cell Hybrid Electric Vehicles: A Fractional-Order Extremum Seeking Method. IEEE Trans. Ind. Electron. 2018, 65, 6787–6799. [Google Scholar] [CrossRef]
Yang, C.; Du, X.; Wang, W.; Yuan, L.; Yang, L. Variable optimization domain-based cooperative energy management strategy for connected plug-in hybrid electric vehicles. Energy 2024, 290, 130206. [Google Scholar] [CrossRef]
Zhong, C.; Li, G.; Meng, Z. Beluga whale optimization: A novel nature-inspired metaheuristic algorithm. Knowl.-Based Syst. 2022, 251, 109215. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Ibrahim, M.S.; Abbas, W.; Waseem, M.; Lu, C.; Lee, H.H.; Fan, J.; Loo, K.H. Long-Term Lifetime Prediction of Power MOSFET Devices Based on LSTM and GRU Algorithms. Mathematics 2023, 11, 3283. [Google Scholar] [CrossRef]
Kim, G.I.; Jang, B. Petroleum Price Prediction with CNN-LSTM and CNN-GRU Using Skip-Connection. Mathematics 2023, 11, 547. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Wang, X.; Zhai, M.; Ren, Z.; Ren, H.; Li, M.; Quan, D.; Chen, L.; Qiu, L. Exploratory study on classification of diabetes mellitus through a combined Random Forest Classifier. BMC Med. Inform. Decis. Mak. 2021, 21, 105. [Google Scholar] [CrossRef]
Hechifa, A.; Lakehal, A.; Nanfak, A.; Saidi, L.; Labiod, C.; Kelaiaia, R.; Ghoneim, S.S.M. Improved intelligent methods for power transformer fault diagnosis based on tree ensemble learning and multiple feature vector analysis. Electr. Eng. 2024, 106, 2575–2594. [Google Scholar] [CrossRef]
Nguyen, T.H.; Do, T.C.; Phan, V.D.; Ahn, K.K. Working Performance Improvement of a Novel Independent Metering Valve System by Using a Neural Network-Fractional Order-Proportional-Integral-Derivative Controller. Mathematics 2023, 11, 4819. [Google Scholar] [CrossRef]
Yu, Y.; Do, T.C.; Park, Y.; Ahn, K.K. Energy saving of hybrid hydraulic excavator with innovative powertrain. Energy Convers. Manag. 2021, 244, 114447. [Google Scholar] [CrossRef]
Sleight, C. Global Construction Equipment Market Expected. 2024. Available online: https://offhighwayresearch.com/491.article (accessed on 1 March 2025).

Figure 1. Schematic overview of the system’s architecture.

Figure 2. Schematic overview of the proposed position forecasting system.

Figure 3. Schematic depiction of the LSTM cell architecture.

Figure 4. Classification structure of the RF algorithm.

Figure 5. Schematic representation of the simulation environment developed with AMESim.

Figure 6. Simulation-based assessment of the boom cylinder’s position tracking performance. (a) Displacement of the boom cylinder (b) Displacement error.

Figure 7. Comparison of RMSE and MAE metrics across different simulation operating modes. (a) RMSE (b) MAE.

Figure 8. Performance assessment of the ICE: subfigure (a) shows speed, (b) displays torque, and (c) illustrates energy consumption.

Figure 9. Experimental test bench, comprising (1) PC, (2) electrical box, (3) clutch carrier, (4) HPM, (5) load, (6) hydraulic system, (7) HM, (8) double clutch, (9) HPM, (10) ICE, (11) EMG, (12) clutch electric motor/generator, and (13) clutch engine.

Figure 10. Efficiency contour illustrating the operational performance of the engine.

Figure 11. MATLAB/Simulink representation of the experimental system configuration.

Figure 12. Experimental assessment of the system’s tracking performance, highlighting displacement accuracy and error dynamics. (a) Displacement of the boom cylinder (b) Displacement error.

Figure 13. Experimental comparison of RMSE and MAE metrics across various operating modes. (a) RMSE (b) MAE.

Figure 14. Efficiency map delineating the operational points of the ICE in the system.

Figure 15. Performance assessment of the ICE: subfigure (a) shows speed, (b) displays torque, and (c) illustrates energy consumption.

Table 1. Technical specifications for key system elements [4].

Element	Characteristic	Measured Value	Unit
Boom cylinder	Piston diameter	50	mm
	Rod diameter	28	mm
	Stroke	0.75	m
HPM	Displacement	30	cc/rev
	Pressure	250	bar
HM	Displacement	10	cc/rev
	Pressure	120	bar
HST	Displacement	33	cc/rev
	Pressure	270	bar
EMG	Rate power	5.5	kW
ICE	Rate power	7.5	kW
Battery	Voltage range per cell	2.5–4.2	V
	Cells in parallel configuration	1	cell
	Cells in series arrangement	3	cell

Table 2. Tracking performance metrics and improvements in controllers.

Controller	RMSE	MAE
BWO-LSTM-RF	4.04	3.16
PSO-LSTM-GRU-PID	4.42 (−8.59%)	3.41 (−7.33%)
LSTM-RF	6.41 (−36.97%)	4.98 (−36.54%)
PID	10.28 (−60.7%)	8.04 (−60.69%)

Table 3. Simulation controller performance metrics.

Algorithm	Total Time	Energy Consumption	Energy Saving	p-Value
BWO-LSTM-RF	69.49 s	85.942 kJ	-	-
PSO-LSTM-GRU-PID	69.49 s	89.243 kJ	3.69%	$p < 10^{- 4}$
LSTM-RF	69.49 s	93.236 kJ	7.83%	$p < 10^{- 4}$
PID	69.49 s	96.939 kJ	11.33%	$p < 10^{- 4}$

Table 4. Comparison of algorithm running times.

Algorithm	Real Time	Training Time	Unit
BWO-LSTM-RF	69.49	1680.93	second
PSO-LSTM-GRU-PID	69.49	1977.65	second
LSTM-RF	69.49	1185.45	second
PID	69.49	-	second

Table 5. Summary of BWO-LSTM-RF model configuration and hyperparameter optimization parameters.

BWO-LSTM-RF Model Overview
Model Architecture	Stacked LSTM and random forest with BWO optimization
Sequence Length	Variable
Activation Functions	ReLU for LSTM layers, linear for output dense layer
Loss Function	Mean squared error (MSE)
Optimizer	Adam (with learning rates: 0.1, 0.01, 0.001)
Normalization	StandardScaler for input features
Training Data Noise	Not explicitly defined
Data Split Ratio	80% training, 20% testing
Dropout Rate	Not explicitly defined in BWO-LSTM-RF
Evaluation Metric	MSE on validation/test set, R2 score
BWO Hyperparameter Optimization
Optimization Technique	Beluga whale optimization (BWO)
Parameters Optimized	LSTM units
Population Size	50 beluga whales
Iterations	100
Best Solution	Optimal parameters minimizing MSE

Table 6. Tracking performance metrics and improvements in controllers.

Controller	RMSE	MAE
BWO-LSTM-RF	3.28	2.15
PSO-LSTM-GRU-PID	3.51 (−9.41%)	3.28 (−6.55%)
LSTM-RF	4.94 (−34.88%)	4.78 (−33.66%)
PID	8.38 (−61.34%)	5.48 (−60.86%)

Table 7. Experimental controller performance metrics.

Algorithm	Total Time	Energy Consumption	Energy Saving	p-Value
BWO-LSTM-RF	69.49 s	90.384 kJ	-	-
PSO-LSTM-GRU-PID	69.49 s	94.122 kJ	3.97%	$p < 10^{- 4}$
LSTM-RF	69.49 s	99.310 kJ	8.98%	$p < 10^{- 4}$
PID	69.49 s	103.240 kJ	12.46%	$p < 10^{- 4}$

Table 8. Comparison of algorithm running times.

Algorithm	Real Time	Training Time	Time
BWO-LSTM-RF	69.49	1702.15	second
PSO-LSTM-GRU-PID	69.49	2005.22	second
LSTM-RF	69.49	1201.87	second
PID	69.49	-	second

Table 9. Comparison of energy consumption between simulation and experimental trials.

Algorithm	Simulation	Experimental	Percent Difference
BWO-LSTM-RF	85.942 kJ	90.384 kJ	4.91%
PSO-LSTM-GRU-PID	89.243 kJ	94.122 kJ	5.18%
LSTM-RF	93.236 kJ	99.310 kJ	6.11%
PID	96.939 kJ	103.240 kJ	6.10%

Table 10. Performance comparison of control algorithms and systems for hydraulic excavator trajectory and energy saving.

Ref	Algorithm	System	RMSE	Energy Saving (%)
[6] (2023)	HAQPSO-PID	LS	8.1798 (boom)	N/A
	QPSO-PID	LS	8.3153 (boom)	N/A
	PSO-PID	LS	10.0977 (boom)	N/A
	ZN-PID	LS	12.9459 (boom)	N/A
[7] (2024)	PSO-LSTM-GRU-PID	AEHCVP	4.78	10.89
	LSTM-GRU-PID	AEHCVP	6.7	8.3
	PID	AEHCVP	9.27	0
[20] (2023)	NNFOPID	NIMV	15.95	29.25
	FOPID	NIMV	22.16	7.71
	PID	NIMV	23.85	0
This study	Proposed	AEHCVP	3.28	12.46
	PSO-LSTM-GRU-PID	AEHCVP	3.51	8.83
	LSTM-RF	AEHCVP	4.94	3.81
	PID	AEHCVP	8.38	0

Table 11. Comparative performance metrics of proposed controllers for a 48 t excavator.

Algorithm	Time	Consumption	Saving	Test Bench
BWO-LSTM-RF	69.49 s	90.384 kJ	12.46%
PSO-LSTM-GRU-PID	69.49 s	94.122 kJ	8.83%
LSTM-RF	69.49 s	99.310 kJ	3.81%
PID	69.49 s	103.240 kJ	-
BWO-LSTM-RF	1 h	0.1300 L	12.46%
PSO-LSTM-GRU-PID	1 h	0.1354 L	8.83%
LSTM-RF	1 h	0.1429 L	3.81%
PID	1 h	0.1485 L	-
Algorithm	Time	Consumption	Saving	48 t Excavator
BWO-LSTM-RF	1 year	19,509 L	12.46%
PSO-LSTM-GRU-PID	1 year	20,317 L	8.83%
LSTM-RF	1 year	21,437 L	3.81%
PID	1 year	22,285 L	-
Algorithm	Time	Price	Saving
BWO-LSTM-RF	1 year	23,411 $	12.46%
PSO-LSTM-GRU-PID	1 year	24,380 $	8.83%
LSTM-RF	1 year	25,724 $	3.81%
PID	1 year	28,093 $	-
Algorithm	Time	CO₂ Consumption	Saving
BWO-LSTM-RF	1 year	51,309 t	12.46%
PSO-LSTM-GRU-PID	1 year	53,434 t	8.83%
LSTM-RF	1 year	56,379 t	3.81%
PID	1 year	58,609 t	-

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nguyen, V.H.; Ahn, K.K. Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems. Mathematics 2025, 13, 1177. https://doi.org/10.3390/math13071177

AMA Style

Nguyen VH, Ahn KK. Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems. Mathematics. 2025; 13(7):1177. https://doi.org/10.3390/math13071177

Chicago/Turabian Style

Nguyen, Van Hien, and Kyoung Kwan Ahn. 2025. "Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems" Mathematics 13, no. 7: 1177. https://doi.org/10.3390/math13071177

APA Style

Nguyen, V. H., & Ahn, K. K. (2025). Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems. Mathematics, 13(7), 1177. https://doi.org/10.3390/math13071177

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Performance Evaluation of Beluga Whale Optimization–Long Short-Term Memory–Random Forest Networks for Trajectory Control and Energy Optimization in Excavator Systems

Abstract

1. Introduction

2. Overview of the Setup

2.1. Drivetrain Analysis

2.2. Analysis of Critical Parameters

2.3. Energy Management Strategy

3. Core Principles of Hybrid Algorithmic Control

3.1. Beluga Whale Optimization (BWO) Algorithms

3.2. Long Short-Term Memory (LSTM) Algorithm

3.3. Random Forest (RF) Technique

3.4. Assessment Metrics for Controller Effectiveness

4. Performance Evaluation and Experimental Analysis

4.1. Development of the Simulation Environment

4.1.1. Model Configuration

4.1.2. Evaluation of Simulation Outcomes and Discussion

4.2. Experimental Setup and Methodology

4.2.1. System Configuration and Instrumentation

4.2.2. Performance Evaluation and Analysis

5. Economic Analysis of Proposed Algorithm

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI