Analysis of Instantaneous Energy Consumption and Recuperation in Electric Buses During SORT Tests Using Linear and Neural Network Models

Edward Kozłowski; Magdalena Zimakowska-Laskowska; Piotr Wiśniowski; Boris Šnauko; Piotr Laskowski; Jan Laskowski; Jonas Matijošius; Andrzej Świderski; Adam Torok

doi:10.3390/en18195107

,

and

¹

Faculty of Management, Lublin University of Technology, 38D Nadbystrzycka Str., 20-618 Lublin, Poland

²

Motor Transport Institute, 80 Jagiellońska Str., 03-301 Warsaw, Poland

³

Center of European Projects, 9 Halicka Str., 31-036 Kraków, Poland

⁴

Faculty of Automotive and Construction Machinery Engineering, Warsaw University of Technology, 84 Narbutta Str., 02-524 Warsaw, Poland

Energies2025, 18(19), 5107;https://doi.org/10.3390/en18195107

This article belongs to the Special Issue Challenges and Future Trends of Energy Management Systems for Electric Vehicles

Version Notes

Order Reprints

Abstract

With the growing deployment of electric buses (e-buses), accurate energy use modelling has become essential for fleet optimisation and operational planning. Using the SORT methodology, this study analyses instantaneous energy consumption and recuperation (IECR). Three vehicle configurations were tested (one battery with pantograph, four batteries, and eight batteries), each with ten repeatable runs. Four approaches were compared: a baseline linear regression, an extended linear model (ELM) due to the state, a feed-forward neural network, and a recurrent neural network (RNN). The extended linear model achieved a determination coefficient of R² = 0.9124 (residual standard deviation 4.26) compared with R² = 0.7859 for the baseline, while the determination coefficient for the RNN is 0.9343, and the RNN provided the highest accuracy on the test set (the correlation coefficient between real and predicted values is 0.9666). The results confirm the dominant influence of speed and acceleration on IECR and show that battery configuration mainly affects consumption during acceleration. Literature-consistent findings indicate that regenerative systems can recover 25–51% of braking energy, with advanced control methods further improving recovery. Despite non-normality and temporal dependence of residuals, the state-aware linear model remains interpretable and competitive, whereas recurrent networks offer superior fidelity. These results support real-time energy management, charging optimisation, and reliable range prediction for electric buses in urban public transport.

Keywords:

electric buses; energy consumption; recuperation; predictive modelling; recurrent neural network

1. Introduction

With the rising prevalence of electric buses, modelling their energy efficiency emerges as a critical challenge, supporting effective operational planning and fleet optimisation. Energy consumption in these vehicles is influenced by multiple aspects, including driving dynamics, operating conditions, terrain topography, and vehicle technical parameters. Accurate modelling allows improving energy efficiency, optimising charging strategies, and extending battery life. Analysing energy consumption in electric buses can support the optimisation of energy management strategies and improve the efficiency of transportation systems. Using statistical methods, machine learning algorithms, and physical modelling enables precise energy consumption forecasting and the adjustment of operational strategies [1]. In urban conditions, eco-driving practices have been shown to reduce vehicle energy use, supporting operational goals in public transport [2]. Additionally, vehicle configuration has been shown to shape energy demand; a comparative study of short versus long city buses reported systematic differences in energy consumption with implications for e-mobility planning [3].

For public transport applications, the Standardised On-Road Test (SORT) methodology serves as an important tool to assess vehicle energy efficiency in quasi-real operating environments. It involves three driving cycles, SORT 1 (intensive urban), SORT 2 (moderate urban), and SORT 3 (suburban), each characterised by differences in speed profiles, stop patterns, and passenger loads [4,5]. Unlike previous test cycles (e.g., ESC, ETC), the SORT methodology evaluates the entire vehicle, not just the engine, allowing for accurate energy consumption comparisons and the effectiveness of recuperation systems. The results of these tests are widely used in tenders for rolling stock purchases and operational optimisation processes. Within such evaluations, broader sustainability indicators for surface public transport should also be considered [6].

Predictive models have assumed greater importance for public transport in recent years, contributing to real-time optimisation of energy efficiency. Models based on predictive control (MPC) can adapt energy management strategies to current road conditions, battery status, and predicted driving speeds, reducing energy consumption by up to several per cent compared to traditional methods [7,8,9,10,11]. Modern modelling approaches increasingly employ machine and deep learning techniques, including long short-term memory (LSTM) and bidirectional LSTM (BiLSTM), which enhance predictive accuracy and allow adaptation to dynamic operating conditions. Predictive models can also account for component ageing, translating into extended battery life and reduced fleet maintenance costs [12].

A key element in improving the efficiency of electric buses is the strategy of energy recuperation during braking. Supercapacitors, flywheels, or flow batteries offer high cycle life and fast energy exchange, making them promising for energy storage in SORT-based systems. These findings align with emerging research on dynamic energy transfer at intersections, which leverages regenerative braking to optimise system-wide energy recovery [13]. Modern energy recovery systems can recover from 25 to as much as 51% of braking energy. The use of advanced control methods, such as threshold logic, fuzzy control, or genetic algorithms, can increase this recovery by another dozen or so per cent compared to basic strategies [14,15]. Optimising energy recuperation translates into reduced overall energy consumption, extended vehicle range, and better battery protection, especially when integrated with hybrid energy storage systems.

Despite progress, the literature indicates significant research gaps in analysing instantaneous energy consumption in electric buses during SORT tests. There is a lack of comprehensive studies combining instantaneous energy consumption profiles with linear and non-linear models in real-world operating scenarios. Previous work shows that non-linear models, such as neural networks, better represent complex relationships than linear models, but they require the inclusion of additional variables, time delays, and environmental factors to improve forecast accuracy [4,16].

This paper analyses electric buses’ instantaneous energy consumption and recuperation (IECR) based on SORT tests. The study included vehicles equipped with one battery with a pantograph, four batteries, and eight batteries, each of which underwent ten test runs. The study applied linear regression together with artificial neural networks to capture how driving parameters influence energy consumption. The obtained results enabled the assessment of the effect of speed and acceleration on instantaneous energy consumption and recuperation, as well as the analysis of the impact of battery configuration on vehicle energy efficiency. These results can provide a basis for further optimising public transport systems through advanced energy management strategies and precise range forecasting of electric buses in urban environments.

2. Literature Review

Modelling energy consumption in electric bus drive systems is key to optimising public transport. Abdelaty et al. [17] and Abdelaty & Mohamed [18] demonstrated the high effectiveness of machine learning models, such as neural networks, in predicting energy consumption. Li et al. [1] proposed a model combining physical and historical data, which allows for more accurate predictions. Energy consumption modelling in electric vehicles has been widely studied in the scientific literature. Kozłowski et al. [19] indicated that vehicle speed, acceleration, and the transformations of these characteristics are key factors determining energy consumption in electric vehicles. These studies demonstrated that dynamic speed changes can significantly affect energy efficiency, which is particularly important in urban conditions. Further research by Kozłowski et al. [4] focused on analysing instantaneous energy consumption and the recuperation (IECR) process, drawing on SORT test measurements. In parallel to consumption modelling, EV routing studies integrate battery range and charger topology; GA-based methods are a common and effective option [20]. These results confirm the significance of efficient energy control strategies, especially in optimising energy recovery systems.

Dabčević et al. [21] developed a model to analyse bus trip data, considering average speed and distance travelled. Vepsäläinen et al. [22] highlighted the computational efficiency of predictive methods, which is crucial for applications in real-world operating conditions. Kozłowski et al. [19] analysed the availability of technical systems using a semi-Markov model, which allows for considering random periods of idleness and operation of electric vehicles. Applying this model in public transport analysis can be crucial for predicting the reliability of electric buses and their effective management. At the system level, initiatives such as Bus Rapid Transit (BRT) have proven effective in improving the efficiency and sustainability of bus operations, thereby facilitating the integration of predictive modelling with urban mobility planning [23]. In addition, Kozłowski et al. [4] investigated instantaneous energy use and the recuperation process using data obtained from SORT trials. These results confirm the importance of effective energy management, especially in optimising energy recovery systems. In the context of predictive methods, Kłosowski et al. [24] applied LSTM neural networks to monitor complex industrial systems, demonstrating the potential application of this technology in modelling energy consumption in electric buses. Furthermore, Pawlik et al. [25] demonstrated the effectiveness of artificial neural networks in diagnosing machines operating under variable conditions, suggesting the possibility of using similar techniques in the operational analysis of electric vehicles.

An important issue in energy consumption analysis is obtaining measurement data. Electric vehicles use, among others, on-board diagnosis (OBD) systems, the control area network (CAN) bus, on-board sensors, and external data loggers. OBD-II enables real-time acquisition of parameters such as speed, battery current and voltage, and state of charge (SOC), which allows for conducting research in real-world conditions [26,27,28,29]. Access to the CAN bus enables the recording of detailed data from the drivetrain and auxiliary systems, characterised by a higher frequency of measurements [30]. On-board sensors allow for direct measurement of current, voltage, speed, acceleration, and environmental conditions [29,31]. Data loggers enable long-term recording of parameters from multiple sources, integrating data from GPS and environmental sensors, which is particularly useful in predictive models [28,30,31].

Energy consumption models can be regression-based or based on neural networks. Linear regression models are interpretable and straightforward but often fail to model non-linear dependencies [32,33,34]. Neural networks are good at modelling non-linearities [35,36], while recurrent architectures (RNN, LSTM, GRU) enable the representation of temporal dependencies and achieve higher forecast accuracy [37,38,39].

Speed, acceleration, vehicle mass, and route profile primarily influence instantaneous energy consumption. Higher speeds and rapid accelerations increase energy consumption due to increased aerodynamic drag and power requirements (Suarez et al., 2022; Skuza et al., 2024 [40,41]). Higher mass—due to the number of passengers or battery configuration—increases consumption, especially on hills [42]. Route configuration, number of stops, and traffic density also play a significant role [43,44].

Recovering electrical energy during braking is one of the key mechanisms for improving efficiency. Chidambaram et al. [45] show that a high recuperation level at low charging currents benefits battery life. The braking strategy significantly impacts energy balance—aggressive methods allow for the recovery of more energy at the expense of faster cell degradation. In contrast, gentle methods are more beneficial for battery durability [46]. Additionally, advanced control strategies, such as fuzzy control or hybrid systems with supercapacitors, can increase energy recovery efficiency by up to 15% [47,48]. The number of batteries in an electric vehicle can affect its energy consumption, mainly due to increased weight.

The literature on braking energy recovery shows that the recovery strategy affects the efficiency of energy recovery. An aggressive strategy enables the recovery of a larger amount of electrical energy during braking, which translates into an increase in range. Although a gentle method produces less electrical energy, it may be more convenient for the vehicle driver and drivetrain [49]. Comparative assessment of regenerative braking performance in a fully electric car was performed. An aggressive method causes a higher current load on the battery, which may cause its faster degradation and shorten its lifespan. In turn, a gentle method is better for battery longevity. Driving style also affects the speed of battery regeneration and energy recovery efficiency from braking [50]. Another factor influencing energy consumption is the configuration of the battery system [42]. An increase in the number of batteries means an increase in weight. This relationship is directly proportional—the more batteries, the greater the weight. An increase in weight also has an impact on increasing rolling resistance. Optimising the weight and arrangement of batteries affects the vehicle’s energy efficiency and maximises range [49]. The number of batteries also increases the range. For most users, the optimal range is 350–480 km. Further increase in battery capacity becomes less effective [42,49].

Comparisons of laboratory test results (e.g., SORT) with data from real-world routes indicate significant differences resulting from the influence of traffic conditions, driver behaviour, and environmental factors [51]. Therefore, combining both data sources to obtain reliable models is recommended.

Energy efficiency assessment requires the use of statistical analysis methods and model validation methods [52,53]. It is important to use visual inspection of residual plots and error measures (RMSE, MAE, correlation coefficient) to assess model fit comprehensively.

Environmental factors also influence energy consumption. Low temperatures increase energy demand due to increased heating consumption and reduced battery efficiency, which can increase consumption by up to 35–50% in winter conditions [54,55]. High temperatures also increase consumption due to air conditioning [36]. Unfavourable weather conditions, such as wind, rain, or snow, further reduce efficiency [56,57].

3. Materials and Methods

Electric buses’ IECR were analysed during the SORT test. The study included three vehicle configurations as follows:

Electric bus with one battery and fitted with a pantograph;
Electric bus with four batteries;
Electric bus with eight batteries.

The tests were conducted on standard 12-metre low-floor city buses in three battery configurations: one pack with a pantograph (nominal capacity 72.4 kWh), four packs (nominal capacity 290 kWh), and eight packs (nominal capacity 579 kWh). All buses were loaded to their permissible gross vehicle weight (approx. 18,000 kg) to ensure comparable test conditions.

Ten test runs were conducted for each variant. The analysis sought to identify the effect of battery number on energy consumption and recuperation while driving and to assess the dependence of IECR on speed and driving dynamics (acceleration). Modelling was conducted using the following:

A linear model considering the contrast describing the system state (acceleration, braking, steady driving, stopping);
A complex linear model considering the states of the system;
Artificial neural network;
Recurrent neural network.

Linear models, despite the generally non-linear nature of the dependence of IECR on velocity and acceleration, linearly use variables and their transformations in estimating the effect on IECR.

3.1. Measuring Equipment

The following devices were used for the measurements:

Yokogawa WT-1806 power analyser (Musashino, Japan);
APPA 32 current clamp (Shin-Tien, Taiwan);
LAB-EL LB-702 thermohygrobarometer—measuring probe No. 1371 and reading panel No. 1073 (Łódź, Poland);
CFM Master anemometer model 8901; serial number 8809870 (Frankfurt, Germany);
CORSYS-DATRON measuring head type L-400 serial no. 41434 (Darmstadt, Germany);
DINI ARGEO non-automatic electronic scale type WWSD6T; serial number 01001852335 (Modena, Italy).

Speed was measured using a DATRON optical head, synchronised with energy consumption and recovery measurements. The measurement system displayed a speed profile to the driver by the SORT cycle requirements, ensuring high repeatability of runs.

Current clamps were connected at points in the vehicle’s high-voltage system (PowerBox, main HV connectors), allowing for the recording of current flow during acceleration, braking, constant speed driving, and standstill. These points are illustrated in the measurement system diagrams. Voltage measurements were performed at dedicated points in the electrical system, enabling battery performance monitoring during operation and charging.

3.2. Measurement Location and Conditions

The tests were carried out on Road No. 56 near Stary Jasiniec. The measurement section, 2100 m in length, featured a maximum gradient of 0.35% and was located at an elevation of 98 m above sea level, with a smooth road surface. During the trials, atmospheric conditions remained stable, with wind speeds along the route between 0.40 and 0.54 m/s, air temperature ranging from 5.2 to 4.7 °C, relative humidity of 60.4–65.3%, and pressure between 1005.1 and 1005.3 hPa.

3.3. SORT Test Procedure

The SORT developed by UITP includes three variants as follows:

SORT 1—intense city traffic (average speed approx. 12 km/h, 6–8 stops/km, stop time up to 35% of driving time);
SORT 2—moderate city traffic (approx. 18 km/h, 3–5 stops/km, standing still approx. 20% of the time);
SORT 3—suburban route (approx. 25 km/h, 1–2 stops/km, stops approx. 10% of the time).

The tests maintained the full load per its permissible gross vehicle weight. Multiple iterations were performed for each vehicle variant to obtain data with minimal deviation from the reference profile. The results were measured in kWh/km, recording energy consumption and recuperation during braking.

3.4. Data Analysis

The collected data were time-synchronised, and the dependence of IECR on vehicle speed, acceleration, and driving states was then modelled. The effect of battery number on energy consumption and recovery characteristics was considered. The models were assessed for fit and predictive ability.

4. Research Tools

This section presents the research tools applied in the study, including both statistical and computational methods. The aim was to evaluate the instantaneous energy consumption and recuperation under different operational conditions of the buses. The methods range from linear models, which provide a simplified view of the system, to advanced neural networks, which enable capturing complex non-linear dependencies. The following subsections describe the tools in detail.

4.1. Linear Model

Let

{(I E C R_{t}, b_{t}, v_{t}, a_{t})}_{1 \leq t \leq n}

denote a dataset, where

I C E_{t}

denotes the energy consumed during 1 s of measurement (Wh),

v_{t}

is the average vehicle speed (

k m / h

),

a_{t} i s

acceleration

(m / s^{2})

, and

b_{t} \in {1 b, 4 b, 8 b}

is the number of batteries installed in the bus. Since the variable describing the number of batteries is a qualitative variable, we choose 1 battery with a pantograph as the reference level. Speed and acceleration are the main factors influencing the amount of energy consumed and recovered, which is why four vehicle states were distinguished. Therefore, the vehicle’s journey is described by its stay in the states. We distinguish between four states:

A

—acceleration,

B

—braking,

D

—driving, and

S

—stop. Therefore, a function

ϕ (v, a) : R^{+} \times R \to {A, B, D, S}

of the form was constructed

(v, a) = \{\begin{array}{l} S, & v < v_{m i n}, \\ A, & v \geq v_{m i n} and a > a_{m i n}, \\ B, & v \geq v_{m i n} and a < - α_{m i n}, \\ D, & other . \end{array}

The value

v_{m i n}

denotes the minimum vehicle speed below which we assume that the bus is in state S, and

α_{m i n}

is the absolute value of acceleration. If the bus speed exceeds

v_{m i n}

and acceleration is below the level—

α_{m i n}

, we assume that the vehicle is in state B (braking). If the speed exceeds

v_{m i n}

and acceleration exceeds

α_{m i n}

, we assume that the vehicle is in state A (acceleration). When the speed exceeds

v_{m i n}

and the absolute value of acceleration is below

α_{m i n}

, then we accept that the vehicle is in state D (driving).

To check the influence of the number of batteries installed in the bus, speed, and acceleration on the IECR at each moment

t

,

1 \leq t \leq n

, we consider a linear model as follows:

\begin{array}{c} I E C R_{t} = θ_{0} + θ_{1} v_{t} + θ_{2} a_{t} + θ_{3} a_{t}^{2} + θ_{4} v_{t} a_{t} + θ_{5} I_{t}^{ϕ} (B) + θ_{6} I_{t}^{ϕ} (D) + \\ θ_{7} I_{t}^{ϕ} (S) + θ_{8} I_{t}^{b} (4 b) + θ_{9} I_{t}^{b} (8 b) + ε_{t}, \end{array}

(1)

where the predictors

v_{t}, a_{t}, a_{t}^{2}, v_{t} a_{t}

linearly influence the size of IECR, and

I_{t}^{b} (k) = \{\begin{array}{l} 1, & b_{t} = k, \\ 0, & b_{t} \neq k \end{array}, I_{t}^{ϕ} (s) = \{\begin{array}{l} 1, & ϕ (v_{t}, a_{t}) = s, \\ 0, & ϕ (v_{t}, a_{t}) \neq s \end{array}

for

k ∣ i n \in {4 b, 8 b}

and

s \in {A, B, D, S}

, while external factors’ influence is represented by a random variable with a normal distribution

N (0, σ^{2})

.

Parameter

θ_{0}

corresponds to the intercept in the model for the case of a bus with one battery and is in a state of acceleration, and

θ_{5}

is responsible for the change in IECR for the state of braking.

θ_{6}

is the state of driving,

θ_{7}

is the for the state of stop,

θ_{8}

is the change in IECR for a bus with four batteries, and

θ_{9}

for a bus with eight batteries. The remaining parameters in model (1) are responsible for the vehicle dynamics influence on the instantaneous energy consumption. Therefore, for the training set, we solve the problem

\begin{matrix} \underset{(θ_{0}, θ_{1}, \dots, θ_{9}) \in R^{10}}{m i n} \sum_{t = 1}^{n} (I E C R_{t} - θ_{0} - θ_{1} v_{t} - θ_{2} a_{t} - θ_{3} a_{t}^{2} - θ_{4} v_{t} a_{t} - θ_{5} I_{t}^{ϕ} (B) - \\ {θ_{6} I_{t}^{ϕ} (D) - θ_{7} I_{t}^{ϕ} (S) - θ_{8} I_{t}^{b} (4 b) - θ_{9} I_{t}^{b} (8 b))}^{2} \end{matrix}

(2)

The squares method is used to estimate the structural parameters

θ_{0}, θ_{1}, \dots, θ_{9}

[58,59]. IECR predictions estimate

\begin{matrix} {\hat{I E C R}}_{t} = {\hat{θ}}_{0} + {\hat{θ}}_{1} v_{t} + {\hat{θ}}_{2} a_{t} + {\hat{θ}}_{3} a_{t}^{2} + {\hat{θ}}_{4} v_{t} a_{t} + {\hat{θ}}_{5} I_{t}^{ϕ} (B) + {\hat{θ}}_{6} I_{t}^{ϕ} (D) + \\ {\hat{θ}}_{7} I_{t}^{ϕ} (S) + {\hat{θ}}_{8} I_{t}^{b} (4 b) + {\hat{θ}}_{9} I_{t}^{b} (8 b) + ε_{t}, \end{matrix}

(3)

4.2. Extended Linear Model Concerning States

Since we use

ϕ (v, a) : R^{+} \times R \to {A, B, D, S}

to determine the vehicle states, we analyzed the linear dependence of battery quantity, speed, and acceleration, as well as the transformation of these variables for each state (acceleration, braking, driving) and the influence of battery quantity on the stop state. The extended linear model (ELM) concerning states is presented as follows:

I E C R_{t} = \sum_{j \in {A, B, D, S}} I_{t}^{ϕ} (j) f_{j} (Θ_{j}, v_{t}, a_{t}) + ε_{t}

(4)

f_{A} (Θ_{A}, v_{t}, a_{t}) = θ_{0}^{A} + θ_{1}^{A} I_{t}^{b} (4 b) + θ_{2}^{A} I_{t}^{b} (8 b) + θ_{3}^{A} v_{t} + θ_{4}^{A} a_{t} + θ_{5}^{A} a_{t}^{2} + θ_{6}^{A} v_{t} a_{t},

(5)

f_{B} (Θ_{B}, v_{t}, a_{t}) = θ_{0}^{B} + θ_{1}^{B} I_{t}^{b} (4 b) + θ_{2}^{B} I_{t}^{b} (8 b) + θ_{3}^{B} v_{t} + θ_{4}^{B} a_{t} + θ_{5}^{B} a_{t}^{2} + θ_{6}^{B} v_{t} a_{t},

(6)

f_{D} (Θ_{D}, v_{t}, a_{t}) = θ_{0}^{D} + θ_{1}^{D} I_{t}^{b} (4 b) + θ_{2}^{D} I_{t}^{b} (8 b) + θ_{3}^{D} v_{t} + θ_{4}^{D} a_{t} + θ_{5}^{D} a_{t}^{2} + θ_{6}^{D} v_{t} a_{t},

(7)

f_{S} (Θ_{S}, v_{t}, a_{t}) = θ_{0}^{S} + θ_{1}^{S} I_{t}^{b} (4 b) + θ_{2}^{S} I_{t}^{b} (8 b)

(8)

The structural parameters in (4)–(8) satisfy the constraints. Thus, the problem of estimation of structural parameters in model (4) is a linear quadratic task with constraints, namely, for the acceleration state

Θ_{A} = (θ_{0}^{A}, \dots, θ_{6}^{A}) \in R_{+}^{7}

, for the braking state

Θ_{B} = (θ_{0}^{B}, \dots, θ_{6}^{B}) \in \times R_{-}^{4} \times R_{+} \times R_{-} \times R_{+}

, for the driving state

Θ_{D} = (θ_{0}^{D}, \dots, θ_{6}^{D}) \in R^{7}

, and for the stop state

Θ_{S} = (θ_{0}^{S}, θ_{1}^{S}, θ_{2}^{S}) \in R^{3}

.

To estimate the structural parameters in the ELM (4), we solve the problem

\underset{Θ \in D}{m i n} \sum_{t = 1}^{n} {(I E C R_{t} - \sum_{j \in {A, B, D, S}} I_{t}^{ϕ} (j) f_{j} (Θ_{j}, v_{t}, a_{t}))}^{2}

(9)

where

Θ = (Θ_{A}, Θ_{B}, Θ_{D}, Θ_{S})

, and the set of feasible solutions is

D = R_{+}^{7} \times R_{-}^{4} \times R_{+} \times R_{-} \times R_{+} \times R^{9}

. To estimate the significance of the structural parameters, we use the boostrapping technique. Namely, to estimate the distribution of each structural parameter, we perform resampling with replacement for the dataset

{(I C E_{t}, b_{t}, v_{t}, a_{t})}_{1 \leq t \leq n}

and solve the problem (9) based on the sample. By repeating

k -

the sampling process, we obtain sequences of estimators of the structural parameters

{θ_{i j}^{A}}_{1 \leq i \leq k}

,

{θ_{i j}^{B}}_{1 \leq i \leq k}

,

{θ_{i j}^{D}}_{1 \leq i \leq k}

for

j \in {0,1, \dots, 6}

and

{θ_{i j}^{S}}_{1 \leq i \leq k}

for

j \in {0,1, 2}

. From the central limit theorem, for

k \to \infty

, the sampling distribution tends to the normal distribution.

For each parameter, we determine the mean values

{\hat{θ}}_{j}^{s} = \frac{1}{k} \sum_{i = 1}^{k} θ_{i j}^{s}

and standard deviation

σ_{θ_{j}^{s}} = \sqrt{\frac{1}{k} \sum_{i = 1}^{k} {(θ_{i j}^{s} - {\hat{θ}}_{j}^{s})}^{2}}

for

s \in {A, B, D, S}

.

We assume that the prediction value of IECR is

{\hat{I E C R}}_{t} = \sum_{j \in {A, B, D, S}} I_{t}^{ϕ} (j) f_{j} ({\hat{Θ}}_{j}, v_{t}, a_{t}),

(10)

where

{\hat{Θ}}_{A}, {\hat{Θ}}_{B}, {\hat{Θ}}_{D}, {\hat{Θ}}_{S}

are the estimates of the structural parameters for Equations (5)–(8). For models (1) and (4), the residuals

ε_{t} = I E C R_{t} - {\hat{I E C R}}_{t}

denotes the differences between the predicted and empirical values of IECR for

1 \leq i \leq n

and

ε = (ε_{1}, ε_{2}, \dots, ε_{n})

denotes the residual vector, while the value

σ^{2} = \frac{ε^{T} ε}{n - m}

denotes the variance of the residuals, where

m

denotes the number of structural model’s parameters. To assess the impact of individual predictors [58,59] for each structural parameter at the significance level

α = 0.05

, we create a null hypothesis.

H₀:

θ_{i}^{s} = 0

(the influence of the

i

th factor is insignificant on IECR).

The alternative hypothesis is as follows:

H₁:

θ_{i}^{s} \neq 0

(the

i

th factor significantly affects IECR).

A statistic that describes the ratio of a parameter

θ_{i}^{s}

to the standard deviation of that parameter

σ_{θ_{i}^{s}}

as follows:

t_{i} = \frac{\hat{θ_{i}^{s}}}{σ_{θ_{i}^{s}}} \sqrt{k - m}

(11)

There is a t distribution with

(k - m)

degrees of freedom. Because the structural parameters for the ELM (4) can be estimated for Equations (5)–(8) separately, for the states acceleration, braking, and driving, the number of degrees of freedom is

k - 7

, while for the stop state, the number of degrees of freedom is

k - 3

. When testing the significance of parameters in Equation (1), the number of degrees of freedom is equal to

k - 10

. The test probability is as follows:

p_{i} = 2 * P (T > | t_{i} |),

where T is a random variable with a t distribution and an appropriate number of degrees of freedom. If

p_{i} \geq α

, then at the significance level

α

, there is no basis for rejecting the hypothesis

H_{0}

. Then, the structural parameter is insignificantly different from zero, and the predictor has a nonsignificant effect on explaining the variability of IECR. If

p_{i} < α

, at the significance level

α

, we reject the null hypothesis

H_{0}

in favour of

H_{1}

. Then, the structural parameter is significantly different from zero, and the predictor significantly impacts explaining IECR variability.

4.3. Neural Networks

Neural networks (NNs) were used to analyse IECR. For this purpose, the dataset

{(I E C R_{t}, b_{t}, v_{t}, a_{t})}_{1 \leq t \leq n}

is presented in the form

{(I E C R_{t}, x_{t})}_{1 \leq t \leq n}

, where

x_{t} = (I_{t} (1 b), I_{t} (4 b), I_{t} (8 b), v_{t}, a_{t}, a_{t}^{2}, v_{t} a_{t}, I_{t}^{ϕ} (A), I_{t}^{ϕ} (B), I_{t}^{ϕ} (D), I_{t}^{ϕ} (S))

and

1 \leq t \leq n

. NNs are among the most widely used machine learning techniques and are typically used for both classification and regression modelling [24,25,58,60,61,62], where the difference in the applications of regression or classification problems lies primarily in the selection of the activation function, especially for the last layer of the network. Each NN consists of inputs, weights (parameters), an aggregation function (defined as the dot product of the inputs and weights), an activation function, and outputs. In multilayer networks, the inputs of the first layer are predictors from the training set. However, the inputs of the subsequent layers are defined as the outputs of the activation function of the current layer. The result of the aggregation function of a neural network is sent to the activation function

g_{j}

as follows:

Z_{j} (x) = g_{j} (w_{j} Z_{j - 1} (x) + α_{j}),

(12)

where

w_{j}

denotes the weight vector and

α_{j}

denotes the bias for

j = 1, 2, \dots, K

. For the input layer, we take

Z_{0} (x) = x

. The output of the last layer (activation of the output layer) is defined as the output obtained from the NN. Thus, from (12), the last layer is defined

\begin{array}{r} Z_{K} (x) = g_{K} (w_{K} g_{K} (w_{K - 1} Z_{K - 2} (x) + α_{K - 1}) + α_{K}) = \dots \\ = g_{K} (w_{k} g_{K - 1} (w_{K - 1} g_{K - 2} (\dots g_{2} (w_{2} g_{1} (w_{1} x + α_{1}) + α_{2}) \dots) + α_{K - 1}) + α_{K}) \end{array}

The analysed networks used a linear activation function of the form

L (x) = a x + b

To identify the regression relationship using a neural network, the task is

m i n R ({w_{j}}_{1 \leq j \leq K}, {α_{j}}_{1 \leq j \leq K}) .

(13)

where the objective function is defined as Mean Squared Error (MSE). The MSE is estimated as the average value of the squared differences between the actual and predicted values. The result obtained for the network’s last layer is taken as the forecast

Z_{K} (\cdot)

. If the predictors are correlated, we can regularise some layers by introducing a case for large values of the structural parameters. Given the above function’s purpose. We can present it in the form of

R ({w_{j}}_{1 \leq j \leq K}, {α_{j}}_{1 \leq j \leq K}) = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - Z_{K} (x_{i}))}^{2} + λ_{1} \sum_{i \in S} {‖w_{i}‖}_{L_{1}} + λ_{2} \sum_{i \in S} {‖w_{i}‖}_{L_{2}}

(14)

where

S \in {1,2, \dots, K}

denotes the set of network layers for which we apply regularisation techniques, where

{‖\cdot‖}_{L_{1}}

and

{‖\cdot‖}_{L_{2}}

denote

L_{1}

and

L_{2}

norms, respectively. For NNs without regularisation, we assume

λ_{1} = λ_{2} = 0

.

In feed-forward neural networks, the entire input dataset is processed simultaneously. In contrast, recurrent neural networks (RNNs) handle the input sequentially, retaining information about previously processed elements. Through this feedback mechanism, RNNs incorporate the results of earlier iterations into subsequent computations. The backwards propagation algorithm estimates weights and biases, i.e., solves the problem (13).

4.4. Assessment of Model Fit and Prediction

For the analysis of the model fit [56,57,61], the coefficient of determination was estimated

R^{2}

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \overset{⃐}{y})}^{2}}

(15)

where

y_{i}

denotes the observed value of the dependent variable,

{\hat{y}}_{i}

is the predicted value based on the model,

i = 1,2, \dots, n

, and

\overset{⃐}{y}

is the mean value of the dependent variable. The coefficient of determination shows how much of the variability in the dependent variable is explained by the model. Additionally, the correlation coefficient between this variable’s actual and predicted values was determined as follows:

r = \frac{\sum_{i = 1}^{n} (y_{i} - \overset{⃐}{y}) ({\hat{y}}_{i} - {\overset{⃐}{y}}_{p r e d})}{\sqrt{\sum_{i = 1}^{n} {(y_{i} - \overset{⃐}{y})}^{2} \sum_{i = 1}^{n} {({\hat{y}}_{i} - {\overset{⃐}{y}}_{p r e d})}^{2}}}

(16)

where

{\overset{⃐}{y}}_{p r e d}

denotes the mean of the predicted values. To analyse the forecast discrepancies [58,59], the following indicators are used:

Mean Squared Error (MSE)

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}

(17)

Root Mean Squared Error (RMSE)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(18)

Mean Absolute Error (MAE)

M A E = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - {\hat{y}}_{i} |

(19)

5. Case Study: Energy Consumption and Recuperation Analysis

During the electric buses’ run, the consumed and recovered energy were analysed in conditions consistent with the SORT procedure. Using a Yokogawa WT-1806 power analyser, the consumed and recovered energy were recorded. The readings as time series

{\{I E C_{t_{j}}\}}_{0 \leq j \leq n}

and

{\{I E R_{t_{j}}\}}_{0 \leq j \leq n}

present a sequence representing consumed and recovered energy accordingly, and the sequence

{\{I E C R_{t_{j}}\}}_{0 \leq j \leq n}

presents IECR, where

I E C R_{t_{j}} = I E C_{t_{j}} + I E R_{t_{j}}

. The impact of the bus run on consumed and recovered energy was analysed, specifically the speed, vehicle acceleration, and battery capacity. The vehicle’s run was described by its periods in

A

—acceleration,

B

—braking,

D

—driving, and

S

—stop states, as well as the number of batteries installed in the vehicle. Particular attention was paid to instantaneous energy consumption during acceleration and energy recovery during braking. For each battery count, 10 runs lasting 3 min were performed. High repeatability of the driving conditions was achieved thanks to the driver’s precise driving, who reproduced the speed profile in each run with high compliance with the assumed cycle. Figure 1a shows the area circled in blue, illustrating the range between the highest and lowest speeds at each test moment, while Figure 1b shows the standard deviations as a function of cycle time. To facilitate the analysis of journey repeatability, Figure 1b shows the standard deviation (black curve) of speed measurements during the SORT test, where the states in which the vehicle was moving are marked for each moment.

Figure 1. The range of speed during the SORT test and the standard deviation of speed at each moment.

In the SORT test, the speed pattern reveals recurring intervals of acceleration, uniform-speed operation, deceleration, and halting. The narrow speed range within each phase (Figure 1a) indicates high repeatability, which is crucial for comparing battery types. The standard deviation values (Figure 1b) confirm this stability—the lowest deviations occur during constant speed driving and stopping. In contrast, the highest deviations are observed during acceleration and braking, reflecting the natural speed variability during these phases.

An example SORT test run and electricity consumption for a bus with eight batteries are shown in Figure 2.

Figure 2. The example of a bus journey according to SORT with eight batteries.

The vehicle speed curve (Figure 2a) reflects a typical SORT cycle, with clearly defined standstill phases, acceleration, constant speed, and braking. Instantaneous energy consumption (Figure 2b) peaks during intense acceleration, dropping to almost zero during standstill and coasting. Analysis of instantaneous energy recuperation (Figure 2c) shows significant negative power values during braking, indicating effective energy recovery during these phases. The most significant recuperation potential is observed during the transition from high speed to standstill, which has significant implications for the vehicle’s energy balance.

To evaluate how the number of batteries, acceleration, and speed affect performance, the dataset was split into training (80%) and testing (20%) subsets. The partitioning was performed by vehicle state, with 80% of the records from each state allocated to the training subset and the remaining 20% to the testing subset. This approach decoupled the analysis from the entire trip sequence (time series) and, consequently, from the specific operating conditions of the vehicle.

5.1. Linear Model

Using a linear model (1), we explain the impact of battery capacity, speed, and acceleration on the training set’s IECR. The estimator values, standard deviations of these parameters, the

t_{i}

test statistic (11), and

p_{i}

the test probability for each parameter are presented in Table 1.

Table 1. The values of parameters, standard deviation,

t_{i}

statistical values, and

p_{i}

probability test for the linear model.

For model (1), the value of the coefficient of determination

R^{2}

is 0.7859, which indicates a good fit of the model to the experimental data, with a standard deviation of the residuals of 6.97. Most explanatory variables statistically affect the IECR, including vehicle speed

v_{t}

, acceleration

a_{t}

, its square

a_{t}^{2}

, and the interaction of speed with acceleration

v_{t} a

. The influence of vehicle operating states, such as braking (braking), driving (driving), and standstill (stop), also turned out to be significant, suggesting that driving dynamics significantly shape IECR.

In turn, the number of batteries in the four- and eight-battery configurations is characterised by high test probability values (

p_{i} > 0.05

), indicating a lack of significant effect of this variable on IECR within the adopted linear model. These results suggest that the effect of battery number on energy consumption is only revealed under different operating conditions or is masked by the dominant influence of vehicle motion parameters. For the linear model (1), the residuals were analysed. The graph below shows the histogram, cumulative distribution function, residual autocorrelation, and Q-Q plot (visual verification of the normality postulate of residuals).

Analysis of the residual series

{ε_{t}}_{1 \leq t \leq n}

of the linear model (1), based on Figure 3, indicates the presence of outliers and significant autocorrelation for lags τ = 1 and τ = 2. The distribution of the residuals deviates from normality, as evidenced by the “fat tails” visible in the Q-Q plot and the differences between the empirical and theoretical distribution functions. The histogram and density function reveal asymmetry and a greater concentration of values around zero, but with a few outlying observations. These results suggest that the assumptions regarding normality and independence of the residuals are not fully met, which may indicate the influence of additional factors not accounted for in the model.

Figure 3. Residuals analysis for a linear model: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

We used the Shapiro–Wilk, Cramér–von Mises, Anderson–Darling, Lilliefors (Kolmogorov–Smirnov), D’Agostino Omnibus, and Jarque–Bera tests, as well as the independence of the residuals using the Ljung-Box test for shifts

1 \leq τ \leq 10

. Table 2 provides the statistics and test probability values.

Table 2. Results of normality and residual correlation tests for the linear model.

Analysis of the distribution of the linear model residuals revealed significant deviations from normality. All tests performed, Shapiro–Wilk, Cramér–von Mises, Anderson–Darling, Lilliefors (Kolmogorov–Smirnov), D’Agostino Omnibus, Jarque–Bera, and Ljung–Box, yielded test probability values close to zero, which is below the significance level of α = 0.05, so the hypotheses of a normal distribution of residuals should be rejected.

Such consistent results across all tests suggest that the residuals exhibit substantial deviations from normality, which may result from non-linear relationships between variables, the presence of outliers, or heteroscedasticity. The results of the Ljung–Box test additionally indicate the presence of autocorrelation in the residuals, which may indicate the validity of considering models with a different structure, e.g., non-linear or recurrent neural networks.

Then, model (1) was validated on the test set. Figure 4 shows the actual and predicted values of the IECR.

Figure 4. Real and predicted instantaneous consumption energy for the linear model.

A comparison of actual and predicted values for instantaneous energy consumption and recovery indicates a good agreement with the linear model concerning the overall trend. Data points are distributed along the red line (the line of perfect agreement between actual and predicted values). However, some scatter is visible, particularly for extreme positive and negative values. This scatter suggests that the model may have limited accuracy in predicting extreme energy consumption and recovery values, which may be due to non-linear relationships not captured by the model. Nevertheless, prediction accuracy is relatively high within the range of average values.

5.2. Extended Linear Model Concerning States

For model (4) for the acceleration state, the values of the estimators, standard deviations of these parameters, the value of

t_{i}

the test statistic (11), and

p_{i}

the test probability for each parameter are presented in Table 3.

Table 3. The values of parameters, standard deviation,

t_{i}

statistical values, and

p_{i}

probability test for the linear model for the acceleration state.

In the case of the linear model describing the acceleration condition, all main variables showed a significant effect on the explained variable, at a significance level of α = 0.05. The parameters of the number of batteries (both four- and eight-battery configurations) are statistically significant, indicating a noticeable effect of the power supply configuration on the IECR value during acceleration. The speed variable

v_{t}

, acceleration

a_{t}

, and their interactions

v_{t} a_{t}

are characterised by very high statistical values

t_{i}

and

p_{i}

, values close to zero, confirming their key role in the model. The value for the component

a_{t}^{2}

is also significant, although with a higher p-value compared to the other predictors. Such a high level of significance for parameters related to driving dynamics suggests that in the acceleration state, the variability of the IECR is strongly dependent on vehicle speed and acceleration, and the effect of battery number, although smaller than that of the motion parameters, remains significant.

For model (4) for the braking state, the values of the estimators, the standard deviations of these parameters, the value

t_{i}

test statistic (11), and

p_{i}

the test probability for each parameter are presented in Table 4.

Table 4. The values of parameters, standard deviation, statistical

T

values, and test probability for the linear model for the braking state.

For the linear model describing the braking state, most of the parameters were highly statistically significant, and the

p_{i}

test probability values were close to zero, which indicates their strong influence on IECR in this driving phase. The configuration of the number of batteries (four- and eight-battery configurations), the variable speed, and the product of velocity and acceleration are significant at a level bordering on absolute statistical certainty.

The structural parameters with acceleration variables

a_{t}

and the square of acceleration

a_{t}^{2}

show higher

p_{i}

test probability values, which suggests a minor role of these variables in the energy recovery process during braking compared to

v_{t}

the vehicle speed and the interaction

v_{t} a_{t}

of speed and acceleration. The interactive component, in turn,

v_{t} a_{t}

is significant and reflects the intricate interplay of speed and acceleration during energy recuperation.

The results indicate that in the braking state, the dominant factor determining the IECR is the vehicle speed, the product of acceleration and speed, and the number of batteries. In contrast, the influence of acceleration is complementary.

For model (4) for the driving state, the values of the estimators, standard deviations of these parameters, the value

t_{i}

test statistic (11), and

p_{i}

the test probability for each parameter are shown in Table 5.

Table 5. The values of parameters, standard deviation

t_{i}

statistical values, and

p_{i}

probability test for the linear model for the driving state.

Due to the linear model describing the driving state (driving), the analysis indicates that, of all variables, only speed

v_{t}

and the interactive term

v_{t} a_{t}

significantly affect IECR (

p_{i}

< 0.05). The test probability value for speed is extremely low,

3.998 \times 10^{- 62}

, which confirms its dominant influence on IECR in the steady-state driving phase.

In turn, the number of batteries (both four- and eight-battery configurations) and acceleration-related variables (

a_{t} i a_{t}^{2})

do not reach statistical significance, suggesting that they have limited significance in this condition. The intercept in the model is also significantly different from zero.

For model (4) for the stop state, the values of the estimators, the standard deviations of these parameters, the value

t_{i}

test statistic (11), and

p_{i}

the test probability for each parameter are presented in Table 6.

Table 6. The values of parameters, standard deviation, statistical

T

values, and test probability for the linear model for the stopped state.

In the model describing the stopped state (stop), the only statistically significant parameter is the intercept (intercept), whose value is positive (

θ_{0}^{D} = 0

.769), and the test probability is 5.463 × 10⁻⁶³ (below the significance level of 0.05), which indicates that the average energy consumption is 0.769 Wh during the stopped state. Neither the number of batteries in the four- nor eight-battery configuration significantly affects the dependent variable IECR (for these structural parameters, the test probability is above the significance level of 0.05).

Model (4) is characterised by a high value of the coefficient of determination

R^{2}

= 0.9124, which means that it reasonably reflects the variability of IECR in the states of acceleration, braking, driving, and stopping, with a low standard deviation of the residuals equal to 4.2648.

For the ELM (4), the residuals were analysed. Figure 5 presents the histogram, cumulative distribution function, autocorrelation of the residuals, and Q-Q plot.

Figure 5. Residuals analysis for a linear model with states: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

The analysis of the residuals for model (4) indicates the presence of extreme values in the series

{ε_{t}}_{1 \leq t \leq n}

and a significant autocorrelation for the τ shifts ∈ {1, 2}. The histogram and density function reveal a distribution close to normal, although with visible “fat tails”, as confirmed by the Q-Q plot. The empirical distribution function differs from the theoretical normal distribution function, but the agreement is better than the baseline model (1). Statistical tests (Shapiro–Wilk, Cramér–von Mises, Anderson–Darling, Jarque–Bera, Lilliefors/Kolmogorov–Smirnov, D’Agostino Omnibus) confirm deviations from normality, and the Ljung–Box test for shifts 1 ≤ τ ≤ 10 indicates partial dependence between subsequent residuals. The values of the statistics and test probabilities are summarised in Table 7.

Table 7. Results of normality tests and correlation tests for residuals of the linear model with states.

On normality tests (Shapiro–Wilk, Anderson–Darling, Cramér–von Mises, Lilliefors, Jarque–Bera, D’Agostino Omnibus), we see very low test probability values (below the 0.05 significance level), which indicates the rejection of the hypothesis of normality of the residual distribution in the model considering job states. Additionally, the Ljung–Box test confirms the presence of autocorrelation in the residuals. This indicates that model (4) fits the data better than model (1); however, the distribution of the residuals deviates from normality and shows significant time dependence.

Then, model (4) was validated on the test set. Figure 6 shows the actual and predicted values of the instantaneous energy consumption and recuperation.

Figure 6. Real and predicted instantaneous consumption energy for the linear model with states.

The model validation (4) results on the test set demonstrate good agreement between the actual and predicted values of IECR. The points align along the perfect fit line (red line), confirming high prediction accuracy. However, a greater discrepancy is visible at the extreme values, possibly due to the smaller number of observations in these ranges and non-linear effects not captured by the linear model.

5.3. Neural Network

For the neural network without regularisation (

λ_{1} = 0

,

λ_{2} = 0

) and with the dropout technique applied (20%), the dependency model is presented in Table 8.

Table 8. NN architecture without regularisation (dropout 20%).

The neural network model consists of five layers: an input layer with 11 features, two dense layers with ten and five neurons, respectively, a dropout layer to reduce the risk of overfitting, and an output layer with a single neuron. All dense layers use a linear activation function, which is justified by the nature of the regression problem. The total number of parameters is 181.

On the training set, the coefficient of determination

R^{2}

for the NN is 0.9059, while the standard deviation of the residuals is 4.4332. The residuals were analysed for the neural network, and the visual analysis is presented in Figure 7.

Figure 7. Residuals analysis for the neural network: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

The figure shows the occurrence of extreme values in a series of residuals

{ε_{t}}_{1 \leq t \leq n}

and a significant correlation with lag

τ \in {1, \dots, 12}

. The Q-Q plot reveals “fat tails”, and comparing the empirical and theoretical (normal) distribution functions indicates significant deviations from the normal distribution. The normality of the residual distribution was verified using the Shapiro–Wilk, Anderson–Darling, Cramér–von Mises, Lilliefors (KS), Jarque–Bera, and D’Agostino Omnibus tests, and independence was verified using the Ljung–Box test for 1 ≤ τ ≤ 10. Detailed results are presented in Table 9.

Table 9. Results of normality tests and correlation tests for residuals of the neural network.

Table 9 presents the results of eight statistical tests (normality and autocorrelation) for the neural network model residuals. For each test, the test probability value is significantly below the 0.05 significance level, indicating rejection of the null hypothesis of normality in the residuals. The results of the Jarque–Bera, D’Agostino Omnibus, and Ljung–Box tests confirm that the residuals exhibit both significant deviations from normality and autocorrelation. The neural network model presented in Table 8, as well as models (1) and (4), do not fully reflect the data’s variability structure.

Figure 8 compares actual and predicted values for IECR obtained from the NN model. A strong correlation is visible between the observed data and the predictions, with most points falling along the red line (the line of perfect fit), confirming the model’s high agreement with reality. Deviations from the red line mainly concern extreme values, where greater scatter in the predictions appears.

Figure 8. Real and predicted instantaneous consumption energy for the neural network.

5.4. Recurrent Neural Network

For the recurrent neural network without regularisation and with the dropout technique (20%), the dependency model is presented in Table 10.

Table 10. RNN architecture without regularisation (dropout 20%).

Table 10 shows the detailed architecture of the recurrent neural network without regularisation, which uses a 20% dropout technique. The table shows the type and order of layers, the shape of the output data, the number of trainable parameters, and the activation functions used in each layer. The total number of model parameters is 531, including weights and biases of all layers.

On the training set, the coefficient of determination

R^{2}

for the RNN is 0.9343, while the standard deviation of the residuals is 3.7047. The residuals were also analysed for the neural network. Figure 9 presents the histogram, cumulative distribution function, Q-Q plot, and autocorrelation of the residuals.

Figure 9. Residuals analysis for recurrent neural network: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

The residuals reveal the presence of extreme values in the residual series

{ε_{t}}_{1 \leq t \leq n}

and significant autocorrelation for the shifts τ ∈ {1, 2, 5… 12}. The Q–Q plot indicates the presence of “fat tails”, and comparing cumulative distribution functions confirms apparent differences between the theoretical and empirical normal distributions. The normality of the residual distribution was verified using the Shapiro–Wilk, Cramér–von Mises, Anderson–Darling, Lilliefors (Kolmogorov–Smirnov), Jarque–Bera, and D’Agostino Omnibus tests, while the independence of the residuals was checked using the Ljung–Box test for shifts 1 ≤ τ ≤ 10. The results of the statistics and test probabilities are shown in Table 11.

Table 11. Results of normality tests and correlation tests for residuals of the recurrent neural network.

Table 11 shows the results of normality tests and correlation tests for the residuals of the RNN for the Shapiro–Wilk, Cramér–von Mises, Anderson–Darling, Lilliefors (Kolmogorov–Smirnov), Jarque–Bera, D’Agostino Omnibus, and Ljung–Box tests. All test probability values are below the 0.05 significance level, indicating a significant deviation of the residual distribution from normality and correlation in the time series.

Figure 10 compares actual and predicted instantaneous energy consumption values obtained from the RNN. The distribution of points along the perfect fit line indicates a high degree of agreement between the predictions and the measurements. However, a greater scatter is visible for the extreme values, particularly in high and low energy consumption areas.

Figure 10. Real and predicted instantaneous consumption energy values for a recurrent neural network.

Figure 11 shows the speed and quality of learning for the neural network models. The batch was divided into two parts for each epoch: 85% was used for training the network and 15% was used for validation. The training process involved 200 epochs.

Figure 11. History of training of neural networks.

Figure 11 shows the training curves of a classical NN and RNN, along with their corresponding validation curves. Both models exhibit a sharp decline in the Mean Squared Error in the initial epochs, followed by a stabilisation at a low level after approximately 50 epochs. This curve indicates an effective model fit and no signs of overfitting.

5.5. Prediction Comparison

To compare the predictions of the models on the test set, the correlation coefficient (

r

) between the actual and predicted values was estimated. Instantaneous energy consumption and recuperation, Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) were evaluated. The values these coefficients (16)–(19) represent are shown in Table 12.

Table 12. The correlation coefficient, Mean Squared Error, Root Mean Squared Error, and Mean Absolute Error for models on the test set.

Table 12 compares the prediction quality of four models: linear, extended linear, neural network, and recurrent neural network. The best prediction results were obtained for the recurrent neural network, characterised by the highest correlation coefficient (r = 0.9666) and the lowest error values (MSE = 13.72; RMSE = 3.70; MAE = 2.24), indicating its highest accuracy in representing the test data. The Taylor diagram illustrates the graphical agreement between the models and the reference data.

The Taylor diagram (Figure 12) illustrates the fit of four models: linear model, extended linear model, classical neural network, and recurrent neural network (RNN). Both neural models have the highest correlation coefficient (above 0.95) and a standard deviation close to the reference value, indicating greater precision in representing the data than linear models.

Figure 12. Comparison of model fit.

6. Discussion of the Results

During SORT tests, the analysis of IECR in electric buses revealed a clear impact of both driving parameters (speed, acceleration) and battery configuration on vehicle energy efficiency. The obtained results confirm observations from the literature that driving profile—particularly acceleration intensity and frequency—is a key factor determining instantaneous energy consumption [40,41].

A comparison of three vehicle configurations (one battery with a pantograph, four batteries, eight batteries) showed that increasing the number of batteries. Therefore, vehicle weight leads to a noticeable increase in energy consumption during acceleration. This effect is consistent with previous studies, which indicate a significant role of the total vehicle weight in shaping energy losses, especially on routes with frequent speed changes [42,43]. At the same time, larger battery capacity promotes range extension, but, as emphasised by Yin et al. [48], this increase becomes increasingly inefficient above a certain threshold, and excessive weight increases can negatively impact the overall energy balance.

Regarding energy recovery, the obtained results confirm that its effectiveness is closely related to braking dynamics. Intensive braking phases generated higher energy recovery, which is consistent with the results of studies that strategies based on higher braking force values allow for the recovery of 25–51% of kinetic energy [15,48]. For the cases examined, the ratio of recovered to consumed energy corresponded to values reported in the literature. Nevertheless, refinements to the braking strategy, such as the application of fuzzy control or threshold algorithms, may further raise this indicator and improve the efficiency of the system as a whole [63].

The applied modelling approaches confirmed that non-linear models (NNs and RNNs) better represent the complex relationships between IECR and driving parameters than linear models. Higher values of the coefficient of determination and lower prediction errors (RMSE, MAE) for neural networks indicate their greater ability to capture non-linearities characteristic of real-world driving profiles. These results are consistent with the observations of other authors, according to which recurrent architectures (LSTM, GRU) provide perfect accuracy in the analysis of temporal data in transport [37,38]. The extended linear model, concerning vehicle states, illustrates the dependencies between the vehicle driving dynamics, the number of batteries installed in the bus, and NN models.

Analysis of the impact of driving states revealed that the most significant differences between linear and non-linear models occurred during acceleration phases, where power demand is highest. These differences were less significant during constant speed and standstill phases, confirming that the main benefits of advanced modelling are related to conditions of high-speed variability. These results fit into the broader literature context, which indicates the need for adaptive, scenario-specific predictive models in public transport [4].

The strict implementation of the speed profile consistent with the SORT methodology ensured the high repeatability of the measurement results, which enables direct comparisons between the tested variants. SORT, as a method for assessing the energy efficiency of entire vehicles in near-real-world conditions, remains an indispensable tool in research on public transport optimisation [4,5].

Based on the present findings, the most critical next step for validating and deploying the proposed modelling approach is systematically collecting and integrating real-world operational data. While SORT tests ensure high repeatability and provide reliable benchmarks, they cannot fully capture the stochastic variability of urban traffic, driver behaviour, and environmental conditions. Extending the models with operational datasets would allow for assessing their robustness in diverse scenarios and strengthening their applicability to fleet-level decision-making, including scheduling and charging optimisation. Testing under different climatic conditions and accounting for operational factors such as breakdowns, defects, or battery degradation remain important complementary directions; however, immediate priority should be given to verifying model fidelity against in-service data.

In summary, the obtained results confirm the following:

The number of batteries significantly influences the instantaneous energy consumption, especially during dynamic driving phases.
The effectiveness of recuperation depends on the braking strategy and can be increased by implementing advanced control methods.
The extended linear model concerning states is more straightforward to interpret and simple to adapt for IECR prediction; on the other hand, the quality of prediction is comparable to neural networks.
Neural network models are more effective than classical linear models in predicting IECR, especially under conditions of high-speed variability.
Standardisation of test conditions (SORT) is crucial for reliable comparison of configurations and energy efficiency evaluation.

7. Conclusions

The analysis was conducted to compare the effectiveness of linear models and neural networks, including recurrent architectures, in predicting instantaneous energy consumption and recuperation. The linear models, both the baseline (1) and the extended model with state variables (4), achieved coefficients of determination of 0.7859 and 0.9124, respectively, in the extended version. However, parameter analysis revealed that not all explanatory variables were statistically significant for the classic model—for example, the number of batteries, whose effect on IECR was insignificant. It is worth emphasising that the extended linear model showed that vehicle dynamics and battery number may be crucial for estimating the IECR prediction for different states. Analysis of the residuals revealed the presence of outliers, significant time correlations in the range τ ∈ {1,…,12}, and apparent deviations from the normality of the distribution, as confirmed by the Shapiro–Wilk, Anderson–Darling, Lilliefors, Cramér–von Mises, Jarque–Bera, and D’Agostino Omnibus tests, while the Ljung–Box test additionally indicated the lack of independence of the residuals.

Using a classical neural network allowed for a significantly higher coefficient of determination

R^{2}

(around 0.91) with a lower standard deviation of the residuals. The values of

R^{2}

for the extended linear model concerning states and the neural network models are very similar. Fit plots confirmed excellent agreement of the forecasted values with the actual data, especially in the range of average loads. However, analysis of the residuals still revealed the presence of extreme values and “fat tails” of the distribution and temporal correlations for selected delays. Introducing a recurrent architecture (RNN) further improved the accuracy of short-term forecasts, especially during rapid load changes and transitions between energy consumption and recuperation phases. Fit plots for the RNN were characterised by minimal phase shifts relative to the actual data, indicating a good ability to represent the dynamics of the processes. However, “fat tails” and some temporal correlations were still observed in the residuals for selected delays.

A comparison of the model fit and stability based on the Taylor diagram and summary indices demonstrated the advantage of neural models over linear models. NN models, particularly RNNs, achieved better correlation with measured data, lower RMSD errors, and minor shifts in the amplitude of IECR changes. The neural network training process was stable over 200 epochs, with the data divided into training (85%) and validation (15%), which avoided overfitting. The results indicate that the most effective approach for applications requiring high-precision representation of instantaneous IECR values is to use neural architectures, especially recurrent ones, due to their ability to model temporal dependencies. Linear models are more easily interpretable and can be used in simplified cases requiring low computational complexity; however, their limited flexibility translates into larger fitting errors. However, concerning states, the extended linear model achieved prediction accuracy comparable to that of neural networks. Regardless of the method used, it may be important to implement techniques that limit the influence of outliers and eliminate temporal correlations in the residuals. At the same time, further research may include expanding the set of variables with additional parameters describing the vehicle state and environmental conditions, as well as testing hybrid architectures combining RNNs with attention mechanisms.

Author Contributions

Conceptualisation, E.K., M.Z.-L., P.W., A.Ś., and A.T.; methodology; E.K., M.Z.-L., P.W., B.Š., P.L., and J.L.; software; E.K., B.Š., and J.L.; validation; E.K., B.Š., and P.L.; formal analysis; E.K., M.Z.-L., P.W., J.M., A.Ś., and A.T.; investigation; E.K., M.Z.-L., B.Š., J.L., and J.M.; resources; M.Z.-L., P.W., P.L., and A.Ś.; data curation; M.Z.-L., P.W., P.L., and A.Ś.; writing—original draft preparation; E.K., M.Z.-L., B.Š., J.L., J.M., and A.T.; writing—review and editing; P.W., P.L., J.L., and J.M.; visualisation; E.K., B.Š., J.M., and A.T.; supervision; M.Z.-L., P.L., A.Ś., and A.T.; project administration; M.Z.-L. and P.W.; funding acquisition; E.K., M.Z.-L., and A.Ś. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are available after request to the corresponding author.

Acknowledgments

This article is the result of a scientific internship by Edward Kozłowski, Magdalena Zimakowska-Laskowska, Piotr Wiśniowski, and Jan Laskowski, which took place at the Budapest University of Technology and Economics from 7 January to 7 February 2025.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BiLSTM	Bidirectional Long Short-Term Memory
BRT	Bus Rapid Transit
CAN	Controller Area Network
ELM	Extended Linear Model
GRU	Gated Recurrent Unit
IECR	Instantaneous Energy Consumption and Recuperation
LSTM	Long Short-Term Memory
MAE	Mean Absolute Error
MPC	Model Predictive Control
MSE	Mean Squared Error
NN	Neural Network
OBD/OBD-II	On-Board Diagnostics (II gen.)
RMSE	Root Mean Squared Error
RNN	Recurrent Neural Network
SOC	State of Charge
SORT	Standardised On-Road Test

References

Li, X.; Wang, T.; Li, J.; Tian, Y.; Tian, J. Energy Consumption Estimation for Electric Buses Based on a Physical and Data-Driven Fusion Model. Energies 2022, 15, 4160. [Google Scholar] [CrossRef]
Caban, J. The investigation of eco-driving possibilities in passenger car used in urban traffic. Transp. Res. Procedia 2021, 55, 212–219. [Google Scholar] [CrossRef]
Burdzik, R.; Konieczny, Ł.; Jaworski, R.; Laskowski, D.; Polak, R. Comparison of Energy Consumption of Short and Long City Buses in Terms of Assessing the Needs for e-Mobility. In Research Methods and Solutions to Current Transport Problems; Siergiejczyk, M., Krzykowska, K., Eds.; Springer International Publishing: Cham, Switzerland, 2020; Volume 1032, pp. 74–83. [Google Scholar] [CrossRef]
Kozłowski, E.; Zimakowska-Laskowska, M.; Dudziak, A.; Wiśniowski, P.; Laskowski, P.; Stankiewicz, M.; Šnauko, B.; Lech, N.; Gis, M.; Matijošius, J. Analysis of Instantaneous Energy Consumption and Recuperation Based on Measurements from SORT Runs. Appl. Sci. 2025, 15, 1681. [Google Scholar] [CrossRef]
Mišanović, S.; Spasojević, V. Measurement of the Fuel Consumption of Buses for Public Transport by the Methodology “SORT” (Standardised On-Road Test Cycles). Int. J. Veh. Mech. Engines Transp. Syst. 2015, 41, 2. [Google Scholar]
Al-lami, A.; Torok, A. Sustainability Indicators of Surface Public Transportation. Sustainability 2023, 15, 15289. [Google Scholar] [CrossRef]
Xie, S.; He, H.; Peng, J. An energy management strategy based on stochastic model predictive control for plug-in hybrid electric buses. Appl. Energy 2017, 196, 279–288. [Google Scholar] [CrossRef]
Zhang, Z.; He, H.; Guo, J.; Han, R. Velocity prediction and profile optimization based real-time energy management strategy for Plug-in hybrid electric buses. Appl. Energy 2020, 280, 116001. [Google Scholar] [CrossRef]
Tian, X.; Cai, Y.; Sun, X.; Zhu, Z.; Xu, Y. A Novel Energy Management Strategy for Plug-in Hybrid Electric Buses Based on Model Predictive Control and Estimation of Distribution Algorithm. IEEE/ASME Trans. Mechatron. 2022, 27, 4350–4361. [Google Scholar] [CrossRef]
Yan, M.; Li, G.; Li, M.; He, H.; Xu, H.; Liu, H. Hierarchical predictive energy management of fuel cell buses with launch control integrating traffic information. Energy Convers. Manag. 2022, 256, 115397. [Google Scholar] [CrossRef]
Li, M.; Liu, H.; Yan, M.; Wu, J.; Jin, L.; He, H. Data-driven bi-level predictive energy management strategy for fuel cell buses with algorithmics fusion. Energy Convers. Manag. X 2023, 20, 100414. [Google Scholar] [CrossRef]
Jia, C.; He, H.; Zhou, J.; Li, J.; Wei, Z.; Li, K. Learning-based model predictive energy management for fuel cell hybrid electric bus with health-aware control. Appl. Energy 2024, 355, 122228. [Google Scholar] [CrossRef]
Koniak, M.; Jaskowski, P.; Madleňák, R.; Zaranka, J. Analysis of real-time energy transfer possibilities at intersections with consideration of energy storage. Arch. Transp. 2025, 73, 195–206. [Google Scholar] [CrossRef]
Yin, Z.; Ma, X.; Su, R.; Huang, Z.; Zhang, C. Regenerative Braking of Electric Vehicles Based on Fuzzy Control Strategy. Processes 2023, 11, 2985. [Google Scholar] [CrossRef]
Sandrini, G.; Chindamo, D.; Gadola, M. Regenerative Braking Logic That Maximizes Energy Recovery Ensuring the Vehicle Stability. Energies 2022, 15, 5846. [Google Scholar] [CrossRef]
Kim, Y.-J.; Lee, S.-J.; Jin, H.-S.; Suh, I.-A.; Song, S.-Y. Comparison of linear and nonlinear statistical models for analyzing determinants of residential energy consumption. Energy Build. 2020, 223, 110226. [Google Scholar] [CrossRef]
Abdelaty, H.; Al-Obaidi, A.; Mohamed, M.; Farag, H.E.Z. Machine learning prediction models for battery-electric bus energy consumption in transit. Transp. Res. Part. Transp. Environ. 2021, 96, 102868. [Google Scholar] [CrossRef]
Abdelaty, H.; Mohamed, M. A Prediction Model for Battery Electric Bus Energy Consumption in Transit. Energies 2021, 14, 2824. [Google Scholar] [CrossRef]
Kozłowski, E.; Borucka, A.; Oleszczuk, P.; Leszczyński, N. Evaluation of readiness of the technical system using the semi-Markov model with selected sojourn time distributions. Eksploat. Niezawodn.–Maint. Reliab. 2024, 26, 191545. [Google Scholar] [CrossRef]
Lech, N.; Nikończuk, P. The method of route optimization of electric vehicle. Procedia Comput. Sci. 2022, 207, 4454–4462. [Google Scholar] [CrossRef]
Dabčević, Z.; Škugor, B.; Cvok, I.; Deur, J. A Trip-Based Data-Driven Model for Predicting Battery Energy Consumption of Electric City Buses. Energies 2024, 17, 911. [Google Scholar] [CrossRef]
Vepsäläinen, J.; Otto, K.; Lajunen, A.; Tammi, K. Computationally efficient model for energy demand prediction of electric city bus in varying operating conditions. Energy 2019, 169, 433–443. [Google Scholar] [CrossRef]
Sakhno, V.; Kravchenko, O.; Korpach, A.; Korpach, O.; Bosenko, V.; Dižo, J.; Blatnický, M. THE CHOICE AND REASONING OF THE BUS RAPID TRANSIT SYSTEMS FOR CITY TRANSPORT. Sci. J. Silesian Univ. Technol. Ser. Transp. 2024, 123, 259–275. [Google Scholar] [CrossRef]
Kłosowski, G.; Rymarczyk, T.; Niderla, K.; Kulisz, M.; Skowron, Ł.; Soleimani, M. Using an LSTM network to monitor industrial reactors using electrical capacitance and impedance tomography—A hybrid approach. Eksploat. Niezawodn.–Maint. Reliab. 2023, 25, 11. [Google Scholar] [CrossRef]
Pawlik, P.; Kania, K.; Przysucha, B. Fault diagnosis of machines operating in variable conditions using artificial neural network not requiring training data from a faulty machine. Eksploat. Niezawodn.–Maint. Reliab. 2023, 25, 168109. [Google Scholar] [CrossRef]
Achariyaviriya, W.; Wongsapai, W.; Janpoom, K.; Katongtung, T.; Mona, Y.; Tippayawong, N.; Suttakul, P. Estimating Energy Consumption of Battery Electric Vehicles Using Vehicle Sensor Data and Machine Learning Approaches. Energies 2023, 16, 6351. [Google Scholar] [CrossRef]
Janpoom, K.; Suttakul, P.; Achariyaviriya, W.; Fongsamootr, T.; Katongtung, T.; Tippayawong, N. Investigating the influential factors in real-world energy consumption of battery electric vehicles. Energy Rep. 2023, 9, 316–320. [Google Scholar] [CrossRef]
Oh, G.; Leblanc, D.J.; Peng, H. Vehicle Energy Dataset (VED), A Large-Scale Dataset for Vehicle Energy Consumption Research. IEEE Trans. Intell. Transp. Syst. 2022, 23, 3302–3312. [Google Scholar] [CrossRef]
Ko, K.; Lee, T.; Jeong, S. A Deep Learning Method for Monitoring Vehicle Energy Consumption with GPS Data. Sustainability 2021, 13, 11331. [Google Scholar] [CrossRef]
Whitmore, G.; Rockstroh, T.; Haenel, P.; Wilbrand, K.; Pomrehn, M. A Data Driven Approach for Real-World Vehicle Energy Consumption Prediction; SAE Technical Paper 2024-01-2870; SAE International: Detroit, MI, USA, 2024. [Google Scholar] [CrossRef]
Dermendzhiyski, Y.; Pavlov, N.; Dacova, D. Electric Vehicle Energy Consumption Measurement Using GPS Data Logger. In Proceedings of the 2023 15th Electrical Engineering Faculty Conference (BulEF), Varna, Bulgaria, 16–19 September 2023; pp. 1–4. [Google Scholar] [CrossRef]
Bahij, M.; Labbadi, M.; Cherkaoui, M.; Chatri, C.; Lakrit, S. A Comparison Study of Machine Learning Methods for Energy Consumption Forecasting in Industry. In Digital Technologies and Applications; Motahhir, S., Bossoufi, B., Eds.; Springer International Publishing: Cham, Switzerland, 2021; Volume 211, pp. 165–175. [Google Scholar] [CrossRef]
Shiau, Y.-H.; Yang, S.-F.; Adha, R.; Muzayyanah, S. Modeling Industrial Energy Demand in Relation to Subsector Manufacturing Output and Climate Change: Artificial Neural Network Insights. Sustainability 2022, 14, 2896. [Google Scholar] [CrossRef]
Özçelep, Y.; Sevgen, S.; Samli, R. A study on the hydrogen consumption calculation of proton exchange membrane fuel cells for linearly increasing loads: Artificial Neural Networks vs Multiple Linear Regression. Renew. Energy 2020, 156, 570–578. [Google Scholar] [CrossRef]
Ilbeigi, M.; Ghomeishi, M.; Dehghanbanadaki, A. Prediction and optimization of energy consumption in an office building using artificial neural network and a genetic algorithm. Sustain. Cities Soc. 2020, 61, 102325. [Google Scholar] [CrossRef]
Lee, H.; Kim, D.; Gu, J.-H. Prediction of Food Factory Energy Consumption Using MLP and SVR Algorithms. Energies 2023, 16, 1550. [Google Scholar] [CrossRef]
Bai, Y.; Xie, J.; Liu, C.; Tao, Y.; Zeng, B.; Li, C. Regression modeling for enterprise electricity consumption: A comparison of recurrent neural network and its variants. Int. J. Electr. Power Energy Syst. 2021, 126, 106612. [Google Scholar] [CrossRef]
Al-Haddad, Y.; Abdu İbrahim, A.; Naeem, R. Forecasting Energy Consumption in Smart Grids: A Comparative Analysis of Recurrent Neural Networks. Iraqi J. Comput. Inform. 2024, 50, 123–132. [Google Scholar] [CrossRef]
Zhang, Y.; Yang, J.; Zhou, S.; Li, P. Building energy consumption prediction based on Bayesian optimized LSTM model. In International Conference on Algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI 2023); Saxena, S., Zhao, C., Eds.; SPIE: Bellingham, WA, USA, 2023; p. 62. [Google Scholar] [CrossRef]
Suarez, J.; Makridis, M.; Anesiadou, A.; Komnos, D.; Ciuffo, B.; Fontaras, G. Benchmarking the driver acceleration impact on vehicle energy consumption and CO₂ emissions. Transp. Res. Part D Transp. Environ. 2022, 107, 103282. [Google Scholar] [CrossRef] [PubMed]
Skuza, A.; Szumska, E.M.; Jurecki, R.; Pawelec, A. Modeling the Impact of Traffic Parameters on Electric Vehicle Energy Consumption. Energies 2024, 17, 5423. [Google Scholar] [CrossRef]
Basso, R.; Kulcsár, B.; Egardt, B.; Lindroth, P.; Sanchez-Diaz, I. Energy consumption estimation integrated into the Electric Vehicle Routing Problem. Transp. Res. Part. Transp. Environ. 2019, 69, 141–167. [Google Scholar] [CrossRef]
Arnau, F.J.; Pla, B.; Bares, P.; Perin, A.T. Eco-Driving Optimization of a Signalized Route With Extended Traffic State Information. IEEE Intell. Transp. Syst. Mag. 2023, 15, 35–45. [Google Scholar] [CrossRef]
Khasawneh, H.J.; Alzubi, M.A.M.; Habib, M.; Al-Tarifi, M.A. Investigation of Road Topography on Energy Consumption of Battery Electric Vehicles (BEVs). In Proceedings of the 2025 15th International Renewable Energy Congress (IREC), Hammamet, Tunisia, 2–4 February 2025; pp. 1–4. [Google Scholar] [CrossRef]
Chidambaram, R.K.; Chatterjee, D.; Barman, B.; Das, P.P.; Taler, D.; Taler, J.; Sobota, T. Effect of Regenerative Braking on Battery Life. Energies 2023, 16, 5303. [Google Scholar] [CrossRef]
Bhat, A.; Suryawanshi, A.; Jadhav, O.; Jawale, S. Optimizing Regenerative Braking in Electric Vehicle: A Comprehensive Study of Control Strategies and Energy Recovery. In Proceedings of the 2025 3rd IEEE International Conference on Industrial Electronics: Developments & Applications (ICIDeA), Bhubaneswar, India, 21–22 February 2025; pp. 1–4. [Google Scholar] [CrossRef]
Geng, C.; Ning, D.; Guo, L.; Xue, Q.; Mei, S. Simulation Research on Regenerative Braking Control Strategy of Hybrid Electric Vehicle. Energies 2021, 14, 2202. [Google Scholar] [CrossRef]
Yin, Z.; Ma, X.; Zhang, C.; Su, R.; Wang, Q. A Logic Threshold Control Strategy to Improve the Regenerative Braking Energy Recovery of Electric Vehicles. Sustainability 2023, 15, 16850. [Google Scholar] [CrossRef]
Spichartz, P.; Sourkounis, C. Assessment of recuperation strategies for electric vehicles by simulations and measurements. In Proceedings of the 2016 International Symposium on Power Electronics, Electrical Drives, Automation and Motion (SPEEDAM), Capri, Italy, 22–24 June 2016; pp. 1226–1231. [Google Scholar] [CrossRef]
Jafari, M.; Gauchia, A.; Zhang, K.; Gauchia, L. Simulation and Analysis of the Effect of Real-World Driving Styles in an EV Battery Performance and Aging. IEEE Trans. Transp. Electrif. 2015, 1, 391–401. [Google Scholar] [CrossRef]
Barauskas, A.; Brilingaitė, A.; Bukauskas, L.; Čeikutė, V.; Čivilis, A.; Šaltenis, S. Test-data generation and integration for long-distance e-vehicle routing. GeoInformatica 2023, 27, 737–758. [Google Scholar] [CrossRef]
Kozak, M.; Piepho, H.-P. What’s normal anyway? Residual plots are more telling than significance tests when checking ANOVA assumptions. J. Agron. Crop Sci. 2018, 204, 86–98. [Google Scholar] [CrossRef]
Nouri, A.; Frisch, J.; Van Treeck, C. Statistical methodologies for verification of building energy performance simulation. In Proceedings of the 17th IBPSA Conference, Bruges, Belgium, 1–3 September 2021. [Google Scholar] [CrossRef]
Hao, M.; Ji, J.; Bie, Y. Impact of Ambient Temperature on Electric Bus Energy Consumption in Cold Regions: Case Study of Meihekou City, China. In Smart Transportation Systems 2021; Qu, X., Zhen, L., Howlett, R.J., Jain, L.C., Eds.; Springer: Singapore, 2021; Volume 231, pp. 95–103. [Google Scholar] [CrossRef]
Szilassy, P.Á.; Földes, D. Impact of Ambient Temperature, Passenger Load, and Delay on the Consumption of Battery Electric Buses. Period. Polytech. Transp. Eng. 2024, 52, 113–119. [Google Scholar] [CrossRef]
Vepsäläinen, J.; Ritari, A.; Lajunen, A.; Kivekäs, K.; Tammi, K. Energy Uncertainty Analysis of Electric Buses. Energies 2018, 11, 3267. [Google Scholar] [CrossRef]
Li, P.; Zhang, Y.; Zhang, Y.; Zhang, K.; Jiang, M. The effects of dynamic traffic conditions, route characteristics and environmental conditions on trip-based electricity consumption prediction of electric bus. Energy 2021, 218, 119437. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer: New York, NY, USA, 2009. [Google Scholar] [CrossRef]
James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning: With Applications in R; Springer: New York, NY, USA, 2021. [Google Scholar] [CrossRef]
Tomiło, P. Classification of the Condition of Pavement with the Use of Machine Learning Methods. Transp. Telecommun. J. 2023, 24, 158–166. [Google Scholar] [CrossRef]
Tomiło, P.; Laskowski, J.; Laskowska, A. Artificial neural network model based on Kolmogorov-Arnold representation theorem and retention mechanism for real-time aircraft flight phases classification. Eng. Appl. Artif. Intell. 2025, 160, 112004. [Google Scholar] [CrossRef]
Majerek, D.; Rymarczyk, T.; Wójcik, D.; Kozłowski, E.; Rzemieniak, M.; Gudowski, J.; Gauda, K. Machine Learning and Deterministic Approach to the Reflective Ultrasound Tomography. Energies 2021, 14, 7549. [Google Scholar] [CrossRef]
Biao, J.; Xiangwen, Z.; Yangxiong, W.; Wenchao, H. Regenerative Braking Control Strategy of Electric Vehicles Based on Braking Stability Requirements. Int. J. Automot. Technol. 2021, 22, 465–473. [Google Scholar] [CrossRef]

Figure 1. The range of speed during the SORT test and the standard deviation of speed at each moment.

Figure 2. The example of a bus journey according to SORT with eight batteries.

Figure 3. Residuals analysis for a linear model: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

Figure 4. Real and predicted instantaneous consumption energy for the linear model.

Figure 5. Residuals analysis for a linear model with states: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

Figure 6. Real and predicted instantaneous consumption energy for the linear model with states.

Figure 7. Residuals analysis for the neural network: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

Figure 8. Real and predicted instantaneous consumption energy for the neural network.

Figure 9. Residuals analysis for recurrent neural network: (a) histogram and density function of residuals, (b) cumulative distribution function of residuals, (c) normal quantile–quantile plot, (d) autocorrelation function of time series of residuals.

Figure 10. Real and predicted instantaneous consumption energy values for a recurrent neural network.

Figure 11. History of training of neural networks.

Figure 12. Comparison of model fit.

Table 1. The values of parameters, standard deviation,

t_{i}

statistical values, and

p_{i}

probability test for the linear model.

Table 1. The values of parameters, standard deviation,

t_{i}

statistical values, and

p_{i}

probability test for the linear model.

	$θ_{i}$	$S_{θ_{i}}$	$t_{i}$	$p_{i}$
Intercept	7.7494	0.5992	12.9319	$1.249 \times 10^{- 37}$
$v_{t}$	−0.0199	0.0092	−2.1668	0.0303
$a_{t}$	−4.8876	0.6160	−7.9340	$2.619 \times 10^{- 15}$
$a_{t}^{2}$	5.9673	0.5391	11.0685	$3.885 \times 10^{- 28}$
$v_{t} a_{t}$	1.0696	0.0179	59.7217	0
State: braking	−10.8159	0.8578	−12.6087	$6.942 \times 10^{- 36}$
State: driving	−4.0014	0.5442	−7.3529	$2.267 \times 10^{- 13}$
State: stop	−7.1746	0.6111	−11.7410	$2.097 \times 10^{- 31}$
4 batteries	0.1800	0.2568	0.7008	0.4835
8 batteries	0.3663	0.2431	1.5070	0.1319

Table 2. Results of normality and residual correlation tests for the linear model.

Test	Statistics	p-Value
Shapiro–Wilk normality test	0.8330	0
Anderson–Darling test	260.7239	0
Cramér–von Mises test	51.7989	0
Lilliefors (Kolmogorov–Smirnov) test	0.1912	0
Jarque–Bera test	14,838.5399	0
D’Agostino Omnibus test	1496.8640	0
Ljung–Box test	1233.4281	0

Table 3. The values of parameters, standard deviation,

t_{i}

statistical values, and

p_{i}

probability test for the linear model for the acceleration state.

Table 3. The values of parameters, standard deviation,

t_{i}

statistical values, and

p_{i}

probability test for the linear model for the acceleration state.

	$θ_{i}^{A}$	$S_{θ_{i}}$	$t_{i}$	$p_{i}$
Intercept	4.7581	0.6251	240.6906	0
4 batteries	1.2854	0.4177	97.3119	0
8 batteries	3.3831	0.4325	247.3615	0
$v_{t}$	0.0069	0.0175	12.4784	8.999 × 10⁻³⁶
$a_{t}$	0.5345	0.7113	23.7639	8.862 × 10⁻¹²²
$a_{t}^{2}$	0.0120	0.0995	3.8072	7.069 × 10⁻⁵
$v_{t} a_{t}$	1.2276	0.0446	870.7524	0

Table 4. The values of parameters, standard deviation, statistical

T

values, and test probability for the linear model for the braking state.

Table 4. The values of parameters, standard deviation, statistical

T

values, and test probability for the linear model for the braking state.

	$θ_{i}^{B}$	$S_{θ_{i}}$	$t_{i}$	$p_{i}$
Intercept	−1 × 10⁻⁹	6.1195 × 10⁻¹⁶	−51,675,295.49	0
4 batteries	−1.27 × 10⁻⁹	4.3039 × 10⁻¹⁷	−734,749,089.34	0
8 batteries	−0.4778	0.2974	−50.8013	0
$v_{t}$	−0.4649	0.0219	−672.3865	0
$a_{t}$	5.103 × 10⁻¹⁹	1.942 × 10⁻¹⁵	0.0083	0.4967
$a_{t}^{2}$	−1.726 × 10⁻¹⁷	9.126 × 10⁻¹⁶	−0.5982	0.2749
$v_{t} a_{t}$	0.2785	0.0303	290.3297	0

Table 5. The values of parameters, standard deviation

t_{i}

statistical values, and

p_{i}

probability test for the linear model for the driving state.

Table 5. The values of parameters, standard deviation

t_{i}

statistical values, and

p_{i}

probability test for the linear model for the driving state.

	$θ_{i}^{D}$	$S_{θ_{i}}$	$t_{i}$	$p_{i}$
Intercept	−1.7340	0.4654	−3.7260	0.0002
4 batteries	0.2915	0.3148	0.9261	0.3548
8 batteries	0.2049	0.2977	0.6881	0.4917
$v_{t}$	0.1751	0.0094	18.6364	3.998 × 10⁻⁶²
$a_{t}$	−13.0527	9.3604	−1.3944	0.1637
$a_{t}^{2}$	−73.5850	57.7703	−1.2738	0.2032
$v_{t} a_{t}$	0.7112	0.2179	3.2633	0.0012

Table 6. The values of parameters, standard deviation, statistical

T

values, and test probability for the linear model for the stopped state.

Table 6. The values of parameters, standard deviation, statistical

T

values, and test probability for the linear model for the stopped state.

	$θ_{i}^{S}$	$S_{θ_{i}}$	$t_{i}$	$p_{i}$
Intercept	0.7690	0.0436	17.6492	5.463 × 10⁻⁶³
4 batteries	0.0102	0.0614	0.1660	0.8682
8 batteries	−0.0734	0.0611	−1.202	0.2296

Table 7. Results of normality tests and correlation tests for residuals of the linear model with states.

Test	Statistics	p-Value
Shapiro–Wilk normality test	0.8996	0
Anderson–Darling test	131.9078	0
Cramér–von Mises test	26.4533	0
Lilliefors (Kolmogorov–Smirnov) test	0.1580	0
Jarque–Bera test	7067.8145	0
D’Agostino Omnibus test	651.1530	0
Ljung–Box test	769.2797	0

Table 8. NN architecture without regularisation (dropout 20%).

Layer (Type)	Output Shape	Parameters	Activation
Input Layer	(None, 11)
Dense 1	(None, 10)	120	Linear
Dropout	(None, 10)	0
Dense 1	(None, 5)	55	Linear
Output Dense	(None, 1)	6	Linear

Table 9. Results of normality tests and correlation tests for residuals of the neural network.

Test	Statistics	p-Value
Shapiro–Wilk normality test	0.9008	0
Anderson–Darling test	179.6719	0
Cramér–von Mises test	37.1023	0
Lilliefors (Kolmogorov–Smirnov) test	0.1578	0
Jarque–Bera test	2255.0859	0
D’Agostino Omnibus test	689.9610	0
Ljung–Box test	1372.5825	0

Table 10. RNN architecture without regularisation (dropout 20%).

Layer (Type)	Output Shape	Parameters	Activation
Input Layer	(None, 11)	0
Dense 1 (Simple RNN)	(None, 11, 8)	80	Linear
Dropout	(None, 11.8)	0
Dense 2 (Flatten)	(1, 88)	0	$-$
Dense 2	(1, 5)	445	Linear
Output Dense	(1, 1)	6	Linear

Table 11. Results of normality tests and correlation tests for residuals of the recurrent neural network.

Test	Statistics	p-Value
Shapiro–Wilk normality test	0.8769	0
Anderson–Darling test	180.4883	0
Cramér–von Mises test	37.5676	0
Lilliefors (Kolmogorov–Smirnov) test	0.1650	0
Jarque–Bera test	8241.3118	0
D’Agostino Omnibus test	1154.2360	0
Ljung–Box test	750.2810	0

Table 12. The correlation coefficient, Mean Squared Error, Root Mean Squared Error, and Mean Absolute Error for models on the test set.

	r	MSE	RMSE	MAE
Linear Model	0.8864878	48.49007	6.963481	4.047660
Extended Linear Model	0.9557132	18.18811	4.264752	2.747482
Neural Network	0.9531719	19.65345	4.433222	2.755827
Recurrent Neural Network	0.9666154	13.72478	3.704697	2.241747

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Analysis of Instantaneous Energy Consumption and Recuperation in Electric Buses During SORT Tests Using Linear and Neural Network Models

Abstract

1. Introduction

2. Literature Review

3. Materials and Methods

3.1. Measuring Equipment

3.2. Measurement Location and Conditions

3.3. SORT Test Procedure

3.4. Data Analysis

4. Research Tools

4.1. Linear Model

4.2. Extended Linear Model Concerning States

4.3. Neural Networks

4.4. Assessment of Model Fit and Prediction

5. Case Study: Energy Consumption and Recuperation Analysis

5.1. Linear Model

5.2. Extended Linear Model Concerning States

5.3. Neural Network

5.4. Recurrent Neural Network

5.5. Prediction Comparison

6. Discussion of the Results

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics