Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control

Herkenrath, Ferris; Koßler, Silas; Günther, Marco; Pischinger, Stefan

doi:10.3390/en19061535

Open AccessArticle

Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control

Chair of Thermodynamics of Mobile Energy Conversion Systems (TME), RWTH Aachen University, 52074 Aachen, Germany

^*

Author to whom correspondence should be addressed.

Energies 2026, 19(6), 1535; https://doi.org/10.3390/en19061535

Submission received: 24 February 2026 / Revised: 12 March 2026 / Accepted: 17 March 2026 / Published: 20 March 2026

(This article belongs to the Special Issue Challenges and Research Trends of Energy Management)

Download

Browse Figures

Versions Notes

Abstract

Model Predictive Control (MPC) has emerged as a promising approach for energy management in hybrid electric vehicles, enabling predictive optimization of powertrain operation. The energy management problem in parallel hybrid powertrains constitutes a Mixed-Integer Nonlinear Programming (MINLP) problem, combining continuous decision variables such as torque distribution with discrete decisions including engine on/off states and clutch engagement. This problem structure presents distinct challenges for different optimization approaches. Gradient-based methods such as Sequential Quadratic Programming (SQP) solve continuous, differentiable optimization problems and require auxiliary methods to handle integer variables, while metaheuristic approaches such as Genetic Algorithms (GA) can handle the mixed-integer structure directly at the cost of increased computational effort. This study presents a systematic comparison between GA and SQP as optimization solvers within an MPC framework for a P1P3 parallel hybrid powertrain. A multi-objective cost function is formulated to simultaneously optimize system efficiency, battery state of charge management, and noise emissions. Both approaches are evaluated across the WLTC as well as a real-world RDE scenario. On the WLTC, both MPC approaches reduce fuel consumption by 0.5–1.0% and improve system efficiency by 3.7–4.6% compared to a state-of-the-art deterministic reference strategy optimized for fuel consumption. At the same time, both approaches additionally achieve substantial reductions in noise emissions compared to the deterministic reference, which was not optimized for acoustic behavior. On both cycles, the GA-based MPC achieves favorable performance compared to SQP, with the performance gap widening from the WLTC to the RDE cycle. Both methods achieve real-time capability, yet SQP reduces computational time by a factor of four compared to GA. As long as computational resources in automotive ECUs remain constrained, this efficiency advantage positions gradient-based optimization for series production applications, whereas metaheuristic methods offer greater flexibility for concept development stages with relaxed real-time requirements. The findings contribute to the understanding of optimization algorithm selection for MINLP energy management problems in hybrid electric vehicles.

Keywords:

energy management system; model predictive control; multi-objective optimization; genetic algorithm; sequential quadratic programming; MINLP

1. Introduction

The transportation sector remains a significant contributor to greenhouse gas emissions, with passenger vehicles accounting for a substantial share of CO₂ output in the European Union [1,2,3]. Hybrid electric vehicles (HEVs) offer a promising pathway to reduce fuel consumption and emissions by combining internal combustion engines with electric propulsion systems [4,5].

The energy management system (EMS) is central to realizing the efficiency potential of hybrid powertrains, as it governs the power distribution between the combustion engine and electric machines during vehicle operation [4]. It can address multiple, often conflicting objectives including fuel efficiency, battery state of charge (SoC), and driving comfort aspects such as noise emissions [5]. This multi-objective nature necessitates optimization-based control strategies capable of balancing these competing demands.

Model Predictive Control (MPC) has emerged as a particularly suitable framework for hybrid vehicle energy management, as it enables predictive optimization over a finite horizon while incorporating system constraints and multiple objectives [6,7,8,9]. By utilizing a system model to predict future states, MPC determines optimal control inputs that minimize a defined cost function subject to operational boundaries [10]. This predictive capability is especially advantageous in hybrid powertrains where anticipating future driving conditions can significantly improve energy utilization.

Beyond solver selection, which is the focus of the present study, the broader literature on MPC-based energy management has explored adaptive and learning-based extensions. Adaptive MPC strategies adjust prediction models or cost function parameters during operation to improve robustness across varying driving conditions [11], while adaptive dynamic programming enables the online learning of control policies without requiring complete system models [12]. Deep reinforcement learning has been applied to hybrid vehicle energy management to handle high-dimensional state spaces and achieve near-optimal performance across diverse scenarios [13,14]. Event-triggered MPC formulations reduce computational burden by executing the optimization only when relevant state changes occur [15]. These approaches address the control framework itself, whereas the choice of optimization solver for the underlying MINLP problem has received comparatively little attention and constitutes the focus of this study.

The underlying optimization problem in parallel hybrid powertrains constitutes a Mixed-Integer Nonlinear Programming (MINLP) problem. The decision space comprises continuous variables such as torque distribution among the combustion engine and electric machines, as well as discrete variables including engine on/off states and clutch engagement [16]. This mixed-integer structure, combined with the nonlinear characteristics of for example component efficiency maps, creates a non-convex solution space that poses distinct challenges for optimization algorithms.

Gradient-based methods such as Sequential Quadratic Programming (SQP) are widely employed for MPC applications due to their computational efficiency and well-established convergence properties for continuous problems [17,18]. When applied to MINLP problems, however, these methods require decomposition into multiple continuous subproblems, solving each operating mode separately and subsequently selecting the optimal solution. In contrast, metaheuristic approaches such as Genetic Algorithms (GA) can handle the mixed-integer structure directly through their population-based, derivative-free search mechanism [19,20,21]. While GAs offer flexibility in handling arbitrary problem structures, they typically require higher computational effort compared to gradient-based methods. The traditionally higher computational demand of metaheuristic approaches has historically constrained their applicability in real-time control systems. However, the computational capabilities of automotive Electronic Control Units (ECUs) have increased substantially in recent years. The demand for computing power is quickly increasing in the automotive domain, and car manufacturers as well as tier-one suppliers are gradually introducing multicore ECUs in their electronic architectures [22,23]. These multicore architectures offer higher levels of parallelism and enable the implementation of more complex control algorithms [22,24]. In parallel, optimization methods for automatic control are increasingly being deployed on embedded hardware platforms for application-specific needs such as guaranteed communication latency and cost effectiveness [25,26]. This development greatly broadens the scope of applications to which optimization methods can be applied in sectors such as automotive [25]. Together with advances in solver algorithms, these hardware improvements continue to expand the boundaries of real-time optimization in embedded control systems [23,24,26]. While these technological advances create new opportunities for deploying both gradient-based and metaheuristic optimization in automotive applications. This study focuses on GA and SQP as representatives of the two principal solver paradigms, metaheuristic and gradient-based, while approaches such as Dynamic Programming are excluded due to the curse of dimensionality in multi-objective problems with multiple continuous control variables.

The selection of GA and SQP as representative solvers is motivated by their position as established methods within the two principal paradigms for solving MINLP problems in real-time control applications. GA represents the class of derivative-free, population-based metaheuristic methods, which also includes Particle Swarm Optimization and Simulated Annealing. SQP represents gradient-based iterative methods that solve a sequence of continuous subproblems. Modern MINLP frameworks such as Bonmin employ branch-and-bound strategies in which continuous NLP relaxations are solved at each node of the search tree, typically using gradient-based solvers such as Ipopt [27]. While these frameworks offer theoretical advantages for convex problems [28], the non-convex nature of the powertrain energy management problem limits their applicability to heuristic modes, and the computational overhead of the branch-and-bound tree poses challenges for real-time implementation at the sampling rates required in this study. Mixed-Integer Quadratic Programming (MIQP) formulations require the cost function and constraints to be quadratic, which does not hold for the nonlinear component efficiency maps considered here without introducing additional approximation errors. Hybrid MPC frameworks that combine integer and continuous optimization typically rely on gradient-based or metaheuristic core solvers and thus do not constitute a fundamentally different paradigm. By comparing GA and SQP, this study therefore covers the two ends of the spectrum between global search capability and computational efficiency that is relevant for embedded energy management applications.

Despite the extensive literature on both optimization approaches individually, comparisons between metaheuristic and gradient-based solvers within MPC frameworks for hybrid vehicle energy management remain limited. Existing studies predominantly focus on either approach in isolation, leaving open questions regarding their relative performance across different driving scenarios and optimization objectives. This gap is particularly relevant for powertrain concept development, where the choice of optimization algorithm can significantly influence design decisions.

This study presents a systematic comparison between GA and SQP as optimization solvers within an MPC-based energy management system for a P1P3 parallel hybrid powertrain in a Model-in-the-Loop environment coupling the MPC controller in implemented in MATLAB R2024b with the vehicle plant model implemented in GT Suite v2024. A multi-objective cost function addressing system efficiency, SoC management, and noise emissions is formulated, with the cost function parameterized through Design of Experiments (DoE). The comparative evaluation encompasses the regulatory WLTC as well as a real-world RDE scenario, a short urban and rural phase followed by highway driving conditions. The contribution of this work lies in providing quantitative evidence for algorithm selection in MINLP energy management problems.

This paper is organized as follows: Section 2 describes the vehicle and powertrain model under investigation. Section 3 presents the MPC framework and cost function formulation. Section 4 details the implementation of both optimization algorithms. Section 5 discusses the comparative results, and Section 6 concludes with implications for algorithm selection in hybrid vehicle energy management.

2. Vehicle and Powertrain Model

This section describes the 48 V mild hybrid powertrain configuration used in this study. First, the P1P3 parallel hybrid architecture is introduced, followed by a description of the available operating modes and their constraints. Subsequently, the main component specifications are presented. The powertrain and vehicle configurations were derived based on a market projection for the year 2030+ for a typical C-class vehicle [29].

The powertrain and vehicle simulation models are implemented in GT-Suite and were developed and validated against experimental data in a preceding study [30]. Component parameters, including the ICE efficiency map, electric machine characteristics, battery model, and transmission data, are derived from test bench measurements and supplier specifications. The co-simulation environment coupling the MPC controller in MATLAB with the GT-Suite plant model was established in [29] and is adopted without modification for the present study.

2.1. Powertrain Architecture

The investigated powertrain follows a P1P3 parallel hybrid configuration, combining features of both parallel and serial hybrid topologies. In this architecture, two electric machines (EM) are integrated alongside a conventional internal combustion engine (ICE), enabling multiple operating modes. A simplified representation of the powertrain layout is shown in Figure 1.

The first electric machine (EM1) is positioned directly behind the ICE and before the clutch, corresponding to the P1 position. This configuration enables EM1 to function as a starter-generator and to facilitate serial operation when the clutch is disengaged. The second electric machine (EM2) is located after the transmission at the P3 position, providing capabilities for electric driving, regenerative braking, and load point shifting during parallel operation [31,32].

The ICE can be mechanically decoupled from the drivetrain via a clutch, allowing independent operation of the combustion engine and enabling purely electric propulsion. A seven speed dual clutch transmission (DCT) connects the powertrain to the front axle through a differential.

2.2. Component Specifications

The main powertrain components and their specifications are summarized in Table 1. The ICE provides a maximum power of 80 kW, suitable for the investigated mid-size passenger vehicle application. The electric machines are dimensioned asymmetrically: EM2 with 20 kW enables electric driving and load point shifting, while EM1 with 15 kW is primarily designed for engine starting and limited serial operation.

2.3. Operating Modes

The P1P3 configuration enables three following operating modes which are electric, serial and parallel mode.

In electric mode the clutch is open and the ICE is deactivated. EM2 solely provides the required traction torque, limited by the available battery power and the electric machine’s torque–speed characteristics. This mode is typically employed at low vehicle speeds and during coasting.

In serial mode the clutch remains open while the ICE operates. EM1 functions as a generator, converting mechanical energy from the ICE to electrical energy, which is either stored in the battery or directly used by EM2 for propulsion. This mode allows the ICE to operate at efficiency-optimized points independent of vehicle speed. However, in the investigated powertrain configuration, the serial operating range is significantly constrained by the power rating of EM1. As illustrated in Figure 2, the ICE can only be operated below the 15 kW power hyperbola during serial mode, effectively limiting this mode to low-load driving conditions.

In parallel mode the clutch is closed, mechanically coupling the ICE to the drivetrain. Both the ICE and EM2 can simultaneously contribute to traction, enabling load point shifting strategies. EM2 can either assist the ICE (boosting) or act as a generator for battery charging (load point shift).

3. MPC Framework and Problem Formulation

This section presents the Model Predictive Control framework employed for energy management. First, the general MPC principle is introduced. Subsequently, the multi-objective cost function is formulated, followed by a description of the system constraints. Finally, the Mixed-Integer Nonlinear Programming structure of the resulting optimization problem is discussed.

3.1. Model Predictive Control Principle

Model Predictive Control is an optimization-based control strategy that utilizes a system model to predict future states over a finite horizon and determines optimal control inputs by minimizing a cost function subject to constraints [33].

At each time step k, the MPC solves an optimization problem over a prediction horizon N. The system state at the next time step is predicted based on the current state x(k) and control input u(k) using a reduced-order model as shown in Equation (1):

x (k + 1) = f (x (k), u (k))

(1)

Equation (2) expresses the optimization objective to minimize a cost function J_N over the horizon:

J_{N} (x, u) = \sum_{k = 0}^{N - 1} l (x_{u} (k), u (k)) + V_{f} ((x (N))

(2)

where l(x,u) represents the stage cost at each time step and V_f(x(N)) denotes the terminal cost. After solving the optimization problem, only the first control input is applied, and the procedure is repeated at the next time step in a receding horizon fashion [34].

In this study, the MPC framework is applied to optimize the torque distribution among the internal combustion engine (ICE) and the two electric machines (EM1 and EM2) of the P1P3 parallel hybrid powertrain. The multi-objective cost function addresses fuel consumption and noise emissions while maintaining a balanced battery state of charge over the driving cycle (see Section 3.2). The gear shifting strategy is not included within the MPC optimization itself but operates through a rule-based approach. However, the shifting behavior is accounted for within the predictor model that serves as the reduced-order representation for the MPC, ensuring that anticipated gear changes are reflected in the state predictions.

3.2. Multi-Objective Cost Function

The energy management problem at hand involves multiple, partially conflicting objectives. In this study, the cost function addresses three optimization targets: battery state of charge management, ICE efficiency, and noise emissions [35]. These objectives are combined using the weighted sum method according to Equation (3):

J = \frac{1}{N} \cdot \sum_{k = 0}^{N - 1} (w_{η} \cdot J_{η, I C E, k} + w_{N o i s e} \cdot J_{N o i s e, k}) + w_{S o C} \cdot J_{S o C, N}

(3)

The weighted sum method is selected as the scalarization approach, consistent with established practice in MPC-based hybrid vehicle energy management. This choice is motivated primarily by the requirement for a fair solver comparison. The weighted sum formulation produces a single scalar cost function that is treated identically by both optimization approaches, ensuring that differences in the results are attributable to the solver mechanisms rather than to differences in how each algorithm handles the problem formulation. Alternative approaches such as the ε-constraint method introduce additional inequality constraints on the secondary objectives, which GA and SQP handle through fundamentally different mechanisms (penalty functions versus Lagrangian relaxation), potentially confounding the comparison. Furthermore, the weighted sum formulation enables systematic weight tuning through the DoE methodology described in Section 4.4.

The weighting factors w_η, w_Noise, and w_SoC determine the relative importance of each objective, while N denotes the prediction horizon length and k the discrete time step index. The costs J_η,ICE,k and J_Noise,k are evaluated at each time step, whereas the terminal cost J_SoC,N only penalizes deviations from the reference state of charge SoC_Ref at the end of the horizon according to Equation (4):

J_{S o C} = (S o C - S o C_{R e f} - S o C_{v} (v))

(4)

The speed-dependent correction term SoC_v accounts for recuperation potential at high vehicle speeds v and is defined by Equation (5):

S o C_{v} (v) = - (5120 \cdot 10^{- 5} \cdot v^{2} - 5653 \cdot 10^{- 5} \cdot v + 1277 \cdot 10^{- 4})

(5)

This correction permits a temporary reduction in the target state of charge when subsequent braking phases offer significant energy recovery opportunities.

The ICE efficiency cost J_η,ICE penalizes operation at low-efficiency points. The current efficiency η_ICE is normalized between the best and worst achievable values η_ICE,best and η_ICE,worst according to Equation (6):

J_{η, I C E} = \frac{η_{I C E, b e s t} - η_{I C E}}{η_{I C E, b e s t} - η_{I C E, w o r s t}}

(6)

The acoustic passenger comfort is assessed using an approach following [36]. The masking noise generated by rolling and aerodynamic drag is compared with the ICE noise at the driver’s headrest position. To obtain a characteristic evaluation measure for the noise emissions over the entire driving cycle, the logarithmic sound intensity levels L_Veh and L_ICE, with the reference sound intensity I₀ = 10−12 W/m², are first converted into intensities according to Equation (7):

I_{i} = I_{0} \cdot 10^{\frac{L_{i}}{10}}

(7)

From this, the integral of the deviation between I_ICE and I_Veh is calculated according to Equation (8), if (I_ICE − I_Veh) > 0. The ratio N_ICE−Veh in J/m² considers both the duration and the intensity of the masking noise overshoot:

N_{I C E - V e h} = \int_{0}^{t} (I_{I C E} - I_{V e h}) d t

(8)

3.3. System Constraints

The optimization problem is subject to various constraints that arise from physical limitations and operational boundaries of the powertrain components. Safe battery operation and longevity require the state of charge SoC to remain within prescribed limits SoC_min = 0.2 and SoC_max = 0.8, as expressed by Equation (9):

{S o C}_{m i n} \leq S o C \leq {S o C}_{m a x}

(9)

The torque outputs of the powertrain actuators are constrained by their respective operating envelopes, which depend on the current rotational speed ω. For the internal combustion engine, Equation (10) bounds the torque T_ICE between zero and speed-dependent maximum:

{0 \leq T}_{V K M} \leq T_{V K M, m a x}

(10)

Analogous limitations apply to the electric machines EM1 and EM2, with their torques T_EM1 and T_EM2 restricted according to Equations (11) and (12):

{c_{E M 2, D r} \cdot T}_{E M 2, m i n} (n_{E M 2}) \leq T_{E M 2} \leq c_{E M 2, D r} {\cdot T}_{E M 2, m a x} (n_{E M 2})

(11)

{c_{E M 1, D r} \cdot T}_{E M 1, m i n} (n_{E M 1}) \leq T_{E M 1} \leq c_{E M 1, D r} {\cdot T}_{E M 1, m a x} (n_{E M 1})

(12)

During sustained high-load operation, thermal derating effects may further reduce the admissible torque range of the electric machines where c_i are the state dependent derating factors. The combined electrical power of both machines P_Mot must also remain within the battery discharge and charge limits P_Bat,min and P_Bat,max, as specified in Equation (13):

P_{B a t, m i n} (T_{B a t}, S o C) \leq P_{M o t} (T_{E M 2}, n_{E M 2}, T_{E M 1}, n_{E M 1}) \leq P_{B a t, m a x} (T_{B a t}, S o C)

(13)

Finally, the torque contributions from all actuators must satisfy the driver’s traction demand T_demand at the wheel. Equation (14) formulates this balance considering the respective transmission ratios i_ICE, i_EM1, and i_EM2:

T_{E M 2} \cdot i_{E M 2} + T_{E M 1} \cdot i_{E M 1} + T_{I C E} \cdot i_{I C E} = T_{D e m a n d}

(14)

3.4. Mixed-Integer Problem Structure

The energy management optimization constitutes a Mixed-Integer Nonlinear Programming (MINLP) problem due to the combination of continuous and discrete decision variable. Continuous variables include the torque distribution among ICE, EM1, and EM2. These can take any value within their respective bounds. Discrete variables arise from the operating mode selection: engine on/off state and clutch engagement. These binary decisions determine whether the powertrain operates in electric, serial, or parallel mode. The combination of discrete mode selection with continuous torque optimization creates a non-convex solution space. This structure has significant implications for the choice of optimization algorithm, as discussed in the following section.

An alternative approach to handling MINLP problems is the conversion to Mixed-Integer Linear Programming (MILP) formulations through least-squares approximation of the nonlinear terms [37]. While this technique has been applied successfully in domains such as power system dispatch, the powertrain energy management problem considered here involves multiple coupled component efficiency maps with pronounced nonlinear characteristics across the operating range. Representing these maps with sufficient accuracy in a piecewise-linear formulation would require a considerable number of segments, increasing the problem dimension. For the present application, the selected optimization approaches handle the nonlinearities directly, which was considered more practical given the existing component models and the real-time requirements of the MPC framework.

4. Optimization Algorithms

This section describes the two optimization algorithms employed to solve the MPC optimization problem: Genetic Algorithm (GA) and Sequential Quadratic Programming (SQP). These algorithms represent fundamentally different approaches to optimization. GA belongs to the class of metaheuristic, derivative-free methods that explore the solution space through population-based stochastic search, offering flexibility in handling arbitrary problem structures including mixed-integer formulations and non-differentiable models [38,39]. SQP, in contrast, is a gradient-based method that iteratively solves quadratic subproblems, providing computational efficiency but requiring differentiable system models and problem reformulation when discrete variables are present [40]. Following the algorithm descriptions, the reduced-order models and the Design of Experiments approach for cost function weight tuning are presented.

4.1. Genetic Algorithm

GA operates on a population of candidate solutions representing different combinations of operating modes and torque distributions over the prediction horizon, iteratively improving them through selection, crossover, and mutation operators.

4.1.1. Algorithm Structure

The GA implementation follows the standard evolutionary cycle illustrated in Figure 3. The search process begins with the initialization of a population, where each individual encodes a potential solution to the energy management problem. In the context of this study, an individual’s chromosome contains the torque values for ICE, EM1, and EM2 as well as the operating mode selection over the prediction horizon. Each individual is then evaluated using the cost function defined in Section 3.2, where lower cost corresponds to higher fitness. Based on these fitness values, individuals are selected for reproduction, with better-performing solutions having higher probability of contributing to the next generation. The selected individuals undergo crossover, combining genetic information from two parents, and mutation, introducing random variations to maintain exploration of the solution space. This evolutionary cycle repeats until a termination criterion is met, such as reaching a maximum number of generations or achieving convergence in the fitness function [41,42].

4.1.2. Implementation

The optimization is executed at each control time step with a sampling interval of 0.1 s, a receding horizon of five seconds and a horizon step size of 0.5 s. The prediction horizon length and the discretization step size within the horizon represent key design parameters of the MPC formulation. Their selection involves a trade-off between optimization performance and computational feasibility, as a longer horizon provides the optimizer with more information about future driving conditions, while finer discretization increases the resolution of the predicted trajectory. To quantify these effects, a parameter study is conducted with the GA-based MPC on the WLTC.

Figure 4 (left column) shows the influence of the prediction horizon length on system efficiency and real-time capability at a constant step size of 0.5 s. System efficiency increases with the horizon length, rising from 44.68% at one second to 45.14% at five seconds. Beyond five seconds, the efficiency gain diminishes, with only 0.06 percentage points improvement between five and 7.5 s. The computational effort, however, increases progressively with the horizon length. At five seconds (RTC = 0.646), the simulation remains well within real-time capability. At 7.5 s, the RTC rises to 2.735, exceeding the real-time threshold and rendering this configuration unsuitable for online implementation.

Figure 4 (right column) shows the influence of the step size on system efficiency and corrected fuel consumption at a constant horizon length of five seconds. Reducing the step size from 0.5 to 0.25 s improves fuel consumption from 3.810 to 3.780 L/100 km and increases system efficiency from 45.14% to 45.34%. However, halving the step size doubles the number of decision variables per horizon, increasing computational effort accordingly. Conversely, increasing the step size to 1.25 s reduces system efficiency to 43.04% and increases fuel consumption to 3.860 L/100 km, effectively eliminating the benefit of the MPC over the deterministic reference.

Based on these results, a prediction horizon of five seconds discretized into ten steps of 0.5 s is selected. This configuration achieves system efficiency within 0.2 percentage points of the finest discretization tested while maintaining real-time capability with sufficient margin. The same horizon length and step size are applied to the SQP-based implementation to ensure a fair comparison.

The GA directly handles the mixed-integer nature of the problem by encoding both continuous variables (torques) and discrete variables (operating mode). An upstream predictor module receives the current driver torque demand T_driver, future vehicle speed v_veh, and current gear state, and predicts the power demand as well as the gear trajectory with these values over the prediction horizon. Based on the predicted states, the constraint module determines the feasible operating region considering component limits and battery power boundaries. The optimizer then searches for the optimal control sequence within this feasible region.

The GA operates with a population size of 36 individuals. A mutation rate of 16.67% is applied to maintain diversity in the search space. The algorithm terminates when either a maximum of 200 generations is reached or the relative improvement in the best fitness value remains below 1 × 10⁻⁶ for 20 consecutive generations.

4.2. Sequential Quadratic Programming

As an alternative to the metaheuristic GA approach, Sequential Quadratic Programming is implemented to solve the MPC optimization problem. SQP is a gradient-based iterative method that approximates the nonlinear optimization problem as a sequence of quadratic subproblems [40].

4.2.1. Mode-Specific Optimization

Since the energy management problem constitutes a Mixed-Integer Problem combining continuous torque variables with discrete mode decisions, it cannot be directly solved by gradient-based methods. Therefore, the optimization is structured as three separate mode-specific subproblems that are solved in parallel, as illustrated in Figure 5:

Each subproblem represents one of the three operating modes: electric mode (clutch open, ICE off), serial mode (clutch open, ICE on), and parallel mode (clutch closed, ICE on). The three optimization problems are solved simultaneously, and the mode yielding the lowest cost is selected for implementation. This parallel evaluation approach transforms the original MINLP into multiple continuous NLPs that can be efficiently solved using gradient-based methods.

4.2.2. Optimal Control Problem Formulation

Each mode-specific optimization is formulated as an Optimal Control Problem (OCP). The objective is to find the optimal control trajectory that minimizes a cost functional while satisfying system dynamics and constraints. Following the formulation in [43], the cost functional consists of two components: the Lagrange cost l(x, z, u), which accumulates path costs over the prediction horizon, and the Mayer cost M(x(T)), which penalizes the terminal state. The state vector x contains the battery state of charge, z represents algebraic variables, u denotes the control input vector comprising the component torques, and T specifies the prediction horizon length. The system dynamics are described by the function f, while inequality constraints are captured by g. The resulting OCP is formulated according to Equations (15)–(18).

\underset{x (\cdot), z (\cdot), u (\cdot),}{minimize} \int_{0}^{T} l (x (t), z (t), u (t)) + M (x (T))

(15)

s . t . x (0) = {\bar{x}}_{0}

(16)

0 = f (\dot{x} (t), x (t), z (t), u (t)) t \in [0, T]

(17)

0 \geq g (x (t), z (t), u (t)) t \in [0, T]

(18)

For numerical solution, the continuous OCP must be transcribed into a finite-dimensional optimization problem. This is achieved using the multiple shooting method [44], which discretizes the horizon into N time steps and treats the state and control values at each discretization point as optimization variables. The discretized state transition from time step k to k + 1 is represented by the function Φ_k. Applying this discretization yields the Nonlinear Programming problem given by Equations (19)–(22).

\underset{\begin{matrix} x_{0}, \dots x_{N} \\ z_{0}, \dots z_{N} \\ u_{0}, \dots u_{N} \end{matrix}}{minimize} \sum_{k = 0}^{N - 1} (t_{k + 1} - t_{k}) \cdot l (x_{k}, z_{k}, u_{k}) + M (x_{k})

(19)

{s . t . x}_{0} = {\bar{x}}_{0}

(20)

[\begin{matrix} x_{k + 1} \\ z_{k} \end{matrix}] = ϕ_{k} (x_{k}, u_{k}), k = 0, \dots, N - 1

(21)

0 \geq g_{k} (x_{k}, z_{k}, u_{k}), k = 0, \dots, N - 1

(22)

4.2.3. Control Variables per Mode

The control vector formulation differs for each operating mode due to the mechanical coupling constraints imposed by the powertrain architecture. These constraints determine which variables can be independently controlled and which are defined by kinematic relationships.

In parallel mode, the clutch is closed and the ICE is mechanically coupled to the drivetrain. Equation (23) defines the control vector u_parallel, which comprises the torques of EM2 (T_EM2) and ICE (T_ICE).

u_{p a r a l l e l} = [T_{E M 2}, T_{I C E}]

(23)

In parallel mode, EM1 is not an independent control variable. Due to the mechanical coupling between EM1 and the ICE through the belt drive, the EM1 torque is determined by the ICE torque according to Equation (26), reducing the independent control inputs to T_EM2 and T_ICE.

Due to the mechanical coupling through the closed clutch, the ICE speed n_ICE is kinematically constrained. Equation (24) describes how n_ICE is determined by the wheel speed n_wheel, the differential ratio i_diff, and the current gear ratio i_gear.

n_{I C E} = n_{W h e e l} \cdot i_{D i f f} \cdot i_{G e a r}

(24)

In serial mode, the clutch is open, decoupling the ICE from the drivetrain. This removes the kinematic constraint on ICE speed, introducing it as an additional degree of freedom. The control vector u_serial is therefore extended to include the ICE speed n_ICE, as shown in Equation (25).

u_{s e r i a l} = [T_{E M 2}, T_{I C E}, n_{I C E}]

(25)

The EM1 torque T_EM1 remains kinematically coupled to the ICE through the EM1 gear ratio i_EM1, and is therefore not an independent control variable. Equation (26) describes the relationship between ICE torque and EM1 torque.

T_{I C E} = - i_{E M 1} \cdot T_{E M 1}

(26)

In electric mode, the ICE is deactivated and only EM2 provides traction torque. The torque delivered at the wheels T_wheel depends on the EM2 torque T_EM2, the EM2 gear ratio i_EM2, and the differential ratio i_diff, as given by Equation (27).

T_{W h e e l} = T_{E M 2} \cdot i_{D i f f} \cdot i_{E M 2}

(27)

4.2.4. Implementation with Acados

The SQP-based MPC is implemented using the acados v0.4.3 software framework [43]. Acados provides efficient solvers for optimal control problems and enables automatic C-code generation for the solution algorithm. The OCP is defined in MATLAB using CasADi for symbolic expressions [45], and acados generates the corresponding solver as an S-Function block that can be directly integrated in Simulink.

The SQP solver is initialized at each MPC time step using a warm-start strategy, where the solution from the previous time step serves as the initial guess for the current optimization. This approach exploits the temporal similarity between consecutive MPC problems, as the driving conditions and system states typically change only incrementally between time steps. Warm-starting reduces the number of SQP iterations required for convergence and mitigates the risk of the solver converging to unfavorable local optima. However, since the optimization is decomposed into three mode-specific subproblems that are solved independently (Section 4.2.1), the warm-start applies within each mode but does not provide information transfer across modes. The mode selection itself remains a greedy decision at each time step, which represents the primary structural limitation of the SQP approach compared to the GA’s integrated search over mode and torque simultaneously.

The SQP solver is configured with the default settings provided by the acados framework. The NLP solver tolerances for stationarity, equality, inequality, and complementarity residuals are set to 1 × 10⁻⁶, with a maximum of 200 SQP iterations. The quadratic subproblems are solved using HPIPM with a maximum of 100 QP iterations.

4.3. Reduced-Order Models

Both optimization approaches require reduced-order models (ROMs) to predict system behavior over the horizon. However, the model requirements differ between GA and SQP due to their different solution mechanisms.

4.3.1. Model Requirements

GA can utilize arbitrary model structures including lookup tables, piecewise functions, and other non-differentiable representations. The only requirement is that the model can be evaluated for a given set of inputs. This flexibility allows the use of measurement-based efficiency maps and empirical correlations also used in the plant model directly without mathematical reformulation.

SQP requires models that are continuously differentiable to enable gradient computation. Lookup tables and discontinuous functions must be approximated by smooth analytical expressions or polynomial fits. This constraint limits modeling flexibility and accuracy but enables efficient gradient-based search.

4.3.2. Implemented Models

While some models can be implemented identically for GA and SQP, others require different formulations due to the differentiability requirements of gradient-based optimization. Models based on analytical expressions, such as the battery model, are used in identical form for both approaches. However, models that rely on lookup tables in their original form, such as the ICE efficiency and transmission efficiency models, must be reformulated for the SQP implementation.

The battery state of charge change ΔSoC over a time step Δt is calculated from the battery current I_Bat according to Equation (28).

∆ S o C = I_{B a t} \cdot ∆ t

(28)

The battery current is derived from a power balance considering the open-circuit voltage U_Bat, the internal resistance R_Bat, and the electrical power demand P_electrical from the electric machines and auxiliary consumers. This relationship is given by Equation (29).

I_{B a t} = \frac{U_{B a t} - \sqrt{U_{B a t}^{2} - 4 {\cdot R}_{B a t} \cdot P_{E l e c t r i c a l}}}{2 \cdot R_{B a t}}

(29)

The open-circuit voltage and internal resistance are functions of the current SoC and cell temperature, obtained from characteristic maps. Since these parameters vary slowly compared to the optimization time scale and limited horizon length, they are treated as constants over the prediction horizon.

The ICE efficiency η_ICE characterizes the conversion of fuel energy to mechanical work. It is determined from the brake-specific fuel consumption (BSFC) as a function of engine speed n_ICE and torque T_ICE. Equation (30) describes the relationship between ICE efficiency, indicated engine power P_ICE, fuel mass flow rate ṁ_fuel, and the lower heating value of the fuel H_u.

η_{I C E} = \frac{P_{I C E, i}}{{\dot{m}}_{b} (n_{I C E}, T_{I C E}) \cdot H_{u}}

(30)

For the GA implementation, the BSFC is obtained directly from a lookup table. For the SQP implementation, this characteristic map is approximated by a smooth polynomial function.

The transmission efficiency η_gear accounts for losses in the gearbox and depends on multiple operating parameters. Equation (31) expresses the transmission efficiency as a function of oil temperature T_oil, input speed n_gear,in, current gear ratio i_gear, and transmitted torque T_wheel.

η_{G e a r} = f (T_{O i l}, n_{G e a r, i n}, i_{G e a r}, T_{W h e e l})

(31)

Like the ICE efficiency model, the GA implementation uses the original lookup table representation, while the SQP implementation employs a smooth analytical approximation.

To quantify the approximation error introduced by the polynomial fits in the SQP implementation, each fitted function is validated against the original lookup table at all grid points. Table 2 summarizes the results. The fuel mass flow rate model, which directly determines BSFC and thus fuel consumption, achieves R² = 0.9999 with a maximum deviation of 2.002%. The engine noise model achieves R² = 0.9946 (RMSE 0.8631%) and the masking noise model R² = 0.9990 (RMSE 0.3501%). These results confirm that the polynomial approximations faithfully represent the original characteristic maps.

For the present set of reduced-order models, the polynomial approximations achieve sufficient accuracy to ensure a fair comparison. However, as the model complexity increases, for instance through the inclusion of transient emission models with additional state dependencies, maintaining comparable approximation quality for the SQP formulation becomes increasingly challenging.

4.4. Cost Function Weight Tuning

The performance of the MPC strongly depends on the weighting factors in the cost function.

In multi-objective optimization, varying these weights generates a set of non-dominated solutions known as the Pareto front, where no solution can improve one objective without degrading another [46,47]. This trade-off characteristic is inherent to the conflicting objectives of fuel efficiency and noise reduction, while maintaining SoC.

To determine weights, a Design of Experiments (DoE) approach is employed. The DoE systematically varies the three weighting factors w_SoC, w_η,ICE, and w_Noise using a space-filling design while evaluating the resulting performance metrics. The WLTC serves as the design cycle for this optimization. Each weight combination is simulated in co-simulation, and the resulting fuel consumption, end-of-cycle SoC, and average ICE efficiency are recorded.

The DoE comprises 230 design points and 17 validation points, covering the full weight space from 0 to 1 for each objective. A third-order polynomial response surface model fitted to the design points achieves R² of 0.92 for normalized fuel consumption and R² of 0.89 for end-of-cycle SoC, confirming that the model captures the principal trends in the design space. The validation points yield comparable accuracy (R² = 0.89 for fuel consumption, R² of 0.87 for SoC), indicating that the response surface does not overfit the design data.

The relationship between weights and performance metrics is captured using a response surface model, which enables identification of weight combinations that achieve the desired optimization targets.

Figure 6 presents the response surfaces for fuel consumption (left) and end-of-cycle SoC (right) as functions of w_SoC and w_η,ICE with w_Noise held at the selected value of 0.21. The individual DoE simulation points are overlaid as markers. Correlation analysis across all 247 DoE points reveals that the two primary KPIs respond to different weights. Fuel consumption correlates positively with w_SoC and negatively with w_η,ICE. End-of-cycle SoC, in contrast, correlates with w_SoC but is practically independent of w_η,ICE. This decoupled sensitivity structure is visible in Figure 6, where the SoC contour lines run nearly vertically across the entire w_η,ICE range.

The primary objective is defined as minimizing fuel consumption while maintaining a balanced end-of-cycle SoC near the reference value of 0.5. Based on the DoE analysis, the optimal weighting factors are determined as shown in Table 3. ICE efficiency receives the highest weight, reflecting its dominant influence on fuel consumption, while the SoC weight is set to the minimum value that maintains charge-sustaining operation on the design cycle. The noise weight contributes to improved acoustic behavior. Correlation analysis across all DoE points reveals that the two primary KPIs respond to different weights. Fuel consumption is governed jointly by w_SoC and w_η,ICE, whereas end-of-cycle SoC depends almost exclusively on w_SoC and is practically independent of w_η,ICE. This decoupled sensitivity structure is visible in Figure 6, where the SoC contour lines run nearly vertically across the entire w_η,ICE range. Table 3 illustrates this behavior through representative weight variations that isolate the effect of individual weights. Reducing w_SoC causes battery depletion regardless of w_η,ICE, while increasing it ensures charge-sustaining operation at the cost of higher fuel consumption. Varying w_η,ICE affects fuel consumption but leaves end-of-cycle SoC nearly unchanged. The high ratio of w_η,ICE to w_SoC in the selected weight set therefore does not compromise SoC management, as the two weights govern independent KPIs. This weight set is applied to both the GA-based and SQP-based MPC implementations to ensure a fair comparison under identical tuning conditions.

The selected weight set thus represents a Pareto-efficient compromise in which w_SoC is set to the minimum value that maintains charge-sustaining operation, allowing w_η,ICE to maximize its fuel-consumption-reducing effect. The ratio w_η,ICE/w_SoC does not compromise SoC management because the two weights govern independent KPIs. During sustained highway driving, the parallel operating mode inherently involves load-point shifting, which charges the battery as a byproduct of ICE efficiency optimization, providing an additional mechanism that supports charge-sustaining operation.

The present study applies a single weight set calibrated on the WLTC to all driving scenarios. Adaptive weight strategies, including multi-cycle calibration and online recalibration during vehicle operation, represent a promising extension to improve robustness across diverse driving conditions and will be addressed in future work.

5. Results

This section presents the comparative evaluation of the GA-based and SQP-based MPC implementations. The evaluation encompasses two driving cycles: the WLTC as the design cycle, and a RDE scenario representing mixed urban, rural and highway driving conditions.

The WLTC serves a dual purpose in this study. First, it provides the basis for cost function weight calibration through the DoE process described in Section 4.4. Second, it represents the primary benchmark for comparing both optimization approaches under conditions for which they were explicitly tuned.

The RDE cycle, in contrast, serves as validation scenario to assess the generalization capability of both approaches. Importantly, the weight set optimized for the WLTC is applied without modification to the RDE simulations. This deliberate choice tests whether the calibrated strategies transfer effectively to driving conditions that differ significantly from the design cycle. The RDE cycle features acceleration and deceleration with low average speeds in the beginning followed by sustained high-speed driving in the second and last third of the cycle. Figure 7 shows the velocity profiles of the two cycles.

Following an introduction of the performance metrics and the fuel consumption correction methodology, the results for each cycle are discussed, concluding with an analysis of computational performance. In this study perfect prediction of future driving conditions are assumed. These assumptions ensure identical boundary conditions for both solvers, which are necessary to isolate the effect of the optimization algorithm. Evaluating robustness to prediction uncertainty, driver variability, and component aging is addressed in ongoing research.

5.1. Performance Metrics and Fuel Consumption Correction

The comparison between the two optimization approaches is based on four key performance indicators (KPIs): corrected fuel consumption, average system efficiency, noise emissions, and computational performance.

5.1.1. Fuel Consumption Correction

Comparing fuel consumption between different energy management strategies in hybrid vehicles requires careful consideration of the battery state of charge at the end of the cycle. If one strategy depletes the battery more than another, its raw fuel consumption appears lower, but this comparison would be misleading since the electrical energy was effectively borrowed from the battery.

To ensure fair comparison, the fuel consumption is corrected for the net energy change (NEC) of the high-voltage battery [48]. The NEC correction factor f_NEC quantifies the ratio of electrical energy change to chemical fuel energy consumed over the cycle. It is calculated from the battery voltage U_Bat, battery current I_Bat, fuel mass flow rate ṁ_fuel, and the lower heating value of the fuel H_u. The integration is performed over the total cycle duration t_Total. Equation (32) defines the NEC correction factor.

f_{N E C} = \frac{\int_{0}^{t_{T o t a l}} (U_{B a t} \cdot I_{B a t}) d t}{\int_{0}^{t_{T o t a l}} ({\dot{m}}_{F u e l} \cdot H_{u, F u e l}) d t}

(32)

The raw fuel consumption f_FuelCon is expressed in liters per 100 km and is calculated from the fuel volume flow rate V_dot,fuel and the total distance traveled s_Total according to Equation (33).

F C = 100 \cdot \frac{\int_{0}^{t_{T o t a l}} {\dot{V}}_{F u e l} d t}{s_{T o t a l}}

(33)

The corrected fuel consumption f_FuelCon,corr is then obtained by applying the NEC correction factor to the raw consumption, as given by Equation (34).

{F C}_{N E D} = F C \cdot (1 - f_{N E C})

(34)

This correction effectively penalizes strategies that deplete the battery (f_NEC < 0) and credits strategies that charge it (f_NEC > 0). For meaningful comparison, the end-of-cycle SoC deviation should remain within ±3% of the reference value. Larger deviations reduce the reliability of the correction.

5.1.2. System Efficiency

The average system efficiency η_sys represents the time-averaged efficiency of the complete powertrain. The calculation depends on the power flow direction, distinguishing between traction with battery discharging, traction with battery charging, and regenerative braking. In each case, the efficiency is defined as the ratio of useful power output to total power input.

During traction with simultaneous battery discharging (P_Wheel > 0 and P_Bat > 0), the wheel power P_Wheel represents the useful output, while the sum of battery power P_Bat and fuel power (ṁ_fuel · H_u) constitutes the input. Equation (35) defines the system efficiency for this operating condition.

η_{S y s} = \frac{P_{W h e e l}}{|P_{B a t}| + {\dot{m}}_{F u e l} \cdot H_{u}} P_{W h e e l} > 0 \land P_{B a t} > 0

(35)

During traction with simultaneous battery charging (P_Wheel > 0 and P_Bat ≤ 0), both the wheel power and the absolute battery charging power represent useful outputs, with fuel power as the sole input. Equation (36) applies to this case.

η_{S y s} = \frac{P_{W h e e l} + |P_{B a t}|}{{\dot{m}}_{F u e l} \cdot H_{u}} P_{W h e e l} > 0 \land P_{B a t} \leq 0

(36)

During regenerative braking (P_Wheel < 0 and P_Bat ≤ 0), the recovered battery power represents the useful output, while the sum of braking power P_Wheel and any fuel power constitutes the input. Equation (37) describes this operating condition.

η_{S y s} = \frac{|P_{B a t}|}{P_{W h e e l} + {\dot{m}}_{F u e l} \cdot H_{u}} P_{W h e e l} < 0 \land P_{B a t} \leq 0

(37)

The system efficiency accounts for all energy conversion losses including the ICE, electric machines, transmission, and battery. This metric captures the overall energy utilization effectiveness and is particularly sensitive to the operating mode distribution, as electric operation typically achieves higher system efficiency than combustion-based modes.

5.1.3. Computational Performance Metric

The real-time capability (RTC) quantifies the computational burden of each algorithm. It is defined as the ratio of the time required to simulate the cycle t_Sim to the actual cycle duration t_Cycle, as given by Equation (38).

R T C = \frac{t_{S i m}}{t_{C y c l e}}

(38)

An RTC value below 1 indicates real-time capability, meaning the algorithm can compute control actions faster than required for online implementation.

5.1.4. Noise Metric

The cumulative noise metric ΔE_Noise quantifies the excess masking noise overshoot integrated over the driving cycle. It is derived from the difference between the ICE interior noise intensity and the vehicle rolling and wind noise intensity at each time step, accumulated only when the ICE noise exceeds the masking threshold. The metric is expressed in J/m² and captures both the duration and intensity of acoustically perceptible engine noise events. A detailed derivation of this metric, including the underlying transfer functions and the psychoacoustic masking model, is provided in [36].

5.2. Design Cycle Results (WLTC)

The WLTC serves as the design cycle for both optimization approaches, with simulations conducted at an ambient temperature of 23 °C according to regulatory specifications [34]. Since both algorithms were calibrated using the same DoE-optimized weight set on this cycle, the WLTC results primarily reveal differences in the optimization algorithms’ ability to find optimal solutions given identical tuning. For comparison, a state-of-the-art deterministic EMS is included, which was calibrated using a DoE approach with the primary objective of minimizing fuel consumption without explicit consideration of noise emissions [49,50,51].

Figure 8 summarizes the KPIs for both MPC approaches and the deterministic reference strategy. Regarding corrected fuel consumption, the GA-based MPC achieves 3.82 L/100 km, representing a reduction of 1.0% compared to the deterministic reference (3.86 L/100 km). The SQP-based MPC reaches 3.84 L/100 km, corresponding to a reduction of 0.5% relative to the deterministic strategy. The system efficiency results demonstrate the effectiveness of both MPC approaches in improving overall powertrain energy utilization. The GA-based MPC achieves an average system efficiency of 45.14%, which corresponds to a relative improvement of 4.6% compared to the deterministic reference (43.14%). The SQP-based approach reaches 44.73%, representing a 3.7% improvement over the deterministic strategy. Both MPC implementations thus achieve notably higher system efficiency than the fuel-optimized reference. The most pronounced differences between the strategies appear in the noise emission results. The deterministic reference, optimized primarily for fuel consumption, produces cumulative noise emissions of 23.0 µJ/m². In contrast, the GA-based MPC reduces noise emissions to 0.12 µJ/m², corresponding to a reduction of 99.5%. The SQP-based MPC achieves 0.20 µJ/m², representing a reduction of 99.1%. These substantial reductions demonstrate that the multi-objective cost function effectively guides both algorithms toward ICE operating points where engine noise remains below the masking threshold, a capability not present in the fuel-focused deterministic strategy.

It should be noted that the deterministic reference strategy was calibrated with the primary objective of minimizing fuel consumption and does not include noise emissions in its optimization targets. The comparison therefore quantifies the noise reduction achieved by including acoustic behavior in the MPC cost function, rather than the noise reduction attributable to the predictive nature of the MPC alone. A noise-optimized deterministic strategy would provide an additional benchmark to isolate the contribution of the MPC framework from the effect of multi-objective formulation. However, since the primary focus of this study is the comparison between GA and SQP under identical cost function formulations, and a noise-optimized deterministic strategy does not represent current state of the art in powertrain concept development, such a benchmark is not considered here.

The operating behavior underlying these performance differences is illustrated in Figure 9, which shows the fuel share and energy share diagrams for both MPC approaches. Both algorithms concentrate ICE operation in the high-efficiency region of the BSFC map, with operating points clustered along the optimal efficiency line. The EM2 energy share diagrams in the lower portion of Figure 9 reveal similar patterns for both approaches, with regenerative braking energy concentrated in the high-efficiency region of the electric machine map.

Table 4 quantifies the operating mode distribution. The deterministic strategy operates 42.9% in parallel mode, 11.5% in serial mode, and 45.6% in electric mode. Both MPC approaches show a distinctly different pattern: the GA-based MPC operates 48.0% in parallel mode, only 0.9% in serial mode, and 51.1% in electric mode. Similarly, the SQP-based MPC achieves 47.8% parallel, 1.3% serial, and 50.8% electric operation. The near-complete avoidance of serial mode by both MPC approaches (reduction of 92% for GA and 89% for SQP compared to the deterministic strategy) is consistent with the EM1 power limitation discussed in Section 2.2, which restricts serial operation to low-load conditions where it offers limited efficiency benefits.

The similarity in mode distributions between GA and SQP on the design cycle is notable. Both approaches select parallel mode within 0.2 percentage points of each other, and electric mode within 0.3 percentage points. This convergence shows that for the WLTC, both optimization algorithms identify similar optimal mode sequences despite their fundamentally different search mechanisms.

To assess the stochastic variability of the GA-based approach, three independent simulation runs with different random seeds are conducted on the WLTC. Table 5 reports the mean and standard deviation of the resulting KPIs. The corrected fuel consumption varies by less than 0.3% across runs (coefficient of variation 0.14%), and end-of-cycle SoC remains within 0.08 percentage points of the mean. These results indicate that the GA-based MPC produces repeatable solutions despite the stochastic nature of the evolutionary search. This behavior is consistent with the MPC framework, where the GA is re-initialized at each control time step with a sampling interval of 0.1 s, effectively limiting the accumulation of stochastic variation over the cycle.

5.3. Real-World Driving Results (RDE)

The RDE cycle represents real-world driving conditions that differ from the standardized WLTC. As described in Section 5, the cost function weights remain unchanged from the WLTC calibration, making this scenario a test of generalization capability rather than cycle-specific optimization. The RDE cycle features urban driving with frequent acceleration and deceleration at low speeds in the initial phase, followed by sustained high-speed highway driving in the latter portions, as shown in Figure 7. Figure 10 presents the performance comparison for the RDE cycle. The corrected fuel consumption results show that the GA-based MPC achieves 3.674 L/100 km, representing a reduction of 0.7% compared to the deterministic reference (3.700 L/100 km). The SQP-based MPC reaches 3.699 L/100 km, corresponding to a minimal reduction of less than 0.1% relative to the deterministic strategy. The performance gap between GA and SQP widens compared to the WLTC; while both approaches achieved similar fuel consumption on the design cycle (0.5% difference), the RDE cycle reveals a difference of 0.7% between GA and SQP. This near-parity results from SQP operating with a less charge-sustaining SoC trajectory compared to the deterministic strategy. While SQP achieves lower raw fuel consumption, the NEC correction largely offsets this advantage. The efficiency metrics in Figure 10 reveal a notable trade-off between ICE efficiency and system efficiency. The SQP-based MPC achieves higher ICE efficiency (35.42%) compared to GA (35.37%), yet the GA-based approach attains higher system efficiency (35.88% vs. 33.55% for SQP). This apparent contradiction, higher ICE efficiency but lower system efficiency for SQP, can be explained by examining the operating behavior in Figure 11. Figure 11 shows the fuel share and energy share diagrams for the RDE cycle. The upper row displays the ICE operating points, while the lower row shows the EM2 energy distribution. Comparing the SQP approach (Figure 11b) with the GA approach (Figure 11a), the SQP concentrates ICE operation more tightly in the high-efficiency region of the BSFC map, explaining the higher average ICE efficiency observed in Figure 10.

However, the EM2 energy share diagrams in the lower portion of Figure 11 reveal the source of SQP’s lower system efficiency. The SQP approach operates EM2 in regions of lower electric machine efficiency compared to GA.

The GA-based approach (Figure 11a, bottom) shows EM2 operation more concentrated in the high-efficiency region (85–90% efficiency contours), while SQP (Figure 11b, bottom) exhibits broader distribution extending into lower-efficiency regions.

This trade-off demonstrates that optimizing individual component efficiency does not necessarily translate to optimal system efficiency. The SQP’s mode-specific optimization structure, which solves separate problems for each operating mode, appears to prioritize ICE efficiency without fully accounting for the resulting impact on electric machine operation. The GA’s integrated optimization, encoding both mode selection and torque distribution within the same chromosome, achieves better coordination between ICE and EM2 operation, resulting in higher overall system efficiency despite lower ICE efficiency.

Since the SQP solver employs warm-starting from the previous time step (Section 4.2.4), the observed performance gap is more likely attributable to the decomposed optimization structure than to convergence issues within individual subproblems.

The noise emission results on the RDE cycle show that the deterministic reference produces cumulative noise emissions of 548.42 µJ/m², considerably higher than the WLTC value (23.0 µJ/m²) due to the different speed and load profiles. The SQP-based MPC achieves the lowest noise emissions at 2.05 µJ/m², corresponding to a reduction of 99.6% compared to the deterministic reference. The GA-based MPC reduces noise emissions to 5.64 µJ/m², representing a reduction of 99.0%. While the absolute noise levels are higher than in the WLTC, the MPC approaches continue to demonstrate their effectiveness in avoiding acoustically unfavorable ICE operating points.

The widening performance gap between GA and SQP from the design cycle (WLTC) to the validation cycle (RDE) provides insight into the generalization characteristics of both approaches. On the WLTC, both algorithms achieved fuel consumption within 0.5% of each other. On the RDE cycle, this gap increases to 0.7%, with GA maintaining its advantage. Furthermore, the analysis of Figure 11 reveals that SQP’s decomposed optimization structure leads to suboptimal coordination between powertrain components. While achieving favorable ICE operating points, the approach compromises EM2 efficiency, ultimately resulting in lower system efficiency. The GA’s population-based global search, which considers the complete solution space simultaneously, achieves more balanced component utilization and thus better overall system performance on driving conditions outside the calibration domain. Given the high model fidelity of the SQP polynomial fits reported in Section 4.3, the observed performance differences between GA and SQP are primarily attributable to the optimization algorithms rather than to discrepancies in the underlying models.

5.4. Computational Performance

The SQP-based approach demonstrates approximately four times faster computation compared to the GA. Figure 12 compares the computational performance of both approaches, measured as the Real-Time Capability (RTC) during WLTC simulation.

This significant speed advantage stems from the efficient gradient-based search and the optimized C-code generated by acados. Despite this difference, both approaches remain real-time capable (RTC < 1), meaning either could be deployed for online energy management.

The computational advantage of SQP becomes particularly relevant for applications requiring extensive simulation studies, such as (DoE) calibration. The faster execution enables more comprehensive parameter studies within practical time constraints. However, for online vehicle implementation where control updates occur at 0.1 s intervals, both algorithms provide sufficient computational margin.

All simulations are conducted on a standard laptop computer equipped with an Intel Core i7 processor and 16 GB RAM. The MPC optimization is executed on a single core. The real-time capability reported in this study is evaluated in a Model-in-the-Loop environment. The RTC metric is well suited for comparing the computational performance of the two solver approaches within this MiL framework but does not allow direct conclusions regarding embedded deployment. Hardware-in-the-Loop testing and embedded benchmarking on representative ECU hardware represent necessary validation steps toward series production and are planned as a part of future work.

6. Conclusions

This study presented a systematic comparison between Genetic Algorithm and Sequential Quadratic Programming as optimization solvers within a Model Predictive Control framework for energy management of a P1P3 parallel hybrid powertrain. Both approaches were evaluated using the same cost function weights calibrated through Design of Experiments on the WLTC and subsequently applied to an RDE validation cycle without recalibration.

On the WLTC (design cycle), the GA-based MPC achieves a corrected fuel consumption of 3.82 L/100 km, representing a reduction of 1.0% compared to the deterministic reference strategy (3.86 L/100 km). The SQP-based MPC reaches 3.84 L/100 km, corresponding to a reduction of 0.5%. Both MPC approaches improve system efficiency relative to the deterministic reference, with GA achieving 45.14% (+4.6%) and SQP achieving 44.73% (+3.7%). The operating mode distributions are similar for both algorithms, with approximately 48% parallel mode, 51% electric mode, and less than 2% serial mode operation. Noise emissions are reduced by more than 99% for both MPC approaches compared to the deterministic reference, which was optimized for fuel consumption without explicit consideration of acoustic behavior. On the RDE validation cycle, the GA-based MPC achieves 3.674 L/100 km, a reduction of 0.7% compared to the deterministic reference (3.700 L/100 km), while SQP reaches 3.699 L/100 km with less than 0.1% reduction. The performance gap between GA and SQP widens from 0.5% on the WLTC to 0.7% on the RDE cycle, indicating differences in generalization behavior when operating outside the calibration domain. System efficiency improvements relative to the deterministic reference amount to 9.2% for GA (35.88%) and 2.1% for SQP (33.55%). The energy share analysis reveals that SQP achieves higher ICE efficiency (35.42% versus 35.37% for GA) but operates EM2 in lower-efficiency regions, while GA achieves more balanced component utilization across the powertrain. This trade-off demonstrates that optimizing individual component efficiency does not necessarily translate to optimal system efficiency. The SQP’s mode-specific optimization structure, which solves separate problems for each operating mode, appears to prioritize ICE efficiency without fully accounting for the resulting impact on electric machine operation. The GA’s integrated optimization, encoding both mode selection and torque distribution within the same chromosome, achieves better coordination between ICE and EM2 operation, resulting in higher overall system efficiency despite lower ICE efficiency. Noise emission reductions of 99.6% for SQP and 99.0% for GA are observed compared to the deterministic reference. Regarding computational performance, the SQP-based approach computes approximately 4.3 times faster than GA, with Real-Time Capability values of 0.151 versus 0.646, respectively. Both approaches remain real-time capable for online vehicle implementation. Beyond computational efficiency, the two approaches differ in their requirements for reduced-order models. SQP requires continuously differentiable formulations, while GA can directly incorporate non-differentiable model structures such as lookup tables. This flexibility in model integration represents a structural advantage for applications where high-fidelity component models are available but may not be readily reformulated into differentiable expressions.

For concept development applications where solution quality and model flexibility take precedence over computational efficiency, the GA-based approach represents the favorable choice based on the results of this study. For applications requiring extensive parameter studies or embedded implementation with minimal computational resources, the SQP-based approach offers advantages through faster execution and automatic code generation. Both optimization approaches provide viable solutions for MPC-based energy management, with the selection depending on the specific requirements of the application context.

Author Contributions

Conceptualization, F.H. and S.K.; methodology, F.H. and S.K.; software, F.H. and S.K.; validation, F.H. and S.K.; formal analysis, F.H.; investigation, F.H. and S.K.; resources, M.G. and S.P.; data curation, F.H.; writing—original draft preparation, F.H. and S.K.; writing—review and editing, M.G. and S.P.; visualization, F.H. and S.K.; supervision, S.P.; project administration, M.G.; funding acquisition, M.G. All authors have read and agreed to the published version of the manuscript.

Funding

Parts of this research were funded by FVV e.V.//Science for a moving society, project number 1499 “Modelling Approaches of Vehicle Induced PN/PM and Emission Reduction Strategies”.

Data Availability Statement

Most relevant data generated or analyzed during this study are included in this published article. Additional data are partly available on request from the corresponding author with the permission of the FVV e.V.//Science for a moving society and their members.

Acknowledgments

We thank Max Decker (FVV e.V.//Science for a moving society) and Frank Bunar (IAV GmbH) as well as the FVV working group for the useful discussions, ideas, and support while conducting the corresponding research project.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

BSFC	Brake-Specific Fuel Consumption
DCT	Dual Clutch Transmission
DoE	Design of Experiments
ECU	Electronic Control Unit
EM	Electric Machine
EMS	Energy Management System
GA	Genetic Algorithm
HEV	Hybrid Electric Vehicle
ICE	Internal Combustion Engine
KPI	Key Performance Indicator
MINLP	Mixed-Integer Nonlinear Programming
MPC	Model Predictive Control
NEC	Net Energy Change
OCP	Optimal Control Problem
RDE	Real Driving Emissions
ROM	Reduced-Order Model
RTC	Real-Time Capability
SoC	State of Charge
SQP	Sequential Quadratic Programming
WLTC	Worldwide Harmonized Light Vehicles Test Cycle

References

Available online: https://Climate.Ec.Europa.Eu/Eu-Action/Transport-Decarbonisation/Overview_en (accessed on 10 March 2026).
Available online: https://www.eea.europa.eu/en/analysis/indicators/greenhouse-gas-emissions-from-transport (accessed on 10 March 2026).
Haas, T.; Sander, H. Decarbonizing Transport in the European Union: Emission Performance Standards and the Perspectives for a European Green Deal. Sustainability 2020, 12, 8381. [Google Scholar] [CrossRef]
Tran, D.-D.; Vafaeipour, M.; El Baghdadi, M.; Barrero, R.; Van Mierlo, J.; Hegazy, O. Thorough State-of-the-Art Analysis of Electric and Hybrid Vehicle Powertrains: Topologies and Integrated Energy Management Strategies. Renew. Sustain. Energy Rev. 2020, 119, 109596. [Google Scholar] [CrossRef]
Ehsani, M.; Singh, K.V.; Bansal, H.O.; Mehrjardi, R.T. State of the Art and Trends in Electric and Hybrid Electric Vehicles. Proc. IEEE 2021, 109, 967–984. [Google Scholar] [CrossRef]
Lü, X.; Li, S.; He, X.; Xie, C.; He, S.; Xu, Y.; Fang, J.; Zhang, M.; Yang, X. Hybrid Electric Vehicles: A Review of Energy Management Strategies Based on Model Predictive Control. J. Energy Storage 2022, 56, 106112. [Google Scholar] [CrossRef]
Huang, Y.; Wang, H.; Khajepour, A.; He, H.; Ji, J. Model Predictive Control Power Management Strategies for HEVs: A Review. J. Power Sources 2017, 341, 91–106. [Google Scholar] [CrossRef]
Zhou, Q.; Du, C. A Quantitative Analysis of Model Predictive Control as Energy Management Strategy for Hybrid Electric Vehicles: A Review. Energy Rep. 2021, 7, 6733–6755. [Google Scholar] [CrossRef]
Sampathnarayanan, B.; Serrao, L.; Onori, S.; Rizzoni, G.; Yurkovich, S. Model Predictive Control as an Energy Management Strategy for Hybrid Electric Vehicles. In Proceedings of the ASME 2009 Dynamic Systems and Control Conference; ASMEDC: Hollywood, CA, USA, 2009; Volume 2, pp. 249–256. [Google Scholar]
Mayne, D.Q.; Rawlings, J.B.; Rao, C.V.; Scokaert, P.O.M. Constrained Model Predictive Control: Stability and Optimality. Automatica 2000, 36, 789–814. [Google Scholar] [CrossRef]
Borhan, H.; Vahidi, A.; Phillips, A.M.; Kuang, M.L.; Kolmanovsky, I.V.; Di Cairano, S. MPC-Based Energy Management of a Power-Split Hybrid Electric Vehicle. IEEE Trans. Control Syst. Technol. 2012, 20, 593–603. [Google Scholar] [CrossRef]
Tang, W.; Wang, Y.; Jiao, X.; Ren, L. Hierarchical Energy Management Strategy Based on Adaptive Dynamic Programming for Hybrid Electric Vehicles in Car-Following Scenarios. Energy 2023, 265, 126264. [Google Scholar] [CrossRef]
Hu, X.; Liu, T.; Qi, X.; Barth, M. Reinforcement Learning for Hybrid and Plug-In Hybrid Electric Vehicle Energy Management: Recent Advances and Prospects. IEEE Ind. Electron. Mag. 2019, 13, 16–25. [Google Scholar] [CrossRef]
Li, J.; Liu, J.; Yang, Q.; Wang, T.; He, H.; Wang, H.; Sun, F. Reinforcement Learning Based Energy Management for Fuel Cell Hybrid Electric Vehicles: A Comprehensive Review on Decision Process Reformulation and Strategy Implementation. Renew. Sustain. Energy Rev. 2025, 213, 115450. [Google Scholar] [CrossRef]
Hu, Z.; Su, R.; Ling, K.-V.; Guo, Y.; Ma, R. Resilient Event-Triggered MPC for Load Frequency Regulation with Wind Turbines Under False Data Injection Attacks. IEEE Trans. Autom. Sci. Eng. 2024, 21, 7073–7083. [Google Scholar] [CrossRef]
Robuschi, N.; Zeile, C.; Sager, S.; Braghin, F. Multiphase Mixed-Integer Nonlinear Optimal Control of Hybrid Electric Vehicles. Automatica 2021, 123, 109325. [Google Scholar] [CrossRef]
Diehl, M.; Ferreau, H.J.; Haverbeke, N. Efficient Numerical Methods for Nonlinear MPC and Moving Horizon Estimation. In Nonlinear Model Predictive Control: Towards New Challenging Applications; Springer: Berlin/Heidelberg, Germany, 2009; pp. 391–417. [Google Scholar]
Quirynen, R.; Cairano, S. Di Sequential Quadratic Programming Algorithm for Real-Time Mixed-Integer Nonlinear MPC. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC); IEEE: New York, NY, USA, 2021; pp. 993–999. [Google Scholar]
Ploskas, N.; Sahinidis, N.V. Review and Comparison of Algorithms and Software for Mixed-Integer Derivative-Free Optimization. J. Glob. Optim. 2022, 82, 433–462. [Google Scholar] [CrossRef]
Belotti, P.; Kirches, C.; Leyffer, S.; Linderoth, J.; Luedtke, J.; Mahajan, A. Mixed-Integer Nonlinear Optimization. Acta Numer. 2013, 22, 1–131. [Google Scholar] [CrossRef]
Lü, X.; Wu, Y.; Lian, J.; Zhang, Y.; Chen, C.; Wang, P.; Meng, L. Energy Management of Hybrid Electric Vehicles: A Review of Energy Optimization of Fuel Cell Hybrid Power System Based on Genetic Algorithm. Energy Convers. Manag. 2020, 205, 112474. [Google Scholar] [CrossRef]
Monot, A.; Navet, N.; Bavoux, B.; Simonot-Lion, F. Multisource Software on Multicore Automotive ECUs—Combining Runnable Sequencing with Task Scheduling. IEEE Trans. Ind. Electron. 2012, 59, 3934–3942. [Google Scholar] [CrossRef]
Hrovat, D.; Di Cairano, S.; Tseng, H.E.; Kolmanovsky, I.V. The Development of Model Predictive Control in Automotive Industry: A Survey. In Proceedings of the 2012 IEEE International Conference on Control Applications; IEEE: New York, NY, USA, 2012; pp. 295–302. [Google Scholar]
Di Cairano, S.; Kolmanovsky, I.V. Automotive Applications of Model Predictive Control. In Handbook of Model Predictive Control; Springer: Berlin/Heidelberg, Germany, 2019; pp. 493–527. [Google Scholar]
Ferreau, H.J.; Almér, S.; Verschueren, R.; Diehl, M.; Frick, D.; Domahidi, A.; Jerez, J.L.; Stathopoulos, G.; Jones, C. Embedded Optimization Methods for Industrial Automatic Control. IFAC-PapersOnLine 2017, 50, 13194–13209. [Google Scholar] [CrossRef]
Norouzi, A.; Heidarifar, H.; Borhan, H.; Shahbakhti, M.; Koch, C.R. Integrating Machine Learning and Model Predictive Control for Automotive Applications: A Review and Future Directions. Eng. Appl. Artif. Intell. 2023, 120, 105878. [Google Scholar] [CrossRef]
Bonami, P.; Biegler, L.T.; Conn, A.R.; Cornuéjols, G.; Grossmann, I.E.; Laird, C.D.; Lee, J.; Lodi, A.; Margot, F.; Sawaya, N.; et al. An Algorithmic Framework for Convex Mixed Integer Nonlinear Programs. Discret. Optim. 2008, 5, 186–204. [Google Scholar] [CrossRef]
Kronqvist, J.; Bernal, D.E.; Lundell, A.; Grossmann, I.E. A Review and Comparison of Solvers for Convex MINLP. Optim. Eng. 2019, 20, 397–455. [Google Scholar] [CrossRef]
Maurer, R.; Kossioris, T.; Sterlepper, S.; Günther, M. Zero-Impact Tailpipe Emission Powertrains: Identify Technical Solutions to Achieve Powertrains with Zero-Impact Tailpipe Emissions under Consideration of a 2030+ Legislative Vehicle and Air Pollutant Scenario. Final. Rep. 2023, 1334, 43–51. [Google Scholar]
Maurer, R.; Kossioris, T.; Hausberger, S. How to Define and Achieve Zero-Impact Emissions in Road Transport? Transp. Res. D Transp. Environ. 2023, 116, 103619. [Google Scholar] [CrossRef]
Zhang, B.; Shi, P.; Mou, X.; Li, H.; Zhao, Y.; Zheng, L. Energy Management Strategy for P1 + P3 Plug-In Hybrid Electric Vehicles. World Electr. Veh. J. 2023, 14, 332. [Google Scholar] [CrossRef]
Tao, J.; Huang, Z.; Di, J. Dual-Motor Hybrid Dedicated Transmission Based on P1/P3 Architecture. In Proceedings of China SAE Congress 2020: Selected Papers; Springer: Berlin/Heidelberg, Germany, 2022; pp. 1651–1669. [Google Scholar]
Grüne, L.; Pannek, J. Nonlinear Model Predictive Control; Springer International Publishing: Cham, Germany, 2017; ISBN 978-3-319-46023-9. [Google Scholar]
Available online: https://arxiv.org/abs/1803.06928 (accessed on 10 March 2026).
Scholz, D. Optimierung Interaktiv; Springer: Berlin/Heidelberg, Germany, 2018; ISBN 978-3-662-57952-7. [Google Scholar]
Kexel, J.; Müller, J.; Herkenrath, F.; Hermsen, P.; Günther, M.; Pischinger, S. Virtual Plug-In Hybrid Concept Development and Optimization under Real-World Boundary Conditions. Vehicles 2024, 6, 1216–1248. [Google Scholar] [CrossRef]
Liang, Z.; Chung, C.Y.; Zhang, W.; Wang, Q.; Lin, W.; Wang, C. Enabling High-Efficiency Economic Dispatch of Hybrid AC/DC Networked Microgrids: Steady-State Convex Bi-Directional Converter Models. IEEE Trans. Smart Grid 2025, 16, 45–61. [Google Scholar] [CrossRef]
Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Boston, MA, USA, 1989; ISBN 9780201157673. [Google Scholar]
Holland, J.H. Adaptation in Natural and Artificial Systems; The MIT Press: Cambridge, MA, USA, 1992; ISBN 9780262275552. [Google Scholar]
Andrei, N. Modern Numerical Nonlinear Optimization; Springer International Publisher: Berlin/Heidelberg, Germany, 2025; ISBN 9783031946936. [Google Scholar]
Fleming, P.J.; Fonseca, C.M. Genetic Algorithms in Control Systems Engineering. IFAC Proc. Vol. 1993, 26, 605–612. [Google Scholar] [CrossRef]
Katoch, S.; Chauhan, S.S.; Kumar, V. A Review on Genetic Algorithm: Past, Present, and Future. Multimed. Tools Appl. 2021, 80, 8091–8126. [Google Scholar] [CrossRef]
Verschueren, R.; Frison, G.; Kouzoupis, D.; Frey, J.; Duijkeren, N.; van Zanelli, A.; Novoselnik, B.; Albin, T.; Quirynen, R.; Diehl, M. Acados—A Modular Open-Source Framework for Fast Embedded Optimal Control. Math. Program. Comput. 2022, 14, 147–183. [Google Scholar] [CrossRef]
Bock, H.G.; Plitt, K.J. A Multiple Shooting Algorithm for Direct Solution of Optimal Control Problems. IFAC Proc. Vol. 1984, 17, 1603–1608. [Google Scholar] [CrossRef]
Andersson, J.A.E.; Gillis, J.; Horn, G.; Rawlings, J.B.; Diehl, M. CasADi: A Software Framework for Nonlinear Optimization and Optimal Control. Math. Program. Comput. 2019, 11, 1–36. [Google Scholar] [CrossRef]
Deb, K. Multi-Objective Optimization Using Evolutionary Algorithms; John Wiley & Sons: Hoboken, NJ, USA, 2001; ISBN 9780471873396. [Google Scholar]
Miettinen, K. Nonlinear Multiobjective Optimization; Springer: Boston, MA, USA, 1998; Volum 12, ISBN 978-1-4613-7544-9. [Google Scholar]
Liu, W. Hybrid Electric Vehicle System Modeling and Control; Wiley: Hoboken, NJ, USA, 2017; ISBN 9781119279327. [Google Scholar]
Balazs, A. Optimierte Auslegung von Ottomotorischen Hybridantriebssträngen Unter Realen Fahrbedingungen. Ph.D. Thesis, RWTH Aachen University, Aachen, Germany, 2015. [Google Scholar]
Seibel, J. Optimierte Auslegung von Ottomotoren in Hybrid-Antriebssträngen. Ph.D. Thesis, RWTH Aachen University, Aachen, Germany, 2008. [Google Scholar]
Jannik, K. Lebenszyklusbasierte Entwicklung Und Bewertung von Nachhaltigen Antriebskonzepten. Ph.D. Thesis, RWTH Aachen University, Aachen, Germany, 2025. [Google Scholar]

Figure 1. Illustration of the P1P3 powertrain and positioning of components. EM1 is the electric machine close to the internal combustion engine (ICE) and EM2 is the electric machine behind the 7-Speed Dual Clutch Transmission (DCT).

Figure 2. Operating range of the EM1 below the 15 kW power curve (blue area) in the Brake-Specific Fuel Consumption (BSFC) map. The P1P3 hybrid vehicle under investigation can only be operated in this range.

Figure 3. Flowchart for the individual steps of the genetic algorithm, own representation based on content from [41].

Figure 4. Sensitivity of the GA-based MPC to prediction horizon length (left column, step size 0.5 s) and discretization step size (right column, horizon length 5 s) on the WLTC. (Top row): average system efficiency. (Bottom left): real-time capability (RTC). (Bottom right): corrected fuel consumption.

Figure 5. Principle of model predictive control with SQP. Discrete operation mode selection is resolved by parallelized calculation.

Figure 6. Response surfaces from DoE weight optimization showing fuel consumption (left) and end-of-cycle SoC (right). The red star indicates the selected optimum.

Figure 7. Speed profiles of the WLTC (top) and the RDE cycle (bottom) examined for validation.

Figure 8. Results of the MPC approaches with GA in blue and SQP in gray for corrected fuel consumption, ICE efficiency, system efficiency, and cumulative noise emissions in WLTC. For comparison purposes, the results of a state-of-the-art deterministic EMS, which was primarily optimized for fuel consumption using a DoE approach, are shown in black.

Figure 9. Fuel share (top) and energy share of the electric machine EM2 (bottom) for both MPC approaches (a) GA and (b) SQP for the WLTC. The dashed line highlights the BSFC optimum curve.

Figure 10. Results of the MPC approaches with GA in blue and SQP in gray for corrected fuel consumption, ICE efficiency, system efficiency, and cumulative noise emissions in the RDE cycle under investigation. For comparison purposes, the results of a state-of-the-art deterministic EMS, which was primarily optimized for fuel consumption using a DoE approach, are shown in black.

Figure 11. Fuel share (top) and energy share of the electric machine EM2 (bottom) for both MPC approaches (a) GA and (b) SQP for the RDE cycle under investigation. The dashed line highlights the BSFC optimum curve.

Figure 12. Real-time capacity evaluation for the two MPC approaches and, for comparison purposes, the RTC value for the deterministic EMS.

Table 1. Overview of powertrain components.

	Parameter	Unit	Value
ICE	Max. power	kW	80
	Max. torque	Nm	140
	Max. speed	min⁻¹	6000
EM1	Max. power	kW	15
	Max. torque	Nm	18
EM2	Max. power	kW	20
	Max. torque	Nm	66
	Max. speed	min⁻¹	7000
	Gear ratio	-	3.7

Table 2. Validation of SQP polynomial fits against original lookup tables used by the GA implementation.

Model	R² -	RMSE %	Max Deviation %
Fuel mass flow rate	0.9999	0.01	2.00
Engine Noise	0.9946	0.86	7.75
Masking Noise	0.9990	0.35	1.23

Table 3. Weight set after Design of Experiment optimization.

Scenario	w_SOC -	w_η,ICE -	w_Noise -	Fuel Cons. in L/100 km	SoC_End -
Selected optimum	0.14	0.59	0.21	3.82	0.505
Increased w_SOC	0.42	0.7	0.23	3.94	0.511
Decreased w_SOC	0.04	0.67	0.15	3.80	0.302
Increased w_η,ICE	0.18	0.9	0.2	3.86	0.513
Decreased w_η,ICE	0.17	0.16	0.32	3.90	0.507
Equal weights	0.36	0.35	0.32	3.97	0.509

Table 4. Distribution of operating time in parallel, serial, and electrical mode for the two MPC approaches and for comparison purposes with the deterministic EMS in WLTC.

Energy Management System		Temporal Share Mode in %
	Parallel	Serial	Electric
Deterministic	42.9	11.5	45.6
MPC GA	48	0.9	51.1
MPC SQP	47.8	1.3	50.8

Table 5. Repeatability of GA-based MPC results over three independent WLTC runs.

	Mean	Std. Deviation	Coefficient of Variation in %
FC_Corr in L/100 km	3.821	0.005	0.14
SoC_End	0.506	0.0004	0.09
∆E_Noise in µJ/m²	0.123	0.015	12.4

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Herkenrath, F.; Koßler, S.; Günther, M.; Pischinger, S. Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control. Energies 2026, 19, 1535. https://doi.org/10.3390/en19061535

AMA Style

Herkenrath F, Koßler S, Günther M, Pischinger S. Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control. Energies. 2026; 19(6):1535. https://doi.org/10.3390/en19061535

Chicago/Turabian Style

Herkenrath, Ferris, Silas Koßler, Marco Günther, and Stefan Pischinger. 2026. "Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control" Energies 19, no. 6: 1535. https://doi.org/10.3390/en19061535

APA Style

Herkenrath, F., Koßler, S., Günther, M., & Pischinger, S. (2026). Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control. Energies, 19(6), 1535. https://doi.org/10.3390/en19061535

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Addressing Mixed-Integer Nonlinear Energy Management in Hybrid Vehicles: Comparing Genetic Algorithm and Sequential Quadratic Programming Within Model Predictive Control

Abstract

1. Introduction

2. Vehicle and Powertrain Model

2.1. Powertrain Architecture

2.2. Component Specifications

2.3. Operating Modes

3. MPC Framework and Problem Formulation

3.1. Model Predictive Control Principle

3.2. Multi-Objective Cost Function

3.3. System Constraints

3.4. Mixed-Integer Problem Structure

4. Optimization Algorithms

4.1. Genetic Algorithm

4.1.1. Algorithm Structure

4.1.2. Implementation

4.2. Sequential Quadratic Programming

4.2.1. Mode-Specific Optimization

4.2.2. Optimal Control Problem Formulation

4.2.3. Control Variables per Mode

4.2.4. Implementation with Acados

4.3. Reduced-Order Models

4.3.1. Model Requirements

4.3.2. Implemented Models

4.4. Cost Function Weight Tuning

5. Results

5.1. Performance Metrics and Fuel Consumption Correction

5.1.1. Fuel Consumption Correction

5.1.2. System Efficiency

5.1.3. Computational Performance Metric

5.1.4. Noise Metric

5.2. Design Cycle Results (WLTC)

5.3. Real-World Driving Results (RDE)

5.4. Computational Performance

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI