Convex Optimization for the Energy Management of Hybrid Electric Vehicles Considering Engine Start and Gearshift Costs

This paper presents a novel method to solve the energy management problem for hybrid electric vehicles (HEVs) with engine start and gearshift costs. The method is based on a combination of deterministic dynamic programming (DP) and convex optimization. As demonstrated in a case study, the method yields globally optimal results while returning the solution in much less time than the conventional DP method. In addition, the proposed method handles state constraints, which allows for the application to scenarios where the battery state of charge (SOC) reaches its boundaries.


Introduction
Hybrid electric vehicles (HEVs) are one option to reduce the CO 2 emissions of passenger light-duty vehicles.Such hybrid vehicles consist of at least two power sources, typically an internal combustion engine and one or more electric motors, as well as an energy buffer, typically a battery.The control of the power flows among these devices is commonly referred to as the energy management or supervisory control [1].There are many approaches [2] to design an energy management strategy, covering heuristic approaches [3][4][5][6] as well as optimization-based approaches, such as deterministic dynamic programming (DP) [7][8][9][10][11][12][13], Pontryagin's minimum principle (PMP) [14][15][16][17][18][19][20][21] and stochastic dynamic programming (SDP) [22][23][24][25][26]. Recently, convex optimization [27] has attracted attention in the research field of energy management for HEVs.It is seen as an alternative method for the optimization of the power flows in HEVs with the advantage of being computationally more efficient than for example DP or indirect methods such as PMP are.Convex optimization has already been employed for dimensioning of powertrains [28][29][30], energy efficiency analysis [31] and online control of HEVs [32,33].However, inherently discrete decision variables, such as the engine on/off decision or the gear decision, are typically determined using heuristic approximations.The reason is the fact that discrete signals are not convex and thus cannot be optimized using convex optimization.One approach to optimize discrete decision problems is to solve a mixed-integer problem [32], but its drawback is the large computational burden for optimizations on typical driving missions.A more efficient approach consists of splitting up the problem into two parts, where the discrete decision variables are optimized prior to the convex optimization [33] with the possibility to iterate between the two steps [34].Based on this methodology, the authors of [30] showed that the optimal engine on/off strategy can be calculated for a serial HEV.
However, in all those convex optimization approaches, costs for switching the engine on/off state or changing the gears are not considered.As reported for instance in [35,36], applying optimization-based methods for the energy management of parallel hybrid vehicles without penalizing engine start or gearshift events can lead to unacceptably frequent engine starts and gearshifts, eventually leading to an increased fuel consumption, excessive wear of components and/or an uncomfortable driving experience.Therefore, in the DP, PMP and SDP approaches, researchers [26,[37][38][39][40][41] have introduced penalties or costs for the changes in the engine or the gear state such that unacceptably frequent starts and shifts can be prevented.To do so in convex optimization-based approaches, heuristics based on signal filtering have been presented so far [29].However, a corresponding optimal solution for methods relying on convex optimization has yet to be found.
Therefore, this paper presents a method to solve the energy management problem for a parallel HEV taking such engine start and gearshift costs into account by using a combination of convex optimization and DP.In a case study, the method is shown to converge to the globally optimal solution while still being computationally efficient.Moreover, the proposed method can handle state constraints.Although not presented in this paper, the algorithm can potentially be applied to an efficient sizing of hybrid powertrains or to the on-board control of hybrid vehicles in the form of receding horizon control.
The paper is structured as follows: Section 2 details the vehicle model used,; Section 3 explains the mathematical problem; Section 4 presents the novel method; Section 5 compares the novel method to DP; Section 6 discusses the novel method; and Section 7 concludes on the ideas shown in this paper.

Vehicle Model
The vehicle under investigation is a pre-transmission parallel HEV passenger car of the executive class.Figure 1 illustrates the configuration of the powertrain architecture which consists of an automatic gearbox, an electric motor, the power electronics, a battery, a clutch and a turbocharged spark-ignited internal combustion engine.The vehicle does not exist yet physically, therefore, the basis for the subsequent vehicle model is a generic vehicle model according to commonly accepted modeling standards [1].To simulate the vehicle, a quasi-static backward approach is used.This widely adopted method of simulation [8,12,42,43] was shown to yield reasonable estimates of fuel consumption when compared to measurement data [44].

Figure 1.
Hybrid electric vehicle (HEV) architecture considered in this paper: pre-transmission parallel HEV.

Introduction to Convex Vehicle Modeling
In order to apply the concepts presented in Section 4 below, the generic, nonlinear vehicle model has to be approximated by a convex model, see [27] for a general introduction to convex optimization, and [29,45] for an introduction to convex modeling and optimization of HEVs.
A convex description of the model requires the component models to be expressed as a convex function of the free optimization variables which are used by the convex solver.These free optimization variables typically comprise more free variables than are actually required to solve the problem using different methods such as, for instance, DP.Examples of such free optimization variables are the torques of the motor and the engine, the electrical power consumption of the motor, the fuel consumption, the battery current and the battery state of charge (SOC).Inherently discrete variables, such as the engine on/off decision or the gear decision, are not convex, and thus, they cannot be included in a convex description.However, for given trajectories of the non-convex variables, the remaining model can be convex.
In this paper, the energy management of the vehicle is assumed to have three decision variables, namely the engine on/off, the gear, and the torque split between the engine and the motor.Since the engine on/off and the gear decision are inherently discrete, these variables are assumed to be known prior to solving a convex optimization problem.The only convex decision variable is the torque split.Therefore, the power flows from the road load up to the gearbox input, where the torque split is decided, can be expressed by any nonlinear function.Only the power flows upstream of the gearbox input have to be described in a convex form.In the following, the approximations needed to obtain a suitable model are explained in detail.All vehicle parameters used in the model are listed in Table 1.

Road Load Model
Assuming that a quasi-static driving cycle is represented by its velocity v, acceleration a, and road slope α for each instant in time k ∈ {1, 2, . . ., N }, the wheel speed ω w and the wheel torque T w required to follow the driving cycle are calculated by [1]: The variable r w denotes the wheel radius, ρ air the air density, c d the aerodynamic drag coefficient, A the frontal area of the vehicle, c r the rolling friction coefficient, m v the total vehicle weight, a g the gravitational acceleration, m r the equivalent mass of the rotating parts of the powertrain, and g the gear engaged (m r depends on the choice of the gear).

Gearbox Model
The speed ω g and the torque T g at the gearbox input are calculated by: with i g being the gear ratio of gear g ∈ {1, . . ., 7}, and η g,0 , η g,1 , ω g,1 being the parameters to model the speed-dependent gearbox losses which account for the increased friction at higher gearbox input speeds [46].The values for the efficiency parameters were chosen based on expert knowledge.Moreover, losses occurring during gearshifts are assumed to consume m g = 0.01 g fuel equivalent per gearshift, i.e., The gear g is assumed to be controlled by the gear control variable u g : These shift losses in Equation ( 5) account for the fuel-equivalent energetic effort to change the gears.The losses can be estimated by first principle models similarly to those presented in [47,48] for a dual-clutch transmission.Otherwise, for the optimization, some artificial shift costs can be assigned in order reduce the number of gearshifts.These artificial shift costs are then considered in the cost function of the optimization without including them in the calculation of the fuel consumption [10,26,40].For the method presented in this paper, the model for the gearshift costs is not restricted to the form presented in Equation (5).More complex models are allowed since the gear control variable is not optimized via convex optimization.

Torque Split Model
The torque split between the electric motor and the internal combustion engine is determined by the motor torque u ts T m such that the engine torque T e is determined by: This equation is convex in any of the variables involved.

Electric Motor Model
The electric motor, with a nominal power of 40 kW, is directly coupled to the input shaft of the gearbox such that the motor speed ω m is always equal to ω g .The power of the electric motor including the power electronics is approximated by a second-order polynomial with speed-dependent coefficients (the time index k being neglected for readability): This description is convex in the variable T m , which is required in the convex optimization problem in Equation ( 27) introduced later.Although Equation ( 8) potentially allows for wasting of the electrical energy, the optimal solution will always hold with equality [29].
The operation of the motor is limited by its minimum and maximum torque limitations, as well as by its maximum speed: The model fit for the electric motor compared to the underlying measurement data is shown in Figure 2. The measurement data were obtained for a 25 kW permanent magnet synchronous motor.The torque axis is then linearly scaled with the nominal power while the efficiencies are kept constant.For more information on the scaling method, see for instance [21,49].Clearly, the fit is very good in the entire operating range.

Engine Model
The mass flow ṁf of the fuel consumed is approximated by a second-order polynomial with speed-dependent coefficients (the time index k being neglected): where ω e denotes the rotational speed of the engine.This description is convex in the variable T e , which is needed in the convex optimization problem in Equation ( 27) introduced later.
At all times, the engine is only allowed to operate within its limits defined by a maximum torque, minimum speed and a maximum speed condition, i.e., The model fit of the engine compared to measurement data is shown in Figure 3.The measurement data were obtained for a modern 125 kW turbocharged spark-ignited combustion engine.The torque axis is then linearly scaled with the nominal power while the efficiencies are kept constant.For more information on the scaling method, see for instance [21,49].The fit is good, with only small errors at high torques at high speeds.The engine on/off decision is determined via the variable u e ∈ {0, 1} such that the engine state e is dictated by: here, the value e = 1 represents the "on" state; and e = 0 represents the "off" state.If the engine is on, the engine speed is equal to w g , unless the first gear is engaged, where the clutch is allowed to slip.If the engine is off, the engine speed is equal to zero, and the engine is disconnected from the rest of the powertrain.Moreover, each engine start is assumed to consume m e = 0.3 g fuel equivalent, i.e., These start costs represent the fuel-equivalent costs to start the engine.For example, these costs can be calculated based on an electrical energy consumption to crank up the engine to its start speed and a fuel consumption to synchronize the engine speed with the gearbox input speed, see for instance [39,48].In the optimization, artificial costs can be assigned in order to reduce frequent engine starts.For the method proposed in this paper, the type of loss model is not restricted to the form Equation (15).A more detailed start model [39,48] could also be employed because the engine on/off decision is not assumed to be convex.

Battery Model
The battery is assumed to be of the LiFePO 4 type [50,51] with the data taken from [29].It is modeled as an equivalent circuit with an open-circuit voltage V oc in series with a constant internal resistance R i [1].In the nonlinear model, the open-circuit voltage is a function of the SOC as shown in Figure 4.In the convex model, the open-circuit voltage is assumed to be constant.The assumptions of a constant open-circuit voltage and a constant resistance are reasonable in the typical operating range of the battery [2,17,28,34].Figure 4 shows a comparison of the two models.In the convex case, the discrete-time equation of the dynamics for the battery SOC reads: where I b is the battery current, P aux is a constant power consumed by electric auxiliary units, and ∆t is the constant sampling time of 1 s.Equation ( 16) is convex in SOC and in I b , however, Equation (17) has to be reformulated to: in order to be convex in the optimization variable of interest, which here is I b only.
The battery current and the battery SOC are constrained to: respectively.

Overview
In summary, the control variables u and the states x of the dynamical system are given by: x(k)

Problem Formulation
The problem to be solved is the energy management problem formulated as an optimal control problem as follows: subject to the vehicle model in Equations ( 1)-( 22) and subject to the charge-sustenance condition SOC(N + 1) = SOC(1) = SOC 0 with SOC 0 being the initial state of charge.
The energy management problem can be solved using DP.However, to account for the engine start and gearshift losses, three state variables are required, i.e., the battery SOC, the engine state, and the gear currently engaged.Unfortunately, the computation time of this pure DP approach is large even for a coarse discretization for the SOC.Moreover, the accuracy of the solution is influenced by the discretization of the problem.
A more efficient approach to solve the problem was proposed in [41] where DP and PMP are combined such that the SOC state can be removed from the DP.As such, DP optimizes the equivalent fuel consumption for a given equivalence factor and for given engine start and gearshift costs, and the optimal solution is found by iterating on the equivalence factor until a charge sustaining solution is found.The advantage of this solution is the fact that the algorithm finds a close-to-optimal solution in a relatively short amount of time.Its main drawback is the fact that finding a charge-sustaining equivalence factor cannot be guaranteed, and therefore a heuristic approach has to be used instead.Another drawback consists of the fact that state constraints on the SOC cannot be handled.
In this paper, these drawbacks are avoided by extending the ideas presented in [41].Accordingly, convex optimization is used to find a charge-sustaining equivalence factor and to handle the state constraints, while DP is used to find the optimal engine on/off and gearshift strategy.

Iterative Algorithm
As shown in [41], by application of PMP, the optimal control problem (23) can be reformulated to with s denoting the equivalence factor between fuel consumption and electrical energy consumption.This equivalence factor is basically the Lagrange multiplier of the Euler-Lagrange formulation of the optimal control problem, sometimes also referred to as the dual variable.The dynamics of s are given by the necessary conditions of PMP.If the SOC dynamics are independent of the SOC, then s can be shown to be piecewise constant [52].If in addition no state constraints are present on the SOC, s is constant for the entire driving cycle.The ultimate goal is then to find the optimal trajectory of s resulting in a charge-sustaining solution.As shown in the following, the aforementioned properties can be used to derive an efficient algorithm.By further defining: the authors of [41] showed that Equation ( 24) is equal to: i.e., by instantaneously minimizing the function H(.), the optimal torque split u ts can be found immediately for a given combination of u g and u e .Therefore, if the charge-sustaining value of s is known, problem in Equation ( 26) can be solved with DP using the gear g and the engine state e as the only state variables.The problem then remains to find the charge-sustaining value of s which, in [41], is effected by a root finding algorithm.However, that method does not yield optimal results if state constraints on the SOC are present or if engine start costs are considered.The reason is that in those cases, it is not possible to find the optimal equivalence factor s using a root finding algorithm because s is not constant.
Therefore, in this paper, the difference to the approach described in [41] is the fact that the equivalence factor is determined by solving a convex optimization problem instead of applying a root finding algorithm.The convex optimization problem to be solved is formulated as follows (bold symbols represent optimization variables): Minimize: subject to: for all k ∈ {1, . . ., N }.
Note that the gearshift and engine on/off sequence must be defined prior to solving Equation ( 27), otherwise the problem is not convex.As a consequence, the gearbox torque T g (k) and the speeds ω e (k) and ω g (k) can be precalculated for the entire driving cycle.
The problem in Equation ( 27) is parsed with CVX [53] and solved with SeDuMi [54].As a result, CVX yields the dual variable associated with Equation (27f).In this case, the dual variable is identical the equivalence factor s.
As illustrated in Figure 5, the above ideas are combined to perform a sequential optimization of the engine on/off and of the gearshift strategy using DP and the power split strategy using convex optimization.The input to the DP is the equivalence factor s in , and the output of the DP consists of the gearshift strategy u g and the engine on/off strategy u e .Here, s in stands for s in (k) ∀k ∈ {1, . . ., N }.A similar notation is used for s, s out , u e , u g .In a next step, a new equivalence factor s out is calculated by solving the convex optimization problem in Equation ( 27) for the previously calculated values of u g and u e .
Assuming that the optimal value of s in ≡ s is known, the optimal values of u g and u e are obtained by using DP [41].The symbol stands for optimal, and ≡ stands for identity of the trajectory.On the other hand, solving the convex problem for the optimal values of u g and u e yields the optimal value of s out ≡ s [30,34].Therefore, if s in ≡ s , then s out ≡ s in ≡ s .As a consequence, the identity "s out ≡ s in " is a necessary condition for the optimality of a solution.To show that this condition is also a sufficient condition, the condition "if and only if s out ≡ s in ⇒ s out ≡ s " has to be shown.To do so, two further scenarios need to be considered for the case if s in = s : in the first scenario, it may happen that solving Equation (26) yields u g and u e although s in = s (for example if the torque split strategy is not optimal).Therefore, the conclusion follows that solving Equation ( 27) yields s out ≡ s , and therefore s out = s in .
In the second scenario, solving Equation ( 26) yields u g = u g and/or u e = u e .Then, solving Equation (27) results in s out = s and also s out = s in as explained in the following: Assume that the values of s in are lower than those of the optimal solution s .This means that electric energy is considered to be cheap in the optimization problem in Equation (26).Hence, the solution to Equation (26) results in an energy management strategy (i.e., u ts , u g , u e ) in which the vehicle is operated in the pure electric mode for more time, implying also that the engine is used less than in the optimal solution."Using the engine less" means that recharging the battery via operating point shifting is done at lower additional loads than optimal, but also that the total engine runtime is less than optimal.Therefore, the SOC is not sustained over the entire driving cycle.
Then, at the next step of the sequential optimization, the optimal power split is calculated via convex optimization for the values of u e and u g obtained at the previous step.Since the solver for the convex problem in Equation ( 27) tries to find an SOC sustaining solution for the given engine runtime that is lower than optimal, the engine has to be operated at higher loads to generate more electric energy during the intervals where the engine is on.In the optimization problem in Equation ( 27), an increased use of the engine can only be achieved by considering the electric energy to be expensive, requiring the values of the equivalence factor s out to be higher than those of s in .Therefore, if s in = s , then s out = s in .Finally, it is concluded that if and only if s out ≡ s in holds, then s out ≡ s .
Based on these arguments, an iterative algorithm can be constructed which starts with an initial guess of s in and converges towards a solution which fulfills the condition s in ≡ s out .In the following, this iterative algorithm is referred to as the DP-C method.Figure 6 illustrates the entire DP-C method which is explained in detail below.Step 1: Initialization of the algorithm The iteration counter is initialized by j = 0.The initial equivalence factor s (0) in is initialized by a constant value for the entire driving cycle.In the case studies below, a constant value of 2.9 is chosen.Notice that in the simulation results below, instead of the fuel mass flow the fuel power is optimized.
More details are given in the Results Section below.Alternatively, some initial engine on/off strategy u (−1) e and gearshift strategy u (−1) g can be chosen to obtain an initial value s (0) in by solving the convex problem in Equation (27).
Step 2: Use of DP to obtain u In this step, based on s (j) in , DP is employed to solve Equation (26).Notice that in total only two states are needed, namely one for the gear and one for the engine on/off state.A state for the battery SOC is not included, allowing the DP problem for a typical driving mission to be solved in less than a second on a notebook with an Intel Core i7 CPU 64-bit with 2.80 GHz.
As a result, new values of u (j) g and u e are obtained which are optimal for s (j) in .However, the solution does not guarantee charge sustenance since in this step, the state for the SOC and the corresponding final state constraint were removed from the problem.
Step 3: Use of convex optimization to obtain s

(j) out
For the given values of u (j) g and u (j) e , the power split is calculated by solving Equation (27).As a result, the charge-sustaining equivalence factor s (j) out is obtained.

Step 4: Check for convergence
To check whether the DP-C method has converged, the trajectories of s (j) in and s (j) out are compared by calculating the root mean square error: If the value of E is lower than some predefined tolerance E T OL , the iteration is terminated and the results can be returned.If the value of E is larger than E T OL , the iteration counter is updated by j ← j + 1, and the update law described in Step 5 below is applied.
In practice, it is more convenient to define a convergence tolerance ∆F C T OL for the fuel consumption and the additional condition that u g and u e do not change between any two consecutive iterations.For the case studies below, this stopping criterion was used with ∆F C T OL = 1 × 10 −5 L/100 km.However, this method of checking the convergence is not equivalent to the one described in the first paragraph above.Therefore, checking whether E decreases at each iteration is recommended.
Step 5: Update law Performing Steps 2 and 3 does not necessarily yield the optimal solution since the optimal equivalence factor is unknown.Initializing the equivalence factor s in with a too high value will result in a too low value for s out (and vice versa), as explained above.Therefore, the optimal s lies between s in and s out .To ensure convergence towards s , damping is introduced by including an update law for the equivalence factor s (j) in before step 2. The update law used here is taken from [30], which is: with the convergence parameter q.The value of the convergence parameter is initially chosen as q = 0.2.When the change of the root mean square error Equation ( 28) between any two consecutive iterations falls below a certain threshold value, the convergence parameter is reduced by q ← 0.7 • q.By using this update law, s out converges towards s for j → ∞.By using the tolerance criterion introduced in Step 4 (∆F C T OL ), the algorithm terminates after two to ten iterations, depending on the driving cycle and the initial value for the equivalence factor.

Results
For the generation of the optimization results shown below, the indicator for the fuel consumption in Equation (27a) is replaced by the fuel power P f which is defined by with H l being the lower heating value of the fuel.The fuel power allows for a more intuitive interpretation of the equivalence factor s since it then "converts battery power to an equivalent fuel power that must be added to the actual fuel power to attain a charge-sustaining control strategy" [2] (p.66).
The resulting values for the equivalence factor can also be compared more easily to those found in the literature.

Performance Comparison with the Convex Vehicle Model
To assess the performance of the DP-C method, it is compared to a pure DP implementation of the convex vehicle model described in Section 2. For the following case studies, unless stated otherwise, the minimum and maximum values of the SOC are assumed to be 20% and 80%, respectively.The SOC state of the DP method is discretized with a resolution of 1% SOC.
Both the DP-C and the DP method are applied on three well-known driving cycles, namely the New European Driving Cycle (NEDC), the Federal Test Procedure (FTP), and the Common Artemis Driving Cycle (CADC).In a further case, referred to as "CADC bounded", the DP-C and the DP method are both applied to the CADC with tighter SOC constraints, i.e., 0.50 ≤ SOC ≤ 0.65.
Table 2 summarizes the fuel consumption, the number of engine starts, the number of gearshifts, and the computation time for the four cases considered.The computation time was identified on a standard notebook with an Intel Core i7 CPU (64-bit, 2.80 GHz).
As Table 2 shows, the fuel consumption (FC) calculated by the DP-C method is always slightly lower than the one calculated by the DP method.The number of engine starts and the number of gearshifts are identical for both methods on the NEDC, whereas on the other driving cycles, these numbers are very similar.The differences are explained by the fact that the DP method requires a discretized state space for the continuous state of the SOC, whereas the DP-C method does not.The discretization of the SOC state for the DP method results in errors occurring during the interpolation of the cost-to-go function in the backward recursion of the DP.As a consequence, this cost-to-go function yields a sub-optimal control performance when compared to an ideal implementation.However, these discretization errors could only be reduced by increasing the resolution of the discretized state space, which would lead to an increased computational burden.Figures 7-10 show the trajectories of the SOC, engine on/off state, the gear state, and the equivalence factor for the four cases considered above.For the NEDC (Figure 7) and the FTP (Figure 8), the trajectories show an almost perfect match of the DP-C and the DP methods (except for one up-and downshift in the FTP).For the two CADC cases, the trajectories do not match well.As described above, the mismatch is attributable to the error introduced by the discretization of the SOC state.Particularly at the border of the feasible state space, the interpolation errors between the feasible and the infeasible state space lead to an error in the cost-to-go function [47].
Comparing the computational burden associated with the DP-C and the DP methods, the DP-C significantly outperforms the DP method.As Table 2 shows, the DP-C method requires less than 10% of the time required by the DP method.In the case of the "CADC bounded", the DP-C method still requires only about 20% of the time required by the DP method, although a reduced number of discrete supporting points for the SOC state are used in the DP method.
Summarizing the findings described in this section, the DP-C method yields more accurate results in much less time than a pure DP implementation.

Performance Comparison with the Nonlinear Vehicle Model
To assess the performance of the DP-C method on the nonlinear vehicle model, the control strategy obtained with the DP-C method is applied to the nonlinear model.The result obtained is then compared to the result of a pure DP implementation including the nonlinear model.In this case study, the only differences of the nonlinear model compared to the convex model are the use of the original consumption maps instead of second-order polynomials in Equations ( 8) and (11), as well as the use of the nonlinear open-circuit voltage characteristic presented in Figure 4.
To increase the accuracy of the DP-C method on the nonlinear vehicle model, the DP-C algorithm is modified at Step 2 ("DP for u g and u e "): Instead of using the convex vehicle model, the nonlinear model is used.The advantage of this modification results in a more accurate gearshift and engine on/off strategy than in the case where the convex model is used.Therefore, the engine on/off and gearshift strategies can be different from those in the corresponding convex case.
As Table 3 shows for the NEDC and the FTP driving cycles, the fuel consumption obtained with the DP-C method can still be lower than the amount calculated with DP even for the nonlinear model.However, in the cases of the CADC and "CADC bounded", the fuel consumption obtained by applying the DP-C method is higher by 0.3% than that calculated with DP.The degraded performance in fuel consumption of the DP-C method on the nonlinear model compared to the performance on the convex model is a direct consequence of the model approximations required to solve the convex problem in Equation (27).Nonetheless, the results obtained with the DP-C method are still quite acceptable.In terms of computation time, the DP-C method is more than 90% faster than the DP method in the cases where the full range of the SOC is discretized (NEDC, FTP, CADC), and it is 75% faster in the case with the reduced SOC discretization (CADC bounded).
In conclusion, even for the nonlinear model, the DP-C method is able to yield close-to-optimal results in much less time than the pure DP method.

Accuracy of the DP Method
In the case of the convex model, the above values for the fuel consumption calculated by DP are always worse than those obtained with the DP-C method.The accuracy of the DP solutions could generally be improved by increasing the resolution of the discretization of the SOC state.However, the improvements in accuracy are very small compared to the additional computational burden required.

Advantages and Disadvantages of the DP-C Method
A substantial advantage of the DP-C method is its ability to yield globally optimal results for the convex vehicle model much faster than the DP method.In the case of a nonlinear model, the performance of the DP-C method is very close to that of the DP method.Moreover, the equivalence factor related to the battery state is obtained directly from the convex solver without any error-prone calculation via the optimal cost-to-go from DP. Furthermore, more accurate results can be obtained with the DP-C method than with the DP method because no discretization of the SOC state is necessary.In addition, as opposed to the method presented in [41], DP-C is also able to handle state constraints, which means that it detects the jumps in the equivalence factor.
The only drawback of the DP-C method are the modeling errors introduced by the convex modeling of the powertrain.However, in the case of the HEV in this paper, the errors are small.

Conclusions
This paper presents a method to calculate the globally optimal energy management strategy for a parallel HEV on a given driving cycle taking into account penalties to avoid frequent engine start and/or gearshift events.The proposed method combines DP and convex optimization in an iterative scheme.The algorithm converges to the globally optimal solution after a few iterations.The optimal gearshift and engine on/off strategy is evaluated by DP while convex optimization is used to determine the optimal power split strategy.The proposed method delivers globally optimal results with respect to the convex vehicle model, even in the presence of state constraints.When compared to the basic DP algorithm, the proposed method results in a substantial reduction of the evaluation time.
For the convex vehicle model, the proposed method delivers a higher precision than DP (0.1%-0.2% lower fuel consumption) while incurring significantly less computational effort (75%-98% less).The higher precision is due to the fact that convex optimization does not require a discretization of the continuous control and state variables, which inherently introduces interpolation errors.To evaluate the magnitude of the error that is introduced by using convex model approximations, the strategy that was optimized for the convex model is applied to the nonlinear model.Compared to the true globally optimal solution obtained by applying DP on the nonlinear model, the proposed method results in a slight deterioration of precision only (up to 0.3% increased fuel consumption).
The proposed method can be extended to other vehicle topologies and different formulations of the energy management problem.For example, the battery state of health or the engine temperature could be considered by including further continuous state variables.Additional discrete state variables, such as for example the clutch state, as well as additional continuous and discrete control variables, such as for example the desired clutch state, the motor speed or torque of a series-parallel hybrid vehicle, could be included as well.

Figure 2 .
Figure 2. Fit of the convex motor model.

Figure 3 .
Figure 3. Fit of the convex engine model.

Figure 4 .
Figure 4. Comparison of the nonlinear and the approximated model for the open-circuit voltage of the battery.

Figure 5 .
Figure 5. Sequential application of dynamic programming (DP) and convex optimization.

Figure 6 .
Figure 6.Iteration scheme of the DP-C algorithm.

Figure 7 .
Figure 7.Comparison of the results obtained with DP and the DP-C algorithm on the New European Driving Cycle ((NEDC) driving cycle.

Figure 8 .
Figure 8.Comparison of the results obtained with DP and the DP-C algorithm on the Federal Test Procedure (FTP) driving cycle.

Figure 9 .
Figure 9.Comparison of the results obtained with DP and the DP-C algorithm on the Common Artemis Driving Cycle (CADC) driving cycle.

Figure 10 .
Figure 10.Comparison of the results obtained with DP and the DP-C algorithm on the CADC driving cycle with limited SOC range.

Table 1 .
Parameters of the pre-transmission parallel HEV executive class passenger car.SOC: state of charge.

Table 2 .
Results with the convex vehicle model.

Table 3 .
Results with the nonlinear vehicle model.