The Development of an Optimal Control Strategy for a Series Hydraulic Hybrid Vehicle

Department of Mechanical and Automation Engineering, Dayeh University, Changhua 51591, Taiwan; c.w.lclsea@gmail.com (C.-W.H.); vutrivien@tdt.edu.vn (T.-V.V.) * Correspondence: ckchen@mail.dyu.edu.tw; Tel.: +886-4-8511-888 (ext. 4107); Fax: +886-8-511-224 † These authors contributed equally to this work. ‡ Present address: Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, HoChiMinh 700000, Vietnam


Introduction
The depletion of gasoline resources and environmental concerns have necessitated a more efficient transportation system.In 2009, more than 70% of US oil was consumed by the transportation sector, and light-duty vehicles accounted for 45% [1].Therefore, improving the efficiency of light-duty vehicle fleets is one of the most effective ways to reduce dependency on oil.
As a short-and mid-term solution, hybrid vehicles have become popular for researchers and manufacturers worldwide.In the recent past, studies have indicated that a further advantage can be realized from hydraulic hybrid vehicles.Hydraulic pump/motor (P/M) units achieve a higher magnitude of power density than that of an electric generator/motor [2].With the development of modern technology, hydraulic bladder accumulators can withstand high charging/discharging rates with above 95% round-trip efficiency [3].This feature allows hydraulic hybrid vehicles (HHVs) to achieve higher fuel economy by regenerative braking.However, the low energy density of a hydraulic accumulator means that the accumulator in a hydraulic hybrid vehicle can only provide energy to propel the wheels for a very short time.For a predefined driving cycle, the size of accumulator in a hydraulic hybrid system is normally calculated to be large enough to store as much vehicle braking energy as possible.It means that the system requires more space, and significant mass must be reserved for energy-storage components.Hence a compromise between total braking energy storage and the amount of mass is necessary.
In hybrid vehicles, the function of the control system is to determine how to coordinate the power sources to satisfy power demand and the dynamic constraints in the most convenient way.The main objective is to reduce the overall energy consumption.In this work, the power management system is divided into two.The first level or supervisory controller determines the desired reference value for engine torque and engine speed to achieve desired vehicle velocity.The second controller, the actuator, calculates the control signal of the amount of fuel mass injected into the cylinder of the engine and the displacements of hydraulic pump and pump/motor drive the components of the series hydraulic hybrid vehicle (SHHV) to meet their corresponding references.
Based on the selected control strategy, the main function of the supervisory controller is to coordinate multiple power sources in order to satisfy the power demand of the driveline with minimum fuel consumption in the most convenient way.In general, control strategies for hybrid propulsion systems can be classified into three categories: rule-based, semi-optimal, and global optimal.The rule-based control strategy is real-time implementable power management.This control strategy uses several rules that take account of the vehicle's load and maintain the engine within its most efficient operating range.Most papers regarding rule-based control strategy for HHV used the state-of-charge (SOC) of the accumulator as the sole state variable [4][5][6][7][8].
Dynamic Programming (DP) finds a global optimal solution for sequential or multi-stage decision problems.The algorithm searches for the optimal decision at discrete points in a time sequence using chosen cost functions.In this approach, the power split between the two energy sources is selected as the control variable and the total equivalent fuel consumption over a given driving cycle is defined as the cost function.DP has been shown to be a powerful tool for optimal control in automotive applications.It is used to optimize powertrain parameters and the gear shifting strategy in conventional vehicles [9][10][11] as well as determine the control variables that rely on torque splitting or power splitting in hybrid vehicles [6,8,12].DP also has been successfully used to replace the rule-based control strategy for a hybrid electric vehicle [13] or to optimize the rule-Based control strategy for a hydraulic-electric hybrid vehicle [14].In the previous study [15], we used rule-based control strategy for the first-level (supervisor) controller while Model Predictive Control (MPC) framework was applied for the second-level (actuator) one.
In this study, a physics-based forward-facing simulation model of series hydraulic hybrid vehicle for a 3.5-ton light-duty truck is first developed, then initial rule-based power management strategies-i.e., thermostatic and modulated-pressure-are applied for the proposed system under two different driving conditions.For further improvement of fuel economy, DP methodology is then applied to achieve a benchmark solution of the proposed system.DP provides better results than rule-based ones.However, the optimal control rules derived from DP cannot directly apply to the forward-facing system.Instead, results derived by DP are extracted to improve rule-based control strategies.The thresholds of the thermostatic strategy and the reference of the accumulator pressure of the modulated-pressure control strategy are modified by DP.The new rule-based strategies, referred to as DP-based thermostatic and modulated-pressure strategies, are applied to the supervisor controller for the proposed system.With the DP-based control strategies, further improvements in fuel economy are achieved.The simulation results provide evidence that the series hydraulic hybrid architecture is not only applicable for heavy-duty trucks but also for a light-duty ones.The results also indicate that the thermostatic strategy is suitable for urban driving conditions.As a conclusion, by applying suitable power management for certain driving conditions, the hydraulic hybrid architecture can help to achieve better fuel economy for both urban and highway driving conditions.When working under the urban driving cycle, Japan 1015, the proposed system can achieve an improvement of 71.53% using DP-based thermostatic power management strategy in comparison to the same 3.5-ton light-duty conventional truck.For a highway driving, such as the highway fuel economy test (HWFET) driving cycle, the proposed system also obtains an improvement of 56.78% using DP-based modulated power management strategy.

System Description
In a conventional vehicle, the engine provides power through the transmission and driveshaft to propel the wheels.In a series HHV, the engine is connected to a hydraulic pump/motor instead of connecting to the mechanical transmission.A schematic of a series hydraulic hybrid powertrain configuration for a rear-wheel-drive light-duty truck is shown in Figure 1.A variable displacement hydraulic pump is connected to an engine to convert mechanical power into hydraulic power.The power is stored in the accumulator or converted back to mechanical power using a variable displacement pump/motor.The pump/motor is a reversible energy conversion component.When working as a motor, it converts the hydraulic power into mechanical power to propel the wheels.When the brake pedal is depressed, the pump/motor works as a pump.The momentum of the vehicle is used to pressurize fluid from a low-pressure reservoir and energy is then stored in a high pressure accumulator.In this manner, the mechanical braking power is converted into hydraulic power.The braking energy is captured and stored in the accumulator in the form of high hydraulic pressure.The low-pressure reservoir provides fluid for the hydraulic system.power is stored in the accumulator or converted back to mechanical power using a variable displacement pump/motor.The pump/motor is a reversible energy conversion component.When working as a motor, it converts the hydraulic power into mechanical power to propel the wheels.
When the brake pedal is depressed, the pump/motor works as a pump.The momentum of the vehicle is used to pressurize fluid from a low-pressure reservoir and energy is then stored in a high pressure accumulator.In this manner, the mechanical braking power is converted into hydraulic power.The braking energy is captured and stored in the accumulator in the form of high hydraulic pressure.The low-pressure reservoir provides fluid for the hydraulic system.

Forward Simulation Model
Using the Physical Network approach of the Simscape modeling environment, SimHydraulics software is a modeling environment within Simulink and MATLAB 2014a (MathWorks, Massachusetts, MA, USA) for engineering design and for the simulation of hydraulic power and control systems.It contains a comprehensive library of hydraulic blocks that extend the Simscape libraries of basic hydraulic, electrical and one-dimensional translational and rotational mechanical elements and utility blocks.This toolbox allows a transient analysis of hydro-mechanical systems, specifically when modeling scenarios with hydraulic actuators as part of a control system.It is also appropriate for systems that allow lumped parameters.
In this work, a model of conventional vehicle with dual clutch transmission (DCT) [16] was used as a foundation for modeling the SHHV system.The model of the SHHV is extended from the conventional vehicle model.The dual clutch transmission is removed.The high-pressure accumulator, the low-pressure reservoir, the variable displacement pump and the variable displacement pump/motor are connected to establish a series hydraulic hybrid powertrain, as shown in Figure 2.

Forward Simulation Model
Using the Physical Network approach of the Simscape modeling environment, SimHydraulics software is a modeling environment within Simulink and MATLAB 2014a (MathWorks, Massachusetts, MA, USA) for engineering design and for the simulation of hydraulic power and control systems.It contains a comprehensive library of hydraulic blocks that extend the Simscape libraries of basic hydraulic, electrical and one-dimensional translational and rotational mechanical elements and utility blocks.This toolbox allows a transient analysis of hydro-mechanical systems, specifically when modeling scenarios with hydraulic actuators as part of a control system.It is also appropriate for systems that allow lumped parameters.
In this work, a model of conventional vehicle with dual clutch transmission (DCT) [16] was used as a foundation for modeling the SHHV system.The model of the SHHV is extended from the conventional vehicle model.The dual clutch transmission is removed.The high-pressure accumulator, the low-pressure reservoir, the variable displacement pump and the variable displacement pump/motor are connected to establish a series hydraulic hybrid powertrain, as shown in Figure 2.
The model is implemented in Simulink, using corresponding hydraulic, mechanical and other fundamental blocks from the Simscape toolbox.The key component blocks are a diesel engine, hydraulic variable pumps and gas-charged accumulators.The hydraulic valve subsystem comprises several pipes, and hydraulic directional valves.The vehicle model block has the longitudinal dynamics and tire models for a rear-wheel-drive (RWD) vehicle.The important parameters of the system are listed in Table 1.Details of system modeling and model validation can be found in the work of [17].The model is implemented in Simulink, using corresponding hydraulic, mechanical and other fundamental blocks from the Simscape toolbox.The key component blocks are a diesel engine, hydraulic variable pumps and gas-charged accumulators.The hydraulic valve subsystem comprises several pipes, and hydraulic directional valves.The vehicle model block has the longitudinal dynamics and tire models for a rear-wheel-drive (RWD) vehicle.The important parameters of the system are listed in Table 1.Details of system modeling and model validation can be found in the work of [17].

Driving Cycles
A common method that is used to describe the operating condition of a vehicle is a velocity versus time profile, also known as a drive cycle.These profiles are used in association with certain test procedures to validate the vehicle's performance, both for government regulation and for internal evaluation by manufacturers.
To evaluate the effectiveness of different sizes of hydraulic components and the performance of the system, two typical driving cycles, Japan 1015 and HWFET, were used as velocity reference trajectories.The respective speed profiles are presented in Figure 3a,b.The Japan 1015 cycle is used in Japan to test emissions and fuel economy for light-duty vehicles.This driving test has the typical stop-and-go patterns that are found in urban driving conditions and is an attractive application for HHVs.The HWFET, a chassis dynamometer-driving schedule that was developed by the United States Environmental Protection Agency, is used to determine the fuel economy of light-duty vehicles under highway conditions.
the system, two typical driving cycles, Japan 1015 and HWFET, were used as velocity reference trajectories.The respective speed profiles are presented in Figure 3a,b.The Japan 1015 cycle is used in Japan to test emissions and fuel economy for light-duty vehicles.This driving test has the typical stop-and-go patterns that are found in urban driving conditions and is an attractive application for HHVs.The HWFET, a chassis dynamometer-driving schedule that was developed by the United States Environmental Protection Agency, is used to determine the fuel economy of light-duty vehicles under highway conditions.

Control System Development
As has been mentioned, the function of the control system in a hybrid system is to determine how to coordinate the power sources to satisfy power demand and the dynamic constraints in the most convenient way.The main objective is to reduce the overall energy consumption.In this work, the power management system is divided into two levels as presented in Figure 4.The first level, or supervisory controller, determines the desired reference value for engine torque and engine speed to achieve the desired vehicle velocity.The second controller, the actuator controller, calculates the amount of fuel mass injected into the cylinder of the engine ans the displacements of hydraulic pump and pump/motor to drive the outputs of the SHHV to their corresponding references.

Control System Development
As has been mentioned, the function of the control system in a hybrid system is to determine how to coordinate the power sources to satisfy power demand and the dynamic constraints in the most convenient way.The main objective is to reduce the overall energy consumption.In this work, the power management system is divided into two levels as presented in Figure 4.The first level, or supervisory controller, determines the desired reference value for engine torque and engine speed to achieve the desired vehicle velocity.The second controller, the actuator controller, calculates the amount of fuel mass injected into the cylinder of the engine ans the displacements of hydraulic pump and pump/motor to drive the outputs of the SHHV to their corresponding references.In this work, the supervisory controller is first developed based on two rule-based control strategies, namely thermostatic and modulated-pressure, to generate the reference values of engine speed and engine torque.Furthermore, DP is applied to achieve optimal control trajectories of engine speed and engine torque for a given driving cycle.These optimal control trajectories will function as references for the actuator controller.

Thermostatic Control Strategy
The thermostatic control approach is traditionally utilized for series hybrid electric vehicles (HEVs) and adopted for HHVs [6,8].The concept of a thermostatic control algorithm is shown in Figure 5.In this approach, the engine operates at a constant output power-the threshold-when the pressure of the accumulator is within [pmin, pmax].If the pressure drops below pmin the engine produces greater power, as determined by a 1D-lookup table.Using the demand power, the engine speed and engine torque references are estimated.To prevent the unnecessary conversion of energy and to In this work, the supervisory controller is first developed based on two rule-based control strategies, namely thermostatic and modulated-pressure, to generate the reference values of engine speed and engine torque.Furthermore, DP is applied to achieve optimal control trajectories of engine speed and engine torque for a given driving cycle.These optimal control trajectories will function as references for the actuator controller.

Thermostatic Control Strategy
The thermostatic control approach is traditionally utilized for series hybrid electric vehicles (HEVs) and adopted for HHVs [6,8].The concept of a thermostatic control algorithm is shown in Figure 5.In this approach, the engine operates at a constant output power-the threshold-when the pressure of the accumulator is within [p min , p max ].If the pressure drops below p min the engine produces greater power, as determined by a 1D-lookup table.Using the demand power, the engine speed and engine torque references are estimated.To prevent the unnecessary conversion of energy and to capture braking energy effectively, the engine is only activated if the pressure in the accumulator is within [p min , p max ] and when the vehicle is not being decelerated.
In this work, the supervisory controller is first developed based on two rule-based control strategies, namely thermostatic and modulated-pressure, to generate the reference values of engine speed and engine torque.Furthermore, DP is applied to achieve optimal control trajectories of engine speed and engine torque for a given driving cycle.These optimal control trajectories will function as references for the actuator controller.

Thermostatic Control Strategy
The thermostatic control approach is traditionally utilized for series hybrid electric vehicles (HEVs) and adopted for HHVs [6,8].The concept of a thermostatic control algorithm is shown in Figure 5.In this approach, the engine operates at a constant output power-the threshold-when the pressure of the accumulator is within [pmin, pmax].If the pressure drops below pmin the engine produces greater power, as determined by a 1D-lookup table.Using the demand power, the engine speed and engine torque references are estimated.To prevent the unnecessary conversion of energy and to capture braking energy effectively, the engine is only activated if the pressure in the accumulator is within [pmin, pmax] and when the vehicle is not being decelerated.

Thermostatic Control Strategy
The term modulated-pressure originates from the need to maintain the pressure of the accumulator at a constant value to avoid the unnecessary energy conversion from the engine to the accumulator.[8] showed that by considering the reference value for the accumulator's pressure as a function of vehicle velocity demand, the modulated-pressure control strategy produces an improvement in fuel economy.A schematic for this control strategy is shown in Figure 6.Two PI controllers are used to estimate the power demand of the engine, to ensure that the vehicle's velocity and the accumulator pressure both track their corresponding reference values.

Thermostatic Control Strategy
The term modulated-pressure originates from the need to maintain the pressure of the accumulator at a constant value to avoid the unnecessary energy conversion from the engine to the accumulator.[8] showed that by considering the reference value for the accumulator's pressure as a function of vehicle velocity demand, the modulated-pressure control strategy produces an improvement in fuel economy.A schematic for this control strategy is shown in Figure 6.Two PI controllers are used to estimate the power demand of the engine, to ensure that the vehicle's velocity and the accumulator pressure both track their corresponding reference values.

The Fundamental Formulation of DP
Many textbooks and papers have been published on the subject of dynamic programming (DP) theory and its applications.DP is a powerful numerical method for solving optimal control problems.DP is a computational technique that extends the decision-making concept to sequences of decisions, which together define an optimal policy and trajectory.An optimal solution for the original problem can be found by using optimal solutions of sub-problems for each decision.The sub-problems are solved recursively in the same fashion.This method uses Bellman's Principle of Optimality [18]: "An optimal policy has the property that whatever the initial state and initial decision are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision."For a given system, DP is used to find the optimal control input that minimizes a certain cost function.The main advantage of DP is that the global optimal solution is guaranteed for any type of problem.However, the computational cost for DP grows exponentially with the number of state  theory and its applications.DP is a powerful numerical method for solving optimal control problems.DP is a computational technique that extends the decision-making concept to sequences of decisions, which together define an optimal policy and trajectory.An optimal solution for the original problem can be found by using optimal solutions of sub-problems for each decision.The sub-problems are solved recursively in the same fashion.This method uses Bellman's Principle of Optimality [18]: "An optimal policy has the property that whatever the initial state and initial decision are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision."For a given system, DP is used to find the optimal control input that minimizes a certain cost function.The main advantage of DP is that the global optimal solution is guaranteed for any type of problem.However, the computational cost for DP grows exponentially with the number of state variables and inputs for the underlying dynamic system.Bellman called this difficulty the "curse of dimensionality."The main drawback of this approach is that all disturbances must be known a priori.Therefore, DP is often not a useful method for the design of real-time control systems.However, DP is a very useful tool for providing an optimal performance benchmark.In the design of a causal real-time controller, this benchmark is then used to assess the quality of a controller.In some cases, the optimal non-realizable solution provides an insight into how the suboptimal but realizable control system should be designed.
In order to apply DP, the dynamic system is considered as a discrete-time system, which is expressed as: the dynamic states, x k P X k P IR n δ , the control inputs, u k P U k P IR m δ , and the disturbances, w k P D k P IR d δ , are discrete variables both in time (index k) and value.The control inputs, u k , are limited to the subset, U k , which depends on the values of the state variables, x k ; i.e., u k P U k px k q.As mentioned previously, the disturbance, w k , must be known in advance for all k P r0, N ´1s.To evaluate the fuel economy of a vehicle, the vehicle's speed profile is considered as a known disturbance.
The control variable, u, and the state variable, x, must be discretized in time and space, as shown in Figure 7.Because the computation time and the amount of memory required for DP grows exponentially with the number of states and control inputs, this number must be as small as possible.The selection of grid sizes for the states and the control inputs are a compromise between the representation of the system's dynamics and computation time.If the grid size is too large, the state does not change as the control changes.However, if the grid size is too small, DP requires a large number of computations.Based on the principle of optimality, the calculation of the DP can be done in reverse following algorithm: At k = N, if there is no constraint on the final state, If there is constraint on the final state, the final cost is defined as: Step N − 1: Step k, for 0 Since the system states are discretized into a finite set of all possible states, the cost-to-go Jx(xk) is Based on the principle of optimality, the calculation of the DP can be done in reverse following algorithm: At k = N, if there is no constraint on the final state, g N px N q " 0. If there is constraint on the final state, the final cost is defined as: Step N ´1: Appl.Sci.2016, 6, 93 8 of 18 Step k, for 0 ď k ă N ´1: Since the system states are discretized into a finite set of all possible states, the cost-to-go J x (x k ) is also defined for these possible states.To solve this problem, the value of J k (x) can be interpolated from the nearest discretized state value of the cost-to-go J k+1 by following equation:

The Backward Model for the SHHV
For a given driving cycle, the power demand is a known priority.A DP algorithm is to find the optimal power-splitting factor that distributes the demand for power to the engine and the accumulator in order to minimize the total fuel consumption for the engine.To evaluate the fuel consumption of the engine for a given driving cycle using DP, a backward model of the SHHV is developed in this section.The diagram for the backward model of the proposed SHHV is shown in Figure 8.
Step N − 1: Step k, for 0 Since the system states are discretized into a finite set of all possible states, the cost-to-go Jx(xk) is also defined for these possible states.To solve this problem, the value of Jk(x) can be interpolated from the nearest discretized state value of the cost-to-go Jk+1 by following equation: (5)

The Backward Model for the SHHV
For a given driving cycle, the power demand is a known priority.A DP algorithm is to find the optimal power-splitting factor that distributes the demand for power to the engine and the accumulator in order to minimize the total fuel consumption for the engine.To evaluate the fuel consumption of the engine for a given driving cycle using DP, a backward model of the SHHV is developed in this section.The diagram for the backward model of the proposed SHHV is shown in Figure 8.The demand for power to propel the vehicle is estimated using the following equation: The pump/motor power is determined by: P p{m " # P w {η p{m if P w ą 0 pmotor modeq P w η p{m if P w ă 0 ppump modeq (7) where η p{m is the total efficiency of the pump/motor which is based on the Pourmovahed model [19].
During the propulsion phase, the requested power, Pp/m, can be provided only by either the engine or the accumulator.For convenience, a power-splitting factor, u P r´8, 1s, is used to illustrate.It is obvious that for a given power demand, the power-splitting factor is arbitrary within a range that depends on the engine power rate.The power flows and the relationship for the power-splitting factor is briefly described in Table 2. Using power-splitting factor as a control input, the engine power is P p " p1 ´uq P p{m P e " P p {η p (8) Using the experimental data for the engine, as shown in Figure 9, the engine speed and the engine torque are interpolated for a given power demand as follows: ω e " l 2 pP e ´l1 q 3 `l3 pP e ´l1 q 2 `l4 P e `l5 T e " π 30 where l 1 = 37.75, l 2 = 0.0311, l 3 = 0.044, l 4 = 52.83,and l 5 = 387.5 are the fitting coefficients.Using the experimental data for the engine, as shown in Figure 9, the engine speed and the engine torque are interpolated for a given power demand as follows:  At each particular operating point, the fuel consumption for the engine is approximately estimated by: .m f " k 00 `k10 ω e `k20 ω 2 e `k11 ω e T e `k01 T e `k02 Since DP is used to minimize the total fuel consumption over the duty cycle, the objective function is represented by: where T is the length of the duty cycle's time.
The power provided/absorbed by the accumulator is given by: P a " uP p{m (12) In terms of the accumulator power, the dynamic for pressure in the accumulator is The state-of-charge (SOC) of the accumulator and not the pressure represents the current available energy in a hydraulic accumulator.The SOC for an accumulator is defined by: SOC " p ´pmin p max ´pmin (14) where p is the current pressure, and p min and p max are the minimum and maximum working pressures for the accumulator.When the SOC = 0, there is no available energy in the accumulator and when the SOC = 1, the accumulator is fully charged.Selecting the SOC of the accumulator as a state variable, the dynamics for the system are: .

The Optimization of Energy Management Using DP
In order to apply DP algorithm, the continuous dynamic equations must by discretized.If the duty time, T, is divided into N equals increments, T s , the dynamic equation, Equation ( 15) is approximated by the following difference equation: x pk `1q " x pkq `1.4p ´1{1.4 pr ppp max ´pmin q x pkq `pmin q 1{1.4 P a pkq V a pp max ´pmin q T s (16) where x(k) is the state of charge for the accumulator at time k.The integral part of the cost function is approximated by the summation: where ∆m f,k is the fuel consumption, which is assumed to be constant during the time increment T s , for the engine at time k.The static equations, from Equation ( 6) to Equation ( 9), are represented in the form of equality constraints: P w pkq " ˆm v pk `1q ´v pkq T s `mgsinθ `m f r gcosθ `0.5ρC d A f v 2 pkq ˙v pkq P p{m pkq " P w pkq {η p{m P p pkq " p1 ´uk q P p{m pkq {η p{m P e pkq " P p pkq {η p ω e pkq " l 2 pP e pkq ´l1 q 3 `l3 pP e pkq ´l1 q 2 `l4 P e pkq `l5 T e pkq " π 30 1000P e pkq ω e pkq ∆m f ,k " k 00 `k10 ω e pkq `k20 ω 2 e pkq `k11 ω e pkq T e pkq `k01 T e pkq `k02 T 2 e pkq P a pkq " u k P w pkq (18) The physical constraints are given below and if these constraints are violated, an infinite penalty is given to the cost function.
The DP algorithm can be implemented backwards or forwards, but backward implementation is more convenient and saves computation time and memory space.Therefore, in this work, a normal backward DP is used.Initially, for a given drive cycle, the duty time, 0 ď t ď T, is divided into N equal increments, T s .DP is implemented from k " N ´1 backward to k " 0. At k " N, a soft constraint is applied to penalize any deviation in the final SOC value from the initial SOC, in order to ensure that the net energy that is stored in the accumulator is zero.Therefore, a comparison of fuel economy of the SHHV with the conventional vehicle can be made: where α is a penalty weight.At each time step k pk " N ´1, . . ., 0q, all of the allowable power-splitting factor, u, is applied for each of the allowable state values to find the optimal policy, which minimizes the fuel consumption rate of the engine for a given last stage of operation.To limit the number of calculations, the allowable state and the control values must be quantized as listed in Table 3. Step N ´1: Step k, for 0 ď k ď N ´1: where ∆m f ,k px k , u k q is the fuel consumption for the engine at time step k.
The simulation results for the use of DP for the discretized SHHV model in the Japan 1015 driving cycle are shown from Figure 10 to Figure 12.The lower part of Figure 10 shows the optimal SOC trajectories that correspond to different initial conditions of the accumulator.The constraint on the final value of the SOC ensures that there is no pre-charged energy in the accumulator during the test and that the engine does not provide any unused energy.
For each initial condition of the accumulator, the system demonstrates different fuel consumption figures, as shown in Figure 11.When the initial value for the SOC is small, the assistance provided by the accumulator is small.In this case, the engine must operate at high rate of power that is not the optimal region.When the value for the SOC is too high, there is not enough space to store braking energy.A certain amount of braking energy cannot be recovered.The fuel consumption for the system is increased.The figure shows that the lowest fuel consumption for the system is 21.5 km/L if the initial value of the SOC is about 90%.
The behavior of the SHHV system during the 480 to 660 s intervals for the Japan 1015 driving cycle is shown in Figure 12.It is seen that the vehicle follows the driving cycle with a reasonable deviation in speed.The negative portion of the pump/motor displacement factor indicates that the component is in the motor mode and the positive one indicates that the component is in the pump mode.The negative portion of the accumulator power profile indicates that the accumulator is charged and the positive portion of the accumulator power profile indicates that the accumulator delivers the power to propel the wheels.
cycle is shown in Figure 12.It is seen that the vehicle follows the driving cycle with a reasonable deviation in speed.The negative portion of the pump/motor displacement factor indicates that the component is in the motor mode and the positive one indicates that the component is in the pump mode.The negative portion of the accumulator power profile indicates that the accumulator is charged and the positive portion of the accumulator power profile indicates that the accumulator delivers the power to propel the wheels.deviation in speed.The negative portion of the pump/motor displacement factor indicates that the component is in the motor mode and the positive one indicates that the component is in the pump mode.The negative portion of the accumulator power profile indicates that the accumulator is charged and the positive portion of the accumulator power profile indicates that the accumulator delivers the power to propel the wheels.Interestingly, when the SOC approaches zero, it means that the stored energy is fully used at each vehicle start and the accumulator has enough space for effective regenerative braking.All of the braking energy throughout the driving test schedule is captured and reused.Avoiding transient operation of the engine and taking into account the low energy density characteristic, the accumulator is only used to absorb excess engine power during the tracking phase, although there is no direct charging.Therefore, the power-splitting factor remains between zero and one.Interestingly, when the SOC approaches zero, it means that the stored energy is fully used at each vehicle start and the accumulator has enough space for effective regenerative braking.All of the braking energy throughout the driving test schedule is captured and reused.Avoiding transient operation of the engine and taking into account the low energy density characteristic, the accumulator is only used to absorb excess engine power during the tracking phase, although there is no direct charging.Therefore, the power-splitting factor remains between zero and one.
For HWFET, the optimal trajectory of the accumulator pressure is shown in Figure 13.The results demonstrate that using a DP algorithm, the accumulator is always at high pressure right before any acceleration and the energy is consumed before hard braking.The dynamic performance of the system is improved and the braking energy is captured and reused effectively.Interestingly, when the SOC approaches zero, it means that the stored energy is fully used at each vehicle start and the accumulator has enough space for effective regenerative braking.All of the braking energy throughout the driving test schedule is captured and reused.Avoiding transient operation of the engine and taking into account the low energy density characteristic, the accumulator is only used to absorb excess engine power during the tracking phase, although there is no direct charging.Therefore, the power-splitting factor remains between zero and one.
For HWFET, the optimal trajectory of the accumulator pressure is shown in Figure 13.The results demonstrate that using a DP algorithm, the accumulator is always at high pressure right before any acceleration and the energy is consumed before hard braking.The dynamic performance of the system is improved and the braking energy is captured and reused effectively.In this work, when applying DP, the simulation result indicated that working along the engine minimum BSFC line is the optimal solution as shown in Figure 14.Besides the duration of the engine In this work, when applying DP, the simulation result indicated that working along the engine minimum BSFC line is the optimal solution as shown in Figure 14.Besides the duration of the engine at a certain operating point and the engine speed are the two important factors that affect the system efficiency.at a certain operating point and the engine speed are the two important factors that affect the system efficiency.

The Evaluation of Improvement in Fuel Economy
The fuel economy of a vehicle is defined as the volumetric fuel consumption per traveled

The DP-Based Control Strategy
Although the optimal control inputs derived by applying DP cannot to be applied to the forward-facing simulation system, they can be functioned as the references.When using a rule-based control strategy, such as thermostatic and modulated-pressure strategies, the reference values of engine torque and engine speed are derived based on the accumulator pressure.
Under the Japan 1015 cycle, the results for DP show that the optimal power threshold is about 20 kW and lower and upper threshold pressures are 150 bar and 270 bar, respectively.These values were used as the thresholds to derive the improved thermostatic control strategy.The difference between the initial and the DP-based thermostatic strategies can be seen from the trajectories for the pressure in the accumulator, as shown in Figure 16.

The DP-Based Control Strategy
Although the optimal control inputs derived by applying DP cannot to be applied to the forward-facing simulation system, they can be functioned as the references.When using a rule-based control strategy, such as thermostatic and modulated-pressure strategies, the reference values of engine torque and engine speed are derived based on the accumulator pressure.
Under the Japan 1015 cycle, the results for DP show that the optimal power threshold is about 20 kW and lower and upper threshold pressures are 150 bar and 270 bar, respectively.These values were used as the thresholds to derive the improved thermostatic control strategy.The difference between the initial and the DP-based thermostatic strategies can be seen from the trajectories for the pressure in the accumulator, as shown in Figure 16.

The DP-Based Control Strategy
Although the optimal control inputs derived by applying DP cannot to be applied to the forward-facing simulation system, they can be functioned as the references.When using a rule-based control strategy, such as thermostatic and modulated-pressure strategies, the reference values of engine torque and engine speed are derived based on the accumulator pressure.
Under the Japan 1015 cycle, the results for DP show that the optimal power threshold is about 20 kW and lower and upper threshold pressures are 150 bar and 270 bar, respectively.These values were used as the thresholds to derive the improved thermostatic control strategy.The difference between the initial and the DP-based thermostatic strategies can be seen from the trajectories for the pressure in the accumulator, as shown in Figure 16.For HWFET, the effectiveness of different estimations of the reference pressure is shown in Figure 17.It is seen that when the pressure remains constant, the accumulator has a reduced capacity as an energy buffer.The output power of the engine fluctuates more significantly because of the variation in the power demand.For the modulated-pressure strategy, the accumulator is used as an energy buffer.Therefore, the engine operates at its highest efficiency Figure 18.For HWFET, the effectiveness of different estimations of the reference pressure is shown in Figure 17.It is seen that when the pressure remains constant, the accumulator has a reduced capacity as an energy buffer.The output power of the engine fluctuates more significantly because of the variation in the power demand.For the modulated-pressure strategy, the accumulator is used as an energy buffer.Therefore, the engine operates at its highest efficiency Figure 18.Using DP, an optimal benchmark solution was estimated.The fuel economy figures for the proposed SHHV using DP and DP-based control strategies are listed in Table 4. Using DP, the proposed system shows a respective improvement of 98.91% and 68.79% over a conventional vehicle for the Japan 1015 and HWFET driving cycles.These gains come from effective regenerative braking and highly efficient operation of the components.However, because driving cycles must be known a priori to allow DP, the optimal control rules that are derived from DP cannot be directly applied to the forward-facing system.Instead, the DP results must be studied and used to improve the initial rule-based control strategies.
The results for the use of DP show that the optimal power threshold is around a value of 20 kW and the optimal range of pressure in the accumulator is from 150 bar to 270 bar.These values were used as the parameters for the thermostatic control strategy, which is DP-based.The respective improvements in the fuel economy for the system are 71.53% and 56.78%, for Japan 1015 and HWFET.

Conclusions
DP optimization is used to estimate the global optimal power-splitting factor for the proposed system for specific driving cycles.This result provides a benchmark solution to assess the quality of the control system that is developed.It gives an insight into how the suboptimal but realizable control system should be designed.
The results derived by Dynamic Programming (DP) are used to improve rule-based control strategies.For the DP-based control strategy, further improvement in fuel economy is achieved.The gains come from effective braking energy recovery, highly efficient operation of components and optimized engine operation.
In the future, since the power-splitting device was not available for the SHHV, the engine speed, engine-pump displacement and pump/motor displacement will have to be used as control variables to give a more accurate solution.

Figure 1 .
Figure 1.Schematic and Control Signal Paths of Series Hydraulic Hybrid Vehicle (SHHV).

Figure 1 .
Figure 1.Schematic and Control Signal Paths of Series Hydraulic Hybrid Vehicle (SHHV).

Figure 4 .
Figure 4. Hierarchical Control System of the SHHV.

Figure 4 .
Figure 4. Hierarchical Control System of the SHHV.

3. 3 .
Dynamic Programming (DP) Optimal Control Strategy with DP Application 3.3.1.The Fundamental Formulation of DP Many textbooks and papers have been published on the subject of dynamic programming (DP)

Figure 7 .
Figure 7. State and Control Input Discretization in Time and Space.

Figure 7 .
Figure 7. State and Control Input Discretization in Time and Space.

Figure 9 .
Figure 9. Engine Torque and Engine Speed Estimation.

Figure 11 .
Figure 11.Relationship between Fuel Economy and Initial SOC.

Figure 11 .
Figure 11.Relationship between Fuel Economy and Initial SOC.

Figure 11 .
Figure 11.Relationship between Fuel Economy and Initial SOC.

Figure 17 .
Figure 17.Pressure Trajectories with Primary and DP-Based Modulated-Pressure Strategies.

Figure 17 .
Figure 17.Pressure Trajectories with Primary and DP-Based Modulated-Pressure Strategies.

Figure 17 .
Figure 17.Pressure Trajectories with Primary and DP-Based Modulated-Pressure Strategies.

Figure 18 .Figure 18 .
Figure 18.Number of Occurrence of Engine Operating Points on BSFC map.

Table 2 .
Power-splitting factor and power flows coordinating.

Table 2 .
Power-splitting factor and power flows coordinating.

Table 3 .
State and Control Input Discretization.

Table 4 .
Fuel Economy Estimation with Dynamic Programming (DP) and DP-Based Control Schemes.