Next Article in Journal
New Molecular Materials for Direct Air Capture of Carbon Dioxide Using Electro-Swing Chemistry
Previous Article in Journal
The New Robotic Platform Hugo™ RAS for Colorectal Surgery: A Single-Center Initial Experience
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Energy Management Strategy for Range-Extended Electric Vehicles Based on Eco-Driving Speed

1
Transportation College, Jilin University, Changchun 130022, China
2
Materials Science and Engineering College, Jilin University, Changchun 130022, China
3
National Key Laboratory of Automotive Chassis Integration and Bionics, Automotive Engineering College, Jilin University, Changchun 130022, China
4
Army Academy of Aromred Forces, Sergeant School, Student Army Second Brigade, Changchun 130117, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(23), 12738; https://doi.org/10.3390/app152312738 (registering DOI)
Submission received: 18 October 2025 / Revised: 25 November 2025 / Accepted: 29 November 2025 / Published: 2 December 2025

Abstract

To achieve the optimal energy allocation between the auxiliary power unit (APU) and battery of connected automated range-extended electric vehicle (CAR-EEV), the hierarchical eco-driving control with dynamic game energy management were investigated and the optimization design of APU working mode was carried out from a multi-objective perspective. Initially, the acceleration and speed of the host vehicle were adjusted in real time, based on the driving status of the preceding vehicle, and the ecological driving speed was obtained in the adaptive car-following eco-driving mode. The dynamic game energy management strategy was proposed, leveraging the real-time interactive information between the vehicle and the traffic environment, and intelligently allocating and scheduling the energy flow within the powertrain. Dynamic game optimization was adopted to achieve dynamic decision-making and control optimization on whether to switch the APU operating speed or not. The multi-objective optimization analyses are carried out based on the weight coefficient matrix. The hierarchical dynamic game energy management strategy based on eco-driving speed (HDGEMS) is implemented through dynamic games and exhibits excellent performance. This strategy enables dynamic adjustment of power distribution between the APU and the battery, thereby allowing the APU to operate efficiently under optimal operating conditions. Meanwhile, it effectively reduces secondary charging losses and the dynamic switching time of the APU, and ultimately achieves energy optimization. Eventually, the results of simulation and experimental thoroughly indicated that economy improvement, emission reduction, and battery life enhancement of CAR-EEV were effectively kept in balance under the control of the proposed HDGEMS with intelligent optimization mode. New research ideas and technical directions are provided for the field of EMS, which is expected to promote technological progress in the industry.

1. Introduction

Connected automated range-extended electric vehicles (CAR-EEVs) equipped with intelligent information systems have introduced complexities in their energy management strategies (EMSs) [1]. To address these challenges, extensive research is needed to develop a robust EMS capable of effectively managing multiple energy sources within the hybrid system for reliable power distribution. For the hierarchical eco-driving control, the adaptive car-following eco-driving strategy is an intelligent driving control approach, aiming to enhance driving safety and reduce energy consumption. The dynamic game energy management strategy is a more advanced energy scheduling method that utilizes the real-time interaction information between the vehicle and the traffic environment to intelligently allocate and schedule the energy flow of the power system [2]. For CAR-EEVs, the design of the APU operating mode and the multi-objective optimization (MOO) are the keys to achieving energy savings and emission reductions [3]. Therefore, through the optimized design of the APU operating mode/area, the influence of different operating modes of the APU system on the energy consumption, emissions, and battery health is studied from a multi-objective perspective, which is important for implementing the optimal energy distribution and realizing the comprehensive control effect of robust control in the multi-source V2X information environment [4].

1.1. Review of Literature

The CAR-EEV’s unique configurations with mechanical decoupling, flexible oil–electric distribution, and intelligent information system on board requires designing an optimal EMS for APU mode with V2X multi-source information [5]. APU operating more at the optimal point/area and minimizing rotational speed fluctuations can achieve optimal energy distribution via CAR-EEV advantages, enhancing overall control performance in energy consumption, emissions, and battery health. However, optimized parameters may only match specific driving cycle conditions and possess limited generalization ability [6,7].
In the eco-driving strategy, the fuel economy of CAR-EEV correlates closely with speed distribution as it directly impacts total energy consumption. The latest research shows that combining eco-driving and speed planning can effectively reduce energy consumption [8]. Deep learning-based driving behavior prediction technology has demonstrated strong potential, with driver habits accurately captured to optimize strategies; meanwhile, breakthroughs have been achieved in advanced battery management systems in improving the accuracy of battery life prediction, providing a more reliable basis for energy management. An eco-driving-based dynamic speed planning algorithm was proposed which can quickly control the speed within the speed limit range and enable the car to obtain relatively low energy consumption [9]. In urban areas with continuous signalized intersections, computing a reference speed curve to avoid worthless idling enhances energy savings. Model predictive control (MPC) was used to calculate reference speed in dense traffic, in which vehicles are forced to follow the car in front [10]. CAR-EEV fuel-saving performance hinges on speed distribution, making reasonable speed planning essential for the overall eco-driving strategy. In the domain of eco-driving strategies, hierarchical strategies are relatively common. The hierarchical optimization framework decomposed the hybrid optimization problem into discrete and continuous ones. A multi-step optimal-based decision-making architecture is developed to compute optimal acceleration profiles for adaptive cruise control in V2X-enabled environments [11]. Although studies on eco-driving and driving styles have been conducted, relatively little research exists regarding the relationship between drivers’ driving styles and vehicle eco-driving optimization, especially for eco-driving speeds on long-distance urban roads [12]. From the above analysis, it is evident that in the eco-driving strategy, the fuel economy of CAR-EEV is closely associated with speed distribution. The speed distribution, which determines the total energy demand, is a crucial factor in the strategy [13]. Moreover, as environmental information and driving styles impact the eco-driving strategy, reasonable speed planning should be incorporated.
Many studies focus on macroscopic traffic management and control, using dynamic game algorithms for energy management strategies and multi-objective optimization to enhance vehicle performance on complex roads. As vehicle networking and traffic info interact, establishing a V2X-based game decision method to solve control conflicts is a focus [14]. Game theory has been widely applied in the fields of intelligent transportation and vehicle control to investigate interactions among multiple intelligent agents [15]. In CAR-EEV energy management research, it is extensively applied to travel characteristics, route selection, power distribution, and mode switching. Game theory excels in explaining small spatiotemporal-scale–agent interactions, culminating in more studies in the CAR-EEV field. Fuzzy Markov chains in a game model predictive controller was introduced to predicted candidate vehicle motion [16]. Moreover, the optimization of EMS performance is a typical MOO problem, requiring independent and balanced results [17,18,19]. With the interaction of connected vehicles and traffic environment information, establishing a V2X-based game decision-making method to solve multi-objective control conflicts has become a research hotspot. The proposed EMS utilizes the CAR-EEV configuration to attain optimal energy distribution (minimizing large-span dynamic fluctuations in rotational speed and maximizing operation at the optimal point/area). Besides minimizing system energy consumption and emissions, extending battery life is another optimization goal. Because previous empirical or semi-empirical battery life models were limited to specific business systems and lacked transferability, most battery anti-aging EMSs with discrete solution processes are difficult to implement in vehicle control. A clear SoC reference trajectory is essential for planning battery power distribution to approach the global optimal [20].
Despite the numerous existing studies on eco-driving, driving styles, and energy management strategies, deficiencies in current research still persist. A hierarchical dynamic game energy-management strategy based on eco-driving speed (HDGEMS, EMShe-dg) was investigated and the optimization design of the APU working mode was carried out from a multi-objective perspective, as proposed. At the upper level, environmental and driver factors directly influence driver behavior and decision-making, contributing to the inability to ensure the accuracy and safety of the eco-driving strategy. Thus, driver behavior characteristics are integrated into the car-following model from environmental and personal aspects, to fulfill the speed planning task. Considering the fuzziness of the reinforcement learning–reward function, imitation learning is adopted as the framework of the car-following model to imitate expert strategies for eco-driving optimization. At the lower level, the achievement of the APU multi-point working mode with dynamic game is focused. This method integrates imitation learning and dynamic game, with the expectation of significantly improving vehicle performance under complex road conditions.

1.2. Motivation and Innovation

In this paper, a dynamic game optimization method based on V2X information is proposed. Promisingly, the proposed EMShe-dg with dynamic game optimization exhibited a significant multi-objective performance comprehensive balancing ability. The main contributions of this paper are highlighted in the following points. (1) Considering the impact of environmental and driver factors on the accuracy and safety of ecological driving strategies, driver behavior characteristics are integrated into the following model from both environmental and personal perspectives to complete the speed planning task of ecological driving strategies. (2) The APU multi-point working mode with dynamic game is designed and a dynamic game method with future driving prediction is used for APU operating speed-switching working, which is an intelligent decision-making mode that considers V2X information to achieve real-time driving feature identification and quantitative information. (3) A general framework of MOO is proposed for EMShe-dg, with optimization of battery charge/discharge planning and APU working mode meeingt the MOO requirements of energy conservation, emission reduction, and battery lifespan enhancement. Compared with existing methods, the profit structure of the proposed HDGEMS game model in this study not only takes into account the economic benefits of energy allocation, but also innovatively integrates riding comfort indicators, making energy allocation decisions more aligned with actual driving needs, thereby providing a more accurate basis for power distribution between the APU and the battery and further optimizing energy allocation effects; additionally, the practical scope of imitation learning has been effectively expanded in this study, and by introducing richer historical driving data and real-time feedback information, the model can better adapt to different driving scenarios, achieving improved performance in energy conservation and emission reduction while battery life has been better prolonged.

2. Materials and Methods

The factors that affect car following can be mainly divided into two aspects. The first is the influence of the environment itself on driver behavior, and the second is the inherent influence of individual driver behavior habits on the following strategy.

2.1. Theoretical Reference

For imitation learning methods, appropriate reward functions do not need to be manually set. Consequently, in complex and ill-defined environments, they outperform reinforcement learning methods. In actual tasks, there may be multiple intentions when drivers are following a vehicle; different drivers have different driving habits, such as aggressive and mild type, which will then lead to difficulties in choosing a reward function. To solve these problems, an adaptive learning strategy and imitation learning method was proposed, which fuses driving styles and enabled the conduction of further research and improvement. Figure 1 illustrates the algorithm flowchart of the adaptive car-following strategy based on imitation learning in this paper.
This section proposes the following model with an adaptive factor for the inherent phenomenon of environmental influence on driver behavior. The main idea is to construct a framework of a GAN network with adaptability based on the basic principle of the GAN network: that is, instead of estimating the reward function, directly learn the strategy of the following model. The formula is as shown in (1):
min π max D ( 0 , 1 ) s × A E π [ log D ( s , a ) + E π E [ log ( 1 D ( s , a ) ] λ H ( π ) H ( π ) ¯ ¯ E π [ log ( α s ) ]
where πE is the expert strategy; π is the strategy that needs to be learned and trained; D is used as a discriminator to distinguish state-action pairs; and H(π) represents causal entropy.
The adaptive strategy is model-free, but needs to be interactively trained with the environment. It treats the environment as a black box and is end-to-end differentiable. Due to internal potential variation factors such as expert level and preferences under different strategies, the generated trajectories will also have significant changes among different individuals. Even the same person when facing the same situation will make different decisions, as well, which leads to the generation of multiple different strategies. In this scenario, construct an expert strategy set: π E = π E 0 , π E 1 , } , and redefine the generation process s 0 ρ 0 c p c , π p ( π c ) 0 a t π a t s t , s t + 1 P s t + 1 a t , s t of expert trajectories τ E . In this process, an adaptive parameter c is defined, and p ( c ) is the prior probability distribution of   c . The goal of this algorithm is to recover the strategy π ( a s , c ) under the adaptive parameter c .
In order to make the adaptive factor more closely fit with the strategy, achieve a degree of interdependence, and make the adaptive factor and the strategy have greater correlation, mutual information is added to the optimization function.
I c ; τ = H ( c ) + E c p ( c ) , a π ( s , c ) [ log p ( c τ ) ]
where I c ; τ is the mutual information; H(c) is the entropy of the adaptive parameter c; p(c) is the prior probability distribution of the adaptive parameter c; π(·|s,c) is the probability distribution of actions given the state s and parameter c; τ is the expert trajectory; p(c|τ) is the posterior probability distribution of c given the expert trajectory τ; and E is the mathematical expectation.
Then add the adaptive factors to the optimization objective function to obtain
m i n θ ψ   m a x ω   E π θ log D ω s , a + E π E log 1 D ω s , a λ 1 I c ; τ λ 2 H π θ
where θ     a n d   ψ are the relevant parameters of the strategy to be learned; ω is the discriminator parameter; D ω s , a   is the discriminator function with parameter ω ; π θ is the strategy to be learned; and λ 1 and λ 2 are the weight coefficients.
This can make the mixed trajectories generated by experts more clearly organized, but in the calculation process the posterior probability p ( c | τ ) is difficult to calculate. Here the Q value is directly used instead; that is
L 1 ( π , Q ) I ( c ; τ ) min θ _ ψ   max ω   E π θ [ log D ω ( s , a ) ] + E [ log ( 1 D ω ( s , a ) ] λ 1 L 1 ( π , Q ) λ 2 H ( π θ )
where L1(π,Q) is a metric related to the policy π and the Q-value (since the posterior probability p(c|τ) is difficult to calculate, it is used to approximate the mutual information I c ; τ ).
During vehicle driving, one should not only consider the influence of the driving environment on the model, but should also satisfy the comfort and safety of the driver in the car-following task. In this section, combined with vehicle dynamics itself, the car-following task will be optimized by using driver-style characteristics. Establishing a personalized car-following model requires driving data that can reflect driving styles. For this objective, the data needs to be efficiently classified in accordance with various driving styles. Combined with the inter-vehicle distance and the speed of the proxy vehicle, the reciprocal of time-to-collision (TTCI) is defined as follows:
T T C I = v d
where v represents the speed difference between the following vehicle and the leading vehicle and d represents the vehicle distance.
A method of fusing driver style into the following strategy was proposed to make the following model have personalized characteristics and improve the overall following task. This method can effectively solve the multi-objective optimization problem in the following task, mainly including the error between the actual and the expected vehicle distance, reflecting driving style; the relative speed maintaining the following behavior and with the leading vehicle; and the acceleration and deceleration for ensuring comfort. The trajectory data of different driving styles (conservative and aggressive) obtained in the previous section indicate that the differences between the two are reflected in behavioral indicators such as time interval, collision time, speed, and acceleration, which can be used to model and quantify driving styles. In typical follow-up scenarios, drivers of the same style often have similar time intervals, while drivers of different styles have significant differences. The formula for vehicle distance can be expressed as
d d e s = v f t h + d 0
where v f is the driving speed of the following vehicle and t h represents time progress. In a conservative driving style, t h = 3.16 s ; in an aggressive driving style, t h = 1.92 s . d 0 represents the safe distance when the car is driving at a very low speed.
According to the basic knowledge of the longitudinal dynamics of the front and rear vehicles, it can be deduced that
x ( k + 1 ) = A ( k ) + B u ( k ) y ( k ) = C x ( k ) ( k ) = [ Δ d ( k ) Δ v ( k ) v ( k ) ] u ( k ) = a ( k ) A = 1 Δ t 0 0 1 0 0 0 1 B = 0.5 Δ t 2 Δ t Δ t
where k represents the k -th time point; t is the sampling interval of 0.1 s; v represents the speed of the leading vehicle; and a represents the acceleration of the leading vehicle.
The goal of the vehicle following is for the driver to adjust the vehicle’s speed to that of the leading vehicle, and keep the vehicle distance close to the expected value: that is, d e r r k 0 ; v k 0 , where ∆derr(k) represents the vehicle distance error and is defined as
d e r r k = d d d e s
For the purpose of providing passengers with comfort during vehicle following, the absolute values of a and j k must also be as small as possible; that is, a ( k ) 0 ,   j ( k ) 0 . j k represents the derivative of a , and its formula is as follows:
j k = a k a k 1 t
Solving optimization problems requires satisfying a series of constraints. Firstly, to prevent collision with the preceding vehicle, the distance between vehicles must not be less than the minimum value; secondly, to ensure that subsequent vehicles are in a suitable state, the distance should not exceed the maximum following distance; and finally, the values of velocity a and j(k) should also be limited between the minimum and maximum values. To construct a manageable optimization problem, the cost function can be defined as follows:
J ξ k , u k 1 , Δ u k =   i = 1 N P     η k + i k η ref   k + i k Q 2   + i = 1 N c 1     Δ u k + i k R 2
where η r e f ( k + i k ) represents the reference vector; Q and   R are the weighting matrices for control and objectives; and x k + i k and u ( k + i k ) are the open-loop predicted state and control quantity at time point k. In the following task, it is defined as follows:
η r e f k + i k   = Δ d d e s k + i k 0 0 0 T , Δ u k + i k   = j k + i k Δ t Q   = 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1
where R represents a one-dimensional vector with a value of 10.
The optimization objective and constraint conditions of model predictive control are
s . t .   d 0 Δ d k d m a x , v m i n v k v m a x , a m i n a k a m a x , j m i n j k j m a x
Considering driving style, the aggressive strategy should have safeness and comfortableness; the conservative driving strategy should have greater safeness and comfortableness. The final cost function is defined as
F f o l l o w i n g = m i n θ ψ   m a x ω   E π θ log D ω s , a E π E D ω s , a λ 0 η π θ λ I L I π θ , Q ϑ λ 2 H π θ λ 3 J ξ k , u k 1 , Δ u k
Regarding the composition of the dataset, driving data from 100 drivers of different regions, age groups, and driving experiences has been collected. The driving duration covers multiple scenarios: short trips (10–30 min), medium trips (30–60 min), and long trips (over 60 min), with a total of 2300 samples. Analysis of such rich and diverse data enables a more comprehensive capture of differences in driving styles among various drivers. In terms of network architecture, the constructed GAN framework includes a generator with 5 fully connected layers, each followed by a ReLU activation function to introduce nonlinear characteristics and enhance the network’s expressive ability, with a learning rate set to 0.001. The discriminator consists of 4 fully connected layers, also using the ReLU activation function, with a learning rate of 0.0005. This design ensures network complexity while avoiding overfitting. Loss evolution shows that in the early stages of training, the loss decreases rapidly, stabilizes in the later stages, and finally reaches a relatively stable low-loss state. Meanwhile, mean absolute error (MAE) and root mean square error (RMSE) are used as indicators of model accuracy. In the driving style classification task, the MAE reaches 0.15 and the RMSE is 0.2, indicating that the model can classify driving styles relatively accurately, providing a reliable basis for the learning and training of subsequent adaptive strategies.

2.2. Designing of Hierarchical Dynamic Game EMS Based on Eco-Driving Speed

As environmental information and driving styles impact the eco-driving strategy, reasonable speed planning should be incorporated. The upper level involves speed planning design based on car-following behavior, while the lower level is occupied by the energy management strategy optimizer.
As illustrated in Figure 2, the powertrain of the CAR-EEV is composed of a traction battery, APU, drive motor, mechanical drive mechanism, intelligent information system, and control system. This system was selected by this study as the research object for the design of the EMS.
In terms of APU system work-area optimization, there are three types of rule-based APU system power-output forms (work mode/area): namely, fixed-point (constant speed, constant torque), multi-point (constant speed, variable torque) and optimal-curve power following (variable speed, variable torque), as shown in Figure 2. The APU system operates at variable speed and torque on the optimum curve and at constant speed and torque in multi-point mode, both of which allow the APU system to follow the vehicle power requirement. When the operating point speed is considered to be continuous, the APU operating mode changes from single point to multiple points, and the number of operating points increases, to work along the optimal curve, commonly referred to as a power-following strategy. The different operating modes of the APU system can thus be described in terms of the number of APU operating points (Ncsop) and the power range (εi) at the speed where this operating point is located. Therefore, the operating strategy of the APU system can be structurally described as a decision logic based on Ncsop and εi: by synergistically optimizing these two dimensional parameters, the system can dynamically switch between fixed-point, multi-point, and power-tracking modes.
The vehicle’s required power Preq is calculated based on the target vehicle speed under the driving cycles, and the distribution range of Preq is statistically analyzed by dividing it into several power ranges. The cumulative power distribution function is obtained as follows:
F p ( t ) = t p r e q ( t ) d t
where Fp(t) is the cumulative power distribution function and preq(t) is the probability density function of the vehicle’s required power.
The principle of the APU system working area under the power probability density function is shown in Figure 3.
Divide the Preq interval evenly into Ncsop sections, with each section corresponding to a constant speed operating point of the APU system. By combining the maximum and minimum power values of the i-th interval and the power fluctuation margin coefficient εi, the upper and lower power limits of the APU system at operating speed ni can be calculated as follows:
P c s o p _ i _ u p = P c s o p _ i _ o p t + ( P c s o p _ i _ max P c s o p _ i _ min ) ε i 2 P c s o p _ i _ l o w = P c s o p _ i _ o p t ( P c s o p _ i _ max P c s o p _ i _ min ) ε i 2
Δ P i _ ε = P c s o p _ i _ u p P c s o p _ i _ l o w
ε i = Δ P i _ ε P c s o p _ i _ max P c s o p _ i _ min
where Pcsop_i_max, Pcsop_i_min, Pcsop_i_up and Pcsop_i_low are the maximum, minimum, upper and lower power in the i-th interval; Pcsop_i_opt is the power of the APU system operating at the optimal curve; and εi is the power fluctuation margin coefficient. ∆Pi_ε is the power coverage size. The torque at operating speed ni in the i-th interval is calculated as follows:
T i _ ε _ u p = 9550 P c s o p _ i _ u p n i T i _ ε _ l o w = 9550 P c s o p _ i _ l o w n i
where Ti_ε_up and Ti_ε_low are the upper and lower torque limits of the APU system operating in the i-th section; the switching thresholds for two adjacent working points are set to
P i _ c r _ u o = P c s o p _ i _ max P i _ c r _ l o w = P c s o p _ i _ min
where Pi_cr_up is the APU power threshold when switching between the i-th interval and the i+1-th interval, and Pi_cr_low is the APU power threshold when switching between the i-th interval and the i−1-th interval.
Finally, the target power and torque values of the APU real-time operating point are calculated as follows:
P i _ c r _ u p = P c s o p _ i _ max P i _ c r _ l o w = P c s o p _ i _ min
T A P U _ a c t = 9550   P A P U _ a c t n A P U _ a c t = 9550   P A P u _ a c t n i
where PAPU_act, nAPU_act and TAPU_act are the real-time target output power, speed, and torque values of the APU system, respectively.
The study uses the DP algorithm, which combines the data of engine fuel consumption, emissions, and ISG engine efficiency characteristics to obtain the optimal operating curve of APU system energy-consumption emissions characteristics [6]. Based on this curve, the APU system operating mode/area control strategy is designed.

2.3. Principle of APU Operating Point-Switching Decision Based on Dual Dynamic Game

In order to improve the cognitive intelligence of CAR-EEV vehicle decisions and achieve intelligent vehicle control, in this section the power allocation and the decision as to whether to switch APU operating modes/points are addressed, based on the dynamic game in the context of multi-source V2X information. A feasibility analysis of the APU operating point switch under multi-point modes is also required. If the conditions for switching to the optimal operating point are met, it is necessary to consider whether the APU system can operate continuously for a certain period of time, to better meet the power demand without switching again; because of this conversion, the dual energy source system has better work efficiency after the point is switched [18]. This involves achieving efficient operation of both the APU and battery systems, which enables connected CAR-EEV to engage in multi-step game interactions with V2X, thereby achieving higher system efficiency.
Considering the complexity, non-uniqueness, and coordination of APU multi-point operating modes in the power-following process, an intelligent switching non-cooperative dynamic game decision algorithm is established. Game theory paradigms are used to address potential conflicts between point/mode switches, considering energy consumption, emission reduction, and battery-life friendliness. The detailed modeling of power demand for the powertrain system under complex scenarios is refined into four parts: game benefits that integrate energy-saving and emission reduction, V2X information, reducing battery-capacity decay, and comprehensive performance indicators. Finally, a comprehensive performance index is established, incorporating different weight coefficients and considering vehicle dynamics constraints and road speed limits as the game’s total payoff function or optimization objective. The dynamic game interactive-decision process is unfolded using game trees, and subgame-perfect Nash equilibria for each continuous stage of the game are sought [21]. The potential conflicts during the real-time judgment of APU operating point/mode switches are evaluated. A detailed design of the game-mode switch rule table is developed by combining standardized switching thresholds with whether APU operating-mode switching has been completed. The decision process for the APU system-operating point switching involves current power, target power demand, the span of current speed to target speed (ni+1ni), the demand torque at the current speed, and the span of target torque (Ti+1Ti). The main factors influencing the speed switching of the APU system include one subjective factor (driving style) and two objective factors (road congestion and battery level). In a multi-source information environment, considering the control target from multiple perspectives, the APU system operating point-switching model is more complex than conventional control modes (based on threshold switching). The most significant difference from conventional operating conditions is the predictability of future driving conditions. Therefore, an intelligent decision system has the opportunity to fully consider the impact of global optimization modes when calculating future power demand and energy management. APU work point-switching strategy based on the dynamic game algorithm is shown in Figure 4. Firstly, it is necessary to determine whether the APU system intends to switch the operating point power: is there a change in the target speed ∆ni > 0? This is determined by comparing the current APU operating point with the speed difference from the optimal target operating point under power demand to establish the target speed. If the demand for the optimal operating point does not require a speed change, and the operating condition demand duration exceeds a predefined time threshold, it indicates that the target operating point provides sufficient working time and space, and the APU operating point maintains its original working mode. Conversely, if the conditions for game initiation are met, indicating a conflict of interest in whether the APU operating point should switch, a game is initiated. In this case, the dynamic game algorithm of the EMS system is activated. During the game, efforts are made to determine the optimal timing for operating point switching, calculate the payoff function, construct the payoff matrix, solve the optimal strategy combination for the current stage, and choose the target operating speed. Finally, a dynamic game is carried out based on the APU’s multi-point strategy selection, until the termination conditions of the game are met.
The detailed determination processes of the utility functions and the Nash equilibrium for the two participants have been clearly presented, as follows:
C s = α max ( 0 , S S max ) + α max ( 0 , S min S )
C c y c l e cos t = β C c y c l e P b a t t e r y η P max b a t t e r y
C d e m a n d = γ max ( 0 , P d e m a n d P b a t t e r y )
where Cs is a state-of-charge maintenance cost, S is the current SoC, Smin, Smax are the target SoC range, α is the penalty coefficient for SoC deviation from the target range, η is the charge–discharge efficiency, β is the impact coefficient of charge–discharge cycles on lifespan, and Pmax-battery is the maximum output power of the battery. Ccycle is the calculation formula for the per-cycle cost.
Combining the above three costs, the utility function of the battery (Ubattery) can be expressed as
U b a t t e r y = ( C s + C c y c l e cos t + C d e m a n d )
Procedure for determining the Nash equilibrium: the strategy space of the battery (with different output powers Pbattery, whose value range is [0, Pmax-battery]) and the strategy space of the APU (with different output powers PAPU, whose value range is [0, Pmax-APU]), and whether to switch operating points are specified; for each strategy combination of the battery and APU, the corresponding utility function values Ubattery(Pbattery, PAPU) and UAPU(Pbattery, PAPU) are calculated, and the payoff matrix is constructed. Then, the iterative method is employed to find the Nash equilibrium.

2.4. Multi-Objective Evaluation Model for EMS Considering Energy Consumption Emissions and Battery Life

While the fixed operation of APU at high-efficiency operating points can reduce energy consumption and emissions, this mode of operation can increase battery usage and shorten battery life. The parameter optimization of the HDGEMS design considers 3 primary objectives, and the HDGEMS described in Section 2 is optimized from a multi-objective perspective. The optimization of the APU system operation mode/area is conducted based on the MOO research framework, in this section, in order to verify whether the proposed MOO method can achieve a better balance among energy consumption, emissions, and battery life. The EMS performance index considers three metrics: the oil–electric conversion loss rate (Coil_ele), comprehensive exhaust emissions (Ecom), and the battery-capacity loss rate (Qloss) [22]. The APU serves as one of the energy sources for CAR-EEV, and all fuel consumption and emissions originate from the engine. This section outlines the evaluation criteria used to assess fuel consumption and exhaust emissions, as follows. Since the engine is mechanically linked to the generator in the APU system, solely considering the engine’s oil consumption characteristics fails to accurately represent the vehicle’s overall energy consumption. Taking into account the engine efficiency, the Coil_ele is calculated as follows:
C o i l _ e l e = 1 360 η e l e η o i l ρ
where ηele is the power-generation efficiency of the generator; ηoil is the efficiency between fuel and the effective power of the engine; and ρ is the calorific value of petrol, which is 4.6 × 107 J/kg. It is evident that Coil_ele defines the energy-transmission loss rate, and a smaller Coil_ele value corresponds to better fuel economy.
The conventional exhaust gases (CO, CH and NOx) are considered in the APU comprehensive exhaust-emission characteristic function Ecom, and the Ecom is calculated as follows:
E c o m = ξ C O E C O + ξ C H E C H + ξ N O x E N O x
where ECO, ECH and ENOx are CO, CH and NOx emission characteristic functions and ξCO, ξCH and ξNOx are the weight coefficients of ECO, ECH and ENOx, respectively. [ξCO, ξCH, ξNOx]T = [0.4, 0.3, 0.3]T.
Undoubtedly, the repeated charge and discharge cycles of the battery storage system will affect the aging speed of the battery. The optimal charging capacity can be achieved by modifying and controlling the charging-current profiles, and the negative impact of current on battery life can be minimized [23]. The battery-capacity loss rate Qloss is calculated to evaluate the battery-life model.
Due to the limited space of this paper, this research primarily focuses on the application of Qloss (for simplicity), and detailed experimental information on the fitting and determination of battery-life model parameters can be found in Ref. [23]. In this article, the fitting parameters in this paper are set as Q1 = 0.495 and Q2 = 0.379. The operating temperature is also a crucial factor affecting the battery life. In this study, the battery operating temperature is standardized at 25 °C. Finally, the Qloss is obtained as follows:
Q l o s s = 0 T c y c 0.495 × d I b a t ( t ) d t × exp 0.379 I b a t ( t ) A h c e l l × N c y c × D O D d t
where DOD is the depth of discharge (DOD = 0.7); Ahcell is the cumulative capacity of the battery; Ibat(t) is the battery current; Tcyc is the total cycle time; and Ncyc is the number of cycles.
In engineering practice, determining the preferred solution is critical. In this paper, the linear normalization method is selected for optimal decision, which is a practical and efficient multi-objective normalization optimization method, and can clearly reflect the weight assigned to the optimization objective. In addition, it is necessary to standardize the cost function for each dimension before using the linear normalization method. Furthermore, the three objective functions, Coil_ele, Ecom and Qloss possess different physical connotations. In order to reach the final quantitative decision, these objective functions need to be combined into a one-dimensional cost function. This study adopts a multi-objective normalization method and uses the following normalization formula:
X ( n o r ) = X p X p ( min ) X p ( max ) X p ( min )
where X(nor) is the normalized result of the objective function; Xp is the original data in the Pareto optimal solution set; and Xp(max) and Xp(min) are, respectively, the maximum and minimum values of the original data in the Pareto optimal solution set.
The comprehensive optimal performance function of the APU system, called the optimal comprehensive vehicle performance (Icom_ovp), is defined as follows:
I c o m _ o v p = 1 ω C C o i l _ e l e + ω E E c o m + ω Q Q l o s s _ b a t t
where Icom_ovp is the comprehensive performance evaluation index; ωC, ωE, and ωQ, respectively, represent the weight coefficients assigned to Coil_ele, Ecom, and Qbatt_loss.
Coil_ele defines the energy-transmission loss rate, and a smaller Coil_ele value corresponds to better fuel economy. Ecom reflects the comprehensive emission level, and a smaller Ecom value corresponds to better exhaust quality. Qbatt_loss is used to quantify the degree of performance degradation during battery use, including capacity degradation, internal resistance increase, etc., reflecting its remaining usable life. The selection of weight coefficients plays a crucial role in determining the final results. Based on the hierarchical analysis method [24], the multi-objective evaluation matrix is obtained as shown in Table 1, and the weight vector result is calculated as [ωC, ωE, ωQ]T = [0.41, 0.13, 0.46]T. This result becomes an important reference for the selection of weight coefficients. It should be noted that considering the catalytic conversion effect of the three-way catalyst weakens the relationship between the exhaust-emission level and the direct engine-emission content. Therefore, the weight factor of Ecom is relatively small, while the weight factors of Coil_ele and Qbatt_loss are comparable. It is worth mentioning that different multi-objective evaluation matrices will produce different weight factors. In this study, the selected weight parameters are [ωC, ωE, ωQ]T = [0.4, 0.2, 0.4]T. The source of the matrix elements is determined by inviting experts and combining design experience with the AHP scaling method to compare and score each indicator pairwise, and then calculating the geometric mean to obtain the matrix elements.

3. Results and Discussion

By means of simulation tests and experimental bench tests, the energy management strategy proposed in Section 2 is validated in this section, and the effectiveness of HDGEMS and the proposed MOO method in achieving the optimal balance between energy consumption, emissions, and battery life is confirmed.

3.1. Optimizing the Vehicle Model

The vehicle-dynamics model and the proposed HDGEMS are created in AVL/Cruise (2019) software and MATLAB/Simulink (R2022a) software, respectively; the co-simulation model is shown in Figure 5.
At the initial stage of the simulation, a vehicle dynamics model is established in the AVL/Cruise software, and various vehicle parameters—such as mass, dimensions, and powertrain characteristics—are inputted. Subsequently, the proposed HDGEMS model is constructed in MATLAB/Simulink, including the eco-driving speed module, dynamic game module, and other relevant components. Initial conditions for co-simulation are set, such as initial vehicle speed and initial battery SoC. High-performance workstations are adopted as the hardware for the simulation, which can meet the requirements of massive data calculation and model operation during the co-simulation process. To ensure the accuracy and reliability of the simulation results, key data obtained from the simulation (e.g., vehicle speed and power) are compared and analyzed with the test data of actual vehicles under the same or similar driving scenarios, and the accuracy of the vehicle dynamics model is verified. The results derived from the aforementioned simulation platform are illustrated in Figure 6. The dynamically varying vehicle speed and increasing mileage under eco-driving reproduce the actual driving conditions, and the distribution of the APU system’s operating area under the multi-point strategy reflects its operating status under diverse power demands, which provides critical basis and support for subsequently verifying the balance effect of the HDGEMS and the proposed MOO method among energy consumption, emissions, and battery life.

3.2. Comparative Analysis and Discussion of Experimental Results

To verify the universality and compare the optimal effect of the HDGEMS with the dynamic game algorithm, tests were carried out by simulation to determine the efficiency of the proposed strategy. To test the universality and compare the optimal effectiveness of the HDGEMS, the HDGEMS with the Eco-driving Speed module (EMShe-dg) and the HDGEMS with the Eco-driving Speed module off (EMSheoff-dg) are selected for comparative study. The EMSheoff-dgoff without the Eco-driving Speed module is expressed as a revision strategy (EMShe-dg), and the HDGEMS without the Eco-driving Speed module (EMSheoff-dg) and the HDGEMS without the dynamic game (EMShe-dgoff) were set as control groups. As a result, many more APU power transients can be observed throughout the CS phases, and the detailed results of the optimization of the performance for the different APU operating modes are shown in Table 2 and Figure 7.
When comparing EMSheoff-dgoff with EMShe-dg, the latter always has a visible advantage in terms of APU work-point adaptability. This better adaptability is reflected in three aspects. (1) The APU system is controlled at more efficient and better operating points, due to the real-time control performance of the eco-driving speed algorithms, resulting in better economics. (2) The role of dynamic algorithms in intelligent real-time decision-making of operating point-switching time can enable more power-following points to work in high-efficiency areas, to realize better economic performance, While a higher frequency of operating point switching may elevate energy consumption and emissions, this involves a subtle balance and trade-off. (3) The dual objectives of “power enhancement and fuel consumption reduction” are accomplished by extending battery-discharge cycles. While energy efficiency is enhanced, precise tracking of the battery SoC reference curve is concurrently secured, thereby sustaining current stability. By virtue of an appropriate discharge state, energy loss induced by DOD is effectively mitigated, and battery service life is significantly prolonged. It is noteworthy that, analogous to the EMShe-dg technology, although some APU operating points and energy distribution modes during cycle advancement are adjusted to optimize the Icom_ovp index, electrical energy remains efficiently utilized up to the cycle termination.

3.3. Experimental Results Considering Different Operating Modes of the APU System

As shown in Figure 8, operating-point distribution MAPs for the APU under four strategies are acquired, where the gray, red, and blue hollow circles in the figure denote the operating-point distribution results of the traditional APU system power following modes (EMSpfm), EMSheoff-dgoff and EMShe-dg, respectively.
As shown in Table 3, in modes Ncsop = 3 and Ncsop = 4, the APU system can be operated in the corresponding constant-speed and variable-torque operating regions, according to the strategy rules, to respond to the demand power. The distribution of operating points is uniform, the fluctuation of operating speed is small, and the operating points can be operated closer to the optimal operating curve. The statistical results show that under the EMShe-dg, 27.1% of the APU’s original operating points are switched to optimal operating points compared to the EMSheoff-dgoff. The control results regarding energy consumption, emissions, and battery capacity loss are presented in Table 3.
It is clear that the EMShe-dg strategy is superior to the EMSheoff-dgoff strategy, giving better control results in terms of energy consumption, emissions, and battery-capacity loss. The overall evaluation index Icom_ovp is substantially improved. When the output power of the APU is insufficient to satisfy the vehicle’s demand, the excess electrical energy will be subjected to secondary conversion for battery power supply. By means of regulating the APU operating points via the EMS, the balance between the deviation from the optimal operating point, dynamic operating-point switching, and the achievement of secondary charging can be attained. The primary source of efficiency loss stems from the APU’s deviation from its optimal operating point, rather than the energy loss associated with secondary charging. Through the utilization of speed sequence information related to driving characteristics, the optimal strategy can be achieved. By adopting the dynamic game algorithm based on V2X information and the eco-driving speed module, the distribution of power between the APU and the system of battery can be precisely modulated, which not only allows the APU system to operate stably at its optimal operating point, but also minimizes, to the greatest extent, the losses associated with secondary charging and the dynamic-time overhead incurred during APU-operating point switching.
The control strategy proposed in this paper demonstrates prominent efficacy in energy consumption, emissions, and battery life. As noted in the prior study [25], DP functions not only as the optimal offline benchmark, but also as the optimal control approach, taking Icom_ovp as its objective function. Given the discrepancies in units and magnitudes across the various performance indicators, the normalization parameters for these indicators will be computed first, to enable the horizontal comparison of the strategies. The computational methods for these parameters are outlined as follows:
μ i k = I i k I i k min I i k max I i k min
where μik is the normalized index of the k-th evaluation index on i-strategy; Iik is the index value; and Iikmin and Iikmax are the minimum and maximum values of the index.
The data employed for the normalization analysis in this study comprises two categories: one refers to the simulation results of the aforementioned strategies, and the other denotes the single-objective data derived from the Pareto solution set. The statistical results post normalization processing are illustrated in the Figure 9. It can be observed from the figure that the HDGEMS scheme proposed in this study demonstrates the most exceptional performance. By employing two independent APU operation modes for comparative analysis, respectively, the disparities in performance can be visualized more distinctly; the results reveal that the performance of the two modes is largely consistent. This suggests that the EMShe-dg scheme proposed in this study attains a sophisticated balance among the three objectives. This control strategy yields exceptional performance in energy consumption control, emission optimization, and battery-lifespan management. Notably, the substantial mitigation of battery-life degradation enables a significant reduction in the overall vehicle in an economical way.
To explore the degree of influence of different variables on EMS results, thereby identifying the most critical influencing factors and providing directions for subsequent optimization and in-depth research, sensitivity analysis was conducted in the study; the initial battery SoC and APU system power limit were selected as key variables, with multiple different initial SoC levels (specifically, 0.2, 0.4, 0.6, and 0.8) set and other conditions kept unchanged. The results indicate that the initial SoC has a relatively significant impact on energy consumption and battery life: when the initial SoC is low, the APU system needs to start more frequently and output higher power to meet vehicle driving demands, leading to increased energy consumption, while the number of charge–discharge cycles of the battery increases and the degree of deep discharge (DOD) intensifies, accelerating battery aging; conversely, when the initial SoC is high, the startup frequency of the APU system decreases, energy consumption is relatively reduced, and the battery-discharge process is gentler, which is beneficial for extending battery life. However, the impact of initial SoC on emissions is relatively minor, as emissions are mainly related to the operating efficiency and combustion status of the APU system. For the variable of APU system power limit, different power limit values (such as 30 kW, 40 kW, 50 kW, and 60 kW) were set, and simulation results show that the APU system power limit has a significant impact on energy consumption and emissions. When the power limit is low, the APU system cannot provide sufficient energy in response to high power demands, resulting in insufficient vehicle power, which necessitates frequent adjustments to the operating mode and increases energy consumption and emissions; when the power limit is excessively high, the APU system may operate in low-efficiency regions under certain working conditions, which also increases energy consumption and emissions. In terms of battery life, a moderate power limit can reduce the charge–discharge pressure on the battery while meeting vehicle power demands, which is conducive to extending battery life. The speed-adjustment threshold in the eco-driving speed algorithm is also a key variable; simulations were conducted with different threshold settings, and the results show that the threshold directly affects the distribution of operating points and switching frequency of the APU system. A smaller threshold causes more frequent adjustments to the operating points of the APU system, and although this can make the vehicle closer to the eco-driving speed, to a certain extent, it increases the switching frequency of the APU system’s operating points, which may lead to increased energy consumption and emissions. The frequent charge–discharge of the battery can also affect its service life: a larger threshold results in relatively stable operating points of the APU system, but may fail to fully utilize the advantages of the eco-driving speed algorithm, thereby affecting the optimization effect on energy consumption and emissions.

3.4. Implementation of Experimental Test and Its Results

The results demonstrate that performance is significantly affected by different EMSs, and further verification needs to be completed on the AVL test bench. The configuration of the test bench is depicted in Figure 10, while Table 4 provides the parameters of the components. It should be noted that to ensure the reliability and repeatability of the results, 10 simulation runs are conducted separately for the cases adopting the HDGEMS, the MOO method, and traditional energy-management strategies, and the average value of the 10 simulation results is calculated to represent the central tendency of the energy consumption, emission, and battery-lifespan indicators.
The specific experimental procedure is as follows: first, the parameters of vehicle and the preset driving cycle (CCDC) are entered into the road-load simulation software. Subsequently, the engine and dynamometer are respectively controlled for speed and torque by the AVL system, to simulate the vehicle’s road load. The Preq calculated in real time incorporates the influencing factors of DC bus voltage and current, comprehensively. Next, the battery SoC is collected by the control system, the HDGEMS is implemented, and the target engine speed and target generator torque are output. Finally, power output is achieved by means of the coordinated control strategy (APU). To verify the power trajectory tracking effect, the calculated power output is used as the target power in the experiment. Due to the differences in the final state of charge among the three strategies, it is unreasonable to directly evaluate the fuel consumption values. The comparison of the 100 km equivalent fuel consumption (Ge) of the three control strategies after adjusting the final SoC (the fuel efficiency under the corrected SoC) is shown Table 5.
The total consumption of EMShe-dg is slightly lower than EMSpfm by about 13.8%, and EMShe-dg is slightly better than EMSheoff-dgoff. And it is observed that the fuel consumption results (Ge, Coil_ele) of the three control strategies are comparable. Owing to the fact that the power demand of the EMSpfm system undergoes dynamic fluctuations and remains relatively high, considerable potential for optimization remains. The experimental results show that there is a significant difference in fuel consumption between the EMSheoff-dgoff and the EMShe-dg, indicating that the proposed EMShe-dg performs well in fuel savings. Compared with the EMSpfm, the EMSheoff-dgoff and EMShe-dg reduced the amount of ecosystem by approximately 3.57% and 4.51%, respectively, demonstrating that the EMShe-dg can achieve higher emission quality. On the other hand, EMSpfm has relatively poor emission quality, due to frequent changes in APU operating points. The proposed EMShe-dg effectively reduces the number of switches by adjusting APU operations to the optimal efficiency range, which is the main reason for its overall consumption reduction. Compared with the EMSpfm, the EMSheoff-dgoff and EMShe-dg significantly reduced Q loss by approximately 23.9% and 33.8%, respectively. In terms of battery life, when the battery capacity of CAR-EEV drops below 80% of the new battery state, it can be considered as the end of battery life. Under the current proposed configuration, the battery life is approximately 1.8 times longer than traditional rule-based strategies that do not consider V2X information during travel. The experimental results are consistent with the simulation conclusions. In addition, the proposed EMShe-dg has been optimized by adjusting the APU operating point to reduce battery-capacity degradation rate, thereby further extending battery life. Through a comprehensive analysis of the three aforementioned factors, notable differences are identified between the two strategies. The research outcomes confirm the feasibility of the proposed energy management scheme. With the support of the parameter adjustment module based on MOO, the proposed HDGEMS system facilitates comprehensive optimization of vehicle performance (achieving a favorable balance across multiple objectives), with a significant improvement in overall performance. This research and the optimization approach provide a more practical reference for the design optimization of EMCS.

4. Conclusions

In order to achieve the optimal energy allocation of two energy sources of CAR-EEV from a multi-objective perspective, an HDGEMS is introduced and a dynamic game optimization method based on V2X information is proposed in this paper. It holds pioneering significance in achieving multi-objective optimal energy distribution, and the long-standing challenge that traditional methods struggle with to balance economy, emission reduction, and battery life has been effectively addressed. The proposed HDGEMS showed good performance by implementing a dynamic game- and parameter-adaptive algorithm for the fuzzy algorithm. Through the real-time optimization of ride-comfort evaluation indicators for road conditions, dynamic adjustment of power distribution between the APU and the battery can be realized. This thereby enables the APU to operate efficiently under optimal operating conditions, and ultimately minimizes energy losses during secondary charging and dynamic switching of the APU, to the maximum extent. This model allows the current operating point under APU multi-point modes to engage in multi-step game interactions with the target operating point, thereby achieving greater energy-saving and emission-reduction potential, higher battery operational efficiency, and better longevity characteristics. The results show that when comparing EMS wdg-wadm with EMS dg-adm, the latter always has a visible advantage in terms of APU work-point adaptability. The simulation and experimental results indicated that economic improvement, emission reduction, and prolonging the battery life of CAR-EEV were kept in balance effectively, under the control of the proposed HDGEMS, with the intelligent optimization method. The design and the related conclusions obtained from the study lay the research foundation for the implementation of the top-level intelligent mode-switching-based energy management strategy for optimal energy allocation. In terms of data acquisition, the V2X information relied on in the study may be inaccurate or delayed, which impairs the real-time performance and accuracy of the dynamic game-based optimization method. In practical applications, significant differences exist in road conditions and traffic rules across different regions, posing challenges to the universality of the model and making it difficult to achieve optimal energy distribution in all scenarios. For subsequent research, targeted improvements should be made to the data acquisition method and the universality of the model, to address these limitations.

Author Contributions

H.L. and W.S. managed the project and conceptualized the scheme; H.L. and K.Y. conceived the control method; H.L., Z.S. and S.W. completed the modeling and performed the simulation experiments; L.L. and Z.S. finished the manuscript; Q.X. and S.L. collected the data and reviewed the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Research on Collaborative Optimization of Optimal Speed Control and Energy Management for Connected Automated Range-Extended Electric Vehicle, grant number 232699HJ0103108631, and Design of Key Mechanical Components and Development of Intelligent Control System for Customized Material Handling Equipment, grant number 2023220103000068. And the APC was funded by Research on Collaborative Optimization of Optimal Speed Control and Energy Management for Connected Automated Range-Extended Electric Vehicle.

Data Availability Statement

Data are provided within the manuscript.

Acknowledgments

The authors gratefully acknowledge the financial support from the Research on Collaborative Optimization of Optimal Speed Control and Energy Management for Connected Automated Range-Extended Electric Vehicles (No. 232699HJ0103108631), and Design of Key Mechanical Components and Development of Intelligent Control System for Customized Material Handling Equipment (No. 2023220103000068).

Conflicts of Interest

The authors declare that there are no conflicts of interests regarding the publication of this paper.

References

  1. Lei, Z.Z.; Qin, D.T.; Hou, L.L.; Peng, J.; Liu, Y.; Chen, Z. An adaptive equivalent consumption minimization strategy for plug-in hybrid electric vehicles based on traffic information. Energy 2020, 190, 116409. [Google Scholar] [CrossRef]
  2. Zhang, Y.J.; Wei, C.F.; Liu, Y.G.; Chen, Z.; Hou, Z.; Xu, N. A novel optimal power management strategy for plug-in hybrid electric vehicle with improved adaptability to traffic conditions. J. Power Sources 2021, 489, 229512. [Google Scholar] [CrossRef]
  3. Li, J.; Wang, Y.; Chen, J.; Zhang, X. Study on energy management strategy and dynamic modeling for auxiliary power units in range-extended electric vehicles. Appl. Energy 2017, 194, 363–375. [Google Scholar] [CrossRef]
  4. Wang, Y. Research on Intelligent Energy Management and Control Strategy for Hybrid Electric Bus Based on Vehicle Network Information. Ph.D. Thesis, Jilin University, Changchun, China, 2016. [Google Scholar]
  5. Lei, Z.Z. Research on Energy Management Strategy for Plug in Hybrid Electric Vehicles Integrating Traffic Information and Adaptive Control. Ph.D. Thesis, Chongqing University, Chongqing, China, 2019. [Google Scholar]
  6. Peng, J.; He, H.; Xiong, R. Rule based energy management strategy for a series–parallel plug-in hybrid electric bus optimized by dynamic programming. Appl. Energy 2017, 185, 1633–1643. [Google Scholar] [CrossRef]
  7. Bai, Y.; Li, J.; He, H.; Dos Santos, R.C.; Yang, Q. Optimal design of a hybrid energy storage system in a plug-in hybrid electric vehicle for battery lifetime improvement. IEEE Access 2020, 8, 142148–142158. [Google Scholar] [CrossRef]
  8. Sun, X.; Zhou, Y.; Zhang, X.; Huang, L.; Lian, J. Real-time optimal EMS of adaptive charge sustenance for fuel cell hybrid buses based on driving characteristics recognition. Energy Convers. Manag. 2022, 254, 115192. [Google Scholar] [CrossRef]
  9. Morlock, F.; Heppeler, G.; Wohlhaupter, U.; Sawodny, O. Range extension for electric vehicles by optimal velocity planning considering different driver types. In Proceedings of the Conference on Control Technology and Applications (CCTA), Maui, HI, USA, 27–30 August 2017. [Google Scholar]
  10. Li, L.; You, S.; Yang, C.; Yan, B.; Song, J.; Chen, Z. Driving-behavior-aware stochastic model predictive control for plug-in hybrid electric buses. Appl. Energy 2016, 162, 868–879. [Google Scholar] [CrossRef]
  11. Lin, R.; Chu, H.; Gao, J.; Chen, H. Charging management and pricing strategy of electric vehicle charging station based on mean field game theory. Asian J. Control. 2024, 26, 3173. [Google Scholar] [CrossRef]
  12. Wang, S.; Lin, X. Eco-driving control of connected and automated hybrid vehicles in mixed driving scenarios. Appl. Energy 2020, 271, 115233. [Google Scholar] [CrossRef]
  13. Nie, Z.; Jia, Y.; Wang, W.; Chen, Z.; Outbib, R. Co-optimization of speed planning and energy Management for intelligent fuel cell hybrid vehicle considering complex traffic conditions. Energy 2022, 247, 123476. [Google Scholar] [CrossRef]
  14. He, H.; Wang, Y.; Han, R.; Han, M.; Bai, Y.; Liu, Q. An improved MPC-based energy management strategy for hybrid vehicles using V2V and V2I communications. Energy 2021, 225, 120273. [Google Scholar] [CrossRef]
  15. Cheng, S.; Li, L.; Chen, X.; Fang, S.-N.; Wang, X.-Y.; Wu, X.-H.; Li, W.-B. Longitudinal autonomous driving based on game theory for intelligent hybrid electric vehicles with connectivity. Appl. Energy 2020, 268, 115030. [Google Scholar] [CrossRef]
  16. Roumila, Z.; Rekioua, D.; Rekioua, T. Energy management based fuzzy logic controller of hybrid system wind/photovoltaic/diesel with storage battery. Int. J. Hydrogen Energy 2017, 42, 19525–19535. [Google Scholar] [CrossRef]
  17. Song, Z.; Li, J.; Han, X.; Xu, L.; Lu, L.; Ouyang, M.; Hofmann, H. Multi-objective optimization of a semi-active battery/supercapacitor energy storage system for electric vehicles. Appl. Energy 2014, 135, 212–224. [Google Scholar] [CrossRef]
  18. Liu, H.; Lei, Y.; Fu, Y.; Li, X. Adaptive parameter optimal energy management strategy based on multi-objective optimization for range extended electric vehicle. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2021, 236, 1809–1823. [Google Scholar] [CrossRef]
  19. Yong, Z.; Li-Juan, Y.; Qian, Z.; Xiao-Yan, S. Multi-objective optimization of building energy performance using a particle swarm optimizer with less control parameters. J. Build. Eng. 2020, 32, 101505. [Google Scholar] [CrossRef]
  20. Liu, H.; Lei, Y.; Fu, Y.; Li, X. A novel hybrid-point-line energy management strategy based on multi-objective optimization for range-extended electric vehicle. Energy 2022, 247, 123357. [Google Scholar] [CrossRef]
  21. Ding, X.; Li, T.; Zhao, F.; Kong, D.; Wei, M. Research on Energy Management Strategy for Extended-Range Electric Vehicle Based on Bargaining Game. Energy Technol. 2024, 12, 202301118. [Google Scholar] [CrossRef]
  22. Liu, H.; Lei, Y.; Sun, W.; Chang, C.; Jiang, W.; Liu, Y.; Hu, J. Research on approximate optimal energy management and multi-objective optimization of connected automated range-extended electric vehicle. Energy 2024, 306, 132368. [Google Scholar] [CrossRef]
  23. Song, D.; Yang, L.; Zeng, X.; Wang, X.; Liang, W.; Yang, N. Optimization of hybrid electric vehicle battery life based on driving cycle synthesis. J. Jilin Univ. Eng. Ed. 2021, 51, 781–791. [Google Scholar]
  24. Liu, H.; Lei, Y.; Fu, Y.; Li, X. Parameter matching and optimization for power system of range-extended electric vehicle based on requirements. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2020, 234, 3316–3328. [Google Scholar] [CrossRef]
  25. Kong, Y.; Xu, N.; Zhang, Y.J. Acquisition of full-factor trip information for global optimization energy management in multi-energy source vehicles and the measure of the amount of information to be transmitted. Energy 2021, 236, 121423. [Google Scholar] [CrossRef]
Figure 1. Algorithm structure diagram of the adaptive car-following model.
Figure 1. Algorithm structure diagram of the adaptive car-following model.
Applsci 15 12738 g001
Figure 2. Configuration of the studied CAR-EEV and principle of the three-point energy management strategy.
Figure 2. Configuration of the studied CAR-EEV and principle of the three-point energy management strategy.
Applsci 15 12738 g002
Figure 3. Schematic diagram of APU system working area under power probability density function.
Figure 3. Schematic diagram of APU system working area under power probability density function.
Applsci 15 12738 g003
Figure 4. Diagram of APU work point-switching strategy based on dynamic game algorithm.
Figure 4. Diagram of APU work point-switching strategy based on dynamic game algorithm.
Applsci 15 12738 g004
Figure 5. Simulation platform of the CAR-EEV for the MOO problem: (a) simulation platform of the CAR-EEV; (b) Simulink Environment Vehicle Controller.
Figure 5. Simulation platform of the CAR-EEV for the MOO problem: (a) simulation platform of the CAR-EEV; (b) Simulink Environment Vehicle Controller.
Applsci 15 12738 g005
Figure 6. Eco-driving vehicle speed output: (a) vehicle speed and power under eco-driving; (b) APU system operating area under multi-point strategy.
Figure 6. Eco-driving vehicle speed output: (a) vehicle speed and power under eco-driving; (b) APU system operating area under multi-point strategy.
Applsci 15 12738 g006
Figure 7. Optimization: detailed results of the HDGEMS with dynamic game and adaptive adjustment module.
Figure 7. Optimization: detailed results of the HDGEMS with dynamic game and adaptive adjustment module.
Applsci 15 12738 g007
Figure 8. MAP diagram of the APU operating-point distribution in different operating modes in HDGEMS.
Figure 8. MAP diagram of the APU operating-point distribution in different operating modes in HDGEMS.
Applsci 15 12738 g008
Figure 9. Comparison of normalized evaluation indicators: (i) Ncsop = 3; (ii) Ncsop = 4.
Figure 9. Comparison of normalized evaluation indicators: (i) Ncsop = 3; (ii) Ncsop = 4.
Applsci 15 12738 g009
Figure 10. The experimental-platform system structure.
Figure 10. The experimental-platform system structure.
Applsci 15 12738 g010
Table 1. Multi-objective Evaluation Matrix Coil_ele, Ecom and Qbatt_loss.
Table 1. Multi-objective Evaluation Matrix Coil_ele, Ecom and Qbatt_loss.
Evaluation MatrixCoil_eleEcomQbatt_loss
Coil_ele1/14/12/3
Ecom1/41/11/4
Qbatt_loss4/37/21/1
Table 2. Test results of performance of the HDGEMS with dynamic game and adaptive adjustment module.
Table 2. Test results of performance of the HDGEMS with dynamic game and adaptive adjustment module.
HDGEMSModule StatusEvaluation Index
Eco-Driving Speed Dynamic GameCoil_eleEcomQbatt_lossIcom_ovp
EMSheoff-dgoffOffOff0.7270.54512.1%0.77
EMSheoff-dgOffOn0.7230.53811.3%0.81
EMShe-dgoffOnOff0.7240.53610.8%0.83
EMShe-dgOnOn0.7180.5329.3%0.86
Note: all parameters listed in this table (Coil_ele, Ecom, Qbatt_loss, Icom_ovp ) are dimensionless indices.
Table 3. Test performance results of the different APU operating modes in HDGEMS.
Table 3. Test performance results of the different APU operating modes in HDGEMS.
HDGEMSAPU Operating ModesEvaluation Index
Coil_eleEcomQbatt_lossIcom_ovp
EMSpfmpower following0.7650.65313.1%0.66
EMSheoff-dgoffNcsop = 30.7340.56712.5%0.73
EMShe-dgNcsop = 30.7270.54210.8%0.77
EMSheoff-dgoffNcsop = 40.7310.54711.5%0.79
EMShe-dgNcsop = 40.7200.5189.3%0.81
Note: all parameters listed in this table (Coil_ele, Ecom, Qbatt_loss, Icom_ovp) are dimensionless indices.
Table 4. Parameters of the vehicle components of the experimental platform.
Table 4. Parameters of the vehicle components of the experimental platform.
ComponentsParametersValueComponentsParametersValue
EngineMaximum power/(kW)65BatteryContinuous discharge capacity (C)2
Minimum speed/(r/min)950 Nominal voltage/(V)320
Maximum speed/(r/min)6500 Capacity/(A·h)25
GeneratorMaximum power/(kW)60MotorPeak power/(kW)75
Maximum speed/(r/min)9000 Maximum speed/(r/min)9000
Table 5. Test performance results of the different EMS.
Table 5. Test performance results of the different EMS.
StrategyGe (L/100 km)GeCoil_eleCoil_eleEcomEcomQbatt_lossQbatt_loss
EMSpfm4.780.7310.53214.2%
EMSheoff-dgoff4.33−9.41%0.723−1.09%0.513−3.57%10.8%−23.9%
EMShe-dg4.12−13.8%0.719−1.64%0.508−4.51%9.4%−33.8%
Note: all parameters listed in this table (Coil_ele, ∆Coil_ele, Ecom, ∆Ecom, Qbatt_loss, ∆Qbatt_loss) are dimensionless indices.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, H.; Yang, K.; Sun, W.; Liu, L.; Su, Z.; Xiao, Q.; Wang, S.; Li, S. Research on Energy Management Strategy for Range-Extended Electric Vehicles Based on Eco-Driving Speed. Appl. Sci. 2025, 15, 12738. https://doi.org/10.3390/app152312738

AMA Style

Liu H, Yang K, Sun W, Liu L, Su Z, Xiao Q, Wang S, Li S. Research on Energy Management Strategy for Range-Extended Electric Vehicles Based on Eco-Driving Speed. Applied Sciences. 2025; 15(23):12738. https://doi.org/10.3390/app152312738

Chicago/Turabian Style

Liu, Hanwu, Kaicheng Yang, Wencai Sun, Le Liu, Zihang Su, Qiaoyun Xiao, Song Wang, and Shunyao Li. 2025. "Research on Energy Management Strategy for Range-Extended Electric Vehicles Based on Eco-Driving Speed" Applied Sciences 15, no. 23: 12738. https://doi.org/10.3390/app152312738

APA Style

Liu, H., Yang, K., Sun, W., Liu, L., Su, Z., Xiao, Q., Wang, S., & Li, S. (2025). Research on Energy Management Strategy for Range-Extended Electric Vehicles Based on Eco-Driving Speed. Applied Sciences, 15(23), 12738. https://doi.org/10.3390/app152312738

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop