iEVEM: Big Data-Empowered Framework for Intelligent Electric Vehicle Energy Management
Abstract
:1. Introduction
- Particularity of Knowledge-Implied EVE Data. Except for traditional big data characteristics [19], EVE big data imply underlying complex domain-specific mechanisms, e.g., the EV energy recovery during braking. General data-driven approaches, which often neglect the incorporation of inherent domain knowledge, face significant challenges in accurately modeling EVE status [8,12,20]. Despite the presence of frameworks incorporating knowledge embedding, they are insufficient to fully support the knowledge embedding of EVEM due to its more complex and diverse knowledge (e.g., various non-linear and uncontrollable electrochemical reactions). Thus, developing a framework that facilitates the seamless embedding of subtle and domain-specific knowledge is of paramount importance for achieving precise EVEM.
- Constraints of Resource-Limited EVEM Systems. Networked EVEM systems comprise heterogeneous devices with varying computational and communication capabilities. Traditional edge- and cloud-based schemes, while widely adopted, are often constrained by computational limitations (e.g., on-site EV devices with restricted processing power and memory) or communication bottlenecks (e.g., limited vehicle-to-everything bandwidth under dynamic network conditions). These inherent constraints significantly hinder their ability to ensure the prompt and reliable responses required for latency-sensitive applications [21], e.g., failure to promptly alert about battery abnormalities potentially results in fatal consequences like spontaneous combustion or explosion. Considerable studies have focused on the issue of resource constraints, while only limited works attempt to reduce end-to-end (E2E) latency in such resource constraints. Consequently, an efficient EVEM framework capable of operating within limited resources is indispensable for practical deployment.
- Deficiencies of Distributed EVEM Systems and Isolated EVE Data. To protect the privacy [22,23,24] of different EV stakeholders like manufacturers, vendors, and consumers, EVEM systems are physically distributed and networked, and EVE data are strictly isolated and unassociated. While there are many works dedicated to addressing data isolation issues (e.g., federated learning), they primarily focus on simplex horizontal or vertical federated learning scenarios. Richer scenarios with multi-party, multi-level, and multi-scale spatio-temporal joint analysis are demanded in EVEM systems. The property critically affects the feasibility and efficiency of the traditional big data framework. Therefore, addressing these challenges within the framework design is crucial to ensure comprehensive and practical EVEM.
- (1)
- We conduct the first comprehensive investigation on intelligent EVEM, aiming to provide insight into this emerging promising field. Particularly, we introduce the background on EVE and clarify essential EVEM applications at the driver-, enterprise-, and social levels, effectively highlighting the practical significance of EVEM. Meanwhile, we systematically identify and extract the key challenges associated with designing and implementing a framework for intelligent EVEM.
- (2)
- We propose a novel big data framework, termed iEVEM, to address the challenges mentioned above. Specifically, we construct a layered architecture of EVE data processing and analysis, starting from the physical layer, which manages heterogeneous and isolated EVE data for data collection. This is followed by the data layer and algorithm layer, which enable supporting the efficient design of knowledge-enhanced intelligent solutions, ultimately supporting diverse intelligent EVEM applications in the application layer. Additionally, an edge–cloud collaborative system architecture is introduced to facilitate practical application deployment while effectively addressing the resource constraints of distributed systems. The framework outlines a standardized development process for intelligent EVEM solutions, which serves as a tutorial to guide practitioners in developing intelligent applications in practice.
- (3)
- We conducted a proof-of-concept case study of iEVEM using real-world data to validate its effectiveness. For EV energy consumption outlier detection, the experimental results demonstrate that iEVEM achieves significant improvements in both detection accuracy, with gains of up to 47.48% higher, and response speed, being at least 3.07× faster compared with state-of-the-art methods. The case study has demonstrated the effectiveness of our framework, enabling further innovation and progress in EVEM and even other fields like smart cities and intelligent industry. Furthermore, we also highlight several important open issues and research directions for the further development and refinement of intelligent EVEM and relevant fields.
2. Background Knowledge
2.1. The Current Status of EVE
2.2. The Factors Affecting the Development of EVE
- Safety. The safety of EVs is a primary concern for consumers. Recent incidents involving battery fires and explosions have raised public apprehension. Taking Tesla EV as an example, more than 20 cars of the Model X/S series suffered battery thermal runaway accidents from 2018 to 2019 [26], highlighting the need for improvements in EV safety, which impacts consumers’ willingness to purchase EVs.
- Convenience. Convenience is significantly impacted by users’ range anxiety. This anxiety arises from two main issues. On one hand, the state-of-charge (i.e., battery level) is nonlinearly influenced by various factors, such as driving habits, road conditions, and temperature, making it difficult for users to estimate their remaining range reliably [8]. On the other hand, insufficient charging infrastructure creates concerns regarding the feasibility of charging [25], especially during long trips. For example, in the United States, there are approximately 120,000 charging stations for over 2 million EVs in 2021, resulting in a ratio of about 1 charging station per 17 vehicles. The inaccurate estimation of battery levels and the imbalance between vehicles and piles may hinder the continued growth and acceptance of EVs.
- Economy. The economic viability of EVs is affected by multiple factors, including government policies, subsidy levels, and electricity pricing [27]. Many countries provide incentives, such as purchase subsidies and tax reductions, to encourage EV sales. In China, for example, subsidies can reach up to 20,000 RMB (approximately 3000 USD) per vehicle. However, as the subsidies decrease, the total cost of ownership may rise, affecting consumer purchase decisions.
3. Essential EVEM Applications
3.1. Driver-Level Applications
3.1.1. Driving Safety
3.1.2. Energy Economy
3.2. Enterprise-Level Applications
3.2.1. Quality Control
3.2.2. Cost Reduction
3.3. Social-Level Applications
3.3.1. Environmental Protection
3.3.2. Public Welfare
4. Challenges to the EVEM Framework
4.1. Data Challenges
- ①
- It is difficult to accurately model EVE using general methods due to underlying complex knowledge. On one hand, given the inherent complexity, nonlinearity, and uncontrollability of energy reactions, EVE status is hard to model formally, which brings great challenges for existing mechanism-driven methods. On the other hand, lacking effective solutions to integrate inherent knowledge, pure data-driven methods struggle to model EVE accurately [8,12] and cannot support reliable EVEM.
- ②
- Unassociated fragmented EVE data pose challenges to multi-scale spatio-temporal correlation analysis. Since EVE status is impacted by a range of factors varying over space and time, multi-scale joint analysis is necessary for EVEM. However, EVE data are collected and possessed in distributed manners and isolated at different owners (e.g., drivers, enterprises, and government agencies) without a way to associate them [39]. This seriously impedes the feasibility of joint analysis.
4.2. System Challenges
- ③
- Rapid response is difficult to satisfy by conventional schemes with limited system resources. In naturally distributed EVEM systems, low E2E latency is challenging with the limited computing capabilities of edge nodes (e.g., vehicle-mounted devices) and communication resources between nodes (e.g., moving EVs) [21]. Specifically, predominating cloud-based methods requiring massive data uploading suffer from prolonged communication time. Local-based methods, processing data locally entirely, result in an unacceptable computation time and cannot support efficient EVEM.
- ④
- Isolated EVEM systems pose challenges to multi-party joint analysis. Numerous EVEM applications inherently require multiple stakeholders (e.g., drivers, enterprises, and government agencies) to participate. However, with widespread and growing privacy concerns [23,24,40] of participants, all data are best kept locally to prevent privacy leakage. Hence, the strictly isolated systems severely hinder the feasibility of joint analysis across multiple parties.
5. Data Intelligence Architecture of iEVEM
5.1. The Physical Layer
- Production Phase refers to the process from raw materials (e.g., electrolyte) to concrete power battery products (e.g., cell, module, and pack) [41]. Production data are collected by manufacturing equipment, mainly containing battery monitoring records (e.g., current, voltage, and resistance), which are generally structured in a predefined format like spreadsheets and acquired continuously near real-time following specific industrial standards (e.g., ISO 12405 [42] in the European Union).
- Service Phase indicates the usage of finished products like EVs and charging piles, where service data record their operating information. Particularly, EV service data are collected by on-board sensors [39] and mainly perceive the status of eic systems, i.e., the battery (e.g., temperature), motor (e.g., velocity), and controller (e.g., regenerative braking). Following the national standard (e.g., GB/T 32960 [43] in China), service data are also collected in structured with a prescribed format and frequencies.
- Maintenance Phase indicates the status of out-of-service, including repairing and recycling. The maintenance data are collected by checkout equipment, which includes the testing information of productions, e.g., fault in repairing and RUL in recycling. Among them, repairing data are usually formatted in semi-structure and varied with enterprises, while recycling data tend to be structured in the required testing procedures along with the increasingly published recycling standards (e.g., UL 1974 [44] in America).
- Domain Expert refers to EVE-domain specialists, and the expert knowledge indicates the information converted by prior experience, which is evolved in the aforementioned phases (e.g., working procedures in production, energy mechanisms in service, and repairing logs in maintenance). Knowledge is usually formatted in semi-structured (e.g., worksheet) and unstructured data (e.g., text), and as an additional input for intelligent solution construction, knowledge representation and embedding are crucial.
5.2. The Data Layer
5.2.1. Data Association
5.2.2. Knowledge-Enhanced Feature Engineering
- Step 1: Feature Extraction transforms raw data into sets of features with underlying patterns. Traditional feature extraction, relying on straightforward mathematics properties (e.g., mean and variance), ignores physical meaning with potentially critical information unexplored (e.g., the peak of the incremental capacity curve is a decisive factor for capacity estimation [45]). Knowledge embedding effectively alleviates the issue by forming feature candidates for each data dimension in advance, which facilitates extracting meaningful features by the feat of expert experience.
- Step 2: Feature Selection intends to identify relevant features for given tasks from feature candidates, which is usually achieved by feature importance ranking. However, existing methods (e.g., decision tree) are prone to unstable ranking since they strongly rely on sample data. To enhance the reliability of task-oriented feature selection, expert knowledge is used to guide the identification of critical relevant features (e.g., expert knowledge can be utilized to assign feature weights in feature ranking) for given tasks. For example, the feature selection can be conducted in a stepwise manner. First, feature grouping is performed. Data-driven feature correlations (e.g., Pearson correlation coefficient) can be used to construct an undirected graph where nodes represent individual features and edges represent the correlation weights between features. Expert knowledge can help verify and adjust the correlation weights. Ultimately, feature grouping is achieved through subgraph partitioning, where edges with weights below a certain threshold are removed. Next, feature ranking is performed based on the feature groups. Specifically, representative features are randomly selected from the groups, and experts indicate their relevance (1 for relevant and 0 for not relevant). After multiple rounds of feature sample and group labeling, task-oriented group weights are generated. These weights are normalized and combined with data-based ranking to produce a stable feature ranking, which minimizes expert labeling costs by only requiring assessments of a few features within each group.
- Step 3: Dimensionality Reduction refers to the process of reducing the number of features while preserving sufficient and necessary information, which significantly contributes to subsequent efficient data analysis and model performance. Traditional reduction methods (e.g., principal component analysis and autoencoders) reduce features by changing feature spaces, where transformed dimensions lack clear physical meanings. Domain knowledge helps obtain refined features with practical meaning retained in original feature spaces (e.g., reduce redundant features according to physical correlations or integrate multiple features into one with practical meaning). For example, we only select one feature in each feature group in step 2 to reduce redundancy, as features within the same group exhibit high correlation.
5.3. The Algorithm Layer
5.3.1. General Model
- Mechanism-driven Models are constructed based on the fundamental insights of underlying EVE mechanisms (e.g., physical laws and chemical reactions), which emphasize interpretability and physical fidelity, making them indispensable for EVEM. These models are mainly developed in formalized mathematical expressions for representing the intrinsic principles (e.g., the electrochemical and thermal dynamics of batteries, the operational characteristics of motors, and the energy flow in powertrain systems). For example, equivalent circuit models [7] are widely used to describe battery behavior, leveraging electrical circuit analogies to represent processes like charge transfer and diffusion. While mechanism-driven models exhibit strong interpretability, they often face challenges in terms of adaptability to complex, nonlinear, and uncontrollable energy reactions and systems. Nonetheless, these models remain a reserve and cornerstone for EVEM.
- Data-driven Models are constructed to uncover patterns, relationships, and decision-making rules directly from data, bypassing the need for explicit physical or mechanistic understanding. Such methods are primarily developed by statistics, machine learning, and deep learning. By virtue of learning patterns and relationships from massive historical data, the solution is built automatically based on mined rules. In the context of EVEM, supervised learning algorithms [32], such as decision trees in machine learning and neural networks in deep learning [20], are commonly used to predict battery degradation and RUL based on historical usage patterns. As another model basis of EVEM, the primary strength of data-driven models lies in their ability to automatically learn complex, nonlinear, and uncontrollable relationships from data without domain knowledge. However, these methods also exhibit notable drawbacks in their stability and reliability, suffering from their poor interpretability.
5.3.2. Knowledge-Enhanced Algorithm Construction
- Step 1: Problem Definition abstracts and models the target problem, including task types (e.g., classification or regression) and requirements (e.g., optimization objectives and constraint conditions) from real scenarios, which should be expressed explicitly with the aid of domain experts. For instance, expert knowledge in the text form can be transformed into optimization formulas through a large language model [46].
- Step 2: Algorithm Development indicates the design of specified intelligent solutions. Depending on the task type and requirements from the problem definition, practicable general models are selected from the model pool (i.e., mechanism- or data-driven models), whose characteristics have been elaborated in advance by experts. After that, the algorithm is designed (e.g., construct a novel one or modify general models) with further consideration of available data, application demands, and muttons with knowledge guidance (e.g., the optimum parameters are set by prior experience). Moreover, in a knowledge-enhanced way, in addition to expert-guided practicable general model selection and proper parameter setting (e.g., learning rate during training), knowledge representation and embedding are utilized for algorithm design to further improve performance. For example, the knowledge-enhanced mechanism-driven approach utilizes expert knowledge to set and adjust parameters such as the drag coefficient and frontal area within the mechanism-driven vehicle dynamic model during new vehicle design. These parameters vary among different vehicle types, allowing for an accurate simulation of real-world energy consumption during the development stage. The early identification of potential issues in energy consumption performance ultimately provides valuable feedback for product improvement. The knowledge-enhanced data-driven approach presents the correlation of EV energy components in the knowledge graph with expert help, where nodes represent components (e.g., battery, motor, and air conditioner) and edges capture their dependencies (e.g., energy flow). If a component fails, a data-driven GNN [47], leveraging feature propagation and aggregation between nodes and their neighbors, can trace the connections to identify the root cause, such as linking abnormal motor performance to upstream issues like battery instability or inverter faults. Expert knowledge can be used to refine the construction of knowledge graphs by enhancing node attributes and edge properties within the energy domain [48]. Specifically, for attributes of each component node, knowledge-enhanced feature engineering helps select the most critical features for energy anomalies. Besides, with the help of experts, the properties of edges not only depict energy flow but also reflect the mechanistic influences and relational weights between components. A more pronounced example of integrating a mechanism-driven, data-driven, and knowledge-enhanced approach is charging optimization with cutting-edge multi-agent reinforcement learning. Specifically, each user can be viewed as an agent, and data-driven methods are applied to recommend optimal charging strategies for these agents based on user behavior data. The mechanism-driven models of calculating charging stations’ load balancing serve as the reward model. In a knowledge-enhanced way, user satisfaction degree is integrated into the learning process through Reinforcement Learning from Human Feedback [49], allowing for the embedding of expert knowledge (i.e., human evaluation) into the reward structure.
- Step 3: Solution Validation is the feasibility evaluation of constructed solutions before application launch. However, practical challenges arise for traditional methods (e.g., cross-validation) due to the time and labor costs caused by the data availability (e.g., insufficient failure data make the verification of fault diagnosis difficult), label accessibility (e.g., limited labeled samples for cross-validation), and experiment producibility (e.g., battery degradation requiring years to manifest). Therefore, the validation design needs to rely on domain experts to fully consider actual situations (e.g., constructing a simulation environment by domain experts) to address this dilemma.
6. Edge–Cloud Collaborative System Architecture of iEVEM
6.1. EVEM Systems
6.2. Edge–Cloud Collaborative Solution
6.2.1. Edge–Cloud Collaborative Storage
6.2.2. Edge–Cloud Collaborative Computing
- Model Lightweight involves deploying an entire small and efficient model directly on edge devices. In such scenarios, edge devices can independently accomplish tasks without relying on cloud resources, ensuring prompt and robust responses even under poor communication conditions (e.g., vehicles performing in-situ energy-efficient route planning while traveling through a tunnel with limited connectivity). To achieve such lightweight models, techniques, such as model distillation, pruning, and quantization merit further exploration, as they enable the reduction in model complexity while maintaining a sufficient accuracy for real-time applications.
- Model Partition refers to the strategy of splitting parts of a large-scale model between the cloud and edge devices. For example, in energy component fault diagnosis using a GNN, the first few GNN layers are executed at vehicles for extracting shallow features (e.g., local anomalies in the voltage or current). The extracted features are then sent to the cloud, where the remaining layers of GNN are carried out to perform a deeper fault diagnosis, such as identifying root causes. Uploading features instead of massive raw data effectively reduces communication time and thus the response latency. The communication-efficient technologies like traffic compression (e.g., quantization and sampling) are crucial for further minimizing response latency.
- Model Cascade refers to synergy-varisized functional models at the edge device and the cloud server in a staged manner. Take EV fault diagnosis as an example; an EV can perform a quick self-check using a lightweight local model to detect potential anomalies and provide rapid alerts. If the local model identifies an ambiguous or complex fault, the cloud-based large model can be engaged for a more accurate and comprehensive diagnosis. Dynamic cascading (i.e., determining when to involve the cloud model based on task) is conducive to the trade-off between latency and accuracy, adapting to real-time requirements and system constraints effectively.
7. Case Study: Outlier Detection of EV Energy Consumption
7.1. Scenario
7.2. Experimental Setup
7.2.1. Dataset
7.2.2. Metrics
- For evaluating the general performance of iEVEM, the Area Under the Curve (AUC) [50] is adopted as a primary indicator of reliability, which is a widely recognized metric to measure classification performance, particularly in scenarios involving an imbalance between positive and negative samples. Specifically, the AUC is calculated as follows:
- For evaluating the effectiveness and necessity of iEVEM components, the Mean Absolute Percentage Error (MAPE) [51], indicating the energy consumption estimation precision, is used for reflecting reliability. Specifically,
7.2.3. Comparatives
7.3. Implementation
7.3.1. The Implementation of Data Intelligence Architecture
- The Physical Layer. The application of detecting outlier vehicles in energy consumption primarily involves data collection from onboard sensors in service vehicles. As introduced in the datasets, vehicles collect 638 dimensional data every second, and these data record various states and information of the vehicle’s operation. It is worth noting that if data from other sources are introduced, such as road information (e.g., road grade), environmental information (e.g., weather), and traffic conditions (e.g., congestion), it would be helpful. However, considering the currently available data, we only focus solely on the onboard sensor data from vehicles.
- The Data Layer. According to the domain knowledge, the data dimensional can be roughly divided into six categories: (1) basic information, such as collection date, vehicle identity number, and battery type, (2) vehicle statuses, such as speed, location, and temperature inside the car, (3) battery statuses, such as state-of-charge, current, and voltage, (4) appliance status like the current and voltage of the motor, air conditioning, and lights, (5) failure statuses comprise the indication of all components’ fault, (6) mechanical information such as seat angle, tire pressure, and others. Based on the expert labeling, 74 attributes were selected from the original 638 dimensions, with empirically irrelevant attributes to energy consumption (e.g., the group of failure statuses and mechanical information) being systematically eliminated. Besides, 49 additional features (e.g., acceleration derived from velocity and time) were constructed based on 74 attributes with essential physical and statistical laws.
- The Algorithm Layer. As for the algorithm design, referring to the expert business understanding, a two-step solution was constructed, comprising a regression sub-task of rational energy consumption estimation with extreme gradient boosting (i.e., XGBoost [52]) and a classification sub-task of outlier detection with a Gaussian distribution instead of conventional unsupervised one-step methods. In the first step, XGBoost operates by aggregating multiple decision trees to produce the final prediction, where each tree is trained to fit the residuals of the previous tree. Assuming there are K trees in total, given each vehicle’s input features x obtained from the data layer, sub-task can be approximated as follows:In the second step, the difference between the actual energy consumption value y and the rational value determines the vehicle’s degree of deviation :In practice, energy consumption deviations are normal and permissible. It is only when the deviation exceeds a certain threshold that vehicles may be identified as outliers. Therefore, the sub-task can be summarized as an indicator function:
7.3.2. The Implementation of Edge–Cloud Collaborative System Architecture
- Edge–cloud collaborative storage. Data were collected in real-time from each vehicle and initially stored locally. Due to local storage limitations and the allowance for data sharing between enterprises and their vehicles (e.g., through data-sharing agreements), local data would be uploaded for centralized storage backup when vehicle-enterprise network resources are available. Only data from a certain time frame are retained locally and are periodically overwritten. Nevertheless, data privacy should be protected between different enterprises and third parties (i.e., governments), that is, the aggregated data within each enterprise must be strictly stored locally to prevent any potential privacy breaches.
- Edge–cloud collaborative computing. For online model inference, an edge–cloud collaborative prototype was constructed with a Jetson Nano serving as the edge device (representing the EV’s on-site computer) and an NVIDIA 2080 Ti acting as the cloud server (representing the enterprise cloudlet). The 10 Mbps edge–cloud bandwidth followed the LTE standard. The edge–cloud communication was configured with a 10 Mbps bandwidth, adhering to the LTE standard, to simulate realistic vehicle-enterprise network conditions. In this setup, a model cascade was employed for efficient edge–cloud collaboration. Specifically, the rational energy consumption estimation sub-task was deployed on the edge device to process local data and minimize the need for massive raw data uploads, thereby reducing bandwidth usage. The cloud server, in turn, aggregated the energy consumption deviations reported by multiple edges and performed a centralized outlier detection sub-task using Flink. The collaboration ensures a balance between local processing efficiency and cloud-level computational scalability, meeting the requirements of real-time and large-scale EVEM. Among them, the first task requires a large amount of data for training. For offline model training, we simulate the process among multiple enterprises based on our cloud server. Without loss of generality, we consider three enterprises and a trusted third party (e.g., government) in our settings, where the enterprises represent edges and the government acts as the cloud. Since the training time does not affect application performance, we focus solely on the training accuracy, disregarding communication conditions. The objective function L of XGBoost is expressed as follows:
7.4. Main Results
7.4.1. The General Performance
7.4.2. Ablation Experiments
- The Impact of Knowledge-enhanced Approach. We evaluated the impact of the data intelligence architecture by the MAPE of rational energy consumption estimation with different data processing and analysis, i.e., mechanism-driven and data-driven methods. The mechanism-driven method is built upon vehicle dynamics referring to [51], i.e., an analytical formulation of vehicle velocity and road grade, which is a representative work for EV energy consumption calculation. Specifically, the energy consumption E is calculated by the integration of power P over time t, i.e.,Following the formula derivation in the literature, the energy consumption estimation model used in our case study is expressed as follows:
- The Impact of Edge–cloud Collaborative Deployment. We first compared the performance differences between federated learning and centralized training. As illustrated in Figure 6, the loss function and validation set performance during the training process are depicted, with the x-axis representing the number of iterations and the y-axis displaying the mean squared error and MAPE, respectively. In each iteration, a tree is generated, with a maximum of 50 trees utilized in our settings. As illustrated in Figure 6a, it is evident that federated learning converges more slowly and exhibits greater fluctuations compared to centralized training. However, this does not adversely affect the model’s performance after convergence. As shown in Figure 6b, the results indicate that federated learning can achieve a performance comparable to that of centralized training while preserving privacy. Then, we evaluated the impact of the edge–cloud collaborative system architecture on the E2E latency of outlier detection with conventional cloud computing. Shown in Figure 5b, the E2E latency of iEVEM is significantly lower where the identical two-step model is adopted. Specifically, the E2E latency of edge–cloud collaborative deployment is approximately 185 ms, which is significantly lower compared to the 685 ms observed in the cloud-based deployment. It is worth noting that the collaborative scheme reduces traffic by more than compared to the cloud-based scheme. The reduction is attributed to the transformation of the raw data into energy consumption values at the edge of the proposed two-step model. Hence, the edge–cloud collaboration can effectively reduce the traffic and thus E2E latency, enabling efficient EVEM.
8. Open Issues
- Multimodal Data Fusion for EVEM: In addition to the structured data discussed, incorporating broader and more diverse data modalities [28] should be considered to further enhance the effectiveness and accuracy of intelligent EVEM. For instance, integrating the visual data and point-cloud data of the road environment can provide richer contextual information, facilitating more precise vehicle energy consumption modeling and prediction. Developing efficient approaches for subtle multimodal data fusion remains a critical challenge.
- Automatic EVEM Knowledge Embedding: A simple attempt at knowledge-enhanced modeling is proven to be effective in this article. However, automated knowledge embedding is essential for handling the vast, diverse, and ever-changing EVEM knowledge. For example, integrating new findings in battery materials or regularly revised energy management standards will require a systematic and automated approach. Nevertheless, achieving such a unified, automatic, and scalable knowledge embedding mechanism poses significant technical challenges and demands further investigation.
- Dynamic Resource Management of EVEM Systems: Given the dynamic and often unpredictable nature of EVEM system resources (e.g., vehicle-to-cloud communication may degrade significantly inside tunnels or during network congestion), developing an agile platform for dynamic resource and scheme management is critical. For example, such a platform could enable seamless switching from in-situ energy-efficient route planning to cloud-based solutions when exiting tunnels or encountering better network conditions. Addressing this issue effectively will require novel strategies to adapt EVEM operations to varying resource availability in real time.
9. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Heidrich, O.; Dissanayake, D.; Lambert, S.; Hector, G. How cities can drive the electric vehicle revolution. Nat. Electron. 2022, 5, 11–13. [Google Scholar] [CrossRef]
- Simpkins, G. Benefits of electric vehicle adoption. Nat. Rev. Earth Environ. 2023, 4, 432. [Google Scholar] [CrossRef]
- Böhm, M.; Nanni, M.; Pappalardo, L. Gross polluters and vehicle emissions reduction. Nat. Sustain. 2022, 5, 699–707. [Google Scholar] [CrossRef]
- Husain, I.; Ozpineci, B.; Islam, M.S.; Gurpinar, E.; Su, G.J.; Yu, W.; Chowdhury, S.; Xue, L.; Rahman, D.; Sahu, R. Electric drive technology trends, challenges, and opportunities for future electric vehicles. Proc. IEEE 2021, 109, 1039–1059. [Google Scholar] [CrossRef]
- Herberz, M.; Hahnel, U.J.; Brosch, T. Counteracting electric vehicle range concern with a scalable behavioural intervention. Nat. Energy 2022, 7, 503–510. [Google Scholar] [CrossRef]
- Zhang, J.; Wang, Y.; Jiang, B.; He, H.; Huang, S.; Wang, C.; Zhang, Y.; Han, X.; Guo, D.; He, G.; et al. Realistic fault detection of li-ion battery via dynamical deep learning. Nat. Commun. 2023, 14, 5940. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Luo, W.; Xu, S.; Yan, Y.; Huang, L.; Wang, J.; Hao, W.; Yang, Z. Electric Vehicle Lithium-Ion Battery Fault Diagnosis Based on Multi-Method Fusion of Big Data. Sustainability 2023, 15, 1120. [Google Scholar] [CrossRef]
- Kim, D.; Shim, H.G.; Eo, J.S. A machine learning method for ev range prediction with updates on route information and traffic conditions. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 28 February–1 March 2022; Volume 36, pp. 12545–12551. [Google Scholar]
- Shen, H.; Zhou, X.; Ahn, H.; Lamantia, M.; Chen, P.; Chen, P. Personalized Velocity and Energy Prediction for Electric Vehicles with Road Features in Consideration. IEEE Trans. Transp. Electrif. 2023, 9, 3958–3969. [Google Scholar] [CrossRef]
- Cuchỳ, M.; Vokřínek, J.; Jakob, M. Multi-Objective Electric Vehicle Route and Charging Planning with Contraction Hierarchies. In Proceedings of the International Conference on Automated Planning and Scheduling, Banaff, AB, Canada, 1–6 June 2024; Volume 34, pp. 114–122. [Google Scholar]
- Zhang, Y.; Yin, Z.; Xiao, H.; Luo, F. Coordinated Planning of EV Charging Stations and Mobile Energy Storage Vehicles in Highways With Traffic Flow Modeling. IEEE Trans. Intell. Transp. Syst. 2024, 25, 21572–21584. [Google Scholar] [CrossRef]
- Khiari, J.; Olaverri-Monreal, C. Uncertainty-Aware Vehicle Energy Efficiency Prediction Using an Ensemble of Neural Networks. IEEE Intell. Transp. Syst. Mag. 2023, 15, 109–119. [Google Scholar] [CrossRef]
- Zhu, Z.; Chen, W.; Xia, R.; Zhou, T.; Niu, P.; Peng, B.; Wang, W.; Liu, H.; Ma, Z.; Gu, X.; et al. Energy forecasting with robust, flexible, and explainable machine learning algorithms. AI Mag. 2023, 44, 377–393. [Google Scholar] [CrossRef]
- Du, P.; Xiao, T.; Chakraborty, C.; Cao, H.; Alfarraj. Energy-efficient UAVs and BSs management in distributed Edge intelligence empowered IoV networks. IEEE Internet Things J. 2024. Early Access. [Google Scholar] [CrossRef]
- Al-Turjman, F.; Altrjman, C. Enhanced Medium Access for Traffic Management in Smart-Cities’ Vehicular-Cloud. IEEE Intell. Transp. Syst. Mag. 2021, 13, 273–280. [Google Scholar] [CrossRef]
- Yan, G.; Liu, K.; Liu, C.; Zhang, J. Edge Intelligence for Internet of Vehicles: A Survey. IEEE Trans. Consum. Electron. 2024, 70, 4858–4877. [Google Scholar] [CrossRef]
- Zhou, Z.; Yu, H.; Xu, C.; Chang, Z.; Mumtaz, S.; Rodriguez, J. BEGIN: Big data enabled energy-efficient vehicular edge computing. IEEE Commun. Mag. 2018, 56, 82–89. [Google Scholar] [CrossRef]
- Yin, L.; Luo, J.; Qiu, C.; Wang, C.; Qiao, Y. Joint task offloading and resources allocation for hybrid vehicle edge computing systems. IEEE Trans. Intell. Transp. Syst. 2024, 55, 10355–10368. [Google Scholar] [CrossRef]
- Li, B.; Kisacikoglu, M.C.; Liu, C.; Singh, N.; Erol-Kantarci, M. Big data analytics for electric vehicle integration in green smart cities. IEEE Commun. Mag. 2017, 55, 19–25. [Google Scholar] [CrossRef]
- So, D.; Oh, J.; Jeon, I.; Moon, J.; Lee, M.; Rho, S. BiGTA-Net: A Hybrid Deep Learning-Based Electrical Energy Forecasting Model for Building Energy Management Systems. Systems 2023, 11, 456. [Google Scholar] [CrossRef]
- Yang, H.; Zheng, K.; Zhang, K.; Mei, J.; Qian, Y. Ultra-reliable and low-latency communications for connected vehicles: Challenges and solutions. IEEE Netw. 2020, 34, 92–100. [Google Scholar] [CrossRef]
- Khamfroush, H. Resource-aware Federated Data Analytics in Edge–Enabled IoT Systems. In Proceedings of the AAAI Symposium Series, Arlington, VA, USA, 7–9 November 2024; Volume 3, p. 305. [Google Scholar]
- Zhang, X.; Mavromatis, A.; Vafeas, A.; Nejabati, R.; Simeonidou, D. Federated Feature Selection for Horizontal Federated Learning in IoT Networks. IEEE Internet Things J. 2023, 10, 10095–10112. [Google Scholar] [CrossRef]
- Liu, Y.; Kang, Y.; Zou, T.; Pu, Y.; He, Y.; Ye, X.; Ouyang, Y.; Zhang, Y.Q.; Yang, Q. Vertical Federated Learning: Concepts, Advances, and Challenges. IEEE Trans. Knowl. Data Eng. 2024, 10, 3615–3634. [Google Scholar] [CrossRef]
- IEA. Global EV Outlook 2024. Available online: https://www.iea.org/reports/global-ev-outlook-2024 (accessed on 10 February 2025).
- Hu, X.; Gao, F.; Xiao, Y.; Wang, D.; Gao, Z.; Huang, Z.; Ren, S.; Jiang, N.; Wu, S. Advancements in the safety of Lithium-Ion Battery: The trigger, consequence and mitigation method of thermal runaway. Chem. Eng. J. 2024, 481, 148450. [Google Scholar] [CrossRef]
- Peng, R.; Tang, J.H.C.G.; Yang, X.; Meng, M.; Zhang, J.; Zhuge, C. Investigating the factors influencing the electric vehicle market share: A comparative study of the European Union and United States. Appl. Energy 2024, 355, 122327. [Google Scholar] [CrossRef]
- Lin, M.; You, Y.; Meng, J.; Wang, W.; Wu, J.; Stroe, D.I. Lithium-ion batteries SOH estimation with multimodal multilinear feature fusion. IEEE Trans. Energy Convers. 2023, 38, 2959–2968. [Google Scholar] [CrossRef]
- Zhang, J.; Huang, C.; Chow, M.Y.; Li, X.; Tian, J.; Luo, H.; Yin, S. A data-model interactive remaining useful life prediction approach of lithium-ion batteries based on PF-BiGRU-TSAM. IEEE Trans. Ind. Informatics 2023, 20, 1144–1154. [Google Scholar] [CrossRef]
- Cao, Y.; Yi, J.; Liu, Y.; Zhao, C.; Li, D.; Zhang, Y.; Han, Z. Joint Routing and Charging Optimization of Electric Passenger Vehicles With Uninterruptible Charging Service. Chem. Eng. J. 2024, 11, 18180–18192. [Google Scholar] [CrossRef]
- Wang, R.; Wang, H.; Zhu, K.; Yi, C.; Wang, P.; Niyato, D. Mobile Charging Services for the Internet of Electric Vehicles: Concepts, Scenarios, and Challenges. IEEE Veh. Technol. Magazine. 2023, 18, 110–119. [Google Scholar] [CrossRef]
- Zhu, Q.; Huang, Y.; Lee, C.F.; Liu, P.; Zhang, J.; Wik, T. Predicting electric vehicle energy consumption from field data using machine learning. IEEE Trans. Transp. Electrif. 2024, 11, 2120–2132. [Google Scholar] [CrossRef]
- McGovern, M.E.; Bruder, D.D.; Huemiller, E.D.; Rinker, T.J.; Bracey, J.T.; Sekol, R.C.; Abell, J.A. A review of research needs in nondestructive evaluation for quality verification in electric vehicle lithium-ion battery cell manufacturing. J. Power Sources 2023, 561, 232742. [Google Scholar] [CrossRef]
- Xu, L.; Wu, F.; Chen, R.; Li, L. Data-driven-aided strategies in battery lifecycle management: Prediction, monitoring, and optimization. Energy Storage Mater. 2023, 59, 102785. [Google Scholar] [CrossRef]
- Zhang, Z.; Li, J.; Guan, D. Value chain carbon footprints of Chinese listed companies. Nat. Commun. 2023, 14, 2794. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, C.; Hao, Z.; Cai, X.; Liu, C.; Zhang, J.; Wang, S.; Chen, Y. Study on the life cycle assessment of automotive power batteries considering multi-cycle utilization. Energies 2023, 16, 6859. [Google Scholar] [CrossRef]
- Kalakanti, A.K.; Rao, S. Charging Station Planning for Electric Vehicles. Systems 2022, 10, 6. [Google Scholar] [CrossRef]
- Zhao, H.; Wu, H.; Lu, N.; Zhan, X.; Xu, E.; Yuan, Q. Lane Changing in a Vehicle-to-Everything Environment: Research on a Vehicle Lane-Changing Model in the Tunnel Area by Considering the Influence of Brightness and Noise Under a Vehicle-to-Everything Environment. IEEE Intell. Transp. Syst. Mag. 2023, 15, 225–237. [Google Scholar] [CrossRef]
- Xiang, C.; Feng, C.; Xie, X.; Shi, B.; Lu, H.; Lv, Y.; Yang, M.; Niu, Z. Multi-Sensor Fusion and Cooperative Perception for Autonomous Driving: A Review. IEEE Intell. Transp. Syst. Mag. 2023, 15, 36–58. [Google Scholar] [CrossRef]
- Hahn, D.; Munir, A.; Behzadan, V. Security and Privacy Issues in Intelligent Transportation Systems: Classification and Challenges. IEEE Intell. Transp. Syst. Mag. 2021, 13, 181–196. [Google Scholar] [CrossRef]
- Kwade, A.; Haselrieder, W.; Leithoff, R.; Modlinger, A.; Dietrich, F.; Droeder, K. Current status and challenges for automotive battery production technologies. Nat. Energy 2018, 3, 290–300. [Google Scholar] [CrossRef]
- ISO 12405; Electrically Propelled Road Vehicles—Test Specification for Lithium-Ion Traction Battery Packs and Systems—Part 4: Performance Testing. ISO: Geneva, Switzerland, 2018.
- GB/T 32960; Electric Vehicle Battery Management System. Standardization Administration of China: Beijing, China, 2016.
- UL 1974; The Standard for Evaluation for Repurposing Batteries. UL: Northbrook, IL, USA, 2018.
- Li, X.; Wang, Z.; Yan, J. Prognostic health condition for lithium battery using the partial incremental capacity and Gaussian process regression. J. Power Sources 2019, 421, 56–67. [Google Scholar] [CrossRef]
- Zhang, J.; Wang, W.; Guo, S.; Wang, L.; Lin, F.; Yang, C.; Yin, W. Solving general natural-language-description optimization problems with large language models. arXiv 2024, arXiv:2407.07924. [Google Scholar]
- Wang, D.; Chen, Z.; Ni, J.; Tong, L.; Wang, Z.; Fu, Y.; Chen, H. Hierarchical graph neural networks for causal discovery and root cause localization. arXiv 2023, arXiv:2302.01987. [Google Scholar]
- Wehner, C.; Kertel, M.; Wewerka, J. Interactive and intelligent root cause analysis in manufacturing with causal Bayesian networks and knowledge graphs. In Proceedings of the 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring), Florence, Italy, 20–23 June 2023. [Google Scholar] [CrossRef]
- Li, Z.; Yang, Z.; Wang, M. Reinforcement learning with human feedback: Learning dynamic choices via pessimism. arXiv 2023, arXiv:2305.18438. [Google Scholar]
- Han, S.; Hu, X.; Huang, H.; Jiang, M.; Zhao, Y. Adbench: Anomaly detection benchmark. Adv. Neural Inf. Process. Syst. 2022, 35, 32142–32159. [Google Scholar] [CrossRef]
- Zhang, J.; Wang, Z.; Liu, P.; Zhang, Z. Energy consumption analysis and prediction of electric vehicles based on real-world driving data. Appl. Energy 2020, 275, 115408. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Secureboost: Xgboost: A scalable tree boosting system. IEEE Intell. Syst. 2016, 7, 785–794. [Google Scholar] [CrossRef]
- Cheng, K.; Fan, T.; Jin, Y.; Liu, Y.; Chen, T.; Papadopoulos, D.; Yang, Q. Secureboost: A lossless federated learning framework. IEEE Intell. Syst. 2021, 36, 87–98. [Google Scholar] [CrossRef]
Real-World Data (R = 1%, D = 5) | Deviation Degree (R = 1%) | Injection Ratio (D = 5) | E2E Latency (ms) | |||
---|---|---|---|---|---|---|
Hard ( = 3) | Easy ( = 7) | Hard ( = 0.01%) | Easy ( = 10%) | |||
KNN | 0.8195 | 0.8181 | 0.8219 | 0.7851 | 0.8372 | 27560 |
CBLOF | 0.7304 | 0.7147 | 0.7402 | 0.6353 | 0.7854 | 568 |
IForest | 0.7185 | 0.6755 | 0.7447 | 0.6431 | 0.7865 | 694 |
ECOD | 0.5303 | 0.5184 | 0.5484 | 0.5297 | 0.5389 | 9651 |
DSVDD | 0.5000 | 0.4996 | 0.5000 | 0.4998 | 0.5000 | 675 |
iEVEM | 0.9644 | 0.9467 | 0.9748 | 0.9591 | 0.9668 | 185 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guo, S.; Zhao, C. iEVEM: Big Data-Empowered Framework for Intelligent Electric Vehicle Energy Management. Systems 2025, 13, 118. https://doi.org/10.3390/systems13020118
Guo S, Zhao C. iEVEM: Big Data-Empowered Framework for Intelligent Electric Vehicle Energy Management. Systems. 2025; 13(2):118. https://doi.org/10.3390/systems13020118
Chicago/Turabian StyleGuo, Siyan, and Cong Zhao. 2025. "iEVEM: Big Data-Empowered Framework for Intelligent Electric Vehicle Energy Management" Systems 13, no. 2: 118. https://doi.org/10.3390/systems13020118
APA StyleGuo, S., & Zhao, C. (2025). iEVEM: Big Data-Empowered Framework for Intelligent Electric Vehicle Energy Management. Systems, 13(2), 118. https://doi.org/10.3390/systems13020118