Review of Various Machine Learning Approaches for Predicting Parameters of Lithium-Ion Ba tt eries in Electric Vehicles

: Ba tt ery management systems (BMSs) play a critical role in electric vehicles (EVs), relying heavily on two essential factors: the state of charge (SOC) and state of health (SOH). However, accurately estimating the SOC and SOH in lithium-ion (Li-ion) ba tt eries remains a challenge. To address this, many researchers have turned to machine learning (ML) techniques. This study provides a comprehensive overview of both BMSs and ML, reviewing the latest research on popular ML methods for estimating the SOC and SOH. Additionally, it highlights the challenges involved. Beyond traditional models like equivalent circuit models (ECMs) and electrochemical battery models, this review emphasizes the prevalence of a support vector machine (SVM), fuzzy logic (FL), k-nearest neighbors (KNN) algorithm, genetic algorithm (GA), and transfer learning in SOC and SOH estimation.


Introduction
The transportation sector significantly impacts carbon emissions and must seek to reduce its environmental footprint.Governments worldwide have established CO2 emission reduction targets for transportation [1].Many major automobile-producing nations have also committed to reducing CO2 emissions.Over the past decade, fossil fuel usage has declined as renewable energy sources like wind, solar, and hydroelectric power have expanded rapidly.This shift has made lithium-ion (Li-ion) batteries popular due to their high power and energy density, energy efficiency, and extended life cycle.These features make Li-ion batteries suitable for various applications, including portable electronic devices [2], electric vehicles [3], and energy storage systems.However, Li-ion battery performance degrades over time [4], necessitating an understanding of the battery's state of health (SOH) and state of charge (SOC) [5].To estimate these states accurately, battery management system (BMS) software has been developed to manage SOC and SOH while addressing potential risks like overheating and overvoltage [6].Given concerns about battery system safety and performance, a well-designed BMS is essential to prevent battery failure [7].With enhanced battery protection, SOC and SOH estimation, cell balancing, and thermal management [8], batteries can operate more safely and efficiently, with SOC and SOH estimation being crucial components.The SOC represents the battery's current capacity to its full charge, and the SOH signifies the battery's capacity compared to its original nominal capacity when brand new.A Li-ion battery is considered to reach its end of life (EOL) when its capacity decreases.
The research and development of battery state estimation methods have been significantly improved with the recent advancements in artificial intelligence (AI) and machine learning (ML) [9].Estimating the states of batteries has gained some interest, and this is primarily attributed to the widespread adoption of data-driven approaches, particularly those based on ML.This popularity is driven by the greater availability of battery data and advancements in computing power capabilities.
During operation, the complex chemical processes within a battery pose a significant challenge for parameter measurement and prediction.Long-term degradation prediction is especially challenging.While physics-based (PB) models can capture electrochemical cycling behavior [10,11] and have incorporated specific capacity degradation mechanisms, many degradation modes remain poorly understood, and existing models suffer from poor identifiability of physical mechanisms [12].
Contrary to traditional model-based approaches, ML models identify high-dimensional patterns in data, independent of underlying physical processes.This makes their study unaffected by hard-to-study factors, making ML less favorable for battery state estimation [13].However, ML model-based methods encounter certain challenges.ML models may have more parameters than physics-based models, necessitating extensive training data.They can also face issues when generalizing to unseen scenarios.Learning from available data may require dimensionality reduction and problem-specific feature engineering [14].It is argued that, without considering physical processes, ML is unlikely to yield highly accurate health forecasting for situations far beyond available data.Combining ML models with other models like PB can yield improved results.This paper aims to comprehensively review recent machine learning applications in estimating the SOC and SOH for batteries.It is a valuable resource for researchers seeking to employ machine learning techniques to address battery-related challenges.
The subsequent sections of this paper are organized as follows.Section 2 covers relevant aspects of Li-ion batteries for EVs, including the BMS, SOC, and SOH.While this may not directly pertain to machine learning applications, it offers essential background for researchers new to the field.Section 3 provides a separate review of ML methods and their applications in SOC and SOH estimation.This includes a general introduction to ML, its historical development, five main categories, common approaches, and their applications.Section 4 outlines future challenges and developments based on current research.Finally, Section 5 concludes the paper.

Li-Ion Battery for Electric Vehicles
Numerous battery types have been utilized, with the lithium-ion battery (LIB) [15,16] gaining attention from both research communities and the automotive industries due to its exceptional attributes, including its high energy density, low self-discharge rate, elevated voltage, extensive lifespan, remarkable reliability, and rapid recharging capabilities.To maintain safe operating conditions, a battery management system (BMS) is employed, responsible for determining optimal charging and discharging times.Figure 1 illustrates the typical structure and functions of the BMS, which involve collecting operational data from the battery pack, processing this information to estimate the battery's current state, and subsequently regulating the battery in alignment with the estimated parameters.A BMS must stay robust over its life cycle.But, predicting battery states in modern EVs is tough due to lithium-ion batteries' nonlinear behavior.These rechargeable batteries react swiftly to temperature, charge, discharge, and DoD changes.Aging induces significant nonlinear shifts in battery traits.A battery pack comprises cells, whose performance degrades over time, affecting overall pack performance.Regular monitoring is essential for functionality.Techniques for assessing Li-ion battery states include the SOC, SOH, RUL, capacity, impedance, and available power.Estimating SOC and SOH is more intricate than other parameters [17][18][19][20].

State of Charge (SOC)
A battery management system extends a battery lifespan by monitoring overcharging and over-discharging.Accurate assessment of battery cell capacity ensures timely replacement.Cell health is evaluated by comparing initial and current capacity.The SOC is crucial for the SOH calculation, cell balancing, and power management.It resembles a fuel gauge but is not directly measurable.While Equation ( 1) is the standard, some studies use modified versions.Coulomb efficiency measures the ratio of discharging to charging capacity.The SOC dynamics equation, based on Coulomb counting, is as follows [21].
where SOC(t) is the SOC at time t, SOC(t0) is the SOC at time t0, I(t) is the discharge current from t0 to t, η is the Coulomb efficiency, and Qfull is the capacity of the battery.

State of Health (SOH)
The state of health (SOH) refers to a battery's capacity when fully charged in its current cycle compared to when new.Over time, the battery's capacity diminishes, a process called degradation.Research shows that lithium-ion batteries degrade gradually at first, then rapidly later, with the point of rapid degradation defined as the Knee-point (Figure 2) [22].Accurate SOH prediction helps anticipate the Knee-point and mitigate premature battery failures.The main objective of SOH estimation is to track Li-ion battery aging over time, with multiple approaches providing valuable insights.Battery capacity directly indicates performance.The term "health" can be unclear, leading to alternative definitions in some studies [23].For example, instead of measuring capacity, which can be challenging, some research defines health based on the easier measurement of electric charge or discharge power.
where  is the electric quantity of charge or discharge of the battery in its current state, and  is the nominal electric quantity of charge or discharge of the battery when it is first installed.The SOH can, to some extent, reflect battery aging, with the primary cause being an increase in internal resistance.Therefore, it can be defined based on internal resistance.
where  is the internal resistance of the battery at its end of life (EOL),  is the internal resistance of the battery in its current state, and  is the internal resistance of the battery when it was first installed.However, it is challenging to measure precisely as the internal resistance is easily influenced by the temperature and SOC.
Estimating the remaining useful life (RUL) of a battery is equally crucial as determining its SOH.The RUL represents the cumulative charge and discharge cycles a battery undergoes before reaching its current capacity decay and failure threshold [24].The RUL is closely linked to the SOH and can be defined as follows.
where  is the cycle numbers in the entire life of the battery, and  is its current number of charge and discharge cycles.Although there are some studies on the RUL separately, most studies combine the RUL and SOH [25].The SOH is predicted first, followed by the RUL, based on the SOH, where the RUL is typically referred to as the quantity of charge or discharge cycles until the battery's SOH is at 0%.
Various metrics, like capacity and resistance, indicate a battery's health index (HI) for SOH assessment.Data-driven methods are vital for estimating a battery's SOH and RUL.The choice of the HI impacts prediction accuracy, crucial for SOH and RUL estimation.HIs serve as training data for SOC estimation, either directly (e.g., capacity, resistance) or indirectly (e.g., voltage, current).Fusion models enhance accuracy by combining multiple HIs.Recent research has analyzed HI selection, comparing benefits and drawbacks for SOH and RUL calculation.The literature includes a comprehensive comparison, as summarized in Table 1.Although not the primary focus of this article, the selection of appropriate HIs from the table can facilitate the estimation of the state of charge (SOC) or SOH, including the RUL.


Online monitoring is possible for battery aging


Unable to monitor during application


Multiple factors that affect the aging of battery performance of the battery can be included


Increased computation, requiring preprocessing due to redundant information

Estimation Methods for the SOC and SOH
Different classifications are employed to categorize techniques for estimating the SOC and SOH.Some researchers consolidate these methods into distinct categories that share notable similarities.Figure 3 illustrates that the upcoming section will look into methods associated with machine learning.The direct measurement method is sometimes called the experimental estimation method.The approach mainly depends on experimental data for measurement.It involves using techniques and algorithms with observable battery metrics like voltage, current, and temperature to deduce a battery's state.There are two prevalent techniques for estimating both the SOC and SOH.
The ampere-hour counting method or Coulomb counting method estimates the battery status by cumulatively measuring the charged and discharged current.The estimation of the SOH can be calculated by the following equation: where    represents the total amount of power charged or discharged (the SOC from 0% to 100% or 100% to 0%) in the current cycle, and  is the rated charge of the battery.
This approach offers advantages like minimal processing power demands and easy integration.Nevertheless, it does not account for alterations in a battery's internal state.When current measurement becomes inaccurate due to calibration and measurement issues, it leads to errors in the SOC.
On the other hand, open circuit voltage (OCV) estimates the SOC based on the open circuit voltage for an extended period.Relaxation is a somewhat constant function of the SOC.In this method, the open circuit voltage  can be defined as follows.

𝑈 𝑈 𝐼𝑅
where  is the OCV, U is the terminal voltage, I is the battery current, and R is the battery's internal voltage.
The process is simple and feasible but unsuitable for real-world usage due to the lengthy resting period required during testing.While the open circuit voltage remains consistent, the battery's actual state of charge (SOC) and state of health (SOH) differ when it operates at varying temperatures [26].Nevertheless, internal resistance is a variable feature of a battery's internal chemistry, which depends on the battery capacity.Generally, as the SOC and SOH decrease, the internal resistance tends to rise.Hence, precise SOC and SOH estimation can be grounded in a reliable measurement of internal resistance [26].Ohm's law can be employed to estimate a battery's internal resistance based on changes in the voltage curve.
where ∆U and ∆I are the step changes in voltage and current, respectively.The current internal resistance value can be obtained by measuring the voltage and current in different current pulse methodologies.The ohmic internal resistance is typically low and exhibits minimal variation during charge and discharge cycles.However, it is sensitive to external influences, making precise measurement challenging.Furthermore, certain studies have highlighted the temperature's substantial influence on internal resistance growth [27].Apart from the previously discussed methods, the direct measurement approach encompasses additional techniques such as electrochemical impedance spectroscopy (EIS), destructive testing, and ultrasonic measurements.Nonetheless, many of these techniques are primarily suitable for laboratory use and offer limited practical utility in real-world scenarios.
The model-based method has been widely used in the literature to model measured battery parameters.The model-based approach usually involves two elements: filtering and modeling.This approach estimates the battery state by identifying model parameters.Estimation accuracy is primarily influenced by two factors: the model's accuracy and the chosen filtering method [28].
The Kalman filter (KF) is utilized for estimating the state of charge (SOC) in power batteries.It employs the concept of Kalman filtering to determine the current "optimal value" based on the "measured value" from the present instrument and the "predicted amount" from the previous hour.The prediction and measurement errors are often referred to as noise.The Kalman filter can be divided into three segments: (1) prediction of the current state based on the estimated state from the previous moment, encompassing state prediction and prediction of the random state distribution; (2) computation of the Kalman gain; and (3) updating of the current predicted state and predicted state distribution using the measured value and Kalman gain [29].Initially applied to linear systems, the Kalman filter has seen modifications such as the extended Kalman filter (EKF) [30], dual extended Kalman filter (DEKF) [31], unscented Kalman filter (UKF) [32], and others in the various literature for SOC and SOH estimation.The advantages of the Kalman filter method include real-time computer data processing, applicability to nonlinear systems, and positive effects on predicting the charge state of electric vehicles during operation.Its primary limitation is its reliance on the accuracy of the battery model, necessitating the development of a reliable battery model to enhance precision.The KF is a valuable tool for handling state estimation problems with Gaussian noise.However, it has limitations when dealing with non-Gaussian state and measurement noise in battery operation.
Particle filters (PFs) and their variations are a more suitable choice for nonlinear systems.A PF [33], based on the Monte Carlo method, employs particle sets to represent probabilities and is applicable to various state-space models.It sequentially samples random state particles from posterior probabilities, involving four steps: prediction, weight update, resampling, and output.Advancements in PFs have yielded positive results in state of charge (SOC) and state of health (SOH) estimation.Its strength lies in handling nonlinear, non-Gaussian systems.However, challenges remain.One major concern is the need for a substantial number of samples to accurately approximate the system's posterior probability density.Additionally, the resampling stage can lead to a loss of sample diversity, resulting in sample depletion issues.
An H-infinity filter (HIF) addresses sensitivity to model errors and the assumption of noise statistics in KF-based algorithms.By introducing the H-infinity norm [34] and boundary condition, an HIF mitigates these issues.An HIF establishes a sub-filter to limit uncertainty and model perturbation effects, reducing estimation errors even in worst-case scenarios without specifying disturbances or model uncertainties.Performance depends on design specifications.Research suggests that an HIF is a straightforward and highly reliable solution for time-variant systems, e.g., SOC and SOH estimation in LIBs [35].Compared to the Kalman filter, the HIF offers robustness improvements, making it a robust Kalman filter variant.In SOC estimation, the HIF outperforms [36].Furthermore, models play a vital role in model-based estimation.The equivalent circuit model (ECM) and electrochemical model (EM) are two key models.Here is a brief overview of these essential elements.
The equivalent circuit model (ECM) is also be referred to as the electrical battery model or electrical equivalent circuit model (EECM) [37].Since ECMs are commonly used, they become quite popular for machine learning methods.Figure 4 provides a brief explanation of several typical ECMs.Each ECM uses the same basic components: an ideal voltage source, a series resistance, and one or more groups of resistor-capacitors (RCs) that are wired in series with the resistance.UL and RL are used to represent the voltage and resistance of the external load circuit.
1.The battery's internal resistance equivalent model is simple to use.The open circuit voltage of the battery (OCV, denoted as UOC), which is represented by the ideal voltage source, is a crucial input parameter used in a variety of estimation techniques.When using a constant charge or discharge current I(t), the battery's internal resistance, also known as the series or ohmic resistance (abbreviated as R0), increases.2. Rp-Cp pairs (known as RCs), in which the resistance is always connected in parallel with the capacitance, are used for modeling the battery behavior under dynamic operating conditions.Sometimes, multiple sets of RCs are connected in series to improve accuracy.For example, when there are two RCs in series, the model is called a second-order RC ECM. 3. The resistance of the battery differs during charging and discharging, and if the effect of this factor is considered, a resistor and diode in series will be used.In this way, only one resistor will work in the ECM during charging and discharging.For example, the commonly used RC model can be expressed as the following: where  is the voltage drop across the RC pair.In many cases, enhancing simulation accuracy with more intricate models comes at the cost of complicating parameter identification.Generally, many studies deem a firstorder RC model or a second-order RC model adequate [38,39].Electrochemical models (EMs) depict the underlying processes within a battery's operation, striving to replicate the chemical reactions occurring in a Li-ion battery during charging, discharging, and resting periods.An EM, with its ability to capture electrochemical kinetics, material properties, and ion transport within the cell, sometimes surpasses an ECM in providing a more comprehensive view of the cell's internal state.EM models fall into four primary categories [40]: the pseudo-two-dimensional (P2D) model [41], single particle (SP) model [42], enhanced single particle (ESP) model [43], and multiphysics coupling (MC) model [44].
The EM model, compared to ECM, accurately tracks battery changes across diverse conditions and enables continuous monitoring throughout its lifespan.However, its complexity stems from intricate partial differential equations describing ion motion and chemical reactions, including solid-phase and electrolyte-phase electric potential, electrochemical kinetics, and electrolyte ionic diffusivity [45].Various models exist, such as CC-CV charge, OCV mathematical, exponential, polynomial, and ensemble models.Life cycle models offer an alternative for state of health (SOH) prediction, based on extensive offline tests, distinct from real-time parameter-based methods [46][47][48].

SOC and SOH Estimation using Machine Learning
Machine learning methods like Coulomb counting estimate the SOC by tracking current flow in/out of the battery and are prone to errors and reliant on the initial SOC.The OCV method correlates battery voltage with the SOC using OCV-SOC curves.But the mapping accuracy is crucial and susceptible to temperature changes.Kalman filters blend measurements with system dynamics, adapting to conditions, but require a battery model and intensive computation.ML models, including neural networks and regression, learn SOC patterns from data and are versatile but require labeled training data and face complexity/overfitting challenges.For the SOH, the capacity fade rate monitors the capacity decline but may miss early degradation and be affected by cycle counting inaccuracies.Impedance spectroscopy offers insights into internal changes but requires specialized equipment and interpretation.Voltage relaxation measures recovery after load removal and is sensitive to aging but needs precise voltage measurements and is specific to certain degradation modes.Combining methods yields better results by adapting to the battery type, data availability, and application needs.Advancements in computer technology enable the utilization of AI, ML, RL, and DL for SOC and SOH estimation [49].This section introduces machine learning for non-experts, highlighting its learning capability from data [50].As seen in Figure 5, the machine learning encompasses supervised, unsupervised, and reinforcement learning [50].Supervised learning relies on labeled examples for training.It necessitates correct labels for given inputs, enabling the algorithm to make accurate predictions.This can be categorized into regression, predicting a number from an infinite set, and classification, predicting from a discrete set of outputs.Unsupervised learning, on the other hand, operates without predefined output labels.It identifies patterns or structures within the data autonomously.This can include clustering, grouping data without labels, anomaly detection for identifying unusual events, and dimensionality reduction for compressing large datasets while retaining essential information.

Reinforcement learning
Reinforcement learning, inspired by behaviorist psychology, emphasizes online learning and maintains a balance between exploration and exploitation.It does not require prior data but learns from environmental feedback on actions.It is employed to solve problems where an entity learns strategies to maximize rewards or achieve specific goals through interaction.
Consequently, supervised learning methods for regression are paramount.It is noteworthy that classification models derive from regression models, discretizing their output.Therefore, algorithms initially designed for classification, like support vector machines and k-nearest neighbors algorithms, find extensive application in battery state prediction.Pedro Domingos' taxonomy categorizes these algorithms into five tribes, symbolists, connectionists, evolutionaries, Bayesians, and analogizers, each offering distinct approaches applicable across domains and providing foundational theories for other algorithms within their respective tribes, as illustrated in Table 2.  [55] Michael Jordan [56] Connectionists Neuroscience Simulation of the brain's thinking process

Evolutionary biology
Simulating the process of species evolution

Genetic programming
John Holland [60] Hod Lipson [61] Analogizers Psychology The similarity between old and new knowledge Netflix recommendation system Support vector machines, nearest neighbors Peter Hart [62] Vladimir Vapnik [63] Douglas Hofstadter [64] Several significant methods for estimating the SOC and SOH have been introduced.Traditional direct measurement techniques are simple but vulnerable to aging, temperature, sensor drift, and external disturbances.Model-based estimation methods yield precise results but exhibit limited robustness and complexity.Machine learning-based SOC and SOH estimation methods show the best results among all evaluated methods, offering high accuracy under aging and temperature conditions.While these methods may demand substantial computational time, advancements in computing technology have mitigated this issue.The abundance of data enables ML techniques to enhance BMS performance, especially during the offline training phase, without considering chemical reactions and battery models.
Estimating the SOC and SOH remains challenging due to battery behavior variations and complex EV battery architecture.ML's ability to improve using existing data and established algorithms, rather than building accurate theoretical models, has driven its increased adoption.The availability of more battery data and improved computing power further supports data-driven ML applications.
Joint estimation of SOH and SOC is crucial, with capacity-based and resistance-based SOH estimation processes playing a significant role in updating SOC estimation.Researchers categorize ML algorithms for battery state estimation into non-probabilistic and probabilistic methods, with studies emphasizing non-probabilistic algorithms, including neural networks, support vector machines, fuzzy logic, and their derivatives.This section provides a comprehensive overview of ML techniques for SOC and SOH estimation, offering comparisons.The overall workflow can be seen in Figure 6.
When employing machine learning to estimate the SOC and SOH of batteries, various crucial aspects come into play.These aspects, as shown, help to capture pertinent details regarding the battery's behavior and condition.

Neural Network (NN)
Neural network, also known as artificial neural network (ANN), methods are categorized into three main groups as described in reference [70].The first category is the feedforward neural network (FFNN) method, which includes variations like the backpropagation neural network (BPNN), Elman neural network (ENN), radial basis function neural network (RBF-NN), and time delay neural network (TDNN).The second category is deep learning (DL), encompassing deep belief networks (DBNs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs).The last category is the hybrid method, combining two or three approaches.Other methods, such as the extreme learning machine (ELM), can also be considered part of the FFNN category.While the classification in [70] is logical, we will focus on widely used NN methods.
We can observe the basic structure of a feedforward neural network (FFNN), which was the original neural network developed for artificial intelligence.All neural network methods have evolved from the FFNN, making it a popular method for SOC estimation in lithium-ion batteries due to its simplicity, ease of training, and good generalization [71,72].
A neural network comprises interconnected neurons organized into three primary layers.The input layer, with four inputs for example, can represent parameters like voltage, current, temperature, and time in battery state estimation.The output layer has three outputs, typically SOC, SOH, and SOP, in battery state estimation.Hidden layers can create complex nonlinear functions by integrating input data.The number of hidden layers can make it a deep neural network (DNN) if there are more than three hidden layers.Each neuron has an input and an output, with inputs from the previous layer and outputs to the next layer in the middle-hidden layer.The input and output of the input and output layers are the data.In each neuron, it is calculated as the following: where y is the output of this neuron, f is the activation function, and usually a sigmoid function is used,  is the weight coefficient of the input  , while  is the input of this neuron, and b is the bias term. and b are determined based on training on a known dataset by using a perceptron.The value of  is updated according to the following equation until it converges.
←      (11) where α is the learning rate, which is used to control the step size, y is the true output of the training data, and  is the predicted output.This perceptron is obtained based on the gradient descent method.The same idea can be used for complex neural networks to update the parameters based on the chain rule of derivation.
Employing a neural network to generate output from an input is called feedforward.Conversely, the process of adjusting the parameters in a neural network based on the disparity between the output and actual results is known as backpropagation.The welltrained model exhibits a notable likelihood of providing accurate outputs for data with both known inputs and unknown outputs, denoted as its generalization ability.Several enhanced approaches rooted in neural networks have emerged for diverse SOC and SOH prediction challenges.This includes the introduction of fundamental principles for a selection of frequently utilized techniques.

Feedforward Neural Network (FFNN)
A FFNN, also known as an FNN in some of the literature, is distinct from a fuzzy neural network (FNN), which is also abbreviated as an FNN.To prevent confusion, this article refers to the feedforward neural network as FFNN.An NN can handle extensive data in complex nonlinear systems independently of a battery's electrochemical processes, making it a widely used tool for battery state estimation.An FFNN's effectiveness in estimating a battery's SOH has been confirmed using calendar aging data under different degradation conditions [73] and one year of real-time data from a BMS [74].The BPNN [75] was effective at various temperatures.
An NN model [76] estimated battery capacity degradation, while the SOC [77] for both lithium phosphate batteries (LPBs) and lithium-ion batteries was estimated using a two-layer neural network with a tensing activation function, consistently achieving an MSE under 10%.The SOC and SOH were estimated using data-driven FFNNs [73] of a multilayer perceptron (MLP) developed with a charge and discharge dataset collected over ten months.The SOC and SOH were predicted based on the cell voltage, current, storage temperature, aging period, and storage state.One study [78] reduced the input dimension for the FFNN, simplifying the network.Another approach [79] used the sample entropy of discharge voltage as the input for the FFNN to predict the battery's SOH intervals accurately.An FFNN can also be combined with models like the ECM and KF for SOC and SOH estimation.
An RBF-NN, a type of FFNN, employs the radial basis function as an activation function.It can prevent training from converging to local minima [80] and reduce reliance on dataset quality [81], often achieving higher estimation accuracy.An improved RBF-NN in [82] estimated the SOC with a generalization error of less than half of other methods.An improved radial basis function neural network (IRBFNN) [83] was proposed to improve SOH estimation.An RBF can be combined with other models, as seen in a battery model [84,85] used to derive state-space equations for an EKF.The works in [86,87] were similar, with H-infinity and UKF used separately.

Deep Neural Network (DNN)
To enhance neural network predictions, deep neural networks (DNNs) can be employed.For instance, the estimation of the SOC utilized a deep feedforward neural network (DFNN) [88].This model exhibited excellent estimation performance under constant ambient temperature conditions, outperforming models using complex differential equations like the EKF in computational efficiency.To estimate a battery's SOH across different SOC and current conditions [89], a paper employed the SOC to adjust the kernel function within the DNN, making it adaptable to real-world scenarios.A comparison between a DNN, linear regression, SVM, k-NN, and ANN for SOH estimation revealed the DNN's superior performance.However, it is important to note that DNN-based approaches require more computation time and resources.This issue can be alleviated by reducing neural connections [90] and downsizing the dataset [91].

Extreme Learning Machine (ELM)
The FFNN algorithm, which relies on gradient descent, may exhibit slow performance and demand extensive computational resources.The extreme learning machine (ELM) was introduced to address this issue.The ELM is built upon feedforward neural networks [92] and can encompass single or multiple layers of hidden nodes.These nodes serve various purposes such as classification, regression, clustering, sparse approximation, compression, and feature learning.In an ELM, the parameters of these hidden nodes can be configured.These hidden nodes can either be randomly selected and remain unaltered or be passed down without modifications.Typically, the output weights of the hidden nodes are trained in a single step, akin to learning a linear model.As shown in Figure 7, w and b are randomly given and will never be changed; only β is learned in a single step, while these parameters need to be solved and consume a lot of computational resources.Obviously, this greatly improves the solution efficiency of the neural network.However, the ELM has been found to be effective enough in many applications including in the SOC and SOH estimation of batteries [93,94].However, compared to traditional learning methods, estimating the weights of an ELM is much faster and requires less computation.It is still important to admit that the ELM approach will always be less accurate than an FFNN because it has fewer trainable weights.
As mentioned previously, the ELM effectively reduces computational effort with a minor trade-off in accuracy, which is crucial for SOC/SOH estimation in engineering applications.In a study by researchers [95], they combined ELM and Thevenin models to achieve online identification of internal resistance for battery SOH estimation.The ELM, trained offline using a collected dataset, exhibited faster learning than a traditional FFNN.This method is suitable for online implementation.In another study by researchers [96], an online sequential ELM approach was proposed to efficiently utilize old and new data.Initially, a subset of samples determined the output weight of the ELM, and this weight was updated with new samples.This approach offers quick learning, strong generalization, and high accuracy.It was noted that the ELM, compared to an RBF, demonstrated a lower computational load and improved estimation error.

Recurrent Neural Network (RNN)
Long short-term memory (LSTM) is a recurrent neural network (RNN) technique frequently found in research and stands out for its ability to consider time dynamics.This characteristic contributes to its popularity in tasks like speech recognition and SOC/SOH estimation, which are inherently time-dependent [97][98][99][100].Figure 8   Figure 8 shows a simplified vector representation of a single hidden layer neural network.The hidden layer in the recurrent neural network (RNN) considers time, and its activation function is incorporated, representing the hidden layer's connection weight.
During training, the hidden layer adjusts based on the previous instant, primarily associated with short-term memory.However, for long-term memory, LSTM is recommended.To work with the data in the hidden layer, C is introduced to the LSTM, combining the functions of the forgetting gate, input gate, and output gate.This enables long-term memory retention and adjustments during training.LSTM is particularly effective for scenarios where a battery's state is influenced by its usage history.
The characteristics of common neural network (NN) methods can be summarized concisely.Neural networks use activation functions to introduce nonlinearity, making them universal function approximators and allowing them to learn battery degradation patterns.Deep neural networks (DNNs) are effective at modeling complex data with many inputs and can achieve high accuracy, especially with extensive aging data.However, DNNs require significant computational power and large datasets.Convolutional neural networks (CNNs) are commonly employed for image recognition tasks.However, their utility extends to battery state estimation as well.Their architecture, featuring convolutional and pooling layers, excels at automatically discerning key features from raw data.A CNN was initially applied for state of health (SOH) estimation using voltage, current, and charging capacity data from a partial charging cycle.The CNN proved to be a practical choice for online SOH estimation, offering high accuracy, quick testing, and minimal memory usage.Another CNN model [101] was introduced for estimating remaining useful life (RUL), complete capacity, and internal resistance based on a single cycle's input data.This model incorporates two closely connected layers for convolutional feature extraction, utilizing voltage, SOC, and current data.CNNs, with their advantageous traits, are frequently used with other methods for battery state estimation.For instance, to harness the strengths of both a CNN's feature extraction and recurrent neural networks' time-series prediction, researchers [102][103][104] combined a CNN and an RNN by adding a convolutional layer to the LSTM.This optimized network structure fully leverages a CNN's automatic feature extraction while reducing the computational load.

Other NN Variants
While neural networks (NNs) exhibit strong performance in estimating battery life at the beginning, their effectiveness diminishes toward the end of a battery's life cycle.As a result, researchers have strived to enhance NNs and introduce novel approaches.These include the adaptive neural network (ADNN) [105], confidence prediction neural network (CPNN) [106], and adaptive recurrent neural network (ARNN) [107], which is RNNbased.Furthermore, the authors in [108] optimized the recurrent nonlinear autoregressive with exogenous inputs (RNARX) method, a well-known RNN subclass.They combined it with the lightning search algorithm (LSA), an intelligent optimization technique that simplifies SOC estimation under various temperatures and drive cycles.Another model [109] showcased SOC forecasting over 54 real driving cycles using the nonlinear autoregressive with an external input (NARX) technique based on NNs.This model took inputs such as battery pack temperatures, current, and voltage and output the SOC, achieving a low MSE of 9.2727 × 10 −7 .
An NN's versatility extends beyond its state determination; it aids in modeling and complements other methods.For instance, NN models measure battery electrochemical properties using activation functions, enabling SOH calculation based on voltage-capacity slopes with fuzzy logic (FL).Other researchers [110] employed a structured NN with sub-NNs to estimate voltage, RC parameters, and SOH.Additionally, artificial neural networks (ANNs) [111] were combined with an extended Kalman filter (EKF) for SOC estimation at room temperature with an RMSE of 2%.It is important to acknowledge that NNs are computationally intensive.Table 3 summarizes the pros and cons of current NN techniques.It is worth noting that the Adaptive Neuro-Fuzzy Inference System (ANFIS) is another significant NN method, which incorporates fuzzy logic (FL) principles into its design.As neural networks regained attention and became increasingly popular in various research fields, support vector machines (SVMs) [112] emerged as one of the most widely adopted machine learning techniques.The conventional SVM primarily addresses classification problems.To handle regression problems, the support vector regression (SVR) concept was introduced as an extension of SVMs.The fundamental principles are illustrated in a simple two-dimensional linear case that can be seen in Figure 9.
An SVM endeavors to establish hyperplanes in high-dimensional space to differentiate data from different classes.It achieves an optimal separation boundary when maximizing the distance between the hyperplane and the closest data point of any class.The hyperplane simplifies to a straight line in this two-dimensional linear scenario.Notably, option "a" outperforms option "b" by offering a broader range of tolerances.It effectively separates white circles and squares, which may exist outside the sample data, while "b" does not.The support vector, representing the distance between the nearest sample point and the hyperplane, plays a pivotal role in constructing the hyperplane.It is named a "support vector" because it supports the tolerance space from positive and negative directions, enhancing generalization ability.This concept is the basis for support vector machines in machine learning.An SVR shares the core idea with an SVM but differs in the solution direction.An SVM aims to maximize the "distance" to the nearest sample point from the hyperplane, while an SVR strives to minimize the "distance" to the farthest sample point from the hyperplane.An SVM employs kernel functions to transform sample points into a higher-dimensional space for tackling multidimensional or nonlinear complex problems.Take the twodimensional XOR problem depicted in Figure 10.In two dimensions, it is impossible to distinguish between positive and negative signs using a linear line.However, this separation becomes achievable by mapping it to a three-dimensional space with a linear plane.An SVM uses a regularization parameter, mitigates overfitting, and demonstrates robust generalization abilities.It adeptly handles nonlinear data, even in high dimensions, thanks to kernel functions.However, selecting the appropriate kernel function is not a straightforward task.Additionally, an SVM may not be suitable for large datasets due to the need to store support vectors in memory, resulting in increased training time and computational demands.In contrast, neural networks (NNs) are better suited for large volumes of data, while SVMs may outperform when the dataset is relatively small.
An SVM overcomes an NN's drawbacks like a local extremum, complex network structures, extensive sample training, and high computational costs.It excels with finite small sample datasets and enhances algorithm generalization.Therefore, an SVM, including support vector regression (SVR), finds widespread use in SOC/SOH estimation [113].In an SVM, diagnosis relies on functions defined in the input space, while learning involves parameter calculation.For SOH estimation, a two-layer structure SVM [114] was employed.The introduced cost function constitutes the core of this method.Other researchers [115] developed a two-level SVM for RUL estimation, initially providing a rough estimate using classification, then refining it through regression.Researchers are actively exploring improved algorithms and fusion with other methods to enhance SVM applicability.Four primary avenues of research to enhance SVM estimating performance include the following.
An initial estimation of battery internal resistance [116] and terminal voltage using an SVM can be used to comprehend battery aging under different conditions.These variables exhibit a linear relationship with battery capacity.Hence, effective features derived directly from partial constant current charging or discharging voltage curves [117,118] are employed as SVM model inputs or features.
The most common approach involves modifying the kernel function to enhance model performance.For example, the Gaussian kernel's double deviation parameter significantly influences an output curve's properties.This enhanced kernel function can adapt to curve shapes with various curvatures, preventing over-and underfitting.Other approaches [119] decompose the kernel function into two terms, representing overall degradation trends and small SOH fluctuations separately, thereby improving estimation accuracy.
Combining SVM with other methods as an auxiliary approach is third method.For instance, a robust and real-time SOC and SOH estimation method was developed.It utilized SVM estimation as the initial capacity of a Kalman filter (KF) and updated the current capacity.An SVR model [120] combined with possibilistic clustering classification (PCC) was proposed to predict the RUL under different operating states.Researchers [121] estimated the SOC based on a method that combined an optimized SVR and a double search optimization process, which was found to be both more accurate and easier to use than an artificial neural network (ANN).
The least squares support vector machine (LS-SVM), a significant SVM variant, is considered the fourth improvement method.A traditional SVM requires extensive computation to solve quadratic programming problems with large datasets, making it challenging for real-world applications.However, the LS-SVM faces non-sparseness issues, leading to a complex model with poor generalization.A fixed-size LS-SVM algorithm [122] was proposed to address this, showing substantial improvement in calculation speed over the traditional SVM.For regression problems, the least squares support vector regression (LS-SVR) algorithm was employed to estimate the SOH.
Using an SVM, the estimation error is comparable to the unscented Kalman filter (UKF).An SVM [123] was applied to data gathered using electrochemical impedance spectroscopy (EIS), which considers capacity fading and internal resistance.The error of the SVM was similar, approximately 2%.Like other machine learning methods, the SVM requires many calculations during training, can only perform offline prediction, and has limited practical applicability.To address this, several improvements have been proposed.By dynamically updating the vectors in an SVM, it can make online predictions.Application to NASA data demonstrated that the average absolute estimation error for the SOH was 2% to 3%.In [124], an alternative improvement was adopted.First, offline training was performed, and the obtained model was used in the initial estimation stages.Then, SVMs were updated based on the current online data.This reduced the absolute error to 1%.
Previous studies indicate that SVM-based methods generally offer high estimation precision.However, achieving this precision comes at the cost of high computational complexity.For instance, other researchers [125] compared support vector regression (SVR) and the recent gradient boost algorithm extreme gradient boosting (XGBoost) for SOC estimation.They pointed out that to obtain precise predictions from SVR, the appropriate kernel function must be configured with workable parameter settings.Even with two SVR parameters tuned, the root mean square error (RMSE) is still significant when compared to XGBoost.Consequently, SVR tends to be time-and energy-intensive, resulting in a low level of practical applications.

Fuzzy Logic Method (FL)
Fuzzy logic (FL) can be employed in conjunction with various machine learning techniques to enhance their applicability, such as the sigmoid function in neural networks like the adaptive neuro-fuzzy inference system (ANFIS) method.As a result, FL holds significant importance and is often discussed independently.Fuzzification, the fuzzy rule base, the inference engine, and defuzzification are integral components of the FL interface system structure [126].FFL was initially applied for battery state estimation using EIS data from an electrochemical model.It estimated both the SOC and SOH by combining an ARMA, NN, and FL.Researchers [127] used measured data from a life cycle test on 95 Liion batteries for SOH estimation, with errors ranging from 1.4% to 9.2% [128].
Fuzzy identification divided the SOH into four states, healthy, acceptable, weak, and bad, with an estimation error of less than 5%.FL models nonlinear systems without needing mathematical models or ECMs.It considers input variables like ambient temperature, current, and a battery's terminal voltage for accurate state estimation.
FL's flexibility allows for specifying output membership functions for the SOH, like healthy, acceptable, and unhealthy.An online SOH prediction method [129] calculates crucial parameters under specific conditions and predicts the SOH.FL, in combination with other methods like ANFIS, can be very useful.
ANFIS, a neuro-fuzzy model, has five layers and minimal error in SOC estimation [130].It is particularly accurate in vehicle simulators, making inferences under uncertainty.Combining FL and SVM enhances SOC estimation accuracy, while fusing fuzzy adaptive factors and Kalman filters improves robustness and accuracy [131].FL's computational complexity is relatively high but supports both data processing and offline parameter calculations.On average, FL's reported estimation errors range from 1.4% to 10% [132].

Genetic Algorithm (GA) and Particle Swarm Optimization (PSO)
Evolutionary algorithms, including the genetic algorithm (GA) and particle swarm optimization (PSO), play a crucial role in machine learning, particularly in SOC/SOH estimation.The GA is widely applied in engineering optimization.It leverages natural selection and genetic mechanisms to find optimal solutions.It offers advantages like direct manipulation of structural objects, inherent parallelism, global search capabilities, probabilistic search methods, and adaptive search direction adjustment.
The essential workflow of a GA can be seen in Figure 11.Initially, the population to be optimized is encoded into genetic chromosome genes, forming the first generation, with each chromosome string representing an individual.These chromosomes can undergo crossover and mutation, creating new individuals.The best of these new individuals forms the second generation, and this process continues until the population stabilizes, leading to the best possible outcome.PSO, inspired by bird behavior, enables birds to find food collectively.They communicate their distances to food and help each other locate the ideal solution, successfully applied in engineering problems.Another paper [133] demonstrated the use of genetic algorithms (GAs) in identifying battery model parameters in state of charge (SOC) estimation.A novel SOC estimation method employing a GA and gray model (GM) was developed for improved repeatability and accuracy [134].PSO has been used for model parameter identification in battery state estimation [135,136] and often combined with machine learning (ML) techniques.PSO-LSTM optimized LSTM hyperparameters, reducing the error to 0.5%.PSO is also combined with support vector regression (SVR) [137], outperforming SVM in SOC estimation.PSO-SVR is robust, even with noisy training data [138].In SOC/SOH estimation, GA and PSO are commonly used for parameter optimization in ML methods like neural networks (NNs) and support vector machines (SVMs).

K-Nearest Neighbor (KNN)
The KNN is a simple classification method [139] and assigns new sample points to the same class as their closest training samples in a feature It is a lazy learning approach, meaning it relies heavily on training cases without involving abstraction steps.In regression, a KNN follows a similar principle.It identifies the K closest points (x1, x2, …, xk) to the new point based on distance and calculates their weighted average, with closer points having greater influence.This average becomes the value for the new point.The geometric representation of this fundamental principle can be seen in Figure 12.KNN regression, a simple and practical method, was utilized for battery state estimation.For example, five key parameters from constant charge curves were used, which included the initial charge voltage, CC charge capacity, CV charge capacity, final charge voltage, and final charge current.These parameters served as inputs for the KNN regression model.A model for estimating the remaining useful life (RUL) [140] was also developed using a KNN.It calculated the RUL by taking a weighted average of similar cells' useful lives with comparable degradation rates.
While a KNN is straightforward, efficient, and provides reasonably accurate results, it has not received as much attention as neural networks (NNs) and support vector machines (SVMs).This can be attributed to the requirement of understanding battery degradation across its entire range because a KNN model cannot extrapolate beyond observed data.Additionally, a KNN may not be resource-efficient, particularly with large datasets or high-dimensional features, as it needs to store all data and locate neighbor nodes.

Transfer Learning
Transfer learning is regarded as a powerful way to enhance conventional machine learning methods.One paper discussed the data-driven solutions, particularly transfer learning [141], to making more accurate predictions about battery parameters, such as the state of charge (SOC), state of health (SOH), and remaining useful life (RUL).Transfer learning for battery parameter estimation has its drawbacks.For example, pre-trained models might not directly apply to battery parameter estimation due to differences in their target domain, potentially leading to less accurate results.Pre-trained models are often complex, requiring significant computational resources and memory, making them less practical for resource-constrained battery applications.Furthermore, learning models can be less interpretable, making it challenging to understand the reasons behind their predictions, which is crucial in battery-related applications.Finetuning large pre-trained models can be time-consuming, especially when dealing with limited data, delaying the deployment of battery parameter estimation systems.Thus, there is a need for careful consideration and domain-specific adaptations when applying transfer learning to battery parameter estimation tasks.
Another study introduced a Gaussian mixture model [142] for lifetime clustering to assess the effectiveness of proposed health indicators.Additionally, an automatic reference battery selection method was suggested to identify the most relevant candidates for model training.A long short-term memory network with probabilistic regression was then employed, providing mean values and confidence intervals through Bayesian inference, showing relative reductions of 43.7% and 45.3% compared to conventional methods.
A semi-supervised self-learning method [143] was proposed to handle the uncertainty for probabilistic predictions.The results achieved a mean prediction error of below 4.1% for long lifespan batteries and 5.21% for short lifespan batteries, providing possible solutions for some real-world scenarios.Another paper [144] presents a method for accurate battery pack health prognosis using universal health indicators and model fusion.These indicators are adaptable to various battery types, cell numbers, and aging statuses that were further extended to dynamic working conditions using deep learning frameworks.The model's performance improved through migration and fusion, making it suitable for different battery packs and working conditions with high accuracy and reliability of mean absolute errors of less than 2.5% and root mean square errors less than 3.1% in various applications.
Lastly, researchers [145,146] introduced a novel method for predicting remaining useful life involving optimizing health indicators and applying online model correction with transfer learning.The process uses Gaussian regression to optimize health indicator thresholds for end-of-life determination.The combination of transfer learning and gated recurrent neural networks was designed to predict remaining useful life based on the optimized health indicators, enabling online applications.A self-correction strategy was used to improve prediction performance during operating cycles, showing the method's accuracy and predicting remaining useful life with an error of less than five cycles.

Hybrid Method (HM)
Hybrid methods combine multiple techniques for better accuracy and robust results, mainly focusing on integrating various ML methods.These methods, known as modellevel ensemble learning (EL), merge outputs from different base learners.They often use a weighted average of these base learners to produce the final output.For example, in [147], logic and Gaussian process regression models were combined using the mean method for SOH estimation.In [148], base learners like the ANN, SVM, gray model, and ARIMA were combined with a time-varying weight assignment approach.Probabilistic integration [149] is used to enhance base learners, making the output based on their probability distribution.For example, eight CNN models [76] were pre-trained with aging data from battery cells, using transfer learning and ensemble methods.EL compensates for base learner performance flaws, eliminating the need for precise parameter optimization.Instead, it allows for random initialization of model parameters and enhances accuracy.

Comparison of Several Types of Machine Learning
Several ML-based methods for SOC and SOH (including the RUL) estimation are compared in Table 4. Numerous relevant studies exist, but the presented sections offer insight into the current research status.Notably, variations in these methods show significant differences in practice.We consolidated key information and ultimate accuracy.These studies used different batteries, datasets, and assessment criteria.Therefore, precise comparative conclusions were challenging due to different dataset, battery, testing conditions, and criteria used.It was evident that the RBF exhibited the highest error, whereas the FFNN and RNN were presently the most promising techniques for estimating both the SOC and SOH.The ELM, SVM, and transfer learning also showed promise but warranted additional investigation through more iterations.An SVM parameter was optimized using an algorithm of particle swarm optimization, and the performance of the prediction model was assessed using cross-validation.

SOC
Absolute error: 1.5% SVR [117] 2020 Part of the differential temperature curves in a voltage range, which were found to be strongly related to the SOH, was adopted to establish a relationship with the SOH using SVR.Evolving quantum fuzzy neural network [167] 2023 The capacity fading of the cell extracted from the incomplete discharge conditions in dynamic driving scenarios was predicted.
Voltage, Current, Capacity SOC Relative error: <5% LSTM with transfer learning [168] 2022 Early aging data of a battery using transfer learning was used to improve the SOH estimation.

Challenges and Future Works
The existing literature reveals numerous studies on SOC and SOH estimation, each with merits and drawbacks.Therefore, thoroughly analyzing these methods is essential to determining suitable applications.ML-based approaches have become popular despite minimal knowledge of a battery's internal workings and reliance on abundant data and powerful processors for SOC estimation.However, imbalanced data can lead to overfitting and underfitting issues.Data quality and quantity significantly impact the ML method's accuracy and performance.Precision in data acquisition, reading, transmission, and analysis is crucial, as errors can lead to poor data quality.Effective techniques must be developed to eliminate inaccurate or biased data.Small sample problems can affect model generalizability and performance.Clustering algorithms, reasoning models, crowdsourcing database technology, and transfer learning may address these issues.
Current estimation methods grapple with robustness concerns, particularly considering complex battery cell behavior.Prognostics are essential as many battery cell parameters are not directly measurable, influenced by environmental factors.Online and offline estimation challenges are intertwined.Online algorithms work with new data during system operation, while offline methods require all data and model parameters before training.Most ML-based methods lean toward offline estimation, potentially leading to challenges in real-time settings.
Spontaneous combustion of EV batteries is a reported issue.Inconsistencies in battery pack design, overcharging, and thermal runaway contribute to this problem.The weakest cell governs pack performance, and low-capacity cells face overcharging and over-discharging, reducing battery life and raising combustion risks.Complexities in EV battery packs pose challenges for BMSs in accurate SOC/SOH estimation.
Some ML algorithms lack model interpretability.Improved interpretability is crucial, as it provides explicit physical meaning to estimation models.Hyperparameter selection through trial and error can lead to computation complexities and unsatisfactory SOC and SOH results.Early-cycle accurate battery life prediction can enhance cell development and effectiveness given initial cycle anomalies and subtle deterioration.One significant disadvantage of transfer learning is the potential for overfitting, where a pre-trained model is finetuned on a new task with limited data.It could become too specialized and perform poorly on unseen data, reducing generalization and accuracy.In addition, domain shifts can occur due to variations in data distribution, which can be challenging to address as new batteries emerge from the market.

Conclusions
The paper discussed the state of charge (SOC) and state of health (SOH) estimation methods, including direct measurement and model-based approaches.Machine learning (ML) techniques, such as neural networks and deep learning, were introduced as effective tools for SOC and SOH prediction, leveraging data to enhance accuracy.Various ML algorithms were discussed for battery state estimation, offering promise for improving BMS performance in EV batteries.The paper also highlighted ongoing research involving ML applications for SOC and SOH prediction, shedding light on current challenges and prospective developments.
Our literature review showed that data-driven strategies are preferable when substantial data are available, surpassing model-based approaches.Neural networks (NNs) have gained significant attention in recent years, followed by support vector machines (SVMs), fuzzy logic (FL), the k-nearest neighbors (k-NN) algorithm, and genetic algorithms (GAs).Studies suggest that NN methods outperform SVMs in specific conditions, while optimized support vector regression (SVR) excels in SOC estimation accuracy.However, NNs exhibit lower mean absolute error (MSE) and handle nonlinear data effectively.An SVM proves practical with smaller datasets but poses challenges in human intervention, learning progress delays, and scalability limitations.
Overall, ML-based methods for SOC and SOH estimation remain a primary research focus for BMSs, with NNs and SVMs being the primary techniques.Conventional methods like the equivalent circuit model (ECM), electrochemical model (EM), and FL continue to validate findings.The challenge lies in translating research outcomes into practical applications.Further real-time estimation research will require different system-level configurations, battery physics-informed learning, intelligent thermal management, and geographically sensitive learning.Furthermore, sometimes human-in-the-loop can be helpful to increase the robustness of a battery management system.Explainable artificial intelligence (XAI) is making waves in battery technology.XAI can explain its decisions or outputs in a way that humans can understand.

Figure 1 .
Figure 1.Structure and functions of a BMS.

Figure 2 .
Figure 2. Capacity degradation and knee-point obtained using the Bacon-Watts model.

Figure 4 .
Figure 4. Three typical ECMs of a Li-ion battery.

Figure 5 .
Figure 5. Categories and roles of machine learning.

Figure 7 .
Figure 7. Basic principles of an ELM.

Figure 8 .
Figure 8. Basic principles of an RNN and LSTM.

Figure 9 .
Figure 9. Geometric representations of an SVM and SVR.

Generation 2 Figure 11 .
Figure 11.Basic workflow of a GA.

Table 1 .
Comparison of advantages and disadvantages of various health indicators (HIs).

Table 2 .
Five tribes of machine learning.
Machine learning necessitates a substantial volume of curated data, particularly for estimating SOC and SOH through experimentation.Leveraging these datasets for methodological investigations without requiring new experiments is a valuable approach.Examples of assessable datasets can be obtained from the following websites.
[67]ttery voltage serves as a foundational indicator for estimating both the SOC and SOH.It mirrors the stored energy in the battery and fluctuates with SOC changes, which can be directly measured or inferred from other sensor data.Thenumber of charge-discharge cycles influences battery health, with more cycles leading to capacity fading and a diminished SOH.Therefore, the cycle count as a feature, aids in SOH prediction.Center for Advanced Life Cycle Engineering (CALCE)[65]: The CALCE at the University of Maryland provides battery data for research purposes, covering battery state estimation, remaining useful life prediction, accelerated battery degradation modeling, and reliability analysis.NASAdatasets[66]:ThePrognosticCenter of Excellence (PCoE) at NASA Ames maintains the Prognostic Data Repository, which contains various battery datasets.Among these are the PCoE Battery Dataset, featuring cycling data for 3,418,650 cells with a 2 Ah capacity cycled to 70% or 80% of initial capacity at different temperatures using a custom-built battery tester.Additionally, there is the Randomized Battery Usage Dataset, providing cycling data for commercial cells, and the HIRF Battery Dataset, which includes data from testing battery packs for a small aircraft.Also, the Small Satellite Power Simulation Dataset includes data from testing battery packs for a small satellite.BatteryArchive.org[67]:It is an open-access battery dataset repository built on opensource software.Its goal is to standardize battery testing and reporting by offering a user-friendly interface for comparing battery datasets. An article published by Springer stresses the importance of standardizing battery testing and reporting through open databases [68] to enhance overall field research and advance toward more efficient and reliable battery systems.However, the reliability of these datasets varies depending on the specific research context and use case.Hence, it is crucial to carefully assess datasets based on their requirements and objectives. Carnegie Mellon University Kilthub Dataset [69]: A dataset of lithium-ion battery discharge data including voltage, temperature, and current from electric vertical takeoff and landing (eVTOL) duty cycles (https://kilthub.cmu.edu/articles/dataset/eV-TOL_Battery_Dataset/14226830(accessed on 21 May 2024)).
illustrates the fundamental concepts of an RNN and LSTM in a way that differs slightly.

Table 3 .
Comparison of NN-based SOC and SOH estimation methods.

Table 4 .
Comparison of various ML methods for the SOC and SOH (including RUL) estimation.