A Review on Battery Model-Based and Data-Driven Methods for Battery Management Systems

: Battery state estimation is fundamental to battery management systems (BMSs). An accurate model is needed to describe the dynamic behavior of the battery to evaluate the fundamental quantities, such as the state of charge (SOC) or the state of health (SOH). This paper presents an overview of the most commonly used battery models, the equivalent electrical circuits, and data-driven ones, discussing the importance of battery modeling and the various approaches used to model lithium batteries. In particular, it provides a detailed analysis of the electrical circuit models commonly used for lithium batteries, including equivalent circuit and thermal models. Furthermore, a comprehensive overview of data-driven approaches is presented. The advantages and limitations of each type of model are discussed. Finally, the paper concludes with a discussion of current research trends and future directions in the ﬁeld of battery modeling.


Introduction
Lithium batteries are the most promising technology for electric vehicles (EVs) and energy storage systems (ESSs).The main goals for each battery-powered system are to extend the battery's lifetime and improve its reliability.The battery management system (BMS) represents a pivotal component in the real-time monitoring of battery conditions.Since it is impossible to have a direct measurement, the battery information, such as the remaining charge, needs to be estimated using model-based estimation algorithms.The battery models presented in the literature mainly fall into the following two main categories: the model-based, such as electrical equivalent circuit, and the data-driven methods, such as neural network and support vector machine.The model-based methods, such as equivalent electrical circuits (ECMs), are the most widely used to study the dynamics of the battery [1][2][3][4][5][6][7].The ECMs involve representing the complex electrochemical processes occurring within a battery as a simplified circuit with various components.Engineers and researchers can analyze and simulate the battery's performance under different conditions [8][9][10][11].The equivalent circuit model typically consists of idealized components such as resistors, capacitors, and voltage sources, representing the various physical phenomena occurring within the battery [12][13][14][15][16].These components are interconnected to mimic the battery's internal resistance, capacitance, and voltage dynamics.Using this modeling technique makes it possible to study the battery's response to different loads, charging and discharging rates, temperature variations, and other factors [17,18].It enables the estimation of essential battery parameters like state of charge (SoC), state of health (SoH), internal resistance, and capacity [19][20][21][22][23]. Battery modeling through equivalent electrical circuits is crucial in various applications, including electric vehicles, renewable energy systems, portable electronics, and grid energy storage [24][25][26][27][28][29].It helps optimize battery designs, develop battery management systems, and predict battery performance over time.
However, it is essential to note that the model's accuracy depends on the chosen circuit topology and the availability of accurate battery characterization data.Actual batteries are complex systems with non-linear behavior, and creating an accurate equivalent circuit model requires careful consideration of various factors.
For this reason, data-driven methods have become increasingly popular in battery modeling thanks to their ability to describe complex non-linear phenomena.These models use machine learning techniques to learn the relationship between input variables (such as state of charge, temperature, and current) and output variables (such as voltage and capacity) from a large dataset of battery measurements.
This overview aims to present a detailed analysis of the most used models in the literature.The paper is mainly organized as follows: Section 2 describes the most commonly used ECMs, from the simplest to the most complex.Kinetic and thermal models are presented in Section 3. Section 4 illustrates the data-driven approaches.Finally, results and conclusions are drawn.

Equivalent Electrical Circuit Models
To adequately simulate the battery storage of a complex system such as the one in Figure 1, ensuring a certain degree of accuracy at the minimum error and minimum computational cost, it is possible to refer to electrochemical, electrical, or "data-driven" models.The electrochemical ones require a considerable processing time to compute the information associated with chemical reactions at the microscopic level [30,31].On the other hand, the analytical-mathematical models exploit empirical equations to exclusively and rapidly estimate characteristics of efficiency and residual charge without, however, allowing the evaluation of the internal parameters and the relationship between voltage and current at the terminals.Therefore, electrical models are the most versatile for analyzing a battery's static and dynamic behavior, expressed through voltage or current generators, resistors, and capacitors [32].It should be noted that a model often used in BESS to relate the battery power to the state of charge [33] is the so-called generic model, which expresses the SOC as a function of the energy stored in the battery relative to its maximum capacity.An increasing or decreasing trend occurs depending on whether the battery itself is in charge or discharge as follows: where the power P BESS in Equation (1) indicates the charge power (>0), while in Equation (2) that of discharge (<0).(η C ) Moreover, (η D ) indicate the charging and discharging efficiencies, respectively, and depend on the particular technology [34,35] and the energy capacity of the system (EC BESS ).This model allows evaluating the relation P BESS = f (SOC) without specifying in which operating context (voltage, current) this occurs.Therefore, it is clear that an assessment of the dynamic behavior of the battery as a function of the SOC is important to understand its operation better.In mathematical terms, this is equivalent to saying that the voltage at the terminals can be expressed as a function ( f ) of the SOC, current, and temperature Equation (3): In which the function f regulates the dynamics of the battery, meaning the latter is at a macroscopic level (battery pack).

Simple Battery Model
The most straightforward version present in the literature is the simple battery model [36,37], which allows a quantitative study of battery behavior without excessively investigating the internal electrochemical processes.The following electrical circuit model (ECM) consists of an ideal voltage generator, the OCV, and an (internal) series resistance, such as R int , both generally dependent quantities from the SOC and temperature.
The equation describing the circuit in Figure 2 is: The open-circuit voltage represents the voltage OCV at the terminals under no-load (equilibrium) conditions.It has a non-linear relationship with the SOC, while R int contributes to evaluating the instantaneous response of the battery and limits the internal current flow.As described in [38,39], it is influenced by two main terms: the electronic resistance, depending on the resistivity of the materials, and the ionic resistance, which relates the current flow to the electrochemical internal processes (ion mobility, conductivity of the electrolyte and effective surface area of the electrodes).Also, this internal resistance allows investigating the degradation mechanism, as it tends to increase with it, being related to the SOH.

Equivalent Electrical Circuit-1RC
The circuit of Figure 3 allows the study of the dynamic and transient behavior; a parallel RC network is added to model the transient response caused by the flow of lithium ions in the electrolyte, and in the negative electrode R 1 , and C 1 respectively constitute the resistance and the polarization capacity.These terms describe how the voltage varies as a function of the current.These parameters, as well as the OCV and R 0 , are functions of the SOC and the battery temperature [40].Therefore, the dynamic behavior of the voltage is obtained by applying Kirchhoff's law (KVL) to the circuit in Figure 3, as follows Equation ( 5): In which the v RC can be obtained by applying Ohm's first law for the resistance and the characteristic equation of the capacitor, respectively, reported below Equations ( 6) and ( 7): In which, given the topology of the circuit, for each instant of time, the voltage is expressed as v rc =v c1 = v R1 .While applying Kirchhoff's law, it is possible to express the current affecting the battery as Equation (8): which is valid for each instant of time, and it can be rewritten taking into account Equations ( 6) and (7) as follows: For the considerations made, the previous one becomes: In addition, using the previous (Equation (10)) and (Equations ( 6) and ( 7)), it is possible to express the voltage affecting the RC network as follows: The resolution of the differential Equation (11), then inserted in Equation ( 5), allows for the investigation of the dynamic behavior of the battery (diffusive phenomena between the electrodes) over time, i.e., of the response to a current excitation through the time constant

Equivalent Electrical Circuit-nRC
The previous model can be generalized to n-RC networks (Figure 4) to introduce more time constants and, therefore, analyze the behavior of the ESS in greater detail.However, the complexity increases the computational cost since it is necessary to extract the values of each component of the equivalent circuit to identify the experimental curves, thus reducing the possibility of use in real-time-type applications.The 2RC model of Figure 4 offers an excellent compromise between precision and computational cost.It has two distinct time constants τ 1 and τ 2 = R 2 • C 2 relating to two phases of the voltage transient, one faster and a slower one (respectively).From a physical point of view, as mentioned in the previous section, RC networks are associated with ion diffusion in the electrolyte and porous active elements (electrodes), with charge transfer, and with the electric double layer effect (electric double dayer (EDL)) [42].The latter constitutes a region (established between an electric conductor and an ionic one, in this case, the electrolyte, on account of the local inhomogeneities of the charge) in which the electrochemical phenomena occur, which mainly influences the electronic flow so that during the charging phenomena the cations present in the cathode-electrolyte interface will be removed leaving many solvents available and thus amplifying the electrolyte decomposition phenomena [40,43].Analyzing the circuit in Figure 5, it is clear how the previous one in Equation ( 5) is modified by adding the term due to the second RC network, i.e.,: Similar to what is reported in the previous section, for this circuit, it is possible to obtain the following: Figure 6 shows in more detail the behavior of the circuit in Figure 5 in response to a generic current variation (impulsive in the specific case proposed).The resistance R 0 represents the instantaneous voltage contribution in the transition between the no-load circuit and the load connected.The resistance can be evaluated by analyzing, using Ohm's law, the ohmic drop following the previous cyclic phase (charge or discharge).The ohmic drop is evaluated by the ratio between the difference between the end charge (or discharge) voltage and the voltage at the first instant of inversion of the relative curve (as evident in the figure by AV variation) and the current involved in the process (this must be doubled if the same current is used in the transition between phases).However, this parameter depends non-linearly on temperature and SOC.The parallel network R 1 , C 1 is associated with the fast time constant whose contribution describes the behavior of the battery concerning transient phenomena of short duration, being associated with the electrochemical polarization for the transfer of the charge [41,45].In parallel, the R 2 , C 2 network is associated with transient phenomena of longer duration (with slow time constant τ 2 = R 2 • C 2 ) linked to polarization by concentration (changes in the concentration of the electrolyte due to the electronic flow at the interface with the electrodes); therefore, it depends on the formation of a concentration gradient in correspondence with the ion diffusion or the relaxation phase [41,[46][47][48].For these reasons, this model is also defined as a double polarization model, or DP, and allows the tracking of internal electrochemical phenomena responsible for the non-linear relationship between voltage at the terminals and residual charge.In addition, the resistances and capacities of the two networks RC also depend on the state of charge and the temperature, as will be detailed below, being C 1 , C 2 necessary to characterize the dynamic response during the transfer of energy (therefore power) between the battery and the mains/load.Furthermore, the dependencies on the state of charge and the temperature are contained directly in the model.

General Non-Linear Model
The 1RC and 2RC models presented can be equipped with an additional bulk capacitor placed in series with the OCV, respectively, giving rise to the PNGV (Partnership for a New Generation of Vehicles) and GNL (general non-linear) model in which this additional element models the variations of this voltage during load application and reflects the capacity of the storage system [43,49-52] (Figure 7).These dependencies will be integrated into the 2RC model, taking into account an accurate identification of the OCV, exploiting special discharges at low C-rates, and a precise imposition of the capacity exhibited by the cell concerning the discharge power and the operating temperature.

RC Equivalent Model
This model exploits two capacitors and three resistors, as evident in Figure 8, noting how the ideal voltage generator associated with the open-circuit voltage is absent OCV.In detail [41,47], the capacitor C b , of modest value, models the effects of the surface nature of the storage and the double layer (EDL).In contrast, C c represents the battery's energy capacity (storage charge) and has a decidedly higher value than the previous one.On the other hand, the resistances R t , R e , and R c , respectively, constitute the losses associated with the terminal section (interfacing with the external load), those associated with the final section and the series resistance capacitor loss ratio (ESR), which describes the Joule losses.Knowing the conventions of Figure 8, the model is described by the following coupled equations [40,41,47]: ) where Equation ( 17) represents the transfer function relating the output voltage to the current source.Although this model has the same number of passive components compared to the DP, the latter, however, remains superior.These comparative evaluations further reaffirm the validity of the choice of the 2RC or double polarization model.
Comparing the 1RC dynamic model with the 2RC dynamic model, it was found that the 2RC model provides a more accurate representation of battery behavior, particularly in terms of voltage and capacity estimation.The additional resistor-capacitor element in the 2RC model allows for better modeling of the battery's internal resistance and polarization effects, leading to improved accuracy in predicting battery performance.
When comparing the PNGV and LNG models with the RC equivalent model, it was found that the RC equivalent model provides a more straightforward and more computationally efficient approach to battery modeling.However, the PNGV and LNG models offer more detailed representations of battery behavior, particularly regarding thermal and aging effects.As such, the choice of model depends on the specific application and level of detail required for accurate battery management.
Overall, while each model has strengths and weaknesses, using electrical circuit models for lithium batteries is crucial for effective battery management and optimization.

State of Charge and Temperature Evaluation
The state of charge and the operating temperature strongly influence the models presented in the previous sections.Therefore, to accurately describe the storage system, it is necessary to introduce a model estimating the SOC and a thermal model evaluating the temperature profile.

Thermal Model
The temperature directly influences the performance of the storage system.There is a relationship capable of regulating the heat exchange with the surrounding environment, also influencing the variability of the characteristic parameters of the representative model.Each identified parameter must refer to a specific temperature value or range.
To model the temperature variation and the heat exchange, the dissipated power by the Joule effect must be considered.Figure 9 shows a model in which the thermal power is evaluated according to electrical equivalent components as a current generator, the thermal resistance (R t ) characteristic of the thermal gradient present in the exchange (temperature differences between ESS and external environment), and finally the storage system as a thermal capacity (C t ).The upper and lower potentials of the circuit refer, respectively, to the temperature detected for the storage system (measured by temperature sensors) and at the reference temperature of 0 • C, indicated as the mass reference in electrical terms.In addition, thermal resistance comes between the (thermal) potential of the ESS and the correlated one to the heat exchange with the external reference temperature (controlled) in order to guarantee a non-instantaneous response to possible sudden variations in the external temperature [47].In this way, the cell temperature, indicated with That, can be expressed as follows in the energy balance [47]: By solving in the Laplace domain (s), it is possible to obtain from the previous one the temperature profile in the compact form: It is adaptable to the specification of the 2RC model adopted by obtaining, the relationship between P Joule and the electrical quantities of the circuit: In detail, it should be noted that R t = 1/(h conv A batt ); where h conv is the coefficient of convective heat exchange with the surrounding environment (air), expressed in (W m −2 K −1 ) and A batt is the surface area of the storage system, therefore based on the dimensional analysis it follows that this resistance is expressed in (K/W).In parallel, C t is a function of the specific heat of the constituent material ESS, expressed in J kg −1 K −1 , and its mass (expressed in kg) and precisely its value can be estimated using a unique optimization algorithm in which the temperature profile recorded by the BMS is compared with the one simulated following the previous equations.
As demonstrated in [53][54][55], it is clear that the variation of the model parameters is significant in correspondence with decidedly accentuated T batt variations, i.e., with gradients even higher than ∆T >> 10 • C; therefore, it is legitimate to state that for insignificant variations in the temperature profile the parameters can be considered unchanged, i.e., not explanatory of particular changes in their values.It should be specified that the environments in which the BESSs are placed have an actively controlled temperature in order to stabilize performance and avoid possible excessive increases (or reductions) such as to jeopardize behavior (such as thermal leaks, although in such contexts, there is also the corrective intervention of the BMS) and at the same time convective heat exchange mechanisms are implemented with the surrounding environment described by the following relationship: Observing how T batt (t) refers to the temperature profile as a function of time, obtainable from (3.36) as follows: where the non-instantaneous response to temperature change is modeled by the response constant τ t = R t C t .

Kinetic Battery Model
As for the temperature profile, it is necessary to monitor the behavior of the SOC, which is such a crucial parameter for analyzing the circuit's behavior in response to a current excitation and contributing to the identification phase of the 2RC electrical parameters.In literature, the Kinetic Battery model is combined with the 2RC model to evaluate the state of charge.This model supposes the battery as a structure composed of two wells, Figure 10, one contains the available charge, which flows directly into the load, while the other contains the bound charge, which refills the available charge well through a valve, k.The difference between good heights regulates the exchanged charge.h 1 represents the SOC that becomes zero when the battery is fully discharged.The SOC can be expressed as: where C available and C max are the available and maximum battery capabilities, respectively.C unavailable is computed in detail in [56].

Data-Driven Models
Data-driven models have become increasingly popular in battery modeling due to their ability to accurately capture complex non-linear behaviors and their flexibility in handling various data types.These models use machine learning techniques to learn the relationship between input variables (such as state of charge, temperature, and current) and output variables (such as voltage and capacity) from a large dataset of battery measurements.One advantage of data-driven models is their ability to capture the dynamic behavior of batteries, which is crucial for accurate state estimation and prediction.Traditional physicsbased models often rely on simplified assumptions and may not be able to capture the full complexity of battery behavior.On the other hand, data-driven models can learn from large datasets and adapt to changes in battery behavior over time.Another advantage of data-driven models is their ability to handle different data types, such as time-series data or images of electrode microstructures.This allows for a more holistic approach to battery modeling, where multiple data types can be integrated to provide a more comprehensive understanding of battery behavior.However, one challenge in using data-driven models for battery modeling is the need for high-quality data.This requires careful experimental design and data collection procedures.Unbalanced data would cause a model to be subjected to overfitting and underfitting in decision-making [57].These issues are well addressed, and the researchers have developed general guidelines to address the problem [58].Additionally, the choice of machine learning algorithms and model architecture can significantly impact the performance and interpretability of the model (Figure 11).Despite these challenges, data-driven models have shown promising results in accurately predicting battery behavior and have the potential to improve battery management systems significantly.As more data becomes available and machine learning techniques continue to advance, data-driven models are expected to play a significant role in the future of battery modeling.Several machine learning techniques can be used in the data-driven approach for battery modeling.These include: 1.
Artificial Neural Networks (ANNs): ANNs are a type of machine learning algorithm that is inspired by the structure and function of the human brain (Figure 12).They consist of interconnected nodes or neurons that process and transmit information.ANNs are well suited for battery modeling as they can handle complex relationships between input and output variables.One specific application of ANNs in battery modeling is the state of charge (SOC) estimation.SOC is a critical parameter for battery management as it represents the amount of charge remaining in a battery and is crucial for determining its performance and lifespan.ANNs are particularly well suited for SOC estimation as they can handle complex relationships between input variables, such as voltage, current, and temperature, and the output variable of SOC.The training process involves feeding the ANN with a large dataset of inputoutput pairs, where the input variables represent the battery's operating conditions and the output variable is the corresponding SOC value.The ANN then learns the relationship between the input and output variables and creates a mathematical model to predict the SOC for new input data accurately.This model can then be used in real-time to estimate the SOC of a battery based on its current operating conditions.One advantage of using ANNs for SOC estimation is their ability to handle non-linear relationships between input and output variables.This is particularly useful for batteries as their behavior can be highly non-linear and dependent on various factors such as temperature and aging.Moreover, ANNs can also be trained to account for different battery chemistries, making them versatile for use with different types of batteries.This is important as different battery chemistries have different chargedischarge characteristics, and an accurate SOC estimation model must consider this.Neural networks have recently been used in several works to evaluate the SOC parameter of batteries.Table 1 summarizes the various results.Random Forests: Random forests are an ensemble learning technique that combines multiple decision trees to make predictions.They create many decision trees and use the average prediction from all the trees to make the final prediction.Random forests are useful for battery modeling as they can handle high-dimensional data and reduce overfitting.

5.
Gaussian Processes (GPs): Gaussian processes are a probabilistic machine learning technique that can be used for regression tasks [74,75].They work by modeling the relationship between input and output variables as a Gaussian distribution, allowing for uncertainty in the predictions.Since the forecast is based on a Gaussian distribution, the forecast can be improved using adaptive fitting [76].The typical trend of a GPR model is illustrated in Figure 14.Gaussian processes are helpful for battery modeling as they can handle noisy and sparse data.GPs are a powerful machine learning technique that can be used for battery SOC estimation.One advantage of using GPs for SOC estimation is their ability to handle non-linear relationships between input and output variables.Batteries exhibit non-linear behavior due to aging, temperature, and discharge rate.Traditional linear models need help to capture these complexities, leading to inaccurate SOC estimations.GPs, on the other hand, can capture these non-linear relationships and provide more accurate predictions.Another significant advantage of GPs is their ability to estimate uncertainty for their predictions.This is crucial for battery management as it allows for more informed decision-making.Batteries are subject to various uncertainties, such as measurement errors and environmental factors, which can affect their performance and lead to potential failures.By considering the uncertainty in the SOC estimation, battery management systems can take appropriate actions to prevent failures and ensure optimal battery performance.6.
Fuzzy Logic: Fuzzy logic is another popular approach for battery SOC estimation.Fuzzy logic is a mathematical framework that can handle imprecise and uncertain information, making it well suited for battery systems that exhibit non-linear and uncertain behavior.In fuzzy logic, input variables such as voltage, current, and temperature are mapped to linguistic terms such as "low", "medium", and "high".These terms are then used to define fuzzy sets, which represent the different states of the battery.The rules for how these input variables affect the SOC are defined using expert knowledge and experience.The fuzzy logic system then takes in the linguistic inputs and uses these rules to calculate the SOC.One advantage of this approach is its ability to handle imprecise and uncertain inputs.Batteries often experience variations in their behavior due to aging and environmental conditions.Fuzzy logic can account for these uncertainties and provide more accurate SOC estimations.Another advantage of fuzzy logic is its interpretability.The rules used in the fuzzy logic system can be easily understood by humans, making it easier to validate and improve the model.This is particularly useful for battery management systems, where it is essential to understand clearly how the SOC estimation is calculated.However, one limitation of fuzzy logic is that it relies heavily on expert knowledge and assumptions about the data.This can be a disadvantage in cases where the data is complex and cannot be easily captured by simple rules.Additionally, fuzzy logic may need help handling large datasets, as it requires significant computational resources to process linguistic inputs and apply rules.
Overall, the choice of machine learning technique will depend on the specific goals and characteristics of the battery modeling project.It is essential to carefully consider the data and problem to determine the most suitable approach.

Conclusions
The electrical circuit modeling of lithium-ion batteries through electrical circuit models and data-driven approaches plays a crucial role in accurately estimating parameters and state of charge (SOC) for battery management systems (BMS) in electric vehicles and other applications.The research in this area focuses on developing accurate models that can capture the dynamic behavior of lithium-ion batteries under different operating conditions and driving styles.Furthermore, the research also delves into comparing different battery modeling methods, including electrical circuits and data-driven models, for accurate SOC estimation in BMS.The study compares the pros and cons of these modeling methods and explores future research directions in this domain.
In particular, for the electrical circuit model, the comparison between the 1RC dynamic model and the 2RC dynamic model revealed that the 2RC model provides a more accurate representation of battery behavior, particularly regarding voltage and capacity estimation.The additional resistor-capacitor element in the 2RC model allows for better modeling of the battery's internal resistance and polarization effects, leading to improved accuracy in predicting battery performance.On the other hand, when comparing the PNGV and LNG models with the RC equivalent model, it was found that the RC equivalent model provides a more straightforward and computationally efficient approach to battery modeling.However, the PNGV and LNG models offer more detailed representations of battery behavior, particularly regarding thermal and aging effects.The choice of model depends on the specific application and the level of detail required for accurate battery management.
The data-driven approach to model lithium-ion batteries addresses the inconsistent and varied characteristics of battery cells, which pose challenges for battery pack modeling.This approach aims to reduce computational work while maintaining good model accuracy.The data-driven approach offers a promising solution to the challenges of modeling lithium-ion batteries, particularly in electric vehicle applications.By leveraging data-driven methods, researchers aim to enhance the accuracy and efficiency of battery modeling, contributing to the development of advanced battery management systems for electric vehicles and renewable energy storage applications.In conclusion, the research on electrical circuit modeling of lithium-ion batteries through electrical circuit models and data-driven approaches provides valuable insights into developing accurate and reliable models for battery management systems, ensuring the safe and efficient operation of electric vehicles and other applications.

Figure 6 .
Figure 6.Response of the 2RC-type ECM to a current pulse in the PDT test, "Pulse Discharge Test" [44].

Figure 9 .
Figure 9. Thermal model adopted to monitor the temperature variations of the BESS.

Figure 12 .
Figure 12.The general architecture of the 3-layer neural network.

Table 1 .
NN used for SOC estimation.SVMs can be trained using a dataset of input variables such as voltage, current, temperature, and corresponding SOC values.The SVM then finds the hyperplane that can best separate the data points and create a model that can accurately predict the SOC for new input data.Additionally, SVMs have a robust generalization ability, meaning they can perform well on unseen data.This is important for battery SOC estimation as the model needs to accurately predict the SOC for various operating conditions, not just the ones it was trained on.Furthermore, SVMs effectively handle noisy data, which is common in battery systems due to external factors such as sensor errors or variations in battery chemistry.By accounting for noise in the training process, SVMs can create a more accurate SOC estimation model.Table2shows a summary of the works in which SVM was used for SoC estimation.

Table 2 .
Summary of the contributions that used SVMs for SoC estimation.