A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation

Xiao, Zhanghua; Rao, Jingzhi; Ji, Cheng; Ma, Fangyuan; Wang, Jingde; Sun, Wei

doi:10.3390/pr13113597

Open AccessFeature PaperArticle

A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation

by

Zhanghua Xiao

^1,†,

Jingzhi Rao

^1,†,

Cheng Ji

²,

Fangyuan Ma

¹,

Jingde Wang

^1,*

and

Wei Sun

^1,*

¹

College of Chemical Engineering, Beijing University of Chemical Technology, Beijing 100029, China

²

School of Chemistry and Chemical Engineering, Huaiyin Normal University, Huaian 223300, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Processes 2025, 13(11), 3597; https://doi.org/10.3390/pr13113597

Submission received: 18 August 2025 / Revised: 1 November 2025 / Accepted: 4 November 2025 / Published: 7 November 2025

(This article belongs to the Special Issue Women’s Special Issue Series: Processes)

Download

Browse Figures

Versions Notes

Abstract

Lithium-ion batteries have been extensively utilized as a high-power, rechargeable, and dischargeable energy storage medium. Accurate estimation of the battery state of charge (SOC) in the battery management system (BMS) is imperative for ensuring the safe and stable operation of electric vehicles. This paper proposes an SOC estimation method based on the equivalent circuit model as well as the ampere-time integration method with a physical informed neural network. The network enhances the estimation of SOC by introducing two mechanistic information sources: the equivalent circuit model (ECM) and the ampere-time integration method (Ah-I method). These are utilized as a priori knowledge to constrain the estimation of SOC. Initially, the Rint model is selected as the physical analysis model of the lithium-ion battery, and subsequently, the Ah-I method is chosen as the auxiliary model for SOC output estimation. A deep learning network is then employed to establish the mapping between the battery input parameters and the SOC output. Finally, the SOC is estimated by fusing the physical model and the data-driven model. The results demonstrate the efficacy of the method in accurately estimating the state of charge of lithium batteries, with a root mean square error within 1%. The validity of the research methodology was further validated through comparison with other approaches.

Keywords:

SOC estimation; physical informed neural network; equivalent circuit model; lithium-ion battery

1. Introduction

The development of electric vehicles has led to a surge in demand for battery-related products and services, while the emergence of new energy technologies has also driven significant advancements in battery applications. Lithium-ion batteries are being employed on a large scale in the electric vehicle industry due to their extended cycle life, high energy density, and stable performance [1,2]. Battery management systems (BMS), which are integral to electric vehicles, are designed to ensure the smooth operation of batteries under complex and harsh operating conditions. The primary concerns that BMS addresses are key issues, such as state of charge (SOC) estimation, state of health (SOH) evaluation and remaining useful life (RUL) evaluation [3]. These elements are intimately associated with the state of the lithium-ion battery. Among these indicators, SOC is regarded as the most crucial in the context of BMS. Accurate SOC estimation provides the basis for various monitoring metrics of the BMS, including its SOH and RUL. Moreover, the SOC supports a range of control schemes, such as equalization management. SOC is defined as the percentage of charge currently contained in a battery relative to its maximum charge capacity [4,5]. Its existence is analogous to the fuel gauge function in traditional fuel vehicles. However, due to the complex electrochemical characteristics of lithium batteries, accurate SOC estimation is critical for BMS. This is because overcharging or over discharging can lead to shortened battery life or even permanent damage.

In recent years, research on methods for SOC estimation has primarily focused on four categories: the open-circuit voltage method [6,7], the ampere-hour integration method (Ah-I method) [8,9], mechanism modeling-based methods, and data-driven methods. The open-circuit voltage method establishes a look-up table relationship through the measurement of the mapping between the open-circuit voltage and the SOC of the battery. This method necessitates precise experimental measurements to obtain an accurate mapping relationship; moreover, the voltage must be maintained at a constant level throughout the measurement interval to eliminate overpotential effects, which renders each measurement time-consuming. These issues contribute to the conclusion that the method is not readily implementable in practical applications. The Ah-I method calculates the change in battery capacity to obtain the battery SOC by calculating the integral of the discharge current. This method is characterized by ease of calculation and high efficiency. However, despite its continuous nature, the method requires frequent calibration due to the low efficiency of the energy conversion and the change in the battery discharge rate. This results in cumulative errors over time, necessitating the use of other methods in practice.

Mechanism-based approaches include electrochemical modeling (EM) and equivalent circuit modeling (ECM). The most widely used EMs for SOC estimation are the single-particle model (SPM) [10] and the pseudo-two-dimensional model (P2D) [11]. This approach involves the utilization of partial differential equations, which incorporate a substantial number of unknown parameters, consequently leading to elevated computational complexity. The ECM is predicated on the theory of porous electrodes and concentrated solutions [12], and it describes the battery mechanism by equating the internal current process of the battery to an electronic circuit process and performing circuit analysis to establish a mathematical model. These models are often combined with state observers in practical applications [13].

Advances in computer technology have accelerated the development of data-driven SOC estimation methods, which have attracted increasing attention. This method employs detectable indicators such as voltage, current, and temperature of the battery cells to directly model the SOC, circumventing the intricate modeling process of battery behavior. This approach ensures reasonable accuracy and practicality. Chemall et al. [14] used a Long Short-Term Memory Neural Network (LSTM-RNN) to map the current, voltage, and temperature directly to the SOC, thus avoiding filters and inference algorithms used in the mechanism modeling process and achieving accurate SOC estimation. To further enhance the SOC estimation accuracy and capture the information of the up and down data before and after capturing in the sequence, Yang et al. [15] employed a bidirectional LSTM (BiLSTM) to improve the model’s ability to process input sequences bidirectionally. However, there is room for improvement as deep learning methods require substantial computational resources and training data. The estimation of battery state can be further realized with greater efficiency and precision through the enhancement of data quality [16].

These methods have yielded a multitude of sophisticated designs with regard to network structure and algorithm design. The SOC estimation methods based on these designs are both powerful and straightforward to implement; however, there is still a need to enhance their accuracy. Li-ion batteries are intricate systems that exhibit significant nonlinearity. On the one hand, if relevant information is not fully taken into account, the performance will be suboptimal [17]. On the other hand, if an excessive number of factors are considered, it may result in overfitting. Furthermore, the incorporation of excessive dimensions has been shown to compromise the accuracy and stability of the algorithm [18]. The prevailing deep learning-based methods are predicated on a data-driven approach, overlooking the electrochemistry of the battery. Consequently, these methodologies fail to adequately consider pertinent information, resulting in suboptimal performance. Research in disparate domains has demonstrated that the incorporation of mechanism-related knowledge into machine learning can enhance performance [19]. For instance, Fangfang Yang et al. [20] incorporated LSTM into SOC estimation and leveraged LSTM to rectify the original UKF estimation, thereby attaining enhanced SOC estimation relative to the utilization of solely the LSTM model. In a similar vein, Jinpeng Tian et al. [21] proposed a novel model-based approach to deconstruct the measured voltage and current sequences into open circuit voltage (OCV), ohmic response, and polarization voltage, among other parameters. This approach aims to expand the scope of DNNs, thereby facilitating more efficient learning of the mapping between measurable signals and SOC. The proposed methodology incorporates the mechanism information of the simplified RC model as a data enhancement input to the DNN. The outcomes demonstrate the efficacy of this approach in enhancing the performance of SOC estimation. These methods illustrate that augmenting the input variables with more relevant data can lead to substantial improvements in prediction accuracy. However, it should be noted that such methods necessitate a more extensive processing flow and lack the integration of the comprehensive information regarding the battery operation mechanism within the deep learning model.

To address these challenges, we propose a neural network model that integrates mechanistic information for accurate SOC estimation. This model integrates domain expertise from the battery field into a data-driven SOC estimation method. The method utilizes a straightforward and effective ECM and the Ah-I method, enhancing the mapping between input variables and SOC. This approach necessitates only a modest increase in computational cost to achieve enhanced SOC estimation performance. Furthermore, we ascertain the proportion of influence exerted by the two mechanistic information types within the loss function, which can result in optimal SOC estimation performance. A number of these statistical metrics have been significantly reduced. Ultimately, the proposed method was successfully applied to a range of operational conditions through experimentation, thereby substantiating its applicability in a fusion model context. The primary innovations and contributions of this paper are as follows: A novel method for estimating the SOC of physical information neural networks is proposed. This method utilizes ECM and the Ah-I method, enhancing the prediction performance of the fused model in the battery domain. The proposed method integrates the ECM, Ah-I method, and the deep learning model, leveraging the valuable insights offered by these models. Specifically, the mechanism model is employed to constrain the input and output data of the deep learning model, thereby enhancing its efficacy. The efficacy of the proposed method is substantiated by a comparison with the estimation results of other types of deep learning models, and the fusion model used has been shown to yield superior estimation results. This comparison provides a solution for the study of SOC estimation under different operating conditions.

The contribution of this work lies in the deep fusion of mechanistic models with data-driven approaches to obtain more accurate SOC estimation models. Existing hybrid models consist of two categories, one is the mechanism model in parallel with the data model, and the mechanism model calculates the base values and the data-driven maps the error values, which are subsequently combined. The other category is data-driven and mechanistic models in series. The data-driven model maps the parameters to be brought into the mechanistic model to obtain the SOC estimates. Existing hybrid methods assist SOC estimation with data and mechanism separately, without achieving tight binding, whereas this study tightly combines the two types of information so that the data process contains mechanism information, and is therefore inconsistent with both existing methods.

2. Data Sets and Performance Evaluation Metrics

In order to validate the effectiveness of the methods employed in this study and provide a visual representation, this section describes the datasets and evaluation metrics used.

2.1. Data Sets

Three datasets were utilized for the models employed in this paper’s validation analysis: the Panasonic 18650PF (Panasonic Corporation, Osaka, Japan) [22], CALCE’s A123 battery (A123 Systems, LLC, Waltham, MA, USA) dataset, and the INR 18650-20R battery (Samsung SDI Co., Ltd., Cheonan, Republic of Korea) dataset [23]. The specifications of the two datasets are presented in Table 1, and they are described in turn below.

The tests included in the Panasonic 18650PF dataset were conducted by Dr. Phillip Kollmeyer of the University of Wisconsin-Madison (phillip.kollmeyer@gmail.com). The test environment consisted of an 8-cubic-foot thermal chamber in which a brand-new 2.9 Ah Panasonic 18650PF battery was tested using a 25-amp, 18-volt universal power tester channel. The dataset comprises six test temperatures, ranging from −20 °C to 25 °C. The test condition comprises nine driving cycles, including cycles 1–4, as well as US06, HWFET, UDDS, LA92, and a neural network (NN). The first four cycles were composed of a random assortment of US06, HWFET, UDDS, LA92, and neural network (NN) drive cycles. The neural network drive cycle is constituted by a combination of elements from the US06 and LA92 drive cycles, with the inclusion of additional dynamics designed to facilitate the training of the neural network and the validation analysis of the model proposed in this paper. The drive power curves have been scaled based on an electric Ford F-150 truck equipped with a 35 kWh battery pack, which effectively simulates the process of SOC changes during real vehicle driving.

The A123 battery dataset was developed by the Battery Research Group at the Center for Advanced Life Cycle Engineering (CALCE) at the University of Maryland. The battery was subjected to testing via an Arbin Instruments BT2000 tester situated in College Station, TX, USA. The dataset contains data at eight temperatures, spanning a range from −10 °C to 50 °C. The collected data encompasses three distinct operating conditions, namely DST, US06, and FUDS, which collectively represent a spectrum of potential driving scenarios.

The INR 18650-20R battery dataset encompasses a range of operating conditions, as previously delineated, in addition to an auxiliary BJDST condition. This condition is intended to simulate dynamic stress testing in Beijing’s urban areas.

The aforementioned data acquisition conditions are derived from the potential driving conditions that prevail in the actual scene. Further detailed information on these driving conditions can be found in the U.S. Advanced Battery Consortium (USABC) brochure [24].

The current–voltage variation curves of the aforementioned six operating conditions at an ambient temperature of 25 °C are illustrated in Figure 1. This figure demonstrates that the scenario current–voltage variations of each operating condition exhibit notable differences, it is imperative to note that the charging current should be specified as positive, while the discharging current must be specified as negative.

2.2. Performance Evaluation Metrics

In order to evaluate the precision of the proposed network and the quality of curve fitting between predicted and true values, the mean absolute error (MAE), root mean square error (RMSE), and maximum absolute error (MAX) are employed as assessment metrics. The MAE calculates the mean absolute error, which is used to assess the overall error in the general case. The RMSE is calculated by taking the square root of the average difference between the true value and the error. This metric is more sensitive to large errors and better reflects the error fluctuation scenario. MAX calculates the maximum value between the true value and the error, thus allowing for the identification of the worst performance exhibited by the model. The expression is provided below for reference.

\begin{matrix} M A E = \frac{1}{n} \sum_{i = 1}^{n} |\hat{y_{i}} - y_{i}| \end{matrix}

(1)

\begin{matrix} R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(\hat{y_{i}} - y_{i})}^{2}} \end{matrix}

(2)

\begin{matrix} M A X = \max_{0 \leq i \leq n} |\hat{y_{i}} - y_{i}| \end{matrix}

(3)

where

n

denotes the total number of samples,

\hat{y_{i}}

denotes the predicted value of SOC and

y_{i}

denotes the actual value of SOC.

The experiments described in this paper were conducted on an AMD Ryzen 5 5600H CPU, an RTX3050Ti (16 G), the Windows 11 operating system, the TensorFlow framework, and the Python version 3.9.

3. Methods

3.1. Overall SOC Estimation Framework

Two common approaches for integrating mechanism and data in the context of SOC estimation are the parallel and series methods [25], as illustrated in Figure 2a,b.

Parallelism involves the use of the mechanistic model as a complement to the data-driven model. For instance, Chen et al. [26] introduced an additional sliding window average voltage as slow time-varying information at the input of the LSTM-RNN, which can facilitate the learning of the mapping of rapid changes in battery voltage and current to the internal characteristics of the battery. The serial approach entails the utilization of the mechanistic model as a preprocessing tool. For instance, the study by Hanqing Yu et al. [27]. incorporated the simplified electrochemical model parameters as an extension to the model. However, it should be noted that the aforementioned methods necessitate a more extensive processing flow and lack integration with the rigorous information concerning the battery operation mechanism within the deep learning model.

In view of this, we propose the fused hybrid approach shown in Figure 2c. The model herein combines the mechanistic information from the ECM and the Ah-I method with a neural network to implement a physical informed neural network for SOC estimation. Specifically, the data inputs, such as current and voltage, are computed in two parts: one for the data-driven losses and the other for calculating the losses for the two types of mechanistic information. The calculation balances the two types of losses, thereby facilitating a more accurate mapping between input and output and enhancing the performance of SOC estimation. The integration of physical laws into the model enhances its generalization capability when confronted with unseen problems beyond the distribution of training data, thus facilitating reasonable predictions of physical phenomena even in data-scarce environments. The overall network model structure is shown in Figure 3.

The overall SOC estimation framework of this paper is illustrated in Figure 4, which is composed of three sections. The initial component entails the aggregation of battery data, encompassing the acquisition of fundamental battery parameters in accordance with the test conditions under various operating conditions at distinct temperatures. Subsequently, the obtained basic battery data are subjected to pre-processing, including deletion of duplicate values, outlier screening and normalization. The second part of the framework involves the auxiliary calculation of the ECM and Ah-I method, which includes the calibration of the mapping relationship between OCV-SOC. Subsequently, the PINN model is trained with a neural network as the core, adding two parts of mechanism information into the network for constraints. The inputs are current, voltage, temperature, and time, and the output of the network is the SOC result. The final stage of the process involves error analysis of the trained results and comprehensive evaluation of the model estimation performance.

3.2. Physics-Informed Neural Networks

The following section outlines the general architecture of physical informed neural networks (PINN). The physical informed neural network is comprised of two distinct components: the conventional neural network and the one that contains physical information. A feedforward neural network comprises three principal layers: an input layer, a fully connected hidden layer with a nonlinear activation function for each neuron, and an output layer. A weight matrix (w) and a bias (b) are applied between each layer. During the training phase, the weight matrix and the bias are optimized with the objective of minimizing an objective function that typically penalizes the deviation of the neural network predictions from the training data. PINN can be employed to directly learn nonlinear mappings between the inputs and outputs of differential equations, as illustrated in Figure 5.

The following notation is analogous to that of reference [28], and the general form of the function that can be fitted by a physics-informed neural network is as follows:

\begin{matrix} \frac{\partial u}{\partial t} = - N [u; λ], x \in Ω, t \in [0, T] \end{matrix}

(4)

where

u (t, x)

is the solution and

N [u; λ]

is a nonlinear operator connecting the state variable

u

and the system parameter

λ

. The solution itself is a nonlinear operator that connects the state variable

u

and the system parameter

λ

. It can be considered as the solution of the system. In this context, the variable

t

denotes the time and

x

denotes the system input. The domain

Ω

can be defined in accordance with the a priori knowledge of the dynamical system, and

[0, T]

represents the time interval over which the system evolves. The model parameter

λ

may be either a constant or an unknown quantity. In order to ensure the enforcement of the physical laws that describe the dynamical system, we define the physics-informed neural network

f (t, x)

.

\begin{matrix} f (t, x) = \frac{\partial u}{\partial t} + N [u, λ] \end{matrix}

(5)

It should be noted that the nonlinear operator

N [u, λ]

reduces to

N [u]

in the case where the system parameter

λ

is known. The overall architectural configuration is illustrated in Figure 5. A neural network is employed to predict

u (t, x)

based on inputs

t

and

x

. To ascertain

f (t, x)

, we utilise automatic differentiation of the neural network components that predict

u (t, x)

[29]. Accordingly, the derivative of

u (t, x)

with respect to time

t

and the system input

x

is computed. Therefore, the neural network that predicts

f (t, x)

has the same parameters but a distinct activation function compared to the neural network that predicts

u (t, x)

. The shared parameters of the two neural networks are optimized through the minimization of the loss function.

\begin{matrix} M S E = {ω_{1} M S E}_{u} + {ω_{2} M S E}_{f} \end{matrix}

(6)

Among them:

\begin{matrix} {M S E}_{u} = \frac{1}{N_{u}} \sum_{i = 1}^{N_{u}} {|u (t_{u}^{i}, x_{u}^{i}) - u^{i}|}^{2} \end{matrix}

(7)

And:

\begin{matrix} {M S E}_{f} = \frac{1}{N_{f}} \sum_{i = 1}^{N_{f}} {|f (t_{f}^{i}, x_{f}^{i})|}^{2} \end{matrix}

(8)

The term

{M S E}_{u}

represents the mean square error loss associated with the initial data set. The variable

N_{u}

refers to the total number of training data.

{M S E}_{f}

denotes the mean square error of a finite set of configuration points, with

N_{f}

representing the total number of configuration points. The number of configuration points and the quantity of training data have an impact on the accuracy of the predictions and the time required for the optimization of the loss function. The error

{M S E}_{u}

enforces the boundary conditions on the independent variable

x

, while

{M S E}_{f}

enforces the physical properties of the dynamical system imposed by condition (8). In other words, it penalizes deviations from the predicted laws of physics. In this study, the Rint model and the ampere-hour integral model were selected as physical characteristics. The objective is to identify the neural network parameters (weights and biases) that minimize (6), given the training dataset and known system parameters

λ

. In the event that the parameter

λ

is unknown, the objective is trained for with the system parameters treated as additional variables. Accordingly, the loss function in a standard PINN is comprised of two distinct contributions. A single loss function is employed to address both the partial differential equation and the configuration points, while a second loss function is utilized to align the solution of the partial differential equation with the observed data. The specific training process is shown in Algorithm 1.

Algorithm 1. Training procedure of the PINN.

Input: Training samples X are divided into training sets and validation sets in batches.

Initialize PINN parameters W0 and B0.

While epoch ≤ threshold:

Calculate point residuals:

R_data = f (t,x)

R_PDE = u (t,x) //Includes Rint model and ampere-hour integration module

Calculate total residuals:

Loss = MSE (R_PDE) + MSE (R_data)

Backpropagation updates parameters

epoch = epoch + 1

Output: PINN parameters W and B

3.3. Rint Model

An equivalent circuit model is a theoretical construct that seeks to elucidate the operational principles of a battery by equating the current process within the battery to an electronic circuit process. This is achieved through the application of circuit analysis, which enables the establishment of a mathematical model based on the theory of porous electrodes and concentrated solutions. In light of the aforementioned considerations pertaining to model complexity, accuracy, and suitability, this paper elects to represent battery mechanism information with the Rint model. The Rint model is a basic model comprising solely an ideal voltage source (E) and a series resistance. The structure of the Rint model is illustrated in Figure 6.

In accordance with Kirchhoff’s law, the circuit equation for the entire loop can be expressed as follows:

\begin{matrix} U (t) = E_{o c v} (S O C) - I (t) R_{0} \end{matrix}

(9)

The result of transforming the equation into a partial derivative with respect to time is shown below.

\begin{matrix} \frac{\partial U (t)}{\partial t} = \frac{\partial E_{o c v} (S O C (t))}{\partial S O C (t)} \frac{\partial S O C (t)}{\partial t} - \frac{\partial I (t)}{\partial t} R_{0} \end{matrix}

(10)

In this context,

E_{o c v} (S O C)

represents the ideal voltage source, with a strong correlation to SOC. This is also referred to as OCV. A variety of open circuit voltage models have been proposed in the literature [30], including polynomial, exponential, and hybrid models. On the basis of an assessment of model accuracy and ease of derivation, the open circuit voltage parameter model selected in this study is a polynomial combination model, as illustrated in the following expression:

\begin{matrix} E_{o c v} (z (t)) = K_{0} + K_{1} z (t) + K_{2} {z (t)}^{2} + K_{3} {z (t)}^{3} + K_{4} {z (t)}^{4} + K_{5} {z (t)}^{5} + K_{6} {z (t)}^{6} \end{matrix}

(11)

In this context,

z (t)

represents SOC at time t, while

K_{0}

to

K_{6}

are polynomial coefficients. The model parameters will be identified through the utilization of experimental data. As illustrated in Figure 7, the voltage data obtained from this slow charging and discharging process reflects the OCV in a near-equilibrium state. The parameters of

E_{o c v}

and

z (t)

can thus be obtained based on the polynomials associated with the slow impulse discharge process.

3.4. Ampere-Hour Integration Method

The Ah-I method is one of the most frequently employed techniques for estimating the SOC of a battery. The real-time SOC value can be determined by either adding the initial SOC value to the charging amount or subtracting the discharging amount. The calculation equation can be expressed as follows:

\begin{matrix} {S O C (t)}_{i} = {S O C}_{0} - \frac{η_{1}}{η_{2} C_{n}} \int_{0}^{t} I (t) d t \end{matrix}

(12)

The location of the aforementioned item is as follows:

{S O C}_{0}

is the initial SOC, which has a significant impact on the accuracy of SOC and is typically calculated by OCV method;

η_{1}

is the Coulombic efficiency, which represents the relationship between the actual discharge current and the theoretical discharge current,

η_{2}

represents the ratio of the battery’s discharging capacity to its charging capacity and is defined as the charging and discharging efficiency of the battery.

C_{n}

is the total battery capacity.

\int_{0}^{t} I (t) d t

is the integral of the discharge current of the battery in the time period

[0, t]

. In the traditional Ah method for estimating the SOC [31], the Coulomb and charge/discharge efficiencies are typically regarded as constants, as is the total capacity

C_{n}

of the battery after the capacity experiment. It is evident that these parameters are influenced by the interplay of ambient temperature, charge/discharge efficiency and battery health, exhibiting pronounced time-varying characteristics [32]. Accordingly, the aforementioned variables are transformed into a partial differential equation, as illustrated below:

\begin{matrix} \frac{\partial S O C (t)}{\partial t} = - α I (t) \end{matrix}

(13)

where

α = \frac{η_{1}}{η_{2} C_{n}}

, This parameter is treated in this paper as a variable parameter that participates in the physical informed neural network for training.

4. Results and Discussion

This section commences with a discussion of the evaluation of SOC methods with data collected at 25 °C, followed by a calibration of the percentage of data loss and physical information loss in SOC estimation by PINN. The subsequent section explores the impact of ambient temperature on the estimator under varying operating conditions. Ultimately, the section concludes with a comparative analysis of the SOC estimation outcomes derived from alternative methods. The hyperparameters is set as layer:4, units:5120, batch_size: full batch, epochs: 2000, optimizer: Adam.

4.1. SOC Estimation Results at 25 °C

In this study, the focus will be on the estimation of the SOC for 25 °C. The equivalent circuit model and the ampere-time integration method are incorporated into the PINN as two physical information, respectively. The effect of the input of the two mechanistic information on the SOC estimation will be discussed. As illustrated in Figure 8, the network without mechanistic information is denoted as NN. The network with the equivalent circuit model information added is denoted as PINN-1. The network with the anharmonic integration information added is denoted as PINN-2. The network containing both mechanistic information is denoted as PINN-1&2.

As previously mentioned in Section 2, the dataset utilized in this section is the Panasonic 18650PF dataset from the University of Wisconsin, wherein the training and validation sets are designated as cycle1~cycle4, and the test conditions are NN conditions. As illustrated in Figure 9, the evaluation metrics for the SOC estimation results are presented. The traditional artificial neural network-based approach to SOC estimation achieves an RMSE of 2.63%, and it is intuitively obvious from Figure 8 that the estimation fluctuates greatly, with a maximum error of 10.74% at a time series of 70,000. The proposed mechanistic information models incorporating the equivalent circuit model and the ampere-time integral, respectively, have been shown to enhance the estimation of SOC when compared to the initial network. The RMSE of the estimation result of PINN-1 has been demonstrated to decrease from 2.63% to 1.25% (a relative reduction of 52.47%), with the maximum error being reduced to 3.89%. A similar trend has been observed in the estimation result of PINN-2, which also decreased in all three indexes. The RMSE of PINN-1&2 of the proposed method in this paper is further reduced to 0.56%, which is a 78.71% reduction, and the maximum error is also reduced to 3.45%. These results reflect the efficacy of the proposed method and demonstrate that the integration of the Rint model of the equivalent circuit and the two types of mechanism information of the anharmonic integration method into the neural network can effectively reduce the estimation error of the SOC and enhance the SOC estimation accuracy.

The aforementioned results can be explained in terms of the mechanistic information incorporated. Firstly, the Rint model contains information regarding the internal physical change process of the battery, which describes the dynamic change process of the battery during the charging and discharging process. For example, the change of the SOC of the battery during the discharging process is correlated with the change of the ideal voltage source in the Rint model. Furthermore, an equivalent equation has been demonstrated to reveal a relationship between current and voltage during the battery discharge process. This is coupled to the neural network through a PINN to facilitate the learning of a part of the intrinsic connection between the input signal and the output signal, thus enhancing the performance of SOC estimation. The ampere–time integral information embedded in PINN-2 directly reveals the relationship between the battery SOC and the input current. This intrinsic relationship is learnt by the neural network to enhance the estimation performance of the network. The PINN-1&2, which integrates these two types of information, restricts the inputs to two components of mechanistic information, thereby attaining optimal estimation outcomes. Of particular note is the strict constraint imposed on the maximum error part, which is instrumental in mitigating the risk of hazardous battery discharge arising from an erroneous SOC estimation.

In conventional PINN applications, both data-driven and physical information losses are frequently treated in an equivalent manner, i.e., there is no differentiation between the two types of losses in terms of their proportion to the total losses. This is reasonable for most applications under physical rules, as the rules described are harsh physical laws such as heat diffusion equations and fluid flow equations. However, when these physical laws are integrated into the neural network, a rigidity feature is formed, which can result in the model seeking a local rather than a global optimum [33]. The two types of laws proposed in this paper bear a strong resemblance to each other; consequently, the variable weighting strategy is employed in the practical application, and the optimal ratio of the two types of losses is determined by comparing and analyzing the SOC estimation results of the two types of losses with different assigned weights in the PINN.

As demonstrated in Equation (14), the ratio of the two components of the composition of the total loss is represented by

ω_{1}

and

ω_{2}

. For the purposes of the experiment, the following assumption is made:

\begin{matrix} {ω_{1} = 1 - ω}_{2} \end{matrix}

(14)

Subsequently, four different ratios of

ω_{2}

were selected for estimation validation: 0, 0.1, 0.5, and 0.9. These ratios correspond to estimation using only data-driven loss (A), data loss dominance (B), equal distribution of the two (C), and physical information loss dominance (D), respectively. In order to evaluate the strengths and weaknesses of the four ratios, they are integrated into a dynamic optimization option within the weight adjustment mechanism and incorporated during model training. The specific illustration is shown in Figure 10.

The corresponding results are displayed in Figure 11. The estimation indexes of the model exhibit a unidirectional trend of change, whereby an alteration in

ω_{2}

is observed to initially diminish the estimation performance, subsequently leading to an enhancement. The figure indicates that with the alteration of

ω_{2}

, the model’s estimation indexes demonstrate a consistent trend of alteration, i.e., the estimation performance initially decreases and subsequently increases. The estimation results where data loss dominates is the optimal value, with an RMSE of only 0.56%. Conversely, the RMSE when the two losses are equally distributed is 1.44%, while the RMSE when the physical information loss dominates is 3.42%. The subsequent analysis will demonstrate how, utilizing A as the baseline, the network with solely data-driven losses does not priorities the internal operating mechanism of the battery. Instead, it relies on a black-box model to establish the mapping between the input signals and the SOCs. Consequently, the estimation is mediocre. However, when the two losses are equally distributed, the model incorporates both the connectivity between the data and the constraints of the battery mechanism model, thereby attaining superior results in comparison to A. The existence of results B and D can be attributed to the non-rigid nature of the information contained within the physical domain. This component of the mechanistic information is capable of attaining a more precise estimation as an auxiliary data-driven one. To illustrate this point, consider the Rint model, which delineates the current–voltage relationship within a Li-ion battery. However, this model lacks a component that delineates the direct relationship between current-voltage and SOC. Consequently, it necessitates additional access to the relationship between open-circuit voltage and SOC for a more precise estimation of SOC. Consequently, reliance on this component of the information is inadequate for accurate SOC estimation. The data loss dominated approach, on the other hand, is capable of incorporating this part of the mechanistic information to assist in addressing the limitations of the data-driven black box approach without compromising its dominance, in accordance with the non-direct mapping relationship of the Rint model. Consequently, in this particular data set as well as in the test form, the data loss-dominated approach consistently yields more accurate estimation results.

4.2. Estimation Results of SOC at Other Temperatures

In addition to differing working conditions, the dynamic characteristics of lithium-ion batteries are also susceptible to the influence of other environmental factors. Indeed, changes in ambient temperature have been shown to have a significant impact on the electrochemical reactions within the battery [34]. The accuracy with which the BMS can estimate the SOC of Li-ion batteries under extreme temperature conditions is a critical factor that will affect the longevity of Li-ion batteries. In severe cases, this inaccuracy may even result in damage to the batteries and potentially hazardous accidents, such as explosions [35]. Consequently, it is imperative to verify the estimation outcomes of the proposed method under diverse temperature conditions. In this subsection, the proposed method is evaluated at temperatures such as −20 °C, −10 °C, 0 °C, 25 °C, and 40 °C (A123 dataset).

The training set employed for the aforementioned evaluation of temperatures ranging from −20 °C to 25 °C encompasses cycles 1 to 4, while the test set comprises a total of four working conditions, namely NN, LA92, UDDS, and US06. The A123 dataset was utilized for 40 °C, incorporating DST, FUDS, and US06, necessitating the implementation of the leave-one-out method for training and testing purposes (two conditions were allocated for training, while the remaining condition was designated for testing). The estimation results of the SOC obtained by the proposed method in this paper are shown in Figure 12.

The present study investigates the estimation of the SOC at varying temperatures and driving conditions. This approach facilitates a more comprehensive evaluation and validation of the SOC estimation method. As illustrated in Figure 11 and substantiated by Table 2, the proposed method exhibits consistent estimation accuracy across all temperature ranges, with the MAE remaining within 1%. The overall error of the estimation demonstrates a downward trend as the temperature increases from −20 °C (low temperature) to 25 °C (room temperature). The mean maximum error across the four working conditions is shown in Figure 11. Furthermore, the three indicators demonstrate an upward trend as the temperature varies from 25 °C (ambient) to 40 °C (high), with the maximum estimation error occurring at 40 °C for the US06 condition.

Both excessively high and low ambient temperatures are incompatible with optimal battery operation, thereby aligning with the temperature-affected effect on the battery’s internal electrochemical environment. From the perspective of an equivalent circuit model, the modelled internal resistance of a lithium battery is a quantity that is affected by temperature. For instance, a decrease in temperature can impede the embedding of lithium ions within the electrode. The aggregation of lithium ions around the carbon anode can precipitate lithium metal, lithium deposition, or plating on the electrode’s surface. This, in turn, can diminish the transfer rate of active lithium ions and the activity of the internal electrochemical reaction. Consequently, the charge transfer resistance can be elevated. Whilst elevated temperatures have been shown to accelerate the migration of ions and thus facilitate the embedding kinetics of lithium ions, they concomitantly promote the occurrence of undesirable side reactions such as dissolution and corrosion of the solid electrolyte interface (SEI) membrane. This, in turn, can lead to degradation of the carbon anode performance due to the poor stability of the SEI membrane. Furthermore, at elevated temperatures, the inactive materials within the cell (e.g., binders) may become ineffective. Consequently, this may result in bias in circuit modelling quantities influenced by elevated temperatures. This component of the variation can also be elucidated from the perspective of the ampere-time integration method. It is evident that fluctuations in temperature have a direct impact on both the maximum battery charge and the discharge coulombic efficiency. As the temperature is reduced, the charge transfer resistance increases significantly. It is a well-established fact that the charge transfer resistance of a discharging battery is usually much higher than that of a rechargeable battery. Consequently, the Coulombic efficiency of a battery is reduced at low temperatures. The lithium plating effect at low temperatures leads to the deposition of lithium ions on the surface of the electrodes, resulting in a reduction in battery capacity. Furthermore, at elevated temperatures, the distribution of ions becomes uneven during operation of the lithium-ion battery, whether charging or discharging. This may lead to the mixing of ions and produce thermal effects during mixing, affecting the battery Coulombic efficiency. Additionally, elevated temperatures can contribute to lithium battery ageing. This phenomenon is not only detrimental to the performance of the battery, but also precipitates a reduction in its service life. Concurrently, the capacity undergoes a substantial decline following the ageing process.

The results of the SOC estimation under different operating conditions demonstrate the robustness of the proposed method and its ability to adapt to the battery’s varying conditions at different temperatures. This is attributable to the incorporation of battery mechanism information in the proposed method, with the Rint module and Ah-I module serving as the abstracted information of battery operation. These modules facilitate the adaptation of the SOC estimation method to complex operating conditions.

4.3. Comparison with Other Different Methods

In order to verify the superiority of the proposed methods, a range of data-driven and hybrid models were selected for comparison, including Support Vector Regression (SVR), Long Short-Term Memory Neural Networks (LSTM), Hybrid Models [32], Position-encoded Attention LSTM (PALSTM) [36], Transformers and XGboost models [37]. To ensure fairness, the same preprocessing method is adopted for all comparison methods, and the first three use the Panasonic dataset with US06 as the test set. The PALSTM comparison method employs the CALCE dataset for the purpose of comparison. The Transformer and XGboost models were compared using the Panasonic dataset. The hyperparameter of Transformer is set as follows: Attention Head Size: 64, Number of Attention Head: 4, Feed-Forward Dimension: 256, Number of Transformer Layers: 2, Batch Size: 32. And the hyperparameter of XGBoost is set as: Max Depth: [3, 5, 7], Number of Estimators: [100, 150, 200], Learning Rate: [0.01, 0.1, 0.2], Random State: 42, Cross-Validation: TimeSeriesSplit (n_splits = 5).

The comparison effect of the methods in the Panasonic dataset is demonstrated in Figure 13.

As demonstrated in Figure 13, the proposed method is the most effective among several methods, both in terms of intuitive SOC estimation results and SOC estimation errors. The local zoomed-in plot of SOC estimation reveals that the SVR estimation error has the largest deviation, followed by the LSTM method, and the proposed method’s estimation curve closely mirrors the reference value. The SOC estimation error plot demonstrates that the proposed method’s error falls within the ±2% range, with only a minor increase beyond this limit during the final estimation period. Significant errors are evident in both the initial and final stages of the SVR, while the LSTM method exhibits minor errors during the initial phase, but significant fluctuations in the latter stages of estimation. Figure 14 provides a more detailed evaluation of the estimation results. The overall errors of the basic methods, such as SVR and LSTM, are substantial, with the maximum error of LSTM reaching 17.84%. In contrast, the errors of the proposed hybrid method and the method presented in the literature are reduced, and the values of the RMSE and MAE of the proposed method are lower than those of the hybrid method in the literature [32]. Conversely, the maximum errors observed in the case of the maximum likelihood method are marginally higher than those of the hybrid method, yet they are all less than 5%.

A detailed comparison with another method in the literature is presented in the CALCE dataset, which encompasses the results of the three test conditions. As illustrated in Table 3, the RMSE of the proposed method demonstrates superior performance in all test cases when compared to the PALSTM. In the FUDS case, the MAE of the proposed method is marginally higher than that of the PALSTM, but for the MAX metrics, the proposed method consistently yields lower values. The findings indicate that the proposed method demonstrates superior performance in all three cases.

The performance of the Transformer model and the XGBoost model [37] is demonstrated in Figure 15 and Figure 16, respectively, on the Panasonic dataset. The results of the overall comparison are presented in Figure 17. It can be observed that, on the same dataset, both the Transformer and XGBoost models exhibit inferior performance compared to the method proposed in this paper. The Transformer model demonstrates the poorest performance, and the XGBoost model also exhibits similar deficiencies. The method proposed in this study demonstrates optimal performance under all four operational conditions.

In order to further explore and validate the practical value of this study, the computational time for methods other than the Hybrid and PALSTM models was calculated. For further details, please refer to Table 4.

The findings presented herein are derived from the Panasonic dataset. In the present study, the training time of the model under investigation has been found to be directly proportional to the size of the dataset. The training of the Panasonic dataset, which contains approximately 100,000 data points, requires an average of approximately 13 min, while the validation and testing phases require less than 0.1 s. In contrast, both the A123 dataset and the INR 18650-20R (NCM) dataset contain approximately 10,000 samples. Consequently, the training time is approximately 2.5 min, while the validation test set takes approximately 0.012 s. Consequently, the proposed method in this study fulfils both performance and time requirements for practical application in automotive environments.

5. Conclusions

The proposed PINN model in this paper entails the integration of the battery mechanism with the data-driven model, thereby facilitating the acquisition of more precise SOC estimates during the training process. Data from two mechanisms are incorporated into the data-driven model independently, and the estimates of the SOC are then compared with the results of the original model. The findings indicate that both modules enhance the SOC estimation performance of the model. The MAE, RMSE and MAX were reduced by 78.71%, 80.42% and 67.88%, respectively, compared to the initial model. The impact of the model on the estimation effect with varying constraint ratios was further investigated, and the optimal constraint ratio was ultimately determined. Furthermore, the model performance was assessed under diverse operating conditions, encompassing different temperature ranges, and a comparative analysis was conducted with other methodologies. This study demonstrates that a hybridized approach, integrating battery mechanistic information with data-driven models, enhances the efficacy of the BMS in estimating the SOC, leading to enhanced performance.

Despite these promising results, several aspects warrant further investigation. Incorporating temperature dependent electrochemical reactions and long-term aging mechanisms could improve the robustness and adaptability of the model. Moreover, real-time implementation and validation under complex driving or charging conditions remain important directions for future research. These efforts would further strengthen the applicability of the proposed method in practical battery management systems.

Author Contributions

Conceptualization, Z.X., J.R. and C.J.; methodology, Z.X., J.R. and C.J.; software, Z.X., J.R., C.J. and F.M.; validation, Z.X. and J.R.; formal analysis, Z.X. and J.R.; investigation, Z.X. and J.R.; resources, Z.X., J.R., C.J. and F.M.; data curation, Z.X., J.R., J.W. and W.S.; writing—original draft preparation, Z.X. and J.R.; writing—review and editing, J.R., C.J., F.M., J.W. and W.S.; visualization, Z.X. and J.R.; supervision, J.W. and W.S.; project administration, J.W. and W.S.; funding acquisition, J.W. and W.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China [grant numbers [22278018]].

Data Availability Statement

Data used in this manuscript is public dataset, which could be available from the links below: https://doi.org/10.17632/wykht8y7tg.1, [22]. Battery Data|Center for Advanced Life Cycle Engineering.

Acknowledgments

The authors acknowledge the support received from the foundation.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

ampere-time integration	Ah-I
battery management system	BMS
electrochemical modeling	EM
equivalent circuit model	ECM
maximum absolute error	MAX
mean absolute error	MAE
open circuit voltage	OCV
pseudo-two-dimensional model	P2D
remaining useful life	RUL
root mean square error	RMSE
single-particle model	SPM
state of charge	SOC
state of health	SOH

References

Tao, T.; Ji, C.; Dai, J.; Rao, J.; Wang, J.; Sun, W.; Romagnoli, J. Data-based health indicator extraction for battery SOH estimation via deep learning. J. Energy Storage 2024, 78, 109982. [Google Scholar] [CrossRef]
Li, F.; Zuo, W.; Zhou, K.; Li, Q.; Huang, Y. State of charge estimation of lithium-ion batteries based on PSO-TCN-Attention neural network. J. Energy Storage 2024, 84, 110806. [Google Scholar] [CrossRef]
Hussein, H.M.; Aghmadi, A.; Abdelrahman, M.S.; Rafin, S.M.S.H.; Mohammed, O. A review of battery state of charge estimation and management systems: Models and future prospective. WIREs Energy Environ. 2024, 13, e507. [Google Scholar] [CrossRef]
Dini, P.; Colicelli, A.; Saponara, S. Review on Modeling and SOC/SOH Estimation of Batteries for Automotive Applications. Batteries 2024, 10, 34. [Google Scholar] [CrossRef]
Liu, Y.; He, Y.; Bian, H.; Guo, W.; Zhang, X. A review of lithium-ion battery state of charge estimation based on deep learning: Directions for improvement and future trends. J. Energy Storage 2022, 52, 104664. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, X. A novel non-experiment-based reconstruction method for the relationship between open-circuit-voltage and state-of-charge/state-of-energy of lithium-ion battery. Electrochim. Acta 2022, 403, 139637. [Google Scholar] [CrossRef]
Bao, W.; Liu, H.; Sun, Y.; Zheng, Y. A Fast Prediction of Open-Circuit Voltage and a Capacity Estimation Method of a Lithium-Ion Battery Based on a BP Neural Network. Batteries 2022, 8, 289. [Google Scholar] [CrossRef]
Chang, W.-E.; Kung, C.-C. An Improved AhI Method with Deep Learning Networkswith State of Charge Estimation of Lithium-Ion Battery. IEEE Access 2024, 12, 55465–55473. [Google Scholar] [CrossRef]
Xiong, R.; Cao, J.; Yu, Q.; He, H.; Sun, F. Critical Review on the Battery State of Charge Estimation Methods for Electric Vehicles. IEEE Access 2018, 6, 1832–1843. [Google Scholar] [CrossRef]
Romero-Becerril, A.; Alvarez-Icaza, L. Comparison of discretization methods applied to the single-particle model of lithium-ion batteries. J. Power Sources 2011, 196, 10267–10279. [Google Scholar] [CrossRef]
Doyle, M.; Fuller, T.F.; Newman, J. Modeling of Galvanostatic Charge and Discharge of the Lithium/Polymer/Insertion Cell. J. Electrochem. Soc. 1993, 140, 1526. [Google Scholar] [CrossRef]
Johnson, V.H. Battery performance models in ADVISOR. J. Power Sources 2002, 110, 321–329. [Google Scholar] [CrossRef]
Shrivastava, P.; Soon, T.K.; Idris, M.Y.I.B.; Mekhilef, S. Overview of model-based online state-of-charge estimation using Kalman filter family for lithium-ion batteries. Renew. Sustain. Energy Rev. 2019, 113, 109233. [Google Scholar] [CrossRef]
Chemali, E.; Kollmeyer, P.J.; Preindl, M.; Ahmed, R.; Emadi, A. Long Short-Term Memory Networks for Accurate State-of-Charge Estimation of Li-ion Batteries. IEEE Trans. Ind. Electron. 2018, 65, 6730–6739. [Google Scholar] [CrossRef]
Yang, F.; Song, X.; Xu, F.; Tsui, K.-L. State-of-Charge Estimation of Lithium-Ion Batteries via Long Short-Term Memory Network. IEEE Access 2019, 7, 53792–53799. [Google Scholar] [CrossRef]
Okafor, N.U.; Alghorani, Y.; Delaney, D.T. Improving Data Quality of Low-cost IoT Sensors in Environmental Monitoring Networks Using Data Fusion and Machine Learning Approach. ICT Express 2020, 6, 220–228. [Google Scholar] [CrossRef]
Yuan, X.; Huang, B.; Wang, Y.; Yang, C.; Gui, W. Deep Learning-Based Feature Representation and Its Application for Soft Sensor Modeling with Variable-Wise Weighted SAE. IEEE Trans. Ind. Inform. 2018, 14, 3235–3243. [Google Scholar] [CrossRef]
Qiu, X.; Wu, W.; Wang, S. Remaining useful life prediction of lithium-ion battery based on improved cuckoo search particle filter and a novel state of charge estimation method. J. Power Sources 2020, 450, 227700. [Google Scholar] [CrossRef]
Karpatne, A.; Atluri, G.; Faghmous, J.H.; Steinbach, M.; Banerjee, A.; Ganguly, A.; Shekhar, S.; Samatova, N.; Kumar, V. Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data. IEEE Trans. Knowl. Data Eng. 2017, 29, 2318–2331. [Google Scholar] [CrossRef]
Yang, F.; Li, W.; Li, C.; Miao, Q. State-of-charge estimation of lithium-ion batteries based on gated recurrent neural network. Energy 2019, 175, 66–75. [Google Scholar] [CrossRef]
Tian, J.; Xiong, R.; Lu, J.; Chen, C.; Shen, W. Battery state-of-charge estimation amid dynamic usage with physics-informed deep learning. Energy Storage Mater. 2022, 50, 718–729. [Google Scholar] [CrossRef]
Kollmeyer, P. Panasonic 18650PF Li-Ion Battery Data; Mendeley Data: Amsterdam, The Netherlands, 2018; Version 1. [Google Scholar] [CrossRef]
Xing, Y.; He, W.; Pecht, M.; Tsui, K.L. State of charge estimation of lithium-ion batteries using the open-circuit voltage at various ambient temperatures. Appl. Energy 2014, 113, 106–115. [Google Scholar] [CrossRef]
U.S. Department of Energy Idao Field Office. USABC Electric Vehicle Battery Test Procedures Manual Revision 2. Available online: https://www.osti.gov/servlets/purl/214312-wzdRsH/webviewable/ (accessed on 18 February 2025).
Sansana, J.; Joswiak, M.N.; Castillo, I.; Wang, Z.; Rendall, R.; Chiang, L.H.; Reis, M.S. Recent trends on hybrid modeling for Industry 4.0. Comput. Chem. Eng. 2021, 151, 107365. [Google Scholar] [CrossRef]
Chen, J.; Zhang, Y.; Wu, J.; Cheng, W.; Zhu, Q. SOC estimation for lithium-ion battery using the LSTM-RNN with extended input and constrained output. Energy 2023, 262, 125375. [Google Scholar] [CrossRef]
Yu, H.; Zhang, L.; Wang, W.; Li, S.; Chen, S.; Yang, S.; Li, J.; Liu, X. State of charge estimation method by using a simplified electrochemical model in deep learning framework for lithium-ion batteries. Energy 2023, 278, 127846. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Cai, S.; Wang, Z.; Wang, S.; Perdikaris, P.; Karniadakis, G.E. Physics-Informed Neural Networks for Heat Transfer Problems. J. Heat Transf. 2021, 143, 060801. [Google Scholar] [CrossRef]
Weng, C.; Sun, J.; Peng, H. A unified open-circuit-voltage model of lithium-ion batteries for state-of-charge estimation and state-of-health monitoring. J. Power Sources 2014, 258, 228–237. [Google Scholar] [CrossRef]
Fleischer, C.; Waag, W.; Bai, Z.; Sauer, D.U. Adaptive On-line State-of-available-power Prediction of Lithium-ion Batteries. J. Power Electron. 2013, 13, 516–527. [Google Scholar] [CrossRef]
Ghaeminezhad, N.; Ouyang, Q.; Wei, J.; Xue, Y.; Wang, Z. Review on state of charge estimation techniques of lithium-ion batteries: A control-oriented approach. J. Energy Storage 2023, 72, 108707. [Google Scholar] [CrossRef]
Wang, S.; Teng, Y.; Perdikaris, P. Understanding and Mitigating Gradient Flow Pathologies in Physics-Informed Neural Networks. SIAM J. Sci. Comput. 2021, 43, A3055–A3081. [Google Scholar] [CrossRef]
Shen, L.; Li, J.; Liu, J.; Zhu, L.; Shen, H.T. Temperature Adaptive Transfer Network for Cross-Domain State-of-Charge Estimation of Li-Ion Batteries. IEEE Trans. Power Electron. 2023, 38, 3857–3869. [Google Scholar] [CrossRef]
Ma, S.; Jiang, M.; Tao, P.; Song, C.; Wu, J.; Wang, J.; Deng, T.; Shang, W. Temperature effect and thermal impact in lithium-ion batteries: A review. Prog. Nat. Sci. Mater. Int. 2018, 28, 653–666. [Google Scholar] [CrossRef]
Shah, S.A.A.; Niazi, S.G.; Deng, S.; Azam, H.M.H.; Yasir, K.M.M.; Kumar, J.; Xu, Z.; Wu, M. A novel positional encoded attention-based Long short-term memory network for state of charge estimation of lithium-ion battery. J. Power Sources 2024, 590, 233788. [Google Scholar] [CrossRef]
Jiayang, H.; Jun, X.; Chuanping, L.; Delong, J.; Xuesong, M. State of charge estimation for lithium-ion batteries based on battery model and data-driven fusion method. Energy 2024, 290, 130056. [Google Scholar] [CrossRef]

Figure 1. Current and voltage curves at 25 °C, (a,b) DST, (c,d) US06, (e,f) UDDS, (g,h) FUDS, (i,j) HWFET, (k,l) LA92, (m,n) BJDST.

Figure 2. The configuration of the fusion model. (a) Parallel, (b) serial, and (c) hybrid. (a) The data-driven model is used to learn and compensate for the differences between the mechanistic model and the empirical data to obtain the output. The term ‘error’ is used to denote the discrepancy between the calculated value of the model and the true value. The term ‘calculated value’ is used to denote the value that has been computed by the model. (b) Processing the mechanistic model as a preprocessor on the input data and subsequently using the data-driven model to obtain the output. (c) Fusion of mechanistic models with data-driven models to form hybrid models to obtain outputs.

Figure 3. Structure of the physical informed neural network proposed in this paper.

Figure 4. Block diagram of the overall SOC estimation flow.

Figure 5. General structure of a physical informed neural network.

Figure 6. Schematic diagram of the Rint model. Eocv is the ideal voltage source, R0 is the internal resistor, and U is the terminal voltage.

Figure 7. A plot of the low current OCV versus SOC.

Figure 8. SOC estimation results under test conditions. (a), (b), (c), and (d) correspond to NN, PINN-1, PINN-2, and PINN-1&2, respectively, with SOC estimation results on the left side of the figure and SOC estimation errors on the right side of the figure.

Figure 9. Performance of SOC estimation for corresponding methods.

Figure 10. Weight Optimization Diagram.

Figure 11. Performance of SOC estimation corresponding to different ω₂. where A, B, C, D represent the results for ω₂ values of 0, 0.1, 0.5, and 0.9 respectively.

Figure 12. Estimated results of the PINN model for estimating SOC at different temperatures. Panasonic 18650PF: (a) −20 °C, (b) −10 °C, (c) 0 °C, (d) 25 °C, INR 18650-20R: (e) 25 °C, CALCE: (f) 40 °C.

Figure 13. Comparison of SOC estimation results between PINN, SVR and LSTM methods.

Figure 14. Evaluation index results of SVR, LSTM, Hybrid [32] and PINN.

Figure 15. Evaluation index results of Transformer model.

Figure 16. Evaluation index results of XGboost model [37].

Figure 17. The results of the overall comparison.

Table 1. Data set specifications.

Battery Dataset	Capacity/Ah	Cell Chemistry	Sampling Frequency/Hz	Working Condition Number
Panasonic 18650PF	2.9	LiCoO₂	10	9
INR 18650-20R	2.0	LiNiMnCo	1	4
CALCE	1.1	LiFePO₄	1	3

Table 2. Maximum error results of SOC estimation at different temperatures and operating conditions.

Temp	Working Condition/MAX (%)
Temp	NN	LA92	UDDS	US06	DST	FUDS	BJDST
−20 °C (Panasonic 18650PF)	5.61	7.25	5.18	3.18
−10 °C (Panasonic 18650PF)	5.40	4.87	4.10	6.35	-	-
0 °C (Panasonic 18650PF)	5.92	3.91	2.67	2.60	-	-
25 °C (Panasonic 18650PF)	3.45	3.08	4.46	3.11	-	-
25 °C (INR 18650-20R)				4.47	1.98	2.18	2.47
40 °C (CALCE)		-	-	8.00	5.09	5.06

Table 3. Comparison between the proposed PINN method and PALSTM.

Training Data	Test Data	RMSE (%)		MAE (%)		MAX (%)
Training Data	Test Data	PALSTM	Proposed	PALSTM	Proposed	PALSTM	Proposed
FUDS US06	DST	1.2	0.80	1.11	0.61	-	4.16
DST US06	FUDS	1.01	0.96	0.80	0.82	-	3.95
DST FUDS	US06	1.13	0.82	0.99	0.71	-	3.04

Table 4. Comparison table of processing times for different models.

Methods	Training Time (min)	Testing Time (s)
Proposed	13	0.1
SVR	0.2	0.04
LSTM	5	2
Transformer	10	80
XGBoost	5	0.15

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xiao, Z.; Rao, J.; Ji, C.; Ma, F.; Wang, J.; Sun, W. A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation. Processes 2025, 13, 3597. https://doi.org/10.3390/pr13113597

AMA Style

Xiao Z, Rao J, Ji C, Ma F, Wang J, Sun W. A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation. Processes. 2025; 13(11):3597. https://doi.org/10.3390/pr13113597

Chicago/Turabian Style

Xiao, Zhanghua, Jingzhi Rao, Cheng Ji, Fangyuan Ma, Jingde Wang, and Wei Sun. 2025. "A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation" Processes 13, no. 11: 3597. https://doi.org/10.3390/pr13113597

APA Style

Xiao, Z., Rao, J., Ji, C., Ma, F., Wang, J., & Sun, W. (2025). A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation. Processes, 13(11), 3597. https://doi.org/10.3390/pr13113597

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Data-Driven Method with Fusing Mechanism Information for Li-Ion Battery State of Charge Estimation

Abstract

1. Introduction

2. Data Sets and Performance Evaluation Metrics

2.1. Data Sets

2.2. Performance Evaluation Metrics

3. Methods

3.1. Overall SOC Estimation Framework

3.2. Physics-Informed Neural Networks

3.3. Rint Model

3.4. Ampere-Hour Integration Method

4. Results and Discussion

4.1. SOC Estimation Results at 25 °C

4.2. Estimation Results of SOC at Other Temperatures

4.3. Comparison with Other Different Methods

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI