Fault Diagnosis for Lithium-Ion Battery Pack Based on Relative Entropy and State of Charge Estimation

: Timely and accurate fault diagnosis for a lithium-ion battery pack is critical to ensure its safety. However, the early fault of a battery pack is difficult to detect because of its unobvious fault effect and nonlinear time-varying characteristics. In this paper, a fault diagnosis method based on relative entropy and state of charge (SOC) estimation is proposed to detect fault in lithium-ion batteries. First, the relative entropies of the voltage, temperature and SOC of battery cells are calculated by using a sliding window, and the cumulative sum (CUSUM) test is adopted to achieve fault diagnosis and isolation. Second, the SOC estimation of the short-circuit cell is obtained, and the short-circuit resistance is estimated for a quantitative analysis of the short-circuit fault. Furthermore, the effectiveness of our method is validated by multiple fault tests in a thermally coupled electrochemical battery model. The results show that the proposed method can accurately detect different types of faults and evaluate the short-circuit fault degree by resistance estimation. The voltage/temperature sensor fault is detected at 71 s/58 s after faults have occurred, and a short-circuit fault is diagnosed at 111 s after the fault. In addition, the standard error deviation of short-circuit resistance estimation is less than 0.12 Ω /0.33 Ω for a 5 Ω /10 Ω short-circuit resistor.


Introduction
To relieve the petrochemical energy crisis and environmental pollution problem, energy storage batteries characterized by electrochemical energy storage plays a crucial role in electric vehicles (EVs) and electricity systems.Among electrochemical energy storage batteries, lithium-ion batteries (LIBs) have attracted widespread attention and are regarded as the most promising batteries [1,2] owing to their outstanding characteristics of energy density, long cycle-life and environmental protection.Additionally, the comprehensive research progress in emerging electrolyte systems for ultralow-temperature lithium batteries offers promising prospects for enhancing their performance under extreme conditions, thereby facilitating the broader application of energy storage devices in cold climates [3].Nowadays, LIBs have been widely used in electric power storage, EVs, electronic devices, electric boats and even electric aircraft.However, safety accidents caused by LIBs have occurred frequently in recent years, such as fires and explosions, which has aroused public concerns and inhibited LIBs from further development to some extent.Therefore, to prevent safety accidents involving LIBs, it is very necessary to detect and isolate the faults of lithium-ion battery packs in the early fault period.
The reasons for safety accidents with LIBs include natural battery aging and battery abuse.Battery abuse is a major factor and can be categorized into mechanical abuse, electrical abuse and thermal abuse [4].Moreover, battery abuse will give rise to a variety of battery faults and accelerate battery degradation, even causing a thermal runaway [5,6].The main cause of thermal runaway is internal short circuit (ISC), which can be divided into micro-short circuit (MSC) and hard-short circuit.However, because a hard-short circuit will immediately lead to thermal runaway, it is meaningless to attempt to detect a hard-short circuit because there is not enough time to take action to avoid thermal runaway [7,8].Therefore, it is critical to detect ISC faults, especially MSC faults.However, other types of faults will also turn into ISC faults if they are not detected in a limited period.Consequently, developing a multiple-fault diagnosis and identification method is very important for lithium-ion battery packs [9].
Common battery faults include overcharge/over-discharge, connection fault, external/internal short circuit, sensor fault and inconsistency within the battery pack.A variety of methods have been proposed for the detection of one type of these faults or multifaults in lithium-ion battery packs.The detection methods can be classified into two categories: model-based fault diagnosis methods [10][11][12][13][14] and non-model-based fault diagnosis methods [15][16][17][18][19][20][21][22].Model-based methods usually establish the battery model to obtain the parameters using fault information and a set of residual signals.Then, the battery faults can be detected by comparing the residual signals with a fault threshold.For example, Xiong et al. proposed a sensor fault diagnosis method by using an unscented Kalman filter (UKF) for SOC estimation, then combining it with a coulomb counting method to detect and isolate current/voltage sensor faults [10].Zhang et al. proposed a model-based and entropy method for multi-fault diagnosis; the terminal voltage residual and temperature residual based on an extended Kalman filter (EKF) were used to detect and isolate multi-faults in LIBs [11].Gao et al. developed the relationship between the main parameters of the model and SOC based on a cell difference model, with EKF used for SOC estimation and recursive least squares (RLS) for MSC diagnosis [12].These model-based methods can effectively detect and isolate battery faults, but most of these fault diagnosis methods require the offline identification of model parameters, which indicates fault diagnosis performance degradation under different operating conditions and temperatures.In addition, if all parameters of the battery pack are identified online in a short time, it will produce a large computational burden.Non-model-based diagnostic methods (including machine learning, statistical analysis and information entropy) have been extensively applied to detect faults in LIBs.Naha et al. proposed a supervised machine learning approach to classify the faults with features extracted from charging curves [15].In addition, the entropy or correlation coefficient of terminal voltage changes significantly if a fault has occurred, which can be used for fault detection by comparing the terminal voltage in the fault with that in normal operation.Kang et al. designed the interleaved voltage topology and correlation coefficient method for multi-fault diagnosis (voltage sensor fault, connection fault and short-circuit fault) [16].Zhao et al. proposed a fault diagnosis method based on a machine learning algorithm and a 3σ multilevel screening strategy, and the fault occurrence frequencies of terminal voltage were calculated [17].Moreover, information entropy methods were developed, and the entropy was calculated in real time using sliding windows to detect faults in LIBs [18,19].Chen et al. identified the internal state parameters of a faulty battery using RLS and an EKF, and the parameters were subsequently analyzed using the local outlier factor to diagnose the fault in the battery [20].Li et al. utilized the interclass correlation coefficient method to extract fault information from the voltage by using a sliding window, and then they designed a loop that connects the first and last voltages to locate faults in battery packs [21].Ma et al. established a fault-free voltage model by a graph neural network and conducted fault diagnosis by comparing it with the measured voltage [22].These methods have been validated in the test set and performed well in fault diagnosis; however, the accuracy of these methods largely depends on large numbers of fault data and is susceptible to measurement noise.
In addition, because most of the existing methods focus on single type of fault diagnosis, the quantitative analysis of short-circuit fault is still relatively few; thus, in this paper, a multi-fault diagnosis method based on relative entropy and SOC estimation is proposed for a lithium-ion battery pack.The faults include voltage sensor fault, temperature sensor fault and short-circuit fault.First, the cell with measured median voltage is selected as the reference cell, and the relative entropies of the voltage and temperature of the other cells to those of the reference cell are calculated using a sliding window.Second, the relative entropies are evaluated by the cumulative sum (CUSUM) test to achieve fault diagnosis and isolation.Next, the equivalent circuit model parameters of the reference cell are identified using recursive least squares with the variable forgetting factor (VFFRLS), and the SOC of the short-circuit cell is estimated based on the EKF.Furthermore, the short-circuit resistance is estimated with RLS for the quantitative analysis of short-circuit fault.
The main contributions of this research lie in the following aspects: (1) The proposed fault diagnosis method can apply to various operating environments because it is not necessary for a battery model to be built to obtain fault residuals.Importantly, the computational burden and hardware redundancy are not increased to accomplish fault diagnosis and quantitative analysis with the proposed method.(2) The forgetting mechanism can effectively extract the fault information of the voltage and temperature of the battery pack, and CUSUM is used to evaluate the fault information, ensuring the efficiency and accuracy of fault diagnosis.Specifically, the voltage/temperature sensor fault was detected at 71 s/58 s after faults occurred under the US06 condition, and a short-circuit fault was diagnosed at 111 s after the fault with a 5 Ω short-circuit resistor under Federal Urban Driving Schedule (FUDS).The standard error deviation of short-circuit resistance estimation is less than 0.12 Ω/0.33 Ω for a 5 Ω/10 Ω short-circuit resistor under FUDS.(3) Not only can various faults (voltage/temperature sensor faults, short-circuit fault) be diagnosed in the proposed method, but quantitative analysis on short-circuit fault is performed by estimating the resistance of the short-circuit resistor to obtain the degree of short-circuit fault.The remainder of this paper is organized as follows.The multi-fault diagnosis method is introduced in Section 2. The results of the virtual battery pack with the sensor faults and short-circuit faults are analyzed and discussed in Section 3. Finally, the key conclusions are summarized in Section 4.

Methods
In this section, a relative entropy method is adopted to calculate the relative entropy of cells' voltage, temperature and SOC in real time, and the CUSUM is applied to achieve fault diagnosis and isolation.During the charge or discharge process, the difference in SOC between a normal cell and a fault cell can be expressed as ∆SOC, and the CUSUM of ∆SOC is represented as CU∆SOC during the charge or discharge period.When the CU∆SOC is positive in both the charging and discharging phases, it indicates that the SOC of the fault cell is consistently lower than that of the normal cell during this period.Therefore, it can be diagnosed that the fault may be an internal short-circuit fault.Conversely, if the CU∆SOC is negative during the charge phase, it can be implied that the fault cell is likely an aged cell.Furthermore, with the diagnosed short-circuit fault, we quantitatively estimate short-circuit resistance.The specific implementation process is shown in Figure 1.

Relative Entropy Calculation
Relative entropy has been widely used in information theory and data mining due to its excellent performance in measuring the difference between two probabilities' distributions [23].Assuming that P(x) and Q(x) are two probability distribution functions for a discrete random variable x, the relative entropy of P to Q is defined by where the sum of P(x) and Q(x) are both 1. Usually, P and Q come from frequency distributions, so the relative entropy is always positive.If P and Q are identical, the relative entropy is zero.The higher the relative entropy, the lower the similarity between two probability distributions.In this work, relative entropy is used to detect battery fault information.First, for a series-connected battery pack, the cell with the median voltage is selected and regarded as the reference cell.Next, the relative entropy of the measured values (voltage, temperature and SOC) of the other cells to the reference cell is calculated.Despite the inconsistency of cells in the battery pack, this difference is not obvious proof that the battery pack can operate normally.However, if a sensor fault occurs in a battery, the value measured by the sensor will definitely deviate from that of the reference cell.Similarly, if a short-circuit fault occurs in a battery, the measured temperature and voltage will also greatly differ from that of the reference cell.Therefore, to obtain the fault information in real time, it is necessary to calculate the relative entropy of temperature, voltage and SOC for each cell, adopting the sliding window at each sampling moment.Here, taking the voltage of a battery pack as an example, the raw voltage matrix of n cells within sampling time t1 to t2 is represented as Batteries 2024, 10, x FOR PEER REVIEW  Relative entropy has been widely used in information theory and data mining its excellent performance in measuring the difference between two probabilities' d tions [23].Assuming that () and () are two probability distribution functio The voltage of the reference cell during this period can be expressed as U m = u t1 m . . ., u t2 m T .
Next, in sampling time t1 to t2, a matrix C is defined as where k indicates the number of intervals from t1 to t2; normally, k is set to 10 [19,24].c k,n refers to the number of incidences of U n in u min + (x − 1) u max −u min k , u min + x u max −u min k during t1 to t2, u max and u min are the maximum and minimum values of U, respectively, and x = 1 • • • k.According to C, the frequency distribution matrix P is obtained thus: The relative entropy RD t2 of the measured temperature and voltage of all cells to that of the reference cell is defined as . Like the Shannon entropy, the sliding window length has a great influence on the entropy value: neither too small nor too large can reveal the battery fault information; here, the sliding window size is chosen as 100 [19,24].

Relative Entropy Evaluation
In order to accurately detect faults in a battery and reduce the risk of false/missed alarms, a CUSUM test is applied to evaluate the relative entropy of carrying fault information; the details can be referred to [11,25].Fault diagnosis can be regarded as a binary hypothesis problem; based on the likelihood ratio principle, the CUSUM of the log-likelihood ratio of the sequence can be formulated as where P H1 and P H0 are the probability density functions of the sequence under hypotheses H1 (fault) and H0 (no fault), respectively, and the probability distribution is viewed as Gaussian distribution.RE t i,m represents the relative entropy of the measured voltage or temperature of a cell i at sampling point t.If a fault occurs, the CUSUM of log-likelihood ratios S t i will increase continually to exceed the alarm threshold of voltage J U (J T for temperature).The alarm threshold has been calibrated by extensive experimental or simulation tests.

Quantitative Assessment Method of Short-Circuit Fault
To evaluate the severity of a battery's short-circuit fault, it involves estimating the shortcircuit current or resistance by quantitative analysis.If a short-circuit fault occurs, the real discharge electricity of a cell during discharge is larger than that calculated by coulombic counting, and the real charge electricity is smaller than that calculated by coulombic counting during charging.The battery model without/with short-circuit resistance is shown in Figure 2a/Figure 2b, respectively.In this work, firstly, the parameters (R 0 , R 1 , C 1 ) of the reference cell in the equivalent circuit model in Figure 2a are identified online by using VFFRLS.Secondly, with the identification parameters, the actual SOC of the shortcircuit cell in Figure 2b is estimated by the EKF method.Next, the short-circuit current in Figure 2b is obtained by comparing it with the coulombic counting method.Furthermore, the short-circuit resistance is calculated by using RLS.
is shown in Figure 2a/Figure 2b, respectively.In this work, firstly, the parameters (R0, R1, C1) of the reference cell in the equivalent circuit model in Figure 2a are identified online by using VFFRLS.Secondly, with the identification parameters, the actual SOC of the short-circuit cell in Figure 2b is estimated by the EKF method.Next, the short-circuit current in Figure 2b is obtained by comparing it with the coulombic counting method.Furthermore, the short-circuit resistance is calculated by using RLS.

Online Parameter Identification
Specifically, according to Kirchhoff's law, the dynamical electrical equations can be obtained, as shown in Equation ( 6). is internal resistance,   constitutes a network of mass transport effects and dynamic voltage performance, and  describes the diffusion voltage.Open-circuit voltage  is a nonlinear function of SOC, and  represents the battery terminal voltage;  is the load current, taken as positive for discharging and negative for charging.
By Laplace transformation and bilinear transformation (discretization), Equation ( 6) can be transformed to Equation (7), where a1, a2 and a3 are the identified parameters.With the identified parameters, the resistance and capacitance parameters in Figure 2a can be obtained as follows.
The process of parameter identification by the recursive least squares method with the forgetting factor is below.First, the parameter vectors  and the error covariance matrices  in Equation ( 8) are initialized, and then the parameters (a1, a2 and a3) are adjusted adaptively according to the samples of load current and measured voltage.
Because forgetting factor  has great influence on recognition results, a VFFRLS algorithm is adopted to select the appropriate forgetting factor.This algorithm adaptively searches the optimal forgetting factor according to the estimation error.The forgetting factor is updated as follows.

Online Parameter Identification
Specifically, according to Kirchhoff's law, the dynamical electrical equations can be obtained, as shown in Equation ( 6).R 0 is internal resistance, R 1 C 1 constitutes a network of mass transport effects and dynamic voltage performance, and U 1 describes the diffusion voltage.Open-circuit voltage U oc is a nonlinear function of SOC, and U t represents the battery terminal voltage; I L is the load current, taken as positive for discharging and negative for charging. .
By Laplace transformation and bilinear transformation (discretization), Equation ( 6) can be transformed to Equation (7), where a1, a2 and a3 are the identified parameters.With the identified parameters, the resistance and capacitance parameters in Figure 2a can be obtained as follows.
The process of parameter identification by the recursive least squares method with the forgetting factor is below.First, the parameter vectors θ 0 and the error covariance matrices P 0 in Equation ( 8) are initialized, and then the parameters (a1, a2 and a3) are adjusted adaptively according to the samples of load current and measured voltage.
Because forgetting factor λ has great influence on recognition results, a VFFRLS algorithm is adopted to select the appropriate forgetting factor.This algorithm adaptively searches the optimal forgetting factor according to the estimation error.The forgetting factor is updated as follows.

Short-Circuit Resistance Estimation Based on SOC
With the identified parameters of the reference cell, the SOC of the short-circuit cell can be estimated by employing a closed-loop method of EKF.By discretizing Equation ( 6) and combining it with the coulombic counting method, the state space equations of the battery model can be expressed as follows where k is the time index and ∆t is the time interval.η is coulomb efficiency, assumed to be 1, and C n is battery available capacity.
x k and y k are the state vector and output vector at time k, respectively.f and g denote the state function and measurement function, respectively.w k−1 and v k are the independent zero-mean Gaussian process noise and measurement noise with the covariance Q k and R k .The computational procedure of the EKF algorithm is comprehensively described in [26].Our paper focuses on the diagnosis of internal short-circuit faults in lithium-ion batteries, so we will not elaborate further on SOC estimation here.
With the actual SOC estimation of the short-circuit battery, we can obtain the relationship between the short-circuit current and the load current, as shown in Equation (11).In consequence, the short-circuit resistance can be estimated online according to the recursive least square method.

Data Preparation and Preprocessing
The multi-fault diagnosis of a lithium-ion battery pack was accomplished based on relative entropy and SOC estimation, including battery short-circuit fault, voltage sensor fault and temperature sensor fault.For the sensor fault tests, because sensors can experience several faults, such as deviation from the measured values to the real value, drifting, the loss of accuracy and the freezing of measured values due to inherent aging and external shocks or vibrations, an equivalent sensor fault experiment was carried out by injecting a strong noise into raw measurement data [11,16,27].For the short-circuit fault test, destructive short-circuit experiments, such as nail penetration and squeezing, are too difficult to control, so that they are not suitable for studying the early stage of battery short-circuit.The method of cells connecting in parallel with a resistance has become an alternative experiment for battery fault tests [8,11,12,28,29].For simulating the short-circuit fault, a resistor with a value of 5 Ω/10 Ω was selected to be connected in parallel with cell 5, representing different levels of short-circuit conditions.A dynamic thermally coupled battery model was constructed using the simulation software AutoLion-ST V2020 with a given initial value and current excitation, as shown in Figure 3 Three complete charge and discharge cycles of a battery under Federal Urban Driving Schedule (FUDS) are illustrated in Figure 3a, and the schematic diagram of different fault tests can be shown in Figure 3b.Because an electrochemical model can accurately describe battery electrochemical reactions and performance, an electrochemical-thermal coupling model in the AutoLion-ST software was developed by coupling a pseudo-two-dimensional model and the lumped mass thermal model.The parameters of the coupling model in this work were taken from [30][31][32][33]; the effectiveness of this model has been validated in [33].The specification of the cylindrical cell was designed and is shown in Table 1.mass thermal model.The parameters of the coupling model in this work were taken from [30][31][32][33]; the effectiveness of this model has been validated in [33].The specification of the cylindrical cell was designed and is shown in Table 1.Moreover, the SOC-OCV curve of the battery was conducted to estimate SOC, as depicted in Figure 4 A fully-charged battery was discharged to a cut-off voltage of 2.5 V, followed by charging to a cut-off voltage of 4.2 V, and the whole charge-discharge process was under a low current of 0.05 C. Two working conditions of FUDS and US06 driving schedules (US06) were considered; the specific operating condition components can be  Moreover, the SOC-OCV curve of the battery was conducted to estimate SOC, as depicted in Figure 4 A fully-charged battery was discharged to a cut-off voltage of 2.5 V, followed by charging to a cut-off voltage of 4.2 V, and the whole charge-discharge process was under a low current of 0.05 C. Two working conditions of FUDS and US06 driving schedules (US06) were considered; the specific operating condition components can be referred to in [34].To reflect the variation of cells, different initial SOCs of the cells were selected, and the maximum inconsistency was 3%.In addition, the testing ambient temperature and the initial temperature of the battery were under 25 • C. The injected Gaussian white noise with standard deviation was 3 mV for the voltage sensor and 0.03 • C for the temperature sensor to acquire original data.A battery pack consisting of a single module with eight cells in series was built to verify the effectiveness of the proposed fault diagnosis method.Short-circuit fault and sensor fault tests were conducted under different operating conditions, where the fault parameters were selected mainly based on the experimental values in [8,11,14,35]; and the specific fault parameters for testing are listed in Table 2.
referred to in [34].To reflect the variation of cells, different initial SOCs of the cells were selected, and the maximum inconsistency was 3%.In addition, the testing ambient temperature and the initial temperature of the battery were under 25℃.The injected Gaussian white noise with standard deviation was 3 mV for the voltage sensor and 0.03℃ for the temperature sensor to acquire original data.A battery pack consisting of a single module with eight cells in series was built to verify the effectiveness of the proposed fault diagnosis method.Short-circuit fault and sensor fault tests were conducted under different operating conditions, where the fault parameters were selected mainly based on the experimental values in [8,11,14,35]; and the specific fault parameters for testing are listed in Table 2.

The Relative Entropy Calculation at Normal Condition
The relative entropy of voltage and temperature is used as fault information, and the CUSUM method is adopted for fault evaluation in the proposed multi-fault diagnosis method.Therefore, to obtain the statistical distribution of two hypotheses and an alarm threshold, a large number of tests were performed under both normal and fault conditions to compromise between missed alarms and false alarms.For the consideration of calculation efficiency and detection sensitivity, the sliding window was chosen as 100, and  and  were set to 150, which is more than two times their maximum CUSUM under normal conditions.To test the diagnosis effect of the proposed method on normal batteries, a normal experiment was conducted under FUDS conditions.The voltage/temperature of each cell and the relative entropies of the voltage/temperature of each cell to the reference cell in the battery pack are shown in Figure S1 and Figure S2, respectively.Here, the inconsistency in SOC in the battery pack was set to 3%, which is within a reasonable range to ensure the normal operation of the battery pack.In Figure S1, one can observe that the voltages of individual cells have certain differences during the discharge period, but the calculated voltage relative entropy fluctuates within a certain range.In addition, because of the inconsistency in all cells, the temperature difference reached a maximum at the end of discharge in Figure S2.Moreover, the relative entropies of the voltage and temperature of each cell do not deviate significantly; as can be seen from the zoomed-in figure, even the inconsistency and measurement noise were added to the battery pack.

The Relative Entropy Calculation at Normal Condition
The relative entropy of voltage and temperature is used as fault information, and the CUSUM method is adopted for fault evaluation in the proposed multi-fault diagnosis method.Therefore, to obtain the statistical distribution of two hypotheses and an alarm threshold, a large number of tests were performed under both normal and fault conditions to compromise between missed alarms and false alarms.For the consideration of calculation efficiency and detection sensitivity, the sliding window was chosen as 100, and J U and J T were set to 150, which is more than two times their maximum CUSUM under normal conditions.To test the diagnosis effect of the proposed method on normal batteries, a normal experiment was conducted under FUDS conditions.The voltage/temperature of each cell and the relative entropies of the voltage/temperature of each cell to the reference cell in the battery pack are shown in Figure S1 and Figure S2, respectively.Here, the inconsistency in SOC in the battery pack was set to 3%, which is within a reasonable range to ensure the normal operation of the battery pack.In Figure S1, one can observe that the voltages of individual cells have certain differences during the discharge period, but the calculated voltage relative entropy fluctuates within a certain range.In addition, because of the inconsistency in all cells, the temperature difference reached a maximum at the end of discharge in Figure S2.Moreover, the relative entropies of the voltage and temperature of each cell do not deviate significantly; as can be seen from the zoomed-in figure, even the inconsistency and measurement noise were added to the battery pack.

The Results of Sensor Faults Diagnosis
When only one sensor (either voltage or temperature) shows abnormal data, we attribute it to sensor malfunction.It is only when both sensors on the same battery simultaneously exhibit abnormal readings that we attribute the fault to the battery itself.Therefore, upon detecting sensor faults, we do not attempt to correct the data from these faulty sensors.Instead, we discard the data generated by these faulty sensors and issue a warning.The sensor fault test was carried out by injecting a large amount noise into the original measurement data.The results of the voltage sensor fault on cell 1 and the temperature sensor fault on cell 6 under FUDS are depicted in Figure 5a,c, respectively.Because most of the sensor faults were drifting, here noise was used to simulate the fault.The noise was injected at time of 5000 s.From Figure 5a,c, one can observe that before the sensor fault happened, the measured voltage of cell 1 and the measured temperature of cell 6 fluctuated within a small noise range, just like the other cells.With noise disturbances of 0.02 V for voltage and 0.2 • C for temperature to the measurement signal, the sensor accuracy was apparently distorted, but it is difficult to detect this using the conventional threshold methods.Furthermore, the relative entropies of the voltage and temperature of each cell to the reference cell were calculated in sensor fault conditions, as shown in Figure 5b,d.From Figure 5b, it can be seen that the voltage relative entropy of cell 1 gradually deviates from that of the other cells after 5000 s.Similarly, in Figure 5d, the temperature relative entropy of cell 6 follows the same trend as the voltage relative entropy in Figure 5b.By comparing the relative entropy of cells at normal and fault, it can be indicated that the relative entropies of normal cells are not always kept at the minimum but fluctuate in a small range.Furthermore, the CUSUM method was applied to reduce the impact on sensor fault diagnosis; the result is shown in Figure 6.One can see that when the CUSUM of relative entropy exceeded a preset threshold, the voltage sensor fault was detected at 93 s after voltage disturbance was added to cell 1 and the temperature sensor fault was detected at 65 s after temperature disturbance was added to cell 6, which indicates that the sensor fault can be detected and located in a short time using our proposed method.

The Results of Sensor Faults Diagnosis
When only one sensor (either voltage or temperature) shows abnormal data, we attribute it to sensor malfunction.It is only when both sensors on the same battery simultaneously exhibit abnormal readings that we attribute the fault to the battery itself.Therefore, upon detecting sensor faults, we do not attempt to correct the data from these faulty sensors.Instead, we discard the data generated by these faulty sensors and issue a warning.The sensor fault test was carried out by injecting a large amount noise into the original measurement data.The results of the voltage sensor fault on cell 1 and the temperature sensor fault on cell 6 under FUDS are depicted in Figure 5a and Figure 5c, respectively.Because most of the sensor faults were drifting, here noise was used to simulate the fault.The noise was injected at time of 5000 s.From Figure 5a,c, one can observe that before the sensor fault happened, the measured voltage of cell 1 and the measured temperature of cell 6 fluctuated within a small noise range, just like the other cells.With noise disturbances of 0.02 V for voltage and 0.2 °C for temperature to the measurement signal, the sensor accuracy was apparently distorted, but it is difficult to detect this using the conventional threshold methods.Furthermore, the relative entropies of the voltage and temperature of each cell to the reference cell were calculated in sensor fault conditions, as shown in Figure 5b,d.From Figure 5b, it can be seen that the voltage relative entropy of cell 1 gradually deviates from that of the other cells after 5000 s.Similarly, in Figure 5d, the temperature relative entropy of cell 6 follows the same trend as the voltage relative entropy in Figure 5b.By comparing the relative entropy of cells at normal and fault, it can be indicated that the relative entropies of normal cells are not always kept at the minimum but fluctuate in a small range.Furthermore, the CUSUM method was applied to reduce the impact on sensor fault diagnosis; the result is shown in Figure 6.One can see that when the CUSUM of relative entropy exceeded a preset threshold, the voltage sensor fault was detected at 93 s after voltage disturbance was added to cell 1 and the temperature sensor fault was detected at 65 s after temperature disturbance was added to cell 6, which indicates that the sensor fault can be detected and located in a short time using our proposed method.

The Results of Short-Circuit Fault Diagnosis
A short-circuit fault test was conducted by connecting a resistor (5 Ω) in parallel with cell 5 under FUDS conditions; the results have been shown in Figure S3.It can be seen that the voltage of cell 5 is significantly lower and the temperature of cell 5 higher than those of the other cells after the short-circuit fault occurred.Because of the short-circuit fault on cell 5, the power consumption of the short-circuit battery cell was accelerated and reached the cut-off voltage quickly, which increased the gap between the short-circuit battery cell and the other cells to the maximum at the end of discharge.To avoid over-discharging in the short-circuit test, the battery pack was discharged until at 10-20% of its SOC.With a resistor (5 Ω) connecting it in parallel with cell 5, the voltage profile and temperature profile of all cells is depicted in Figure S3a and Figure S3c, respectively.To detect the short-circuit fault of the battery pack, the relative entropy of the voltage and temperature of all cells was calculated as fault information, as shown in Figure S3b,d.One can see that the relative entropies of the voltage and temperature of cell 5 deviated from the other cells after the short-circuit fault occurred.Furthermore, the CUSUM of the voltage and temperature relative entropy of all cells was calculated, as shown in Figure S4a,b, respectively.One can see that the CUSUM of the voltage and temperature relative entropy of cell 5 exceeded the predetermined thresholds at 4109 s and 4111 s, respectively, which indicates that a short-circuit fault of cell 5 was detected, but the severity of the short-circuit is not available.

The Results of Short-Circuit Fault Diagnosis
A short-circuit fault test was conducted by connecting a resistor (5 Ω) in parallel with cell 5 under FUDS conditions; the results have been shown in Figure S3.It can be seen that the voltage of cell 5 is significantly lower and the temperature of cell 5 higher than those of the other cells after the short-circuit fault occurred.Because of the short-circuit fault on cell 5, the power consumption of the short-circuit battery cell was accelerated and reached the cut-off voltage quickly, which increased the gap between the short-circuit battery cell and the other cells to the maximum at the end of discharge.To avoid over-discharging in the short-circuit test, the battery pack was discharged until at 10-20% of its SOC.With a resistor (5 Ω) connecting it in parallel with cell 5, the voltage profile and temperature profile of all cells is depicted in Figure S3a and Figure S3c, respectively.To detect the shortcircuit fault of the battery pack, the relative entropy of the voltage and temperature of all cells was calculated as fault information, as shown in Figure S3b,d.One can see that the relative entropies of the voltage and temperature of cell 5 deviated from the other cells after the short-circuit fault occurred.Furthermore, the CUSUM of the voltage and temperature relative entropy of all cells was calculated, as shown in Figure S4a and Figure S4b, respectively.One can see that the CUSUM of the voltage and temperature relative entropy of cell 5 exceeded the predetermined thresholds at 4109 s and 4111 s, respectively, which indicates that a short-circuit fault of cell 5 was detected, but the severity of the short-circuit is not available.
To analyze the severity of the short-circuit of the battery pack, a quantitative evaluation was performed using SOC estimation.The SOC estimation of the reference cell (cell 4) was obtained by the EKF based on online identification parameters, as shown in Figure S5a.Here, the SOC-AH curve represents the SOC estimated using the coulombic counting method.After a resistor of 5 Ω connected with cell 5 in parallel at the time of 4000 s (when To analyze the severity of the short-circuit of the battery pack, a quantitative evaluation was performed using SOC estimation.The SOC estimation of the reference cell (cell 4) was obtained by the EKF based on online identification parameters, as shown in Figure S5a.Here, the SOC-AH curve represents the SOC estimated using the coulombic counting method.After a resistor of 5 Ω connected with cell 5 in parallel at the time of 4000 s (when the short-circuit fault occurred), one can see that the SOC of cell 5 dropped rapidly and deviated from that of the reference cell due to the continuous power consumption by the short-circuit resistor.During this period, the SOC of a normal cell remains higher than that of a short-circuited cell, implying that the ∆SOC is consistently positive throughout the discharge phase.Moreover, the resistance of the short-circuit fault was estimated using RLS, as shown in Figure S5b.It can be observed that the mean absolute error of resistance estimation is 0.17 Ω and the standard deviation is 0.12 Ω, indicating an accurate estimation of short-circuit resistance with a small margin of error.
Furthermore, to compare short-circuit faults of different degrees, the voltage relative entropy and temperature relative entropy of all cells were obtained in the short-circuit fault of connecting a 10 Ω resistor with cell 5 in parallel, as shown in Figure S6a,b, respectively.It can be clearly observed that the deviation of the voltage relative entropy and temperature relative entropy of cell 5 was small because of the slight short-circuit fault.Usually, the smaller the resistance of connecting with cell 5 in parallel, the more serious the short-circuit fault.In addition, the CUSUM of the voltage relative entropy and temperature relative entropy of all cells were calculated to detect the short-circuit fault; the results are depicted in Figure S7.It can be seen that the CUSUM of the voltage relative entropy of cell 5 exceeded the predetermined thresholds at 4101 s and that the CUSUM of temperature relative entropy exceeded at 4200 s.By comparing the results with the short-circuit fault of connecting a 5 Ω resistor with cell 5 in parallel, one can notice that it takes longer to detect a short-circuit fault in connecting a 10 Ω resistor.Moreover, the SOC estimations of the short-circuit cell and the reference cell are shown in Figure S8a.Similarly, one can observe that the SOC of cell 5 drops rapidly and deviates from that of the reference cell at 4000 s (when the short-circuit fault occurred), which indicates that the more severe the short-circuit is, the faster the SOC drops.The resistance of the short-circuit fault was estimated and is shown in Figure S8b.The mean absolute error of resistance estimation is 0.42 Ω, and standard deviation is 0.33 Ω, indicating the effectiveness of the proposed quantitative evaluation method.

Method Comparison under US06 Condition
To further verify the performance of the proposed method, a variety of fault tests, including a short-circuit, a voltage sensor fault with 0.05 V noise and a temperature sensor fault with 0.5 • C noise, were performed to compare the proposed method with a correlation coefficient-based [36] method under US06 conditions.To simplify analysis, here, the shortcircuit fault created by connecting a 10 Ω resistor with cell 5 in parallel was selected as an example.Firstly, the load current of the battery pack and the voltage and temperature of all cells are shown in Figure S9a-c, respectively.It can be seen that the voltage and temperature change in cell 5 significantly deviated from those of the other cells because of the shortcircuit fault.Secondly, the fault diagnosis result based on the correlation coefficient method is displayed in Figure 7 In the correlation coefficient method, the sliding window was set to 200 for the consideration of diagnosis accuracy and computational efficiency.In Figure 7, the dashed lines parallel to the time axis are the appropriate fault thresholds derived from the tests, with minimum voltage and temperature thresholds of 0.94 and 0.6, respectively.The correlation coefficients of the voltage and temperature of each cell to the other cells were calculated in real time by a sliding window, and the maximum correlation coefficient of voltage/temperature was taken as the final diagnostic signal.From Figure 7a, it can be seen that the correlation coefficients of voltage can apparently detect the fault even with the inconsistency and noise interference.Meanwhile, the correlation coefficient of temperature fluctuates considerably in the presence of noise and inconsistency in Figure 7b.The voltage exceeded the threshold at 74 s after the fault occurred, while the temperature exceeded the threshold at 298 s.According to the abnormal correlation coefficient of voltage and temperature, it can be inferred that the short-circuit fault occurred at 298 s because a short-circuit fault influences changes in voltage and temperature.As a contrast, a shortcircuit fault diagnosis based on the proposed method under US06 conditions is displayed in Figure S10.According to the CUSUM result of voltage relative entropy and temperature relative entropy in Figure S10c,d, a short-circuit fault can be diagnosed at 121 s after the fault has occurred; the time it takes to detect a short-circuit fault using our method is faster than that of the correlation coefficient method (298 s).Finally, the quantitative analysis of the short-circuit fault using SOC estimation is shown in Figure S11.The resistance of the short-circuit fault is estimated with 0.37 Ω standard deviation and 0.51 Ω mean absolute error.
In addition, a fault diagnosis test based on the Shannon entropy method was conducted, taking the voltage of a 5 Ω short-circuit resistor as an example.As shown in Figure S12a, it is difficult to directly diagnose the fault based only on Shannon entropy.Meanwhile, the correlation coefficient of Shannon entropy (Shannon RE-CC) [28] was calculated, as shown in Figure S12b.It can be seen that the Shannon RE-CC method is very sensitive to noise interference and inconsistency effects, leading to misdiagnosis or missed diagnosis by the set threshold.Therefore, the Shannon entropy-based method was not selected for comparison.correlation coefficient method (298 s).Finally, the quantitative analysis of the short-circuit fault using SOC estimation is shown in Figure S11.The resistance of the short-circuit fault is estimated with 0.37 Ω standard deviation and 0.51 Ω mean absolute error.In addition, a fault diagnosis test based on the Shannon entropy method was conducted, taking the voltage of a 5 Ω short-circuit resistor as an example.As shown in Figure S12a, it is difficult to directly diagnose the fault based only on Shannon entropy.Meanwhile, the correlation coefficient of Shannon entropy (Shannon RE-CC) [28] was calculated, as shown in Figure S12b.It can be seen that the Shannon RE-CC method is very sensitive to noise interference and inconsistency effects, leading to misdiagnosis or missed diagnosis by the set threshold.Therefore, the Shannon entropy-based method was not selected for comparison.
Moreover, different short-circuit fault degrees were tested; the fault diagnosis results are listed in Tables 3 and 4. By comparing the short-circuit fault diagnosis of our proposed method with the correlation coefficient method in different operating conditions, it can be found the proposed method performs better in short-circuit fault diagnosis for battery packs.To be specific, the proposed method had a significant advantage of diagnosis time; the smaller the short-circuit fault degree, the shorter the diagnosis time of the proposed method than that of the coefficient correlation-based diagnosis method.Moreover, different short-circuit fault degrees were tested; the fault diagnosis results are listed in Tables 3 and 4. By comparing the short-circuit fault diagnosis of our proposed method with the correlation coefficient method in different operating conditions, it can be found the proposed method performs better in short-circuit fault diagnosis for battery packs.To be specific, the proposed method had a significant advantage of diagnosis time; the smaller the short-circuit fault degree, the shorter the diagnosis time of the proposed method than that of the coefficient correlation-based diagnosis method.

Conclusions
In conclusion, a fault diagnosis method based on relative entropy and SOC estimation is proposed to ensure the normal operation of a battery system, including the diagnosis of voltage sensor faults, temperature sensor faults and short-circuit faults.First, by selecting a cell with the median voltage in the battery pack to the reference cell, the relative entropies of the voltage and temperature of each cell to that of the reference one are calculated in real time according to the sliding window.Second, the relative entropy of voltage and temperature serve as fault information, and the CUSUM method is adopted for fault evaluation, indicating that the fault information of the battery is amplified by the relative entropy and the risk of false/missed alarms is reduced by the CUSUM method.In addition, VFFRLS is used for the parameter identification of the reference cell.Furthermore, by calculating the ∆SOC between a normal cell and a fault cell during both the charge and discharge phases, it becomes possible to distinguish an internal short-circuit cell from a low-capacity cell.This method can distinguish between micro-short circuit batteries and low-capacity fault batteries (or aged batteries), but it does not determine whether battery aging is due to irreversible side reactions at the positive electrode (such as oxygen release).In fact, our method cannot specifically diagnose the occurrence of irreversible side reactions (such as oxygen release) at the positive electrode of lithium batteries.Thereafter, the SOC of the short-circuit cell is estimated by an EKF, and the short-circuit resistance is estimated according to the proposed method.The effectiveness of the proposed method has been validated and compared with the correlation coefficient method under different fault types and fault degrees in FUDS and US06 with the interference of noise and inconsistency, and the results show that the proposed method performs well in both different conditions and different fault types.Meanwhile, the robustness of the proposed method was verified under the interference of noise and inconsistency in two working conditions of FUDS and US06.It should be noted that not only is the qualitative diagnosis of a short-circuit fault performed in the proposed method, but the resistance of a short-circuit resistor is estimated for an analysis of the degree of short-circuit faults.Importantly, computational effort and hardware redundancy do not increase with the proposed method, including relative entropy calculation, parameter identification and SOC estimation.In the future, the lower diagnosis time, estimation accuracy and verification of the proposed method in real-world vehicles will be the focus of our research.

Figure 1 .
Figure 1.The schematic process of the multi-fault diagnosis method.

Figure 1 .
Figure 1.The schematic process of the multi-fault diagnosis method.

Figure 2 .
Figure 2. Structure of the equivalent circuit model: (a) first-order model; (b) first-order model with ISC.

Figure 2 .
Figure 2. Structure of the equivalent circuit model: (a) first-order model; (b) first-order model with ISC.

Figure 3 .Table 1 .
Figure 3. Schematic diagram of the virtual battery pack experiment: (a) Schematic diagram of the battery pack charging and discharging test; (b) schematic diagram of the different fault tests.Table 1. Design parameters of cylindrical cell.Parameters (Units) Value Positive electrode material NCM111 Negative electrode material GRAPHITE Diameter × Height (mm) 18.6 × 65.2 Cell weight (g) 43.39 Nominal capacity (Ah) 2.15@1 C Nominal voltage (V) 3.65 Discharge cutoff voltage (V) 2.5 Charge cutoff voltage (V) 4.2

Figure 3 .
Figure 3. Schematic diagram of the virtual battery pack experiment: (a) Schematic diagram of the battery pack charging and discharging test; (b) schematic diagram of the different fault tests.

Figure 5 .
Figure 5. Results of sensor faults under FUDS conditions: (a) Voltage curve of each cell; (b) Voltage relative entropy curve of each cell to that of the reference cell; (c) Temperature curve of each cell; (d) Temperature relative entropy curve of each cell to that of the reference cell.

Figure 5 .
Figure 5. Results of sensor faults under FUDS conditions: (a) Voltage curve of each cell; (b) Voltage relative entropy curve of each cell to that of the reference cell; (c) Temperature curve of each cell; (d) Temperature relative entropy curve of each cell to that of the reference cell.

Batteries 2024 , 16 Figure 6 .
Figure 6.Results of sensor fault diagnosis under FUDS conditions: (a) CUSUM of voltage relative entropy; (b) CUSUM of temperature relative entropy.

Figure 6 .
Figure 6.Results of sensor fault diagnosis under FUDS conditions: (a) CUSUM of voltage relative entropy; (b) CUSUM of temperature relative entropy.

Figure 7 .
Figure 7. Short-circuit fault diagnosis results based on the correlation coefficient method: (a) Maximum correlation coefficient of the voltage of each cell to that of other cells; (b) Maximum correlation coefficient of the temperature of each cell to that of the other cells.

Figure 7 .
Figure 7. Short-circuit fault diagnosis results based on the correlation coefficient method: (a) Maximum correlation coefficient of the voltage of each cell to that of other cells; (b) Maximum correlation coefficient of the temperature of each cell to that of the other cells.
. Voltage and voltage relative entropies of all cells under FUDS conditions: (a) Voltage curve of each cell; (b) Voltage relative entropy curve of each cell to that of reference cell.Figure S2.Temperature and temperature relative entropies of all cells in the battery pack under FUDS conditions: (a) Temperature curve of each cell; (b) Temperature relative entropy curve of each cell to that of reference cell.Figure S3.Short-circuit fault by connecting a resistor (5 Ω) in parallel with cell 5 under FUDS condition: (a) Voltage curve of each cell; (b) Voltage relative entropy curve of each cell to that of reference cell; (c) Temperature curve of each cell; (d) Temperature relative entropy curve of each cell to that of reference cell.
Figure S4.Short-circuit fault diagnosis by connecting a resistor (5 Ω) in parallel with cell 5 under FUDS condition: (a) CUSUM result of voltage relative entropy; (b) CUSUM result of temperature relative entropy.Figure S5.Results of quantitative analysis of short-circuit fault by connecting a 5 Ω resistor with cell 5 in parallel: (a) Comparison of SOC estimation of short-circuit cell and reference cell by using the proposed method with that of coulombic counting method; (b) Result of short-circuit resistance estimation.Figure S6.Relative entropy of all cells in short-circuit fault by connecting a 10 Ω resistor with cell 5 in parallel: (a) Voltage relative entropy curve of each cell to that of reference cell; (b) Temperature relative entropy curve of each cell to that of reference cell.Figure S7.Short-circuit fault diagnosis by connecting a 10 Ω resistor with cell 5 in parallel under FUDS condition: (a) CUSUM result of voltage relative entropy; (b) CUSUM result of temperature relative entropy.Figure S8.Results of quantitative analysis of short-circuit fault by connecting a 10 Ω resistor with cell 5 in parallel: (a) Comparison of SOC estimation of short-circuit cell and reference cell by using the proposed method with that of coulombic counting method; (b) Result of short-circuit resistance estimation.Figure S9.The test of short-circuit fault with a 10 Ω resistor connecting with cell 5 in parallel: (a) Load current curve of battery pack; (b) Voltage curves of all cells; (c) Temperature curves of all cells.Figure S10.Short-circuit fault diagnosis based on the proposed method under US06 operating condition: (a) voltage relative entropy curves; (b) CUSUM result of voltage relative entropy; (c) Temperature relative entropy curves; (d) CUSUM result of temperature relative entropy.Figure S11.Results of quantitative analysis of short-circuit fault under US06: (a) Comparison of SOC estimation of short-circuit cell and reference cell by using the proposed method with that of coulombic counting method; (b) Result of short-circuit resistance estimation.Figure S12.The results of Shannon entropy and correlation coefficient of battery pack voltage under short-circuit conditions in the US06 driving cycle are presented: (a) Battery pack voltage Shannon entropy; (b) The median correlation coefficient of Shannon entropy.

Table 1 .
Design parameters of cylindrical cell.

Table 2 .
Specific setting parameters for battery fault tests.

Table 2 .
Specific setting parameters for battery fault tests.

Table 3 .
[36]arison of the results of the methods proposed under FUDS and US06 with the method based on correlation coefficients[36].

Table 3 .
[36]arison of the results of the methods proposed under FUDS and US06 with the method based on correlation coefficients[36].

Table 4 .
Resistance estimation for different short-circuit degree under different operating conditions..