A Comprehensive Signal-Based Fault Diagnosis Method for Lithium-Ion Batteries in Electric Vehicles

: To enhance the operational reliability and safety of electric vehicles (EVs), big data plat-forms for EV supervision are rapidly developing, which makes a large quantity of battery data available for fault diagnosis. Since fault types related to lithium-ion batteries play a dominant role, a comprehensive fault diagnosis method is proposed in this paper, in pursuit of an accurate early fault diagnosis method based on voltage signals from battery cells. The proposed method for battery fault diagnosis mainly includes three parts: variational mode decomposition in the signal analysis part to separate the inconsistency of cell states, critical representative signal feature extraction by using a generalized dimensionless indicator construction formula and effective anomaly detection by sparsity-based clustering. The signal features of the majority of signal-based battery fault detection studies are found to be particular cases with a specific set of parameter values of the proposed indicator construction formula. With the sensitivity and stability balanced by appropriate moving-window size selection, the proposed signal-based method is validated to be capable of earlier anomaly detection, false-alarm reduction, and anomalous performance identification, compared with traditional approaches, based on actual pre-fault operating data from three different situations.


Introduction
Lithium-ion batteries are usually used in the power supply and storage of electric vehicles (EVs) because of their superior performance [1]. With a number of favorable policies issued and implemented in recent years, the demand for reliability and safety of lithium-ion batteries is steadily on the increase. As a consequence, potential battery abnormality and faults require accurate detection, isolation, and identification as early as possible, so that fault-tolerant operation for minimizing the battery performance degradation can be implemented in time to avoid dangerous situations and battery failure [2,3]. Generally, battery fault diagnosis methods include the following categories: model-based, data-driven, knowledge-based, and signal-based methods [4], and fault types of EV battery system consist of battery faults [5][6][7] (such as overcharging, overdischarging, overheating, loose connections, insulation failure, state inconsistency, external/internal short circuits, electrolyte leakage, swelling, accelerated degradation and thermal runaway), sensor faults, and actuator faults [8].
As for the model-based methods for lithium-ion batteries, by utilizing filters and observers (such as Kalman filter-based methods [7][8][9][10], particle filter [11], Lunberger ob-server [12], and adaptive observers [13]), the equivalent circuit model [14], the electrochemical model [6], and other models [15] at battery cell level are applied to trace and update parameters related to battery states, with excellent real-time monitoring performance [16]. However, the parameters are still challenging to be accurately estimated by signals collected in actual EV operation within a short period. Besides, a large quantity of studies on fractional order models (FOMs) along with their derivatives [17,18], and models on battery pack level [19][20][21] have been dedicated to enhance the accuracy of depicting complicated electrochemical behaviors, material properties, constructions, and the nonlinear fading mechanism of lithium-ion batteries [22]. But on account of almost open-loop estimation process, insufficient adaptivity makes it dissatisfactory to employ model-based methods in most practical cases. Moreover, the computational cost required is often not affordable by current BMS hardware [23] and potential failure in convergence under several working conditions [24]. Even under relatively ideal working conditions, the collected signals in a battery pack could still be influenced by the differences in the state of charge (SOC), state of health (SOH), internal impedance, and other external environmental factors, so it is necessary to make an effort to effectively distinguish various actual faults from severe inconsistency [19]. A further reason that makes it unfeasible to adopt model-based methods for time-efficient battery fault diagnosis in practice is that, their ideal boundary conditions are hardly satisfied in actual operation of EVs.
As for the non-model-based methods for battery fault diagnosis, on-board sensors are utilized to monitor various physical signals such as thermal, electrical, mechanical, etc., and the records are uploaded to the data platform after real-time processing by the battery management system (BMS). As one of the most fundamental signals to analyze, time series of cell voltage values have been typically adopted to realize efficient battery fault diagnosis in extensive studies, based on voltage and voltage difference thresholds [25], machine learning techniques [26], local outliers, the correlation [27], the information entropy [28], and their derivatives [29]. Without the requisite efforts in battery modeling, these methods are available for online applications and offline data analysis, but the performance is inevitably influenced by inconsistency of cell states and the quantity/quality of data records as well as measurement noise. The raw data containing the information to determine diversified states of the battery cells, are usually accompanied by precursory fault signals, noises brought by measuring facilities, and data loss during transmission and storage.
In brief: (1) High computing capability for precise mechanism modeling and signal filtering is required, which is difficult for practical implementation of model-based fault diagnosis methods by current BMS hardware; (2) Besides, corresponding information contained in data records related to some subtle precursory abnormal signals, and the impact of inconsistency on the fault detection should be separate away to enhance the robustness of the diagnosis method; (3) Moreover, the adaptivity of the fault threshold should be considered to make the method reliable to different extents of abnormal performance observed in the sensor signals; (4) Finally, considering the large environmental noise, inefficiently placed temperature sensors in reality, and the hysteresis characteristics of temperature signal records, electric signals are technically preferable in practical application, but traditional battery fault diagnosis methods based on voltage signals are hardly available to identify the detailed abnormal performance; Considering all the challenging factors, a signal-based battery fault diagnosis method based on the voltage signals is proposed in the article. The overall flowchart is shown in Figure 1. The paper contains the following solutions in detail: (1) Considering the excitations composed of various frequency components, some electrochemical behaviors can be prominently analyzed to illustrate the dynamic outputs of lithium-ion batteries mainly in the high frequency range, for there are small time constants of the response related to polarization, whereas the states of battery cells can be estimated by measuring in the low frequency range, as well as the diffusion process. Variational mode decomposition (VMD) algorithm is utilized to find out the components of voltage signals that are strongly associated with either the local response to external excitations, or long-term variation tendencies of battery states. In this way, corresponding functions will be realized on signal components in different frequency ranges. To our best knowledge, using VMD in signal analysis can attain some satisfying performance [30], compared with using other signal process methods such as empirical mode decomposition [23] that suffers from end effect and mode mixing, and wavelet decomposition [31] of which the wavelet basis becomes hard to determine, due to coupled performances and occurrence of different types of faults; (2) Influenced by multiple external factors, it is hard to make robust judgments on abnormal signals by directly employing methods on raw signal data. Therefore, a generalized dimensionless indicator (GDI) extraction formula with moving-window observation is proposed to reduce the impact of the quality and the quantity of training data, and to effectively balance the sensitivity and stability of the signal features depending on the applied situation. Extensive interpretability of the GDI formula with reasonable physical significance will be validated within a list of effective signal features in existing literatures; (3) As both short daily data and a considerable amount of historical data are likely to be used to determine a specific threshold, the clustering algorithm, as one of the unsupervised learning methods for effective anomaly detection, is utilized to find out the existing outliers in two kinds of sequences of proposed features extracted from a flexible number of training samples. The validation is made upon the battery pack data derived from the real-world thermal runaway cases, and the name of battery/EV manufacturers cannot be revealed due to their regulations related to data confidentiality, thus photos of BMS or burned ruins unavailable; (4) To illustrate the feasibility of identifying some specific abnormal patterns in the moving-window observed voltage signals, three case studies with preferably selected parameters of the proposed formula are presented, and the time-priority weight is considered over multiple voltage segments in the quantified anomaly evaluation procedure for a definitive diagnosis report.
The remainder of this paper is organized as follows: Section 2 specifies the VMD algorithm and utilization, and GDI construction formula is introduced as the feature extraction method with moving-window observation in Section 3. Then the sparsity-based clustering method is introduced and implemented upon weighed feature sequences to obtain the diagnosis results in Section 4. The performance of the proposed method is verified, compared with several different methods in three practical cases in Section 5. At last, conclusions are presented with future work in Section 6.

Signal Processing with VMD
The performance of using VMD for non-stationary and nonlinear signal analysis has been validated in extensive studies on mechanical vibration signals [32][33][34][35]. Enlightened by this outstanding signal processing technique, the assumption can be made that we can learn from machinery fault diagnosis to perform battery fault diagnosis, because the battery voltage signals share a lot of similarities with machinery vibration signals. Regarded as response signals carrying the information characterizing system states and potential faults, both of the two signals are observable outputs of sophisticated nonlinear systems with high uncertain excitations under actual operating circumstances. In order to acquire significant components for anomaly detection and fault diagnosis, this signal analysis method is introduced.
VMD was proposed by Dragomiretskiy and Zosso for adaptive and non-recursive signal analysis [36]. In order to distinguish the true fault signals from battery inconsistencies shown in voltage signals when an actual electric vehicle is operating, VMD is used to decompose voltage signals into narrow-band components with independent center fre-quencies. The decomposed battery signals are considered to contain two main components: the dynamic part, as the transient state response of actual vehicle operating without the offset caused by the battery state, and the static part, derived from varying battery states with the actual vehicle operating. The operation state of charging or discharging can be easily classified by current and speed in practice.
The framework of VMD is to evaluate the moderate bandwidth of k number of decomposed components called intrinsic mode functions (IMFs), which are compact around independent center frequencies. In frequency spectrum, the cell voltage signals are initially processed by using the Hilbert transform, and then the formulated constrained variational optimization problem can be expressed by Equation (1): where denotes the k-th component of the signal, denotes center frequency of the k-th component of the signal, f denotes the original signal, denotes the Dirac distribution, and * denotes the convolution operator. After using a quadratic penalty term and Lagrangian multipliers, the constrained variational optimization problem can be further expressed by Equation (2): where denotes the balancing parameter of the data-fidelity constraint. Considering the actual sampling frequency and current ratio, the order of magnitude can be 10 2 to 10 3 . Then it can be solved with the alternate direction method of multipliers, and all the IMFs gained from solutions in spectral domain of frequencies are shown by Equation (3): where ( ), ( ), ( ) and ( ) respectively denote the Fourier transforms of ( ), ( ), ( ) and ( ) to be determined. The components in the time domain can be finally obtained as the real part from the inverse Fourier transform of the filtered signal.
Considering the specific problem of battery fault diagnosis, the decomposed IMFs with different main frequency bands will be further classified into two primary categories, for k could be larger than 2, with the algorithm parameters determined by the actual sampling frequency. IMFs with lower frequency bands can efficiently express the variation of battery states with longer time scales, whereas components containing higher frequency bands can be loaded with potential transient features indicating certain abnormality as shown in Figure 2b. In this way, voltage data records are necessarily processed to reduce the impact of the uncorrelated state factors of cells. The features are then extracted and interpreted to fuse the evidence concerning the abnormalities in the following section.

Generalized Dimensionless Indicator Extraction Formula
Dimensionless indicators (DIs) are diagnosis characteristics that are sensitive to battery abnormalities and faults, instead of working conditions, for they are less affected by the disturbance with the size of the moving window appropriately selected. In existing studies [37], DIs are traditionally defined using Equation (4): where | | denotes the signal amplitudes, ( )denotes the probability density function (PDF) of corresponding signal amplitudes, and , are two different positive constants. If the quantity of signal records N is sufficient in temporal moving window, and the weight may as well be the same for each discrete record in the data vector x , ,…, , the discrete calculation will be approximately as shown in Equation (5): and (6): where Several dimensionless signal features frequently used are interpreted in DI definition as shown in Table 1. In fact, the first six of them have been extensively utilized for fault diagnosis of rolling bearings, while the last two with their extended forms have been typically applied to battery fault diagnosis.

DI Interpretation and Calculation Formula
Discrete GDI Interpretation where denotes the movingwindow size.
Traditional DI formula It can be inferred from Table 1 that the first four features strictly follow the DI construction formula in Equation (4), but are hardly the same as the others, particularly in the last two cases. That is, though correlation coefficient and entropy have been widely used in signal-based battery fault diagnosis study, there is a lack of generalized formula to realize a comprehensive coverage along with the others efficiently. Moreover, redefined dimensionless indicators (RDIs) under separately discussed conditions [38] are as shown in Equation (7): where constants , , ∈ 0.5, 5 are assigned with a step size of = 0.5, and constant ε ∈ 1, 5 , ∈ 0.5, 5 (ε ≠ ) are respectively assigned with step size = 1, = 0.5. In this way, hundreds of different RDIs could be obtained and well-chosen with various selection methods, but it is obvious that there is a lack of relevant physical meaning to explicitly account for every one of them, thus impeding subsequent fault identification. Besides, by means of introducing additional variables to imply different signal sources for the numerator and the denominator, the machinery trouble-free signals and the characteristics of the fault signals were together included in the DI definition [39]. Based on these studies, the GDIs are hereby defined as Equation (8): where , denote the simply processed signal vectors from single or multiple signal sources or constants, p(·) denotes the PDF of signal records, l , m are positive constants, and , denote the order of GDIs in terms of signal source complexity, as many features are first-order indexes as the first half in Table 1. The steps of the feature construction and extraction part are as follows: • Firstly, the collected voltage data records should be divided into segments. The components of each segment are obtained after VMD analysis. To reveal cell inconsistency, the static parts will be efficiently used.

•
Then, the decomposed components of each segment, especially the dynamic parts, are observed in moving windows with a proper size determined by the actual sampling frequency. In order to enhance the flexibility of the formula, , are obtained after processing the decomposed components, like data standardization and scale transformation with simple functions.
• Next, constant , are assigned positive integers, for , -order roots of the integrals ( ) and can be theoretically regarded as the first moment, second moment, third moment etc. of , .

•
Finally, the order of a GDI is evaluated by the supplemental constant , which means that there may be not more than potential signal sources with different signal dimensions, as seen in the correlation indicator, or th-power of , th-moment of a data vector from a certain signal resource, such as the skewness indicator and the kurtosis indicator, or combination of these scenarios.
It is noteworthy that, there could be different values of and , as the correlation indicator interpretation further proved in the Appendix. In contrast to those existing DI extraction methods, the construction procedure of GDI is accompanied with additional physical significance instead of value assignment with a certain interval in reasonable ranges to parameters in the formula and then seeking for the ones containing distinct discrepancy among them, which is critical for further identification of the abnormal signal pattern. This can be realized by the following fact that, the positivity and negativity of the identified outliers in different kinds of feature sequences can also be considered as a useful indicator of specific abnormal patterns in the moving-window observed voltage signals, such as overvoltage/undervoltage, and abnormal rising/falling tendencies.
Note that in order to eliminate the end effect and mode mixing, both z and s equal signed voltage signals minus the average sequence of static components. As a typical set of the preferable parameters, the discrete parameter values in this paper are assigned = 3, = 3, = 2, = 1/ shown in Figure 3 for the following illustrations.

Clustering upon Feature Sequences
As a prevailing unsupervised learning method, clustering is adopted in this paper to find the outliers that represent cell abnormalities in feature sequences. Generally, there are three kinds of ideas for clustering [40]: • Normal data records belong to a cluster in the data, whereas anomalies don't belong to any cluster; • Normal data records lie close to their closest cluster centroid, whereas anomalies are far away from their closest cluster centroid; • Normal data records belong to large and dense clusters, while anomalies either belong to small or sparse clusters.
For the first kind of clustering methods, they are not optimized to find battery cells with the abnormalities and the inducements directly, such as density-based spatial clustering of applications with noise (DBSCAN) clustering and spiking neural networks (SNN), because the main aim of the underlying clustering algorithm in this paper is to obtain the clusters from the feature sequences without completely known thresholds in advance. Besides, a certain outlier may become the centroid of one cluster itself, and the techniques satisfying the second will be invalid to detect such anomalies, such as k-means clustering and self-organizing maps (SOMs). Therefore, utilizing clustering algorithms conforming to the third is preferable in this paper, which means that, the specific cell data records would be abnormal if the density of their clusters were lower than a threshold. In essence, it is more feasible and convincible to determine the actual quantity of abnormal cells (usually starting from one single cell in reality) that may lead to a devastating consequence, than to determine a certain adaptive-questionable threshold by supervised learning with very limited training data. In fact, after implementing the clustering algorithm optimized by searching for the least sum of centroid distances, clusters that contain only very few records in feature sequences indicate potential cell abnormalities.
Here, two types of sequences related to GDIs are utilized as the main material for anomaly detection in the proposed method: the deviation from the mean GDI sequence, and the forward difference of the deviation sequences shown in Figure 4. Certainly it is available to observe the variation tendency of the sequences with longer time scales, but the typical range of voltage sampling frequency is 10 −1 to 10 0 Hz, we pay attention to the horizontally-compared difference for detailed investigation, along with the difference of the GDI sequences within a relatively short period. That is to say, the difference in the variation of original signals between the last moments of temporal/previous observation, can be assessed in many dimensionless ways, so that abnormal patterns could be detected sensitively and intuitively by finding out the outliers in various sequences made of GDIs. To make quantified evaluations, time-priority is one of the most important factors to consider, by which the degree of cell abnormality can be determined and accumulated. If the sparsity degree of a certain cluster with minimum GDI units at moment is low enough, the damping factor is firstly introduced in the procedure of anomaly evaluation to manifest the impact of time-priority in a specific segment. Then in order to make horizontal comparisons, absolute values of the minor ones divided by the standard deviation at corresponding moments are added to the anomaly counting vector as shown in Equation (9): where ( ) denotes the vector to accumulate the degree of abnormality in current data segment up to moment , ( ) denotes the feature vector including detected sparse clusters whose member amount is less than threshold (preferably 2) at sample moment , denotes the non-negative exponential damping constant.
If it is required to investigate some very temporal details in online applications, constant is assigned a large positive value, so that the horizontal comparison shall play a leading role in the evaluation procedure. In contrast, the analysis on multiple integrated historical voltage data segments may prefer a small positive value for constant , to achieve an overall judgment on contributions at different detected moments of similar weights. As a consequence, the quantified degree of cell abnormality can be determined moment by moment as shown in Figure 5, with very little contributions to from actually normal cells at potentially misclassified valid moments. In this way, false alarms can be significantly reduced, compared with classical threshold-based supervised learning methods, such as 3 principle thresholds, and quantile linear thresholds. Finally, the sequence of can be obtained by applying the proposed algorithm to a few realtime voltage data vectors, or even a flexible amount of historical pack voltage data segments adaptively, verified in the following section.

Results and Discussion
Comparisons of the results obtained by implementing different techniques from the existing studies are made in this section. There are three practical fault cases of electric vehicles driven by batteries with Li(NiMnCo)O2 cathodes, and the results will be respectively discussed below. The voltage sampling frequency of all cases is 0.1 Hz with 1 mV resolution. The detailed algorithms chosen to make comparisons with our proposed method are the correlation-based method, the entropy-based method with data normalization, and the threshold-based method by the practical BMS. However, considering that there may be either a fixed threshold or a time-varying threshold to determine, eliminating as many false alarms as possible is an important objective, with the same size of the moving window in each case to observe the voltage signals.

Case 1: Pre-Fault Data up to BMS Alarm of a Thermal Runaway Accident
The pre-fault dataset starts from the last discharging segment and ends at the 440th moment of credible BMS detection as shown in Figure 2. It can be inferred from Figure 5 that outliers are seen at the 430th, the 433rd, and the 439th moment, and negative values of the differential GDIs can indicate an abnormal falling tendency of the 27th cell's voltage. In turn, positive values of the detected outliers could imply that abnormal rising of cell voltage data takes place.
Using a fixed threshold, such as 0.5 typically used in [27], no credible detection can be made by correlation coefficients in Figure 6a. Besides, the quantity of false detections will dramatically increase if a time-varying threshold is adaptively used to investigate the whole segment. After using a fixed threshold, the earliest credible detection could be given at the 433rd moment by the correlation-based method, but the specific voltage data performance of the abnormal cell remains unknown, because correlation coefficients can be significantly reduced by diversified means.
As shown in Figure 6b, the earliest outlier can be logically detected with 3σ thresholds at the 436th moment, but the quantity of false detections is extremely disappointing.
The results are shown in Table 2. Since no precursory signals are credible before the voltage plunging, the earliest fault detection confirms the excellence of the sensitivity of the proposed method.

Case 2: Voltage Data up to the First BMS Alarm (the 1280th Moment) with Intermittent Inconspicuous Anomaly Occurrence
In the second case, frequent occurrence of inconspicuous under-voltage can be found in the historical voltage signals, and BMS detection was triggered at the 1280th moment. The dataset as shown in Figure 7 begins from the beginning of the uploaded data, containing multiple charging and discharging segments. Four relevant undervoltage instances are mainly concerned in the case shown in subplots of Figure 7. Before the final moment detected by BMS, anomaly detection cannot be realized by correlation coefficients without false detections, regardless of whether a time-varying threshold or a fixed threshold is used, as shown in Figure 8a.
As for using an entropy-based detection as shown in Figure 8b, it is appropriate to adopt a time-varying threshold compared with a fixed one, then the anomaly detections can be made roughly in accordance with the proposed method as in Figure 8c. However, it is still incapable of acquiring the knowledge about the specific voltage data performances of the abnormal cell, and false alarms take place frequently in the entropy sequences.
It can be inferred from the result of the proposed method that, after using the proposed method, four successful outlier detections can be credibly made in the concerned local parts. Then we make a little discussion on the abnormal performance identification in this typical case below. The abnormal pattern of voltage signals of the relevant cell can be recognized as multiple short-time under-voltage instances of the 58th cell, because positive outliers in differential GDI deviation sequence of that cell can be observed exactly a window size later from negative outliers of the same cell. In fact, the duration of a short voltage pulse could be determined by subtracting the moving window size from the time interval of the proposed feature's sign reversion. In turn, if an overvoltage pulse emerges, the sign of the outliers in the sequences will change from positive when the pulse slides into the moving window observed, to negative as the pulse slides out.
The results are listed in Table 3, and there is also a lack of definitive overall conclusion by using the entropy-based method, in spite of the similar detections made in the concerned local parts. The comprehensive result in Figure 9 indicates that, despite the short fluctuation at the beginning, the 58th cell has been evaluated as the most abnormal cell, starting from the 472nd moment to the last moment with the damping factor = 0.1 in (9) by the proposed method. To sum up, types, durations, and severity of local abnormalities can be effectively identified by the proposed method, together with a comprehensive evaluation that is steadily attained as early as possible, on cell abnormalities over multiple segments in this case. Little contribution to

Case 3: Last Discharging Segment before Fault Occurrence During Fast Charging
In the third case, due to historical data accessibility of the electric vehicle that suffered a very severe fault in a fast charging process, the utilized dataset shown in Figure 10 is the whole last discharging segment before the fault occurrence when it was charged. The dataset shown in Figure 10 contains two different types of patterns that are recorded in actual vehicle operation: aggresively varying discharging with excessively long sampling intervals, thus failed to immediately trace the accurate transient variation of voltage signals in the first half, and a steady voltage reduction with a little standby-mode fluctuation in the second half. By virtue of easily implemented manual temporal-series examinations upon the short dataset, an overvoltage pulse of the 52nd cell can be considered to be the major abnormal performance in the second half. Since no anomaly detection by BMS is available as a comparison, the robustness of the methods will be mainly verified in this case.
Adopting a time-varying threshold in the sequence of correlation coefficients, the voltage records of the 52nd cell are considered as outliers from the 24th to 30th moment, but it is likely to make lots of false detections shown in Figure 11a with no further performance interpretation available. Unfortunately, no valid detections related to the 52nd cell could be significantly made by entropy-based methods in Figure 11b regardless of the threshold type selected.

Detected by BMS
It can be inferred from the result of the proposed method in Figure 11c that, the moments while aggresively varying voltage amplitudes can be effectively treated as invalid clustering moments in the first half, because the quantity of the feature points in the minor clusters is not low enough, thus avoiding such false alarms. In the second half, an outlier with large negative amplitude at the 36th moment can be recognized and assigned with a large weight value. The 3 mV rising edge of the abnormal voltage pulse is not as obviously detected as its 5 mV falling edge, which makes a positive outlier that should have been obviously detected hidden in the forward difference of the GDI deviation sequences.  The results are listed in Table 4 and provide us with the verification on the balanced performance of sensitivity and stability of the proposed method, thus robust to diversified actual operating conditions.

Conclusions
In this paper, a comprehensive signal-based fault diagnosis method for lithium-ion batteries in electric vehicles is proposed. The reduced hardware cost attributed to no effort required in battery modeling makes it suitable for online applications, and it is proven that the method can detect the initial stages of the fault, by capturing the abnormal varying tendencies or short-time pulses, with corresponding variation reflected in recognized outliers in various feature sequences. Besides, this method contains the following highlights: The cell inconsistency in battery states can be separately analyzed by VMD, which is explained from the perspective of different frequency bands of lithium-ion battery responses. The characteristics of dynamic responses under both charging and discharging conditions can get involved in fault diagnosis, whereas the static components can be used in the path of battery aging mechanism investigation and cell state estimation.
Next, the formula of GDI construction is proposed in (8) as the part of signal feature extraction, and existing features in battery fault diagnosis papers, such as correlation coefficient, entropy, and diversified signal indexes have been proven to be particular cases of the generalized formula. Preferable parameter values with a practical significance have been suggested and taken as the example to extract the critical characteristic sequences carried with signed indicators from voltage data signals. The moving window observation with an appropriate size is indispensably adopted here to satisfy the demand on balancing the stability and sensitivity of the method, no matter online or offline. The first-order forward difference of the GDI deviation sequences is representatively used to illustrate the subsequent process of signal fault diagnosis.
Moreover, the excellence of unsupervised learning is verified in signal anomaly detection, because a well-compatible sparsity-based clustering method is utilized to find out the outliers in feature sequences. Credible detections can be made by the proposed judgment, whereas the judgment leads to either a growing number of false negative fault, or frequent false positive faults in other threshold-based ways. In addition, to draw an overall diagnostic conclusion, a damping factor as the time-prior weight factor with temporal standardization of the absolute value is introduced in the comprehensive scores of abnormal degree quantification as in (9). The counting is validated to be very useful for early stage detections if slight abnormalities of a certain battery cell take place intermittently.
At last, the fault identification is realized by well-founded discussions on the sign variation, intervals, and time-priority weighed standardized absolute values of the outliers in the sequences, so that local abnormal signal patterns can be correctly identified in terms of types, duration, and severity with good robustness, which is hardly feasible in other existing signal-based methods.
The future work is to make modified evaluations based on multiple feature and method fusion from diversified physical signal sensors [41], and to enhance the adaptive ability of thresholds of various algorithms upon different types of batteries. With the development of data acquisition and storage techniques, an increasing number of decomposed signal components from different types of sensors will be available for further specific studies, so that the sensor failure and cell state inconsistency shall be truly isolated.
Author Contributions: Methodology, X.C.; software, X.C.; validation, X.C., C.Z., W.Z. and L.Z.; resources, C.Z., W.Z., J.J. and Y.J.; data curation, X.C. and Y.J.; writing-original draft preparation, X.C.; writing-review and editing, C.Z., J.J., W.Z., Y.J. and L.Z.; funding acquisition, C.Z., W.Z. and J.J.; All authors have read and agreed to the published version of the manuscript. Then we turn to investigate the numerators of the two sub-GDIs ( , ), the numerator of is as shown in (A4): where denotes the product of corresponding elements in vector where , , , denote the corresponding elements in vector , and , , with , = 2, = , , = , = 1. So far, with respect to the feasibility of interpreting the 8th listed signal feature by the proposed GDI formula, a strict proof by expression transformation and parameter assignment has been given.