Next Article in Journal
Improvement of Cycle Stability for Graphite-Based Lithium-Ion Batteries via Usage of Phenyl Methanesulfonate as an Electrolyte Additive
Previous Article in Journal
Preparing Co/N-Doped Carbon as Electrocatalyst toward Oxygen Reduction Reaction via the Ancient “Pharaoh’s Snakes” Reaction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Early Prediction of the Health Conditions for Battery Cathodes Assisted by the Fusion of Feature Signal Analysis and Deep-Learning Techniques

1
School of Transportation Science and Engineering, Beihang University, Beijing 102206, China
2
College of Automotive Engineering, Jilin University, Changchun 130022, China
3
Breathe Technology Ltd., Cambridge CB3 0QH, UK
4
School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China
5
Department of Mechanical Engineering, Imperial College London, London SW7 2AZ, UK
6
Electrochemical Innovation Lab, Department of Chemical Engineering, University College London, London WC1E 7JE, UK
7
School of Engineering and Materials Science, Queen Mary University of London, Mile End Rd., Bethnal Green, London E1 4NS, UK
*
Authors to whom correspondence should be addressed.
Batteries 2022, 8(10), 151; https://doi.org/10.3390/batteries8100151
Submission received: 24 August 2022 / Revised: 19 September 2022 / Accepted: 28 September 2022 / Published: 1 October 2022

Abstract

:
With rapid development of clean energy vehicles, the health diagnosis and prognosis of lithium batteries remain challenging for practical applications. Accurate state-of-health (SOH) and remaining useful life (RUL) estimation provides crucial information for improving the safety, reliability and longevity of batteries. In this paper, a fusion of deep-learning model and feature signal analysis methods are proposed to realize accurate and fast estimation of the health conditions for battery cathodes. Specifically, the long short-term memory (LSTM) network and differential thermal voltammetry (DTV) are utilized to verify our fusion method. Firstly, the DTV feature signal analysis is executed based on battery charging and discharging data, based on which useful feature variables are extracted with Pearson correlation analysis. Next, the deep-learning model is constructed and trained with the LSTM as the core based on timeseries datasets constructed with features. Finally, the validation and error analysis of proposed model are provided, showing a max mean absolute error of 0.6%. The proposed method enables highly accurate models for SOH and RUL estimation that can be potentially deployed on cloud-end for offline battery degradation tracking.

Graphical Abstract

1. Introduction

Lithium-ion batteries (LIBs) are utilized for electric vehicles (EVs), space shuttles and smart grid as an efficient secondary power storage device [1,2,3]. However, the performance of LIBs will deteriorate with continuous usage [4,5,6]. When the available capacity of a battery drops below 80% of its initial capacity, its risk of failure increases significantly. Therefore, accurate estimation of the remaining useful life (RUL) and prediction of state of health (SOH) of high energy electrodes under dynamic operating conditions are important to improve the safety and reliability of commercial batteries.
Currently, there are three main approaches for battery SOH and RUL prediction, including model-based methods, data-driven methods and feature signal analysis methods, respectively [7,8,9].
The model-based methods make prediction by constructing battery models that are either mechanistic, empirical, semi-empirical or a fusion of mechanistic and empirical formulation. An empirical exponential growth model for LIBs impedance degradation is considered, using standard particle filtering (PF) and resampling techniques to fit the degradation datasets [10]. A simplified equivalent circuit model was utilized and the Gauss-Hermite particle filter technique was applied to track the capacity values, which is an extension of PF technique [11]. A reformulated pseudo 2D (P2D) model was developed for RUL predictions to reduce model complexity [12]. The model-based methods have the highest interpretability and can observe enough set of intermediate variables, but the establishment process is relatively complex. Furthermore, the accuracy of model is easily affected by variable current and temperature under the influence of different external conditions. Though the high-fidelity battery model could effectively improve the estimation accuracy, it will greatly increase the complexity of operations as well so that it is not suitable for vehicle-end microcontroller computing environment [13,14].
The data-driven model depends on historical observations directly to approach the SOH prediction. Data-driven methods have higher fitting capability for non-linear data and wider application fields, which have become increasingly popular for capacity estimation of LIBs in recent years [15,16,17]. The data-driven methods rely entirely on experimental data and usually does not require much knowledge of internal mechanisms of battery. Capacity estimation could be performed by learning the complex dependence of battery capacity on features extracted from measurements such as voltage, current and temperature. Liu et al. proposed a data-driven approach to forecast capacity fade trajectory, extracting features with physical meanings from discharge voltage curves [18]. Severson et al. utilized machine-learning tools to both predict and classify cells by cycle life, making use of discharge voltage curves from early cycles to exhibit capacity degradation [19]. A framework-combined deep convolution neural network (DCNN) with double-layer long short-term memory (LSTM) was proposed to approach battery SOH estimation and RUL prediction [20]. The data-driven methods have the strongest implement ability, but highly dependent on the quality of the input data features and relatively less interpretable. Due to the nature of timeseries of battery degradation data, the LSTM is well suitable for prediction of battery RUL and in this paper the LSTM network is utilized as the core.
Feature signal analysis methods, based on electrochemical analysis techniques and signal processing techniques, are implemented to provide certain signal characteristics to approach analysis of battery degradation process. Several methods such as incremental capacity analysis (ICA), Ref. [21] differential voltage analysis (DVA), Ref. [22] electrochemical impedance spectroscopy (EIS) [23] and differential thermal voltammetry (DTV) [24] have been proposed to extract suitable and reliable battery degradation characteristics. In error analysis, the curve parameters such as peak and valley positions, peak heights and areas are usually compared and selected as indirect features, which have a significant correlation with battery capacity fade. The incremental capacity (IC) and IC peak area analysis are employed to assess the consistency of battery module, which is helpful for rapid evaluation and screening of retired batteries for second usage [25]. A synthetic approach based on specific electrode behavior with proper adjustment was proposed to simulate various scenarios of battery degradation modes [26]. Weng et al. extended the ICA-based SOH monitoring approach from single cells to battery modules and an IC peak tracking approach based on ICA principles was proposed to achieve on-board implementation [27]. An incremental capacity analysis was applied to identify various contributions to capacity fade [28]. These signal analysis methods focus on extracting information from signals to analyze the battery health status, usually used as an auxiliary technical tool in conjunction with model-based methods or data-driven methods whose accuracy requires high sampling conditions, data transmission rate and data pre-processing.
Recently, with the rapid increase in the amount of available data and computing power, deep-learning methods have seen wide applicability in search engines, data mining, machine translation, speech and image processing, etc. The electrode degradation estimation model built based on deep-learning methods has strong generalization and the modeling process does not require complex knowledge of the internal mechanism of battery, making it easy to obtain more accurate estimation. However, similar to other data-driven methods, deep-learning methods are also less interpretable and highly dependent on the quality of the input data. These drawbacks, however, could be compensated by training the model with diagnostic data that strongly correlate with battery degradation mechanisms and SOH. Cui et al. proposed an online SOH estimation framework based on ICA and Time Pattern Attention Mechanism Long Short-Term Memory (TPA-LSTM) network [29]. Pradyumna et al. using impedance curves obtained from an electrochemical impedance spectroscopy (EIS) test and a convolutional neural network (CNN) to estimate capacity [30]. The DTV method, proposed by Wu et al. in 2015, Ref. [24] is a feature signal analysis method which could bridge microscopic phase transition characteristics of a battery with its and macroscopic thermal and electrical characteristics. The phase transition occurs in electrodes and causes a change in entropy, which is a function of temperature, and thus leads to a change in temperature. Therefore, the DTV analysis method establishes a link between macroscopic measurable physical quantities and microscopic characteristics of the electrode, reflecting the information on lithium ions involved in electrochemical reactions at different aging stages. Compared with other commonly used methods, such as EIS and ICA, DTV is more suitable as a diagnostic method for practical applications. Since the method requires only cell voltage measurements and surface temperature. It does not require accurate current measurements such as ICA or additional hardware such as EIS. Combined with DTV analysis method, the deep-learning method could become an effective and practical tool for tracking the health conditions of batteries.
In this paper, the fusion of data-driven methods and feature signal analysis methods are proposed to realize early prediction of the SOH and RUL of high-energy density lithium-ion battery cathodes. To validate the proposed method, a data-driven model with LSTM network as the core and DTV feature signal analysis are utilized to construct the fusion model. Firstly, the DTV feature signal analysis is executed based on battery degradation data. With Savitzky-Golay (SG) filter method and Pearson correlation analysis, highly correlated features are extracted based on battery degradation datasets of Oxford University, including information of valleys and peaks from DTV curve and singular values of DTV vector, respectively. Next, a deep-learning model with LSTM network as the core is constructed and trained with these features. Furthermore, the dropout technique is introduced to solve the overfitting problems and the Monte Carlo simulation is utilized to characterize the prediction uncertainty. Finally, the model validation and error analysis are performed. This work demonstrate that features extracted from DTV curve could be utilized in deep-learning model to improve and optimize long-term SOH and RUL prediction based on the early degradation data. In framework of CHAIN, the proposed model could approach accurate offline SOH and RUL estimation or real-time online prediction combined with local fast empirical model, promoting the development of the next-generation cloud-vehicle synergistic management system [31,32,33,34].

2. Degradation Data

In this section, the DTV feature signal analysis is executed based on battery aging datasets from Oxford University [35,36]. With Pearson correlation analysis method, two kinds of feature variables are extracted as the input of subsequently established deep-learning model, including singular values from DTV vector and information of peaks and valleys from DTV curves, respectively.

2.1. Battery Aging Experimental Data

In this paper, the selected LCO/NCO battery degradation dataset is obtained from Oxford University based on 8 groups of 740 mAh pouch batteries (labelled from #1 to #8). The experimental batteries consist of graphite anodes, lithium cobalt oxide and lithium nickel cobalt oxide cathodes (LCO/NCO). There are 8 cells cycled through an 8-channel Big MPG 205 battery tester and housed in a MK53 hot chamber at 40 °C with constant current charge and discharge. Table 1 shows the specific parameters and experimental conditions. The C-rate is the measured value of charging or discharging current relative to its nominal capacity. The voltage, current and the temperature data recorded for two continuous cycles in the test are shown in Figure 1a. Figure 1b shows the battery capacity degradation curves of the 8 cells. At the end of the cycle test, the discharge capacity of battery #2 and #5 drop sharply. The surface temperature shows a clear change as well. Furthermore, the battery #6 does not reach below end of life (EOL). Therefore, the battery #1, #3, #4, #7 and #8 are selected and utilized for training and testing of our model.

2.2. Differential Thermal Voltammetry Analysis and Feature Extraction

2.2.1. Noise Removal

Noise will be generated in practical measurements due to fluctuations in signal sampling, especially temperature data, which is much louder than the voltage current data as shown in Figure 1a. Furthermore, the error caused by noise will be further amplified due to the differential operation and the obtained DTV curve will contain a lot of noise.
To address the effects of noise, in this work, our data was extracted from 1C constant current (CC) charge cycles in a characterization test with the time interval of 20 s. If the selected voltage measurement interval is too small, the noise will have a huge impact on the results of our test. If the selected voltage interval is too large, it may cause to the loss of valuable information. Therefore 20 s is chosen as the interval because the voltage interval is proportional to the time interval during CC charging.
After setting a fixed time interval, the DTV curves still contain a lot of noise. Therefore, the filtering method could be utilized to further process the data and we choose the Savitzky-Golay (SG) filter for smoothing. The SG filter is well-adapted for data smoothing as a type of low-pass filter. The filter method could maintain the shape and height of waveform peaks, which is suitable for filtering the DTV curves to obtain high-quality features [37]. The principle of filter could be described as follows:
y ( i ) = j = p j = p 1 N c C j x ( i + j )
where x ( i ) represents the original signals of input, y ( · ) the synthetic signals of output, C j the coefficient given by the SG filter and N c the number of convoluting integers, which is equal to the smoothing window size (2p + 1).
Figure 2 and Figure 3 shows the comparison between the filtered battery surface temperature and DTV curve and the original curves obtained from the raw data. The DTV data as shown in Curve 2 in Figure 3a still contains noise, so they are filtered as well. The DTV curve contains obvious peaks and valleys, and the SG filter is suitable for preserving these important features of the the DTV curves.
The final graph is shown in Curve 3 in Figure 3a, which is the DTV curve we finally obtain. Due to the smoothness of the function is measured by continuous differentiation, Figure 3b shows our evaluation of the smoothing effect by deriving the DTV curve. As shown in Figure 2 and Figure 3, with the application of selected time intervals and SG filter, we could effectively smooth the original temperature curve and DTV curve and preserve the features on the curve well, including information of valleys and peaks.

2.2.2. Differential Thermal Voltammetry Analysis

The DTV method is an important tool for tracking battery health conditions, which could be utilized to evaluate the patterns of battery degradation process. The battery degradation is a complex physicochemical process. Therefore, the analysis of internal mechanism and measurement of parameters could be challenging. The change of entropy is a function of temperature, and the DTV method could provide obvious information related to entropy. Some feature variables, such as positions and heights of peak and valleys of the curve, are directly concerned with the impedance increment and nonuniform of electrode performance during battery degradation process, reflecting the phase transition characteristic. During the whole battery degradation process, the positive and negative electrodes are in different phase combinations. The transition between these phases may lead to a sudden change in entropy, which may lead to the emergence of inflection points on the DTV curve. Specifically, the peak position describes the peak potential during the (de) intercalation stage. The peak position shift during aging can describe the impedance rise and stoichiometric drift of the battery. The peak height represents the maximum heating rate of these stages. The peak width describes the potential window of the combined phase in the two electrodes. The peak area gives information about the heat generated during the (de) intercalation stage. The phase transition characteristic is closely related to the battery degradation, thus providing close links between DTV features and battery degradation.
The parameters of D T V methods could be calculated by differentiate the temperature of the battery surface to the terminal voltage during charging, described as follows:
D T V = d T d t / d V d t = d T d V
where T represents the battery surface temperature, V the battery terminal voltage. That is, we could obtain DTV only by obtaining temperature and voltage data from the battery during charging or discharging.
Figure 4 shows that the heights of peak will decrease, and the positions of peak will increase during battery degradation in charging process. Furthermore, the valley gradually becomes smaller and moves towards high potential. Obviously, the DTV method could help to reflect battery degradation characteristics by bridging microscopic phase transition characteristics and macroscopic signaling characteristics.

2.3. Features Extraction of Battery Degradation and Correlation Analysis

2.3.1. Features Extraction

As for the final obtained DTV curve, the characteristic information such as the position, size and width of the peaks and valleys is elected as alternative input features. These values are high correlated with the change in the impedance of the battery and are also direct or intuitive. The specific description of the peak and valley could be expressed as follows:
V p e a k = V i | d D T V d V i = 0 , a n d f ( V i ) f ( V ) , V ( V i 1 , V i + 1 )
D T V p e a k = f ( V p e a k )
V v a l l y = V i | d D T V d V i = 0 , a n d f ( V i ) f ( V ) , V ( V i 1 , V i + 1 )
D T V v a l l y = f ( V v a l l y )
where f ( · ) is a mapping function between voltage and DTV, and V i 1 and V i + 1 are the voltages of the previous sampling time point and the later sampling time point, respectively. As shown in Figure 5a, the coordinates of the first peak, the valley, and the second peak are [FV1, FV2], [FV3, FV4], and [FV5, FV6], respectively. In addition, FV1, FV2, FV3, FV4, FV5, and FV6 represent the values of V p e a k 1 , D T V p e a k 1 , V v a l l y , D T V v a l l y , V p e a k 2 , and D T V p e a k 2 , respectively. With the increase of aging cycle, the specific change processes of feature variables of battery #1 are shown in Figure 5b,c.
More attention is paid to the change of peaks and valleys for DTV feature signal analysis methods, but it is difficult to obtain accurate peak and valley information in the practical application process due to the influence of noise disturb, sampling intervals, electrochemistry and other factors, etc. Different measurement conditions could have a huge impact on the results. Even if accurate peak and valley information could be obtained, operations such as DTV curve smoothing, and peak and valley information feature extraction will increase the computational complexity of overall battery degradation estimation model.
Therefore, in addition to the commonly used peak and valley information as input features, we also selected the singular value decomposition method to extract the singular value of DTV data. The singular value decomposition is a simple and direct mathematical method, which can directly extract feature information from the unfiltered DTV data, avoiding the extra efforts of finding and extracting features from peaks and valleys of DTV curves, that is, singular value decomposition could reduce the complexity of data processing operation.
The principle of singular value decomposition could be described as follows:
M D T V = U D T V V T
where M D T V is the matrix of the l columns and k rows from DTV data. A column of DTV data could be seen as a matrix of k rows and l columns, where k is equal to the length and l is equal to one. After decomposition by SVD, it is split into 3 sub-matrices where the U matrix is the square matrix of k rows and k columns, the U matrix is the orthogonal eigenvector of the MMT matrix and V is the square matrix of l rows and l columns. D T V is a matrix with values only for the main diagonal, where the values are called singular values from DTV vector.
The singular values concentrate the important information of matrix in the larger singular values. The singular value decreases from the upper left from the largest to the smallest to the lower right at a very fast rate, so that the important information is concentrated in the upper left part of the DTV matrix. In this paper, singular values are extracted for the DTV data of each cycle. The largest singular value is chosen as an additional feature FV7, and Figure 5d shows how FV7 changes with battery degradation.

2.3.2. Correlation Analysis

In order to improve the prediction accuracy and efficiency, the features with high correlation coefficient are selected through Pearson correlation analysis [38]. The formula of Pearson correlation analysis is described as follows:
r x y = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
where n is the number of sample series, x and y , respectively, represent a variable, and, respectively, denote mean values of x and y .
The correlation analysis between the battery capacity and the features we extracted is shown in Figure 6. The correlation analysis results for all batteries are shown in Table 2. It could be seen that the feature values we extract on different batteries have a high correlation with capacity and the correlation analysis results of most of the features are above 90%. The FV1 of battery #1 and the FV3 of battery #7 and the FV1 of battery #8 have reached more than 85%, though they have not reached 90%. Furthermore, the singular value features we extracted reached more than 94% on each of the four batteries. Finally, the features FV2, FV4, FV5 and FV7 are selected as the input features of the model. The FV2, FV4 and FV5 will be utilized as a set of inputs and FV7 will be used as another set of input datasets.

3. Methodology

This section mainly describes the framework of the proposed deep-learning method integrated of DTV analysis method and data-driven method.

3.1. LSTM Network Structure

LSTM neural network is a variant of RNN, which is suitable for long-term dependence problem and can effectively solve the gradient disappearance and gradient explosion problems. The LSTM introduces unit states and uses three types of gates, input gate, forget gate and output gate, to hold and control information. The main feature of the LSTM neural network is that there is a main chain of information recorded through the units, using the sigmoid function σ ( · ) and the hyperbolic function tan h ( · ) in each network unit, respectively [39]. Each unit of LSTM is described as follows:
The first step is to use the forget gate to determine which information is discarded from the unit state c j 1 when the LSTM unit obtains the data for x j and h j 1 . The sigmoid activation function of the forgetting gate will output a number between 0 and 1, where “1” means that the status value is completely retained and “0” means that the value is completely discarded. This number will be utilized for update of unit status c j . The forget gate can be expressed as:
f j = σ ( W f [ h j 1 , x j ] + b f )
Secondly, a sigmoid activation function and a t a n h activation function are used to form the input gate. The sigmoid function decides which values to update and then uses the tanh layer to obtain new information, consequently creating a new candidate state to participate in the update of unit state. These two steps could be described as follow:
i j = σ ( W i [ h j 1 , x j ] + b i )
g j = tan h ( W g [ h j 1 , x j ] + b j )
The third step is the fusion of data from the forget and input gates. Next, the unit state changes from c j 1 to c j :
c j = c j 1 × f j + i j × g j
The final output gate generates a new hidden state h j 1 based on the unit state c j .
o j = σ ( W o [ h j 1 , x j ] + b o )
h j = o j × tan h ( c j )
where the W represents the weights of layer, b the deviations, h j 1 the output of last cycle and x j the current output.
Since the battery degradation data is time-series data, recurrent neural networks (RNN) are well suited for the battery SOH and RUL prediction problems. Furthermore, battery degradation issue is a long-term dependency problem and general RNNs are only capable of solving short-term dependency problems. The problems of gradient disappearance and gradient explosion cannot be avoided as well, for which the LSTM neural network is utilized as the main body of the model in this paper.

3.2. Component and MC Simulation

In this paper, the optimization algorithm based on the RMSprop method is used to train the LSTM network. The RMSprop algorithm has been empirically proven to be an effective and practical algorithm for deep-learning neural network optimization. It works very well for RNNs, and is one of the frequently adopted optimization methods nowadays.
Neural networks suffer from overfitting problem, a phenomenon where the model performs well on the training datasets but poorly on the test datasets. The overfitting phenomenon will make the generalization ability and robustness of model deteriorate. Srivastava et al. proposed a dropout technique to solve overfitting problem more effectively [40]. During the training process, the dropout technique randomly removes neurons and their connections from the neural network. Whenever a dropout process is executed, it is equivalent to obtaining a smaller network from the original network consisting of the neurons left. Thus, training a neural network with the dropout technique can be considered as training many sparse networks with extensive weights sharing, where each sparse network is rarely trained. The network becomes less sensitive to the specific weights of the neurons, which allows the model to be more generalizable and not too dependent on certain local features.
Both the predicted values and their confidence levels are important parameters for assessing the performance of the predictor. The LSTM RNN is able to learn the long-term correlation of time series data and predict multi-step advanced data points, but it cannot provide prediction confidence. Therefore, we introduce MC simulation methods to characterize the prediction uncertainty. MC simulations require a numerical algorithm to describe the system. All information about the degradation process of battery performance is contained in the capacity datasets. Therefore, the LSTM RNN trained based on capacity data is defined as the numerical algorithm implemented by the battery degradation process, that is, the output of our LSTM model is utilized as the input data of MC simulations.

3.3. Input and Output Data Structure

The specific structure of our inputs and outputs is shown in Figure 7.
Where FV represents features, M the batch size, SOH the capacity and N the sequence length. The timeseries are constructed using the extracted features and the input is constructed as a three-dimensional timeseries tensor. The extracted feature variables are first constructed into a two-dimensional matrix. Next, two-dimensional matrix fragments are extracted from the above two-dimensional matrix according to the sequence length N. Next, a series of two-dimensional matrix fragments are constructed into a three-dimensional tensor as the final input according to a given combination of batch sizes M. The features of each two cycles are used as a time series to predict the capacity of the second of these cycles, and the two adjacent timeseries will have an overlap of data points to make full use of our data.

3.4. Framework of the Proposed RUL and SOH Estimation Model

The specific framework of LSTM-based battery SOH and RUL prediction proposed in this paper is shown in Figure 8. The framework is divided into four parts: preprocessing, filtering and feature extraction, model construction and training, validation and error analysis.
In the first and second part, the data preprocessing and feature extraction are executed. As described in detail in Section 2, two kinds of features are extracted to train and test the deep-learning model, including information of valleys and peaks from DTV curves and singular values of DTV vector, respectively. In the third part, a deep-learning model with LSTM network as the core is constructed. The three-dimensional timeseries tensor obtained in the second part is utilized as input for training our model, and the RMSprop technique is used to improve the convergence speed during the training process. Furthermore, the Dropout technique is introduced to suppress overfitting and MC simulation is introduced to give uncertainty-related information in the computation process. Finally, the proposed battery health condition prediction model is quantitatively analyzed and validated. The mean absolute error (MAE) analysis, root mean square error (RMSE) analysis, and box plot analysis are utilized and compared. Furthermore, the proposed model is verified on the other four cells.

4. Result and Discussion

In this section, the validation and error analysis of our proposed model are executed. Different division proportions of the dataset are investigated firstly to ensure accuracy with a minimum amount of data, that is, approaching prediction of battery degradation early as soon as possible. Next, the robustness and stability of our model are proved and the accuracy of prediction results using singular values as input features of the model is calculated.

4.1. Experiment Setup

We use TensorFlow to build the whole network and use the compute unified device architecture (CUDA) to accelerate computation. The version of TensorFlow is 2.5.0 and the version of CUDA is 11.2. The overall experiment is performed on a server with AMD EPYC 7543 32-Core Processor CPU and NVIDIA A40 GPU, and the operating system is ubuntu 18.04.
The accuracy of the model and the training process are greatly affected by the hyperparameters in the LSTM network, which are pre-given rather than learned by the network. Batchsize determines the number of samples selected by the model in one training. The value of Batchsize will affect the optimization degree and speed of the model. A larger Batchsize will speed up the training speed, but it is easy to make the model converge at the local optimum point. After debugging, the Batchsize is selected as 2 is optimal. The value of the learning rate has a great impact on the training process. A large learning rate will cause the gradient to oscillate and may even fail to converge and a small learning rate will lead to a very slow convergence rate. After debugging, the learning rate is selected as 0.0012 as the best. Epoch represents the training times of all sample data. A large epoch will lead to slow model training and overfitting and a small epoch may lead to underfitting. In the experiment, it is found that convergence can be achieved by setting the epoch to 100, continuing training may lead to overfitting. Therefore, set the epoch to 100. The number of nodes in the network directly determines the performance of the network, too few nodes may lead to insufficient information processing capability of the network, which may lead to underfitting and too many neurons may lead to the limited amount of information contained in the training set being insufficient to train all neurons in the hidden layer, leading to overfitting. We debug in the range of 16–128, and the results show that 64 is optimal. Dropout rate is the probability of discarding neurons, too low probability has limited effect, while too high probability will lead to insufficient network training. After debugging, it is found that 0.2 is the best. The sequence length represents the length of the input sequence or the size of the input window, and sequence stride represents the step size of each sliding of the window. Due to the small amount of training data, in order to make full use of the training data, we set the sequence length to 2, and the sequence stride set to 1. The specific model structure and hyperparameters are shown in Table 3.

4.2. Validation of Early Prediction Ability

In this subsection, the variation of SOH and RUL prediction results with different division proportions of the dataset are investigated to verify the early prediction ability of the proposed model. The Red dotted line represents the predicted SOH and the green dotted line represents the real SOH. The division proportion is demonstrated with the black lines. The purple area is generated with Monte Carlo simulation to obtain the probability distribution of RUL. Three division proportions of dataset are executed in this subsection, including 10%, 20% and 30%, respectively. As shown in Figure 9a, the error of prediction is 0.714% when using 10% of data for training. The RUL is 4600 cycles, and the probability density of battery RUL is distributed between 4079 and 4194 cycles. Figure 9b shows that the error of prediction is 0.473% when using 20% of data for training. The RUL is 3500 cycles and the probability density of battery RUL is distributed between 3254 and 3312 cycles. The precision is significantly higher than the results using 10% of data for training. Figure 9c shows that the error of prediction is 0.363% when using 30% of data for training. The RUL is the 2800 cycles, and the probability density of battery RUL is distributed between 2820 and 2856 cycles. Figure 9d shows the box plot of the predicted results, which avoids the effect of extreme outliers and provides a visual table of the error distribution conditions with small rectangles in each box representing the MAE. The box plot could show the characteristic of the error distribution through maximum, minimum, median, upper quartile and lower quartile. With the proportion of the training set increasing, the upper quartile gradually decreases from 1.1% to 0.75% and to 0.5%, and the lower quartile gradually decreases from 0.4% to 0.25% and to 0.15%. The MAE also decreases from 0.8% to 0.6% and to 0.3%.
When using 30% of data for training, though the error is lower and the probability density distribution of RUL is closer to the real failure cycle, the predicted RUL interval is distributed after the actual failure cycle. The accuracy of prediction result using 20% of data for training is slightly lower than that of using 30% of the results—and the predicted RUL intervals are distributed before the actual failure cycle. At the same time, under the strict condition that only 10% of the data is used for training, and the dataset is in 100 cycles, which may lead to some missing information and have an impact on the accuracy. However, the model accuracy is still decent with a prediction error of 0.714%. Therefore, the results indicate that our method could achieve early SOH estimation and RUL prediction, that is, the proposed model could ensure high enough accuracy with the least amount of data for training.

4.3. Validation of Robustness

In this subsection the robustness and reliability of the proposed model is investigated. Some of the eight battery capacity data did not reach below EOL, making it impossible to derive RUL predictions. Therefore, the battery #3, #4, #7 and #8 are selected for verification. The red dotted line illustrates the predicted SOH and the green dotted line illustrates the real SOH. With Monte Carlo method, the probability distribution of RUL is obtained as the purple part. In these validations, the first 20% of the data are used as training dataset, and the cycle after is used as the starting cycle for prediction.
Figure 10 shows the validation of the robustness and reliability of the model using the commonly chosen information of peaks and valleys as features. The proposed method is validated on all four cells and the prediction errors are distributed around 0.4%. As shown in Figure 10a, the RUL is 2900 cycles, and the probability density of battery RUL is distributed at 2700 cycles for battery #3. Figure 10b shows that the RUL is 2500 cycles and the probability density of battery RUL is distributed between 2418 and 2469 cycles for battery #4. Figure 10c shows that the RUL is 4300 cycles, and the probability density of battery RUL is distributed between 3932 and 3968 cycles for battery #7. Figure 10d shows that the RUL is 3000 cycles and the probability density of battery RUL is distributed between 2950 and 2984 cycles for battery #8. It could be seen that the predicted RUL probability densities do not cover the actual effectiveness cycle very precisely, but they are distributed in a small interval close to practical failure cycle. The reason for not covering the practical failure cycle may be that the original dataset is in 100-cycle increments, which may result in some information missing.
As shown in Figure 10e,f, the prediction results with the data of 20% cycles lost could produce a prediction mean absolute error of around 0.4%, which is close to 0.473% without data lost. Furthermore, the lower and upper quartiles and MAE of box line plot are distributed around 0.25%, 0.6% and 0.75%, respectively, further proving the robustness and stability of our model.

4.4. Predict Result with SVD Feature

In this subsection the singular values are utilized as the input features for training and the four batteries are utilized to verify the verify the proposed model. Furthermore, all of the prediction results of SOH and RUL are obtained with data of 20% cycles lost. The red dotted line and the green dotted line represent the predicted SOH and real SOH, respectively. The purple area demonstrates the probability distribution of RUL, obtained with Monte Carlo simulations.
Figure 11 shows the prediction results using the singular values of DTV vector as features. As shown in Figure 11a–d, the prediction results with data of 20% cycles lost could produce a prediction mean absolute error of around 0.4%, which shows that the accuracy of prediction using singular values of DTV vector as features is slightly lower. However, the singular values can be extracted directly from the original DTV data without complicated operations such as filtering, and the accuracy is high enough for practical application. Furthermore, the predictions in this section using the singular value as features are taken as the starting point of 20%, which proves the robustness and reliability of using singular values as features as well.
Figure 11e,f shows the results of the box plot and the error analysis of the model. The lower quartiles are all distributed around 0.3% and both MAE and RMSE are distributed at the same level. The upper quartile using singular values as features for battery #7 and #8 reaches 0.8%, and the MAE and RMSE reach around 0.6% and 0.7%, respectively. It can be seen that it is feasible to extract the singular values of DTV data as features for prediction, and could make the process simpler with comparable accuracy. Figure 11 shows the results of the prediction using the singular values of DTV as features and the error is slightly larger than the results for other batteries, which is caused by the inconsistency of batteries.
Generally, the singular values have a strong correlation with the capacity data, which could encompass the information of peaks and valleys as an independent set of feature inputs. The prediction results using singular values as input features could produce a tolerable error. Furthermore, the utilization of singular value decomposition for DTV enables the use of simpler data processing and feature extraction methods to extract strongly correlated features. The singular value could replace the peak-valley information, reducing the size of input matrix and improving the efficiency of the model.

4.5. Validation of Computation Cost and Compared with Other Methods

Table 4 shows the computation cost of training and prediction. As shown in Table 4. The training time is about 20 s, and the prediction time is about 0.5 s. The results show that the method proposed by us has low calculation consumption and high calculation efficiency. The computation cost of deep learning models in training is usually huge, and will increase dramatically with the increase of data volume. The high calculation efficiency of our proposed method is benefit from extracting features from DTV data, which reduces the amount of data, and using correlation analysis methods to further filter features, which reduces the number of features. At the same time, the amount of data we use is inherently small, and we have achieved high-precision prediction under the premise of using small amounts of data, which further proves the superiority and effectiveness of our proposed method.
Table 5 shows the prediction results of the proposed method in this paper and compared with other works that have also conducted studies based on Oxford battery dataset. This research have utilized traditional machine learning algorithms and some hybrid algorithms to build battery degradation models. Most of their input features come directly from voltage and temperature data or use ICA methods to extract features, rather than DTV data used in this paper. The model with support vector regression (SVR) as the core is constructed, with dT and ICA as the features, respectively [41]. A model with random forest as the core was constructed, and constant current charging time (CCCT) was taken as the feature [42]. Support vector machine (SVM), multiple linear regression (MLR) and gaussian process regression (GPR) are used as the core to build models, respectively, and the three models are fused through the random forest algorithm, with voltage, temperature and ICA as the characteristics. Ref. [43] establish models with CNN and gate recurrent unit (GRU) as the core, respectively, and the final output of the two models concatenated and projected to the output to build a GRU-CNN model, and the features are extracted from voltage. Ref. [44] compared with most prediction methods in the table, the method designed in this paper has higher SOH prediction accuracy. It benefits from the input characteristics provided by DTV analysis and the excellent ability of LSTM network to solve long-term dependent time series problems. At the same time, we use less data as the training set, which further proves the superiority of this method. The comparison results show that the designed SOH prediction method has good prediction performance.

5. Conclusions

In this paper, the fusion of a data-driven method and a feature signal analysis approach is investigated to predict the SOH as well as RUL of battery cathodes. The DTV feature signal analysis is calculated based on the collected battery voltage and temperature information. Two kinds of features are extracted as the input features of the data-driven model from DTV data, including the singular values of the DTV vector and the peaks and valleys information of DTV curves, respectively. Subsequently, a deep-learning model is constructed with LSTM network as the core. Furthermore, dropout method is utilized to suppress overfitting and MC method is introduced to give uncertainty-related information.
Different division proportions of the dataset are investigated firstly. The results show that the prediction accuracy improves significantly with the increase of training set. However, 20% of the data ensures both accuracy and uses the relatively less amount of data with the prediction error of SOH maintained 0.473%, which is the best choice as a training set during early prediction. Next, the robustness and reliability of the proposed model are validated by removing the first 20% of the data for different start-up cycles. The prediction error of SOH is 0.433%, which is very close to the prediction error of 0.473% without data loss. Finally, the singular values are utilized as input features with the prediction SOH error of 0.582%, which is slightly higher than the error of 0.473% but can reduce the complexity of data processing while ensuring the prediction accuracy.
The main contributions of this work could be described as follows. (1) The DTV curve, which can characterize the phase-transition electrodes, is obtained and used to analyze the battery degradation process by cleaning, reducing, filtering and processing the raw voltage and temperature data. (2) Two kinds of features are extracted, and the singular values are simpler, reducing the cost and decreasing the computational complexity. (3) Only a small portion of data from early battery cycles is used for model training to achieve high accuracy of battery RUL prediction. (4) The robustness and reliability are investigated and validated with the data of 20% cycles lost. Generally, the proposed LSTM-based model has good early prediction accuracy and robustness, proving that the fusion of data-driven methods and feature signal analysis methods has good prospects for application in electrode health conditions estimation. This work could improve the prediction accuracy and has potential to approach cloud-end highly accurate offline prediction and achieving local-end real-time accurate estimation of SOH and RUL combined with local fast model in CHAIN.

Author Contributions

Conceptualization, S.Y., X.L. (Xinhua Liu) and W.W.; methodology, X.L. (Xinhua Liu), W.W., L.Z. and T.Z.; software, W.W. and X.Y.; validation, W.W., L.Z. and H.Y.; investigation, H.W.; resources, X.L. (Xuekun Lu) and H.W.; writing—original draft preparation, W.W.; writing—review and editing, L.Z. and H.Y.; visualization, W.W., S.C. and F.L.; supervision, S.Y. and X.L. (Xinhua Liu); project administration, S.Y.; funding acquisition, S.Y. and X.L. (Xinhua Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the National Natural Science Foundation of China (No. 52102470).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in [Oxford university research archive] at [https://dx.doi.org/10.5287/bodleian:KO2kdmYGg] (accessed on 11 April 2022), reference number [35,36].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhang, L.-S.; Gao, X.-L.; Liu, X.-H.; Zhang, Z.-J.; Cao, R.; Cheng, H.-C.; Wang, M.-Y.; Yan, X.-Y.; Yang, S.-C. CHAIN: Unlocking Informatics-Aided Design of Li Metal Anode from Materials to Applications. Rare Met. 2022, 41, 1477–1489. [Google Scholar] [CrossRef]
  2. Zhou, C.C.; Su, Z.; Gao, X.L.; Cao, R.; Yang, S.C.; Liu, X.H. Ultra-High-Energy Lithium-Ion Batteries Enabled by Aligned Structured Thick Electrode Design. Rare Met. 2022, 41, 14–20. [Google Scholar] [CrossRef]
  3. Zhang, L.; Hu, X.; Wang, Z.; Ruan, J.; Ma, C.; Song, Z.; Dorrell, D.G.; Pecht, M.G. Hybrid Electrochemical Energy Storage Systems: An Overview for Smart Grid and Electrified Vehicle Applications. Renew. Sustain. Energy Rev. 2021, 139, 110581. [Google Scholar] [CrossRef]
  4. Zhang, J.F.; Zhang, D.W.; Wu, X.W.; Ren, Q.L.; Qu, Z.G. Current-Voltage Characteristics and Breakdown of Different Structural Planar Microelectrodes in Atmospheric Air. AIP Adv. 2021, 11, 065301. [Google Scholar] [CrossRef]
  5. Gao, X.L.; Liu, X.H.; Xie, W.L.; Zhang, L.S.; Yang, S.C. Multiscale Observation of Li Plating for Lithium-Ion Batteries. Rare Met. 2021, 40, 3038–3048. [Google Scholar] [CrossRef]
  6. Meng, X.; Xu, Y.; Cao, H.; Lin, X.; Ning, P.; Zhang, Y.; Garcia, Y.G.; Sun, Z. Internal Failure of Anode Materials for Lithium Batteries—A Critical Review. Green Energy Environ. 2020, 5, 22–36. [Google Scholar] [CrossRef]
  7. Li, Y.; Liu, K.; Foley, A.M.; Zülke, A.; Berecibar, M.; Nanini-Maury, E.; Van Mierlo, J.; Hoster, H.E. Data-Driven Health Estimation and Lifetime Prediction of Lithium-Ion Batteries: A Review. Renew. Sustain. Energy Rev. 2019, 113, 109254. [Google Scholar] [CrossRef]
  8. Cai, L.; Meng, J.; Stroe, D.I.; Luo, G.; Teodorescu, R. An Evolutionary Framework for Lithium-Ion Battery State of Health Estimation. J. Power Sources 2019, 412, 615–622. [Google Scholar] [CrossRef]
  9. Tao, H.; Lian, C.; Liu, H. Multiscale Modeling of Electrolytes in Porous Electrode: From Equilibrium Structure to Non-Equilibrium Transport. Green Energy Environ. 2020, 5, 303–321. [Google Scholar] [CrossRef]
  10. An, D.; Choi, J.H.; Kim, N.H. Prognostics 101: A Tutorial for Particle Filter-Based Prognostics Algorithm Using Matlab. Reliab. Eng. Syst. Saf. 2013, 115, 161–169. [Google Scholar] [CrossRef]
  11. Hu, C.; Jain, G.; Tamirisa, P.; Gorka, T. Method for Estimating Capacity and Predicting Remaining Useful Life of Lithium-Ion Battery. Appl. Energy 2014, 126, 182–189. [Google Scholar] [CrossRef]
  12. Ramadesigan, V.; Chen, K.; Burns, N.A.; Boovaragavan, V.; Braatz, R.D.; Subramanian, V.R. Parameter Estimation and Capacity Fade Analysis of Lithium-Ion Batteries Using Reformulated Models. J. Electrochem. Soc. 2011, 158, A1048. [Google Scholar] [CrossRef]
  13. Su, X.; Wang, S.; Pecht, M.; Ma, P.; Zhao, L. Prognostics of Lithium-Ion Batteries Based on Different Dimensional State Equations in the Particle Filtering Method. Trans. Inst. Meas. Control 2017, 39, 1537–1546. [Google Scholar] [CrossRef]
  14. Zhou, Y.; Huang, M. Lithium-Ion Batteries Remaining Useful Life Prediction Based on a Mixture of Empirical Mode Decomposition and ARIMA Model. Microelectron. Reliab. 2016, 65, 265–273. [Google Scholar] [CrossRef]
  15. Chen, C.; Xiong, R.; Yang, R.; Li, H. A Novel Operational Data-Driven Battery Open-Circuit Voltage Characterization Mining Method for Large-Scale Applications. Green Energy Intell. Transp. 2022, 1, 100001. [Google Scholar] [CrossRef]
  16. Li, D.; Zhang, Z.; Liu, P.; Wang, Z.; Zhang, L. Battery Fault Diagnosis for Electric Vehicles Based on Voltage Abnormality by Combining the Long Short-Term Memory Neural Network and the Equivalent Circuit Model. IEEE Trans. Power Electron. 2021, 36, 1303–1315. [Google Scholar] [CrossRef]
  17. She, C.; Wang, Z.; Sun, F.; Liu, P.; Zhang, L. Battery Aging Assessment for Real-World Electric Buses Based on Incremental Capacity Analysis and Radial Basis Function Neural Network. IEEE Trans. Ind. Inform. 2020, 16, 3345–3354. [Google Scholar] [CrossRef]
  18. Liu, X.; Zhang, X.Q.; Chen, X.; Zhu, G.L.; Yan, C.; Huang, J.Q.; Peng, H.J. A Generalizable, Data-Driven Online Approach to Forecast Capacity Degradation Trajectory of Lithium Batteries. J. Energy Chem. 2022, 68, 548–555. [Google Scholar] [CrossRef]
  19. Severson, K.A.; Attia, P.M.; Jin, N.; Perkins, N.; Jiang, B.; Yang, Z.; Chen, M.H.; Aykol, M.; Herring, P.K.; Fraggedakis, D.; et al. Data-Driven Prediction of Battery Cycle Life before Capacity Degradation. Nat. Energy 2019, 4, 383–391. [Google Scholar] [CrossRef] [Green Version]
  20. Kong, D.; Wang, S.; Ping, P. State-of-Health Estimation and Remaining Useful Life for Lithium-Ion Battery Based on Deep Learning with Bayesian Hyperparameter Optimization. Int. J. Energy Res. 2022, 46, 6081–6098. [Google Scholar] [CrossRef]
  21. Tang, X.; Zou, C.; Yao, K.; Chen, G.; Liu, B.; He, Z.; Gao, F. A Fast Estimation Algorithm for Lithium-Ion Battery State of Health. J. Power Sources 2018, 396, 453–458. [Google Scholar] [CrossRef]
  22. Weng, C.; Cui, Y.; Sun, J.; Peng, H. On-Board State of Health Monitoring of Lithium-Ion Batteries Using Incremental Capacity Analysis with Support Vector Regression. J. Power Sources 2013, 235, 36–44. [Google Scholar] [CrossRef]
  23. Messing, M.; Shoa, T.; Habibi, S. Estimating Battery State of Health Using Electrochemical Impedance Spectroscopy and the Relaxation Effect. J. Energy Storage 2021, 43, 103210. [Google Scholar] [CrossRef]
  24. Wu, B.; Yufit, V.; Merla, Y.; Martinez-Botas, R.F.; Brandon, N.P.; Offer, G.J. Differential Thermal Voltammetry for Tracking of Degradation in Lithium-Ion Batteries. J. Power Sources 2015, 273, 495–501. [Google Scholar] [CrossRef]
  25. Jiang, Y.; Jiang, J.; Zhang, C.; Zhang, W.; Gao, Y.; Guo, Q. Recognition of Battery Aging Variations for LiFePO4 Batteries in 2nd Use Applications Combining Incremental Capacity Analysis and Statistical Approaches. J. Power Sources 2017, 360, 180–188. [Google Scholar] [CrossRef]
  26. Dubarry, M.; Truchot, C.; Liaw, B.Y. Synthesize Battery Degradation Modes via a Diagnostic and Prognostic Model. J. Power Sources 2012, 219, 204–216. [Google Scholar] [CrossRef]
  27. Weng, C.; Feng, X.; Sun, J.; Peng, H. State-of-Health Monitoring of Lithium-Ion Battery Modules and Packs via Incremental Capacity Peak Tracking. Appl. Energy 2016, 180, 360–368. [Google Scholar] [CrossRef] [Green Version]
  28. Dubarry, M.; Liaw, B.Y. Identify Capacity Fading Mechanism in a Commercial LiFePO4 Cell. J. Power Sources 2009, 194, 541–549. [Google Scholar] [CrossRef]
  29. Cui, X.; Chen, Z.; Lan, J.; Dong, M. An Online State of Health Estimation Method for Lithium-Ion Battery Based on ICA and TPA-LSTM. In Proceedings of the IEEE Industrial Electronics and Applications Conference (IEACon), Penang, Malaysia, 22–23 November 2021; pp. 130–135. [Google Scholar] [CrossRef]
  30. Pradyumna, T.K.; Cho, K.; Kim, M.; Choi, W. Capacity Estimation of Lithium-Ion Batteries Using Convolutional Neural Network and Impedance Spectra. J. Power Electron. 2022, 22, 850–858. [Google Scholar] [CrossRef]
  31. Pang, M.C.; Yang, K.; Brugge, R.; Zhang, T.; Liu, X.; Pan, F.; Yang, S.; Aguadero, A.; Wu, B.; Marinescu, M.; et al. Interactions Are Important: Linking Multi-Physics Mechanisms to the Performance and Degradation of Solid-State Batteries. Mater. Today 2021, 49, 145–183. [Google Scholar] [CrossRef]
  32. Yang, S.; He, R.; Zhang, Z.; Cao, Y.; Gao, X.; Liu, X. CHAIN: Cyber Hierarchy and Interactional Network Enabling Digital Solution for Battery Full-Lifespan Management. Matter 2020, 3, 27–41. [Google Scholar] [CrossRef]
  33. Yang, S.; Zhang, Z.; Cao, R.; Wang, M.; Cheng, H.; Zhang, L.; Jiang, Y.; Li, Y.; Chen, B.; Ling, H.; et al. Implementation for a Cloud Battery Management System Based on the CHAIN Framework. Energy AI 2021, 5, 100088. [Google Scholar] [CrossRef]
  34. Liu, X.; Zhang, L.; Yu, H.; Wang, J.; Li, J.; Yang, K.; Zhao, Y.; Wang, H.; Wu, B.; Brandon, N.P.; et al. Bridging Multiscale Characterization Technologies and Digital Modeling to Evaluate Lithium Battery Full Lifecycle. Adv. Energy Mater. 2022, 12, 2200889. [Google Scholar] [CrossRef]
  35. Birkl, C. Oxford Battery Degradation Dataset 1 VO-RT-Aggregated Database. OP. Available online: https://ora.ox.ac.uk/objects/uuid:03ba4b01-cfed-46d3-9b1a-7d4a7bdf6fac (accessed on 11 April 2022).
  36. Birkl, C. Diagnosis and Prognosis of Degradation in Lithium-Ion Batteries VO-RT-Thesis. OP. Available online: https://ora.ox.ac.uk/objects/uuid:7d8ccb9c-1469-4209-9995-5871fc908b54 (accessed on 11 April 2022).
  37. Chen, J.; Jönsson, P.; Tamura, M.; Gu, Z.; Matsushita, B.; Eklundh, L. A Simple Method for Reconstructing a High-Quality NDVI Time-Series Data Set Based on the Savitzky-Golay Filter. Remote Sens. Environ. 2004, 91, 332–344. [Google Scholar] [CrossRef]
  38. Schober, P.; Schwarte, L.A. Correlation Coefficients: Appropriate Use and Interpretation. Anesth. Analg. 2018, 126, 1763–1768. [Google Scholar] [CrossRef] [PubMed]
  39. Greff, K.; Srivastava, R.K.; Koutnik, J.; Steunebrink, B.R.; Schmidhuber, J. LSTM: A Search Space Odyssey. IEEE Trans. Neural Netw. Learn. Syst. 2017, 28, 2222–2232. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Mele, B.; Altarelli, G. Lepton Spectra as a Measure of b Quark Polarization at LEP. Phys. Lett. B 1993, 299, 345–350. [Google Scholar] [CrossRef] [Green Version]
  41. Tian, J.; Xiong, R.; Shen, W. State-of-Health Estimation Based on Differential Temperature for Lithium Ion Batteries. IEEE Trans. Power Electron. 2020, 35, 10363–10373. [Google Scholar] [CrossRef]
  42. Lin, C.; Xu, J.; Shi, M.; Mei, X. Constant Current Charging Time Based Fast State-of-Health Estimation for Lithium-Ion Batteries. Energy 2022, 247, 123556. [Google Scholar] [CrossRef]
  43. Lin, M.; Wu, D.; Meng, J.; Wu, J.; Wu, H. A Multi-Feature-Based Multi-Model Fusion Method for State of Health Estimation of Lithium-Ion Batteries. J. Power Sources 2021, 518, 230774. [Google Scholar] [CrossRef]
  44. Fan, Y.; Xiao, F.; Li, C.; Yang, G.; Tang, X. A Novel Deep Learning Framework for State of Health Estimation of Lithium-Ion Battery. J. Energy Storage 2020, 32, 101741. [Google Scholar] [CrossRef]
Figure 1. Battery aging cycle schemes and capacity degradation profiles. (a) Completed aging cycle test for voltage, current and temperature. (b) Battery capacity degradation curves of the eight batteries.
Figure 1. Battery aging cycle schemes and capacity degradation profiles. (a) Completed aging cycle test for voltage, current and temperature. (b) Battery capacity degradation curves of the eight batteries.
Batteries 08 00151 g001
Figure 2. Comparison of initial temperature curves and processed by SG.
Figure 2. Comparison of initial temperature curves and processed by SG.
Batteries 08 00151 g002
Figure 3. Comparison between original and processed curves. (a) DTV curves: Curve 1 is the initial DTV curve, Curve 2 is the DTV curve after filtering temperature, Curve 3 is the DTV curve after filtering temperature and DTV. (b) Differential of DTV curves: The red curve is the initial DTV differential; the green curve is the SG processed DTV differential.
Figure 3. Comparison between original and processed curves. (a) DTV curves: Curve 1 is the initial DTV curve, Curve 2 is the DTV curve after filtering temperature, Curve 3 is the DTV curve after filtering temperature and DTV. (b) Differential of DTV curves: The red curve is the initial DTV differential; the green curve is the SG processed DTV differential.
Batteries 08 00151 g003
Figure 4. Aging Evolution of DTV curves.
Figure 4. Aging Evolution of DTV curves.
Batteries 08 00151 g004
Figure 5. DTV curves and feature variables of the battery #1 under different cycles. (a) Features extraction from DTV curves under different cycles. (b) Extraction features of FV1, FV3 and FV5 with the cycle number. (c) Extraction features of FV2, FV4 and FV6 with the cycle number. (d) Extraction features of FV7 with the cycle number.
Figure 5. DTV curves and feature variables of the battery #1 under different cycles. (a) Features extraction from DTV curves under different cycles. (b) Extraction features of FV1, FV3 and FV5 with the cycle number. (c) Extraction features of FV2, FV4 and FV6 with the cycle number. (d) Extraction features of FV7 with the cycle number.
Batteries 08 00151 g005
Figure 6. Pearson correlation analysis of capacity and feature values for battery #1.
Figure 6. Pearson correlation analysis of capacity and feature values for battery #1.
Batteries 08 00151 g006
Figure 7. Timeseries construction of training dataset.
Figure 7. Timeseries construction of training dataset.
Batteries 08 00151 g007
Figure 8. Framework of the proposed RUL and SOH estimation model.
Figure 8. Framework of the proposed RUL and SOH estimation model.
Batteries 08 00151 g008
Figure 9. Validation of early SOH and RUL estimation ability of the proposed model. (a) Estimation results for battery #1 with split portion of 1: 9. (b) Estimation results for battery #1 with split portion of 2:8. (c) Estimation results for battery #1 with split portion of 3:7. (d) Box-plot analysis for battery #1 for three proportions.
Figure 9. Validation of early SOH and RUL estimation ability of the proposed model. (a) Estimation results for battery #1 with split portion of 1: 9. (b) Estimation results for battery #1 with split portion of 2:8. (c) Estimation results for battery #1 with split portion of 3:7. (d) Box-plot analysis for battery #1 for three proportions.
Batteries 08 00151 g009
Figure 10. Validation of robustness of the proposed model. (a) Estimation results for battery #3. (b) Estimation results for battery #4. (c) Estimation results for battery #7. (d) Estimation results for battery #8. (e) Box-plot analysis for four batteries. (f) RMSE and MAE analysis for four batteries.
Figure 10. Validation of robustness of the proposed model. (a) Estimation results for battery #3. (b) Estimation results for battery #4. (c) Estimation results for battery #7. (d) Estimation results for battery #8. (e) Box-plot analysis for four batteries. (f) RMSE and MAE analysis for four batteries.
Batteries 08 00151 g010
Figure 11. SOH and RUL estimation results of the proposed model using singular value as input features. (a) Estimation results for battery #3. (b) Estimation results for battery #4. (c) Estimation results for battery #7. (d) Estimation results for battery #8. (e) Box-plot analysis for four batteries. (f) RMSE and MAE analysis for four batteries.
Figure 11. SOH and RUL estimation results of the proposed model using singular value as input features. (a) Estimation results for battery #3. (b) Estimation results for battery #4. (c) Estimation results for battery #7. (d) Estimation results for battery #8. (e) Box-plot analysis for four batteries. (f) RMSE and MAE analysis for four batteries.
Batteries 08 00151 g011
Table 1. Technical specifications and battery test procedures of the eight batteries.
Table 1. Technical specifications and battery test procedures of the eight batteries.
Technical SpecificationsCycling Tests
Anode MaterialGraphiteCharge testCC charge at 2C
Cathode materialLCO/NCO
Nominal capacity [Ah]0.74
Nominal voltage [V]3.7
Discharge cutoff voltage [V]2.7Discharge testArtemis drive cycle discharge
Charge cutoff voltage [V]4.2
Weight [g]19.5 ± 0.5
Table 2. Pearson correlation coefficients between feature variables and capacities.
Table 2. Pearson correlation coefficients between feature variables and capacities.
Battery LabelFV1FV2FV3FV4FV5FV6FV7
#10.870.960.930.960.980.950.99
#30.910.920.940.960.960.910.97
#40.960.950.980.960.970.950.99
#70.950.940.890.910.980.920.94
#80.890.950.910.960.980.940.98
Table 3. Specific hyperparameters and structure of model.
Table 3. Specific hyperparameters and structure of model.
CriteriaValues
Species of layers(LSTM, Dropout, LSTM, Dropout, Dense)
Units of layers(64, -, 64, -, 1)
Dropout20%
Activation Function(tanh, -, tanh, -, sigmoid)
Sequence length2
Batch Size2
Epoch100
OptimizerRMS prop
Learning rate0.0012
Training lossMean squared error
Table 4. Computation cost of training and prediction.
Table 4. Computation cost of training and prediction.
Battery LabelData StartData Split PortionFeatureTraining Time(s)Prediction Time(s)
#10%1:92, 4, 519.0470.698
0%2:82, 4, 520.6610.637
0%3:72, 4, 521.8000.611
0%2:8719.9790.576
20%2:82, 4, 520.1190.741
#320%2:82, 4, 518.7290.582
718.2340.517
#420%2:82, 4, 516.3670.706
717.4500.561
#720%2:82, 4, 519.0790.571
719.0000.577
#820%2:82, 4, 518.9610.552
718.4930.525
Table 5. RMSE of SOH estimation result.
Table 5. RMSE of SOH estimation result.
MethodData StartData Split PortionFeatureRMSE(%)
#1#2#3#4#5#6#7#8
Ours0%1:92, 4, 50.714-------
2:80.473-------
3:70.363-------
2:870.582-------
20%2:82, 4, 50.433-0.3900.324--0.3470.415
7--0.3730.308--0.4850.480
SVR [41]0%Leave-one-out cross validationdT1.432.811.081.503.621.171.801.39
ICA0.650.970.450.413.510.230.400.49
Random forest [42]CCCT0.470.970.360.440.440.370.660.42
SVM [43]Voltage
Temperature ICA
0.852.330.750.580.640.930.590.57
MLR [43]0.842.020.730.550.600.970.370.43
GPR [43]0.942.110.810.450.510.940.530.62
Fusion of SVM MLR GPR [43]0.700.790.570.420.480.860.510.51
GRU [44]Train: #1 #2 #3 #5 #6 #7
Test: #4 #8
Voltage---1.43---0.88
CNN [44]---1.24---1.56
GRU-CNN [44]---0.61---0.65
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, W.; Zhang, L.; Yu, H.; Yang, X.; Zhang, T.; Chen, S.; Liang, F.; Wang, H.; Lu, X.; Yang, S.; et al. Early Prediction of the Health Conditions for Battery Cathodes Assisted by the Fusion of Feature Signal Analysis and Deep-Learning Techniques. Batteries 2022, 8, 151. https://doi.org/10.3390/batteries8100151

AMA Style

Wang W, Zhang L, Yu H, Yang X, Zhang T, Chen S, Liang F, Wang H, Lu X, Yang S, et al. Early Prediction of the Health Conditions for Battery Cathodes Assisted by the Fusion of Feature Signal Analysis and Deep-Learning Techniques. Batteries. 2022; 8(10):151. https://doi.org/10.3390/batteries8100151

Chicago/Turabian Style

Wang, Wentao, Lisheng Zhang, Hanqing Yu, Xianbin Yang, Teng Zhang, Siyan Chen, Fengwei Liang, Huizhi Wang, Xuekun Lu, Shichun Yang, and et al. 2022. "Early Prediction of the Health Conditions for Battery Cathodes Assisted by the Fusion of Feature Signal Analysis and Deep-Learning Techniques" Batteries 8, no. 10: 151. https://doi.org/10.3390/batteries8100151

APA Style

Wang, W., Zhang, L., Yu, H., Yang, X., Zhang, T., Chen, S., Liang, F., Wang, H., Lu, X., Yang, S., & Liu, X. (2022). Early Prediction of the Health Conditions for Battery Cathodes Assisted by the Fusion of Feature Signal Analysis and Deep-Learning Techniques. Batteries, 8(10), 151. https://doi.org/10.3390/batteries8100151

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop