1. Introduction
Lithium-ion batteries (LIBs) are utilized for electric vehicles (EVs), space shuttles and smart grid as an efficient secondary power storage device [
1,
2,
3]. However, the performance of LIBs will deteriorate with continuous usage [
4,
5,
6]. When the available capacity of a battery drops below 80% of its initial capacity, its risk of failure increases significantly. Therefore, accurate estimation of the remaining useful life (RUL) and prediction of state of health (SOH) of high energy electrodes under dynamic operating conditions are important to improve the safety and reliability of commercial batteries.
Currently, there are three main approaches for battery SOH and RUL prediction, including model-based methods, data-driven methods and feature signal analysis methods, respectively [
7,
8,
9].
The model-based methods make prediction by constructing battery models that are either mechanistic, empirical, semi-empirical or a fusion of mechanistic and empirical formulation. An empirical exponential growth model for LIBs impedance degradation is considered, using standard particle filtering (PF) and resampling techniques to fit the degradation datasets [
10]. A simplified equivalent circuit model was utilized and the Gauss-Hermite particle filter technique was applied to track the capacity values, which is an extension of PF technique [
11]. A reformulated pseudo 2D (P2D) model was developed for RUL predictions to reduce model complexity [
12]. The model-based methods have the highest interpretability and can observe enough set of intermediate variables, but the establishment process is relatively complex. Furthermore, the accuracy of model is easily affected by variable current and temperature under the influence of different external conditions. Though the high-fidelity battery model could effectively improve the estimation accuracy, it will greatly increase the complexity of operations as well so that it is not suitable for vehicle-end microcontroller computing environment [
13,
14].
The data-driven model depends on historical observations directly to approach the SOH prediction. Data-driven methods have higher fitting capability for non-linear data and wider application fields, which have become increasingly popular for capacity estimation of LIBs in recent years [
15,
16,
17]. The data-driven methods rely entirely on experimental data and usually does not require much knowledge of internal mechanisms of battery. Capacity estimation could be performed by learning the complex dependence of battery capacity on features extracted from measurements such as voltage, current and temperature. Liu et al. proposed a data-driven approach to forecast capacity fade trajectory, extracting features with physical meanings from discharge voltage curves [
18]. Severson et al. utilized machine-learning tools to both predict and classify cells by cycle life, making use of discharge voltage curves from early cycles to exhibit capacity degradation [
19]. A framework-combined deep convolution neural network (DCNN) with double-layer long short-term memory (LSTM) was proposed to approach battery SOH estimation and RUL prediction [
20]. The data-driven methods have the strongest implement ability, but highly dependent on the quality of the input data features and relatively less interpretable. Due to the nature of timeseries of battery degradation data, the LSTM is well suitable for prediction of battery RUL and in this paper the LSTM network is utilized as the core.
Feature signal analysis methods, based on electrochemical analysis techniques and signal processing techniques, are implemented to provide certain signal characteristics to approach analysis of battery degradation process. Several methods such as incremental capacity analysis (ICA), Ref. [
21] differential voltage analysis (DVA), Ref. [
22] electrochemical impedance spectroscopy (EIS) [
23] and differential thermal voltammetry (DTV) [
24] have been proposed to extract suitable and reliable battery degradation characteristics. In error analysis, the curve parameters such as peak and valley positions, peak heights and areas are usually compared and selected as indirect features, which have a significant correlation with battery capacity fade. The incremental capacity (IC) and IC peak area analysis are employed to assess the consistency of battery module, which is helpful for rapid evaluation and screening of retired batteries for second usage [
25]. A synthetic approach based on specific electrode behavior with proper adjustment was proposed to simulate various scenarios of battery degradation modes [
26]. Weng et al. extended the ICA-based SOH monitoring approach from single cells to battery modules and an IC peak tracking approach based on ICA principles was proposed to achieve on-board implementation [
27]. An incremental capacity analysis was applied to identify various contributions to capacity fade [
28]. These signal analysis methods focus on extracting information from signals to analyze the battery health status, usually used as an auxiliary technical tool in conjunction with model-based methods or data-driven methods whose accuracy requires high sampling conditions, data transmission rate and data pre-processing.
Recently, with the rapid increase in the amount of available data and computing power, deep-learning methods have seen wide applicability in search engines, data mining, machine translation, speech and image processing, etc. The electrode degradation estimation model built based on deep-learning methods has strong generalization and the modeling process does not require complex knowledge of the internal mechanism of battery, making it easy to obtain more accurate estimation. However, similar to other data-driven methods, deep-learning methods are also less interpretable and highly dependent on the quality of the input data. These drawbacks, however, could be compensated by training the model with diagnostic data that strongly correlate with battery degradation mechanisms and SOH. Cui et al. proposed an online SOH estimation framework based on ICA and Time Pattern Attention Mechanism Long Short-Term Memory (TPA-LSTM) network [
29]. Pradyumna et al. using impedance curves obtained from an electrochemical impedance spectroscopy (EIS) test and a convolutional neural network (CNN) to estimate capacity [
30]. The DTV method, proposed by Wu et al. in 2015, Ref. [
24] is a feature signal analysis method which could bridge microscopic phase transition characteristics of a battery with its and macroscopic thermal and electrical characteristics. The phase transition occurs in electrodes and causes a change in entropy, which is a function of temperature, and thus leads to a change in temperature. Therefore, the DTV analysis method establishes a link between macroscopic measurable physical quantities and microscopic characteristics of the electrode, reflecting the information on lithium ions involved in electrochemical reactions at different aging stages. Compared with other commonly used methods, such as EIS and ICA, DTV is more suitable as a diagnostic method for practical applications. Since the method requires only cell voltage measurements and surface temperature. It does not require accurate current measurements such as ICA or additional hardware such as EIS. Combined with DTV analysis method, the deep-learning method could become an effective and practical tool for tracking the health conditions of batteries.
In this paper, the fusion of data-driven methods and feature signal analysis methods are proposed to realize early prediction of the SOH and RUL of high-energy density lithium-ion battery cathodes. To validate the proposed method, a data-driven model with LSTM network as the core and DTV feature signal analysis are utilized to construct the fusion model. Firstly, the DTV feature signal analysis is executed based on battery degradation data. With Savitzky-Golay (SG) filter method and Pearson correlation analysis, highly correlated features are extracted based on battery degradation datasets of Oxford University, including information of valleys and peaks from DTV curve and singular values of DTV vector, respectively. Next, a deep-learning model with LSTM network as the core is constructed and trained with these features. Furthermore, the dropout technique is introduced to solve the overfitting problems and the Monte Carlo simulation is utilized to characterize the prediction uncertainty. Finally, the model validation and error analysis are performed. This work demonstrate that features extracted from DTV curve could be utilized in deep-learning model to improve and optimize long-term SOH and RUL prediction based on the early degradation data. In framework of CHAIN, the proposed model could approach accurate offline SOH and RUL estimation or real-time online prediction combined with local fast empirical model, promoting the development of the next-generation cloud-vehicle synergistic management system [
31,
32,
33,
34].
3. Methodology
This section mainly describes the framework of the proposed deep-learning method integrated of DTV analysis method and data-driven method.
3.1. LSTM Network Structure
LSTM neural network is a variant of RNN, which is suitable for long-term dependence problem and can effectively solve the gradient disappearance and gradient explosion problems. The LSTM introduces unit states and uses three types of gates, input gate, forget gate and output gate, to hold and control information. The main feature of the LSTM neural network is that there is a main chain of information recorded through the units, using the sigmoid function
and the hyperbolic function
in each network unit, respectively [
39]. Each unit of LSTM is described as follows:
The first step is to use the forget gate to determine which information is discarded from the unit state
when the LSTM unit obtains the data for
and
. The sigmoid activation function of the forgetting gate will output a number between 0 and 1, where “1” means that the status value is completely retained and “0” means that the value is completely discarded. This number will be utilized for update of unit status
. The forget gate can be expressed as:
Secondly, a sigmoid activation function and a
activation function are used to form the input gate. The sigmoid function decides which values to update and then uses the tanh layer to obtain new information, consequently creating a new candidate state to participate in the update of unit state. These two steps could be described as follow:
The third step is the fusion of data from the forget and input gates. Next, the unit state changes from
to
:
The final output gate generates a new hidden state
based on the unit state
.
where the
represents the weights of layer, b the deviations,
the output of last cycle and
the current output.
Since the battery degradation data is time-series data, recurrent neural networks (RNN) are well suited for the battery SOH and RUL prediction problems. Furthermore, battery degradation issue is a long-term dependency problem and general RNNs are only capable of solving short-term dependency problems. The problems of gradient disappearance and gradient explosion cannot be avoided as well, for which the LSTM neural network is utilized as the main body of the model in this paper.
3.2. Component and MC Simulation
In this paper, the optimization algorithm based on the RMSprop method is used to train the LSTM network. The RMSprop algorithm has been empirically proven to be an effective and practical algorithm for deep-learning neural network optimization. It works very well for RNNs, and is one of the frequently adopted optimization methods nowadays.
Neural networks suffer from overfitting problem, a phenomenon where the model performs well on the training datasets but poorly on the test datasets. The overfitting phenomenon will make the generalization ability and robustness of model deteriorate. Srivastava et al. proposed a dropout technique to solve overfitting problem more effectively [
40]. During the training process, the dropout technique randomly removes neurons and their connections from the neural network. Whenever a dropout process is executed, it is equivalent to obtaining a smaller network from the original network consisting of the neurons left. Thus, training a neural network with the dropout technique can be considered as training many sparse networks with extensive weights sharing, where each sparse network is rarely trained. The network becomes less sensitive to the specific weights of the neurons, which allows the model to be more generalizable and not too dependent on certain local features.
Both the predicted values and their confidence levels are important parameters for assessing the performance of the predictor. The LSTM RNN is able to learn the long-term correlation of time series data and predict multi-step advanced data points, but it cannot provide prediction confidence. Therefore, we introduce MC simulation methods to characterize the prediction uncertainty. MC simulations require a numerical algorithm to describe the system. All information about the degradation process of battery performance is contained in the capacity datasets. Therefore, the LSTM RNN trained based on capacity data is defined as the numerical algorithm implemented by the battery degradation process, that is, the output of our LSTM model is utilized as the input data of MC simulations.
3.3. Input and Output Data Structure
The specific structure of our inputs and outputs is shown in
Figure 7.
Where FV represents features, M the batch size, SOH the capacity and N the sequence length. The timeseries are constructed using the extracted features and the input is constructed as a three-dimensional timeseries tensor. The extracted feature variables are first constructed into a two-dimensional matrix. Next, two-dimensional matrix fragments are extracted from the above two-dimensional matrix according to the sequence length N. Next, a series of two-dimensional matrix fragments are constructed into a three-dimensional tensor as the final input according to a given combination of batch sizes M. The features of each two cycles are used as a time series to predict the capacity of the second of these cycles, and the two adjacent timeseries will have an overlap of data points to make full use of our data.
3.4. Framework of the Proposed RUL and SOH Estimation Model
The specific framework of LSTM-based battery SOH and RUL prediction proposed in this paper is shown in
Figure 8. The framework is divided into four parts: preprocessing, filtering and feature extraction, model construction and training, validation and error analysis.
In the first and second part, the data preprocessing and feature extraction are executed. As described in detail in
Section 2, two kinds of features are extracted to train and test the deep-learning model, including information of valleys and peaks from DTV curves and singular values of DTV vector, respectively. In the third part, a deep-learning model with LSTM network as the core is constructed. The three-dimensional timeseries tensor obtained in the second part is utilized as input for training our model, and the RMSprop technique is used to improve the convergence speed during the training process. Furthermore, the Dropout technique is introduced to suppress overfitting and MC simulation is introduced to give uncertainty-related information in the computation process. Finally, the proposed battery health condition prediction model is quantitatively analyzed and validated. The mean absolute error (MAE) analysis, root mean square error (RMSE) analysis, and box plot analysis are utilized and compared. Furthermore, the proposed model is verified on the other four cells.
4. Result and Discussion
In this section, the validation and error analysis of our proposed model are executed. Different division proportions of the dataset are investigated firstly to ensure accuracy with a minimum amount of data, that is, approaching prediction of battery degradation early as soon as possible. Next, the robustness and stability of our model are proved and the accuracy of prediction results using singular values as input features of the model is calculated.
4.1. Experiment Setup
We use TensorFlow to build the whole network and use the compute unified device architecture (CUDA) to accelerate computation. The version of TensorFlow is 2.5.0 and the version of CUDA is 11.2. The overall experiment is performed on a server with AMD EPYC 7543 32-Core Processor CPU and NVIDIA A40 GPU, and the operating system is ubuntu 18.04.
The accuracy of the model and the training process are greatly affected by the hyperparameters in the LSTM network, which are pre-given rather than learned by the network. Batchsize determines the number of samples selected by the model in one training. The value of Batchsize will affect the optimization degree and speed of the model. A larger Batchsize will speed up the training speed, but it is easy to make the model converge at the local optimum point. After debugging, the Batchsize is selected as 2 is optimal. The value of the learning rate has a great impact on the training process. A large learning rate will cause the gradient to oscillate and may even fail to converge and a small learning rate will lead to a very slow convergence rate. After debugging, the learning rate is selected as 0.0012 as the best. Epoch represents the training times of all sample data. A large epoch will lead to slow model training and overfitting and a small epoch may lead to underfitting. In the experiment, it is found that convergence can be achieved by setting the epoch to 100, continuing training may lead to overfitting. Therefore, set the epoch to 100. The number of nodes in the network directly determines the performance of the network, too few nodes may lead to insufficient information processing capability of the network, which may lead to underfitting and too many neurons may lead to the limited amount of information contained in the training set being insufficient to train all neurons in the hidden layer, leading to overfitting. We debug in the range of 16–128, and the results show that 64 is optimal. Dropout rate is the probability of discarding neurons, too low probability has limited effect, while too high probability will lead to insufficient network training. After debugging, it is found that 0.2 is the best. The sequence length represents the length of the input sequence or the size of the input window, and sequence stride represents the step size of each sliding of the window. Due to the small amount of training data, in order to make full use of the training data, we set the sequence length to 2, and the sequence stride set to 1. The specific model structure and hyperparameters are shown in
Table 3.
4.2. Validation of Early Prediction Ability
In this subsection, the variation of SOH and RUL prediction results with different division proportions of the dataset are investigated to verify the early prediction ability of the proposed model. The Red dotted line represents the predicted SOH and the green dotted line represents the real SOH. The division proportion is demonstrated with the black lines. The purple area is generated with Monte Carlo simulation to obtain the probability distribution of RUL. Three division proportions of dataset are executed in this subsection, including 10%, 20% and 30%, respectively. As shown in
Figure 9a, the error of prediction is 0.714% when using 10% of data for training. The RUL is 4600 cycles, and the probability density of battery RUL is distributed between 4079 and 4194 cycles.
Figure 9b shows that the error of prediction is 0.473% when using 20% of data for training. The RUL is 3500 cycles and the probability density of battery RUL is distributed between 3254 and 3312 cycles. The precision is significantly higher than the results using 10% of data for training.
Figure 9c shows that the error of prediction is 0.363% when using 30% of data for training. The RUL is the 2800 cycles, and the probability density of battery RUL is distributed between 2820 and 2856 cycles.
Figure 9d shows the box plot of the predicted results, which avoids the effect of extreme outliers and provides a visual table of the error distribution conditions with small rectangles in each box representing the MAE. The box plot could show the characteristic of the error distribution through maximum, minimum, median, upper quartile and lower quartile. With the proportion of the training set increasing, the upper quartile gradually decreases from 1.1% to 0.75% and to 0.5%, and the lower quartile gradually decreases from 0.4% to 0.25% and to 0.15%. The MAE also decreases from 0.8% to 0.6% and to 0.3%.
When using 30% of data for training, though the error is lower and the probability density distribution of RUL is closer to the real failure cycle, the predicted RUL interval is distributed after the actual failure cycle. The accuracy of prediction result using 20% of data for training is slightly lower than that of using 30% of the results—and the predicted RUL intervals are distributed before the actual failure cycle. At the same time, under the strict condition that only 10% of the data is used for training, and the dataset is in 100 cycles, which may lead to some missing information and have an impact on the accuracy. However, the model accuracy is still decent with a prediction error of 0.714%. Therefore, the results indicate that our method could achieve early SOH estimation and RUL prediction, that is, the proposed model could ensure high enough accuracy with the least amount of data for training.
4.3. Validation of Robustness
In this subsection the robustness and reliability of the proposed model is investigated. Some of the eight battery capacity data did not reach below EOL, making it impossible to derive RUL predictions. Therefore, the battery #3, #4, #7 and #8 are selected for verification. The red dotted line illustrates the predicted SOH and the green dotted line illustrates the real SOH. With Monte Carlo method, the probability distribution of RUL is obtained as the purple part. In these validations, the first 20% of the data are used as training dataset, and the cycle after is used as the starting cycle for prediction.
Figure 10 shows the validation of the robustness and reliability of the model using the commonly chosen information of peaks and valleys as features. The proposed method is validated on all four cells and the prediction errors are distributed around 0.4%. As shown in
Figure 10a, the RUL is 2900 cycles, and the probability density of battery RUL is distributed at 2700 cycles for battery #3.
Figure 10b shows that the RUL is 2500 cycles and the probability density of battery RUL is distributed between 2418 and 2469 cycles for battery #4.
Figure 10c shows that the RUL is 4300 cycles, and the probability density of battery RUL is distributed between 3932 and 3968 cycles for battery #7.
Figure 10d shows that the RUL is 3000 cycles and the probability density of battery RUL is distributed between 2950 and 2984 cycles for battery #8. It could be seen that the predicted RUL probability densities do not cover the actual effectiveness cycle very precisely, but they are distributed in a small interval close to practical failure cycle. The reason for not covering the practical failure cycle may be that the original dataset is in 100-cycle increments, which may result in some information missing.
As shown in
Figure 10e,f, the prediction results with the data of 20% cycles lost could produce a prediction mean absolute error of around 0.4%, which is close to 0.473% without data lost. Furthermore, the lower and upper quartiles and MAE of box line plot are distributed around 0.25%, 0.6% and 0.75%, respectively, further proving the robustness and stability of our model.
4.4. Predict Result with SVD Feature
In this subsection the singular values are utilized as the input features for training and the four batteries are utilized to verify the verify the proposed model. Furthermore, all of the prediction results of SOH and RUL are obtained with data of 20% cycles lost. The red dotted line and the green dotted line represent the predicted SOH and real SOH, respectively. The purple area demonstrates the probability distribution of RUL, obtained with Monte Carlo simulations.
Figure 11 shows the prediction results using the singular values of DTV vector as features. As shown in
Figure 11a–d, the prediction results with data of 20% cycles lost could produce a prediction mean absolute error of around 0.4%, which shows that the accuracy of prediction using singular values of DTV vector as features is slightly lower. However, the singular values can be extracted directly from the original DTV data without complicated operations such as filtering, and the accuracy is high enough for practical application. Furthermore, the predictions in this section using the singular value as features are taken as the starting point of 20%, which proves the robustness and reliability of using singular values as features as well.
Figure 11e,f shows the results of the box plot and the error analysis of the model. The lower quartiles are all distributed around 0.3% and both MAE and RMSE are distributed at the same level. The upper quartile using singular values as features for battery #7 and #8 reaches 0.8%, and the MAE and RMSE reach around 0.6% and 0.7%, respectively. It can be seen that it is feasible to extract the singular values of DTV data as features for prediction, and could make the process simpler with comparable accuracy.
Figure 11 shows the results of the prediction using the singular values of DTV as features and the error is slightly larger than the results for other batteries, which is caused by the inconsistency of batteries.
Generally, the singular values have a strong correlation with the capacity data, which could encompass the information of peaks and valleys as an independent set of feature inputs. The prediction results using singular values as input features could produce a tolerable error. Furthermore, the utilization of singular value decomposition for DTV enables the use of simpler data processing and feature extraction methods to extract strongly correlated features. The singular value could replace the peak-valley information, reducing the size of input matrix and improving the efficiency of the model.
4.5. Validation of Computation Cost and Compared with Other Methods
Table 4 shows the computation cost of training and prediction. As shown in
Table 4. The training time is about 20 s, and the prediction time is about 0.5 s. The results show that the method proposed by us has low calculation consumption and high calculation efficiency. The computation cost of deep learning models in training is usually huge, and will increase dramatically with the increase of data volume. The high calculation efficiency of our proposed method is benefit from extracting features from DTV data, which reduces the amount of data, and using correlation analysis methods to further filter features, which reduces the number of features. At the same time, the amount of data we use is inherently small, and we have achieved high-precision prediction under the premise of using small amounts of data, which further proves the superiority and effectiveness of our proposed method.
Table 5 shows the prediction results of the proposed method in this paper and compared with other works that have also conducted studies based on Oxford battery dataset. This research have utilized traditional machine learning algorithms and some hybrid algorithms to build battery degradation models. Most of their input features come directly from voltage and temperature data or use ICA methods to extract features, rather than DTV data used in this paper. The model with support vector regression (SVR) as the core is constructed, with dT and ICA as the features, respectively [
41]. A model with random forest as the core was constructed, and constant current charging time (CCCT) was taken as the feature [
42]. Support vector machine (SVM), multiple linear regression (MLR) and gaussian process regression (GPR) are used as the core to build models, respectively, and the three models are fused through the random forest algorithm, with voltage, temperature and ICA as the characteristics. Ref. [
43] establish models with CNN and gate recurrent unit (GRU) as the core, respectively, and the final output of the two models concatenated and projected to the output to build a GRU-CNN model, and the features are extracted from voltage. Ref. [
44] compared with most prediction methods in the table, the method designed in this paper has higher SOH prediction accuracy. It benefits from the input characteristics provided by DTV analysis and the excellent ability of LSTM network to solve long-term dependent time series problems. At the same time, we use less data as the training set, which further proves the superiority of this method. The comparison results show that the designed SOH prediction method has good prediction performance.
5. Conclusions
In this paper, the fusion of a data-driven method and a feature signal analysis approach is investigated to predict the SOH as well as RUL of battery cathodes. The DTV feature signal analysis is calculated based on the collected battery voltage and temperature information. Two kinds of features are extracted as the input features of the data-driven model from DTV data, including the singular values of the DTV vector and the peaks and valleys information of DTV curves, respectively. Subsequently, a deep-learning model is constructed with LSTM network as the core. Furthermore, dropout method is utilized to suppress overfitting and MC method is introduced to give uncertainty-related information.
Different division proportions of the dataset are investigated firstly. The results show that the prediction accuracy improves significantly with the increase of training set. However, 20% of the data ensures both accuracy and uses the relatively less amount of data with the prediction error of SOH maintained 0.473%, which is the best choice as a training set during early prediction. Next, the robustness and reliability of the proposed model are validated by removing the first 20% of the data for different start-up cycles. The prediction error of SOH is 0.433%, which is very close to the prediction error of 0.473% without data loss. Finally, the singular values are utilized as input features with the prediction SOH error of 0.582%, which is slightly higher than the error of 0.473% but can reduce the complexity of data processing while ensuring the prediction accuracy.
The main contributions of this work could be described as follows. (1) The DTV curve, which can characterize the phase-transition electrodes, is obtained and used to analyze the battery degradation process by cleaning, reducing, filtering and processing the raw voltage and temperature data. (2) Two kinds of features are extracted, and the singular values are simpler, reducing the cost and decreasing the computational complexity. (3) Only a small portion of data from early battery cycles is used for model training to achieve high accuracy of battery RUL prediction. (4) The robustness and reliability are investigated and validated with the data of 20% cycles lost. Generally, the proposed LSTM-based model has good early prediction accuracy and robustness, proving that the fusion of data-driven methods and feature signal analysis methods has good prospects for application in electrode health conditions estimation. This work could improve the prediction accuracy and has potential to approach cloud-end highly accurate offline prediction and achieving local-end real-time accurate estimation of SOH and RUL combined with local fast model in CHAIN.