1. Introduction
As the number of vehicles powered by fossil-fuel-burning internal combustion engines has increased, carbon dioxide emissions have also increased, and the climate change problem has emerged as a global topic. To overcome this problem, studies on electric vehicles (EVs) to replace internal combustion engine vehicles have been gaining momentum [
1]. Lithium-ion batteries (LIBs) are used as an energy source in EVs because of their advantages, which include a high energy density, long life, and high efficiency [
2]. A LIB may overheat as a result of over-charging and over-discharging depending on various driving conditions for the EV, causing fires and explosions [
3,
4]. In other words, the state of a LIB has a significant effect on the safety of an EV, which implies that real-time estimation of the battery state is imperative [
5]. Real-time estimation of the battery state refers to estimations of its state of health (SOH) and state of charge. Among these, its SOH is an important indicator of battery life and is defined as a percentage of the capacity of the fully charged battery at the current point in time relative to the initial capacity [
6]. As a battery’s deterioration progresses, its lifespan decreases, reducing its safety [
7]. Therefore, its SOH is an indicator that is directly related to the safety of the battery, and it is crucial to estimate it accurately [
8,
9,
10]. The performance of a LIB deteriorates as a result of a variety of conditions, such as repeated charging and discharging, its internal temperature, and charging and discharging current [
11,
12]. Therefore, it is difficult to clearly formulate a method for determining the performance degradation of a battery. It is necessary to develop an algorithm that can accurately estimate the SOH [
13].
Methods for estimating the SOH of a battery include model- and data-based estimation methods. Model-based estimation methods use an electrochemical battery equivalent circuit model and adaptive filters to estimate the SOH [
14]. The adaptive filters used include the Kalman filter (KF) and particle filter (PF). The KF uses a recursive method to compare the previous estimate and the current measured value to reduce errors, thereby increasing the estimation accuracy [
15]. However, a regular KF is used in linear models and is not suitable for battery models that include nonlinearity. Therefore, Sepasi et al. [
16] estimated the SOH by linearizing a nonlinear system with an extended Kalman filter (EKF). However, the EKF produces an estimation error because the state variable is estimated by linearizing the nonlinear system through a Jacobean matrix [
14]. Chen et al. [
17] used an unscented Kalman filter (UKF) to estimate the SOH. The UKF facilitates more accurate SOH estimation because the state variable is estimated relatively error-free by using approximates without obtaining the Jacobian matrix for linearization, unlike the EKF. In the case of the aforementioned EKF and UKF, the noise must satisfy a Gaussian distribution. In contrast, the PF method is suitable for estimating the state of a LIB, which is a nonlinear and non-Gaussian system because of various electrochemical reactions [
18]. The PF finds random particle samples with weights that are related to the post-probability density. When this process is repeated, the weight tends to become 0 [
19]. To solve this problem, Datong et al. [
20] estimated the SOH by providing a distribution closer to the actual post-distribution through an unscented particle filter (UPF) that combined the UKF and PF algorithms. However, there were limitations in solving physical and chemical changes and various noises in the LIB model [
21].
Meanwhile, data-based methods estimate the SOH by using a machine learning approach without requiring the analysis of electrochemical reactions [
22]. Recently, many studies have been conducted using data-based methods. Nuhic et al. [
23] analyzed data to find patterns and estimated the SOH using the support vector machine, which is an algorithm for approximating nonlinearities. Datong et al. [
24] used the relevance vector machine to improve the accuracy and reliability of a battery’s SOH estimation. Li et al. [
25] estimated the SOH based on ensemble learning by extracting six deterioration features from battery data. Furthermore, Ren et al. [
26] proposed an algorithm to estimate the SOH by combining a deep neural network and an autoencoder. You et al. [
27] estimated the SOH based on recurrent neural networks (RNNs) using the voltage and current data over time. In RNNs, the memory used to remember the sequences in long inputs becomes shorter, which may cause the vanishing and exploding gradient problem. In previous studies, general data preprocessing techniques (e.g., Normalization) and mathematical battery models that are generally applied to data preprocessing were used. These limit the SOH accuracy of LIB with nonlinearity. To preprocess the data with such nonlinearities, Maher et al. [
28] estimated the SOH by applying continuous wavelet transform (CWT) to the voltage profile of the battery. Jun Xu et al. [
29] estimated SOH by applying discrete wavelet transform (DWT). The CWT facilitates the extraction of useful features related to battery deterioration by converting one-dimensional (time) battery data into two-dimensional (time-frequency domain) data. The DWT facilitates the extraction of electrochemical features from signals with a broad frequency range through multi-resolution analysis.
In this study, we proposed a preprocessing method suitable for SOH estimation using both CWT and DWT. In the case of CWT, it was possible to increase the estimation accuracy. In the case of DWT, since a neural network was not used in existing papers, a SOH estimation method was proposed using LSTM. To confirm the usefulness of the proposed algorithm, the SOH estimation technique by normalization and SOH estimation results were compared. From this, it was found that the SOH estimation accuracy using the wavelet technology proposed in this study was high. It was confirmed that the proposed algorithm is more suitable for data with nonlinearity than the conventional preprocessing method. Furthermore, regarding the impacts of the specific wavelet-related methods, we found that it was more appropriate to apply the DWT methods to the LSTM than to the CNN and CWT.
The remainder of this paper is organized as follows.
Section 2 describes the data composition.
Section 3 describes CWT and DWT, which are preprocessing methods, the CNN and LSTM methods, which are machine learning methods, were explained, as well as the neural network model structure for SOH estimation.
Section 4 compares the SOH estimation performances of the LSTM and CNN models to verify the accuracy of the proposed model, the SOH estimation performance of the proposed method was examined, and its validity was determined.
2. Data Composition
An actual EV battery operates in various driving environments. Therefore, to obtain battery data similar to the actual EV driving environment, this study utilized a randomized battery usage dataset measured at the NASA Ames Prognostics Center of Excellence [
30]. The extension of the data is .mat, and the size of the entire data is about 1 GB.
Table 1 shows the characteristics of the LIB. Here, the random walk profile consists of a random current in the range −4.5 to 4.5 A. A random current (−4.5, −3.75, −3, −2.25, −1.5, −0.75, 0.75, 1.5, 2.25, 3.0, 3.75, and 4.5 A) is used for charging and discharging. A negative or positive current indicates charging or discharging, respectively. A random current was applied for 5 min to end one cycle, and a break of 1 s was repeated. For safe operation, the maximum and minimum threshold voltages were 4.2 and 3.2 V, respectively. Additionally, if the voltage is exceeding the threshold, the 1 step is finished within 5 min. The random walk continues for 1500 steps. After RW, the reference charge and reference discharge are performed twice.
Figure 1 shows the whole process of data cycles. The discharge reference is used to obtain the capacity. Therefore, this paper uses reference discharge for SOH estimation.
Figure 2 shows the measurements of the first 50 RW steps and the last 50 RW steps of the RW9 cell. The actual measurement time of 50 RW steps should be about 4.2 h, but they are 3.5 h and 1 h, respectively. This is because when the minimum and maximum threshold voltages are reached, the current step ends and the next step begins. In the last 50 steps, the time is significantly reduced because the minimum mill maximum threshold voltage is often reached. The reason the threshold voltage is often reached is that the internal resistance increases.
As the number of charging and discharging cycles for a LIB increase, the battery’s capacity decreases, and the internal resistance increases.
Figure 3 is the reference discharge profile. It can be seen that as the discharge cycle increases, the discharge time becomes shorter, and the temperature increases. Through this, it can be seen that as the charge and discharge cycles increase, the internal resistance increases, which affects the capacity degradation. Therefore, in this paper, voltage and temperature data were used by utilizing the discharge curves of a total of four cells of lithium-ion batteries RW9, RW10, RW11, and RW12. Voltage data and temperature data were used to extract feature points through wavelet transformation and train the neural network model.
4. Simulation
Figure 8 shows the structure of the simulation system. The MATLAB/Simulink Wavelet Toolbox was used to extract features using wavelet transforms, and the MATLAB/Simulink Deep Learning Toolbox was used to implement the LSTM model. The simulation system was implemented by constructing the proposed SOH estimation model for a LIB. The experimental setting is a desktop with CPU Intel i7 10700K, GPU NVIDIA GeForce GTX 1660 S 6 GB, and RAM 16 GB.
4.1. Learning Using Data
Using the composed LIB dataset, we trained the CNN and LSTM neural networks. When training each neural network, we optimized the neural network using Adam Optimizer and used the regression function as an activation function to estimate the LIB’s SOH. Furthermore, to examine the learning performance of the neural network models, we used 80% of the LIB in each cell dataset as the training dataset and 20% as the test dataset to determine each model’s performance.
4.2. Comparison of Model Performances Using the Battery Dataset
To compare the LIB SOH estimation performances of the LSTM neural network model using the DWT, as expressed in
Table 4, the estimation accuracy was determined for each data point using the Equation below.
Figure 9 shows the actual SOH values and SOH estimation graphs obtained using the neural network models.
In Equations (22)–(25), n denotes the number of test data points. Here, RMSE represents the robustness of the estimation, MAE accuracy refers to the estimation accuracy of SOH, and MAX is the maximum error value. The SOH estimation accuracy using Normalization and LSTM for the voltage and temperature were 94.85% and 93.49%, respectively. The SOH estimation accuracy using CWT and CNN for the voltage and temperature were 96.82% and 94.68%, respectively. The SOH estimation accuracy using DWT and LSTM for the voltage and temperature were 98.92% and 96.45%, respectively. All three models exhibited excellent performance when using the voltage data.
All three models exhibited excellent performance when using the voltage data. This is due to the time-dependent nature of the voltage and temperature data associated with the charging and discharging of lithium-ion batteries. In lithium-ion batteries, the internal resistance increases as the number of repetitions of charging and discharging increases. The continuity of the voltage curve that occurs at this time is superior to that of the temperature curve as shown in
Figure 3. In particular, as the charge and discharge time elapses, the temperature curve becomes irregular and the continuity decreases. The learning algorithm applied in this study was able to solve the nonlinearity of the data to some extent. However, it was confirmed that the SOH estimation accuracy was relatively low because the temperature characteristic curve was very discontinuous. Instead, we found that the SOH accuracy was excellent for the same algorithm because the voltage curve had better continuity than the temperature curve. When comparing the performance of the model using wavelet and Normalization, it can be seen that the accuracy of the wavelet is high. This confirms that the proposed algorithm is advantageous for nonlinear battery data learning.
The performance was verified by comparing the relative errors of the three models. A boxplot was then used to compare the performances of the three models to evaluate the variance of the relative error. The mathematical equation of the relative error is the same as in Equation (26).
Here,
are the estimated and actual SOH values, respectively. The boxplot mainly focused on the maximum, minimum, median, third quartile, and first quartile. In
Table 5, IQR is the difference between the third quartile and the first quartile. The relative errors of different SOHs are compared in
Figure 10 and the data are described in
Table 6.
The estimation accuracies of the SOH were compared in the case of using temperature data. When comparing the LSTM with normalization, the variance of the relative error of the LSTM was narrow and maximum value small. In addition, it was confirmed that there were no outliers in the relative error for the LSTM; however, outliers occurred when using normalization. Although the estimation accuracy of the CNN, comparing CNN and normalization, was high, the variance of the error was wide and the maximum value relatively large. Outliers existed in both models. When comparing the LSTM and CNN, it was confirmed that the variance of the relative error of the LSTM was narrow and maximum value smaller. The LSTM did not have outliers, but CNN had outliers.
For the voltage data case, the estimation accuracies of the SOH were compared. When comparing the LSTM with normalization, the variance of the relative error of the LSTM was narrow and the maximum value small. In both models, there were no outliers in the relative error. When the CNN and normalization are compared, the variance of the relative error of CNN was narrow and the maximum value small. However, there was an outlier for CNN, but not for normalization. Finally, when comparing LSTM and CNN, it can be seen that the variance and maximum value of the relative error of the LSTM were small, and there were no outliers.
One reason for this difference between the two models was possibly the difference in learning. The CNN model did not consider the historical values based on time because it learned the entire image information, which was not related to a specific time. In contrast, the LSTM model was trained while considering the historical values based on time. Therefore, in terms of the SOH estimation accuracy of the LIB, the LSTM training method was more appropriate and fully reflects changes in the characteristic values (voltage and temperature) of the lithium ions caused by charging and discharging over time.
5. Conclusions
In this study, to improve the SOH estimation accuracy of a LIB, we introduced wavelet (CWT and DWT)-based preprocessing methods, which overcame the limitations of conventional studies when processing nonlinear characteristic data, and proposed a learning optimization method based on these. To confirm the usefulness of the proposed algorithm, we compared the SOH estimation results of the LIB using the wavelet technique and Normalization, which are existing data preprocessing techniques.
When the temperature data were used, which is a variable related to the SOH estimation of a LIB, the SOH accuracy was 93.49% for the technique using normalization and LSTM, 94.38% for the technique using CWT and CNN, and 96.45% for the technique using DWT and LSTM. When the relative errors were compared, the relative error variance of the DWT and LSTM was relatively narrow and the maximum value relatively low. Therefore, the accuracy was excellent when the wavelet was used compared to the existing data preprocessing technique (normalization). When CNN and LSTM were compared, the estimation accuracy of the LSTM was excellent.
When voltage data were used, the SOH accuracy was 94.85% for the using normalization and LSTM, 97.71% for using CWT and CNN, and 98.92% for the using DWT and LSTM method. When the relative errors were compared, the relative error variances of DWT and LSTM were the narrowest and minimum values low, similar to the temperature data. Therefore, as with the temperature data, it was found that the accuracy was excellent and the estimation accuracy of the LSTM was excellent when the wavelet was used compared with the conventional pretreatment.
By combing all of the aforementioned results, the SOH estimation accuracy using voltage data was greater than that when temperature data were used, and the SOH estimation accuracy was higher when DWT and LSTM were used. This confirmed the usefulness of the algorithm proposed in this study and it was demonstrated that the voltage is the most important factor of the LIB SOH accuracy estimation related variables. Therefore, it was confirmed that our proposed wavelet-based data processing and LSTM training methods, which showed excellent characteristics reflecting the time-series data, were useful in improving the SOH estimation accuracy.
In the future, we aim to establish a method for further improving the SOH estimation accuracy by adding the effects of data for characteristics other than the voltage and temperature of a LIB that was the focus of this study.