2.1. Proposed RUL Prediction Approach
The suggested model framework for predicting the RUL is shown in
Figure 1. Initially, the original data’s insightful properties were retrieved, exposing important trends and patterns that might be used to predict the batteries’ RUL.
After that, a PCA with memory feature integration was used to complete feature fusion. In order to forecast the RUL as the output, the encoded features were finally given into the DNN model.
A set of four lithium-ion batteries (B0005, B0006, B0007, and B0018) from the NASA dataset were subjected to three distinct operational profiles: charge, discharge, and impedance, all conducted at room temperature. These batteries were selected for predicting the remaining useful life (RUL). The NASA dataset provides time series data for each charge and discharge cycle, including measurements of voltage, current, temperature, and impedance. Each cycle follows the widely adopted constant current–constant voltage (CC-CV) pattern for both charging and discharging.
The testing process was repeated until the battery capacity degraded to 70% of its initial value. A complete cycle comprises both a charge and a discharge sequence.
Figure 2 illustrates a typical charge–discharge cycle of a lithium-ion battery. All charge–discharge cycles were compared against this reference pattern and those that deviated significantly were excluded from the dataset, as such anomalies could negatively impact the performance of the neural network.
Charging process: Initially, the battery is charged with a constant current (CC) until the voltage is raised to a maximum upper limit. Then, the voltage is kept constant (CV) while the current drops to a certain lower limit.
Discharging process: The entire discharge process is at a constant current (CC) of a specific value until the voltage drops to a certain lower limit.
- B.
Feature Extraction
In Deep Neural Networks, feature engineering plays a pivotal role in the success of prediction models, especially in complex scenarios like battery life prediction. The dataset gathered from charging and discharging cycles of batteries presents a significant challenge: the number of data points varies across cycles, with some having as few as 800 data points and others as many as 5000. This disparity makes it impractical to directly feed raw data into the neural network, necessitating data preprocessing and feature extraction to ensure consistent and meaningful input.
Feature extraction is crucial because, in theory, incorporating a broader set of features enhances the model’s accuracy. A well-structured representation of the data is essential for identifying trends and key features that contribute to better predictions.
As batteries undergo repeated charge and discharge cycles, they degrade due to several mechanisms, including the loss of active lithium ions, lithium plating, electrode degradation, metal dissolution, and an increase in internal impedance [
18]. These factors lead to a reduction in the battery’s capacity over time. Consequently, time series data of current, voltage, and temperature during charging and discharging cycles reveal distinct patterns when analyzed against the cycle count.
For instance, the figures for a B0005 battery, depicted in
Figure 3, illustrate the variation in voltage, current, and temperature of the battery cell B0005 over time during the charging and discharging processes across multiple cycles.
Figure 3a–c corresponds to the discharge cycles while
Figure 3d–f corresponds to the charge cycles. By analyzing these curves, it is evident that certain parameters of lithium-ion batteries are highly sensitive to the cycle count and exhibit clear trends. For instance,
Figure 3a highlights whether all discharging cycles follow a typical pattern, as shown in
Figure 2, and highlights how the time taken to reach the minimum terminal voltage decreases as the cycle count increases. Similarly,
Figure 3e highlights how the constant current (CC) charging time shortens with each successive cycle.
These characteristic features, extracted from the original data for each cycle, serve as critical inputs for predicting the battery’s remaining useful life (RUL). Identifying and leveraging these features enables the neural network to make more accurate and reliable predictions.
These distinguishing characteristics were taken from the original data for every battery cycle in order to forecast the RUL. While plotting each cycle for each battery, we observed errors in some battery cycles, such as missing data points and graphs showing abnormalities compared to others. These cycles were skipped during model training to prevent potential errors in the model. However, some of these cycles were retained to evaluate the model’s accuracy. The following features are extracted from the charging and discharging operation.
Charging Features:
Upper Cut-off Voltage Time
(tmin(i), Vi), Vi ≥ Upper Cut off Voltage, i = 1, 2, 3, … where tmin(i) represents the time when the battery terminal voltage reaches the maximum value first and Vi represents the value of output voltage in the ith cycle.
Constant Current Charging Time (CCCT)
(tmin(i), Ai), Ai ≤ Constant Current Value, i = 1, 2, 3, … Ai represents the value of the current value in the ith cycle.
Constant Voltage Charging Time (CVCT)
(tmin(i), Vi), Vi £ Constant Voltage Value, i = 1, 2, 3, … where Vi represents the value of the output voltage in the ith cycle.
Maximum Charging Voltage Time
The time required to reach the maximum battery-measured voltage. (tmax, Vmax) = {(ti, Vi| max(Vi))}, i = 1, 2, 3, …, m, where tmax is the time stamp for which the maximum voltage Vmax is measured in the ith cycle.
Maximum Temperature Time
Time when the battery reaches its maximum temperature. (tmax, Tmax) = {(ti, Ti| max(Ti))}, i = 1, 2, 3, …, m, where tmax is the time stamp for which the maximum temperature Tmax is measured at the ith cycle
Area Under Voltage Curve
Area covered by the voltage vs. time curve.
Area Under Current Curve
Area covered by the current curve vs. time.
Maximum Power Time
(tmax, (V*I)max) = {(ti, V*I| max(V*I))}, i = 1, 2,…, m, where the tmax is the time stamp for which the maximum current and voltage product is measured at the ith cycle.
Discharging Features:
Discharging Time
During the discharging process, the time for which the battery terminal voltage reaches its minimum voltage.
Maximum Temperature Time
Time which gives the maximum temperature at the discharging process for each cycle.
Area Under Voltage Curve
Area covered by the voltage measured vs. time plot.
Area Under Current Curve
Area covered by the current vs. time plot.
Discharging Capacity
The remaining capacity at each discharging cycle.
When combining the charge cycle and discharge cycle, there are 13 features for one cycle.
- C.
Generating Labels and Data Normalization
For the training phase of the RUL prediction model, correct labels are necessary after the features have been extracted from the raw data. To obtain the supervised data (x
i,y
i), the appropriate RUL (y
i) for the ith cycle characteristics (x
i) must be calculated. The ith cycle’s RUL was determined as
where L is the total count of charge and discharge cycles.
To mitigate any adverse effects resulting from varying value ranges, data normalization is an essential component of Neural Network models [
19]. We used the minimum–maximum normalization technique to rescale all of the dataset’s features inside the interval [0–1]. The following is the min–max normalization formula [
19].
where x
max denotes the greatest value of feature x and x
min denotes its minimum value.
By ensuring that all features have the same scale, this normalization strategy can help the neural network model perform better and coverage more quickly.
- D.
Memory Features
Time Series Analysis can be used to characterize the patterns in the data; therefore, memory-based characteristics spanning numerous cycles—which have not been used much—can be applied here. Utilizing a window that spans many cycles, extracted features are used with memory to capture the properties seen in the preceding n cycles. By increasing the number of preceding cycles (n), testing loss was decreased.
For instance, in order to predict the RUL for the ith cycle, yi, all of the features from the previous n cycles, {xi, xi−1, xi−2, xi−3, …, xi−n}, were used in addition to the features of the current cycle, xi. Several window sizes were used to test the proposed DNN in order to increase the number of cycles (n).
- E.
Feature Fusion
Adding memory features to the DNN model increases the number of input features, leading to potential redundancy, higher computational complexity, and inefficiency. To address this, we reduced the dimensionality of the input data using two proven techniques: Principal Component Analysis (PCA), Nonlinear PCA, and an autoencoder.
- (1)
Principal Component Analysis (PCA)
In this project, to handle the high-dimensional dataset and reduce the risk of errors such as overfitting, we used Principal Component Analysis (PCA) as the first encoding technique. PCA simplifies the dataset by identifying new variables, known as principal components (PCs), which are linear combinations of the original features. These components retain the most significant information from the original data while reducing dimensionality. The process of applying PCA involves the following steps [
20]:
Standardization: The data is standardized to ensure all features have a mean of zero and a standard deviation of one, preventing any feature from dominating the analysis.
Covariance matrix calculation: A covariance matrix is computed to measure how different features vary together, identifying relationships between them.
Eigen decomposition: The eigenvalues and eigenvectors of the covariance matrix are calculated to determine the directions (eigenvectors) with the highest variance (eigenvalues).
Selection of principal components: The principal components with the largest eigenvalues are selected, capturing the most variance and ensuring that only the most meaningful information is retained.
Projection: The original dataset is projected onto the space defined by the selected principal components, resulting in a reduced-dimensional representation of the data for further analysis.
Figure 4 illustrates the original data compared with the decoded data for the case of n = 3 in PCA.
- (2)
Nonlinear PCA
In this study, Nonlinear PCA was not used due to the large size of the dataset, making it computationally impractical to handle. Instead, we employed an autoencoder, which provides a scalable and efficient solution for dimensionality reduction by learning nonlinear representations of the data.
- (3)
Autoencoder
An autoencoder is a type of neural network used for unsupervised learning, primarily for dimensionality reduction and feature extraction [
21]. It aims to learn an efficient, compressed representation of input data by encoding it into a lower-dimensional space and then reconstructing it back to the original form. The autoencoder consists of two main parts:
Encoder: Compresses the input data into a lower-dimensional latent space, capturing the most essential features.
Decoder: Reconstructs the original data from the compressed representation, ensuring that the important information is retained.
In this project, we only consider the encoder, as the encoded representation is used as input to the DNN model. The decoder is only employed to verify the accuracy of the autoencoder model by evaluating how well it reconstructs the original input.
Figure 5 illustrates the original data compared with the decoded data for the case of n = 3 in the autoencoder.
- F.
Training Neural Network Models with and without encoded Features
Table 1 represents the number of input features for PCA and autoencoder along with their outputs, categorized by different number of previous cycles.
The proposed model for remaining useful life prediction is a Deep Neural Network (DNN) that leverages both current and previous cycle features. The output from the encoder serves as input to the DNN, along with the target output, as the DNN operates under a supervised learning framework. In Deep Neural Networks, each input passes through a nonlinear activation function to generate the input for the subsequent layer. The mapping in the ith layer can be expressed as follows:
where
represents the nonlinear activation function, W denotes the weight matrix, and b corresponds to the bias of the ith layer.
During the training phase, the weights and bias parameters of the layers are optimized interactively to minimize the loss function, which is defined by comparing model output with the target output. After tuning the hyperparameters, the selected DNN model consists of four fully connected layers. Each layer has 10, 7, 4, and 1 neurons, respectively. The ReLU activation function is used for the hidden layers, while the sigmoid function is applied to the output layer. The model’s performance during the testing is evaluated using the Root Mean Squared Error (RMSE%) value.
2.2. Nissan Leaf Gen 01 Battery Module
The adoption of electric vehicles (EVs) has surged rapidly in recent years. In 2022 alone, 10.5 million new EVs and plug-in hybrid vehicles were introduced to the market. This represents a significant growth in market share, with EVs accounting for 14% of global vehicle sales in 2022, up from 9% in 2021. Projections suggest that by 2030, EVs could capture up to 35% of the market. Most EVs are powered by lithium-ion battery packs, which are composed of multiple battery modules. These batteries typically reach the end of their automotive service life after approximately 10 years. However, even after this, many still retain 60–67% of their original capacity, making them suitable for secondary applications such as energy storage systems (ESSs), particularly for solar power plants [
22].
- B.
Why Use This Model when BMSs Already Exist?
EVs are equipped with a sophisticated battery management system (BMS), which continuously monitors and estimates the remaining useful life (RUL) of the battery during operation. However, once the battery pack is removed from the vehicle and disconnected from the BMS, the system’s memory is reset and the ability to predict the RUL is lost. This presents a significant challenge when repurposing retired EV batteries for second-life applications.
Therefore, this study aims to develop a predictive model for estimating the remaining useful life (RUL) of Nissan Leaf Gen 01 battery modules after their removal from the BMS. The proposed method focuses on utilizing simple tests to predict the battery’s remaining life, ensuring the feasibility of second-life applications in energy storage systems.
- C.
About the Battery Module
The Nissan Leaf Generation 01 battery pack consists of 48 modules connected in series, with each module comprising four lithium-ion cells. The anode material is a combination of LiMn
2O
4 and LiNiO
2, while the cathode is composed of graphite. The configuration of each battery module is 2P2S, meaning two parallel sets are connected in series, resulting in an overall pack configuration of 2P-96S. The nominal capacity of the battery pack is 66.2 Ah (24 kWh) at a 0.3C rate, where the C-rate defines the rate of charge or discharge relative to the total capacity. For instance, a 1C rate would fully charge or discharge the battery in one hour, while a 2C rate would complete the process in 30 min. Due to high current requirements managed by the battery management system (BMS), the usable capacity is set to 65.6 Ah. Each module has an internal resistance of 1 mΩ, and the series resistance across the pack is 2 mΩ. The operating voltage range for a single cell is 2.8 V to 4.2 V, with a mean cell voltage of 3.8 V, yielding a typical module voltage of 7.6 V and a maximum module voltage of 8.4 V. Each module weighs approximately 3.8 kg, with an energy density of 213 Wh/L and a specific energy of 132 Wh/kg.
Table 2 summarizes the battery module parameters.
- D.
Data Collection
Due to the difference in capacity between the Nissan Leaf Gen 01 battery module and the training set, it was necessary to ensure that 1.5 A was driven through each cell. Given the 2P-2S configuration of the module, a total current of 3 A was applied during the charging process.
Charging was carried out using a RIGOL DP 932U DC power supply in constant current mode until the battery voltage reached 8.4 V. Once this voltage reached that value, the current gradually decreased while the voltage kept constant. The charging process was stopped when the current was reduced up to 40 mA.
- 2.
Discharging Process
Discharging was conducted using a constant current method at 4 A, with the assumption that each cell in the module drains 2 A, consistent with the training dataset. This process was carried out using a KIKUSUI PLZ1205W electronic load. Data collection was performed using version 5.1.0 of the SD023-PLZ-5W (Wavy for PLZ-5W) software, provided by KIKUSUI.
Figure 6 illustrates the complete charging and discharging setups used in the laboratory for the data collation process for Nissan Leaf Gen 01 battery modules.
- E.
Data Normalization
The charging and discharging current and voltage curves over time of the Nissan Leaf module exhibit the same pattern as those of the 18650 battery datasets. The only difference between the training dataset and the Nissan Leaf Gen 01 battery module lies in the capacity when comparing their performance. Therefore, normalization was performed using standard data for each battery type. This process brings both battery values to a scale between 0 and 1, making the model well-suited for predicting the remaining useful life (RUL) of Nissan Leaf battery modules. These methods ensure that predictions remain robust despite the constraints encountered during data collection. It is important to note that this approach does not consider the battery’s chemistry or physical structure.
Figure 7 illustrates the charge and discharge curves plotted using the gathered data. The standard times can be calculated as follows:
where 2 Ah is the capacity and 1.5 A is the charging current for the 18650 battery.
where 24 kW is the capacity, 48 is the number of modules in a pack, 3 A is the charging current, and 8.4 V is the max voltage of a Nissan Leaf battery module.
2.3. Experiment
We obtained the lithium-ion battery data from NASA’s Prognostic Center of Excellence (PCoE). For our model, we selected four battery cells designated as B0005, B0006, B0007, and B0018. These batteries underwent various operational profiles, including charging, discharging, and impedance testing, all conducted at room temperature. Charging was performed in constant current (CC) mode at 1.5 A until the battery voltage reached 4.2 V, followed by constant voltage (CV) mode until the charge current decreased to 20 mA. Discharging occurred at a constant current (CC) of 2 A until the battery voltages dropped to 2.7 V, 2.5 V, 2.2 V, and 2.5 V for batteries B0005, B0006, B0007, and B0018, respectively.
The experiments were terminated when the batteries met the end-of-life (EOL) criteria, defined as a 30% reduction in rated capacity (from 2 Ah to 1.4 Ah).
We used batteries B0005, B0006, and B0007, which had 167 cycles, to train the model, while battery B0018, with 132 cycles, was utilized for testing. Although the dataset from NASA includes several hundred cycles, the proposed model is capable of functioning effectively with cells that have a higher cycle count, especially since we normalized the target output using min–max normalization.
The DNN model was then trained using the encoded features from batteries B0005, B0006, and B0007, and tested with the encoded features of battery B0018 and CALCE lab data. The predicted results were compared among the DNN, Neural Network with memory features (NNwMF), Support Vector Machine (SVM), and Linear Regression (LR) models using RMSE% values. Under the NNwMF framework, we considered the DNN with PCA encoded memory features, DNN with memory features without encoding, DNN with memory features encoded using an autoencoder, and an LSTM model.
- B.
Model Retraining
As the above-mentioned model is used to predict the remaining useful life (RUL) of Nissan Leaf batteries, maintaining the same test conditions as those in the training dataset can be challenging. Consequently, measuring certain parameters, such as temperature variation during the charging and discharging processes, becomes difficult. During the data collection phase, we focused solely on charging and discharging currents, voltage, and power over time, while neglecting temperature features. As a result, we had to retrain the most effective model obtained during our initial training phase using the new set of 12 input features, making it suitable for predicting the remaining useful life (RUL) of the Nissan Leaf Gen 01 battery modules.