1. Introduction
In recent years, LiBs have dominated the relatively short-term energy storage market by virtue of their important characteristics such as high energy density, long service life, and continuous cost reduction [
1,
2]. However, LiBs degrade over the course of their use as the number of cycles increases [
3,
4]. When the degradation of LiBs reaches a certain level (failure threshold), the capacity and power will decrease rapidly if they continue to be used, which may cause more damage and even dangerous situations such as explosions [
5,
6,
7,
8]. The evaluation of RUL is regarded as a key indicator for measuring the operational effectiveness of LiBs. It is important to construct a good battery capacity degradation prediction model for the early warning of battery failures, the prevention of dangerous accidents, and reductions in economic losses. In general, when the maximum discharge capacity of an LiB falls to 70% or 80% of its rated capacity, the battery is considered to reach the capacity failure threshold and is no longer suitable for charge/discharge operation and needs to be replaced or discarded [
9]. Generally, according to the definition of RUL, predicting the RUL of LiBs is actually predicting the number of charge/discharge cycles that can be performed between the current state and the failure threshold.
RUL prediction methods for LiBs are based on two main categories: model-based methods and data-driven methods [
10]. Model-based methods aim to develop a mathematical model to describe the degradation trajectory of the battery dynamics, which simulates the degradation process of LiBs by the means of physical, electrochemical, or equivalent circuit models. Differently from model-based methods, data-driven methods use historical monitoring data to directly predict battery degradation trends [
11], such as Gaussian process regression [
12,
13] and relevance vector machines [
14,
15]. Meanwhile, many deep learning methods, including long short-term memory [
16,
17], recurrent neural networks [
18,
19], convolutional neural networks [
20,
21,
22], and attention mechanisms [
23,
24,
25] are also beginning to make an impact in the field. The superiority of data-driven methods is in direct prediction tasks using historical monitoring data. However, there is the limitation that battery capacity is difficult to measure directly in practice. As a result, an indirect prediction method was created, which extracts an indicator that can reflect the health status of LiBs from degradation process, i.e., a health indicator (HI). In addition, although data-driven methods are currently the most popular and widely used methods, they ignore the inherent physical characteristics [
26] of LiBs, which can affect prediction accuracy and lack interpretability. To address this issues, Ma et al. incorporated domain knowledge into deep learning to improve RUL prediction accuracy [
27], while Sandrucci et al. integrated experimental design and chemometric data analysis to help capacity prediction for lithium metal batteries [
28]. More generally, physics-informed machine learning has been widely used for the health management of LiBs and has become an effective method for predicting battery degradation [
29,
30,
31].
Currently, the focus of most researchers has been on improving single-value RUL prediction accuracy for LiBs. However, relying only on a single value for prediction has significant limitations when the degradation of LiBs is irreversible. In addition, the degradation of LiBs involves very complex physicochemical processes, and it is difficult to accurately control the change process through a numerical prediction model, and the prediction model may also be affected by external environmental factors during the operation. In practice, the working environment of LiBs is not consistent. A good RUL prediction model can not only take into account the working state of LiBs in different operating environments but also accurately describe the degradation trend. RUL interval prediction meets the requirements in this regard. Thus, there is a need for a data processing method that can construct time series intervals with upper and lower limits.
However, for the health state management of LiBs, relatively few studies have been conducted on interval prediction. Pan et al. achieved the interval prediction of the health state of LiBs by constructing a nonlinear regression model by combining fuzzy information granularity and a support vector machine [
32]. In order to mitigate the effect of capacity rebound on battery RUL prediction during the degradation of LiBs, Pang et al. introduced linguistic descriptions based on the prediction model in [
33]. By predicting the capacity rebound points individually and superimposing their results with the original prediction data, more accurate predictions were achieved and these predictions were presented in the form of intervals. Unlike the above works, which all use fixed-scale segmentation, a strategy for predicting RUL intervals of LiBs using fuzzy information granulation combined with trend-filtered segmentation was proposed in [
34]. But when confronted with a small amount of battery degradation data or a smoother degradation curve, the model may have poor prediction results due to insufficient training data. Moreover, since fuzzy information granularity inherently simplifies the complexity of the data, it makes the battery degradation data, which is originally a small amount of data and changes gently, become even more abbreviated after processing, further affecting the prediction accuracy.
On the other hand, the degradation data of LiBs is essentially a time series. The prediction accuracy for a time series can be improved via two ways: one is to improve the prediction model, and the other is to make effective use of the time series. As we all know, periodicity is an important characteristic of time series, and the reasonable use of periodicity can effectively improve prediction accuracy. Based on this, the prediction of periodic time series has been studied widely. For example, Wang et al. proposed a network-based periodic energy consumption prediction method and applied it to a refrigeration system [
35]. To address this issue, a method to construct non-periodic time series into periodic time series was proposed and used to predict the SOH of LiBs in our previous work [
36]. In this paper, the time series construction method is applied to generate long periodic time series data and form a dataset to address the problems of insufficient training data and smooth degradation in RUL prediction based on fuzzy information granulation. The experiments, based on the CALCE battery dataset and NCA battery dataset, show the effectiveness and advantage of the proposed prediction method. The main contributions of this paper are as follows: (1) A periodic time series dataset is generated by using the construction method of periodic time series. (2) An RUL interval prediction method, by combining periodic time series and trend filtering segmentation-based fuzzy information granulation, is proposed. (3) The proposed RUL interval prediction method is verified and analyzed based on two public battery datasets. It is worth noting that this paper is dedicated to obtaining better accuracy in predicting the RUL of LiBs by combining fuzzy information granulation and the construction method of periodic time series, which are both from our previous work [
34,
36]. In short, we expect to obtain better prediction performance through a reasonable combination of simple or existing methods, rather than obtaining this by directly improving the predictive model itself. This paper provides a valuable new idea for the works [
32,
37,
38,
39] that use the fuzzy information granulation in the field of the health management of LiBs to improve prediction accuracy via the same way or a similar way to that used in this paper.
The rest of this paper is organized as follows. The used battery dataset, periodic time series construction method, fuzzy information granulation, and evaluation indicators are introduce briefly in
Section 2. In
Section 3, the proposed RUL interval prediction method for LiBs is described in detail. In
Section 4, we report how the experiments based on the CALCE battery dataset and NCA battery dataset were performed and analyzed. Finally, we conclude this paper in
Section 5.
3. RUL Interval Prediction Method of LiBs
Based on the two public battery datasets and the periodic time series construction method as well as fuzzy information granulation shown in
Section 2, an RUL interval prediction method for LiBs will be proposed and described in this section.
The proposed RUL interval prediction method for LiBs, which consists of five steps, is presented in
Figure 6. Let
be the extracted HI degradation time series of LiBs and
be the time window. The details of the proposed method of RUL interval prediction for LiBs are as follows.
Step 1: Data acquisition and HI extraction. HIs with high correlation with the capacity, which are found by using the Pearson coefficient and MIC, are selected to characterize capacity, and outliers in the HI degradation data are removed.
Step 2: Trend filtering and fuzzy information granulation. Trend filtering is an effective method used to analyze potential linear trends in time series. The original time series are filtered and analyzed using
trend filtering [
42] to determine the points where the trend changes. Different time window lengths are matched for different ranges of trend segments by the trial-and-error method. Building upon this, fuzzy information granulation is utilized on the time series. After the time series
X is divided into
, the time window data are transformed into fuzzy particles using fuzzy information granulation. The data in the time window
are constructed into a fuzzy granule,
, under the triangular affiliation function. The fuzzy granule
contains three parts of information,
,
, and
, which are arranged in order to form the time subsequences
,
, and
, which are the lower limit Low, upper limit Up, and median
m, respectively.
Step 3: Construction of periodic time series. After completing the trend-filtering segmentation of an HI of LiBs, the HI becomes two subsequences with upper and lower limits. Repeatedly executing Step 1 and Step 2, the upper subsequence Up of the HI of LiBs can be obtained, forming a non-periodic dataset. On this basis, these non-periodic data are converted into a dataset with periodic characteristics by different permutation strategies, and the best periodic time series are determined based on the experimental results as the basic data for subsequent experiments.
Step 4: Building prediction model. The fuzzy granulated Low and Up are used to train the VMD-GRU prediction model. The granulated sequences of Low and Up are first given to VMD to decompose into individual modal components. Then, each component is divided into a training set and test set. Finally, the training set is fed into a GRU to train the prediction model and find the best model. Particularly, in the data division stage, the training set not only contains the original training data of the predicted batteries but also combines other battery degradation data that constitute the periodic time series.
Step 5: RUL prediction with VMD-GRU. The test data of each modal component are provided to the trained VMD-GRU prediction model, and the prediction data of each modal component of the Up and Low are reconstructed to achieve RUL interval prediction for LiBs.
It should be noted that, for the convenience of comparative analysis, the same VMD-GRU prediction model as in [
34] is adopted in Step 4 of
Figure 6. VMD is an adaptive, fully non-recursive method for modal variational and signal processing. Compared with the classical empirical mode decomposition (EMD), VMD can effectively reduce modal aliasing by presetting the number of modes and bandwidth constraints and can extract all modal components simultaneously, which makes its computational efficiency high, but the number of modes
K needs to be preset. Nevertheless, considering the modal aliasing suppression and parallel decomposition ability, VMD is chosen over EMD in this paper. On the other hand, the GRU merges the input gate and forget gate of long short-term memory into a single update gate, which can solve the problems such as the inability of long-term memory and gradient in backpropagation in recurrent neural networks, and its training speed and memory utilization is greatly improved relative to long short-term memory due to the reduction in the number of parameters. Based on the above considerations, the VMD-GRU model is selected as the prediction model in this paper as shown in Step 4 of
Figure 6. Of course, using other available data-based prediction models in Step 4 is entirely acceptable, and this will not affect the usability of the proposed prediction method.
4. Experiments and Analyses
In this section, we detail how the effectiveness and advantage of the proposed RUL interval prediction method were verified via several experiments based on the CALCE battery dataset and NCA battery dataset. Before validating the proposed prediction method, in order to enhance the feasibility of the experimental design, a series of focusing on the performance of periodic time series under different sorting strategies was specially designed. Taking battery CS2_35 as an example, the prediction errors under different sorting strategies of multiple constructed periodic time series were analyzed to find out the optimal sorting. This step was crucial for optimizing the prediction accuracy. Finally, by combining the data of the CALCE batteries and NCA batteries, the experiments on RUL interval prediction for LiBs based on periodic time series were designed and implemented as a means of verifying the practical application and superiority of the proposed method.
4.1. HI Extraction
Generally, the capacity degradation of a battery is measured by its capacity and internal resistance. However, LiBs are not completely charged and discharged regularly during the actual charging/discharging process, and the internal resistance is constantly changing, so it is not possible to effectively and accurately obtain the data of LiBs online. At the same time, there are many factors that affect the battery degradation, such as voltage, current, temperature, and humidity. Therefore, voltage and current, which are easy to measure and easy to obtain, were chosen as HIs in this paper. The impedance value of the battery in operation is provided in some public datasets of professional laboratories, so the impedance value of the battery was also used as a kind of HI.
In this paper, the charging/discharging curves of different cycles were selected to be studied, and the charging/discharging characteristics of battery CS2_35 are shown in
Figure 7 as an example.
Figure 7a,b represent the variation in CC charging voltage and CV charging current with time during charging, and
Figure 7c,d represent the variation in voltage and impedance with time during CC discharging, respectively.
In
Figure 7a, the CC charging time shrinks considerably during the battery charging/discharging cycle from 1 to 750, which indicates that the battery capacity decreases with its charging/discharging process continuously. Meanwhile, according to the degree of sparsity between the CC charging voltage curves of different cycle cycles, the capacity degradation of this battery is mainly divided into three phases: between cycle cycles 1 and 150, the capacity of the battery decreases at a relatively fast rate; between cycle cycles 150 and 600, the capacity of the battery decreases at a slow rate and may be accompanied by some capacity rebound phenomenon; between cycle cycles 600 and 750, the battery capacity degradation rate is significantly faster compared to that in the other two stages. This pattern is consistent with the capacity degradation diagram of battery CS2_35. Similarly, the CC discharge voltage curve in
Figure 7c exhibits a similar phenomenon. In contrast, the CV charge current curve in
Figure 7b exhibits that the CV charge current time increases with the increase in the battery charging/discharging cycles and does not show a gap in the CV charge current time between different charging/discharging cycles, which indicates that the correlation between this HI and its capacity degradation curve is lower than the correlation between the CC charge voltage time and CC discharge voltage time and the battery capacity degradation curve. In
Figure 7d, the impedance basically does not change with time during the battery cyclic charging/discharging cycles, but the impedance value of each cycle is different in magnitude, and its impedance value increases with the increase in battery charging/discharging cycles, and it also exhibits the gap between different cyclic cycles, so it is also possible to take the average of the impedance value of each cycle as the HI.
In summary, four HIs were extracted, including CC charging time (CCCT), CV charging time (CVCT), CC discharge time (CCDT), and the resistance of the CC discharge process (RVICCDP). The four HIs extracted from battery CS2_35 are shown in
Figure 8, from which it can be seen that the CCCT and CCDT basically show a degradation trend consistent with the capacity degradation, while the trends presented by the CVCT and the RVICCDP are quite different from the capacity degradation of battery CS2_35. In order to further analyze the relationship between the proposed HIs and capacity, correlation analysis was performed, as described in
Section 4.2.
4.2. Correlation Analysis
Four HIs were extracted by analyzing the information during the operation of LiBs. In order to illustrate the relationship between the extracted HIs and the capacity, the analysis was carried out from two perspectives: quantitative and qualitative. The quantitative analysis was performed using Pearson and maximal information coefficient (MIC) correlation analyses, and the qualitative analysis is illustrated by scatter plots.
4.2.1. Quantitative Analysis
Pearson coefficient: The mathematical expression of the Pearson correlation coefficient is
where
X is the capacity and
Y is the HIs extracted in this paper. The Pearson correlation coefficient has a value between
and 1. If the coefficient is in the interval
, the larger the value is, the stronger the positive correlation is. On the contrary, if the coefficient is in the interval
, the smaller the value is, the stronger the negative correlation is.
MIC: The main idea of MIC is to discretize the relationship between two variables in two dimensions and represent it using a scatter plot. The probability density of the scatter can then measure the coefficient of mutual information between the two variables, thus capturing linear as well as non-linear associations. Different grid drawings cause the samples to fall into different 2D grids, and the number of scatters in each grid interval is counted and compared to the total number of samples to obtain a joint distribution,
, of HIs and capacity. Based on the feature edge distribution
and target edge distribution
, the mutual information
between HIs and capacity can be found, whose mathematical expression is
Based on (
5), MIC can be described as
where
a and
b denote the number of grids divided in the directions
x and
y, respectively;
B is a constant, typically set to the 0.6 power of the data volume.
Taking CALCE battery dataset as an example, the correlation analysis between HIs and capacity is shown in
Table 1, from which the correlation coefficient of CCDT is the highest among the four HIs, indicating that CCDT is able to reflect the capacity degradation of LiBs.
4.2.2. Qualitative Analysis
The scatter plots of capacity versus the four HIs are shown in
Figure 9, from which all the four HIs and capacity have some correlation. Both CCCT and CCDT decrease with decreasing capacity, indicating a positive correlation between these two HIs and capacity, and CCDT shows almost a straight line with capacity, which also indicates a stronger positive correlation between CCDT and capacity. Meanwhile RVICCDP increases with decreasing capacity, indicating a negative correlation between this HI and capacity. This is because as the capacity of LiBs decreases with the increase in time, the impedance of the battery also increases, so RVICCDP varies oppositely to the capacity. CVCT is negatively correlated with capacity because as the battery capacity decreases, the internal impedance of the battery increases, so CVCT increases with the decrease in battery capacity.
In order to demonstrate that the proposed HIs can effectively reflect the battery degradation, both quantitative and qualitative analyses were employed. Considering cost-effectiveness and operational simplicity, we focused on the charging/discharging processes of LiBs as the basis for evaluating the HIs. This study showed that there is a significant correlation between these indicators and the battery capacity. Based on this result, the CCDT with the highest correlation between these metrics and battery capacity was found. Thus, CCDT was used as the main HI in the subsequent experiments.
In this paper, the prediction method that relies on a single HI (CCDT) was chosen. The main reason is that an RUL prediction method based on a single HI is less computationally demanding compared to multi-HI fusion (e.g., CCDT + CCCT), which is important for the practical implementation and real-time operation of the RUL prediction method. Of course, better prediction accuracy can be obtained if multi-HI fusion is used, but there will be a compromise in the complexity of the prediction model and the amount of computation, which will be an obstacle to the practical implementation and real-time operation of the prediction method.
4.3. Experiments on RUL Interval Prediction for LiBs Based on Periodic Time Series
4.3.1. Verification Experiments and Analysis
Taking battery CS2_35 as an example, after selecting and sorting the fuzzy granular data using the corresponding strategy, the first 37% of the fuzzy data of battery CS2_35 and the data of previous batteries were used as the training set, and the remaining data of battery CS2_35 were used as the test set, and the VMD-GRU model, whose parameters were the same as those in [
34] and are shown in
Table 2, was applied to RUL prediction for LiBs. The prediction accuracy and interval prediction effect are shown in
Table 3 and
Table 4, respectively, where ↑ and ↓ represent the correlations from low to high and high to low, respectively. The core code can be found on our personal website (
https://github.com/wenjie015/wenjie015.github.io/blob/master/files/RUL%20interval%20prediction%20for%20battery%20CS2_35.ipynb, accessed on 12 May 2025) built on Github.
In order to show the superiority of the proposed methods more intuitively, a graphical spider web diagram is used to express the error of each method, as shown in
Figure 10. To improve the clarity and readability of the cobweb diagram, more intuitive or simplified aliases are used to express each method. The time series shown in
Table 3 are, from top to bottom, SrI, SdI, SII_1, SII_2, SIII_1, SIII_2, SIII_3, SIII_4, SIII_5, SIII_6, SIV_1, SIV_2, SIV_3, and SIV_4. It is important to note that the prediction of one interval of strategy III upper limit Up does not have the same ascending and descending time series alignment as the lower limit Low and is represented here by an alias only. Moreover, the original strategy method that did not construct a periodic time series is denoted by S_raw.
From
Table 3 and
Figure 10, it is seen that Strategy I based on periodic time series has obvious prediction advantages when the same prediction model is used. Especially in the prediction of Low under fuzzy conditions, Strategy I shows an absolute advantage compared with the original strategy. In Strategy I and Strategy III, the prediction effect of the periodic time series according to the correlation gradually increasing is significantly better than that of the correlation gradually decreasing. Therefore, when constructing periodic time series, it is better to choose the sorting method with gradually increasing correlation. Combined with the results in
Table 4, when constructing periodic time series selecting correlation data (Strategy I and Strategy II), both the model prediction accuracy and the comprehensive evaluation indicator of intervals show better results; thus the correlation selection of data in the method based on periodic time series has a greater impact than the correlation sorting. Among Strategies III and IV, Strategy III is more stable on the whole, so the connection order of data is also a non-negligible factor when choosing data connections. In addition, when data from different sets of batteries are mixed together, the poorer results of Strategies III and IV show that the correlation between data should be fully considered when selecting the data to be connected. In conclusion, for the selection of connected data, the selection of data from the same group is preferred to data from dissimilar groups, and the correlation sorting method is superior to stochastic sorting. Hence, Strategy I was used in the subsequent experiments to implement RUL interval prediction for LiBs.
The prediction results for the four batteries in CALCE are shown in
Figure 11, and it is obvious that they all effectively achieve RUL interval prediction. Compared with the prediction results in [
34] without constructing periodic time series, the prediction curves of upper and lower bound shown in
Figure 11 more accurately depict the degradation trend of capacity and its fluctuation characteristics. Therefore, when performing RUL interval prediction, the method of constructing periodic time series can capture the fluctuation characteristics of time series more accurately, thus improving the accuracy of RUL interval prediction.
The experimental results of RUL interval prediction for five NCA batteries are shown in
Figure 12. Compared with the original strategy of not taking the construction of periodic time series in [
34], the method based on periodic time series has advantages. As shown in
Figure 12, when the prediction starting points are all the first 37% of the fuzzy information particles, it exhibits a very different prediction effect from that in [
34]. In [
34], when the prediction starting point of NCA batteries is 37%, the prediction interval is far from the true value, which is due to the smoothness of the NCA batteries’ degradation and the adoption of a larger time window in the fuzzy granulation, which leads to insufficient data for the training of prediction model, whereas the proposed method avoids this problem.
Since this paper mainly improves the prediction accuracy by constructing periodic time series and does not involve fuzzy granularity to divide the window problem, and a fuzzy information granule contains the charging/discharging information of multiple battery degradation cycles, it only improves the fit with the original fuzzy information granule in terms of the prediction accuracy, so the interval prediction at the failure threshold of CALCE batteries is consistent with [
34]. Since the NCA battery degradation data is not provided at the failure threshold, the range of interval prediction at the failure threshold for NCA batteries is not given in this subsection. The prediction accuracy of the method of constructing periodic time series proposed in this paper and the evaluation indicators for interval prediction will be described in
Section 4.3.2.
4.3.2. Comparison Experiments and Analysis
When the prediction starting point is early, NCA batteries experience poor prediction. Thus, in this subsection, we detail how experiments were designed using NCA battery aging data to validate the feasibility of the interval prediction scheme based on periodic time series as well as its effectiveness in addressing the data loss caused by the granularity of fuzzy information. Meanwhile, to show the superiority of the proposed method in terms of prediction performance and its generalization ability, the experiments were conducted on the CALCE battery dataset to compare and analyze the prediction results of the interval prediction based on periodic time series with the prediction results of the interval prediction model based on the fuzzy information granular and trend filtering segmentation in [
34].
For the convenience of analyzing and evaluating the prediction results, the interval prediction model in [
34] is denoted as Method I. Further, a new interval prediction model, called Method II, was formed based on Method I by introducing periodic time series. The main difference between Model I and Model II is that Model I uses the original data directly for the training of the prediction model, whereas Model II first joins the original data to make them periodic using the proposed periodic time series construction method and then trains the prediction model with the generated periodic data.
Table 5 shows the prediction error evaluation results of these two methods (Method I and Method II) on different battery datasets. As shown in
Table 5, in the MAE for the CALCE battery dataset, for example, the prediction error evaluation indicator of Method II is lower than that of Method I in most cases, and there exist only a few cases in which the prediction error evaluation indicator of Method II is higher than that of Method I. Moreover, the error evaluation indicator of Method II is relatively smooth, with not much difference, while the error evaluation indicator of Method I fluctuates more. In the NCA battery dataset, the prediction error evaluation indicators of Method II are all lower than that of Method I, and the prediction error evaluation indicator of Method II is only one-tenth of that of Method I. In summary, the proposed prediction method, by constructing periodic time series, has great advantages in prediction values, which fully shows its effectiveness and superiority.
The evaluation results of Method I and Method II for interval prediction for the CALCE battery dataset are shown in
Table 6. In particular, the results of using fixed-scale segmentation (segmentation width was 15) instead of trend filtering segmentation, which is denoted as Model Basis, are also presented in
Table 6 for a broader comparison. As can be seen from
Table 6, the value of
for Model I is smaller than that of Model Basis, indicating that the trend filtering segmentation has a higher prediction accuracy. Looking at
again, the one under the Model I is the larger for the same battery, further proving the superiority of the trend filtering segmentation in terms of prediction accuracy. Finally, based on the
value, it can be concluded that Model I has better overall prediction than Model Basis. On the other hand, the
P values of both Model I and Model II are greater than 20% for the CALCE battery dataset, which indicates that both Method I and Method II are capable of interval prediction. Although Method II does not have an advantage in terms of
and
, the
value of Method II is significantly better than that of Method I.
The evaluation results of Method I and Method II for interval prediction for the NCA battery dataset are shown in
Table 7. In
Table 7, for the NCA battery dataset, although most of the interval prediction widths
of Method II are larger than those of Method I, the
P values of Method II are significantly better than those of Method I. However, Method II has an absolute advantage in the coverage of the predicted intervals over the true values
. From the analysis in [
34], it can be seen that in the comprehensive evaluation indicator, the influential
P value mainly depends on the interval prediction width
, and the
P values of Method II are all better than those of Method I, which further indicates the great advantage of Method II in the coverage of true values. In conclusion, the prediction performance of Method II is better than that of Method I, which indicates that the proposed method of constructing periodic time series is more feasible and effective in RUL interval prediction.
Remark 1. Fuzzy information granulation has been widely used in the field of the health management of LiBs, such as in [32,37,38,39]. Considering the tendency of fuzzy information granulation to require the length of data, the combination of the periodic time series construction method and fuzzy information granulation used in this paper provides a valuable new idea for related works, i.e., works using fuzzy information granulation can try to develop and improve by combining the construction method of periodic time series via the same way or a similar way to that used in this paper, which is one of the potential benefits of this paper. Remark 2. The two battery datasets from CALCE and NCA were both obtained in controlled lab conditions, while real-world datasets may exhibit higher noise and environmental variability, which may result in well-performing prediction models trained using laboratory data to perform poorly when used with real-world data. There are two pathways to address this issue: (1) train prediction models using data generated from real scenarios so that they can take into account the noise and environmental variability in real-world datasets; (2) design prediction models with better generalization capability so that prediction models trained using laboratory data also perform well on real-world datasets. Both of these pathways encounter some challenges in their use: the first pathway needs to address the privacy of real data, while the second pathway places higher demands on the design of prediction models. These challenges are also directions and topics for future attention and research.