1. Introduction
Lithium-ion batteries have the advantages of high operating voltage, high energy density, long cycle life, no memory effect, and environmental friendliness. They have been widely used in portable electronic devices such as mobile phones and notebook computers, and are gradually expanding into the fields of electric vehicles and large battery energy storage. As one of the important energy storage technologies, lithium-ion batteries play an important role in changing the energy consumption structure and electrifying traditional energy sources [
1,
2,
3,
4]. However, the storage capacity of lithium-ion batteries will continue to decline as they age, which shows a decrease in power, causing the battery capacity to decay and the internal resistance to increase. In existing research, battery state of health (SOH) is usually used as a quantitative indicator to evaluate battery aging. Nevertheless, the battery SOH will be affected by many factors, such as temperature, discharge depth and state of charge (SOC), etc., which further increases the difficulty of estimating the battery SOH. Therefore, in order to avoid safety hazards caused by battery capacity fading and maximum available power reduction, it is crucial to select an appropriate estimation method to complete battery SOH identification.
Generally speaking, these methods are mainly divided into two categories: traditional estimation methods and data-driven estimation methods.
Traditional methods focus on identifying and analyzing important external characteristic parameters in lithium-ion batteries. They usually analyze the collected experimental data such as battery current, voltage, temperature, etc., and relatively directly obtain the characteristic parameters that can reflect battery degradation and estimate battery SOH. The method includes experimental analysis method [
5], electrochemical model method [
6] and equivalent circuit model method [
7].
The experimental analysis method evaluates the battery SOH by directly measuring certain characteristic parameters of the battery, mainly including capacity measurement, ohmic internal resistance measurement and impedance measurement. Although the experimental analysis method can accurately calibrate the battery SOH, the limitations of battery experimental equipment, experimental sites, and experimental test plans will increase the difficulty of implementing this method [
8]. The electrochemical model is a first-principle model that can not only accurately simulate the external characteristics of the battery, but also simulate the distribution and changes in the internal characteristics [
9]. The electrochemical model can deeply describe the microscopic reactions inside the battery and has very clear physical meanings. However, the electrochemical model contains complex partial differential equations and numerous electrochemical parameters, which not only brings accuracy advantages but also greatly increases the complexity and computational load of the model. A large number of unmeasured parameters in the model affect the application of the model in battery management systems, and its ability to dynamically track changes in environmental conditions is limited. The equivalent circuit model method usually uses traditional circuit components (such as capacitors, internal resistances, voltage sources, etc.) to form a circuit network to describe the external characteristics of the battery. The equivalent circuit model has good applicability to various working states of the battery. The state equation of the model can be easily derived, and the dynamic response and attenuation behavior of the battery can be constructed by identifying the model parameters [
10]. However, the estimation results under this method usually depend on the structure of the equivalent circuit, resulting in increasing model errors.
Since the aging process of a lithium-ion battery is a dynamic coupling process and its internal electrochemical mechanism is very complex, traditional battery SOH estimation methods have problems such as model complexity and limited robustness, dynamic adaptability and accuracy. They also have limited ability to describe the dynamic and static comprehensive characteristics of lithium-ion batteries, which is not conducive to practical application. The data-driven method provides another solution for estimating battery SOH. By collecting a large amount of historical data and analyzing the original data, we can only establish the relationship between battery capacity and external features from the perspective of data information, without deeply exploring the complex internal mechanism changes in lithium-ion batteries. Relying on the data-driven method’s ability to autonomously learn data information, this method has received widespread attention in battery SOH estimation [
11,
12,
13,
14].
Data-driven methods include the optimization algorithm method, sample entropy method, performance characteristics method, and machine learning method [
15,
16,
17,
18,
19,
20]. The optimization algorithm method used is to identify the battery model parameters through a suitable algorithm and then use the model parameters to estimate the battery SOH. A multi-time joint framework for predicting the battery SOC, SOH and SOP (state of power) has been established in [
18]. The sample entropy method is a useful tool for assessing the predictability of time series, while also quantifying the regularity of the data series. For example, in [
21], Sun et al. have established a battery capacity model through the sample entropy of voltage and temperature sequences, and then have estimated the battery SOH. In addition, the data-driven method can also complete the battery SOH estimation by analyzing the aging characteristics related to the degradation process of lithium-ion batteries, that is, the performance characteristics method. This method is to establish a mapping relationship between these parameters and the lithium-ion battery capacity through the evolution of dominant characteristics during the degradation process [
22,
23]. These dominant parameters are usually named aging features. Most of the current research focuses on the aging features that are obtained in a relatively stable working environment with a single working mode. The aging features obtained in this way are time-consuming and unfavorable for practical development. Machine learning only relies on battery historical data and uses advanced algorithms to train complex non-linear degradation behaviors of battery to achieve accurate estimation of the battery SOH [
24,
25,
26,
27,
28]. Commonly used methods include neural networks (NN), support vector machines (SVM), Monte Carlo, Gaussian process regression (GPR) and long short-term memory networks (LSTM), etc. Although the data-driven method is easy to implement, the implementability of this method is based on sufficient aging data, and it usually takes several months or even longer to obtain aging data. That is to say, there are several issues with these methods, which have to be overcome: (1) Most references that extract the battery feature need to consider the battery’s entire working condition, which increases the computing burden and lab force. (2) Different working conditions will affect the performance of the lithium-ion battery in reality, but most research has extracted features from one single stable working condition (e.g., 1 C charging/discharging rate), and the performance of these features under dynamic working conditions is doubtful.
In summary, considering that in actual scenarios, batteries usually work in complex and changeable environments, it is often necessary to obtain useful battery aging features in a very short time to perform battery SOH calibration. At the same time, considering the practical factors of the high cost of obtaining label data in the battery field, the insufficient amount of data is directly related to the effect of data information analysis and mining and affects the accuracy of the battery SOH estimation. Therefore, this paper considers how to obtain effective battery data information from the current pulse test stage, extract battery aging features in a short time, and complete accurate battery SOH estimation under small sample data.
Based on the above analysis, this paper addresses the issues discussed above from three aspects. First, the response voltage curve characteristics of the pulse test current at four rates are analyzed, and the aging features in a very short time that can reflect the battery capacity decay are extracted. Then, the proposed method in this paper is verified in two cases. The first case is based on the complete current pulse test data and considers using the aging features under all SOC intervals as the battery SOH estimation model input while verifying the universality of the proposed method in this paper. The second case is to assume that the current pulse test process is incomplete while ensuring that the auxiliary data and the source domain data set come from aging feature data in different SOC intervals. Six ways of dividing the auxiliary and the source domain data set are considered to verify the mapping relationship for battery SOH estimation model of the proposed method under different SOC intervals is not affected. Finally, it is also verified that the proposed method can still ensure the accuracy of lithium-ion battery SOH estimation in the above two application modes, whether in a single-temperature operating mode or in different temperature operating modes.
In detail, the main contributions of this paper are summarized as follows:
- (1)
This paper analyzes the voltage response curve characteristics under pulse current, and aging features are extracted over a very short time, which provides a basis for establishing the nonlinear relationship between the battery SOH and the aging feature.
- (2)
In the case of inconsistent initial SOC values, the proposed method can still ensure that the estimation accuracy of the battery SOH under different SOC interval is not affected.
- (3)
The proposed method in this paper can still ensure the battery SOH estimation accuracy while reducing the model feature dimension and the demand for target domain label data.
This paper is organized as follows:
Section 2 introduces the method of extracting the aging feature. The related algorithms Tradaboost.R2 are introduced in
Section 3. It also still introduces the battery SOH estimation under the complete SOC range. Battery SOH estimations under different SOC ranges are presented in
Section 4. The conclusions and future work are given in
Section 5. The specific framework for the proposed method is shown in
Figure 1.
2. Aging Feature under Pulse Current
In this work, lithium iron phosphate battery cells (LFP/C) are considered, while
Table 1 describing the cell’s basic electric parameters. Five LFP/C cells are aged at two different temperatures, i.e.,
°C and
°C, which is divided into two modes and summarised in
Table 1. The capacity variations in the five LFP/C cells are presented in
Figure 2. During the non-constant charging and discharging stage, three SOC and pulse currents at four rates are considered. The whole battery testing process are shown in
Figure 3. First, the battery is charged to SOC = 20%, SOC = 50% and SOC = 80% with a constant current of 1 C rate. And in each SOC stage, pulse charge and discharge currents of 4 C, 2 C, 1 C and 0.5 C are applied to the battery. A negative current indicates a charging process and a positive current indicates a discharging process.
Figure 4 shows the current changes in the battery under the pulse test.
The test process is divided into three parts, namely reference performance test (RPT), which is also called capacity test, pulse test and aging test. RPT test is used for capacity calibration. The aging features extracted in this paper come from the battery data in the pulse test stage, while the aging test is used to accelerate the aging of the battery. This paper considers two temperature modes, that is, accelerated aging modes with different temperatures selected during the aging test stage.
Figure 4 and
Figure 5 show the current/voltage changes in the lithium-ion battery under three SOC during the pulse current test. It can be seen that the response voltage value changes significantly in different SOC stages, for example, in the charging stage where the SOC changes in the range of
to
, compared with the two stages of SOC = 20% and SOC = 50%, the response voltage value of the corresponding point is the largest at the stage of SOC = 80%, and the response voltage value of SOC = 50% is larger than that of SOC = 20%. It can be said that when the SOC is in the charging stage belonging to this area, the response voltage value of the corresponding point increases with the increase in the initial SOC value. Under the same conditions, the same results can be obtained in the discharging stage as in the charging stage.
Since the pulse current value is always at a stable value, this paper only considers the changing characteristics of the response voltage under the current pulse test. Now, we analyze the specific changing trend of the current pulse test response voltage curve in detail, taking the response voltage curve at the SOC = 20% stage as an example. During the charging stage,
Figure 6 shows the lithium-ion battery at SOC = 20% at different rates. From the response voltage change curve, it can be seen that the overall trend of the response voltage curve is downward (showing a “red–green–blue” color change); similarly, the discharge stage under this condition also shows the same trend, as shown in
Figure 7.
Let us analyze the specific changing trends of the response voltage curves under different initial SOC stages. Taking the charging response voltage curve at a rate of 4 C as an example,
Figure 8 shows the response voltage change curve of lithium-ion battery under different initial SOC stages. It can be seen that as the initial value of SOC changes, the voltage response curve changes. When SOC = 20%, the overall trend is downward, but when SOC = 50% and SOC = 80%, the overall trend of the response voltage curve is upward. In the discharging stage under the same conditions, no matter which SOC stage it is in, the overall trend of the response voltage curve is downward, as shown in
Figure 9.
The aging features selected in this paper come from the data in the pulse test stage. The purpose of extracting the aging features is to find features that are highly correlated with battery SOH as model input. According to
Figure 6,
Figure 7,
Figure 8 and
Figure 9, the battery voltage shows an obvious downward or upward trend, and it is feasible to perform feature extraction based on this phenomenon.
According to
Figure 10, the four voltage values
,
,
,
on the response voltage curve and the integrated area S between the response voltage values
and
are selected as the battery aging features. Based on the above analysis, ten aging features can be obtained at each charging/discharging rate of each SOC stage. For example, during the charging period at a rate of 4 C, there are four response voltage values and one integration area, and there are a total of five aging features. Similarly, there are five aging features during the discharge stage, so there are a total of ten aging features at a rate of 4 C. Therefore, in each SOC stage, there are a total of forty aging features, and by analogy, in the three SOC stages, a total of one hundred and twenty aging features can be obtained.
3. Lithium-Ion Battery SOH Estimation under Complete SOC Range
Due to the diversity of dynamic working conditions in the battery aging experiment test process, such as differences in charge and discharge rates, variability in operating temperature, and randomness of the current SOC, which increases the types of source domain data and affects the effectiveness of the lithium-ion battery SOH estimation model. On the other hand, the dynamic working conditions of the battery are changing, and the previous battery aging data information is not necessarily applicable to the current new application scenarios. Considering how to select data relevant to the current scenario from diverse source domain data is a prerequisite to ensure accurate estimation of the battery SOH under multiple working conditions.
This paper selects five lithium-ion batteries in two temperature modes as shown in
Table 1. First, the battery aging features are extracted from the current pulse test data. Secondly, considering the impact of differences in different battery data on model estimation accuracy, a battery SOH estimation framework based on the sample transfer learning method is established. Then, by constructing an auxiliary data set to enhance the learning ability of the battery SOH model, it can obtain an accurate estimation of the battery SOH under dynamic testing conditions. Finally, comparative experiments are designed and combined with multiple error index methods to verify the effectiveness of the method.
3.1. Lithium-Ion Battery SOH Estimation Framework Based on the Tradaboost.R2 Algorithm
The Tradaboost.R2 algorithm is evolved from the Adaboost.R2 algorithm, which is essentially a machine learning method based on the concept of ensemble learning. The purpose of the Adaboost.R2 algorithm is to update and iterate two weight parameters; one is the weight parameter for the sample data, and the other is the weight parameter for the weak learner. The significance of updating the sample data weight parameters is to adjust the distribution of the training data set for the next iteration based on the previous round for the source domain. In other words, in each iteration process, the weight coefficient for the source domain sample data that makes the estimation result error of the target domain smaller is reduced in the next iteration. And for the source domain sample data that makes the prediction result of the target domain have a larger error, its weight coefficient will be increased in the next iteration, because in the next iteration, these sample data that make the error result larger may make the error result smaller in this iteration, but if the error result does not decrease in this iteration, then it continues to increase the weight of the sample data and proceeds to the next round of iteration. On the other hand, the purpose of updating the weight parameters for the weak learner is to increase the weight coefficient for the weak learner with a smaller overall error result.
Tradaboost.R2 is based on the Adaboost.R2 algorithm, adding the original source domain data and target domain data to the source domain, target domain and auxiliary data set. The auxiliary data set comes from a small amount of labeled data in the target domain. At this time, the training set consists of the source domain data and the auxiliary data. The goal of the Tradaboost.R2 algorithm is to reduce the sample data in the auxiliary data set that makes the prediction result error larger, so that after several iterations, the sample data in the auxiliary set related to the source domain data will have a larger weight coefficient, and the weight coefficient of sample data in the auxiliary data set that is not related to the source domain will be reduced.
This paper chooses extreme learning machines (ELMs) as the weak learner; an ELM is actually a single hidden layer feedforward neural network method. Compared with the traditional feedforward neural network, the structure still includes an input layer, a hidden layer and an output layer. However, in terms of training effect, since the ELM does not require the backpropagation method based on gradient descent to adjust the weight coefficients, its network training speed is fast and it has good generalization ability, while it does not easily fall into local minimum points. Therefore, this paper will use an ELM as the weak learner to establish a lithium-ion battery SOH estimation framework based on the Tradaboost.R2 method.
Consider a large amount of labeled source domain data , where is input data and is output data. Dividing the target domain data into two parts, the auxiliary data set and the testing set , where the auxiliary data set is a small amount of labeled data in the target domain, and the corresponding sample label is , while the test data set is a large amount of unlabeled data in the target domain. Now the entire training data consist of the source domain data set and auxiliary data set, that is , and the corresponding sample label is . Assume that the amount of sample data in the source domain is and the number of samples in the auxiliary data set is . For simplicity of writing, the test data in the target domain are written as . Therefore, the current goal is to predict the output of the corresponding sample through the trained model when the model has a new test sample input . The specific process of the Tradaboost.R2 method is shown in Algorithm 1.
3.2. Battery SOH Estimation Based on Complete SOC Range
Based on the research and analysis of battery aging features in
Section 2, it can be seen that 40 battery aging features can be extracted from each SOC interval, and a total of 120 aging features are obtained. In order to establish a battery SOH estimation model under different temperature modes, all the aging features of the three SOC intervals are used as the input of the model, and the application scenarios in single-temperature mode and multi-temperature mode are considered, respectively. Next, considering that the source domain and the target domain come from two different battery data sets, the entire data set is divided into source domain, auxiliary and target domain data sets. By using a small amount of target domain label sample data to establish an auxiliary data set, the impact of insufficient source domain information on the model estimation accuracy is reduced.
Table 2 shows the division of source domain, auxiliary and target domain data sets under the complete SOC interval, where
represents source domain data,
represents auxiliary data and
represents target domain data. At the same time, in order to verify the effectiveness of the battery SOH estimation model under different temperature modes, six scenarios are considered and represented by the symbol
, where
–
are single-temperature scenarios, that is, the source domain and the target domain have the same temperatures, and
–
are multi-temperature scenarios, that is, battery data from the source domain and target domain at different temperatures.
Algorithm 1: Tradaboost.R2 algorithm. |
|
Furthermore, in order to verify the effectiveness of the battery SOH estimated model, the mean absolute error (MAE), mean square error (MSE) and root mean square error (RMSE) analysis methods are used, which are expressed in Equation (
1)–(
3). Finally, the coefficient of determination
is used to verify the performance of the estimated model, where the larger the
value, the better the model fitting effect.
where
N is the number of samples, and
is the true value for battery SOH,
is the estimated value for battery SOH, and the
is the mean value for battery SOH.
This paper selects the Adaboost method, GPR, SVM and CNN-LSTM, as well as the case where the auxiliary set is not considered in the proposed method as the comparison methods. The symbol represents the estimated results based on the Adaboost.R2 method, the symbol represents the estimated results based on the GPR method, the symbol represents the estimated results based on the SVM method, the symbol - represents the estimated results based on the CNN-LSTM method, the symbol - represents the estimated results without the transfer learning step (that is, the proposed method does not include the auxiliary data), and the symbol represent the estimated results of the method which proposed in this paper.
Due to the different temperatures of lithium-ion batteries, the entire battery data can be divided into two modes, namely mode 1 and mode 2. Regarding the division of data sets in single-temperature mode, take the three battery data sets in mode 1 as an example. There are three battery data sets in mode 1, namely No.1, No.2 and No.3. Choose one of these three data sets as the target domain, and the other two data sets as the source domain, such as scenario 1 (
) and scenario 2 (
). In the multi-temperature mode, select the data set in one mode as the source domain, such as No.1 in mode 1, and the data set in another temperature mode as the target domain, such as No.4 in mode 2, and divide it into scenarios 4, 5 and 6 (
,
,
). In these two modes, in order to improve the estimation accuracy of the model and verify the necessity of transfer learning, the construction of auxiliary data sets is considered, as shown in
Table 2.
3.3. Analysis and Discussion of Experimental Results
In this part, based on the previous aging feature extraction and model optimization, the battery SOH estimation under single-temperature and multi-temperature modes are analyzed. First, the estimation results under single-temperature mode are analyzed, and the experimental results based on different models are considered to verify the necessity of transfer learning.
Figure 11 and
Figure 12 show the battery SOH estimation results for scenario 1 (
)–scenario 3 (
) under the single-temperature mode. It can be found that in the three scenarios, the battery SOH estimation based on the
method are closer to the reference value, and its effect of tracking the reference value is more obvious, while the battery SOH estimation based on the
-
method obviously does not accurately track the true value. In addition, considering that the Tradaboost.R2 algorithm is improved from the Adaboost.R2 algorithm, the battery SOH estimation based on the basic algorithm Adaboost is also unsatisfactory. At the same time, regardless of the scenario, the trend of estimation results based on CNN-LSTM, GPR and SVM is not ideal.
Table 3 and
Table 4 give three error values and coefficients of determination for
–
. It can be seen from these two tables that the MAE of the estimated value based on the
method does not exceed 1.5%, which is significantly smaller than the MAE of other comparison methods; in terms of MSE, the MSE based on the
method is lower than 0.015% and the corresponding RMSE is less than 1.5%, which also shows that the estimation accuracy based on the
method is higher than the other methods. On the other hand, in terms of verifying the model fitting effect, the coefficient of determination
based on the
method is not less than 88% in the three scenarios. Compared with the corresponding coefficient of determination values of other methods, the model fitting effect based on the
method is better. Therefore, it can be seen that the method after the transfer learning step can improve the accuracy of the estimation results, but the estimation results without the transfer learning step cannot guarantee the estimation accuracy, which further illustrates the necessity of transfer learning.
On the other hand, according to
Table 2, it can be seen that when no auxiliary set is constructed, taking
as an example, the MAE is 0.0503, which shows that the estimation model obtained only with the help of source domain data information is not well adapted to the test environment of the target domain. When constructing the auxiliary data set, only the first 15% of the label data information in the target domain can obtain better estimation results to verify the necessity of transfer learning. It can be seen that lithium-ion battery SOH estimation can be completed with a small amount of label sample data. Therefore, this method verifies that the accuracy of battery SOH estimation based on a small number of label sample data is not affected.
Besides that,
Table 3 and
Table 4 present the error estimation results of the battery SOH under different models. These tables also depict the error estimation results of the battery SOH under different types. In order to analyze the estimated result clearly, we use the estimated results based on GPR, SVM, CNN-LSTM, Adaboost.R2 and without using the transfer method under the same conditions to compare with the Tradaboost.R2 algorithm estimated results.
After analyzing the battery SOH estimation results in single-temperature mode, this part starts to analyze the multi-temperature mode. Battery No.5 in mode 2 only has ten sample points that select battery No.4 in mode 2 as the target domain, and the three batteries No.1, No.2 and No.3 in mode 1 are used as the source domain to verify the battery SOH estimation under multi-temperature mode.
Figure 13 and
Figure 14 show the battery SOH estimation results for scenario 4 (
)–scenario 6 (
) under multiple-temperature mode. It can be found that in the three scenarios, the battery SOH estimation based on the
method is closer to the reference value, and its effect of tracking the reference value is more obvious, while the battery SOH estimation based on the
-
method obviously does not accurately track the true value. In addition, considering that the Tradaboost.R2 algorithm is improved from the Adaboost.R2 algorithm, the battery SOH estimation based on the basic algorithm Adaboost is also unsatisfactory. At the same time, regardless of the scenario, the trend of estimation results based on CNN-LSTM, GPR and SVM is not ideal.
Table 4 and
Table 5 give three error values and coefficients of determination for scenario
–
. It can be seen from these two tables that the MAE of the battery SOH estimated value based on the
method does not exceed 1.3%, which is significantly smaller than the MAE of other comparison methods; in terms of MSE, the MSE based on the
method is lower than 0.033% and the corresponding RMSE is less than 1.8%, which also shows that the estimation accuracy based on the
method is higher than that of other methods. On the other hand, in terms of verifying the model fitting effect, the coefficient of determination
based on the
method is not less than 85% in the three scenarios. Compared with the corresponding coefficient of determination values of other methods, the model fitting effect based on the
method is better than other methods.
In the three scenarios under multi-temperature mode, taking scenario as an example, when the auxiliary data set is not constructed, the MAE is 0.0553, compared with the MAE value (0.0108) under the method proposed in this paper, which shows that the estimation model obtained only with the help of source domain information has no way to learn the data information for the target domain, and when the top 15% of the label data in the target domain is used to build an auxiliary data set, the error value of the prediction results has dropped significantly. This shows that 15% of labeled data information plays an important role in model training and verifies the necessity of transferring data information. The same results are also reflected in scenarios and , which means that when the auxiliary data set only accounts for 15% of the target domain, the accuracy of battery SOH estimation under multi-temperature mode still can be guaranteed.
4. Lithium-Ion Battery SOH Estimation under Different SOC Ranges
In the previous section, we discuss the battery SOH estimation problem under the complete SOC interval, which means that when establishing the estimation model framework, all aging features are used as input of the model. However, if there is a lack of aging features in a certain range, will it make it difficult to estimate the batter SOH? Under laboratory conditions, pulse test data for three complete SOC intervals can be easily obtained. However, in actual applications, depending on the needs of different application scenarios, it may not be possible to complete the pulse test under three complete SOC intervals. At this time, it is necessary to consider whether the battery SOH estimation under different SOC intervals can be achieved. In other words, comparing with the battery SOH estimation under the complete SOC interval, can it be ensured that the mapping relationship under different SOC intervals will not be affected.
4.1. Data Set Division under Different SOC Intervals
In order to illustrate the aging features under different SOC intervals, the symbol is used to represent the forty aging features at SOC = 20%, the symbol is used to represent the forty aging features at SOC = 50% and the symbol is used to represent the forty aging features at SOC = 80%.
In order to verify that the mapping relationship of the battery SOH estimation model under different SOC intervals is not affected, aging feature data from different initial SOC is selected when constructing the auxiliary and the source domain data set. For example, it is now necessary to use the label data information of battery No.1 to estimate SOH of battery No.3. If battery No.3 only contains pulse test data with SOC = 20%, and battery No.1 contains pulse test data with a complete SOC range, then the target domain data set of the model is the aging features
of battery No.3, and source domain data set contains the aging characteristics
of battery No.1. Under the assumption of this scenario condition, the dimensions of the source domain and target domain data of the model are different. Therefore, the aging feature
of battery No.1 can be selected as the auxiliary data set of the model to learn the data information of
, and the aging feature
or
can be used as the source domain data set of the model, thus ensuring the dimensions of the three data sets are the same, and the selection of the auxiliary data set only requires aging features from the same SOC interval as the target domain data set. At the same time, there is no restriction that the auxiliary data must come from the same dataset as the target domain. Therefore, based on the battery pulse test data in different SOC intervals, in each scenario, the source domain, auxiliary data and target domain data can be divided as shown in
Table 6.
4.2. Battery SOH Estimation Based on Different SOC Ranges
When considering the battery SOH estimation based on sample transfer learning under different SOC intervals, six scenarios are considered and are divided as shown in
Table 7. Scenario 1 (
)–Scenario 3 (
) verify the battery SOH estimation based on a single-temperature mode, that is, although the source domain and the target domain data set come from different battery data, they remain in the same temperature mode. The battery SOH estimation under multi-temperature mode is reflected by Scenario 4 (
)–Scenario 6 (
), that is, the source domain and the target domain data set are derived from different temperature modes. The purpose is to verify the performance based on the idea that when using data in different temperature modes, the battery SOH estimation can still be achieved. In each scenario of this part, the source domain, auxiliary data and target domain data set are divided in six ways. Different from the construction method of the auxiliary data set under the complete SOC interval, here, there is no requirement that the auxiliary data must come from the same target domain data set. It is only required that the auxiliary data set and the target domain data set come from the aging features under the same SOC interval. This greatly reduces the demand for target domain label data and has more practical research significance.
4.3. Analysis and Discussion of Experimental Results
In this section, the battery SOH estimation under single-temperature and multi-temperature modes are analyzed, respectively. First, the experimental results under the single-temperature mode are analyzed. Based on
Section 3.2, the battery SOH estimation under the six division methods is verified in each scenario; secondly, compared with the battery SOH estimation framework under the complete SOC interval, the feature dimension of the model is not only reduced but also the demand for target domain label data is reduced. Therefore, in this section, the estimation results under the six partitioning methods will be compared with the estimation results based on the
method in the complete SOC interval of
Section 3.2. In addition, since the Tradaboost.R2 algorithm is an ensemble method, considering the final estimation result is synthesized based on other sub-models, the relative error between each sub-model and the final result is analyzed. Similarly, the battery SOH estimation under multi-temperature mode is also analyzed in the same way.
Figure 15 and
Figure 16 show the battery SOH estimation results for scenario 1 (
)–scenario 3 (
). It can be seen from
Figure 15 that in
and
, compared with the estimation results for the other five methods, the estimation results based on method 6 (
) are closer to the reference value, and its effect of tracking the reference value is more obvious. The absolute error also can reflect the better estimation effect is shown in
Figure 16b.
Table 8 and
Table 9 give three error values and coefficients of determination for
–
. It can be seen from these two tables that the optimal estimation results can be obtained based on
in
and
, while the optimal results in
are obtained by
. In the optimal estimation results of these three scenarios, the MAE does not exceed 0.75%, which is significantly smaller than the MAE under the other methods; at the same time, the MSE is lower than 0.0095% and the corresponding RMSE does not exceed 0.98%; compared with the complete SOC interval based on
method, the optimal estimation errors of the three scenarios are smaller than the estimation errors based on the
method. On the other hand, in terms of verifying the model fitting effect, the coefficient of determination
of the optimal result reaches 90% in all three scenarios. Compared with the coefficient of determination value based on the
method, it can be seen that the model fitting effect based on different SOC intervals is better than the model effect based on the complete SOC interval. This can also show that while the feature dimension and the demand for label data in the target domain are reduced, the battery SOH estimation accuracy has not been reduced.
Table 8 and
Table 9 present the error estimation results of the battery SOH under different SOC ranges. In order to ensure that the mapping relationship of the lithium-ion battery SOH estimation model in different SOC intervals is not affected, it adopts the way of dividing the data set in
Table 6 and
Table 7.
After analyzing the battery SOH estimation results in single-temperature mode, this part starts to analyze the multi-temperature mode. We select battery No.4 in mode 2 as the target domain and use the three batteries No.1, No.2 and No.3 in mode 1 as the source domain to verify the battery SOH estimation under multi-temperature mode.
Figure 17 and
Figure 18 show the battery SOH estimation results for scenario 4 (
)–scenario 6 (
). It can be seen from
Figure 17 that in
and
, comparing with the estimation results of the other five methods, the estimation results based on method 5 (
) are closer to the reference value, and its effect of tracking the reference value is more obvious, which the absolute error also can reflect the better estimation effect that shown in
Figure 18b. Completing the verification of the estimation results of three scenarios under the multi-temperature mode, which demonstrates that the method proposed in this paper has stronger generalization ability in estimating battery SOH under different temperature modes.
Table 9 and
Table 10 give three error values and coefficients of determination for
–
. It can be seen from these two tables that the optimal estimation results can be obtained based on
in
and
, while the optimal results in
are obtained by
. In the optimal estimation results of these three scenarios, the MAE does not exceed 1.2%, which is significantly smaller than the MAE under other methods; at the same time, the MSE is lower than 0.02% and the corresponding RMSE does not exceed 1.4%. Compared with the complete SOC interval based on the
method, the optimal estimation errors of the three scenarios are smaller than the estimation errors based on the
method. On the other hand, in terms of verifying the model fitting effect, the coefficient of determination
of the optimal result reaches 92% in all three scenarios. Therefore, no matter in terms of battery SOH estimation accuracy or model fitting effect, the optimal results based on different SOC intervals are better than complete SOC intervals.
Based on the above discussion, it can be proved from three aspects that the method proposed in this paper can achieve higher accuracy in battery SOH estimation:
- (1)
In the comparison model that combines various errors to depict the performance of the method, we use the estimated results based on GPR, SVM, CNN-LSTM, Adaboost and without using the transfer method under the same conditions to compare with the Tradaboost.R2 algorithm estimated results. The various error results show that the estimated results based on the Tradaboost.R2 algorithm are better than other algorithms.
- (2)
The performance of model fitting is measured by the coefficient of determination . The better the model fitting performance, the closer its value is to 1. The values of the coefficient of determination prove that the estimated results based on the Tradaboost.R2 algorithm are better than other algorithms.
- (3)
This paper proposes a lithium-ion battery SOH estimation framework based on the Tradaboost.R2 method under pulse testing. The proposed method can ensure that the mapping relationship of the lithium-ion battery SOH estimation model in different SOC intervals is not affected.
- (4)
By verifying the estimation results based on the complete SOC interval and different SOC intervals, it can be seen that while the model feature dimension and the demand for target domain label data is reduced, the estimation accuracy of the model can still be guaranteed.