Incremental Capacity Analysis on Commercial Lithium-Ion Batteries using Support Vector Regression: A Parametric Study

: Incremental capacity analysis (ICA) has been used pervasively to characterize the degradation mechanisms of the lithium-ion batteries, and several online state-of-health estimation algorithms are built based on ICA. However, the stairs and the noises in the discrete sampled voltage data obstruct the calculation of the capacity differentiation over voltage (d Q /d V ), therefore we need methods to ﬁt the sampled voltage ﬁrst. In this paper, the support vector regression (SVR) algorithm is used to smooth the sampled voltage curve using Gaussian kernels. A parametric study has been conducted to show how to enhance the performances of the SVR algorithm, including (1) speeding up the algorithm by downsampling; (2) avoiding overﬁtting and under-ﬁtting using proper standard deviation σ in the Gaussian kernel; (3) making precise capture of the characteristic peaks. A novel method using linear approximation has been proposed to help judge the accuracy of the SVR algorithm in tracking the ICA peaks. And advanced SVR algorithms using double σ and using cost function that directly regulates the differentiation result have been proposed. The advanced SVR algorithm can make accurate curve ﬁtting for ICA with overall error less than 1% (maximum 3%) throughout cycle lives, for four kinds of commercial lithium-ion batteries with LiFePO 4 and LiNi x Co y Mn z O 2 cathodes, making it promising to be further applied in online SOH estimation algorithms.


Introduction
The market of electric vehicles is growing at an unprecedented rate in recent years, driven by the motivations to find cleaner energy powertrain systems for future transportations [1][2][3].Battery energy storage system, which the heart of the energy powertrain systems of electric vehicles, must have considerable energy density and long cycle life to guarantee the desired performances of electric vehicles.Lithium-ion batteries stand out from other kinds of energy storage technologies, because of their better energy efficiency and longer cycle life [4][5][6][7].
One challenge for the application of lithium-ion batteries in electric vehicles is the state-of-health (SOH) monitoring [8,9].The SOH is 'a state' that reflects the aging condition in comparison with Energies 2018, 11, 2323 2 of 21 a fresh battery.The quantitative definition of SOH can be founded on the battery capacity [10,11] or the impedance [12][13][14].The SOH monitoring is regarded as one of the key problems for battery management, because the SOH affects the accurate estimation of battery states, thereby determining the capability of power output of the battery system [15][16][17].
Incremental capacity analysis (ICA) is a method that is being pervasively used in battery SOH diagnosis in recent years, after its first proposal by Balewski & Brenet [18] and propagated by Dubarry et al. [19,20].The ICA technique is performed by differentiating the capacity Q with respect to the voltage V, therefore the voltage plateaus that reflects the thermodynamics of battery will be transformed into dQ/dV peaks, which are convenient for degradation analysis [21,22].dQ/dV is also called the incremental capacity (IC).The ICA is capable of revealing the mechanism of both the capacity loss and the impedance increment [23][24][25], and can be further applied for online SOH monitoring [26,27].
The major difficulty in performing the ICA comes from processing the differentiations of the discretized voltage data with sampling levels and noise.The sampling levels and noises can lead to large fluctuations in the IC curves, since differentiating discretized voltage at plateaus will return values near zero and infinity [28].Curve fitting seems to be a must to perform ICA.A typical curve fitting method is to find continuous functions that can fit the discretized voltage data with minimum error.Then the differentiations can be acquired by differentiating the fitted continuous functions.There several available curve fitting methods [29][30][31], among which the SVR method is a kind of technique that comes from modern theory of machine learning (ML).Usually the ML-based methods are used for predicting the capacity loss [32][33][34], rather than fitting the curves of battery voltage.Weng et al. [35] firstly used the SVR to fit the battery voltage curves, and conducted ICA by differentiating the fitted functions.The SVR method is believed to have good properties: (1) insensitive to measurement noises; (2) robust to data range and size; (3) effective in extracting a signature that shows strong dependence to battery age, making it promising for further online ICA on diagnosing the battery SOH.
However, the SVR method in [35] was not widely propagated for SOH diagnosis after its first publication, because the influences of the key parameters in the SVR algorithm on the IC results have not been clarified.The other researchers may find it difficult to learn the usage of the SVR method, because they may encounter problems such as: (1) how to reduce the computational time of the SVR method; (2) how to judge the accuracy of the curve fitting by the SVR method; (3) does the SVR method fit for lithium-ion cells with other chemistries other than lithium phosphate etc.Those problems hindered a wider application of the SVR method in processing ICA.
This paper dedicates to solve the problems that hindered the wider application of the SVR method in processing ICA.Parametric studies are conducted to investigate the influence of the key parameters in the SVR algorithm on the performance of curve fitting.The relationship between the computational time and the data size is clarified.The influence of the standard deviation σ in the Gaussian kernel on the accuracy of curve fitting has been investigated.The principle for selecting proper σ to avoid over-fitting and under-fitting is proposed.Improvements of the previous SVR algorithm have been proposed by (1) using double σ to adapt to the curve shape with different curvatures; (2) changing the cost function for optimization.The proper parameters are double checked by the degradation data for commercial lithium-ion cells with LFP = LiFePO 4 and with NCM = Li(Ni x Co y Mn 1−x−y )O 2 , recommended for further usage of ICA.

Experimental
Degradation data are collected to check the validity and universality of the proposed SVR method.The data come from the University of Michigan, Ann Arbor (UoM) and Tsinghua University (THU).The details for the accelerating degradation tests can be found in the marked references.The current listed in the Table 1 refers to that used to generate the voltage curve that needs to be fitted by the SVR method.Figure 1 shows the voltage profiles for the four commercial lithium-ion cells after being Energies 2018, 11, 2323 3 of 21 cycled by hundreds of times.The voltage profiles in Figure 1 are to be fitted by the SVR method for further derivation of IC curves.Two types of batteries have cathode of LFP, whereas the other two have the cathode with NCM.The mass ratio of Ni:Co:Mn for the batteries with NCM cathode is 1:1:1.

The Canonical Form of the Support Vector Regression
The SVR method is an application of the support vector machine in artificial intelligence, which was initially developed by Drucker et al. at the AT&T Bell Laboratories, for data regression [37].Assume x and y represent the input and output data to be fitted, respectively.Here for battery sampling data, x can be the time τ, the charged/discharged capacity Q, or the state-of-charge SOC, whereas y is the sampled voltage.X = {x1, x2, x3… xn} and Y = {y1, y2, y3… yn} are the data set of x and y, respectively.n is the total number of data for curve fitting.For the SVR method, the output data y is fitted using a linearized parametric model: where , , , ,... , vector for kernels.The arbitrary kernel function κ(xk, x) is defined as:

The Canonical Form of the Support Vector Regression
The SVR method is an application of the support vector machine in artificial intelligence, which was initially developed by Drucker et al. at the AT&T Bell Laboratories, for data regression [37].Assume x and y represent the input and output data to be fitted, respectively.Here for battery sampling data, x can be the time τ, the charged/discharged capacity Q, or the state-of-charge SOC, whereas y is the sampled voltage.X = {x 1 , x 2 , x 3 . . .x n } and Y = {y 1 , y 2 , y 3 . . .y n } are the data set of x and y, respectively.n is the total number of data for curve fitting.For the SVR method, the output data y is fitted using a linearized parametric model: where x), κ(x 2 , x), . . .κ(x n , x)] T is the vector for kernels.The arbitrary kernel function κ(x k , x) is defined as: where σ, which is the standard deviation of the Gaussian kernel, is a preset parameter to control the shape of the κ(x k , x).Define ε as the precision parameter that set some tolerances for the fitting error, and slack variables ξ + k and ξ − k to cope with infeasible constraints: Figure 2 shows the graphical definition of ε, ξ + k and ξ − k for the SVR algorithm.The region of green belt, looking like a tube, indicates that there will be no penalty in the cost function when the fitted curve is quite close to the raw data.
, exp 2σ where σ, which is the standard deviation of the Gaussian kernel, is a preset parameter to control the shape of the κ(xk, x).Define ε as the precision parameter that set some tolerances for the fitting error, and slack variables k ξ + and k ξ − to cope with infeasible constraints: Figure 2 shows the graphical definition of ε, k ξ + and k ξ − for the SVR algorithm.The region of green belt, looking like a tube, indicates that there will be no penalty in the cost function when the fitted curve is quite close to the raw data.
where w = 100 is the weighting factor, ||•||1 denotes the l1 norm in the coefficient space.
To further reformulate Equation (4) into a canonical form of Linear Programming (LP) problem, the coefficient β should be decomposed into: Then the SVR using l 1 regularization formulates the optimization problem as follows: Costfunction#1 : min where w = 100 is the weighting factor, ||•|| 1 denotes the l 1 norm in the coefficient space.
To further reformulate Equation (4) into a canonical form of Linear Programming (LP) problem, the coefficient β should be decomposed into: where α + k and α − k are nonnegative and satisfy α + k • α − k = 0. Then the SVR problem in Equation ( 4) can be further reformulated into the carnonical form of LP problem: min Energies 2018, 11, 2323 and The optimization problem in Equation ( 6) can be solved using the function linprog in MATLAB (2016a ® ).The computational complexity of the problem is Θ(n 2 ).n is usually large for battery test data, therefore it should be resampled with a distance of D, to downsize the optimization to n/D, and the computational complexity will be reduced to Θ(n 2 /D 2 ).For example, assume that the raw data R is R = {r 1 , r 2 , . . ., r n }, then the resampled data will be S = {s j | s j = r j•D , j = 1,2, . . ., [n/D]} = {r D , r 2×D , . . ., r [n/D]×D }.
The optimal result usually give near-zero value for most of the β i , and those β i that are much larger than zero (>10 −4 ) are regarded as significant, and corresponding x i is called the support vector sv i , of which the total number is N sv .Therefore the model for curve fitting is built as: Figure 3 illustrates the selection of sv i according to the optimal solution found by the Cost Function #1 in Equation (4).There are 22 sv i that has β i >10 −4 , therefore N sv = 22 is not that large comparing with the original data size n ~10 4 .Note that some sv i locate close with each other, therefore it looks that there are only 15 separate sv i (marked with cyan diamonds) as in Figure 3.

Double σ to Enhance the Quality of Curve Fitting
In Reference [35] a fixed σ is used for the SVR method, however, sometimes the SVR method cannot always make good fit for a full range of SOC with a fixed σ in application.The reason for the problem is that the shape of the kernel κ(xk, x) is mainly controlled by σ (as shown in Figure 4a), although its coefficient β can regulate the shape a little bit.If the shape of the original voltage matches that of the kernel, the SVR method with a single fixed σ will have a good fit result, otherwise in some SOCs the SVR method will return overfitted or underfitted results.On this account, the SVR method in Reference [35] has been upgraded for fitting the data with different curvatures.That is the SVR algorithm using double σ in the Gaussian kernel.Reformulating the Equation (1) with two different kernels κ1(x) and κ2(x): , , where the Gaussian kernels yields to:

Double σ to Enhance the Quality of Curve Fitting
In Reference [35] a fixed σ is used for the SVR method, however, sometimes the SVR method cannot always make good fit for a full range of SOC with a fixed σ in application.The reason for the problem is that the shape of the kernel κ(x k , x) is mainly controlled by σ (as shown in Figure 4a), although its coefficient β can regulate the shape a little bit.If the shape of the original voltage matches that of the kernel, the SVR method with a single fixed σ will have a good fit result, otherwise in some SOCs the SVR method will return overfitted or underfitted results.On this account, the SVR method in Reference [35] has been upgraded for fitting the data with different curvatures.That is the SVR algorithm using double σ in the Gaussian kernel.

Double σ to Enhance the Quality of Curve Fitting
In Reference [35] a fixed σ is used for the SVR method, however, sometimes the SVR method cannot always make good fit for a full range of SOC with a fixed σ in application.The reason for the problem is that the shape of the kernel κ(xk, x) is mainly controlled by σ (as shown in Figure 4a), although its coefficient β can regulate the shape a little bit.If the shape of the original voltage matches that of the kernel, the SVR method with a single fixed σ will have a good fit result, otherwise in some SOCs the SVR method will return overfitted or underfitted results.On this account, the SVR method in Reference [35] has been upgraded for fitting the data with different curvatures.That is the SVR algorithm using double σ in the Gaussian kernel.Reformulating the Equation (1) with two different kernels κ1(x) and κ2(x): , , where the Gaussian kernels yields to: Reformulating the Equation (1) with two different kernels κ 1 (x) and κ 2 (x): where the Gaussian kernels yields to: Energies 2018, 11, 2323 Here there are two different σ, for which σ 1 < σ 2 .The kernel with a smaller σ is used to fit the voltage segment with big curvatures, whereas that with a larger σ is used to fit those with small curvatures, as shown in Figure 4b.The correlated functions for optimal solution should be changed accordingly, however, they will not be listed here to save contents.

Criterion for Evaluating the Quality of Curve Fitting by SVR
The accuracy of the SVR method should be judged in order to fulfill further improvement.Here we are proposing a new criterion using linear approximation of discretized sampling data to judge the accuracy of the curve fitting.Further discussion on evaluating the quality of the curve fitting by SVR will be based on the criterion of linear approximation.Figure 5 illustrates the criterion for evaluating the quality of curve fitting by SVR, according to the property of the discretized sampling data with stairs and noises.Figure 5a shows a common case of the voltage sampling data, with clear stair-like shape caused by digital sampling.The actual differentiation y' = dy/dx at Y i is approximately the slope of the red line, or tanθ, as calculated in Equation ( 12): where N Yi is the number of points that satisfy y k = Y i in the output data sequence of Y = {y 1 , y 2 , y 3 . . .y n }, δY and δX are the resolution of sampling for the output and input data, respectively.Here δY = 1 mV and δX = 0.1 s, representing a sampled data sequence in a real case.Moreover, for Figure 5a, the y' at y k = Y i should be 10 −3 , taking that N Yi = 10.Therefore in an ideal case, the SVR method should return a result of ŷ = f (x) close to 10 −3 for all the x k that have y k = Y i .Figure 5b demonstrates the ideal curve fitting case for the SVR method.Both the ŷ = f (x) and ŷ = f (x) (yellow line in Figure 5b) looks smooth when they cross those points with y k = Y i , and the fitted differentiation ŷ = f (x) is close to the slope tanθ that we have calculated from Figure 5a.
Energies 2018, 11, x FOR PEER 7 of 21 Here there are two different σ, for which σ1 < σ2.The kernel with a smaller σ is used to fit the voltage segment with big curvatures, whereas that with a larger σ is used to fit those with small curvatures, as shown in Figure 4b.The correlated functions for optimal solution should be changed accordingly, however, they will not be listed here to save contents.

Criterion for Evaluating the Quality of Curve Fitting by SVR
The accuracy of the SVR method should be judged in order to fulfill further improvement.Here we are proposing a new criterion using linear approximation of discretized sampling data to judge the accuracy of the curve fitting.Further discussion on evaluating the quality of the curve fitting by SVR will be based on the criterion of linear approximation.Figure 5 illustrates the criterion for evaluating the quality of curve fitting by SVR, according to the property of the discretized sampling data with stairs and noises.Figure 5a shows a common case of the voltage sampling data, with clear stair-like shape caused by digital sampling.The actual differentiation y' = dy/dx at Yi is approximately the slope of the red line, or tanθ, as calculated in Equation ( 12): where    The selection of δY is important for the shape of the IC curve calculated by the linear approximation.Define δR as the real resolution of the discrete sampling, δY should be the multiples of δR (δY = K•δR, K is an integer) in order to avoid fluctuations caused by unfair counting.Unfair counting means that if δY ≠ K•δR, sometimes δY contains K•δR, while sometimes δY contains (K + 1)•δR, leading to inevitable fluctuations in the processed results for linear approximation.As the sampling accuracies of voltage in this paper are all 1 mV, δY should be the multiples of 1mV for calculating the linear approximation.Larger δY can reduce the fluctuations in the differentiations results by the linear approximation, as shown in Figure 6.But too large δY will lead to loss of useful information of the original data.The tradeoff should be considered when selecting a proper δY.Therefore in this paper, δY = 2 mV for the cell with LFP cathode, whereas δY = 5 mV for the cell with NCM cathode.The selection of δY is important for the shape of the IC curve calculated by the linear approximation.Define δR as the real resolution of the discrete sampling, δY should be the multiples of δR (δY = K•δR, K is an integer) in order to avoid fluctuations caused by unfair counting.Unfair counting means that if δY = K•δR, sometimes δY contains K•δR, while sometimes δY contains (K + 1)•δR, leading to inevitable fluctuations in the processed results for linear approximation.As the sampling accuracies of voltage in this paper are all 1 mV, δY should be the multiples of 1 mV for calculating the linear approximation.Larger δY can reduce the fluctuations in the differentiations results by the linear approximation, as shown in Figure 6.But too large δY will lead to loss of useful information of the original data.The tradeoff should be considered when selecting a proper δY.Therefore in this paper, δY = 2 mV for the cell with LFP cathode, whereas δY = 5 mV for the cell with NCM cathode.

Changing the Cost Functions to Improve the Accuracy of Incremental Capacity Analysis
The slack variables that are critical in the Cost Function #1 (Equation ( 4)) come from Equation (3), with the sampled sequence y as the data to be fitted.y', which is linearly approximated in Equation ( 12), can be used as a new reference for defining the new slack variables k ς in Equation ( 13).
where the reference differentiation y' is marked as y'*, in order to be distinguished from the real differentiation of y.In Equation (13), γ is the precision parameter that sets tolerance for the fitting error, and slack variables k ς + and k ς − are used to cope with infeasible constraints.Furthermore, the fitting function should be: where μ is omitted due to the first-order differentiation, κ' is the first-order differentiation of the Gaussian kernel functions, as defined in Equation (15).
Then the Cost Function #2, which is established according to the deviations in the first-order differentiation can be built by Equation ( 16).
( ) where λ = 100 is the weighting factor.Similarly with that in Equation ( 5), the coefficient β is decomposed into nonnegative k α + and k α − , satisfying , in order to form a carnonical form of LP problem.Moreover, the SVR problem with Cost Function #2 can be reformulated into the

Changing the Cost Functions to Improve the Accuracy of Incremental Capacity Analysis
The slack variables that are critical in the Cost Function #1 (Equation ( 4)) come from Equation (3), with the sampled sequence y as the data to be fitted.y', which is linearly approximated in Equation ( 12), can be used as a new reference for defining the new slack variables ς k in Equation (13).
where the reference differentiation y' is marked as y'*, in order to be distinguished from the real differentiation of y.In Equation (13), γ is the precision parameter that sets tolerance for the fitting error, and slack variables ς + k and ς − k are used to cope with infeasible constraints.Furthermore, the fitting function should be: where µ is omitted due to the first-order differentiation, κ' is the first-order differentiation of the Gaussian kernel functions, as defined in Equation (15).
Then the Cost Function #2, which is established according to the deviations in the first-order differentiation can be built by Equation (16).
CostFunction#2 : min where λ = 100 is the weighting factor.Similarly with that in Equation ( 5), the coefficient in order to form a carnonical form of LP problem.Moreover, the SVR problem with Cost Function #2 can be reformulated into the carnonical form of the LP problem as in Equation (6).And the vectors and matrices are changed from Equation (7) into Equation ( 17): and Solving the LP problem in Equations ( 16)-( 18), the fitted first-order differentiation can be: which is the direct fitting results of the IC by the SVR algorithm.

Result and Discussions
The performances of curve-fitting by the SVR algorithm will be discussed in this section.The reasons of over-fitting and under-fitting have been studied in details, and correlated improvements to avoid those problems are discussed.

The Influence of the Data Length on the Performances of the SVR Algorithm
Parametric studies are conducted on the influence of the data length n on the goodness of fitting and on the computational time of the SVR algorithm.Here we choose σ = 0.20 for the SVR algorithm, identical with that set in Reference.[35]. Figure 7a shows that as D increases, the data length n/D used as the input of the SVR algorithm decreases from approximately 3000 to 100.There is always under-fitting at the SOC near 0% and 100% for all data lengths.And the degree of under-fitting will become worse for the algorithm with shorter data lengths.The phenomenon is similar for both the Cell A with LFP cathode and Cell B with NCM cathode, because similar phenomenon can be seen in both Figure 7a,b.Figure 7c shows that the IC curves look similar for 200 < n/D < 3000 for Cell A with LFP cathode.However, for n/D < 200, the characteristic IC peak located between 3.35 V and 3.4 V disappears.Therefore downsampling with too few points of data is not acceptable to keep the accuracy of IC curves.Similar phenomenon can be observed in Figure 7d for Cell B with NCM cathode.
The data length also determines the computational time of the SVR algorithm, as shown in Figure 7e.As the computational complexity for the LP problem in Equation ( 6) is Θ(n 2 ), the downsampling will help save much time in computation by reducing the computational complexity to Θ (n 2 /D 2 ).The computational times for the SVR algorithm that is used to conduct curve-fitting for cell with LFP and NCM cathode have been compared in Figure 7e.The workstation that is used for the SVR algorithm has an Intel ® Core™ i7-6820HK CPU @2.70 GHz with a RAM of 16.0 GB, and the version of the MATLAB ® used for computation is R2016a.The computational times display shapes of polynomial n 2 as in Figure 7e as expected, therefore n must be well-controlled within a lower range to save the time used for curve fitting.to Θ (n 2 /D 2 ).The computational times for the SVR algorithm that is used to conduct curve-fitting for cell with LFP and NCM cathode have been compared in Figure 7e.The workstation that is used for the SVR algorithm has an Intel ® Core™ i7-6820HK CPU @2.70 GHz with a RAM of 16.0 GB, and the version of the MATLAB ® used for computation is R2016a.The computational times display shapes of polynomial n 2 as in Figure 7e as expected, therefore n must be well-controlled within a lower range to save the time used for curve fitting.In summary, there is a trade-off between the computational time and the accuracy of curve fitting when selecting a proper data length n for the SVR algorithm.Nevertheless, the accuracy of the SVR algorithm in fitting the IC curve is not majorly determined by n as long as there is enough data In summary, there is a trade-off between the computational time and the accuracy of curve fitting when selecting a proper data length n for the SVR algorithm.Nevertheless, the accuracy of the SVR algorithm in fitting the IC curve is not majorly determined by n as long as there is enough data that can reflect the overall shape of the voltage curve.Later discussions show that the accuracy of the SVR algorithm is much more influenced by the σ and the cost function.Improvements will be fulfilled by selecting proper σ and cost function.Therefore the computational time has higher priority when selecting the divider D for downsampling.The data length n/D is controlled within 400-500 points for further discussion, balancing both the computational time and accuracy of curve fitting.

The Influence of the σ in the Gaussian Kernel on the Performance of the SVR Algorithm
The σ in the Gaussian kernel (Equation ( 2)) controls the shape of individual support vectors, and the fitted curve ŷ is a combination of several kernels, therefore σ influences the accuracy of curve fitting by the SVR algorithm.Figure 8a,b illustrate the curve fitting results of the voltage for Cell A with LFP cathode and Cell B with NCM cathode, respectively.The problem of under-fitting as shown in Figure 7a,b have been solved by selecting smaller σs.For Cell A with LFP cathode, σ = 0.06 is much better than σ = 0.20, whereas for Cell B with NCM cathode, σ = 0.08 is much better than σ = 0.20.The reason that why the SVR algorithm with smaller σ performs better than that with large σ is that the basic curvatures formed by support vectors with smaller σ can be finer than those with larger σ.The number of support vectors (N sv ) will increase for the SVR algorithm with smaller σ, because the algorithm is trying to capture the finer details of the voltage curves.that can reflect the overall shape of the voltage curve.Later discussions show that the accuracy of the SVR algorithm is much more influenced by the σ and the cost function.Improvements will be fulfilled by selecting proper σ and cost function.Therefore the computational time has higher priority when selecting the divider D for downsampling.The data length n/D is controlled within 400-500 points for further discussion, balancing both the computational time and accuracy of curve fitting.

The Influence of the σ in the Gaussian Kernel on the Performance of the SVR Algorithm
The σ in the Gaussian kernel (Equation ( 2   However, overfitting will appear if too small σ is chosen in the SVR algorithm both for the Cell A with LFP cathode and for the Cell B with NCM cathode, as shown in Figure 8c,d.The mechanism of overfitting in the IC curves can be explained by Figure 5c,d.The SVR algorithm has seemingly stronger capability to capture finer details of the voltage curves with smaller σ, therefore they will reflect the stairs and noises in the IC results.Overfitting is inevitable when extremely small σ is set in the SVR algorithm.Here when σ < 0.04, overfitting will occur for both cells with LFP cathode and NCM cathode.On the other hand, too large σ will lead to under-fitting in the voltage curves especially for SOC near 0% and 100%, as we already know from Figure 8a,b.Some IC peaks will disappear for the SVR algorithm with σ > 0.10.Therefore if we wish to use single σ in the SVR algorithm, a proper range can be 0.04 < σ < 0.10.Here we choose σ = 0.06 for Cell A with LFP cathode, and σ = 0.08 for Cell B with NCM cathode in the following discussions.

Double σ to Enhance the Accuracy of the SVR Algorithm
Overfitting or underfitting is still possible to occur because the shape of the voltage curves will change during degradation after cycling.If overfitting or under-fitting only appears when curve fitting is conducted for the cycled data, it means that the fixed σ for the fresh cell no longer fits for the changed curvature in the cycled voltage curve.Large σ fits for voltage curve with small curvatures, whereas small σ fits for that with big curvatures.As the σ is preset and fixed in Section 4.2, the problem caused by cell degradation is usually possible for the SVR algorithm.One way to solve the problem is to use double σ, of which the smaller one is used to fit the arc shape with big curvature, whereas the larger one to fit that with small curvature.The mathematical formulation of the SVR algorithm using double σ can be referred in Section 3.2.
Figure 9a,b show the curve fitting results for Cell A and Cell B using the SVR algorithm with double σ, respectively.The support vectors with larger σ is prone to be located at the SOC near 0% and 100%, also at the position with small curvature.The sitting of the support vectors with larger σ (σ 2 = 0.20) help smooth out the curve fitting results, with lower possibility to have overfitting problems, as those shown in Figure 8.Moreover, the voltage curve is still mainly "supported" by the smaller σ (σ 1 = 0.06 for LFP cell, and 0.08 for NCM cell) with high accuracy.Figure 9c,d display the IC curves derived from fitting voltage for Cell A with LFP cathode and Cell B with NCM cathode, respectively.The overfitting problem still occurs at minor positions for the SVR algorithm with single σ, as in the magnified figures in Figure 9c,d.Note that without magnification, the overfitting problem in the IC curves cannot be noticed from Figure 8c,d.The fluctuations in the IC curves have been successfully smoothed out with the help of double σ in the SVR algorithm.Hence, utilizing double σ, one smaller for the accuracy of fitting and another bigger one for avoiding overfitting, can substantially improve the quality of voltage curve fitting for lithium-ion batteries.Up to now the SVR algorithm in Reference [35] has been herein improved with better accuracy in curve fitting and with lower possibility of overfitting.The SVR algorithm in Reference [35] cannot capture all the voltage curvatures at all SOCs, but with double σ, the problem has been solved.After checking the curve fitting results for the cells before and after degradation, we recommend that the double σ can be set as {σ 1 = 0.06, σ 2 = 0.20} for LFP cells, and {σ 1 = 0.08, σ 2 = 0.20} for NCM cells.
Energies 2018, 11, x FOR PEER 13 of 21 However, overfitting will appear if too small σ is chosen in the SVR algorithm both for the Cell A with LFP cathode and for the Cell B with NCM cathode, as shown in Figure 8c,d.The mechanism of overfitting in the IC curves can be explained by Figure 5c,d.The SVR algorithm has seemingly stronger capability to capture finer details of the voltage curves with smaller σ, therefore they will reflect the stairs and noises in the IC results.Overfitting is inevitable when extremely small σ is set in the SVR algorithm.Here when σ < 0.04, overfitting will occur for both cells with LFP cathode and NCM cathode.On the other hand, too large σ will lead to under-fitting in the voltage curves especially for SOC near 0% and 100%, as we already know from Figure 8a,b.Some IC peaks will disappear for the SVR algorithm with σ > 0.10.Therefore if we wish to use single σ in the SVR algorithm, a proper range can be 0.04 < σ < 0.10.Here we choose σ = 0.06 for Cell A with LFP cathode, and σ = 0.08 for Cell B with NCM cathode in the following discussions.

Double σ to Enhance the Accuracy of the SVR Algorithm
Overfitting or underfitting is still possible to occur because the shape of the voltage curves will change during degradation after cycling.If overfitting or under-fitting only appears when curve fitting is conducted for the cycled data, it means that the fixed σ for the fresh cell no longer fits for the changed curvature in the cycled voltage curve.Large σ fits for voltage curve with small curvatures, whereas small σ fits for that with big curvatures.As the σ is preset and fixed in Section 4.2, the problem caused by cell degradation is usually possible for the SVR algorithm.One way to solve the problem is to use double σ, of which the smaller one is used to fit the arc shape with big curvature, whereas the larger one to fit that with small curvature.The mathematical formulation of the SVR algorithm using double σ can be referred in Section 3.2.
Figure 9a,b show the curve fitting results for Cell A and Cell B using the SVR algorithm with double σ, respectively.The support vectors with larger σ is prone to be located at the SOC near 0% and 100%, also at the position with small curvature.The sitting of the support vectors with larger σ (σ2 = 0.20) help smooth out the curve fitting results, with lower possibility to have overfitting problems, as those shown in Figure 8.Moreover, the voltage curve is still mainly "supported" by the smaller σ (σ1 = 0.06 for LFP cell, and 0.08 for NCM cell) with high accuracy.Figure 9c,d display the IC curves derived from fitting voltage for Cell A with LFP cathode and Cell B with NCM cathode, respectively.The overfitting problem still occurs at minor positions for the SVR algorithm with single σ, as in the magnified figures in Figure 9c,d.Note that without magnification, the overfitting problem in the IC curves cannot be noticed from Figure 8c,d.The fluctuations in the IC curves have been successfully smoothed out with the help of double σ in the SVR algorithm.Hence, utilizing double σ, one smaller for the accuracy of fitting and another bigger one for avoiding overfitting, can substantially improve the quality of voltage curve fitting for lithium-ion batteries.Up to now the SVR algorithm in Reference [35] has been herein improved with better accuracy in curve fitting and with lower possibility of overfitting.The SVR algorithm in Reference [35] cannot capture all the voltage curvatures at all SOCs, but with double σ, the problem has been solved.After checking the curve fitting results for the cells before and after degradation, we recommend that the double σ can be set as {σ1 = 0.06, σ2 = 0.20} for LFP cells, and {σ1 = 0.08, σ2 = 0.20} for NCM cells.

Changing the Cost Function to Improve the Accuracy of ICA Using the SVR Algorithm
Up to now, the accuracy of curve fitting by the SVR algorithm has been greatly improved comparing with that in Reference [35].The following deficiencies of the SVR algorithm have been overcome: (1) the deficiency in accurately fitting the voltage throughout the full SOC ranges (from 0% to 100%); (2) the deficiency of possible overfitting or under-fitting that can be displayed in the IC curves; (3) the questionable capability of the SVR algorithm to fit the voltage curve for lithium-ion cells with NCM cathode.In this section, we are trying to further seek finer fitting of the voltage curves of lithium-ion batteries using the SVR algorithm.And a criterion for judging the accuracy of the curve fitting in the view of the differentiation has been proposed as in Equation ( 20): where RMSE means the overall root-mean-square-error for the differentiation ^' y calculated from the fitted function ŷ , the subscript i is the index of the data, and n is the data length.
The Cost Function #2 (Equation [15], which directly regulates the LP algorithm by the voltage differentiation rather than the voltage, is utilized to further improve the quality of the SVR algorithm as discussed in Sections 4.2 and 4.3.Not only the influence of the changes in the cost functions, but also the performances of choosing single σ and double σ with different cost functions have been compared.Therefore the performances of four kinds of SVR algorithms have been compared, as listed in Table 2.The discussion of the performances focus on the (1) fitting accuracy, with or without overfitting/under-fitting; (2) whether the algorithm can return a continuous function of the fitted curves, y = f(x) for the voltage curve and y' = f'(x) for the IC curve.Recommendations follow the correlated discussions for further usage of the algorithm in ICA.

Changing the Cost Function to Improve the Accuracy of ICA Using the SVR Algorithm
Up to now, the accuracy of curve fitting by the SVR algorithm has been greatly improved comparing with that in Reference [35].The following deficiencies of the SVR algorithm have been overcome: (1) the deficiency in accurately fitting the voltage throughout the full SOC ranges (from 0% to 100%); (2) the deficiency of possible overfitting or under-fitting that can be displayed in the IC curves; (3) the questionable capability of the SVR algorithm to fit the voltage curve for lithium-ion cells with NCM cathode.In this section, we are trying to further seek finer fitting of the voltage curves of lithium-ion batteries using the SVR algorithm.And a criterion for judging the accuracy of the curve fitting in the view of the differentiation has been proposed as in Equation ( 20): where RMSE means the overall root-mean-square-error for the differentiation ŷ calculated from the fitted function ŷ, the subscript i is the index of the data, and n is the data length.The Cost Function #2 (Equation [15], which directly regulates the LP algorithm by the voltage differentiation rather than the voltage, is utilized to further improve the quality of the SVR algorithm as discussed in Sections 4.2 and 4.2.Not only the influence of the changes in the cost functions, but also the performances of choosing single σ and double σ with different cost functions have been compared.Therefore the performances of four kinds of SVR algorithms have been compared, as listed in Table 2.The discussion of the performances focus on the (1) fitting accuracy, with or without overfitting/under-fitting; (2) whether the algorithm can return a continuous function of the fitted curves, y = f (x) for the voltage curve and y' = f '(x) for the IC curve.Recommendations follow the correlated discussions for further usage of the algorithm in ICA.10a,b) come from y' approximated by Equation (12).Obvious deviations can be seen for the SVR algorithm with single σ and Cost Function #1 for both cases, called as "Fair" in Table 2. Improvements can be seen for the SVR algorithm with double σ and Cost Function #1, as we expected from the discussion in Section 4.3, called as "Good" in Table 2. Further improvements can be seen for the SVR algorithm with Cost Function #2, regardless of the number of σ that is used in the SVR algorithm, called as "Better" in Table 2.The further improvements in the IC curve using Cost Function #2 come from that the SVR algorithm directly regulates the estimation error by the Ref IC, which we believe to be the best linear approximation by Equation (12).However, the price paid for using Cost Function #2 is that we cannot rebuild the voltage curve y = f (x), because the offset constant µ has been erased from the function of y' = f'(x) when calculating the first-order differentiation.
cases, called as "Fair" in Table 2. Improvements can be seen for the SVR algorithm with double σ and Cost Function #1, as we expected from the discussion in Section 4.3, called as "Good" in Table 2. Further improvements can be seen for the SVR algorithm with Cost Function #2, regardless of the number of σ that is used in the SVR algorithm, called as "Better" in Table 2.The further improvements in the IC curve using Cost Function #2 come from that the SVR algorithm directly regulates the estimation error by the Ref IC, which we believe to be the best linear approximation by Equation ( 12).However, the price paid for using Cost Function #2 is that we cannot rebuild the voltage curve y = f(x), because the offset constant μ has been erased from the function of y' = f'(x) when calculating the first-order differentiation.
The SVR algorithms with "single σ + Cost Function #2" and "double σ + Cost Function #1" are recommended for further aging analysis based on ICA.The SVR algorithm with "single σ + Cost Function #1" is not recommended because its fair accuracy in fitting the IC curve.The SVR algorithm with "double σ + Cost Function #2" is also not recommended, because it has higher computational complexity than that with "single σ + Cost Function #2", while the two algorithms have similar accuracy in the view of RMSE.Moreover, if we focus on the location and height of the characteristic peaks in the IC curves as shown in Figure 10c,d, the SVR algorithm with "single σ + Cost Function #2" and with "double σ + Cost Function #2" also show similar results.Although the accuracy of the SVR algorithm with "double σ + Cost Function #1" is not as good as that with "single σ + Cost Function #2", it is still recommended because it can provide explicit continuous functions of the voltage curves, which is important to conduct further research on the curvatures of the voltage at any arbitrary SOC.The SVR algorithms with "single σ + Cost Function #2" and "double σ + Cost Function #1" are recommended for further aging analysis based on ICA.The SVR algorithm with "single σ + Cost Function #1" is not recommended because its fair accuracy in fitting the IC curve.The SVR algorithm with "double σ + Cost Function #2" is also not recommended, because it has higher computational complexity than that with "single σ + Cost Function #2", while the two algorithms have similar accuracy in the view of RMSE.Moreover, if we focus on the location and height of the characteristic peaks in the IC curves as shown in Figure 10c,d, the SVR algorithm with "single σ + Cost Function #2" and with "double σ + Cost Function #2" also show similar results.Although the accuracy of the SVR algorithm with "double σ + Cost Function #1" is not as good as that with "single σ + Cost Function #2", it is still recommended because it can provide explicit continuous functions of the voltage curves, which is important to conduct further research on the curvatures of the voltage at any arbitrary SOC.

ICA for Characterizing the Degradation of Lithium-ion Batteries Using the SVR Algorithm
The SVR algorithm with "single σ + Cost Function #2", that with "double σ + Cost Function #1", and that with "double σ + Cost Function #2" are further used to perform ICA to characterize the degradation of commercial lithium-ion batteries, as shown in Figure 11.Table 3 collects the statistics of features of the characteristic peaks.The accuracy of the IC results calculated by the SVR algorithm with "single σ + Cost Function #2" and "double σ + Cost Function #2" are quite similar, therefore we only show the results for "single σ + Cost Function #2" to save contents.

ICA for Characterizing the Degradation of Lithium-ion Batteries Using the SVR Algorithm
The SVR algorithm with "single σ + Cost Function #2", that with "double σ + Cost Function #1", and that with "double σ + Cost Function #2" are further used to perform ICA to characterize the degradation of commercial lithium-ion batteries, as shown in Figure 11.Table 3 collects the statistics of features of the characteristic peaks.The accuracy of the IC results calculated by the SVR algorithm with "single σ + Cost Function #2" and "double σ + Cost Function #2" are quite similar, therefore we only show the results for "single σ + Cost Function #2" to save contents.However, problem emerges when we try to found quantitative relationship between the SOH and the IC peaks, especially for the SVR algorithm with Cost Function #1.Quantified features of IC peaks as shown in Figure 12, including the peak location, the peak height, and the integrated area surrounding the peak have been used for establishing relationships between the IC peaks and the SOHs of lithium-ion cells in order to build online SOH estimation algorithms.In Reference [35], quantified relationship has been built between the peak height and the battery SOH.Table 3 shows the accuracy of the peak locations and peak heights using the SVR algorithm with different settings.The calculation of the relative error is conducted from the data in Figure 11, with the reference from the linear approximation by Equation (12).The characteristic peak is picked as long as the peak height increases/decreases monotonically with the SOH decreasing.Although the accuracy of the peak location can be precise with error less than 0.5%, the error in the peak height of the SVR algorithm with "double σ + Cost Function #1" can be as large as 15%-16%, whereas that with "single σ + Cost Function #2" can be up to 8%.The accuracy of the SVR algorithm with "double σ + Cost Function #1" is not acceptable for further quantified ICA analysis, if the peak height is regarded as an indicator.Figure 11 shows that the IC curves calculated by the two SVR algorithms are both close to the Ref IC calculated by the linear approximation in Equation (12).The RMSE for the SVR algorithm with "single σ + Cost Function #2" is usually slightly smaller than that with "double σ + Cost Function #1", indicating that Cost Function #2 is dominating the fitting accuracy of the IC curves.Using fixed σ in the SVR algorithm to fit the data throughout the full life cycle is acceptable, because the RMSE changes a little at different SOHs, for all the cells in Figure 11.Moreover, {σ 1 = 0.06, σ 2 = 0.20} is good for cells with LFP cathode, because the fitting results for Cell A and Cell C are quite similar.Similarly, {σ 1 = 0.08, σ 2 = 0.20} is good for the cells with NCM cathode, because the fitting results for Cell B and Cell D are quite similar.Other combinations of {σ 1 , σ 2 } are also possible, readers who have interest could check according to the discussions made in Sections 4.2 and 4.2.
However, problem emerges when we try to found quantitative relationship between the SOH and the IC peaks, especially for the SVR algorithm with Cost Function #1.Quantified features of IC peaks as shown in Figure 12, including the peak location, the peak height, and the integrated area surrounding the peak have been used for establishing relationships between the IC peaks and the SOHs of lithium-ion cells in order to build online SOH estimation algorithms.In Reference [35], quantified relationship has been built between the peak height and the battery SOH.Table 3 shows the accuracy of the peak locations and peak heights using the SVR algorithm with different settings.The calculation of the relative error is conducted from the data in Figure 11, with the reference from the linear approximation by Equation (12).The characteristic peak is picked as long as the peak height increases/decreases monotonically with the SOH decreasing.Although the accuracy of the peak location can be precise with error less than 0.5%, the error in the peak height of the SVR algorithm with "double σ + Cost Function #1" can be as large as 15%-16%, whereas that with "single σ + Cost Function #2" can be up to 8%.The accuracy of the SVR algorithm with "double σ + Cost Function #1" is not acceptable for further quantified ICA analysis, if the peak height is regarded as an indicator.Nevertheless, as the height of the peak calculated by the linear approximation might be influenced by the noise in the raw data, sometimes we can alter to the integrated area surrounding the peak, as the shaded area marked in Figure 12.The errors of the fitted IC curves can be well controlled under 3% through the full cycle life for the SVR algorithm with "single σ + Cost Function #2" for all kinds of commercial lithium-ion batteries, after introducing the integrated area near the IC peak as an SOH indicator.Table 4 lists that most of the error is less than 1%, unless the cell reaches the end of its life.However, the SVR algorithm with "double σ + Cost Function #1" still has an error as large as 10% for Cell A with LFP cells, although it has less error (up to 5%) than using the peak height as the SOH indicator.
In summary, the SVR algorithm with Cost Function #2 performs much better than those with Cost Function #1.And the SVR algorithm with Cost Function #2 is promising to be further used for online SOH estimation, helping build quantified relationship between the IC peaks and the battery SOH.For the SVR algorithm with "single σ + Cost Function #2", σ = 0.06 is good for cells with LFP cathode, whereas σ = 0.08 is good for cells with NCM cathode.A smaller σ can also cause overfitting in the results of the SVR algorithm with "single σ + Cost Function #2", the reason of which is similar  Nevertheless, as the height of the peak calculated by the linear approximation might be influenced by the noise in the raw data, sometimes we can alter to the integrated area surrounding the peak, as the shaded area marked in Figure 12.The errors of the fitted IC curves can be well controlled under 3% through the full cycle life for the SVR algorithm with "single σ + Cost Function #2" for all kinds of commercial lithium-ion batteries, after introducing the integrated area near the IC peak as an SOH indicator.Table 4 lists that most of the error is less than 1%, unless the cell reaches the end of its life.However, the SVR algorithm with "double σ + Cost Function #1" still has an error as large as 10% for Cell A with LFP cells, although it has less error (up to 5%) than using the peak height as the SOH indicator.
In summary, the SVR algorithm with Cost Function #2 performs much better than those with Cost Function #1.And the SVR algorithm with Cost Function #2 is promising to be further used for online SOH estimation, helping build quantified relationship between the IC peaks and the battery SOH.For the SVR algorithm with "single σ + Cost Function #2", σ = 0.06 is good for cells with LFP cathode, whereas σ = 0.08 is good for cells with NCM cathode.A smaller σ can also cause overfitting in the results of the SVR algorithm with "single σ + Cost Function #2", the reason of which is similar with that discussed in Section 4.2.The SVR algorithm with Cost Function #1 also has its own advantage that it can rebuild the voltage curve, for which that with Cost Function #2 cannot.The SVR algorithm with "double σ + Cost Function #1" can perform well in many cases, although sometimes the error can be as large as 10% for fitting the peak heights.One should choose proper settings of the SVR algorithm for specific applications, and the discussions in this paper can guide the selections of proper parameters for the SVR algorithms.

Conclusions
This paper studies the performances of the SVR algorithm on the ICA for commercial lithium-ion batteries.The SVR algorithm, which serves a curve fitting method using Gaussian kernel, can smooth the discrete sampled voltage data with continuous functions, thereby making further differentiations applicable for the ICA.Parametric studies have been conducted to improve the performance of the SVR algorithm in fitting the stair like sampled voltage from four kinds of commercial lithium-ion batteries.The computational time can be reduced by downsampling the raw data with proper resampling distance D. The possible problem of overfitting and under-fitting can be solved by choosing proper standard deviation σ in the Gaussian kernel in the SVR algorithm.Smaller σ is prone to cause overfitting, whereas larger σ is likely to bring under-fitting, therefore an SVR algorithm with double σ (one large and one small) is proposed to compensate the disadvantages of using single σ.The cost function, which is previously set to regulate the error in voltage fitting, has been further modified to directly regulate the error in the differentiations.The new cost function greatly improves the accuracy of the SVR algorithm in calculating the IC curves, making it promising to be further applied in online SOH estimation.The maximum error in the characteristic IC peaks calculated by the SVR algorithm with the new cost function can be less than 3% for all the data during the full cycle life of all the four kinds of lithium-ion batteries.After viewing all the curve fitting results, σ = 0.06 is good for cells with LFP cathode, whereas σ = 0.08 is good for cells with NCM cathode.

Table 1 .
Degradation tests of commercial lithium-ion batteries for validating the methodologies.The abbreviation for cathode means: (1) LFP = LiFePO 4 , (2) NCM = Li(Ni x Co y Mn 1−y )O 2 , (3) LMO = LiMn 2 O 4 ; whereas that for anode means: C = carbon or graphite based anode.The voltage ranges comply with the instructions provided by the manufacturers.cathode with NCM.The mass ratio of Ni:Co:Mn for the batteries with NCM cathode is 1:1:1.

Table 1 .Figure 1 .
Figure 1.The voltage profile for cells with different cycle numbers.(a) Battery A, charge; (b) battery B, charge; (c) battery C, charge; (d) battery D, discharge.

Figure 1 .
Figure 1.The voltage profile for cells with different cycle numbers.(a) Battery A, charge; (b) battery B, charge; (c) battery C, charge; (d) battery D, discharge.

Figure 2 .
Figure 2. The definition of ε, k ξ + and k ξ − for support vector regression.

Figure 2 .
Figure 2. The definition of ε, ξ + k and ξ − k for support vector regression.

Figure 4 .
Figure 4. Using double σ to enhance the quality of curve fitting, (a) SVR algorithm with single σ; (b) SVR algorithm with double σ.

Figure 4 .
Figure 4. Using double σ to enhance the quality of curve fitting, (a) SVR algorithm with single σ; (b) SVR algorithm with double σ.

Figure 4 .
Figure 4. Using double σ to enhance the quality of curve fitting, (a) SVR algorithm with single σ; (b) SVR algorithm with double σ.
NYi is the number of points that satisfy yk = Yi in the output data sequence of Y = {y1, y2, y3… yn}, δY and δX are the resolution of sampling for the output and input data, respectively.Here δY = 1 mV and δX = 0.1 s, representing a sampled data sequence in a real case.Moreover, for Figure5a, the y' at yk = Yi should be 10 −3 , taking that NYi = 10.Therefore in an ideal case, the SVR method should return a result of ( ) close to 10 −3 for all the xk that have yk = Yi.Figure5bdemonstrates the ideal curve fitting case for the SVR method.Both the ( ) in Figure5b) looks smooth when they cross those points with yk = Yi, and the fitted differentiation ( ) the slope tanθ that we have calculated from Figure5a.

Figure 5 .
Figure 5. Establishment of criterion for evaluating the quality of curve fitting by SVR.(a) The approximate linear differentiation that the SVR method dedicates to capture.(b) A case of good fit for the SVR method.(c) A case of bad fit, caused by overfitting the shape of stairs for the discretized data.(d) A case of bad fit, caused by overfitting the shape of noise.

Figure
Figure5cillustrates an overfitting case caused by the stair-like sampled voltage.Peaks can be observed in the dx/dy curves for each level of voltage stairs, if the SVR method tracks them closely.Figure5dillustrates an overfitting case caused by noises in the sampled data.Extra peaks that superimposed on the peaks in Figure5ccommence in the dx/dy curves, if the SVR method tries to follow the noise.If the SVR method keeps tracking the shape of the sampled voltage with Cost Function #1 (Equation (4)), the over-fitting cases in Figure5c,d will be inevitable.Therefore we try to propose a new cost function to regulate the curve-fitting of the SVR method.The selection of δY is important for the shape of the IC curve calculated by the linear approximation.Define δR as the real resolution of the discrete sampling, δY should be the multiples

Figure 5 .
Figure 5. Establishment of criterion for evaluating the quality of curve fitting by SVR.(a) The approximate linear differentiation that the SVR method dedicates to capture.(b) A case of good fit for the SVR method.(c) A case of bad fit, caused by overfitting the shape of stairs for the discretized data.(d) A case of bad fit, caused by overfitting the shape of noise.

Figure
Figure5cillustrates an overfitting case caused by the stair-like sampled voltage.Peaks can be observed in the dx/dy curves for each level of voltage stairs, if the SVR method tracks them closely.Figure5dillustrates an overfitting case caused by noises in the sampled data.Extra peaks that superimposed on the peaks in Figure5ccommence in the dx/dy curves, if the SVR method tries to follow the noise.If the SVR method keeps tracking the shape of the sampled voltage with Cost Function #1 (Equation (4)), the over-fitting cases in Figure5c,d will be inevitable.Therefore we try to propose a new cost function to regulate the curve-fitting of the SVR method.The selection of δY is important for the shape of the IC curve calculated by the linear approximation.Define δR as the real resolution of the discrete sampling, δY should be the multiples of δR (δY = K•δR, K is an integer) in order to avoid fluctuations caused by unfair counting.Unfair counting means that if δY = K•δR, sometimes δY contains K•δR, while sometimes δY contains (K + 1)•δR, leading to inevitable fluctuations in the processed results for linear approximation.As the sampling accuracies of voltage in this paper are all 1 mV, δY should be the multiples of 1 mV for calculating the linear approximation.Larger δY can reduce the fluctuations in the differentiations results by the linear approximation, as shown in Figure6.But too large δY will lead to loss of useful information of the original data.The tradeoff should be considered when selecting a proper δY.Therefore in this paper, δY = 2 mV for the cell with LFP cathode, whereas δY = 5 mV for the cell with NCM cathode.

Figure 6 .
Figure 6.The selection of proper sampling resolution for the approximate linear differentiation in Figure 5a.(a) Cell A with LFP cathode.(b) Cell B with NCM cathode.

Figure 6 .
Figure 6.The selection of proper sampling resolution for the approximate linear differentiation in Figure 5a.(a) Cell A with LFP cathode.(b) Cell B with NCM cathode.

Figure 7 .
Figure 7.The relationship between the data length and the performances of the SVR algorithm.(a) The fitting of the voltage for Cell A with LFP cathode.(b) The fitting of the voltage for Cell B with NCM cathode.(c) The fitting results of the IC curves for Cell A with LFP cathode.(d) The fitting results of the IC curves for Cell B with NCM cathode.(e) Computational time versus the data length for the SVR algorithm.

Figure 7 .
Figure 7.The relationship between the data length and the performances of the SVR algorithm.(a) The fitting of the voltage for Cell A with LFP cathode.(b) The fitting of the voltage for Cell B with NCM cathode.(c) The fitting results of the IC curves for Cell A with LFP cathode.(d) The fitting results of the IC curves for Cell B with NCM cathode.(e) Computational time versus the data length for the SVR algorithm.
)) controls the shape of individual support vectors, and the fitted curve ŷ is a combination of several kernels, therefore σ influences the accuracy of curve fitting by the SVR algorithm.Figure8a,b illustrate the curve fitting results of the voltage for Cell A with LFP cathode and Cell B with NCM cathode, respectively.The problem of under-fitting as shown in Figure7a,b have been solved by selecting smaller σs.For Cell A with LFP cathode, σ = 0.06 is much better than σ = 0.20, whereas for Cell B with NCM cathode, σ = 0.08 is much better than σ = 0.20.The reason that why the SVR algorithm with smaller σ performs better than that with large σ is that the basic curvatures formed by support vectors with smaller σ can be finer than those with larger σ.The number of support vectors (Nsv) will increase for the SVR algorithm with smaller σ, because the algorithm is trying to capture the finer details of the voltage curves.

Figure 8 .
Figure 8.The influence of the σ in the Gaussian kernel on the accuracy of curve fitting.(a) The fitting results of the voltage curve for Cell A with LFP cathode, σ = 0.06 and 0.20.(b) The fitting results of the voltage curve for Cell B with NCM cathode, σ = 0.08 and 0.20.(c) The fitting results of the IC curve for Cell A with LFP cathode, from small σ = 0.02 to large σ = 0.40.(d) The fitting results of the IC curve for Cell B with NCM cathode, from small σ = 0.02 to large σ = 0.40.

Figure 8 .
Figure 8.The influence of the σ in the Gaussian kernel on the accuracy of curve fitting.(a) The fitting results of the voltage curve for Cell A with LFP cathode, σ = 0.06 and 0.20.(b) The fitting results of the voltage curve for Cell B with NCM cathode, σ = 0.08 and 0.20.(c) The fitting results of the IC curve for Cell A with LFP cathode, from small σ = 0.02 to large σ = 0.40.(d) The fitting results of the IC curve for Cell B with NCM cathode, from small σ = 0.02 to large σ = 0.40.

Figure 9 .
Figure 9.Using double σ to improve the accuracy of the SVR algorithm.(a) The curve fitting of the voltage for Cell A with LFP cathode.(b) The curve fitting of the voltage for Cell B with NCM cathode.(c) The fitting result of the IC curve for Cell A with LFP cathode.(d) The fitting result of the IC curve for Cell B with NCM cathode.

Figure 10a ,
Figure 10a,b compares the IC curves returned by the four different SVR algorithms for Cell A with LFP cathode and Cell B with NCM cathode, respectively.The reference IC curves (the black

Figure 9 .
Figure 9.Using double σ to improve the accuracy of the SVR algorithm.(a) The curve fitting of the voltage for Cell A with LFP cathode.(b) The curve fitting of the voltage for Cell B with NCM cathode.(c) The fitting result of the IC curve for Cell A with LFP cathode.(d) The fitting result of the IC curve for Cell B with NCM cathode.

Figure
Figure 10a,b compares the IC curves returned by the four different SVR algorithms for Cell A with LFP cathode and Cell B with NCM cathode, respectively.The reference IC curves (the black dotted line marked as Ref IC in the Figure10a,b) come from y' approximated by Equation(12).Obvious deviations can be seen for the SVR algorithm with single σ and Cost Function #1 for both cases, called as "Fair" in Table2.Improvements can be seen for the SVR algorithm with double σ and Cost Function #1, as we expected from the discussion in Section 4.3, called as "Good" in Table2.Further improvements can be seen for the SVR algorithm with Cost Function #2, regardless of the number of σ that is used in the SVR algorithm, called as "Better" in Table2.The further improvements in the IC curve using Cost Function #2 come from that the SVR algorithm directly regulates the estimation error by the Ref IC, which we believe to be the best linear approximation by Equation(12).However, the price paid for using Cost Function #2 is that we cannot rebuild the voltage curve y = f (x), because the offset constant

Figure 10 .Figure 10 .
Figure 10.Comparisons of the fitting results of IC curves for the SVR algorithms with different settings.Comparison of the fitted IC curves for (a) Cell A with LFP cathode; (b) Cell B with NCM Figure 10.Comparisons of the fitting results of IC curves for the SVR algorithms with different settings.Comparison of the fitted IC curves for (a) Cell A with LFP cathode; (b) Cell B with NCM cathode.Comparison of the locations and heights of the peaks in the IC curves for (c) Cell A with LFP cathode; (d) Cell B with NCM cathode.

Figure 11
Figure11shows that the IC curves calculated by the two SVR algorithms are both close to the Ref IC calculated by the linear approximation in Equation(12).The RMSE for the SVR algorithm with "single σ + Cost Function #2" is usually slightly smaller than that with "double σ + Cost Function #1", indicating that Cost Function #2 is dominating the fitting accuracy of the IC curves.Using fixed σ in the SVR algorithm to fit the data throughout the full life cycle is acceptable, because the RMSE changes a little at different SOHs, for all the cells in Figure11.Moreover, {σ1 = 0.06, σ2 = 0.20} is good for cells with LFP cathode, because the fitting results for Cell A and Cell C are quite similar.Similarly, {σ1 = 0.08, σ2 = 0.20} is good for the cells with NCM cathode, because the fitting results for Cell B and Cell D are quite similar.Other combinations of {σ1, σ2} are also possible, readers who have interest could check according to the discussions made in Sections 4.2 and 4.3.However, problem emerges when we try to found quantitative relationship between the SOH and the IC peaks, especially for the SVR algorithm with Cost Function #1.Quantified features of IC peaks as shown in Figure12, including the peak location, the peak height, and the integrated area surrounding the peak have been used for establishing relationships between the IC peaks and the SOHs of lithium-ion cells in order to build online SOH estimation algorithms.In Reference[35], quantified relationship has been built between the peak height and the battery SOH.Table3shows the accuracy of the peak locations and peak heights using the SVR algorithm with different settings.The calculation of the relative error is conducted from the data in Figure11, with the reference from the linear approximation by Equation(12).The characteristic peak is picked as long as the peak height increases/decreases monotonically with the SOH decreasing.Although the accuracy of the peak location can be precise with error less than 0.5%, the error in the peak height of the SVR algorithm with "double σ + Cost Function #1" can be as large as 15%-16%, whereas that with "single σ + Cost Function #2" can be up to 8%.The accuracy of the SVR algorithm with "double σ + Cost Function #1" is not acceptable for further quantified ICA analysis, if the peak height is regarded as an indicator.

Figure 12 .
Figure 12.Quantifying the features of IC peak for further online SOH estimation.

Figure 12 .
Figure 12.Quantifying the features of IC peak for further online SOH estimation.

Table 2 .
Comparison of the SVR algorithms with different parametric settings.

Table 2 .
Comparison of the SVR algorithms with different parametric settings.
Energies 2018, 11, x FOR PEER 16 of 21 cathode.Comparison of the locations and heights of the peaks in the IC curves for (c) Cell A with LFP cathode; (d) Cell B with NCM cathode.

Table 3 .
The accuracy of the recommended SVR algorithms in capturing the feature of the characteristic peaks (peak location and height) in the IC curves.

Table 3 .
The accuracy of the recommended SVR algorithms in capturing the feature of the characteristic peaks (peak location and height) in the IC curves.

Table 4 .
The accuracy of the recommended SVR algorithms in capturing the features of the characteristic peaks (height integration near the peak) in the IC curves.