Lithium-Ion Battery SOH Estimation Method Based on Multi-Feature and CNN-BiLSTM-MHA

: Electric vehicles can reduce the dependence on limited resources such as oil, which is conducive to the development of clean energy. An accurate battery state of health (SOH) is beneficial for the safety of electric vehicles. A multi-feature and Convolutional Neural Network–Bidirectional Long Short-Term Memory–Multi-head Attention (CNN-BiLSTM-MHA)-based lithium-ion battery SOH estimation method is proposed in this paper. First, the voltage, energy, and temperature data of the battery in the constant current charging phase are measured. Then, based on the voltage and energy data, the incremental energy analysis (IEA) is performed to calculate the incremental energy (IE) curve. The IE curve features including IE, peak value, average value, and standard deviation are extracted and combined with the thermal features of the battery to form a complete multi-feature sequence. A CNN-BiLSTM-MHA model is set up to map the features to the battery SOH. Experiments were conducted using batteries with different charging currents, and the results showed that even if the nonlinearity of battery SOH degradation is significant, this method can still achieve a fast and accurate estimation of the battery SOH. The Mean Absolute Error (MAE) is 0.1982%, 0.1873%, 0.1652%, and 0.1968%, and the Root-Mean-Square Error (RMSE) is 0.2921%, 0.2997%, 0.2130%, and 0.2625%, respectively. The average Coefficient of Determination ( R 2 ) is above 96%. Compared to the BiLSTM model, the training time is reduced by an average of about 36%.


Introduction
Electric vehicles have the advantages of good environmental protection, high energy efficiency, and low operating costs.They can reduce the dependence on limited resources such as oil and help reduce carbon emissions, making them widely used in personal transportation, commercial transportation, and special purpose vehicles [1][2][3].As an important energy storage device for electric vehicles, lithium-ion batteries play a crucial role in the lifecycle of electric vehicles.However, with the use of batteries, their performance and state of health (SOH) will gradually deteriorate, limiting the service life of vehicles.Therefore, the SOH has become a key indicator for evaluating the degree of battery aging [4,5].More importantly, the safety of batteries is closely related to their SOH.When batteries age or are damaged, the probability of catastrophic events such as self-ignition or explosion significantly increases [6,7].Therefore, accurately estimating the SOH of batteries is an indispensable measure to ensure the safety of power batteries and extend their lifespan, which is conducive to improving the reliability of battery management systems [8][9][10].
Domestic and international scholars have proposed various methods for estimating the SOH of lithium-ion batteries, which can generally be divided into two categories: model-based methods [11][12][13][14][15][16][17][18] and data-driven methods [19][20][21][22][23][24][25][26].Model-based methods involve establishing appropriate equivalent models of lithium-ion batteries to simulate average value, and standard deviation, are extracted.It is verified through the Pearson correlation coefficient method that the IE curve features have a strong correlation with the battery SOH.Subsequently, the IE curve features are combined with the thermal features of the battery to form a complete multi-feature sequence.Compared with the relatively single features extracted by traditional feature extraction methods, the multi-features extracted in this paper contain a more comprehensive nonlinear aging trend of the battery SOH.Therefore, the deep learning model can learn richer battery SOH aging information, thereby establishing a more accurate battery SOH estimation model.
(2) A highly efficient CNN-BiLSTM-MHA deep learning model is set up.The unique capability of the Convolutional Neural Network (CNN) to extract key trends in sequences is utilized in this paper.It effectively combines the original multi-feature sequence into local abstract features, reducing the computational burden on subsequent models.Following the CNN, Bidirectional Long Short-Term Memory (BiLSTM), which has the ability to capture long-term dependencies in sequences, is used.By calculating the sequence in both the forward and backward directions, the model can keenly capture the aging trend of the battery SOH.Additionally, Multi-head Attention (MHA) is applied to the hidden states of BiLSTM, effectively alleviating the problem of limited information propagation in hidden layers caused by gradient explosion and gradient vanishing in BiLSTM, thereby enhancing the model's generalization capability.Consequently, the CNN-BiLSTM-MHA model can achieve excellent SOH estimation performance, including accuracy, speed, and robustness.
(3) Comprehensive and rigorous SOH estimation validation and model comparison: Following the method proposed in this paper, SOH estimation is conducted using four battery aging datasets with different charging rates.The CNN-BiLSTM-MHA model is systematically compared with the CNN-BiLSTM, BiLSTM, BiGRU, and SVR models.The experimental results indicate that, even with the pronounced nonlinear degradation process of the battery SOH, the proposed method demonstrates superior SOH estimation performance.Across the four datasets, the Mean Absolute Error (MAE) remains within 0.2%, the Root-Mean-Square Error (RMSE) within 0.3%, and the average Coefficient of Determination (R 2 ) exceeds 96%.Compared to the BiLSTM model, the MAE and RMSE are reduced by approximately 36% and 35%, respectively, and the training time is reduced by an average of about 36%.
In summary, the measurement data were utilized for comprehensive feature extraction in this paper, and an SOH estimation model combining accuracy and speed was established, which was effectively demonstrated through experimental verification and comparison.
The rest of this paper is structured as follows: Section 2 provides a detailed introduction to multi-features and their extraction methods.Section 3 introduces the proposed CNN-BiLSTM-MHA model and analyzes its advantages.Section 4 introduces the acquisition of battery aging data and uses four different charging rate datasets to conduct experiments based on the method proposed in this paper.The experimental results are obtained, followed by model comparison to verify the superiority of the SOH estimation model established in this paper.Section 5 concludes this paper.Finally, Section 6 presents the limitations of this study.

Feature Extraction
Constant current-constant voltage (CC-CV) charging is the most widely used charging method for lithium-ion batteries.As shown in Figure 1, it depicts the curves of voltage and current over time during the CC-CV charging process of a lithium-ion battery.In this paper, data from the constant voltage charging phase are selected for a comprehensive and detailed analysis of the IE curve.Combined with the temperature during the constant voltage charging phase, comprehensive health indices are extracted, and the Pearson correlation coefficient method is utilized to analyze the correlation of features extracted from the IE curve.
voltage and current over time during the CC-CV charging process of a lithium-ion battery.In this paper, data from the constant voltage charging phase are selected for a comprehensive and detailed analysis of the IE curve.Combined with the temperature during the constant voltage charging phase, comprehensive health indices are extracted, and the Pearson correlation coefficient method is utilized to analyze the correlation of features extracted from the IE curve.

Feature Extraction Based on IE Curve and Correlation Analysis
Traditional incremental capacity analysis (ICA) is commonly applied to analyze the aging mechanism of batteries during the constant current charging and discharging phases.By collecting capacity and terminal voltage data during the constant current charging and discharging phases, the original capacity-voltage (Q-V) curve is transformed into an incremental capacity (dQ/dV-V) curve through first-order differentiation, allowing for the extraction of more comprehensive features from the transformed curve.However, battery charging and discharging involve changes in both current and voltage.Battery capacity calculation is only related to current and time during charging and discharging, ignoring the voltage variable.Therefore, in this paper, the original energyvoltage (E-V) curve is transformed into an IE (dE/dV-V) curve through first-order differentiation, which is conducive to analyzing and extracting more comprehensive and effective features to reflect the battery aging process.
Taking the laboratory cycling aging dataset of batteries with a 0.5C charging rate as an example, the E-V curves and IE curves at different cycle numbers are shown in Figure 2 and Figure 3, respectively.

Feature Extraction Based on IE Curve and Correlation Analysis
Traditional incremental capacity analysis (ICA) is commonly applied to analyze the aging mechanism of batteries during the constant current charging and discharging phases.By collecting capacity and terminal voltage data during the constant current charging and discharging phases, the original capacity-voltage (Q-V) curve is transformed into an incremental capacity (dQ/dV-V) curve through first-order differentiation, allowing for the extraction of more comprehensive features from the transformed curve.However, battery charging and discharging involve changes in both current and voltage.Battery capacity calculation is only related to current and time during charging and discharging, ignoring the voltage variable.Therefore, in this paper, the original energy-voltage (E-V) curve is transformed into an IE (dE/dV-V) curve through first-order differentiation, which is conducive to analyzing and extracting more comprehensive and effective features to reflect the battery aging process.
Taking the laboratory cycling aging dataset of batteries with a 0.5C charging rate as an example, the E-V curves and IE curves at different cycle numbers are shown in Figures 2 and 3

IE during Constant Current Charging Phase
Analyzing Figure 2 reveals that the vertical span of the E-V curve, i.e., the span of IE, decreases with an increase in the cycle number.This indicates that during the constant current charging phase, the battery's acceptable external energy decreases due to battery aging.By calculating the energy added during each cycle of constant current charging, the results shown in Figure 4 are obtained.From Figure 4, it can be observed that the overall trend of the energy added during the battery's constant current charging phase exhibits a decreasing trend, but there are also local sharp increases and decreases.These fluctuations are somewhat associated with the phenomenon of battery capacity regeneration.Therefore, using IE as a feature can effectively establish a mapping with the battery SOH through deep learning models.
The IE is calculated as follows: where Δ represents IE, and  and  respectively denote the final and initial energy during the constant current charging phase.

IE during Constant Current Charging Phase
Analyzing Figure 2 reveals that the vertical span of the E-V curve, i.e., the span of IE, decreases with an increase in the cycle number.This indicates that during the constant current charging phase, the battery's acceptable external energy decreases due to battery aging.By calculating the energy added during each cycle of constant current charging, the results shown in Figure 4 are obtained.From Figure 4, it can be observed that the overall trend of the energy added during the battery's constant current charging phase exhibits a decreasing trend, but there are also local sharp increases and decreases.These fluctuations are somewhat associated with the phenomenon of battery capacity regeneration.Therefore, using IE as a feature can effectively establish a mapping with the battery SOH through deep learning models.
The IE is calculated as follows: where ∆E represents IE, and E end and E start respectively denote the final and initial energy during the constant current charging phase.

The Peak and Average Values of the IE Curve
Analyzing the IE curve shown in Figure 3 reveals that with an increase in the cycle number, the peak of the IE curve decreases correspondingly.This peak reflects the peak rate of IE.By extracting the peaks of the IE curves for each cycle, the results illustrated in Figure 5 are obtained.As seen in Figure 5, the peaks of the IE curve exhibit a stair-step decreasing trend, indicating that during the battery aging process, the peak rate of IE periodically enters a stable phase after a steep decline.This reflects the maximum intensity of internal chemical reactions within the battery.

The Peak and Average Values of the IE Curve
Analyzing the IE curve shown in Figure 3 reveals that with an increase in the cycle number, the peak of the IE curve decreases correspondingly.This peak reflects the peak rate of IE.By extracting the peaks of the IE curves for each cycle, the results illustrated in Figure 5 are obtained.As seen in Figure 5, the peaks of the IE curve exhibit a stair-step decreasing trend, indicating that during the battery aging process, the peak rate of IE periodically enters a stable phase after a steep decline.This reflects the maximum intensity of internal chemical reactions within the battery.Analyzing the IE curve shown in Figure 3 reveals that with an increase in the cycle number, the peak of the IE curve decreases correspondingly.This peak reflects the peak rate of IE.By extracting the peaks of the IE curves for each cycle, the results illustrated in Figure 5 are obtained.As seen in Figure 5, the peaks of the IE curve exhibit a stair-step decreasing trend, indicating that during the battery aging process, the peak rate of IE periodically enters a stable phase after a steep decline.This reflects the maximum intensity of internal chemical reactions within the battery.To comprehensively analyze the intensity of internal chemical reactions within the battery through the IE curve, the average values of the IE curves for each cycle are calculated.The results are presented in Figure 6.As Figure 6 shows, the average values of the IE curve, representing the average rate of IE, exhibit a decreasing trend overall, with occasional minor sharp increases and decreases.This indicates that as the battery ages, the average intensity of internal chemical reactions gradually weakens.
The average value of the IE curve is calculated as follows: To comprehensively analyze the intensity of internal chemical reactions within the battery through the IE curve, the average values of the IE curves for each cycle are calculated.The results are presented in Figure 6.As Figure 6 shows, the average values of the IE curve, representing the average rate of IE, exhibit a decreasing trend overall, with occasional minor sharp increases and decreases.This indicates that as the battery ages, the average intensity of internal chemical reactions gradually weakens.
The average value of the IE curve is calculated as follows: By extracting the peak value and average value of the IE curve, it is possible to effectively reflect the peak rate and average rate of IE, thus establishing a certain correlation with the internal chemical reactions within the battery.Therefore, selecting the peaks and averages of the IE curve as features can effectively establish a mapping with the battery SOH through deep learning models.

The Standard Deviation of the IE Curve
The standard deviation reflects the degree to which the values in the dataset deviate By extracting the peak value and average value of the IE curve, it is possible to effectively reflect the peak rate and average rate of IE, thus establishing a certain correlation with the internal chemical reactions within the battery.Therefore, selecting the peaks and averages of the IE curve as features can effectively establish a mapping with the battery SOH through deep learning models.

The Standard Deviation of the IE Curve
The standard deviation reflects the degree to which the values in the dataset deviate from the mean value.In the context of the IE curve, it indicates the stability of the IE rate.By calculating the standard deviation of the IE curve for each cycle, the results presented in Figure 7 are obtained.Figure 7 shows that the standard deviation of the IE curve exhibits a decreasing trend, indicating that as the battery ages, the IE rates of each cycle tend towards their average rate.The stability of the IE rate gradually increases, while there are also sharp increases and decreases associated with battery capacity regeneration.Therefore, selecting this feature can effectively establish a mapping with the battery SOH through deep learning models.
The standard deviation of the IE curve is calculated as follows: where Std represents the standard deviation of the IE curve, f (x i ) represents the discrete data points of the IE curve, f denotes the mean value of the IE curve, and N is the number of data points.

Correlation Analysis
To further validate the correlation between features extracted from the IE curve and the battery SOH, the Pearson correlation coefficient method is used for correlation analysis between IE curve features and the SOH in this paper.The Pearson correlation coefficient method is commonly used to measure the degree of the linear relationship between two variables, with values ranging from −1 to 1.A correlation coefficient of 1 indicates a perfect positive correlation between two variables, −1 indicates a perfect negative correlation, and 0 indicates no linear relationship between the variables.Therefore, if the absolute value of the Pearson correlation coefficient between the extracted features and battery SOH approaches 1, it indicates that the extracted features can better reflect the decay of the battery SOH in terms of a linear relationship.
The Pearson correlation coefficient is defined as follows: where  represents the Pearson correlation coefficient,  and  respectively denote the -th values in the two variables,  and  represent the mean values of the two variables, and  denotes the sample size.By calculating the Pearson correlation coefficients between the features extracted from the IE curve and battery SOH, the results are obtained as shown in Table 1.

Correlation Analysis
To further validate the correlation between features extracted from the IE curve and the battery SOH, the Pearson correlation coefficient method is used for correlation analysis between IE curve features and the SOH in this paper.The Pearson correlation coefficient method is commonly used to measure the degree of the linear relationship between two variables, with values ranging from −1 to 1.A correlation coefficient of 1 indicates a perfect positive correlation between two variables, −1 indicates a perfect negative correlation, and 0 indicates no linear relationship between the variables.Therefore, if the absolute value of the Pearson correlation coefficient between the extracted features and battery SOH approaches 1, it indicates that the extracted features can better reflect the decay of the battery SOH in terms of a linear relationship.
The Pearson correlation coefficient is defined as follows: where r represents the Pearson correlation coefficient, X i and Y i respectively denote the i-th values in the two variables, X and Y represent the mean values of the two variables, and n denotes the sample size.By calculating the Pearson correlation coefficients between the features extracted from the IE curve and battery SOH, the results are obtained as shown in Table 1.From Table 1, it can be observed that the Pearson correlation coefficients between the features extracted from the IE curve and battery SOH are all close to 1.This indicates a very strong correlation between the features and battery SOH.Therefore, selecting these features from the IE curve allows for the aging pattern of the battery to be accurately tracked by deep learning models.

Thermal Features
The temperature and SOH of lithium-ion batteries are closely related [27,28], primarily due to the physical and chemical changes that occur within the battery.At elevated temperatures, secondary reactions and degradation phenomena may occur, such as the oxidation of the cathode material and decomposition of the electrolyte.These reactions and degradation phenomena accelerate the aging process of the battery, leading to a decrease in the SOH.Conversely, at low temperatures, the flowability of the electrolyte inside the battery decreases significantly, resulting in slower ion transport within the battery.This can lead to reduced efficiency during the battery charging and discharging processes, thereby rapidly decreasing the battery SOH.However, at an appropriate temperature, the flowability of the electrolyte, chemical reaction rates, and material stability within the lithium-ion battery reach optimal conditions.As a result, the aging rate of the battery slows down, and capacity regeneration phenomena may occur.
Considering the impact of temperature on the SOH of lithium-ion batteries, the average temperature of the lithium-ion battery during the constant current charging phase is extracted as a thermal feature in this paper.This enables deep learning models to learn richer information about battery aging, thereby accurately tracking the aging patterns of the battery.
The average charging temperature is calculated as follows: where T avg represents the average charging temperature, T 1 to T n denote the temperatures of each sampling point, and n represents the number of sampling points.

CNN
A CNN is a deep learning algorithm inspired by the structure and principles of the visual cortex in animals.It effectively learns spatial hierarchical relationships in data by simulating the visual perception mechanism of animals.While CNNs are primarily applied to image recognition and processing tasks, their ability to capture local abstract features also makes them perform well on time series data processing tasks.Therefore, it can be effectively applied in the field of battery SOH estimation [29].
Due to the excellent performance of One-Dimensional Convolutional Neural Networks (1D CNNs) in time series processing tasks [30], a 1D CNN is utilized in this paper to process time series data.The 1D CNN is used to extract key trends in the IE curve features and thermal features, transforming the critical trends in the original features into abstract local important features.This provides valuable information for the analysis and prediction of subsequent models.
The CNN in this paper includes the convolutional layer and the pooling layer.The basic mathematical formulas and descriptions of these operations are as follows: (1) Convolutional layer: where y represents the output of the convolutional layer, x represents the one-dimensional input signal, f represents the convolution kernel (filter), s represents the index within the convolution kernel, and t represents the index of time or space.
For each filter in the convolutional layer, such an operation is performed, generating the corresponding feature map.
(2) Activation Function: The ReLU function is chosen as the activation function.The ReLU activation function is defined as the following: where h(t) represents the output after activation, max denotes the maximum operation, and y(t) is the input.This operation means that if the convolutional output y(t) is greater than 0, the output of the activation function is y(t); if it is less than or equal to 0, the output is 0. This nonlinear transformation helps the model learn complex data representations.
(3) Max Pooling Layer: The max pooling operation selects the maximum value within a specified range as the output for that range.The max pooling operation is defined as the following: where m(t) represents the output after the pooling operation, h is the input, p represents the pooling window size, and max operation applies within the range of indices from t to t + p − 1, i.e., selecting the maximum value within each pooling window.
The structure diagram of the 1D CNN in this paper is shown in Figure 8: This operation means that if the convolutional output () is greater than 0, the output of the activation function is (); if it is less than or equal to 0, the output is 0. This nonlinear transformation helps the model learn complex data representations.
(3) Max Pooling Layer: The max pooling operation selects the maximum value within a specified range as the output for that range.The max pooling operation is defined as the following: where () represents the output after the pooling operation, ℎ is the input,  represents the pooling window size, and  operation applies within the range of indices from  to  +  − 1, i.e., selecting the maximum value within each pooling window.
The structure diagram of the 1D CNN in this paper is shown in Figure 8: In the figure, the output of the convolutional layer is determined by multiple original sequence data, depending on the choice of the size of the convolutional kernel and stride.The output of the convolutional layer is activated through the ReLU activation function and finally obtained through the max pooling layer, which constitutes the output of the entire 1D CNN.

y(t−1) y(t) y(t+1) h(t+1) h(t) h(t−1) m(t)
The CNN in this paper first utilizes convolutional operations to extract abstract local important features from the original feature sequences.Subsequently, pooling operations are employed to reduce the dimensionality of the abstract features, resulting in abstract local important features.These abstract local important features can more accurately describe the periodic variations and anomalies in the sequences, thereby alleviating to some extent the overfitting risks caused by the multicollinearity of the original In the figure, the output of the convolutional layer is determined by multiple original sequence data, depending on the choice of the size of the convolutional kernel and stride.The output of the convolutional layer is activated through the ReLU activation function and finally obtained through the max pooling layer, which constitutes the output of the entire 1D CNN.
The CNN in this paper first utilizes convolutional operations to extract abstract local important features from the original feature sequences.Subsequently, pooling operations are employed to reduce the dimensionality of the abstract features, resulting in abstract local important features.These abstract local important features can more accurately describe the periodic variations and anomalies in the sequences, thereby alleviating to some extent the overfitting risks caused by the multicollinearity of the original features.Moreover, since the abstract local important features are fewer in quantity compared to the original feature sequences, they effectively reduce the computational burden of subsequent models, thereby accelerating the overall training speed.

BiLSTM-MHA
BiLSTM is a special type of a recurrent neural network (RNN) that comprehensively captures long-term dependencies in sequence data from both the forward and backward directions.Its main applications include text generation, speech recognition, machine translation, and time series prediction, among others.In this paper, BiLSTM is utilized to process time series data, where the input sequence is the output of the aforementioned CNN.
The computation process of a standard LSTM is as follows: (1) First, utilizing the external state from the previous time step and the current input at the current time step, the forget gate, input gate, output gate, and candidate state cell are computed.The mathematical formulations are defined as the following: where f t , i t , o t , and ∼ C t represent the forget gate, input gate, output gate, and candidate state cell, respectively.W f , W i , W o , and W C denote the weights for the forget gate, input gate, output gate, and candidate state cell, while b f , b i , b o , and b C represent the biases.σ denotes the Sigmoid activation function, which compresses the result between 0 and 1, and tanh represents the hyperbolic tangent activation function, which compresses the result between −1 and 1.
(2) Updating the memory cell is conducted by combining the forget gate and the input gate.The mathematical formulation is defined as the following: where C t represents the updated memory cell, C t−1 denotes the memory cell to be updated, f t and i t respectively represent the forget gate and the input gate, and ∼ C t represents the candidate cell state.
(3) Combining with the output gate, the information from the internal state is passed to the external state.The mathematical formulation is defined as the following: where h t represents the external state, C t denotes the updated memory cell, o t represents the output gate, and tanh is the hyperbolic tangent activation function.
For BiLSTM, at each time step, both the forward and backward external states are computed simultaneously.Then, these two external states are combined to form the final external state.
Although BiLSTM performs well in handling time series tasks, it suffers from the problem of limited information propagation.This is because at each time step, information can only be propagated through hidden states.For sequences with long-term dependencies, the hidden states may be affected by issues like vanishing or exploding gradients, leading to inefficient information propagation.To address the limitation of information propagation in BiLSTM, MHA is applied to the hidden state sequences of BiLSTM in this paper.
MHA is a mechanism in deep learning used to handle sequential data.It captures the relationships between different positions in the input sequence and improves the representation of the sequence by adding appropriate weights to it through the parallel computation and concatenation of multiple attention heads.This enhances the model's generalization ability.
Assuming the input sequence is X = {x 1 , x 2 , . . . ,x n }, where x i is the i-th element in the sequence and has a dimension of d, the computation process of MHA is as follows: (1) Linear transformations are applied to the input sequence to obtain queries (Q), keys (K), and values (V).The mathematical formulations are defined as the following: where W Q , W K , and W V represent the corresponding weight matrices.
(2) For each pair of query q i and key k j , the attention score is calculated.The mathematical formulation is defined as the following: where Attention q i , k j represents the attention score, √ d k is the scaling factor, d k is typically equal to d/h, and h is the number of attention heads.
(3) The attention scores are normalized.The mathematical formulation is defined as the following: where α ij denotes the attention weight, and the Softmax function is applied to normalize Attention q i , k j .(4) The attention weights are used to weight and sum the values V, obtaining the attention representation for each query q i .The mathematical formulation is defined as the following: where Attention(q i , V) represents the attention representation for each query q i , α ij denotes the attention weight, and v j represents the value vector corresponding to the j-th element.
(5) All attention head outputs are concatenated, and another linear transformation is performed.The mathematical formulation is defined as the following: where MHA(X) represents the output of the MHA, Concat denotes the concatenation operation, Attention 1 , . . ., Attention h represent the outputs of each attention head, and W O is the weight matrix.The structure of BiLSTM-MHA in this paper is shown in Figure 9.
In this paper, the input of BiLSTM-MHA is the output of the CNN.The CNN effectively extracts the key variation trends of the original feature sequence as abstract local important features and reduces the amount of sequence data, thereby alleviating the computational burden of BiLSTM-MHA.MHA is applied to the hidden states of BiLSTM, allowing the model to consider the importance of input sequences at different positions at each time step.This helps alleviate the vanishing or exploding gradient problems in BiLSTM, enabling the hidden states of BiLSTM to be updated correctly, thus improving model training stability and prediction accuracy.
(5) All attention head outputs are concatenated, and another linear transformation is performed.The mathematical formulation is defined as the following: where MHA() represents the output of the MHA, Concat denotes the concatenation operation, Attention , … , Attention represent the outputs of each attention head, and  is the weight matrix.The structure of BiLSTM-MHA in this paper is shown in Figure 9.

{
Hidden layer Multi-Head Attention In this paper, the input of BiLSTM-MHA is the output of the CNN.The CNN effectively extracts the key variation trends of the original feature sequence as abstract local important features and reduces the amount of sequence data, thereby alleviating the computational burden of BiLSTM-MHA.MHA is applied to the hidden states of BiLSTM, allowing the model to consider the importance of input sequences at different positions at each time step.This helps alleviate the vanishing or exploding gradient problems in BiLSTM, enabling the hidden states of BiLSTM to be updated correctly, thus improving model training stability and prediction accuracy.

Experimental Data
This paper conducts battery charge-discharge cycle aging experiments on cylindrical lithium-ion batteries with the same specifications in indoor ambient temperature conditions.The rated capacity of the batteries is 2.5 Ah.Firstly, constant current-constant voltage charging is used at rates of 0.1C, 0.2C, 0.3C, and 0.5C until the battery terminal voltage reaches the upper cutoff voltage.Then, the voltage is kept constant for constant voltage charging until the current decreases to the cutoff current.After a 5-min rest period to stabilize the internal chemical state of the battery, a constant current discharge at 0.5C is performed until the battery terminal voltage decreases to the lower cutoff voltage.There are various definitions of the battery SOH, and the capacity ratio is selected in this paper to define the SOH of lithium-ion batteries.This definition combines reliability and intuitiveness and is defined as the following [31]: where Q max capacity represents the current maximum discharge capacity of the lithium battery, and Q rated capacity represents the rated capacity of the lithium battery when it left the factory.Figure 10a-d respectively represent the SOH of four lithium-ion batteries charged at 0.1C, 0.2C, 0.3C, and 0.5C as the number of charge-discharge cycles increases.From the figure, it can be observed that the SOH of the batteries generally decreases with the increase in charge-discharge cycles.However, due to various comprehensive factors affecting the batteries during the charge-discharge cycle process, this process exhibits significant nonlinearity.Additionally, noticeable capacity regeneration phenomena occur at multiple different numbers of charge-discharge cycles.These phenomena pose certain difficulties for estimating the SOH of lithium-ion batteries.
The cycling aging parameters for batteries at four different charge rates are shown in Table 2.
with the increase in charge-discharge cycles.However, due to various comprehensive factors affecting the batteries during the charge-discharge cycle process, this process exhibits significant nonlinearity.Additionally, noticeable capacity regeneration phenomena occur at multiple different numbers of charge-discharge cycles.These phenomena pose certain difficulties for estimating the SOH of lithium-ion batteries.The cycling aging parameters for batteries at four different charge rates are shown in Table 2.

Experimental Results
According to the second part of this paper, the IE curve features and thermal features of lithium-ion batteries at different charge rates are extracted, combined with their corresponding SOH values, to form a complete dataset.In order to maintain the overall distribution balance of the dataset and ensure that the model has good generalization ability, reducing the risk of underfitting or overfitting and obtaining reliable evaluation results, the training set and test set are split in a 1:1 ratio.That is, the first 50% of the cycles are used as the training set, and the last 50% of the cycles are used as the test set.
The training set data are input into the CNN-BiLSTM-MHA model constructed in the third part of this paper.In this model, the default number of filters for the CNN and the size of the convolutional kernel are 64 and 3, respectively.The default number of

Experimental Results
According to the second part of this paper, the IE curve features and thermal features of lithium-ion batteries at different charge rates are extracted, combined with their corresponding SOH values, to form a complete dataset.In order to maintain the overall distribution balance of the dataset and ensure that the model has good generalization ability, reducing the risk of underfitting or overfitting and obtaining reliable evaluation results, the training set and test set are split in a 1:1 ratio.That is, the first 50% of the cycles are used as the training set, and the last 50% of the cycles are used as the test set.
The training set data are input into the CNN-BiLSTM-MHA model constructed in the third part of this paper.In this model, the default number of filters for the CNN and the size of the convolutional kernel are 64 and 3, respectively.The default number of neurons for BiLSTM is 128, which can be adjusted appropriately based on the complexity of the task.The number of attention heads and the dimension of keys are both set to 2. The learning rate is adaptively updated by the Adam optimizer, which helps stabilize the internal parameters of the model and converge faster to the global optimal solution.The training is conducted for 100 epochs.
The experimental flowchart is shown in Figure 11.( )

Correlation analysis
Step3 Step4: SOH estimation Performance analysis ) The SOH estimation results are shown in Figure 12a-d.
To rigorously evaluate the accuracy and speed of SOH estimation, this experiment employs common metrics for assessing the performance of regression models, namely the MAE, RMSE, R 2 , and training time.The MAE, RMSE, and R 2 are defined as the following: In this experiment, n represents the number of charge-discharge cycles, y i denotes the true SOH value, y represents the average true SOH value, and ŷi stands for the estimated SOH value.For the MAE and RMSE, smaller values indicate a higher estimation accuracy of the model.For R 2 , a value closer to 1 indicates a better fitting performance of the model.The SOH estimation performance of the proposed method is shown in Table 3.Based on Figure 13 and Table 3, it can be observed that due to conducting comprehensive IEA, combining the IE curve features and thermal features enables the deep learning model to establish a mapping relationship between features and the battery SOH effectively.The CNN-BiLSTM-MHA model constructed in this paper leverages the excellent ability of CNNs to extract various key trends in the sequence.The abstract features extracted by the CNN make the combination of IE curve features and thermal features more reasonable and alleviate the computational burden of subsequent BiLSTM.Meanwhile, BiLSTM performs well in learning the correlation between sequences, and the utilization of MHA to weight the hidden states of BiLSTM further enhances its stability.The experimental results demonstrate that the proposed method achieves accurate and fast SOH estimation results across four different charge rates in the cycle aging experiment data, particularly for the true values of the SOH at charging rates of 0.1C and 0.2C, as these two charging rates result in longer charging times for each cycle of the battery, and the battery charge-discharge cycle aging experiment was conducted at room temperature.Considering the temperature difference between day and night, the average temperature difference between each cycle at these two charging rates is significant.Therefore, the capacity regeneration and degradation phenomena are more obvious at these two charging rates.Due to the consideration of thermal features, the proposed method can accurately estimate the SOH.To further demonstrate the estimation performance of the proposed method, an evaluation is conducted based on the four aforementioned metrics: the MAE, RMSE, R 2 , and training time.
From the perspective of estimation accuracy, across the four different charge rates, the MAE of the estimation results is within 0.2%, and the RMSE is within 0.3%.For the 0.2C, 0.3C, and 0.5C charge rate data, the R 2 are all above 95%.For the 0.1C charge rate data, due to the pronounced nonlinearity in the variation in its true SOH values, R 2 is slightly lower compared to the estimation results of other charge rates.However, both the MAE and RMSE reach considerable standards, and R 2 still reaches 92.62%, indicating that the proposed method can provide an accurate estimation of the battery SOH.
From the perspective of training time, with a significant amount of data in the battery charge-discharge cycle process, the training time varies due to the different numbers of charge-discharge cycles and the complexity of the tasks.The maximum training time is only 15.36 s.To further demonstrate the estimation performance of the proposed method, an evaluation is conducted based on the four aforementioned metrics: the MAE, RMSE, R 2 , and training time.
From the perspective of estimation accuracy, across the four different charge rates, the MAE of the estimation results is within 0.2%, and the RMSE is within 0.3%.For the 0.2C, 0.3C, and 0.5C charge rate data, the R 2 are all above 95%.For the 0.1C charge rate data, due to the pronounced nonlinearity in the variation in its true SOH values, R 2 is slightly lower compared to the estimation results of other charge rates.However, both the MAE and RMSE reach considerable standards, and R 2 still reaches 92.62%, indicating that the proposed method can provide an accurate estimation of the battery SOH.
From the perspective of training time, with a significant amount of data in the battery charge-discharge cycle process, the training time varies due to the different numbers of charge-discharge cycles and the complexity of the tasks.The maximum training time is only 15.36 s.

Experimental Comparisons
In order to validate the superiority of the CNN-BiLSTM-MHA model proposed in this paper, comparative experiments are conducted with the CNN-BiLSTM, BiLSTM, BiGRU, and SVR models.Using the aforementioned laboratory data with different charge rates, the comparative experimental results are shown in Figure 13a-d.
The MAE, RMSE, and R 2 comparison of the estimation results for different charging rate across various models are shown in Tables 4-7.The comparison of the training time between the proposed model and the CNN-BiLSTM, BiLSTM, and BiGRU models at different charging rates is shown in Table 8.From Tables 4-8, it can be observed that using IE curve features combined with thermal features, the proposed model outperforms the other four models in terms of both the MAE and RMSE (which are lower) and R 2 (which is higher).Additionally, the training time of the proposed CNN-BiLSTM-MHA model is significantly advantageous compared to the BiLSTM model.This indicates that the CNN-BiLSTM-MHA model has better SOH estimation accuracy and speed across different charge rates compared to other models.Furthermore, it is noted that BiGRU, having a simpler gating structure and lower computational complexity compared to BiLSTM, exhibits slightly shorter training times.However, its estimation accuracy is inferior to BiLSTM.Considering that the CNN reduces the computational burden of subsequent models, BiLSTM is selected as part of the combined model in this paper.
Comparing the estimation results of the CNN-BiLSTM model with the BiLSTM model reveals that the CNN effectively extracts key trends in the original IE curve features and thermal features, transforming them into abstract features by effectively combining the original features.This allows the subsequent BiLSTM to easily learn forward and backward dependencies between sequences.Compared to the BiLSTM model, the CNN-BiLSTM model achieves an average reduction of approximately 22% in the MAE and RMSE and an average reduction of approximately 39% in training time.This demonstrates the effectiveness of the CNN in handling original features.
In this paper, MHA is applied to the hidden states of BiLSTM, effectively alleviating the gradient vanishing and exploding problems in BiLSTM, ensuring that the hidden states are updated correctly.Comparing the estimation results of the CNN-BiLSTM-MHA model with the CNN-BiLSTM model, it is observed that while the CNN-BiLSTM-MHA model experiences an average increase of approximately 4% in training time, the MAE and RMSE are reduced by an average of approximately 18% and 17%, respectively.This indicates that combining MHA with BiLSTM can achieve significant improvements in estimation accuracy with minimal computational burden.
In conclusion, the proposed method and model can provide a rapid and accurate estimation of the SOH for data from different charging rate.

Conclusions
A lithium-ion battery SOH estimation method based on multi-feature and CNN-BiLSTM-MHA is proposed in this paper, significantly enhancing the accuracy and speed of SOH estimation.
In this paper, the IE curves are obtained from the constant current charging phase data of the battery.A comprehensive and detailed analysis of the IE curves is conducted, and IE curve features including IE, peak value, average value, and standard deviation are extracted.These IE curve features provide comprehensive and accurate information about the battery SOH.The Pearson correlation coefficient method is also used to validate the high correlation between IE curve features and the battery SOH.In addition to the IE curve features, thermal features of the battery are also incorporated.Therefore, the deep learning model can learn the relationship between the internal chemical reaction changes during the charge-discharge cycle of the battery and SOH, leading to an accurate SOH estimation model.
Based on comprehensive IEA, the extraction of IE curve features combined with thermal features, an outstanding CNN-BiLSTM-MHA model is proposed in this paper.This model combines the advantages of CNNs in extracting important abstract features and BiLSTM in learning long-term sequential dependencies.Therefore, the model achieves both speed and accuracy in battery SOH estimation.Additionally, the use of MHA enhances the stability of BiLSTM, further improving the generalization and robustness of the model.The model undergoes rigorous experimental validation on datasets with four different charging currents.Comparative experiments with the CNN-BiLSTM, BiLSTM, BiGRU, and SVR models further confirm the excellent estimation accuracy and speed of the CNN-BiLSTM-MHA model.Even in the presence of significant nonlinearity in the battery SOH decay process, the MAE is 0.1982%, 0.1873%, 0.1652%, and 0.1968%, and the RMSE is 0.2921%, 0.2997%, 0.2130%, and 0.2625%, respectively.The average R 2 exceeds 96%.Compared to the BiLSTM model, the CNN-BiLSTM-MHA model achieves an average reduction of approximately 36% in the MAE and RMSE and an average reduction of approximately 36% in training time.
In addition, the results of this study provide valuable references for future research work.By revealing the relationship between voltage, energy, and temperature data and the battery SOH, it lays the foundation for developing more accurate and efficient SOH estimation models and stimulates researchers to explore methods for using measurement data for accurate and fast SOH estimation.In the future, combining more types of sensor data and adopting more advanced deep learning algorithms may further improve the accuracy and real-time performance of SOH estimation.Optimizing the data collection and processing process to achieve SOH estimation using fragmented data will be an important development direction.In summary, this paper not only provides an effective solution for the accurate and rapid estimation of battery SOH but also points out the direction for future research and application, with important academic and practical value.We will optimize data processing and strengthen the establishment of SOH estimation models in the future, exploring the use of fragmented data to achieve the accurate and fast estimation of the SOH.

Limitations of Study
Although this study employed a data-driven approach to estimate the battery SOH by measuring voltage, energy, and temperature data, there are still some limitations.Firstly, the source and diversity of the dataset are key factors.Although this study used four datasets with different charging currents, the data used mainly came from specific types and models of batteries, which may result in the unknown performance of the estimated model when faced with other types of batteries, affecting the universality of the results.Secondly, the uncertainty and errors in the data measurement process may also have an impact on the research results.The measurement of voltage, energy, and temperature is influenced by various factors such as sensor accuracy, environmental conditions, and operating specifications, which may lead to noise and bias in the measurement data, thereby affecting the accuracy of SOH estimation.We have taken corresponding measures to avoid the influence of environmental conditions and operating specifications as much as possible.Finally, the time range and experimental conditions of this study are relatively limited, and it has not

Figure 4 .
Figure 4.The curve of IE variation with cycles.

Figure 4 .
Figure 4.The curve of IE variation with cycles.

Figure 4 .
Figure 4.The curve of IE variation with cycles.2.1.2.The Peak and Average Values of the IE Curve

Figure 5 .
Figure 5.The peak value of the IE curve varies with cycles.

Figure 5 .
Figure 5.The peak value of the IE curve varies with cycles.

22 Figure 6 .
Figure 6.The average value of the IE curve varies with cycles.

Figure 6 .
Figure 6.The average value of the IE curve varies with cycles.

Figure 7 .
Figure 7.The standard deviation of the IE curve varies with cycles.

Figure 7 .
Figure 7.The standard deviation of the IE curve varies with cycles.

Figure 13 .
Figure 13.(a) Comparison of SOH estimation results for charging at 0.1C; (b) comparison of SOH estimation results for charging at 0.2C; (c) comparison of SOH estimation results for charging at 0.3C; (d) comparison of SOH estimation results for charging at 0.5C.

Figure 13 .
Figure 13.(a) Comparison of SOH estimation results for charging at 0.1C; (b) comparison of SOH estimation results for charging at 0.2C; (c) comparison of SOH estimation results for charging at 0.3C; (d) comparison of SOH estimation results for charging at 0.5C.

Table 1 .
The Pearson correlation coefficient between the features of each IE curve and the battery SOH.

Table 4 .
Evaluation indicators at 0.1C charging rate.

Table 5 .
Evaluation indicators at 0.2C charging rate.

Table 6 .
Evaluation indicators at 0.3C charging rate.

Table 7 .
Evaluation indicators at 0.5C charging rate.

Table 8 .
Training time comparison.