1. Introduction
Accurate, efficient, and reliable estimation of the State-of-Health (SOH) of lithium-ion batteries is critical for enhancing the range and service life of electric vehicles. Economically, SOH estimation extends battery life, reduces maintenance costs, and facilitates cascade utilization. With the rapid development of new energy vehicles, SOH estimation has emerged as a key supporting technology that promotes the sustainable advancement of the industry and contributes to the goal of carbon neutrality, offering both commercial value and social benefits [
1,
2,
3,
4,
5,
6]. There are various definitions for the SOH of batteries. This paper uses the capacity ratio to define the SOH of lithium-ion batteries. This definition is both intuitive and reliable, as shown in Equation (1):
where
indicates the current maximum discharge capacity of a lithium battery, and
indicates the rated capacity of the lithium battery.
In recent years, numerous domestic and foreign scholars have proposed various estimation methods for assessing the health status of lithium batteries [
7,
8]. These methods can be categorized into three main types: direct measurement, model-based, and data-driven approaches. The primary direct measurement techniques include Coulomb counting [
8], open-circuit voltage (OCV) [
9], and electrochemical impedance spectroscopy [
10,
11,
12]. Among these, the Coulomb counting method estimates the capacity decay rate by integrating the current. While this method is straightforward, it is susceptible to inaccuracies due to sensor precision, integral error, and self-discharge effects, and thus, its long-term accuracy is limited. The open-circuit voltage method evaluates the SOH based on the OCV-SOC [
13] curve deviation after the battery has rested. However, this method requires a long static equilibrium time and is susceptible to temperature variations, SOC errors, and polarization interference, resulting in low dynamic applicability. Electrochemical impedance spectroscopy (EIS) analyzes the aging mechanism by extracting impedance parameters, such as ohmic and charge-transfer impedance, through AC excitation. This method is highly dependent on equipment accuracy, and the algorithm is complex, posing significant challenges due to high cost and high-frequency noise interference, making its real-time application difficult.
Model-based methods [
14,
15] can be roughly categorized into three types: empirical models [
14], equivalent circuit models [
16,
17], and electrochemical models [
18]. The empirical model relies on experimental data to fit the aging law, such as capacity decay and internal resistance growth, and establishes a statistical relationship between SOH, cycle times, temperature, and other parameters. While this model is computationally efficient and easy to deploy, it is limited by the data range, cannot analyze the aging mechanism, and exhibits significant errors when the data boundary is exceeded. The equivalent circuit model uses a resistance–capacitance network (such as the Rint model) to mathematically characterize the dynamic characteristics of the battery. It enables the quantitative estimation of the battery’s SOH through real-time online identification of characteristic parameters, such as ohmic internal resistance. This approach is both real-time and physically explanatory; however, identifying parameters in higher-order models can be challenging, and the simulation accuracy for high-frequency responses is often limited. The electrochemical model is based on the porous electrode theory, quantifying microscopic processes such as lithium-ion diffusion and aging mechanisms (such as lithium deposition), and constructing partial differential equations (such as the P2D model). The mechanism is clear, but the calculation is complex. It is only applicable to laboratory research and difficult to apply in real-time vehicles.
The data-driven method [
19,
20,
21,
22,
23,
24,
25] relies on historical operational data from the battery to extract the corresponding features and directly builds an SOH estimation model based on machine learning algorithms, thereby avoiding the complexities of the physical modeling [
26] process. Traditionally, extracted features include current, voltage, and temperature; however, these metrics lack mechanistic characteristics. The common incremental capacity analysis (ICA) method is based on the voltage–capacity relationship. By plotting the incremental capacity (IC) curve, key change points in the SOH degradation process can be identified. This method indirectly characterizes the equilibrium of internal chemical reactions and offers good physical interpretability. However, ICA only analyzes the relationship between capacity and voltage, ignoring changes in battery energy, which is a more comprehensive indicator of battery performance. Data-driven methods mainly include Support Vector Machine (SVM) [
27], Random Forest (RF) [
28,
29], etc. Although SVM shows good classification and regression performance on medium and small-scale datasets, due to its high dependence on input features and kernel function selection, it is difficult to accurately depict the timing law of battery SOH changes using the number of cycles. At the same time, the computational cost is relatively high when dealing with high-dimensional and diverse battery data, which restricts the wide application of this method in engineering practice. Although RF has strong nonlinear modeling ability and the advantage of resisting overfitting, its nature as a static integrated model makes it difficult to effectively learn the temporal correlation characteristics in the degradation process of battery SOH. Furthermore, when dealing with data from different battery models or working conditions, this model has insufficient generalization performance, and the prediction results are prone to discontinuous fluctuations, thereby reducing the stability of state estimation. These traditional machine learning methods often show obvious limitations in dealing with complex nonlinear and time-varying characteristics. In recent years, deep learning techniques [
26,
30,
31,
32,
33], such as Long Short-Term Memory (LSTM) [
34,
35], Gated Recurrent Unit (GRU) [
36,
37], and Transformer [
38,
39], have emerged. Although LSTM can effectively model the dependencies of a long-term series, when dealing with data such as battery SOH, that degrade slowly and evolve over a long period of time, it still faces the problem of memory capacity attenuation caused by vanishing gradients. Meanwhile, this model is highly sensitive to the amount of training data and hyperparameter configuration, leaving it insufficient to capture the early degradation characteristics of batteries. As a structurally simplified gated recurrent network, GRU improves the training efficiency by streamlining the gated structure, but its representational ability is relatively limited. In the task of battery SOH prediction, this model has insufficient modeling accuracy for the long-term trend of subtle changes. When encountering degradation characteristics that are not obvious or large data fluctuations, the prediction results are prone to instability. With its global attention mechanism, the Transformer can effectively capture the dependencies of long sequences. However, in the task of battery SOH estimation, which is limited by the amount of available data, its attention mechanism is prone to unstable distribution. Furthermore, this model has a large number of parameters and a high training cost, and it is difficult to achieve effective deployment under the conditions of limited computing resources or small samples. These methods are capable of automatic feature extraction, enhanced nonlinear modeling, and time series processing. They perform well in SOH estimation but require large amounts of model data and can be computationally expensive. In contrast, hybrid modelling approaches [
32,
35,
38] can leverage the advantages of different networks. For example, the CNN-LSTM [
40,
41,
42] hybrid architecture has become one of the current mainstream research methods because it combines the capabilities of local feature extraction and time series modeling. Among them, the CNN layer can automatically identify the local key features in the input data, while the LSTM layer can effectively capture the temporal sequence law of SOH changing with the number of cycles. However, this model still has several deficiencies in practical applications. Using the linear combination mode of its fully connected layers to accurately describe the complex nonlinear degradation characteristics of the battery is difficult, and the model lacks an explanatory mechanism for the contribution degree of features. When the individual differences of the battery are significant or the working conditions change drastically, the stability and generalization performance of the prediction results decline. Furthermore, although the deep network structure helps improve the model performance, it also creates problems, such as increased training time and difficulties in parameter tuning.
To overcome the above problems, this paper proposes the CNN-KAN-BiLSTM hybrid deep learning model. Among them, KAN is a deep learning method based on the Kolmogorov–Arnold approximation theorem. By utilizing B-Spline basis functions for feature transformation, KAN can effectively approximate complex nonlinear relationship. By integrating the advantages of convolutional feature extraction, nonlinear transformation enhancement and bidirectional time series modeling, the accuracy and robustness of SOH estimation are improved. This model first uses CNN to extract the local patterns of the input features, then adopts the Kolmogorov–Arnold Network (KAN) to replace the traditional fully connected layer to enhance the nonlinear mapping ability. Finally, the temporal evolution law of SOH is learned from both the positive and negative directions through the BiLSTM module to enhance the model’s ability to capture the degradation trend of batteries. Compared with the traditional hybrid structure, CNN-KAN-BiLSTM has achieved key improvements in multiple aspects. First, in terms of estimation accuracy, the KAN layer has adaptive activation ability, which can fit more complex nonlinear relationships and effectively improve the estimation accuracy. Second, in terms of computational complexity, the KAN module has a compact structure and is highly efficient in training without significantly increasing the computational burden of the overall model. Third, in terms of interpretability, the construction form of KAN makes the mapping process of intermediate features more physically readable, which is helpful for analyzing the specific influence of different features on SOH. Fourth, in terms of robustness, BiLSTM can capture temporal features from both forward and reverse directions, enhancing the model’s ability to adapt to inconsistent battery data and noise disturbances. Therefore, CNN-KAN-BiLSTM not only structurally optimizes the deficiencies of the existing models in terms of function expression, generalization ability, and interpretability, but also provides a more forward-looking solution for complex SOH modeling problems.
In this paper, a novel SOH estimation framework for hybrid models is proposed. First, incremental energy analysis (IEA) is proposed. Based on the measured charging data of lithium batteries, the incremental energy (
IE) curve is plotted according to the voltage-energy relationship. The SOH of lithium batteries is characterized using the features extracted from the
IE curve. Compared to ICA, IEA accounts for energy loss and provides a more comprehensive description of the battery aging process. Subsequently, a CNN-KAN-BiLSTM deep learning model is proposed. CNN provides basic local structural information, KAN performs nonlinear reconstruction of features and enhances the response of key variables, and BiLSTM [
38] further integrates time-dependent information. This approach enables accurate modeling and stable estimation of SOH variation trends. The main contributions of this paper are summarized as follows:
- (1)
Based on the data of the lithium battery charging stage, this paper constructs a multi-feature sequence integrating IE features and temperature features, and introduces voltage mean variance (VVM) and temperature mean standard deviation (SDTM) to characterize the internal inconsistencies of the battery pack. This sequence retains the multi-dimensional feature structure of each period and constitutes a time series in a cyclic order, which more effectively reflects the evolution trend of SOH over time and significantly improves the prediction accuracy and robustness of the model.
- (2)
This paper proposes the CNN-KAN-BiLSTM model for accurately estimating the SOH of batteries, integrating local feature extraction, nonlinear modeling and temporal trend perception. Experiments were conducted under different working conditions covering battery pack and single cells with multiple charging rates. The characteristics of incremental energy and incremental capacity were compared respectively, as well as models such as CNN-KAN, CNN-LSTM, KAN, and BiLSTM. The results show that the proposed model achieves the optimal performance on all datasets, with MAE < 0.4%, RMSE < 0.5%, and R2 > 97%, fully verifying its accuracy, stability and generalization ability.
- (3)
In this paper, experimental verification was carried out using public datasets. This paper also experimentally verified the SOH estimation method proposed in this study by using the NASA battery dataset and the CALCE battery dataset. The experimental results show that in the face of public battery datasets from different sources and with various working conditions, this method demonstrates strong robustness and generalization ability in terms of SOH estimation accuracy, curve smoothness, and the ability to fit the degradation trend, further verifying the adaptability and practical value of the proposed model in multiple scenarios.
4. Experimental Procedure, Results, and Analysis
4.1. Experimental Data
The experimental data were obtained from aging tests of a lithium battery pack measured in the laboratory as well as from three battery cells tested under different charging rates (0.1 C, 0.2 C, and 0.5 C). The materials used in the experiment are Lishen 18650 lithium batteries, each with a nominal capacity of 2.5 Ah and a voltage of 3.6 V, respectively. The lithium battery pack consists of six series-connected lithium-ion battery cells with identical specifications. Aging tests of the battery pack and individual cells were conducted at room temperature using a Neware high-performance battery test system built in the laboratory, and the aging data of the lithium battery pack and each cell were measured. The testing system comprises a battery cell testing device, a battery pack testing device, and a host computer, all characterized by high responsiveness and precision.
The aging test of the battery pack begins by charging at a constant current of 1.2 A until the end voltage of the battery pack reaches the cut-off voltage of 24.9 V. Next, the lithium battery pack is charged at a constant voltage of 24.9 V until the current drops to 68 mA, at which point the charging is stopped. Subsequently, a constant discharge is carried out at 2.4 A until the battery terminal voltage drops to 19.3 V. The experiment is terminated when the maximum discharge capacity of the battery pack reaches the failure threshold, which is approximately 65% of the rated capacity. The battery cell (0.5 C) is charged at a constant current of 1.25 A until the battery terminal voltage reaches the cut-off voltage of 4.2 V. It is then charged at a constant voltage of 4.2 V until the current drops to 48 mA, at which point the charging is stopped. Finally, a constant discharge is carried out at 1.25 A until the battery terminal voltage drops to 3 V. The experiment is terminated when the maximum discharge capacity of the single battery reaches the failure threshold, which is about 80% of the rated capacity.
4.2. Experimental Procedure
This experiment primarily involved SOH estimation for a lithium battery pack and battery cells with varying charging rates. The specific experimental steps are shown in
Figure 6 and described as follows:
- Step 1.
The raw data, such as current, voltage, energy, and temperature of the battery pack and individual were measured and calculated using the experimental equipment during the charging stage.
- Step 2.
Based on the recorded energy, voltage and temperature data, the incremental energy (IE) curves of the battery pack and each cell were plotted respectively, and the relevant features, such as peak value, average value, and standard deviation, were extracted. Meanwhile, statistical features, such as average temperature and temperature variance, were extracted from the temperature data to jointly construct a multi-feature sequence integrating energy-temperature information. Subsequently, the Pearson correlation coefficient was used to evaluate the correlations between the above-mentioned various characteristics and the SOH of battery pack and cells to verify its effectiveness in SOH modeling.
- Step 3.
The processed datasets were divided into training and test sets. The extracted features were input into the CNN-KAN-BiLSTM deep learning model for training. The results of the battery SOH estimation were estimated, and evaluation metrics, MAE, RMSE, and R2 were calculated for subsequent performance analysis.
4.3. Experimental Results and Analysis
Figure 7 shows the SOH of a battery pack and three individual battery cells at different charging rates, plotted as a function of the number of cycles. The figure demonstrates that the SOH values for both the battery pack and each cell generally exhibit a downward trend with increasing cycles, however, this decline is not smooth. SOH fluctuations may occur due to temperature effects. Following each charge–discharge cycle, the battery is typically rested for some time, during which electrode polarization may partially recover, leading to brief capacity increases.
It is worth noting that in the
Figure 7b,d, the SOH curves of the two batteries in the early cycling stage are slightly above 100%. This is mainly attributed to the common slight capacity increase phenomenon in the initial cycle of lithium-ion batteries, that is, the released effective capacity briefly exceeds the nominal capacity. According to the SOH calculation method adopted in this paper, that is, the ratio of the maximum measured discharge capacity to the nominal capacity, a calculation result with SOH slightly exceeding 100% will occur. This phenomenon does not reflect abnormal battery performance, but is caused by the structural activation process of the graphite anode material during the initial cycle, including microscopic changes such as the expansion of interlayer spacing and the shedding of particles into nanosheets, which cause it to release more effective lithium intercalation sites and enhance reversible capacity. This capacity “rebound” process is more significant under conditions such as high temperature or depth of discharge (DOD) [
43]. If the battery manufacturing process is uniform and the initial activation of the negative electrode material is sufficient (such as moderate compaction and good wetting of the electrolyte), its initial state may be close to the theoretical capacity upper limit. In the subsequent cycle, due to the limited space for structural activation, the phenomenon of SOH exceeding 100% may not be significantly manifested, as shown in the battery in
Figure 7c. It should be pointed out that a SOH slightly above 100% is an acceptable technical phenomenon in practical applications and will not have a substantial impact on the judgment of the SOH change trend or the validity of model evaluation.
To rigorously evaluate the accuracy of SOH estimation, the performance of the regression model was assessed using common metrics: MAE, RMSE, and R
2. Their definitions are provided in Equations (24), (25), and (26), respectively:
where
is the number of charge-discharge cycles,
indicates the true SOH,
is the average true SOH, and
is the estimated SOH. Smaller MAE and RMSE values indicate higher model accuracy, while R
2 closer to 1 indicates better model fit. The experimental results are shown in
Figure 8:
The figure shows the SOH estimation results of the proposed model for lithium batteries across four different working conditions. It includes data from a battery pack (
Figure 7a) and three groups of battery cells charged at varying rates (0.1 C, 0.2 C, and 0.5 C;
Figure 7b–d). The estimated SOH curves closely align with the actual SOH curves, demonstrating strong fitting accuracy under all working conditions. The estimated trends are highly consistent with the real degradation processes, highlighting the model’s excellent generalization capability. Under the condition where the SOH degradation of the lithium battery pack and the 0.5 C battery cell is relatively stable, the estimated curve closely matches the true value, indicating the model’s strong capability in capturing overall degradation trends. For the 0.1 C and 0.2 C cells which exhibit fluctuating SOH, the model effectively identifies key change points without significant deviation in noisy environments, demonstrating its good stability.
The CNN-KAN-BiLSTM deep learning model effectively integrates the advantages of each layer. The CNN layer extracts local change patterns from input features, enhancing the ability to perceive and capture micro-features related to degradation behavior. The KAN layer performs nonlinear modeling on the features extracted by CNN, mapping them into a space that better describes the degradation process. Compared to traditional models, KAN exhibits a stronger nonlinear representation capability while maintaining interpretability. The BiLSTM layer accepts the KAN output, generates state representations at each time step, and ultimately estimates SOH. This architecture leverages the strengths of each layer, forming a closed-loop system that achieves effective SOH estimation. As shown in
Table 3, MAE across all working conditions is below 0.4%, R
2 exceeds 97%, and the R
2 for the lithium battery pack reaches as high as 99.46%. These results indicate that this model not only has excellent performance and efficiency but also demonstrates strong adaptability to the actual complex and variable real-world data.
To further verify the feasibility and effectiveness of the proposed SOH estimation method for lithium batteries, this paper presents a feature comparison experiment alongside four deep learning model comparison experiments. Specifically, the feature comparison experiment involved extracting features from the incremental capacity (IC) curve, including peak value, average value, standard deviation, and area, while incorporating battery temperature characteristics to form a comprehensive multi-feature sequence. These features were then compared with those proposed in this paper, and the respective sequences were input into the deep learning model for training to assess the estimation performance of both methods on battery SOH. The model comparison experiment tested CNN-KAN, CNN-LSTM, KAN, and BiLSTM models using identical input features to compare their SOH estimation effectiveness. The results of the feature comparison experiment are presented in
Figure 9.
The evaluation parameters for the SOH estimation in the feature comparison experiment are shown in
Table 4.
The comparison of SOH estimation curves in
Figure 8 clearly illustrates the impact of the two feature types on model performance. Assessment results based on IEA features are generally smoother, demonstrating better fitting accuracy and improved detection of key change points during abrupt SOH changes and fluctuation phases. This indicates that IEA features effectively enhance model adaptability and stability under nonlinear and complex conditions.
By comparing SOH estimation results based on incremental capacity (ICA) and incremental energy (IEA) features across various working conditions, the superiority of IEA becomes evident. For the battery pack, the MAE with IEA (0.3910%) is significantly lower than with ICA (0.7568%). RMSE decreases from 0.9800% (ICA) to 0.4798% (IEA), while R2 increases from 97.78% to 99.47%. Similar trends are observed for 0.1 C, 0.2 C, and 0.5 C cells, highlighting IEA’s effectiveness in capturing battery aging behavior and energy degradation trends.
The model comparison results are displayed in
Figure 10.
By observing the SOH estimation effects of the aforementioned models under various working conditions, it is evident that the models demonstrate good estimation performance. However, the proposed CNN-KAN-BiLSTM model achieves highly consistent fitting with the real SOH curve across all conditions. When the degradation curve exhibits abrupt changes and jitter, the model maintains stable outputs. Although the CNN-KAN model effectively extracts local features and performs nonlinear mapping, its lack of temporal modeling results in slight deficiencies in capturing long-term SOH degradation trends. The CNN-LSTM model captures temporal dependencies but lacks KAN’s nonlinear feature optimization, leading to lower accuracy in complex working conditions. When used alone, the BiLSTM model fails to extract local incremental energy features, resulting in poor overall performance. The CNN-KAN model outperforms CNN-LSTM in short-term SOH estimation due to its nonlinear mapping of CNN-extracted features; however, its long-term estimation errors increase without temporal modeling. The CNN-LSTM model is suitable for long-term estimation, but gradient vanishing during drastic SOH changes reduces its accuracy. Conversely, the KAN model relies solely on nonlinear mapping and lacks feature extraction and temporal modeling capabilities, leading to poor performance. The proposed CNN-KAN-BiLSTM model performs exceptionally in estimating the lithium-ion battery SOH. By comparing real and estimated SOH values, the model accurately captures degradation trends with minimal errors, indicating its strong capability to learn SOH variations. The RMSE and MAE of the CNN-KAN-BiLSTM model are significantly lower than those of other models, demonstrating higher accuracy and reduced errors.
Analysis reveals that the proposed model maintains high accuracy across various battery conditions, especially during nonlinear SOH degradation. The CNN-KAN-BiLSTM model utilizes CNN for local feature extraction, KAN for nonlinear transformation, and BiLSTM for handling temporal dependency, effectively learning SOH patterns and improving generalization. Consequently, this method improves both the accuracy and stability of lithium-ion battery SOH estimation, providing a reliable solution for intelligent monitoring.
In addition, this paper also incorporates comparative evaluation experiments of some existing models, and the experimental results are shown in
Figure 11.
6. Conclusions
In this paper, a method for SOH estimation of lithium-ion batteries based on incremental energy characteristics and CNN-KAN-BiLSTM is proposed. Key features are extracted from the incremental energy curve, which incorporates specific inconsistencies of the battery pack, while temperature features are combined to form an effective set of input features. The CNN-KAN-BiLSTM model is trained to achieve high-precision SOH estimation. The experimental results show that this method effectively estimates the SOH of the battery. The feasibility and superiority of this method are confirmed by comparative experiments. Both the RMSE and MAE are below 1%, demonstrating excellent performance.
Although the CNN-KAN-BiLSTM model proposed in this paper shows good prediction accuracy and generalization ability in the battery SOH estimation task, there are still certain limitations. First, the current model has not yet introduced the self-attention mechanism. Therefore, when dealing with high-dimensional features or complex working conditions, it may have insufficient responses to key features, limiting its adaptability to multiple types of batteries and variable working conditions. Second, the experiment only constructed features and conducted modeling based on the data from the charging stage. Although the charging stage has the advantages of strong stability and high data quality, it ignored the characteristic information during the discharging process, which might lead to the model having an insufficient understanding of the battery’s state changes throughout its life cycle. In addition, this paper mainly conducts experimental verification based on laboratory data and partial data from the two public datasets of NASA and CALCE. Although it is somewhat representative, there is still a gap compared with the actual operating environment of energy storage power stations or new energy vehicles. The stability and practicality of the model under real complex working conditions still need to be further tested. Therefore, future research can consider introducing the self-attention mechanism to enhance the recognition ability of key features. At the same time, it integrates the full-cycle data of charging and discharging, and combines the real vehicle and power station operation data in actual application scenarios to construct an SOH estimation model that is closer to the real working conditions.
Future work could introduce a self-attention mechanism to enhance the model’s focus on critical features, enabling adaptation to diverse battery types and working conditions. This improvement would enhance generalization capability and estimation accuracy, ultimately supporting the intelligent management of new energy vehicles and energy storage systems. Additionally, integrating operational data from energy storage power stations with measurement data from new energy vehicles could build a SOH estimation model that is more aligned with real-world applications. Large-scale, multi-cycle data from power stations can reveal long-term battery degradation patterns, while real vehicle data reflect dynamic characteristics under complex conditions. Combining these data sources would enhance model adaptability in practical environments and advance SOH estimation technology toward engineering applications and intelligence.