1. Introduction
Lithium-ion batteries [
1] have become essential energy storage units in electric vehicles and large-scale energy storage systems due to their high energy density [
2], long cycle life, and low self-discharge rates [
3]. However, performance degradation is inevitable during use, manifesting as capacity reduction [
4], increased internal resistance, and other issues that directly affect the battery’s reliability and safety [
5]. Battery state of health (SOH), a key indicator for assessing battery performance degradation, has thus become a primary focus of concern.
In recent years, numerous academic studies have been conducted both domestically and internationally to achieve precise estimation of the SOH of batteries. Research methods can be classified into two categories: model-based methods [
6,
7] and data-driven methods [
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26,
27,
28]. Model-based methods describe the dynamic behavior and internal state of batteries by establishing corresponding mathematical models, which can be further divided into equivalent circuit models [
6] and electrochemical models [
7]. The equivalent circuit model employs a circuit network composed of components such as resistors, capacitors, and voltage sources to transform complex electrochemical behaviors into quantifiable circuit parameters. In contrast, the electrochemical model is based on the internal reaction mechanisms of the battery and precisely describes the kinetics of electrode processes and material transport characteristics using approaches such as partial differential equations. Although these methods have clear physical interpretations, they still face limitations in practical applications, including model complexity, difficulty in parameter identification, and sensitivity to operating conditions.
The data-driven approach first extracts features from battery operation data and then establishes the mapping relationship between the SOH and feature parameters using machine learning methods, without relying on complex electrochemical mechanisms. Existing studies primarily extract voltage [
8,
9], current [
10,
11,
12], and temperature [
13] as features. Additionally, mechanism-based features can be derived through differential voltage analysis (DVA) [
8], incremental capacity analysis (ICA) [
10,
11,
12], and differential thermal voltammetry (DTV) [
13], among others. However, these methods often overlook the battery’s power output characteristics under dynamic working conditions. Regarding model development, traditional machine learning models such as Support Vector Machine (SVM) [
14], Gaussian Process Regression (GPR) [
15], Extreme Learning Machine (ELM) [
16,
17], and Random Forest (RF) [
18] have been employed for SOH estimation. Although these methods offer high computational efficiency, they typically depend on manually extracted features as inputs and struggle to capture the highly nonlinear and long-term time-dependent characteristics inherent in battery degradation. With advances in deep learning, recurrent neural networks and their variants—such as Long Short-Term Memory (LSTM) [
19,
20], Gated Recurrent Unit (GRU) [
21], Convolutional Neural Network–Long Short-Term Memory (CNN-LSTM) [
22,
23,
24], and Transformer [
25,
26] models—have been applied for model training. For example, Gu et al. [
25] proposed an improved CNN–Transformer model that estimates the SOH of lithium-ion batteries using techniques such as feature selection and normalization. However, these models treat all input data equally and cannot distinguish key information, which limits their accuracy in characterizing the degradation process. Furthermore, Temporal Convolutional Networks (TCNs) [
27,
28] effectively capture local temporal features during battery degradation through a causal dilated convolutional structure. For instance, Liu et al. [
27] developed an SOH prediction model integrating multi-channel TCN and GRU with a self-attention mechanism (GRU-SA), significantly improving prediction accuracy.
Although existing deep learning-based SOH estimation methods, including CNN, T LSTM, Transformer, TCN, and hybrid models combining these architectures with attention mechanisms, have demonstrated promising performance, these methods primarily focus on model architecture optimization and rely on conventional signal features (voltage, current, temperature), often operating at either the module or cell level without modeling correlations between module-level and cell-level degradation.
In contrast, the proposed method is a hybrid TCN–Transformer model enhanced with an SE mechanism. It extracts module-cell and cell-level electrochemically informed features, enabling the model to capture both local and global dependencies while preserving electrochemical interpretability. This two-level hierarchical, mechanism-informed feature design allows the model to reflect both overall battery performance decay and internal cell inconsistency [
29], providing more accurate and physically meaningful SOH estimation. The main contributions of this study are summarized as follows:
- (1)
Beyond conventional signal processing, this study proposes a mechanism-informed feature extraction framework. At the module level, the length of the incremental power curve is selected as a feature, which serves as a macroscopic indicator capturing the cumulative effects of internal resistance increase and polarization intensification. At the cell level, the SOC at the knee point of the combined temperature-weighted voltage inconsistency curve is extracted, which is physically linked to the accelerated degradation stage triggered by strong polarization and thermoelectric feedback under high SOC conditions. These two-level hierarchical features offer highly complementary physical criteria for SOH estimation, addressing both overall performance decay and internal inconsistency evolution.
- (2)
To effectively fuse the above features with distinct electrochemical implications, this work proposes the TCN–Transformer–SE model. This approach dynamically assigns weights to the two-level hierarchical features using the Squeeze-and-Excitation (SE) mechanism, ensuring that the model prioritizes the most degradation-relevant information. The local temporal perception capabilities of the TCN are deeply integrated with the global dependency modeling of the Transformer to comprehensively capture the dynamic electrochemical processes. Additionally, the Huber loss function is employed to improve the model’s robustness against outliers.
- (3)
Comprehensive aging experiments on a lithium-ion battery pack are conducted to validate the proposed method. The validation not only confirms the estimation accuracy but, more importantly, substantiates the electrochemical rationale behind the feature selection and model design. The superiority and interpretability of the proposed SOH estimation framework are rigorously demonstrated through multi-feature comparison experiments, ablation studies, data sensitivity analyses, and comparative experiments with benchmark models.
The remainder of this article is organized as follows.
Section 2 provides a detailed introduction to the methods for extracting two-level hierarchical features.
Section 3 presents the proposed TCN–Transformer–SE model and analyzes its advantages.
Section 4 describes experimental procedures, results, and analysis. Finally,
Section 5 concludes the study.
2. Two-Level Hierarchical Feature Extraction
To comprehensively analyze the degradation behavior of battery modules and cells, this study proposes a two-level hierarchical feature extraction method. At the module level, the length of the incremental power curve during the constant current charging stage is selected as the key indicator. At the cell level, the SOC at the knee point of the combined temperature-weighted voltage inconsistency curve within the high SOC range throughout the charging cycle is extracted as the key indicator.
2.1. Incremental Power Analysis
This study proposes a feature extraction method based on incremental power. Under constant current charging conditions, the current remains constant while the power variation is mainly determined by the voltage response. As the battery ages, the internal resistance increases and the polarization effect intensifies [
30], causing the voltage to stabilize with respect to the capacity variation. This subsequently leads to a change in the rate of power variation with capacity, ultimately manifesting as a change in the length of the incremental power curve. ICA mainly relies on the differential transformation of voltage to capacity to characterize battery degradation. However, not all degradation features can be manifested through peak values during the battery aging process. Especially in multi-cycle usage, the degradation performance of the battery is usually a gradual change, and peak information is difficult to comprehensively capture this continuous evolution process. In contrast, the incremental power method proposed in this paper is based on the differential relationship between power and capacity, and can comprehensively reflect the energy conversion process of the battery. The rate of change of power with respect to capacity within each cycle is calculated using the numerical difference method, as illustrated in
Figure 1, thereby obtaining the incremental power relationship. The calculation is as follows:
where
P and
C represent power and capacity, respectively, and
i is the sampling point index within a single charging cycle.
Figure 1 presents the incremental power curves of several representative cycles before filtering. It can be observed that the curves contain significant oscillations with multiple local peaks, which are mainly caused by the differential calculation process that amplifies small variations and measurement noise in the original signals. These fluctuations may interfere with the accurate extraction of aging-related features. Before extracting features, Gaussian filtering (σ = 7, kernel size = 57) was applied to smooth the curve, effectively removing high-frequency noise while fully preserving the primary features that reflect the battery’s aging trend. The final incremental power curve is shown in
Figure 2.
Based on this, the curve length corresponding to each cycle is used as a feature. The length is defined as follows:
where
L represents the curve length,
xi represents the abscissa of the curve at the
i-th sampling point, which is the index of the sampling point, and
yi represents the ordinate of the curve at the
i-th sampling point, which is the rate of change of power at the
i-th point with respect to capacity.
n is the total number of sampling points of the curve.
2.2. Combined Temperature-Weighted Voltage Inconsistency Curve
In the high SOC range, the battery’s polarization phenomenon is significantly intensified. Even minor differences among individual cells are amplified by thermoelectric feedback, resulting in a pronounced knee point in the voltage inconsistency curve. This inflection point marks the transition from a slow degradation phase to an accelerated failure stage in battery aging. Because temperature variations not only influence polarization but also exacerbate uneven voltage distribution through thermal feedback, this paper simultaneously extracts temperature and voltage data from each channel during the cyclic charging process to develop a combined temperature-weighted voltage inconsistency index. Traditional DVA and DTA features typically focus on either voltage or temperature changes separately, making it difficult to capture this complex interaction effect. By incorporating temperature weighting, this method not only more accurately identifies transition points in battery degradation but also better addresses the complex battery behavior in high SOC ranges.
First, calculate the temperature variance for each temperature acquisition channel over time:
where
si2 represents the temperature variance of the
i-th channel,
Ti,t is the temperature value of the
i-th channel at time
t,
Tiavg is the average temperature of the
i-th channel, and
n is the total number of channels.
Normalization yields the weighting coefficient:
where
s is the sum of the temperature variances of all channels, and
wi is the normalized temperature weight of the
i-th channel.
Channels with larger variances exhibit greater thermal volatility and are assigned higher weights in the overall inconsistency assessment. Based on this, the combined temperature-weighted voltage inconsistency is defined as follows:
where
W represents the total sum of the temperature weights across all channels,
Vi,t denotes the voltage values of the
i-th channel at time
t,
Vtavg is the average voltage across channels after temperature weighting, and
Ct is the combined temperature-weighted voltage inconsistency at time
t.
As shown in
Figure 3, the curve exhibits a distinct knee point within the 0.8–1.0 SOC range, corresponding to the increased voltage difference caused by polarization effects and thermal coupling at the end of charging. This curve quantifies the combined effect of voltage and temperature disparities across cells. In battery degradation studies, knee points observed in aging-related curves are commonly interpreted as transitions from a relatively slow degradation regime to a more rapid degradation regime, where the slope and degradation rate increase significantly after the knee point [
31]. Similarly, identification methods for battery degradation trajectories have defined knee points as the boundary between linear and nonlinear (accelerated) decay stages [
32]. In our study, the observed knee point in the temperature-weighted voltage inconsistency curve within the high-SOC region marks a transition to a faster increase in cell-to-cell divergence, which can be interpreted as indicative of the onset of accelerated degradation in the battery pack. Therefore, the SOC at this knee point is identified as a key indicator of consistency degradation.
2.3. Feature Sensitivity Analysis and Degradation Characterization Evaluation
To systematically evaluate the sensitivity of the extracted features to the battery degradation process and their characterization capabilities, this section selects the two-level hierarchical features for analysis. Firstly, the features and SOH are normalized, and they are uniformly mapped to the [0, 1] interval to eliminate the influence of units:
where
X represents the original feature value, and
Xmin and
Xmax represent the minimum and maximum values of this feature over the entire cycling process.
SOH represents the original battery health status, while
SOHmin and
SOHmax represent its minimum and maximum values, respectively.
Xnorm and
SOHnorm represent the normalized feature value and SOH, respectively.
To quantify the degree of drift of the characteristics over the aging process of the battery, the absolute change in each parameter relative to the first cycle is calculated:
where
i represents the cycle index,
Xnorm(
i) and
SOHnorm(
i) represent the normalized feature value and SOH at the
i-th cycle, and
Xnorm(1) and
SOHnorm(1) represent the normalized feature value at the first cycle. Δ
X(
i) and Δ
SOH(
i) represent the changes in the normalized feature value and normalized SOH at the
i-th cycle relative to the first cycle.
Subsequently, the Pearson correlation coefficient
R was used to analyze the linear correlation between Δ
X and Δ
SOH:
where
R represents Pearson correlation coefficient, and
and
represent the mean values of Δ
X(
i) and Δ
SOH(
i) over all cycles.
n represents the total number of cycles.
The value range of Pearson correlation coefficient R is [−1, 1]. The closer the absolute value is to 1, the stronger the linear correlation between the feature change and the degradation of SOH will be, indicating that this feature is more sensitive to battery aging. Meanwhile, the corresponding p-value (probability of significance) is calculated using a t-test to assess the statistical significance of the correlation, with p < 0.05 indicating a statistically significant correlation.
As shown in
Figure 4, the variations in the two features both exhibit strong linear correlations with the variation in SOH. Among them, the Pearson correlation coefficient of the length of incremental power curve feature reaches 0.994 (
p < 0.001), with the data points closely distributed around the y = x reference line. This indicates that the change in this feature is almost synchronized with SOH degradation, demonstrating extremely high sensitivity to battery aging. The Pearson correlation coefficient of the SOC at the knee point feature is 0.9937 (
p < 0.001), which also shows a strong correlation with SOH variation, further verifying the sensitivity of this feature to the battery degradation process.
Figure 5 illustrates the evolution of feature variations with respect to the cycle number. It can be observed that as the number of cycles increases, both features gradually deviate from their initial values, intuitively reflecting the progressive aging process of the battery. From the smoothed trend curves, the length of the incremental power curve feature exhibits a stable monotonic increasing trend without obvious local fluctuations, demonstrating good stability and strong capability for degradation characterization. The SOC at the knee point feature also shows an overall increasing trend, and its variation remains consistent with the battery aging process, effectively reflecting the dynamic characteristics of degradation.
4. Experimental Process, Results and Analysis
4.1. Original Data and Evaluation Indicators
The SOH estimation experiment was conducted on a computing platform equipped with an Intel Core i9-12900H CPU (5.00 GHz) manufactured by Intel in the United States and an NVIDIA GeForce RTX 3060 GPU (6 GB) manufactured by NVIDIA in the United States. The deep learning model was implemented in a Python 3.7.16 environment using PyTorch 1.13.0 with CUDA 11.7.
The experimental data for this study were obtained from a lithium-ion battery pack aging test system developed in the laboratory. This test platform consists of a host computer, an Arbin 60 V 50 A 4CH LBT high-precision battery testing device (Arbin Instruments, College Station, TX, USA), and an auxiliary channel, as shown in
Figure 9. The Arbin 60 V 50 A 4CH LBT device is used to perform charge and discharge tests on the battery pack. The auxiliary channel utilizes a thermocouple temperature sensor (Arbin Instruments, College Station, TX, USA) to monitor the battery temperature and a voltage transmitter (Arbin Instruments, College Station, TX, USA) to measure the battery voltage.
The test subject is a battery pack composed of eight Lishen 18650 cylindrical lithium-ion cells, each with a rated capacity of 2.5 Ah, connected in series, with the positive electrode made of LiNi
xCo
yMn
zO
2. Each cycle sequentially includes three stages: constant current charging, constant voltage charging, and constant current discharging, completing a full charge–discharge cycle. The battery pack was cycled for a total of 796 cycles. All the cycle tests were conducted under the environmental temperature ranging from 24 °C to 36 °C. For the main SOH estimation experiments, the dataset was split evenly into training and test sets (50%:50%). To facilitate the training process and ensure consistent scaling across features, the SOH values were normalized to a range of 0 to 1. The cycle test parameters for the lithium-ion battery pack are presented in
Table 2, and the complete experimental procedure is illustrated in
Figure 10.
To evaluate the estimation accuracy and fitting performance of the model for SOH, this study selects the mean absolute error (MAE), root mean square error (RMSE), coefficient of determination (R
2) and mean absolute percentage error (MAPE) as performance evaluation metrics. The formulas are defined as follows:
where
ye represents the estimated value of the
i-th sample,
yi represents the true value of the
i-th sample,
yt is the mean of the true values, and
n is the total number of samples.
These evaluation metrics carry clear physical significance in battery SOH estimation. MAE and RMSE reflect the average deviation between the estimated SOH values and the true SOH values: smaller values indicate higher estimation accuracy, which reliably supports battery management system decisions such as remaining useful life prediction and maintenance warnings. R2 measures the model’s goodness of fit to the SOH degradation trend over battery aging, where values closer to 1 indicate higher coincidence between the estimated curve and the true degradation trajectory, reflecting a stronger capability to capture the SOH variation trend. MAPE measures SOH estimation accuracy from the perspective of relative error, intuitively reflecting the percentage of estimation deviation relative to the true SOH, facilitating performance comparison across different aging stages.
4.2. Multi-Feature Comparison Analysis
To verify the effectiveness of the selected features in SOH estimation, this study evaluated the performance of various feature combinations. The two features proposed in this paper were pairwise combined with the constant current average voltage (AV) and the constant voltage charging duration (CD), then input into the TCN–Transformer–SE estimation model. The output accuracy was compared to demonstrate the superiority of the feature combination proposed in this study. The experimental results are presented in
Table 3 and
Figure 11.
Among all the evaluated feature combinations, the pairing of curve length and the SOC at the knee point demonstrated the best estimation performance, with evaluation metrics significantly outperforming those of other feature combinations (MAE = 0.0031, RMSE = 0.0038, R2 = 0.9937, MAPE = 0.3820). When curve length and SOC at the knee point were each combined with the constant current average voltage, the feature combinations based on curve length (curve length + AV: R2 = 0.9767; curve length + CD: R2 = 0.9365) generally outperformed the corresponding combinations based on SOC at the knee point (SOC + AV: R2 = 0.9720; SOC + CD: R2 = 0.8854). This indicates that the curve length feature has a stronger discriminative ability in characterizing the battery aging state. Although constructing of the SOC at the knee point feature involves temperature data, feature-level fusion not only enables efficient utilization of the existing thermocouple temperature sensor but also enhances the model’s representational capacity through the complementarity of the two features. Compared to other combinations involving the inflection point SOC, the combination of curve length and SOC reduces the MAE to 0.0031, the RMSE to 0.0038, and the MAPE to 0.3820, and it increases the R2 to 0.9937.
4.3. Ablation Studies
To investigate the specific contributions of each component in the model to its overall performance, a series of ablation experiments were conducted in this study. The model’s performance significantly deteriorated when either the SE or Huber component was removed. The experimental results are presented in
Table 4 and
Figure 12.
The experimental results demonstrate that incorporating the Huber loss function significantly enhances model performance. Compared to the model without the Huber loss, the MAE decreased from 0.0046 to 0.0031, the RMSE dropped from 0.0057 to 0.0038, and the MAPE decreased from 0.5752 to 0.3820. Additionally, R2 increased from 0.9858 to 0.9937. These findings confirm the effectiveness of the Huber loss function in improving model robustness.
Meanwhile, the introduction of the SE mechanism has significantly improved the model’s performance. When using the Huber loss in all cases, compared to the model without the SE mechanism, the MAE decreased from 0.0054 to 0.0031, the RMSE decreased from 0.0071 to 0.0038, the MAPE decreased from 0.6873 to 0.3820, and the R2 increased from 0.9778 to 0.9937. These results demonstrate that the SE mechanism effectively enhances the model’s ability to focus on key features and improves the efficiency of feature utilization.
4.4. Data Sensitivity Analysis
Under various data partitioning conditions, the TCN–Transformer–SE model demonstrates excellent generalization capabilities. This study designed a verification experiment with low data dependency. In this experiment, the top 30%, 40%, and 50% of cyclic data from the dataset were used as training sets to simulate early deployment scenarios in practical applications. The results are presented in
Table 5 and
Figure 13.
It can be observed that both models are capable of capturing the overall degradation trend of the battery throughout the entire cycling process. Even when only 30% of the data is used for training, the predicted curves still follow the general downward trajectory of the actual SOH, demonstrating the robustness of the proposed framework under limited training data conditions.
However, noticeable differences between the two models can be observed in the later degradation stage. As shown in
Figure 13a, the predictions of the baseline TCN–Transformer model exhibit relatively larger fluctuations and deviations from the actual SOH curve. In contrast, the TCN–Transformer–SE model produces smoother estimation curves that align more closely with the actual SOH trajectory.
As the proportion of training data increases from 30% to 50%, the estimation accuracy of both models gradually improves. The predicted curves become more consistent with the ground-truth SOH, and the deviation in the late-cycle region is further reduced. In particular, under the 50:50 data split shown in
Figure 13c, the TCN–Transformer–SE model almost overlaps with the actual SOH curve across most cycles, indicating its strong capability in capturing both local degradation patterns and long-term dependencies.
Furthermore, the SE mechanism enhances the model’s ability to adaptively recalibrate feature importance, which helps suppress noise-induced fluctuations and extract more representative degradation information. Consequently, the TCN–Transformer–SE model consistently outperforms the baseline model across all training ratios, achieving the best performance when sufficient training data are available.
4.5. Public Dataset Validation
To further verify the effectiveness of the proposed method, this paper selected the CS2-35 and CS2-36 public lithium-ion battery datasets provided by the CALCE Center of the University of Maryland. All the CS2 batteries in this dataset were charged using the standard constant current and constant voltage charging scheme. As a classic dataset for battery aging research, it provides strong support for verifying the effectiveness and generalization ability of the model.
Based on this dataset, the TCN–Transformer–SE model was verified. The experimental results are shown in
Table 6 and
Figure 14. From
Table 6, it can be seen that the model achieved an estimation accuracy of MAE = 0.0067, RMSE = 0.0096, R
2 = 0.9852, and MAPE = 0.9161% on the CS2-35 battery and an estimation accuracy of MAE = 0.0092, RMSE = 0.0107, R
2 = 0.9513, and MAPE = 1.2339% on the CS2-36 battery.
Figure 14 shows the comparison curves of the estimated SOH results and the true values for the two batteries. From the figure, it can be seen that the estimated curve of the model maintains a high consistency with the true SOH trajectory, and it can track the aging process of the battery relatively accurately. The proposed method can achieve good estimation results on the public dataset, further verifying its effectiveness and reliability.
4.6. Comparative Experiments with Other Models
The proposed TCN–Transformer–SE model is compared and evaluated against a variety of representative sequence models currently in use. The baseline models included in the comparison are GRU, LSTM, Transformer, TCN, CNN-LSTM, and TCN–Transformer. All models were trained using the first 50% of the cycle data, and their estimation performance was assessed on the remaining 50%. The results are presented in
Table 7 and
Figure 15. Key hyperparameter settings for different models are shown in
Table 8.
Compared to other baseline models, the proposed TCN–Transformer–SE demonstrates superior estimation accuracy on the dataset. The RF model (R2 = 0.7846), as a traditional machine learning method, offers advantages in handling small-sample data and providing feature importance rankings; however, it lacks the inherent capability to model temporal dependencies, limiting its effectiveness for sequential aging data. The GRU model (R2 = 0.8694) features a streamlined structure with high parameter efficiency, yet its performance is somewhat limited for capturing complex battery degradation patterns due to its simpler gating mechanism. The LSTM model (R2 = 0.9002) exhibits stronger time series modeling capabilities through its comprehensive gating mechanism, effectively addressing the vanishing gradient problem, but it treats all input features equally and cannot dynamically prioritize degradation-relevant information as aging mechanisms evolve. The GPR model (R2 = 0.9135), as a probabilistic approach, provides uncertainty quantification along with predictions, which is valuable for risk assessment, yet it struggles to capture the highly nonlinear patterns in long-term degradation trajectories. The Transformer model (R2 = 0.9236) benefits from its self-attention mechanism in capturing global dependencies across the entire sequence, enabling parallel computation and long-range modeling, but it may overlook local fluctuations critical for detecting early-stage degradation signals. The TCN model (R2 = 0.9320) leverages its unique causal dilated convolution structure to excel in local feature extraction and parallel computation while maintaining a flexible receptive field, yet it faces challenges in modeling ultra-long-range dependencies across hundreds of cycles. The CNN-LSTM model (R2 = 0.9723) achieves a significant performance boost by combining the CNN’s local feature extraction with LSTM’s temporal modeling strengths, effectively capturing both spatial patterns and temporal dynamics, but it lacks an adaptive mechanism to weight features differently across aging phases. The TCN–Transformer hybrid model (R2 = 0.9778) further integrates TCN’s local feature extraction with Transformer’s global dependency modeling, achieving even better performance by addressing the limitations of each individual component, yet without dynamic feature prioritization, it cannot fully exploit the most informative features at each aging stage. Finally, the TCN–Transformer–SE model, which incorporates the SE mechanism, effectively enhances the contribution of key features through adaptive feature weighting, addressing the limitations of above models and attaining the best results across all metrics (MAE = 0.0031, RMSE = 0.0038, R2 = 0.9937, MAPE = 0.3820).
5. Conclusions
This study proposes a novel method for estimating SOH of lithium-ion battery packs. Incremental power feature and combined temperature-weighted voltage inconsistency characteristic are calculated at the module and cell levels, respectively, providing comprehensive inputs for SOH estimation. The proposed TCN–Transformer–SE method effectively addresses challenges such as the difficulty RNNs face in capturing long-term dependencies and the limited receptive fields of convolutional neural networks (CNNs) by leveraging a collaborative mechanism that integrates local perception, global modeling, and feature adaptation. In experiments on SOH estimation of battery packs, this method consistently demonstrated excellent estimation performance and showed strong applicability in scenarios with limited data.
However, several limitations should also be acknowledged. First, the proposed method requires simultaneous access to both module-level and cell-level features, which may increase the complexity of data acquisition in practical battery management systems. Second, the combined temperature–voltage inconsistency indicator relies heavily on the high SOC range, where polarization effects are most pronounced. In real-world applications, batteries may not frequently operate in this region, potentially affecting feature extraction effectiveness.
In terms of general applicability, the proposed method has been evaluated on both a laboratory battery pack dataset and a publicly available dataset, where both features demonstrated consistent estimation performance across different data sources. These results indicate that the proposed framework exhibits promising generalization capability. However, the current validation is primarily based on data from lithium nickel cobalt manganese oxide (LiNixCoyMnzO2) and lithium cobalt oxide (LiCoO2) battery chemistries, and the applicability of the method to other chemistries (e.g., lithium iron phosphate, lithium titanate) requires further investigation. Future research will focus on exploring the generalization performance of this method across different battery pack systems.