1. Introduction
Lithium-ion battery state-of-health (SOH) estimation is a critical function in a battery management system (BMS), especially for electric vehicles (EVs) and renewable energy storage, as it directly impacts operational safety and reliability [
1]. Over time, lithium-ion batteries experience degradation in the form of capacity loss and increased internal resistance [
2]. This degradation can lead to reduced performance, including decreased range and power [
3]. In addition, aging batteries can pose serious safety hazards such as thermal runaway, which may lead to fire or explosion if not managed properly [
4]. Therefore, obtaining accurate and timely SOH information enables predictive maintenance, optimizes energy use, and helps prevent unexpected failures [
5]. However, SOH cannot be directly measured during battery operation, which makes SOH estimation challenging [
6]. Therefore, it is necessary to develop stable estimation algorithms for SOH prediction by voltage, current, temperature and other signals, which has become an important research focus in recent years [
7].
Currently, battery SOH estimation methods are mainly categorized into two main groups: modeling methods and data-driven methods. Modeling approaches for battery SOH estimation are generally categorized into equivalent circuit models (ECMs) and electrochemical models. ECMs simulate the dynamic behavior of batteries using combinations of resistors, capacitors, and voltage sources. These models are widely used due to their simplicity and low computational cost. However, they are empirical in nature, lack direct physical interpretability, and their parameters can drift significantly with aging and operating conditions, requiring frequent recalibration [
8]. In contrast, electrochemical models, such as the pseudo two-dimensional model, are derived from fundamental principles of thermodynamics and electrochemistry. They can accurately capture internal states like lithium-ion concentration and reaction kinetics, offering high fidelity and predictive capability. However, these models are computationally demanding and rely on detailed knowledge of battery structure and material parameters, which limits their application in practical BMS implementations [
9]. Due to the limitations of both model types, more studies have turned to more flexible data-driven methods to overcome the limitations of the modeling approach.
With the rapid development of artificial intelligence, SOH estimation with data-driven methods has attracted widespread attention. Data-driven methods extract features from battery data and utilize algorithms to explore the relationship between in features and capacity. Compared with other methods, data-driven approaches do not require complex experimental equipment or rely on an in-depth knowledge of the battery’s internal mechanical states. Furthermore, they can effectively improve the accuracy of SOH estimation in different application scenarios and battery charging and discharging modes, and show good adaptability and estimation capability, which is promising for a wide range of research. The two key steps of data-driven are feature extraction and model construction. Researchers usually extract features such as battery charging and discharging voltages, currents, and temperatures from battery databases, which respond to the health status such as battery capacity. Feature extraction is a key step in building SOH estimation models, and as research continues, various feature extraction strategies have emerged. For example, Deng et al. introduced a voltage-partitioning strategy to obtain the difference in discharge capacity between two cycles △Q(V) from a non-monotonic or impulse discharge voltage profile, and a filtering strategy to obtain a smoothed voltage profile under dynamic discharge conditions, and the standard deviation of the discharge capacity profile and △Q(V) were selected as the health features [
10]. Li et al. At the same time, internal aging features, such as charge transfer resistance, solid-phase diffusion coefficient, and electrode volume fraction, are extracted by EM, and external health features are extracted from the voltage and temperature curves which are divided into multiple phases [
11]. Tao et al. Based on the analysis of the aging mechanism of the batteries and the change rule of the curves, a total of 61 health indexes related to the degradation of the performance are extracted from the cycling curves. The Discrete Wavelet Transform (DWT) is used for noise reduction in the original metrics during preprocessing. A normalized mutual information (MI) feature selection algorithm based on maximum correlation and minimum common redundancy is introduced to select a set of optimized operating condition indicators [
12].
Subsequently, algorithms are modeled to establish the relationship between these features and capacity, and commonly used algorithms include including linear regression (LR) [
13], support vector machine (SVM) [
14], random forest (RF) [
15], decision tree (DT) [
16] and others. In order to further improve the accuracy and efficiency of SOH estimation, many researchers have developed integrating intelligent optimization algorithms with data-driven models. In recent years, many high-level research results have proved the effectiveness of such methods. For example, Fu et al. proposed an improved Arithmetic Optimization–Backpropagation Neural Network (AO-BPNN) for state of charge (SOC) estimation, which achieved satisfactory results under dynamic conditions [
17]. Chen et al. combined Particle Swarm Optimization (PSO) with Extreme Learning Machines (ELM) for SOH estimation, demonstrating enhanced predictive accuracy [
18]. Li et al. use the PSO algorithm to optimize the parameter configuration of the group method of data handling (GMDH). The optimized GMDH was used for the SOH estimation. The results show that the error of the proposed PSO-GMDH estimation model in estimating the subsequent SOH of a battery using its historical data is kept within 0.89% [
19]. Sheng et al., based on the MTR-GPR model, extracted the information from different datasets based on the distributional similarity to make full use of the aging data of batteries from multiple sources, and at the same time, reduce the negative impact of distributional differences [
20]. Tong et al. proposed an SVR model optimized using the Dragon Beetle Optimizer (DBO) and conducted comparative experiments across different training data scales to validate the model in SOH estimation [
21]. However, the performance of individual models may vary considerably on different datasets [
22]. This suggests that different models may perform well in different data environments and that no single model consistently achieves optimal results in all scenarios [
23]. To address this problem, increasing attention has been directed toward multiple model algorithms being collaboratively optimized into stacked integrated learning models for more accurate and reliable SOH estimation. For example, Zhi et al., proposed a genetic algorithm-particle swarm optimization (GA-PSO) algorithm to optimize the parameter values of SVR, improving the estimation accuracy and convergence speed of SVR [
24]. Li et al. proposed a novel stacked-integrated model, which consists of five basic models, integrating LightGBM, XGBoost, random forest (RF), SVR, and GPR through LR fusion to ensure model diversity. and GPR to ensure model diversity and improve SOH estimation accuracy. The average SOH estimation error of the stacked integrated model is lower than that of the five basic models [
25]. To further improve the performance of data-driven methods in SOH estimation, many researchers have focused on improving feature processing and optimizing algorithmic models. This paper also aims to contribute to this research area by exploring relevant improvement methods.
In addition, the estimation accuracy of SOH estimation models depends on the correlation between health features and battery SOH, and the adaptability of the algorithmic model’s own operation logic to the battery SOH. The selection of hyperparameters is a common challenge for most estimation models, and hyperparameter optimization plays a key role in improving the predictive performance and stability of the models [
26]. However, the hyperparameters of most machine learning models still rely on offline adjustment or empirical selection and lack a mechanism to dynamically adjust with the battery operating conditions, which makes it difficult to ensure that the model maintains optimal performance under different operating conditions.
This study proposes the construction of the IPSO-SVR model for dynamic hyper-parameter tuning, integrating numerous intelligent optimization algorithms to address these issues. The method leverages the strengths of the genetic algorithm (GA), whale optimization algorithm (WOA), and simulated annealing algorithm (SA) to enhance PSO, while simultaneously incorporating dynamic optimization and adjustment of SVR hyperparameters. This approach aims to improve the model’s global optimization capability and convergence stability, thereby enhancing the accuracy, robustness, and generalization performance of SOH estimation. This research presents the training and validation of the proposed IPSO-SVR model utilizing a public battery aging experimental dataset and conducts a comparative analysis with classical PSO-SVR and other data-driven methodologies to assess the efficacy of the current approach. The primary objective of this paper is as follows:
A health feature extraction method based on the value of the change in the voltage of a single point of the critical zone over the cycle period from the voltage profiles of different cycles of the battery is proposed to provide a more comprehensive description of the battery aging process. This approach permits a fast and flexible estimation of the battery capacity.
In order to address the performance fluctuations of a single model on different datasets, this paper proposes the IPSO-SVR model for dynamic hyper-parameter tuning integrating multiple intelligent optimization algorithms, allowing us to take advantage of the complementary strengths of multiple models.
The estimation results of the IPSO-SVR model in this paper are compared with the traditional model to confirm the model advantages with the help of relevant evaluation indexes, and the experimental results are analyzed and discussed at the same time.
The rest of the paper is organized as follows:
Section 2 describes the dataset used in this study and the health factor extraction process.
Section 3 describes the characteristics and structure of the estimated model. In
Section 4, the estimation results of the model are analyzed and compared with the base model. Finally,
Section 5 summarizes the paper.
3. IPSO-SVR Model Construction
The predictive efficacy of the identical model may fluctuate between batteries due to the varied aging trajectories of various batteries. These discrepancies result in inadequate estimation precision of individual models in conventional machine learning techniques. Numerous studies have employed methodologies like the implementation of advanced algorithms to assess SOH [
36,
37]. Nonetheless, traditional algorithms based on a singular model have specific benefits and intrinsic worth for reintegration and use, and they have previously been widely applied across various application scenarios. This study introduces the IPSO-SVR model for dynamic hyper-parameter tuning incorporating multiple intelligent optimization algorithms to enhance the resilience and generalization capacity of the SOH estimate process.
3.1. Algorithmic Principle
In SVR modeling for SOH estimation of lithium-ion batteries, optimizing hyperparameters presents challenges in balancing global optimization efficiency with local convergence accuracy when depending solely on a single intelligent algorithm. Consequently, this work proposes the development of the IPSO-SVR model for dynamic hyper-parameter tuning incorporating multiple intelligent optimization algorithms. The method leverages the strengths of WOA, GA, SA, and other algorithms to enhance PSO, while simultaneously incorporating dynamic hyperparameters to facilitate the adaptive optimization of SVR hyperparameters, thereby synergistically improving hyperparameter search efficacy and model performance.
3.2. Dynamic Hyper-Parameter Optimization
The selection of hyperparameters is vital for the efficacy of machine learning models, particularly in SVR models, where the choice of penalty coefficients and kernel function parameters directly influences generalization performance and predictive accuracy. The conventional fixed hyperparameter optimization method typically establishes hyperparameters during the training phase and maintains them constant throughout the optimization process, resulting in inflexibility when confronted with diverse datasets and variations in model complexity, potentially leading to suboptimal model performance.
The proposed dynamic hyperparameter optimization method modifies and calibrates hyperparameter values in real time throughout the model training process, allowing for adaptation to the evolving data features during training. This study demonstrates a fusion strategy, where hyperparameter adjustments are progressively refined during the program’s iterative phase by using dynamic inertia weights and adaptive cross-variation probabilities. Initially, elevated inertia weights and increased cross-variance probability enable the model to thoroughly investigate the hyperparameter space, thereby preventing premature convergence to local optima; as iterations progress, both inertia weights and cross-variance probability are systematically diminished, facilitating a transition to a more localized and refined search to accurately identify the optimal value within the hyperparameter space.
In contrast to the conventional fixed hyperparameter approach, this dynamic adjustment mechanism enhances the model’s generalization capability and predictive accuracy, while also markedly increasing the algorithm’s robustness, allowing for superior adaptation to diverse datasets and optimization requirements. The empirical results demonstrate that the fusion algorithm employing dynamic hyperparameter optimization exhibits significant advantages in the hyperparameter optimization search process of the SVR model, thereby validating its rationality and efficacy in real application.
3.3. Fusion Principle
The PSO algorithm, a conventional hyper-parameter optimization method, offers high computational efficiency and rapid convergence in optimizing parameters for the SVR model; however, it is also susceptible to limitations such as a tendency to converge to local optima and a swift decline in population diversity. This paper proposes three optimization strategies that integrate GA, SA and WOA with the PSO algorithm to leverage the strengths of each method and achieve synergistic benefits.
Figure 4 shows the flowchart of the IPSO-SVR prediction process, using the SA-based fusion strategy as an example.
The GA-PSO fusion technique employs the selection, crossover, and mutation mechanisms of GA to successfully preserve population diversity and enhance search space coverage, thereby compensating for the limitations of PSO, which is susceptible to converging on local optima in the later stages.
The WOA-PSO fusion algorithm utilizes the WOA’s robust global search in the initial phase, remarkably enhancing the depth and precision of exploration while mitigating the risk of local optima.
The SA-PSO fusion algorithm is based on the SA, which enhances stability and accuracy during the later stages of convergence. This technique integrates the rapid convergence attributes of PSO to achieve the synchronized advancement of exploration and exploitation.
In essence, these three fusion algorithms leverage the strengths of several algorithms to enhance the standard PSO-SVR model and address the shortcomings of individual techniques. Practical experiments and subsequent calculations reveal that these fusion algorithms exhibit clear theoretical rationale and offer exceptional optimization performance in certain applications, thus validating their practical significance and superiority in the hyperparameter optimization of SVR models.
3.4. Indicators for Model Evaluation
In order to objectively and accurately assess the reliability of lithium-ion battery SOH estimation results, it is essential to select appropriate evaluation indicators. The evaluation indexes can quantify the difference between the model prediction and the actual data, which facilitates quantitative analysis and model performance comparison, and thus guides the optimization and improvement of the model. The evaluation indexes selected in this paper include mean absolute error (MAE), root mean square error (RMSE), mean square error (MSE), and coefficient of determination (R2).
The average absolute error can intuitively reflect the degree of absolute deviation between the predicted value and the real value, and the smaller its value indicates the better prediction performance of the model. The formula is as follows:
The root mean square error averages the squares of the prediction errors before taking the square root, and can effectively reflect the overall degree of deviation in the prediction errors. Lower RMSE indicates better model accuracy. The calculation formula is as follows:
The mean square error, by averaging the squares of the prediction errors, can effectively penalize larger error values and thus reflect the accuracy of the model more sensitively. The calculation formula is as follows:
The coefficient of determination is used to measure the degree of fit between the predicted and true values of the model, and its value ranges from 0 to 1, with values close to 1 indicating that the model has a good fit. The formula is as follows:
In the above four Formulas (4) to (6), is the true value, is the predicted value, is the number of samples, and is the average value of the true value. Through the calculation and analysis of the above evaluation indexes, the performance of the SOH estimation model can be comprehensively and accurately assessed to ensure that the model has a high degree of reliability and stability when it is applied to practical problems.
4. Analysis and Comparison of Experimental Results
This part verifies the effectiveness of the IPSO-SVR model for dynamic hyper-parameter tuning incorporating multiple intelligent optimization algorithms in SOH estimation. Firstly, the health characteristics after Pearson’s correlation coefficient analysis are inputted into the IPSO-SVR model for SOH estimation, and the estimation results are visualized through graphs and charts, and the estimation results are evaluated through relevant evaluation indexes. The estimation results are also compared with those of traditional models to verify the superiority of the estimation model proposed in this paper.
4.1. Experimental Design
This paper utilizes B0005, B0006, and B0007 batteries from the NASA battery dataset for capacity estimation. The data concerning charge and discharge cycles throughout the lifespan of lithium-ion batteries serves as a critical foundation for constructing this estimation model, with a total of 166 cycles recorded in the dataset pertaining to the batteries’ cycle life. During model training, 60% of the data is used as training data, while 40% serves as test data; specifically, data pertaining to lithium-ion batteries from the initial 99 cycles constitute the training set, while data from the final 67 cycles comprise the test data.
To effectively validate the optimization effectiveness and stability of IPSO-SVR model, we selected an Intel® Core™ Ultra 9 185H CPU (2.50 GHz) and 32 GB of LPDDR5X memory, sourced from Intel Corporation (Santa Clara, CA, USA). The model was trained and tested using MATLAB R2024b software on the Windows 11 operating system.
4.2. IPSO-SVR Model Estimation Results
Figure 5 illustrates the error percentage curve derived from the capacity estimate outcomes of the IPSO-SVR model applied to the NASA dataset. The graphic distinctly illustrates the extent of the inaccuracy in SOH estimation by the IPSO-SVR model across various cells and fusion procedures, thus demonstrating the algorithm’s predictive ability. The absolute maximum error percentage is approximately 2% for cell B0005, indicating a little estimation error and precise prediction, whereas it is nearly 4% for cell B0006 and around 3% for cell B0007. The estimation errors of various batteries exhibit distinct variances and oscillations in accuracy while utilizing fusion GA, SA, and WOA, respectively. This may pertain to elements such as the intrinsic variations of lithium-ion batteries and the dataset, which could be associated with the inherent nonlinearity and uncertainty characteristics of lithium-ion batteries during the aging process. The degradation of battery capacity is not a straightforward linear decline but is influenced by various physicochemical phenomena, including the formation of solid electrolyte interfacial (SEI) membranes, structural alterations in electrode materials, and electrolyte decomposition. The strength and rate of these mechanisms among various battery monomers exhibit modest variations, resulting in a nonlinear, multi-stage capacity decline trajectory. Furthermore, minute variations in the production process of battery monomers, material imperfections, and internal structural discrepancies contribute to unpredictable alterations in the aging rate and pattern. Such nonlinearities and uncertainties are challenging to effectively represent using existing data-driven models, resulting in discrepancies in prediction accuracy across various battery datasets. Overall, the estimation findings indicate that the IPSO-SVR model exhibits high accuracy, particularly with the B0005 dataset, where various fusion procedures have yielded superior estimation outcomes.
Using the B0005 battery as an example, it is evident from
Figure 6 that in the prediction set, the application of the three fusion strategies—fusion GA, SA, and WOA—yields prediction results with minimal volatility, all closely approximating the actual value. The dynamic hyper-parameter adjustment model design technique, which incorporates many intelligent optimization methods, demonstrates superior performance in the SOH estimate procedure.
The error data in
Table 3 indicates that the IPSO-SVR model demonstrates exceptional accuracy and stability for SOH estimate on the NASA dataset. The minimum R
2 value across all three datasets, utilizing various fusion strategies, is 0.8835, with the majority exceeding 0.9. The maximum MAE is 1.1359%, while most values are below 0.5%. The maximum RMSE is 1.3501%. And the maximum MSE is 0.0182%, with the majority remaining under 0.005%. The IPSO-SVR model exhibits low error values and high estimation capability across various evaluation metrics. The estimation models demonstrate superior results for B0005 and B0007 compared to B0006, potentially due to the inherent properties of the dataset. The estimation models, after implementing various fusion strategies, consistently yield low-error results, demonstrating that these strategies, leveraging the inherent characteristics of the fusion algorithms, effectively integrate with the PSO-SVR model to deliver precise estimations.
4.3. Comparison of the IPSO-SVR Model with Conventional Models
This research compares the IPSO-SVR model, which employs various fusion strategies for dynamic hyper-parameter tuning using numerous intelligent optimization algorithms, with the conventional PSO-SVR estimation model to further verify its superiority. To facilitate a more equitable comparison of each model’s performance, both the PSO-SVR model and the IPSO-SVR model utilize identical health features derived from the NASA dataset as inputs, employing the same top 60% of data as the training set, and implemented on the same experimental platform. Its construction process and performance objectives were designed based on existing models reported in previous studies, serving as a reference benchmark [
38].
Figure 7 presents the estimation results of the various models for each battery set. The three estimation curves labeled WOA-PSO-SVR, SA-PSO-SVR, and GA-PSO-SVR in
Figure 7 represent the estimation results of the IPSO-SVR model under different optimization strategies. Only the findings and errors of the test data are displayed for clarity.
Figure 7 illustrates the comparative estimation results of B0005, B0006, and B0007 across various estimation models. The visual representation facilitates the discernment of differences in estimation outcomes among the models, revealing that the IPSO-SVR exhibits superior estimation accuracy across all datasets. Particularly with the B0006 dataset, the inherent properties of the dataset result in significant estimation errors for the traditional model, with errors increasing progressively as the number of cycles increases. The estimation results of the IPSO-SVR model across the three fusion procedures closely align with the actual values, yielding superior estimation outcomes. This compellingly demonstrates the superiority of the proposed model in this study, which circumvents the prevalent severe errors found in traditional models, thus highlighting the high estimation accuracy and stability of the IPSO-SVR model across various datasets.
To further illustrate the superiority of the suggested model proposed in this study,
Table 4 presents the error levels according to several error evaluation criteria for the classic PSO-SVR model. The findings across multiple error evaluation criteria indicate that the conventional PSO-SVR exhibits significant estimation errors for the NASA lithium-ion dataset in this study. Upon analyzing and computing data from
Table 3 in the preceding section, the estimation outcomes exhibit significant errors across various fusion strategies for each evaluation metric: MAE is recorded at 0.4143%, RMSE at 0.5887%, MSE at 0.0035%, and R
2 at 0.9577 for B0005. Calculations indicate that the MAE decreased by around 50%, the RMSE by around 36%, the MSE by roughly 56%, and the R
2 increased by 7%. The error reduction for the remaining two datasets is comparable to the results for B0005. The IPSO-SVR model attains enhanced accuracy in capacity and SOH estimation by employing three distinct fusion optimization procedures alongside dynamic hyperparameter optimization.
4.4. Discussion
Although the proposed IPSO-SVR model demonstrates high accuracy and robustness in SOH estimation tasks, it still presents certain limitations in terms of model complexity, deployment adaptability, and general applicability, which warrant further discussion.
First, while the IPSO-SVR model enhances the capability of parameter optimization, it also introduces a significant computational burden during the training phase. Under the configuration of 100 maximum iterations and a population size of 20, the training and prediction process for a single battery over 166 cycles in the NASA dataset took approximately 15 to 40 s on our experimental platform, with an average of 23 s. Although this time consumption is acceptable for offline training and can be performed periodically, it may pose a potential bottleneck in resource-constrained scenarios or in cases requiring frequent model updates.
Despite its real-time prediction capabilities, deploying the IPSO-SVR model on embedded platforms may encounter challenges such as memory limitations and high algorithmic complexity. Future work will address these issues by exploring lightweight model variants, reducing redundant optimization stages, and tailoring algorithmic implementations to hardware specifications.
Moreover, the generalization ability of the model across different datasets and battery chemistries has not yet been fully verified. The experiments conducted in this study are based on the publicly available NASA dataset, which, although a widely accepted benchmark, differs from modern commercial batteries in terms of sample source and electrochemical characteristics. Therefore, the model’s stability and accuracy under alternative battery types, operating conditions, or charging/discharging strategies remain to be validated. In future work, we plan to incorporate multi-source datasets to further evaluate and improve the model’s practical robustness and reliability.
5. Conclusions
The assessment of state of health for lithium-ion batteries is a critical component in battery management, maintenance, and operation processes, particularly given the swift advancement of the global electric vehicle market and the extensive utilization of lithium-ion batteries as power sources [
39]. This paper quantifies battery SOH, generates IC curves from the NASA dataset to extract health features, and develops the IPSO-SVR model for dynamic hyper-parameter tuning, integrating various intelligent optimization algorithms to address the inadequacies in SOH estimation accuracy of traditional models in complex and variable application scenarios. A more precise prediction of battery SOH is achieved, which successfully forecasts the variation in battery SOH. The primary conclusions of this paper are as follows:
The ICA method was used to generate IC curves. From these curves, five health-related features were extracted, and both Pearson and Spearman rank correlation analysis confirmed their strong association with SOH.
This paper proposes the Improved Particle Swarm Optimization-Support Vector. Regression model for dynamic hyper-parameter tuning incorporating multiple intelligent optimization algorithms. Prediction results demonstrate that the IPSO-SVR model is both robust and generalizable. Across three battery packs, the minimum R2 was 0.8835 (with the vast majority above 0.90), the maximum MAE was 1.1359% (most values below 0.50%), the maximum RMSE was 1.3501%, and the maximum MSE was 0.0182% (most values below 0.005%). Even in the worst-case fusion scenario, estimation accuracy remained high.
Compared with the conventional PSO-SVR model, the proposed IPSO-SVR exhibits markedly stronger generalization: MAE was reduced by approximately 50%, RMSE by about 36%, MSE by around 56%, and R2 increased by 7%.
On the experimental platform used in this study, the average training time of the IPSO-SVR model was 23 s, with a maximum of 40 s. Although this represents an increase in time complexity relative to traditional models, it remains within an acceptable range.
In future work, we will validate the accuracy of the IPSO-SVR model on a broader range of battery datasets and devote efforts to continuously enhancing its generalization capability. We also plan to integrate the complementary strengths of traditional estimation models—leveraging their proven merits while further improving SOH prediction accuracy—and to explore the incorporation of emerging algorithms into SOH estimation, with the goal of developing models that deliver even better overall performance.
This research contributes to sustainable energy management by improving the accuracy of battery SOH estimation, which in turn enables more efficient and proactive battery management in applications like EVs and grid storage. By using the IPSO-SVR model to maintain batteries in optimal condition, the battery lifespan can be extended, delaying replacement needs and allowing each battery to deliver more service over its life. Extending battery life means fewer batteries need to be manufactured and discarded, which helps optimize resource utilization (reducing the demand for raw materials such as lithium and cobalt) and decreases waste. Consequently, the environmental impact associated with battery production and disposal is mitigated. Overall, the proposed approach supports the development of more sustainable energy storage systems and aligns with broader goals of sustainable development.