Next Article in Journal
Aging Estimation and Clustering of Used EV Batteries for Second-Life Applications
Previous Article in Journal
Electro-Thermal Modeling and Parameter Identification of an EV Battery Pack Using Drive Cycle Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

State-of-Health Estimation of LiFePO4 Batteries via High-Frequency EIS and Feature-Optimized Random Forests

1
School of Automotive Studies, Tongji University, Shanghai 201804, China
2
Shanghai Qiyuan Green Power Technology Co., Ltd., Beijing 100044, China
*
Author to whom correspondence should be addressed.
Batteries 2025, 11(9), 321; https://doi.org/10.3390/batteries11090321
Submission received: 24 July 2025 / Revised: 23 August 2025 / Accepted: 27 August 2025 / Published: 28 August 2025

Abstract

Accurate state-of-health (SOH) estimation of lithium iron phosphate (LiFePO4) batteries is critical for ensuring the safety and performance of electric vehicles, particularly under extreme operating conditions. This study presents a data-driven SOH prediction framework based on high-frequency electrochemical impedance spectroscopy (EIS) measurements conducted at −5 °C across various states of charge (SOCs). Feature parameters were extracted from the impedance spectra using equivalent circuit modeling. These features were optimized through Bayesian weighting and subsequently fed into three machine learning models: Random Forest (RF), Gradient Boosting (GB), and Extreme Gradient Boosting (XGB). To mitigate SOC-dependent variations, the models were trained, validated, and tested using features from different SOC levels for each aging cycle. This work provides a practical and interpretable approach for battery health monitoring using high-frequency EIS data, even under sub-zero temperature and partial-SOC conditions. The findings offer valuable insights for developing SOC-agnostic SOH estimation models, advancing the reliability of battery management systems in real-world applications.

1. Introduction

With the rapid development of electric vehicles (EVs), battery safety has become a critical issue. Battery-related failures account for a substantial proportion of performance degradation and safety incidents in EVs [1,2]. Among various battery chemistries, lithium iron phosphate (LFP) batteries are widely used in cost-effective EVs due to their high thermal stability, long cycle life, and intrinsic safety [3,4]. In addition to LFP, nickel-rich layered oxides such as lithium nickel manganese cobalt oxide (NMC) and lithium nickel cobalt aluminum oxide (NCA) are extensively adopted in high-energy-density EVs, owing to their superior specific capacity and energy density. However, these chemistries are generally more susceptible to thermal runaway and require stricter safety management. Ensuring safe operation over the entire battery life cycle therefore necessitates continuous monitoring of the battery health state [5]. In this context, accurate estimation and prediction of the state of health (SOH) [6] are essential for early fault diagnosis and second-life applications [7].
SOH prediction typically involves four stages: data acquisition, health indicator extraction, mapping of features to SOH, and generalization validation on new data [8]. Common data sources include voltage [9], current [10], temperature, and electrochemical impedance spectroscopy (EIS) [11,12]. However, voltage, current, and temperature signals are highly dynamic under real-world conditions, making it difficult to extract stable and reliable features. EIS, on the other hand, provides richer electrochemical information but is usually collected offline, which limits its real-time applicability due to time-consuming measurements.
Although voltage, current, and temperature signals [13] are highly dynamic under real-world conditions, they can still yield informative health indicators through appropriate signal processing techniques. For instance, incremental capacity analysis (ICA) has been extensively applied to voltage–capacity curves, where peak positions, amplitudes, and areas are extracted as robust indicators of degradation [14]. A random forest (RF)-based approach in [9] averaged IC and IIC peak coordinates to assess battery consistency, achieving 97.2% classification accuracy. However, such methods are often sensitive to noise, baseline drift, and data preprocessing. More recent advances have sought to enhance robustness, such as adaptive ICA schemes that adjust smoothing parameters and sampling intervals according to operating conditions, thereby enabling feature extraction from partial charging data [15]. These developments demonstrate that even under dynamic load profiles, voltage and current signals can provide reliable features if properly processed.
Other approaches employ hybrid pulse power characterization (HPPC) combined with equivalent circuit models (ECMs) [16] to extract resistance and capacitance values. For example, the elite opposition-based learning snake optimizer (EOLSO) was applied in [17] to parameterize a second-order RC model, outperforming conventional algorithms. While ECM-based methods offer physical interpretability, they may struggle to represent complex diffusion dynamics within the battery.
Features can be derived from ECM parameterization or from the distribution of relaxation times (DRTs) [18]. DRT approaches, in particular, enable the extraction of relaxation times, peak positions, and integrated areas, though they are highly sensitive to regularization parameters and demand high data integrity to avoid false peaks [19]. In parallel, data-driven methods, especially deep neural networks, have shown the ability to learn features directly from raw signals [20,21]. These approaches can achieve impressive accuracy but rely on large, high-quality datasets and often lack interpretability, which limits their deployment in safety-critical applications.
Overall, SOH modeling approaches can be categorized as physics-based or data-driven. Physics-based models are grounded in the electrochemical mechanisms and dynamics of batteries and offer good interpretability. However, they are difficult to generalize across different chemistries and operational conditions. In contrast, Data-driven methods, including machine learning models [22,23,24], offer flexibility and strong predictive capability but are susceptible to overfitting and require substantial training data.
To enable accurate SOH prediction using EIS data within a limited frequency range, this study proposes the following contributions: (1) An interpretable empirical prediction model is developed; (2) Effective feature parameters are identified from partial EIS data; (3) The influence of state of charge (SOC) is considered in optimizing feature combinations; (4) The proposed method is validated using EIS measurements under sub-zero temperature conditions, highlighting the effectiveness of inductive-phase impedance features in SOH prediction.

2. Experimental Materials and Methods

To prevent temperature fluctuations caused by environmental influences, the test chamber was precisely maintained at −5 °C throughout the experiment. The experimental subjects were three prismatic LiFePO4 (LFP) batteries (Model: LF50F) produced by EVE Power Co., Ltd. (Jingmen, China), each measuring 148.3 mm (L) × 26.7 mm (W) × 127 mm (H) with a nominal capacity of 50 Ah. In order to investigate the electrochemical impedance responses under sub-zero temperature conditions (−5 °C) at various states of charge (SOC), the following experimental procedures were designed:
To obtain impedance data at different stages of battery health, the cell underwent aging cycles under a controlled ambient temperature of 25 °C using a 1C constant current-constant voltage (CC-CV) charging and 1C constant current discharging protocol. Each aging cycle consisted of 40 full charge–discharge operations. The discharge capacity recorded at the final cycle of each aging phase was used as the reference capacity to indicate the state of health (SOH) at that stage.
To capture the impedance response at different SOC levels, the cell was conditioned in a thermal chamber at −5 °C for 2 h. Based on the findings from reference [25], which investigated the rest period effects on LiFePO4 battery impedance measurements, a 2 h stabilization time was adopted in this experiment. The SOC was adjusted using controlled charging and discharging steps. After each aging cycle, electrochemical impedance spectroscopy (EIS) measurements were conducted at SOC levels of 0%, 20%, 40%, 60%, 80%, and 100% using an electrochemical workstation. The EIS tests were performed with an AC excitation current of 3A over a frequency range from 10 kHz to 1 kHz. As illustrated in Figure 1, the voltage amplitude responses obtained from the EIS measurements under different SOC conditions remain below 10 mV across the selected frequency range (1–10 kHz). This indicates that the applied perturbation (3A excitation current) is sufficiently small to avoid nonlinear polarization effects. Therefore, the experimental data satisfy the small-signal linearity condition, ensuring that the extracted impedance parameters accurately reflect the intrinsic electrochemical characteristics of the battery system.
In this experiment, temperature sensors were deployed at both the geometric center of the battery and the positive tab to monitor the temperature of the cell body and the current connection point, respectively. Temperature data were synchronously recorded using the integrated data acquisition system of the Neware battery testing equipment, which had been properly calibrated and adjusted for measurement accuracy.

3. Methodology

3.1. Feature Extraction

Figure 2 presents the Nyquist plots of impedance spectra obtained at various states of charge (0–100%) under a low-temperature condition of −5 °C, during a specific aging stage. All measurements were performed in the ultra-high frequency range from 10 kHz to 1 kHz, and a clear SOC-dependent variation in impedance behavior can be observed. Notably, the impedance curves exhibit a distinct bending toward the inductive region at higher frequencies, where the imaginary component becomes positive.
This phenomenon is attributed to multiple low-temperature effects, including increased charge transfer resistance, limited ionic mobility, and reduced electrode reaction kinetics, which together cause a rise in the inductive part of the impedance. Although inductive responses in this frequency region are often disregarded in conventional battery EIS analysis—primarily due to their weak correlation with dominant electrochemical storage mechanisms—they may still encode meaningful information. For instance, they can reflect interfacial dynamics, current collector inductance, or transient electrode behavior, especially under extreme conditions. Therefore, this study aims to extract and analyze meaningful features from the EIS curves in the high-frequency inductive region to deepen the understanding of battery characteristics at low temperatures.
To characterize the inductive arc observed in the ultra-high-frequency region of the impedance spectra, an equivalent circuit model (ECM) was constructed, as shown in Figure 1. This model consists of a series connection of an ohmic resistor R o h m and a parallel branch composed of a resistor R L and an inductor L . Such a structure is designed to capture the dynamic high-frequency behavior that becomes prominent under low-temperature conditions due to slow interfacial kinetics and limited ion mobility.
The total impedance of this circuit, denoted Z e c m , is expressed as:
Z e c m = R o h m + 1 1 R L + j ω L = R o h m + R L 1 + j ω R L L
The real and imaginary components of this impedance are given by:
Z e c m = R o h m + R L 1 + ( ω R L L ) 2
Z e c m = ω R L 2 L 1 + ( ω R L L ) 2
By eliminating the frequency-dependent terms, the impedance locus satisfies the following circular relationship:
( Z e c m R o h m ) 2 + ( Z e c m ) 2 = R L 2 2
This represents a semicircle in the complex plane, centered at ( R o h m + R L / 2,0 ) with radius R L / 2 , corresponding to the inductive arc located in the fourth quadrant of the Nyquist plot.
To extract model parameters from experimental data, a least-squares fitting method was adopted. The impedance data points ( Z i , Z i ) , measured within the ultra-high-frequency range (10 kHz to 1 kHz), are assumed to satisfy the implicit form of the circular equation:
Z i 2 + Z i 2 + a 1 Z i + a 2 = 0
This is reformulated as a linear least-squares problem by defining:
s Z i , Z i = a 1 Z i + Z i + a 2
f Z i , Z i = Z i 2 + Z i 2
The objective is to minimize the total squared error:
E = i = 1 n   s Z i , Z i f Z i , Z i 2
This leads to a solvable linear system, from which the optimal coefficients a 1 and a 2 are obtained. These coefficients are then used to compute the center x 1 and radius r 1 of the fitted circle. Based on the identified circle parameters, the corresponding equivalent circuit parameters are derived as:
R L = 2 r 1
R o h m = x 1 r 1
The inductance L is further calculated by substituting the real part of the measured impedance at a fixed high frequency (e.g., 10 kHz) into the analytical expression:
Z ω = R o h m + R L 1 + ( ω R L L ) 2
Solving this equation for L yields the final inductive parameter. This methodology minimizes the total fitting error and effectively transforms a nonlinear geometric identification problem into a linear least-squares formulation, making it well-suited for robust parameter extraction from high-frequency impedance data under extreme low-temperature conditions.

3.2. Feature Weight Optimization

To achieve reliable and accurate estimation of battery State of Health (SOH) under varying environmental and operational conditions, such as temperature and State of Charge (SOC), we propose a hybrid machine learning framework that integrates Bayesian optimization (BO) for adaptive feature weighting with two ensemble regression algorithms: Random Forest (RF) and XGBoost.
Each input sample is represented by a three-dimensional feature vector extracted from Electrochemical Impedance Spectroscopy (EIS) measurements in the ultra-high-frequency range. Specifically, the three circuit parameters identified in the previous section are selected as physically interpretable features that reflect the battery’s internal electrochemical dynamics:
x = f 1 , f 2 , . . . , f n R n
These features are highly sensitive to changes in battery aging and low-temperature behavior and therefore serve as informative inputs for SOH modeling.
To further enhance prediction accuracy, a feature weighting strategy is employed, where each feature is assigned an optimal weight w i learned via Bayesian optimization. The weighted feature vector is given by:
x w = w 1 f 1 , w 2 f 2 , . . . , w n f n
The weighted features are then fed into both RF and XGBoost models, which are trained to regress the SOH value. This combination of physics-informed features and data-driven optimization enables the model to generalize well across different SOC levels and temperature conditions.
These features correspond to specific frequency domain characteristics of the battery, capturing impedance-related information at fixed measurement points. However, under varying operational scenarios, the relative importance of these features may change. To accommodate this, a set of trainable feature weights is introduced:
w = w 1 , w 2 , . . , w n , with   w n > 0 , i = 1 n   w i = 1
The original feature vector is then transformed via element-wise multiplication, resulting in a weighted feature input:
x weighted = x w = w 1 f 1 , w 2 f 2 , . . . , w n f n
Rather than using fixed weights, the vector w is optimized through Bayesian Optimization. BO constructs a probabilistic surrogate model over the space of weight vectors and seeks the optimal weight configuration that minimizes the model prediction error. Specifically, for a given regression model M ( ) , the following loss function is defined as the BO objective:
L ( w ) = 1 N i = 1 N     y i M x i w 2
Here, y i denotes the measured SOH label of the i -th sample, and y ˆ i = M ( x i w ) is the model prediction based on the weighted feature input. At each iteration, BO proposes a candidate weight vector w , retrains the model with weighted features, and evaluates the resulting RMSE. This process is repeated for a predefined number of steps, yielding the optimal feature weight vector w * that is subsequently used in final model training.

3.3. SOH Evaluation Model

3.3.1. Bayesian-Optimized Random Forest (BO-RF)

Random Forest is a bagging-based ensemble learning model that constructs multiple decision trees using bootstrapped data and random feature splits. In the proposed framework, the input to each tree is the optimized weighted feature vector x weighted , and the final SOH prediction is obtained by averaging the outputs of all trees:
y ˆ = 1 M m = 1 M   T m x weighted
where T m is the prediction of the m -th tree, and M is the total number of trees in the forest. The model is trained to minimize the mean squared error:
L RF = 1 N i = 1 N   ( y i y ˆ i ) 2
This structure allows RF to capture nonlinear interactions among the weighted features while maintaining resistance to overfitting through ensemble averaging.

3.3.2. Bayesian-Optimized XGBoost (BO-XGBoost)

Reference [26] has demonstrated that the XGBoost-based approach can achieve prediction accuracy of up to 90% using only a limited number of features, thereby highlighting its potential for embedded BMS applications. Building upon this, the present study integrates Bayesian weight optimization with machine learning models to enable highly accurate SOH prediction. XGBoost is a gradient boosting framework that builds decision trees in a stage-wise manner, each learning to correct the residuals of the previous model. After applying the optimized feature weights w * , the prediction is expressed as:
y ˆ = k = 1 K   f k x weighted
where f k F is the k -th regression tree and F denotes the space of all possible trees. The model is trained to minimize the following regularized loss function:
L XGB = i = 1 N   ( y i y ˆ i ) 2 + k = 1 K   Ω ( f k )
Ω f k = γ T k + 1 2 λ j = 1 T k   w j 2
Here, T k is the number of leaves in the k -th tree, w j represents the output value of leaf j , and γ , λ are regularization hyperparameters. The inclusion of Ω ( f k ) penalizes overly complex trees, improving the model’s generalization ability. The use of first- and second-order gradient information in tree construction also accelerates convergence and enhances robustness.
The complete prediction process consists of three stages: (1) feature extraction from EIS data, (2) feature reweighting via Bayesian optimization to learn w * , and (3) final SOH regression using either the BO-RF or BO-XGBoost model. This framework not only allows flexible adaptation to diverse operating conditions but also emphasizes interpretable feature importance and optimized model accuracy across battery life cycles.
To provide a clear visualization of the proposed SOH prediction framework, Figure 3 illustrates the overall architecture of both Bayesian-optimized Random Forest (BO-RF) and Bayesian-optimized GBDT models (BO-GBoost/XGBoost). The input feature vector is first reweighted using Bayesian optimization, after which the reweighted features are passed into the ensemble models for regression. For the RF-based model, predictions from each tree are averaged, whereas in the GBDT-based model, each tree sequentially fits the residuals from the previous stage to refine the final prediction.
This pipeline is repeated across different SOC levels to assess model robustness. The resulting optimal weights and RMSE values provide insight into both feature importance and model suitability.

4. Results and Discussions

4.1. High-Frequency Feature Extraction

As shown in Figure 4, R L consistently exhibits positive correlations with SOH across all SOCs, with only minor variations, indicating that its relationship with SOH is largely independent of SOC and thus highly generalizable. Similarly, R o h m maintains strong negative correlations with SOH across different SOC levels, further confirming its robustness as a health-related feature.
In contrast, L displays substantial fluctuations across both SOCs and cells, suggesting that it is influenced by factors beyond the battery itself, such as measurement equipment and testing conditions. Consequently, L is not considered as a feature for SOH prediction in this study.
These findings demonstrate that R L and R o h m are SOC-generalizable indicators suitable for constructing SOH prediction models that are robust across a wide range of SOC conditions, whereas the variability of L highlights the need for further investigation to mitigate external influences in future studies.

4.2. Model Comparison Across SOC Conditions

Figure 5 illustrates the comparison between predicted SOH values and measured SOH values for three ensemble learning models—Random Forest, GBoost, and XGBoost—across the training set, validation set, and test set. The input features encompass R L , R o h m , x 1 and r 1 . The training set incorporates feature parameters at 0%, 20%, 40%, and 60% SOC, while the validation and test sets utilize parameters at 80% and 100% SOC, respectively.
In each figure, a direct visual comparison of the predicted and measured SOH values is presented. Across all datasets, the Random Forest model exhibits a high degree of accuracy, with its predicted values closely following the measured values, as indicated by the points densely clustered around the red diagonal line. This is further supported by its RMSE values in Table 1: 0.066 for the training set, 0.011 for the validation set, and an extremely low 0.001 for the test set.
Table 1 summarizes the RMSE values for SOH predictions using three machine learning models. The Random Forest model achieves RMSEs of 0.066, 0.184, and 0.170 for the training, validation, and test sets, respectively, indicating consistently accurate predictions. Visual inspection further confirms a tight alignment of predicted values with the measured SOH.
In comparison, Gradient Boosting shows very low training RMSE (0.011) but higher validation (0.191) and test (0.210) errors, suggesting overfitting. Its predicted points are more dispersed around the diagonal, reflecting lower prediction consistency. XGBoost achieves training, validation, and test RMSEs of 0.001, 0.203, and 0.194, respectively. While slightly outperforming Gradient Boosting in some cases, its predictions still deviate more from measured values than those of Random Forest.
Overall, considering both RMSE metrics and the visual distribution of predictions, Random Forest demonstrates superior accuracy and consistency, making it the most suitable model for battery SOH estimation.

5. Conclusions

This study presents a robust and interpretable framework for estimating the state of health (SOH) of LiFePO4 batteries using high-frequency electrochemical impedance spectroscopy (EIS) data under sub-zero temperature conditions. The key findings and contributions are summarized as follows:
(1)
Feature Extraction and Optimization: High-frequency EIS measurements at −5 °C were successfully utilized to extract SOC-generalizable features, including R L and R o h m , through equivalent circuit modeling. These features exhibited strong correlations with SOH, demonstrating their reliability as health indicators across varying SOC levels.
(2)
Bayesian-Optimized Machine Learning Models: The integration of Bayesian optimization with ensemble learning models, particularly Random Forest (RF), significantly enhanced prediction accuracy. This highlights the effectiveness of combining physics-informed features with data-driven optimization for robust SOH estimation.
(3)
SOC-Agnostic Framework: By training and validating the models on features extracted from diverse SOC levels, the proposed framework demonstrated robustness against SOC-dependent variations. This approach ensures practical applicability in real-world scenarios where batteries operate under partial and fluctuating SOC conditions.
(4)
Practical Implications: The study provides a scalable and interpretable solution for battery health monitoring, even under extreme low-temperature conditions. The use of high-frequency EIS data, coupled with feature optimization, offers a balance between accuracy and computational efficiency, making it suitable for deployment in battery management systems (BMS).
In conclusion, this work advances the field of battery health diagnostics by offering a reliable, data-driven SOH estimation framework. The proposed methodology not only addresses the challenges posed by low-temperature and partial-SOC conditions but also sets a foundation for future innovations in battery management technologies.

Author Contributions

Conceptualization, Z.Y. and X.W. (Xueyuan Wang); methodology, Z.Y.; software, Z.Y.; validation, Z.Y. and X.W. (Xueyuan Wang); formal analysis, Z.Y.; investigation, Z.Y.; resources, X.W. (Xuezhe Wei), H.D. and L.L.; data curation, Z.Y.; writing—original draft preparation, Z.Y.; writing—review and editing, Z.Y.; visualization, Z.Y.; supervision, X.W. (Xuezhe Wei), H.D. and X.W. (Xueyuan Wang); project administration, L.L.; funding acquisition, X.W. (Xuezhe Wei), H.D., X.W. (Xueyuan Wang) and L.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (Grant number: 52207242).

Conflicts of Interest

Author Lifang Liu was employed by the company Shanghai Qiyuan Green Power Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Wang, Z.; Huang, G.; Chen, Z.; An, C. Accidents involving lithium-ion batteries in non-application stages: Incident characteristics, environmental impacts, and response strategies. BMC Chem. 2025, 19, 94. [Google Scholar] [CrossRef]
  2. Shi, P.; Zhu, H.; Dong, X.; Hai, B. Research progress on thermal runaway warning methods and fire extinguishing technologies for lithium-ion batteries. World Electr. Veh. J. 2025, 16, 81. [Google Scholar] [CrossRef]
  3. Comanescu, C. Ensuring safety and reliability: An overview of lithium-ion battery service assessment. Batteries 2025, 11, 6. [Google Scholar] [CrossRef]
  4. Chen, W.; Han, X.; Pan, Y.; Yuan, Y.; Kong, X.; Liu, L.; Sun, Y.; Shen, W.; Xiong, R. Defects in lithium-ion batteries: From origins to safety risks. Green Energy Intell. Transp. 2025, 4, 100235. [Google Scholar] [CrossRef]
  5. Saxena, V. Enhancing EV battery lifecycle management: Robotic disassembly, design for disassembly, and sustainable solutions. J. Energy Storage 2025, 119, 116368. [Google Scholar] [CrossRef]
  6. Madani, S.S.; Shabeer, Y.; Allard, F.; Fowler, M.; Ziebert, C.; Wang, Z.; Panchal, S.; Chaoui, H.; Mekhilef, S.; Dou, S.X.; et al. A comprehensive review on lithium-ion battery lifetime prediction and aging mechanism analysis. Batteries 2025, 11, 127. [Google Scholar] [CrossRef]
  7. Zheng, M.; You, Y.; Lu, J. Understanding materials failure mechanisms for the optimization of lithium-ion battery recycling. Nat. Rev. Mater. 2025, 10, 355–368. [Google Scholar] [CrossRef]
  8. Li, Q.; Song, R.; Wei, Y. A review of state-of-health estimation for lithium-ion battery packs. J. Energy Storage 2025, 118, 116078. [Google Scholar] [CrossRef]
  9. Sunil, S.; Pattipati, K.R.; Balasingam, B. Piecewise linear approximation of battery open-circuit voltage characteristics using dynamic programming. IEEE Trans. Instrum. Meas. 2025, 74, 2521912. [Google Scholar] [CrossRef]
  10. Sun, J.; Liu, X.; Li, X.; Chen, S.; Xing, S.; Guo, Y. State of health estimation of lithium-ion battery based on constant current charging time feature extraction and internal resistance compensation. Energy 2025, 315, 134472. [Google Scholar] [CrossRef]
  11. M, S.; V, K.; P, H. Electrochemical impedance spectroscopy as a diagnostic and prognostic tool for EV batteries: A review. J. Electrochem. Sci. Technol. 2025, 16, 267–286. [Google Scholar] [CrossRef]
  12. Choi, W.; Shin, H.; Kim, J.M.; Choi, J.; Yoon, W. Modeling and applications of electrochemical impedance spectroscopy (EIS) for lithium-ion batteries. J. Electrochem. Sci. Technol. 2020, 11, 1–13. [Google Scholar] [CrossRef]
  13. Vasta, E.; Scimone, T.; Nobile, G.; Eberhardt, O.; Dugo, D.; De Benedetti, M.M.; Lanuzza, L.; Scarcella, G.; Patanè, L.; Arena, P.; et al. Models for battery health assessment: A comparative evaluation. Energies 2023, 16, 632. [Google Scholar] [CrossRef]
  14. Bian, X.; Liu, L.; Yan, J. A model for state-of-health estimation of lithium ion batteries based on charging profiles. Energy 2019, 177, 57–65. [Google Scholar] [CrossRef]
  15. Zhao, J.; Zhang, X.; Hu, C. Lithium-ion battery state-of-health estimation using voltage-position encoding CNN and incremental capacity analysis with a novel smoothing parameter selection strategy. J. Energy Storage 2025, 130, 117296. [Google Scholar] [CrossRef]
  16. Nejad, S.; Gladwin, D.T.; Stone, D.A. A systematic review of lumped-parameter equivalent circuit models for real-time estimation of lithium-ion battery states. J. Power Sources 2016, 316, 183–196. [Google Scholar] [CrossRef]
  17. Li, W.; Xiong, Y.; Zhang, S.; Fan, X.; Wang, R.; Wong, P. A novel method of parameter identification for lithium-ion batteries based on elite opposition-based learning snake optimization. World Electr. Veh. J. 2025, 16, 268. [Google Scholar] [CrossRef]
  18. Peng, J.; Gao, Y.; Cai, L.; Zhang, M.; Sun, C.; Liu, H. State of health estimation for lithium-ion batteries using electrochemical impedance spectroscopy and a multi-scale kernel extreme learning machine. World Electr. Veh. J. 2025, 16, 224. [Google Scholar] [CrossRef]
  19. Plank, C.; Rüther, T.; Jahn, L.; Schamel, M.; Schmidt, J.P.; Ciucci, F.; Danzer, M.A. A review on the distribution of relaxation times analysis: A powerful tool for process identification of electrochemical systems. J. Power Sources 2024, 594, 233845. [Google Scholar] [CrossRef]
  20. Qi, G.; Du, G.; Wang, K. Progress in estimating the state of health using transfer learning–based electrochemical impedance spectroscopy of lithium-ion batteries. Ionics 2025, 31, 2337–2349. [Google Scholar] [CrossRef]
  21. Meng, J.; Hu, D.; Lin, M.; Peng, J.; Wu, J.; Stroe, D.I. A domain-adversarial neural network for transferable lithium-ion battery state-of-health estimation. IEEE Trans. Transp. Electrif. 2025, 11, 7732–7742. [Google Scholar] [CrossRef]
  22. Li, Z.; Zhang, X.; Gao, W. State of health estimation of lithium-ion battery during fast charging process based on BiLSTM-transformer. Energy 2024, 311, 133418. [Google Scholar] [CrossRef]
  23. Qi, N.; Yan, K.; Yu, Y.; Li, R.; Huang, R.; Chen, L.; Su, Y. Machine learning and neural network supported state of health simulation and forecasting model for lithium-ion battery. Front Energy 2024, 18, 223–240. [Google Scholar] [CrossRef]
  24. Qu, J.; Liu, F.; Ma, Y.; Fan, J. A neural-network-based method for RUL prediction and SOH monitoring of lithium-ion battery. IEEE Access 2019, 7, 87178–87191. [Google Scholar] [CrossRef]
  25. Hosen, M.S.; Gopalakrishnan, R.; Kalogiannis, T.; Jaguemont, J.; Van Mierlo, J.; Berecibar, M. Impact of relaxation time on electrochemical impedance spectroscopy characterization of the most common lithium battery technologies—Experimental study and chemistry-neutral modeling. World Electr. Veh. J. 2021, 12, 77. [Google Scholar] [CrossRef]
  26. Tonima, M.A.; DeHart, A.; Tabakci, D.; Tisapramotkul, P.; Munro-West, A.; Mehra, A.; Shoa, T. Electrochemical impedance spectroscopy (EIS) and machine learning based battery state of health (SoH) estimation. In Proceedings of the 2023 IEEE International Conference on Prognostics and Health Management (ICPHM), Montreal, QC, Canada, 5–7 June 2023; pp. 212–223. [Google Scholar] [CrossRef]
Figure 1. Voltage amplitude responses of the battery under different SOC levels (0–100%) at −5 °C within one aging stage.
Figure 1. Voltage amplitude responses of the battery under different SOC levels (0–100%) at −5 °C within one aging stage.
Batteries 11 00321 g001
Figure 2. Nyquist plots of the battery impedance at different SOC levels (0–100%) under −5 °C and the corresponding ECM.
Figure 2. Nyquist plots of the battery impedance at different SOC levels (0–100%) under −5 °C and the corresponding ECM.
Batteries 11 00321 g002
Figure 3. Schematic diagram of the proposed SOH prediction models with Bayesian-optimized feature weights. (a) Random Forest architecture. (b) GBDT framework (GBoost/XGBoost).
Figure 3. Schematic diagram of the proposed SOH prediction models with Bayesian-optimized feature weights. (a) Random Forest architecture. (b) GBDT framework (GBoost/XGBoost).
Batteries 11 00321 g003
Figure 4. Pearson correlation coefficients between the extracted equivalent circuit parameters ( R L , R o h m , and L ) and SOH under different SOC conditions (0%, 20%, 40%, 60%, 80%, and 100%) for three battery cells. (a) Pearson correlation coefficients between R L , R o h m , and L and SOH for Cell B1. (b) Pearson correlation coefficients between R L , R o h m , and L and SOH for Cell B2. (c) Pearson correlation coefficients between R L , R o h m , and L and SOH for Cell B3.
Figure 4. Pearson correlation coefficients between the extracted equivalent circuit parameters ( R L , R o h m , and L ) and SOH under different SOC conditions (0%, 20%, 40%, 60%, 80%, and 100%) for three battery cells. (a) Pearson correlation coefficients between R L , R o h m , and L and SOH for Cell B1. (b) Pearson correlation coefficients between R L , R o h m , and L and SOH for Cell B2. (c) Pearson correlation coefficients between R L , R o h m , and L and SOH for Cell B3.
Batteries 11 00321 g004
Figure 5. Predicted versus measured SOH values for the machine learning models. (a) Training set results. (b) Validation set results. (c) Test set results.
Figure 5. Predicted versus measured SOH values for the machine learning models. (a) Training set results. (b) Validation set results. (c) Test set results.
Batteries 11 00321 g005aBatteries 11 00321 g005b
Table 1. RMSE values of SOH predictions using different models.
Table 1. RMSE values of SOH predictions using different models.
ModelRMSE of Training SetRMSE of Validation SetRMSE of Test Set
Random Forest0.0660.1840.170
Gradient Boosting0.0110.1910.210
XGBoost0.0010.2030.194
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yan, Z.; Wang, X.; Wei, X.; Dai, H.; Liu, L. State-of-Health Estimation of LiFePO4 Batteries via High-Frequency EIS and Feature-Optimized Random Forests. Batteries 2025, 11, 321. https://doi.org/10.3390/batteries11090321

AMA Style

Yan Z, Wang X, Wei X, Dai H, Liu L. State-of-Health Estimation of LiFePO4 Batteries via High-Frequency EIS and Feature-Optimized Random Forests. Batteries. 2025; 11(9):321. https://doi.org/10.3390/batteries11090321

Chicago/Turabian Style

Yan, Zhihan, Xueyuan Wang, Xuezhe Wei, Haifeng Dai, and Lifang Liu. 2025. "State-of-Health Estimation of LiFePO4 Batteries via High-Frequency EIS and Feature-Optimized Random Forests" Batteries 11, no. 9: 321. https://doi.org/10.3390/batteries11090321

APA Style

Yan, Z., Wang, X., Wei, X., Dai, H., & Liu, L. (2025). State-of-Health Estimation of LiFePO4 Batteries via High-Frequency EIS and Feature-Optimized Random Forests. Batteries, 11(9), 321. https://doi.org/10.3390/batteries11090321

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop