1. Introduction
With the global transformation of energy structures and the innovation of energy technologies, lithium-ion batteries (LIBs) have emerged as a highly efficient and clean energy storage solution, finding extensive applications in electric vehicles (EVs), renewable energy storage systems, etc. [
1,
2]. LIBs are widely regarded as the cornerstone of modern energy storage technologies due to their high energy density, long cycle life, and low self-discharge rate [
3,
4]. However, the performance and safety of LIBs heavily depend on the accurate monitoring and management of their internal states, particularly under extreme conditions such as high or low temperatures and overloading, which can easily induce overcharging or over-discharging, leading to thermal runaway [
5]. There are severe risks due to the thermal runaway, such as overheating, fire, and even explosion [
6,
7]. Therefore, accurate SOC estimation is a critical measure to prevent such catastrophic events. As one of the core technologies for efficient battery management, precise SOC monitoring enables timely detection of overcharging, over-discharging, or abnormal temperatures, thereby mitigating safety risks, ensuring overall system safety, and extending battery lifespan [
8].
Current state-of-charge (SOC) estimation methods primarily include the ampere-hour integration technique [
9], open-circuit voltage (OCV) method [
10], and data-driven approaches [
11,
12,
13]. Nevertheless, these exhibit severe limitations in practical applications: The accuracy of ampere-hour integration critically depends on initial SOC precision and current-sampling frequency, where accumulated integration errors under significant battery nonlinearity induce substantial deviations between estimated and true SOC values [
14]. The OCV method requires offline measurements, inherently compromising real-time performance; moreover, its OCV-SOC relationship deteriorates with battery aging and temperature variations, further reducing estimation accuracy [
15]. While data-driven methods achieve high precision and efficiency, they necessitate extensive high-quality datasets, demonstrate limited generalizability across operating conditions, and impose high computational demands, collectively hindering widespread implementation [
16,
17]. Consequently, equivalent circuit model (ECM)-based methods integrated with Kalman filtering have gained significant traction [
18,
19].
Recent studies have proposed advanced algorithms to address these challenges. For example, research [
20] introduced an improved variable forgetting factor recursive least squares (IVFFRLS) method combined with AEKF for LIB SOC estimation. By eliminating rounding errors in traditional recursive least squares and optimizing parameter boundaries using genetic algorithms, the IVFFRLS method significantly enhanced parameter identification accuracy. Additionally, the incorporation of a covariance matching method in the AEKF enabled adaptive noise parameter updates, improving SOC estimation accuracy and robustness. Research [
21] proposed an intelligent adaptive extended Kalman filtering (IAEKF) method, which dynamically updated the covariance matrix by detecting voltage error sequence distribution changes in real time. This approach significantly improved SOC estimation accuracy while maintaining computational efficiency and demonstrated robust performance against model parameter variations, offering a reliable solution for complex operating conditions. Research [
22] developed a joint SOC estimation algorithm based on an improved variable forgetting factor recursive least squares method and adaptive square-root unscented Kalman filtering (ASRUKF). By correcting the forgetting factor using multi-step voltage residuals, the algorithm enhanced parameter identification accuracy. The introduction of square-root decomposition in the UKF framework prevented covariance matrix negative definiteness, while the adaptive noise covariance adjustment mechanism dynamically optimized the filtering process. This method exhibited excellent estimation accuracy and robustness across various operating conditions, maintaining reliable SOC tracking even with voltage and current measurement deviations. Research [
23] proposed an augmented adaptive extended Kalman filtering (AAEKF) algorithm, which integrated Kalman filtering with adaptive robust control theory to enhance SOC estimation accuracy for EV LIBs. This method innovatively addressed the sensitivity of traditional AEKF to battery model accuracy, demonstrating superior adaptability to system nonlinearity, parameter fluctuations, and environmental temperature variations.
Nevertheless, current methodologies exhibit three interlocked limitations under extreme temperature variations and unknown initial SOC conditions: (1) Pronounced temperature fluctuations induce significant activation energy shifts in lithium-ion diffusion kinetics, while electrolyte conductivity plummets to 15% of its room-temperature value at −20 °C, causing severe model-parameter mismatch and ohmic polarization distortion [
24]. (2) The prevalent engineering challenge of unknown initial SOC leads to cumulative error amplification through coulomb counting integration. (3) Critically, most existing models rely on manual parameter calibration. Static parameterization inherently limits adaptive capability by failing to dynamically optimize temperature sensitivity coefficients—as seen in the EKF. Although adaptive variants, including the AEKF, IAEKF, and AAEKF, incorporate dynamic parameters, their adjustment factors still require manual tuning and thus fail to achieve optimal performance. These critical bottlenecks underscore the urgent demand for autonomous parameter identification frameworks, motivating the core contributions of this work:
- (1)
A Bayesian optimization-based adaptive extended Kalman filtering (BO-AEKF) algorithm is proposed, which significantly enhances SOC estimation accuracy by incorporating a Bayesian optimization mechanism, enabling adaptability to various operating conditions and environmental disturbances.
- (2)
A dynamic parameter adaptive updating strategy is introduced to address model errors caused by temperature variations, effectively improving model stability and estimation accuracy.
- (3)
The validation results of the BO-AEKF algorithm under various operating conditions demonstrate that its accuracy and stability outperform traditional methods. With the initial value tracking mechanism, the algorithm achieves rapid convergence under uncertain initial states, further enhancing its robustness.
The remainder of this paper is organized as follows:
Section 2 establishes a dual-polarization (DP) model and dynamically updates model parameters through a temperature-adaptive mechanism, completing parameter identification.
Section 3 details the construction of the BO-AEKF algorithm and validates its high-precision estimation capabilities and rapid convergence characteristics under varying initial conditions.
Section 4 summarizes the findings and proposes potential directions for future research.
2. Materials and Methods
This study constructs a lithium battery state estimation framework with temperature adaptability and enhanced robustness. The implementation procedure is as follows: first, adopt the dual-polarization equivalent circuit model to characterize battery dynamics; then, achieve cross-temperature parameter identification via adaptive RLS incorporating dynamic forgetting factors using public battery datasets, ensuring real-time compensation of temperature-dependent model parameters; finally, the proposed BO-AEKF eliminates manual tuning requirements through intelligent optimization, exhibits rapid convergence characteristics under severe initial state uncertainty, significantly enhances adaptability to initial errors and operational transients, ultimately providing high-accuracy closed-loop solutions for complex scenarios.
2.1. Establishment of the Equivalent Circuit Model
In the modeling of LIBs, the selection of a model necessitates a balance between accuracy and computational efficiency. The dual-polarization model has been chosen as our primary focus due to its ability to precisely describe the current distribution and voltage response of LIBs, coupled with its concise state-space equation structure [
25]. This model achieves high accuracy in characterizing voltage dynamics and facilitates rapid analysis and numerical simulations, providing a reliable theoretical foundation for optimizing LIB performance. Leveraging these advantages, we ultimately adopted the DP model as the subject of our study. The equivalent circuit of the DP model is illustrated in
Figure 1. By incorporating two RC (resistor–capacitor) networks, this model accurately captures the dynamic characteristics of LIBs, including ohmic polarization, electrochemical polarization, and concentration polarization phenomena.
Based on the equivalent circuit, the following equations can be established:
among them,
represents the open-circuit voltage,
denotes the ohmic internal resistance,
and
correspond to the electrochemical polarization resistance and capacitance, respectively,
and
represent the concentration polarization resistance and capacitance, respectively,
and
signify the electrochemical polarization voltage and concentration polarization voltage, respectively, and
stands for the terminal voltage.
2.2. Parameter Identification
This study utilizes the publicly available University of Maryland Battery Dataset (UMD Battery Dataset) from the University of Maryland Energy Research Center (UMERC). This high-quality dataset, which focuses on A123 lithium iron phosphate batteries, has been widely employed in research on lithium battery modeling, state estimation, and lifespan prediction [
26]. The dataset encompasses comprehensive performance data under various experimental conditions, including dynamic stress tests (DSTs), Federal Urban Driving Schedule (FUDS), Supplemental Federal Test Procedure (US06), and constant current charge–discharge tests, ensuring its representativeness and comprehensiveness. The basic information of the battery is shown in
Table 1. The experimental data were collected over a temperature range of −10 °C to 50 °C using multiple cells of the same LiFePO
4 battery model (exact cell quantity unspecified in raw data). The OCV-SOC relationship and capacity characteristics were derived from low-current constant-current charge/discharge tests (0.05 C rate) across all temperature points.
As shown in
Figure 2, the DST, FUDS, and US06 driving cycles are displayed, offering a comprehensive visualization of their operational dynamics.
The DST driving cycle, characterized by frequent low-amplitude current fluctuations typical of urban driving, is employed in this study for model parameter calibration due to its suitability for parameter identification and dynamic performance testing. Meanwhile, the FUDS driving cycle, which simulates prolonged urban driving with smoother current variations, is utilized to evaluate system energy efficiency and SOC estimation accuracy. In contrast, the US06 driving cycle, featuring rapid current fluctuations representative of high-speed driving scenarios, is adopted to validate high-power output performance and thermal stability. To comprehensively assess battery performance, this study leverages the DST cycle for parameter identification, while the FUDS and US06 cycles are chosen for SOC estimation validation, as they better reflect real-world operational complexity.
To begin, a low-current constant-current charge–discharge experimental method [
27] was employed to obtain the correspondence between OCV and SOC at various temperatures, as well as the corresponding discharge capacity (
) at different temperatures (
T). Subsequently, a 12th-order polynomial fitting method was applied to fit the OCV-SOC curves [
28], while a 3rd-order polynomial fitting method was also used for comparison. The corresponding equations are as follows:
where
and
are coefficients to be determined. The resulting curves are shown in
Figure 3. As illustrated in
Figure 3a, under low-temperature conditions (
T < 0 °C), the OCV exhibits a significant downward trend as the temperature decreases. Notably, the most pronounced voltage drop occurs within the SOC range of 0 to 0.2, indicating that low temperatures have a substantial impact on the electrochemical performance of the battery, particularly in the low-SOC region. Meanwhile,
Figure 3b demonstrates that within a broad temperature range (−10 °C to 50 °C), the battery capacity shows a positive correlation with temperature, with the available capacity gradually increasing as the ambient temperature rises. These findings underscore the importance of incorporating temperature adaptability corrections into the model.
Finally, parameter identification was conducted using data from the DST driving cycle. An adaptive dynamic genetic factor recursive least squares (RLS) method [
29,
30] was employed for online system parameter identification. The key steps include initialization, predicted output calculation, parameter updating, dynamic genetic factor updating, and parameter transformation, as illustrated in the flowchart in
Figure 4.
Set the initial parameter estimates and the initial covariance matrix . The initial parameter estimates are determined based on the preliminary values of , , , , and . The initial covariance matrix is typically chosen as a large positive definite matrix.
The initial parameter estimates are given by
The input vector is defined as
The parameter estimation vector is
Compute the gain matrix
, update the parameter estimates
, and update the covariance matrix
:
where
is a tunable parameter less than and close to 1, and
is a positive adaptation coefficient.
Convert
to
,
,
,
, and
:
where
, and
the time step. Subsequently, the values can be further transformed according to Equation (9).
According to the steps outlined above, iteratively executing the procedure allows for the estimation of the values of , , , , and at all time instants.
2.3. SOC Estimation
To improve the accuracy and robustness of SOC estimation for lithium-ion batteries, this study proposes a novel BO-AEKF algorithm, which integrates Bayesian optimization with the AEKF [
31]. Furthermore, to address model errors induced by environmental temperature variations, dynamic updates are applied to key parameters such as the open-circuit voltage
and discharge capacity
, thereby enhancing the model’s precision and adaptability. The overall estimation framework, incorporating constant current charge–discharge experiments and parameter identification, is illustrated in
Figure 5.
EKF estimates the battery SOC by performing local linearization of the nonlinear system dynamics using first-order Taylor expansion, iteratively updating the SOC estimate via state and observation equations; however, its core defect lies in the reliance on preset, fixed process noise covariance matrix and measurement noise covariance matrix, which leads to cumulative estimation bias under dynamic operating conditions due to model mismatch. As an improvement based on the EKF algorithm, the AEKF introduces a sliding-window residual statistics mechanism, enabling self-adjustment of the process noise covariance matrix and measurement noise covariance matrix according to system dynamics; nevertheless, the adjustment factors for scaling these covariance matrices require manual tuning, inherently preventing the identification of an optimal parameter combination. To overcome this limitation, the BO-AEKF framework employs a Bayesian optimization approach, constructing a Gaussian process surrogate model to approximate the black-box relationship between the adjustment factors and SOC estimation accuracy and subsequently performing automatic hyperparameter optimization via acquisition function maximization to enhance SOC estimation precision, with this method proving particularly suitable for practical application scenarios characterized by complex operating conditions and significant parameter variations. The detailed implementation steps are depicted in
Figure 6 and described as follows:
where
η represents the charge/discharge efficiency.
The state vector is initialized as , and the covariance matrix is initialized. The process noise covariance matrix and measurement noise covariance matrix are set, while the Jacobian matrix is initialized. These matrices can be preliminarily estimated based on the system’s noise characteristics.
The prior state
for the next time step is predicted using the state equation. The open-circuit voltage
is calculated using the prior estimate and the OCV-SOC curve at the corresponding temperature. The observation matrix is updated to
, and the terminal voltage
is estimated using the observation equation. The prior state covariance is predicted as
The Kalman gain is computed as
The posterior state is updated using
where
is the measured terminal voltage. The error covariance matrix is updated as
where
is the identity matrix. The measurement noise covariance matrix
and process noise covariance matrix
are adaptively adjusted using
where
is the error between the measured and estimated terminal voltages, and
are adjustment factors.
Bayesian optimization is employed to optimize the adjustment factors
and
and output the SOC estimation result. The objective function is defined as the RMSE of the SOC estimation error:
where
is the true SOC value,
is the EKF-estimated SOC value, and
N is the number of data points. The optimization variables
and
are defined within specific ranges, which can be manually narrowed based on experience to reduce iteration counts.
Initialization involves randomly selecting several initial points, computing the objective function values, and constructing an initial surrogate model. A Gaussian process (GP) surrogate model is employed, which assumes the objective function follows a multivariate Gaussian distribution:
where
is the mean function, and
is the kernel function. The squared exponential kernel is used:
where
is the signal variance, and
and
are the length scales in the
and
dimensions, respectively.
The next evaluation point is selected using the Expected Improvement (EI) acquisition function:
where
is the current best objective function value. Iterative optimization proceeds as follows: (1) select the next evaluation point
using the acquisition function; (2) compute the objective function value
; (3) add the new point
to the observation dataset; (4) update the GP surrogate model based on the new observations; and (5) terminate if the maximum iteration count or convergence criteria are met; otherwise, continue. Finally, the optimal adjustment factors
are output and incorporated into the AEFK to obtain the final results.
3. Results
To validate the effectiveness of the proposed BO-AEKF algorithm, parameter identification was first performed using the AFFRLS algorithm on DST condition data at ambient temperatures of −10 °C, 0 °C, 25 °C, and 50 °C. The results are presented in
Table 2, with the parameter identification results at 25 °C illustrated in
Figure 7.
The parameter identification results in
Figure 7 indicate that all parameters exhibit significant fluctuations during the initial phase (approximately 0–2000 s), primarily stemming from algorithmic artifacts—where initial guess deviations cause overcompensation—and model limitations in capturing rapid polarization. Consequently, the observed significant fluctuations and polarity sign jumps in
/
/
during this period constitute expected phenomena before gradually converging. Specifically,
and
stabilize after 3000 s, whereas
reaches a steady state after 4000 s. Although
and
exhibit minor fluctuations up to 6000 s, their overall trends maintain stability. Ultimately, all parameters converge to reasonable ranges without divergence, and the terminal voltage estimation error—despite larger initial deviations during early identification—rapidly converges after 20 s and remains maintained below 0.005 V. This confirms that the identification algorithm effectively suppresses noise interference, yielding reliable and robust parameter identification results that establish an accurate basis for lithium battery modeling.
The parameter identification results in
Table 2 indicate that both the ohmic resistance
and the polarization resistances
and
exhibit a decreasing trend with increasing temperature. Specifically,
decreases from 0.29 Ω at −10 °C to 0.189 Ω at 50 °C, while
and
decrease from 0.02148 Ω and 0.1928 Ω to 0.001646 Ω and 0.02983 Ω, respectively. This reduction in resistance is attributed to enhanced ionic conductivity and improved reaction kinetics at elevated temperatures. Elevated temperatures significantly reduce both ohmic and polarization resistances, thereby enhancing battery performance and efficiency.
Based on the identified parameters, tests were conducted under FUDS and US06 conditions at ambient temperatures of −10 °C, 0 °C, 25 °C, and 50 °C. To comprehensively evaluate the performance of the BO-AEKF algorithm, it was compared with the traditional EKF and AEKF algorithms, using the SOC calculated by the ampere-hour integration method as the reference [
32]. Additionally, the effects of temperature on the open-circuit voltage and battery discharge capacity were fully considered during the testing process. The experimental results are shown in
Figure 8 and
Figure 9 and
Table 3.
From
Figure 8 and
Figure 9, it is evident that the BO-AEKF algorithm demonstrates superior estimation accuracy and robustness across a wide temperature range under both FUDS and US06 operating conditions, with particularly notable improvements under the more dynamic US06 condition. By integrating Bayesian optimization with adaptive strategies, the BO-AEKF algorithm achieves enhanced SOC estimation accuracy across varying temperatures. As indicated in
Table 3, the RMSE and MAX data comparisons reveal that BO-AEKF outperforms both EKF and AEKF in terms of precision and robustness under different temperatures and dynamic conditions. An exception occurs in the FUDS condition at 50 °C, where the inherent thermal drift of the model was not fully mitigated, leading to increased overall error. However, the MAX point shifted toward the direction of reduced error, resulting in a smaller MAX value, while the RMSE remained significantly lower. Thus, from an overall accuracy perspective, BO-AEKF consistently exhibits lower errors across various temperatures. For instance, under the FUDS condition at −10 °C, the RMSE was optimized by 18.5% compared to EKF, while at 25 °C, a 14% improvement in accuracy was maintained. The advantages of BO-AEKF are even more pronounced under the highly dynamic US06 condition, with RMSE reductions of 51.9% at 0 °C and 40.6% at 50 °C, demonstrating its ability to deliver precise state estimations even under conditions of high dynamic uncertainty.
Moreover, BO-AEKF exhibits exceptional stability in extreme temperature environments. For example, under the US06 condition at −10 °C, its MAX value was only 0.0035, representing a 74.5% reduction compared to EKF and a 64.3% reduction compared to AEKF. This robustness stems from its Bayesian optimization layer, which dynamically adjusts the system noise covariance and observation noise covariance to adapt to model uncertainties induced by temperature variations. Although the MAX error in the FUDS condition at 50 °C was slightly higher than that of AEKF, the RMSE remained superior, indicating that BO-AEKF prioritizes maintaining overall accuracy over merely suppressing peak errors in extreme temperatures.
Compared to other algorithms, the core advantage of BO-AEKF lies in its dynamic parameter adaptation. By leveraging Bayesian optimization to adjust noise parameters in real time, it maintains stability across wide temperature ranges and highly dynamic conditions. In conclusion, the BO-AEKF algorithm offers significant advantages for state estimation in photovoltaic and electric vehicle energy storage systems, particularly in complex scenarios involving wide temperature ranges and high dynamics. Overall, BO-AEKF demonstrates higher accuracy and greater adaptability under extreme temperatures and complex operating conditions, providing reliable technical support for the development of lithium battery management systems.
To address the practical engineering challenge of unknown initial SOC in real-world lithium battery operations, this study conducted a series of comparative experiments to systematically evaluate the initial value adaptability and convergence characteristics of the BO-AEKF algorithm. To ensure the algorithm’s robustness and stability across varying initial conditions, the objective function of the Bayesian optimization layer was explicitly defined:
The BO-AEKF algorithm was tested under FUDS and US06 driving cycles at 25 °C, with different initial SOC values as inputs. The results, as depicted in
Figure 10 and summarized in
Table 4, demonstrate that the algorithm consistently converges to the true SOC value within a short timeframe, regardless of the initial state. Specifically, the RMSE remained below 3% for the FUDS cycle and within 3.5% for the more dynamic US06 cycle, confirming its high estimation accuracy.
In the initial phase, estimation results exhibited fluctuations due to the uncertainty of the initial SOC value. Notably, the amplitude of these fluctuations increased proportionally with the deviation between the initial state and the true value. However, as the algorithm iterated, these fluctuations gradually diminished and stabilized, highlighting the robust nature of the BO-AEKF algorithm in mitigating the impact of initial errors.
This enhanced robustness and adaptability stem from the algorithm’s adaptive mechanisms and robustness-enhancing design, which effectively address the issue of initial state uncertainty. These features provide reliable SOC estimation support for BMS, offering significant implications for optimizing charging/discharging strategies and extending battery lifespan. Overall, the BO-AEKF algorithm demonstrates superior performance in handling initial state uncertainties, making it a valuable tool for advanced BMS applications.
4. Discussion
Comparative analysis reveals that the BO-AEKF algorithm introduces Bayesian optimization to adjust the parameters of the AEKF, achieving remarkable improvements of up to 74.5% in MAX error and 51.9% in RMSE compared to the traditional EKF. Notably, the algorithm demonstrates the rare and valuable capability to converge within 100 s under large initial SOC error conditions. While the computational complexity of BO-AEKF is higher than that of both EKF and AEKF, it is important to note that this increase is not exponential or multiplicative. The EKF relies on linearizing state and measurement functions, entailing minimal computational overhead. The AEKF adapts process and measurement noise covariances to enhance accuracy, with slightly higher complexity than EKF but still lower than BO-AEKF. BO-AEKF performs iterative calculations via Bayesian optimization to find optimal parameters, but the computational growth is controlled through strategies like adaptive iteration termination and feature dimension reduction—thus avoiding a multiplicative complexity surge. Despite the higher complexity, BO-AEKF achieves superior SOC estimation accuracy while maintaining a feasible computational burden, demonstrating a more balanced trade-off between precision and efficiency in practical applications. Therefore, the BO-AEKF algorithm is highly suitable for computation-constrained environments such as real-time control in electric vehicle BMS and grid energy storage systems.
Although significant progress has been made in the accuracy and robustness of SOC estimation, several research directions warrant further exploration.
Firstly, regarding model adaptability, current studies primarily focus on the impact of temperature as a single factor on SOC estimation models. However, in real-world applications like electric vehicle charging/discharging and grid energy storage peak regulation, battery performance is often influenced by the synergistic effects of multiple factors, such as aging, charge/discharge rates, and battery types. Future research could establish more comprehensive model parameter mapping relationships based on multi-factor coupling experiments, exploring the combined effects of battery degradation, current magnitude, and different lithium battery materials on SOC estimation. This would contribute to the development of a more universal battery state estimation framework.
Secondly, in terms of algorithm optimization, although the BO-AEKF algorithm demonstrates excellent accuracy, its high computational complexity poses a significant burden on onboard computing units. Given the stringent real-time requirements of practical BMS systems, future optimizations could focus on two directions: (1) leveraging the advantages of deep learning algorithms to establish lightweight network structures that replace certain computational modules and (2) investigating simplified model methods based on covariance matrix feature extraction, reducing computational complexity through mathematical techniques such as feature space dimensionality reduction. These approaches aim to achieve an optimal balance between accuracy and efficiency. (3) Based on the Bayesian optimization algorithm, develop a cross-paradigm evaluation framework to achieve joint operation with UKF or PF, enhancing SOC estimation accuracy.
Lastly, in the context of engineering application validation, all current experiments are based on standard test data under laboratory conditions. However, real-world operating conditions impose higher demands on parameter adjustments. Subsequent research could integrate GAN networks for generating and classifying actual operating condition data and constructing adaptive parameter adjustment mechanisms based on condition recognition. Additionally, large-scale empirical studies should be conducted in real-world electric vehicle battery systems and energy storage stations. Long-term operational testing in practical engineering environments would not only further validate the algorithm’s applicability but also provide more reliable practical insights for parameter optimization.
The in-depth exploration of these research directions will facilitate the transition of SOC estimation technology from experimental research to engineering applications, offering technical support for the development of more reliable and efficient battery management systems.