A Novel Fusion Method for State-of-Charge Estimation of Lithium-Ion Batteries Based on Improved Genetic Algorithm BP and Adaptive Extended Kalman Filter

The lithium-ion battery is the power source of an electric vehicle, so it is of great significance to estimate the state of charge (SOC) of lithium-ion batteries accurately to ensure vehicle safety. To improve the accuracy of the parameters of the equivalent circuit model for batteries, a second-order RC model for ternary Li-ion batteries is established, and the model parameters are identified online based on the forgetting factor recursive least squares (FFRLS) estimator. To improve the accuracy of SOC estimation, a novel fusion method, IGA-BP-AEKF, is proposed. Firstly, an adaptive extended Kalman filter (AEKF) is used to predict the SOC. Then, an optimization method for BP neural networks (BPNNs) based on an improved genetic algorithm (IGA) is proposed, in which pertinent parameters affecting AEKF estimation are utilized for BPNN training. Furthermore, a method with evaluation error compensation for AEKF based on such a trained BPNN is proposed to enhance SOC evaluation precision. The excellent accuracy and stability of the suggested method are confirmed by the experimental data under FUDS working conditions, which indicates that the proposed IGA-BP-EKF algorithm is superior, with the highest error of 0.0119, MAE of 0.0083, and RMSE of 0.0088.


Introduction
Due to increased environmental requirements and the expansion of the lithium-ion battery industry across the world, there is greater demand on the market for completely electric vehicles [1]. Lithium-ion batteries in the automotive industry are quite common due to their high degree of safety, high power density, and dependable charging and discharging [2]. The most important type of data to monitor in moving vehicles with new energy is the state of charge (SOC) of their batteries. In the battery management system, with accurate SOC estimation, the issue of the overcharging and over-discharging of batteries could be avoided and the life of batteries could be extended through battery equalization, which could ensure the normal and safe functioning of new energy vehicles under demanding operating circumstances [3]. The SOC, however, cannot currently be acquired directly due to the complexity of the internal chemical processes in lithium-ion batteries and the limits of the available technology, and can only be inferred from the performance of the battery's exterior characteristics [4,5].
2. In view of the fact that the EKF algorithm cannot cope with real-time changes in noise during SOC prediction, an adaptive open-window estimation approach is introduced to further reduce the noise interference. 3. A genetic algorithm is optimized by introducing a cosine function to build adaptive crossover probability and variation probability, and the annealing technique to enhance the ability to jump out of local extremes. The improved GA solves the problem of easily maturing in the early stages and stagnating in the later stages, and thus avoids falling into local optimal solutions. 4. The improved GA is used to optimize the initial parameters of the BP neural network (BPNN) that is used to compensate for the estimation error of AEKF. The proposed estimation strategy is applied to numerical simulation experiments, and BP-AEKF, GA-BP-AEKF, and IGA-BP-AEKF are compared. The latter has better performance in accuracy and robustness.

Battery Modeling
The precision of the parameter identification of the lithium-ion battery model has a significant bearing on the accuracy of SOC calculation via the equivalent circuit approach [36]. The Thevenin model [37], second-order RC model [38], and Rint model [39] are now the most popular models. The accuracy of the model is enhanced with increasing RC order, but its computational complexity also rises. A second-order RC model is developed in this paper with a detailed analysis, as seen in Figure 1. This model performs better dynamically than the first-order model because it splits the cell polarization into electrochemical polarization and concentration polarization. As shown in the figure, U OC is the open circuit voltage; U L is the terminal voltage; I represents the battery current, taking the discharge direction as the positive direction; R 0 is the battery's internal resistance in ohms; R 1 and R 2 represent electrochemical and concentration polarization resistance, respectively; C 1 and C 2 , respectively, represent electrochemical polarization capacitance and concentration polarization capacitance; and U 1 and U 2 are the voltages at the ends of R 1 and R 2 , respectively.
The function relation of the equivalent circuit model is shown as follows: According to [16], the SOC of a lithium battery can be expressed as: The state space in Expression (3) is obtained after the discretization of Equations (1) and (2).
where SOC 0 is the initial value of SOC; τ 1 and τ 2 are time constants; ∆t is the sampling period; Q N is the rated capacity of the cell; η is the coulombic efficiency; and ∂U OC ∂SOC was obtained using Equation (4) of the OCV-SOC fitting relationship.

OCV-SOC Acquisition
It was demonstrated that the open circuit voltage and SOC have the same trend and show specific correspondence with the SOC values [21]. After the OCV-SOC fitting relationship is established, U OC can be obtained from the SOC predicted by the AEKF algorithm, and then, used for model parameter identification. In this paper, experiments were conducted with LiMn 2 O 4 lithium batteries with a rated capacity of 35 Ah, a charging cutoff voltage of 4.2 V, a discharging cutoff voltage of 2.6 V, and a rated voltage of 3.7 V. The OCV-SOC curves were obtained as shown in Figure 2. The authors of [39] investigated the effects of temperature, data points, and aging degree when obtaining the OCV-SOC relationship for Li-ion batteries, and when 11 data points were selected to model the OCV-SOC relationship in the OCV test data, the highest power of the polynomial fit function should not exceed an order of 10. When the highest power of the polynomial is larger, an overfitting phenomenon will occur; when the highest power of the polynomial is smaller, an underfitting phenomenon will occur, so the polynomial with the highest power of seven is selected as the fitting function under comprehensive consideration. The fitting results are shown in Figure 3.      The fitted polynomial equation of the seventh order is:

FFRLS-Based Parameter Identification
In the field of system identification, recursive least squares (RLS) has been widely used because of its advantages such as ease of understanding, good convergence effect, and fast speed [13]. However, RLS has the problems of data saturation and data accumulation, which affect the accuracy of current parameter identification; the introduction of forgetting factors can effectively prevent data saturation and enhance the accuracy of identification with new data [40]. The specific formula of the forgetting factor recursive least squares method is derived from [40], and the detailed flow of the algorithm is shown in Figure 4. The fitted polynomial equation of the seventh order is:

FFRLS-Based Parameter Identification
In the field of system identification, recursive least squares (RLS) has been widely used because of its advantages such as ease of understanding, good convergence effect, and fast speed [13]. However, RLS has the problems of data saturation and data accumulation, which affect the accuracy of current parameter identification; the introduction of forgetting factors can effectively prevent data saturation and enhance the accuracy of identification with new data [40]. The specific formula of the forgetting factor recursive least squares method is derived from [40], and the detailed flow of the algorithm is shown in Figure 4.   In the figure, k θ is the estimated vector of the discriminated parameters at the previous moment; K is the gain matrix; P is the error covariance matrix; k ϕ is the observed data matrix; k y is the estimated output; E is the unit matrix; and λ is the forgetting factor with a value of 0.96.

AEKF Estimation of SOC
KF is widely used in linear control systems. The core theory aims to predict the current value based on the estimated value at the previous moment, update the gain and error matrix using the errors of observable variables, and finally, correct the current estimated value to make it close to the true value. The extended Kalman filter, as a derivative algorithm of the Kalman filter, inherits the core of the algorithm of the Kalman filter and can use Taylor expansion to ignore the higher-order terms above the first order and linearize the nonlinear system with the first-order terms, which can be used for the charge In the figure, θ k is the estimated vector of the discriminated parameters at the previous moment; K is the gain matrix; P is the error covariance matrix; ϕ k is the observed data matrix; y k is the estimated output; E is the unit matrix; and λ is the forgetting factor with a value of 0.96.

AEKF Estimation of SOC
KF is widely used in linear control systems. The core theory aims to predict the current value based on the estimated value at the previous moment, update the gain and error matrix using the errors of observable variables, and finally, correct the current estimated value to make it close to the true value. The extended Kalman filter, as a derivative algorithm of the Kalman filter, inherits the core of the algorithm of the Kalman filter and can use Taylor expansion to ignore the higher-order terms above the first order and linearize the nonlinear system with the first-order terms, which can be used for the charge state estimation of lithium batteries [41].
The state equation and the observation equation are: T is the state variable; y k = U L (k) is the observed variable; u k is the input vector; and ω k and v k are zero-mean Gaussian white noise. At each moment point, upon linearizing f (x k , u k ) and g(x k , u k ), we have: Affected by temperature, battery aging, and other factors, there will be real-time changes in interference noise affecting the SOC estimation of a Li-ion battery, but when estimating SOC based on EKF, the process noise covariance matrix Q and observation noise covariance matrix R are often set as constants substituted into the recursive process, which will lead to a decrease in SOC estimation accuracy. To solve the error problem caused by the above reasons, the Sage-Husa adaptive filtering algorithm is introduced to construct the adaptive extended Kalman filtering algorithm (AEKF) so that the noise covariance can be updated adaptively in the recursive process of the algorithm. The adaptive noise covariance equation is: where M is the dimension of the observation, which is the window size, and is a weighted average of the previous M new interest variance; e k is the new interest matrix, which is the difference between the estimated voltage and the measured voltage; H k is the new interest covariance matrix; P is the error covariance matrix; and K is the Kalman gain. A flow chart of the AEKF-based algorithm is shown in Figure 5.
where M is the dimension of the observation, which is the window size, and is a weighted average of the previous M new interest variance; e k is the new interest matrix, which is the difference between the estimated voltage and the measured voltage; H k d K is the Kal-man gain. A flow chart of the AEKF-based algorithm is shown in Figure 5.

BP Neural Network
The BP neural network has a great learning ability because its prediction approach involves adjusting the weights of each layer in turn per the error of the predicted value by continuously correcting the weights between the layers of the model. The input, hidden, and output layers of a BP neural network comprised them [20]. Typically, the input layer is made up of variables that have a strong linear correlation with the output but a weak linear correlation with each other. The intermediate layer's primary job is to handle the logical connection between the input and output, and the output layer generates the predicted values we are aiming for. The topology based on BPNN is shown in Figure 6.

BP Neural Network
The BP neural network has a great learning ability because its prediction approach involves adjusting the weights of each layer in turn per the error of the predicted value by continuously correcting the weights between the layers of the model. The input, hidden, and output layers of a BP neural network comprised them [20]. Typically, the input layer is made up of variables that have a strong linear correlation with the output but a weak linear correlation with each other. The intermediate layer's primary job is to handle the logical connection between the input and output, and the output layer generates the predicted values we are aiming for. The topology based on BPNN is shown in Figure 6.  In Figure 6, nm w and mk w are the connection weights between the different layers of the network. In [20,23], when using BPNN to compensate for EKF estimation errors, the input variables to the BPNN were [ ] , and the output was . When using the BPNN to compensate for the predictive error of the AEKF, the AEKF-filtered  In Figure 6, w nm and w mk are the connection weights between the different layers of the network. In [20,23], when using BPNN to compensate for EKF estimation errors, the input variables to the BPNN were SOC(k) U 1 (k) U 2 (k) K 1 K 2 K 3 , and the output was Err = SOC ture − SOC EKF . When using the BPNN to compensate for the predictive error of the AEKF, the AEKF-filtered SOC(k), U 1 (k), U 2 (k), AEKF gain K, and the error value e of the observed variable are used as input variables, so the input variables are SOC(k) U 1 (k) U 2 (k) K 1 K 2 K 3 e , and the output layer is the estimation error of AEKF, Err = SOC ture − SOC AEKF . It is demonstrated that a three-layer BPNN can fulfill the prediction requirements of nonlinear systems when the number of neurons in the hidden layer is properly determined [27]. The number of neurons in the hidden layer is determined using the empirical formula y = √ n + l + a [21], where n and l are the number of neurons in the input and output layers, respectively, and a is a random number between 0 and 10, which is most suitable when y = 15 according to several experiments. Therefore, a three-layer BPNN with seven inputs and one output is constructed.

IGA-BP
Due to the unpredictability of the BPNN's initial parameter selection, local optima are a common problem when employing BPNN for AEKF estimate error compensation. Because of this, the initial weights of the BPNN are optimized in this paper using the stochastic search capability of the genetic algorithm (GA), a stochastic search algorithm invented by John H's team [42], and simulating the biological genetic and evolutionary elimination mechanism to choose the genetically best individuals via genetic and mutational operations on the offspring. The GA-BP algorithm process is as follows.
Population initialization: Firstly, individuals are coded and arranged in a decimal way, which is advantageous in that the coding and decoding steps can be omitted, making the algorithm process simpler. The gene of each individual is composed of w nm and w mk .
Determination of the fitness function: The fitness function is the criterion for the GA model to screen the offspring, which can effectively reflect the gap between individuals in the population. In this paper, we construct the fitness function based on the estimated error sum of squares of AEKF.
N is the number of training samples. Selection operation: The roulette wheel method is usually chosen to determine the next generation, and N individuals in the population require N repetitions. If the fitness value of individual i is F i , the probability that individual i is left behind at each selection is: where k is the coefficient.
Crossover operation: Since individuals are coded with real numbers, the real number crossover method is chosen, and chromosomes l, a l , and chromosomes k, a k , are crossed at position j as follows: where b is a random number, b ∈ 0, 1 .
Mutation operation: the j gene of individual i is selected for mutation, and the mutation mode is shown in Formula (10). where a max and a min are the upper and lower thresholds of gene a ij , f (g) = r(1 − g /G max ), r ∈ 0, 1 . g is the number of current iterations, and is G max the maximum number of iterations.
After several iterations of the genetic algorithm, the number of good individuals will grow exponentially. Since the population size is limited, the dramatic increase in good individuals will lead to the rapid disappearance of other individuals. This is similar to the law of "survival of the fittest" in nature, but a population with mostly genetically similar good individuals will lead to increased inbreeding, and the algorithm will easily fall into the local extreme value trap. If the crossover probability is too small, the algorithm will be slowed down, and if it is too large, highly adapted individuals will be destroyed quickly. If the mutation probability is too small, it will be difficult to generate new individuals, and if it is too large, the algorithm will be meaningless. In addition, this paper establishes a fitness function based on the prediction value error, so the lower the value of fitness, the better the individual is. Therefore, the cosine function is introduced to design the adaptive crossover probability and variation probability, and thus, improve population diversity. (12) where p c is the crossover probability, p c1 and p c2 are the maximum crossover probabilities of high-quality and low-quality individuals, respectively, F max , F min , and F avg are the maximum, minimum, and average fitness values in the population, respectively, and F is the fitness value of the individuals to be crossed. (13) where p m is the probability of variation, p m1 and p m2 are the minimum variation rates of excellent and inferior individuals, respectively, and k 1 and k 2 are coefficients. At the same time, the simulated annealing algorithm (SA) was introduced in the GA [43]. SA is based on the fact that the free motion entropy of the molecules of a substance increases at high temperatures, leading to high activity, causing them to rearrange randomly and slowly stabilizing the molecules during the cooling process. The SA algorithm accepts non-optimal solutions with a certain probability according to the Metropolis criterion, which, to a certain extent, compensates for the easy maturation of GA in the early stage and stagnation in the later stage.
According to the Metropolis criterion, the optimal individuals in the old and new populations are determined. The probability of accepting a new individual is: If the fitness value of the optimal individual in the new population is lower than that of the old individual, the individual is accepted as the new optimal value; otherwise, the acceptance probability of the new individual is calculated according to the formula, while a random number between 0 and 1 is generated, and if the acceptance probability of the new individual is greater than the random number, the new individual is accepted; otherwise, it is not accepted. At the end of each genetic operation, annealing is required. T = T 0 * k, T 0 is the initial temperature, k is the annealing coefficient, and k ∈ [0.9, 1]. When the algorithm starts, T is larger and the GA is more likely to accept the inferior solution in the new population. The value gradually decreases with the increase in population generations, and the probability of accepting the inferior new solution becomes lower and lower, and when the value of T tends to 0, it basically no longer accepts any deteriorating new solutions, and can only accept a new solution with a state lower than the current solution; thus, this is the key to preventing the GA from prematurely maturing and jumping out of the local extremes. A flow chart of the improved genetic algorithm for optimizing the BPNN is shown in Figure 7.

SOC Estimation Based on the Joint IGA-BP-AEKF Algorithm
The equivalent circuit method is computationally simple and can reflect the operating characteristics of the battery based on the external characteristics of the battery, and the data-driven method can be trained using a large amount of data so as to find the mapping relationship between the input and output. The fusion algorithm combines the advantages of both, and the optimized BPNN is trained offline. The trained BPNN compensates for the SOC estimation error of the AEKF so that the SOC estimation result is closer to the real value. The specific flow of the fusion algorithm is shown in Figure 8. From the figure, we can see that the whole prediction model is divided into three parts: parameter identification, AEKF prediction of SOC, and IGA-BP compensation.
Step 1: BPNN training. A 3-layer BPNN with seven inputs and one output is established, and the implied layer neuron number is 15. The voltage and current data under

SOC Estimation Based on the Joint IGA-BP-AEKF Algorithm
The equivalent circuit method is computationally simple and can reflect the operating characteristics of the battery based on the external characteristics of the battery, and the data-driven method can be trained using a large amount of data so as to find the mapping relationship between the input and output. The fusion algorithm combines the advantages of both, and the optimized BPNN is trained offline. The trained BPNN compensates for the SOC estimation error of the AEKF so that the SOC estimation result is closer to the real value. The specific flow of the fusion algorithm is shown in Figure 8.

SOC Estimation Based on the Joint IGA-BP-AEKF Algorithm
The equivalent circuit method is computationally simple and can reflect the operating characteristics of the battery based on the external characteristics of the battery, and the data-driven method can be trained using a large amount of data so as to find the mapping relationship between the input and output. The fusion algorithm combines the advantages of both, and the optimized BPNN is trained offline. The trained BPNN compensates for the SOC estimation error of the AEKF so that the SOC estimation result is closer to the real value. The specific flow of the fusion algorithm is shown in Figure 8. From the figure, we can see that the whole prediction model is divided into three parts: parameter identification, AEKF prediction of SOC, and IGA-BP compensation.
Step 1: BPNN training. A 3-layer BPNN with seven inputs and one output is established, and the implied layer neuron number is 15. The voltage and current data under From the figure, we can see that the whole prediction model is divided into three parts: parameter identification, AEKF prediction of SOC, and IGA-BP compensation.
Step 1: BPNN training. A 3-layer BPNN with seven inputs and one output is established, and the implied layer neuron number is 15. The voltage and current data under UDDS conditions are collected, the AEKF is used to estimate the SOC under UDDS conditions, and the K k , e k , and x + k generated about the AEKF estimation process are saved. Figure 9 shows the SOC estimation under UDDS conditions using the AEKF. conditions, and the k K , k e , and k x + generated about the AEKF estimation process are saved. Figure 9 shows the SOC estimation under UDDS conditions using the AEKF. The Kalman filter gain k K , the end voltage observation error k e , and the state prediction value k x + are used as the input of the IGA-optimized BPNN, and Err is used as the output. The learning rate is set to 0.001, and the expectation value is 1 × 10 −5 . However, the data need to be normalized according to Equation (15) before input to avoid the different ranges of the input variables that lead to network training Failure.
where: max x and min x are the maximum and minimum values of the original data. In total, 80% of the data are chosen as the training sample and 20% as the test sample. When the predicted desirable value is obtained, the intended BPNN training is proven to be accomplished [21].
Step 2: Joint estimation of parameter identification and AEKF algorithm. As can be seen from the figure, based on the real-time battery measurement data, FFRLS is used to realize the online identification of the battery model parameters, and then, the AEKF is used to complete the estimation of SOC and the update of k K , k e , and k x + based on the obtained model parameters, and at the same time, the updated open-circuit voltage value is obtained from the OCV-SOC curve based on the SOC value at this time. This is substituted into the FFRLS algorithm to complete the two-step identification of the model parameters, and therefore, to achieve the joint estimation of model parameters and the AEKF algorithm.
Step 3: Joint estimation of BPNN and AEKF. k K , k e , k x + , and Err are input into the successfully trained BPNN. Finally, the sum of the optimal estimate after AEKF filtering and the output of the BPNN are obtained as the optimal estimate of the fusion algorithm.

Experimental Verification Analysis
In actual operation, the battery cannot be guaranteed to work at a constant current all the time, and the current output of the battery is complex and variable. This study was conducted in a MATLAB working environment to simulate the battery operation mode for the experiments, assuming that the battery was fully charged at the initial moment and setting the initial value of SOC to 1. The validity of the proposed estimation method is demonstrated by the complex operating condition FUDS. The current and voltage data of FUDS operating conditions are shown in Figure 10. The Kalman filter gain K k , the end voltage observation error e k , and the state prediction value x + k are used as the input of the IGA-optimized BPNN, and Err is used as the output. The learning rate is set to 0.001, and the expectation value is 1 × 10 −5 . However, the data need to be normalized according to Equation (15) before input to avoid the different ranges of the input variables that lead to network training Failure. (15) where: x max and x min are the maximum and minimum values of the original data. In total, 80% of the data are chosen as the training sample and 20% as the test sample. When the predicted desirable value is obtained, the intended BPNN training is proven to be accomplished [21].
Step 2: Joint estimation of parameter identification and AEKF algorithm. As can be seen from the figure, based on the real-time battery measurement data, FFRLS is used to realize the online identification of the battery model parameters, and then, the AEKF is used to complete the estimation of SOC and the update of K k , e k , and x + k based on the obtained model parameters, and at the same time, the updated open-circuit voltage value is obtained from the OCV-SOC curve based on the SOC value at this time. This is substituted into the FFRLS algorithm to complete the two-step identification of the model parameters, and therefore, to achieve the joint estimation of model parameters and the AEKF algorithm.
Step 3: Joint estimation of BPNN and AEKF. K k , e k , x + k , and Err are input into the successfully trained BPNN. Finally, the sum of the optimal estimate after AEKF filtering and the output of the BPNN are obtained as the optimal estimate of the fusion algorithm.

Experimental Verification Analysis
In actual operation, the battery cannot be guaranteed to work at a constant current all the time, and the current output of the battery is complex and variable. This study was conducted in a MATLAB working environment to simulate the battery operation mode for the experiments, assuming that the battery was fully charged at the initial moment and setting the initial value of SOC to 1. The validity of the proposed estimation method is demonstrated by the complex operating condition FUDS. The current and voltage data of FUDS operating conditions are shown in Figure 10.

Validation of the Parameter Identification Effect
In the model identification effect validation experiments, the FFRLS algorithm is used to estimate the end voltage values using the data collected in the laboratory. Figure 11 (left) shows the comparison between the estimated end voltage based on FFRLS and the measured voltage under the second-order RC equivalent circuit model. The estimated voltage basically matches with the measured voltage, which verifies that the second-order equivalent model has high stability and that the FFELS identification effect is good. Figure  11 (right) shows that the voltage estimation error tends to increase gradually as the charging and discharging proceeds, which is caused by the more intense internal chemical reaction that occurs when the lithium battery is discharged to the cutoff voltage state, and the maximum absolute value of the end voltage error based on the FFRLS algorithm is calculated to be 0.035. The results show that the FFRLS can identify the model parameters well under the second-order RC model for subsequent SOC estimation use.

AEKF Algorithm Validation
In order to verify that the SOC estimation accuracy of the EKF algorithm is effectively improved after the introduction of the adaptive algorithm, the EKF and AEKF algorithms are simulated and validated using FUDS operating data as a comparison object. Except for the introduction of the adaptive algorithm, the initial parameters of the two algorithms are the same. The simulation results are shown in Figure 12(left). The SOC estimates under both algorithms follow the true values well, but the estimates of the AEKF algorithm are closer to the reference values. The error results are shown in Figure 12(right); the SOC error estimated by the AEKF is smaller than that of the EKF, but the SOC estimation error curves of both the EKF and AEKF models show large fluctuations when the current changes abruptly, which is caused by the large errors in the model parameters identified by FFRLS when the current changes abruptly, which coincides with the identification voltage error curve in Figure 11(right). To further examine the model prediction performance,

Validation of the Parameter Identification Effect
In the model identification effect validation experiments, the FFRLS algorithm is used to estimate the end voltage values using the data collected in the laboratory. Figure 11(left) shows the comparison between the estimated end voltage based on FFRLS and the measured voltage under the second-order RC equivalent circuit model. The estimated voltage basically matches with the measured voltage, which verifies that the second-order equivalent model has high stability and that the FFELS identification effect is good. Figure 11(right) shows that the voltage estimation error tends to increase gradually as the charging and discharging proceeds, which is caused by the more intense internal chemical reaction that occurs when the lithium battery is discharged to the cutoff voltage state, and the maximum absolute value of the end voltage error based on the FFRLS algorithm is calculated to be 0.035. The results show that the FFRLS can identify the model parameters well under the second-order RC model for subsequent SOC estimation use.

Validation of the Parameter Identification Effect
In the model identification effect validation experiments, the FFRLS algorithm is used to estimate the end voltage values using the data collected in the laboratory. Figure 11 (left) shows the comparison between the estimated end voltage based on FFRLS and the measured voltage under the second-order RC equivalent circuit model. The estimated voltage basically matches with the measured voltage, which verifies that the second-order equivalent model has high stability and that the FFELS identification effect is good. Figure  11 (right) shows that the voltage estimation error tends to increase gradually as the charging and discharging proceeds, which is caused by the more intense internal chemical reaction that occurs when the lithium battery is discharged to the cutoff voltage state, and the maximum absolute value of the end voltage error based on the FFRLS algorithm is calculated to be 0.035. The results show that the FFRLS can identify the model parameters well under the second-order RC model for subsequent SOC estimation use.

AEKF Algorithm Validation
In order to verify that the SOC estimation accuracy of the EKF algorithm is effectively improved after the introduction of the adaptive algorithm, the EKF and AEKF algorithms are simulated and validated using FUDS operating data as a comparison object. Except for the introduction of the adaptive algorithm, the initial parameters of the two algorithms are the same. The simulation results are shown in Figure 12(left). The SOC estimates under both algorithms follow the true values well, but the estimates of the AEKF algorithm are closer to the reference values. The error results are shown in Figure 12(right); the SOC error estimated by the AEKF is smaller than that of the EKF, but the SOC estimation error curves of both the EKF and AEKF models show large fluctuations when the current changes abruptly, which is caused by the large errors in the model parameters identified by FFRLS when the current changes abruptly, which coincides with the identification voltage error curve in Figure 11(right). To further examine the model prediction performance,

AEKF Algorithm Validation
In order to verify that the SOC estimation accuracy of the EKF algorithm is effectively improved after the introduction of the adaptive algorithm, the EKF and AEKF algorithms are simulated and validated using FUDS operating data as a comparison object. Except for the introduction of the adaptive algorithm, the initial parameters of the two algorithms are the same. The simulation results are shown in Figure 12(left). The SOC estimates under both algorithms follow the true values well, but the estimates of the AEKF algorithm are closer to the reference values. The error results are shown in Figure 12(right); the SOC error estimated by the AEKF is smaller than that of the EKF, but the SOC estimation error curves of both the EKF and AEKF models show large fluctuations when the current changes abruptly, which is caused by the large errors in the model parameters identified by FFRLS when the current changes abruptly, which coincides with the identification voltage error curve in Figure 11(right). To further examine the model prediction performance, the root mean square error (RMSE) and mean absolute error (MAE) are used to evaluate the two models. Table 1 shows that the maximum error of EKF is 0.0576 and that of AEKF is 0.0432. The RMSE and MAE values of AEKF are lower than those of EKF, so the noise reduction capability of the adaptive algorithm is better and the SOC estimation accuracy is significantly improved.
where N is the total amount of data, Y i the true value, and f i is the predicted value. the root mean square error (RMSE) and mean absolute error (MAE) are used to evaluate the two models. Table 1 shows that the maximum error of EKF is 0.0576 and that of AEKF is 0.0432. The RMSE and MAE values of AEKF are lower than those of EKF, so the noise reduction capability of the adaptive algorithm is better and the SOC estimation accuracy is significantly improved.
( ) where N is the total amount of data, i Y the true value, and i f is the predicted value.

IGA-BP-AEKF Algorithm Validation
The SOC estimation results of the proposed joint algorithm and the unimproved algorithm were compared with all initial values being equal, and the experimental results are shown in Figure. From Figure 13(left), it can be seen that the prediction trends of the three algorithms are basically consistent with the true values, but the SOC curve fit of the IGA-BP-AEKF algorithm is closer to the reference value, reflecting its strong tracking ability. From Figure 13(right), we can see that the maximum error of AEKF is significantly reduced after the BPNN compensation, and the BPNN compensation of AEKF after IGA optimization has the best effect. In the SOC < 0.5 stage, unlike the other two algorithms, the IGA-BP-AEKF model does not have a large abrupt change at a certain moment, indicating that IGA enhances the global optimization-seeking ability in the genetic process, has more advantages in searching the optimal initial parameters of BPNN, and has strong anti-interference ability, which is consistent with the performance of the error curve. From Table 2, we know that IGA-BP-AEKF has the smallest data values under all three evaluation criteria, which indicates that the improved joint algorithm has better stability in estimating SOC.

IGA-BP-AEKF Algorithm Validation
The SOC estimation results of the proposed joint algorithm and the unimproved algorithm were compared with all initial values being equal, and the experimental results are shown in figure. From Figure 13(left), it can be seen that the prediction trends of the three algorithms are basically consistent with the true values, but the SOC curve fit of the IGA-BP-AEKF algorithm is closer to the reference value, reflecting its strong tracking ability. From Figure 13(right), we can see that the maximum error of AEKF is significantly reduced after the BPNN compensation, and the BPNN compensation of AEKF after IGA optimization has the best effect. In the SOC < 0.5 stage, unlike the other two algorithms, the IGA-BP-AEKF model does not have a large abrupt change at a certain moment, indicating that IGA enhances the global optimization-seeking ability in the genetic process, has more advantages in searching the optimal initial parameters of BPNN, and has strong anti-interference ability, which is consistent with the performance of the error curve. From Table 2, we know that IGA-BP-AEKF has the smallest data values under all three evaluation criteria, which indicates that the improved joint algorithm has better stability in estimating SOC.

Conclusions
In view of the shortage of related research on the SOC estimation of lithium-ion batteries, a novel fusion method for SOC estimation based on improved Genetic Algorithm BP and an adaptive extended Kalman filter is proposed in this paper. The experimental results indicate that the proposed IGA-BP-EKF algorithm is superior to some other related ones. The maximum absolute value of the terminal voltage error based on the FFRLS algorithm for online parameter identification in the second-order RC model is calculated to be 0.035, which is good enough for subsequent accurate SOC estimation. The experimental results under FUDS working conditions show that the maximum error of the IGA-BP-EKF algorithm is 0.0119, MAE is 0.0083, and RMSE is 0.0088, which greatly improves the accuracy of the SOC estimation of lithium-ion batteries compared with BP-AEKF and GA-BP-AEKF. In future work, the effects of temperature and other factors should be considered on the SOC estimation of Li-ion batteries, and experimental operating condition data should be enriched and improved.
Author Contributions: Conceptualization, L.C. and C.S.; methodology and writing-original draft preparation, L.C. and C.S.; software, C.S.; validation, L.C., Z.Z., and C.S.; project administration, S.C.; funding acquisition, S.C. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement:
The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest:
The authors declare no conflict of interest.

Conclusions
In view of the shortage of related research on the SOC estimation of lithium-ion batteries, a novel fusion method for SOC estimation based on improved Genetic Algorithm BP and an adaptive extended Kalman filter is proposed in this paper. The experimental results indicate that the proposed IGA-BP-EKF algorithm is superior to some other related ones. The maximum absolute value of the terminal voltage error based on the FFRLS algorithm for online parameter identification in the second-order RC model is calculated to be 0.035, which is good enough for subsequent accurate SOC estimation. The experimental results under FUDS working conditions show that the maximum error of the IGA-BP-EKF algorithm is 0.0119, MAE is 0.0083, and RMSE is 0.0088, which greatly improves the accuracy of the SOC estimation of lithium-ion batteries compared with BP-AEKF and GA-BP-AEKF. In future work, the effects of temperature and other factors should be considered on the SOC estimation of Li-ion batteries, and experimental operating condition data should be enriched and improved.
Author Contributions: Conceptualization, L.C. and C.S.; methodology and writing-original draft preparation, L.C. and C.S.; software, C.S.; validation, L.C., Z.Z. and C.S.; project administration, S.C.; funding acquisition, S.C. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement:
The data presented in this study are available upon request from the corresponding author.