This study aims to systematically evaluate the optimization performance of the proposed improved AO-AVOA dual-strategy intelligent optimization algorithm and its effectiveness in predicting the state of health (SOH) of lithium-ion batteries. First, the convergence behavior and global search capability of the improved AO-AVOA are validated using standard unimodal and multimodal benchmark functions. Subsequently, the algorithm is integrated with a backpropagation (BP) neural network to construct the AO-AVOA-BP prediction model. This model is then applied to SOH estimation tasks based on the NASA and CALCE battery datasets, and experimental results are analyzed accordingly. Finally, by comparing the prediction error metrics under different optimization models, training set proportions, and combinations of health indicators, the model’s accuracy, stability, and generalization ability are comprehensively assessed. These evaluations confirm the effectiveness and practical value of the proposed method for modeling battery health status in real-world scenarios.
5.1. Performance Evaluation of the Improved AO-AVOA Dual-Strategy Intelligent Optimization Algorithm
To verify the performance and feasibility of the proposed AO-AVOA dual-strategy intelligent optimization algorithm, numerical simulation experiments are conducted using a set of standard benchmark functions. Unimodal functions, characterized by a single global optimum, are commonly employed to assess the convergence speed and precision of optimization algorithms. These functions effectively reflect the algorithm’s ability to locate optimal solutions efficiently within smooth search spaces. In contrast, multimodal functions feature more complex search space landscapes with multiple local optima, simulating the non-convexity and multi-solution nature of real-world optimization problems. They are essential for testing an algorithm’s capacity to escape local optima and accurately identify global solutions, serving as key indicators of global search ability and robustness [
32]. Accordingly, six representative standard test functions (see
Table 2) are selected in this study. These functions span various levels of complexity and search space characteristics, enabling a comprehensive evaluation of the proposed algorithm’s performance across different optimization scenarios.
To validate the performance advantages of the proposed improved AO-AVOA, the original AO algorithm and the AVOA are selected as baseline comparators. In the experimental setup, all algorithms are configured with a maximum of 500 iterations and a population size of 30. Each algorithm is independently executed 30 times on each benchmark function to mitigate the influence of stochastic factors. The performance evaluation primarily relies on two indicators: the mean and standard deviation of fitness values obtained during the optimization process. The mean fitness value reflects the algorithm’s search accuracy, while the standard deviation measures its stability. The mean fitness, calculated as shown in Equation (26), provides a quantitative assessment of the algorithm’s global optimization capability within the solution space.
times represents the number of times the algorithm runs, representing the fitness value of the ith solution.
When the mean fitness value approaches the theoretical optimum, it indicates that the algorithm possesses high precision and superior convergence performance in global search. To further quantify the distribution characteristics of the solutions, the standard deviation is adopted as a metric for quantitative characterization, as defined in Equation (27).
Table 3 presents the optimization results of the improved AO-AVOA compared with other algorithms across various benchmark functions. In the unimodal function tests, the improved AO-AVOA demonstrates outstanding performance in function evaluations, effectively converging to the theoretical optimum. Its convergence speed and accuracy show significant improvements compared with the AO and AVOAs. Although it does not fully reach the global optimum, its solution accuracy is noticeably superior to that of the compared algorithms, confirming its innovative advantages during the local search phase. In the multimodal function tests, all three algorithms are capable of finding the theoretical optimum; however, in function evaluations, the improved AO-AVOA, while not fully converging to the global optimum, still maintains performance superior to the other algorithms. These results indicate that the improved AO-AVOA not only possesses strong global exploration capabilities but is also effective in addressing complex global optimization problems.
Figure 9 illustrates the convergence curves of the improved AO-AVOA compared with the original AO and AVOAs across several standard benchmark functions. As observed in
Figure 8, the improved AO-AVOA consistently demonstrates faster convergence rates and higher convergence accuracy in both unimodal and multimodal optimization tasks. Compared with the original AO and AVOAs, it exhibits stronger global search capabilities during the early stages of optimization and more stable local convergence performance in the later stages. These results further validate the effectiveness of the proposed improvements in enhancing the algorithm’s overall optimization performance. The findings confirm that the improved AO-AVOA not only possesses robust global exploration ability but is also well-suited for solving complex global optimization problems.
In the unimodal function tests, the improved AO-AVOA demonstrated exceptionally high convergence efficiency on functions and , approaching the global optimum almost immediately during the initial phase. This indicates strong search stability and precision. On function , although the early-stage convergence trajectory resembled that of the AO algorithm, the overall convergence speed was faster. The optimization path gradually aligned with that of AVOA in the later stages, ultimately achieving the highest solution accuracy. This highlights the algorithm’s adaptive capability across different phases of the search process.
In the multimodal function tests, although the improved AO-AVOA did not exactly reach the theoretical optimum on function , it exhibited a significantly faster convergence rate compared to the other algorithms, indicating effective exploration performance. On function , the algorithm successfully reached the global optimum within approximately 10 iterations, substantially outperforming the compared methods. On function , despite a slower start and slight lag behind the AO algorithm, the improved algorithm accelerated in the later stages by effectively escaping local optima, eventually yielding a superior solution. These results validate the robustness and global search potential of the proposed approach in complex multimodal environments.
5.2. Analysis of Model Prediction Results
To verify the SOH prediction capability of the proposed AO-AVOA-BP model, a comparative analysis was conducted with the BP neural network, PSO-BP neural network, AO-BP neural network, and AVOA-BP neural network models. The SOH prediction results and corresponding errors for the NASA and CALCE datasets under different models are shown in
Figure 10 and
Figure 11, respectively.
As illustrated in
Figure 10 and
Figure 11, the proposed AO-AVOA-BP model achieved the highest prediction accuracy across all four battery SOH prediction tasks on both the NASA and CALCE datasets, demonstrating a significant advantage in performance. Specifically, the model exhibited minimal error fluctuations, low peak-to-valley differences, and a notably low frequency of outliers. These characteristics indicate strong stability, generalization capability, and robustness across multiple sets of battery degradation data. This superior performance can be primarily attributed to the optimized design of the algorithmic framework. The AO algorithm provides effective global search capabilities, reducing the likelihood of the parameter training process becoming trapped in local optima. The AVOA component enhances dynamic adaptability to non-stationary degradation patterns through its adaptive perturbation mechanism. Meanwhile, the BP neural network offers powerful nonlinear mapping capabilities, enabling it to accurately capture the degradation trend of SOH. The synergy among these three components significantly enhances the model’s capacity for modeling complex electrochemical degradation behavior and ensures high prediction stability.
It is noteworthy that during the 91st cycle test of the CS2-36 battery, the AO-AVOA-BP model exhibited a relatively large local prediction error. However, the model was able to respond promptly and autonomously correct the deviation, allowing the prediction curve to quickly return to a reasonable trajectory and maintain overall accuracy in the subsequent cycles. This indicates that the model not only possesses strong initial prediction capability but also demonstrates robust dynamic recovery and error-suppression mechanisms. It can effectively adjust its parameters in response to data disturbances or abrupt degradation behavior, thereby maintaining the continuity and reliability of the prediction results.
In contrast, the AVOA-BP model exhibited larger and more frequent fluctuations in prediction error during SOH estimation. This instability primarily stems from its lack of global optimization capability, making it prone to local optima and leading to suboptimal parameter tuning. Moreover, its limited ability to model nonlinear degradation features results in inadequate generalization performance. Additionally, when confronted with sudden degradation events or anomalous data, the model shows weak adaptive correction capacity, making it difficult to rectify prediction deviations in a timely manner, thereby exacerbating error instability.
To compare the differences in training efficiency and computational cost among various models, this study conducts a comparative analysis of the convergence speed and computational complexity of BP, PSO-BP, AO-BP, AVOA-BP, and the proposed AO-AVOA-BP model. This analysis not only reveals efficiency differences beyond error performance but also provides a reference for evaluating the feasibility and adaptability of the models in practical engineering applications.
Table 4 presents a systematic comparison of the convergence speed, computational complexity, and respective advantages and disadvantages of the different methods, where n denotes the population size, d represents the problem dimension, and T refers to the number of iterations.
As shown in
Table 4, different methods exhibit notable differences in convergence efficiency and computational complexity. The traditional BP model has relatively low computational cost but suffers from slow convergence and a tendency to fall into local optima. Single optimization-based approaches (e.g., PSO-BP, AO-BP, and AVOA-BP) achieve improved convergence speed; however, their stability and adaptability remain limited. In contrast, the AO-AVOA-BP model integrates both global search capability and adaptive mechanisms, achieving the fastest overall convergence and greater robustness. Nevertheless, its computational cost is relatively higher, suggesting that a trade-off between accuracy and efficiency should be considered in practical applications.
To comprehensively evaluate the performance of each prediction model in the battery SOH estimation task, this study incorporates multiple evaluation metrics to comparatively analyze their accuracy and stability. By quantifying the prediction errors, the influence of different algorithmic structures on model performance can be further elucidated. Specifically, the SOH prediction results are systematically assessed using Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE). The NASA battery performance metrics are summarized in
Table 5, and the statistical results of the t-test and analysis of variance (ANOVA) are reported in
Table 6. The CALCE battery performance metrics are presented in
Table 7, and the t-test and ANOVA results are given in
Table 8.
Using the B0005 and CS2-35 battery datasets as representative cases, a systematic comparative analysis was conducted to evaluate the performance of the proposed AO-AVOA-BP neural network model in the lithium-ion battery SOH prediction task. Experimental results demonstrate that the proposed model achieved significant advantages across multiple evaluation metrics, confirming its enhanced prediction accuracy and generalization capability. Specifically, on the B0005 battery dataset, the proposed model attained MAE, RMSE, and MAPE values of 0.0059, 0.0065, and 0.8320%, respectively. Compared with the BP model, these values were reduced by 80.78%, 82.04%, and 80.65%, respectively; compared with the PSO-BP model, they were reduced by 63.58%, 63.28%, and 63.20%, respectively; compared with the AO-BP model, they were reduced by 45.37%, 45.38%, and 44.86%, respectively; and compared with the AVOA-BP model, they were reduced by 67.58%, 69.63%, and 67.38%, respectively. These results indicate that the proposed model has a clear advantage in reducing prediction bias and suppressing error fluctuations.
Further analysis based on the statistical test results in
Table 6 indicates that AO-AVOA-BP and the conventional BP model exhibit highly significant differences (
p < 0.001) across all battery datasets, demonstrating that the proposed model provides a reliable advantage in prediction accuracy over the baseline. Moreover, the comparisons between AO-AVOA-BP and AO-BP as well as AVOA-BP reveal statistically significant differences in most cases (e.g., datasets B0006, B0007, and B0018,
p < 0.05), suggesting that the dual-mode improvement mechanism (the coupling of AO and AVOA) further enhances model performance compared with a single optimization approach. In contrast, for certain datasets (e.g., the comparison with AO-BP on B0005,
p > 0.05), the differences did not reach statistical significance, implying that the AO optimization strategy alone can achieve a comparable level of performance improvement in specific scenarios, though it still falls short of the robustness of the proposed model overall.
In addition, the ANOVA results further confirmed the significance of performance differences among models. For datasets B0005, B0006, and B0007, the ANOVA tests yielded extremely high F-values and very low p-values (p < 0.001), which not only highlight the highly significant differences in prediction accuracy across models but also statistically substantiate the comprehensive advantage of the AO-AVOA-BP model in the overall prediction tasks.
On the CS2-35 battery dataset, the proposed model also demonstrates consistently high accuracy, with MAE, RMSE, and MAPE values of 0.0087, 0.0115, and 1.095%, respectively. Compared with the BP model, these values decreased by 75.00%, 70.66%, and 75.28%, respectively; compared with the PSO-BP model, they decreased by 63.45%, 57.25%, and 63.84%, respectively; compared with the AO-BP model, they decreased by 36.96%, 35.75%, and 37.66%, respectively; and compared with the AVOA-BP model, they decreased by 51.40%, 51.88%, and 52.10%, respectively. These results not only reaffirm the model’s advantage in reducing overall prediction errors but also indicate that the proposed model maintains stable predictive performance across different battery types and operating conditions, demonstrating strong robustness and generalization capability.
Further analysis of the statistical test results in
Table 8 reveals that AO-AVOA-BP and the conventional BP model exhibit highly significant differences (
p < 0.001) on the CS2-35 to CS2-37 battery datasets, with the large F-values obtained from the ANOVA tests corroborating the superior overall prediction accuracy of the proposed model. Meanwhile, comparisons with AO-BP also indicate statistically significant improvements in most cases (e.g., the t-tests on CS2-35 and CS2-36 both yielded
p < 0.001), suggesting that the synergistic effect of the dual optimization mechanism can further enhance error control. However, in the comparisons with AVOA-BP, the results vary across different battery datasets. For instance, on CS2-35 and CS2-36, the
p-values were greater than 0.05, indicating that the differences were not statistically significant under these conditions; in contrast, on CS2-37 and CS2-38, the t-test results (
p < 0.01) demonstrated clear differences. These findings imply that although AO-AVOA-BP exhibits overall superiority, its performance advantages may vary under different operating conditions, particularly in scenarios where AVOA-BP alone already provides relatively comparable predictive performance.
Overall, the combined analysis of error metrics and statistical tests demonstrates that the AO-AVOA-BP model significantly improves SOH prediction accuracy on the CALCE battery datasets, with its advantages being statistically significant in most cases. This not only further verifies the robustness and generalization ability of the model across different battery platforms but also statistically substantiates the rationality and effectiveness of its improved structural design.
It should be noted that although the AO-AVOA-BP model demonstrates significant predictive advantages on both the NASA and CALCE datasets, certain limitations remain in terms of computational cost, scalability, and hyperparameter sensitivity. For example, while the dual-mode optimization mechanism enhances global search capability and dynamic adaptability, it also increases training complexity. Its generalization performance and engineering adaptability under larger-scale or more complex operating conditions still require further validation. In addition, the model exhibits sensitivity to certain hyperparameter settings, which may affect stability across different scenarios. Therefore, future research may focus on reducing computational overhead, improving model universality and adaptability, and exploring embedded deployment in practical battery management systems to further facilitate its real-world application.
Based on the comparative results from both the B0005 and CS2-35 datasets, it is evident that the proposed AO-AVOA-BP model significantly enhances the global search capability and local convergence behavior of the neural network through a multi-strategy optimization approach. This leads to superior error control performance compared to the traditional BP model and its single or simply integrated optimization variants. In particular, the notable improvement in the MAPE metric indicates that the proposed model captures the actual SOH degradation trend more accurately, thereby reducing cumulative prediction errors caused by abnormal fluctuations or localized deviations. It not only effectively addresses the error accumulation issue associated with the parameter sensitivity of traditional BP models but also overcomes the limitations of single optimization algorithms (e.g., PSO, AVOA) under complex operating conditions. Overall, the proposed model exhibits outstanding performance not only on individual datasets but also maintains stable advantages across different datasets, demonstrating strong generalizability and potential for real-world engineering applications. This provides a reliable technical foundation for large-scale deployment in battery health management (BHM) systems.
To further validate the performance advantages of the proposed AO-AVOA-BP neural network model in lithium-ion battery SOH prediction tasks, a systematic comparison was conducted using two representative batteries, B0005 and CS2-35. The model was benchmarked against several state-of-the-art prediction models, including TCA-PSA and BO-LSTM. The comparative results are presented in
Table 9.
As shown in
Table 9, the AO-AVOA-BP neural network achieved the lowest MAE and RMSE values across both the B0005 and CS2-35 battery datasets, indicating superior prediction accuracy and stronger residual error control. Specifically, for the B0005 dataset, the AO-AVOA-BP model obtained an MAE of 0.0079 and an RMSE of 0.0089, significantly outperforming BO-LSTM (0.0127, 0.0092) and MFPA-TCN (0.0093, 0.0097). On the CS2-35 dataset, the model further surpassed alternatives such as CVIT and CNN-DBLSTM, achieving an MAE of 0.0087 and an RMSE of 0.0115, demonstrating robust error control capability under varying SOH degradation curve patterns.
Notably, the AO-AVOA-BP model exhibited outstanding performance in terms of the MAPE metric, achieving 1.0943% on the B0005 dataset and 1.095% on the CS2-35 dataset, which is substantially lower than the 7.0077% reported by the CNN-BiGRU-QR model. These results indicate that the proposed model not only outperforms other high-complexity deep learning methods in terms of absolute error but also achieves higher accuracy on a relative scale. This enables it to effectively avoid excessive relative errors during the steep slope changes typically observed at the end of battery’s life, reflecting its strong full-sequence fitting capability and robustness.
The performance improvement of the AO-AVOA-BP model primarily stems from the introduction of a hybrid intelligent optimization strategy that integrates AO and AVOA to jointly optimize the initial weights and hyperparameters of the BP neural network. Compared with models that rely solely on gradient descent or a single heuristic algorithm, this dual-optimization mechanism enables more efficient global search during the early training stages and fine-grained local convergence in the later stages, thereby significantly enhancing the BP network’s ability to capture nonlinear degradation trends and extreme points. This “structural enhancement + weight optimization” strategy also effectively overcomes the traditional limitations of BP networks, such as slow convergence and susceptibility to local minima.
Overall, the AO-AVOA-BP model demonstrates stable performance across two publicly available datasets with distinct sampling strategies, aging rates, and operating conditions, indicating its strong generalization capability across different battery types and sampling mechanisms. This consistency across datasets is of practical significance, particularly for industrial scenarios involving diverse cell chemistries and uncertain operating environments. Compared with structurally complex, resource-intensive end-to-end models, AO-AVOA-BP achieves higher accuracy and stability with a relatively small parameter scale, providing an efficient, interpretable, and easily deployable modeling solution for lithium-ion battery SOH prediction tasks.