This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming
by
Tiantian Qian
Tiantian Qian 1,2,*,
Kaifeng Zhang
Kaifeng Zhang 2,
Difen Shi
Difen Shi 1 and
Lei Zhang
Lei Zhang 1
1
School of Electronic Engineering, Nanjing Xiaozhuang University, Nanjing 211171, China
2
School of Automation, Southeast University, Nanjing 210096, China
*
Author to whom correspondence should be addressed.
Energies 2025, 18(21), 5638; https://doi.org/10.3390/en18215638 (registering DOI)
Submission received: 1 October 2025
/
Revised: 22 October 2025
/
Accepted: 24 October 2025
/
Published: 27 October 2025
Abstract
The hybrid energy storage system (HESS) that combines battery with hydrogen storage exploits complementary power/energy characteristics, but most studies optimize capacity and operation separately, leading to suboptimal overall performance. To address this issue, this paper proposes a bi-level co-optimization framework that integrates deep reinforcement learning (DRL) and mixed integer programming (MIP). The outer layer employs the TD3 algorithm for capacity configuration, while the inner layer uses the Gurobi solver for optimal operation under constraints. On a standalone PV–wind–load-HESS system, the method attains near-optimal quality at dramatically lower runtime. Relative to GA + Gurobi and PSO + Gurobi, the cost is lower by 4.67% and 1.31%, while requiring only 0.52% and 0.58% of their runtime; compared with a direct Gurobi solve, the cost remains comparable while runtime decreases to 0.07%. Sensitivity analysis further validates the model’s robustness under various cost parameters and renewable energy penetration levels. These results indicate that the proposed DRL–MIP cooperation achieves near-optimal solutions with orders of magnitude speedups. This study provides a new DRL–MIP paradigm for efficiently solving strongly coupled bi-level optimization problems in energy systems.
Share and Cite
MDPI and ACS Style
Qian, T.; Zhang, K.; Shi, D.; Zhang, L.
Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies 2025, 18, 5638.
https://doi.org/10.3390/en18215638
AMA Style
Qian T, Zhang K, Shi D, Zhang L.
Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies. 2025; 18(21):5638.
https://doi.org/10.3390/en18215638
Chicago/Turabian Style
Qian, Tiantian, Kaifeng Zhang, Difen Shi, and Lei Zhang.
2025. "Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming" Energies 18, no. 21: 5638.
https://doi.org/10.3390/en18215638
APA Style
Qian, T., Zhang, K., Shi, D., & Zhang, L.
(2025). Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies, 18(21), 5638.
https://doi.org/10.3390/en18215638
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.