Next Article in Journal
Advances in Numerical Reservoir Simulation for In Situ Upgrading of Heavy Oil via Steam-Based Technologies
Previous Article in Journal
An Automated Method for Optimizing the Energy Efficiency of Multi-Story Student Residence Halls Using Façade Photovoltaic Installations
Previous Article in Special Issue
LightGBM Medium-Term Photovoltaic Power Prediction Integrating Meteorological Features and Historical Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming

1
School of Electronic Engineering, Nanjing Xiaozhuang University, Nanjing 211171, China
2
School of Automation, Southeast University, Nanjing 210096, China
*
Author to whom correspondence should be addressed.
Energies 2025, 18(21), 5638; https://doi.org/10.3390/en18215638 (registering DOI)
Submission received: 1 October 2025 / Revised: 22 October 2025 / Accepted: 24 October 2025 / Published: 27 October 2025
(This article belongs to the Special Issue AI Solutions for Energy Management: Smart Grids and EV Charging)

Abstract

The hybrid energy storage system (HESS) that combines battery with hydrogen storage exploits complementary power/energy characteristics, but most studies optimize capacity and operation separately, leading to suboptimal overall performance. To address this issue, this paper proposes a bi-level co-optimization framework that integrates deep reinforcement learning (DRL) and mixed integer programming (MIP). The outer layer employs the TD3 algorithm for capacity configuration, while the inner layer uses the Gurobi solver for optimal operation under constraints. On a standalone PV–wind–load-HESS system, the method attains near-optimal quality at dramatically lower runtime. Relative to GA + Gurobi and PSO + Gurobi, the cost is lower by 4.67% and 1.31%, while requiring only 0.52% and 0.58% of their runtime; compared with a direct Gurobi solve, the cost remains comparable while runtime decreases to 0.07%. Sensitivity analysis further validates the model’s robustness under various cost parameters and renewable energy penetration levels. These results indicate that the proposed DRL–MIP cooperation achieves near-optimal solutions with orders of magnitude speedups. This study provides a new DRL–MIP paradigm for efficiently solving strongly coupled bi-level optimization problems in energy systems.
Keywords: hybrid energy storage system (HESS); hydrogen storage; battery energy storage system; co-optimization; bi-level; deep reinforcement learning (DRL) hybrid energy storage system (HESS); hydrogen storage; battery energy storage system; co-optimization; bi-level; deep reinforcement learning (DRL)

Share and Cite

MDPI and ACS Style

Qian, T.; Zhang, K.; Shi, D.; Zhang, L. Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies 2025, 18, 5638. https://doi.org/10.3390/en18215638

AMA Style

Qian T, Zhang K, Shi D, Zhang L. Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies. 2025; 18(21):5638. https://doi.org/10.3390/en18215638

Chicago/Turabian Style

Qian, Tiantian, Kaifeng Zhang, Difen Shi, and Lei Zhang. 2025. "Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming" Energies 18, no. 21: 5638. https://doi.org/10.3390/en18215638

APA Style

Qian, T., Zhang, K., Shi, D., & Zhang, L. (2025). Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies, 18(21), 5638. https://doi.org/10.3390/en18215638

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop