Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming
Abstract
1. Introduction
- (1)
- Capacity Optimization
- (2)
- Operation Optimization
- (3)
- Co-Optimization of Capacity and Operation
- (1)
- Strong coupling between long-term investment and short-term operation;
- (2)
- High nonlinearity and mixed-integer characteristics;
- (3)
- Environmental uncertainty and computational complexity.
- (1)
- A novel bi-level collaborative optimization model for capacity and operation is proposed. In the capacity optimization layer, the model incorporates an adaptive mechanism for net load fluctuations into capacity boundary calculation; in the operation optimization layer, a dynamic operation model considering the degradation processes of battery and hydrogen energy storage is established to improve the model’s accuracy and engineering applicability.
- (2)
- A novel hybrid solution algorithm that combines RL and mixed-integer programming (MIP) is proposed for the bi-level optimization model. The proposed approach leverages the adaptive learning and environment-perception capabilities of RL to dynamically respond to complex and uncertain scenarios, while employing MIP to ensure accurate optimal operation and strict constraint satisfaction. This hybrid algorithm achieves near-optimal performance with significantly reduced computational time.
- (3)
- The proposed model is validated through multi-scenario simulations and sensitivity analyses, demonstrating its robustness and generalization capability. The results show that the proposed method maintains stable optimization performance under varying operating conditions and system parameters.
2. System Model
2.1. Inner-Layer Operation Optimization
2.1.1. Objective Function
2.1.2. Constraints
Hydrogen Energy Storage System Constraints
- (1)
- Power operation constraints.
- (2)
- Mutual-exclusion constraint (to prevent the electrolyzer and fuel cell from operating simultaneously).
- (3)
- Hydrogen storage tank dynamic balance equation.
- (4)
- Hydrogen storage tank state constraints.
- (5)
- To ensure the feasibility and stability of the system during multi-day continuous operation, periodic constraints are imposed.
- (6)
- To prevent frequent start–stop cycling of the electrolyzer, which may accelerate its degradation, a start–stop operation constraint is imposed.
Battery System Constraints
- (1)
- Power operation constraints
- (2)
- Charging/discharging mutual-exclusion constraint (to prevent simultaneous charging and discharging)
- (3)
- Battery state balance equation
- (4)
- Battery state upper and lower bound constraints
System Power Balance Equation
2.2. Outer-Layer Capacity Optimization
2.2.1. Objective Function
2.2.2. Constraints
- (1)
- System reliability constraints
- (2)
- Capacity configuration boundary constraints
- (3)
- Battery charging/discharging duration constraints
3. A Cooperative DRL–MIP Framework for HESS Capacity Configuration and Operation Optimization
3.1. Collaborative Optimization Mechanism
3.1.1. Outer Layer Design
3.1.2. Inner Layer Design
- State and action space design
- 2.
- Reward function design
- 3.
- Network architecture and training strategy
- (1)
- Network architecture
- (2)
- TD3 core strategy
- (3)
- Network update mechanism
- (4)
- Soft update mechanism
- (5)
- Prioritized experience replay
4. Results and Discussion
4.1. Case Setting
4.2. Algorithmic Solution and Results Analysis
4.3. Comparative Analysis
4.4. Sensitivity Analysis
4.4.1. Sensitivity Analysis of Key Component Costs
- (1)
- Sensitivity to Electrolyzer Power Cost
- (2)
- Sensitivity to Fuel Cell Power Cost
- (3)
- Sensitivity to Hydrogen Tank Cost
- (4)
- Sensitivity to Lithium Battery Power Cost
- (5)
- Sensitivity to Lithium Battery Energy-Capacity Cost
4.4.2. Sensitivity Analysis of Renewable Energy Penetration
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations and Nomenclature
| HESS | Hybrid energy storage system |
| DRL | Deep reinforcement learning |
| EL | Electrolyzer |
| BESS | Battery energy storage system |
| MIP | Mixed integer programming |
| FC | Fuel cell |
| HST | Hydrogen storage tank |
| MDP | Markov Decision Process |
| TD3 | Twin Delayed Deep Deterministic Policy Gradient |
| SoC | State of charge |
| SoH | State of hydrogen storage tank |
| Cost related variables | |
| Total daily operation and maintenance cost of each component | |
| Battery investment cost | |
| Unit power investment cost of the electrolyzer | |
| Unit power investment cost of the fuel cell | |
| Minimized daily total cost of the system | |
| Salvage value | |
| Ratio of maintenance costs to the investment cost of the electrolyzer | |
| Ratio of maintenance cost to the investment cost of the fuel cell | |
| Degradation cost coefficient of electrolyzer | |
| Degradation cost coefficient of fuel cell | |
| Power and energy related variables | |
| Net load power at time t | |
| Electrolyzer power | |
| Fuel cell power | |
| Battery charging/discharging power | |
| Rated power of the electrolyzer | |
| Rated power of the fuel cell | |
| Minimum and maximum of the net load | |
| Surplus and deficit energy | |
| Maximum continuous energy requirement of the system | |
| Deficit or surplus power | |
| Capacity of the hydrogen storage tank | |
| Hydrogen production rate | |
| Hydrogen consumption rate | |
| Accelerated aging coefficient of battery | |
| Efficiency and parameters | |
| Charging and discharging efficiencies of the battery | |
| Electricity-to-hydrogen conversion efficiency of the electrolyzer | |
| Hydrogen-to-electricity conversion efficiency of the fuel cell | |
| Hydrogen compression storage efficiency | |
| Hydrogen decompression efficiency | |
| a,b | Parameters of the battery aging model |
| Difference between the starting state and the ending state | |
| i | Discount rate |
| λ | Margin factor |
| T | Length of optimization time horizon |
| Technical lifetime | |
| Designed service life of the electrolyzer | |
| Designed service life of the fuel ce | |
| Required minimum charging/discharging duration of the battery system | |
| Lower heating value (LHV) of hydrogen | |
| Total number of start–stop cycles | |
| Binary variables | |
| Binary variables-the status of the electrolyzer: 1 on, 0 off | |
| Binary variables-the status of the fuel cell: 1 on, 0 off | |
| Binary variables-the charging states of the battery: 1 on, 0 off | |
| Binary variables-the discharging states of the battery: 1 on, 0 off | |
| Variables related to reinforcement learning and neural networks | |
| Feature vector of the net-load profile | |
| Normalized value of the current best capacity configuration | |
| Normalized value of the historical information features | |
| Power deviation threshold | |
| Numerical stability term | |
| Base improvement reward | |
| Improvement amplification coefficient | |
| Superlinear exponent | |
| Reward cap | |
| Penalty coefficient for no improvement | |
| Penalty coefficient for PD violation | |
| Normalization scale parameters | |
| Optimal actor policy | |
| θ | Parameters of the actor network |
| Expected total system cost | |
| Policy network | |
| Parameter set of the policy function | |
| Actor network learning rate | |
| Action output by the actor network under state s | |
| i-th critic network’s Q-function (action–value function) | |
| Parameters of the critic network | |
| Critic learning rate | |
| Loss function of the critic network | |
| Immediate reward | |
| Discount factor | |
| Minimum of the two Critic network outputs | |
| Random noise added to the target action | |
| Soft update parameter | |
| Parameters of the i-th target critic network | |
| Parameters of the target actor network | |
| Priority of sample i | |
| Priority sampling hyperparameter | |
| Temporal difference error | |
| Importance sampling weight | |
| Importance sampling hyperparameter | |
References
- Zhou, Z.; Ma, Z.; Mu, T. Hybrid energy storage capacity optimization based on VMD-SG and improved Firehawk optimization. Electr. Power Syst. Res. 2025, 239, 111218. [Google Scholar] [CrossRef]
- Lu, Q.; Yang, Y.; Chen, J.; Liu, Y.; Liu, N.; Cao, F. Capacity optimization of hybrid energy storage systems for offshore wind power volatility smoothing. Energy Rep. 2023, 9, 575–583. [Google Scholar] [CrossRef]
- Wu, X.; Shang, W.; Feng, G.; Huang, B.; Xiong, X. Coordinated control algorithm of hydrogen production-battery based hybrid energy storage system for suppressing fluctuation of PV power. Int. J. Hydrogen Energy 2024, 88, 931–944. [Google Scholar] [CrossRef]
- El-Ghazaly, M.; Abdel-Salam, M.; Nayel, M.; Hashem, M. Techno-economic utilization of hybrid optimized gravity-supercapacitor energy-storage system for enriching the stability of grid-connected renewable energy sources. J. Energy Storage 2025, 107, 115002. [Google Scholar] [CrossRef]
- Elkholy, M.; Schwarz, S.; Aziz, M. Advancing renewable energy: Strategic modeling and optimization of flywheel and hydrogen-based energy system. J. Energy Storage 2024, 101, 113771. [Google Scholar] [CrossRef]
- Hu, S.; Yang, H.; Ding, S.; Tian, Z.; Guo, B.; Chen, H.; Yang, F.; Xu, N. Model simulation and multi-objective capacity optimization of wind power coupled hybrid energy storage system. Energy 2025, 319, 134887. [Google Scholar] [CrossRef]
- Al-Quraan, A.; Athamnah, I. Economic tri-level control-based sizing and energy management optimization for efficiency maximization of stand-alone HRES. Energy Convers. Manag. 2024, 302, 118140. [Google Scholar] [CrossRef]
- Guven, A.F.; Abdelaziz, A.Y.; Samy, M.M.; Barakat, S. Optimizing energy dynamics: A comprehensive analysis of hybrid energy storage systems integrating battery banks and supercapacitors. Energy Convers. Manag. 2024, 312, 118560. [Google Scholar] [CrossRef]
- Li, B.; Wang, H.; Tan, Z. Capacity optimization of hybrid energy storage system for flexible islanded microgrid based on real-time price-based demand response. Int. J. Electr. Power Energy Syst. 2022, 136, 107581. [Google Scholar] [CrossRef]
- Wang, J.; Deng, H.; Qi, X. Cost-based site and capacity optimization of multi-energy storage system in the regional integrated energy networks. Energy 2022, 261, 125240. [Google Scholar] [CrossRef]
- Hu, Y.; Yang, B.; Wu, P.; Wang, X.; Li, J.; Huang, Y.; Su, R.; He, G.; Yang, J.; Su, S.; et al. Optimal planning of electric-heating integrated energy system in low-carbon park with energy storage system. J. Energy Storage 2024, 99, 113327. [Google Scholar] [CrossRef]
- Rowe, K.; Mokryani, G.; Cooke, K.; Campean, F.; Chambers, T. Bi-level optimal sizing, siting and operation of utility-scale multi-energy storage system to reduce power losses with peer-to-peer trading in an electricity/heat/gas integrated network. J. Energy Storage 2024, 83, 110738. [Google Scholar] [CrossRef]
- Wang, Y.; Zhang, Y.; Xue, L.; Liu, C.; Song, F.; Sun, Y.; Liu, Y.; Che, B. Research on planning optimization of integrated energy system based on the differential features of hybrid energy storage system. J. Energy Storage 2022, 55, 105368. [Google Scholar] [CrossRef]
- Li, X.; Li, M.; Habibi, M.; Najaafi, N.; Safarpour, H. Optimization of hybrid energy management system based on high-energy solid-state lithium batteries and reversible fuel cells. Energy 2023, 283, 128454. [Google Scholar] [CrossRef]
- Ye, Y.; Xu, B.; Wang, H.; Zhang, J.; Lawler, B.; Ayalew, B. Deep reinforcement learning-based energy management system enhancement using digital twin for electric vehicles. Energy 2024, 312, 133384. [Google Scholar] [CrossRef]
- Wu, Y.; Huang, Z.; Li, D.; Li, H.; Peng, J.; Guerrero, J.M.; Song, Z. Integrated battery thermal and energy management for electric vehicles with hybrid energy storage system: A hierarchical approach. Energy Convers. Manag. 2024, 317, 118853. [Google Scholar] [CrossRef]
- Barelli, L.; Bidini, G.; Ciupageanu, D.A.; Pelosi, D. Integrating hybrid energy storage system on a wind generator to enhance grid safety and stability: A levelized cost of electricity analysis. J. Energy Storage 2021, 34, 102050. [Google Scholar] [CrossRef]
- Roy, P.; Liao, Y.; He, J. Economic dispatch for grid-connected wind power with battery-supercapacitor hybrid energy storage system. IEEE Trans. Ind. Appl. 2023, 59, 1118–1128. [Google Scholar] [CrossRef]
- Bharatee, A.; Ray, P.K.; Ghosh, A. A power management scheme for grid-connected PV integrated with hybrid energy storage system. J. Mod. Power Syst. Clean. Energy 2022, 10, 954–963. [Google Scholar] [CrossRef]
- Wu, X.; Liu, L.; Wu, Y.; Luo, C.; Tang, Z.; Kerekes, T. Near-optimal energy management strategy for a grid-forming PV and hybrid energy storage system. IEEE Trans. Smart Grid 2025, 16, 1422–1433. [Google Scholar] [CrossRef]
- Dsouza, O.D.; Shilpa, G.; Rajnikanth; Irusapparajan, G. Optimized energy management for hybrid renewable energy sources with hybrid energy storage: An SMO-KNN approach. J. Energy Storage 2024, 96, 112152. [Google Scholar] [CrossRef]
- Sathishkumar, R.; Venkateswaran, M.; Deepamangai, P.; Rajan, P.S. An efficient power management control strategy for grid-independent hybrid renewable energy systems with hybrid energy storage: Hybrid approach. J. Energy Storage 2024, 96, 112685. [Google Scholar] [CrossRef]
- Adam, A.H.A.; Chen, J.; Kamel, S.; Safaraliev, M.; Matrenin, P. Power management and control of hybrid renewable energy systems with integrated diesel generators for remote areas. Int. J. Hydrogen Energy 2024, 89, 320–341. [Google Scholar] [CrossRef]
- Manandhar, U.; Zhang, X.; Beng, G.H.; Subramanian, L.; Lu, H.H.C.; Fernando, T. Enhanced energy management system for isolated microgrid with diesel generators, renewable generation, and energy storages. Appl. Energy 2023, 350, 121624. [Google Scholar] [CrossRef]
- Behera, P.K.; Pattnaik, M. Supervisory power management scheme of a laboratory scale wind-PV based LVDC microgrid integrated with hybrid energy storage system. IEEE Trans. Ind. Appl. 2024, 60, 4723–4735. [Google Scholar] [CrossRef]
- Ramu, S.K.; Vairavasundaram, I.; Palaniyappan, B.; Bragadeshwaran, A.; Aljafari, B. Enhanced energy management of DC microgrid: Artificial neural networks-driven hybrid energy storage system with integration of bidirectional DC-DC converter. J. Energy Storage 2024, 88, 111562. [Google Scholar] [CrossRef]
- Nkwanyana, T.B.; Siti, M.W.; Wang, Z.; Mulumba, W. Hybrid energy storage lifespan optimization based on an enhanced fuel-cell degradation model and meta-heuristic algorithm. Energy Rep. 2024, 12, 5712–5727. [Google Scholar] [CrossRef]
- Duong, H.-N.; Tran, L.; Vu, T.; Vo-Duy, T.; Nguyễn, B.-H. A global optimal benchmark for energy management of microgrid (GoBuG) integrating hybrid energy storage system. IEEE Trans. Smart Grid 2024, 15, 5429–5440. [Google Scholar] [CrossRef]
- Zhang, K.; Zou, G.; Zhang, J.; Li, H.; Sun, Y.; Li, G. Microgrid energy management strategy considering source-load forecast error. Int. J. Electr. Power Energy Syst. 2025, 164, 110372. [Google Scholar] [CrossRef]
- Tang, Y.; Xun, Q.; Liserre, M.; Yang, H. Energy management of electric-hydrogen hybrid energy storage systems in photovoltaic microgrids. Int. J. Hydrogen Energy 2024, 80, 1–10. [Google Scholar] [CrossRef]
- Sepehrzad, R.; Moridi, A.R.; Hassanzadeh, M.E.; Seifi, A.R. Intelligent energy management and multi-objective power distribution control in hybrid micro-grids based on the advanced fuzzy-PSO method. ISA Trans. 2021, 112, 199–213. [Google Scholar] [CrossRef]
- Chekira, O.; Boujoudar, Y.; El Moussaoui, H.; Boharb, A.; Lamhamdi, T.; El Markhi, H. An improved microgrid energy management system based on hybrid energy storage system using ANN NARMA-L2 controller. J. Energy Storage 2024, 98, 113096. [Google Scholar] [CrossRef]
- Wang, J.; Lyu, C.; Bai, Y.; Yang, K.; Song, Z.; Meng, J. Optimal scheduling strategy for hybrid energy storage systems of battery and flywheel combined multi-stress battery degradation model. J. Energy Storage 2024, 99, 113208. [Google Scholar] [CrossRef]
- Elkholy, M.H.; Senjyu, T.; Metwally, H.; Farahat, M.; Irshad, A.S.; Hemeida, A.M.; Lotfy, M.E. A resilient and intelligent multi-objective energy management for a hydrogen-battery hybrid energy storage system based on MFO technique. Renew. Energy 2024, 222, 119768. [Google Scholar] [CrossRef]
- Yang, C.; Li, X.; Chen, L.; Mei, S. Intra-day and seasonal peak shaving oriented operation strategies for electric–hydrogen hybrid energy storage in isolated energy systems. Sustainability 2024, 16, 7010. [Google Scholar] [CrossRef]
- Han, F.; Zeng, J.; Lin, J.; Gao, C. Multi-stage distributionally robust optimization for hybrid energy storage in regional integrated energy system considering robustness and nonanticipativity. Energy 2023, 277, 127729. [Google Scholar] [CrossRef]
- Shan, J.; Lu, R. Multi-objective economic optimization scheduling of CCHP micro-grid based on improved bee colony algorithm considering the selection of hybrid energy storage system. Energy Rep. 2021, 7, 326–341. [Google Scholar] [CrossRef]
- Deng, J.; Wang, X.; Chen, T.; Meng, F. An energy router based on multi-hybrid energy storage system with energy coordinated management strategy in island operation mode. Renew. Energy 2023, 212, 274–284. [Google Scholar] [CrossRef]
- Pang, B.; Zhu, H.; Tong, Y.; Dong, Z. Optimal design and control of battery-ultracapacitor hybrid energy storage system for BEV operating at extreme temperatures. J. Energy Storage 2024, 101, 113963. [Google Scholar] [CrossRef]
- Li, M.; Wang, L.; Wang, Y.; Chen, Z. Sizing optimization and energy management strategy for hybrid energy storage system using multiobjective optimization and random forests. IEEE Trans. Power Electron. 2021, 36, 11421–11430. [Google Scholar] [CrossRef]
- Xu, F.; Li, X.; Jin, C. Optimal capacity configuration and dynamic pricing strategy of a shared hybrid hydrogen energy storage system for integrated energy system alliance: A bi-level programming approach. Int. J. Hydrogen Energy 2024, 69, 331–346. [Google Scholar] [CrossRef]
- He, Y.; Guo, S.; Zhou, J.; Song, G.; Kurban, A.; Wang, H. The multi-stage framework for optimal sizing and operation of hybrid electrical-thermal energy storage system. Energy 2022, 245, 123248. [Google Scholar] [CrossRef]
- Li, C.; Zhang, X. Optimal sizing of hybrid energy storage system under multiple typical conditions of sources and loads. Int. J. Sustain. Energy 2025, 44, 2439298. [Google Scholar] [CrossRef]
- Tsao, Y.-C.; Banyupramesta, I.G.A.; Lu, J.-C. Optimal operation and capacity sizing for a sustainable shared energy storage system with solar power and hydropower generator. J. Energy Storage 2025, 110, 115173. [Google Scholar] [CrossRef]
- Wang, G.; Blondeau, J. Optimal combination of daily and seasonal energy storage using battery and hydrogen production to increase the self-sufficiency of local energy communities. J. Energy Storage 2024, 92, 112206. [Google Scholar] [CrossRef]
- Yang, H.; Chu, Y.; Ma, Y.; Zhang, D. Operation strategy and optimization configuration of hybrid energy storage system for enhancing cycle life. J. Energy Storage 2024, 95, 112560. [Google Scholar] [CrossRef]
- Li, H.; Sun, D.; Li, B.; Wang, X.; Zhao, Y.; Wei, M.; Dang, X. Collaborative optimization of VRB-PS hybrid energy storage system for large-scale wind power grid integration. Energy 2023, 265, 126292. [Google Scholar] [CrossRef]
- Liang, Z.; Chung, C.Y.; Wang, Q.; Chen, H.; Yang, H.; Wu, C. Fortifying renewable-dominant hybrid microgrids: A bi-directional converter-based interconnection planning approach. Engineering 2025, 51, 130–143. [Google Scholar] [CrossRef]
- Lou, Q.; Li, Y.; Li, Z.; Han, L.; Xu, Y.; Yi, Z. Multi-stage planning approach for distribution network considering long-term variations in load and renewable energy. Energies 2025, 18, 152. [Google Scholar] [CrossRef]
- Huang, B.; Zhao, T.; Yue, M.; Wang, J. Bi-level adaptive storage expansion strategy for microgrids using deep reinforcement learning. IEEE Trans. Smart Grid 2023, 15, 1362–1375. [Google Scholar] [CrossRef]
- Fujimoto, S.; van Hoof, H.; Meger, D. Addressing Function Approximation Error in Actor-Critic Methods. In Proceedings of the 35th International Conference on Machine Learning (ICML 2018), Stockholm, Sweden, 10–15 July 2018. [Google Scholar]










| Reference | Co-Optimization Module | Solution Methods | Whether Scenario Uncertainty Is Considered | Whether Energy Storage Degradation Is Considered |
|---|---|---|---|---|
| [39] | Capacity level: nonlinear programming (NLP); Operation level: dynamic programming model (DP) | Capacity level: multi-start space-reduction algorithm; Operation level: dynamic programming algorithm; Interactive iteration between two levels | No | Yes, only battery |
| [40] | Capacity level: multi-objective NLP; Operation level: DP | Capacity level: multi-objective grey wolf optimization algorithm; Operation level: dynamic programming algorithm; Interactive iteration between two levels | No | Yes, only battery |
| [41] | Capacity level: NLP; Operation level: mixed-integer linear programming (MILP) | Capacity level: improved PSO–GA hybrid algorithm; Operation level: commercial solver; Interactive iteration between two levels | Partially considered by typical day data based on clustering | No |
| [42] | Capacity level: NLP; Operation level: MILP | Capacity level: PSO algorithm; Operation level: commercial solver; Interactive iteration between two levels | Partially considered by typical day data based on clustering | Yes, only battery |
| [43] | Capacity level: multi-objective NLP; Operation level: MILP | Capacity level: NSGA-III algorithm; Operation level: commercial solver; Interactive iteration between two levels | Partially considered by typical day data based on clustering | No |
| [44] | Capacity level: NLP; Operation level: mixed-integer nonlinear programming (MINLP) | Capacity level: sequential quadratic programming algorithm; Operation level: commercial solver; Interactive iteration between two levels | No | No |
| [45] | Capacity level: multi-objective NLP; Operation level: MILP | Capacity level: ε-constraint multi-objective optimization algorithm; Operation level: commercial solver; Interactive iteration between two levels | No | No |
| [46] | Capacity level: NLP; Operation level: MINLP | Transformed into a single-level MILP and solved by a commercial solver | No | Yes, only battery |
| [47] | Capacity level: NLP; Operation level: MINLP | Transformed into a single-level MILP and solved by a commercial solver | No | No |
| Component | Economic/Technical Parameter | Value | Unit |
|---|---|---|---|
| EL (Electrolyzer) | 786 | USD/kWh | |
| 0.8 | - 1 | ||
| 0.4 | - | ||
| 70,000 | Hour | ||
| HFC (Fuel Cell) | 286 | USD/kW | |
| 0.6 | - | ||
| 0.3 | - | ||
| 30,000 | Hour | ||
| HT (Hydrogen Tank) | 1143 | USD/kg | |
| 0 | - | ||
| 1 | - | ||
| 0.97 | - | ||
| 0.98 | - | ||
| Battery | 429 | USD/kW | |
| 357 | USD/kWh | ||
| 0.98 | - | ||
| 0.98 | - | ||
| 0.1 | - | ||
| 0.9 | - | ||
| 10 | USD/kWh | ||
| HESS (Hybrid Energy Storage System) | 20 | Year | |
| i | 0.08 | - | |
| Other | 33.33 | kWh/kg |
| Decision Variable | Optimized Result | Unit |
|---|---|---|
| 312.23 | kW | |
| 173.26 | kW | |
| 225.90 | kg | |
| 71.60 | kW | |
| 174.68 | kWh | |
| Minimum daily total cost | 209.10 | USD |
| Cases | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) | Computation Time (s) | |
|---|---|---|---|---|---|---|---|---|
| Case 1 | DRL + Gurobi | 312.23 | 173.26 | 225.90 | 71.60 | 174.68 | 209.10 | 1.3 |
| Case 2 | GA + Gurobi | 262.88 | 93.19 | 112.77 | 133.42 | 460.72 | 219.34 | 250 |
| PSO + Gurobi | 279.96 | 115.35 | 146.30 | 104.32 | 353.56 | 211.87 | 225 | |
| Gurobi | 309.16 | 173.33 | 220.43 | 74.39 | 186.89 | 208.73 | 1800 | |
| Case 3 | Battery-only | - | - | - | 383.57 | 3297.74 | 473.35 | - |
| Hydrogen-only | 383.57 | 238.93 | 76.25 | - | - | 140.19 | - | |
| (USD/kW) | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) |
|---|---|---|---|---|---|---|
| 550 | 319.07 | 172.27 | 232.94 | 64.53 | 158.19 | 186.71 |
| 629 | 274.13 | 109.31 | 132.36 | 109.37 | 386 | 198.5 |
| 707 | 307.63 | 156.85 | 217.02 | 75.94 | 194.1 | 203.3 |
| 786 | 312.23 | 173.26 | 225.9 | 71.6 | 174.68 | 209.1 |
| 864 | 270.75 | 106.74 | 122.61 | 112.82 | 406.87 | 218.51 |
| 943 | 266.8 | 102.38 | 109.69 | 116.77 | 435.96 | 225.15 |
| 1021 | 255.13 | 93.96 | 72.45 | 128.44 | 521.15 | 231.62 |
| (USD/kW) | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) |
|---|---|---|---|---|---|---|
| 200 | 304.59 | 167.7 | 210.58 | 81.9 | 208.87 | 205.98 |
| 229 | 317.57 | 175.64 | 230.91 | 68.55 | 163.37 | 207.27 |
| 257 | 317.28 | 173.78 | 231.34 | 68.11 | 162.31 | 207.79 |
| 286 | 312.23 | 173.26 | 225.9 | 71.6 | 174.68 | 209.1 |
| 314 | 318.06 | 173.25 | 231.93 | 65.71 | 160.49 | 210.95 |
| 343 | 280.19 | 108.37 | 154.28 | 103.38 | 338.23 | 213.84 |
| 371 | 273.75 | 109.99 | 132.11 | 109.82 | 385.45 | 215.13 |
| (USD/kg) | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) |
|---|---|---|---|---|---|---|
| 800 | 334.53 | 181.63 | 255.82 | 49.54 | 120.08 | 187.76 |
| 914 | 320.07 | 175.68 | 234.14 | 63.54 | 155.56 | 194.59 |
| 1029 | 286.16 | 115.47 | 164.88 | 96.73 | 327.47 | 206.44 |
| 1143 | 312.23 | 173.26 | 225.9 | 71.6 | 174.68 | 209.1 |
| 1257 | 253.81 | 102.13 | 65.9 | 129.77 | 534.64 | 215.21 |
| 1371 | 251.5 | 90.13 | 59.94 | 132.91 | 548.25 | 216.69 |
| 1486 | 254.86 | 95.91 | 57.52 | 128.72 | 553.54 | 219.14 |
| (USD/kW) | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) |
|---|---|---|---|---|---|---|
| 300 | 305.77 | 169.31 | 212.97 | 80.29 | 203.19 | 207.81 |
| 343 | 306.89 | 171.54 | 215.44 | 77.49 | 197.67 | 208.86 |
| 386 | 306.34 | 171.57 | 215.45 | 78.51 | 199.52 | 209.82 |
| 429 | 312.23 | 173.26 | 225.9 | 71.6 | 174.68 | 209.1 |
| 471 | 273.55 | 125.53 | 131.58 | 110.09 | 386.62 | 213.42 |
| 514 | 314.97 | 180.85 | 259.79 | 57.69 | 143.77 | 216.62 |
| 557 | 278.22 | 146.66 | 131.77 | 107.87 | 386.84 | 217.57 |
| (USD/kWh) | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) |
|---|---|---|---|---|---|---|
| 250 | 249.66 | 88.36 | 53.83 | 134.04 | 561.86 | 179.71 |
| 286 | 244.15 | 99.73 | 53.04 | 146.54 | 611.26 | 198.23 |
| 321 | 264.4 | 99.8 | 101.19 | 119.17 | 455.11 | 203.01 |
| 357 | 312.23 | 173.26 | 225.9 | 71.6 | 174.68 | 209.1 |
| 393 | 285.23 | 150.21 | 168.27 | 98.33 | 304.06 | 217.79 |
| 429 | 311.65 | 168.48 | 225.19 | 77.5 | 177.39 | 218.42 |
| 464 | 325.33 | 174.73 | 241.01 | 58.71 | 142.63 | 220.08 |
| Renewable Energy Penetration Scenario | (kW) | (kW) | (kg) | (kW) | (kWh) | (USD) |
|---|---|---|---|---|---|---|
| High | 312.23 | 173.26 | 225.9 | 71.6 | 174.68 | 209.1 |
| Medium | 318.37 | 212.37 | 51.83 | 117.86 | 372.66 | 203.48 |
| Low | 123.18 | 290.64 | 1083.62 | 69.76 | 327.83 | 470.96 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Qian, T.; Zhang, K.; Shi, D.; Zhang, L. Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies 2025, 18, 5638. https://doi.org/10.3390/en18215638
Qian T, Zhang K, Shi D, Zhang L. Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies. 2025; 18(21):5638. https://doi.org/10.3390/en18215638
Chicago/Turabian StyleQian, Tiantian, Kaifeng Zhang, Difen Shi, and Lei Zhang. 2025. "Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming" Energies 18, no. 21: 5638. https://doi.org/10.3390/en18215638
APA StyleQian, T., Zhang, K., Shi, D., & Zhang, L. (2025). Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming. Energies, 18(21), 5638. https://doi.org/10.3390/en18215638
