A Health-Aware Hybrid Reinforcement–Predictive Control Framework for Sustainable Energy Management in Photovoltaic–Electric Vehicle Microgrids
Abstract
1. Introduction
1.1. Research Objectives
- To develop a hybrid control architecture that integrates stochastic model predictive control (SMPC) with DRL, enabling adaptive, data-driven decision-making under uncertainty.
- To incorporate a degradation-aware optimisation layer that explicitly manages battery SOH and prolongs the lifespan of energy storage systems through intelligent charge/discharge regulation.
- To evaluate the proposed H-RPEM on a real PV-EV MG dataset, benchmarking its performance against that of conventional predictive and rule-based control methods in terms of cost, emissions, and technical resilience.
1.2. Research Questions
- How can RL and predictive optimisation be effectively combined into a single control framework that maintains adaptability while satisfying physical and operational constraints?
- In what ways does the explicit consideration of battery degradation influence the economic and environmental performance of PV-EV MGs during real-time operation?
- To what extent can the proposed hybrid controller maintain robustness and stability under stochastic variations in renewable generation, grid tariffs, and EV charging demand?
2. Methodology
2.1. Data Pre-Processing and Stochastic Characterisation
- PV irradiance and PV power were obtained from the NREL MIDC database for a representative UK latitude, with PV output computed using Equation (3).
- Load demand was synthesised from standard residential and commercial load shape templates and scaled to match the microgrid size considered.
- Cell temperature was estimated from the ambient temperature using a standard NOCT model.
- EV fleet power was reconstructed by aggregating all overlapping charging sessions into a 15 min power profile.
2.2. MG Model and Energy Balance
2.3. Hybrid Reinforcement–Predictive Control (H-RPEM)
2.4. Algorithmic Structure of the H-RPEM Controller
| Algorithm 1: Health-Aware Hybrid Reinforcement–Predictive Energy Manager |
![]() |
3. Results and Discussion
3.1. Quantitative Evaluation of Economic, Environmental, and Health-Aware Performance
3.2. Sensitivity and Robustness Analysis of the H-RPEM Controller
3.3. Learning Dynamics and Convergence Behaviour of the Hybrid Controller
3.4. Limitations
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- International Energy Agency (IEA). Global EV Outlook 2023: Catching up with Climate Ambitions; IEA: Paris, France, 2023; Available online: https://www.iea.org/reports/global-ev-outlook-2023 (accessed on 12 November 2025).
- Manousakis, N.M.; Karagiannopoulos, P.S.; Tsekouras, G.J.; Kanellos, F.D. Integration of renewable energy and electric vehicles in power systems: A review. Processes 2023, 11, 1544. [Google Scholar] [CrossRef]
- Alhawsawi, E.Y.; Salhein, K.; Zohdy, M.A. A comprehensive review of existing and pending university campus microgrids. Energies 2024, 17, 2425. [Google Scholar] [CrossRef]
- Bullich-Massagué, E.; Cifuentes-García, F.J.; Glenny-Crende, I.; Cheah-Mañé, M.; Aragüés-Peñalba, M.; Díaz-González, F.; Gomis-Bellmunt, O. A review of energy storage technologies for large-scale photovoltaic power plants. Appl. Energy 2020, 274, 115213. [Google Scholar] [CrossRef]
- Cavus, M.; Allahham, A.; Adhikari, K.; Zangiabadia, M.; Giaouris, D. Control of microgrids using an enhanced Model Predictive Controller. In Proceedings of the 11th International Conference on Power Electronics, Machines and Drives (PEMD 2022), Newcastle, UK, 21–23 June 2022; pp. 660–665. [Google Scholar] [CrossRef]
- Ahmad, S.; Shafiullah, M.; Ahmed, C.B.; Alowaifeer, M. A review of microgrid energy management and control strategies. IEEE Access 2023, 11, 21729–21757. [Google Scholar] [CrossRef]
- Baillieul, J.; Samad, T. (Eds.) Encyclopedia of Systems and Control, 2nd ed.; Springer: London, UK, 2021. [Google Scholar] [CrossRef]
- Mayne, D.Q.; Rawlings, J.B.; Rao, C.V.; Scokaert, P.O.M. Constrained model predictive control: Stability and optimality. Automatica 2000, 36, 789–814. [Google Scholar] [CrossRef]
- Schwenzer, M.; Ay, M.; Bergs, T.; Abel, D. Review on model predictive control: An engineering perspective. Int. J. Adv. Manuf. Technol. 2021, 117, 1327–1349. [Google Scholar] [CrossRef]
- Joshal, K.S.; Gupta, N. Microgrids with model predictive control: A critical review. Energies 2023, 16, 4851. [Google Scholar] [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef]
- Zhou, K.; Zhou, K.; Yang, S. Reinforcement learning-based scheduling strategy for energy storage in microgrid. J. Energy Storage 2022, 51, 104379. [Google Scholar] [CrossRef]
- Yu, P.; Zhang, H.; Song, Y.; Wang, Z.; Dong, H.; Ji, L. Safe reinforcement learning for power system control: A review. Renew. Sustain. Energy Rev. 2025, 223, 116022. [Google Scholar] [CrossRef]
- François-Lavet, V.; Taralla, D.; Ernst, D.; Fonteneau, R. Deep reinforcement learning solutions for energy microgrids management. In Proceedings of the European Workshop on Reinforcement Learning (EWRL 2016), Barcelona, Spain, 3–4 December 2016; Available online: https://dial.uclouvain.be/pr/boreal/object/boreal:177152 (accessed on 10 November 2025).
- Gao, Y.; Matsunami, Y.; Miyata, S.; Akashi, Y. Multi-agent reinforcement learning dealing with hybrid action spaces: A case study for off-grid oriented renewable building energy system. Appl. Energy 2022, 326, 120021. [Google Scholar] [CrossRef]
- Islam, K.S.; Dutta, A.; Mruthyunjaya, S. Hybrid ML–RL approach for smart grid stability prediction and optimized control strategy. arXiv 2025, arXiv:2508.19541. [Google Scholar]
- Ahsan, F.; Dana, N.H.; Sarker, S.K.; Li, L.; Muyeen, S.M.; Ali, M.F.; Tasneem, Z.; Hasan, M.M.; Abhi, S.H.; Islame, M.R.; et al. Data-driven next-generation smart grid towards sustainable energy evolution: Techniques and technology review. Prot. Control Mod. Power Syst. 2023, 8, 43. [Google Scholar] [CrossRef]
- Birkl, C.R.; Roberts, M.R.; Howey, D.A. Degradation diagnostics for lithium-ion cells. J. Power Sources 2017, 341, 373–386. [Google Scholar] [CrossRef]
- Fan, Y.; Xiao, F.; Li, C.; Yang, G.; Tang, X. A novel deep learning framework for state of health estimation of lithium-ion battery. J. Energy Storage 2020, 32, 101741. [Google Scholar] [CrossRef]
- Lin, X.; Xi, L.; Wang, Z. Battery degradation-aware energy management strategy with driving pattern severity factor feedback correction algorithm. J. Clean. Prod. 2024, 450, 141969. [Google Scholar] [CrossRef]
- Le, C.N.; Vinayagam, A.; Tran, P.T.; Stojcevski, S.; Dinh, T.N.; Stojcevski, A.; Chandran, J. State of health aware adaptive scheduling of battery energy storage system charging and discharging in rural microgrids using long short-term memory and convolutional neural networks. Energies 2025, 18, 5641. [Google Scholar] [CrossRef]
- Yi, H.; Du, Z.; Chen, H.; Zhang, K. Multi-objective optimization framework for PEMFC hybrid marine power systems: Integrating dynamic lifetime degradation and energy management. Ocean. Eng. 2025, 340, 122248. [Google Scholar] [CrossRef]
- Bai, Y.; Li, J.; He, H.; Dos Santos, R.C.; Yang, Q. Optimal design of a hybrid energy storage system in a plug-in hybrid electric vehicle for battery lifetime improvement. IEEE Access 2020, 8, 142148–142158. [Google Scholar] [CrossRef]
- Wang, L.; Li, Q.; Ding, R.; Sun, M.; Wang, G. Integrated scheduling of energy supply and demand in microgrids under uncertainty: A robust multi-objective optimization approach. Energy 2017, 130, 1–14. [Google Scholar] [CrossRef]
- Hasani, R.; Mohammadi, M.; Samanfar, A. Integrated multiobjective energy management for a smart microgrid incorporating electric vehicle charging stations and demand response programs under uncertainty. Int. J. Energy Res. 2025, 1, 9531493. [Google Scholar] [CrossRef]
- Zheng, Y.; Jia, J.; An, D. Energy management for microgrids with hybrid hydrogen–battery storage: A reinforcement learning framework integrated multi-objective dynamic regulation. Processes 2025, 13, 2558. [Google Scholar] [CrossRef]
- Pan, W.; Yu, X.; Guo, Z.; Qian, T.; Li, Y. Online EVs vehicle-to-grid scheduling coordinated with multi-energy microgrids: A deep reinforcement learning-based approach. Energies 2024, 17, 2491. [Google Scholar] [CrossRef]
- Korkas, C.D.; Tsaknakis, C.D.; Kapoutsis, A.C.; Kosmatopoulos, E. Distributed and multi-agent reinforcement learning framework for optimal electric vehicle charging scheduling. Energies 2024, 17, 3694. [Google Scholar] [CrossRef]
- Xie, H.; Song, G.; Shi, Z.; Zhang, J.; Lin, Z.; Yu, Q.; Fu, H.; Song, X.; Zhang, H. Reinforcement learning for vehicle-to-grid: A review. Adv. Appl. Energy 2025, 17, 100214. [Google Scholar] [CrossRef]
- Cavus, M.; Bell, M. Enabling smart grid resilience with deep learning-based battery health prediction in EV fleets. Batteries 2025, 11, 283. [Google Scholar] [CrossRef]
- Cavus, M.; Ayan, H.; Dissanayake, D.; Sharma, A.; Deb, S.; Bell, M. Forecasting electric vehicle charging demand in smart cities using hybrid deep learning of regional spatial behaviours. Energies 2025, 18, 3425. [Google Scholar] [CrossRef]
- Khayat, Y.; Shafiee, Q.; Heydari, R.; Naderi, M.; Dragičević, T.; Simpson-Porco, J.W.; Dörfler, F.; Fathi, M.; Blaabjerg, F.; Guerrero, J.M.; et al. On the secondary control architectures of AC microgrids: An overview. IEEE Trans. Power Electron. 2019, 35, 6482–6500. [Google Scholar] [CrossRef]
- Valakhorrasani, A. Electric Vehicle Charging Patterns. Kaggle Dataset. 2023. Available online: https://www.kaggle.com/datasets/valakhorasani/electric-vehicle-charging-patterns (accessed on 11 November 2025).
- Lin, Y.; Zhou, L.; Yan, J.; He, S. A hybrid data-driven model for state of health estimation of lithium-ion batteries with capacity recovery. Renew. Energy 2025, in press. [Google Scholar] [CrossRef]
- Apribowo, C.H.B.; Ashidqi, M.D.; Nizam, M.; Purwanto, A. Data-driven modeling of lithium-ion battery degradation using XGBoost with extended Kalman filter-based internal resistance correction. Results Eng. 2025, 28, 108100. [Google Scholar] [CrossRef]










| Parameter | Symbol/Value | Unit |
|---|---|---|
| Battery energy capacity | kWh | |
| Maximum charge/discharge power | kW | |
| Charge/discharge efficiency | Unitless | |
| Sampling interval | h | |
| Discount factor | - | |
| Learning rate | - | |
| Prediction horizon | Steps | |
| Price tariff range | GBP 0.10–0.26 | Per kWh |
| CO2 emission intensity | 0.30–0.55 | kg kWh−1 |
| Degradation scaling factor | Unitless | |
| PV temperature coefficient | per °C | |
| PV area | m2 | |
| PV nominal efficiency | Unitless | |
| Reinforcement-predictive weight | - | |
| Simulation horizon | h | |
| Time-varying electricity price | €/kWh | |
| Grid-imported power | kW |
| Method | Cost (GBP) | CO2 (kg) | SOH (–) | Renewable (%) | Peak (kW) |
|---|---|---|---|---|---|
| Baseline | 1.325 | 1.293 | 0.891 | 0.378 | 1.199 |
| SMPC | 0.914 | 0.915 | 0.941 | 0.397 | 0.898 |
| H-RPEM | 0.761 | 0.792 | 0.958 | 0.424 | 0.903 |
| Configuration | Cost (–) | CO2 (–) | SOH (–) |
|---|---|---|---|
| Full H-RPEM | 0.761 | 0.792 | 0.958 |
| No degradation term | 0.894 | 0.902 | 0.753 |
| No PV awareness | 0.869 | 0.887 | 0.881 |
| No health term | 0.894 | 0.902 | 0.902 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cavus, M.; Bell, M. A Health-Aware Hybrid Reinforcement–Predictive Control Framework for Sustainable Energy Management in Photovoltaic–Electric Vehicle Microgrids. Batteries 2026, 12, 5. https://doi.org/10.3390/batteries12010005
Cavus M, Bell M. A Health-Aware Hybrid Reinforcement–Predictive Control Framework for Sustainable Energy Management in Photovoltaic–Electric Vehicle Microgrids. Batteries. 2026; 12(1):5. https://doi.org/10.3390/batteries12010005
Chicago/Turabian StyleCavus, Muhammed, and Margaret Bell. 2026. "A Health-Aware Hybrid Reinforcement–Predictive Control Framework for Sustainable Energy Management in Photovoltaic–Electric Vehicle Microgrids" Batteries 12, no. 1: 5. https://doi.org/10.3390/batteries12010005
APA StyleCavus, M., & Bell, M. (2026). A Health-Aware Hybrid Reinforcement–Predictive Control Framework for Sustainable Energy Management in Photovoltaic–Electric Vehicle Microgrids. Batteries, 12(1), 5. https://doi.org/10.3390/batteries12010005


