Reinforcement Learning-Based Energy Management for Sustainable Electrified Urban Transportation with Renewable Energy Integration: A Case Study of Alexandria, Egypt
Abstract
1. Introduction
1.1. Motivation
1.2. Literature Review
1.3. Contributions
- A comprehensive urban transportation energy system model is developed that integrates metro station loads, E-bus station, EV parking, PV generation, hydrogen production, and grid interaction within a unified operational framework.
- A renewable-aware energy utilization strategy is proposed, enabling surplus PV power to be effectively allocated to internal loads, EV and E-bus charging, or hydrogen production that fuels a set of gas turbines, thereby reducing energy wastage and grid dependence.
- The energy management problem is formulated as a factored Markov decision process (FMDP), capturing the interactions among heterogeneous subsystems while maintaining computational tractability.
- A multi-agent reinforcement learning–based coordination strategy is developed to jointly manage demand-side flexibility and distributed energy resources under stochastic operating conditions.
2. Methodology
2.1. System Model
2.1.1. PV System
2.1.2. Electrolyzer
2.1.3. Electric Vehicles Model
2.1.4. Hydrogen Fueled Gas Turbines
2.2. Problem Formulation
2.2.1. Cost Functions
2.2.2. Operational Constraints
2.2.3. FMDP Formulation
- States
- Actions
- Reward
2.2.4. Multi-Agent RL Algorithm
2.3. Methodological Scope and Limitations
3. Case Study
3.1. System Description
3.1.1. Metro Line Stations and Way-Sides
3.1.2. E-Bus Station
3.1.3. EVs Parking Garage
3.1.4. Green-Hydrogen Generating Station & HGT
3.2. PV System Integration and Optimal Energy Management
4. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Nomenclature
| ηPV | the efficiency of the PV array |
| the maximum PV power (MW) | |
| SRt | the solar radiation at time t (W/m2) |
| SRSTC | the solar radiation under standard testing conditions (STC) (W/m2) |
| tc | the temperature coefficient of the maximum output power (1/K) |
| the temperature of the PV cell at time t (K) | |
| the temperature of the PV cell under STC (K) | |
| the ambient temperature at time t (K) | |
| the electrolyzer’s output power (MW) | |
| the PV power input to the electrolyzer (EL) at time t (MW) | |
| , | the maximum and minimum power input to the EL (MW) |
| the conversion efficiency of the EL | |
| the EV’s factor of willingness | |
| a factor representing the next journey of the ith EV | |
| the maximum capacity of an EV’s battery (MWh) | |
| the state of charge (SOC) of an EV at time t as percentage of its battery’s maximum capacity | |
| , | the percentage maximum and minimum SOC of an EV at time t, respectively |
| , | the charging and discharging efficiency of an EV, respectively |
| , | the charging and discharging power of an EV (MW), respectively |
| , | the charging and discharging rates of an EV (MW/h), respectively |
| , | the EV’s arrival and departure time, respectively |
| , | the starting and stopping time of EV’s charging, respectively |
| , | the starting and stopping time of EV’s discharging, respectively |
| the efficiency of the hydrogen-fueled gas turbine (HGT) | |
| the output power of the HGT (MW) | |
| , | the maximum and minimum power input to a HGT (MW), respectively |
| , | the minimum and maximum power output of a HGT (MW), respectively |
| , | the electric power bought and sold (MW), respectively |
| , | the energy buying and selling prices ($/MWh) |
| the dissatisfaction cost of nth load at time t ($) | |
| the dissatisfaction energy price at time t ($/MWh) | |
| the dissatisfaction coefficient | |
| the consumed power of the nth load at time t (MW) | |
| the rated power of the nth load at time t (MW) | |
| , | the power bought from and sold to the electricity grid at time t (MW) |
| , | the power traded with EVs (MW) |
| the regenerative braking power (MW) | |
| the loads power at time t (MW) | |
| the power traded with the transportation system at time t (MW) | |
| the power of the lighting loads at time t (MW) | |
| the power of the HVAC loads at time t (MW) |
References
- Bowen, G.; Yang, H.; Zhang, T.; Liu, X.; Wang, X. Technoeconomic analysis of rooftop PV system in elevated metro station for cost-effective operation and clean electrification. Renew. Energy 2024, 226, 120305. [Google Scholar] [CrossRef]
- El-Zonkoly, A. Optimal P2P based energy trading of flexible smart inter-city electric traction system and a wayside network: A case study in Alexandria, Egypt. Electr. Power Syst. Res. 2023, 223, 109708. [Google Scholar] [CrossRef]
- Li, X.; Zhao, Y.; Zhang, W.; Wang, F.; Yin, W.; Liu, K. Photovoltaic potential prediction and techno-economic analysis of China railway stations. Energy Rep. 2023, 10, 3696–3710. [Google Scholar] [CrossRef]
- Shen, X.; Wei, H.; Wei, L. Study of trackside photovoltaic power integration into the traction power system of suburban elevated urban rail transit line. Appl. Energy 2020, 260, 114177. [Google Scholar] [CrossRef]
- Laimon, M.; Yusaf, T. Towards energy freedom: Exploring sustainable solutions for energy independence and self-sufficiency using integrated renewable energy-driven hydrogen system. Renew. Energy 2024, 222, 119948. [Google Scholar] [CrossRef]
- Guan, B.; Liu, X.; Zhang, T.; Wang, X. Hourly energy consumption characteristics of metro rail transit: Train traction versus station operation. Energy Built Environ. 2023, 4, 568–575. [Google Scholar] [CrossRef]
- Yu, Y.; You, S.; Ye, T.; Wang, Y.; Guo, X.; Chen, W.; Liu, T.; Wei, S.; Na, Y. Characteristics and assessment of the electricity consumption of metro systems: A case study of Tianjin, China. Energy Sci. Eng. 2023, 11, 4408–4420. [Google Scholar] [CrossRef]
- Guan, B.; Yang, H.; Li, H.; Gao, H.; Zhang, T.; Liu, X. Energy consumption characteristics and rooftop photovoltaic potential assessment of elevated metro station. Sustain. Cities Soc. 2023, 99, 104928. [Google Scholar] [CrossRef]
- Savaş, S.; Külahcı, K. Machine Learning-Based Energy Consumption and Carbon Footprint Forecasting in Urban Rail Transit Systems. Appl. Sci. 2026, 16, 1369. [Google Scholar] [CrossRef]
- Gad, K.; Tonini, F.; Agati, G.; Borello, D.; Colombo, E. Energy demand prediction and scenario analysis for vehicle traction: A bottom-up approach applied to the Italian railway system. Energy Rep. 2025, 13, 4196–4208. [Google Scholar] [CrossRef]
- Tang, Z.; Yin, H.; Yang, C.; Yu, J.; Guo, H. Predicting the electricity consumption of urban rail transit based on binary nonlinear fitting regression and support vector regression. Sustain. Cities Soc. 2021, 66, 102690. [Google Scholar] [CrossRef]
- He, D.; Teng, X.; Chen, Y.; Liu, B.; Wu, J. Piston wind and energy saving based on the analysis of fresh air in the subway system. Sustain. Energy Technol. Assess. 2022, 50, 101805. [Google Scholar] [CrossRef]
- Cheng, Y.; Fang, C.; Yuan, J.; Zhu, L. Design and application of a smart lighting system based on distributed wireless sensor networks. Appl. Sci. 2020, 10, 8545. [Google Scholar] [CrossRef]
- Akbari, S.; Hashemi-Dezaki, H.; Fazel, S. Optimal clustering-based operation of smart railway stations considering uncertainties of renewable energy sources and regenerative braking energies. Electr. Power Syst. Res. 2022, 213, 108744. [Google Scholar] [CrossRef]
- Gabaldón, A.; García-Garre, A.; Ruiz-Abellón, M.C.; Guillamón, A.; Molina, R.; Medina, J. Management of railway power system peaks with demand-side resources: An application to periodic timetables. Sustainability 2023, 15, 2746. [Google Scholar] [CrossRef]
- Mohajer, B.K.; Mousavi, S.M.G. Energy management optimization in smart railway stations with the ability to charge plug-in hybrid electric vehicles. J. Energy Storage 2023, 70, 107867. [Google Scholar] [CrossRef]
- Vamvakas, D.; Michailidis, P.; Korkas, C.; Kosmatopoulos, E. Review and Evaluation of Reinforcement Learning Frameworks on Smart Grid Applications. Energies 2023, 16, 5326. [Google Scholar] [CrossRef]
- Li, H.; Dai, X.; Goldrick, S.; Kotter, R.; Aslam, N.; Ali, S. Reinforcement Learning for EV Fleet Smart Charging with On-Site Renewable Energy Sources. Energies 2024, 17, 5442. [Google Scholar] [CrossRef]
- Liu, X.; Yeh, S.; Plötz, P.; Ma, W.; Li, F.; Ma, X. Electric bus charging scheduling problem considering charging infrastructure integrated with solar photovoltaic and energy storage systems. Transp. Res. Part E 2024, 187, 103572. [Google Scholar] [CrossRef]
- Liu, X.; Liu, X.C.; Xie, C.; Ma, X. Impacts of photovoltaic and energy storage system adoption on public transport: A simulation-based optimization approach. Renew. Sustain. Energy Rev. 2023, 181, 113319. [Google Scholar] [CrossRef]
- Jarvis, P.; Climent, L.; Arbelaez, A. Smart and sustainable scheduling of charging events for electric buses. TOP 2024, 32, 22–56. [Google Scholar] [CrossRef]
- Wu, F.; Yang, J.; Li, B.; Crisostomi, E.; Rafiq, H.; Rashed, G.I. Uncertain scheduling potential of charging stations under multi-attribute uncertain charging decisions of electric vehicles. Appl. Energy 2024, 374, 124036. [Google Scholar] [CrossRef]
- Fan, P.; Bu, S.; Li, S.; Fang, S.; Zhang, C.; Ke, S. Resilience enhancement strategy for multi-microgrids with electric vehicles under extreme weather conditions. CSEE J. Power Energy Syst. 2026, 1–12. [Google Scholar] [CrossRef]
- Ke, S.; Zhang, K.; Mai, W.; Guo, R.; He, S.; Tian, J.; Chung, C.Y. Maximizing intra-day V2G feasible capacity of EVs: A cross-disciplinary approach with traffic flow and prospect theory. IEEE Trans. Transp. Electr. 2026. [Google Scholar] [CrossRef]
- Guan, B.; Li, H.; Yang, H.; Zhang, T.; Liu, X.; Wang, X. Leveraging cost-effectiveness of photovoltaic-battery system in metro station under time-of-use pricing tariff. J. Clean. Prod. 2024, 434, 140268. [Google Scholar] [CrossRef]
- Taneja, S.; Jain, A.; Bhadoriya, Y. Green Hydrogen as a Clean Energy Resource and Its Applications as an Engine Fuel. Eng. Proc. 2023, 59, 159. [Google Scholar] [CrossRef]
- Yang, Y.; Xu, X.; Luo, Y.; Liu, J.; Hu, W. Distributionally robust planning method for expressway hydrogen refueling station powered by a wind–PV system. Renew. Energy 2024, 225, 120210. [Google Scholar] [CrossRef]
- Ahammed, T.; Khan, I. Ensuring power quality and demand-side management through IoT-based smart meters in a developing country. Energy 2022, 250, 123747. [Google Scholar] [CrossRef]
- Egypt: Alexandria–Abou Qir Metro Line. Available online: https://www.aiib.org/en/projects/details/2022/approved/Egypt-Alexandria-Abou-Qir-Metro-Line.html (accessed on 31 December 2025).
- Mwasalat Misr Company. Available online: https://mwasalatmisr.com/ (accessed on 31 December 2025).














| Ref. | PV System | EVs | E-Buses | Green-Hydrogen | EM | Uncertainties | Elevated Railway Stations | ||
|---|---|---|---|---|---|---|---|---|---|
| RES | Energy Price | Loads | |||||||
| [1] | √ | ● | ● | ● | ● | ● | ● | ● | √ |
| [2] | √ | √ | ● | ● | √ | √ | √ | ● | √ |
| [3] | √ | ● | ● | ● | ● | ● | ● | ● | √ |
| [4] | √ | ● | ● | ● | √ | ● | ● | ● | √ |
| [8] | √ | ● | ● | ● | √ | ● | ● | ● | √ |
| [14] | √ | ● | ● | ● | √ | √ | ● | ● | ● |
| [15] | ● | ● | ● | ● | √ | ● | ● | ● | ● |
| [16] | √ | √ | ● | ● | √ | √ | ● | ● | √ |
| [25] | √ | ● | ● | ● | √ | ● | ● | ● | √ |
| This paper | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| ηPV | 95% |
| 200 W/1.6 m2 | |
| SRSTC | 1000 W/m2 |
| tc | −0.41%/°C |
| 25 °C |
| Batteries’ Rated Capacity (kWh) | (kW) | (%) | (%) | Mean | Standard Deviation | |||
|---|---|---|---|---|---|---|---|---|
| Arrival | Departure | Arrival | Departure | |||||
| 8 | 1.6 | 1–3 | 95 | 20 | 6 a.m. | 8 p.m. | 60 min | 60 min |
| 17 | 3.4 | |||||||
| 18 | 3.6 | |||||||
| 48 | 9.6 | |||||||
| Agent ID | Action Set |
|---|---|
| [0.75, 0.85, 0.95, 1] of | |
| [0.7, 0.8, 0.9, 1] of | |
| [0.6, 0.7, 0.8, 0.9] of | |
| [0.25, 0.35, 0.45, 0.55] of | |
| [0.6, 0.7, 0.8, 0.9] of | |
| [0.25, 0.35, 0.45, 0.55] of |
| Energy Traded | Energy Cost ($) | |||
|---|---|---|---|---|
| Scenario 1 | Scenario 2 | Scenario 3 | ||
| UTS and UG | Sold | 4.7533 × 103 | 7.724 × 103 | 7.5557 × 103 |
| bought | 5.7568 × 103 | 3.4427 × 103 | 3.4297 × 103 | |
| EVs and UG | Sold | 136.5771 | 11.8405 | 3.1739 |
| bought | 450.8924 | 263.7876 | 11.3364 | |
| EVs and UTS | Sold | X | X | 10.4 |
| bought | X | X | 201.961 | |
| EVs internal trading | X | 168.3943 | 168.3943 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
El-Zonkoly, A. Reinforcement Learning-Based Energy Management for Sustainable Electrified Urban Transportation with Renewable Energy Integration: A Case Study of Alexandria, Egypt. Sustainability 2026, 18, 2352. https://doi.org/10.3390/su18052352
El-Zonkoly A. Reinforcement Learning-Based Energy Management for Sustainable Electrified Urban Transportation with Renewable Energy Integration: A Case Study of Alexandria, Egypt. Sustainability. 2026; 18(5):2352. https://doi.org/10.3390/su18052352
Chicago/Turabian StyleEl-Zonkoly, Amany. 2026. "Reinforcement Learning-Based Energy Management for Sustainable Electrified Urban Transportation with Renewable Energy Integration: A Case Study of Alexandria, Egypt" Sustainability 18, no. 5: 2352. https://doi.org/10.3390/su18052352
APA StyleEl-Zonkoly, A. (2026). Reinforcement Learning-Based Energy Management for Sustainable Electrified Urban Transportation with Renewable Energy Integration: A Case Study of Alexandria, Egypt. Sustainability, 18(5), 2352. https://doi.org/10.3390/su18052352

