Coordinated Energy Storage Optimization for Power Quality in High-Renewable Distribution Networks
Abstract
1. Introduction
- IUF-Oriented Imbalance Mitigation: A global reward function is explicitly designed to minimize the Current Unbalance Factor (IUF) derived from symmetrical components, directly targeting phase current asymmetry associated with transformer thermal stress and neutral conductor loading.
- Physics-Informed Multi-Agent Learning Framework: A linearized DistFlow-based environment is embedded within the reinforcement learning framework, enabling agents to learn physically consistent control policies that account for branch impedance and power flow sensitivity in distribution networks.
- Centralized Training and Decentralized Execution (CTDE): The MADDPG architecture enables centralized offline training using global state information while allowing decentralized real-time execution based solely on locally measured states. This structure significantly reduces communication requirements and avoids repeated centralized optimization, improving the scalability and practical applicability of the proposed strategy for large-scale LVDNs.
2. System Framework
2.1. Overall System Structure
2.2. Establishment of the System Model
- (1)
- Power balance and power flow constraints
- (2)
- Energy storage constraints
- (3)
- Optimization Model
3. Three-Phase Unbalance Optimization Based on MADRL
3.1. Markov Decision Process Model
3.2. Constraint Enforcement in the RL Framework
3.3. MADDPG Algorithm
| Algorithm 1: Multi-agent energy storage scheduling based on MADDPG | |
| 1 | Initialize Env, the multi-agent Actor–Critic networks, and the experience replay memory |
| 2 | For to , do: |
| 3 | for all agents. |
| 4 | to , do: |
| 5 | , do: |
| 6 | |
| 7 | End for |
| 8 | in the environment. |
| 9 | Observe next state, reward rᵢᵗ for each agent, and store transition in 𝓡. |
| 10 | If the number of samples in 𝓡 > batch size B then: |
| 11 | Sample a mini-batch of transitions from the replay memory 𝓡 |
| 12 | For each agent i do |
| 13 | |
| 14 | |
| 15 | Update actor policy using the sampled policy gradient |
| 16 | End for |
| 17 | Update target networks |
| 18 | End if |
| 19 | End For |
| 20 | End for |
4. Case Study
4.1. Parameter Settings
4.2. Example Result
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| PV | Photovoltaic |
| PCC | Point of common coupling |
| MADDPG | Multi-agent deep deterministic policy gradient |
| BESS | Battery energy storage systems |
| IUF | Current unbalance factor |
| DG | Distributed generation |
| LVDNs | Low-voltage distribution networks |
| DER | Distributed energy resource |
| OPF | Optimal power flow |
| VUF | Voltage unbalance factor |
| CTDE | Centralized training and decentralized execution |
| State of charge of battery energy storage | |
| Current unbalance factor | |
| Ramp-rate limit of storage unit | |
| Nominal voltage magnitude | |
| Line reactance | |
| Line resistance | |
| PCC phase current | |
| Positive sequence current | |
| Negative sequence current |
References
- Hamza, E.A.; Sedhom, B.E.; Badran, E.A. Impact and Assessment of the Overvoltage Mitigation Methods in Low-voltage Distribution Networks with Excessive Penetration of PV Systems: A Review. Int. Trans. Electr. Energ. Syst. 2021, 31, e13161. [Google Scholar] [CrossRef]
- Licari, J.; Rhaili, S.E.; Micallef, A.; Staines, C.S. Addressing Voltage Regulation Challenges in Low Voltage Distribution Networks with High Renewable Energy and Electrical Vehicles: A Critical Review. Energy Rep. 2025, 14, 2977–2997. [Google Scholar] [CrossRef]
- Gabdullin, Y.; Azzopardi, B. Impacts of Photovoltaics in Low-Voltage Distribution Networks: A Case Study in Malta. Energies 2022, 15, 6731. [Google Scholar] [CrossRef]
- Hungbo, M.; Gu, M.; Meegahapola, L.; Littler, T.; Bu, S. Impact of Electric Vehicles on Low-voltage Residential Distribution Networks: A Probabilistic Analysis. IET Smart Grid 2023, 6, 536–548. [Google Scholar] [CrossRef]
- Helm, S.; Hauer, I.; Wolter, M.; Wenge, C.; Balischewski, S.; Komarnicki, P. Impact of Unbalanced Electric Vehicle Charging on Low-Voltage Grids. In Proceedings of the 2020 IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe); IEEE: New York, NY, USA, 2020; pp. 665–669. [Google Scholar]
- Ibrahim, I.A.; Hossain, M.J. Low Voltage Distribution Networks Modeling and Unbalanced (Optimal) Power Flow: A Comprehensive Review. IEEE Access 2021, 9, 143026–143084. [Google Scholar] [CrossRef]
- Mukwekwe, L.; Venugopal, C.; Davidson, I.E. A Review of the Impacts and Mitigation Strategies of High PV Penetration in Low Voltage Networks. In 2017 IEEE PES Power Africa; IEEE: New York, NY, USA, 2017; pp. 274–279. [Google Scholar]
- Zeraati, M.; Golshan, M.E.H.; Guerrero, J.M. Voltage Quality Improvement in Low Voltage Distribution Networks Using Reactive Power Capability of Single-Phase PV Inverters. IEEE Trans. Smart Grid 2018, 10, 5057–5065. [Google Scholar] [CrossRef]
- Ju, Y.; Liu, W.; Zhang, Z.; Zhang, R. Distributed Three-Phase Power Flow for AC/DC Hybrid Networked Microgrids Considering Converter Limiting Constraints. IEEE Trans. Smart Grid 2022, 13, 1691–1708. [Google Scholar] [CrossRef]
- Pratama, N.A.; Rahmawati, Y. Evaluation of Unbalanced Load Impacts on Distribution Transformer Performances. Front. Energy Syst. Power Eng. 2020, 2, 28–35. [Google Scholar]
- Moses, P.S.; Masoum, M.A. Three-Phase Asymmetric Transformer Aging Considering Voltage-Current Harmonic Interactions, Unbalanced Nonlinear Loading, Magnetic Couplings, and Hysteresis. IEEE Trans. Energy Convers. 2012, 27, 318–327. [Google Scholar] [CrossRef]
- Ulinuha, A.; Sari, E.M. The Influence of Harmonic Distortion on Losses and Efficiency of Three-Phase Distribution Transformer. In Proceedings of the Journal of Physics: Conference Series; IOP Publishing: Bristol, UK, 2021; Volume 1858, p. 012084. [Google Scholar]
- Chicco, G.; Pons, E.; Russo, A.; Spertino, F.; Porumb, R.; Postolache, P.; Toader, C. Assessment of Unbalance and Distortion Components in Three-Phase Systems with Harmonics and Interharmonics. Electr. Power Syst. Res. 2017, 147, 201–212. [Google Scholar] [CrossRef]
- Wang, J.; Li, D. A Three-Phase-Unbalance Mitigation Strategy Based on the Synergy of Intelligent Phase-Shifting Switches and Flexible Self-Balancing Switches. Electronics 2026, 15, 776. [Google Scholar] [CrossRef]
- Babu, M.N.; Dhal, P.K. Impact of Load Flow and Network Reconfiguration for Unbalanced Distribution Systems. Meas. Sens. 2024, 32, 101078. [Google Scholar] [CrossRef]
- Lotfi, H.; Hajiabadi, M.E.; Parsadust, H. Power Distribution Network Reconfiguration Techniques: A Thorough Review. Sustainability 2024, 16, 10307. [Google Scholar] [CrossRef]
- Almeida, D.; Pasupuleti, J.; Ekanayake, J. Comparison of Reactive Power Control Techniques for Solar PV Inverters to Mitigate Voltage Rise in Low-Voltage Grids. Electronics 2021, 10, 1569. [Google Scholar] [CrossRef]
- Aboshady, F.M.; Pisica, I.; Zobaa, A.F.; Taylor, G.A.; Ceylan, O.; Ozdemir, A. Reactive Power Control of PV Inverters in Active Distribution Grids with High PV Penetration. IEEE Access 2023, 11, 81477–81496. [Google Scholar] [CrossRef]
- Nour, A.M.M.; Hatata, A.Y.; Helal, A.A.; El-Saadawi, M.M. Review on Voltage-violation Mitigation Techniques of Distribution Networks with Distributed Rooftop PV Systems. IET Gener. Transm. Distrib. 2020, 14, 349–361. [Google Scholar] [CrossRef]
- Alzahrani, A.; Alharthi, H.; Khalid, M. Minimization of Power Losses through Optimal Battery Placement in a Distributed Network with High Penetration of Photovoltaics. Energies 2019, 13, 140. [Google Scholar] [CrossRef]
- Aljohani, T.M.; Saad, A.; Mohammed, O.A. Two-Stage Optimization Strategy for Solving the VVO Problem Considering High Penetration of Plug-In Electric Vehicles to Unbalanced Distribution Networks. IEEE Trans. Ind. Appl. 2021, 57, 3425–3440. [Google Scholar] [CrossRef]
- Shang, X.-J.; Mishra, Y.; Yang, Y.; Liu, J.-L.; Yu, Z.-G.; Tian, Y.-C. Optimal Coordinated Scheduling of Electric Vehicles and Battery Energy Storage Systems. IEEE Trans. Consum. Electron. 2025, 71, 2944–2954. [Google Scholar] [CrossRef]
- Zamzam, T.; Shaban, K.; Gaouda, A.; Massoud, A. Performance Assessment of Two-Timescale Multi-Objective Volt/Var Optimization Scheme Considering EV Charging Stations, BESSs, and RESs in Active Distribution Networks. Electr. Power Syst. Res. 2022, 207, 107843. [Google Scholar] [CrossRef]
- Zhang, D.; Shafiullah, G.M.; Das, C.K.; Wong, K.W. Optimal Allocation of Battery Energy Storage Systems to Enhance System Performance and Reliability in Unbalanced Distribution Networks. Energies 2023, 16, 7127. [Google Scholar] [CrossRef]
- Nazir, N.; Almassalkhi, M. Stochastic Multi-Period Optimal Dispatch of Energy Storage in Unbalanced Distribution Feeders. Electr. Power Syst. Res. 2020, 189, 106783. [Google Scholar] [CrossRef]
- Chicco, G.; Mazza, A. 100 Years of Symmetrical Components. Energies 2019, 12, 450. [Google Scholar] [CrossRef]
- Gan, L.; Low, S.H. Convex Relaxations and Linear Approximation for Optimal Power Flow in Multiphase Radial Networks. In Proceedings of the 2014 Power Systems Computation Conference, Wroclaw, Poland, 18–22 August 2014; pp. 1–9. [Google Scholar]
- Ye, Y.; Wang, H.; Chen, P.; Tang, Y.; Strbac, G. Safe Deep Reinforcement Learning for Microgrid Energy Management in Distribution Networks with Leveraged Spatial–Temporal Perception. IEEE Trans. Smart Grid 2023, 14, 3759–3775. [Google Scholar] [CrossRef]
- Wang, W.; Yu, N.; Gao, Y.; Shi, J. Safe Off-Policy Deep Reinforcement Learning Algorithm for Volt-VAR Control in Power Distribution Systems. IEEE Trans. Smart Grid 2020, 11, 3008–3018. [Google Scholar] [CrossRef]
- Bahrami, S.; Chen, Y.C.; Wong, V.W.S. Deep Reinforcement Learning for Demand Response in Distribution Networks. IEEE Trans. Smart Grid 2021, 12, 1496–1506. [Google Scholar] [CrossRef]
- Pinthurat, W.; Hredzak, B. Multi-Agent Deep Reinforcement Learning for Mitigation of Unbalanced Active Powers Using Distributed Batteries in Low Voltage Residential Distribution System. Electr. Power Syst. Res. 2025, 245, 111599. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhang, D.; Qiu, R.C. Deep Reinforcement Learning for Power System Applications: An Overview. CSEE J. Power Energy Syst. 2020, 6, 213–225. [Google Scholar] [CrossRef]
- Sun, X.; Qiu, J. Two-Stage Volt/Var Control in Active Distribution Networks with Multi-Agent Deep Reinforcement Learning Method. IEEE Trans. Smart Grid 2021, 12, 2903–2912. [Google Scholar] [CrossRef]
- Wang, Y.; Xiao, M.; You, Y.; Poor, H.V. Optimized Energy Dispatch for Microgrids with Distributed Reinforcement Learning. IEEE Trans. Smart Grid 2024, 15, 2946–2956. [Google Scholar] [CrossRef]
- Xu, H.; Sun, H.; Nikovski, D.; Kitamura, S.; Mori, K.; Hashimoto, H. Deep Reinforcement Learning for Joint Bidding and Pricing of Load Serving Entity. IEEE Trans. Smart Grid 2019, 10, 6366–6375. [Google Scholar] [CrossRef]
- Fan, J.; Liebman, A.; Wang, H. Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network. In Proceedings of the 2024 IEEE Power & Energy Society General Meeting (PESGM), Seattle, WA, USA, 21–25 July 2024; pp. 1–5. [Google Scholar]
- Zhang, Y.; Wang, J. GEFCom2014 Probabilistic Solar Power Forecasting Based on K-Nearest Neighbor and Kernel Density Estimator. In Proceedings of the 2015 IEEE Power & Energy Society General Meeting, Denver, CO, USA, 26–30 July 2015; pp. 1–5. [Google Scholar]










| Literature | Centralized OPF | Distributed BESS | RL-Based | Direct IUF Objective | CTDE | Distributed Real-Time Execution |
|---|---|---|---|---|---|---|
| [14,15,16] | ✓ | ✕ | ✕ | ✕ | ✕ | ✕ |
| [23,25] | ✓ | ✓ | ✕ | △ | ✕ | ✕ |
| [21,22] | ✓ | ✓ | ✕ | △ | ✕ | ✕ |
| [27] | △ | ✓ | ✕ | △ | ✕ | △ |
| [26,27,28] | ✕ | △ | ✕ | ✕ | △ | ✓ |
| [32,33,34] | ✕ | △ | ✓ | ✕ | △ | ✓ |
| [31] | ✕ | ✓ | ✓ | △ | ✕ | ✓ |
| This work | ✕ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Agent No. | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| Node | 1 | 4 | 10 | 12 | 17 | 19 |
| Phase | A | B | A | B | B | C |
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Learning Rate | 1 × 10−3 | Discount Factor | 0.95 |
| Soft Update Coefficient | 0.01 | Replay Buffer | 100,000 |
| Actor network layers | 3 | Actor network neurons | 256 |
| Critic network layers | 3 | Critic network neurons | 512 |
| Actor activation function | Relu/ Tanh | Critic activation function | Relu |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Duan, R.; Jiang, Y.; Zhu, X.; Song, X.; Luo, J.; Jia, Y. Coordinated Energy Storage Optimization for Power Quality in High-Renewable Distribution Networks. Energies 2026, 19, 2373. https://doi.org/10.3390/en19102373
Duan R, Jiang Y, Zhu X, Song X, Luo J, Jia Y. Coordinated Energy Storage Optimization for Power Quality in High-Renewable Distribution Networks. Energies. 2026; 19(10):2373. https://doi.org/10.3390/en19102373
Chicago/Turabian StyleDuan, Ruiqin, Yan Jiang, Xinchun Zhu, Xiaolong Song, Junjie Luo, and Youwei Jia. 2026. "Coordinated Energy Storage Optimization for Power Quality in High-Renewable Distribution Networks" Energies 19, no. 10: 2373. https://doi.org/10.3390/en19102373
APA StyleDuan, R., Jiang, Y., Zhu, X., Song, X., Luo, J., & Jia, Y. (2026). Coordinated Energy Storage Optimization for Power Quality in High-Renewable Distribution Networks. Energies, 19(10), 2373. https://doi.org/10.3390/en19102373
