Deep Reinforcement Learning-Based Multi-Objective Optimization for Virtual Power Plants and Smart Grids: Maximizing Renewable Energy Integration and Grid Efficiency
Abstract
1. Introduction
1.1. Characteristics of VPPs
1.2. Characteristics of Smart Grids
1.3. The Relationship Between VPPs and Smart Grids
1.4. Research Innovation Points
2. Literature Review
2.1. Review of Technological Advances and Future Development of Smart Grids
2.2. A Review of Multi-Dimensional Issues and Optimization Strategies for VPPs
3. AI-Optimized Integrated Optimization Algorithm
3.1. Integrated Optimization Policy
3.2. Optimization Objectives and Constraints
3.3. Power Grid Optimization Research
3.4. Optimization Algorithms
3.5. Application of AI Technology in Optimization Algorithms
3.6. Comparison of Existing Optimization Algorithms
4. AI-Based Integrated Optimization Algorithm for VPPs
4.1. Algorithm Design Framework
4.2. Data Acquisition and Processing
4.3. Algorithm Implementation and Testing
4.4. Application and Prospect of Emerging Technologies
5. Experimental Results
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Bo, Q.; Zhang, X.; Cao, X.; Zhang, J.; Qiu, X.; Wu, Y. Robust Optimal Dispatching of Power Grid with the Participation of Virtual Power Plants. J. Phys. Conf. Ser. 2024, 2788, 012023. [Google Scholar] [CrossRef]
- Tang, X.; Wang, J.; Wang, Y.; Wan, Y. The Optimization of Supply–Demand Balance Dispatching and Economic Benefit Improvement in a Multi-Energy Virtual Power Plant within the Jiangxi Power Market. Energies 2024, 17, 4691. [Google Scholar] [CrossRef]
- Jia, Y.; Lai, C.S.; Xu, Z.; Chai, S. Adaptive Partitioning Approach to Self-Sustained Smart Grid. IET Gener. Transm. Distrib. 2017, 11, 485–494. [Google Scholar] [CrossRef]
- Kiasari, M.M.; Aly, H.H. A Proposed Controller for Real-Time Management of Electrical Vehicle Battery Fleet with MATLAB/SIMULINK. J. Energy Storage 2024, 99, 113235. [Google Scholar] [CrossRef]
- Wang, L.; Li, D. Optimizing Domestic Energy Management with a Wild Mice Colony-Inspired Algorithm: Enhancing Efficiency and Coordination in Smart Grids through Dynamic Distributed Energy Storage. Heliyon 2024, 10, e35462. [Google Scholar] [CrossRef]
- Khemakhem, S.; Rekik, M.; Krichen, L. Home Energy Management Based on Plug-in Electric Vehicle Power Control in a Residential Smart Grid. Int. J. Digit. Signals Smart Syst. 2019, 3, 173–186. [Google Scholar] [CrossRef]
- Abdelkader, S.; Amissah, J.; Abdel-Rahim, O. Virtual Power Plants: An In-Depth Analysis of Their Advancements and Importance as Crucial Players in Modern Power Systems. Energy Sustain. Soc. 2024, 14, 52. [Google Scholar] [CrossRef]
- Ji, X.; Li, C.; Wang, J.; Wang, Y.; Hou, F.; Guo, S. Energy Management Optimization Strategy of Virtual Power Plant Based on Deep Reinforcement Learning. J. Phys. Conf. Ser. 2022, 2384, 012041. [Google Scholar] [CrossRef]
- Tian, L.; Cheng, L.; Guo, J.; Wang, X.; Yun, Q.; Gao, W. A Review on the Study of Management and Interaction Mechanism for Distributed Energy in Virtual Power Plants. Power Syst. Technol. 2020, 44, 2097–2108. [Google Scholar] [CrossRef]
- Hongying, L. Research on the Application of Artificial Intelligence Technology in Power System Intelligent Dispatching Automation. J. Phys. Conf. Ser. 2021, 2083, 042047. [Google Scholar] [CrossRef]
- Li, Z. The Application of Internet of Things Technology in Smart Grids. Integr. Circuit Appl. 2023, 40, 194–195. [Google Scholar]
- Yu, Y. A Brief Description of the Basics of the Smart Grid. J. Tianjin Univ. Sci. Technol. 2020, 53, 551–556. [Google Scholar] [CrossRef]
- Deng, J.; Jiang, F.; Tu, C. Study of NIST’s interoperable smart grid technology architecture. Power Syst. Prot. Control. 2020, 48, 9–21. [Google Scholar]
- Lida, H.; Yu, C.; Yong, W. Enhancing dynamic energy network management using a multiagent cloud-fog structure. Renew. Sustain. Energy Rev. 2022, 162, 112439. [Google Scholar] [CrossRef]
- Ebrie, A.S.; Kim, Y.J. Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems. Systems 2024, 12, 106. [Google Scholar] [CrossRef]
- Foroushan, A.S.A.; Leila, B.; Sasan, P.; Mohammadali, N.; Matti, L. A New two-layer model for energy management in the smart distribution network containing flexi-renewable virtual power plant. Electr. Power Syst. Res. 2021, 194, 107085. [Google Scholar] [CrossRef]
- Sreenivasulu, G.; Balakrishna, P. Optimal Dispatch of Renewable and Virtual Power Plants in Smart Grid Environment through Bilateral Transactions. Electr. Power Compon. Syst. 2021, 49, 488–503. [Google Scholar] [CrossRef]
- Zhu, S.; Wang, H. Paillier-Based Data Aggregation and Stimulation Scheme in the Smart Grids. Comput. Eng. 2021, 47, 166–174. [Google Scholar] [CrossRef]
- Li, K. Research on Data Aggregation and User Query Privacy Protection in Smart Grid. Ph.D. Thesis, North China Electric Power University (Beijing), Beijing, China, 2023. Available online: https://link.cnki.net/doi/10.27140/d.cnki.ghbbu.2023.000163 (accessed on 1 September 2024).
- Wiezorek, C.; Backe, C.; Werner, S.; Firvida, M.B.; Wulkow, C.; Strunz, K. Validating Algorithms for Flexible Load Control in a Smart Grid Laboratory Environment. In Proceedings of the 2021 IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe), Espoo, Finland, 18–21 October 2021; 2021; pp. 1–5. [Google Scholar] [CrossRef]
- Xiong, Y. Smart Grid Based on Bi-Level Programming False Data Injection Attacks Research. Ph.D. Thesis, Chongqing University of Posts and Telecommunications, Chongqing, China, 2022. Available online: https://link.cnki.net/doi/10.27675/d.cnki.gcydx.2022.000487 (accessed on 1 September 2023).
- Iqbal, S.; Sarfraz, M.; Ayyub, M.; Tariq, M.; Chakrabortty, R.K.; Ryan, M.J.; Alamri, B. A Comprehensive Review on Residential Demand Side Management Strategies in Smart Grid Environment. Sustainability 2021, 13, 7170. [Google Scholar] [CrossRef]
- Akbari, E.; Naghibi, A.F.; Veisi, M.; Shahparnia, A.; Pirouzi, S. Multi-objective economic operation of smart distribution network with renewable-flexible virtual power plants considering voltage security index. Sci. Rep. 2024, 14, 70095. [Google Scholar] [CrossRef]
- Zhang, Y.; Wang, A.; Zhang, H. Overview of Smart Grid Development in China. Power Syst. Prot. Control. 2021, 49, 180–187. [Google Scholar] [CrossRef]
- Li, Z.; Ai, Q.; Zhang, Y.; Yin, S.; Sun, D.; Li, X. Data drive technology in the application of the virtual power plant review. Power Grid Technol. 2020, 44, 2411–2419. [Google Scholar] [CrossRef]
- van Summeren, L.F.M.; Wieczorek, A.J.; Bombaerts, G.J.T.; Verbong, G.P.J. Community energy meets Smart Grids: Reviewing goals, structure, and roles in Virtual Power Plants in Ireland, Belgium and the Netherlands. Energy Res. Soc. Sci. 2020, 63, 101415. [Google Scholar] [CrossRef]
- Zhang, L.; Dai, G.; Nie, Q.; Tong, Z. Economic dispatch model of virtual power plant considering electricity consumption under a carbon trading mechanism. Power Syst. Prot. Control. 2020, 48, 154–163. [Google Scholar] [CrossRef]
- Mei, G.; Gong, J.; Zheng, Y. Scheduling strategy for multi-energy complementary virtual power plant considering the correlation between wind and solar output and carbon emission quota. Proc. CSU-EPSA 2021, 33, 62–69. [Google Scholar] [CrossRef]
- Wang, R.; Wu, J.; Cai, Z.; Liu, G.; Zhang, H.; Cai, J. Optimal dispatching of virtual power plant containing electric vehicles in multi-cooperative market. South. Power Syst. Technol. 2021, 15, 45–55. [Google Scholar] [CrossRef]
- Liu, Y.; Fan, Y. Optimal scheduling strategy for virtual power plant considering 5G base station technology, energy-storage, and energy-saving measures. Proc. CSU-EPSA 2022, 34, 8–15. [Google Scholar] [CrossRef]
- Liu, D.; Cheng, P.; Wang, X.; Li, J.; Qin, G. Real-time electricity management optimization algorithm for the household prosumer in smart grid. South. Power Syst. Technol. 2022, 16, 20–28. [Google Scholar] [CrossRef]
- Liu, D.; Li, Z.; Xu, E.; Zhang, S.; Ji, P.; Wang, H.; Gao, C. Flexible block order trading clearing model for new power systems. Power Syst. Technol. 2022, 46, 4150–4159. [Google Scholar] [CrossRef]
- Zhang, D.; Yun, Y.; Wang, X.; He, J. Multi-time scale of new energy scheduling optimization for virtual power plant considering uncertainty of wind power and photovoltaic power. Acta Energiae Solaris Sin. 2022, 43, 529–537. [Google Scholar] [CrossRef]
- Bai, X.; Fan, Y.; Wang, T.; Liu, Y.; Nie, X.; Yan, C. Dynamic aggregation method of virtual power plants considering reliability of renewable energy. Electr. Power Autom. Equip. 2022, 42, 102–110. [Google Scholar] [CrossRef]
- Bianca, G.; Tudor, C.; Ionut, A. Virtual Power Plant Optimization in Smart Grids: A Narrative Review. Future Internet 2022, 14, 128. [Google Scholar] [CrossRef]
- Kong, X.; Xiao, J.; Liu, D.; Wu, J.; Wang, C.; Shen, Y. Robust stochastic optimal dispatching method of multi-energy virtual power plant considering multiple uncertainties. Appl. Energy 2020, 279, 115707. [Google Scholar] [CrossRef]
- Cheng, R.; Zhou, B.; Shi, J.; Li, J.; Zhao, W.; Mao, T.; Wang, T.; Xu, Y.; Guo, Y. Review of Key Technologies for Mega-City Virtual Power Plants upon Regional Unified Power Market. South. Power Syst. Technol. 2023, 17, 90. [Google Scholar] [CrossRef]
- Hou, H.; Ge, X.; Cao, X. Coalition Game Optimization Method for Multiple Virtual Power Plants Considering Carbon Trading. Proc. CSU-EPSA 2023, 35, 77–85. [Google Scholar] [CrossRef]
- Guo, W.; Liu, P.; Shu, X. Optimal dispatching of electric-thermal interconnected virtual power plant considering market trading mechanism. J. Clean. Prod. 2021, 279, 123446. [Google Scholar] [CrossRef]
- Gao, R.; Guo, H.; Zhang, R.; Mao, T.; Xu, Q.; Zhou, B.; Yang, P. A Two-Stage Dispatch Mechanism for Virtual Power Plant Utilizing the CVaR Theory in the Electricity Spot Market. Energies 2019, 12, 3402. [Google Scholar] [CrossRef]
- Xie, M.; Huang, Y.; Li, Y.; Liu, M. Evolutionary Game Decision and Mechanism Analysis of Dynamical Aggregation of Distributed Energy Resources into Virtual Power Plant. Power Syst. Technol. 2023, 47, 4958–4977. [Google Scholar] [CrossRef]
- Zeng, X.; Tang, C. Research on optimization of virtual power plants dispatch by considering the consumption of new energy under time-of-use electricity price environment. J. Electr. Power Sci. Technol. 2023, 38, 24–34. [Google Scholar] [CrossRef]
- Wei, H.; Wang, W.; Kao, X. A novel approach to hybrid dynamic environmental-economic dispatch of multi-energy complementary virtual power plant considering renewable energy generation uncertainty and demand response. Renew. Energy 2023, 219, 119406. [Google Scholar] [CrossRef]
- Liu, X.O. Research on optimal dispatch method of virtual power plant considering various energy complementary and energy low carbonization. Int. J. Electr. Power Energy Syst. 2022, 136, 107670. [Google Scholar] [CrossRef]
- 1547-2018; IEEE Standard for Interconnection and Interoperability of Distributed Energy Resources with Associated Electric Power Systems Interfaces. Institute of Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2018; pp. 1–138. Available online: https://ieeexplore.ieee.org/document/8332112 (accessed on 5 June 2025).
Dimension | Traditional Power Grids | Smart Grids |
---|---|---|
Architecture | Centralized generation (thermal, hydro) | Distributed integration (PV, wind, storage) |
Energy Flow | Unidirectional (generation → customer) | Bidirectional (user-to-grid) |
Method | Time Scale Adaptability | Multi-Objective Dynamic Balance | Privacy Protection Mechanism | Computational Complexity | 200-Node Delay |
---|---|---|---|---|---|
Traditional MPC [11,12,13] | Single scale | Fixed weight | no | O(n3) | 298.7 ms |
Standard DRL (DQN) [8] | Single scale | Linear weighting | Centralized storage | O(2d) | 12.3 ms |
Federal DRL [16] | Dual-time collaboration | Dynamic Pareto | Blockchain encryption | O(n log n) | 5.2 ms |
Index | Current Level | Expected Increase | Concrete Impact | Data Source |
---|---|---|---|---|
Peak load | 1000 MW (2023) | Reduce by 10% | Reduce the pressure on the power grid and reduce the cost of power supply | Shenzhen VPP platform actual operation data |
Load balancing capacity | 75% | Increase by 15% | Improve grid stability and reduce the risk of power outages | Shenzhen VPP platform actual operation data |
Utilization rate of wind energy | 30% | Increase by 20% | Increase the proportion of renewable energy and reduce dependence on fossil fuels | Reference [2] |
Solar integration capacity | 25% | Increase by 25% | Improve the efficiency of solar power generation and optimize resource allocation | Reference [7] |
Economic dispatch cost | 1 million CNY | Reduce by 25% | Reduce operating costs through intelligent scheduling and resource allocation | Shenzhen VPP platform actual operation data |
Grid reliability | 99.5% | Increase by 0.5% | Improve the reliability and stability of the power grid | Shenzhen VPP platform actual operation data |
Energy efficiency | 40% | Increase by 10% | Improve energy efficiency and reduce energy waste | Shenzhen VPP platform actual operation data |
Algorithm Type | Core Formula | Application Scenario | Advantage | Limitation |
---|---|---|---|---|
DRL dynamic optimization | High volatility real-time scheduling | Adapt to environmental changes and optimize long-term returns | The training data demand is large and the computational complexity is high | |
Robust optimization | Extreme uncertainty scenario | Guarantee worst-case performance | Conservatism may reduce average efficiency | |
GNN topology optimization | Optimization of large-scale power grid structure | Model topology relationships explicitly to improve interpretability | Sensitive to graph structure quality | |
Federated learning | Multi-agent privacy protection collaboration | High data privacy and distributed computing efficiency | The communication cost is high and the convergence speed is slow |
Data Acquisition Methods | Pros | Cons | Pretreatment Technique | Tools |
---|---|---|---|---|
Sensor data | Strong real-time and high precision | High cost and complex maintenance | Data cleansing | Python 3.11, R 4.2 |
Market data | Large amount of data and wide coverage | There may be noise and inconsistencies | Feature selection | Weka 3.8, Scikit-learn 1.2 |
Social media data | Reflect user behavior and be dynamic | Data quality is uneven | Data normalization | Pandas 1.5, NumPy 1.23 |
Remote sensing data | Wide space coverage and easy access | The analysis is complicated and the processing time is long | Data interpolation | ArcGIS 10.9, QGIS 3.28 |
Algorithm | Iteration Limitation | Convergence Condition | Hyperparameter Setting |
---|---|---|---|
DRL | 200 cycles/500,000 steps | Reward fluctuation is less than 1% | The learning rate is 0.001 and the discount factor is 0.99 |
GA | 300 generations | The change in fitness is less than 0.5% | The crossover rate was 0.8 and the variation rate was 0.05 |
PSO | 500 iterations | The global optimum remains unchanged | Inertia weight: 0.8 → 0.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tang, X.; Wang, J. Deep Reinforcement Learning-Based Multi-Objective Optimization for Virtual Power Plants and Smart Grids: Maximizing Renewable Energy Integration and Grid Efficiency. Processes 2025, 13, 1809. https://doi.org/10.3390/pr13061809
Tang X, Wang J. Deep Reinforcement Learning-Based Multi-Objective Optimization for Virtual Power Plants and Smart Grids: Maximizing Renewable Energy Integration and Grid Efficiency. Processes. 2025; 13(6):1809. https://doi.org/10.3390/pr13061809
Chicago/Turabian StyleTang, Xinfa, and Jingjing Wang. 2025. "Deep Reinforcement Learning-Based Multi-Objective Optimization for Virtual Power Plants and Smart Grids: Maximizing Renewable Energy Integration and Grid Efficiency" Processes 13, no. 6: 1809. https://doi.org/10.3390/pr13061809
APA StyleTang, X., & Wang, J. (2025). Deep Reinforcement Learning-Based Multi-Objective Optimization for Virtual Power Plants and Smart Grids: Maximizing Renewable Energy Integration and Grid Efficiency. Processes, 13(6), 1809. https://doi.org/10.3390/pr13061809