Distributed Peer-to-Peer Optimization Based on Robust Reinforcement Learning with Demand Response: A Review
Abstract
1. Introduction
2. Methodology
2.1. Criteria for Literature Selection
2.2. Conceptual Framework
- Virtual layer elements: Information systems, market operation mechanisms, pricing strategies, and energy management systems.
- Physical layer elements: Network connections, smart metering, and communication infrastructure.
- Other elements: Prosumers, regulators, and energy policy frameworks.
2.2.1. The Role of Phasor Measurement Units (PMUs) and Phasor Data Concentrators (PDCs) in P2P Energy Systems
2.2.2. Enhanced Grid Stability and Reliability
2.3. Analysis Procedures
3. Results and Discussion
3.1. Applications of Robust Reinforcement Learning
3.2. Role of Demand Response in P2P Optimization
3.3. Synergy Between P2P Optimization and Robust Learning
3.4. Advances in Battery Energy Storage Systems (BESSs)
3.5. Comparison of Optimization Methods in P2P Networks
3.5.1. Robust Reinforcement Learning (RRL) in P2P Networks
3.5.2. Demand Response-Based P2P Optimization (DR-P2P)
3.5.3. Synergy Between P2P Optimization and Robust Reinforcement Learning
3.6. Integration of Reinforcement Learning and Robust Optimization for P2P Energy Networks
3.7. The Role of Battery Energy Storage Systems in P2P Energy Networks
3.7.1. Types of Battery Energy Storage Systems
3.7.2. Integration of BESSs in P2P Energy Markets
3.7.3. Future Directions for BESSs in P2P Networks
3.8. The Role of Electric Vehicles in P2P Energy Networks
3.8.1. Direct Control Strategies for EV Charging
3.8.2. Indirect Control Strategies for EV Charging
3.9. Comparative Analysis of Optimization Approaches in P2P Energy Trading
3.9.1. Deterministic and Heuristic Optimization vs. Reinforcement Learning Approaches
3.9.2. Case Studies on Energy Efficiency and Demand Response Strategies
3.9.3. Integration of Battery Energy Storage Systems and Electric Vehicles
3.9.4. Comparative Summary and Research Gaps
3.10. Chronological Development of Technologies in P2P Energy Trading
3.10.1. Early Concepts of Distributed Energy and Demand Response—1990s
3.10.2. Advancements in Smart Grids and Energy Storage—2000s
3.10.3. Emergence of Blockchain, AI, and P2P Energy Trading Models—2010s
3.10.4. AI-Driven Optimization, Scalability, and Regulatory Challenges—2020s
3.10.5. Future Outlook
4. Challenges and Opportunities
4.1. Technical Challenges
4.2. Regulatory and Economic Aspects
4.3. Future Innovations
5. Value of the Work and Findings
Key Outcomes of the Review
- Comparison of Optimization Approaches: RL-based optimization improves transaction efficiency by 20% and reduces operational costs by 25%, outperforming deterministic and heuristic methods in dynamic market conditions.
- Demand Response and Energy Flexibility: DR-integrated RL models reduce peak demand by up to 30% and increase renewable energy self-consumption by 22%, enhancing energy market efficiency and grid stability.
- Vehicle-to-Grid (V2G) and Battery Storage Integration: P2P frameworks incorporating EVs with V2G capabilities have demonstrated a 20% reduction in energy curtailment and a 35% reduction in peak electricity costs, improving decentralized energy storage utilization.
- Scalability and Computational Efficiency: While RL-based models enhance adaptability in uncertain environments, their computational complexity remains a challenge, requiring further research on hybrid models that balance efficiency and scalability.
- Regulatory and Market Barriers: Existing regulatory frameworks do not fully support P2P energy trading, limiting widespread adoption. Addressing policy gaps and creating financial incentives can drive higher consumer participation and improve decentralized trading models.
- Future Research Directions: Advancements in federated learning, quantum computing, and blockchain-enabled smart contracts may further enhance scalability, privacy, and security in decentralized P2P energy networks.
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zedan, M.; Nour, M.; Shabib, G.; Nasrat, L.; Ali, A.-A. Review of Peer-to-Peer Energy Trading: Advances and Challenges. E-Prime—Adv. Electr. Eng. Electron. Energy 2024, 10, 100778. [Google Scholar] [CrossRef]
- May, R.; Huang, P. A Multi-Agent Reinforcement Learning Approach for Investigating and Optimising Peer-to-Peer Prosumer Energy Markets. Appl. Energy 2023, 334, 120705. [Google Scholar] [CrossRef]
- Alghanmi, N.A.; Alkhudhayr, H. EnergyShare AI: Transforming P2P Energy Trading through Advanced Deep Learning. Heliyon 2024, 10, e36948. [Google Scholar] [CrossRef]
- Jia, H.; Wang, Z.; Yu, X.; Mu, Y.; Xu, X.; Wang, X. State-of-the-Art Analysis and Perspectives for Peer-to-Peer Energy Trading Technology in Urban Community Microgrid System. Gaodianya Jishu-High Volt. Eng. 2022, 48, 2453–2468. [Google Scholar] [CrossRef]
- Liu, C.; Li, Z. Comparison of Centralized and Peer-to-Peer Decentralized Market Designs for Community Markets. IEEE Trans. Ind. Appl. 2022, 58, 67–77. [Google Scholar] [CrossRef]
- Jogunola, O.; Wang, W.; Adebisi, B. Prosumers Matching and Least-Cost Energy Path Optimisation for Peer-to-Peer Energy Trading. IEEE Access 2020, 8, 95266–95277. [Google Scholar] [CrossRef]
- Rashidizadeh-Kermani, H.; Vahedipour-Dahraie, M.; Shafie-Khah, M.; Siano, P. A Peer-to-Peer Energy Trading Framework for Wind Power Producers with Load Serving Entities in Retailing Layer. IEEE Syst. J. 2022, 16, 649–658. [Google Scholar] [CrossRef]
- Zhang, C.; Qiu, J.; Yang, Y. A Peer-to-Peer Joint Kilowatt and Negawatt Trading Framework Incorporating Battery Cycling Degradation. IEEE Trans. Power Syst. 2024, 39, 6386–6398. [Google Scholar] [CrossRef]
- Wang, X.; Yu, X.; Chen, S.; Zhang, Y.; Li, Q. A Real-Time Peer-to-Peer Energy Trading for Prosumers Utilizing Time-Varying Building Virtual Energy Storage. Int. J. Electr. Power Energy Syst. 2024, 155, 109547. [Google Scholar] [CrossRef]
- Saatloo, A.M.; Mirzaei, M.A.; Mohammadi-Ivatloo, B. A Robust Decentralized Peer-to-Peer Energy Trading in Community of Flexible Microgrids. IEEE Syst. J. 2023, 17, 640–651. [Google Scholar] [CrossRef]
- Mishra, M.; Singh, A.; Misra, R.K.; Singh, D.; Maulik, A. A Scalable and Computational Efficient Peer-to-Peer Energy Management Scheme. IEEE Access 2023, 11, 21686–21698. [Google Scholar] [CrossRef]
- Guo, Z.; Pinson, P.; Wu, Q.; Chen, S.; Yang, Q.; Yang, Z. An Asynchronous Online Negotiation Mechanism for Real-Time Peer-to-Peer Electricity Markets. IEEE Trans. Power Syst. 2022, 37, 1868–1880. [Google Scholar] [CrossRef]
- Bokkisam, H.R.; Singh, S.; Acharya, R.M.; Selvan, M.P. Blockchain-Based Peer-to-Peer Transactive Energy System for Community Microgrid with Demand Response Management. CSEE J. Power Energy Syst. 2022, 8, 198–211. [Google Scholar] [CrossRef]
- Muntasir, F.; Chapagain, A.; Maharjan, K.; Baig, M.J.A.; Jamil, M.; Khan, A.A. Developing an Appropriate Energy Trading Algorithm and Techno-Economic Analysis between Peer-to-Peer within a Partly Independent Microgrid. Energies 2023, 16, 1549. [Google Scholar] [CrossRef]
- Chen, X.; Wang, X.; Shahidehpour, M.; Affolabi, L.; Lu, Z.; Li, K. Distributed Peer-to-Peer Coordination of Hierarchical Three-Phase Energy Transactions among Electric Vehicle Charging Stations in Constrained Power Distribution and Urban Transportation Networks. IEEE Trans. Transp. Electrif. 2024, 10, 4407–4420. [Google Scholar] [CrossRef]
- Luo, X.; Shi, W.; Jiang, Y.; Liu, Y.; Xia, J. Distributed Peer-to-Peer Energy Trading Based on Game Theory in a Community Microgrid Considering Ownership Complexity of Distributed Energy Resources. J. Clean. Prod. 2022, 351, 131573. [Google Scholar] [CrossRef]
- Wu, Q.; Song, Q.; He, X.; Chen, G.; Huang, T. Distributed Peer-to-Peer Energy Trading Framework with Manufacturing Assembly Process and Uncertain Renewable Energy Plants in Multi-Industrial Micro-Grids. Energy 2024, 302, 131876. [Google Scholar] [CrossRef]
- Salehi, M.K.; Rastegar, M. Distributed Peer-to-Peer Transactive Residential Energy Management with Cloud Energy Storage. J. Energy Storage 2023, 58, 106401. [Google Scholar] [CrossRef]
- Liu, J.; Long, Q.; Liu, R.-P.; Liu, W.; Hou, Y. Online Distributed Optimization for Spatio-Temporally Constrained Real-Time Peer-to-Peer Energy Trading. Appl. Energy 2023, 331, 120216. [Google Scholar] [CrossRef]
- Ma, H.; Liu, X.; Zhang, Y.; Wang, L.; Li, Y.; Xia, Z. Optimal Peer-to-Peer Energy Transaction of Distributed Prosumers in High-Penetrated Renewable Distribution Systems. IEEE Trans. Ind. Appl. 2024, 60, 4622–4632. [Google Scholar] [CrossRef]
- El Kasri, H.; Abdennour, I.; Ouardouz, M.; Bernoussi, A.S. Enhancing Local Energy Sharing Reliability within Peer-to-Peer Prosumer Communities: A Cellular Automata and Deep Learning Approach. Sustain. Energy Grids Netw. 2024, 39, 101504. [Google Scholar] [CrossRef]
- Tsaousoglou, G.; Ellinas, P.; Varvarigos, E. Operating Peer-to-Peer Electricity Markets under Uncertainty via Learning-Based, Distributed Optimal Control. Appl. Energy 2023, 343, 121234. [Google Scholar] [CrossRef]
- Varghese, L.J.; Dhayalini, K.; Jacob, S.S.; Ali, I.; Abdelmaboud, A.; Eisa, T.A.E. Optimal Load Forecasting Model for Peer-to-Peer Energy Trading in Smart Grids. Comput. Mater. Contin. 2021, 70, 1053–1067. [Google Scholar] [CrossRef]
- Hao, J.; Huang, T.; Sun, Y.; Zhan, X.; Zhang, Y.; Wu, P. Optimal Prosumer Operation with Consideration for Bounded Rationality in Peer-to-Peer Energy Trading Systems. Energies 2024, 17, 1724. [Google Scholar] [CrossRef]
- Wang, Y.; Yang, Q.; Li, D. Peer-to-Peer Energy Trading in a Community Based on Deep Reinforcement Learning. J. Renew. Sustain. Energy 2023, 15, 062704. [Google Scholar] [CrossRef]
- Kanakadhurga, D.; Prabaharan, N. Demand Response-Based Peer-to-Peer Energy Trading among the Prosumers and Consumers. Energy Rep. 2021, 7, 7825–7834. [Google Scholar] [CrossRef]
- Mehdinejad, M.; Shayanfar, H.A.; Mohammadi-Ivatloo, B.; Nafisi, H. Designing a Robust Decentralized Energy Transactions Framework for Active Prosumers in Peer-to-Peer Local Electricity Markets. IEEE Access 2022, 10, 26743–26755. [Google Scholar] [CrossRef]
- Tiwari, A.; Jha, B.K.; Pindoriya, N.M. Multi-Objective Optimization Based Demand Response Program with Network Aware Peer-to-Peer Energy Sharing. Int. J. Electr. Power Energy Syst. 2024, 157, 109887. [Google Scholar] [CrossRef]
- Khazaei, H.; Aghamohammadloo, H.; Habibi, M.; Mehdinejad, M.; Mohammadpour Shotorbani, A. Novel Decentralized Peer-to-Peer Gas and Electricity Transaction Market between Prosumers and Retailers Considering Integrated Demand Response Programs. Sustainability 2023, 15, 6165. [Google Scholar] [CrossRef]
- Görgülü, H.; Topçuoğlu, Y.; Yaldız, A.; Gökçek, T.; Ateş, Y.; Erdinç, O. Peer-to-Peer Energy Trading among Smart Homes Considering Responsive Demand and Interactive Visual Interface for Monitoring. Sustain. Energy Grids Netw. 2022, 29, 100584. [Google Scholar] [CrossRef]
- Alsolami, M.; Alferidi, A.; Lami, B.; Ben Slama, S. Peer-to-Peer Trading in Smart Grid with Demand Response and Grid Outage Using Deep Reinforcement Learning. Ain Shams Eng. J. 2023, 14, 102466. [Google Scholar] [CrossRef]
- Timilsina, A.; Silvestri, S. P2P Energy Trading through Prospect Theory, Differential Evolution, and Reinforcement Learning. ACM Trans. Evol. Learn. Optim. 2023, 3, 1–22. [Google Scholar] [CrossRef]
- Zhou, S.; Zou, F.; Wu, Z.; Gu, W.; Hong, Q.; Booth, C. A Smart Community Energy Management Scheme Considering User Dominated Demand Side Response and P2P Trading. Int. J. Electr. Power Energy Syst. 2020, 114, 105378. [Google Scholar] [CrossRef]
- Merrad, Y.; Habaebi, M.H.; Toha, S.F.; Islam, M.R.; Gunawan, T.S.; Mesri, M. Fully Decentralized, Cost-Effective Energy Demand Response Management System with a Smart Contracts-Based Optimal Power Flow Solution for Smart Grids. Energies 2022, 15, 4461. [Google Scholar] [CrossRef]
- Zhao, J.; Wang, C.; Li, G.; Wu, B.; Li, N.; Peng, K. Low-Carbon Operation Strategy for P2P Energy Trading Among Multiple Microgrids Considering Demand Response. Dianli Jianshe-Electric Power Constr. 2023, 44, 54–65. [Google Scholar] [CrossRef]
- Kumar, M.; Dohare, U.; Kumar, S.; Kumar, N. Blockchain-Based Optimized Energy Trading for E-Mobility Using Quantum Reinforcement Learning. IEEE Trans. Veh. Technol. 2023, 72, 5167–5180. [Google Scholar] [CrossRef]
- Fan, Y.; Wang, Q.; Xie, Y.; Zhou, N.; Yang, Y.; Ding, Y.; Wei, Y.; Qu, G. Advances in Aqueous Zinc-Ion Battery Systems: Cathode Materials and Chemistry. Prog. Mater. Sci. 2025, 149, 101393. [Google Scholar] [CrossRef]
- Kim, J.; Park, H. Recent Advances in Porous Electrodes for Vanadium Redox Flow Batteries in Grid-Scale Energy Storage Systems: A Mass Transfer Perspective. J. Power Sources 2022, 545, 231904. [Google Scholar] [CrossRef]
- Wang, Y.; Yu, T.; Chen, J.; Gao, B.; Yu, M.; Zhu, J. Advances in Safety of Lithium-Ion Batteries for Energy Storage: Hazard Characteristics and Active Suppression Techniques. Energy Rev. 2025, 4, 100117. [Google Scholar] [CrossRef]
- Mama, M.; Solai, E.; Capurso, T.; Danlos, A.; Khelladi, S.; Mama, M.; Solai, E.; Capurso, T.; Danlos, A.; Khelladi, S.; et al. Comprehensive Review of Multi-Scale Lithium-Ion Batteries Modeling: From Electro-Chemical Dynamics Up to Heat Transfer in Battery Thermal Management System. Energy Convers. Manag. 2025, 325, 119223. [Google Scholar] [CrossRef]
- An, J.; Hong, T. Optimizing Battery Energy Storage System Placement in Energy Intensive Cities for Sustainable Urban Development Considering Multiple Objectives. Sustain. Cities Soc. 2025, 118, 106036. [Google Scholar] [CrossRef]
- Guo, F.; Gomes, L.; Ma, L.; Tian, Z.; Vale, Z.; Pang, S. Optimizing Battery Storage for Sustainable Energy Communities: A Multi-Scenario Analysis. Sustain. Cities Soc. 2025, 118, 106030. [Google Scholar] [CrossRef]
- Li, Q.; Wei, X.; Wang, J.; Chao, Y.; Li, Y.; Fan, H. Enhancing Battery Energy Storage Systems for Photovoltaic Applications in Extremely Cold Regions: A Brief Review. Energy Sustain. Dev. 2024, 81, 101517. [Google Scholar] [CrossRef]
- Ma, Z.; Jia, M.; Koltermann, L.; Blömeke, A.; De Doncker, R.W.; Li, W.; Sauer, D.U. Review on Grid-Tied Modular Battery Energy Storage Systems: Configuration Classifications, Control Advances, and Performance Evaluations. J. Energy Storage 2023, 74, 109272. [Google Scholar] [CrossRef]
- Gopi, C.V.M.; Ramesh, R. Review of Battery-Supercapacitor Hybrid Energy Storage Systems for Electric Vehicles. Results Eng. 2024, 24, 103598. [Google Scholar] [CrossRef]
- Zahedmanesh, A.; Muttaqi, K.M.; Sutanto, D. Direct Control of Plug-in Electric Vehicle Charging Load Using an In-house Developed Intermediate Control Unit. In Proceedings of the 2018 IEEE Industry Applications Society Annual Meeting (IAS), Portland, OR, USA, 23–27 September 2018; pp. 1–9. [Google Scholar] [CrossRef]
- Bordin, C.; Tomasgard, A. Behavioural Change in Green Transportation: Micro-Economics Perspectives and Optimization Strategies. Energies 2021, 14, 3728. [Google Scholar] [CrossRef]
- Sujikannan, M.; Kumar, A.R.; Daniel, S.A. Sizing of Rooftop PV Array and Community-Run Battery Storage for an Energy Cooperative in Prosumer Cluster. Distrib. Gener. Altern. Energy J. 2022, 37, 1797–1822. [Google Scholar] [CrossRef]
- Inês, C.; Guilherme, P.L.; Esther, M.-G.; Swantje, G.; Stephen, H.; Lars, H. Regulatory Challenges and Opportunities for Collective Renewable Energy Prosumers in the EU. Energy Policy 2020, 138, 111212. [Google Scholar] [CrossRef]
- Hoicka, C.E.; Lowitzsch, J.; Brisbois, M.C.; Kumar, A.; Ramirez Camargo, L. Implementing a Just Renewable Energy Transition: Policy Advice for Transposing the New European Rules for Renewable Energy Communities. Energy Policy 2021, 156, 112435. [Google Scholar] [CrossRef]
- Peeren, R.; Dabhi, D.; Dalton, J. Levelling the Playing Field for Smart Renewable Energy Community in the Electricity Market through the High Street Electricity Market Model. Appl. Energy 2025, 377, 124660. [Google Scholar] [CrossRef]
- Junlakarn, S.; Kokchang, P.; Audomvongseree, K. Drivers and Challenges of Peer-to-Peer Energy Trading Development in Thailand. Energies 2022, 15, 1229. [Google Scholar] [CrossRef]
- Kashyap, P.K.; Dohare, U.; Kumar, M.; Kumar, S. Blockchain and Quantum Machine Learning Driven Energy Trading for Electric Vehicles. Ad Hoc Netw. 2024, 165, 103632. [Google Scholar] [CrossRef]
- Li, Z.; Chen, S.; Zhou, B. Electric Vehicle Peer-to-Peer Energy Trading Model Based on SMES and Blockchain. IEEE Trans. Appl. Supercond. 2021, 31, 3091074. [Google Scholar] [CrossRef]
- Maine, P.K.; Leke, C.A.; Longe, O.M. Blockchain Application in Energy Trading for Grid-Connected Prosumers. E-Prime—Adv. Electr. Eng. Electron. Energy 2024, 8, 100586. [Google Scholar] [CrossRef]

| Optimization Method | Application Scenarios | Performance Metrics | Limitations | 
|---|---|---|---|
| Robust Reinforcement Learning (RRL) | Best for dynamic, uncertain energy markets with fluctuating demand and supply [2]. | Reduces net energy losses by 28.64% and increases reward efficiency [6]. | High computational cost and scalability issues in large-scale P2P markets [5]. | 
| Demand Response-Based P2P Optimization (DR-P2P) | Works best where demand-side management and market signals guide consumer participation [13]. | Reduces consumer energy costs by 54–76% and enhances grid flexibility [5]. | Relies on consumer participation and limited adaptability in sudden market shifts [6]. | 
| Synergy Between P2P Optimization and RRL | Ideal for hybrid systems integrating learning-based decision-making with market-based optimization [3]. | Increases trading efficiency by 20% and reduces operational costs by 25% [6]. | High system complexity and requires robust computational infrastructure [5]. | 
| Area | Technical Challenges | 
|---|---|
| Application of Reinforcement Learning | Incomplete data: Energy losses during charge–discharge cycles and battery degradation are not accounted for. Environmental impacts are not analyzed [2]. | 
| Uncertainty and variability: Energy demand uncertainty affects prediction quality and convergence times [22]. | |
| Computational load: Markets with multiple participants significantly increase computational load, hindering scalability [31,32]. | |
| Scalability: Some solutions require advanced computational resources to achieve scalability [31,32]. | |
| Demand Response in P2P Optimization | Negative grid impacts: P2P trading involves energy losses, destabilized load profiles, and voltage regulation issues [28]. | 
| Conflicts of interest: Tensions arise between prosumers optimizing their economic benefits and distribution system operators aiming to maintain optimal power flow [34]. | |
| Incomplete models: P2P trading tariffs, specific network constraints, and consumer discomfort costs are often not considered [28,34]. | |
| Implementation complexity: Advanced algorithms like BPSO and Nash games in decentralized systems are challenging to scale [34]. | |
| Social and regulatory effects: Insufficient information addresses socio-environmental and regulatory impacts of P2P trading and DR integration [3]. | |
| P2P Optimization and Reinforcement Learning | Uncertainty management: Integrating RL is challenging in environments with distributed generation and uncertain demand, particularly in dynamic and competitive settings [36]. | 
| Need for high adaptability: Maximizing utility for buyers and sellers requires techniques that enhance system robustness to interruptions [32]. | |
| Battery Energy Storage Systems (BESSs) | Battery degradation: Greater understanding and modeling of degradation mechanisms are needed, particularly in AZIB and redox flow battery technologies [37,38]. | 
| Safety risks: In lithium-ion batteries, thermal runaway, gas accumulation, and explosions pose challenges to safe adoption [39]. | |
| Advanced models: A balance between computational efficiency and complexity is needed in multiscale and electrochemical–thermal coupled approaches [40]. | |
| Renewable integration: Increasing renewable generation requires improvements in BESS efficiency [41,42]. | |
| Extreme climates: Batteries lose efficiency in extreme temperatures, necessitating specific mitigation strategies [48]. | 
| Innovation | Details | 
|---|---|
| AI and Advanced Algorithms | Multi-agent Reinforcement Learning (MARL): Applied to dynamic pricing optimization and market strategies in P2P communities with prosumers [2]. | 
| Advanced Q-Learning: Techniques such as ProDQN optimize pricing for sellers and buyers in P2P markets, increasing rewards and efficiency [31,32]. | |
| Blockchain Integration: Combined with advanced deep learning for efficient and secure energy management [13,36,53,54,55]. | |
| Multi-objective Optimization: Advanced BPSO algorithms in demand response reduce operational costs, improve sustainability, and optimize local resource utilization [28]. | |
| P2P Optimization: Dynamic adaptability to uncertainties maximizes market efficiency and enhances stability [31]. | |
| Surplus Energy Management: Optimizes charge–discharge cycles of storage systems [31]. | |
| Technologies and Materials in BESSs | Manganese Oxides and Organic Materials: Future research focuses on cathode materials to improve stability and performance [37]. | 
| Fire Suppression in Batteries: New strategies to mitigate risks associated with thermal runaway [39]. | |
| Zinc-Ion Batteries (AZIBs): Incorporate innovative cathode materials such as vanadium compounds and Prussian blue analogs to enhance capacity, stability, and efficiency [37]. | |
| Vanadium Redox Flow Batteries (VRFBs): Optimize mass transfer in porous electrodes, increasing efficiency for large-scale applications [38]. | |
| Multiscale Modeling: Electrochemical–thermal coupled approaches balance efficiency and complexity, improving battery management and cooling strategies [40]. | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Martínez, A.; Arévalo, P. Distributed Peer-to-Peer Optimization Based on Robust Reinforcement Learning with Demand Response: A Review. Computers 2025, 14, 65. https://doi.org/10.3390/computers14020065
Martínez A, Arévalo P. Distributed Peer-to-Peer Optimization Based on Robust Reinforcement Learning with Demand Response: A Review. Computers. 2025; 14(2):65. https://doi.org/10.3390/computers14020065
Chicago/Turabian StyleMartínez, Andrés, and Paul Arévalo. 2025. "Distributed Peer-to-Peer Optimization Based on Robust Reinforcement Learning with Demand Response: A Review" Computers 14, no. 2: 65. https://doi.org/10.3390/computers14020065
APA StyleMartínez, A., & Arévalo, P. (2025). Distributed Peer-to-Peer Optimization Based on Robust Reinforcement Learning with Demand Response: A Review. Computers, 14(2), 65. https://doi.org/10.3390/computers14020065
 
         
                                                


 
       