Multi-Objective Optimal Scheduling of Park-Level Integrated Energy System Based on Trust Region Policy Optimization Algorithm

Deyuan Lu; Chongxiao Kou; Shutong Wang; Li Wang; Yongbo Wang; Yingjun Lv

doi:10.3390/electronics14244900

,

and

¹

School of Electrical and Automation Engineering, Shandong University of Science and Technology, Qingdao 250032, China

²

Huarong Technology Co., Ltd., Jinan Branch, Jinan 250031, China

³

Shandong Yingxin Computer Technology Co., Ltd., Jinan 250031, China

⁴

Shandong SeTAQ Instruments Co., Ltd., Jinan 250031, China

Electronics2025, 14(24), 4900;https://doi.org/10.3390/electronics14244900

Version Notes

Order Reprints

Abstract

In the context of dual-carbon goals, Park-Level Integrated Energy Systems (PIES) are pivotal for enhancing renewable energy integration and promoting clean, efficient energy use. However, the inherent non-linearity from multi-energy coupling and the high dimensionality of operational data present substantial challenges for conventional scheduling optimization methods. To overcome these obstacles, this paper introduces a novel multi-objective scheduling framework for PIES leveraging deep reinforcement learning. We innovatively formulate the scheduling task as a Markov Decision Process (MDP) and employ the Trust Region Policy Optimization (TRPO) algorithm, which is adept at handling continuous action spaces. The state and action spaces are meticulously designed according to system constraints and user demands. A comprehensive reward function is then established to concurrently pursue three objectives: minimum operating cost, minimum carbon emissions, and maximum exergy efficiency. Through comparative analyses against other AI-based algorithms, our results demonstrate that the proposed method significantly lowers operating costs and carbon footprint while enhancing overall exergy efficiency. This validates the model’s effectiveness and superiority in addressing the complex multi-objective scheduling challenges inherent in modern energy systems.

Keywords:

park-level integrated energy system; trust region policy optimization; multi-objective optimization

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Article metric data becomes available approximately 24 hours after publication online.