1. Introduction
Driven by the global goals of carbon neutrality and carbon peaking, the energy system is undergoing a profound low-carbon transition. The large-scale grid integration of renewable energy and the clustered operation of distributed energy resources have emerged as pivotal trends in future energy development [
1,
2]. As a critical enabler for integrating distributed generators, energy storage systems, flexible loads, and other resources, Virtual Power Plants (VPPs) transform distributed energy from “disordered grid connection” to “ordered collaboration” by aggregating the regulation potential of scattered energy resources. This provides an effective pathway for enhancing the flexibility of power systems and promoting the accommodation of renewable energy [
3,
4]. With the continuous expansion in the number and scale of VPPs, the collaborative operation of multi-VPP clusters has increasingly become a critical research topic in power system dispatch and management [
5,
6].
As the core link connecting the power supply side and the demand side, the operational efficiency of a distribution system directly impacts the accommodation level of distributed energy and the overall system benefits. Distribution System Operators (DSOs), acting as the managers and dispatchers of distribution systems, are responsible for coordinating the operational behaviors of multiple VPPs, balancing supply and demand, reducing carbon emissions, and controlling operational costs [
7]. However, each VPP operates as an independent market entity with the core objective of maximizing its own benefits. Its energy management strategies (such as generation schedule formulation, load regulation, and energy storage charging/discharging control) are not only constrained by internal resource endowments but also closely coupled with the decisions of other VPPs and the dispatching strategies of the DSO [
8]. This complex interaction, characterized by multiple stakeholders, conflicting objectives and diverse constraints, renders traditional centralized dispatching methods inadequate for balancing the interests of all parties, resulting in limited feasibility and economic efficiency of dispatching schemes [
9]. Furthermore, with the gradual maturation of carbon trading markets, carbon emission costs have evolved into a significant component of the energy system operational costs. Consequently, integrating carbon constraints into the collaborative dispatching process of multiple VPPs further accentuates the complexity of the optimization problem, which is driven by both economic and low-carbon objectives [
10,
11].
Currently, extensive research has been conducted on the optimal dispatching and collaborative management of VPPs. Regarding the optimal dispatching of a single VPP, existing studies primarily focus on the optimal allocation of internal resources. By establishing optimization models that incorporate factors such as the uncertainty of renewable energy output and energy storage operation constraints, these studies aim to minimize costs or maximize benefits [
12,
13]. Reference [
14] constructs an intraday optimal dispatching model for VPPs considering wind power output fluctuations, which is solved using the particle swarm optimization algorithm, effectively improving the operational economy of VPPs. Reference [
15] incorporates carbon trading costs into the VPP utility function and proposes a dispatching strategy that balances economic and low-carbon objectives. However, such studies typically overlook the interactive relationships between multiple VPPs and thus cannot adapt to the operational requirements of multi-VPP clusters. In the field of multi-VPP collaborative management, the energy resources among VPPs exhibit spatiotemporal complementarity [
16]. Therefore, conducting collaborative optimization that accounts for multi-VPP clusters in a unified regional distribution network can achieve interest balance and optimization among VPPs, leading to better economic benefits. Reference [
17] proposes a low-carbon optimal dispatching method for distribution systems with multiple VPPs, reducing system carbon emissions by coordinating the output plans of each VPP. Furthermore, distributed coordination methods achieve the alignment of local optimality and global optimization through information interaction and autonomous decision-making among multiple subjects. Reference [
18] adopts the Alternating Direction Method of Multipliers (ADMM) to realize distributed optimization of multiple VPPs, balancing dispatching efficiency and privacy protection. Nevertheless, existing distributed methods mostly focus on economic objective optimization, with insufficient consideration of low-carbon constraints. Moreover, they fail to effectively characterize the interest game relationships among participants, resulting in inadequate fairness and feasibility of collaborative dispatching schemes.
As an effective tool for analyzing interest interactions among multiple entities, game theory provides a novel solution for the multi-VPP collaborative management [
19]. In particular, the Stackelberg game, a non-cooperative game model, effectively coordinates interest conflicts between entities by characterizing the hierarchical “leader–follower” decision-making relationship [
20]. In the power system field, the Stackelberg game has been widely applied to multi-entity collaboration problems, such as those involving power grids and distributed generators, as well as microgrids and load aggregators [
6,
7,
12]. Reference [
21] designs a transaction mechanism between electricity retailers and VPPs based on the Stackelberg game, enhancing the economic benefits of both parties. However, existing studies utilizing the Stackelberg game mostly target single VPPs or two-party interaction scenarios, lacking a systematic analysis of multi-VPP cluster collaboration. Furthermore, they do not fully integrate carbon trading mechanisms and complex constraints (such as energy storage operation and flexible load response), making it difficult to meet the practical requirements for the low-carbon economic collaborative operation of multiple VPPs [
22,
23].
Additionally, research on multi-VPP collaborative management has yielded substantial results, with scholars worldwide focusing on guiding distributed resources to participate in market games through carbon trading mechanisms [
24,
25,
26]. However, the existing research lacks an in-depth investigation into the integration of stepwise carbon trading mechanisms and flexible load constraints within a multi-VPP Stackelberg game framework [
27]. Specifically, the dynamic game relationship between carbon trading costs and adaptive load response strategies has been overlooked, resulting in sub-optimal economic and environmental synergies in low-carbon economic dispatch strategies. Therefore, this work proposes a Stackelberg game-based method for the low-carbon economic optimization and collaborative management of multiple VPPs.
The main research contents are as follows:
- (1)
A Stackelberg game model for multi-VPP energy management is constructed, clarifying the DSO as the leader and multiple VPPs as the followers, to characterize the interest interaction relationship between the two parties.
- (2)
A dynamic pricing game model for the leader (DSO) and an energy management game model for the followers (VPPs) are established, respectively. The VPP utility function comprehensively considers multi-dimensional factors such as energy transaction costs, fuel costs, and carbon trading costs. Furthermore, a strategy space subject to constraints regarding energy balance, energy storage operation, power conversion, and flexible load response is constructed.
- (3)
A solution approach integrating the Nonlinear-based Chaotic Harris Hawks Optimization (NCHHO) algorithm and CPLEX solver is introduced to efficiently obtain the equilibrium solution of the Stackelberg game, thereby achieving a win–win outcome for both the DSO and multiple VPPs.
The main innovations of this paper are reflected in three aspects: First, a multi-VPP Stackelberg game framework balancing the profit between the DSO and VPPs is constructed. By incorporating carbon trading costs into the VPP utility function, the collaborative optimization of low-carbon economic objectives is realized. Second, multi-constraint conditions such as energy storage operation and flexible load response are systematically considered, improving the description of the VPP strategy space and enhancing the engineering feasibility of dispatching schemes. Third, the NCHHO algorithm is successfully applied to the problem of optimization for multi-VPP collaborative dispatching.
The remainder of this paper is organized as follows:
Section 2 provides a Stackelberg game model for energy management of multiple VPPs. The dynamic pricing of the leader and energy management of the follower VPPs are presented in
Section 3 and
Section 4, respectively. The Stackelberg game-based optimal scheduling model for multiple VPPs is described in
Section 5. In
Section 6, a case study is conducted, and the results are discussed. Finally, conclusions are drawn in
Section 7.
2. Stackelberg Game Model for Energy Management of Multiple VPPs
In the scenario of clustered multiple-VPP operation, each VPP acts as an independent energy aggregation unit. The output of its internal Distributed Energy Resources (DERs) and load demand exhibit real-time dynamic fluctuations, frequently leading to an imbalance between internal power generation and consumption within a given time interval. To address this challenge, this work employs a multi-VPP method for energy management.
To facilitate orderly energy interaction between multiple VPPs and the distribution system, a transaction mechanism involving the Distribution System Operator (DSO) and multiple VPPs is designed, as illustrated in
Figure 1. As the core dispatching and transaction entity, the DSO is responsible for formulating unified transaction prices for electricity purchase and sale. VPPs with power surplus can sell excess electricity to the DSO at the electricity selling price, thereby monetizing surplus energy. Conversely, power-deficit VPPs can purchase the required electricity from the DSO at the purchase price, ensuring a reliable power supply for their internal loads. Meanwhile, based on the aggregated power purchase and sale data reported by each VPP, the DSO conducts two-way transactions with the power market by leveraging the grid price (the price for purchasing electricity from the microgrid) and the grid feed-in price (the price for selling electricity to the microgrid) in the power market. The DSO generates revenue through the transaction price difference, forming a three-level closed-loop energy transaction system of “VPP-DSO-Power Market”.
Considering the distinct investment and operational entities of the DSO and VPPs, their divergent interest demands, and the DSO’s role as the distribution system manager, a leader–follower hierarchical Stackelberg game framework is constructed in this work, as shown in
Figure 2. The roles are defined as follows:
Leader (DSO): As the dominant player in the game, the DSO first aggregates the initial electricity purchase and sale demand reported by each VPP. Subsequently, considering power market grid prices, system operational constraints, and the price response characteristics of VPPs, it formulates differentiated electricity purchase and sale prices for each VPP with the objective of maximizing its own net profit.
Followers (Multiple VPPs): Each VPP acts as an independent follower in the game. Upon receiving the transaction prices set by the DSO, each VPP optimizes the output schedules of its internal DERs and load response strategies based on its own resource endowments (such as DER output potential, energy storage status, and adjustable capacity of flexible loads). With the objective of minimizing its own operation cost, each VPP determines the final volume of electricity to be purchased from or sold to the DSO.
In this game process, the decision-making of the leader and followers exhibits a clear sequential hierarchy: the DSO determines the transaction prices first, followed by the response of the multiple VPPs who formulate their energy management strategies, thus constituting a typical Stackelberg game. Furthermore, as parallel followers, the VPPs make decisions simultaneously to pursue individual optimality without information exchange regarding their specific electricity volumes and DER output strategies, thereby forming a non-cooperative game relationship. These components together constitute a hybrid “leader–follower–non-cooperative” game system.
The Stackelberg game model for the DSO and multiple VPPs constructed in this work consists of three core components: players, strategy spaces, and utility functions, which are defined in detail as follows:
- (1)
Players: The core participants in the game are the leader (DSO) and the followers (multiple VPPs). The DSO, as a single leader, undertakes the functions of system dispatching and transaction pricing. The multiple VPPs, acting as multiple independent followers, each possess internal DERs (including distributed generators, energy storage systems, flexible loads, etc.) and possess the capability for independent decision-making regarding energy management.
- (2)
Strategies: A strategy refers to the decision variables adopted by each player to achieve their respective goals. For the leader (DSO), the core strategies are the electricity purchase price and sale price offered to the multiple VPPs. These prices must be set within a reasonable range constrained by the power market price interval and system supply–demand balance requirements. For the followers (VPPs), their core strategies encompass two aspects: first, the transaction volume with the DSO (i.e., the electricity sales volume for power-surplus VPPs and the electricity purchase volume for power-deficit VPPs); and second, the operational schedules of internal DERs (such as the output of distributed generators, the charging/discharging power of energy storage systems, and the load transfer of flexible loads). All strategy variables must satisfy the respective equipment constraints and operational rules of the VPPs.
- (3)
Utility Functions: A utility function represents the objective-oriented metric for each player. The DSO’s utility objective is to maximize its own net profit, which is achieved by formulating optimal transaction prices to balance the transaction revenue from multiple VPPs against the transaction cost within the power market. The multiple VPPs’ utility objective is to minimize their own comprehensive operation cost, which covers multi-dimensional expenditures such as energy purchase cost, fuel consumption cost, carbon trading cost, equipment operation and maintenance cost, and compensation cost. The utility functions of both parties will be described in detail in
Section 3 and
Section 4, respectively.
4. Stackelberg Game Model for Energy Management of the Follower VPPs
4.1. Strategy
The game strategy of a VPP consists of its operational schedule for each time interval, including the electricity sold to the DSO (), the electricity purchased from the DSO (), the output power of micro gas turbines (), the charging/discharging power of energy storage (ES) systems (), the power of flexible loads (), and the output power of renewable energy sources (photovoltaic systems () and wind turbines ()). These variables are denoted as , where Nj represents the set of distributed energy resources (DERs) contained in VPP j.
4.2. Utility Function
The VPP’s utility function in the game aims to minimize its operational cost, which comprises the electricity transaction cost (
), MT’s fuel cost (
), carbon trading cost (
), operation and maintenance (O&M) cost (
), revenue from renewable energy generation (
), and compensation cost for flexible load demand response (
). For an arbitrary VPP
j, its utility function is expressed as:
4.2.1. Electricity Transaction Cost
To maintain power balance, VPPs engage in transactions with the upper-level grid by purchasing electricity to meet load demand when its internal generation is insufficient, and selling surplus electricity to the grid when generation exceeds demand. Thus, the electricity transaction cost within the grid is expressed by:
where
denotes the scheduling coefficient of energy storage.
4.2.2. Fuel Cost
The micro gas turbine (MT), fueled by natural gas, exhibits variable-load efficiency characteristics. The relationship between its output power and fuel cost is formulated as a quadratic function [
28]:
where
,
and
are the cost coefficients of the MT.
4.2.3. Carbon Trading Cost
Carbon emissions are generally generated throughout the production, transportation, and consumption processes of various energy sources, including CO
2 emissions from the MTs fueled by natural gas, as well as CO
2 emissions associated with the operation of photovoltaics (PV), wind power, and energy storage systems. Accordingly, the equivalent carbon emissions can be formulated as follows:
where
,
,
, and
represent the output power of the MT, PV system, WT, and the charging/discharging power of energy storage in VPP j at time
t, respectively;
is the absolute transaction power of VPP j with the DSO at time
t;
,
,
,
, and
denote the CO
2 emission factors of the MT, PV, WT, ES and the absolute transaction power, respectively.
To reduce carbon emissions, carbon quotas are typically incorporated into the carbon trading mechanism as free carbon emission allowances allocated to market participants; any excess emissions must be purchased through the market [
29,
30].
Generally, there are two common methods for allocating carbon quotas: free allocation and paid allocation. Free allocation refers to granting a specific emission quota to the system in advance at no cost, in order to enhance its willingness to participate; whereas paid allocation requires the system to pay corresponding fees for its own carbon emissions. In this work, to encourage the VPP participation in the carbon market, free allocation is adopted, and carbon emission quotas are formulated as follows:
where
,
,
and
are the conversion coefficients of the carbon allowance allocation corresponding to the aforementioned energy generation and consumption processes, respectively.
The carbon trading mechanism requires energy suppliers to control their carbon emissions within their carbon allowances. Any emissions exceeding the allocated allowances must be offset by purchasing carbon credits from the market to avoid penalties. This work adopts a stepwise carbon trading mechanism, which sets price intervals based on the volume of carbon emissions: the unit price increases as the purchase volume rises. The stepwise carbon price is defined as:
where
k is the number of price steps;
is the carbon trading price for the
k-th step;
is the basic unit price of carbon trading. Accordingly, the carbon trading cost is calculated as:
where
L denotes the step length;
is the carbon allowance allocated to VPP
j, and
is the total carbon emissions of VPP
j.
4.2.4. Operation and Maintenance (O&M) Cost
In the integrated energy system, PV systems, WTs, and energy storage systems all incur corresponding O&M costs, which can be expressed as:
where
,
and
denotes the O&M cost coefficients for the PV system, WT and energy storage, respectively.
4.2.5. Compensation Cost
Since users have different willingness to shift or curtail their electrical loads, compensation is provided to incentivize user participation in flexible load curtailment. The compensation cost is given by:
where
represents the power of curtailable electrical loads in VPP
j at time
t; and
is the compensation cost coefficient of curtailed electrical loads.
4.2.6. Revenue from Renewable Energy Generation
To encourage renewable energy generation, certified “carbon assets” or Green Electricity Certificates (GECs) can be generated after verification by authorized institutions. These assets can be sold in the carbon emission trading market or the green electricity market to enterprises with carbon emission quota constraints, thereby generating revenue:
where
is the quantification coefficient for converting renewable energy generation into Green Electricity Certificates.
4.3. Strategy Space
In responding to the prices released by the DSO, the VPP must satisfy the power balance constraint and the operational constraints of each DER, while pursuing its own profit. The VPP’s strategy space is denoted as , j = 1, 2…, N.
4.3.1. Energy Balance Constraint
To achieve the collaborative optimization between the DSO and the VPP and maximize self-interests while meeting user demand, the energy balance constraint must be strictly satisfied as follows:
where
is the predicted load value of VPP j at time
t.
4.3.2. Energy Storage Constraints
The energy storage constraints cover the charging and discharging operational constraints of the electrical energy storage battery, which are given as follows:
where
is the state of charge (SOC) of energy storage in VPP j at time
t;
and
are the lower and upper limits of the SOC, respectively;
and
are the charging and discharging power of energy storage, respectively;
is the maximum energy capacity of the energy storage system (energy losses are neglected in these constraints); and
is the time interval from time
t to time
t + 1.
4.3.3. Power Upper/Lower Limit and Ramp Rate Constraints
To achieve economic dispatch and efficient operation for a system with diversified flexible resources, the following constraints must be satisfied.
where
is a binary variable:
= 1 indicates that VPP
j sells electricity to the DSO at time
t, while
= 0 indicates that VPP j purchases electricity from the DSO at time
t;
is the maximum transaction volume between VPP j and the DSO;
is the maximum output power of the MT;
and
are the downward and upward ramp rates of the MT, respectively;
and
are the maximum output power of the WT and PV at time
t, respectively, taken as the predicted wind and PV power value for VPP
j.
7. Conclusions
Addressing the critical issues of low-carbon economic optimization and collaborative management for multiple Virtual Power Plants (VPPs), this work proposes an optimal dispatch method based on a Stackelberg game. A bi-level game architecture is constructed with the DSO as the leader and multiple VPPs as followers, encompassing the complete research process of model formulation, constraint design, and algorithmic solution. The results indicate that the constructed Stackelberg game model effectively characterizes the interest interaction between the DSO and multiple VPPs. The leader (DSO) maximizes its own utility through a dynamic pricing strategy while providing scientific guidance for the collaborative operation of multiple VPPs. Subject to constraints such as energy balance, energy storage operation, power conversion, and flexible load response, the follower VPPs formulate energy management strategies to achieve individual benefit optimization, comprehensively considering multi-dimensional costs including energy transaction, fuel, carbon trading, operation and maintenance, and compensation, as well as new energy generation revenues. By efficiently solving the game model using the NCHHO algorithm and CPLEX solver, an equilibrium solution satisfying the interests of both parties is successfully obtained. Ultimately, while ensuring the safe and stable operation of the distribution system, the proposed method balances the profit between the DSO and VPPs and incentivizes renewable energy consumption and indirect carbon reduction. It achieves the dual goals of a low-carbon economy and collaborative management, providing a theoretical basis and technical reference for the large-scale low-carbon collaborative operation of multi-VPP clusters. Future work could further extend to the optimization of game models in scenarios involving high-penetration renewable energy and multi-energy coupling, integrate battery life-cycle degradation into the VPP utility function, and consider the impact of uncertainty factors on dispatch strategies to enhance the robustness and engineering applicability of the method. Moreover, we intend to further investigate low-carbon dispatch issues to achieve more substantial emission reductions.
Additionally, the limitations of the non-cooperative assumption should be acknowledged. In practice, VPPs might form coalitions to exert market power. Such cooperative behaviors could lead to higher clearing prices and altered dispatch strategies, potentially reducing the DSO’s revenue and system efficiency. Investigating the impact of such coalitions requires a cooperative game-theoretic framework, which is also a direction for future research.