Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game

Yang, Bing; Zhou, Dongguo

doi:10.3390/en19081821

Open AccessArticle

Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game

by

Bing Yang

¹ and

Dongguo Zhou

^2,*

¹

Dongfang Electric (Chengdu) Innovation Research, Co., Ltd., Chengdu 610218, China

²

School of Robotics, Wuhan University, Wuhan 430072, China

^*

Author to whom correspondence should be addressed.

Energies 2026, 19(8), 1821; https://doi.org/10.3390/en19081821

Submission received: 2 March 2026 / Revised: 31 March 2026 / Accepted: 5 April 2026 / Published: 8 April 2026

(This article belongs to the Special Issue Advanced Control and Operation of Distributed Energy Resources in Modern Power Systems)

Download

Browse Figures

Versions Notes

Abstract

To address the challenges of low-carbon economic optimization and collaborative management for multiple Virtual Power Plants (VPPs), this paper proposes a low-carbon economic optimization and collaborative management method based on a Stackelberg game framework. Firstly, a Stackelberg game model is constructed with the Distribution System Operator (DSO) as the leader and multiple VPPs as followers. The leader (DSO) guides the followers’ behavior through dynamic pricing strategies to maximize its own utility. Meanwhile, the followers (VPPs) develop energy management strategies to minimize their individual costs, taking into account factors such as energy transaction costs, fuel costs, carbon trading costs, operation and maintenance (O&M) costs, compensation costs, and renewable energy generation revenues. Furthermore, the strategy spaces of all participants are defined, and an optimization model is established subjected to constraints including energy balance, energy storage operation, power conversion, and flexible load response. The CPLEX solver and Nonlinear-based Chaotic Harris Hawks Optimization (NCHHO) algorithm are employed to solve the proposed game model. Simulation results demonstrate that the proposed method effectively facilitates collaboration between the DSO and multiple VPPs. While ensuring the safe operation of the system, it balances the profit between the DSO and VPPs, and incentivizes renewable energy consumption and indirect carbon reduction, thereby validating the effectiveness and superiority of the method and providing reliable technical support for the low-carbon collaborative operation of multiple VPPs.

Keywords:

virtual power plant; Stackelberg game; low-carbon economy; flexible load; collaborative management

1. Introduction

Driven by the global goals of carbon neutrality and carbon peaking, the energy system is undergoing a profound low-carbon transition. The large-scale grid integration of renewable energy and the clustered operation of distributed energy resources have emerged as pivotal trends in future energy development [1,2]. As a critical enabler for integrating distributed generators, energy storage systems, flexible loads, and other resources, Virtual Power Plants (VPPs) transform distributed energy from “disordered grid connection” to “ordered collaboration” by aggregating the regulation potential of scattered energy resources. This provides an effective pathway for enhancing the flexibility of power systems and promoting the accommodation of renewable energy [3,4]. With the continuous expansion in the number and scale of VPPs, the collaborative operation of multi-VPP clusters has increasingly become a critical research topic in power system dispatch and management [5,6].

As the core link connecting the power supply side and the demand side, the operational efficiency of a distribution system directly impacts the accommodation level of distributed energy and the overall system benefits. Distribution System Operators (DSOs), acting as the managers and dispatchers of distribution systems, are responsible for coordinating the operational behaviors of multiple VPPs, balancing supply and demand, reducing carbon emissions, and controlling operational costs [7]. However, each VPP operates as an independent market entity with the core objective of maximizing its own benefits. Its energy management strategies (such as generation schedule formulation, load regulation, and energy storage charging/discharging control) are not only constrained by internal resource endowments but also closely coupled with the decisions of other VPPs and the dispatching strategies of the DSO [8]. This complex interaction, characterized by multiple stakeholders, conflicting objectives and diverse constraints, renders traditional centralized dispatching methods inadequate for balancing the interests of all parties, resulting in limited feasibility and economic efficiency of dispatching schemes [9]. Furthermore, with the gradual maturation of carbon trading markets, carbon emission costs have evolved into a significant component of the energy system operational costs. Consequently, integrating carbon constraints into the collaborative dispatching process of multiple VPPs further accentuates the complexity of the optimization problem, which is driven by both economic and low-carbon objectives [10,11].

Currently, extensive research has been conducted on the optimal dispatching and collaborative management of VPPs. Regarding the optimal dispatching of a single VPP, existing studies primarily focus on the optimal allocation of internal resources. By establishing optimization models that incorporate factors such as the uncertainty of renewable energy output and energy storage operation constraints, these studies aim to minimize costs or maximize benefits [12,13]. Reference [14] constructs an intraday optimal dispatching model for VPPs considering wind power output fluctuations, which is solved using the particle swarm optimization algorithm, effectively improving the operational economy of VPPs. Reference [15] incorporates carbon trading costs into the VPP utility function and proposes a dispatching strategy that balances economic and low-carbon objectives. However, such studies typically overlook the interactive relationships between multiple VPPs and thus cannot adapt to the operational requirements of multi-VPP clusters. In the field of multi-VPP collaborative management, the energy resources among VPPs exhibit spatiotemporal complementarity [16]. Therefore, conducting collaborative optimization that accounts for multi-VPP clusters in a unified regional distribution network can achieve interest balance and optimization among VPPs, leading to better economic benefits. Reference [17] proposes a low-carbon optimal dispatching method for distribution systems with multiple VPPs, reducing system carbon emissions by coordinating the output plans of each VPP. Furthermore, distributed coordination methods achieve the alignment of local optimality and global optimization through information interaction and autonomous decision-making among multiple subjects. Reference [18] adopts the Alternating Direction Method of Multipliers (ADMM) to realize distributed optimization of multiple VPPs, balancing dispatching efficiency and privacy protection. Nevertheless, existing distributed methods mostly focus on economic objective optimization, with insufficient consideration of low-carbon constraints. Moreover, they fail to effectively characterize the interest game relationships among participants, resulting in inadequate fairness and feasibility of collaborative dispatching schemes.

As an effective tool for analyzing interest interactions among multiple entities, game theory provides a novel solution for the multi-VPP collaborative management [19]. In particular, the Stackelberg game, a non-cooperative game model, effectively coordinates interest conflicts between entities by characterizing the hierarchical “leader–follower” decision-making relationship [20]. In the power system field, the Stackelberg game has been widely applied to multi-entity collaboration problems, such as those involving power grids and distributed generators, as well as microgrids and load aggregators [6,7,12]. Reference [21] designs a transaction mechanism between electricity retailers and VPPs based on the Stackelberg game, enhancing the economic benefits of both parties. However, existing studies utilizing the Stackelberg game mostly target single VPPs or two-party interaction scenarios, lacking a systematic analysis of multi-VPP cluster collaboration. Furthermore, they do not fully integrate carbon trading mechanisms and complex constraints (such as energy storage operation and flexible load response), making it difficult to meet the practical requirements for the low-carbon economic collaborative operation of multiple VPPs [22,23].

Additionally, research on multi-VPP collaborative management has yielded substantial results, with scholars worldwide focusing on guiding distributed resources to participate in market games through carbon trading mechanisms [24,25,26]. However, the existing research lacks an in-depth investigation into the integration of stepwise carbon trading mechanisms and flexible load constraints within a multi-VPP Stackelberg game framework [27]. Specifically, the dynamic game relationship between carbon trading costs and adaptive load response strategies has been overlooked, resulting in sub-optimal economic and environmental synergies in low-carbon economic dispatch strategies. Therefore, this work proposes a Stackelberg game-based method for the low-carbon economic optimization and collaborative management of multiple VPPs.

The main research contents are as follows:

(1): A Stackelberg game model for multi-VPP energy management is constructed, clarifying the DSO as the leader and multiple VPPs as the followers, to characterize the interest interaction relationship between the two parties.
(2): A dynamic pricing game model for the leader (DSO) and an energy management game model for the followers (VPPs) are established, respectively. The VPP utility function comprehensively considers multi-dimensional factors such as energy transaction costs, fuel costs, and carbon trading costs. Furthermore, a strategy space subject to constraints regarding energy balance, energy storage operation, power conversion, and flexible load response is constructed.
(3): A solution approach integrating the Nonlinear-based Chaotic Harris Hawks Optimization (NCHHO) algorithm and CPLEX solver is introduced to efficiently obtain the equilibrium solution of the Stackelberg game, thereby achieving a win–win outcome for both the DSO and multiple VPPs.

The main innovations of this paper are reflected in three aspects: First, a multi-VPP Stackelberg game framework balancing the profit between the DSO and VPPs is constructed. By incorporating carbon trading costs into the VPP utility function, the collaborative optimization of low-carbon economic objectives is realized. Second, multi-constraint conditions such as energy storage operation and flexible load response are systematically considered, improving the description of the VPP strategy space and enhancing the engineering feasibility of dispatching schemes. Third, the NCHHO algorithm is successfully applied to the problem of optimization for multi-VPP collaborative dispatching.

The remainder of this paper is organized as follows: Section 2 provides a Stackelberg game model for energy management of multiple VPPs. The dynamic pricing of the leader and energy management of the follower VPPs are presented in Section 3 and Section 4, respectively. The Stackelberg game-based optimal scheduling model for multiple VPPs is described in Section 5. In Section 6, a case study is conducted, and the results are discussed. Finally, conclusions are drawn in Section 7.

2. Stackelberg Game Model for Energy Management of Multiple VPPs

In the scenario of clustered multiple-VPP operation, each VPP acts as an independent energy aggregation unit. The output of its internal Distributed Energy Resources (DERs) and load demand exhibit real-time dynamic fluctuations, frequently leading to an imbalance between internal power generation and consumption within a given time interval. To address this challenge, this work employs a multi-VPP method for energy management.

To facilitate orderly energy interaction between multiple VPPs and the distribution system, a transaction mechanism involving the Distribution System Operator (DSO) and multiple VPPs is designed, as illustrated in Figure 1. As the core dispatching and transaction entity, the DSO is responsible for formulating unified transaction prices for electricity purchase and sale. VPPs with power surplus can sell excess electricity to the DSO at the electricity selling price, thereby monetizing surplus energy. Conversely, power-deficit VPPs can purchase the required electricity from the DSO at the purchase price, ensuring a reliable power supply for their internal loads. Meanwhile, based on the aggregated power purchase and sale data reported by each VPP, the DSO conducts two-way transactions with the power market by leveraging the grid price (the price for purchasing electricity from the microgrid) and the grid feed-in price (the price for selling electricity to the microgrid) in the power market. The DSO generates revenue through the transaction price difference, forming a three-level closed-loop energy transaction system of “VPP-DSO-Power Market”.

Considering the distinct investment and operational entities of the DSO and VPPs, their divergent interest demands, and the DSO’s role as the distribution system manager, a leader–follower hierarchical Stackelberg game framework is constructed in this work, as shown in Figure 2. The roles are defined as follows:

Leader (DSO): As the dominant player in the game, the DSO first aggregates the initial electricity purchase and sale demand reported by each VPP. Subsequently, considering power market grid prices, system operational constraints, and the price response characteristics of VPPs, it formulates differentiated electricity purchase and sale prices for each VPP with the objective of maximizing its own net profit.

Followers (Multiple VPPs): Each VPP acts as an independent follower in the game. Upon receiving the transaction prices set by the DSO, each VPP optimizes the output schedules of its internal DERs and load response strategies based on its own resource endowments (such as DER output potential, energy storage status, and adjustable capacity of flexible loads). With the objective of minimizing its own operation cost, each VPP determines the final volume of electricity to be purchased from or sold to the DSO.

In this game process, the decision-making of the leader and followers exhibits a clear sequential hierarchy: the DSO determines the transaction prices first, followed by the response of the multiple VPPs who formulate their energy management strategies, thus constituting a typical Stackelberg game. Furthermore, as parallel followers, the VPPs make decisions simultaneously to pursue individual optimality without information exchange regarding their specific electricity volumes and DER output strategies, thereby forming a non-cooperative game relationship. These components together constitute a hybrid “leader–follower–non-cooperative” game system.

The Stackelberg game model for the DSO and multiple VPPs constructed in this work consists of three core components: players, strategy spaces, and utility functions, which are defined in detail as follows:

(1): Players: The core participants in the game are the leader (DSO) and the followers (multiple VPPs). The DSO, as a single leader, undertakes the functions of system dispatching and transaction pricing. The multiple VPPs, acting as multiple independent followers, each possess internal DERs (including distributed generators, energy storage systems, flexible loads, etc.) and possess the capability for independent decision-making regarding energy management.
(2): Strategies: A strategy refers to the decision variables adopted by each player to achieve their respective goals. For the leader (DSO), the core strategies are the electricity purchase price and sale price offered to the multiple VPPs. These prices must be set within a reasonable range constrained by the power market price interval and system supply–demand balance requirements. For the followers (VPPs), their core strategies encompass two aspects: first, the transaction volume with the DSO (i.e., the electricity sales volume for power-surplus VPPs and the electricity purchase volume for power-deficit VPPs); and second, the operational schedules of internal DERs (such as the output of distributed generators, the charging/discharging power of energy storage systems, and the load transfer of flexible loads). All strategy variables must satisfy the respective equipment constraints and operational rules of the VPPs.
(3): Utility Functions: A utility function represents the objective-oriented metric for each player. The DSO’s utility objective is to maximize its own net profit, which is achieved by formulating optimal transaction prices to balance the transaction revenue from multiple VPPs against the transaction cost within the power market. The multiple VPPs’ utility objective is to minimize their own comprehensive operation cost, which covers multi-dimensional expenditures such as energy purchase cost, fuel consumption cost, carbon trading cost, equipment operation and maintenance cost, and compensation cost. The utility functions of both parties will be described in detail in Section 3 and Section 4, respectively.

3. Stackelberg Game Model for Dynamic Pricing of the Leader (DSO)

3.1. Strategy

The strategy adopted by the DSO consists of the electricity purchase prices

λ_{t}^{D A, b}

and sale prices

λ_{t}^{D A, s}

formulated for each VPP at time t, denoted as

λ^{D A, b} = (λ_{1}^{D A, b}, λ_{2}^{D A, b}, \dots, λ_{T}^{D A, b})

and

λ^{D A, s} = (λ_{1}^{D A, s}, λ_{2}^{D A, s}, \dots, λ_{T}^{D A, s})

, respectively, where T represents the number of time intervals in a day.

3.2. Utility Function

The DSO’s utility function aims to maximize its net profit, which accounts for the costs and revenues from electricity transactions with both the power market and VPPs. The objective function is expressed as follows:

\max F^{D S O} = \sum_{t = 1}^{T} (λ_{t}^{W, s} P_{t}^{D S O, s} - λ_{t}^{W, b} P_{t}^{D S O, b} + λ_{t}^{D A, b} \sum_{j = 1}^{N} P_{j, t}^{V P P, b} - λ_{t}^{D A, b} \sum_{j = 1}^{N} P_{j, t}^{V P P, s})

(1)

where

λ_{t}^{W, s}

and

λ_{t}^{W, b}

are the grid sale price (price for selling to the power market) and grid purchase price (price for purchasing from the power market) at time t, respectively;

P_{j, t}^{V P P, b}

and

P_{j, t}^{V P P, s}

represent the electricity sold to and purchased from the DSO by VPP j at time t, respectively;

P_{t}^{D S O, s}

and

P_{t}^{D S O, b}

denote the electricity sold to and purchased from the power market by the DSO at time t, respectively; and N is the total number of VPPs.

To ensure the supply–demand balance among all VPPs,

P_{t}^{D S O, s}

and

P_{t}^{D S O, b}

are defined as follows:

\{\begin{cases} P_{t}^{D S O} = \sum_{j = 1}^{N} (P_{j, t}^{V P P, b} - P_{j, t}^{V P P, s}) \\ P_{t}^{D S O, b} = \max (0, P_{t}^{D S O}) \\ P_{t}^{D S O, s} = \min (0, - P_{t}^{D S O}) \end{cases}

(2)

where

P_{t}^{D S O}

is the total electricity transacted between the DSO and the power market after aggregating all the VPPs’ purchases and sales of electricity. A positive value of

P_{t}^{D S O}

indicates that the DSO purchases electricity from the power market, while a negative value indicates that the DSO sells electricity to the power market.

3.3. Strategy Space

To incentivize VPPs to participate in transactions with the DSO, the electricity purchase and sale prices formulated by the DSO must satisfy the following constraint:

λ_{t}^{W, s} \leq λ_{t}^{D A, s} \leq λ_{t}^{D A, b} \leq λ_{t}^{W, b}

(3)

In Equation (3), the DSO’s electricity purchase price

λ_{t}^{D A, b}

must not exceed the grid purchase price

λ_{t}^{W, b}

, and its electricity sale price

λ_{t}^{D A, s}

must not be lower than the grid sale price

λ_{t}^{W, s}

. This constraint ensures that VPPs will choose to transact with the DSO to maximize their own interests. Consequently, the DSO’s strategy space is thus defined by Equation (3), denoted as

Ω^{D S O}

.

4. Stackelberg Game Model for Energy Management of the Follower VPPs

4.1. Strategy

The game strategy of a VPP consists of its operational schedule for each time interval, including the electricity sold to the DSO (

P_{j, t}^{V P P, s}

), the electricity purchased from the DSO (

P_{j, t}^{V P P, b}

), the output power of micro gas turbines (

P_{i, t}^{M T}

), the charging/discharging power of energy storage (ES) systems (

P_{i, t}^{E S}

), the power of flexible loads (

P_{i, t}^{F l e x}

), and the output power of renewable energy sources (photovoltaic systems (

P_{i, t}^{P V}

) and wind turbines (

P_{i, t}^{W T}

)). These variables are denoted as

p_{j} = (P_{j, t}^{V P P, s}, P_{j, t}^{V P P, b}, {(P_{i, t}^{M T}, P_{i, t}^{E S}, P_{i, t}^{F l e x}, P_{i, t}^{P V}, P_{i, t}^{W T})}_{i \in N_{j}})

,

t = 1 : T

where N_j represents the set of distributed energy resources (DERs) contained in VPP j.

4.2. Utility Function

The VPP’s utility function in the game aims to minimize its operational cost, which comprises the electricity transaction cost (

C_{j}^{N E T}

), MT’s fuel cost (

C_{j}^{M T}

), carbon trading cost (

C_{j}^{C O 2}

), operation and maintenance (O&M) cost (

C_{j}^{O P}

), revenue from renewable energy generation (

C_{j}^{G r e e n}

), and compensation cost for flexible load demand response (

C_{i, t}^{F l e x}

). For an arbitrary VPP j, its utility function is expressed as:

\min C_{j}^{V P P} = C_{j}^{N E T} + C_{j}^{M T} + C_{j}^{C O 2} + C_{j}^{O P} + C_{j}^{F l e x} - C_{j}^{G r e e n}

(4)

4.2.1. Electricity Transaction Cost

To maintain power balance, VPPs engage in transactions with the upper-level grid by purchasing electricity to meet load demand when its internal generation is insufficient, and selling surplus electricity to the grid when generation exceeds demand. Thus, the electricity transaction cost within the grid is expressed by:

C_{j}^{N E T} = \sum_{t = 1}^{T} (λ_{t}^{D A, b} P_{j, t}^{V P P, b} - λ_{t}^{D A, s} P_{j, t}^{V P P, s} + λ_{E S S}^{N E T} {(P_{j, t}^{E S})}^{2})

(5)

where

λ_{E S S}^{N E T}

denotes the scheduling coefficient of energy storage.

4.2.2. Fuel Cost

The micro gas turbine (MT), fueled by natural gas, exhibits variable-load efficiency characteristics. The relationship between its output power and fuel cost is formulated as a quadratic function [28]:

C_{i, t}^{M T} = a_{i} {(P_{j, t}^{M T})}^{2} + b_{i} P_{j, t}^{M T} + c_{i}

(6)

where

a_{i}

,

b_{i}

and

c_{i}

are the cost coefficients of the MT.

4.2.3. Carbon Trading Cost

Carbon emissions are generally generated throughout the production, transportation, and consumption processes of various energy sources, including CO₂ emissions from the MTs fueled by natural gas, as well as CO₂ emissions associated with the operation of photovoltaics (PV), wind power, and energy storage systems. Accordingly, the equivalent carbon emissions can be formulated as follows:

E^{C O_{2}} = \sum_{t = 1}^{T} (η_{M T} P_{j, t}^{M T} + η_{P V} P_{j, t}^{P V} + η_{W T} P_{j, t}^{W T} + η_{E S} P_{j, t}^{E S} + η_{V P P} P_{j, t}^{V P P})

(7)

where

P_{j, t}^{M T}

,

P_{j, t}^{P V}

,

P_{j, t}^{W T}

, and

P_{j, t}^{E S}

represent the output power of the MT, PV system, WT, and the charging/discharging power of energy storage in VPP j at time t, respectively;

P_{j, t}^{V P P}

is the absolute transaction power of VPP j with the DSO at time t;

η_{M T}

,

η_{P V}

,

η_{W T}

,

η_{E S}

, and

η_{V P P}

denote the CO₂ emission factors of the MT, PV, WT, ES and the absolute transaction power, respectively.

To reduce carbon emissions, carbon quotas are typically incorporated into the carbon trading mechanism as free carbon emission allowances allocated to market participants; any excess emissions must be purchased through the market [29,30].

Generally, there are two common methods for allocating carbon quotas: free allocation and paid allocation. Free allocation refers to granting a specific emission quota to the system in advance at no cost, in order to enhance its willingness to participate; whereas paid allocation requires the system to pay corresponding fees for its own carbon emissions. In this work, to encourage the VPP participation in the carbon market, free allocation is adopted, and carbon emission quotas are formulated as follows:

E^{A l l o c} = \sum_{t = 1}^{T} (γ_{M T} P_{t}^{M T} + γ_{P V} P_{t}^{P V} + γ_{W T} P_{t}^{W T} + γ_{V P P} P_{t}^{V P P})

(8)

where

γ_{M T}

,

γ_{P V}

,

γ_{W T}

and

γ_{V P P}

are the conversion coefficients of the carbon allowance allocation corresponding to the aforementioned energy generation and consumption processes, respectively.

The carbon trading mechanism requires energy suppliers to control their carbon emissions within their carbon allowances. Any emissions exceeding the allocated allowances must be offset by purchasing carbon credits from the market to avoid penalties. This work adopts a stepwise carbon trading mechanism, which sets price intervals based on the volume of carbon emissions: the unit price increases as the purchase volume rises. The stepwise carbon price is defined as:

λ_{k}^{C O_{2}} = λ_{b a s e}^{C O_{2}} (\frac{1}{4} (k - 1) + 1)

(9)

where k is the number of price steps;

λ_{k}^{C O_{2}}

is the carbon trading price for the k-th step;

λ_{b a s e}^{C O_{2}}

is the basic unit price of carbon trading. Accordingly, the carbon trading cost is calculated as:

C_{j}^{C O 2} = \sum_{k = 1}^{K} λ_{k}^{C O_{2}} \min (\max ((E_{j}^{C O 2} - E_{j}^{A l l o c}) - L \times (k - 1), 0), L)

(10)

where L denotes the step length;

E_{j}^{A l l o c}

is the carbon allowance allocated to VPP j, and

E_{j}^{C O 2}

is the total carbon emissions of VPP j.

4.2.4. Operation and Maintenance (O&M) Cost

In the integrated energy system, PV systems, WTs, and energy storage systems all incur corresponding O&M costs, which can be expressed as:

C_{j}^{O P} = \sum_{t = 1}^{T} [λ^{P V} P_{j, t}^{P V} + λ^{W T} P_{j, t}^{W T} + λ^{E S} P_{j, t}^{E S}]

(11)

where

λ^{P V}

,

λ^{W T}

and

λ^{E S}

denotes the O&M cost coefficients for the PV system, WT and energy storage, respectively.

4.2.5. Compensation Cost

Since users have different willingness to shift or curtail their electrical loads, compensation is provided to incentivize user participation in flexible load curtailment. The compensation cost is given by:

C_{j}^{F l e x} = \sum_{t = 1}^{T} (λ_{c o s t}^{F l e x} P_{j, t}^{F l e x})

(12)

where

P_{j, t}^{F l e x}

represents the power of curtailable electrical loads in VPP j at time t; and

λ_{c o s t}^{F l e x}

is the compensation cost coefficient of curtailed electrical loads.

4.2.6. Revenue from Renewable Energy Generation

To encourage renewable energy generation, certified “carbon assets” or Green Electricity Certificates (GECs) can be generated after verification by authorized institutions. These assets can be sold in the carbon emission trading market or the green electricity market to enterprises with carbon emission quota constraints, thereby generating revenue:

C_{j}^{G r e e n} = λ^{g r e e n} \sum_{t = 1}^{T} (P_{j, t}^{P V} + P_{j, t}^{W T})

(13)

where

λ^{g r e e n}

is the quantification coefficient for converting renewable energy generation into Green Electricity Certificates.

4.3. Strategy Space

In responding to the prices released by the DSO, the VPP must satisfy the power balance constraint and the operational constraints of each DER, while pursuing its own profit. The VPP’s strategy space is denoted as

Ω_{j}^{V P P}

, j = 1, 2…, N.

4.3.1. Energy Balance Constraint

To achieve the collaborative optimization between the DSO and the VPP and maximize self-interests while meeting user demand, the energy balance constraint must be strictly satisfied as follows:

P_{j, t}^{V P P, b} - P_{j, t}^{V P P, s} + P_{j, t}^{M T} + P_{j, t}^{E S} + P_{j, t}^{F l e x} + P_{j, t}^{W T} + P_{j, t}^{P V} = P_{j, t}^{L D}

(14)

where

P_{j, t}^{L D}

is the predicted load value of VPP j at time t.

4.3.2. Energy Storage Constraints

The energy storage constraints cover the charging and discharging operational constraints of the electrical energy storage battery, which are given as follows:

P_{j, \min}^{E S} \leq P_{j, t}^{E S} \leq P_{j, \max}^{E S}

(15)

S_{j, \min}^{E S} = S_{j, t - 1}^{E S} - \frac{Δ t}{E_{j, \max}} P_{j, t}^{E S}

(16)

S_{j, \min}^{E S} \leq S_{j, t}^{E S} \leq S_{j, \max}^{E S}

(17)

S_{0}^{E S} = S_{T}^{E S}

(18)

where

S_{j, t}^{E S}

is the state of charge (SOC) of energy storage in VPP j at time t;

S_{j, \min}^{E S}

and

S_{j, \max}^{E S}

are the lower and upper limits of the SOC, respectively;

P_{j, \min}^{E S}

and

P_{j, \max}^{E S}

are the charging and discharging power of energy storage, respectively;

E_{j, \max}

is the maximum energy capacity of the energy storage system (energy losses are neglected in these constraints); and

Δ t

is the time interval from time t to time t + 1.

4.3.3. Power Upper/Lower Limit and Ramp Rate Constraints

To achieve economic dispatch and efficient operation for a system with diversified flexible resources, the following constraints must be satisfied.

0 \leq P_{j, t}^{V P P, s} \leq θ_{j, t} P_{j, \max}^{V P P}

(19)

0 \leq P_{j, t}^{V P P, b} \leq (1 - θ_{j, t}) P_{j, \max}^{V P P}

(20)

0 \leq P_{j, t}^{M T} \leq P_{j, \max}^{M T}

(21)

P_{j, d n}^{M T} \leq P_{j, t}^{M T} - P_{j, t - 1}^{M T} \leq P_{j, u p}^{M T}

(22)

0 \leq P_{j, t}^{W T} \leq P_{t, \max}^{W T}

(23)

0 \leq P_{j, t}^{P V} \leq P_{t, \max}^{P V}

(24)

where

θ_{j, t}

is a binary variable:

θ_{j, t}

= 1 indicates that VPP j sells electricity to the DSO at time t, while

θ_{j, t}

= 0 indicates that VPP j purchases electricity from the DSO at time t;

P_{j, \max}^{V P P}

is the maximum transaction volume between VPP j and the DSO;

P_{j, \max}^{M T}

is the maximum output power of the MT;

P_{j, d n}^{M T}

and

P_{j, u p}^{M T}

are the downward and upward ramp rates of the MT, respectively;

P_{t, \max}^{W T}

and

P_{t, \max}^{P V}

are the maximum output power of the WT and PV at time t, respectively, taken as the predicted wind and PV power value for VPP j.

5. Stackelberg Game-Based Optimal Scheduling Model for Multiple Virtual Power Plants

5.1. Stackelberg Game Model

According to the aforementioned analysis, the Stackelberg game model for the DSO and multiple VPPs is formulated as follows:

\begin{array}{l} \max_{λ^{D A, s}, λ^{D A, b}, p} F^{D S O} (λ^{D A, s}, λ^{D A, b}, p) \\ s . t . \{\begin{cases} (λ^{D A, s}, λ^{D A, b}) \in Ω^{D S O} \\ p_{j} = \underset{{\hat{p}}_{j}}{\arg \min} C_{j}^{V P P} (λ^{D A, s}, λ^{D A, b}, {\hat{p}}_{j}) \forall j \\ {\hat{p}}_{j} \in Ω_{j}^{V P P} \end{cases} \end{array},

(25)

where the specific forms of the objective function and constraints are consistent with the definitions provided in the aforementioned Sections.

In Equation (25), the DSO and VPPs formulate their strategies with the objectives of maximizing revenue and minimizing operational costs, respectively. The DSO’s revenue is related to the electricity purchase/sale prices it sets and the transaction volumes of the VPPs: a larger price difference between the purchase and sale prices, or a larger volume of electricity shared among VPPs, will lead to higher revenue for the DSO. However, the VPPs’ price-responsive behaviors also affect the DSO’s revenue: a higher purchase price will reduce the VPPs’ electricity purchase volume, while a lower sale price will decrease the VPPs’ electricity sale volume, resulting in a reduction in the total electricity exchanged among VPPs. Evidently, an interest game exists between the DSO and the VPPs. To maximize its own revenue, the DSO must account for the VPPs’ price-responsive behaviors and determine the optimal electricity pricing strategy by finding the Nash equilibrium solution.

5.2. Solution for Game Model

The solution process consists of two stages. In the first stage, the NCHHO algorithm [31] is used to solve the DSO’s objective function in Equation (1). In the second stage, the CPLEX solver in the YALMIP toolbox is utilized to calculate the total cost of the lower-level aggregated VPPs in Equation (4) under its multi-objective and multi-constraint conditions. The specific steps are described as follows:

(1): Data Input. Input the purchasing and selling prices of the distribution network, wind and solar power forecast data, and predicted load data. Set the constraints for the control variables.
(2): Population Initialization. Initialize the population by setting initial random variables, specifically the SO’s internal purchasing and selling prices, the quantities of electricity traded, and the SO’s internal energy storage capacity. Define the upper and lower bounds of the search space and the maximum number of iterations, iter = 30, and set the population size to 30.
(3): Position Initialization. Randomly initialize the positions of all individuals within the defined search space.
(4): Fitness Calculation. Calculate the SO’s profit and the VPP’s cost according to Equation (1) and Equation (4), respectively. The fitness value for each individual is based on this optimization.
(5): Evaluation and Update. For each individual, calculate its new fitness value and update the best individual (the best solution found so far) if a better one is found.
(6): Termination Check. Determine whether the search result meets the stopping criteria (e.g., reaching the maximum number of iterations). If met, output the optimal solution; otherwise, repeat steps (4) and (5) until the convergence condition is satisfied.

6. Case Study and Results

6.1. Case Configuration

The proposed algorithm was validated through modeling and analysis in MATLAB 2021b, aided by the YALMIP toolbox and the CPLEX solver. The case study involves a VPP cluster system designed in this work, which consists of three VPPs. Each VPP contains MT, energy storage systems (batteries), wind power and photovoltaics, with the set of distributed energy resources defined as N_j = 1. All data used in this work are derived from the simulation of a typical real-world scenario. Figure 3 illustrates the forecasted wind power, solar power and load demand for VPPs on a representative day. The technical parameters of each component are tabulated in Table 1, Table 2, Table 3, Table 4 and Table 5.

6.2. Results for the Bi-Level Game Optimization Strategy

Figure 4 shows the optimized electricity prices. It can be observed that the electricity purchase and sale prices formulated by the DSO satisfy the constraints defined in Equation (3). This ensures that VPPs choosing to transact with the DSO are instrumental in maximizing their own interests.

Figure 5 presents the results of electricity purchasing and selling volumes for both the DSO and the VPP cluster. It can be observed that, under price-based regulation, the transaction behaviors of the VPPs are driven by the objective of maximizing their own interests. Taking VPP1 as an example, electricity is purchased from the DSO during periods 1–7 when purchase prices are lower, while electricity is sold to the DSO during periods 7–14 when sale prices are higher. This phenomenon occurs because such a strategy not only minimizes the operational costs for the VPPs but also enhances the benefits for the DSO in its transactions with the upper-level power market.

Figure 6, Figure 7 and Figure 8 provide a detailed view of the internal power generation, energy storage dynamics, and electricity trading within the VPPs. It can be seen that, through the coordinated supplied by the MT, PV, WT, and energy storage, optimized scheduling both within the VPPs and across the distribution network effectively meets load demand and enhances power supply reliability. Taking VPP1 as an example, during time periods 1–2, the electricity purchase price is lower than the O&M costs of the WT; consequently, the system opts to purchase electricity directly from the DSO and utilizes the surplus to charge the energy storage. During time periods 3–7, the WT and the MT are dispatched to meet the user load demand. During time periods 8–13, driven by rising purchase and sale prices, PV and WT power are efficiently utilized, with the surplus electricity sold to the DSO to maximize profits. During time periods 14–24, characterized by high load demand, VPP1 balances the load using the output of PV, WT, storage, and the MT. Given the intermittent nature of WT and PV, their output is prioritized to meet load demand. Meanwhile, the MT, owing to its flexible scheduling characteristics, effectively compensates for the uncertainty of renewable energy output and enables on-demand power generation, as exemplified during time periods 3–7 and 17–23. Additionally, this work not only improves the interactivity of power exchange but also facilitates rapid matching of buyers and sellers during supply shortages or surpluses. This demonstrates the effectiveness of our approach.

Table 6 presents the operational cost of VPPs. It can be observed that operation and maintenance (O&M) costs account for a significant proportion of the total expenses. Fundamentally, this is attributed to the substantial output of renewable energy sources, such as wind power, which not only contributes to carbon emission reduction but also yields substantial carbon emission compensation through Green Electricity Certificates indirectly. This indicates that the proposed method, while satisfying its own economic interests, realizes low-carbon economic optimization through an energy management strategy subject to constraints including energy balance, energy storage operation, power generation and transaction, and flexible load response.

6.3. Comparison

6.3.1. Comparison with Fixed Electricity Prices

To further validate the effectiveness of the proposed model, two strategies are established for comparison analysis:

Strategy 1: This strategy serves as the baseline scenario. It does not utilize the dynamic pricing mechanism from the DSO. Instead, a fixed electricity price is uniformly applied for both purchasing and selling, corresponding to the standard grid tariff detailed in Table 7.

Strategy 2: This strategy implements the model proposed in this work, which is characterized by the adoption of dynamic prices formulated by the DSO.

Table 8 presents the economic benefits for the DSO and the VPPs under the two strategies. It can be observed that under Strategy 2, the DSO aims to maximize revenue by optimizing internal transaction electricity prices. Furthermore, since the purchase price set by the DSO does not exceed the grid electricity price, and the sale price is not lower than the feed-in tariff, the operation cost for the VPPs is reduced compared to Strategy 1.

Table 9 presents a detailed quantitative comparison of carbon emissions and revenues for each VPP under Strategy 1 and Strategy 2. It can be observed that the physical carbon emissions, which are mainly from the MTs fueled by natural gas, do not account for a large proportion. Despite variations in physical emissions, the revenues in Strategy 2 reflect the economic incentives provided by the Green Certificate mechanism. This indicates that the proposed model effectively incentivizes renewable energy consumption and indirect carbon reduction.

6.3.2. Comparison with HHO Optimization Method

To demonstrate the advantages of the method proposed in this work, its execution, adaptability, and global search capability were evaluated under the same game model. Representative results from running the NCHHO and the classic HHO algorithm [32] with the same population size and initial populations are illustrated in Figure 9. Specifically, both algorithms were executed for 10 independent runs to ensure statistical significance. Table 10 lists the statistical metrics, including the mean, standard deviation, and best fitness. These results indicate that NCHHO consistently outperforms the standard HHO in terms of solution quality, thereby confirming the effectiveness of the NCHHO method incorporated into the proposed model.

6.4. Sensitivity Analysis

6.4.1. Basis Unit Carbon Price

To verify the impact of carbon trading parameters on dispatch outcomes, a sensitivity analysis was performed by gradually increasing the basis unit price of carbon trading. Table 11 lists the revenues of the VPPs and DSO under different price levels. The results indicate that the relationship between the basis unit price of carbon trading and stakeholder profits is non-monotonic; neither excessively low nor high prices ensure optimal returns. Consequently, the optimal comprehensive profit is achieved within the price range of [0.00010, 0.00020]. Therefore, in this work,

λ_{b a s e}^{C O_{2}}

is set to 0.00015.

6.4.2. Free Allocation Coefficients

In order to evaluate the impact of the parameters of the stepwise carbon trading mechanism on the overall objective function, an optimal search for the parameters of free allocation coefficients is performed, using the NCHHO with a population size of 30. The upper and lower limits of the parameters are listed in the ‘Search Range’ column of Table 12. It should be noted that this experiment optimizes only the parameters associated with the carbon trading cost, while the optimal prices are inherited from the previous experiment (Strategy 2). Additionally, the baseline carbon price is set

λ_{b a s e}^{C O_{2}}

= 0.00015.

The optimal parameter values obtained by the NCHHO algorithm are presented in the ‘Optimal Value’ column in Table 12. Table 13 presents the revenues of the VPPs and DSO under the optimized carbon emission conversion parameters. It can be observed that, compared to the non-optimized scenario, the optimized parameters lead to a slight increase in the operational costs of each VPP, while the DSO’s revenue shows an increase. Furthermore, Table 14 lists the corresponding carbon emission conversion values for each part within the VPPs in detail. According to the conversion coefficients, the calculated carbon emissions for each VPP are shown in the “CO₂ emission” column. Evidently, after optimization, the adjusted conversion coefficients contribute to a reduction in carbon trading costs. This indicates that the optimization of carbon conversion coefficients effectively modulates VPP operational strategies. Although operational costs increase slightly, the reduction in carbon emissions and associated trading costs validates the effectiveness of the proposed strategy in balancing economic efficiency with environmental benefits.

6.5. Further Discussions

6.5.1. VPP with Non-Responsive Behavior

In real-world operations, a VPP might engage in non-responsive behavior to influence the DSO’s pricing strategy. To investigate this scenario, a comparative case where VPP1 operates independently of DSO price adjustments is designed, named Strategy 3, setting its prices based on the feed-in tariff, while the other two VPPs follow the DSO’s pricing scheme. In this case, the initial populations are inherited from the final populations obtained in the last iteration of Strategy 2. Table 15 illustrates the resulting economic benefits. The results show that the DSO achieves higher profits, and the operational costs of other VPP1 and VPP3 increase. This outcome indicates that VPP with non-responsive behavior can disrupt the balance of the Stackelberg game and lead to suboptimal outcomes for the system, thereby validating the effectiveness of the non-cooperative pricing mechanism in the proposed model.

6.5.2. VPP with No GECs

To verify the validity of the green certificate revenue, denoted as

C_{j}^{G r e e n}

, which is subtracted from total operational costs, Strategy 4 was designed by removing the green certificate revenue. The respective objective function is as follows:

\min C_{j}^{V P P} = C_{j}^{N E T} + C_{j}^{M T} + C_{j}^{C O 2} + C_{j}^{O P} + C_{j}^{F l e x}

(26)

Table 16 presents the benefits of VPPs and DSO. It can be observed that the DSO’s profit is significantly lower than that in Strategy 2 (which includes the component of Green Certificates), while the costs for each VPP have correspondingly increased, as detailed in Table 17. This demonstrates that the proposed method (incorporating Green Certificates) effectively improves the economic efficiency of the system.

7. Conclusions

Addressing the critical issues of low-carbon economic optimization and collaborative management for multiple Virtual Power Plants (VPPs), this work proposes an optimal dispatch method based on a Stackelberg game. A bi-level game architecture is constructed with the DSO as the leader and multiple VPPs as followers, encompassing the complete research process of model formulation, constraint design, and algorithmic solution. The results indicate that the constructed Stackelberg game model effectively characterizes the interest interaction between the DSO and multiple VPPs. The leader (DSO) maximizes its own utility through a dynamic pricing strategy while providing scientific guidance for the collaborative operation of multiple VPPs. Subject to constraints such as energy balance, energy storage operation, power conversion, and flexible load response, the follower VPPs formulate energy management strategies to achieve individual benefit optimization, comprehensively considering multi-dimensional costs including energy transaction, fuel, carbon trading, operation and maintenance, and compensation, as well as new energy generation revenues. By efficiently solving the game model using the NCHHO algorithm and CPLEX solver, an equilibrium solution satisfying the interests of both parties is successfully obtained. Ultimately, while ensuring the safe and stable operation of the distribution system, the proposed method balances the profit between the DSO and VPPs and incentivizes renewable energy consumption and indirect carbon reduction. It achieves the dual goals of a low-carbon economy and collaborative management, providing a theoretical basis and technical reference for the large-scale low-carbon collaborative operation of multi-VPP clusters. Future work could further extend to the optimization of game models in scenarios involving high-penetration renewable energy and multi-energy coupling, integrate battery life-cycle degradation into the VPP utility function, and consider the impact of uncertainty factors on dispatch strategies to enhance the robustness and engineering applicability of the method. Moreover, we intend to further investigate low-carbon dispatch issues to achieve more substantial emission reductions.

Additionally, the limitations of the non-cooperative assumption should be acknowledged. In practice, VPPs might form coalitions to exert market power. Such cooperative behaviors could lead to higher clearing prices and altered dispatch strategies, potentially reducing the DSO’s revenue and system efficiency. Investigating the impact of such coalitions requires a cooperative game-theoretic framework, which is also a direction for future research.

Author Contributions

Conceptualization, B.Y.; methodology, D.Z.; software, D.Z.; validation, B.Y.; formal analysis, D.Z.; investigation, B.Y.; resources, B.Y.; data curation, B.Y.; writing—original draft preparation, D.Z.; writing—review and editing, D.Z.; visualization, B.Y. and D.Z.; supervision, B.Y.; project administration, B.Y.; funding acquisition, B.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Bing Yang was employed by Dongfang Electric (Chengdu) Innovation Research, Co., Ltd. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Shu, Z.; Zhu, K.; Wang, C.; Shao, H.; Jia, K. Virtual power plants participating in day-ahead electricity market bidding strategy considering carbon trading. Electr. Power Eng. Technol. 2024, 43, 58–69. [Google Scholar]
Wu, Y.; Wu, J.; De, G. Research on trading optimization model of virtual power plant in medium- and long-term market. Energies 2022, 15, 759. [Google Scholar] [CrossRef]
Wang, J.; Fan, R.; Xiao, Y.; Han, X.; Li, Y. Low carbon optimal dispatch of coordinated source-grid-load-storage in new power system based on low-carbon demand response. J. Taiyuan Univ. Technol. 2024, 55, 46–56. [Google Scholar]
Hu, S.; Chen, Y.; Feng, J. A flexible interactive coordination control method of commercial virtual power plant based on WCVAR. Int. J. Electr. Power Energy Syst. 2024, 160, 110128. [Google Scholar] [CrossRef]
Li, P.; Liu, H.; Li, Y. Multi-park integrated energy microgrids hybrid game optimization strategy considering two-stage carbon trading. Trans. China Electrotech. Soc. 2025, 40, 4788–4803. [Google Scholar]
Li, Q.; Dong, F.; Zhou, G. Co-optimization of virtual power plants and distribution grids: Emphasizing flexible resource aggregation and battery capacity degradation. Appl. Energy 2025, 377, 124519. [Google Scholar] [CrossRef]
Zare, A.; Shafie-khah, M.; Siano, P.; Lazaroiu, G.C. A systematic review of virtual power plant configurations and their interaction with electricity, carbon, and flexibility markets. Renew. Sustain. Energy Rev. 2026, 226, 116448. [Google Scholar] [CrossRef]
Marzbani, F.; Osman, A.; Hassan, M. Advances in virtual power plant operations: A review of optimization models. IEEE Access 2025, 13, 131525–131548. [Google Scholar] [CrossRef]
Yu, S.; Fang, F.; Liu, Y.; Liu, J. Uncertainties of virtual power plant: Problems and countermeasures. Appl. Energy 2019, 239, 454–470. [Google Scholar] [CrossRef]
Wang, P.; Ge, Y.; Yu, N.; Lin, Q.; Chen, R.; Wang, J. Low-carbon optimal dispatch of virtual power plant based on time-of-use ladder carbon emission rights exchange mechanism. Syst. Sci. Control Eng. 2023, 11, 2180688. [Google Scholar]
Tian, X.; Zhang, X.; Qi, X.; Liu, F.; Peng, F.; Ju, L. A two-stage scheduling optimization model and benefit allocation strategy for virtual power plants considering demand-side tiered carbon trading. Distrib. Util. 2025, 42, 88–98. [Google Scholar]
Lu, M.; Lou, S.; Liu, J.; Wu, Y.; Wang, Z. Coordinated optimization of multi-type reserve in virtual power plant accommodated high shares of wind power. Proc. CSEE 2018, 38, 2874–2883. [Google Scholar]
Zhao, L.; Chang, W.; Yang, M.; Yang, Q.; Qin, G. Two-stage energy economic optimal dispatch of virtual power plant in deregulated electricity market. Electr. Power 2022, 55, 13–22. [Google Scholar]
Wang, C.; Tang, Z.; Wei, M.; Liu, X.; Cui, H. Coordinated and optimized scheduling of virtual power plants with wind power-waste incineration cogeneration considering uncertainty of source and load sides. J. Electr. Power Sci. Technol. 2024, 39, 232–241. [Google Scholar]
Wang, J.; He, S. Distributionally robust low-carbon scheduling model for virtual power plants considering emerging distributed resources and electricity carbon trading. Electr. Power Constr. 2025, 46, 13–26. [Google Scholar]
Li, X.; Zhao, D. Distributed coordinated optimal scheduling of multiple virtual power plants based on decentralized control structure. Trans. China Electrotech. Soc. 2023, 38, 1852–1863. [Google Scholar]
Li, H.; Li, Y.; Li, W. Distributed cooperative control for virtual power plants considering interaction of source, network and load. Electr. Drive 2019, 49, 72–77. [Google Scholar]
Chen, H.; Wang, Z.; Zhang, R.; Jiang, T.; Li, X.; Li, G. Decentralized optimal dispatching modeling for wind power integrated power system with virtual power plant. Proc. CSEE 2019, 39, 2615–2625. [Google Scholar]
Yang, J. Transaction decision optimization of new electricity market based on virtual power plant participation and Stackelberg game. PLoS ONE 2023, 18, e0284030. [Google Scholar] [CrossRef]
Zhang, H.; Zhang, S.; Cheng, H.; Zhang, X.; Gu, Q. A state-of-the art review on stackelberg game and its applications in power market. Trans. China Electrotech. Soc. 2022, 37, 3250–3262. [Google Scholar]
Zhou, L.; Song, X.; Yu, T. Energy trading optimization method for multi-microgrid systems based on stackelberg game. Acta Energiae Solaris Sin. 2025, 46, 642–649. [Google Scholar]
Wang, G.; Lin, Z.; Chen, Y. Carbon-billed future for virtual power plants: A comprehensive review. Renew. Sustain. Energy Rev. 2025, 217, 115719. [Google Scholar] [CrossRef]
Wang, X.; Chen, C.; Shi, Y.; Chen, Q. Multi-objective two-stage optimization scheduling algorithm for virtual power plants considering low carbon. Int. J. Low-Carbon Technol. 2024, 19, 773–779. [Google Scholar] [CrossRef]
Liu, W.; Li, Z.; Xing, X.; Chen, X.; Wang, Y.; Wang, X. Non-cooperative game optimization for virtual power plants considering carbon trading market. Energy 2025, 317, 134571. [Google Scholar] [CrossRef]
Liu, X.; Ni, Y.; Sun, Y.; Wang, J.; Wang, R.; Sun, Q. Multi-VPPs power-carbon joint trading optimization considering low-carbon operation mode. J. Energy Storage 2023, 83, 110786. [Google Scholar] [CrossRef]
Guo, K.; Zhao, J.; Li, H.; Xu, C.; Zhang, Y.; Lin, X. Low carbon economic dispatch of virtual power plants considering carbon trading and demand response. Distrib. Energy 2025, 10, 69–81. [Google Scholar]
Xu, D.; Li, M. A Stackelberg game model for the energy-carbon co-optimization of multiple virtual power plants. Inventions 2025, 10, 16. [Google Scholar] [CrossRef]
Hao, R.; Ai, Q.; Jiang, Z. Bi-level game strategy for multi-agent with incomplete information in regional integrated energy system. Autom. Electr. Power Syst. 2018, 42, 194–201. [Google Scholar]
Zhou, J.; Liang, C.; Shi, L.; Li, Y.; Liu, J.; Wu, F. Optimal scheduling of integrated energy system considering the ladder-type carbon trading mechanism. Electr. Power 2025, 58, 77–87. [Google Scholar]
Chen, J.; Hu, Z.; Chen, J.; Chen, Y.; Gao, M.; Lin, M. Optimal dispatch of integrated energy system considering loader-type carbon trading and flexible double response of supply and demand. High Volt. Eng. 2021, 47, 3094–3106. [Google Scholar]
Dehkordi, A.A.; Sadiq, A.S.; Mirjalili, S.; Ghafoor, K.Z. Nonlinear-based chaotic harris hawks optimizer: Algorithm and internet of vehicles application. Appl. Soft Comput. 2021, 109, 107574. [Google Scholar] [CrossRef]
Heidari, A.A.; Mirjalili, S.; Faris, H.; Aljarah, I.; Mafarja, M.; Chen, H. Harris hawks optimization: Algorithm and applications. Future Gener. Comput. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]

Figure 1. Transaction relationship diagram.

Figure 2. Stackelberg game block diagram.

Figure 3. Forecasted wind power, solar power and load demand for VPPs on a representative day.

Figure 4. Electricity purchasing and selling prices.

Figure 5. Electricity purchasing and selling results.

Figure 6. The internal power generation, energy storage dynamics, and electricity trading within VPP1.

Figure 7. The internal power generation, energy storage dynamics, and electricity trading within VPP2.

Figure 8. The internal power generation, energy storage dynamics, and electricity trading within VPP3.

Figure 9. Best fitness curves of HHO and NCHHO algorithms.

Table 1. Unit power and equipment parameters.

Type	Power/MW		Coefficient CNY/(kW·h)	CO₂ Emission Cost Coefficient/kg/(kW·h)	Conversion Coefficient of Carbon Emission Allowances/kg/(kW·h)
Type	Minimum	Maximum	Coefficient CNY/(kW·h)	CO₂ Emission Cost Coefficient/kg/(kW·h)
WT	0	$P_{t, \max}^{W T}$ : Forecasted output	$λ^{W T}$ : 0.45	$η_{W T}$ : 76.6	$γ_{W T}$ : 43
PV	0	$P_{t, \max}^{P V}$ : Forecasted output	$λ^{P V}$ : 0.48	$η_{P V}$ : 132.5	$γ_{P V}$ : 78
VPP	--	$P_{j, \max}^{V P P}$ : 10	--	$η_{N E T}$ : 1303	$γ_{N E T}$ : 798
MT	--	--	--	$η_{M T}$ : 129.37	$γ_{M T}$ : 97.14
ESS	--	--	$λ^{E S S}$ : 0.5	$η_{E S S}$ : 91.30	--

Table 2. The parameter of utility function.

Parameter	a_e	b_e	c_e	$P_{j, \max}^{M T}$ /MW	$P_{j, d n}^{M T}$ /MW	$P_{j, u p}^{M T}$ /MW
VPP1	0.08	0.9	1.2	6	−3.5	3.5
VPP2	0.1	0.6	1	5	−3	3
VPP3	0.15	0.5	0.8	4	−2	2

Table 3. The parameters of the energy storage systems.

Parameter	$P_{j, \min}^{E S}$	$P_{j, \max}^{E S}$	$S_{j, \min}^{E S}$	$S_{j, \max}^{E S}$	$S_{0}^{E S}$	$E_{j, \max}$	$λ_{E S S}^{N E T}$
VPP1	−0.6 MW	0.6 MW	0.2	0.9	0.4	2	0.05 CNY/kWh
VPP2	−0.6 MW	0.6 MW	0.2	0.9	0.4	2	0.05 CNY/kWh
VPP3	−1.2 MW	1.2 MW	0.2	0.9	0.4	3	0.05 CNY/kWh

Table 4. Parameter of carbon trading mechanism.

Parameter	L/g	$λ_{b a s e}^{C O_{2}}$ /(CNY/g)	K	$λ^{g r e e n}$ /(CNY/g)
Value	4,000,000	0.00015	5	0.21

Table 5. Parameters of curtailable loads.

Type	Maximum Curtailable Load	$λ_{c o s t}^{F l e x}$ /CNY/(kW·h)
curtailable electric load	10% of the forecasted load	1.4

Table 6. The operational cost of VPPs.

	$C_{j}^{N E T}$ /CNY	$C_{j}^{M T}$ /CNY	$C_{j}^{F l e x}$ /CNY	$C_{j}^{O P}$ /CNY	$C_{j}^{C O 2}$ /CNY	$C_{j}^{G r e e n}$ /CNY	$C_{j}^{V P P}$ /CNY
VPP1	15,743.7	50,820.7	0	30,687.0	190.0	13,399.5	84,041.9
VPP2	1215.1	32,034.4	882.0	41,138.9	169.0	18,812.2	56,627.2
VPP3	9248.9	33,097.2	2902.6	49,978.2	242.5	22,102.7	73,366.7

Table 7. Day-ahead electricity prices for the distribution network.

Time-of-Use	Time	Selling Price	Purchasing Price
Peak	11:00–13:00 19:00–22:00	0.50	1.20
Flat	7:00–11:00 14:00–18:00 23:00–24:00	0.35	0.75
Valley	0:00–7:00	0.30	0.40

Table 8. Benefits of VPP and DSO under two strategies.

	Strategy 1	Strategy 2
VPP1	87,683.6	84,041.9
VPP2	60,020.2	56,627.2
VPP3	76,580.2	73,366.7
DSO	−49,314.8	−5770.0

Table 9. Results of carbon emission.

	Strategy 1					Strategy 2
	Carbon Emissions/kg					Carbon Emissions/kg
	VPP	MT	WT	PV	ES	VPP	MT	WT	PV	ES
VPP1	53.912	18.886	49.956	11.208	3.057	53.555	20.696	51.746	12.061	3.224
VPP2	21.711	12.217	93.157	2.420	1.176	46.450	9.394	82.010	7.571	1.200
VPP3	66.721	17.572	93.105	15.593	6.086	67.502	15.420	92.626	12.625	4.473

Table 10. Statistics of results HHO and NCHHO algorithms.

	Mean Fitness	Standard Deviation	Best Fitness
HHO	−2981.1	2281.6	32.9
NCHHO	−7656.3	4346.4	−2587.7

Table 11. Results of the basis unit carbon price on dispatch outcomes.

$λ_{b a s e}^{C O_{2}}$ (CNY/g)	VPP1	VPP2	VPP3	DSO
0.00005	82,711.6	54,712.2	73,082.9	−6706.5
0.00010	85,110.4	55,861.6	73,204.4	−5506.4
0.00015	84,041.9	56,627.2	73,366.7	−5770.0
0.00020	82,932.0	54,472.4	73,407.5	−6294.7
0.00025	81,402.9	53,467.5	71,574.3	−11,076.9
0.00030	84,274.8	52,136.4	74,016.8	−7171.1

Table 12. Parameter setting for the carbon trading mechanism.

Parameters/kg/(kW·h)		Combination of Coefficient Values/kg/(kW·h)	Search Range	Optimal Value
$η_{W T}$ : 76.6	$γ_{W T}$ : 43	33.6	[10, 70]	14.5664
$η_{P V}$ : 132.5	$γ_{P V}$ : 78	54.5	[10, 100]	71.3418
$η_{N E T}$ : 1303	$γ_{N E T}$ : 798	505	[100, 1000]	286.2096
$η_{M T}$ : 129.37	$γ_{M T}$ : 97.14	32.23	[10, 70]	53.3096
$η_{E S S}$ : 91.30	--	91.30	[30, 200]	30.0

Table 13. Benefits of VPP and DSO.

	Strategy 2	Optimal Results
VPP1	84,041.9	85,243.2
VPP2	56,627.2	56,642.5
VPP3	73,366.7	73,614.8
DSO	−5770.0	−3589.1

Table 14. The carbon emission conversion values of VPPs.

		$P_{j, t}^{V P P}$ /(kW·h)	$P_{j, t}^{M T}$ /(kW·h)	$P_{j, t}^{W T}$ /(kW·h)	$P_{j, t}^{P V}$ /(kW·h)	$P_{j, t}^{E S}$ /(kW·h)	CO₂ Emission /kg	$C_{j}^{C O 2}$ /CNY
Strategy 2	VPP1	53.5552	20.6954	51.7460	12.0611	3.2240	29,742.2	190.0
	VPP2	46.4493	9.3939	82.0104	7.5713	1.2000	43,841.5	169.0
	VPP3	67.5015	15.4204	92.6254	12.6251	4.4733	50,699.6	242.5
Optimized	VPP1	48.9512	21.7669	52.2521	11.8412	4.7053	17,993.4	105.7
	VPP2	54.9807	16.2895	79.0205	10.0736	2.8000	25,200.4	116.0
	VPP3	74.1794	14.6718	91.4336	16.5753	4.2454	29,307.4	154.1

Table 15. The benefits of VPPs and DSO.

	VPP1	VPP2	VPP3	DSO
Strategy 3	87,683.6	54,866.6	74,159.3	−3090.4

Table 16. The benefits of VPPs and DSO.

	VPP1	VPP2	VPP3	DSO
Strategy 4	96,727.4	76,043.0	91,390.4	−48,764.0

Table 17. The operational cost of VPPs.

	$C_{j}^{N E T}$ /CNY	$C_{j}^{M T}$ /CNY	$C_{j}^{F l e x}$ /CNY	$C_{j}^{O P}$ /CNY	$C_{j}^{C O 2}$ /CNY	$C_{j}^{V P P}$ /CNY
VPP1	18,556.8	49,642.7	0	28,337.9	190.0	96,727.4
VPP2	1978.9	32,206.7	882.0	40,795.0	180.4	76,043.0
VPP3	10,913.2	34,256.8	2749.4	43,228.5	242.5	91,390.4

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, B.; Zhou, D. Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game. Energies 2026, 19, 1821. https://doi.org/10.3390/en19081821

AMA Style

Yang B, Zhou D. Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game. Energies. 2026; 19(8):1821. https://doi.org/10.3390/en19081821

Chicago/Turabian Style

Yang, Bing, and Dongguo Zhou. 2026. "Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game" Energies 19, no. 8: 1821. https://doi.org/10.3390/en19081821

APA Style

Yang, B., & Zhou, D. (2026). Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game. Energies, 19(8), 1821. https://doi.org/10.3390/en19081821

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Low-Carbon Economic Optimization and Collaborative Management of Virtual Power Plants Based on a Stackelberg Game

Abstract

1. Introduction

2. Stackelberg Game Model for Energy Management of Multiple VPPs

3. Stackelberg Game Model for Dynamic Pricing of the Leader (DSO)

3.1. Strategy

3.2. Utility Function

3.3. Strategy Space

4. Stackelberg Game Model for Energy Management of the Follower VPPs

4.1. Strategy

4.2. Utility Function

4.2.1. Electricity Transaction Cost

4.2.2. Fuel Cost

4.2.3. Carbon Trading Cost

4.2.4. Operation and Maintenance (O&M) Cost

4.2.5. Compensation Cost

4.2.6. Revenue from Renewable Energy Generation

4.3. Strategy Space

4.3.1. Energy Balance Constraint

4.3.2. Energy Storage Constraints

4.3.3. Power Upper/Lower Limit and Ramp Rate Constraints

5. Stackelberg Game-Based Optimal Scheduling Model for Multiple Virtual Power Plants

5.1. Stackelberg Game Model

5.2. Solution for Game Model

6. Case Study and Results

6.1. Case Configuration

6.2. Results for the Bi-Level Game Optimization Strategy

6.3. Comparison

6.3.1. Comparison with Fixed Electricity Prices

6.3.2. Comparison with HHO Optimization Method

6.4. Sensitivity Analysis

6.4.1. Basis Unit Carbon Price

6.4.2. Free Allocation Coefficients

6.5. Further Discussions

6.5.1. VPP with Non-Responsive Behavior

6.5.2. VPP with No GECs

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI