Next Article in Journal
Impact of Triplen Harmonics Generated by Modern Non-Linear Loads on Neutral Conductor Overheating in Low-Voltage Smart Buildings
Previous Article in Journal
Research on Parameter Design and Control Method of Lightweight Converter Valve for Offshore Wind Power Transmission Based on Hybrid Topology
Previous Article in Special Issue
A Stackelberg Game Approach for Collaborative Operation and Interest Balancing in Community-Based Integrated Energy Microgrids
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Gradient Revision Method for Demand Response Stimulus Parameters of the Integrated Energy System

Engineering Research Center for Energy Generation & Grid of Ministry of Education, Xinjiang University, Urumqi 830017, China
*
Author to whom correspondence should be addressed.
Energies 2026, 19(7), 1742; https://doi.org/10.3390/en19071742
Submission received: 13 February 2026 / Revised: 19 March 2026 / Accepted: 23 March 2026 / Published: 2 April 2026
(This article belongs to the Special Issue Research on Operation Optimization of Integrated Energy Systems)

Abstract

Integrated Demand Response (IDR) enhances the operational flexibility of Integrated Energy Systems (IES) and promotes renewable energy integration. However, limited interaction between the Integrated Energy Operator (IEO) and users during actual energy transactions can lead to biases in IDR planning, compromising user response effectiveness. To address this, this paper proposes a method for revising IDR stimulus parameters in IES based on gradient descent within a Stackelberg game framework. First, an IDR model based on consumer psychology principles is constructed to establish an IES Stackelberg game, in which the IEO acts as the leader and the load aggregator acts as the follower. Subsequently, during the game, the IEO utilizes users’ energy consumption strategies to adjust the stimulus threshold parameters of the dead zone and saturation zone along the negative gradient direction, thereby updating its decision for the next round. Furthermore, a response adjustment mechanism is designed to refine the IDR plan, enhancing its feasibility. Finally, comparative analyses across diverse scenarios demonstrate that the proposed method reduces deviations in planned IDR, thereby enhancing the low-carbon performance and renewable energy integration capacity of IES.

1. Introduction

Driven by the imperative to address climate change and advance the global energy transition, energy systems are evolving toward clean, low-carbon, secure, and efficient operation. The development of multi-energy coupling technologies and the deep exploration of flexible demand-side resources are crucial for supply–demand regulation [1,2]. Integrated energy systems (IES) overcome the limitations of traditional single-energy systems by leveraging the complementary and mutual support among multiple energy sources, thereby enhancing overall energy utilization efficiency and system flexibility [3,4]. Integrated demand response (IDR), as a key demand-side management tool, promotes IES supply–demand balance by steering users to modify their energy consumption behaviors and unlocking adjustable load resources [5]. However, implementing IDR often impacts users’ energy comfort and experience. To mitigate this conflict, integrated energy operators (IEOs) develop targeted pricing and incentive strategies based on users’ distinct energy consumption characteristics and preferences, thereby enhancing the dispatchability of demand-side resources and load response effectiveness. Therefore, considering the bidirectional interaction between IEO and users, exploring how IES can accurately capture and utilize user energy characteristics to enhance IDR implementation effectiveness is crucial for improving IES economics and market-oriented development [6].
Currently, many scholars simulate the supply–demand interaction in energy markets by constructing Stackelberg game models. In reference [7], a bidirectional Stackelberg game model incorporating shared energy storage systems was proposed, employing the bisection method for solution. In reference [8], a Stackelberg game model was proposed to optimize the efficiency of integrated energy systems. This model utilizes the exergy properties of electrical and thermal energy and employs genetic algorithms to achieve multi-objective optimization. However, Stackelberg game models solved using iterative algorithms implicitly require users to possess high-frequency autonomous decision-making capabilities [9], which is not conducive to user participation in demand response and the market-oriented operation of demand response. Furthermore, the computational complexity and number of iterations required by traditional iterative algorithms to solve bilevel models are typically significantly higher than those for single-layer problems. Considering the actual supply–demand interactions and to reduce computational complexity for the solution, some scholars have transformed the two-layer model of principal-agent games into a single-layer model using Karush-Kuhn-Tucker (KKT) conditions [10,11]. However, solving the model using centralized algorithms requires complete information from the user side, which is similarly challenging in practice. Moreover, computational burden increases as the number of market participants grows. Consequently, some researchers have begun exploring how to construct more refined and complex user-side models. These models simulate real user feedback to assist operators in independently making precise decisions. In reference [12], a Stackelberg game model was developed based on demand response to simultaneously coordinate logistics operations, port energy scheduling, and vessel power requirements. In reference [13], a unified user satisfaction metric was proposed for cooling, heating, and electricity loads, based on a comprehensive consideration of their demand response. In reference [14], an EV charging/discharging incentive model that accounts for battery degradation was established, with the electric vehicle load aggregator positioned as a follower, to achieve refined energy management. While these studies examine the construction and solution of Stackelberg game models from various perspectives, they encounter challenges in simultaneously balancing the reduction in solution iterations and minimizing the requirement for user information or detailed user-side model construction.
Currently, IDR models that integrate users’ energy consumption characteristics can support decision-making for operators and electricity retailers, which has led scholars to conduct in-depth research on such models and classify them into price-based demand response (PBDR), incentive-based demand response (IBDR), and substitution-based demand response (SBDR) [15]. If the IEO can capture users’ energy consumption characteristics, it can design pricing strategies to steer IDR effectively. However, parameters reflecting users’ inherent response characteristics are difficult to acquire directly and thus require estimation using historical IDR data. In reference [16], support vector machines were applied to assess the potential of demand response. In reference [17], an LSTM-based method was proposed to recognize and predict user response behavior. In reference [18], a reinforcement learning model was presented to derive optimal incentive strategies based on user characteristics and historical data from the preceding week. Although these methods accurately estimate IDR parameters, IDR data is limited due to the ongoing development of services. This data scarcity makes it difficult for sample-dependent approaches to effectively estimate key parameters [19].
Consequently, researchers have turned to traditional statistical methods to estimate parameters related to user energy consumption characteristics in IDR models. In reference [20], a linear regression algorithm was utilized to estimate the parameters of the user response model, based on historical data within the IBDR framework. In reference [21], the least squares method was used to estimate user risk-sensitivity parameters for day-ahead demand response. In reference [22], a moving-window linear regression method utilizing maximum likelihood estimation was introduced for determining the unknown parameters of the IDR model. Existing traditional statistical methods have achieved success in improving the IDR model parameter estimation accuracy. However, these methods rely solely on historical IDR data. Consequently, they fail to reflect dynamic changes in user energy consumption characteristics during actual game interactions. This limitation often leads to systematic biases in IDR model parameters and affects the applicability of the model.
To address the problems mentioned as in Table 1, this study proposes a gradient descent-based integrated response stimulus threshold parameters revision method (GD-DRPR) for adjusting stimulus threshold parameters of the dead zone and saturation zone of the IDR model based on the principle of consumer psychology during the IES Stackelberg game. Compared to the KKT-based method, GD-DRPR does not require the IEO to have complete information about the load aggregator (LA). Unlike traditional distributed algorithms for solving Stackelberg game models, which often require numerous rounds of information exchange between the IEO and LA, GD-DRPR incorporates the demand response model as an approximate reaction function of the LA. This enables the IEO to make decisions by anticipating the LA’s responses, thereby improving the efficiency of transactions. Note that in previous studies, key parameters of the IDR model are usually estimated using historical data prior to the optimization or game process and remain fixed during optimization. GD-DRPR is not intended to improve the method of parameter estimation but focuses on revising original parameters during the game process based on user feedback.
The main contributions of this paper are summarized as follows:
  • This paper establishes an IES optimization model based on the Stackelberg game. As the leader, the IEO sets the energy selling price and demand response incentive compensation price according to the estimated response of the LA from the IDR model to maximize profits. As a follower, LA optimizes energy consumption strategies by integrating demand response resources to minimize costs.
  • This paper constructs a demand response model based on the principle of consumer psychology. It proposes the GD-DRPR by establishing an objective function based on the difference between expected demand response rates of the IEO and the LA. This approach improves the IEO’s decision-making within the IDR model and enables precise mobilization of LA resources. Furthermore, it reduces the number of interactions between the IEO and the LA, increases transaction efficiency, and lowers the barrier for user participation in IDR.
  • A response adjustment mechanism is designed. After the Stackelberg game concludes, the IEO verifies and adjusts the IDR plan with the LA to ensure its practical feasibility.
The remainder of this paper is organized as follows: Section 2 introduces the Stackelberg game framework. Section 3 presents the demand response model and GD-DRPR. Section 4 describes the IEO and LA models, along with the solution process for the Stackelberg game under GD-DRPR. In Section 5, a case study is conducted, and the results are discussed. Finally, Section 6 provides the conclusion.

2. Structure of IES and Stackelberg Game Framework

2.1. Structure of IES

The IES structure is shown in Figure 1. This system uses an IEO and an LA to coordinate transactions of electricity, gas, heat, and cooling between users and the upper-level energy network. The IEO is equipped with photovoltaic (PV), wind turbine (WT), combined heat and power (CHP), gas boiler (GB), electric boiler (EB), absorption chiller (AC), electric chiller (EC), electrical storage system (ESS), and heat storage system (HSS). The LA aggregates electrical, heat, and cooling loads. The IEO is located between the higher-level energy network and the LA, which manages the energy equipment within the IES and participates in market transactions. The IEO sets both the energy purchase price for LA and the incentive compensation price for demand response. Once LA accepts these prices, it aggregates flexible demand-side resources to provide IDR services. If the IEO cannot meet the LA’s energy demand with its own resources, it may purchase energy from the upper-level power grid or gas network. In this paper, the IEO purchases electricity from the upper-level power grid at time-of-use grid tariffs and natural gas from the upper-level gas network at a fixed price, while LA only purchases energy from the IEO.

2.2. Stackelberg Game Framework for IES Under GD-DRPR

The Stackelberg game model framework constructed in this paper is shown in Figure 2. The upper-layer model is the operational planning and pricing strategy decision-making model for the leader IEO, which determines the energy purchasing plan, equipment output plan, energy selling price, and incentive compensation price. The lower-layer model is the energy consumption decision-making model for the follower LA, which determines the energy purchasing plan and the expected IDR.
Unlike traditional IES Stackelberg game models, the interaction mechanism is enhanced by the GD-DRPR. In this framework, the lower-layer LA decision model optimizes its energy consumption strategy based on current prices and transmits this expected response plan to the IEO. Upon receiving the LA’s feedback, the IEO does not directly input this load data into the upper-layer pricing model. Instead, the IEO utilizes the LA’s energy consumption strategy as a reference to execute the GD-DRPR process, thereby revising the stimulus threshold parameters of the dead zone and saturation zone. Subsequently, these revised parameters are incorporated into the upper-layer IEO decision model to generate more precise pricing and incentive strategies for the next iteration. After the iterative game process converges, the IEO receives the confirmed response level from the LA and adjusts the energy procurement plan and equipment output plan based on the LA’s expected response.

3. IDR Model and Its Threshold Parameters Revision Method

3.1. A Demand Response Model Based on the Principle of Consumer Psychology

According to the principle of consumer psychology, consumer responses to economic stimulus exhibit significant differences when reaching distinct psychological thresholds. A demand response model was constructed based on this principle, shown in Figure 3. According to users’ demand response characteristics at different stimulus levels, this model categorizes the response process into three stages, including the dead zone, linear zone, and saturation zone [23]. IDR can be categorized into PBDR, SBDR, and IBDR, and aligns with the fundamental assumptions of the principle of consumer psychology. Therefore, the model is extended for application within the IDR framework [23,24]. The relationship between demand response stimulus levels r t x , y and demand response rates λ t x , y can be expressed as follows:
0 < r t , min x , y < r t , max x , y
λ t x , y = 0                              , 0 r t , min x , y r t , max x , y λ max x , y ( r t x , y r t , min x , y ) r t , max x , y r t , min x , y , r t , min x , y r t x , y r t , max x , y λ max x , y                          , r t , max x , y r t x , y
where r t , min x , y and r t , max x , y are the dead zone and saturation zone thresholds of demand response stimulus level at time t, respectively; λ max x , y is the maximum demand response rate; the superscript x denotes the type of responsive load, where x { TL , SL , CL } with TL, SL, and CL represent transferable load, substitutable load, and curtailable load, respectively; and the superscript y denotes the energy type of load, where y { e , h , c } with e, h, and c represent electrical, heat, and cooling loads, respectively.

3.2. IDR Model

3.2.1. PBDR Model

This paper temporarily disregards the transfer behavior of heat and cooling loads, focusing only on the transfer of electrical loads to construct the PBDR model. The IEO guides LA participation in peak shaving and valley filling by setting electricity prices, with the difference between the current price and the historical average price serving as the stimulus level for PBDR. Combining Equations (1) and (2), the PBDR model is expressed as follows:
P t TL , e = P t L , e λ t TL , e t T P t TL , e = 0
λ t TL , e = λ max TL , e ( U t TL , e , out U t TL , e , in ) ( r t TL , e r t , min TL , e ) r t , max TL , e r t , min TL , e r t TL , e = ( U t TL , e , out U t TL , e , in ) ( c t e c e ¯ ) r t , max TL , e r t TL , e r t , min TL , e U t TL , out U t TL , in = 0
where P t L , e is the baseline electric load at time t; P t TL , e is the shiftable electric load at time t; T is the total scheduling period; λ t TL , e is the rate of the PBDR for electric energy at time t; λ max TL , e is the maximum rate of the PBDR for electric energy; r t , max TL , e and r t , min TL , e are the dead zone and saturation zone thresholds for the stimulus level of the PBDR for electric energy at time t, respectively; c t e is the electricity selling price of the IEO at time t; c e ¯ is the historical average electricity selling price; U t TL , e , out is a binary variable indicating the state of shifting electric load from time t to other periods, taking a value of 1 if the load is shifted and 0 otherwise; and U t TL , e , in is a binary variable indicating the state of shifting electric load from other periods to time t, taking a value of 1 if the load is shifted and 0 otherwise.

3.2.2. SBDR Model

IEO enables substitutable loads to participate in SBDR by setting different energy prices, which steer the LA to adjust its energy consumption structure. The model considers only the energy substitution between electricity–heat and electricity–cooling loads, establishing SBDR stimulus levels based on the price differentials between electricity and heat, and between electricity and cooling [23]. By combining Equations (1) and (2), the SBDR model is as follows:
P t SL , e = η hte P t SL , h + η cte P t SL , c w max SL , e P t L , e P SL , e t w max SL , e P t L , e
P t SL , h = ( U t , eth SL U t , hte SL ) P t L , h λ t SL , h r t SL , h = ( U t , eth SL U t , hte SL ) ( c t h c t e ) r t , min SL , h r t SL , h r t , max SL , h λ t SL , h = λ max SL , h r t SL , h r t , max SL , h r t , min SL , h U t , hte SL U t , eth SL = 0
P t SL , c = ( U t , etc SL U t , c t e SL ) P t L , c λ t SL , c r t SL , c = ( U t , etc SL U t , c t e SL ) ( c t c c t e ) r t , min SL , c r t SL , c r t , max SL , c λ t SL , c = λ max SL , c r t SL , c r t , max SL , c r t , min SL , c U t , c t e SL U t , etc SL = 0
where w max SL , e is the upper limit proportion of the substitutable electric load; η hte and η cte are the energy substitution coefficients for heat-to-electrical and cooling-to-electrical, respectively; P t SL , e , P t SL , h and P t SL , c are the substitutable electrical, heat and cooling loads at time t, respectively; P t L , h and P t L , c are the baseline heat load and baseline cooling load at time t, respectively; U t , hte SL , U t , eth SL , U t , c t e SL and U t , etc SL are binary variables denote the states of heat-to-electrical, electrical-to-heat, cooling-to-electrical, and electrical-to-cooling load substitution at time t, taking a value of 1 if substitution occurs and 0 otherwise; r t SL , h , r t , min SL , h and r t , max SL , h are the stimulus level of the heat energy SBDR at time t and its dead zone and saturation zone thresholds, respectively; λ t SL , h and λ max SL , h are the rate of SBDR for heat energy at time t and its upper limit, respectively; r t SL , c , r t , min SL , c and r t , max SL , c are the stimulus level of the cooling energy SBDR at time t and its dead zone and saturation zone thresholds, respectively; λ t SL , c and λ max SL , c are the rate of SBDR for cooling energy at time t and its upper limit, respectively; c t h and c t c are the heating and cooling selling prices of the IEO at time t, respectively.

3.2.3. IBDR Model

IEO incentivizes curtailable loads to participate in IBDR by contracting with LA and setting the stimulus level as r t CL , y = c t CL , y , where c t CL , y is the incentive compensation price at time t. Combining Equations (1) and (2), the IBDR model is as follows:
P t CL , y = P t L , y λ t CL , y
λ t CL , y = λ max CL , y r t CL , y r t , max CL , y r t , min CL , y U t CL , y r min , t CL , y r t CL , y U t CL , y r t , max CL , y
where P t CL , y is the curtailable load of y class energy at time t; λ max CL , y denotes the upper limit of the IBDR rate for energy y; r t , max CL , y and r t , min CL , y denote the dead zone and saturation zone threshold values, respectively, for the stimulus level of IBDR at time t; and U t CL , y is a 0–1 variable indicating the load curtailment status at time t, where 1 signifies load curtailment, and 0 signifies no curtailment.

3.3. Revision Method for IDR Stimulus Threshold Parameters

3.3.1. GD-DRPR Process

The K t x , y = λ max x , y / ( r t , max x , y r t , min x , y ) is the slope of the linear region in Figure 3 and indicates user sensitivity to demand response stimulus level, while r t , max x , y and r t , min x , y are key parameters derived from historical data [19]. This historical derivation is often inaccurate because it fails to capture recent dynamics, including behavioral shifts, equipment updates, or schedule changes, and because the IEO has incomplete user information. To improve accuracy, r t , max x , y and r t , min x , y are revised by λ t , n IEO , x , y and λ t , n LA , x , y , where λ t , n IEO , x , y and λ t , n LA , x , y are the expected demand response rates of the IEO and the LA at time t. The objective function J is then formulated based on these parameters, Equation (2), and the least squares method:
min J ( r t , n , min x , y , r t , n , max x , y ) = [ λ max x , y ( r t , n x , y r t , n 1 , min x , y ) λ t , n LA , x , y ( r t , n 1 , max x , y r t , n 1 , min x , y ) ] 2
where n is the iteration step count for the inner iteration within GD-DRPR.
Furthermore, considering that the model depicted in Figure 3 provides only an approximate description of actual response behavior, an excessively rapid decrease in the objective function value during the incentive threshold parameters adjustment process may cause these parameters to deviate from the true response characteristics. Therefore, the GD-DRPR method is proposed based on the gradient descent approach.
This paper compares the expected post-response load from the IEO and the LA. The mean absolute percentage error (MAPE) is utilized as the convergence indicator for the GD-DRPR algorithm. To evaluate the effectiveness of revision for IDR stimulus threshold parameters, both MAPE and the root mean square error (RMSE) are employed as performance metrics. The expressions for these metrics are as follows:
D MAPE ( n ) = 100 % 3 T t = 1 T y { e , g , h } P t , n IEO , y P t , n LA , y P t , n LA , y
R MSE ( n ) = 1 3 T t = 1 T y { e , g , h } ( P t , n IEO , y P t , n LA , y ) 2
where P t , n IEO , y and P t , n LA , y are the expected post-response load of y class energy at time t during iteration n for IEO and LA, respectively.
GD-DRPR is a dual-layer iterative process. The outer layer models the game-theoretic interaction between IEO and LA. In the inner layer, IEO iteratively revises the IDR stimulus threshold parameter. The process proceeds as follows:
1.
Set the initial parameters as n = 0, m = 0, D MAPE ( n ) = 100 % and D ( m ) = 100 % , where m is the count of interactions between the IEO and LA, and D ( m ) is the MAPE calculated following each interaction. The IEO performs its initial decision-making based on the initial IDR stimulus threshold.
2.
Let m = m + 1. The LA executes a decision-making round according to the current price, subsequently providing the expected response rate λ t , n LA , x , y to the IEO.
3.
Let n = n + 1. The gradient of the current objective function is then calculated with the following formula:
J r t , n 1 , min x , y = M ( λ t , n LA , x , y λ max x , y )
J r t , n 1 , max x , y = M λ t , n LA , x , y
M = λ max x , y ( r t , n x , y r t , n 1 , min x , y ) λ LA , x , y ( r t , n 1 , max x , y r t , n 1 , min x , y )
4.
The IDR stimulus threshold parameters are updated along the negative gradient using the following formula:
r t , n , min x , y = r t , n 1 , min x , y + α M ( λ max x , y λ t , n LA , x , y )
r t , n , max x , y = r t , n 1 , max x , y + α M λ t , n LA , x , y
where α is the iterative step size.
5.
The IEO determines its pricing strategies based on the current IDR stimulus threshold parameters.
6.
Calculate D MAPE ( n ) for iteration n. If D MAPE ( n ) D MAPE ( n 1 ) , let r t , n , min x , y = r t , n 1 , min x , y , r t , n , max x , y = r t , n 1 , max x , y , and D ( m ) = D MAPE ( n ) . Otherwise, return to Step 3 to continue the inner loop.
7.
If D ( m ) D ( m 1 ) < 0.01 % , the algorithm concludes. Otherwise, pass the current pricing strategy to LA and return to Step 2 to begin a new outer loop.

3.3.2. Proof of Convergence for GD-DRPR

The GD-DRPR mentioned in this text involves inner and outer iterations. The outer iteration corresponds to traditional Stackelberg games of IES, while the inner iteration involves the adjustment of stimulus threshold parameters, thus requiring proof of the convergence of the inner iteration. This section utilizes the Lyapunov function to prove the convergence of the GD-DRPR inner iteration.
Theorem 1. 
Global convergence to the minimum point of the objective function  J ( r ) .
Proof of Theorem 1. 
Let r = [ r t , min x , y , r t , max x , y ] T , and let w = [ λ t LA , x , y λ max x , y , λ t LA , x , y ] T . Choose the Lyapunov function as V ( r ) = J ( r ) , and when V ( r ) = 0 , it corresponds to the global minimum point, then:
Δ V n = V n + 1 ( r n ) V n ( r n 1 )
Based on updated Formulas (13)–(15):
Δ V n = V n ( r n 1 ) [ ( 1 2 α w n 2 ) 2 1 ]
To ensure Δ V n 0 , it is necessary that:
1 2 α w n 2 ) 2 1 0
Solving this inequality gives:
0 α 1 w n 2
Under this condition, Δ V n 0 . Since w n does not change during each interaction between IEO and LA, when V ( r ) > 0 and 0 α 1 / w n 2 , Δ V n 0 and J ( r ) globally converges to the minimum point of the objective function. ☐

4. IES Stackelberg Game Model

4.1. IEO Pricing Decision Model

4.1.1. Objective Function

The IEO steers the LA’s response by setting energy selling prices and incentive compensation. The objective of optimization is to maximize the IEO’s daily revenue, formulated as follows:
max C IEO = C sell C buy C om C idr C CO 2
C sell = t = 1 T ( c t e P t IEO , e + c t h P t IEO , h + c t c P t IEO , c )
C buy = t = 1 T ( P t GRID p t GRID + G t GAS p t GAS )
C om = t = 1 T c CHP P t CHP + c GB H t GB + c EB H t EB + c AC Q t AC + c EC Q t EC + c ESS ( P t , dis ESS P t , char ESS ) + c HSS ( P t , dis HSS P t , char HSS )
where C IEO is the daily operational revenue of the IEO; C sell is the energy sales revenue of the IEO; and C buy , C om , C idr , and C CO 2 are the energy purchase cost, equipment maintenance cost, IDR incentive compensation cost, and carbon emission cost of the IEO, respectively. P t GRID and G t GAS are the electricity and gas purchased by IEO at time t, respectively; p t GRID and p t GAS are the electricity and gas purchase prices from the upper-level energy network, respectively. P t CHP is the electric energy output of the CHP at time t; H t GB and H t EB are the heat energy output of GB and EB at time t, respectively; Q t EC and Q t AC are the cooling energy output of EC and AC at time t, respectively; P t , char ESS and P t , dis ESS are the charging and discharging electric energy of the ESS at time t, respectively; P t , char HSS and P t , dis HSS are the charging and discharging heat energy of the HSS at time t, respectively; and c CHP , c GB , c EB , c AC , c EC , c ESS , and c HSS are the operating maintenance costs per unit output power for the CHP, GB, EB, AC, EC, ESS, and HSS, respectively.
For IBDR, IEO must simultaneously establish incentive compensation prices and IBDR plans, where the IDR compensation cost is as follows:
C idr = t = 1 T y { e , g , h } c t CL , y P t CL , y
Carbon emission allowances are allocated based on the baseline method. If emissions exceed the allocated allowance, additional allowances must be purchased [25]. The resulting carbon emission cost is calculated as follows:
C CO 2 = t = 1 T c re e GRID P t GRID + P t CHP ( e CHP δ CHP ) + H t GB ( e GB δ GB )
where e GRID is the carbon dioxide emission factor for the power grid; e CHP is the carbon dioxide emission factor for CHP; e GB is the carbon dioxide emission factor for GB; c re signifies the unit price of carbon allowances; δ CHP is the free carbon emission allowance coefficient for CHP; and δ GB is the carbon emission allowance coefficient for GB.

4.1.2. Constraints

The constraints of the IEO upper-layer pricing decision model must satisfy power balance constraints, operational constraints for equipment, price constraints, and demand response constraints as defined in Equations (3)–(9). Its power balance constraints are as follows:
P t GRID + P t WT + P t PV + P t CHP + P t , dis ESS P t cut P t , char ESS P t EC P t EB = P t IEO , e G t GAS = G t CHP + G t GB H t EB + H t GB + H t CHP + P t , dis HSS P t , char HSS H t AC = P t IEO , h Q t AC + Q t EC = P t IEO , c
P t IEO , e + P t TL , e + P t CL , e P t SL , e = P t L , e P t IEO , h + P t CL , h + P t SL , h = P t L , h P t IEO , c + P t CL , c + P t SL , c = P t L , c
where P t EB and P t EC are the electric power inputs to EB and EC at time t; P t WT and P t PV are the output powers of WT and PV at time t, respectively; P t cut is the electric energy curtailed during at time t; H t CHP is the heat energy output from CHP at time t; H t AC is the heat energy input to AC at time t; and G t CHP and G t GB are the natural gas inputs to CHP and GB, respectively.
Due to the similarities in their modeling, a unified model is adopted for both EES and HSSs, as presented in Equation (30).
S t B = ( 1 σ B ) S t 1 B + ( η B , char P t , char B P t , dis B η B , dis ) 0 P t , c B U t , char B P char , max B 0 P t , dis B U t , dis B P dis , max B U t , char B + U t , dis B = 0 μ min B S max B S t B μ max B S max B S 0 B = S T B = μ min B S max B t = 1 T ( P t , char B P t , dis B ) = 0
where S t B and S t 1 B are the stored energies of the energy storage equipment at time t and time t−1, respectively; σ B is the loss coefficient of the B-class storage equipment; η B , char and η B , dis are the B-class storage equipment charging and discharging efficiency, respectively; and B is the type of energy storage equipment, where B ESS , HSS . U t , char B and U t , dis B are 0–1 variables representing the charging and discharging states of the B-class storage equipment at time t, respectively; P char , max B and P dis , max B are the B-class storage equipment charging and discharging maximum power, respectively; S max B is the maximum capacity of the B-class storage equipment; μ min B and μ max B are the minimum and maximum capacity coefficients of the B class storage equipment, respectively; and S 0 B and S T B are the energy stored at the beginning and end of the scheduling cycle, respectively.
The operation constraints of the energy conversion equipment, including CHP, GB, EB, EC, and AC, are shown in Equations (31) and (32).
P t CHP = η CHP , e G t CHP H t CHP = η CHP , h ( 1 η CHP , e ) G t CHP H t GB = η GB G t GB H t EB = η EB P t EB Q t AC = η AC H t AC Q t EC = η EC P t EC
0 H t CHP H max CHP 0 H t GB H max GB 0 H t EB H max EB 0 Q t AC Q max AC 0 Q t E C Q max EC
where η CHP , e and η CHP , h are the electrical and heat energy conversion efficiency of CHP; η GB , η EB , η AC , and η EC are the energy conversion efficiencies of GB, EB, AC, EC, respectively; and H max CHP , H max GB , H max EB , Q max AC and Q max EC are the upper power limits of CHP, GB, EB, AC, and EC.
As shown in Equation (33), the operation of each energy conversion equipment must also satisfy ramping constraints.
P t CHP P t 1 CHP P max , R CHP H t GB H t GB H max , R GB H t EB H t 1 EB H max , R EB Q t AC Q t 1 AC Q max , R AC Q t EC Q t 1 EC Q max , R EC
where P max , R CHP , H max , R GB , H max , R EB , Q max , R AC , and Q max , R EC are the maximum ramp rate of CHP, GB, EB, AC, and EC.
As shown in Equation (34), the energy selling price and incentive compensation price established by the IEO must satisfy the upper and lower limits, as well as the average selling price constraint. This ensures that both energy selling prices and incentive compensation prices remain within a reasonable range, preventing extreme price scenarios [26].
c min y c t y c max y t = 1 T c t y T c av y c min CL , y c t CL . y c max CL , y
where c min y and c min CL , y are the minimum energy selling price and minimum incentive compensation price for the y class energy at time t, respectively; and c av y is the maximum average selling price for y class energy at time t. c max y and c max CL , y are the maximum energy selling price and maximum incentive compensation price for y class energy at time t, respectively.

4.2. LA Energy Consumption Decision Model

4.2.1. Objective Function

Following the modeling approach in [27], the satisfaction loss incurred by the LA due to demand response deviations from baseline load is quantified using the following function:
f t LA = ( a x , y P t x , y + b x , y P t x , y 2 )
where f t LA is the satisfaction loss cost at time t; and a x , y and b x , y denote the satisfaction loss parameters for the x-class demand response of y energy.

4.2.2. Constraints

LA adjusts electricity, heating, and cooling demand based on the energy sales and incentive compensation prices established by the IEO, aiming to minimize the comprehensive cost of LA’s economic efficiency and satisfaction. This cost, representing the sum of LA’s energy procurement cost, satisfaction loss cost, and IDR compensation, can be expressed as follows:
min C LA = t = 1 T ( f t LA + c t e P t LA , e + c t h P t LA , h + c t c P t LA , c C idr )
P t LA , e + P t TL , e + P t CL , e P t SL , e = P t L , e P t LA , h + P t CL , h + P t SL , h = P t L , h P t LA , c + P t CL , c + P t SL , c = P t L , c
Unlike the demand response constraints in the IEO pricing decision mode, the demand response constraints in the LA decision model only consider the power balance constraints of IBDR and SBDR, as well as the maximum response rate constraint.
t T P t TL , e = 0 P t SL , e = η hte P t SL , h + η cte P t SL , c 0 < P t x , y < λ max x , y P t L , y

4.3. Rescheduling Model for IEO Under a Response Adjustment Mechanism

In this paper, since the IDR model serves as a linear approximation of the user’s actual response characteristics, it does have certain limitations and may not fully reflect the actual user response patterns. Inaccuracies can become more pronounced near the thresholds of the dead zone and saturation regions. Consequently, following the conclusion of the game, the IEO is required to verify and adjust its IDR plan in consultation with the LA to ensure its feasibility. This process involves the IEO modifying its equipment output and energy procurement schedules based on the determined energy selling price, incentive compensation price, and the confirmed expected response load from LA. The objective function of this response adjustment model is presented in Equation (22), subject to the constraints defined in Equations (28)–(33).

4.4. Model Solution

In the constructed IES Stackelberg game model, the upper-layer IEO decision model is a mixed-integer nonlinear model.
Due to the presence of 0–1 variables in Equations (3), (6) and (7), the big-M method is employed for linearization. After the Stackelberg game model converges, the response adjustment model also be solved. The model solution process is illustrated in Figure 4.

5. Case Study

5.1. Basic Data

This section validates the rationality and effectiveness of the IDR and the proposed GD-DRPR method through case study simulations. The output prediction curves for PV and WT, along with the LA energy consumption load curve, are shown in Figure 5 [28]. The time-of-use tariffs and natural gas for the upper-level grid are listed in Table 2. The energy trading limits are detailed in Table 3. The parameters of each piece of equipment in IES are detailed in Table 4. All parameters are configured according to the Reference [29]. In the proposed model, the LA reports its demand response capability to the IEO. The maximum demand response rates and the threshold parameters of the dead zone and saturation zone are set with reference to [30], where the maximum demand response rates for PBDR, SBDR, and IBDR are 10%, 6%, and 5%, respectively. The simulation was performed in MATLAB (R2023a) using the Yalmip toolbox to call the commercial solver Gurobi 11.0.
To analyze the effectiveness of the GD-DRPR proposed in this article, nine scenarios were set up as shown in Table 5. Scenario 3 is the method proposed in this paper.

5.2. Analysis of IDR and IES Optimization Results

To analyze the optimization results of IDR and IES, this section compares Scenarios 1, 2, and 3. The optimization results for Scenarios 1, 2, and 3 are shown in Table 6. The IDR and pricing results for Scenario 3 are illustrated in Figure 6 and Figure 7.
As shown in Table 6, Scenario 2 demonstrates significant improvements following the implementation of IDR. Total IEO revenue increases by 42.14% relative to Scenario 1, and carbon emissions decrease by 2.71%. These results indicate that IDR effectively enhances both the economic efficiency and the low-carbon performance of integrated energy system operations.
Further analysis reveals that Scenario 3 delivers additional benefits. Building on Scenario 1, the operational revenue of the IEO rises by 13.29%, and carbon emissions are reduced by 8.83% compared to Scenario 2. This finding suggests that GD-DRPR can further amplify the advantages brought by IDR.
It should also be noted that, in Scenario 3 compared to Scenario 1, the IEO’s energy sales revenue increases by 3407 CNY. This increment is less than the 4643 CNY rise in LA costs. The difference is primarily attributed to the increase in the cost associated with LA satisfaction loss. This outcome reflects the IEO’s ability to steer LA participation in IDR and modify energy consumption behavior through its pricing mechanism. While IDR enhances system efficiency, it may impose burdens on users through diminished satisfaction, potentially deterring their long-term participation. If users disengage due to these costs, the market sustainability of the IDR mechanism could be severely compromised. Long-term contracts between IEO and LA should be established. Such contracts enable IEO to face energy market risks on behalf of LA, thereby cultivating trust and ensuring stable involvement. Concurrently, these contracts can regulate IEO pricing strategies to prevent excessive cost burdens on LAs, ensuring that economic incentives are aligned with sustainable system operations.
As shown in Figure 6, after implementing IDR, the peak-to-valley difference in electrical load decreased from 1134.79 kW to 872.56 kW, demonstrating a significant smoothing effect on the load curve. Combined with the optimized electricity price curve for Scenario 3 in Figure 7b, it can be observed that the electricity price exhibits a time-of-use pattern with peak-valley-flat characteristics. This means that PBDR can encourage users to shift part of their load from peak periods to off-peak periods, thereby achieving peak shaving and valley filling.
A comparison of Figure 6b and Figure 7a reveals a significant decline in heat load during the period from 23:00 to 8:00. During this interval, the heat selling price exceeds the electricity selling price. Under the SBDR, part of the heat load is shifted to electrical load, thereby realizing load substitution and reducing the difference between peak and valley values for both electrical and heat loads. This process further enhances the system’s capacity to integrate renewable energy. In the period from 10:00 to 22:00, the heat selling price is lower than the electricity selling price, which in turn induces a shift in electricity load to heat load as a result of price signals. Further analysis of Figure 6c and Figure 7a indicates that, from 10:00 to 22:00, the cooling selling price is also lower than the electricity selling price. This price relationship leads to a corresponding shift in electricity load to cooling load. During other time periods, the price difference between cooling and electricity sales remains relatively small, resulting in minimal load substitution toward cooling. In summary, the price differential among various energy sources plays a pivotal role in influencing the magnitude of response under substitution-based demand response. When the price difference is limited, the effect of load substitution remains insignificant. In contrast, a larger price differential strengthens the guiding effect of SBDR and results in more notable substitution effects.
Furthermore, as shown in Figure 7b, the incentive compensation price for electricity remains relatively high during periods when the electrical load increases following the response, while it is nearly zero during other intervals. Additionally, Figure 6 further demonstrates that the heat load and cooling load exhibit similar variation patterns. This is because the IEO possesses stronger profit motivation during peak load periods and, therefore, is less inclined to guide users to reduce load through incentive compensation when load demand can be met. This observation indicates that the IEO primarily opts for PBDR and SBDR, rather than IBDR.

5.3. Analysis of GD-DRPR and Response Adjustment Mechanism

On one hand, IBDR reduces the energy purchased by the LA from the IEO; on the other hand, the IEO must compensate the IBDR for the reduced energy. When the IEO considers energy sales revenue and aims to maximize daily operational income, it tends to sell more energy to increase profits, which discourages the IEO from formulating IBDR plans. Therefore, when the IEO prioritizes energy sales revenue, IBDR becomes less active, making it difficult to analyze. To facilitate a simultaneous analysis of the effects of GD-DRPR and response adjustment mechanisms on PBDR, SBDR, and IBDR, this section compares Scenarios 4, 5, 6, and 7, in which the IEO considers only operational costs.
The optimization results for Scenarios 4, 5, 6, and 7 are shown in Table 7. The adjustment amounts for the scheduling plans between Scenarios 5 and 7 compared to Scenarios 4 and 6, are illustrated in Figure 8. The IDR plan deviations for IEO in Scenarios 4 and 6 are depicted in Figure 9.
As shown in Table 7, both carbon emissions and IEO costs increase in Scenario 5 and Scenario 7 compared to Scenario 4 and Scenario 6. This is because the proposed method requires the IEO to estimate the LA’s response using the IDR model, rather than making decisions directly based on the LA’s energy consumption plan. As a result, the IEO must adjust the response plan through the response adjustment mechanism before rescheduling. Figure 8 shows that the maximum power adjustment in the rescheduling plan for Scenario 7 is 22.7 kW, which is significantly lower than the 293.79 kW observed in Scenario 5. Furthermore, Scenario 5 exhibits a 71 kg increase in carbon emissions and a 792 CNY increase in IEO costs compared to Scenario 4. Scenario 7, on the other hand, shows only a 2 kg increase in carbon emissions and an 8 CNY increase in IEO costs relative to Scenario 6. These results indicate that the rescheduling outcomes for Scenario 7 are closer to the original scheduling outcomes than those for Scenario 5.
In addition, it can be observed that smaller adjustments to the dispatch plan result in post-rescheduling outcomes that are more closely aligned with the pre-rescheduling results. Moreover, compared to Scenario 5, Scenario 7 reduces carbon emissions and IEO costs by 3.46% and 0.34%, respectively. This demonstrates that smaller adjustments to the dispatch plan yield more favorable results in terms of low-carbon performance and economic efficiency after rescheduling. In Figure 9, a response deviation greater than 0 indicates that the post-response load in the IDR plan is higher than the expected post-response load by LA. From 24:00 to 8:00, Scenario 4 shows a higher planned post-response electrical load and a lower planned post-response heat load, which is reflected in Scenario 5 as reduced electricity purchases in Figure 8a.
At the same time, the output of the CHP, which can simultaneously generate electricity and supply heat, mainly replaces the output of the GB, which can only supply heat. From 9:00 to 23:00, Scenario 4 exhibits a lower planned post-response electrical load and a higher planned post-response heat load, which is reflected in Scenario 5 as increased electricity purchases and reduced GB output in Figure 8a. Additionally, the output of the EC replaces that of the AC systems. Furthermore, in Scenario 4, the planned response-adjusted cooling load is significantly lower only during the periods from 7:00 to 8:00 and from 22:00 to 24:00, which is reflected in Scenario 5 as increased EC and AC outputs in Figure 8a. According to Table 1, in Scenario 4, the IEO increases electricity purchases during periods when the upper-level grid’s time-of-use price is higher, and decreases purchases when the price is lower, thereby increasing the IEO’s overall costs.
In Figure 9b, at 1:00, 7:00, and from 19:00 to 23:00, the planned response in Scenario 6 leads to a higher heat load, resulting in a reduction in EB or GB output in Scenario 7 that exceeds the corresponding increase in CHP output. At 8:00, the planned response results in lower heat load, leading to the replacement of GB output with EB and CHP. At 9:00, due to the lower planned post-response electrical load, the ESS in Scenario 7 charges to store excess electricity. Additionally, since deviations in other IDR planned values are relatively small, the subsequent changes in electricity purchases and equipment output are comparatively balanced, leading to only minimal differences between the results of Scenario 7 and Scenario 6.
Additionally, without introducing a response adjustment mechanism, the planned post-response load error results for Scenarios 4 and 6 are shown in Table 7. As indicated in Table 7, GD-DRPR reduced MAPE and RSME by 92.45% and 92.73%, respectively. In summary, GD-DRPR effectively corrects the IDR incentive threshold parameter, aligning the IDR plan formulated by the IEO more closely with actual user responses. This reduces the need for schedule adjustments caused by IDR plan deviations, thereby lowering system operating costs and validating the method’s effectiveness in enhancing the economic efficiency of the IES.

5.4. The Effect of the Satisfaction Loss Parameters on GD-DRPR

To evaluate the adaptability of GD-DRPR under varying degrees of user energy consumption characteristics, this section compares the parameter revision effects of IEO and LA during a single interaction process. The revised IDR stimulus threshold parameter from Scenario 6 is set as the initial value, while the user satisfaction loss parameters are allowed to fluctuate randomly around the initial value. Its maximum fluctuation rate gradually increased from 10% to 50%.
Figure 10 illustrates the effect of satisfaction loss parameters on GD-DRPR. The initial MAPE increases as the volatility of the user satisfaction loss parameters rises, demonstrating a clear positive correlation between the two. This is attributed to the gradient descent mechanism employed by GD-DRPR. Larger variations in user energy consumption characteristics lead to significant deviations in the IDR incentive threshold parameters, thereby necessitating more rounds of parameter correction to achieve convergence. Although the MAPE decreases significantly during the initial iterations, the rate of decline slows down after the third iteration. Furthermore, the MAPE exhibits peaks near specific fluctuation rates, indicating that user energy consumption characteristics are more sensitive within these ranges, exerting a more pronounced influence on the parameter adjustment process.

5.5. Convergence Analysis of GD-DRPRs

Through experiments, this paper finds that the proposed GD-DRPR algorithm performs well with a step size of 0.001; therefore α = 0.001 is set.
This section sets the step sizes to 0.3 α , 0.7 α , α , 1.5 α , 2 α , and 10 α to analyze the impact of different step sizes on algorithm convergence. Figure 11 shows the convergence curves for different step sizes, where the number of iterations refers to the inner iteration step n. It can be observed that the algorithm is sensitive to the step size. When the step size is large, the algorithm is prone to stopping early or even causes the objective function value to increase. When the step size is small, the algorithm converges slowly. When the step size is 0.001, the algorithm can converge relatively quickly.

5.6. Comparative Analysis of GD-DRPR and Other Methods

This section compares the differences between GD-DRPR and other traditional solution methods in terms of final optimization results and the number of game interactions. Traditional methods primarily fall into two categories: first, those that equivalently transform the two-layer model into a single-layer model for solution based on KKT conditions; second, those that employ iterative optimization algorithms for solution, with the adaptive genetic algorithm (AGA) serving as a representative example in this paper. Therefore, this section conducts a comparative analysis of Scenarios 3, 8, and 9, examining the solution methods for different Stackelberg game models.
To evaluate the adaptability of GD-DRPR under varying wind and solar penetration rates, the predicted wind and solar outputs were scaled proportionally, with penetration rates set at 40%, 50%, and 60%. The results for Scenarios 3, 8, and 9 under each penetration rate are compared in Table 8.
The KKT-based solution method requires the IEO to have complete information, including user energy consumption characteristics. Therefore, under different penetration rates, the revenue of the IEO for Scenario 8 in Table 8 consistently remains the highest. In both GD-DRPR and AGA, the IEO operates without complete information regarding the LA, which more accurately reflects real scenarios. As Table 8 illustrates, under these same conditions of incomplete information, the number of interactions in Scenario 9 is significantly higher than in Scenario 3. This demonstrates that GD-DRPR effectively minimizes the need for repeated gaming between the two parties and thereby enhances transaction efficiency. Moreover, the solving time in Scenario 3 is shorter than in Scenario 9, demonstrating that, despite the increased model complexity introduced by GD-DRPR, it still maintains a computational efficiency advantage over AGA. Meanwhile, the optimization results for Scenario 3 are generally closer to those of Scenario 8. This is primarily attributed to the fact that AGA is more prone to getting trapped in local optima during the search process. Considering that Scenario 8 represents the ideal case where the IEO possesses full information, its results serve as the theoretical upper bound. Therefore, the fact that Scenario 3 achieves performance close to this upper bound under conditions of incomplete information demonstrates the effectiveness and practicality of GD-DRPR.
As illustrated in Figure 11, the GD-DRPR algorithm converges at 17 inner iterations when an appropriate step size is applied. In conjunction with Table 8, it is evident that the IEO and LA models exchange information only four times, meaning the Stackelberg game process concludes in exactly three outer iterations. Since the inner iterations of the proposed approach are not reset at the onset of a new outer iteration cycle, the algorithm only needs to solve the IEO model approximately a dozen times and the LA model a few times. In contrast, the AGA method requires solving the IEO model several dozen times per iteration, depending on the configured population size. Consequently, when applied to large-scale complex systems and extended to multi-tier games, the proposed method not only shows lower computational complexity than conventional distributed algorithms but also effectively reduces the overhead communication among different agents.
Notably, when wind and photovoltaic penetration reach 50% and 60%, Scenario 3 exhibits superior performance regarding carbon emissions and power curtailment. This demonstrates that GD-DRPR enables the IEO to derive pricing strategies that align closely with real-world user behavior. Consequently, this effectively mobilizes demand-side resources, enhancing both the low-carbon operation and the renewable energy integration capability of the IES.

6. Conclusions

This paper addresses the issue that IEO struggles to formulate accurate IDR plans due to incomplete user information, leading to deviations in user demand response. It proposes the GD-DRPR and response adjustment mechanism, then establishes an IES Stackelberg game model based on this approach. Through case study verification and analysis, the following conclusions are drawn:
  • The proposed GD-DRPR reduces the MAPE of planned IDR loads by 92.45% and the RMSE by 92.73%, enabling rapid revision of stimulus threshold parameters in the dead zone and saturation zone of the IDR model.
  • The IES Stackelberg game model proposed in this paper, the IEO employs the IDR model to support decision-making. Compared to scenarios without GD-DRPR, carbon emissions were reduced by 13.29%, while the IEO’s total revenue increased by 8.83%, achieving synergistic improvements in both the economic efficiency and low-carbon performance of IES operations.
  • Compared to traditional Stackelberg game methods, GD-DRPR reduces the interaction frequency between IEO and LA, thereby improving transaction efficiency. This reduction in interactions also alleviates the decision-making burden on the LA, effectively lowering the market entry barriers for both users and LAs. The improved transaction efficiency facilitates the IEO’s participation in more complex and large-scale market environments.
The method proposed in this paper can serve as a reference for revising the demand response model and solving the Stackelberg game model. However, many issues still need to be addressed before its practical application. These mainly include:
  • The initial stimulus threshold parameters and the upper limits of users’ demand response are merely set within a reasonable range based on previous studies. The adaptability of GD-DRPR in complex real-world environments remains to be verified.
  • This study only considers a deterministic model. The actual performance of GD-DRPR under uncertain conditions needs to be validated.
In future work, we will conduct further research to address these two issues for further optimization.

Author Contributions

Conceptualization, K.Z. and L.X.; methodology, K.Z.; software, K.Z.; validation, K.Z., L.X. and Y.B.; formal analysis, Y.B.; investigation, Y.B.; resources, K.Z.; data curation, K.Z.; writing—original draft preparation, K.Z.; writing—review and editing, L.X.; visualization, Y.B.; supervision, L.X.; project administration, L.X.; funding acquisition, L.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Xinjiang Uygur Autonomous Region Natural Science Foundation Key Project (2024D01D05) and the National Natural Science Foundation of China (62463030).

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Nomenclature

Abbreviations Explanation
IESIntegrated energy system
IEOIntegrated energy operator
LALoad aggregator
IDRIntegrated demand response
GD-DRPRGradient descent-based integrated response stimulus threshold parameters revision method
PVPhotovoltaic
WTWind turbine
CHPCombined heat and power
GBGas boiler
EBElectric boiler
ACAbsorption chiller
ECElectric chiller
ESSElectrical storage system
HSSHeat storage system
PBDRPrice-based demand response
SBDRSubstitution-based demand response
IBDRIncentive-based demand response
Variables
c t y The energy selling price for the y class energy at time t
c t CL . y The incentive compensation price for the y class energy at time t
D MAPE ( n ) The mean absolute percentage error between the expected post-response load from the IEO and the LA in the inner iteration n
D ( m ) The mean absolute percentage error between the expected post-response load from the IEO and the LA in outer iteration m
P t L , e The baseline electric load at time t
P t L , h The baseline heat load at time t
P t L , c The baseline cooling load at time t
P t TL , e The shiftable electric load at time t
P t SL , y The substitutable load of y class energy at time t
P t CL , y The curtailable load of y class energy at time t
P t IEO , y The expected post-response load of y class energy at time t for IEO
P t LA , y The expected post-response load of y class energy at time t for LA
P t GRID The electricity purchased by IEO at time t
G t GAS The gas purchased by IEO at time t
P t CHP The electric energy output of the CHP at time t
P t EB The electric power inputs to EB at time t
P t EC The electric power inputs to EC at time t
P t WT The output powers of PV at time t
P t PV The output powers of WT at time t
P t cut The electric energy curtailed at time t
P t , char B The charging electric energy of the B-class storage equipment at time t
P t , dis B The discharging electric energy of the B-class storage equipment at time t
H t CHP The heat energy output from CHP at time t
H t GB The heat energy output of GB at time t
H t EB The heat energy output of EB at time t
H t AC The heat energy input to AC at time t
Q t AC The cooling energy output of AC at time t
Q t EC The cooling energy output of EC at time t
r t x , y The response stimulus level of class x load with class y energy at time t
r t , min x , y The dead zone thresholds for the response stimulus level of class x load with class y energy at time t
r t , max x , y The saturation zone thresholds for the response stimulus level of class x load with class y energy at time t
λ t x , y The demand response rate of class x load with class y energy at time t
λ max x , y The maximum demand response rate of class x load with class y energy at time t
λ t , n IEO , x , y The IEO’s expected demand response rate of class x load with class y energy at time t in iteration n
λ t , n LA , x , y The LA’s expected demand response rate of class x load with class y energy at time t in iteration n
Superscript
xThe type of responsive load, where x { TL , SL , CL } with TL, SL, and CL represent transferable load, substitutable load, and curtailable load, respectively
yThe energy type of load, where y { e , h , c } with e, h, and c represent electrical, heat, and cooling loads, respectively
BThe type of energy storage equipment, where B ESS , HSS
Subscript
tThe time values
nThe iteration step count for the inner iteration within GD-DRPR

References

  1. Ziemba, P.; Zair, A. Temporal Analysis of Energy Transformation in EU Countries. Energies 2023, 16, 7703. [Google Scholar] [CrossRef]
  2. Gao, C.; Lu, H.; Chen, M.; Chang, X.; Zheng, C. A low-carbon optimization of integrated energy system dispatch under multi-system coupling of electricity-heat-gas-hydrogen based on stepwise carbon trading. Int. J. Hydrogen Energy 2025, 97, 362–376. [Google Scholar] [CrossRef]
  3. Zeng, B.; Yang, X.; Hu, P.; Wang, Y.; Dong, H.; Gong, D.; Ye, X. Towards a digitally enabled intelligent coal mine integrated energy system: Evolution, conceptualization, and implementation. Sustain. Energy Technol. Assess. 2025, 73, 104128. [Google Scholar] [CrossRef]
  4. Hua, H.; Du, C.; Chen, X.; Kong, H.; Li, K.; Liu, Z.; Naidoo, P.; Lv, M.; Hu, N.; Fu, M.; et al. Optimal dispatch of multiple interconnected-integrated energy systems considering multi-energy interaction and aggregated demand response for multiple stakeholders. Appl. Energy 2024, 376, 124256. [Google Scholar] [CrossRef]
  5. Wang, H.; Zhang, J.; Wang, P.; Zhang, N. The role of demand-side flexibilities on low-carbon transition in power system: A case study of West Inner Mongolia, China. Renew. Energy 2025, 242, 122478. [Google Scholar] [CrossRef]
  6. Huang, L.; Liu, H.; Liu, B.; Ma, S.; Wang, N.; Xie, H.; He, Z. Demand response with incomplete information: A systematic review. Electr. Power Syst. Res. 2025, 246, 111720. [Google Scholar] [CrossRef]
  7. Wang, Y.; Jin, Z.; Liang, J.; Li, Z.; Dinavahi, V.; Liang, J. Low-carbon optimal scheduling of park-integrated energy system based on bidirectional Stackelberg-Nash game theory. Energy 2024, 305, 132342. [Google Scholar] [CrossRef]
  8. Huang, Y.; Wang, Y.; Liu, N. A two-stage energy management for heat-electricity integrated energy system considering dynamic pricing of Stackelberg game and operation strategy optimization. Energy 2022, 244, 122576. [Google Scholar] [CrossRef]
  9. Ali, A.O.; Elmarghany, M.R.; Abdelsalam, M.M.; Sabry, M.N.; Hamed, A.M. Closed-loop home energy management system with renewable energy sources in a smart grid: A comprehensive review. J. Energy Storage 2022, 50, 104609. [Google Scholar] [CrossRef]
  10. Li, X.; Wu, N.; Lei, L. Nash-Stackelberg-Nash three-layer mixed game optimal control strategy for multi-integrated energy systems considering multiple uncertainties. Energy 2025, 320, 135418. [Google Scholar] [CrossRef]
  11. Gao, D.; Wei, G.; Zhi, Y.; Yang, X. Optimal energy trading in rural micro-grids with variable ownership of photovoltaics and power stations: A Stackelberg game approach. Energy Build. 2025, 330, 115338. [Google Scholar] [CrossRef]
  12. Zhao, W.; Ma, K.; Yang, J.; Qu, Z.; Guo, S.; Qi, F.; Sun, W. A two-stage scheduling strategy integrated with Stackelberg game approach to coordinate seaport logistics operation and energy management. Electr. Power Syst. Res. 2025, 244, 111527. [Google Scholar] [CrossRef]
  13. Yang, P.; Jiang, H.; Liu, C.; Kang, L.; Wang, C. Coordinated optimization scheduling operation of integrated energy system considering demand response and carbon trading mechanism. Int. J. Electr. Power Energy Syst. 2023, 147, 108902. [Google Scholar] [CrossRef]
  14. Shi, S.; Ji, Y.; Zhu, L.; Liu, J.; Gao, X.; Chen, H.; Gao, Q. Interactive optimization of electric vehicles and park integrated energy system driven by low carbon: An incentive mechanism based on Stackelberg game. Energy 2025, 318, 134799. [Google Scholar] [CrossRef]
  15. Zhang, M.; Yang, J.; Yu, P.; Tinajero, G.D.A.; Guan, Y.; Yan, Q.; Zhang, X.; Guo, H. Dual-Stackelberg game-based trading in community integrated energy system considering uncertain demand response and carbon trading. Sustain. Cities Soc. 2024, 101, 105088. [Google Scholar] [CrossRef]
  16. Shi, R.; Jiao, Z. Individual household demand response potential evaluation and identification based on machine learning algorithms. Energy 2023, 266, 126505. [Google Scholar] [CrossRef]
  17. Liu, D.; Sun, Y.; Qu, Y.; Li, B.; Xu, Y. Analysis and Accurate Prediction of User’s Response Behavior in Incentive-Based Demand Response. IEEE Access 2019, 7, 3170–3180. [Google Scholar] [CrossRef]
  18. Lu, R.; Hong, S.H. Incentive-based demand response for smart grid with reinforcement learning and deep neural network. Appl. Energy 2019, 236, 937–949. [Google Scholar] [CrossRef]
  19. Zhou, Y.; Su, Y.; Yao, R.; Xu, Q.; Qin, D.; Zhang, N.; Wang, Y. Key Technologies and Research Prospects of Smart Meter Data Sharing. Proc. CSEE 2026, 46, 1364–1383. [Google Scholar]
  20. Li, P.; Wang, H.; Zhang, B. A Distributed Online Pricing Strategy for Demand Response Programs. IEEE Trans. Smart Grid 2019, 10, 350–360. [Google Scholar] [CrossRef]
  21. Khezeli, K.; Bitar, E. Risk-Sensitive Learning and Pricing for Demand Response. IEEE Trans. Smart Grid 2018, 9, 6000–6007. [Google Scholar] [CrossRef]
  22. Zheng, S.; Qi, Q.; Sun, Y.; Ai, X. Integrated demand response considering substitute effect and time-varying response characteristics under incomplete information. Appl. Energy 2023, 333, 120594. [Google Scholar] [CrossRef]
  23. Yan, H.; Hou, H.; Deng, M.; Si, L.; Wang, X.; Hu, E.; Zhou, R. Stackelberg game theory based model to guide users’ energy use behavior, with the consideration of flexible resources and consumer psychology, for an integrated energy system. Energy 2024, 288, 129806. [Google Scholar] [CrossRef]
  24. Li, L.; Fan, S.; Xiao, J.; Zhou, H.; Shen, Y.; He, G. Fair trading strategy in multi-energy systems considering design optimization and demand response based on consumer psychology. Energy 2024, 306, 132393. [Google Scholar] [CrossRef]
  25. Li, X.; Deng, J.; Liu, J. Energy–carbon–green certificates management strategy for integrated energy system using carbon–green certificates double-direction interaction. Renew. Energy 2025, 238, 121937. [Google Scholar] [CrossRef]
  26. Zhao, Y.; Wei, Y.; Tang, Y.; Guo, Y.; Sun, H. Multi-objective robust dynamic pricing and operation strategy optimization for integrated energy system based on stackelberg game. Int. J. Hydrogen Energy 2024, 83, 826–841. [Google Scholar] [CrossRef]
  27. Yan, M.; Li, H.; Wang, J.; He, Y. Optimal operation model of a park integrated energy system considering uncertainty of integrated demand response. Power Syst. Prot. Control 2022, 50, 163–175. [Google Scholar] [CrossRef]
  28. Hao, H.; Wu, H.; Chen, X.; Wu, X.; Yu, K. Double-layer Optimization of Integrated Energy Systems Considering Flexible Collaboration Between Energy Economy and User Satisfaction. Power Syst. Technol. 2024, 48, 4174–4188. [Google Scholar] [CrossRef]
  29. Nan, B.; Jiang, C.; Dong, S.; Xu, C. Day-ahead and Intra-day Coordinated Optimal Scheduling of Integrated Energy System Considering Uncertainties in Source and Load. Power Syst. Technol. 2023, 47, 3669–3683. [Google Scholar] [CrossRef]
  30. Cui, Y.; Wang, C.; Niu, Y.; Wang, K.; Yao, W.; Liu, C. Two-stage stochastic optimization decision of integrated energy system considering the uncertainty of integrated demand response. Power Syst. Technol. 2025, 49, 2232–2242. [Google Scholar] [CrossRef]
Figure 1. Structure and energy flow of IES.
Figure 1. Structure and energy flow of IES.
Energies 19 01742 g001
Figure 2. The framework of the Stackelberg game under GD-DRPR.
Figure 2. The framework of the Stackelberg game under GD-DRPR.
Energies 19 01742 g002
Figure 3. Demand response model based on the principle of consumer psychology.
Figure 3. Demand response model based on the principle of consumer psychology.
Energies 19 01742 g003
Figure 4. IES Stackelberg game solution and rescheduling flowchart.
Figure 4. IES Stackelberg game solution and rescheduling flowchart.
Energies 19 01742 g004
Figure 5. Prediction curves of day-ahead load and PV output.
Figure 5. Prediction curves of day-ahead load and PV output.
Energies 19 01742 g005
Figure 6. Comparison of load DR before and after: (a) electrical load; (b) heat load; (c) cooling load.
Figure 6. Comparison of load DR before and after: (a) electrical load; (b) heat load; (c) cooling load.
Energies 19 01742 g006
Figure 7. Prices set by IEO: (a) energy selling price; (b) demand response incentive compensation price.
Figure 7. Prices set by IEO: (a) energy selling price; (b) demand response incentive compensation price.
Energies 19 01742 g007
Figure 8. Adjusted scheduling plans for Scenarios 5 and 7: (a) Scenario 5; (b) Scenario 7.
Figure 8. Adjusted scheduling plans for Scenarios 5 and 7: (a) Scenario 5; (b) Scenario 7.
Energies 19 01742 g008
Figure 9. Demand response deviation under Scenario 4 and Scenario 6: (a) electricity load; (b) heat load; (c) cooling load.
Figure 9. Demand response deviation under Scenario 4 and Scenario 6: (a) electricity load; (b) heat load; (c) cooling load.
Energies 19 01742 g009
Figure 10. The effect of user satisfaction loss parameters on GD-DRPR.
Figure 10. The effect of user satisfaction loss parameters on GD-DRPR.
Energies 19 01742 g010
Figure 11. Comparison of convergence curves of step size.
Figure 11. Comparison of convergence curves of step size.
Energies 19 01742 g011
Table 1. Comparison of the research in this paper with related works.
Table 1. Comparison of the research in this paper with related works.
Refs.Solution AlgorithmParameter EstimationParameter Revision During
Optimization
CentralizedDecentralizedData-DrivenTraditional StatisticalYesNo
[7,8,12,13,14]
[10,11]
[16,17,18]
[20,21,22]
This paper
Table 2. The time-of-use tariffs and natural gas price.
Table 2. The time-of-use tariffs and natural gas price.
Energy TypeTimePrice (CNY/kWh)
Electricity13:00–16:00, 19:00–22:00 1.1398
08:00–13:00, 16:00–19:00, 22:00–24:000.7112
24:00–08:000.3815
Natural gasFull day0.35
Table 3. Energy trading limits.
Table 3. Energy trading limits.
Energy TypePrice Floor (CNY/kWh)Price Cap (CNY/kWh)Average Price Cap (CNY/kWh)
Electricity0.21.240.8
Heat0.20.60.5
Cooling0.20.60.46
Table 4. Equipment parameters for IES.
Table 4. Equipment parameters for IES.
ParametersNumerical ValueParametersNumerical Value
S max ESS /(kWh)800 η EC 3
S max HSS /(kWh)1000 P dis , max ESS , P char , max ESS /(kW)50, 50
σ ESS 0.0025 P dis , max HSS , P char , max HSS /(kW)50, 50
σ HSS 0.0017 Q max CHP /(kW)80, 800
μ min ESS , μ max ESS 0.1, 0.9 P max , R CHP /(kW/h)25
μ min HSS , μ max HSS 0.1, 0.9 Q min GB /(kW)80, 800
η ESS , char , η ESS , dis /(kW)0.95, 0.95 Q max , R GB /(kW/h)33.3
η HSS , char , η HSS , dis /(kW)0.9, 0.9 Q max EB /(kW)450
η CHP , e 0.33 Q max , R EB /(kW/h)100
η CHP , h 0.51 C max AC /(kW)150
η GB 0.9 C max , R AC /(kW/h)33.3
η EB 0.95 C max EC /(kW)250
η A C 0.8 C max , R EC /(kW/h)100
Table 5. Information for scenarios.
Table 5. Information for scenarios.
ScenarioIDRStackelberg GameResponse Adjustment MechanismObjectiveSolution Method
IEO RevenueIEO
Cost
GD-DRPRKKT-BasedAGA
1
2
3
4
5
6
7
8
9
Table 6. Comparison of results for Scenarios 1, 2, and 3.
Table 6. Comparison of results for Scenarios 1, 2, and 3.
ScenarioCarbon
Emissions/kg
C sell /CNY C IEO /CNY C LA /CNY
1553834,491865934,247
2538836,87712,30837,730
3489937,89813,45938,890
Table 7. Comparison of Results for Scenarios 4, 5, 6, and 7.
Table 7. Comparison of Results for Scenarios 4, 5, 6, and 7.
ScenarioCarbon
Emissions/kg
IEO Cost/CNYMAPERSME
4478823,8052.43%32.18
5485924,5970%0
6468924,5050.18%2.34
7469124,5130%0
Table 8. Comparison of GD-DRPR with other methods.
Table 8. Comparison of GD-DRPR with other methods.
Renewable Energy
Penetration
ScenarioInteraction CountRuntime/sCarbon
Emissions/kg
Curtailment Rate C IEO /CNY
40%3448841340%19,929
815642060%20,112
94377444600%19,905
50%3450125321.91%23,537
815725902.28%23,626
94479326712.40%23,105
60%3449719014.72%25,748
815619385.16%25,914
94783020305.37%25,589
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, K.; Xie, L.; Bian, Y. Gradient Revision Method for Demand Response Stimulus Parameters of the Integrated Energy System. Energies 2026, 19, 1742. https://doi.org/10.3390/en19071742

AMA Style

Zhou K, Xie L, Bian Y. Gradient Revision Method for Demand Response Stimulus Parameters of the Integrated Energy System. Energies. 2026; 19(7):1742. https://doi.org/10.3390/en19071742

Chicago/Turabian Style

Zhou, Kaiyu, Lirong Xie, and Yifan Bian. 2026. "Gradient Revision Method for Demand Response Stimulus Parameters of the Integrated Energy System" Energies 19, no. 7: 1742. https://doi.org/10.3390/en19071742

APA Style

Zhou, K., Xie, L., & Bian, Y. (2026). Gradient Revision Method for Demand Response Stimulus Parameters of the Integrated Energy System. Energies, 19(7), 1742. https://doi.org/10.3390/en19071742

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop