Game Approach to HDR-TS-PV Hybrid Power System Dispatching

Hot dry rock (HDR) power stations have the potential to serve as an energy storage system for large-scale photovoltaic (PV) plants. For flexible operation, thermal storage (TS) power stations are required to coordinate with HDR power stations. In this study, a hybrid power system is constructed by combining the HDR, TS, and PV plants. Game theory is then introduced into the optimal dispatch of the hybrid power system. Considering HDR, TS, and PV as players, non-cooperative and cooperative game dispatching models are established and verified by a case in the Gonghe basin of Qinghai. Finally, the stability of the coalitions and the rationality of allocation of the hybrid power system is verified, and the sensitivity of critical parameters is analyzed. The results demonstrate that the overall payoff of the hybrid power system is increased by 10.15%. The payoff of the HDR power station is increased by 16.5%. The TS power station has obtained 50% of the total extra profits. The PV plant reduces the impact on the grid to obtain the priority of grid connection. Based on these results, a theoretical basis can be provided for developing generation systems based on the HDR resources in the Gonghe Basin.


Introduction
With the increasing clean energy penetration, the power grid's security, stability, and economy have been severely affected by the intermittent and fluctuating nature of power sources. The large-scale transmission and consumption of clean energy face the challenge of a shortage of reliable power sources. This issue is especially prominent for some clean energy bases where clean energy production and transmission are the primary industry [1], such as the Gonghe Basin. This region has tens of millions of kilowatt-level clean energy bases with abundant wind, solar, hydropower, and geothermal power resources. However, because of the lack of flexible and stable power sources, it is difficult to expand the scale of clean energy development further.
Among the various types of clean energy sources, hot dry rock (HDR) power stations have the most stable and reliable operational characteristics [2]. The development of a more flexible operational mode to form a new type of energy storage system, coupled with the vast amount of thermal energy in the geothermal reservoir, would be instrumental to ensure the reliability of grid-connected photovoltaic (PV) plants and to support clean energy demand of buildings, which can meet the seasonal cooling and power demand of buildings and maximize the utilization of hot dry rock resources.
The above research can well solve the coordinated dispatching problem and improve the system economy, but the interaction of the hybrid power system participants is not entirely reflected [25]. Therefore, other scholars have discussed the hybrid power system's coordinated operation based on game theory [26]. In [27], game theory was introduced to model a hybrid power system comprising wind turbines, PV plants, and batteries to optimize the dispatching strategy of energy storage systems. The Stackelberg game was used to obtain the charging strategy of electric vehicles and the pricing strategy of charging stations [28]. Cooperative and non-cooperative game methods were used to study automatic distributed optimal economic of automatic generation control in multi-region power systems [29]. A non-cooperative game dispatching model composed of wind power, thermal power, and pumped storage power plants was established to optimize the combined operation effects in different seasons [30]. Through non-cooperative game theory, a power generation dispatch model of a hybrid power system containing wind, wind, and heat was established to maximize the overall profits in the long-distance transmission scenario [31]. A general framework for power system flexibility analysis based on cooperative game theory was proposed for risk design and energy policy design [32]. The potential cooperative behaviors of multiple grid-connected micro-grids were simulated to achieve higher efficiency and better economy [33]. In [34], the payoffs of wind turbines, thermal power stations, and electric vehicles in the power market were studied based on cooperative game theory, and the imputation of the cooperative game was rationally allocated based on the Shapley value. The profit allocation of the hybrid power system in the cooperative game was analyzed in [35]. The results show that the Shapley value is the most stable allocation strategy.
Previous studies have provided extensive theories and methods for the hybrid power system dispatching. However, the characteristics of HDR power stations observed in the energy extraction and conversion process lead to different operational strategies compared to thermal power stations and energy storage power stations. The existing coordinated dispatching models and payoff allocation mechanisms for multi-energy sources are not entirely suitable for HDR power stations. Accordingly, the coordinated operation model and payoff allocation strategy for HDR power stations require further research. Therefore, the major contributions of this paper are: (1) a new method for designing HDR power stations is presented to reduce the power fluctuations of PV plants and improve the reliability of the power grid; (2) a hybrid power system composed of HDR, TS, and PV plants is constructed, and the coordinated operational characteristics of HDR power stations in the hybrid power system are analyzed; (3) a game pattern with HDR power stations, TS power stations, and PV plants as players is proposed. The non-cooperative game's payoff function and the characteristic function of the cooperative game are then defined; and (4) the game dispatching models are simulated with the case of the Gonghe Basin of Qinghai. Finally, a sensitivity analysis of the two critical parameters of the allowed fluctuation rate, and the penalty coefficient is carried out.
The remainder of this paper is organized as follows. The architecture of the HDR-TS-PV hybrid power system and characteristics of the HDR power station are given in Section 2. The HDR-TS-PV game model is proposed in Section 3. Section 4 simulates a case based on actual data of the Gonghe Basin. An in-depth discussion of the simulation results is carried out in Section 5. Conclusions and further developments are discussed in Section 6.

Architecture of the Hybrid Power System
Conventional HDR power stations include geothermal mining cycles and ORC generators [36]. Because the extraction of geothermal energy from artificial reservoirs established by hydraulic fracturing requires a bottom hole pressure of 30 MPa to 40 MPa and a pipeline that is 3 km to 4 km deep [37] and considering the working fluid velocity and the rock elastic modulus, the dynamic response time of the geothermal mining cycle is especially high. Although the annual utilization time can exceed 8000 h, the flexibility of HDR power stations is not sufficient to fully exploit the energy storage effect of the reservoirs to provide reliable support for PV plants.
Therefore, it is necessary to design a TS power station that can be coordinated with HDR power stations to improve operational flexibility. The TS power station comprised a thermal energy storage exchanger (TESE), a thermal energy storage tank, an ORC generator, and an electric heating device. The heat-transfer oil (HTO) was used as the thermal energy storage medium. The TS power station can use the TESE to store the waste heat from the HDR power station and convert the surplus power from PV plants into thermal energy through the electric heating device.
Conversely, conventional battery energy storage stations cannot adapt to the Qinghai plateau's cold climate, have a short cycle life, and cannot directly use geothermal energy. Nevertheless, the TS power station is not affected by environmental conditions; it demonstrates the flexible operation of the battery energy storage stations and can store geothermal energy directly, making it suitable for a hybrid power system with an HDR power station. The hybrid power system of the Gonghe Basin was considered as a test case for this research. The region has a plateau continental climate with large temperature variations and experiences rapid changes in weather conditions throughout the day. Although solar energy resources are abundant, the local loads are insufficient and require a large amount of PV power to be transmitted for consumption. Therefore, the local government has formulated a policy that renewable energy power stations should be equipped with energy storage systems with a capacity of 5% to 10%. An HDR-TS-PV hybrid power system architecture was constructed with a typical 200 MWp PV plant equipped with an HDR power station and a TS power station, as shown in Figure 1.

Architecture of the Hybrid Power System
Conventional HDR power stations include geothermal mining cycles and ORC generators [36]. Because the extraction of geothermal energy from artificial reservoirs established by hydraulic fracturing requires a bottom hole pressure of 30 MPa to 40 MPa and a pipeline that is 3 km to 4 km deep [37] and considering the working fluid velocity and the rock elastic modulus, the dynamic response time of the geothermal mining cycle is especially high. Although the annual utilization time can exceed 8000 h, the flexibility of HDR power stations is not sufficient to fully exploit the energy storage effect of the reservoirs to provide reliable support for PV plants.
Therefore, it is necessary to design a TS power station that can be coordinated with HDR power stations to improve operational flexibility. The TS power station comprised a thermal energy storage exchanger (TESE), a thermal energy storage tank, an ORC generator, and an electric heating device. The heat-transfer oil (HTO) was used as the thermal energy storage medium. The TS power station can use the TESE to store the waste heat from the HDR power station and convert the surplus power from PV plants into thermal energy through the electric heating device.
Conversely, conventional battery energy storage stations cannot adapt to the Qinghai plateau's cold climate, have a short cycle life, and cannot directly use geothermal energy. Nevertheless, the TS power station is not affected by environmental conditions; it demonstrates the flexible operation of the battery energy storage stations and can store geothermal energy directly, making it suitable for a hybrid power system with an HDR power station.
The hybrid power system of the Gonghe Basin was considered as a test case for this research. The region has a plateau continental climate with large temperature variations and experiences rapid changes in weather conditions throughout the day. Although solar energy resources are abundant, the local loads are insufficient and require a large amount of PV power to be transmitted for consumption. Therefore, the local government has formulated a policy that renewable energy power stations should be equipped with energy storage systems with a capacity of 5% to 10%. An HDR-TS-PV hybrid power system architecture was constructed with a typical 200 MWp PV plant equipped with an HDR power station and a TS power station, as shown in Figure 1. As Figure 1 shows, the HDR power station was designed according to the temperature of the HDR source, which was calculated considering the optimal working fluid flow and the power generation efficiency given in [11], and its capacity was set at 6.6 MW. The TS power station was coordinated with the HDR power station, whose capacity was also  As Figure 1 shows, the HDR power station was designed according to the temperature of the HDR source, which was calculated considering the optimal working fluid flow and the power generation efficiency given in [11], and its capacity was set at 6.6 MW. The TS power station was coordinated with the HDR power station, whose capacity was also set at 6.6 MW. The HDR-TS-PV hybrid power system improves the operational flexibility and economy of the HDR power station through the adjustable TS power station and simultaneously enhances the reliability of the PV plant. In this hybrid power system, the HDR power station, TS power station, and PV plant are connected to the low voltage side of the same transformer. To coordinate the operation of the three elements, this study makes the following assumptions:

1.
The HDR power station is a standard double-well system [4]; 2.
The effects of network congestion and line capacity on the system are ignored; 3.
The TOU price is regulated by the power selling side.

Operating Characteristics of HDR Power Station in Hybrid Power System
The HDR power station, as shown in Figure 2, is composed of a geothermal mining cycle and ORC generator, coupled with a heater. The geothermal mining cycle consists of underground HDR fracture channels, a production well, a reinjection well, a circulating pump, and the heater. The brine is driven by the circulating pump from the reinjection well into the underground HDR fracture channels to absorb geothermal energy, then from the production well into the heater to release the heat and re-enter the reinjection well. power station, TS power station, and PV plant are connected to the low voltage side of the same transformer. To coordinate the operation of the three elements, this study makes the following assumptions: 1. The HDR power station is a standard double-well system [4]; 2. The effects of network congestion and line capacity on the system are ignored; 3. The TOU price is regulated by the power selling side.

Operating Characteristics of HDR Power Station in Hybrid Power System
The HDR power station, as shown in Figure 2, is composed of a geothermal mining cycle and ORC generator, coupled with a heater. The geothermal mining cycle consists of underground HDR fracture channels, a production well, a reinjection well, a circulating pump, and the heater. The brine is driven by the circulating pump from the reinjection well into the underground HDR fracture channels to absorb geothermal energy, then from the production well into the heater to release the heat and re-enter the reinjection well. The ORC generator includes the heater, a recuperator, a turbine, a condenser, a generator, and a pump. The liquid organic working fluid butane is driven by the pump to enter the heater for absorbing the heat released by the brine. The butane with the saturated vapor state then enters the turbine to expand and convert heat into mechanical energy. The residual energy still contained in the exhaust butane gas is transferred to the lowtemperature liquid butane through the recuperator. The exhaust butane gas finally condenses into the low-temperature liquid butane in the condenser and then enters the next cycle again. A more detailed discussion of the ORC could be found in [38].
Several days are required for the HDR power station to establish a stable geothermal mining cycle, and a similar period is required to adjust the geothermal working fluid pressure and flow rate, and the fluid used is usually brine. Therefore, compared to thermal power plants, the power ramp rate of conventional HDR power stations is low, and frequent start and stop cannot be achieved. In the actual operation, the HDR power station consistently maintains its geothermal mining cycle by alternately configuring two circulating pumps. This feature further urges the ORC generators to continuously generate electricity in order to fully exploit the geothermal energy for higher economic benefits. The continuous power generation model of a conventional HDR power station [12]   The ORC generator includes the heater, a recuperator, a turbine, a condenser, a generator, and a pump. The liquid organic working fluid butane is driven by the pump to enter the heater for absorbing the heat released by the brine. The butane with the saturated vapor state then enters the turbine to expand and convert heat into mechanical energy. The residual energy still contained in the exhaust butane gas is transferred to the low-temperature liquid butane through the recuperator. The exhaust butane gas finally condenses into the low-temperature liquid butane in the condenser and then enters the next cycle again. A more detailed discussion of the ORC could be found in [38].
Several days are required for the HDR power station to establish a stable geothermal mining cycle, and a similar period is required to adjust the geothermal working fluid pressure and flow rate, and the fluid used is usually brine. Therefore, compared to thermal power plants, the power ramp rate of conventional HDR power stations is low, and frequent start and stop cannot be achieved. In the actual operation, the HDR power station consistently maintains its geothermal mining cycle by alternately configuring two circulating pumps. This feature further urges the ORC generators to continuously generate electricity in order to fully exploit the geothermal energy for higher economic benefits. The continuous power generation model of a conventional HDR power station [12] is where P t Hc represents the output power, m t r is the mass flow of brine, c pr is the specific heat capacity of brine, and η P is the ORC generator's efficiency. T HDR and T Well represent the outlet temperature of the production well and the temperature of the reinjection well, respectively.
It should be noted that under the hybrid power system architecture shown in Figure 1, part of the continuous geothermal energy generated by the geothermal mining cycle can be stored in the thermal energy storage tank through the TESE so that the geothermal energy flow is decoupled from the electrical energy flow of the ORC generator. Therefore, the HDR power station can fully utilize the fast adjustment speed and wide range of operating conditions of the ORC generator to provide reserves without losing geothermal energy. The flexible power generation model of the HDR power station can be expressed as follows: where Equation (2) represents the model of the brine distribution, m t α represents the mass of the brine used for power generation, and m t β represents the brine used for thermal energy storage. Q t β is the thermal power input to the TESE in Equation (3), and T β represents the waste heat temperature of the brine after TESE. The output power of the HDR power station can then be described by Equation (4), where, P t H f represents the output power.

ElemenTS of the Game
Compared with other coordinated dispatching methods, game theory can better clarify the internal mechanisms of multi-agent competition and cooperation to quantify each party's benefits. The elements of the game include Player, Strategy, and Payoff.

Player
The HDR power station, TS power station, and PV plant are the three players, denoted by H, T, and P, respectively.

Strategy
The operational goal of the HDR power station is to obtain as much profit as possible to rapidly recoup the high costs of construction by changing the mass flow of brine to dispatch the power. Therefore, the strategy set of the HDR power station is where P install H represents the installed capacity of the HDR power station, and P min H is the output power that meets the minimum power generation requirements of the ORC generator, which is generally 10% of the installed capacity.
The TS power station primarily realizes the arbitrage of the electricity price peakvalley difference by dispatching purchased power and generating power. Therefore, the strategy set of the TS power station is where P t purch is the electric power purchased from the grid during the low TOU price period and limited by the maximum power P max heat of the electric heating device. P t T represents the power generation capacity of the TS power station, and its upper and lower limits are the installed capacity P install T and the ORC generator and the minimum power generation condition P min T , respectively. The operational goal of a PV plant is to formulate a credible output power plan based on the weather forecast data to increase the credible capacity connected to the grid as much as possible so as to reduce the loss of surplus power and penalty incurred by abandoning loads. its strategy set is where P t PV is the credible output power of the PV plant, and its corresponding installed capacity is the credible capacity of the PV plant, denoted by P PV . P install PV represents the installed capacity of the PV plant.

Payoff
All three players profit by selling electricity to the grid so that we can define the payoff vector of the game as I = (I H , I T , I P ), where I H , I T and I P represent the payoff of the HDR power station, TS power station, and PV plant, respectively. Each player has an independent and equal status and can either compete in a noncooperative game or try to increase profits through cooperation. The coordination dispatching models of the two types of games are given below.

HDR-TS-PV Non-Cooperative Game and Payoff Functions
If all three players are independent and try to maximize their profits, they naturally constitute a non-cooperative game of competition with each other. This paper first presents the payoff functions of the three players in non-cooperative games in order to compare and analyze the results of the subsequent cooperative game dispatching model.

Payoff Function of HDR Power Station
Owing to the continuity of the geothermal mining cycle, the brine mass flow of the HDR power station cannot be adjusted in the non-cooperative game in order to avoid heat loss. The flexibility of the ORC generator cannot be fully utilized, and only the continuous power generation mode can be adopted. Therefore, according to Equation (1), the payoff function of the HDR power station is

Payoff Function of TS Power Station
In the non-cooperative game, the TS power station purchases electricity from the power grid during the low TOU price period and converts it into thermal energy. During the high TOU price period, the TS power station generates electricity for arbitrage. The payoff function is The output power of the TS power station to the power grid can be modeled as where m t dc is the mass flow of the HTO during the power generation period and c po is the specific heat capacity of the HTO. T h and T l represent the temperature of the HTO used for power generation and its temperature after power generation, respectively.
The operation of the TS power station must also account for the following thermal storage state constraints and thermal energy store/release constraints: where S t h is the thermal storage energy in the tank, Q t c is the input thermal power when the TS power station is storing thermal energy, Q t dc is the released thermal power, η h is the thermal preservation coefficient of the thermal energy storage tank, η dc is the efficiency of the thermal release process, η e is the electric heating efficiency, and ∆τ is the time interval.

Payoff Function of PV Plant
The output power of the PV plants can be modeled using solar irradiance and the installed capacity [39]. According to the weather forecast, the PV plant obtains the predicted output power as the credible power generation in the first stage. The curtailed electricity and the abandoned loads are obtained from the actual solar irradiance in the second stage. This type of problem can usually be described by a Bayesian game model [34]: the two players of the game are the PV plants and nature, where the PV plants make decisions based on predicted values, and nature determines the joint probability of the actual solar irradiance. The payoff function of the PV plant is where k is the number of types of solar irradiance, π k is the probability of each type, P t PV represents the predicted value of the output power based on the solar irradiance in the non-cooperative game. P k,t PVcur and P k,t PVal are the curtailed power and the abandoned loads under each type, and c t ph is the penalty price for abandoned loads. The models of P t PV , P k,t PVcur , and P k,t PVal are P t PV = λ t PV P PV , where λ t PV is the power generation coefficient of the PV plant predicted according to the solar irradiance [39] and ξ k,t PV is the actual output power corresponding to the noncooperative part of the PV plant. ϕ is the power predicted error of the PV plant.
The penalty price for abandoning loads, c t ph is expressed as the product of the penalty coefficient for abandoning loads p and the TOU price c t e , namely,

HDR-TS-PV Cooperation Game Pattern
If the three players in the hybrid power system intend to form coalitions, such as coordinated dispatching with a PV plant, the HDR power station can obtain greater benefits by providing auxiliary services and by the coordinated dispatching with a TS power station, and the HDR power station can use the TOU prices for arbitrage. The three players participate in cooperation under these conditions. Due to the limitations caused by the efficiency of the ORC generator, it is difficult for the TS power stations to profit from the TOU prices. Therefore, the TS power stations need to cooperate with the HDR power station or the PV plant to profit from waste geothermal or surplus PV power with zero marginal costs. Concurrently, because of the requirements of power grid assessment for the PV curtailment power and the abandonment load, the PV plant also cooperates in order to reduce the loss of the curtailed power and the penalty for abandoning the load.
It is observed that through cooperation, the three players can obtain a greater payoff and allocate extra profits through energy settlement, which is only related to the cooperating players. Therefore, cooperation between HDR, TS, and PV can be modeled by the characteristic function game (CFG). CFG uses the tuple G = N, V to describe the cooperative game relationship, where N = {1, 2, · · · , n} is a set representing the number of players in the coalition. V represents the characteristic function that reflects the payoff of the coalition, which is represented by the mapping relationship V : F(N) → R . For any S i of S i ∈ N, V(S i ) or V S i represents the overall payoff of the S i coalition.
In · · · , S n } is the coalition structure and x = {x 1 , x 2 , · · · , x n } is the payoff vector. S n , x must meet the individual rationality where x i is the payoff received by the player i in the coalition, and I i represents the payoff of the player i in the non-cooperative game.

Characteristic Functions of HDR-TS-PV Cooperative Game
In the non-cooperative game, each player can only achieve their best profits through their independent dispatching strategies. By forming coalitions, players have more options for coordinated dispatching, thus expanding the strategy set. Based on the original profit model, HDR and TS power stations can also profit by providing reserves for the PV plant, but their primary strategies are to dispatch output power. Therefore, the strategy sets of the HDR and TS power stations are still Ω H and Ω T , respectively. The PV plant can reduce the curtailed power and penalty for abandoning the load by purchasing reserves; therefore, its strategy set also includes where R t represents the reserves purchased by the PV plant, and σ represents the power fluctuation rate allowed by the grid. In the case of purchasing the reserves, the constraint in Equation (22) always limits the PV plant's output power within a range of ±σ around the predicted value. Therefore, P t PV can be considered as the credible output power of the PV plant, and its corresponding installed capacity is the credible capacity of the PV plant, denoted by P PV . ξ t PV represents the actual output power corresponding to the credible capacity part of the PV plant.
Because the actual installed capacity of the PV plant may be greater than the reserve capacity provided by the HDR power station, the installed capacity can be divided into non-cooperative and cooperative parts in the cooperative game.
In a non-cooperative game, the players cannot use their respective resource characteristics and operating modes to achieve complementary advantages. Through cooperation, players can fully coordinate the dispatching of resources to maximize profits. The operating models and the profit models can be innovated to expand payoff sources, thereby increasing the overall payoff of the hybrid power system. The characteristic function of each coalition is described below.

Characteristic Function of HDR-TS Coalition
In the HDR-TS coalition, the TS power station stores part of the HDR geothermal mining cycle's thermal energy in the low TOU price periods and then reuses the stored thermal energy to generate power in the high TOU price periods for arbitrage. The characteristic function of the HDR-TS coalition is According to the flexible generation model of the HDR power station described in Equations (2)-(4), the heat exchange constraints for the TS power station to obtain geothermal energy from the HDR power station are where T c represents the thermal energy storage temperature of the HTO, Q t β represents the waste heat power of the HDR power station, and η ex is the heat exchange efficiency of the TESE. Additionally, the constraints of the optimization problem Equation (24) also include Equations (1), (12) and (14).

Characteristic Function of HDR-PV Coalition
In the HDR-PV coalition, the HDR station flexibly distributes the mass flow of the brine to provide reserves for the PV plant, which can then increase the overall payoff of the coalition by reducing the curtailment power and penalty for abandoning loads. The characteristic function of the HDR-PV coalition is where I P represents the payoff of the non-cooperative game part of the PV plant, which is calculated according to Equation (15). The reserve dispatching constraints for the HDR-PV coalition are where R k,t HDR represents the reserves provided by the HDR power station, and ∆m k,t α is the brine mass flow required when the reserves are dispatched.

Characteristic Function of TS-PV Coalition
In the TS-PV coalition, the TS power station converts the electricity purchased from the grid during the low TOU price periods and the surplus PV power into thermal energy for storage. During the high TOU price periods or shortage periods of the PV plant, the stored thermal energy is used to generate revenue and provide reserves for the PV plant. The TS power station achieves profitability through the zero-cost electricity generated by the surplus PV power. Simultaneously, the PV plant increases its credible capacity and reduces the penalty for abandoning loads by purchasing reserves, thus forming a cooperative game pattern. The characteristic function of the TS-PV coalition is The TS power station provides an up-reserve through the ORC generator to realize the profits and a down-reserve through electric heating devices to absorb surplus PV power. The reserve constraints of the TS-PV coalition are where R k,t TS represents the up-reserve provided by the TS power station, ∆m k,t dc is the mass flow of HTO that needs to be adjusted when the up-reserve is dispatched, and P k,t PVcur represents the surplus PV power consumed when the TS power station provides a downreserve for the PV plant. The constraints also include Equations (1) and (12).

Characteristic Function of HDR-TS-PV Coalition
When the HDR power station, TS power station, and the PV plant are coordinated for dispatch, the TS power station can store geothermal energy in the thermal energy storage tank for the subsequent generation, which improves the flexibility of the HDR power station and realizes the peak regulation of the geothermal energy. Both the HDR power stations and the TS power stations can simultaneously provide reserves for the plants through flexible output power dispatching strategies, thereby increasing the credible capacity of the PV plant, reducing the curtailment of PV power and the penalty for abandoning loads, and further improving the overall payoffs. The characteristic function of the HDR-TS-PV coalition is The constraints of the HDR-TS-PV coalition include the operating constraints, thermal storage state constraints, and reserve constraints of their respective systems, including Equations (1)-(4), (12), (25), (28)-(30), (32)-(34). They are not repeated here.

System Parameters
Based on the weather conditions in the Gonghe Basin of Qinghai and TOU price, a fictitious hybrid power system was constructed to verify the models and the methods proposed in this paper. In addition to abundant solar energy resources, the Gonghe Basin also has high-quality HDR resources [40,41]. The scenario diagram of the fictitious hybrid power system is shown in Figure 3. The solar irradiance and the TOU price curves are shown in Figure 4. power system is shown in Figure 3. The solar irradiance and the TOU price curves are shown in Figure 4.    The Latin hypercube method [42] is adopted to generate 10,000 scenarios using the actual solar irradiance data, which are finally reduced to 100 typical scenarios [43] as the types observed in nature. The system parameters are listed in Table 1.  The Latin hypercube method [42] is adopted to generate 10,000 scenarios using the actual solar irradiance data, which are finally reduced to 100 typical scenarios [43] as the types observed in nature. The system parameters are listed in Table 1.

System Parameters Values
Installed

Dispatching Results
Each payoff function and characteristic function is linear programming or mixedinteger linear programming, which was solved by software MATLAB2016b and CPLEX12.8. The results are shown in Table 2. It can be observed from Table 2 that the payoff of the TS power station is zero because it cannot profit in the non-cooperative game. The payoff of the HDR-PV coalition is lower than the sum of the two non-cooperative game payoffs, i.e., These results show that the HDR power station and the PV plant cannot form a stable coalition. Figure 5 shows the results for the coordinated operation of the HDR-PV coalition. The HDR power station provides reserves to increase the credible capacity of the PV plant by replacing geothermal energy with the curtailed power and the abandoned loads. However, the cost of abandoning geothermal energy is greater than the sum of the curtailed power and penalty for abandoning loads. Therefore, the coordinated operation of the HDR power station and PV plant cannot increase the overall payoff, making the HDR-PV coalition unstable. It can be observed from Error! Reference source not found. that the payoff of the TS power station is zero because it cannot profit in the non-cooperative game. The payoff of the HDR-PV coalition is lower than the sum of the two non-cooperative game payoffs, i.e., These results show that the HDR power station and the PV plant cannot form a stable coalition. Error! Reference source not found. shows the results for the coordinated operation of the HDR-PV coalition. The HDR power station provides reserves to increase the credible capacity of the PV plant by replacing geothermal energy with the curtailed power and the abandoned loads. However, the cost of abandoning geothermal energy is greater than the sum of the curtailed power and penalty for abandoning loads. Therefore, the coordinated operation of the HDR power station and PV plant cannot increase the overall payoff, making the HDR-PV coalition unstable. When the TS power station joins the HDR-PV coalition to form a grand coalition, the abandoned geothermal energy used to stabilize the fluctuation of the PV plant is reused by the TS power station to generate new profits. The results of the grand coalition cooperative game dispatching are shown in Figure 6. During the low TOU price period (0:00-7:00), both the HDR and TS power stations maintain minimum output power. The continuous geothermal energy is stored in the thermal energy storage tank of the TS power station while ensuring the safety of the geothermal mining cycle. During the PV generation period (8:00-18:00), the HDR and TS power stations provide reserves corresponding to the credible output power of the PV plant. The HDR power station is then operated at full power to maximize profits. The TS power station converts all the stored thermal energy into electricity during the high TOU price period (19:00-22:00).
In the end, the three players tend to form an HDR-TS-PV grand coalition. The overall payoff of the hybrid power system in each game form is shown in Table 3.
The coordinated dispatching results of the grand coalition obtained by the cooperative game dispatching model proposed in this paper are shown in Figure 7. storage tank of the TS power station while ensuring the safety of the geothermal mining cycle. During the PV generation period (8:00-18:00), the HDR and TS power stations provide reserves corresponding to the credible output power of the PV plant. The HDR power station is then operated at full power to maximize profits. The TS power station converts all the stored thermal energy into electricity during the high TOU price period (19:00-22:00). In the end, the three players tend to form an HDR-TS-PV grand coalition. The overall payoff of the hybrid power system in each game form is shown in Error! Reference source not found.. The coordinated dispatching results of the grand coalition obtained by the cooperative game dispatching model proposed in this paper are shown in Error! Reference source not found..

Stability Analysis of Coalitions
In general, the payoff allocation of the coalition is not unique, and the set of all the stable allocations is defined as a core. In this paper, we adopt the minimum core (ε-core) method to analyze the stability of the coalitions, namely where ) (G core is the core set of the coalition and  is the relaxation variable [44]. If the lower bound of  is equal to zero, then the ) (G core is non-empty before relaxation. Thus, the coalition is stable.
According to the overall payoff in Error! Reference source not found., three players

Stability Analysis of Coalitions
In general, the payoff allocation of the coalition is not unique, and the set of all the stable allocations is defined as a core. In this paper, we adopt the minimum core (ε-core) method to analyze the stability of the coalitions, namely where core(G) is the core set of the coalition and ε is the relaxation variable [44]. If the lower bound of ε is equal to zero, then the core(G) is non-empty before relaxation. Thus, the coalition is stable. According to the overall payoff in Table 3, three players can form a stable grand coalition with a non-empty core through Equations (7) and (36). The stable allocation set of the grand coalition is shown in Figure 8. method to analyze the stability of the coalitions, namely where ) (G core is the core set of the coalition and  is the relaxation variable [44]. If the lower bound of  is equal to zero, then the ) (G core is non-empty before relaxation. Thus, the coalition is stable. According to the overall payoff in Error! Reference source not found., three players can form a stable grand coalition with a non-empty core through Equations (7) and (36). The stable allocation set of the grand coalition is shown in Error! Reference source not found..

Allocation Strategy Based on Shapley Value
A stable coalition cannot guarantee the rationality of the payoff allocation. Therefore, it is necessary to study the fairness method of payoff allocation to provide a fair allocation strategy. The Shapley value is the fairest and most reasonable payoff allocation value among equal players [45]. It is based on marginal contributions and is widely used in cooperative games. Considering that there is no difference in status and weight between the HDR power station, the TS power station, and the PV plant, the Shapley value is adopted as the payoff allocation strategy. The simplified calculation [46] of the Shapley value can be expressed as where |S| is the number of elements in the subset S and V(S\{i}) represents the remaining players in the sub-coalition after excluding player i. Table 4 shows the allocation results of the three players based on the Shapley value. Further analysis of the stability of this allocation strategy through Equation (37) shows that the allocation obtained by the Shapley value is stable, as shown in Figure 9.

Sensitivity Analysis
The extra profits of the grand coalition generated by the cooperative game are closely related to the allowed fluctuation rate that the power grid can tolerate and the penalty coefficient for abandoning loads. Therefore, based on the preceding section, the allowed fluctuation rate and penalty coefficient are varied to further analyze the role of the HDR, PV, and TS plants in the grand coalition.
Under the parameters in Table 1, the allowed fluctuation rates are set at 0%, 1%, 2%, and 3%. Simultaneously, when the penalty coefficients are 2, 3, 4, and 5, respectively, the payoff of the grand coalition and the extra profits obtained by each player are calculated.

Sensitivity Analysis
The extra profits of the grand coalition generated by the cooperative game are closely related to the allowed fluctuation rate that the power grid can tolerate and the penalty coefficient for abandoning loads. Therefore, based on the preceding section, the allowed fluctuation rate and penalty coefficient are varied to further analyze the role of the HDR, PV, and TS plants in the grand coalition.
Under the parameters in Table 1, the allowed fluctuation rates are set at 0%, 1%, 2%, and 3%. Simultaneously, when the penalty coefficients are 2, 3, 4, and 5, respectively, the payoff of the grand coalition and the extra profits obtained by each player are calculated. Figure 10 shows the relationship between the overall payoffs, the allowed fluctuation rate, and the penalty coefficient. Error! Reference source not found. shows the relationship between the overall payoffs, the allowed fluctuation rate, and the penalty coefficient. In Error! Reference source not found., ①, ②, ③, and ④ correspond to the scenarios where the penalty coefficients p are equal to 2, 3, 4, and 5, respectively. The overall payoff of the coalition increases with the increasing allowed fluctuation rate, mainly due to an increase in the grid-connected power of the PV plant and the penalty reduction. The overall payoff of the coalition decreases with an increase in the penalty coefficient, and the payoff of the PV plant is affected directly by the penalty coefficient. However, the payoff of the HDR power station is almost constant, reflecting the characteristics of stability and continuity of HDR resources. The TS power station is mainly responsible for creating profits and saving penalties by supporting the grid connection of the PV plant.
Error! Reference source not found. shows the changes in the allocation of the extra profits to the HDR, PV, and TS power stations with the change in the allowed fluctuation In Figure 10, 1 , 2 , 3 , and 4 correspond to the scenarios where the penalty coefficients p are equal to 2, 3, 4, and 5, respectively. The overall payoff of the coalition increases with the increasing allowed fluctuation rate, mainly due to an increase in the grid-connected power of the PV plant and the penalty reduction. The overall payoff of the coalition decreases with an increase in the penalty coefficient, and the payoff of the PV plant is affected directly by the penalty coefficient. However, the payoff of the HDR power station is almost constant, reflecting the characteristics of stability and continuity of HDR resources. The TS power station is mainly responsible for creating profits and saving penalties by supporting the grid connection of the PV plant. Figure 11 shows the changes in the allocation of the extra profits to the HDR, PV, and TS power stations with the change in the allowed fluctuation rate. The overall extra profits decrease with an increase in the allowed fluctuation rate, which indicates that the extra profits of the coalition decrease with an increase in the ability of the power grid to withstand power fluctuation. These extra profits are mainly caused by the curtailment of energy and penalty cost generated by the PV plant. Simultaneously, the extra profits of the HDR power station do not change with the penalty coefficient changes and are not sensitive to the allowed fluctuation rates, indicating that the extra profit of the HDR power station is obtained by the peak regulation of geothermal energy, which is not affected by the PV plant.

Discussion
Further analysis of payoff and extra profits was carried out to reveal the role of the players in the hybrid power system and the source of the extra profits. It can be seen from Error! Reference source not found. that under the TOU price given in Error! Reference source not found., the arbitrage operation of the TS power station cannot be profitable due to the low power generation efficiency of the ORC generator. As a result, the TS power station can only benefit by abandoning heat and electricity with zero marginal cost through cooperating with HDR power station or PV plant. Therefore, its dispatching strategy is to always adopt a minimum generation operation in low TOU price periods and PV generation periods in order to store as much as possible the waste heat and surplus power, as shown in Error! Reference source not found.. It can be seen from the actual operation shown in Error! Reference source not found. that under this operating strategy, the TS power station can play a peak shaving role by storing geothermal energy, which realizes its payoff and makes the geothermal energy mainly generated in high TOU price periods, thereby creating extra profits. Simultaneously, the TS power station can recover the waste heat of the HDR power station when the down reserve is provided to the PV plant, further improving the system profits. As can be seen from the allocation results in Error! Reference source not found., the extra profits of the TS power station accounted for 50.04% of the total extra profits. This result further shows that the TS power station has the most considerable marginal contribution to the extra profits, reflecting the decisive role of the TS power station in the game relationship.
Due to its operating characteristics, the HDR power station cannot cooperate with the PV plant without abandoning heat, so it cannot provide auxiliary services for the PV plant alone. Since the TS power station can directly store geothermal energy, it can help the HDR power station realize arbitrage. Error! Reference source not found. shows that when the HDR power station and the TS power station cooperate, the payoff of the HDR-TS coalition increases by 8.04% relative to the non-cooperation operation, and the overall The variation trends of the extra profits obtained by the PV and TS power stations are similar. The higher the penalty coefficient, the more the extra profits that are allocated. The relationship between the penalty coefficient and extra profits indicates that the TS power station is the main contributor that supports the grid connection of the PV station, realizes the value of the surplus PV power, and reduces the penalty.
Analysis of Figures 10 and 11 shows that the extra profits in the grand coalition, generated by the cooperative game, result from three factors. First, owing to the peakregulating effect of the TS power station, the HDR generation period shifts from the low TOU price period to the high TOU price period. Second, through the flexible operation of the HDR and the TS power stations, a greater capacity of the PV station can track the dispatching curve, thus reducing the penalty. Third, the curtailed power and geothermal energy are stored and utilized by the TS power station.

Discussion
Further analysis of payoff and extra profits was carried out to reveal the role of the players in the hybrid power system and the source of the extra profits. It can be seen from Table 2 that under the TOU price given in Figure 3, the arbitrage operation of the TS power station cannot be profitable due to the low power generation efficiency of the ORC generator. As a result, the TS power station can only benefit by abandoning heat and electricity with zero marginal cost through cooperating with HDR power station or PV plant. Therefore, its dispatching strategy is to always adopt a minimum generation operation in low TOU price periods and PV generation periods in order to store as much as possible the waste heat and surplus power, as shown in Figure 7. It can be seen from the actual operation shown in Figure 6 that under this operating strategy, the TS power station can play a peak shaving role by storing geothermal energy, which realizes its payoff and makes the geothermal energy mainly generated in high TOU price periods, thereby creating extra profits. Simultaneously, the TS power station can recover the waste heat of the HDR power station when the down reserve is provided to the PV plant, further improving the system profits. As can be seen from the allocation results in Table 4, the extra profits of the TS power station accounted for 50.04% of the total extra profits. This result further shows that the TS power station has the most considerable marginal contribution to the extra profits, reflecting the decisive role of the TS power station in the game relationship.
Due to its operating characteristics, the HDR power station cannot cooperate with the PV plant without abandoning heat, so it cannot provide auxiliary services for the PV plant alone. Since the TS power station can directly store geothermal energy, it can help the HDR power station realize arbitrage. Table 3 shows that when the HDR power station and the TS power station cooperate, the payoff of the HDR-TS coalition increases by 8.04% relative to the non-cooperation operation, and the overall payoff of the grand coalition only increases by 1.95% relative to the HDR-TS coalition when the PV plant joins. The extra profits in Table 4 show that the HDR power station is allocated 42.02% of the overall extra profits. This result shows that the increased profits of the hybrid power system in the cooperative game mainly come from the peak shaving arbitrage of geothermal energy. Figures 10 and  11 also show that the HDR power station is not affected by grid parameters σ and p, which further clarifies that the payoff and extra profits of the HDR power station mainly come from its geothermal energy. Table 3 shows that when the PV plant and the TS power station cooperate, the overall payoff of the hybrid system only increased by 1.12%. In the grand coalition, the extra profits of the PV plant accounted for only 7.94%. It can be seen that the contribution of the PV plant to the extra profits of the hybrid system is small. The main interest of PV plants participating in the coordinated dispatch of the hybrid power system is to increase the credible capacity and reduce the impact on the grid so as to obtain the priority of grid connection. This result is also reflected in Figures 10 and 11. The increase in the allowed fluctuation rate and the penalty coefficient decrease indicates that the grid's ability to absorb fluctuating power has increased. A strong power grid increases the non-cooperative payoff of the PV plant and decreases the cooperative extra profits, thereby reducing the willingness of the PV plant to cooperate.
Moreover, since the temperature of the geothermal reservoir is a critical parameter that restricts the operating performance of the HDR power station, we conducted a preliminary simulation on the influence of the temperature change on the overall payoff of the hybrid power system. According to previous research [38], when the temperature is 170 • C, the energy efficiency of ORC is 12.7%. Simulation with these parameters shows that when the temperature drops to 170 • C, the overall payoff of the hybrid power system is 54,285 $/day, a drop of 12.28%. The credible capacity of PV plants participating in cooperative games is reduced by 28.78%. The preliminary results indicate that after the HDR power station has been operating for about 10 years, the three players' willingness to participate in the cooperative game will decrease. More in-depth research on the influence of more HDR power station parameters on the hybrid power system is needed in future research.

Conclusions
For the grid-connection requirements of large-scale PV plants, we first proposed the technical route of using HDR power stations to provide reserves for PV plants in this study. The operating characteristics of conventional HDR power stations were analyzed, and a scheme for using TS power stations to improve the operating flexibility of HDR power stations was proposed. In this study, a hybrid power system consisting of an HDR station, a TS station, and a PV plant was designed, and a coordinated dispatching method was provided based on game theory. First, taking HDR power station, TS power station, and PV plant as players, the non-cooperative game model and the characteristic function game model of the hybrid power system dispatching were established. The optimization of the dispatching models describes the decision-making processes of the players.
Consequently, the coordinated dispatching strategy of the players was obtained by calculating the optimal value of the payoff functions and the characteristic functions that can be transformed into linear optimization problems or mixed-integer linear optimization problems. The case analysis was based on actual data in the Gonghe Basin. The results show that only cooperation can maximize the overall payoff and the individual profits of the system. Due to heat loss, the HDR power station cannot form a coalition with the PV plant alone, reflecting the significant role of the TS power station in the system. Finally, the sensitivity analysis of the allowed fluctuation rate and the penalty coefficient shows that they significantly influence the overall payoff of the system and the operating strategies of each player. However, the profits of the HDR power station are not significantly affected by the two parameters, which reflects its stable characteristics. The main goal of PV plants in the cooperative game is to increase credible capacity and reduce the impact on the grid, and TS power station is the key to achieving extra profits for HDR power station and PV plant. This study provides an essential theoretical basis for developing power generation systems based on HDR geothermal energy in the Gonghe basin of Qinghai, China.
One limitation of this study is that the grid topology and capacity constraints are simplified when constructing the game dispatching model and calculating the payoff allocation of the hybrid power system. The influence of the grid structure on the hybrid power system will be further investigated in future studies.

Data Availability Statement:
The data that support the findings of this study are available from the corresponding author, [L.C.], upon reasonable request.