1. Introduction
Guided by the carbon peaking and carbon neutrality goals, the integration of electric vehicles (EVs) and power systems is advancing progressively. Vehicle-grid integration (VGI) is transitioning from technical demonstration to large-scale application, emerging as a crucial pathway toward a decarbonized and flexible energy system [
1,
2,
3]. Achieving efficient VGI fundamentally relies on accurate analysis of user load demand and flexible potential, along with the optimized deployment of charging infrastructure [
4]. These imperatives demand enhanced standards for the completeness, timeliness, and synergy of multi-source data. However, data originating from vehicles, charging infrastructure, traffic networks, power grids, and meteorological systems are inadequately integrated. Significant data silos persist, undermining the accuracy of user response prediction and the efficiency of resource scheduling, thereby forming a major bottleneck to VGI implementation. Overcoming data barriers and facilitating efficient multi-source data sharing have thus become urgent priorities for the scalable development of VGI.
1.1. Literature Review on Multi-Source Data Applications in VGI
Existing studies indicate that integrating multi-source data can significantly enhance the operational efficiency and scale of VGI.
In charging load forecasting, combining EV charging, grid load, and traffic network data can effectively predict the spatio-temporal distribution characteristics of urban fast-charging demand [
5] and enhance the accuracy of short-term regional charging load predictions [
6]. Incorporating road traffic, meteorological data, and user charging behavior further enhances the robustness of EV charging demand forecasting [
7,
8,
9]. For instance, [
7] reported a rise in forecasting accuracy from 78.1% to 88.7% in terms of R
2 by utilizing traffic flow, weather conditions, and user charging behavior. Moreover, compared with the Auto-Regressive Integrated Moving Average (ARIMA) method that relies exclusively on historical charging load, the Spatiotemporal Multi-graph Convolutional Networks (STMGCN) method, which integrates multi-source data such as time-series modeling and graph neural networks, achieved a reduction in Mean Absolute Error (MAE) from 72.236 kW to 53.287 kW [
8].
In analyzing user-side flexible potential, integrating user travel chains and charging behavior data enables the development of a user-side charging management framework, thereby harnessing VGI’s flexible potential [
10]. Price and smart charging strategies significantly influence user behavior. By analyzing relevant data, user responses to various price and incentive mechanisms can be predicted, enabling guidance toward coordinated charging and discharging during off-peak hours or increasing renewable energy consumption [
11].
In the field of charging station planning, careful consideration of factors such as extreme weather events, grid resilience, and traffic flow supports the development of economically efficient and resilient charging station layout methodologies [
12]. Furthermore, examining real-time traffic data and travel trajectories assists in identifying charging hotspot areas, thereby providing a scientific basis for optimal charging infrastructure siting [
13].
1.2. Research Gaps and Methodological Foundations
Although multi-source data fusion demonstrates significant advantages in domains such as load forecasting, behavior mining, and station planning, the above studies operate under the implicit assumption that such data are readily accessible and already integrated. In reality, however, VGI-related data remain dispersed across diverse entities, including EV data platforms, charging infrastructure data platforms, grid companies, transportation authorities, and meteorological agencies. This dispersion results in severe data silos, which impede the exploitation of multi-source data integration and underline the urgent need to address institutional barriers to data sharing in VGI.
The data silos issue fundamentally stems from a lack of benefit coordination and incentive mechanisms. Designing incentive-compatible mechanisms is essential in such complex, multi-agent scenarios. Traditional game models (e.g., Stackelberg, Nash equilibrium), which are mostly grounded in the assumption of perfect rationality and analyze static equilibria, fail to depict the long-term strategic evolution among stakeholders [
14,
15,
16]. Evolutionary game theory, with its foundation of bounded rationality and its tools of population dynamics and Evolutionary Stable Strategies (ESS), models how agents gradually adjust behaviors and how multi-source data integration can emerge. Thus, its application in this study is highly appropriate [
17].
While evolutionary game models provide a dynamic method for analyzing multi-agent interactions, existing research in this domain has been predominantly confined to tripartite frameworks, such as those applied in studies assessing the impacts of EV subsidies [
18,
19,
20]. The applicability of this approach is further demonstrated in real estate [
21,
22], medical data sharing [
23,
24], and environmental governance [
25]. Collectively, these studies affirm that evolutionary game theory can not only reveal sensitive thresholds in policy and market design but also support the development of well-balanced institutional frameworks. However, VGI with multi-source data integration involves multiple stakeholders, including data providers, data users, government, and service platforms. The tripartite game model fails to encompass all these key actors, necessitating a quadripartite model for a more realistic representation of the data value chain and the coordination mechanisms essential to overcoming data silos.
Multi-source data integration entails multiple stakeholders, including data providers such as EV and charging infrastructure data platforms, data users such as grid companies and aggregators, governments, and data service platforms. This diversity leads to divergent views on data valuation, cost–benefit allocation, and risk management, forming complex game-theoretic interactions. This necessitates the design of a multi-source data integration mechanism that aligns interests across all parties through incentive-compatible arrangements.
1.3. Research Contributions
The main contributions and innovations of this study are summarized as follows.
A quadripartite evolutionary game model applicable to the VGI data sharing ecosystem is developed, in which the decision-making interactions among data providers, data users, the government, and service platforms are incorporated into an analytical structure to comprehensively capture dynamic gaming relationships within the data value chain.
Key parameters and mechanism pathways driving system evolution are identified via simulation on the MATLAB platform, through which the influences of parameters such as subsidies, penalties, and costs on multi-source data integration are revealed, offering a theoretical basis for the design of incentive-compatible data integration mechanisms.
Data integration and pathway strategies for coordinating multi-stakeholder interests are proposed, based on clarified roles and interaction logics of various parties in breaking down data silos, with actionable intervention strategies provided to key entities such as the government to facilitate scalable VGI development.
2. Cost–Benefit Analysis of Participants in VGI
2.1. Cost–Benefit Analysis for Data Providers
Data providers are central stakeholders in VGI, bearing the costs of data collection, processing, and sharing. These providers encompass entities such as those supplying vehicle, charging/battery-swap, traffic, grid, and meteorological data. When actively engaged in data sharing, they gain benefits including direct revenue from data users, government subsidies, and enhanced industry reputation and influence. However, participation also entails costs related to system access and operational maintenance, as well as potential risks such as data privacy breaches and misuse. In terms of strategy selection, data providers face a binary choice between data silos and multi-source data integration. This strategic simplification allows for a clear delineation of the fundamental trade-offs between isolation and collaboration. The outcome of this strategic game is collectively influenced by factors such as the level of government subsidies, the robustness of the data assurance mechanism, and the security of the data service platform.
2.2. Cost–Benefit Analysis for Data Users
Data users, primarily comprising grid companies and EV aggregators, are key entities in the application of VGI data. By acquiring multi-source data from vehicles, charging infrastructure, traffic network, the grid, and meteorological sources through data service platforms, power grid companies can optimize the allocation of charging and swapping stations, improve load forecasting accuracy, and improve their ability to guide user behavior. This contributes to increased equipment utilization, peak shaving and valley filling, ensured safe power supply, and deferred infrastructure investments. EV aggregators, on the other hand, can leverage refined data to improve the dispatch of user-side resources and increase electricity market trading profits. The benefits for data users mainly consist of basic returns and the incremental value derived from data utilization, which is highly dependent on the completeness and accuracy of the acquired data. In practice, however, some data users may undervalue data, refuse to pay reasonable prices, or even engage in data misuse. Their costs include expenses related to data accessing, processing, and utilization of multi-source data, the levels of which may vary depending on the involvement with the data service platform. Their strategic options are framed as a binary decision to either adopt or not adopt the shared data. This dichotomous approach facilitates the analysis of the critical factors that drive adoption. When deciding between the strategies of using and not using, data users’ choices are influenced by multiple factors, including data cost, platform integration difficulty, government pressure, and the effectiveness of the data application.
2.3. Cost–Benefit Analysis for Government
The government plays a critical role in the VGI data-sharing ecosystem by establishing mechanisms, standards, and enforcing supervision. Through policies and regulations that govern the collection, transmission, and use of multi-source data, the government ensures the compliance and security of data flows. Additionally, by providing subsidies to data providers and implementing reward and punishment mechanisms for data service platforms, it fosters the healthy development of the data ecosystem. Regulatory benefits are multifaceted. On the one hand, effective regulation attracts industrial investment, stimulates technological innovation, boosts local industrial development, and promotes regional economic growth. On the other hand, it improves the quality of charging and swapping services, enhances public satisfaction, and strengthens social governance, thereby boosting government credibility and public service performance. Regulatory outcomes also provide support for subsequent policy adjustments and scientific research. The costs of regulation mainly consist of the administrative expenditures required to perform supervisory duties, with both subsidies and regulatory expenditures being higher under a strong regulatory scenario compared to a weak one. Furthermore, inadequate regulation could lead to secondary risks such as data misuse and loss of trust, undermining policy effectiveness. The government’s strategic posture is conceptualized as a choice between two discrete regulatory intensities: stringent and lax. This binary framing is instrumental for analyzing the balance between fiscal input, regulatory effectiveness, and societal returns.
2.4. Cost–Benefit Analysis for the Data Service Platform
The data service platform acts as a critical hub connecting supply and demand sides in VGI, responsible for integrating, managing, and servicing multi-source data. Its core mission is to aggregate diverse data resources, standardize their identifiers and storage formats, and establish efficient mechanisms for data sharing and invocation, thereby facilitating trusted exchange and in-depth utilization of data. The platform needs to coordinate the relationship between data providers and users to enable efficient matching and sustained collaboration in operation. However, in pursuit of profit maximization, the platform may overlook issues such as data quality and service fairness, which could adversely affect the sustainability of the data-sharing ecosystem. Its revenue primarily stems from government incentives and rewards, especially when it actively provides services and ensures high efficiency and security in data sharing, thereby gaining more policy support. In terms of costs, beyond investments in system construction and operational maintenance, the platform also faces government penalties or reputational damage if it provides passive services or operates under insufficient supervision. The platform’s operational mode is represented by a binary selection of active or passive service. This distinction captures the essential conflict between pursuing long-term ecosystem health and yielding to short-term cost-saving incentives, which is critical for maintaining its core role and sustainable operational capability within the VGI system.
2.5. Analysis of the Logical Relationships in the Game Model
The sustainable operation of multi-source data in VGI relies on the collaboration of four key parties, including data providers, data users, the government, and the data service platform. Data providers supply high-quality data in exchange for policy incentives. Data users access data through the platform to support decision-making and communicate demand feedback. The data service platform connects both supply and demand sides while ensuring secure data circulation. The government coordinates the overall framework through laws, regulations, and incentive policies to promote equitable sharing and orderly development. Together, these four parties form a closed-loop system interconnected by data flows and value flows, collectively sustaining an efficient, compliant, and sustainable VGI data ecosystem. Sustainable operation mechanism for multi-source data integration in VGI is shown in
Figure 1.
4. Case Study
During different stages of data sharing in VGI, the strategic choices of game participants exhibit distinct evolutionary trajectories. To delve deeper into how key parameters in the replication dynamic system influence the four parties’ strategic decisions, this paper employs a numerical simulation method to conduct simulations and sensitivity analysis. The strategy profile (1, 1, 1, 1), which represents the full cooperation of all participants, is established as the ideal evolutionary target trajectory, providing a reference for analyzing the system’s potential convergence toward an optimal stable equilibrium.
To verify the dynamic characteristics of the aforementioned quadripartite evolutionary game model and analyze its ESS, this section employs numerical simulation methods. All simulations are implemented in MATLAB R2021a under an academic license, utilizing the ode45 solver for numerical integration, with both the relative tolerance and absolute tolerance set to 1 × 10−8.
Since parameter assignment for VGI multi-source data integration involves commercial confidentiality, obtaining complete and authentic real-world data remains challenging. Therefore, the parameter settings in this research are primarily established with reference to a large city with 500,000 EVs. The values of core parameters are derived based on publicly available policies and regulations, relevant literature, and reasonable assumptions, aiming to validate the model’s effectiveness and analyze the system’s evolutionary dynamics. This approach ensures that the parameters are grounded in realistic operational logic and regulatory boundaries, thereby validating the model’s effectiveness and enabling a robust analysis of the system’s evolutionary dynamics. The simulation parameters are configured as presented in
Table 5. We have ensured dimensional consistency throughout the model by unifying the units of all parameters and verifying the balance of all dynamic equations.
4.1. Impact of Data Providers
The efficient operation of VGI relies on the active participation of data providers. To further verify the effectiveness and feasibility of data providers supplying multi-source data within VGI, this paper sets their strategy probabilities at 0 and 1, representing the two pure strategic states of maintaining data silos and pursuing multi-source data integration, respectively. This enables a clear examination of the system’s boundary conditions. All other parameters for this simulation are provided in
Table 5. On this basis, a three-dimensional simulation space is constructed with an initial state of (
y,
w,
z) = (0.1, 0.1, 0.1) to simulate the evolutionary processes of data users, government, and the data service platform under different initial strategy combinations. The simulation results are shown in
Figure 2.
The results demonstrate that when data providers choose to supply multi-source data (x = 1), data users tend to adopt shared data, the data service platform is more likely to select active service, and the government tends towards stringent regulation to achieve greater benefits. Conversely, when data providers only supply siloed data (x = 0), the strategies of the data users, government, and data service platform display greater exploratory behavior and uncertainty. In this scenario, the system explores a broader strategic area before eventually converging gradually to the equilibrium state (1, 1, 1, 1). This suggests that, under the combined influence of multiple factors such as data security, privacy protection, and cost–benefit distribution, the stable strategy for data providers is not unique. Furthermore, multi-source data integration can accelerate the system’s convergence speed, driving all parties to reach a stable equilibrium more quickly.
4.2. Impact Analysis of Costs
4.2.1. Data Costs and Leakage Risk Costs
To conduct a more comprehensive analysis of how data costs and leakage risk costs impact the system’s equilibrium, this paper performs a sensitivity analysis on the data costs of data providers with the initial state of (
x,
y,
w,
z) = (0.1, 0.1, 0.1, 0.1) and other variables listed in
Table 5. The results are presented in
Figure 3.
The simulation results reveal that under low-cost conditions (satisfying No. ⑭ in
Table A2), data providers exhibit a positive payoff (
Ex > 0) for multi-source data integration, leading them to adopt this strategy. In contrast, under high-cost conditions (satisfying No. ⑬ in
Table A2),
Ex becomes negative, inclining data providers to maintain data silos. At the critical cost
yuan/unit where
Ex = 0 (satisfying No. ① in
Table A3), the evolutionary trajectory of data providers remains constant.
Deviating from the critical cost increases the absolute value of
Ex, thereby accelerating convergence. For instance, when
Ex > 0, reducing the cost from 0.04 to 0.02 yuan/unit shortens the time for data providers to reach the stable state of 1 from 0.06 to 0.029. Similarly, under No. ⑬ in
Table A2, increasing the cost from 0.6 to 0.8 extends the convergence time (defined as the earliest time at which a strategy variable reaches and remains within tolerance 1 × 10
−9 of its equilibrium 0 or 1) from 0.044 to 0.93.
Simultaneously, data users exhibit faster convergence speeds under low-cost conditions, indicating that their strategies reach equilibrium more readily under low-cost constraints. In comparison, the government and the data service platform are relatively unaffected by variations in data costs and leakage risks, maintaining more stable convergence pathways.
These results demonstrate that reducing data provision costs and leakage risks can not only enhance data providers’ initiative but also accelerate the equilibrium process of the overall system. This insight provides valuable guidance for optimizing the VGI data-sharing environment.
4.2.2. Adoption Costs of the Data User
To systematically examine how data usage costs influence the strategies of data users, this study sets the initial states of the four parties to (
x,
y,
w,
z) = (0.1, 0.1, 0.1, 0.1) while keeping all other parameters constant. By adjusting the data costs
and
, changes in the evolutionary trajectories of each stakeholder are observed. The simulation results are presented in
Figure 4.
The results indicate that under low-cost conditions (satisfying No. ④ in
Table A2), data users obtain a positive return from data usage (
Ey > 0) and thus tend to adopt the data. In contrast, under high-cost conditions (satisfying No. ③ in
Table A2),
Ey becomes negative, leading them to reject data usage. At the critical cost
=
= 0.5 yuan/unit where
Ey = 0, the evolutionary trajectory remains unchanged. Evolutionary trajectories of the other parties are unaffected by changes in data cost.
Deviating from the critical cost increases the absolute value of Ey, thereby accelerating the convergence speed. For example, when Ey > 0, reducing the data usage cost from 0.4 to 0.2 shortens the convergence time of data users from 0.29 to 0.096. Similarly, when Ey < 0, increasing the cost from 0.8 to 1 reduces the convergence time from 0.23 to 0.078.
These findings suggest that reducing data usage costs can effectively incentivize data adoption among users and promote faster convergence of the overall system to equilibrium.
4.2.3. Regulation Costs of Government
To systematically examine how regulatory costs influence government decision-making, this study fixes all other parameters and sets the initial states of the four parties to (
x,
y,
w,
z) = (0.1, 0.1, 0.1, 0.1). By adjusting the government’s regulatory cost, the dynamic changes in the evolutionary trajectories of each party are observed. The simulation results are shown in
Figure 5.
The results indicate that under low regulatory cost conditions (satisfying No. ⑧ in
Table A2), the government obtains a positive benefit from stringent regulation (
Ew > 0) and thus tends to adopt this strategy. In contrast, under high regulatory costs (satisfying No. ⑦ in
Table A2),
Ew becomes negative, leading the government to prefer lax regulation. At the critical cost
= 500 thousand yuan where
Ew = 0, the evolutionary trajectory remains unchanged, which is consistent with the theoretical analysis in
Section 3.5. The strategies of the other parties show no significant changes.
Deviating from the critical regulatory cost increases the absolute value of Ew, thereby accelerating the convergence speed. For instance, when Ew > 0, reducing the stringent regulatory cost from 400 to 200 shortens the government’s convergence time from 0.23 to 0.076. Similarly, when Ew < 0, increasing the cost from 600 to 800 reduces the convergence time from 0.19 to 0.06.
These results suggest that lowering regulatory costs can effectively incentivize the government to implement stringent regulations, thereby promoting the construction of a multi-source data integration system.
4.2.4. Participation Costs of Platform
To comprehensively analyze the impact of participation costs on the decision-making behavior of the data service platform, this study keeps all other parameters constant and sets the initial states of the four agents to (
x,
y,
w,
z) = (0.1, 0.1, 0.1, 0.5). By adjusting the participation cost of the data service platform, the evolutionary trajectories of all four parties are simulated, as shown in
Figure 6. In this case,
z = 0.5 is chosen primarily to visually highlight the lag effects associated with the two critical points and to avoid a sharp decline of the platform’s strategy in the initial stage, which would otherwise make its steady-state probability nearly indistinguishable from zero.
As discussed in
Section 3.5, when
x = 0 and
x = 1, the data service platform’s participation cost corresponds to two critical points,
= 1140 yuan/unit and
= 2100 yuan/unit, respectively (satisfying the No. ⑥ and ⑦ in
Table A3, respectively). It should be noted that data providers maintain data silos in the early phase and subsequently offer multi-source data integration. When the participation cost lies between the two thresholds, changes in
x give rise to a hysteresis effect for the data service platform, whose probability toward active service first approaches zero and later converges to one. In the case where the critical value is
= 2100 yuan/unit, the platform’s probability initially declines and then stabilizes at 0.056.
Moreover, because data users operate downstream of the service platform, a lower participation cost allows the platform to converge more rapidly to active service. It is more proactive service provision reduces the net payoff difference for data users in deciding whether to adopt the data, resulting in a slightly slower convergence speed for the data users. These findings suggest that as the participation cost of the data service platform decreases, its willingness to provide active service increases significantly, thereby facilitating the formation of a multi-source data integration ecosystem for VGI.
4.3. Impact Analysis of Government Subsidies
To systematically examine the impact of government subsidies on the evolutionary process of the VGI, this paper conducts a sensitivity analysis on the subsidy levels allocated to data providers and the data service platform, with the initial state of (
x,
y,
w,
z) = (0.1, 0.1, 0.1, 0.1) and other variables listed in
Table 5. The results are shown in
Figure 7.
The simulation results indicate that under high subsidy conditions (satisfying No. ⑭ in
Table A2), data providers achieve a positive net benefit from multi-source data integration (
Ex > 0), leading them to adopt this strategy. In contrast, under low subsidy scenarios (satisfying No. ⑬ in
Table A2),
Ex becomes negative, inclining data providers toward maintaining data silos. At the critical subsidy
εS = 0.02 yuan/unit where
Ex = 0, the evolutionary trajectory remains unchanged.
Deviating from the critical subsidy enhances the absolute values of Ex and Ey, thereby accelerating the convergence speed. For instance, when Ex > 0 or Ey > 0, increasing the government subsidy from 0.1 to 40 reduces the convergence time for data providers from 0.29 to 0.06 and for data users from 0.028 to 0.018.
These suggest that in the VGI early stages, appropriately increasing government subsidies not only enhances data providers’ sharing willingness but also promotes coordination and cooperation among multiple stakeholders, laying a solid foundation for the healthy operation of the system. This finding provides valuable insights for formulating effective subsidy policies in VGI implementation.
4.4. Impact of Government Regulatory Intensity
Based on different orientations of government regulatory strategies, this study designs four policy scenarios for simulation analysis, with the initial system state set as (x, y, w, z) = (0.1, 0.1, 0.1, 0.1) and all other parameters held constant. The key policy parameters for each scenario are configured as follows.
A. Case 1 (Stringent Regulation): Adopts high-intensity subsidies and penalties, with εS = εP = φP = 1.5 yuan/unit.
B. Case 2 (Lax Regulation): Implements a low degree of policy intervention, with εS = εP = φP = 0.01 yuan/unit.
C. Case 3 (Platform-Driven Ecosystem): Primarily incentivizes the service platform, with εS = 0.05 yuan/unit, εP = 1.5 yuan/unit, φP = 0.05 yuan/unit.
D. Case 4 (Data Provider-Driven Ecosystem): Focuses on incentivizing data providers, with εS = 1.5 yuan/unit, εP = 0.01 yuan/unit, φP = 0.05 yuan/unit.
The evolutionary paths of the system under each scenario are illustrated in
Figure 8.
The simulation results indicate that under Cases 1 and 4, all four stakeholders converge relatively quickly to the stable state (1, 1, 1, 1). This suggests that higher subsidy or penalty intensities contribute to promoting cooperative equilibrium within the system, particularly through stronger regulatory pressure on data providers. Under Case 2 (Lax Regulation), due to insufficient incentives and constraints, the system satisfies the condition Ex < 0, ultimately converging to the data silo state (0, 1, 1, 1).
The outcome of Case 3 is more complex. The government’s payoff function is closely related to the strategy x of data providers. When x = 1, a critical condition emerges between the government’s incentives and constraints on the service platform, namely φP − εP = 0.3 yuan/unit. In the early evolutionary phase, data providers tend to maintain data silos (with a low x), gradually shifting toward multi-source data sharing thereafter. During this process, when φP − εP > 0.3 yuan/unit, changes in the data providers’ strategy x induce a lagged response in the government’s regulatory strategy w. The government initially strengthens supervision and subsequently relaxes it as system coordination improves. This phenomenon indicates that in an incentive design centered on the platform, policy effectiveness is constrained by the initial strategic choices of data providers, potentially leading to phased fluctuations during the evolutionary process.
5. Conclusions
Based on the current status of multi-source data integration in VGI and the demands of diverse stakeholders, this paper constructs a quadripartite evolutionary game model involving data providers, data users, government, and the data service platform. It systematically analyzes the benefit distribution relationships and cooperative game outcomes among these stakeholders. The main conclusions are summarized as follows.
First, the proposed quadripartite evolutionary game model effectively captures dynamic gaming behaviors within the data value chain. Through modeling and simulation, critical conditions influencing each stakeholder’s decision-making have been identified (see
Table A3), validating the model’s effectiveness in analyzing multi-source data sharing ecosystems in VGI. Key strategic thresholds were determined, including a data cost critical value of
yuan/unit and a government subsidy threshold of
εS = 0.02 yuan/unit.
Second, the study demonstrates that the four stakeholders can achieve an equilibrium through appropriate strategy selection and policy guidance, confirming that multi-source data integration in VGI is not a zero-sum game but possesses inherent potential for enhancing overall efficiency through collaboration. Data providers’ strategic choices significantly impact system convergence speed, with multi-source data integration strategies accelerating convergence toward the stable state (1, 1, 1, 1).
Third, government subsidies, reward-punishment mechanisms, and data costs play crucial roles in realizing multi-source data integration and accelerating convergence speed. Deviation from critical conditions can significantly enhance convergence speed. For instance, increasing government subsidies from 0.1 to 40 yuan/unit reduces the time for data providers to reach stability from 0.29 to 0.06.
Fourth, the analysis provides concrete and scenario-based pathways for governmental intervention. The simulation of policy scenarios reveals that a stringent regulatory approach and a data provider-driven incentive model are most effective in rapidly establishing system-wide cooperation. In contrast, a lax regulatory approach fails to overcome inertia, leading to entrenched data silos. These findings offer a quantitative basis for stage-specific policy design. Initial subsidies are crucial to lower entry barriers, while a well-calibrated, dynamic integration of subsidies and penalties is needed to guide stakeholders toward a stable, cooperative equilibrium. These findings offer a quantitative and model-grounded basis for the design of targeted incentive mechanisms in practice.
The proposed model offers substantial potential for future research extensions. Subsequent investigations could adapt the model structure to examine international VGI collaboration scenarios for sustainable development goals (e.g., affordable clean energy). Further development of the model could incorporate a broader range of policy instruments beyond the current focus on subsidies and penalties, including tax incentives and data property rights frameworks. As current parameters are derived from literature and theoretical assumptions, future research will focus on calibrating parameters with real-world data and extending the model to incorporate dynamic parameters and heterogeneous behavioral rules, thereby enhancing its predictive accuracy and policy relevance.