1. Introduction
In recent decades, the term “power grid” has evolved beyond the traditional “electrical grid”. The electrical grid typically refers to the physical infrastructure for electricity generation, transmission, and distribution, focusing on voltage levels, substations, and power flow topology. By contrast, the power grid represents a broader cyber–physical-energy system that integrates information, communication, control, distributed generation, and storage technologies [
1].
Modern power grids thus encompass multi-layer interactions among physical, cyber, and market elements, supporting advanced functionalities such as demand response, real-time control, and resilience enhancement [
2].
Table 1 summarizes the distinction between these two terms and related system types.
In this study, “power grid” specifically denotes a modern AC/DC hybrid system integrating renewable energy, HVDC transmission, and coordinated operation supported by communication and control layers. The traditional term “electrical grid” is used only when referring to the purely physical transmission and distribution network.
Driven by the imperative of a global transition toward a sustainable energy future, the architecture and operation of modern power systems are undergoing a paradigm shift [
3,
4]. A fundamental characteristic of this transformation is the large-scale integration of renewable energy sources (RES) such as hydropower, wind, and solar, which are often located in geographical areas remote from major load centers [
5]. To effectively bridge this spatial gap, High-Voltage Direct Current (HVDC) transmission has emerged as an indispensable technology, facilitating the secure and stable transfer of bulk power over long distances [
6]. This trend has led to the emergence of increasingly complex hybrid AC/DC power systems [
7,
8].
While this architecture offers considerable economic and environmental benefits, it also introduces a series of new and profound challenges to system stability [
9]. The progressive replacement of conventional synchronous generators with resources interfaced via power electronic converters leads to a systemic decline in system rotational inertia and fault levels [
10]. Not only is the inherent physical robustness of the grid eroded, but its natural ability to withstand and suppress disturbances is also weakened, resulting in more severe frequency and voltage deviations [
11,
12]. Consequently, accurately assessing the overall strength of such hybrid AC/DC grids is a critical yet challenging task. A realistic analysis must consider a composite of factors, including network structural integrity, stability margins, the AC system’s fault support capability for the DC system, and resilience against cascading failures [
13,
14]. However, the exploration of a unified indicator system that can holistically evaluate these interdependent aspects remains limited, highlighting a clear need for a more comprehensive approach. To address this, this paper proposes a grid strength assessment framework, constructed from the dual perspectives of structural strength metrics and fault withstand strength metrics. To fuse these heterogeneous metrics into a single, actionable composite index, a hybrid AHP-CRITIC weighting method is adopted to synergistically balance expert judgment with objective data-driven evidence [
15].
However, establishing a composite index to quantify grid strength is only the first step. For targeted reinforcement in grid planning and precise operational guidance, it is crucial to identify the key factors responsible for a low grid strength index [
16,
17]. Global sensitivity analysis (GSA) methods, particularly the variance-based Sobol’ method, are a recognized and reliable approach for such tasks [
18]. However, the Sobol’ method has inherent limitations: it cannot indicate whether an influence is positive or negative and lacks the capability to diagnose individual operating modes [
19]. Furthermore, a significant practical barrier hinders its direct application: the vast number of dynamic simulations required renders the process computationally infeasible for large-scale power systems [
20]. While data-driven surrogate models can address this computational hurdle, their common lack of interpretability, which undermines the trust of system operators, highlights the need for a solution that is not only fast but also transparent [
21].
To overcome this dual challenge of computational cost and lack of local interpretability [
19], this paper proposes a framework that couples a surrogate model with an explanation method. A Gaussian Process Regression (GPR) model is trained as a surrogate to rapidly and accurately approximate the system’s dynamic response. Upon this fast surrogate, SHapley Additive exPlanations (SHAP) is employed to address the interpretability challenge [
22]. Rooted in cooperative game theory, SHAP can decompose a specific prediction, attributing the outcome to the precise contribution of each input parameter. This delivers clear insights for both local, case-by-case diagnosis and global importance ranking [
23].
In addition, the increasing integration of renewable-based converters such as photovoltaic (PV) generation, battery energy storage systems (BESS), and wind energy conversion systems (WECS) has made HVDC–AC interaction more complex. In particular, the DC-DC modular multilevel converter (MMC) interface for utility-scale PV systems is emerging as a critical research focus [
24,
25], since these systems must maintain dynamic stability under weak AC grid conditions due to their high control bandwidth and converter-based dynamics [
26]. Therefore, the coupling behavior between strong HVDC transmission and weak AC networks with renewable infeed must be carefully analyzed in assessing overall grid strength.
Based on the existing literature, this paper introduces a structured framework to address the grid strength assessment challenges in complex hybrid AC/DC systems. Our methodology first constructs a composite grid strength index via the AHP-CRITIC method. Then, a GPR surrogate model is trained to simulate the system’s response, which provides a foundation for the SHAP analysis. Finally, SHAP is employed to identify critical operating parameters and diagnose specific scenarios. This framework is applied to a detailed model of the Southwest Power Grid in China to demonstrate its effectiveness and practical value.
In contrast to prior studies that focus on either isolated grid strength indices or traditional sensitivity analyses, this paper makes several distinct contributions:
A unified composite grid strength index is proposed that integrates both structural strength and fault withstand strength metrics, offering a holistic quantitative evaluation of hybrid AC/DC systems characterized by ‘strong HVDC and weak AC’ interactions.
A high-fidelity Gaussian Process Regression (GPR) surrogate model is developed to efficiently approximate system dynamic responses, overcoming the computational burden of large-scale transient simulations required in traditional sensitivity analysis.
The SHapley Additive exPlanations (SHAP) technique is innovatively applied to quantify both global and local importance of operating parameters, enabling interpretable insights into the key factors influencing grid strength.
The proposed GPR–SHAP framework is validated through a detailed case study of the Southwest China Power Grid, demonstrating its capability to provide transparent, data-driven guidance for operational decision-making. These contributions jointly enable the transition from purely experience-based grid management toward interpretable, data-informed stability enhancement.
The remainder of this paper is organized as follows:
Section 2 details the framework of the composite grid strength index.
Section 3 introduces the AHP-CRITIC index fusion method.
Section 4 describes the GPR-based SHAP methodology.
Section 5 presents the validation of the proposed method through targeted dynamic simulations on the Southwest Power Grid in China. Finally,
Section 6 concludes the paper.
2. Grid Strength Assessment Framework for Hybrid AC/DC Systems
The stability assessment of power systems necessitates a comprehensive consideration of multiple operational indices, which collectively reflect the system’s ability to withstand disturbances and maintain stable operation. The Southwest Power Grid in China serves as a prime example of a modern hybrid system defined by a “strong HVDC and weak AC” architecture. This structure is a direct consequence of its energy geography: vast renewable energy sources, including hydropower, wind, and solar, are located in remote western regions, and their generated power is transmitted to eastern load centers via multiple large-capacity, long-distance HVDC transmission corridors. Conversely, the AC network at the sending end, which collects this power, is characterized by high network impedance and a low Short-Circuit Ratio (SCR), rendering it electrically distant from major load centers.
Table 2 presents a comparison between strong HVDC and weak AC.
To address the stability challenges inherent to “strong HVDC, weak AC” systems, we propose an assessment framework constructed from two complementary perspectives: structural strength and fault withstand strength.
First, structural strength metrics quantify the grid’s inherent robustness. Specifically, we assess voltage support capability using the Multi-outfeed Effective Short-Circuit Ratio (MOESCR) and post-fault power redistribution capability with the DC Power Transfer Influence Factor (DCPTIF).
Second, fault withstand strength metrics gauge the system’s dynamic resilience to large disturbances. These include indicators for cascading failure resilience—the Load Shedding Ratio (LSR) and Line Capacity Failure Proportion (LCFP)—and metrics for the transient stability margin, namely the Transient Voltage Stability Index (TVSI), Transient Angle Stability Index (TASI), and Frequency Deviation Factor (FDF).
The structure of this framework is depicted in
Figure 1.
2.1. Structural Strength Metrics
To provide a holistic measure of the grid’s structural strength and to characterize the inherent robustness of the grid infrastructure, a set of indices is constructed to quantify two principal aspects: voltage support capability and active power transfer capability under faults.
2.1.1. Voltage Support Capability
The ability of the AC grid to maintain a stable voltage at High-Voltage Direct Current (HVDC) converter stations is a fundamental requirement for ensuring secure system operation. This is particularly critical in power systems with multiple electrically coupled HVDC links.
The International Council on Large Electric Systems (CIGRE) proposed the multi-infeed effective short-circuit ratio (MIESCR) as an index to describe the voltage support strength of an AC system to the converter stations of a multi-infeed DC system. The MIESCR can, to a certain extent, reflect the power stability of a multi-infeed DC system. A higher MIESCR value at a converter station indicates stronger voltage support from the AC system to that station.
The multi-outfeed effective short-circuit ratio for converter station
is defined as (1):
where
is the rated power of the
-th HVDC line;
is the three-phase short-circuit capacity at the AC bus of converter station
;
is the reactive power supplied by the AC filters and shunt capacitors at station
when its AC bus voltage is at the rated value.
is the multi-infeed interaction factor between converter station
and station
, expressed as (2):
where
is the self-disturbing bus and
is the observed bus;
is the voltage disturbance at the AC bus of converter station
, approximately 1%; and
is the resulting voltage change at the AC bus of converter station
. It is noteworthy that the calculation procedures for the multi-outfeed interaction factor (MOIF) and the multi-outfeed effective short-circuit ratio (MOESCR) are highly consistent with those for MIIF and MIESCR. The distinction is that MOIF and MOESCR are calculated on the rectifier side of the HVDC lines, rather than the inverter side. In this paper, MOESCR is adopted as one of the sub-criteria layer indices for grid structural strength.
2.1.2. Capability to Withstand Post-Fault Power Flow Redistribution
In a hybrid AC/DC system such as the Southwest Power Grid in China, a key aspect of system stability is the ability of the AC network to withstand the power surges and flow transfers caused by major contingencies like the blocking of an HVDC line.
To quantify the stress imposed on the AC system by a DC fault, the DC Power Transfer Influence Factor,
, is adopted. This factor simultaneously considers the impacts of large DC power transfers on the Southwest AC system from the perspectives of both transient stability and thermal stability. A higher
value indicates greater system vulnerability to DC transmission disturbances. The DC Power Transfer Influence Factor
is defined as (3):
where
is the additional transferred DC power carried by the AC corridor after an HVDC blocking event;
is the short-circuit capacity at both ends of the AC line;
is the actual power transmission on the AC line;
is the power limit of the AC line; and
is a scaling factor introduced for ease of result identification.
2.2. Fault Withstand Strength Metrics
To quantify the system’s ability to withstand and recover from severe disturbances, a set of indices is established to characterize its performance during two post-disturbance phases: quasi-steady-state and transient.
2.2.1. Cascading Failure Resilience Indicators
These indices measure the state of the power system after the transient dynamic processes have subsided, with a focus on its final operational capability and structural integrity. To quantitatively assess these characteristics, this study acquires key data through cascading failure simulations.
Since a cascading failure can be viewed as a sequence of system states, the calculation of its occurrence probability must consider not only the probability of each individual state but also the influence of preceding states on subsequent ones. The probability of a cascading failure is therefore inherently a conditional probability. Assuming a cascading failure process is composed of
sequential system states, denoted in chronological order as
,
, the occurrence probability of this event,
, can be obtained from the formula for conditional probability as (4):
where
is the probability of the initial system state
,
are the transition probabilities between the respective system states, which can be denoted as
,
. The formula for their calculation is detailed in [
27]. Thus, Equation (4) can be expressed as (5):
In this probability chain, the probability of each state transition
is closely related to the failure behavior of specific physical components. Therefore, to obtain accurate transition probabilities, it is necessary to establish appropriate fault models for critical components. Based on the data from [
28], the dynamic relationship between a line’s outage probability and its loading level is modeled using the piecewise function illustrated in
Figure 2.
In the figure, represents the upper limit of the normal power flow on the line, while is the ultimate transmission capacity limit. When the power flow on the line is greater than or equal to , the line is either tripped due to a short-circuit fault caused by thermal sag or removed from service after prolonged overloading, resulting in an outage probability of 1. represents the long-term statistical average of the line’s outage probability.
It should be noted that when a contingency such as a line outage occurs, the system transitions to a subsequent state and may consequently fragment into multiple electrical islands, leading to power imbalances. To maintain system stability, the simulation procedure performs an independent power flow analysis for each island. When a power imbalance is detected, the system first attempts to restore balance by adjusting generator set-points. If a power deficit persists after this redispatch, load shedding measures are implemented. For simplicity, load shedding is performed proportionally according to the magnitude of the load at each node. The adjusted load magnitude is calculated using the following Equation (6):
where
is the set of system load nodes,
is the load at node
,
is the set of system generator nodes, and
is the power output at node
. The adjusted load at each load node is seen (7):
Specifically, for any island containing no generation sources, all of its internal load is shed entirely. Based on this simulation framework, the following two core quasi-steady-state indices are defined.
From a physical operational perspective, the proportion of load lost relative to the initial total system load following a cascading failure can reflect the impact of the cascading process on the grid’s power supply efficiency. To quantitatively characterize this attribute, this paper defines the load shedding index as the percentage of the load shed during the cascading failure relative to the total initial system load, which is seen in (8):
where
is the amount of load shed due to the cascading failure, and
is the total system load prior to the event.
- 2.
Line Capacity Failure Proportion (LCFP)
To accurately assess the structural integrity of the network post-fault and to quantify the functional degradation of its power transmission capability, the Line Capacity Failure Proportion (LCFP) is defined. It represents the ratio of lost or compromised transmission capacity to the total pre-fault transmission capacity. Its formula is (9):
where
is the set of all transmission lines in the system before the disturbance;
represents the rated transmission capacity of a single line
; and
is the set of lines that are in an out-of-service state post-fault. A larger LCFP indicates lower structural integrity of the post-fault network.
2.2.2. Transient Stability Margin
These indices evaluate the system’s ability to maintain stability during the dynamic period following a disturbance. The core of the assessment is to determine whether the system can maintain synchronism and preserve voltage and angle stability throughout the electromechanical transient process.
Our methodology for assessing transient voltage stability is based on the multi-component transient voltage stability margin (TVSI) framework defined in [
29]. This framework decomposes the post-fault voltage trajectory into a vector
, thereby providing a multi-dimensional characterization of the stability response. In this paper, these three components are synthesized into a single composite index,
, through a weighted sum, which is seen in (10):
where the weights
are non-negative and sum to 1;
represents the transient voltage recovery capability,
denotes the post-fault steady-state level, and
reflects the level of transient oscillations. Their detailed definitions can be found in [
29]. A lower
value corresponds to a more favorable transient voltage performance.
- 2.
Transient Angle Stability Index (TASI)
Transient angle stability is evaluated using the TASI index, which quantifies the maximum post-fault rotor angle separation. Its mathematical expressions are (11) and (12):
where a negative value of
indicates transient instability. A higher
value signifies a more stable system.
- 3.
Frequency Deviation Factor (FDF)
When the sending and receiving systems are not part of the same synchronous grid, the impact of an HVDC line blocking fault on the frequencies of both systems must be considered. Due to the high penetration of hydropower in the Southwest Power Grid in China, its inertia is lower compared to systems dominated by thermal power, making this aspect particularly important. The Frequency Deviation Factor (FDF) quantifies the system’s ability to maintain frequency stability post-disturbance. A larger
β value indicates stronger frequency support capability. The FDF is defined as (13):
where
is the equivalent speed regulation of all generators (Hz/MW) and
is the system’s active load-frequency regulation coefficient (MW/Hz).
The relationship between the equivalent speed regulation
in (13) and the individual generator speed regulation
is given by (14):
where
is the rated power of generator
(MW);
is the nominal system frequency (Hz); and
is the angular speed regulation of generator
, defined as (15):
where
is the no-load steady-state speed,
is the full-load steady-state speed, and
is the rated speed.
indicates that a 5% frequency deviation results in a 100% change in the generator’s active power output.
The system’s active load-frequency regulation coefficient is defined as (16):
where
is the change in the total active load and
is the change in frequency.
3. Grid Strength Index Fusion Method Based on AHP-CRITIC
To address the susceptibility of subjective weighting methods to expert bias and the over-reliance of objective weighting methods on raw data, this paper employs the AHP-CRITIC method to determine the weights of each index and derive a corresponding composite score. The AHP-CRITIC method synergistically combines the advantages of both subjective and objective weighting. It utilizes the Analytic Hierarchy Process (AHP) to capture subjective judgments from experts based on their experience, while employing the CRiteria Importance Through Inter-criteria Correlation (CRITIC) method to perform objective weighting based on the intrinsic information within the data. For the evaluation indices selected in
Section 2, which reflect the stability of the power grid during operation, certain interrelationships are bound to exist. This justifies the introduction of the CRITIC method. The CRITIC method fundamentally operates by analyzing the objective relationships among indices, calculating the contrast intensity across different operating scenarios for the same index, and quantifying the correlations between different indices. This allows for the comprehensive determination of objective weights, yielding results that are more objective and reasonable.
The AHP-CRITIC procedure first employs AHP to calculate subjective weights, followed by the CRITIC method for objective weights. Finally, the composite weights are calculated. The specific steps are as follows:
The judgment matrix
is used to compare the relative importance of influencing factors at the same hierarchical level.
scale from 1 to 9 and their reciprocals is used for the comparison, as shown in (17) as below:
where
represents the importance of element
relative to element
at the same level, with values ranging from 1 to 9 and their reciprocals.
Based on the judgment matrix
, the product of the elements in each row,
, is calculated, and its
-th root,
is then determined in (18):
The vector is then normalized according to
to obtain the eigenvector of matrix A, which corresponds to the subjective weight vector as (19):
The maximum eigenvalue of the judgment matrix is calculated as (20):
Then perform a consistency check on the judgment matrix, which is seen in (21)
Generally,
is not equal to 0, so the consistency ratio
must be calculated to test for consistency, which is seen in (22):
When , the judgment matrix is considered to have satisfactory consistency. Otherwise, the values in the matrix must be readjusted, and the preceding steps must be repeated until the consistency check is passed.
- 2.
Calculate the objective weights
As the selected indices have different properties and units, with some being positively correlated with the objective layer and others negatively correlated, it is necessary to normalize the raw data.
For positive indices, this is achieved using (23):
For negative indices, this is achieved using (24):
where
is the normalized value of the data for each index, and
is the original index data. This results in an
standardized index matrix
which is seen in (25):
In the CRITIC method, the standard deviation of an index is used to represent the contrast among different scenarios. The standard deviation
can be calculated in (26). A larger standard deviation implies greater variation in the measured values, indicating that the data sample contains more information. Conflict refers to the varying degrees of positive or negative correlation among different indices. This characteristic can be quantified by calculating the correlation coefficient
using Equation (27).
where
is the mean of the elements in the
-th row of matrix
;
is the covariance between the
-th and
-th rows of matrix
; and
and
are the standard deviations of the
-th and
-th indices, respectively.
The amount of information contained in each index
is then calculated based on the standard deviation and the correlation coefficients, which is seen in (28):
where
quantifies the conflict between the
-th evaluation index and all other indices. A larger
signifies that the
-th index contains a greater amount of information, and thus its relative importance is higher. This value is then used to calculate the objective weight of the
-th index
, which is seen in (29):
Through the transformations above, the objective weight vector is obtained, which effectively corrects for both the variability and the conflict of the indices.
- 3.
Calculate the composite weight
To achieve a synergistic fusion of subjective and objective evaluation information, the Principle of Minimum Discrimination Information is introduced to integrate the weights derived from the AHP and CRITIC methods into a combined weight vector
. The composite weight of an index is given by (30):
where
is the composite weight of the
-th index.
5. Case Study and Results
To validate the effectiveness of the proposed grid strength assessment framework and to demonstrate its practical value, a comprehensive case study was conducted based on a detailed, large-scale model of the Southwest Power Grid in China for the year 2027, which has a total planned installed capacity of approximately 264 GW. Calculations were performed using the PSD-BPA software package (version 4.15), which is integrated into the grid’s servers. All simulations and data analyses for this study were conducted on a high-performance computing workstation equipped with an AMD Ryzen 9 8945HX processor (AMD, Santa Clara, CA, USA), 32 GB of RAM (Samsung, Seoul, South Korea), and an NVIDIA GeForce RTX 5060 graphics card with 8 GB of VRAM (NVIDIA, Santa Clara, CA, USA). This section first presents the composite weights obtained using the AHP-CRITIC index fusion method. Subsequently, a quantitative assessment of different operating scenarios is provided. Finally, a SHAP-based analysis is used to identify the key factors and geographical regions that influence the system’s transient stability.
5.1. Grid Strength Assessment Based on the Fused Index
5.1.1. Weight Calculation
The foundation of a composite assessment is the rational allocation of weights to the various indices within the framework. The AHP-CRITIC method proposed in this paper is used to acquire these weights by fusing expert knowledge with data-driven characteristics.
Subjective Weights from AHP: Through consultation with experts, the importance ranking of the aforementioned evaluation indices was determined, from highest to lowest, as follows: Transient Voltage Stability Index, Load Shedding Ratio, Transient Angle Stability Index, Line Capacity Failure Proportion, Multi-Outfeed Short-Circuit Ratio, Frequency Deviation Factor, and DC Power Transfer Influence Factor. The subjective weight values for each individual index were calculated as
Objective Weights from CRITIC: The objective weights determined by the CRITIC method are
Composite Weights: By combining the subjective and objective weights, the composite weight vector was obtained as V = (0.1316, 0.0760, 0.2044, 0.1612, 0.1660, 0.1255, 0.1352). The distribution of the composite weights mitigates the bias that can arise from relying solely on expert judgment or data-driven methods. This fusion provides a scientifically sound and reliable basis for the subsequent composite grid strength assessment.
5.1.2. Comparative Analysis of Different Operating Modes
Using the derived composite weights, this study assessed various operating scenarios based on the “summer peak load” condition for the year 2027.
Figure 3,
Figure 4 and
Figure 5 illustrate the maximum power angle separation, maximum frequency, and maximum nodal voltage during the transient stability process for five different scenarios following the same fault. The composite index scores for these five scenarios, denoted as A, B, C, D, and E, are 70.57, 65.48, 64.52, 51.14, and 43.69, respectively.
As can be observed from
Figure 3,
Figure 4 and
Figure 5, performance differences exist among the various operating scenarios, and a strong correlation is found between the scores and the time-domain simulation results. Among the five cases, scenario A exhibits the best transient, voltage, and frequency stability, which is consistent with its highest index score. Conversely, while scenario E has moderate frequency stability, it shows the poorest transient and voltage stability, thus receiving the lowest score. This confirms that the proposed composite index can serve as a reliable and effective reference for detailed grid strength analysis and provides a valuable tool for system grid strength assessment and control applications.
5.2. Identification of Key Parameters via SHAP
5.2.1. Development and Validation of GPR Agent Models
To assess the impact of regional generation and load on overall system stability, the analysis framework detailed in
Section 4 was implemented. Given that a robust analysis requires a large number of model evaluations, directly applying SHAP to full-scale power system simulations is computationally infeasible.
To overcome this challenge, a high-fidelity surrogate model based on Gaussian Process Regression (GPR) was first constructed to emulate the mapping between system inputs and the composite grid strength index. The GPR model aims to establish a mapping from the high-dimensional input space of system variables to the scalar output representing the composite grid strength index. The input vector X contains 10 key variables that characterize the generation (G) and load (L) levels in five critical regions of the Southwest Power Grid in China, specifically defined as
. The operating boundaries for each input variable were determined from historical data and planning studies, as detailed in
Table 3. The scalar output
of the model is the composite grid strength index.
A training dataset of 100 samples was randomly generated. For each sample, the indices mentioned in
Section 2 were calculated and then fused into a composite index. The GPR model was subsequently trained on this dataset. To verify the accuracy of the GPR simulator, its predictive performance was evaluated on an independent, unseen test set. As shown in
Figure 6, the predicted values from the GPR model are in close agreement with the true values from simulations, and nearly all true values fall within the 95% confidence interval of the predictions. To further ensure the robustness of the model evaluation, a repeated 5-fold cross-validation strategy was employed. The entire 5-fold cross-validation process was independently repeated 100 times with different random seeds, yielding a mean Absolute Percentage Error (MAPE) of 4.87%. This confirms that the GPR simulator can serve as a reliable and accurate proxy for the full system model.
5.2.2. Global and Local Feature Importance Analysis Based on SHAP
Once the GPR surrogate model was validated, it was used as a computationally efficient proxy to conduct an in-depth feature importance analysis using SHAP. The SHAP method can simultaneously provide a global perspective on system-wide importance and local explanations for specific operating conditions.
Global Feature Importance Analysis Based on SHAP
First, a global analysis was performed to identify the system variables that have the most significant impact on stability across the entire operational space.
Figure 7 illustrates the contribution of different input feature values to the output. Specifically, it shows how the value of an input feature influences whether the output is increased (SHAP value > 0) or decreased (SHAP value < 0). Each point represents the Shapley value for a feature in a specific instance; red indicates a high feature value, while blue indicates a low feature value. The features on the y-axis are ordered by increasing importance from bottom to top. This plot aggregates the local SHAP values from all samples, not only revealing the overall ranking of feature importance but also displaying the distribution, density, and direction of their impacts.
The global analysis unequivocally indicates that the load in the yc region and the load and generation in the cd region are the most critical factors influencing the grid strength index. As shown in
Figure 7, higher values of ycL, cdG, and cdL tend to push the grid strength index higher, confirming their dominant and predictable influence. Interestingly, the generation and load parameters from the same region exhibit nearly identical SHAP value distributions. This suggests that in certain key regions, the source-load dynamics are highly coupled and should be considered as a single entity for coordinated control.
Local Explanation for a Single Instance
Beyond global ranking, SHAP is also capable of diagnosing individual operating scenarios. To demonstrate this capability, a scenario with a low predicted grid strength index was selected from the test set.
Figure 8 presents the SHAP contribution plot for this single instance. The diagram decomposes the prediction, clearly showing how the value of each feature pushed the model’s output from the baseline value to the final prediction for this scenario.
The local diagnosis from
Figure 8 reveals that, although multiple factors were at play, the low generator output and low load in the cd region were the primary contributors to the negative outcome in this specific case, pulling the grid strength index downward. This provides a plausible hypothesis: to improve the stability of this particular state, the most effective initial measure would be to increase the generation output and load in the cd region.
Validation of the Hypothesis via Corrective Control Simulations
To verify the hypothesis derived from the local SHAP analysis, a control experiment was conducted for the specific scenario from the previous subsection, as shown in
Table 4. The results in the table show that adjusting generation and load in the cd region yields a significantly greater increase in the index compared to adjustments in the xc region, confirming that the grid strength index is indeed more sensitive to the generator output and load in the cd region. This evidence demonstrates that SHAP not only ranks overall feature importance but also identifies critical variables in specific scenarios, guiding operators to take effective control actions. This finding provides a crucial link between offline statistical planning and the real-world power system of Southwest China, confirming that this method can be reliably used as a planning tool to guide operational decisions and ensure the enhancement of grid stability.
6. Conclusions
This paper presented and verified a framework for composite grid strength assessment with interpretable feature importance analysis in complex hybrid AC/DC systems, an architecture that is increasingly characteristic of modern grids. The proposed methodology combines multi-criteria decision-making (AHP-CRITIC) for index fusion, a high-fidelity Gaussian Process Regression (GPR) surrogate model to overcome computational barriers, and the SHAP method to analyze the complex factors driving grid strength. Initially, a composite grid strength index was constructed, which synergistically integrates structural strength metrics and fault withstand strength metrics. A GPR-based surrogate model was then used to capture the complex nonlinear relationship between high-dimensional system parameters and this composite index. Finally, the SHAP method was applied to break down the complex dynamics of grid strength into a clear set of key influencing factors, ordered by their global significance and explained at the local, scenario-specific level.
The practical significance of this work for system planning and operations is substantial. The framework not only provides a composite grid strength assessment system but also offers operators highly relevant decision-making support by identifying key influencing factors. This enables them to design and verify control strategies with greater precision.
However, we acknowledge several limitations that pave the way for future research. The current GPR surrogate model’s inputs are primarily focused on generation and load levels, and its performance is inherently dependent on the quality and diversity of the training data. The model does not yet explicitly account for dynamic changes in network topology, such as line outages, which are critical determinants of grid strength.
Future work should therefore focus on addressing these limitations. A promising direction is the development of more sophisticated surrogate models, such as those based on Graph Neural Networks (GNNs), which can naturally encode topological information and adapt to its changes. This would enable the framework to assess a broader range of contingencies and provide more comprehensive stability insights. Additionally, future research could explore integrating a wider array of input features, including protection system parameters and control settings, to create a more holistic and robust assessment tool.