1. Introduction
1.1. Motivation
Power systems are evolving from relatively localized infrastructures into highly interconnected cyber–physical networks characterized by long-distance power transfers, multiple voltage levels, intensive inter-area exchanges, and frequent operating-point fluctuations. Under such conditions, a local disturbance seldom remains local. It can first alter the electrical environment of nearby devices, then accumulate as regional congestion or voltage stress, and finally propagate to the system level through critical transmission corridors and backbone substations. Weak-node identification should therefore be treated as a multi-scale vulnerability inference problem rather than as a purely local ranking task.
Weak-node identification has direct engineering relevance in preventive control, contingency screening, restoration planning, and online dispatch support. Nodes with high vulnerability scores are often associated with strong coupling to overloaded corridors, high voltage sensitivity, prominent responsibility in post-contingency redistribution, or persistent exposure to inter-area stress transmission. In practical dispatch environments, such nodes deserve prioritized monitoring and tighter security margins. However, many of them are not obvious from local observations alone, because an apparently moderate node may still be globally critical if it is embedded in a stressed region or connected to a transfer bottleneck.
1.2. Related Work
Early studies established the power-system vulnerability problem at the system level. Fouad et al. [
1,
2] formalized vulnerability as a dynamic-security concept, but their formulation did not provide a node-oriented ranking mechanism. Zhou et al. [
3] introduced artificial neural networks into security and vulnerability assessment, yet the model still operated on global operating states rather than explicit network structure. Driven by the rapid development of new energy sources, inter-regional direct current transmission, and the scaled deployment of pumped-storage capabilities, the future regional power-flow exchange form will be more complex [
4]. Mahadev and Christie [
5,
6] represented vulnerability and severity for static security assessment, but the framework remained centralized and indicator-driven.
Subsequent work enriched the physical interpretation of vulnerability. Yu and Singh [
7] incorporated protection failures into integrated vulnerability analysis, but the method was still scenario-enumerative and difficult to translate into fast online weak-node scoring. Doorman et al. [
8] analyzed the vulnerability of the Nordic power system using a realistic large-scale grid, but the outcome remained system-oriented rather than bus-oriented. Kamwa et al. [
9,
10] introduced fuzzy partitioning for dynamic vulnerability assessment, but the partition served assessment convenience rather than hierarchical representation learning. Wang et al. [
11] developed a fault-chain-based vulnerability assessment scheme for transmission networks, yet the method did not learn node embeddings from graph-structured data. Chen and Mili [
12] evaluated composite vulnerability to cascading failures probabilistically, but the resulting framework was not designed for dispatch-oriented weak-node ranking.
Complex-network studies further clarified an important limitation. Hines et al. [
13] showed that topological models alone may provide misleading information about electricity-infrastructure vulnerability when operating conditions are ignored. Pagani and Aiello [
14] reviewed the power grid as a complex network, but the dominant methodology was still based on handcrafted structural indicators rather than learnable multi-scale representations.
Neural-network-based security assessment also progressed substantially before graph learning became popular. Amjady and Ehsan [
15,
16] evaluated power-system reliability using artificial neural networks, but the model did not explicitly encode electrical connectivity. Amjady [
17] employed time-series and fuzzy neural networks for generation-adequacy assessment, yet the formulation still focused on system-level adequacy rather than node-level vulnerability localization. Bento [
18] proposed a physics-guided neural network for load-margin assessment, but the target remained operating-security margin estimation instead of weak-node identification.
The rise of graph learning created a more suitable technical basis for power-grid modeling. Wu et al. [
19] reviewed graph neural networks and clarified why local message passing is effective for structured data, but the review was not power-system-specific. Zhao et al. [
20] used structure-informed graph learning for online prediction of transient dynamics, but the task addressed system response forecasting rather than weak-node ranking. Gao et al. [
21] proposed a physics-guided graph convolutional neural network for optimal power flow, but the goal was an OPF surrogate instead of vulnerability inference. Xie et al. [
22,
23] studied grid vulnerability under extreme weather and forced oscillations in the
Energies literature, respectively, but these studies did not establish a hierarchical weak-node learning mechanism. Moshtagh et al. [
24] developed a topology-aware graph neural network for state estimation in PMU-unobservable systems, but the task remained state reconstruction. Guo et al. [
25] applied graph neural networks to production-cost minimization decomposition, but the framework targeted optimization acceleration rather than critical-node discovery. Wang et al. [
26] used knowledge graphs and graph neural networks for topology identification, but the objective was network-structure recovery rather than vulnerability ranking. Kfouri et al. [
27] employed graph neural networks for bad-data detection and identification in state estimation, but the emphasis was on data integrity rather than multi-scale weak-node assessment.
To make the literature positioning more focused, we now distinguish four groups of directly related methods. Classical centrality- and sensitivity-based methods provide transparent vulnerability indicators, but their scores are usually handcrafted and may not represent nonlinear regional accumulation. Physics-informed GNNs embed power-flow or operational constraints into graph learning, but most reported applications focus on OPF, state estimation, or transient-response prediction rather than dispatch-oriented weak-node ranking. Graph-transformer and attention-based methods can enlarge the receptive field, but they usually require substantial data and do not by themselves define power-grid-specific hierarchical labels. Graph-partition GNNs improve computational scalability, but graph cutting may weaken boundary and corridor information unless the partition hierarchy remains electrically coupled to the original network.
We also clarify the novelty boundary. The proposed method does not claim that GCN, attention, gating, or hierarchical pooling are new neural-network primitives. Existing GNN, GAT, GraphSAGE, GIN, ChebNet, graph-transformer, and hierarchical pooling models already provide powerful generic graph-learning operators. The scientific question addressed here is narrower and power-system-specific: how to define a weak-node target, an electrical hierarchy, and an interpretation route that connect bus-level rankings with branch vulnerability and corridor stress. Accordingly, this paper evaluates a mechanism-aware weak-node formulation rather than presenting a new universal GNN architecture.
These observations motivate a hierarchical formulation. Weak-node significance in an interconnected grid is rarely determined by one graph scale alone. Local evidence must be interpreted together with regional aggregation effects and backbone transfer constraints. A dispatch-oriented model should therefore evaluate each node through coupled local, regional, and global contexts instead of relying on a single flat graph or on disconnected partitions.
1.3. Manuscript Positioning and Main Contribution
The above literature reveals three unresolved issues directly related to weak-node identification. First, classical vulnerability studies and centrality or sensitivity indices are physically meaningful, but they mostly provide system-level indices, scenario-based assessments, or handcrafted rankings rather than learnable node-level representations. Second, modern graph-learning models such as GAT, GraphSAGE, GIN, ChebNet, graph transformers, and hierarchical pooling architectures provide generic message-passing or attention operators, but they do not by themselves define power-system weak-node labels or explain how bus vulnerability is coupled with branch and corridor stress. Third, physics-informed GNN studies in power systems mainly address OPF approximation, state estimation, topology identification, or dynamic-response prediction, whereas weak-node identification requires a target that is explicitly tied to structural, electrical, and operating-state vulnerability evidence. These gaps motivate a mechanism-aware formulation rather than a claim of inventing a new generic graph neural block.
Against this background, this paper is positioned as a mechanism-driven hierarchical extension of graph learning for weak-node identification. The fundamental difference from classical hierarchical GNNs is that the hierarchy is tied to electrical constraints: node-layer edges use admittance-type coupling, regional and backbone abstractions are induced by electrical interconnections and transfer corridors, and node scores are interpreted through branch-vulnerability and corridor-stress evidence. Thus, hierarchy is used to preserve the physical path of vulnerability propagation, namely local disturbance, regional accumulation, backbone transfer stress, and feedback to node criticality, instead of serving only as graph coarsening or computational compression.
The main contributions of this work are summarized as follows.
A mechanism-aware weak-node formulation is proposed for complex interconnected power grids. The model uses standard graph-learning operators, but the hierarchy is defined according to electrical coupling, supernode aggregation, branch vulnerability, and corridor-level risk evidence rather than generic graph pooling alone.
A validated local–global fusion implementation is provided. The reported numerical results are produced by a local encoder, a supernode/global encoder, assignment-based global feedback, and a 48 → 24 → 1 decoder. Attention, top-down gating, and more complex cross-scale weighting are now described only as extensible variants unless explicitly implemented.
A reproducible vulnerability-proxy learning protocol is introduced. The reported training objective is mean-square regression on the defined mechanism-based labels, while hierarchical-consistency and ranking-loss terms are treated as future extensions and are not used as evidence for the present results.
Based on the supplementary data-and-code package, the proposed method is validated on representative IEEE benchmark systems and analyzed in depth on the IEEE 145-bus case. The results show clear advantages in the medium-scale interconnected benchmark where inter-area coupling is particularly important, while the large-scale case is reported as a limitation rather than as universal generalization evidence.
1.4. Paper Organization
The remainder of this paper is organized as follows.
Section 2 formulates the weak-node identification problem and presents the mechanism-driven hierarchical graph-construction strategy.
Section 3 introduces the hierarchical representation-learning framework on the node, regional, and backbone layers.
Section 4 presents the validated local–global fusion model and extensible learning strategy.
Section 5 reports the benchmark configuration, comparative case studies, and engineering analysis.
Section 6 concludes the paper.
2. Problem Formulation and Mechanism-Driven Hierarchical Graph Construction
2.1. Weak-Node Identification as a Multi-Criteria Vulnerability Ranking Task
Consider a power grid under operating scenario . Let the system contain a set of buses, lines, transformers, and regional aggregation units. The task of weak-node identification is to assign each node a vulnerability score and then rank all nodes according to the predicted scores. The ranking should reflect the likelihood that a node acts as a structurally and operationally critical weak point under the given scenario.
Different from generic node classification, vulnerability ranking in a power grid is driven by both structural role and operating-state sensitivity. A desirable predictor should satisfy at least four requirements:
It should detect local stress, such as voltage deviation, loading imbalance, or weak neighborhood support.
It should capture regional aggregation effects, such as area-level congestion or concentrated stress in a station cluster.
It should reflect global transfer constraints, especially the influence of critical corridors and backbone interconnections.
It should preserve the ordering of high-risk nodes, rather than merely fitting their numerical scores in a mean-square sense.
These requirements imply that the desired predictor is inherently multi-scale.
2.2. Limitations of Single-Layer Graph Representations for Weak-Node Assessment
A single-layer graph representation models the grid as a graph , where all buses or substations appear at the same level. This representation is useful for local message passing but insufficient for two reasons.
First, the physical consequence of a disturbance is not confined to the immediate neighbors of the disturbed equipment. Post-contingency power redistribution may raise stress in one region, then shift transfer pressure into an adjacent corridor, and finally create a vulnerability cluster around a different set of nodes. Second, nodes with similar local neighborhoods may play substantially different roles when embedded in different higher-level structures. Therefore, two nodes that are locally similar can still exhibit markedly different global vulnerability.
2.3. Hierarchical Risk Propagation in Interconnected Power Grids
The supplementary data-and-code package and the accompanying case-study files indicate that vulnerability propagation in interconnected grids follows a hierarchical path rather than a simple additive aggregation. This path can be expressed as
“local equipment disturbance” → ”regional accumulation” → ”backbone transfer stress” → ”feedback to node vulnerability”.
This interpretation provides the conceptual basis for the hierarchical model proposed in this paper. The node layer captures immediate electrical interactions around equipment. The regional layer captures station-cluster or partition-level stress accumulation. The backbone/global layer captures long-range transfer coupling and corridor constraints. Weak-node identification then becomes the problem of adaptively fusing these three levels.
2.4. Scenario-Aware Construction of the Node-Layer Graph
For a given scenario ξ, the node layer is defined as
where
is the node set,
is the branch/transformer connection set,
is the scenario-dependent weighted adjacency matrix, and
is the node-feature matrix.
represents the node-layer power-grid diagram constructed under scenario ξ, used to describe the basic network structure and node connection relationships under this operating scenario.
The scenario-dependent edge weight is modeled as
where
is the availability coefficient of branch
under scenario
, and
is the basic electrical coupling coefficient.
represents the node-level weighted adjacency coefficient between node
and node
in scenario
, used to characterize the effective electrical coupling strength between them.
In the general formulation, can be determined from switch status, protection action, maintenance outage, or post-contingency line-state assessment, while can be computed from line impedance, transformer equivalent parameters, or admittance-based coupling. In the released implementation used for the benchmark study, the node-layer graph is instantiated with admittance-type edge weights derived from ‘pandapower’ network data, and in-service transformers are retained as strong interconnection links.
This formulation prevents the graph from degenerating into a purely topological object. Instead, the graph becomes a scenario carrier that reflects the effective propagation capability of electrical interactions.
2.5. Node-Feature Representation and Physical Interpretability
The general mechanism-aware formulation allows static topology features, operating-state features, and disturbance-response features to be combined. The current benchmark implementation in the supplementary data-and-code package adopts a six-dimensional feature vector that is traceable to the training scripts. For each node, the feature vector contains
Node degree;
Weighted betweenness centrality;
Sampled weighted average shortest-path length;
Weighted clustering coefficient;
Voltage-deviation magnitude;
Normalized active load.
Accordingly, the node-feature vector for node
i under scenario ξ is written as
where
denotes the input feature vector of node
in the node layer under scenario
,
denotes the degree of node
, reflecting its direct topological connectivity,
denotes the weighted betweenness centrality of node
, measuring its importance on weighted shortest paths,
denotes the sampled weighted average shortest-path length from node
to the other nodes,
denotes the weighted clustering coefficient of node
, characterizing the local connectivity density around the node,
denotes the voltage-deviation magnitude of node
, indicating its operating-state stress, and
denotes the normalized active load of node
.
The first four components describe the structural role of the node in the electrical interaction graph, while the last two components reflect operating-state stress. This compact feature set preserves physical interpretability.
2.6. Node-to-Region Assignment and Regional Abstraction
To lift local information to the regional scale, a node-to-region assignment matrix is introduced:
where
is the number of nodes and
is the number of regional units.
is the node-to-region assignment matrix that maps node-layer nodes to regional units.
Each row of indicates the region to which a node belongs. Depending on the engineering setting, the region can correspond to a station cluster, a voltage-level group, or a dispatch partition. Region-like supernodes are obtained by spectral clustering or size-constrained grouping, which yields an adaptive partition consistent with system scale.
2.7. Regional Feature Aggregation and Inter-Region Coupling
The regional feature matrix is obtained by average aggregation:
where
denotes the node-level hidden feature matrix and
is an all-one vector.
denotes the regional feature matrix obtained by aggregating node-level representations within each region.
The regional adjacency matrix is generated by lifting node-layer interactions:
where
denotes the regional adjacency matrix induced from node-layer interactions, and
denotes the node-layer adjacency matrix before regional lifting.
This formulation retains inter-region coupling without requiring handcrafted region-level edges. A dense set of cross-region electrical links in the node graph automatically induces strong edges in the regional graph.
2.8. Region-to-Backbone Aggregation and Global Representation
A second assignment matrix is used to aggregate regional units into the backbone/global layer:
where
is the number of backbone units, and
denotes the region-to-backbone assignment matrix that maps regional units to the backbone/global layer.
The corresponding global features and adjacency are
where
denotes the backbone/global feature matrix obtained by aggregating regional representations.
denotes the backbone/global adjacency matrix induced from inter-region couplings.
The backbone/global layer abstracts higher-level transfer channels and system-wide constraints. From a power-system perspective, this layer is essential because long-range vulnerability is usually governed by backbone corridors rather than by individual branches alone.
2.9. Mechanism-Driven Vulnerability Labels
For the paper formulation, the node target is interpreted as a multi-scale vulnerability quantity synthesized from local, regional, and global components:
where
are adaptive contribution coefficients,
denotes the overall vulnerability score of node
,
denotes the local vulnerability component of node
, reflecting risks caused by its immediate neighborhood,
denotes the regional vulnerability component of node
, reflecting risks inherited from its regional cluster, and
denotes the global vulnerability component of node
, reflecting risks associated with system-wide backbone conditions.
This decomposition is important because the learning target is not treated as an arbitrary label; it is treated as a reproducible mechanism-based proxy for dispatch-oriented weak-node ranking.
The practical label-generation procedure used for the benchmark data is summarized as follows. Step 1: a standard pandapower benchmark case is loaded and converted into an admittance-weighted graph in which in-service transmission lines and transformers define the effective electrical coupling. Step 2: node-level structural and operating indicators are computed, including degree, weighted betweenness centrality, sampled weighted average shortest-path length, weighted clustering coefficient, voltage-deviation magnitude, and normalized active load. Step 3: branch-vulnerability indicators are computed from normalized electrical-coupling and path/corridor-related quantities, and the branch-vulnerability score is stored as the branch target. Step 4: node vulnerability evidence is aggregated from its own structural-operating indicators, the vulnerability of incident or nearby branches, and the regional/corridor context induced by the graph hierarchy. Step 5: each component is min–max normalized within the case to avoid scale dominance, and the final reference node score is obtained by averaging or weighted synthesis of the normalized components.
Therefore, the reference ranking is a mechanism-based weak-node index derived from topology, electrical coupling, bus operating stress, and branch–corridor vulnerability evidence. It is not a label manually assigned by experts. It is also not a complete dynamic cascading-failure label; rather, it is a transparent and reproducible proxy designed for supervised weak-node ranking when full N − 1 or dynamic cascading simulations are not available for all benchmark systems.
For clarity, the current branch-vulnerability proxy is defined from normalized electrical-coupling and corridor-stress indicators, including path/corridor importance, line coupling strength, and loading- or stress-related quantities available from the processed benchmark data. The current node vulnerability proxy combines four categories of normalized evidence: local structural/electrical support, bus operating stress, incident or neighboring branch vulnerability, and regional/global aggregation induced by the hierarchy. These normalized components are synthesized to obtain the final node score used for supervised learning.
5. Case Studies and Experimental Analysis
5.1. Benchmark Systems and Data Sources
The supplementary data-and-code package contains multiple IEEE and PEGASE benchmark systems used to generate graph data. The cases explicitly listed in the hierarchical training script are summarized in
Table 1.
These systems provide a diverse range of topologies and scales. The IEEE 145-bus system plays a central role in this paper because the supplementary package contains node labels, branch labels, a topology figure, a detailed case report, and ranking tables for both nodes and lines.
5.2. Labels, Evaluation Metrics, and Baseline Methods
Node labels are stored as the final vulnerability target , while branch labels are stored as the branch-vulnerability target and its intermediate decomposition terms. The detailed branch ranking table confirms that several of the highest-risk branches are concentrated around a compact set of nodes, which supports the coupling hypothesis that motivates the hierarchical model.
The following metrics are used throughout the comparison:
MSE: score regression accuracy;
MAE: absolute deviation of predicted vulnerability scores;
Spearman coefficient: ranking consistency between predictions and mechanism-based labels;
Hit@3, Hit@5, Hit@10: overlap ratio between predicted and reference top- weak nodes.
These metrics are complementary. MSE and MAE evaluate numerical fidelity, while Spearman and Hit@ directly evaluate whether the model identifies the correct critical nodes.
Furthermore, we added a supplementary fixed-seed baseline comparison including GAT, GraphSAGE, GIN, ChebNet, TransformerConv, and two classical vulnerability proxies based on degree and betweenness. These baselines were trained or evaluated under the same processed feature-label protocol for the three representative systems. The results are reported in
Table 2. They show that modern GNN variants can be competitive, especially GIN and ChebNet on some metrics; therefore, we further narrow our claim. The present contribution is not a universal performance dominance over all graph models, but a mechanism-aware weak-node formulation and an interpretable local–global hierarchy whose strongest numerical evidence remains the IEEE 145-bus case and the node–line–corridor coupling analysis.
The expanded comparison indicates that no single model dominates all metrics and all cases. For example, ChebNet obtains a low MSE on case57 and GIN obtains a high Spearman coefficient on case145, whereas the hierarchical GNN gives the lowest MSE on case145 and remains directly tied to the proposed node–line–corridor interpretation. This result supports a more cautious conclusion: the proposed formulation is competitive and interpretable on the target medium-scale interconnected benchmark, but broader claims require further repeated-run studies, hyperparameter tuning, physics-informed baselines, and full ablation experiments.
A full expanded benchmark should include: (i) flat GCN, GAT, GraphSAGE, GIN, ChebNet, and graph-transformer models trained under the same feature and label protocol; (ii) physics-informed GNN variants with power-flow residuals or sensitivity constraints; (iii) classical degree, betweenness, electrical centrality, voltage-sensitivity, and contingency-based vulnerability indices; and (iv) ablations that remove the hierarchy, remove the supernode/global encoder, replace local–global fusion with local-only prediction, vary the region-construction strategy, and test any ranking or consistency loss separately.
Original GNN: a two-layer graph-convolution encoder on the full graph that decodes node and branch vulnerability directly.
Graph-partition GNN: a partition-based baseline that first divides the graph into local subgraphs and then trains local graph models independently, mainly reflecting the computational-partitioning route.
Hierarchical GNN: the proposed method that forms supernodes from lower-layer groups, performs local and global graph convolution, and fuses the resulting representations for node scoring.
5.3. Detailed IEEE 145-Bus Case Configuration
The case study reports that the selected IEEE 145-bus benchmark has 133 effective nodes and 358 branch samples in the processed experimental dataset. This processed case therefore serves as a medium-scale representative system where local, regional, and corridor-level effects can all be observed clearly.
The IEEE 145-bus case is particularly suitable for structural interpretation because the supplementary package provides synchronized evidence at three levels: node vulnerability labels, branch-vulnerability rankings, and a topology visualization. This combination makes it possible to examine whether the predicted weak nodes coincide with corridor-dominated high-risk regions rather than being isolated local anomalies.
5.4. Computational Considerations
For a three-layer hierarchy, the approximate per-layer message-passing complexity is proportional to , where is the hidden dimension. Since and are much smaller than , hierarchical encoding offers a computationally reasonable trade-off between expressive power and scalability. In practice, the local scripts either limit cluster size or set the number of supernodes adaptively according to network size, which keeps the upper-layer graphs compact.
5.5. Cross-System Comparative Results
The case study contains cross-method comparisons on three representative systems: IEEE 57-bus, IEEE 145-bus, and IEEE 300-bus. The results are listed in
Table 3.
Several important observations can be drawn from
Table 3. First, the hierarchical model delivers the clearest advantage on the medium-scale interconnected benchmark represented by case145. Its MSE is reduced substantially from 0.0161 to 0.0068 relative to the original GNN and from 0.0207 to 0.0068 relative to the graph-partition GNN, while the Spearman coefficient reaches 0.825. This result is highly relevant for the targeted engineering scenario of this paper, because medium-scale interconnected systems are exactly the systems where local interactions and inter-area couplings are both important.
Second, the improvement on case57 indicates that explicit scale separation is beneficial even when the graph itself is not extremely large. The model significantly improves ranking quality, reaching 70% Hit@10 and 60% Hit@5, which suggests that hierarchical structure enhances ranking stability even in relatively compact systems.
Third, the case300 results indicate that the present implementation still faces challenges on larger systems with weaker structural regularity or reduced label coverage. This is an important and honest observation. It shows that hierarchical graph learning is not a universal remedy by itself; performance still depends on training coverage, label quality, and the adequacy of the region/backbone abstraction.
Overall, the representative comparisons support a moderated claim: hierarchical modeling is particularly effective when weak-node significance is governed by cross-scale propagation rather than by purely local patterns, but the present evidence should not be interpreted as proof of universal large-scale generalization.
5.6. Structural Interpretation of the IEEE 145-Bus Case
The IEEE 145-bus system is the most informative case in the supplementary package and is therefore used for detailed structural analysis.
Figure 2 provides a topology-centered interpretation of the case145 benchmark. In the figure, the red nodes denote the top-10 weak buses in the mechanism-based node ranking, whereas the orange line segments denote the top-10 weak transmission lines in the branch-vulnerability ranking. The notable spatial overlap between the highlighted nodes and highlighted line segments indicates that weak nodes are not scattered randomly across the network. Instead, they concentrate around a limited number of corridor-dominated transfer paths, especially in the regions surrounding Buses 68, 71, 140, 141, 62, 65, 116, and 117. This visual evidence is important because it shows that the target of weak-node identification is physically coupled to branch-level transmission bottlenecks and regional stress accumulation.
Table 4 compares the top-ranked weak nodes in the reference mechanism labels and in the hierarchical prediction.
The most important observation is that the hierarchical model correctly identifies Bus 141 as the most critical weak node and retrieves several other structurally meaningful nodes, including Bus 68, Bus 123, Bus 135, and Bus 140. Compared with the original GNN, the hierarchical model exhibits a much stronger alignment with the corridor-centered high-risk structure visible in the branch ranking.
This result is not accidental. The highly ranked predicted nodes are not simply the nodes with strong local activity; they are nodes embedded in a stress-transmission structure. The success of the hierarchical model therefore supports the theoretical argument that node vulnerability depends on both local evidence and higher-layer context.
5.7. Node–Line Coupling Evidence and Representative Critical Buses
The branch ranking file provides strong support for the above interpretation. The top-10 branch vulnerabilities in the IEEE 145-bus case are listed in
Table 5.
The coupling between the node ranking and the branch ranking is striking. The reference weak nodes 141, 68, 140, 65, 123, 116, 117, 62, and 135 are tightly distributed around the high-risk branches 68–71, 140–141, 62–65, 141–142, 65–68, 62–116, and 116–117. This means that the weak-node ranking is not arbitrary. It is consistent with the actual transmission bottlenecks and the corridor-level stress pattern.
This evidence also explains why the hierarchical model outperforms the flat GNN on case145. A flat local GNN may detect heavily connected or locally unusual nodes, but it does not explicitly encode the fact that risk is concentrated around a set of correlated node–line structures organized by region and corridor. The hierarchical model, by contrast, can detect that the vulnerability of nodes such as 141 and 68 is amplified by their position in a broader propagation structure.
Representative critical buses further illustrate this point. Bus 141 is the top-ranked weak node in the reference set and also the top prediction of the hierarchical model. Its importance is strongly supported by the presence of top-ranked branches 140–141 and 141–142, indicating that it acts as a corridor-adjacent structural hub rather than a purely local anomaly. Bus 68 is directly tied to the highest-risk branch 68–71 and another top-ranked corridor 65–68, which shows that the model correctly recognizes nodes that belong to the dominant transmission chain. Bus 123 and Bus 135 also appear among the high-ranked predictions, and their relevance is consistent with the branch table, where branch 134–135 is highly vulnerable and branch 66–123 remains among the most critical corridors in the detailed report.
Taken together, the case145 evidence shows that the model is not merely fitting labels numerically. Instead, it is discovering a physically meaningful coupling pattern:
This node–line–corridor chain is exactly the vulnerability-propagation logic emphasized by the hierarchical technical route. Therefore, the case study provides not only predictive evidence but also mechanism-level validation.
5.8. Engineering Implications, Limitations, and Future Research Directions
The main advantage of the proposed method is that it transforms weak-node identification from a shallow neighborhood aggregation problem into a structured multi-level inference problem. From an engineering perspective, the model can distinguish at least three practically meaningful node types: nodes that are both locally stressed and globally important; nodes that are locally moderate but regionally stressed; and nodes that are locally and regionally moderate but globally dangerous because they lie on corridor-dominated transfer paths. This distinction is highly valuable for dispatch support because the third type is often the most dangerous and the least visible to heuristic or purely local methods.
The proposed method also differs conceptually from partition-based approaches. Partition-based GNNs improve computational tractability, but they may destroy the information that matters most for vulnerability analysis, namely boundary coupling, inter-area transfers, and cross-region feedback. The proposed hierarchical method compresses the graph without disconnecting the semantics. Higher-level graphs are learned abstractions of the original network and remain coupled to it through attention and feedback.
Although the case145 results are promising, the current results also reveal important limitations. The expanded baseline comparison shows that GIN, ChebNet, TransformerConv, GraphSAGE, and even simple degree-based indicators can be competitive on selected metrics. The main supported conclusion is narrower: the proposed local–global hierarchical formulation gives strong MSE performance and physically interpretable node–line–corridor evidence on the IEEE 145-bus benchmark. Full journal-scale validation still requires repeated-run statistics, systematic hyperparameter tuning, physics-informed GNN baselines, objective N − 1/cascading-failure labels, and component-level ablations.
Several extensions are especially promising for future work:
Physics-informed hierarchical learning: embed power-flow residuals, sensitivity equations, or contingency constraints into the loss function.
Temporal hierarchical graphs: replace static graphs with time-evolving hierarchical graphs to model stress accumulation and relaxation dynamics.
Adaptive region construction: learn the region/backbone hierarchy end-to-end rather than relying on spectral clustering only.
Joint node–line vulnerability learning: use a shared latent space so that node and branch rankings are optimized simultaneously.
Online deployment: integrate the framework with dispatch-support platforms for streaming evaluation under rolling scenarios.
6. Conclusions
This paper has presented a mechanism-aware hierarchical graph-learning framework for weak-node identification in complex interconnected power grids.
The results show that the hierarchical design is particularly effective on representative small- and medium-scale interconnected benchmarks. On the IEEE 145-bus case, the proposed method achieves the best MSE and ranking consistency among the compared methods and correctly highlights nodes strongly coupled with the highest-risk transmission lines. The case study confirms that the method can identify hidden critical nodes governed by inter-area propagation structures rather than merely by local anomalies.
In summary, the proposed framework provides a physically interpretable route for learning and analyzing a defined mechanism-based weak-node vulnerability proxy. The supplementary comparison with GAT, GraphSAGE, GIN, ChebNet, TransformerConv, and classical indicators improves the experimental transparency and shows that the method is competitive rather than universally dominant.