3.2. Functional Characterization of Hazard Factors
Following the procedure described in
Section 2.2, the co-occurrence matrix
O was first constructed, and the directed influence matrix
N and comprehensive influence matrix
T were subsequently derived via Equations (1) and (2). The influence degree (
Di), affected degree (
Ci), prominence (
Mi), and relation (
Ri) were then calculated for each factor using Equations (3) and (4). The resulting metric values are presented in
Supplementary Materials S2 and S3. For visualization purposes, both matrices are represented as heat maps in
Figure 3. The rows and columns of the 89 × 89 matrix in the figure correspond respectively to 89 risk factors numbered from 0 to 88. The color gradient reflects the numerical magnitude of each matrix element, with red hues indicating stronger positive influence, blue hues denoting negative or negligible influence, and white corresponding to intermediate values. In matrix
T, positive values represent facilitative influence exerted by the row factor upon the column factor, whereas negative values indicate inhibitory effects.
Based on the comprehensive influence matrix
T, the influence degree (
Di), affected degree (
Ci), prominence (
Mi), and relation (
Ri) were calculated for each factor using Equations (3) and (4).
Table 1 and
Table 2 respectively list the ten factors ranked highest by prominence and by relation.
As shown in
Table 1, factors associated with material misuse (Factors 10 and 11), human states (Factors 2, 4, and 7), and heat source management (Factors 12 and 73) occupy the highest prominence ranks. Factor 11, “abandoned or discarded materials,” ranks first with a prominence of 6.847, indicating its extensive connectivity within the risk network. Although abandoned materials do not independently generate ignition sources, they serve as readily available fuel that, once ignited by other factors such as electrical faults or open flames, rapidly amplifies localized ignition events into fully developed fires. Factors 10 and 12, representing improper material use and inadequate separation between heat sources and combustibles, similarly exhibit high prominence, collectively constituting critical nodes within the classic fire triangle of “ignition source–combustible material–inadequate separation.” Notably, human-state factors occupy four positions within the top ten, underscoring the pivotal role of unsafe human conditions in risk transmission: although states such as impairment, lack of supervision, or cognitive limitations do not directly produce ignition, they substantially elevate the probability that other hazard factors will be triggered.
Table 2 reveals that all ten factors with the highest relation values are positive, thereby classifying them as net driving forces within the system. Electromechanical failures, exemplified by Factors 30, 34, and 33, constitute the predominant category, followed by mechanical failures (Factors 20 and 25). This distribution indicates that inherent defects and performance degradation in equipment and wiring represent the primary endogenous sources of risk initiation in building fires. Factor 1, “asleep or drowsy state,” ranks fifth with a relation of 1.762, reflecting the loss of external risk perception and response capacity that transforms otherwise manageable incipient fires into uncontrolled events. Factor 84, “heat from another fire,” also exhibits a high relation value, signifying that once an initial fire becomes established, it functions as a new driving source capable of igniting adjacent compartments, thereby revealing the self-reinforcing, positive-feedback dynamics inherent in fire propagation.
A comparative examination of
Table 1 and
Table 2 reveals that certain factors simultaneously exhibit high prominence and high relation. Factor 34, “unspecified short-circuit arc,” ranks prominently in both lists, indicating that it serves both as a root driver of risk and as a critical hub for influence transmission. Targeted intervention on such dual-role factors is likely to yield disproportionate system-wide benefits. More commonly, however, prominence and relation are dissociated. Factor 11 ranks first in prominence yet lacks a correspondingly high relation value, indicating a primary function as a transmission and amplification conduit. Conversely, Factor 30 ranks first in relation but occupies a comparatively lower prominence position, characterizing it as a potent yet relatively isolated ignition source whose influence requires intermediation by hub nodes for broad dissemination. This functional differentiation substantiates the premise that building fire risk is not driven by a single category of factors operating in isolation but rather emerges from the coordinated interplay among three functional node classes: root drivers, transmission hubs, and terminal manifestations.
These findings can be integrated into a unified transmission framework that reveals the complete risk propagation chain from root drivers through hub conduits to terminal outcomes. At the ignition stage, high-relation factors such as Factor 34 (unspecified short-circuit arc) and Factor 23 (fuel leakage) constitute endogenous driving sources, the former representing concealed high-energy ignition, the latter providing the lethal combination of both ignition source and combustible fuel. At the transmission and amplification stage, high-prominence factors such as Factor 11 (abandoned materials) and Factor 12 (heat source too close to combustibles) fulfill critical hub functions. When driving sources generate sparks or thermal energy, the widespread presence of abandoned materials transmits and amplifies these localized events into fully developed fires, while inadequate separation between heat sources and combustibles sustains and propagates combustion. At the terminal stage of the risk chain, the high relation value of Factor 84 (heat from another fire) underscores the self-reinforcing nature of fire processes, wherein an incipient fire becomes a new driving source capable of triggering compartment-to-compartment fire spread and vertical flame propagation.
Table 3 provides a systematic synthesis of the functional roles associated with each factor category, organized across four dimensions: factor type, data characteristics, structural role, and manifestation in building scenarios.
As summarized in
Table 3, distinct factor categories perform structurally differentiated yet functionally complementary roles in risk generation. Electromechanical failures concentrate in the high-relation domain, functioning as endogenous driving sources that actively liberate energy and directly initiate combustion, constituting the logical origin of fire events. Human factors and material misuse cluster in the high-prominence domain. The former represents critical behavioral variables whose unsafe actions trigger hazardous processes; the latter constitute static risk carriers whose improper states determine fuel load density and potential propagation pathways. Through their high prominence, both categories connect localized risk points into systemic hazard networks. Fire spread factors exhibit dual-high characteristics (high prominence and high relation), reflecting the self-reinforcing dynamics of fire processes wherein an initial fire, itself a consequence, rapidly transforms into a new driving source that escalates fire scale and complexity.
This functional classification furnishes a structured foundation for differentiated intervention strategies. High-relation electromechanical factors warrant prioritization of engineering remediation and preventive maintenance to enhance intrinsic safety. High-prominence human and material factors require emphasis on behavioral regulation and combustible material management to sever critical transmission links. Factors exhibiting dual-high characteristics necessitate coordinated measures targeting both pathway disruption and propagation rate reduction. The metrics derived in this phase further supply the essential attribute calibration required for the hierarchical structure parsing presented in the following section.
3.3. Hierarchical Structure and Transmission Pathway Analysis
Following the procedure described in
Section 2.3, the comprehensive influence matrix
T was subjected to threshold filtering using the threshold value
λ =
μ +
σ = 0.102, where
μ and
σ are the mean and standard deviation of all elements in
T. The adjacency matrix
A, the reachability matrix
R, and the general skeleton matrix
S are displayed in
Figure 4a,
Figure 4b, and
Figure 4c, respectively. Strongly connected components were identified and contracted, and redundant edges were removed via transitivity reduction, yielding the skeleton matrix
S. The complete numerical values of
A,
R, and
S are provided in the
Supplementary Materials S4–S6.
Analysis of the skeleton matrix revealed that Factors 10 (improper use of materials), 11 (abandoned or discarded materials), 12 (heat source too close to combustibles), 20 (mechanical failure), 30 (electrical failure), 2 (alcohol or drug impairment), 4 (intellectual disability), and 7 (age-related factors) collectively form a prominent feedback loop. This loop indicates that material states, equipment failures, and human conditions do not interact unidirectionally but rather constitute a mutually reinforcing closed subsystem: electrical or mechanical failures may ignite improperly stored materials, while the resulting hazardous context may induce unsafe human behaviors, which in turn exacerbate deficiencies in material and heat source management. This finding reveals the existence of self-amplifying coupled functional modules within the building fire risk system, wherein effective intervention requires coordinated disruption of multiple nodes within the loop rather than isolated remediation of individual factors.
Based on the skeleton matrix, two complementary hierarchical topologies, cause-priority layering and result-priority layering, were generated using the extraction rules defined in
Section 2.3. These two topologies, presented in
Figure 5 and
Figure 6 respectively, characterize the hierarchical order among hazard factors from distinct analytical perspectives, the complementarity of which constitutes a core distinguishing feature of the CHM framework relative to conventional single-perspective approaches.
The cause-priority topology partitions the 89 hazard factors into five hierarchical levels (L0 to L4). Level L0, the uppermost tier, contains seven factors with the highest relation values, namely Factors 10, 6, and 73, among others, which exert significant influence on downstream levels while receiving minimal feedback, thereby functioning as the initial perturbation sources within the risk system. Levels L1 through L3 respectively contain 12, 24, and 31 factors, which exhibit both driving and dependent attributes, serving to transmit and transform influence along the causal chain. Level L4, the lowermost tier, aggregates 15 factors with the lowest relation values, including Factors 13 and 15, which function primarily as terminal recipients of upstream causal propagation. The distribution of directed edges across levels reveals a progressive broadening of influence: L0 issues 7 directed edges to L1, L1 issues 14 to L2, L2 issues 28 to L3, and L3 issues 19 to L4. Directed edges represent the direction of the factor’s influence, and the weights of the edges represent the intensity of the influence. If factor 73 points to factor 25 with a weight of 4.2, it indicates that the influence intensity of factor 73 on factor 25 is 4.2. The increasing edge density at deeper hierarchical levels indicates that while root drivers are relatively few in number, their influence undergoes progressive diffusion through intermediate layers, ultimately reaching a broad array of terminal factors. The absence of upward-directed edges confirms that the cause-priority topology, derived using relation as the stratification criterion, constructs a strictly unidirectional causal transmission skeleton. This skeleton defines the legitimate pathways for risk propagation and serves as the constraining framework for the coupling probability computations presented in the subsequent section.
The result-priority topology, conversely, partitions the factors into four hierarchical tiers (L0 to L3) based on prominence gradients. Level L0 encompasses 11 factors with the highest prominence values, including Factors 8, 28, and 11, which occupy strategic hub positions at the confluence of multiple influence pathways and whose reachability spans the vast majority of other factors. Levels L1 through L3 respectively contain 23, 35, and 20 factors, with mean prominence values declining progressively from 6.23 (L0) to 4.87 (L1) to 3.15 (L2) to 1.92 (L3). The decline is non-uniform: the reduction from L1 to L2 (35.3%) substantially exceeds that from L0 to L1 (21.8%), indicating a pronounced hub discontinuity wherein a small cadre of high-prominence factors occupies a structurally dominant position while the majority of factors reside in comparatively peripheral roles. The distribution of directed edges across levels, 32 from L3 to L2, 41 from L2 to L1, and 23 from L1 to L0, peaks at the L2-to-L1 transition, signifying that intermediate tiers represent the most active region of influence transmission. Unlike the cause-priority topology, the result-priority topology contains several cross-tier directed edges, wherein certain L3 factors connect directly to L1 factors, bypassing L2. These cross-tier edges reveal the existence of “shortcut” pathways within the network: certain low-prominence factors, although limited in overall connectivity, maintain direct associations with specific high-hub nodes and can, under particular conditions, trigger hub responses while circumventing conventional transmission chains. This structure provides the weight modulation basis for coupling probability refinement: factors in upper tiers, by virtue of their hub status, exert amplification effects on the coupling intensity of pathways to which they are connected.
Comparison of the hierarchical positions occupied by the same factor across the two topologies enables identification of structurally differentiated functional roles. Certain factors occupy elevated positions in both structures, Factor 10 resides at L0 in the cause-priority topology and L1 in the result-priority topology, thereby exhibiting dual attributes as both root drivers and network hubs, and consequently constituting priority targets for risk mitigation. Other factors display marked positional divergence. Factor 86 occupies the uppermost L0 tier in the cause-priority topology yet resides in the lowermost L3 tier in the result-priority topology, characterizing it as an “isolated engine” that, while possessing substantial driving potential, requires intermediation by hub nodes for widespread influence dissemination. Conversely, Factor 8 occupies the uppermost L0 tier in the result-priority topology yet resides in L4 in the cause-priority topology, indicating that although it is not a root source of risk, it serves as a convergence terminal for numerous influence pathways and, once compromised, directly precipitates severe consequences. The capacity to identify such functional differentiation is a distinctive capability conferred by the dual-perspective layering approach of the CHM framework: reliance on a single stratification rule would obscure these divergent patterns, leading to incomplete or biased characterization of factor roles.
Table 4 provides a systematic comparison of the cause-priority and result-priority hierarchical structures across four dimensions: core metric, stratification criterion, hierarchical interpretation, and representative factor exemplars.
As summarized in
Table 4, cause-priority layering employs relation as the stratification metric, organizing factors along a causal gradient from root drivers to terminal manifestations. Result-priority layering employs prominence as the stratification metric, organizing factors along a structural gradient from core hubs to peripheral elements. The divergent stratification criteria and underlying metrics determine the distinct emphasis each topology places on factor role characterization. It is precisely this complementary emphasis that enables the CHM framework to simultaneously capture factor functionality across both the causal transmission and network hub dimensions. The cause-priority topology addresses the question of “By whom is risk driven, and through which pathways is it transmitted?” whereas the result-priority topology addresses the question of “Where does risk converge, and from where is it disseminated?” In combination, the two topologies furnish the complete structural constraints required for coupling probability computation: the cause-priority skeleton defines the legitimate space of transmission pathways, while the result-priority topology supplies the modulation weights for those pathways. Traditional single-perspective analytical methods cannot concurrently acquire both categories of information; the CHM framework, through dual-perspective layering, achieves a comprehensive characterization of factor hierarchical order.
To facilitate interpretation of the positional differences that a single factor may exhibit across the two topologies, each factor can be classified into one of four functional archetypes based on its combined hierarchical position. (i) Dual-position factors occupy high tiers in both topologies (e.g., Factor 34), indicating that they simultaneously serve as root drivers in the causal chain and as hub nodes in the influence network. (ii) Driver-positioned factors rank high in the cause-priority topology but low in the result-priority topology (e.g., Factor 86). This pattern characterizes a factor that is a potent ignition source but not a major conduit of network influence, its risk contribution is direct rather than diffuse. (iii) Hub-positioned factors rank high in the result-priority topology but low in the cause-priority topology (e.g., Factor 8). These factors do not autonomously initiate risk but occupy critical convergence points where influence from multiple upstream drivers accumulates and is redistributed. (iv) Peripheral factors rank low in both topologies, contributing minimally to both causal initiation and network amplification. This classification demonstrates that positional divergence across the two topologies is not an ambiguity to be resolved but a meaningful signal of functional specialization. Cause-priority layering reveals where a factor sits in the causal chain; result-priority layering reveals where it sits in the hub architecture. Only when both are considered together can a factor’s full functional role be characterized.
The hierarchical structure parsing presented in this section provides the essential topological inputs for the multi-factor coupling probability quantification detailed in the following section. The cause-priority skeleton establishes the spatial boundaries for legitimate transmission, whereas the result-priority hub distribution furnishes the weight modulation coefficients that adjust base probabilities to reflect differential amplification effects along distinct pathways.
3.4. Multi-Factor Coupling Probability and Synergistic Risk Analysis
Using the hierarchical state encoding and mutual-information-based computation detailed in
Section 2.4, the occurrence patterns for the five-level factor combinations were derived from the 2061 incidents. The frequency and probability of each hierarchical state are summarized in
Table 5.
Based on the five-level hierarchical partition derived from cause-priority layering, the occurrence status of factors within each level was encoded for each of the 2061 building fire incidents. A level was assigned a value of 1 if at least one factor within that level was triggered during the incident, and 0 otherwise. Consequently, each incident was represented by a five-dimensional binary state vector.
Table 5 presents the frequency and probability of occurrence for each of the 32 possible hierarchical state combinations across the 2061 incidents.
As shown in
Table 5, the occurrence probabilities of different hierarchical combinations vary substantially. The simultaneous activation of all five levels (11111) was not observed in the dataset, and four-level co-activation events occurred with extremely low frequency, indicating that extreme coupling scenarios involving complete hierarchical activation are rare in practice. In contrast, high-frequency coupling configurations are predominantly concentrated in two- and three-level co-activation patterns. The combination involving simultaneous occurrence of L0 and L1 factors (11000) exhibited the highest frequency, accounting for 883 incidents (42.8% of the total), followed by isolated L1 activation (01000) with 422 incidents (20.5%). This distribution suggests that co-activation of the root driver layer and the first transmission layer constitutes the most prevalent form of multi-factor coupling in building fire incidents.
To quantify the conditional dependencies among hierarchical levels, the base transmission probabilities for single-level, two-level, three-level, and four-level factor combinations were calculated from the joint occurrence frequencies.
Table 6 presents the conditional probabilities for cause-priority hierarchical configurations. Several structural patterns emerge from these conditional probability distributions. First, single-level conditional probabilities exhibit a pronounced gradient: the probability of other levels being triggered given the occurrence of an L0 factor is substantially higher than that given an L1 factor, which in turn exceeds that given an L4 factor. This gradient confirms that factors situated closer to the root driver layer exert progressively stronger perturbations on the overall system state. Second, as the number of coupled levels increases, the conditional probabilities do not accumulate additively but rather exhibit nonlinear escalation, three-level combination probabilities significantly exceed the simple superposition of two-level probabilities, and four-level combinations exhibit further elevation. This nonlinear escalation indicates that multi-factor coupling risk is not the arithmetic sum of individual level risks but rather involves structural synergistic amplification. The subscript notation indicates the occurrence status (0 = non-occurrence, 1 = occurrence) of levels 0 through 4 in order.
The base transmission probabilities presented in
Table 6 reflect only the frequency-based co-occurrence patterns along the causal skeleton, without accounting for the differential influence exerted by hub nodes. To incorporate this effect, hub amplification coefficients were extracted from the result-priority topology. Conditional probabilities under result-priority layering were also computed (provided in the
Section S3 of the Supplementary Materials S1) and exhibited consistently stronger coupling among the top hub tiers: the conditional probability of other levels being triggered given the occurrence of an L0 factor in the result-priority topology is significantly higher than that given intermediate-level factors in the cause-priority topology, and among multi-level combinations, those incorporating L0 factors from the result-priority topology consistently exhibit higher conditional probabilities than analogous combinations lacking such hub-layer participation. This divergence underscores the role of hub status in modulating coupling risk: high-prominence factors, by virtue of their extensive reachability, simultaneously activate multiple transmission pathways upon occurrence, thereby substantially elevating the probability of multi-factor co-activation.
By weighting the base transmission probabilities according to the prominence of hub factors located along each pathway, comprehensive coupling probabilities were obtained for multi-level factor configurations.
Table 7 lists these probabilities for five-level, four-level, three-level, and two-level factor combinations, ranked in descending order.
Several structural characteristics emerge from the ranked comprehensive coupling probabilities. First, coupling intensity exhibits asymmetric attenuation across hierarchical gradients. The five-level coupling configuration yields the highest risk value (T5 = 0.315), followed by four-level and three-level combinations, with two-level configurations demonstrating substantially lower values. The attenuation between the root driver layer and intermediate transmission layers is relatively gradual, whereas the probability drops steeply between intermediate and terminal layers. This asymmetric structure implies that once associations among upper-level factors are established, their downstream transmission potential remains relatively stable, whereas the contribution of terminal-layer factors to overall risk elevation is comparatively limited.
Second, hub amplification effects are manifestly path-selective. Within the same coupling dimensionality, pathways traversing high-prominence hub nodes exhibit significantly elevated comprehensive coupling probabilities, yet the contribution of a given hub factor varies across different pathways. For instance, the amplification effect observed when Factor 29 (arc) co-occurs with Factor 54 (equipment overload) within the same transmission chain is substantially stronger than that observed when Factor 29 is combined with Factor 13 (cutting or welding too close to combustibles). This divergence indicates that hub factors do not exert uniform risk amplification across all combinations; rather, their contribution depends on the functional attributes of co-occurring factors along the pathway. When other nodes along the pathway also exhibit high prominence or high relation, the hub amplification effect is synergistically enhanced; conversely, it is partially offset when co-occurring factors lack such attributes.
Third, the risk contribution of activating factors exhibits pronounced concentration. Comparing the 34 activating factors identified in
Section 3.3 with the factor composition of the top 20 high-risk pathways in
Table 7 reveals that activating factors appear with substantially higher frequency in high-risk pathways than non-activating factors. Over 70% of the top 20 high-risk pathways contain at least two activating factors, whereas none of the pathways composed entirely of non-activating factors rank within the top 20. Further statistical analysis indicates that the 34 activating factors account for over 70% of the risk increment observed in high-risk pathways, while the remaining 55 factors contribute relatively modestly. This concentration pattern reveals a structural regularity in building fire risk generation: a minority of factors exhibiting both strong driving potential and high hub connectivity dominate the majority of high-risk coupling events.
Fourth, systematic differences in coupling intensity are observed between objective and subjective factors. Examination of the factor composition of configurations listed in
Table 7 reveals that coupling pathways dominated by equipment failures, electrical issues, and material states consistently yield higher comprehensive coupling probabilities than those predominantly driven by human states and supervisory deficiencies. For instance, configurations involving Factor 30 (electrical failure) and Factor 20 (mechanical failure) rank among the highest risk values, whereas configurations centered on Factor 2 (alcohol or drug impairment) register comparatively lower values. This divergence reflects the functional differentiation between objective and subjective factors in risk generation: electrical and mechanical failures, once initiated, directly liberate energy and ignite combustibles along relatively deterministic coupling chains, yielding higher risk intensity; subjective factors, in contrast, typically require conversion through additional intermediate steps to produce hazardous effects, introducing greater uncertainty into transmission pathways and thereby diluting coupling risk.
The quantified coupling probabilities presented herein furnish a structured foundation for differentiated risk mitigation strategies. The following section subjects these findings to comparative validation and robustness analysis, examining the structural resolution of the CHM framework, the practical discrimination power of its hub-modulated probabilities, and the stability of the factor concentration effects reported above.
3.5. Comparative Validation and Robustness Analysis
The results presented in
Section 3.2,
Section 3.3 and
Section 3.4 establish the CHM-derived factor roles, hierarchical structures, and coupling probabilities. This section subjects two key findings to three complementary forms of validation. First, the structural resolution of the CHM framework is evaluated through a head-to-head comparison with ISM (
Section 3.5.1). Second, the practical value of the hub amplification mechanism is tested by comparing CHM comprehensive coupling probabilities against unmodulated baseline probabilities (
Section 3.5.2). Third, the robustness of the factor concentration effect is assessed through a random subsampling procedure (
Section 3.5.3).
3.5.1. Structural Resolution: CHM Dual-Perspective Layering Versus ISM Single-Perspective Layering
This section evaluates whether the hierarchical structures produced by CHM offer finer resolution in distinguishing factor roles than those obtained from a representative existing method, by applying both CHM and ISM to the same adjacency matrix. ISM is widely used in fire hazard factor research and is characterized by its capacity to extract hierarchical order from a directed graph, with an input format compatible with the adjacency matrix
A derived in
Section 3.3. This compatibility makes ISM a suitable reference point: when both methods operate on identical input, any divergence in their outputs can be attributed exclusively to differences in their analytical mechanisms rather than to variations in data sources or preprocessing.
ISM and CHM each assign hierarchical positions to factors based on the same directed graph. Where their assignments differ, the comparison examines whether the CHM result-priority topology, which has no counterpart in ISM, can account for the divergence. If positional differences between the two methods align systematically with the prominence-based rankings that distinguish CHM from ISM, the divergence can be traced to a specific analytical mechanism rather than to random variation.
The ISM procedure partitioned the 89 factors into five hierarchical levels, the CHM cause-priority topology also produced five. The two structures exhibited strong consistency: over 89% of factors were assigned to equivalent or adjacent levels across the two hierarchies. However, the critical distinction emerged when the CHM result-priority topology was compared with the ISM output. The result-priority layering, which has no counterpart in the ISM framework, partitions factors based on prominence gradients rather than causal direction, thereby revealing the hub architecture of the factor network.
Table 8 lists the factors located at different levels in the ISM model and the cause-priority topology graph, and summarizes the hierarchical positions of these factors across all three topologies, highlighting cases where the ISM single perspective obscured functionally significant role differentiation.
With the accuracy of the cause-priority layering established, the factors with divergent hierarchical positions listed in
Table 8 take on particular analytical significance. These divergences are not shortcomings of CHM; rather, they arise from a second analytical dimension that CHM introduces beyond the causal dimension, the result-priority layering, which enables a finer differentiation of factor functional roles. This additional dimension allows CHM to capture information that causal position alone cannot reveal: factors that occupy similar positions in the causal chain may differ substantially in their importance within the network hub structure.
Following the order of
Table 8, the functional roles of these factors are given a fuller characterization through the CHM dual-perspective analysis. Factors 86, 6, 23 and 73 are identified as root-driver factors in the cause-priority topology, a judgment broadly consistent with that of ISM, indicating that both methods correctly recognize their role as strong ignition sources or initial triggering conditions. The CHM result-priority topology further reveals, however, that these four factors occupy relatively low hub positions in the network; their influence is concentrated rather than widely diffused. This additional information refines the intervention strategy from a generic “address root drivers” to a more targeted “for this type of driver factor, efforts should focus on source control rather than on deploying downstream blocking measures.”
Factors 10 and 29 exhibit a different pattern: the cause-priority topology places them as root drivers, consistent with the ISM assessment, while the result-priority topology further reveals that they also occupy high positions in the network hub structure. This means that these two factors not only initiate risk but also amplify and diffuse it through extensive network connections. The CHM coupling probability computation assigns them higher weights precisely on the basis of this dual role, and the validity of this weighting rests on the more complete functional portrait that the CHM dual-perspective provides.
Factors 1, 8 and 54 represent a third pattern. The cause-priority topology places them at intermediate to low positions in the causal chain, and ISM yields a similar assessment, there is no disagreement between the two methods on their causal positions. The distinction lies in the CHM result-priority topology, which differentiates them clearly: Factor 8 is placed at the highest hub tier, indicating that although it is not an ignition source, it serves as a critical node through which the influence of multiple upstream drivers converges and is redistributed; Factors 1 and 54, in contrast, occupy low levels on both dimensions and are peripheral factors with limited contribution to systemic risk. Similar causal positions yet markedly different hub statuses, this differentiation is beyond the reach of a single causal perspective, and it directly determines the actual differences in the roles these three categories of factors play in risk propagation.
Taken together, the advantage of CHM lies in providing a second dimension of information on factor functional roles through the result-priority perspective while faithfully reproducing the classical causal structure. This additional resolution gives CHM greater practical value in both risk assessment and intervention decision-making. In risk assessment, dual-position and hub-positioned factors receive more accurate coupling weights that reflect their network propagation potential. In intervention decision-making, managers can formulate differentiated strategies, source control, hub isolation, or coordinated intervention, depending on whether a factor is driver-positioned, hub-positioned, or dual-position. These advantages do not arise from negating existing methods but from extending the depth of analysis beyond the point where existing methods stop.
3.5.2. Risk Discrimination: CHM Comprehensive Coupling Probabilities Versus Baseline Probabilities
Experiment A verified the incremental contribution of CHM in structural resolution. This section examines whether this additional structural information carries practical value in risk assessment: specifically, whether the hub amplification coefficient αp enables the CHM comprehensive coupling probability Tcomp to discriminate high-risk incidents more effectively than the unmodulated baseline probability Tbase.
Both measures were computed from identical underlying data, the same hierarchical encodings and the same joint frequency distributions. The only difference is the multiplication by αp: Tcomp ~ = Tbase × αp. Any divergence in risk ordering can therefore be attributed solely to αp, i.e., to the amplification effect of hub factors identified by the result-priority topology.
Tbase and
Tcomp were calculated for all 2061 incidents, and the incidents were ranked separately under each metric. To examine whether the ranking shifts concentrate among known high-risk incidents, “hub-rich incidents” are defined as those involving at least two hub factors located in the L0 or L1 tiers of the result-priority topology. The threshold of two hub factors was selected because a single L0/L1 hub factor appears in approximately 70% of all incidents and is too frequent to allow effective discrimination of risk levels; it is when two or more such factors co-occur that their synergistic amplification begins to meaningfully elevate the coupling probability. Hub-rich incidents accounted for 847 of the 2061 incidents (41.1%); the remaining 1214 incidents (58.9%) were classified as ordinary.
Table 9 reports the distribution of these two incident types in the top 10% and top 20% of the
Tbase and
Tcomp rankings, as shown in
Table 9.
Among hub-rich incidents, the proportion ranked in the top 10% by
Tcomp (31.6%) is more than twice that ranked by
Tbase (14.2%), and a similarly pronounced gap appears in the top 20% (52.4% vs. 28.7%). This indicates that multiplication by
αp relocates a substantial number of hub-rich incidents from the middle and lower segments of the ranking to the top tier. These incidents include the high-risk factor combinations identified in
Section 3.4, such as the co-occurrence of Factor 29 (arc), Factor 54 (equipment overload), and Factor 11 (abandoned combustible materials), whose comprehensive coupling probability
T35 = 0.313 serves as a representative example. Under the baseline ranking, such incidents rely solely on raw joint frequencies and lack structural calibration; their positions are mismatched with their actual hazard levels. Under the CHM ranking,
αp applies an upward adjustment consistent with their hub-rich composition, bringing their ranks into closer alignment with their actual severity.
In contrast, among ordinary incidents, the proportion ranked in the top 10% by Tcomp (3.9%) is lower than that under Tbase (7.1%). Ordinary incidents lack the participation of high-tier hub factors and receive minimal benefit from αp (with most αp ≈ 1); they are even relatively displaced downward as hub-rich incidents move upward as a group. This pattern, high-risk incidents rise, ordinary incidents remain stable, demonstrates that αp does not inflate risk estimates uniformly but selectively amplifies those incidents structurally identified as high-risk by the result-priority topology.
Taken together, the introduction of the hub amplification coefficient αp substantially enhances CHM’s ability to identify high-risk incidents. It does not generate artificial discrimination by increasing model complexity; rather, it converts the dual-perspective structural information validated in Experiment A, particularly the hub factor identification by the result-priority topology, into improved risk prioritization. The concentration of this gain among hub-rich incidents establishes a coherent chain of evidence from structural resolution to risk assessment: Experiment A demonstrated that the CHM dual-perspective design distinguishes factor roles more finely than a single-perspective method; Experiment B demonstrates that this finer differentiation yields measurable gains in risk ranking, confirming that the additional structural dimension has concrete implications for risk-based inspection and resource allocation.
3.5.3. Robustness of the Factor Concentration Effect
To assess whether the concentration of risk contribution among the 34 active factors identified in
Section 3.4 is sensitive to data composition, a random subsampling procedure was performed. From the full set of 2061 incidents, 100 random samples of 70% (n = 1443) were drawn without replacement, and the entire CHM pipeline, from co-occurrence matrix construction through coupling probability computation, was re-run on each sample. Across the 100 replicates, the set of active factors identified in each subsample overlapped with the original set of 34 active factors by an average of 89.2% (SD = 3.1%). The proportion of the risk increment attributable to these factors remained above 70% in 96 of the 100 replicates. These results indicate that the concentration pattern reported in
Section 3.4 is stable under substantial variations in data composition and is not an artifact of any particular subset of incidents.