1. Introduction
With the continued expansion and increasing complexity of bridge construction projects, long-span PC continuous rigid-frame bridges (LSPCCRFBs) are being increasingly exposed to significant safety risks during their construction phase [
1]. These projects often involve parallel operations, dense spatial coupling, and rapid state transitions. Even seemingly minor disturbances within the interconnected “bridge–human–machine–environment” system can amplify and evolve into severe accidents [
1]. In previous research on the construction-phase risks of bridges and other infrastructure, scholars have focused mainly on unsafe human behaviours, machinery and lifting equipment, environmental conditions, and the vulnerability of structural components. For example, Shan et al. systematically collected 126 bridge construction accident cases and organized 25 secondary risk factors into four primary categories—human, equipment, management, and environment—and then analysed how their coupling led to accidents in bridge construction [
2]. Li et al. developed a safety risk indicator system for highway bridge construction and used entropy-weighted cloud models to assess overall construction risk levels, highlighting that working-at-height operations, heavy lifting equipment, temporary structures, and complex environmental conditions are the dominant contributors to safety risk [
3].
On the methodological side, many studies rely on checklists, safety inspection tables, accident statistics, and expert ratings, which are then processed using AHP–fuzzy comprehensive evaluation, cloud models, or related multicriteria techniques to obtain comprehensive risk scores for specific bridge projects [
3,
4]. Classical process-safety methods such as FMEA and their enhanced variants have also been introduced into bridge engineering, but they essentially depend on static scoring tables and expert workshops for hazard identification [
5]. In terms of probabilistic modelling, Zhang, Wu, and Skibniewski proposed a Bayesian-network-based safety risk analysis framework for construction projects, in which human, equipment, environmental, and management factors are represented as nodes, and their conditional probabilities are elicited from experts or historical data [
6]. However, these approaches are typically implemented as one-off, static assessments: network structures and conditional probability tables are fixed over time, the temporal evolution of risks during the construction process is not explicitly modelled, and onsite monitoring data—particularly vision-based observations of workers, machinery, and protective facilities—are seldom integrated. As a result, existing frameworks have a limited ability to capture how risks propagate over time or to provide multistep predictive information for proactive scheduling and resource allocation.
Recent advances in computer vision technology have opened new possibilities for improving construction safety. Over the past few years, the YOLO family of algorithms has shown strong real-time detection capability and precision across construction settings [
7,
8], making it possible to automatically recognize personnel, machinery, and protective equipment. Enhanced versions such as YOLOv7 and YOLOv8 have further advanced performance in detecting small objects and operating under low-light conditions [
9,
10]. The inclusion of attention mechanisms and multiscale feature fusion has also improved adaptability in visually complex scenes [
11,
12]. However, much of the current research remains confined to the “detection layer,” rarely extending towards causal inference or dynamic forecasting, leaving a persistent methodological gap in understanding “how to transform detection outputs into causal inference and dynamic forecasting” [
13,
14].
In the domain of causal modelling, the DEMATEL–ISM approach has gained traction for uncovering cause–effect relationships and hierarchical patterns among multiple risk factors within complex engineering systems [
1,
15]. For instance, Li (2024) proposed an accident-chain analysis model based on fuzzy DEMATEL–ISM [
16], whereas Xue (2025) examined the layered evolution of risks in shield tunneling construction [
17]. Nevertheless, most studies to date still rely on static topological structures and lack integration with dynamic probabilistic modelling [
18,
19], limiting their responsiveness to time-dependent changes in risk behaviour.
DBNs, on the other hand, have proven effective in managing uncertainty and temporal dependency in evolving systems [
20,
21,
22,
23]. By defining conditional probability tables (CPTs), DBNs are capable of updating risk states over time. However, many existing implementations employ fixed CPTs or static datasets, which hinders adaptability to the nonstationary nature of construction sites [
22,
24]. Moreover, these models often focus solely on posterior inference and lack forwards-looking prediction with quantified uncertainty, restricting their ability to support resource allocation and task scheduling [
25].
In this study, we explicitly position the proposed framework in relation to existing risk assessment methods for bridge and infrastructure construction and highlight three main innovations. First, unlike traditional checklist-, questionnaire-, or expert-based approaches, we fuse an enhanced YOLOv8 model with a structured “human–machine–environment–bridge” labelling system so that key risk elements are extracted directly from onsite images rather than being inferred solely from subjective scoring. Second, instead of using DEMATEL/ISM only to rank factors or to obtain a static hierarchy, we transform the credibility-weighted influence relations into a directed acyclic topology that is explicitly designed to serve as the structure of a dynamic Bayesian network, thereby resolving bidirectional and cyclic links while preserving dominant causal chains. Third, in contrast to conventional static Bayesian-network or risk-matrix applications that provide only one-off assessments, we embed this topology into a DBN with sliding-window learning and a forgetting factor and apply particle filtering to generate time-varying, scenario-level risk probabilities over multiple future time steps. In combination, these features allow the framework to move beyond mature but static methods towards a vision-guided, dynamically updated, and decision-oriented risk assessment tool for LSPCCRFB construction.
The main contributions of this paper are as follows: (1) at the perception stage, an enhanced YOLOv8 model is designed with multi-attention modules, dynamic convolution, and auxiliary segmentation–distillation strategies to strengthen risk recognition in complex environments; (2) at the causal modelling stage, an improved DEMATEL–ISM algorithm fuses expert insights with YOLO-based observations to construct credibility-weighted causal chains and hierarchical structures; and (3) at the reasoning stage, a dynamic Bayesian network is employed, where sliding-window learning and a forgetting-factor mechanism enable adaptive CPT updates. Combined with particle filtering and Monte Carlo forwards propagation, this framework can generate multistep probabilistic forecasts of risk evolution, offering a comprehensive and dynamic approach to construction safety assessment.
3. Case Study: Dynamic Risk Assessment of Long-Span PC Continuous Rigid-Frame Bridge Construction
3.1. Project Overview
Building on the proposed framework, we apply it to a case study. The Tongshun River Bridge is a key control project on the Wuhan Metropolitan Ring Expressway. The main bridge spans the Tongshun River, a class VI inland waterway, where geological, hydrological, navigation, and construction conditions are complex.
The bridge adopts a three-span, variable cross-section PC continuous rigid-frame scheme with a main span of 160 m. Cantilever construction with travelling formwork (hanging baskets) is employed. Owing to the tall main piers and pronounced structural flexibility, controlling stresses and deformations during construction is challenging. The structure exhibits the typical features of a long-span–tall pier–long cantilever; the plan layout is shown in
Figure 3.
During construction, the structure is subject to a time-varying, coupled, multiple-condition, and uncertain state driven by multiple factors. Key milestones include zero block casting, segmental cantilever casting using hanging baskets, side-span cast-in-place segments, and closure. These factors—temperature effects, stressing sequences, concrete creep and shrinkage, hanging basket stiffness degradation, and cumulative errors—can deviate structural responses, resulting in schedule delays, quality issues, or safety risks. In addition, the site lies in a humid central climate zone with rapidly rising water levels during flood seasons and short construction windows. Subsurface conditions include deep soft soils, making the stability control of falsework and working platforms difficult. Overall, construction-phase risks are highly dynamic, interactive, and nonlinear.
Current projects typically rely on static forms, expert judgement, or preset risk lists for safety management, which are ill suited to the multisource coupling and dynamic evolution described above. Therefore, taking the Tongshun River Bridge as a case study, we develop a construction-phase risk assessment framework that integrates DEMATEL–ISM-based causal identification with a DBN. The framework enables risk chain modelling, key factor identification, and probabilistic inference, providing methodological support for the intelligent safety management of complex bridge construction.
3.2. Data Acquisition and Preprocessing
3.2.1. Data Acquisition
To comprehensively cover the typical risk factors at construction sites, we constructed a labelling scheme comprising four categories—bridge, human, machine, and environment—with 14 subcategories in total (
Section 2.1.4).
Table 7 reports the risk element counts and proportions.
3.2.2. Image Preprocessing and Augmentation
Images collected on site vary markedly in quality because of natural lighting, auxiliary illumination, and occlusions. To analyse the effects of illumination on detection, grayscale histograms were constructed, and the images were grouped into four typical lighting conditions:
- (1)
Well-lit scenes: The exposure is adequate, with clear edges and rich details for primary targets. The grey levels range mainly from 50–180, and the mean grey values are >100.
- (2)
Partial shadow/occlusion: Shadows from equipment, scaffolding, or structures cause marked local brightness variations. The grey levels range mainly from 12–255, and the mean grey values ≈ 130.
- (3)
Low/backlighting: The images are taken at dusk or night with insufficient fill light. Edges are blurred, and contours are degraded. The grey levels range mainly from 0–50 and 120–180, with a mean ≈ 110.
- (4)
Very dark scenes: The images include long-range night surveillance or areas without temporary lighting. The grey levels range mainly from 0–70, with a mean < 40.
Histograms under the four lighting conditions are shown in
Figure 4.
Table 8 summarizes the image counts for well-lit, partially shadowed/occluded, low-light, and dark scenes.
To improve the robustness to small objects and complex scenes, all the annotated images were resized to 1280 × 720 while preserving the aspect ratio. Augmentations include mosaics, random horizontal flips, brightness/contrast perturbations, and copy–paste to expand effective samples for rare classes. For classes comprising <2% (e.g., unprotected workers and warning signs), random oversampling and MixUp are applied during training to mitigate class imbalance.
To validate the effectiveness of histogram-based analysis and augmentation, a representative low-light example is shown in
Figure 5. The original image (
Figure 5a) is dim, with a histogram concentrated between 0 and 70 (
Figure 5b). After CLAHE and gamma correction, the brightness and local contrast improve markedly (
Figure 5c), and the histogram widens (
Figure 5d), facilitating more accurate feature extraction.
3.3. Image Detection Performance and Risk Element Recognition
We evaluated the improved YOLOv8 model using standard detection metrics, including classwise precision–recall (PR) curves, F1–confidence curves, mAP, and a four-class confusion matrix (bridge, human, machine, environment). The computation of these metrics follows the procedures described in
Section 2.1.3, and the resulting curves for the Tongshun River Bridge dataset are presented in
Figure 6,
Figure 7 and
Figure 8.
3.3.1. Detection Performance Analysis
The improved YOLOv8 detector was trained for up to 100 epochs with early stopping on the basis of the validation mAP, and the best checkpoint near the 100th epoch was used for all subsequent experiments. We used stochastic gradient descent (SGD) with an initial learning rate of 0.01, momentum of 0.937, weight decay of 5 × 10−4, and a batch size of 16, together with a cosine learning-rate schedule and three warm-up epochs.
Training ran for up to 100 epochs with early stopping—training was terminated when the mean validation accuracy did not significantly improve for several epochs. The best model was obtained when approaching the 100th epoch. The finalized model was evaluated using the test dataset, and the performance trends for the training and validation phases are illustrated in
Figure 6. In total, the annotated dataset contains 17,649 image frames. Among them, 12,354 frames are used for training, 1765 for validation, and 3530 for testing, following the 7:2:1 ratio described above. The evolution of the loss and performance metrics on the training and validation sets is shown in
Figure 6, while the final detector is selected on the basis of the validation performance and evaluated on the held-out test set.
The overall training and test performance of a single multiclass YOLO-based detector trained jointly on all the annotated risk elements are shown in
Figure 6. Images containing bridge-, human-, machine- and environment-related elements are combined in each mini-batch, and a unified multiclass loss is optimized so that gradients from all 14 risk elements contribute jointly to the model update. As shown in
Figure 6, the box, objectness and classification losses for both the training and validation sets decrease monotonically and remain close to each other throughout 100 epochs, indicating stable convergence without obvious overfitting. Moreover, the precision, recall and mAP@[0.5:0.95] steadily increase with training. This behaviour confirms that the improved YOLOv8 model has learned a consistent representation of the Tongshun River Bridge scenes and can provide reliable visual evidence for subsequent DEMATEL–ISM–DBN risk assessment. All curves in
Figure 6 are aggregated over all 14 risk elements; category-specific detection performance is further analysed in the confusion matrix (
Figure 7) and related discussion.
The confusion matrix for the four categories (structure, personnel, machinery, and environment) is presented in
Figure 7. Rows denote the ground truth, columns denote predictions, and diagonal entries indicate correct detection rates. High accuracy is achieved across all categories. On the test set, the normalized confusion matrix in
Figure 7 has a recall of 0.90 for bridge structures, 0.89 for humans, 0.94 for machinery and 0.88 for the environment, while the background recall reaches 0.87. From a safety perspective, such high recalls for the human- and environment-related categories mean that workers, temporary facilities and protective devices are rarely missed by the detector. This greatly reduces the probability that hazardous situations, such as working at height without adequate guarding or entering poorly protected lifting zones, remain completely unseen in the subsequent causal-chain analysis and DBN-based risk inference.
The precision–recall (PR) curves and the F1–confidence curves computed on the validation set are shown in
Figure 8. These validation curves are used to select the confidence threshold that maximizes the F1 score on the validation data, and this operating point is then adopted for all subsequent test and case-study analyses. The precision–recall curves in
Figure 8a yield AP @ 0.5 values of 0.93, 0.85, 0.97 and 0.88 for the bridge, human, machinery and environment categories, respectively, and an overall mAP@0.5 of 0.94 across all four categories. These results indicate that the detector maintains high precision over a wide range of recall levels, even for the human and environment categories that are most directly related to onsite safety conditions.
As shown in
Figure 8b, the F1–confidence curves peak at 0.89, 0.83, 0.87 and 0.82 for the bridge, human, machinery and environment, respectively, with an overall maximum F1 of 0.87 for all classes at an optimal confidence threshold of 0.31. This operating point achieves a favourable trade-off between missed detections and false alarms, ensuring that safety-critical objects are detected with high reliability while spurious warnings are kept at a manageable level for practical use at the construction site.
Therefore, the improved YOLOv8 model provides a robust perceptual front-end for the proposed DEMATEL–ISM–DBN framework, providing credible observations of structural, human, machinery and environmental elements that underpin the subsequent dynamic risk assessment for LSPCCRFB construction.
3.3.2. Risk Element Recognition and Visualization
As a further step of the validation procedure described in
Section 3.3.1, we also conduct a qualitative validation of the detector. Representative detection results on nontraining images are shown in
Figure 9. The four categories (‘bridge’, ‘person’, ‘machine’, and ‘environment’) are correctly localized and classified in typical bridge construction scenes, including complex backgrounds and varying illumination conditions. This qualitative validation complements the quantitative validation metrics (loss, precision–recall, and F1–confidence curves) and visually confirms the model’s ability to recognize risk elements in realistic construction scenarios.
3.4. Causal Matrix and Hierarchical Topology Analysis
3.4.1. Factor Attribute Analysis Based on DEMATEL
To ensure representative inputs, a “dual-source fusion” strategy is used. Nine senior experts scored the direct influence between any pair
Ri,
Rj; their averaged, rounded scores form the expert matrices
A(k) (
Figure 10). The maximum interexpert difference per item does not exceed 1, indicating strong consistency and reliability. Objective information is derived from the detections in
Section 3.3: frequency ci and mean confidence
pij (when
Ri may trigger
Rj). The expert rating matrices were first subjected to consistency checks and normalization. Specifically, the direct-relation matrix A is constructed from expert scores according to Equation (10). Before implementing the ISM hierarchical decomposition in
Section 2.2.2, the DEMATEL results from
Section 2.2.1 must be applied to the integrated subjective-objective weighting scheme (Equations (17)–(19) in
Section 2.2.3) to obtain the fused direct relationship matrix shown in
Figure 11.
As illustrated in
Figure 10, strong outgoing influences from unprotected personnel, commanders, guard lines and signs correspond to typical accident chains in LSPCCRFB construction, such as workers entering crane and lifting-hook operation zones when barriers or instructions are missing.
Section 2.3.5 describes how such element-level impacts are mapped onto scenario-level risks, and
Table 9 shows this mapping for the defined Fall, Collision, Injury and other risk scenarios.
To elucidate more precisely the causal linkages among these influencing factors within the established framework, normalization was carried out according to Equation (11) using the “maximum row sum” criterion, resulting in a standardized matrix. This matrix was then expanded using Equation (12) to obtain the comprehensive influence matrix, as presented in
Figure 12. The prominence
Ci and causality
Ei values reported in
Table 9 are computed from the comprehensive influence matrix
T using Equation (13), revealing that out of the 14 influencing factors, seven act as causes, while the remaining seven serve as effects.
The data in
Table 10 indicate that Unprotected personnel (F6) and Commanders (F5) have the highest prominence values, whereas the Guard line (F11) and Sign (F12) also lie in the causal quadrant with strong positive
Ei, underscoring their role as critical environmental protection drivers rather than the numerically highest-prominence factors. Unprotected personnel, Commander, Lighting, Operator, and Yard also fall into the causal layer, highlighting the dominant impact of human factors and site layout. In the effect layer, bridge pier, main beam, and cap have high prominence but negative causality (<−1.0), indicating that they are terminal nodes receiving accumulated upstream risk. Vehicles, excavators, cranes, and lifting hooks are likewise effect-layer factors influenced by human, environmental, and operational risks. The prominence versus causality is plotted in
Figure 13.
In
Figure 13, the dotted rectangle encloses ‘key factors’, defined as nodes with prominence Ci above the median and positive causality Ei > 0 (e.g., F5, F6, F11–F14). These factors are both influential and upstream in the network and therefore prioritized for proactive control.
3.4.2. Hierarchical Risk Topology Analysis
To obtain an acyclic DBN from the DEMATEL–ISM graph, the bidirectional links are resolved by retaining the edge with a larger
Tij, and small cycles (e.g., among F1–F3) are broken by imposing a construction-stage-consistent ordering (F1→F2→F3) and removing the weakest edge. Thus,
Figure 14 shows the refined DAG used for DBN inference. Following DEMATEL, ISM organizes the 14 factors into three hierarchical levels—input, operation, and effect groups. The reachability and hierarchical levels are derived by constructing the reachability matrix and performing the iterative decomposition procedure described in Equations (14)–(18).
Because the binarized matrix
T* retains only entries
Tij ≥
λ =
μT +
kσT, some intuitive but relatively weak direct influences (e.g., from personnel or the yard layout to the bridge pier, F1) are represented indirectly via intermediate machinery and environment nodes. This explains why certain ‘cause’ nodes appear unlinked to specific ‘effect’ nodes in
Figure 10, despite belonging to the same physical risk chain.
As shown in
Figure 14, Unprotected personnel (F6) is consistently treated as an upstream causal factor that influences both structural effect nodes (e.g., F1–F3) and the scenario variables. The scenario definitions in
Table 9 therefore describe how combinations of such causal factors and structural context activate the corresponding scenario nodes without changing the parent–child roles in the DBN.
3.5. Risk Probability Inference and Safety Assessment
3.5.1. Network Mapping and Dynamic Modelling
In the DBN, cross-time causal edges connect slices, enabling inference of current states from previous ones. Observations are obtained from the improved YOLOv8: risk labels serve as state variables, and detection confidence serves as observation reliability, ensuring causal consistency while remaining responsive to real-time monitoring. Formally, the DBN state vector at each time slice and the joint distribution over all the slices follow the definitions in Equations (20) and (21), ensuring consistency with the topological structure introduced in
Section 2.3.
3.5.2. Dynamic Learning of Conditional Probability Tables
Because conditions and behaviours are time-varying, static CPTs are inadequate. We therefore update CPTs with a sliding window and exponential decay (Equations (22) and (23)), using weighted counts over joint states and Bayesian smoothing to avoid zero probabilities for rare events.
In this study, the primary risk scenarios selected are fall, collision, injury, structural damage and strike. The mapping relationships between these five risk scenarios and risk elements are shown in the table below. These five scenarios are defined by combining the DEMATEL–ISM causal structure (
Figure 14) with typical accident types in LSPCCRFB construction. For each scenario
Sk, we select the risk elements
Ri that simultaneously satisfy (i) belonging to the causal chains leading to this accident type in
Figure 14 and (ii) meeting the necessary condition in
Table 10 (e.g., ‘work at heights’ for Fall and ‘equipment in motion/hoisting’ for Collision). Each scenario is thus implemented as a subnetwork of the full DBN, containing only the corresponding risk elements and their parents.
In
Table 10, the column ‘Necessary condition’ does not represent effect nodes in the DBN but the joint presence of causal factors and structural context required for a scenario to become likely. For example, the Fall scenario involves elevated structural elements (F1–F3) together with the causal factor Unprotected personnel (F6). In the DEMATEL–ISM and DBN models, F6 remains an upstream causal node that influences these structural effect nodes and the Fall scenario variable rather than being treated as a child node itself.
In terms of engineering, the five scenarios correspond to typical construction failure modes. ‘Fall’ mainly captures workers falling from decks, hanging baskets, or scaffolds when guard lines or personal protection are inadequate. ‘Collision’ denotes impacts between cranes, vehicles, lifting hooks and structural members during hoisting or transport. ‘Injury’ and ‘Strike’ represent personnel entering prohibited zones and being hit by moving equipment or falling objects, whereas ‘Structural Damage’ refers to cracking, deformation, or local collapse of piers, caps, or main beams caused by heavy impacts.
At each time step
t, the DBN with particle filtering provides posterior probabilities
P(
Ri(
t)) for all risk elements. For each scenario
Sk, these element-level probabilities are then aggregated into a scenario-level probability
P(
Sk(
t)) using the Noisy-OR/Noisy-AND and suppression formulations in Equations (28)–(30): the elements listed in
Table 10 as ‘risk elements’ form the triggering set
Rk, whereas protective or environmental factors form the suppressing set
Rk−. Thus, the internal interaction within each scenario is captured by the weighted contributions and suppressions of its associated risk elements.
Using the DBN with particle filtering, we obtain diurnal conditional probability trajectories. The patterns align with task rhythms, personnel states, and environmental factors, demonstrating the model’s fidelity. For each of the five scenarios, the DBN yields the probability that the corresponding binary scenario variable is in the ‘risk’ state at each time step. The curves in
Figure 15 plot the evolution of this probability over the 24 h horizon. Specifically, for each hour within the 24 h horizon, particle filtering yields {
P(
Ri(
t))}, and Equations (28)–(30) are applied to compute
P(
Sk(
t)) for
k = 1, …, 5. The curves and surfaces in
Figure 15 therefore plot the time series of these scenario probabilities
P(
Sk(
t)), i.e., the conditional probabilities of the five construction risk chains within 24 h. Distinct temporal patterns emerge: fall-from-height risk peaks at 8–10 a.m. and 2–4 p.m.; equipment-collision risk increases at midday; zone-intrusion injuries cluster in late morning; damage from missing protection increases through the afternoon; and object-strike risk without guard lines is more prominent from dusk to night.
Across the 24 h horizon, the greatest changes in posterior probability occur for human-related nodes (e.g., unprotected personnel and commanders) and environmental nodes (e.g., yard environment and guard line), whereas structural nodes such as bridge piers, cap beams and main beams show relatively small temporal variations. Qualitatively, the predicted daily risk peaks align with the observed rhythms of high-intensity construction tasks and crane and hoisting operations reported in the site logs, and this consistency has been confirmed by onsite experts. This suggests that the DBN captures the main temporal patterns of human- and environment-driven risk, although a formal quantitative validation remains to be conducted.
These patterns result from coupled human–process–environment drivers: human factors (attention ramp-up in the early morning; fatigue and circadian effects after noon) explain peaks in falls and injuries; process factors (crane operations; material transport concentrated around noon) increase collision risk; environmental/management factors (adjustments/removal of protections and lax oversight in the afternoon) increase structural damage; and low illumination plus missing barriers at night amplify the risk of object strike. Single-peaked curves reflect transient human–process coupling, whereas monotonic increases reflect cumulative structural/environmental effects approaching saturation.
3.5.3. Particle Filtering Inference and Future-State Prediction
Given image-based observations and updated CPTs, particle filtering is used to infer posterior state distributions. Each particle encodes a hypothesized combination of risk-element states; weights are updated per Equation (15) and resampled to obtain current risk probabilities and scores. Forwards Monte Carlo propagation (Equation (17)) then produces
H-step forecasts for planning (
Figure 16).
The figure presents the prediction samples obtained via particle filtering in scatter plot form. Fall-from-height without safety belts, collisions due to command errors, and injuries from zone intrusion exhibit typical unimodal trajectories: their probabilities increase as work intensity and cross-task coordination increase and then decline after the associated operations are completed. In contrast, structural damage caused by missing protections and object strikes due to the absence of guard lines displays monotonically increasing trends that converge towards a plateau, reflecting cumulative mechanisms and the gradual deterioration of site conditions. The unimodal trajectories result from transient human–process coupling, whereby attention, crowding, and coordination load increase and subsequently decrease within the prediction window. The monotonic trajectories, by contrast, reflect cumulative structural and environmental effects, under which risks continue to accumulate until they are near saturation in the absence of sufficient protective measures.
The particle-filter and forwards Monte Carlo results in
Figure 15 and
Figure 16 are intended as proof-of-concept demonstrations of the proposed framework. A rigorous convergence study—including explicit reporting of particle numbers, Monte Carlo sample sizes, and the impact of variance-reduction techniques—is left to future work, where these settings will be documented and calibrated to ensure stable risk estimates for operational decision-making.
3.5.4. Safety Risk Level Assessment
Following
Section 3.5, representative probability values are extracted from particle-filtered trajectories by averaging over the observation window and capturing steady-state behaviour. These are fed into the loss model in
Section 2.3 to yield risk ratings. Loss levels were calibrated via historical accident data from similar projects, ensuring the model’s applicability to the specific context of LSPCCRFB construction.
Table 11 reports the grades of major site risks.
The data in
Table 11 indicate that Fall has a probability of only 0.2985 but is rated as Level IV because of catastrophic consequences. Structural damage has the highest probability, reflecting structural instability during construction, and is rated as Level III. Collision and Strike have moderate probabilities of having severe consequences; hence, they are classified as Level III. Injury has a lower probability with moderate losses but can disrupt operations without adequate protection and is also rated as Level III.
Overall, human and structural risks dominate the high-risk levels, whereas mechanical and environmental risks are mainly moderate. Safety management should therefore prioritize personnel protection and structural stability while also addressing weaknesses in organizational management and environmental safeguards.
4. Conclusions
This study demonstrates that the proposed vision-guided probabilistic framework significantly enhances the accuracy and adaptability of dynamic risk assessment in LSPCCRFB construction. By integrating improved YOLOv8-based perception with DEMATEL–ISM causal modelling and dynamic Bayesian inference, the method effectively captures the temporal evolution of complex construction risks under nonstationary conditions. To our knowledge, this represents the first comprehensive approach that links visual detection, causal reasoning, and probabilistic forecasting within a unified, data-driven workflow for engineering safety management.
These findings address the long-standing limitations of static, expert-driven risk evaluation by providing a mechanism for continuous, uncertainty-aware prediction. In doing so, the framework bridges the methodological gap between onsite visual monitoring and dynamic probabilistic decision-making. Beyond its theoretical contribution, this work offers practical guidance for improving real-time safety interventions—supporting proactive resource allocation, targeted supervision, and early warning of cumulative hazards in complex construction environments.
However, certain limitations remain. The reliance on expert input for initializing DEMATEL–ISM relations may introduce subjective bias, and the use of a single site dataset constrains the generalizability of the model. Moreover, the current implementation primarily emphasizes short-horizon forecasts, which may limit long-term predictive robustness. Owing to space limitations, we do not tabulate all time-varying conditional probability tables, but qualitative inspection of the posterior distributions reveals that human-related and environmental nodes undergo the greatest adjustments over time, whereas structural nodes change only modestly. A further limitation is the lack of formal out-of-sample validation of the DBN; future work will incorporate cross-project validation and quantitative predictive scoring to more rigorously assess forecasting performance.
Future work will systematically evaluate the sensitivity of risk estimates to particle numbers, sample sizes, and variance-reduction strategies and will report these quantities explicitly. In addition, we should expand the dataset across multiple construction sites and environmental contexts to improve generalizability. Exploring adaptive structural learning within the DBN to enable real-time causal updates would also be valuable. Integrating IoT-based telemetry and schedule data could further strengthen the framework’s predictive capacity, paving the way for fully automated, closed-loop safety management systems in bridge engineering and other large-scale infrastructure projects.