You are currently viewing a new version of our website. To view the old version click .
Journal of Marine Science and Engineering
  • Article
  • Open Access

20 February 2025

Data-Driven Analysis of the Causal Chain of Waterborne Traffic Accidents: A Hybrid Framework Based on an Improved Human Factors Analysis and Classification System with a Bayesian Network

,
,
,
,
and
1
China Waterborne Transport Research Institute, Beijing 100088, China
2
Guangxi Key Laboratory of ITS, Guilin University of Electronic Technology, Guilin 541004, China
3
School of Architecture and Transportation Engineering, Guilin University of Electronic Technology, Guilin 541004, China
4
Department of Infrastructure, Guilin University of Electronic Technology, Guilin 541004, China
This article belongs to the Special Issue Sustainable and Efficient Maritime Operations

Abstract

In the context of economic globalization, waterborne transportation plays an important role in international trade and logistics. However, waterborne traffic accidents pose a severe threat to life, property safety, and the environment. To gain a deeper understanding of the causal mechanisms behind waterborne traffic accidents, we conducted a data-driven analysis of the causal chain of waterborne traffic accidents. By constructing a hybrid framework integrating an improved HFACS (Human Factors Analysis and Classification System) with a Bayesian network model, we conducted a multi-dimensional analysis of accident causes. The constructed model was quantitatively analyzed by applying genie software to the accident samples collected from the China MSA. The results indicate that there are 12, 3, 6, 2, 4, and 7 causal chains leading to collisions, contact, fires/explosions, windstorm accidents, sinking, and other types of accidents, respectively. These research results can serve as a reference for the enhancement of the safety of waterborne transportation.

1. Introduction

With the acceleration of economic globalization and the expansion of global trade, waterborne transportation has assumed an irreplaceably important role. This form of transportation offers significant advantages, such as allowing for high carrying capacities, low costs, and high long-haul transport efficiency. Nevertheless, accidents often affect the waterborne transportation system, particularly since the 1990s, when changes in marine shipping and the international environment sharply highlighted the importance of shipping safety []. These accidents pose significant threats to lives, property, and the environment [].
Waterborne traffic accidents are not caused by a single factor but occur as the result of the interaction of multiple factors. Therefore, it is especially important to analyze the causal chain of waterborne traffic accidents so as to reduce the likelihood of their occurrence. The HFACS (Human Factors Analysis and Classification System) model, an effective system for analyzing and classifying human factors contributing to accidents, can help identify and categorize human factors that lead to accidents. Furthermore, the HFACS can integrate various methods to explore the causal relationships between factors leading to accidents []. As a probabilistic graphical model, a Bayesian network can learn the probabilistic dependencies between variables by means of statistical data. By constructing a Bayesian network model, one can identify and analyze the influences of various factors in accidents, providing a new method for accident risk analysis and prediction []. Therefore, constructing a hybrid framework combining the HFACS and Bayesian networks, especially in a data-based setting, is a feasible and useful method of comprehensively understanding the mechanisms of accident occurrence, providing a scientific basis for accident prevention and quick responses.

2. Literature Review

2.1. Identification of Causality of Waterborne Accidents

Research on the causality of waterborne traffic accidents has made remarkable progress. Such studies, through the comprehensive application of various methods and theories, have deeply analyzed the root causes of accidents. Wang et al. [] conducted an in-depth analysis of multiple factors affecting the severity of marine accidents, including human, vessel, environmental, and management factors. They used global accident data and statistical models to reveal the independent and interactive effects of these factors on accident severity. Liu et al. [] employed machine learning techniques to comprehensively analyze the causes of marine accidents in China’s coastal waters. They utilized algorithms, including decision trees and random forests, and neural networks to uncover the complex relationships between human, vessel, environmental, and management factors and accident severity. Sun et al. [] employed complex network theory to analyze the causes of waterborne traffic accidents. They constructed a causal network model, analyzed its characteristics, and identified critical causal factors, such as human negligence and management loopholes. Wu et al. [] analyzed ship collision accidents in the Yangtze River that occurred from 2013 to 2017 through text-mining techniques, identifying 33 causal features, covering human, vessel, environmental, and management factors. They used a Bayesian network model and text-mining methods to predict marine risks and discovered that human factors were the primary causes. Using GIS technology, Wang et al. [] explored the spatial distribution patterns of global maritime accidents, including accident frequency and severity, through density analysis and clustering analysis. Ho Namgung et al. [] developed a collision risk inference system for Maritime Autonomous Surface Ships (MASSs) adhering to COLREGs. Addressing gaps in existing methods, their innovative approach defines avoidance actions based on danger levels, sets risk indices using expanded ship domains, extracts key factors from overlapping domains, and employs an adaptive neuro-fuzzy system for learning. This system enhances collision warning timing and positioning, offering more decision-making time for prevention.
The HFACS model, an effective tool in the field of aviation safety, has been applied to the waterborne transportation field by many scholars. Kaptan et al. [] applied the HFACS to the maritime sector to uncover the root causes and causal chains of waterborne traffic accidents. Yildiz et al. [] applied the HFACS-PV method to explore HOFs in maritime accidents. Their analysis revealed this method’s applicability to collisions, groundings, sinkings, and other types of accidents. Wang et al. [] used the HFACS-FCMs model to investigate the human factors in ship grounding accidents. They found that inadequate ship safety management organizations constitute the most significant factor in grounding accidents, followed by organizational influences and unsafe behavior antecedents. Yıldırım et al. [] utilized the HFACS and statistical methods to analyze human factors in collisions and groundings. GIS and FTA of Black Sea accidents revealed that COLREG violations and communication lapses were the primary causes of collision, while grounding accidents stemmed from watchkeeper errors and inadequate bridge resource management communication. Huang [] utilized the HFACS framework combined with expert scoring methods and gray theory to analyze the human factors in marine traffic accidents. The HFACS-MTAI system achieved precise identification, systematic classification, and quantitative ranking of human factors. Chen et al. [] analyzed the causes of marine traffic accidents using the HFACS-MA framework to comprehensively assess human and organizational factors. Through case studies and public accident reports, they validated this framework’s effectiveness and identified key causal factors. Chauvin et al. [] improved the HFACS model using the classification tree method to analyze the causal factors at each level of the collision accidents.
The Bayesian network model has also been widely applied in analyzing the causes of waterborne traffic accidents. Fan et al. [] employed a data-driven Bayesian network model to focus on human factors, exploring their impact on the probability and outcomes of various waterborne traffic accidents when interacting with non-human factors. Antão et al. [] utilized Bayesian Belief Network (BBN) models to investigate the impact of human error on coastal shipping accidents under various sea conditions. Their findings revealed that human error and variations in risk acceptance and perception among the crews of different vessel types had significant influences. Wang et al. [] used a Bayesian network model to investigate the severity of global waterborne transportation accidents in order to reveal the complex relationships between weather, ship characteristics, human factors, and accident severity. Meng et al. [] adopted a data-driven Bayesian network model integrating physical knowledge to explore risk factors influencing ship collision accidents. The research results demonstrated the model’s effectiveness in identifying and analyzing key risk factors of ship collisions. Fan et al. [] employed a Bayesian network approach to explore risk factors influencing maritime transportation accidents. They constructed a network model reflecting the interdependencies among the influencing factors. Tian et al. [] utilized a Bayesian network model to investigate ship collision accidents recorded by the Zhejiang Maritime Safety Administration of China. Their research revealed that human factors such as improper lookout, inadequate collision assessments, and improper collision avoidance measures are the primary causes of accidents and can be used to predict accident probabilities. Hänninen et al. [] conducted a statistical analysis of ship collisions in the Gulf of Finland and utilized a Bayesian network to investigate the causal relationships between human factors and the final collision outcomes. Meng et al. [] employed a combined N-K model and Bayesian networks to assess coupled risks in Chinese ship collisions. They found that multifactor coupling was more effective than bifactor coupling, with human and management factors being vital and ship/environment factors’ impacts becoming more significant with varying probabilities.
Wang et al. [] integrated the HFACS with a BN to investigate the root causes of marine accidents, emphasizing the importance of human and organizational factors. Their results showed that combining the HFACS with a BN can effectively uncover complex human factor chains and organizational deficiencies in marine accidents. Rostamabadi et al. [] developed an FBN-HFACS model to analyze human and organizational factors (HOFs) in process accidents. The model serves as a robust tool for accident prevention and safety management. Jiang et al. [] applied the HFACS, Bayesian network, and path analysis models to analyze the causal pathways of waterborne traffic accidents. Their research revealed the key causal pathways for accidents such as collisions, sinkings, and contacts. Li et al. [] studied the impact of HOFs on ship collision accidents using the HFACS and a Bayesian network model. The HFACS model identified multi-level causes of accidents, while the Bayesian network revealed the probabilistic dependencies between causes and identified critical causal pathways. Their study revealed that crew operational errors and equipment failures were the primary causes of ship collision accidents. Wang et al. [] employed the HFACS-BN model to explore HOFs in collisions between merchant ships and fishing vessels. Their study revealed the interactions between various human and organizational factors in collision accidents and their degrees of influence on accident occurrence. Özkan et al. [] analyzed nearly two decades of ship accidents in the Black Sea region using the HFACS and a Bayesian network model. They found that the frequent accidents in the coastal areas of the Black Sea were primarily due to crew operational errors, equipment failures, and inadequate management.

2.2. Research Gap Analysis

Scholars in the field of causation analysis have conducted extensive research on waterborne traffic accidents. Nevertheless, despite significant progress in this field, there is still room for improvement. Firstly, the research conducted so far is mainly based on data from a specific region or a single type of waterborne traffic accident, and the applicability of the aforementioned method under different circumstances requires further validation. Furthermore, waterborne traffic accidents often involve the intricate interplay of multiple complex factors, and current research efforts in deciphering these interactions are still inadequate. Lastly, the tendency to use single-framework methodologies in existing research has somewhat limited the breadth and depth of inquiry.
Therefore, we conducted a data-driven analysis of the causal chains of waterborne traffic accidents, which can be applied more generally to different situations. Firstly, a hybrid framework integrating an improved HFACS and a Bayesian network model was constructed to conduct a multi-dimensional analysis of accident causes. Secondly, based on the constructed framework, we analyzed the underlying causes from multiple dimensions and conducted a chain analysis of the causal factors, quantitatively assessing the correlations between factors and their degrees of influence on accident occurrence. Thirdly, we identified the global causal chain of waterborne traffic accidents for different types of accidents, providing information that can serve as a general reference for practical waterborne traffic accident prevention.

3. Data-Driven Hybrid Framework Based on the HFACS and a Bayesian Network

3.1. Basic Data-Driven Analysis Framework

In this paper, we propose a data-driven hybrid framework that can be used to analyze the causal chain of waterborne traffic accidents in order to attain a better understanding of the interaction mechanisms between potential causal factors in the waterborne transportation system. The basic framework is shown in Figure 1. As Figure 1 shows, the main procedure includes the following steps.
Figure 1. Illustration of the basic data-driven analysis framework.
Step 1: Initialize the accident dataset. Add all the accident samples collected at the beginning of the procedure into the accident dataset one by one.
Step 2: Calculate the number of occurrences for causal factors. The number of occurrences for causal factors is calculated according to the novel MTAACS (Maritime Traffic Accident Analysis and Classification System, an improved Classification method based on HFACS) proposed in this paper.
Step 3: For every causal factor pair, conduct a correlation analysis. This analysis is conducted using the chi-square test. If the causal factor pair passes the test and the pair is the last factor pair, then proceed to Step 4; otherwise, test the next causal factor pair.
Step 4: Construction and application of Bayesian networks. This step includes five sub-steps, including constructing a conditional probability table, identifying key factors, performing a sensitivity analysis of the key causes, identifying cause paths, and conducting a global cause chain analysis.
Step 5: Update the accident dataset and return to Step 2 if there are any new accident data; if not, output the analysis result.
With regard to the field of exploring analytical frameworks for the causes of waterborne traffic accidents, our study significantly distinguishes itself from the existing research through our innovative combination of the Human Factors Analysis and Classification System with a Bayesian network model integrated in a data-driven framework, which should renew the analysis results according to the updating of the datasets. Different from traditional research methods, which tend to be confined to single-dimensional analyses focusing solely on human factors or technical faults, the framework constructed in this study systematically identifies and categorizes human factors in accidents using the HFACS. These factors comprehensively encompass multiple dimensions, such as individuals, organizations, and the environment, thereby providing more comprehensive and in-depth insights. Furthermore, we introduce an advanced probabilistic graphical model of the Bayesian network. This model not only effectively handles complex dependencies between variables but also quantifies the uncertainty of accident causes through probabilistic reasoning, more accurately capturing the ambiguity and dynamic characteristics of accident causation. This provides solid data support for formulating accident prevention strategies.

3.2. Data Collection and Basic Analysis

To ensure the accuracy and representativeness of the data, we primarily sourced our information from the water traffic accident investigation reports publicly released on the official website of the China Maritime Safety Administration from June 2015 to February 2023. These reports comprehensively document various types of accidents, including collisions, contact, fires/explosions, windstorms, sinkings, etc. However, upon conducting an in-depth analysis of the causal chains of various accidents, it was observed that there were relatively few sample data for strandings and contact. Given the impact of sample size on research accuracy, strandings and contact were excluded in this study. The sample data for the other types of accidents are relatively abundant in the publicly available reports, sufficient to provide the necessary information for an in-depth analysis and understanding of their causal chains.
After systematic data collection and screening, we gathered a total of 756 investigative reports on water traffic accidents along China’s coast, serving as the core data foundation for this research. For collision accidents, each vessel was considered an independent research sample. We classified several common types of accidents and recorded their respective occurrence frequencies, as shown in Table 1.
Table 1. Description and frequency of accident types.
The statistical data in Figure 2 clearly show that there are significant differences in the proportions of different types of accidents in the overall dataset. Collision accidents, other accidents, and sinking accidents account for a relatively high percentage, indicating that these types of accidents are more likely to occur. In contrast, the proportions of contact accidents, fire/explosion accidents, and windstorm accidents are relatively low, suggesting that these types of accidents are less likely to occur. This data pattern provides us with an intuitive overview of accident occurrences.
Figure 2. Accident-type distribution chart.

3.3. Improved HFACS (MTAACS) for Causes of Waterborne Traffic Accidents

In the process of accident analysis, the HFACS model logically provides a basic framework for the causal chain as well as a fundamental pathway with which to identify the root causes of accidents, namely, management factors: direct cause–indirect cause (precursor)–root cause. Although the HFACS model provides a systematic logical framework for causal linkages, there are several limitations to its direct application to classifying factors. Firstly, the factor hierarchy set in the HFACS model may overlook the unique nature of causal structures in waterborne transportation accidents. Secondly, given that the HFACS model lacks clarity in terms of accident classification—a feature that is particularly crucial for the analysis of waterborne traffic accidents in this paper, wherein causal analysis of various accidents is the key research content—it is necessary to incorporate a specific “accident level” as an indispensable part of the framework structure to explicitly classify various accident types. Furthermore, at the “preconditions for unsafe behavior” level, the influence of the external environment must be analyzed in addition to internal factors. It is worth noting that the impact of adverse external environments does not necessarily have a direct causal relationship with upper-level factors such as “adverse organizational influences” and “inadequate supervision”. Lastly, the HFACS model assumes there is a linear causal relationship between adjacent levels, which may be different from the nonlinear causal relationships in waterborne traffic accidents. This complexity necessitates a more nuanced examination of the interactions and associations between factors in the analysis of waterborne traffic accidents.
Thus, in this paper, we propose a novel Maritime Traffic Accident Analysis and Classification System (MTAACS) based on an improved HFACS. The proposed system fully accounts for the unique characteristics and complexities of waterborne traffic accidents. It categorizes the causative factors of waterborne traffic accidents into five hierarchical levels: organizational influences, inadequate supervision, preconditions for unsafe behaviors, unsafe behaviors, and the accident layer. Factors within the same level are considered to be in parallel relationships, implying there are no direct causal linkages. This arrangement is designed to avoid oversimplifying the complexity of waterborne traffic accidents and allows analysts to comprehensively consider the interactions between various factors at the same level. Concurrently, the relationships between factors at adjacent levels are described as nonlinear. This indicates that a single factor may simultaneously impact multiple lower-level factors or that multiple factors may collectively act on the same lower-level factor. This nonlinear relationship can more accurately reflect the complexity and diversity of the various factors involved in waterborne traffic accidents, providing a deeper and more comprehensive analytical perspective for accident prevention.
Based on waterborne traffic accident investigation reports, causative factors relevant to each dimension were extracted. The accident risk causative factors extracted in this study stem from descriptions given in accident investigation reports and can be classified into 38 factors, as shown in Table 2 and Figure 3.
Table 2. Integrated factors causing water transportation accidents.
Figure 3. Statistical chart regarding the layered causes of water traffic accidents.

3.4. Correlation Analysis for Causal Factor Pairs

In analyzing the causal factor chain of waterborne traffic accidents, structural learning becomes a crucial initial stage, with the core objective of clarifying the topological structure of the Bayesian network. As the foundation for constructing the Bayesian network model, structural learning determines how various nodes in the model are interconnected and ascertains their dependency relationships.
In this process, the constraint-based search method plays a decisive role. In this method, various techniques from statistics and information theory are employed, such as the chi-square test, mutual information, and the G2 test. In this study, the chi-square test was selected as the core analytical tool for structural learning. By analyzing the statistical correlations in accident data through the chi-square test, potential links between different causal factors were identified, providing a basis for constructing the topological structure of the Bayesian network model.
Taking C2 (insufficient safety awareness) and D1 (improper lookout) as examples, the chi-square test was applied to examine whether there was a significant statistical correlation between these two factors. The results of applying the chi-square test to analyze the potential link between C2 (insufficient safety awareness) and D1 (improper lookout) are shown in Figure 4. This figure indicates that the expected count (116.19) was significantly greater than 5, and the total sample size reached 964, satisfying the conditions for conducting a Pearson’s chi-squared test. The test results show that the p-value is less than 0.05. According to conventional statistical norms, this result indicates a rejection of the null hypothesis (i.e., C2 and D1 are independent of each other) and the acceptance of the alternative hypothesis, indicating that there is a statistically significant relationship between C2 (insufficient safety awareness) and D1 (improper lookout).
Figure 4. Chi-square test result chart for C2 × D1.

3.5. Construction of Bayesian Network for Causes of Waterborne Traffic Accidents

3.5.1. Basic Concepts and Formulas

A Bayesian network is a probabilistic graphical model used to represent dependencies between variables. It is presented in the form of a Directed Acyclic Graph (DAG), a graphical structure that enables an intuitive understanding and analysis of causal relationships and conditional dependencies among variables. In the model, nodes represent random variables, while directed edges represent the interrelationships between variables, and their relationships are quantified through conditional probabilities. The construction and analysis of Bayesian network (BN) models primarily involve the following three steps. The first step is to determine the BN’s structure. In the context of complex networks, we integrated data-driven approaches with expert knowledge, employing structural learning methods and optimizing based on statistical test results and prior information to accommodate multi-node scenarios. Secondly, the BN parameters must be determined. Given the scale of the network, in this study, we trained the parameters based on sample data to ensure that parameter estimation was both efficient and robust. Finally, an analysis of the BN, particularly a sensitivity analysis, was conducted. Through this analysis, significant precursor variables influencing the target nodes can be identified, thereby allowing the extraction of key causal factors and pathways for water transportation accidents and providing scientific support for accident prevention strategies.
The theoretical basis of Bayesian networks lies primarily in probability theory and graph theory. The basic concepts and formulas for the construction of a Bayesian network are as follows.
(1)
Prior Probability
Prior probability is the initial assessment of the probability of a random event or a variable taking on a specific value before the observation of any relevant data or evidence. In Bayesian statistics and Bayesian networks, prior probability reflects an initial belief about the state of an event or a variable based on historical data, expert knowledge, or background information.
(2)
Posterior Probability
Posterior probability is the reassessment of the probability of a random event or a variable taking on a specific value after the observation of new data or evidence. Posterior probability is calculated based on Bayes’ theorem, which combines the prior probability (the probability before the observation of new data) and the likelihood function (the probability of observing the data given that the event has occurred). Posterior probability represents an updated belief about the state of an event or a variable given the new evidence or observations.
(3)
Conditional Probability
The conditional probability formula is a fundamental concept in probability theory that describes the probability of one event occurring given that another event has already occurred. The conditional probability formula is defined as follows: Let A and B be two events, and P(B) > 0 (i.e., the probability of event B occurring is not zero). Then, the probability of event A occurring given that event B has already occurred is denoted as P(A|B).
(4)
Joint Probability
The joint probability formula describes the probability of two or more events occurring simultaneously. For two events A and B, the joint probability formula is as follows:
P(AB) = P(A,B)
However, if events A and B are independent, their joint probability can be obtained by multiplying their individual probabilities:
P(AB) = P(A) ∙ P(B)
In Bayesian networks, joint probabilities are not usually given directly but instead calculated indirectly through conditional probability tables. These conditional probability tables describe the probability of a child node taking on each possible value given the values of its parent nodes.
(5)
Total Probability Formula
The total probability formula is an important theorem in probability theory that describes the probability of a complex event occurring in terms of the probabilities of a series of simpler events and their conditional probabilities.
Let the full sample space be S. If the events E 1 ,   E 2 ,   E 3 ,   E n form a complete event group (i.e., they are mutually exclusive and exhaustive, and E 1 E 2 E 3 E n = S ) and P E i > 0 ( i = 1 , 2 , 3 , , n ) , then for any event A, we have
P A = P A | E 1 P E 1 + P A | E 2 P E 2 + + P A | E n P E n = i n P A | E i P E i
(6)
Bayes’ Formula
Bayes’ Theorem provides a formula that describes the relationship between two conditional probabilities.
Let the full sample space be S and A be an event within S. If E 1 ,   E 2 ,   E 3 ,   E n is a complete event group (i.e., they are mutually exclusive and exhaustive, and E 1 E 2 E 3 E n = S ) and P(A) > 0, P( E i ) > 0 ( i = 1 ,   2 ,   3 , , n ) , then Bayes’ Theorem states that
P E i | A = P A | E i P E i i n P A | E i P E i
After the topological structure of the Bayesian network is determined through structural learning methods, the next crucial step is to solve for the network parameters. In this process, the Maximum Likelihood Estimation (MLE) method, a commonly used statistical approach, is widely applied in parameter learning for Bayesian networks.
Based on the available accident data, MLE was used to compute the likelihood function of the data and find the parameters that maximize this function to estimate the conditional probability table (CPT) for each node. The results of parameter estimation provide the CPTs for each node in the Bayesian network model, representing the probability distribution of each node given its parent nodes’ states. Further analysis of these parameters can reveal the dependency relationships between different factors and their degrees of influence.

3.5.2. Bayesian Network for Causes of Water Traffic Accidents

Based on the established conditional probability table (CPT), we could obtain the Bayesian network for the causes of water traffic accidents using GeNle software Academic 4.1. The network is shown in Figure 5. Therein, five different colors means different categories of factor levels. The blue and yellow bars are calculated based on the provided data, with blue indicating the probability of the accident’s cause occurring and yellow indicating the probability of not occurring.
Figure 5. Chart depicting the Bayesian network calculation results for causes of water traffic accidents.

4. Analysis of the Causal Chain of Waterborne Traffic Accidents

4.1. Identification of Key Factors in Waterborne Traffic Accidents

Identifying and analyzing the key factors are crucial for understanding the causes of waterborne traffic accidents, predicting accident risks, and formulating effective preventive measures and emergency response strategies. Based on the node probabilities presented in Figure 5, the top three factors with higher occurrence probabilities in each level were extracted and compiled into a table of key factors, as shown in Figure 6.
Figure 6. Top three causes with the highest occurrence probabilities at each level.
Based on Figure 6, at the organizational management level, inadequate safety management execution (A5), improper staffing (A1), and inadequate education and training (A2) were identified as the three core factors affecting waterborne traffic safety. In particular, the high incidence of inadequate safety management execution emphasizes a significant lack of safety management. Together, these three factors constitute potential root causes of waterborne traffic safety accidents, creating hidden dangers and increasing risks during ship operations.
At the unsafe-supervision level, inadequate equipment provisioning and maintenance (B1), a lack of supervision and guidance (B2), and improper navigation area selection and operational management (B4) are the three most problematic areas. These deficiencies not only have a direct impact on decision making and execution at the next level but also indirectly amplify safety risks regarding waterborne traffic.
At the preconditions-for-unsafe-behaviors level, insufficient safety awareness (C2); inadequate theoretical knowledge, work experience, skill levels, and other competencies (C1); and external factors such as excessive wind, waves, and currents (C7) were recognized as the primary factors inducing waterborne traffic accidents. These preconditions foster the occurrence of unsafe behaviors, increasing the likelihood of accidents.
At the unsafe-behavior level, improper lookout (D1), misjudgment of hazards (D3), and failure to take early action (D9) are the behaviors that most directly lead to accidents. These behaviors involve misjudgments and faulty decisions made by crew members during navigation, highlighting the importance of enhancing crew safety awareness and operational skills.

4.2. Sensitivity Analysis of Causes of Waterborne Traffic Accidents

Sensitive factors can be identified through sensitivity analysis of the Bayesian network. The aim of this analysis is to determine the degrees of influence of network parameters on the posterior probability of specific target nodes, thereby identifying the key factors contributing to various types of accidents. The core algorithm of Bayesian network sensitivity analysis involves calculating the differential of the target node’s posterior probability with respect to network parameters, thus quantifying the impact of parameter changes on the posterior probability. In GeNle, by setting the state of the target node and conducting sensitivity analysis, the importance of upper-level factors with respect to lower-level factors can be visually displayed. Combined with the calculation results for the sensitivity parameters, further quantitative analysis can be conducted.
Taking a windstorm accident as an example, we set E4 (windstorm accidents) in the accident layer as the target node and conducted a sensitivity analysis as shown in Figure 7. During this process, the colors of the Bayesian network nodes were changed to visually indicate the locations of sensitive factors. Red nodes represent the locations of relevant parameters, with darker shades indicating the higher sensitivity of the associated parameters. In Figure 7, it is evident that insufficient theoretical knowledge, work experience, skill levels, and other competencies (C1); excessive wind, waves, and currents (C7); and inadequate typhoon prevention measures (D12) are the factors most relevant to windstorm accidents. The causal strength of each factor needs to be combined with sensitivity parameters for quantitative assessment. GeNle provided the maximum, minimum, and average values of the sensitivity of each node’s relevant parameters. Among them, the maximum value is consistent with the shade of the associated node’s color, indicating that the maximum value of the sensitivity analysis is the primary criterion for assessing the sensitivity of associated nodes.
Figure 7. Sensitivity analysis network diagram for wind disaster accidents.
Through the sensitivity analysis of various factors contributing to waterborne traffic accidents, sensitivity parameter data were obtained, as shown in Figure 8, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13, and Figure 14 lists the top three sensitivity factors for different types of accidents. These parameters reveal the relative importance of causal factors in different types of accidents.
Figure 8. Sensitivity analysis results for collision accidents.
Figure 9. Sensitivity analysis results for contact accidents.
Figure 10. Sensitivity analysis results for fire/explosion accidents.
Figure 11. Sensitivity analysis results for windstorm accidents.
Figure 12. Sensitivity analysis results for sinking accidents.
Figure 13. Sensitivity analysis results for other accidents.
Figure 14. Top three sensitivity factors for different types of accidents.
Based on Figure 8 and Figure 14, for collision accidents, the analysis results show that insufficient safety awareness (C2) has the largest absolute value for the sensitivity parameter, indicating that a crew’s level of safety awareness has the most significant impact on the occurrence of collision accidents. Therefore, enhancing crew safety awareness is of great significance for reducing collision accidents.
According to Figure 9 and Figure 14, for contact accidents, improper collision avoidance behavior (D4) has the largest absolute value for the sensitivity parameter among the causal factors. This result indicates that during navigation, proper collision avoidance behavior is crucial in preventing contact accidents.
Based on Figure 10 and Figure 14, for fire/explosion accidents, the absolute value of the sensitivity parameter for improper equipment use (D7) is the highest, indicating that improper use of equipment is one of the primary contributors to fire/explosion accidents. Therefore, enhancing the standardization and safety of equipment use is crucial for reducing fire/explosion accidents.
According to Figure 11 and Figure 14, for windstorm accidents, the absolute value of the sensitivity parameter for inadequate typhoon prevention measures (D12) is the highest, indicating that the adequacy of typhoon prevention measures taken by ship staff directly relates to the incidence of windstorm accidents under severe weather conditions such as typhoons. Therefore, strengthening the formulation and implementation of typhoon prevention measures for ships is crucial for reducing the risk of windstorm accidents.
As shown in Figure 12 and Figure 14, for sinking accidents, the absolute value of the sensitivity parameter for improper emergency measures (D11) is the highest, indicating that the adequacy of emergency measures taken by ship staff in emergency situations plays a decisive role in preventing sinking accidents. Therefore, improving crews’ emergency response capabilities and formulating effective emergency measures are important ways of reducing sinking accidents.
As shown in Figure 13 and Figure 14, for other types of accidents, insufficient safety awareness (C2) has the largest absolute value for the sensitivity parameter, which again emphasizes the universal importance of enhancing crews’ safety awareness for reducing waterborne traffic accidents.

4.3. Cause Paths of Waterborne Traffic Accidents

After deeply exploring the key and sensitive factors of waterborne traffic accidents, the next step was to construct the corresponding cause paths. Based on the results of the sensitivity analysis, a rigorous screening process was conducted for the causal factors in the chain analysis model of waterborne traffic accident causes. Specifically, the causal factors that had insignificant impacts on various accidents (i.e., those with node sensitivity parameters less than 0.01) were eliminated.
However, in the analysis of causal factors for contact accidents, a special situation was identified: the sensitivity of most causal factors was less than 0.01. To comprehensively and thoroughly analyze the cause chains of contact accidents, a special strategy was adopted: when constructing the cause paths, the three nodes with the highest sensitivity at each level were selected for focused analysis. The results are shown in Table 3 and Figure 15. In Figure 15, five different colors with numbers mean different categories of factor levels.
Table 3. Cause paths for different types of accidents.
Figure 15. Cause paths of different types of accidents.
As evident in Table 3 and Figure 15, the number of cause paths for various types of waterborne traffic accidents varies: there are 12 cause paths for collision accidents, while there are 3 for contact accidents, 6 for fire/explosion accidents, 2 for windstorm accidents, 4 for sinking accidents, and 7 for other types of accidents.
The cause paths of collision accidents show complex diversity, encompassing the intertwined influences of human errors (such as improper lookout and operation) and environmental factors (such as insufficient visibility and complex current conditions). In contrast, the cause paths of contact accidents are relatively concise, centered on improper collision avoidance behavior while also being influenced by certain environmental factors. The cause paths of fire/explosion accidents reveal potential risks across multiple links, ranging from equipment failures to improper human operations, demonstrating these accidents’ unique complexity. The cause paths of windstorm accidents are closely related to extreme weather conditions, especially insufficient wind resistance capabilities of ships under severe weather conditions. The cause paths of sinking accidents involve improper cargo stowage and ship operational performance issues. The cause paths of other accident categories are more extensive and complex, reflecting the diversity and complexity of the causes of waterborne traffic accidents.

4.4. Global Causal Chain Analysis of Waterborne Traffic Accidents

Using the causal reasoning method of Bayesian networks, we analyzed the global causal chains of the causal factors of waterborne traffic accidents. Via the causal reasoning method, the target node (i.e., the occurrence status of different accident types) was set as a known condition, and the posterior probabilities of evidence nodes were calculated to infer the possible causes of an accident.
Taking windstorm accidents (E4) as an example, the occurrence status of the target node was set to 1, and the posterior probabilities of its four parent nodes—failure to travel at a safe speed, improper lookout, improper collision avoidance behavior, and inadequate typhoon prevention measures—were derived. The results show that the occurrence probability for inadequate typhoon prevention measures (D12) was the highest, reaching 69%. Further analysis of the three other parent nodes of inadequate typhoon prevention measures—a lack of theoretical knowledge/work experience/skills/other abilities, insufficient safety awareness, and excessive wind/waves/currents—revealed that the posterior probability of excessive wind, waves, and currents (C7) was relatively high. Using the same method, a global causal chain analysis was conducted for other types of accidents, yielding the global causal chains for all the accident types, as shown in Table 4.
Table 4. Global causal chains for different types of accidents.
The results given above have the following implications.
(1)
Collision accidents occur frequently due to a complex array of causes. These include improper staffing; improper selection of navigation areas/improper operation and management; insufficient theoretical knowledge, work experience, skill levels, and other abilities; and improper lookout. Measures including enhancing crew safety training, optimizing navigation environments, improving crew management systems, and enhancing vessel lookout capabilities are essential to prevent collision accidents.
(2)
Contact accidents are mainly caused by inadequate implementation of safety management, a lack of supervision and guidance, insufficient safety awareness, and improper lookout. Based on these results, strengthening crew training (especially regarding the standardized process of safety management), enhancing supervision normalization, and enhancing vessel lookout capabilities are crucial to reduce the probability of contact accidents.
(3)
Fire/explosion accidents often result from inadequate implementation of safety management, improper equipment allocation and maintenance, insufficient safety awareness, improper use of equipment, etc. To prevent these accidents, regular inspections and maintenance of fire-fighting and explosion-proof equipment are essential. Additionally, strict fire source management systems must be implemented.
(4)
Windstorm accidents frequently occur due to crew members’ lack of theoretical knowledge, work experience, and skills, leading to ineffective supervision, guidance, and inadequate plans under severe weather conditions. Hence, improving crew competence, enhancing decision making under extreme conditions, and implementing scientific navigation plans are crucial for safe navigation during wind disasters.
(5)
Sinking accidents are mainly caused by improper staffing; improper selection of navigation areas/improper operation and management; insufficient theoretical knowledge, work experience, skill levels, etc.; and misjudgment of danger. To prevent these accidents, crew safety training should be enhanced, navigation environments should be optimized, and the supervision of ship maintenance should be strengthened. Other accidents often result from various combinations of factors, such as inadequate education/training, leading to ineffective safety management and improper equipment handling.
Furthermore, among all kinds of the accidents investigated in this paper, human errors such as improper lookout and misjudgment of hazards are influential factors which may lead to the occurrence of the waterborne accidents.

5. Conclusions

Water traffic accidents not only result in casualties and losses but also pollute waterways and damage ecosystems. In this study, we focused on investigating the causal chains of waterborne traffic accidents in a data-driven context, delving into their underlying factors and chain reactions. Based on China’s nationwide investigative reports on waterborne traffic accidents, we systematically extracted the causal factors leading to accidents and constructed a hybrid framework based on an improved HFACS and a Bayesian network. Through quantitative analyses, including chi-square tests and sensitivity analyses, the causal chains of different types of accidents were thoroughly explored, revealing the fundamental mechanisms behind accident occurrences. The results indicate that there are 12, 3, 6, 2, 4, and 7 causal chains leading to collisions, contact, fires/explosions, windstorm accidents, sinking, and other types of accidents, respectively. These research results can serve as a reference for enhancing the safety of waterborne transportation.
Future research will focus on the following: (1) comparative analysis with different maritime regions, so as to explore the similarities and differences in causal chains across different maritime environments; (2) the application of new technologies, such as artificial intelligence, to deepen our understanding of the causal chains of waterborne traffic accidents. More specifically, with the advent of MASs (Maritime Autonomous Ships), advanced sensors and algorithms have significant potential to reduce human errors, in turn enhancing the safety and reliability of water transportation. Additionally, MASs, equipped with AI and machine learning, exhibit improved emergency response capabilities, enabling autonomous judgment and timely action in critical situations. However, as MASs evolve, it is imperative to strengthen regulation and legal frameworks and establish safety standards, operational norms, and robust monitoring mechanisms to ensure their safe and reliable operation. We aim to explore these technological advancements in accident prevention, monitoring, and emergency responses, thereby driving the development of water transportation safety management towards greater intelligence and automation.

Author Contributions

Conceptualization, X.Y.; Methodology, Q.X.; Software, Y.Y.; Formal analysis, J.W.; Data curation, Q.W.; Writing—original draft, X.Y.; Writing—review & editing, H.Z. and Q.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of China (52362055), Guangxi Science, and the Technology Major Program of China (Grant No. AA23062053).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare they have no conflicts of interest.

References

  1. Formela, K.; Weintrit, A.; Neumann, T. Overview of definitions of maritime safety, safety at sea, navigational safety and safety in general. Int. J. Mar. Navig. Saf. Sea Transp. 2019, 13, 285–290. [Google Scholar] [CrossRef]
  2. Peng, Z.; Jiang, Z.; Chu, X.; Ying, J. Spatiotemporal Distribution and Evolution Characteristics of Water Traffic Accidents in Asia since the 21st Century. J. Mar. Sci. Eng. 2023, 11, 2112. [Google Scholar] [CrossRef]
  3. Wu, B.; Kou, L.; Ma, Q. Research on HFACS based on accident causality diagram. Open J. Saf. Sci. Technol. 2017, 7, 77–85. [Google Scholar] [CrossRef][Green Version]
  4. Zhang, G.; Thai, V.V. Expert elicitation and Bayesian Network modeling for shipping accidents: A literature review. Saf. Sci. 2016, 87, 53–62. [Google Scholar] [CrossRef]
  5. Wang, H.; Liu, Z.; Wang, X.; Graham, T.; Wang, J. An analysis of factors affecting the severity of marine accidents. Reliab. Eng. Syst. Saf. 2021, 210, 107513. [Google Scholar] [CrossRef]
  6. Liu, K.; Yu, Q.; Yuan, Z.; Yang, Z.; Shu, Y. A systematic analysis for maritime accidents causation in Chinese coastal waters using machine learning approaches. Ocean Coast. Manag. 2021, 213, 105859. [Google Scholar] [CrossRef]
  7. Sun, J.; Li, M.; Xiu, X. Analysis of the Causes of Water Transportation Accidents Based on Complex Network Theory. J. Dalian Marit. Univ. 2023, 49, 80–90+160. [Google Scholar]
  8. Wu, Y.; Jiang, F.; Yao, H.; Huang, M.; Ma, Q. Analysis of Causal Factors and Risk Prediction for Inland River Ship Collision Accidents Based on Text Mining. J. Transp. Inf. Saf. 2018, 36, 8–18. [Google Scholar]
  9. Wang, H.; Liu, Z.; Liu, Z.; Wang, X.; Wang, J. GIS-based analysis on the spatial patterns of global maritime accidents. Ocean Eng. 2022, 245, 110569. [Google Scholar] [CrossRef]
  10. Namgung, H.; Kim, J.S. Collision risk inference system for maritime autonomous surface ships using COLREGs rules compliant collision avoidance. IEEE Access 2021, 9, 7823–7835. [Google Scholar] [CrossRef]
  11. Kaptan, M.; Sarıalioğlu, S.; Uğurlu, Ö.; Wang, J. The evolution of the HFACS method used in analysis of marine accidents: A review. Int. J. Ind. Ergon. 2021, 86, 103225. [Google Scholar] [CrossRef]
  12. Yildiz, S.; Uğurlu, Ö.; Wang, J.; Loughney, S. Application of the HFACS-PV approach for identification of human and organizational factors (HOFs) influencing marine accidents. Reliab. Eng. Syst. Saf. 2021, 208, 107395. [Google Scholar] [CrossRef]
  13. Wang, Q.; Sha, Z.; Zhang, J.; Ma, J. Human Factor Analysis of Ship Grounding Accidents Based on the HFACS-FCMs Model. J. Shandong Jiaotong Univ. 2024, 32, 103–109+123. [Google Scholar]
  14. Yıldırım, U.; Başar, E.; Uğurlu, Ö. Assessment of collisions and grounding accidents with human factors analysis and classification system (HFACS) and statistical methods. Saf. Sci. 2019, 119, 412–425. [Google Scholar] [CrossRef]
  15. Huang, H. Analysis and Application of Human Factors in Marine Traffic Based on the HFACS Method. Shandong Ind. Technol. 2018, 05, 216–218+238. [Google Scholar]
  16. Chen, S.; Wall, A.; Davies, P.; Yang, Z.; Wang, J.; Chou, Y.H. A Human and Organizational Factors (HOFs) analysis method for marine casualties using HFACS-Maritime Accidents (HFACS-MA). Saf. Sci. 2013, 60, 105–114. [Google Scholar] [CrossRef]
  17. Chauvin, C.; Lardjane, S.; Morel, G.; Clostermann, J.P.; Langard, B. Human and organizational factors in maritime accidents: Analysis of collisions at sea using the HFACS. Accid. Anal. Prev. 2013, 59, 26–37. [Google Scholar] [CrossRef] [PubMed]
  18. Fan, S.; Blanco-Davis, E.; Yang, Z.; Zhang, J.; Yan, X. Incorporation of human factors into maritime accident analysis using a data-driven Bayesian network. Reliab. Eng. Syst. Saf. 2020, 203, 107070. [Google Scholar] [CrossRef]
  19. Antão, P.; Soares, C.G. Analysis of the influence of human errors on the occurrence of coastal ship accidents in different wave conditions using Bayesian Belief Networks. Accid. Anal. Prev. 2019, 133, 105262. [Google Scholar] [CrossRef]
  20. Wang, L.; Yang, Z. Bayesian network modelling and analysis of accident severity in waterborne transportation: A case study in China. Reliab. Eng. Syst. Saf. 2018, 180, 277–289. [Google Scholar] [CrossRef]
  21. Meng, X.; Li, H.; Zhang, W.; Zhou, X.Y.; Yang, X. Analyzing risk influencing factors of ship collision accidents: A data-driven Bayesian network model integrating physical knowledge. Ocean Coast. Manag. 2024, 256, 107311. [Google Scholar] [CrossRef]
  22. Fan, S.; Yang, Z.; Blanco-Davis, E.; Zhang, J.; Yan, X. Analysis of maritime transport accidents using Bayesian networks. Proc. Inst. Mech. Eng. Part O J. Risk Reliab. 2020, 234, 439–454. [Google Scholar] [CrossRef]
  23. Tian, Y.; Qiao, H.; Hua, L.; Yan, S.; Zhang, Q. Bayesian Network Model for Maritime Ship Collisions and Its Application. J. Nav. Univ. Eng. 2023, 35, 28–33. [Google Scholar]
  24. Hänninen, M.; Kujala, P. The effects of causation probability on the ship collision statistics in the Gulf of Finland. Mar. Navig. Saf. Sea Transp. 2010, 4, 79–84. [Google Scholar]
  25. Meng, X.; Li, H.; Zhang, W.; Zhou, X.Y.; Yang, X. Analyzing ship collision accidents in China: A framework based on the NK model and Bayesian networks. Ocean Eng. 2024, 309, 118619. [Google Scholar] [CrossRef]
  26. Wang, H.; Jin, G. Cause analysis of marine accidents based on HFACS and Bayesian networks. In Proceedings of the International Conference on Smart Transportation and City Engineering, Chongqing, China, 16–18 December 2023; Volume 13018, pp. 1022–1030. [Google Scholar]
  27. Rostamabadi, A.; Jahangiri, M.; Zarei, E.; Kamalinia, M.; Banaee, S.; Samaei, M.R. A novel fuzzy bayesian network-HFACS (FBN-HFACS) model for analyzing human and organization factors (HOFs) in process accidents. Process Saf. Environ. Prot. 2019, 132, 59–72. [Google Scholar] [CrossRef]
  28. Jiang, Y.; Wan, Z.; Chen, J. Path Analysis of the Causes of Coastal Water Transportation Accidents in China. J. Dalian Marit. Univ. 2024, 50, 76–84. [Google Scholar]
  29. Li, Y.; Cheng, Z.; Yi, T.; Fan, X.; Wu, B. Use of HFACS and Bayesian network for human and organizational factors analysis of ship collision accidents in the Yangtze River. Marit. Policy Manag. 2022, 49, 1169–1183. [Google Scholar] [CrossRef]
  30. Wang, H.; Chen, N.; Wu, B.; Soares, C.G. Human and organizational factors analysis of collision accidents between merchant ships and fishing vessels based on HFACS-BN model. Reliab. Eng. Syst. Saf. 2024, 249, 110201. [Google Scholar] [CrossRef]
  31. Özkan, U.; Serdar, Y.; Sean, L.; Wang, J.; Kuntchulia, S.; Sharabidze, I. Analyzing Collision, Grounding, and Sinking Accidents Occurring in the Black Sea Utilizing HFACS and Bayesian Networks. Risk Anal. Off. Publ. Soc. Risk Anal. 2020, 40, 2610–2638. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.