Probabilistic Modeling of Maritime Accident Scenarios Leveraging Bayesian Network Techniques

: This paper introduces a scenario evolution model for maritime accidents, wherein Bayesian networks (BNs) were employed to predict the most probable causes of distinct types of maritime incidents. The BN nodes encompass factors such as accident type, life loss contingency, accident severity, quarter and time period of the accident, and type and gross tonnage of the involved ships. An analysis of 5660 global maritime accidents spanning the years 2005 to 2020 was conducted. Using Netica software, a tree augmented network (TAN) model was constructed, thus accounting for interdependencies among risk-influencing factors. To confirm these results, a validation process involving sensitivity analysis and historical accident records was performed. Following this, both forward causal inference and reverse diagnostic inference were carried out on each node variable to scrutinize the accident development trend and evolution process under preset conditions. The findings suggest that the model was competent in effectively predicting the likelihood of various accident scenarios under specific conditions, as well as extrapolating accident consequences. Forward causal reasoning unveiled that general cargo ships with a gross tonnage of 1–18,500 t were most prone to experiencing collision and stranding/grounding accidents in the first quarter. Reverse diagnostic reasoning indicated that, in the early morning hours, container ships, general cargo ships, and chemical ships with a tonnage of 1–18,500 t were less likely to involve life loss in the event of collision accidents.


Introduction
Intricately variable and multifaceted climatic conditions, endemic to an expansive marine environment, have perpetually underscored maritime transport as a vocation of considerable risk. Incidents disrupting maritime transit, which encompass a spectrum of occurrences from vessel collisions to groundings, and from onboard fires to devastating explosions, bear the hallmark of low frequency yet are marked by their profoundly destructive aftermath [1]. Upon the unfortunate manifestation of a maritime accident, a tsunami of undesirable outcomes typically ensues, including, notably, substantial financial loss, a tragic toll of human casualties, or even the insidious onset of extensive environmental pollution [2]. As an imperative and fundamental step toward mitigating the inherent perils of maritime navigation, a comprehensive exploration into the multifactorial etiology of these traffic mishaps proves indispensable [3][4][5]. Moreover, it becomes increasingly pivotal to meticulously undertake a dynamic risk evaluation, focusing on the myriad facets of maritime operations [6][7][8]. Complementing this, the development and implementation of robust predictive models, which can potentially forecast the calamitous conjunction of human fatality and its probability in the event of an accident, could contribute significantly toward minimizing future maritime disasters [9][10][11][12].
Over the years, an impressive corpus of scholarly efforts has been devoted to enhancing our understanding of maritime traffic safety, including explorations into accident causation analysis [3][4][5]13], accident consequence assessment [14][15][16], and accident loss computation [17,18]. These endeavors have given rise to an array of innovative evaluation methodologies. In one notable study, Hu et al. [3] skillfully harnessed the capabilities of the Human Failure Analysis and Classification System (HFACS) in tandem with structural equation modeling (SEM) to disentangle an intricate web of causal factors underpinning marine traffic accidents (MTAs). Chou et al. [4], in a synergistic integration of technologies, amalgamated the Automatic Identification System, Geographic Information System, and an electronic chart (e-chart) to scrutinize the interplay between environmental factors, geographical locations, and the common causes of marine mishaps. By overlaying vessel traffic flows, accident sites, and environmental data on a shared e-chart, their research unfurled valuable insights into port authorities when streamlining ship traffic flow and curtailing the prevalence of marine accidents in the vicinity of ports. Meanwhile, Xue et al. [5] proffered a comprehensive analytical framework for investigating the peculiarities and causative factors of ship accidents, utilizing a decade's worth of historical data that were harvested from the capriciously fluctuating backwater expanse of the Three Gorges Reservoir region. Their extensive work yielded a thorough summary and visualization of vessel accident categories and severity, involved vessel types, spatial-temporal distribution characteristics, and vessel accident loss, along with the underlying causes and lessons gleaned from pertinent accidents that were achieved through a rigorous statistical and comparative analysis of historical data. Elsewhere, Fu et al. [13] engineered a bivariate probit model to delve into an array of 311 Arctic ship accidents spanning from 1998 to 2017. Their study brought to the fore influential factors such as gross tonnage, ship type, ship age, accident type, accident year, accident location, wind, and sea ice as the primary contributors to accident severity. Simultaneously, their research unveiled an intriguing negative correlation between serious accidents and those resulting in pollution. As research on maritime traffic accidents has illuminated a gamut of potential causative factors, the increased granularity of available accident data has spurred a growing number of scholars to concentrate on the ramifications of these mishaps, specifically on the evaluation of accident consequences and loss computation. Such undertakings have risen to prominence, particularly in the eyes of managers concerned with incidents that yield significant economic damage and human casualties. For instance, Chen et al. [15] presented an evidence-based Fuzzy Bayesian network methodology to erect probabilistic models of marine accidents, thereby enabling the appraisal of accidents that were likely to spawn severe consequences. In a similar vein, Ventikos and Giannopoulos [16] introduced a criterion to assess the risks and repercussions within the maritime transport sector from a societal perspective, thereby formulating a novel framework for the marine risk assessment, which facilitated a comparison of disparate accident scales and characteristics, while accurately mirroring the risk threshold society was prepared to tolerate. Chen et al. [17] pioneered an enhanced entropy weight-TOPSIS model to furnish a holistic analysis and appraisal of the marine total loss incidents, encompassing a global scope from 1998 to 2018. These studies, though highly impactful, predominantly undertake analyses either from the standpoint of accident causation or the evaluation of accident consequences. Rarely do these scholarly pursuits straddle both domains in a bidirectional inquiry.
In the realm of accident scenario analysis, methodological constructs like event tree analysis and accident tree analysis are frequently utilized in the assembly of traffic accident scenario evolution models [19][20][21][22][23]. However, the breadth of most accident cause analyses often overshadows their specificity, impeding their ability to yield targeted recommendations to forestall analogous events [24]. To bridge this gap, scholars could employ a Bayesian network-based maritime accident scenario modeling approach. Bayesian networks stand as a form of a probabilistic graphical model, which is deftly equipped to encapsulate and deliberate over uncertain knowledge and nebulous relationships among variables. This versatile modeling approach, designed to embrace the labyrinthine and dynamic character of maritime activities, excels at discerning the contributory factors that precipitate maritime accidents [3,5]. Employing a synergistic blend of historical data and expert acumen, this model could approximate both the likelihood of an accident's occurrence and the potential fallout arising from a range of accident scenarios [14,16]. Bayesian networks (BN) find broad application in confronting uncertain multi-factor causality inference, accident causation analysis, and scenario prediction, making them invaluable tools in road and waterway transportation sectors [25][26][27][28][29][30][31][32]. Various scholars have employed these tools in diverse studies: Zou and Yue [33] melded the probabilistic risk analysis with the BN theory to explore the origins of road traffic accidents; Yuan et al. [34] constructed a scenario-derived prediction model for the repercussions of fire accidents in oil and gas storage and transportation emergency processes, leveraging a defuzzification method and a dynamic BN model. Other researchers, such as Zhao et al. [35], have used the ISM-BN model to assess the impact of varying factors on maritime safety, successfully pinpointing the critical risk components for different accident types. Afenyo et al. [36] utilized a BN model to sketch an Arctic shipping accident scenario and illuminated the crucial causative elements of a potential accident scenario. Similarly, Jiang et al. [37] proposed a Bayesian network-based risk analysis strategy to evaluate maritime accidents along the 21st-century Maritime Silk Road (MSR), identifying the principal influencing factors that could bolster accident prevention measures and ensure maritime transportation safety and sustainability. In a more focused study, Si et al. [38] employed a BN structure learning algorithm that paired the kernel density estimation with a model weighted average strategy to dissect the causative elements of container ship collisions, basing their analysis on a limited set of container ship collision sample data. Other studies like Fan et al. [39] and Hänninen et al. [40] proposed similar Bayesian network-based risk analysis approaches to understand the contributing factors to maritime transport accidents, with the latter focusing more on maritime safety management and its relationship with maritime traffic safety. Despite these successes, these aforementioned studies suffer from a triad of limitations: (1) a paucity of sample data from maritime accidents, (2) a labor-intensive and time-consuming data collection process, and (3) the inherent difficulty of obtaining accident loss records. Summarily, while waterway transportation research has honed its focus on accident causality reasoning and accident causation analysis, there remains a conspicuous void in the research landscape pertaining to accident scenario modeling.
In light of this, this paper aimed to build a BN model for the evolution of maritime accident scenarios using global maritime accident data. These data derived from the Global Integrated Shipping Information System (GISIS) and established by the International Maritime Organization (IMO) have been widely used by scholars in maritime accident studies [41][42][43][44][45][46][47]. The novelty of this research lies in the use of a BN-based approach to model maritime traffic accident scenarios. This is a unique method of analyzing the causes of maritime traffic accidents through performing dynamic risk assessments on shipping activities and predicting the probability of accident occurrence and its consequences. This innovative approach enables the identification and simulation of influencing factors across a range of accident scenarios, providing an intricate understanding of the complexities associated with maritime traffic accidents.
This study provides comprehensive analysis and valuable insights into 5660 global maritime accidents from 2005 to 2020. The accident data were well sampled, non-manually collected, open, and, more importantly, provided a high number of data fields in relation to accident losses. This made it possible to compensate for data limitations that have existed in previous studies. This study had two main contributions. First, a tree augmented network (TAN) model was developed to construct BN and train the data, and a data-driven BN-based method was proposed that could effectively predict the probability and consequences of accidents. Second, the proposed model was able to predict the causal factors that were most likely to lead to specific accident consequences; this could help maritime stakeholders implement effective preventive measures to improve maritime transportation safety.
The rest of this article is structured as follows. Section 2 briefly introduces the structure and construction method of BN and further introduces the method TAN driven by the data. Section 3 builds the TAN model based on the data of 5660 maritime accidents and carries out sensitivity analysis and simulation verification on the built model. Section 4 uses the two-way reasoning ability of the TAN model to predict the accident chain and analyze the accident causes. Finally, the fifth part summarizes the full text.

BN Structure Learning-TAN
BN is a directed acyclic graph (DAG) that is composed of nodes and directed edges and is widely employed to illustrate the interdependence and strength of associations between variables. As shown in Figure 1 denotes the set of nodes in the network, ∈ denotes the random variable in the domain of the definition of this restriction, and denotes a set of directed edges in this network. The network represents the interrelationship between variables through vectorial arcs, with the intensity of each association specified by a table of conditional probabilities. There are two primary approaches to the generation of BN structures: (1) the expert knowledge method and (2) the data-driven method. In the expert knowledge method, the BN structure was built by subjectively evaluating the causal relationships between variables. Conversely, the data-driven method was employed to uncover the interdependence between variables based on the learning algorithm of the BN model and data correlations. In this study, since sufficient sample data were collected, the data-driven method was used to construct the BN structure.
Data-driven Bayesian approaches could be classified into three main categories: (1) the naive Bayesian network (NBN), (2) the augmented naive Bayesian network (ABN), and (3) the tree augmented network (TAN). Among these, TAN learning effectively combined the simplicity and robustness of NBN computation with its ability to characterize interaction dependencies among variables, thus providing insights into the key factors leading to the outcomes of specific accidents [48]. Therefore, this paper employed the datadriven TAN approach to construct the BN structure.
BN encodes the joint probability distribution over a set of random variables . We let = { 1 , ⋯ , }, where denoted the number of influencing factors, where 1 , ⋯ represent the influencing factors, and is a class variable (accident type). It was established that the set of parent nodes of in was empty, meaning ∏ = ∅. Moreover, ∏ had at most one other node in addition to that could have an associated edge pointing to it. The joint probability density distribution adhered to the following equation: In the process of learning the TAN structure, Chow and Liu [49] proposed an approach to optimize and construct the BN structure using the conditional mutual information of each attribute pair. This function was defined as: where denotes the conditional mutual information; is the i-th state of the influencing factor ; and is the i-th state of the influencing factor .

Data Collection
This paper utilized the Marine Casualties and Incidents (MCI) database in GISIS, which is managed by IMO [50]. GISIS is a comprehensive, global maritime information system. In accordance with IMO regulations, every country with sovereignty over its territorial sea is required to report maritime accidents that occur within its waters to the IMO. The MCI database contains two types of information in relation to global maritime accidents: first, the factual data gathered from various sources, and second, detailed data obtained from casualty investigation reports submitted to the IMO.
The MCI database houses global maritime accident data dating back to 1973. Between 1973 and 2000, the annual number of recorded maritime accidents was quite limited. From 2001 onward, the number of accidents documented in the MCI database has been more consistent. However, the accident timestamps during 2001-2004 are only accurate to the day, which is not sufficient for studying the specific time periods in which these accidents occurred. Consequently, low-quality data from the early years have been excluded, and a total of 5660 maritime accidents recorded from 2005 to 2020 were utilized to construct the BN model.

Node Variable Definitions
Based on the literature's studies on maritime accident factor analysis [1,43,51,52], there are 16 primary factors that contribute to maritime accidents, including the ship type, hull type, ship's age, length, gross tonnage, operation, voyage segment, ship's speed, condition, equipment or device condition, ship's design, interaction information, weather conditions, ocean conditions, time period, and channel traffic condition. Combining these factors with the information available in the MCI database, seven node variables for the BN model were selected; these included the accident quarter, accident period, accident type, ship type involved, total tonnage of the ship involved, life loss contingency, and accident severity.
Given the requirement of discrete variables for BN nodes, it was necessary to discretize continuous variables in the accident statistics. The division of accident occurrence quarters into the first quarter (January, February, and March), second quarter (April, May, and June), third quarter (July, August, and September), and fourth quarter (October, November, and December) were conducted. The categorization of accident periods was made during dawn (0:00-5:59), early morning (6:00-8:59), morning (9:00-11:59), noon (12:00-13:59), afternoon (14:00-16:59), early evening (17:00-19:59), and evening (20:00-23:59). To discretize the gross tonnage of the ships involved, the collected data and the centroid clustering (CC) algorithm were utilized for their classification. The CC algorithm, which uses the minimization error sum of squares as the objective function, was employed and terminated when the number of iterations reached a preset maximum of 5000 iterations. The optimal classification results yielded four groups based on the gross tonnage of the ships involved: (1-18,500 t), (18,501-57,500 t), (57,501-120,000 t), and (120,001-403,342 t). Among these, 403,342 t represented the maximum total tonnage of the ships involved in the collected data.
Furthermore, in this paper, we classified non-routine accidents, such as missing ships, life-saving equipment accidents, and numerous accident types with irregular or rare records, accounting for no more than 5% as "others" [53]. Multipurpose ships, tugboats, supply, and offshore vessels, unspecified ship types, and other ship types represented no more than 10% were categorized as "others" [53]. Table 1 presents the names, classifications, frequency of occurrence, and percentages of each discrete variable category. Specifically, the "quarter of accident" is a variable divided into four categories, corresponding to the four quarters of the year. Category "a" represents accidents that occurred in the first quarter (January to March), with a frequency of 1539, accounting for 27.19% of the total occurrences. Similarly, "b" is for the second quarter (April to June), "c" for the third quarter (July to September), and "d" for the fourth quarter (October to December), each with their respective frequencies and percentages. The "ship type" is a variable that has seven categories. For instance, "a" represents general cargo ships, which were involved in accidents 989 times, making up 17.47% of the total occurrences. "b" stands for bulk carriers, "c" for container ships, "d" for chemical tankers/oil tankers, "e" for passenger ships, "f" for fishing ships, and "g" for others. Each category has its corresponding frequencies and percentages of occurrence. The "accident type" is a variable that categorizes the types of accidents that occur. For example, category "a" denotes collisions, which occurred 1016 times, representing 17.95% of the total accidents. Similarly, "b" stands for stranding/grounding, "c" for fire/explosions, "d" for capsize, "e" for machinery damage, "f" for contact, and "g" for others, each with their respective frequencies and percentages. Each of the remaining variables in Table 1 followed a similar pattern, wherein specific categories were defined for each variable, along with their frequency of occurrence and corresponding percentages.

TAN Modeling
Based on the data processing results, the relationship between the six influencing factors and accident consequences was examined. Netica software with a "learning network" function was employed to develop a TAN model, which was grounded on Equation (2), ensuring that all connections between nodes were meaningful. The BN qualitative structure was trained by data, followed by a rigorous review conducted by domain experts to confirm the significance of the links between these nodes. In this study, no changes were made during the finetuning process, as all the interrelationships suggested by the data were in alignment with reality. The initial structure of TAN, which is depicted in Figure 2, was based on the data-driven TAN training results that showcased the realistic correlations between variables. The numbers depicted in Figure 2 represent the initial results of the TAN model. For instance, if the type of accident is divided into seven categories, the initial proportion of each category after initialization is approximately 14.3%. Therefore, the sum of the proportions for all categories would equate to 100%. This explanation is applicable to all other variables depicted in Figure 2 as well.
Utilizing the TAN model, Netica software employed basis functions to create a structure learning module and a parameter learning module, which automatically learned the conditional probability table (CPT) parameters from the sample dataset. The construction of TAN and the obtainment of CPT facilitated the calculation of the posterior probability of each variable. The statistical results of these probabilistic variables were instrumental in the analysis of maritime safety considerations and the facilitation of accident prevention. Figure 3 presents the TAN results for the random variables of interest.

Sensitivity Analysis
In the Netica software, the accident type was selected as the target node, and sensitivity analysis on this node was conducted to identify the factors with the greatest influence on the target node within the TAN model.
The mutual information value represents the sensitivity level between two random variables; a higher value indicates the greater sensitivity of the influencing factor to the target node and, conversely, its lower sensitivity. The sensitivity analysis function in Netica software was used to calculate the mutual information value, percentage, and variance for each influencing factor and accident type, as displayed in Table 2. According to Table 2, the accident consequence and accident severity were the factors most sensitive to the accident type performance, with mutual information values of 0.14246 and 0.14033, respectively; these were notably higher than those of the other four factors. These results revealed how accident consequence and accident severity were the two most intuitive factors for determining the type of accident, followed by ship type, gross tonnage of the ship, time period, and quarter. Concerning the pivotal factors impacting a variety of accident types, the subsequent step entailed a discernment of how these factors, or the states of these factors, influence the intended accident category. This was conducted by calculating the joint probability of each factor and the "accident category", as depicted in Table 3. Table 3, the state of each factor that exerted the most significant impact on an accident category is disclosed (in bold value). For instance, in the "life loss contingency" category, when in the state "life loss", the highest likelihood was for accident type "collision" (22.2%), whereas in the state "no life loss", there was the lowest probability to be "collision" (7.9%). In the "severity of accident" category, state "serious accidents" demonstrated the highest probability for accident type "stranding/grounding" (23.5%), while state "particularly serious accidents" exhibited the lowest probability for accident type "stranding/grounding" (7.09%). Looking at the "ship type", type "bulk carrier" showed the highest likelihood for accident type "collision" (25.9%), whereas type "fishing ship" indicated the lowest probability for accident type "collision" (8.53%), but had the highest probability to be in "fire/explosion" (21.9%) and "capsize" (12.4%). Although "gross tonnage" and "quarter of accident" showed little difference in the probability of influencing "accident type", the probability of "collision" was the highest. In addition, both "collision" and "capsize" showed the highest probability at nighttime. This analysis underscores the influence that a particular state had on a single factor in an accident category. Additionally, it demonstrates how different states of a single factor contributed to the probability of a specific accident category. Generally, more attention should be paid to those conditions that display high probabilities of accidents due to the state of the single factor under an accident type.

Model Validation
To validate the effectiveness of the TAN model, three offshore accident cases from 2021 were randomly selected, each with varying accident consequences and severities, and were labeled as events 1, 2, and 3. The case data were input into the model for scenario analysis, and Table 4 presents the relevant data information for these accident cases. Based on the data from three randomly selected events, the probability of the known nodes, such as the quarter, time period, vessel type, accident consequence, accident severity, and gross tonnage of the vessel, was set to 100%. The types and probabilities of the predicted accidents were then observed. As illustrated in Table 4, the probability of other accident types occurring in event 1 was 75.1%; the probability of stranding/grounding in event 2 was 38.0%; and the probability of collision in event 3 was 44.4%. When compared to the original data's accident types, the predicted accident types for the three events matched, indicating that the model's predictions were accurate to some extent. Since the occurrence probability of other accident types in the original data was significantly higher than that of collision and stranding/grounding, the data-driven TAN model's simulation results demonstrated better performance in predicting the occurrence probability of other accident types (e.g., Figure 4a) and for average results in predicting collision and stranding/grounding accidents (e.g., Figure 4b,c).

Results and Discussion
The TAN model has the ability to reason bidirectionally and can help explain the most likely scenarios that are associated with a specific accident type. The data-driven TAN-based model examined the correlations between various influencing factors of maritime accidents and accident types, as well as accident consequences. This analysis enabled the prediction of the likelihood of various accident scenarios and the extrapolation of accident consequences under specific conditions.

Accident Chain Forecast
After the manipulation of the TAN model using Netica software, the relationships between the influencing factors and the accident type, including life loss contingency and the probability of each node, were obtained. By adjusting the placement bar of a single node or multiple nodes, the target node's probability trend was observed; therefore, a judgment could be formed of the potential trends and consequences of the accident.
The parameters of the conditions for maritime accidents were first simulated by changing a single node and observing the changes in the target node. When changing the ship type, more significant changes occurred in the probability of each accident type. For example, when the ship type was set to a chemical ship, the probability of a fire/explosion accident type increased significantly. When the ship type was a bulk carrier, the probability of the collision accident type increased notably. This study showed that different ship types could lead to significant differences in the occurrence of accident types. Additionally, the ship's gross tonnage and the accident's quarter and time also impacted the accident type.
Since the accident type was influenced by the joint decision of several nodes, the influence of a single node on the accident type was more one-sided. Therefore, the accident quarter was set to the first quarter (with the variable node's confidence bar set to 100%), the ship type was set to a general cargo ship, and the gross tonnage to 1-18,500 t, as shown in Figure 5. The change in the accident type and accident severity node probability from the early morning to evening is shown in Figure 6. As seen in Figure 6, among the types of maritime accidents throughout the day, the probability of a fire/explosion on ships was low, except for the afternoon time period, which was 18.2%; the probabilities of capsize, machinery damage, and contact were also low below 10%. Among other accidents, the probabilities of ship collision and stranding/grounding accidents were significantly higher at around 20%. Additionally, it was observed that the occurrence probability of stranding/grounding accidents was significantly higher during the dawn and evening than in other periods. Figure 5. Analysis of the accident chain with the characteristics "first quarter", "general cargo", and "gross ship tonnage set to 1-18,500 t". Unlike previous studies, this paper focused specifically on the question of whether or not the consequences of an accident could involve a loss of human life when an accident occurred under this scenario. As shown in Figure 7, the change in the probability of the "life loss contingency" node from the early morning to the evening showed that the probability of an accident consequence that did not involve loss of life was much higher than the probability of an accident consequence that involved loss of life throughout the day in this scenario. The results of the study indicate that the probability of potential loss of life is low for all accident types in this scenario. Therefore, accidents involving collisions and fires/explosions do not necessarily result in loss of life outcomes either and may need to be combined with additional accident causation in order to obtain more reliable conclusions. In summary, the highest probabilities of collision and stranding/grounding occurred at dawn, with the accident severity for this appearing particularly serious. A collision was most likely to occur at noon, with high accident severity. Particularly severe collision and stranding/grounding accidents were more likely to occur in the evening. It is worth mentioning that although the probabilities of collision and stranding/grounding of ships were higher in this scenario, the probability of life loss was relatively low, and the accident consequences were less affected by the time of the accidents.

Accident Cause Analysis
In this study, the bidirectional reasoning of the TAN model was employed, allowing for both causal reasoning and diagnostic reasoning. The application of causal reasoning aided in the prediction of accident chains, while diagnostic reasoning assisted in the analysis of the accident causes. By determining the type, consequence, and severity of an accident, a more intuitive comprehension of the causes and mechanisms behind maritime accidents could be achieved.
As demonstrated in Figure 8, certain patterns were identified from the research data. For instance, when the accident type was classified as a collision, with severity as a general accident and no involvement of life loss, there was a significantly higher probability of container ships, general cargo ships, and chemical ships that were involved compared to other ship types. This indicates a necessity for focusing on the safety measures of these types of vessels, given their higher likelihood of being involved in collision accidents. The results also revealed a correlation between ship tonnage, time of the accident, and frequency of collision accidents. Ships with a tonnage between 1 and 18,500 t were more prone to collisions during dawn hours. A plausible explanation for this might be the combined influence of lower visibility conditions, potential crew fatigue, and less active navigation during these hours. Furthermore, the data suggest that accidents involving container ships, general cargo ships, and chemical ships of such tonnage typically have a lower probability of causing life loss. This might be attributed to the relative ease with which personnel can escape from smaller ships in distress or potentially the higher success rate of rescue operations due to the manageability of these smaller vessels.
These findings provide essential insights into maritime accident patterns. By identifying specific circumstances and ship types that are associated with a higher risk of accidents, it could be possible to develop more targeted safety protocols and preventive measures. It also highlights the usefulness of predictive models, such as the TAN model, for risk management in the maritime industry.
In conclusion, these findings emphasize the intricate nature of maritime accidents and the numerous variables involved. Through the bidirectional reasoning of the TAN model, a more thorough understanding of these accidents could be obtained, potentially leading to the development of more effective accident prevention strategies and, ultimately, the enhancement of maritime safety.

Conclusions
Ship safety has always been a major concern in the maritime transportation industry. In this paper, a TAN model for a maritime traffic accident risk assessment was constructed to analyze the relationship between the consequences of maritime accidents and various influencing factors and to use model simulation to analyze how different risk factors can affect different types of maritime accidents.
The TAN model was constructed based on data from a total of 5660 maritime accidents from 2005 to 2020. In addition to other accident types, the accident type with the highest probability of occurrence among maritime traffic accidents included collision, followed by stranding/grounding, and then fire/explosion.
The sensitivity analysis and simulation validation of the constructed model showed that accident consequences and accident severity are the two most intuitive factors when determining the type of accident occurrence, followed by the ship type, gross tonnage of the ship, time period, and season. The constructed model effectively predicted the likelihood of various accident scenarios and accident consequence projections under specific conditions.
According to the causal reasoning analysis of the TAN model and under the conditions of "first quarter", "general cargo ship", and "ship's gross tonnage of 1-18,500 t," the probability of ship collision and stranding/grounding accidents was higher, while the probability of life loss was relatively low, and the consequences of this accident were less affected by the time of the accident. According to the analysis of the model's diagnostic reasoning, in the general collision accident chain without loss of life, container ships, general cargo ships, and chemical ships were the main types of ships involved in such accidents. Ships with a tonnage of 1-18,500 t were more likely to have such accidents during the dawn; however, their probability of causing loss of life was lower. These findings carry significant implications for enhancing safety measures in the maritime transportation industry. By understanding the frequency, severity, and common conditions of various types of accidents, stakeholders could develop more targeted and effective accident prevention strategies.
Despite utilizing a substantial volume of publicly available accident data to achieve reliable predictive outcomes, we acknowledge the limitations of our study. It is plausible that the introduction of more variables could alter these results. Future work should focus on expanding this model to include additional variables such as environmental factors (weather conditions, sea state), ship design and maintenance factors, human factors (crew experience, fatigue), and others that could impact the risk and consequences of maritime accidents. Additionally, more in-depth research should be carried out to investigate the different patterns of accidents associated with different types of ships at various times of day to refine preventative measures accordingly.