Next Article in Journal
From Expectation to Experience: Understanding Public Acceptance of AI-Enabled Autonomous Shuttle Services in Seoul
Previous Article in Journal
A Dempster-Shafer Theory-Based Multi-Criteria Decision Model for Evaluating Sustainable Third-Party Logistics Providers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Socio-Technical Drivers of Casualty Severity in Commercial–Fishing Vessel Collisions: A Bayesian Network Analysis

1
Faculty of Maritime and Transportation, Ningbo University, Ningbo 315832, China
2
Institute of Intelligent Transportation Systems, Zhejiang University, Hangzhou 310058, China
3
Barcelona School of Nautical Studies, Universitat Politècnica de Catalunya (UPC-BarcelonaTech), 08034 Barcelona, Spain
4
Collaborative Innovation Center of Modern Urban Traffic Technologies, Southeast University, Nanjing 211189, China
5
Ningbo University Sub-Centre, National Traffic Management Engineering & Technology Research Centre, Ningbo 315832, China
*
Author to whom correspondence should be addressed.
Sustainability 2026, 18(10), 4648; https://doi.org/10.3390/su18104648
Submission received: 8 March 2026 / Revised: 27 April 2026 / Accepted: 29 April 2026 / Published: 7 May 2026
(This article belongs to the Section Hazards and Sustainability)

Abstract

This study examines the probabilistic patterns associated with casualty severity in collisions between commercial and fishing vessels in China’s coastal waters. Using 137 official accident investigation reports from 2013 to 2022, a structured dataset capturing vessel characteristics, environmental conditions, and human liability factors was constructed. A Tree-Augmented Bayesian Network (TAN-BN) was developed to model the probabilistic interactions among these variables and to identify the key drivers of casualty severity. Sensitivity analysis based on mutual information indicates that fishing vessel length is the most influential factor affecting casualty outcomes (MI = 0.322), followed by time of occurrence, wind speed, visibility, and season. Scenario analysis using MPE indicates that severe casualty scenarios are associated with adverse temporal and environmental conditions such as nighttime, poor visibility, and open-water environments, while liability-specific analysis further shows that collisions attributed primarily to commercial vessel errors are most likely to result in 4–10 casualties. The results highlight the structural vulnerability of small fishing vessels and the critical role of environmental exposure in heterogeneous vessel encounters. This study provides an interpretable probabilistic framework for examining casualty severity patterns in maritime collisions and offers risk-informed insights for improving sustainable maritime safety management in mixed-traffic coastal waters.

1. Introduction

Maritime transportation serves as the backbone of global trade, orchestrating the movement of the vast majority of international merchandise [1]. The maritime industry encompasses a wide range of vessels, from colossal container ships to modest fishing vessels, each navigating the complex network of seafaring routes. Nevertheless, collisions between commercial and fishing vessels represent a particularly acute challenge within this maritime milieu. These incidents are not merely isolated events but expose significant vulnerabilities in existing maritime safety protocols and navigation practices. Therefore, understanding the dynamics and underlying causes of such collisions requires more than routine analysis; it calls for a systematic approach to mitigate future risks and enhance maritime safety management, and support the safe and sustainable coexistence of coastal navigation and fishing activities.
Fishing activity also plays a vital role in global food supply and economic development. However, commercial fishing remains one of the most hazardous occupations worldwide, with high fatality rates reported in the fisheries sector [2]. The safety risks associated with fishing vessels are often linked to relatively limited structural protection, unsafe crew practices, and difficulties in implementing and enforcing safety regulations [3]. As a result, fishing vessel accidents frequently lead to serious consequences, including property loss and casualties [4].
The contrast between commercial and fishing vessels is pronounced, as each adheres to distinct operational paradigms. Commercial vessels are generally designed for transport efficiency and operate along prescribed shipping routes, whereas fishing vessels often display flexible and irregular navigational patterns shaped by fishing operations and working domains [5]. These differences increase encounter complexity and collision risk in shared coastal waters [5,6]. Fishing vessels are also generally more vulnerable in accidents because accident severity has been shown to be associated with vessel size, vessel age, and weather-related conditions such as wind speed [7,8].
The convergence of these disparate operational modes within shared maritime spaces creates favorable conditions for accidents, further exacerbated by variable weather, uneven access to navigational technology, and human error.
Collisions between commercial and fishing vessels can result in severe consequences, including vessel damage, sinking, and serious threat to crew safety. The resulting casualties remain a major concern within the maritime industry, as reflected in global statistics. According to the International Labour Organization, the annual mortality rate for fishermen worldwide is approximately 80 per 100,000 individuals [7]. A substantial proportion of these casualties arises from ship collisions, particularly those involving commercial and fishing vessels [9].
The complexity inherent in maritime collision scenarios, particularly those involving commercial and fishing vessels, poses significant challenges to conventional analytical methodologies. Traditional linear or deterministic models, while informative, often fail to capture the intricate interplay among dynamic and multi-dimensional factors. This limitation highlights the need for more advanced and flexible approaches capable of analyzing the multifactorial nature of maritime accidents.
Collision accidents between merchant and fishing vessels refer to traffic accidents that occur on water, involving vessels in the transportation industry and those engaged in fishing operations [6]. Collisions between commercial and fishing vessels typically arise from a combination of human, vessel-related, and environmental factors [10]. The complexity and unpredictability of these elements make experimental replication of accidents impractical, creating major challenges for causal analysis. Moreover, the influence of each factor can vary considerably, while maritime accident data often exhibit randomness, multi-dimensionality, and ambiguity. Recent Bayesian analyses of fishing vessel accidents [11] have revealed that vessel characteristics, human errors, and environmental conditions exert heterogeneous influences on accident severity, underscoring the effectiveness of probabilistic modeling in maritime casualty assessment. Therefore, employing robust probabilistic models to identify key associated factors and support more targeted risk mitigation strategies is essential.
China’s rapid economic growth has intensified maritime activity, positioning the country as a major commercial and fishing nation. Its coastal waters host both dense shipping routes and rich fishing grounds, where operational overlaps are common. The intersection of commercial navigation lanes and traditional fishing areas amplifies environmental complexity and substantially heightens the risk of vessel collisions.
To address these challenges, this study employs a Tree-Augmented Bayesian Network (TAN-BN) to characterize the probabilistic dependency patterns among vessel, human liability, temporal, and environmental factors in a complex socio-technical setting. Compared with conventional factor-ranking methods, Bayesian networks are better suited to maritime accident analysis because they can capture uncertainty, interdependence, and conditional relationships among vessel, human, and environmental factors. Methods such as DEMATEL and HFACS are useful for identifying and ranking influential factors [12], and CASMET provides a structured framework for casualty analysis [13]; however, these approaches are less capable of representing conditional probabilistic relationships and conducting inference across multiple interacting variables under uncertainty. Moreover, TAN-BN was adopted because it relaxes the strong conditional independence assumption of naive Bayes while remaining more parsimonious and interpretable than a fully unrestricted Bayesian network. This makes Bayesian networks particularly suitable for casualty analysis in commercial–fishing vessel collisions, where severe outcomes usually emerge from combinations of factors and where probabilistic inference is needed to interpret scenario-specific risk configurations. Although the empirical analysis is restricted to China’s coastal waters, the analytical framework may provide a useful reference for similar report-based maritime safety studies. However, its applicability to other regions or transport contexts would require separate validation with external data.
While TAN structures and mutual information analysis have been widely used in risk studies, their application in this study is directed at a more specific question that has received limited attention in the existing literature, namely how casualty severity emerges in collisions between commercial and fishing vessels. Most previous studies have concentrated on collision occurrence, navigational risk, or individual accident cases, whereas this study focuses on casualty outcomes and the interplay among vessel, environmental, temporal, and human liability factors behind them. In addition, the analysis is based on a database of 137 official accident reports rather than AIS trajectories or a small number of representative cases, which makes it possible to incorporate contextual information such as defect conditions, liability attribution, and accident circumstances. The contribution of this study is therefore analytical and empirical rather than methodological: it uses an established but appropriate probabilistic framework to generate new evidence on casualty severity patterns in a specific and underexamined collision context. More specifically, the study contributes by (1) positioning casualty severity, rather than collision occurrence alone, as the analytical target; (2) focusing on the asymmetric socio-technical setting of commercial–fishing vessel collisions; and (3) revealing interpretable dependency patterns from structured official investigation reports that are not readily observable from descriptive summaries or occurrence-oriented studies alone. The rest of this paper is organized as follows: Section 2 presents a comprehensive literature review, summarizing current research on collisions between commercial and fishing vessels, data sources and their applications, and key influencing factors. Section 3 outlines the research methodology. Section 4 describes the model development process, including data acquisition, factor selection, and construction of the TAN-BN model. Section 5 presents and discusses the findings, beginning with an analysis of the TAN-BN results, followed by a sensitivity analysis based on the mutual information (MI) index and a comparison of factor influences under different casualty scenarios using the Most Probable Explanation (MPE) approach. Finally, Section 6 concludes the study and summarizes the main insights.

2. Literature Review

2.1. Collision Between Commercial and Fishing Vessels

Existing studies relevant to maritime collisions and casualty outcomes can be broadly grouped into four streams: (1) descriptive and case-based analyses of commercial–fishing vessel collisions, (2) statistical and econometric studies of accident severity and casualty outcomes, (3) Bayesian-network-based studies of maritime accident mechanisms, and (4) human and organizational factor studies. Although these streams have each contributed useful insights, they differ in analytical focus, data basis, and explanatory power. More importantly, they do not address equally well the specific question examined in this study, namely how casualty severity is associated with interacting socio-technical factors in collisions between commercial and fishing vessels. A first stream consists of descriptive, legal, and regional studies of collisions between commercial and fishing vessels. These studies are valuable because they document the operational tensions that arise when large, route-following commercial ships interact with smaller, task-oriented, and often less protected fishing vessels in shared waters [5,6,9,14,15,16,17,18,19,20,21,22,23]. They have improved understanding of accident characteristics, local navigational conditions, and region-specific operational problems. However, much of this work remains descriptive, geographically localized, or based on individual cases. As a result, although such studies are useful for identifying recurrent practical issues, they are less effective in clarifying whether the same factors remain important across a broader set of collision events and different coastal settings. In other words, this literature is strong in contextual detail but weaker in cross-case synthesis and comparative explanation. A second stream addresses accident severity and casualty outcomes more broadly in maritime transportation. These studies provide a more systematic basis for understanding accident consequences by showing that severity is associated with interacting vessel, environmental, operational, and crew-related factors [7,8,24,25]. For example, Wang et al. [24] found that accident severity is related to accident type, weather and sea conditions, ship type, ship age, and crew-related deficiencies, while Wang et al. [25] showed that accident severity depends not only on accident circumstances but also on vessel characteristics, water depth, distance from shore, and seafarer-related factors. Zhou et al. [26] further highlighted the value of modeling casualty outcomes explicitly through a Bayesian-network-based framework. This body of work is important because it shifts attention from accident occurrence alone to the consequences of accidents. However, most of these studies examine maritime accidents in general rather than the specific collision context of commercial–fishing vessel encounters. As a result, they do not fully resolve whether the determinants of casualty severity in this highly asymmetric collision class differ from those identified in broader maritime accident datasets. A third stream focuses on probabilistic and Bayesian approaches to maritime safety analysis [27,28,29,30,31]. These studies have shown that Bayesian methods are well suited to accident analysis because they can represent uncertainty, conditional dependence, and interactions among vessel, environmental, and human-related factors. Foundational applications emphasized systemic and organizational risk factors [27], while later studies expanded toward accident-type analysis, accident severity analysis, and data-driven quantification of maritime risk [28,29,30,31]. This stream demonstrates clear methodological value, especially for situations in which accident outcomes do not arise from isolated factors but from combinations of conditions. Nevertheless, most existing Bayesian studies either address maritime accidents at a broad level, focus on occurrence risk rather than casualty severity, or use data sources that do not preserve the same degree of contextual detail as official investigation reports. Thus, while Bayesian methods are clearly established in maritime safety research, their use has less often been directed toward casualty severity in commercial–fishing vessel collisions using multi-case official report data with explicit contextual coding. A fourth stream examines human and organizational factors in maritime accidents [18,32,33,34,35]. This literature has consistently shown that unsafe acts, organizational deficiencies, communication failures, and deficiencies in decision-making or watchkeeping are central to accident development. Review studies such as Wu et al. [32] demonstrate that human and organizational factors have become a major direction in maritime safety research, while HFACS-based and -related empirical studies [18,33,34,35] provide more structured interpretations of accident pathways. This stream is especially useful in explaining how accidents develop. However, it also has two limitations in relation to the present study. First, many such studies focus on causal pathways or responsibility structures rather than on casualty severity as the main analytical outcome. Second, human-factor information in accident reports is often rich narratively but difficult to code in a fully standardized way across many cases, which complicates direct cross-case quantitative modeling. Accordingly, this literature is highly relevant to interpretation, but less often provides an integrated probabilistic account of how human-related factors combine with vessel, environmental, and temporal conditions to shape casualty severity outcomes. Taken together, the existing literature has substantially improved understanding of maritime collisions, accident causation, severity, and organizational risk. However, it remains uneven in relation to the specific problem addressed here. First, many studies on commercial–fishing vessel collisions are geographically narrow or case-based, which limits the generalizability of their conclusions. Second, a substantial portion of the literature focuses on collision occurrence, legal responsibility, or qualitative causal analysis rather than on casualty severity as the primary analytical target. Third, even where probabilistic approaches are used, relatively few studies combine multiple official investigation reports with an interpretable analytical framework that preserves contextual information on vessel characteristics, environmental exposure, accident circumstances, and liability attribution. The present study is positioned in relation to these limitations. It does not claim methodological novelty; rather, it applies an established TAN-BN framework to examine casualty severity in a specific and underexamined collision context using structured information extracted from official accident investigation reports.
Commercial–fishing vessel collisions have attracted sustained attention because they often occur in operationally complex waters where dense traffic routes intersect with fishing grounds. Existing studies show that this collision type is characterized by asymmetry in vessel size, maneuverability, operating purpose, and navigational behavior, which creates a distinctive socio-technical risk environment.
Several studies have examined the geographic distribution and contextual features of such accidents. Zhang et al. [14] identified a high incidence of collisions involving commercial and fishing vessels in global accident statistics, while Przywarty et al. [15] and Oh et al. [16] analyzed fishing vessel collisions in the Polish Exclusive Economic Zone and Korean waters, respectively. These studies improve understanding of accident characteristics in particular regions, but they are primarily descriptive and do not fully resolve how casualty severity is shaped by interacting factors across multiple cases.
Other studies have adopted broader data-driven approaches. Jiang et al. [31] developed a Bayesian network model using a large global maritime accident database and found that location, ship type, age, gross tonnage, and deadweight tonnage are important risk-influencing factors. Li et al. [17] also used a TAN-BN framework at the global scale and showed that ship operation, voyage segment, ship speed, sea conditions, wind, ship type, and human factors strongly affect collision-related risk. More recently, Zhou et al. [26] explicitly addressed maritime casualty analysis from a global perspective, further demonstrating the value of probabilistic modeling for consequence-oriented accident analysis. However, these studies remain broad in accident scope and do not focus specifically on casualty severity in commercial–fishing vessel collisions.
Human and organizational analyses provide another important perspective. Wang et al. [35] examined 443 historical merchant–fishing vessel collision accidents in China using an HFACS-BN framework and identified 56 human and organizational factors, including several factors specific to this collision type. Ma et al. [18] similarly used HFACS, DEMATEL, and fuzzy cognitive mapping to identify key human-related contributors such as lookout negligence, poor safety management practices, and failure to take effective avoidance action. These studies strengthen understanding of human and organizational pathways, but they focus mainly on causal structures or accident development paths rather than on casualty severity outcomes.
Case-based and region-specific studies remain valuable because they reveal operational detail that large databases may overlook. Tang et al. [19], Zhang [20], Su [21], Wu et al. [22], Yang et al. [23], and Yi [9] provide insights into particular accidents, local navigational conditions, and legal or managerial issues in China’s coastal waters. However, conclusions from single accidents or limited sea areas are often difficult to generalize because marine conditions, traffic composition, and fishing practices differ substantially across regions.
With respect to data sources, AIS-based studies can capture trajectories, encounter geometry, and collision location, but they often lack direct information on weather conditions, vessel deficiencies, organizational context, and responsibility attribution. This is particularly problematic in commercial–fishing vessel collisions because smaller fishing vessels may have incomplete AIS coverage or inconsistent AIS usage. By contrast, official accident investigation reports provide richer contextual information, including accident sequence, vessel condition, weather and sea conditions, and liability assessment. Recent work has also shown the value of insurance claim data and narrative accident records for severity analysis in fishing vessel safety research [36,37,38].
Overall, the literature confirms that commercial–fishing vessel collisions are an important but still insufficiently synthesized research area. Existing studies have improved understanding of accident occurrence, human and organizational contributors, and regional collision patterns. Yet three shortcomings remain: (1) many studies are geographically narrow or case-based; (2) casualty severity is less frequently modeled as the primary outcome; and (3) few studies combine multi-report official accident data with an interpretable probabilistic framework specifically tailored to commercial–fishing vessel collisions. More specifically, the gap addressed in this study is not the absence of probabilistic approaches in maritime safety or accident severity research. Rather, it lies in the comparatively limited quantitative analysis of casualty severity in commercial–fishing vessel collisions based on multiple official investigation reports that preserve contextual information on vessel characteristics, environmental exposure, accident circumstances, and liability attribution. This gap is important because collisions between commercial and fishing vessels occur within a highly asymmetric socio-technical setting: the two vessel types differ substantially in size, maneuverability, operational purpose, and structural vulnerability, and these differences may shape casualty outcomes in ways not fully captured by broader maritime accident datasets. Accordingly, the present study focuses specifically on casualty severity in this collision context and applies a Tree-Augmented Bayesian Network to examine the associated probabilistic dependency patterns.

2.2. Selection of Influencing Factors

The international literature suggests that casualty severity in maritime accidents is shaped by interacting environmental, vessel-related, temporal, and human/organizational factors. Accordingly, the variables selected in this study are intended to represent not all possible contributors, but rather the key socio-technical dimensions that can be coded consistently from official accident reports and that are well supported by prior research.
Environmental conditions are among the most widely documented contributors to maritime risk and severity. Studies of marine accident severity have shown that wind, sea state, current, and visibility affect both accident development and accident consequences by altering maneuverability, perception, and response capacity [24,25]. In the fishing vessel literature, Rezaee et al. [37] and Zhang et al. [39] found that harsh weather conditions increase incident rates relative to fishing activity levels, especially for smaller vessels. Recent Bayesian severity studies likewise confirm that visibility, wind, and location are important contextual variables in consequence-oriented accident analysis. Therefore, wind speed [37,40], visibility [28,40,41], and location [30,41] are retained in this study as core environmental indicators.
Human and organizational factors remain central in the maritime safety literature [42,43]. Broad reviews and empirical studies consistently show that accident development often involves deficiencies in voyage preparation [30,38,44] (e.g., inadequate voyage planning), the judgment and decision-making phase [42,43,45] (e.g., poor hazard judgment, improper lookout), navigation maneuvering [42,46] (e.g., failure to maintain a safe speed, delayed action, incorrect vessel handling), and the management process [30,43,45]. In merchant–fishing vessel collisions specifically, HFACS-based studies also indicate that unsafe acts, inadequate organizational processes, and communication failures play major roles. In the present study, human factors are operationalized through a liability-based variable derived from official accident reports. This is necessarily a simplified representation, but it remains useful because responsibility attribution in investigation reports captures which side’s operational errors were judged to be dominant in the accident sequence.
Vessel characteristics are also important because they influence both exposure and vulnerability. Previous studies have reported associations between severity outcomes and vessel size, age, defects, and technical condition. For example, Wang et al. [24] found that ship type and age were significant in marine accident severity, while Uğurlu et al. [38] showed that fishing vessel accidents are significantly related to vessel length and age. In commercial–fishing vessel collisions, the asymmetry between large commercial vessels and relatively small fishing vessels is especially relevant because it affects impact consequences, survivability, and post-collision rescue conditions. On this basis, the present study includes commercial vessel ship length [29,47], ship gross tonnage [29,48,49], ship age [47,48,50], and ship defects [2,30,51].
Temporal and contextual variables are also supported by prior research. Time of day [8,52,53] and season [8,47,52] may shape traffic density, visibility, fatigue, fishing behavior, and weather exposure. Prior studies have shown that nighttime operations, seasonal conditions, and adverse weather frequently co-occur with severe maritime outcomes, while recent fishing vessel research also suggests that the effects of such variables may vary across operational contexts and incident types. Therefore, time and season are retained as auxiliary contextual factors that may help explain how casualty patterns differ across accident scenarios.
In summary, the factor set used in this study reflects a synthesis of the broader international literature and the practical constraints of accident report coding. It combines environmental exposure, vessel vulnerability, temporal context, and human liability information into one operational framework. This choice is consistent with current maritime safety research, which increasingly views accident outcomes as emergent properties of socio-technical systems rather than as the result of isolated single factors.
While the preceding review positions the study within the broader state of the art, the purpose of Table 1 is narrower: it summarizes the factor-level literature used to support variable selection for the empirical model.
Based on the above literature, the gap addressed in this study is not that probabilistic approaches have never been used in maritime safety or accident severity research. Rather, the gap lies in the comparatively limited quantitative analysis of casualty severity in commercial–fishing vessel collisions using multiple official investigation reports that preserve contextual information on vessel characteristics, environmental exposure, accident circumstances, and liability attribution. This gap is relevant because collisions between commercial and fishing vessels occur in a highly asymmetric socio-technical setting: the two vessel types differ substantially in size, maneuverability, operational purpose, and vulnerability, which may shape casualty outcomes differently from those observed in broader maritime accident datasets. Accordingly, the contribution of the present study is not the introduction of a new Bayesian technique, but the generation of interpretable empirical evidence on how casualty severity is associated with interacting vessel, environmental, temporal, and liability-related factors in this specific collision context. The present study therefore focuses specifically on casualty severity in this collision context and applies a Tree-Augmented Bayesian Network to examine the associated probabilistic dependency patterns.

3. Materials and Methods

The data used in this study are drawn from accident reports issued by the China Maritime Safety Administration. These reports provide detailed records, including weather conditions at the time of the accident and determinations of human liability. Using these data, we apply data mining techniques to construct a TAN-BN model. The model is then subjected to sensitivity analysis, and the MPE is used to infer plausible causes of casualties. Further methodological details are provided in the remainder of Section 3.

3.1. TAN-BN Structure Construction

For the analysis of risk-influencing factors (RIFs), two common approaches can be used to learn a BN structure. The first approach relies on expert knowledge to specify causal relationships. The second approach is data-driven and infers dependencies among RIFs using structure-learning algorithms within the BN framework [54]. This study developed the BN structure using the latter data-driven approach.
First, raw maritime accident reports were manually coded, yielding a database comprising 137 reports involving 274 ships. The resulting database provides an empirical basis for data-driven risk analysis, although the sample size remains moderate relative to the dimensionality of the final discretized TAN-BN. Following Friedman et al. [55], we adopted a Tree-Augmented Bayesian Network as a parsimonious and interpretable probabilistic structure for analyzing dependency patterns among accident-related factors. TAN captures strong attribute dependencies by connecting them in a tree, thereby improving classification performance over standard naive Bayes without incurring prohibitive computational complexity.
Formally, a BN encodes a joint probability distribution over a set of random variables U represented by an annotated directed acyclic graph (DAG). Let U = A 1 , A n , C , where n is the number of RIFs, A 1 ,   A n denote the RIF variables, and C is the class variable (accident type).
Consider a graph structure where the class variable is the root, i.e., Π C = ( Π C represents the set of the parent elements of C in U ), and every RIF has the class variable as its only parent, i.e., A i = { C }   for   1 i n . BN defines the unique joint probability distribution over U as expressed by the following formula:
P A 1 , A n , C = P ( C ) i = 1 n   P A i C
The DAG on A 1 , A n is a tree if A i contains only one parent except for a variable with no parent (called the root). There exists a function π that defines a tree on A 1 , A n , if there is an i such that π ( i ) = 0 (i.e., the root of the tree), and there is no sequence i 1 , i k that makes a π ( i j ) = i j + 1   for   i j < k   and   π i k = i 1 (no cycle). Such a function defines a tree network where A i = C , A π ( i ) if π ( i ) > 0 , and A i = { C }   if     π ( i ) = 0 .
Learning the TAN structure is an optimization problem. The solution to this problem is by the use of conditional mutual information between attributes [56]. The function can be defined as
I P A i , A j C = a i i , a j i , c i   P a i i , a j i , c i l o g P a i i , a j i c i P a i i c i P a j i c i
I P indicates conditional mutual information, a i i is the i th state of RIF A i , a j i is the i th state of RIF A j , and c i is the i th state of accident type. The optimization problem, that is, learning the TAN structure, is finding a defined function tree of π A 1 , A n , maximizing logarithmic likelihood.

3.2. Expectation–Maximization Parameter Learning

After constructing the Bayesian network, the next step is parameter learning. For each node, a conditional probability distribution must be estimated. When learning parameters, one can utilize compiled data or rely on empirical knowledge from experts to specify conditional probability distributions. However, expert knowledge may sometimes be imprecise, biased, and subjective, which can reduce the accuracy of the network structure. Therefore, in this study, we opted to specify the conditional probability distribution by learning from accident report data to ensure the accuracy of the network structure.
The Expectation–Maximization (EM) algorithm is a classical method for maximum-likelihood estimation with incomplete or latent data, first formalized by Dempster et al. [55]. It iteratively estimates the probability parameters in the Bayesian network, enabling the model to maximize the likelihood function based on the observed data.
The EM algorithm consists of two main steps:
E-Step: In the E-step, the posterior probability of the hidden variable is calculated based on the current parameter estimation. The posterior probability of all hidden variables is calculated for each sample data based on the current parameters. Given the observed data x = ( x ( 1 ) , x 2 , , x m ) and the current parameter θ, calculate the expectation of the log-likelihood function for the complete data z = ( z ( 1 ) , z 2 , , z m ) . The joint probability distribution is p x , z θ ; the conditional probability distribution is p ( z x , θ ) . For this purpose, the expectation of the log-likelihood function is defined as
Q i z ( i ) : = p z ( i ) x ( i ) ; θ
M-Step: The posterior probability of the hidden variable calculated in the E-step is used to update the parameter estimate. For each variable node and edge node, the parameter’s value is re-estimated according to the posterior probability obtained in step E using the maximum-likelihood estimation or Bayesian estimation method. This step can be done by maximizing the likelihood function or adding a prior distribution for regularization. Maximizing the expectation of the log-likelihood function under complete data, that is, selecting some parameter θ , satisfies
θ : = a r g   m a x θ   i   z ( i )   Q i z ( i ) l o g   p x ( i ) , z ( i ) ; θ Q i z ( i )
Compared with gradient learning and count learning, the EM algorithm has broader applicability, especially when the data is incomplete or contains some unknown parameters. Although the convergence rate of the EM algorithm is relatively slow, it is still widely used in practical applications and can give more accurate results [57]. Therefore, in this paper, we use the EM parameter learning method.

3.3. Mutual Information

In Bayesian networks, MI measures the statistical dependence between two variables. It quantifies the reduction in uncertainty of one variable when the value of another variable is known, indicating the degree of information sharing between them. The mutual information index is calculated as follows:
E ( F ) = i = 1 n   P F = F i l o g   P F = F i
where E represents entropy, and P F = F i represents the probability of a random event in a specific state space. According to the first two parameters, E ( F ) can be obtained under the condition that the probability of a random event is known.
E ( F / T ) = n   E F / T = T s P T = T s
The MI index is calculated by comparing the entropy of the random event to the entropy of the target variable. A higher ratio indicates a stronger correlation between the accident outcome and the causative event.
M I = E ( F ) E ( F / T )
The greater the value of MI, the stronger the correlation between the two variables and the higher the degree of information sharing. When the MI is 0, there is no correlation between the two variables.
MI in Bayesian networks provides the ability to measure the correlation of variables, infer conditional independence, learn network structure, and select features. By interpreting mutual information results, the relationships between variables in Bayesian networks can be better understood and interpreted to support probabilistic inference, prediction, and decision-making [58].

3.4. Most Probable Explanation

To examine the connections between the relevant nodes in the BN and identify the most likely states within the nodes, the BN model can provide the MPE based on the determined number of casualties caused by the accident.
The explanation of MPE is as follows: the given set of evidential variables E solves for the maximum probability assignment of O = X/E. The formal description is as follows: Given E = e, calculate o = a r g m a x p O = o E = e . O = X/E represents the variables in the variable set X except for the evidence variable set E.
The goal of MPE inference is to determine the underlying variable’s value that maximizes the given observation’s conditional probability. It aims to find the most likely explanation, which is the combination of potential variable values with the highest probability for a given observation [59].
An optimal estimate of the unobserved variable can be obtained by calculating the MPE. This estimation is useful for scenario-based probabilistic interpretation and decision support.

3.5. Research Workflow

To improve the transparency and reproducibility of the study, the overall data-processing and analytical procedure is summarized in Table 2. The table outlines the main steps from accident report collection and case screening to variable coding, model construction, parameter learning, and subsequent probabilistic analyses.
The workflow summarized in Table 2 provides an overview of how the raw accident reports were transformed into a structured analytical dataset and subsequently used for TAN-BN modeling and inference. The main methodological components are described in detail in the following section.

4. Model Construction

4.1. Dataset

The accident data in this paper is drawn from the accident report published on the official website of the Maritime Safety Administration of the People’s Republic of China (https://www.msa.gov.cn/html/cnmsa/hxaq/sgjx/index.html) (accessed on 5 April 2026). Due to the frequent occurrence of maritime accidents, we follow the “Statistical Measures for Water Traffic Accidents” as the reference standard and include only cases classified as “general grade” or above. Minor accidents are excluded. During the statistical period, no major catastrophic incidents were reported, so none are included. The accident type analyzed is collisions between commercial and fishing vessels. The spatial distribution of these collisions in China’s coastal waters is shown in Figure 1. The study area is bounded by the following latitudes and longitudes: (17°58.7′ N, 108°12.1′ E/41°30.4′ N, 108°12.1′ E/41°30.4′ N, 124°48.8′ E/17°58.7′ N, 124°48.8′ E).
Drawing on expert opinion and the relevant literature, we identify key factors influencing the number of casualties in commercial–fishing vessel collisions. The indicators considered include commercial vessel age, gross tonnage, and documented defects; fishing vessel length, age, and documented defects; and environmental and contextual factors such as visibility, location, wind speed, season, time, and human liability. The variables included in the model were determined with reference to both the existing literature and the information available in the official accident reports. Candidate factors were first identified from prior studies on maritime accident severity, fishing vessel safety, and collision analysis, and were then screened according to whether they could be coded consistently across cases. The final set of variables was selected to represent the main socio-technical dimensions associated with casualty severity, including vessel characteristics, environmental conditions, temporal context, and human-related responsibility. As such, these variables should be regarded as operational indicators derived from the available reports rather than as a complete representation of all possible accident contributors.
From 2013 to 2022, 274 vessels were involved in these accidents. Each accident report provides an overview of the accident and investigation; general information about the vessel, crew, and shipping company; weather and sea conditions; the navigation environment of the accident area; analysis and identification of the accident’s essential elements; accident sequence; damage assessment; causal analysis; and attribution of responsibility. Using the 2021 collision accident between “DL ACACIA” and “Guiqin Fishery 20216” as an example, the following information can be extracted:
(1)
Number of casualties: 7.
(2)
Accident location (latitude/longitude): 20°53.509′ N/114°23.303′ E.
(3)
Date and time (season): 3 January 2021, at 05:05 h.
(4)
Wind speed and visibility: northeast wind at 6–7 Beaufort scale, gusts up to 8, visibility ranging from 12 to 18 km, with good visibility.
(5)
Vessel technical parameters: See Table 3.
(6)
Determination of human liability: Primary and secondary human liabilities are established in Chapter 10 of the accident report, in the section that assesses human liability.
The number of casualties, the key outcome in this study, is determined according to the Statistical Measures for Water Traffic Accidents (Ministry of Transport, 2014) [60]. Casualties are divided into the following groups: no casualties (less than three severe injuries), 1–3 casualties in accidents (every three additional severe injuries is considered an increase in the number of casualties), 4–10 casualties in accidents (every three additional severe injuries is considered an increase in the number of casualties), and more than 10 casualties in an accident (each additional three severe injuries is considered one additional death). By comprehensively analyzing these influencing factors, this study investigates the relationships between each factor and the number of casualties in collisions between commercial and fishing vessels.
Table 4 presents the node variables, descriptions, values, and frequencies for the selected variables. For commercial vessels, gross tonnage and age are classified according to standards issued by the China Maritime Safety Administration. For fishing vessels, length and age follow the same regulatory sources. For example, ship age is classified with reference to the Ministry of Transport of the People’s Republic of China Decree No. 13, 2021, and the corresponding vessel tonnage and length are specified in the relevant documents. In this paper, gross tonnage reflects the size of commercial vessels, and length reflects the size of fishing vessels. Visibility and wind speed are defined following IMO (2014) standards [41]. The human dimension is represented only through a liability-based indicator derived from the official reports. Accordingly, the model captures only a limited report-based aspect of human-related responsibility rather than the full multi-dimensional structure of human and organizational factors. Other indicators are determined based on the literature summarized in Table 1. Table 5 provides further explanations of variables marked with an asterisk (“*”) in Table 4.

4.2. Constructing the TAN Network

Netica version 5.18 (Norsys Software Corp., Vancouver, BC, Canada; https://www.norsys.com) was used to construct the Tree-Augmented Bayesian Network (TAN-BN) in this study. The analyses were conducted on a personal computer running a 64-bit Windows operating system with an AMD Ryzen 7 7840H processor and 32 GB RAM. Its built-in structure-learning capability was employed to learn the TAN structure from the coded accident dataset, with casualties specified as the target node and 12 attribute nodes representing vessel-related, environmental, temporal, and human liability factors, as shown in Table 6. The model was used as an explanatory probabilistic tool to characterize dependency patterns associated with casualty severity in commercial–fishing vessel collisions. During network construction, the default TAN-learning settings provided by Netica were adopted, and the structure was learned automatically based on the relationships in the dataset. The qualitative component of the TAN for casualty probability was established according to the procedure summarized in Table 6. Finally, all learned edges were reviewed by domain experts to assess their plausibility, and the resulting network structure is presented in Figure 2. In the figure, the target node is shown in yellow, the attribute nodes are shown in blue, and the arrows indicate directional relationships among nodes.
In this study, the TAN structure was used to represent both the direct relationship between the target variable (casualties) and each explanatory factor, and the dependency relationships among explanatory variables themselves. Compared with a naive Bayes structure, TAN allows each attribute node to have the target node and at most one additional attribute node as parents. This makes the model more suitable for the present study, because vessel, environmental, temporal, and human liability factors in maritime collision accidents are not fully independent. Therefore, the learned TAN structure provides a more realistic and interpretable representation of the probabilistic dependency patterns associated with casualty severity.

4.3. Bayesian Network Model EM Parameter Learning

This study utilizes sample data to estimate the parameters for each node. The sample data consists of observational data collected from accident reports, which provide objective and reliable parameter estimates. Each node’s conditional probability tables were developed in Netica.
The Expectation–Maximization (EM) algorithm is used to learn the conditional probability parameters of the Bayesian network under the established TAN structure. The main role of EM is to iteratively optimize the conditional probability tables of each node so that the likelihood of the observed accident data is maximized. This process improves the consistency between the network parameters and the empirical data, and provides a quantitative basis for the subsequent marginal probability analysis, mutual information calculation, and MPE inference.
After using the EM algorithm, the hidden variables are estimated by iteratively estimating the posterior probability of the numbers and the maximum likelihood of the parameters. The result after parameter learning is shown in Figure 3.

5. Result and Discussion

5.1. Marginal Probability Distribution of Key Variables

Using a TAN-BN model developed from 137 investigation reports on collisions between commercial and fishing vessels, this study identifies the key socio-technical factors associated with casualty severity, examines their marginal probability distributions and relative statistical associations, and further characterizes the typical risk scenarios and factor configurations underlying different casualty outcomes through MPE inference. In this way, the analysis provides new empirical evidence on how casualty severity is structured in this specific collision class, rather than merely applying a Bayesian model to reproduce already well-established patterns.
Figure 3 presents the marginal probability distribution of each variable node in the TAN-BN model, calculated using EM-based parameter learning on the dataset. These values represent the overall likelihood of each state for a given factor, independent of any observed evidence or conditional settings. The model indicates that, among collisions between commercial and fishing vessels of general grade or above in China’s coastal waters, the probability of an accident resulting in 1 to 3 casualties (each additional three serious injuries is counted as one casualty) is the highest at 54%, while the probability of more than 10 casualties (each additional three serious injuries is considered 1 additional casualty) is relatively low at 2.92%.
(1)
Nodal analysis of hull factors
According to the marginal distributions obtained from the TAN-BN model, 10.9% of commercial vessels involved in the dataset were more than 20 years old, and 47.4% had a gross tonnage exceeding 3000 tons. The proportion of commercial vessels with documented technical or structural defects was 6.57%. In contrast, among fishing vessels involved in collisions, 23.4% were over 20 years old and 59.8% had a length greater than 24 m. The proportion of defective fishing vessels was 8.03%. These marginal probabilities describe the overall distribution of vessel characteristics in the dataset, independent of casualty outcomes or other conditional influences.
(2)
Nodal analysis of environmental factors
The probabilistic profile of the model reveals that 72.2% of collision cases occurred during nighttime hours, indicating a strong temporal bias toward non-daylight operations. Additionally, 69.3% of the accidents took place under moderate wind conditions (Beaufort 4–6), and 68.6% occurred during good visibility. These results suggest that many collisions between commercial and fishing vessels occur under typical operating conditions rather than extreme weather or poor visibility.
Seasonally, only 15.3% of collisions were associated with summer months, which may reflect the influence of fishing moratorium policies that reduce vessel density during this period. However, these proportions are derived from the model’s marginal distributions and do not, by themselves, imply causal relationships with casualty severity.
(3)
Nodal analysis of human liability factors
The accident reports indicate that both commercial and fishing vessel crews were involved in different forms of human-related error across the recorded collision events, including deficiencies in lookout, judgment, maneuvering, and voyage management. However, because such details are not described in a sufficiently standardized form across all reports, the present study does not model them as separate variables. Instead, it uses a liability-based categorical indicator to denote whether primary human-related responsibility in the official report was attributed mainly to the commercial vessel, the fishing vessel, or both parties.

5.2. Sensitivity Analysis Based on Mutual Information

Based on the marginal probabilities derived from the TAN-BN model, commercial vessels were identified as the primarily liable party in 50.4% of cases, fishing vessels in 26.3% of cases, and both parties in 23.3% of cases. These probabilities should not be interpreted as a complete characterization of human causation. Rather, they describe the distribution of reported liability configurations within the dataset and provide a simplified way to retain one human-related dimension in the probabilistic analysis.
To assess the relative statistical association of each input variable with casualty outcomes, a sensitivity analysis was conducted based on the MI index. Within the Bayesian network framework, MI measures the reduction in uncertainty of the target variable (casualties) given knowledge of a predictor variable’s state. This metric offers a theoretically grounded and model-consistent approach for assessing the relative importance of contributing factors in the probabilistic inference process.
Figure 4 illustrates the MI values between each risk factor and the casualty outcome node. The node sizes are scaled proportionally to the magnitude of shared information. Among all variables, the length of the fishing vessel (F Length) shows the strongest association with casualty severity, exhibiting the highest MI score (0.322), followed by time of day, wind speed, visibility, and season. These five variables represent the most significant contributors to the variation in casualty levels.
A closer examination of the results reveals that vessel-related characteristics of fishing vessels—including length, age, and structural integrity—consistently exerted a stronger influence on casualty outcomes than the corresponding attributes of commercial vessels. This suggests that, in collision scenarios, fishing vessels constitute the more vulnerable party, as their structural limitations exacerbate the consequences of collision impact. The model captures this asymmetry by allocating greater informational weight to fishing vessel-related variables.
In contrast, the liability-based human-factor variable exhibits a relatively low mutual information value in the marginal sense. This should not be interpreted as evidence that human factors are unimportant in maritime collisions. Rather, it reflects the fact that the present model represents the human dimension only through a simplified liability-based proxy and not through a richer set of directly coded human and organizational variables. The result therefore suggests that the marginal explanatory role of the liability variable is limited within this coding framework, while broader human-factor mechanisms may still be embedded indirectly in other contextual or operational patterns represented in the model.
Overall, the MI-based sensitivity analysis underscores the structural vulnerability and environmental exposure of fishing vessels as primary determinants of casualty severity in commercial–fishing vessel collisions. Within the present dataset, these findings suggest that risk mitigation strategies in China’s mixed commercial–fishing coastal waters should focus not only on navigational behavior but also on improving the design, robustness, and protective measures of small fishing vessels operating in complex traffic environments. From a practical perspective, the MI results suggest that safety interventions should prioritize the factors most strongly associated with casualty severity, especially the structural vulnerability of fishing vessels and the heightened exposure associated with nighttime operation, poor visibility, and adverse wind conditions.

5.3. Analysis of External Environmental Influences Under the MPE Model

To examine which combinations of factor states are most strongly associated with different levels of collision severity within the learned network, MPE inference was applied within the TAN-BN model. MPE identifies the combination of variable states that maximizes the joint probability of the entire network under a given evidence condition—in this case, a fixed number of casualties.
Two scenarios were selected for comparative analysis:
(1)
Low-severity outcome (C1): no casualties and fewer than three serious injuries;
(2)
High-severity outcome (C4): more than ten casualties.
The resulting most probable configurations for each scenario are illustrated in Figure 5 and Figure 6, with a summary provided in Table 7. Key patterns identified from the results are summarized as follows.
These scenario-specific risk profiles indicate systematic probabilistic patterns across meteorological, temporal, spatial, and operational dimensions. They show how different reported conditions and liability configurations are associated with different casualty outcomes within the dataset, although they should not be interpreted as direct evidence of causal mechanisms.
(1)
Wind Conditions: In both low- and high-casualty scenarios, wind speeds predominantly fall within Beaufort 3–6 (light-to-moderate breeze). Notably, severe winds (Beaufort ≥ 7) are rarely observed in the MPE configurations, suggesting that operational restrictions commonly imposed on both commercial and fishing vessels during extreme weather effectively reduce exposure to hazardous conditions. Thus, while moderate winds do not necessarily prevent collisions, they may still pose dynamic challenges for vessel control and response time, particularly in confined waters.
(2)
Visibility: A clear divergence emerges with respect to visibility. In low-severity incidents (no casualties or ≤3 serious injuries), good visibility (≥5 nautical miles) is the most likely condition. In contrast, high-casualty events (≥10 casualties) are strongly associated with poor visibility. This pattern underscores visibility as a critical risk amplifier: even in the presence of human error, adequate visual conditions may allow for last-minute evasive actions, whereas reduced visibility severely impairs situational awareness and collision avoidance, significantly increasing casualty risk.
(3)
Time of day: Nighttime is a dominant factor in both configurations, indicating a persistent vulnerability during dark hours. This aligns with known challenges such as reduced visual detection capability, circadian fatigue, and diminished crew alertness. Its consistent presence, particularly in high-severity cases, reinforces the role of nighttime as a key compounding factor, especially when combined with poor visibility or complex traffic environments.
(4)
Seasonality: The seasonal distribution reveals a notable divergence: low-casualty incidents are most probable in autumn, whereas high-casualty events peak in spring. Both seasons are periods of intensified fishing activity along China’s coast, but spring may entail additional risks due to transitional weather, increased vessel traffic density from seasonal fisheries, and potential mismatches in crew readiness or vessel preparedness after winter layup. This indicates that severe collisions are influenced not only by traffic intensity but also by seasonal operational stressors.
(5)
Location: Spatial context plays a decisive role in outcome severity. Collisions resulting in no or minimal casualties are most likely to occur in designated shipping lanes or regulated waterways, where navigational rules are enforced, traffic flow is structured, and vessel behavior is more predictable. Conversely, open waters are the predominant location for fatal collisions. These areas typically lack formal traffic separation schemes, suffer from weaker surveillance and enforcement, and involve greater uncertainty in fishing vessel movements—factors that collectively reduce collision avoidance effectiveness and increase the likelihood of catastrophic outcomes.
These MPE-based scenario patterns indicate that maritime safety management may benefit from more targeted, scenario-specific interventions, such as enhanced monitoring and warning measures during nighttime, low-visibility, and open-water operations.

5.4. Impact Analysis of Human Liability Under the MPE Model

The MPE-based analysis not only reveals the influence of human liability on casualty levels but also facilitates further exploration of the environmental context in which each party’s errors are most likely to occur. Figure 7 and Figure 8 present the MPE outcomes under two conditions: HE1 (commercial vessels as the primary liable party) and HE2 (fishing vessels as the primary liable party).
When human error is attributed to crews of commercial vessels (HE1), the MPE model, conditioned on selecting HE1 as the primary liability factor, reveals a highly consistent scenario characterized by winter conditions (S4 = 100%), nighttime operations (T2 = 100%), good visibility (≥5 nautical miles = 100%), moderate-to-strong winds (WS2 = 100%), and navigation within traffic-regulated shipping lanes (L1 = 100%). This configuration is associated with large-scale casualties (C3 = 100%). In other words, when commercial vessels are assigned primary liability, even clear but dark and windy winter nights in congested channels significantly heighten collision risk. This elevated risk likely results from the combined effects of high vessel momentum, constrained maneuvering space, and reduced crew vigilance under heavy operational workload.
In contrast, when fishing vessel crews are identified as primarily liable (HE2), the MPE solution (conditioned on HE2) shifts markedly toward less severe outcomes: no casualties (C1 = 100%) or at most 1–3 casualties (C2 = 66.8%). These incidents occur under similar environmental and spatial conditions—winter, night, good visibility, and regulated shipping lanes—but are associated with calmer-to-moderate wind conditions (WS1). This suggests that, although fishing vessels operate in the same high-density, clear-night winter environments, their lower mass, reduced speed, and typically more cautious operational profiles contribute to mitigated casualty severity.
Collectively, these liability-specific risk profiles indicate that casualty outcomes are not determined solely by vessel type or the attribution of fault, but rather by the interaction of human error with seasonal, temporal, meteorological, and spatial factors. Commercial vessels emerge as “high-risk agents” when operational errors coincide with winter darkness and elevated wind conditions in narrow, high-traffic corridors. In contrast, fishing vessels, even when committing errors in comparable settings, rarely result in large-scale loss events, underscoring the moderating role of vessel dynamics and operational behavior in accident consequences. The liability-specific results further suggest that prevention measures should be differentiated across vessel types, with particular attention to bridge watchkeeping, lookout management, and collision avoidance practices on commercial vessels operating in mixed-traffic waters.

5.5. Discussions

The directed relationships identified by the TAN-BN model represent learned dependency patterns among key factors, allowing for a clearer interpretation of how these variables are statistically associated within the dataset. Several directional dependencies merit particular attention. First, the learned link between “season” and “human liability” indicates that the distribution of liability attribution varies across seasonal contexts in the observed cases. This pattern may be related to seasonal differences in weather, visibility, traffic conditions, and operational context, although the present model does not establish these factors as direct causal mechanisms of human error. In addition, seasonal contexts may coincide with different navigational conditions, such as ice formation, dense fog, or intensified fishing activities, further compounding operational risks.
Second, the learned dependency between “wind speed” and “fishing vessel length” indicates that smaller fishing vessels are more frequently represented in higher-wind accident contexts within the dataset. This pattern may be consistent with the greater operational vulnerability often reported for smaller vessels under adverse conditions. Third, the learned dependency between “season” and “visibility” indicates that visibility conditions vary across seasonal settings in the observed cases. This pattern is consistent with broader seasonal variation in meteorological conditions, although the present model does not identify a direct causal pathway from season to collision severity. Collectively, these learned relationships provide an interpretive description of how environmental and operational variables are statistically patterned in the accident dataset.
The sensitivity analysis based on the mutual information (MI) index further identified five factors showing relatively strong associations with casualty severity, namely fishing vessel length, time of day, wind speed, visibility, and season, all of which correspond to prominent arcs in the TAN-BN network. Among these, fishing vessel length demonstrated the strongest mutual information with casualty outcomes, indicating that it is the variable most strongly associated with casualty severity in the present model. One possible interpretation is that smaller fishing vessels are more frequently associated with severe casualty outcomes, a pattern that is consistent with lower structural protection and survivability in collision contexts, although the present analysis does not test these mechanisms directly. This association may also be related to the greater exposure of smaller vessels to adverse operating conditions such as strong winds and rough seas. This interpretation is also broadly consistent with the learned dependency between wind speed and fishing vessel length in the network.
Time of day also shows a relatively strong association with collision severity, which is consistent with prior research [7,52]. In the present dataset, more severe outcomes are more frequently linked to nighttime conditions, a pattern that may be related to fatigue, reduced vigilance, or greater operational complexity during night navigation. This pattern may also be consistent with the greater operational difficulty of maintaining situational awareness and navigation discipline during nighttime operations, although the model itself does not isolate these possible explanations directly.
Consistent with earlier studies [40,61], wind speed emerged as a critical factor affecting collision outcomes. This association may be related to the greater handling difficulty often reported under strong-wind and rough-sea conditions, especially for smaller fishing vessels. Similarly, visibility demonstrated a high MI value, corroborating findings from previous research [37,61]. This pattern may be consistent with the reduced detectability of approaching vessels under low-visibility conditions, although the present analysis does not isolate this operational mechanism directly.
The seasonal association observed in this study may also reflect broader traffic dynamics along China’s coastline, a pattern that is consistent with earlier studies [46,52]. One possible interpretation is that seasonal variation in fishing activity and vessel traffic density may help explain why casualty patterns differ across the year. In this sense, season should be understood as a contextual variable that co-varies with other operational and environmental conditions, rather than as an independently identified causal factor.
Regarding human liability, the analysis suggests that its association with casualty severity varies across environmental and operational contexts. Commercial-vessel-liability configurations are more frequently associated with severe casualty outcomes when they coincide with winter darkness, high winds, or congested waterways. Conversely, fishing-vessel-liability configurations under similar conditions are more often associated with less severe outcomes, a pattern that may be related to differences in vessel characteristics and operating conditions. This asymmetry suggests that differences in vessel characteristics and operational contexts may help explain the observed variation in casualty severity. Although the MI value for human liability was not among the most prominent, this may result from the structural definition of nodes within the model. Even after disaggregating human factors into multiple variables, the same five dominant factors persisted. The relatively low MI for human liability may be related to its dependence on broader seasonal and environmental contexts, suggesting that its association with casualty severity is not expressed as a uniformly strong marginal pattern.
The practical implications of this study should be interpreted directly in relation to the probabilistic patterns identified by the TAN-BN model. First, because fishing vessel length showed the strongest association with casualty severity in the mutual information analysis, the results suggest that casualty-reduction efforts should give particular attention to the structural vulnerability of fishing vessels, especially smaller or less protected vessels. In practical terms, this points to the importance of strengthening survivability-oriented safety management for fishing vessels operating in mixed-traffic waters, including closer attention to lifesaving equipment, hull condition, emergency preparedness, and safety inspection.
Second, because the model indicates that more severe casualty outcomes are more strongly associated with nighttime operation, poor visibility, and adverse wind conditions, the findings support more targeted risk management under these conditions. This may include strengthened lookout requirements, more proactive warning and communication measures, and intensified monitoring of collision-prone areas during periods of reduced visibility and nighttime navigation.
Third, the MPE results show that severe casualty configurations are more likely to occur in open-water settings than in regulated waterways. This suggests that collision-risk prevention should not focus only on traffic-regulated routes, but should also strengthen surveillance, encounter warning, and risk communication in open-water zones where commercial navigation and fishing activity overlap and vessel behavior may be less predictable.
Fourth, the liability-specific MPE analysis indicates that severe casualty scenarios are more frequently associated with configurations in which commercial vessels are the primary liable party. Although this should not be interpreted as a causal judgment beyond the official reports themselves, it does suggest the need for stronger bridge watchkeeping, lookout discipline, and collision avoidance practice among commercial vessel crews operating in mixed commercial–fishing traffic environments.
Overall, these recommendations are intended as risk-informed implications derived from probabilistic model results rather than as direct evidence-based prescriptions validated through intervention analysis. They reflect the main empirical patterns identified in the analysis—namely fishing vessel vulnerability, adverse environmental exposure, nighttime operational risk, and open-water high-severity configurations—and indicate where safety management efforts may be more effectively prioritized. From a scientific-contribution perspective, the value of this study lies not in methodological innovation, but in the extraction of domain-specific knowledge from a structured set of official investigation reports. The analysis moves beyond descriptive summaries by showing how casualty severity in commercial–fishing vessel collisions is probabilistically associated with a distinct combination of fishing vessel vulnerability, environmental exposure, temporal conditions, and liability-related operational context. In this sense, the study contributes new empirical understanding of consequence formation in a collision class that has more often been examined from the perspective of occurrence, legal responsibility, or individual cases than from casualty severity itself.
The main strength of the study lies in combining multiple official accident reports with a probabilistic framework that captures interdependence among vessel, environmental, temporal, and human liability factors. Compared with single-case analysis or AIS-only data, this approach provides richer contextual insight into casualty-related patterns. At the same time, several limitations should be acknowledged. The dataset is restricted to China’s coastal waters, the sample size remains moderate relative to the model dimensionality, and human factors are represented through a simplified liability-based proxy. In addition, TAN-BN supports association and conditional dependency analysis rather than formal causal inference. Despite these limitations, the results offer useful implications for the quantitative investigation of socio-technical drivers of casualty severity and for the design of more targeted maritime safety interventions.

6. Conclusions

This study examined casualty severity in collisions between commercial and fishing vessels in China’s coastal waters using 137 official accident reports and a TAN-BN framework. In doing so, it addressed the research question by identifying the socio-technical factors most strongly associated with casualty severity, revealing their probabilistic dependency patterns, and characterizing the typical configurations associated with different casualty outcomes.
The results indicate that fishing vessel length shows the strongest association with casualty severity, followed by time of day, wind speed, visibility, and season. Overall, more severe casualty outcomes were more frequently associated with smaller fishing vessels, nighttime conditions, adverse weather, and reduced visibility. The scenario-based MPE analysis further shows that severe casualty configurations are more likely when human liability is primarily attributed to commercial vessel crews, whereas fishing vessel liability is more often associated with less severe outcomes. Taken together, these findings suggest that casualty severity in commercial–fishing vessel collisions is strongly associated with the combined presence of vessel vulnerability, environmental exposure, and liability-related operational conditions.
From a practical perspective, this study provides evidence for improving maritime safety governance in shared coastal waters, particularly through better protection of vulnerable fishing vessels, enhanced night-navigation management, and stronger precautionary measures under adverse weather conditions. Within the scope of China’s coastal waters, these findings are relevant to the resilience and sustainability of coastal maritime socio-technical systems in which commercial shipping and fishing activities coexist.
Nevertheless, the results should be interpreted with caution: the dataset is restricted to China’s coastal waters, the human liability variable is a simplified proxy for human factors, and the probabilistic model supports association and conditional dependency analysis rather than formal causal inference Accordingly, the findings related to human liability indicate associations between reported responsibility configurations and casualty severity, but they do not provide a comprehensive explanation of underlying human causation. In addition, the TAN-BN structure imposes a simplified tree-structured dependency pattern and may not fully represent more complex interactions among variables. The learned dependency structure and conditional probabilities may also be influenced by sample size and variable discretization, particularly for low-frequency state combinations. Because the dataset is geographically limited to China’s coastal waters, the findings should be interpreted as context-specific rather than universally generalizable. Future research could extend the dataset, refine human-factor representation, and incorporate additional validation and uncertainty assessment.

Author Contributions

Conceptualization, H.Z. (Hongzhu Zhou), Y.Z., Y.W. and P.Z.; methodology, H.Z. (Hongzhu Zhou), M.G., Y.Z., F.W., H.Z. (Hongxia Zhou) and P.Z.; software, H.Z. (Hongzhu Zhou), Y.Z. and P.Z.; validation, H.Z. (Hongxia Zhou), Y.Z. and P.Z.; formal analysis, H.Z. (Hongzhu Zhou), Y.Z., F.W., H.Z. (Hongxia Zhou), Y.W., H.Z. (Hongzhu Zhou) and P.Z.; investigation, H.Z. (Hongzhu Zhou), Y.Z., F.W. and H.Z. (Hongxia Zhou); resources, P.Z.; data curation, H.Z. (Hongzhu Zhou), Y.Z. and F.W.; writing—original draft preparation, H.Z. (Hongzhu Zhou), Y.Z. and P.Z.; writing—review and editing, M.G., Y.W. and P.Z.; visualization, H.Z. (Hongzhu Zhou) and Y.Z.; supervision, P.Z.; project administration, P.Z.; funding acquisition, P.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the National Natural Science Foundation of China (52272334, 61074142), Key R&D Program of Zhejiang Province (2024C01180), Ningbo International Science and Technology Cooperation Project (2023H020), EC H2020 Project (690713) and National Key Research and Development Program of China (2017YFE0194700).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

We would like to thank the National “111” Centre on Safety and Intelligent Operation of Sea Bridges (D21013), the Zhejiang 2011 Collaborative Innovation Center for Port Economy, and the Donghai Academy of Ningbo University for the financial support in publishing this paper. The authors would like to thank the K.C. Wong Magna Fund in Ningbo University for sponsorship.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BNBayesian Network
TANTree-Augmented Network
EMExpectation–Maximization
RIFsRisk-Influencing Factors
E-StepExpected Step
M-StepMaximization Step
MPEMost Probable Explanation
MIMutual Information

References

  1. Home | UN Trade and Development (UNCTAD). Available online: https://unctad.org/ (accessed on 5 April 2026).
  2. Wróbel, K.; Montewka, J.; Kujala, P. Towards the Assessment of Potential Impact of Unmanned Vessels on Maritime Transportation Safety. Reliab. Eng. Syst. Saf. 2017, 165, 155–169. [Google Scholar] [CrossRef]
  3. Obeng, F.; Domeh, D.; Khan, F.; Bose, N.; Sanli, E. An Operational Risk Management Approach for Small Fishing Vessel. Reliab. Eng. Syst. Saf. 2024, 247, 110104. [Google Scholar] [CrossRef]
  4. Weng, J.; Li, G. Economic Loss Analysis of Fishing Boat Collisions Considering Spatial-Temporal Interaction Effects. J. Navig. 2020, 73, 1069–1086. [Google Scholar] [CrossRef]
  5. Zhu, C.; Lei, J.; Wang, Z.; Zheng, D.; Yu, C.; Chen, M.; He, W. Risk Analysis and Visualization of Merchant and Fishing Vessel Collisions in Coastal Waters: A Case Study of Fujian Coastal Area. J. Mar. Sci. Eng. 2024, 12, 681. [Google Scholar] [CrossRef]
  6. Zhang, H.; Chen, B.; Zhao, Q.; Yu, J.; Fang, Z. Identification of Risk Key Factors and Prevention Strategies for Collision Accidents between Merchant and Fishing Vessels in China Waters Based on Complex Network. Ocean Eng. 2024, 307, 118148. [Google Scholar] [CrossRef]
  7. Jin, D. The Determinants of Fishing Vessel Accident Severity. Accid. Anal. Prev. 2014, 66, 1–7. [Google Scholar] [CrossRef]
  8. Wang, F.; Du, W.; Feng, H.; Ye, Y.; Grifoll, M.; Liu, G.; Zheng, P. Identification of Risk Influential Factors for Fishing Vessel Accidents Using Claims Data from Fishery Mutual Insurance Association. Sustainability 2023, 15, 13427. [Google Scholar] [CrossRef]
  9. Yi, Z. Study on Collision Between Fishing Vessels and Merchant Ships Within the China Coastal Waters. Master’s Thesis, World Maritime University, Dalian, China, 2015. Available online: https://commons.wmu.se/cgi/viewcontent.cgi?article=1125&context=msem_dissertations (accessed on 5 April 2026).
  10. Liu, B.; Guo, X.; Dong, C. Research on the Characteristics and Causes of Collision Accidents between Merchant Ships and Fishing Vessels in China’s Coastal Waters. Appl. Sci. 2026, 16, 268. [Google Scholar] [CrossRef]
  11. Ye, Y.; Zheng, P.; Wang, Q.; Wong, S.C.; Xu, P. Modeling Economic Loss Associated with Fishing Vessel Accidents: A Bayesian Random-Parameter Generalized Beta of the Second Kind Model with Heterogeneity in Means. Anal. Methods Accid. Res. 2025, 46, 100384. [Google Scholar] [CrossRef]
  12. Guo, M.; Chen, M.; Yuan, L.; Zhang, Z.; Lv, J.; Cai, Z. Investigation of Ship Collision Accident Risk Factors Using BP-DEMATEL Method Based on HFACS-SCA. Reliab. Eng. Syst. Saf. 2025, 257, 110875. [Google Scholar] [CrossRef]
  13. Bhardwaj, U.; Teixeira, A.P.; Guedes Soares, C. Casualty Analysis Methodology and Taxonomy for FPSO Accident Analysis. Reliab. Eng. Syst. Saf. 2022, 218, 108169. [Google Scholar] [CrossRef]
  14. Zhang, Y.; Sun, X.; Chen, J.; Cheng, C. Spatial Patterns and Characteristics of Global Maritime Accidents. Reliab. Eng. Syst. Saf. 2021, 206, 107310. [Google Scholar] [CrossRef]
  15. Przywarty, M.; Jarząbek, D.; Gucma, L. Model of Navigational Safety of Fishing Vessels In Polish EEZ. Annu. Navig. 2014, 21, 133–142. [Google Scholar] [CrossRef]
  16. Oh, J.-H.; Kim, K.; Jeong, J.-S. A Study on the Risk Analysis Based on the Trajectory of Fishing Vessels in the VTS Area. Int. J. e-Navig. Marit. Econ. 2015, 2, 38–46. [Google Scholar] [CrossRef]
  17. Li, H.; Çelik, C.; Bashir, M.; Zou, L.; Yang, Z. Incorporation of a Global Perspective into Data-Driven Analysis of Maritime Collision Accident Risk. Reliab. Eng. Syst. Saf. 2024, 249, 110187. [Google Scholar] [CrossRef]
  18. Ma, L.; Ma, X.; Du, Q.; Zhang, R. Investigation of the Severity of Maritime Accidents Considering the Interaction between Human Factors and Operating Conditions: A Case Study on Collision Accidents in China. Reliab. Eng. Syst. Saf. 2026, 265, 111533. [Google Scholar] [CrossRef]
  19. Tang, H.Q.; Huang, H.L. Analysis of a Collision Accident between a Commercial Ship and a Fishing Ship. Pearl River Water Transp. 2015, 92–95. [Google Scholar] [CrossRef]
  20. Zhang, X.H. Analysis on the Bilateral Liabilities of a Fishing Vessel Related Collision Accident and Associated Countermeasures. China Marit. Saf. 2016, 35–36. [Google Scholar] [CrossRef]
  21. Su, L.S. Analysis of Commercial and Fishing Vessel Collision Accidents Based on the Yellow and Bohai Sea Areas. China Water Transp. 2019, 33–34. [Google Scholar] [CrossRef]
  22. Wu, B.; Wang, Y.; Zhang, J.; Savan, E.E.; Yan, X. Effectiveness of Maritime Safety Control in Different Navigation Zones Using a Spatial Sequential DEA Model: Yangtze River Case. Accid. Anal. Prev. 2015, 81, 232–242. [Google Scholar] [CrossRef]
  23. Yang, J.; Sun, Y.; Song, Q.; Ma, L. Laws and Preventive Methods of Collision Accidents between Merchant and Fishing Vessels in Coastal Area of China. Ocean Coast. Manag. 2023, 231, 106404. [Google Scholar] [CrossRef]
  24. Wang, H.; Liu, Z.; Wang, X.; Graham, T.; Wang, J. An Analysis of Factors Affecting the Severity of Marine Accidents. Reliab. Eng. Syst. Saf. 2021, 210, 107513. [Google Scholar] [CrossRef]
  25. Wang, H.; Liu, Z.; Wang, X.; Huang, D.; Cao, L.; Wang, J. Analysis of the Injury-Severity Outcomes of Maritime Accidents Using a Zero-Inflated Ordered Probit Model. Ocean Eng. 2022, 258, 111796. [Google Scholar] [CrossRef]
  26. Zhou, K.; Xing, W.; Wang, J.; Li, H.; Yang, Z. A Data-Driven Risk Model for Maritime Casualty Analysis: A Global Perspective. Reliab. Eng. Syst. Saf. 2024, 244, 109925. [Google Scholar] [CrossRef]
  27. Trucco, P.; Cagno, E.; Ruggeri, F.; Grande, O. A Bayesian Belief Network Modelling of Organisational Factors in Risk Analysis: A Case Study in Maritime Transportation. Reliab. Eng. Syst. Saf. 2008, 93, 845–856. [Google Scholar] [CrossRef]
  28. Wang, L.; Yang, Z. Bayesian Network Modelling and Analysis of Accident Severity in Waterborne Transportation: A Case Study in China. Reliab. Eng. Syst. Saf. 2018, 180, 277–289. [Google Scholar] [CrossRef]
  29. Fan, S.; Blanco-Davis, E.; Yang, Z.; Zhang, J.; Yan, X. Incorporation of Human Factors into Maritime Accident Analysis Using a Data-Driven Bayesian Network. Reliab. Eng. Syst. Saf. 2020, 203, 107070. [Google Scholar] [CrossRef]
  30. Li, H.; Ren, X.; Yang, Z. Data-Driven Bayesian Network for Risk Analysis of Global Maritime Accidents. Reliab. Eng. Syst. Saf. 2023, 230, 108938. [Google Scholar] [CrossRef]
  31. Jiang, H.; Zhang, J.; Wan, C.; Zhang, M.; Guedes Soares, C. A Data-Driven Bayesian Network Model for Risk Influencing Factors Quantification Based on Global Maritime Accident Database. Ocean Coast. Manag. 2024, 259, 107473. [Google Scholar] [CrossRef]
  32. Wu, B.; Yip, T.L.; Yan, X.; Guedes Soares, C. Review of Techniques and Challenges of Human and Organizational Factors Analysis in Maritime Transportation. Reliab. Eng. Syst. Saf. 2022, 219, 108249. [Google Scholar] [CrossRef]
  33. Puisa, R.; Lin, L.; Bolbot, V.; Vassalos, D. Unravelling Causal Factors of Maritime Incidents and Accidents. Saf. Sci. 2018, 110, 124–141. [Google Scholar] [CrossRef]
  34. Coraddu, A.; Oneto, L.; Navas de Maya, B.; Kurt, R. Determining the Most Influential Human Factors in Maritime Accidents: A Data-Driven Approach. Ocean Eng. 2020, 211, 107588. [Google Scholar] [CrossRef]
  35. Wang, H.; Chen, N.; Wu, B.; Guedes Soares, C. Human and Organizational Factors Analysis of Collision Accidents between Merchant Ships and Fishing Vessels Based on HFACS-BN Model. Reliab. Eng. Syst. Saf. 2024, 249, 110201. [Google Scholar] [CrossRef]
  36. Zhou, H.; Wang, F.; Hu, W.; Grifoll, M.; Liu, J.; Du, W.; Zheng, P. A Novel Framework for Identifying Major Fishing Vessel Accidents and Their Key Influencing Factors. Sustainability 2024, 16, 8246. [Google Scholar] [CrossRef]
  37. Rezaee, S.; Pelot, R.; Ghasemi, A. The Effect of Extreme Weather Conditions on Commercial Fishing Activities and Vessel Incidents in Atlantic Canada. Ocean Coast. Manag. 2016, 130, 115–127. [Google Scholar] [CrossRef]
  38. Uğurlu, F.; Yıldız, S.; Boran, M.; Uğurlu, Ö.; Wang, J. Analysis of Fishing Vessel Accidents with Bayesian Network and Chi-Square Methods. Ocean Eng. 2020, 198, 106956. [Google Scholar] [CrossRef]
  39. Ye, Y.; Liu, M.; Meng, F.; Wong, S.C.; Gao, X.; Yang, Z. Temporal Instability Analysis of Fatal Commercial Fishing Vessel Incidents: A Correlated Random-Parameter Model with Heterogeneity in Means. Reliab. Eng. Syst. Saf. 2026, 274, 112423. [Google Scholar] [CrossRef]
  40. Cao, Y.; Wang, X.; Wang, Y.; Fan, S.; Wang, H.; Yang, Z.; Liu, Z.; Wang, J.; Shi, R. Analysis of Factors Affecting the Severity of Marine Accidents Using a Data-Driven Bayesian Network. Ocean Eng. 2023, 269, 113563. [Google Scholar] [CrossRef]
  41. International Maritime Organization (IMO). Casualty-Related Matters: Reports on Marine Casualties and Incidents; IMO Report: London, UK, 2014. [Google Scholar]
  42. Antão, P.; Soares, C.G. Analysis of the Influence of Human Errors on the Occurrence of Coastal Ship Accidents in Different Wave Conditions Using Bayesian Belief Networks. Accid. Anal. Prev. 2019, 133, 105262. [Google Scholar] [CrossRef] [PubMed]
  43. Obeng, F.; Domeh, V.; Khan, F.; Bose, N.; Sanli, E. Capsizing Accident Scenario Model for Small Fishing Trawler. Saf. Sci. 2022, 145, 105500. [Google Scholar] [CrossRef]
  44. Li, Y.; Cheng, Z.; Yip, T.L.; Fan, X.; Wu, B. Use of HFACS and Bayesian Network for Human and Organizational Factors Analysis of Ship Collision Accidents in the Yangtze River. Marit. Policy Manag. 2022, 49, 1169–1183. [Google Scholar] [CrossRef]
  45. Graziano, A.; Teixeira, A.P.; Guedes Soares, C. Classification of Human Errors in Grounding and Collision Accidents Using the TRACEr Taxonomy. Saf. Sci. 2016, 86, 245–257. [Google Scholar] [CrossRef]
  46. Akyuz, E. A Hybrid Accident Analysis Method to Assess Potential Navigational Contingencies: The Case of Ship Grounding. Saf. Sci. 2015, 79, 268–276. [Google Scholar] [CrossRef]
  47. Liu, K.; Yu, Q.; Yuan, Z.; Yang, Z.; Shu, Y. A Systematic Analysis for Maritime Accidents Causation in Chinese Coastal Waters Using Machine Learning Approaches. Ocean Coast. Manag. 2021, 213, 105859. [Google Scholar] [CrossRef]
  48. Yu, Q.; Teixeira, Â.P.; Liu, K.; Rong, H.; Guedes Soares, C. An Integrated Dynamic Ship Risk Model Based on Bayesian Networks and Evidential Reasoning. Reliab. Eng. Syst. Saf. 2021, 216, 107993. [Google Scholar] [CrossRef]
  49. Zhang, D.; Yan, X.P.; Yang, Z.L.; Wall, A.; Wang, J. Incorporation of Formal Safety Assessment and Bayesian Network in Navigational Risk Estimation of the Yangtze River. Reliab. Eng. Syst. Saf. 2013, 118, 93–105. [Google Scholar] [CrossRef]
  50. Ung, S.T. Navigation Risk Estimation Using a Modified Bayesian Network Modeling-a Case Study in Taiwan. Reliab. Eng. Syst. Saf. 2021, 213, 107777. [Google Scholar] [CrossRef]
  51. Hänninen, M.; Kujala, P. Bayesian Network Modeling of Port State Control Inspection Findings and Ship Accident Involvement. Expert Syst. Appl. 2014, 41, 1632–1646. [Google Scholar] [CrossRef]
  52. Khan, B.; Khan, F.; Veitch, B. A Dynamic Bayesian Network Model for Ship-Ice Collision Risk in the Arctic Waters. Saf. Sci. 2020, 130, 104858. [Google Scholar] [CrossRef]
  53. Khan, R.U.; Yin, J.; Mustafa, F.S.; Liu, H. Risk Assessment and Decision Support for Sustainable Traffic Safety in Hong Kong Waters. IEEE Access 2020, 8, 72893–72909. [Google Scholar] [CrossRef]
  54. Yang, Z.; Yang, Z.; Yin, J. Realising Advanced Risk-Based Port State Control Inspection Using Data-Driven Bayesian Networks. Transp. Res. Part A Policy Pract. 2018, 110, 38–56. [Google Scholar] [CrossRef]
  55. Friedman, N.; Geiger, D.; Goldszmidt, M. Bayesian Network Classifiers. Mach. Learn. 1997, 29, 131–163. [Google Scholar] [CrossRef]
  56. Chow, C.; Liu, C. Approximating Discrete Probability Distributions with Dependence Trees. IEEE Trans. Inf. Theory 1968, 14, 462–467. [Google Scholar] [CrossRef]
  57. Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm. J. R. Stat. Soc. B 1977, 39, 1–22. [Google Scholar] [CrossRef]
  58. Do, C.B.; Batzoglou, S. What Is the Expectation Maximization Algorithm? Nat. Biotechnol. 2008, 26, 897–899. [Google Scholar] [CrossRef] [PubMed]
  59. Costello, D.; Xu, H. Generating Certification Evidence for the Certification of Collision Avoidance in Autonomous Surface Vessels. Marit. Policy Manag. 2024, 51, 1497–1516. [Google Scholar] [CrossRef]
  60. Ministry of Transport of the People’s Republic of China. Statistical Measures for Water Traffic Accidents; Ministry of Transport of the People’s Republic of China: Beijing, China, 2014. Available online: https://xxgk.mot.gov.cn/gz/202112/W020220809342434281375.pdf (accessed on 5 April 2026).
  61. Zhao, Z.; Liu, X.; Feng, L.; Grifoll, M.; Feng, H. Causation Analysis of Marine Traffic Accidents Using Deep Learning Approaches: A Case Study from China’s Coasts. Systems 2025, 13, 284. [Google Scholar] [CrossRef]
Figure 1. Accident location map.
Figure 1. Accident location map.
Sustainability 18 04648 g001
Figure 2. Structure of the Bayesian network model (C denotes commercial vessels, and F denotes fishing vessels).
Figure 2. Structure of the Bayesian network model (C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g002
Figure 3. The final TAN Bayesian network model (C denotes commercial vessels, and F denotes fishing vessels).
Figure 3. The final TAN Bayesian network model (C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g003
Figure 4. Mutual information is shared with the target node (node size proportional to MI value; C denotes commercial vessels, and F denotes fishing vessels).
Figure 4. Mutual information is shared with the target node (node size proportional to MI value; C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g004
Figure 5. MPE mode (no casualties and less than 3 serious injuries; the gray node represents the target variable; C denotes commercial vessels, and F denotes fishing vessels).
Figure 5. MPE mode (no casualties and less than 3 serious injuries; the gray node represents the target variable; C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g005
Figure 6. In MPE mode (>10 casualties; the gray node represents the target variable; C denotes commercial vessels, and F denotes fishing vessels).
Figure 6. In MPE mode (>10 casualties; the gray node represents the target variable; C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g006
Figure 7. MPE mode (Selection HE1 for liability; the red dashed circle highlights the selected evidence or key inferred state; C denotes commercial vessels, and F denotes fishing vessels).
Figure 7. MPE mode (Selection HE1 for liability; the red dashed circle highlights the selected evidence or key inferred state; C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g007
Figure 8. MPE mode (Selection HE2 for liability; the red dashed circle highlights the selected evidence or key inferred state; C denotes commercial vessels, and F denotes fishing vessels).
Figure 8. MPE mode (Selection HE2 for liability; the red dashed circle highlights the selected evidence or key inferred state; C denotes commercial vessels, and F denotes fishing vessels).
Sustainability 18 04648 g008
Table 1. Contributing factors to accidents in the literature.
Table 1. Contributing factors to accidents in the literature.
FactorsIndicatorsReferences
Environmental factorsVisibilityCao et al. [40]; Wang et al. [28]; IMO 2014 [41]
LocationIMO 2014 [41]; Li et al. [30]
Wind SpeedCao et al. [40]; Rezaee et al. [37]
Human
factors
Voyage PreparationLi et al. [30]; Li et al. [44]; Uğurlu et al. [38]
Judgment and Decision-MakingAntao et al. [42]; Obeng et al. [43]; Graziano et al. [45]
Navigation ManeuveringAntao et al. [42]; Akyuz et al. [46]
ManagementLi et al. [30]; Obeng et al. [43]; Graziano et al. [45]
Vessel
factors
DefectsLi et al. [30]; Wróbel et al. [2]; Hänninen et al. [51]
LengthLiu et al. [47]; Fan et al. [29]
AgeLiu et al. [47]; Yu et al. [48]; Ung S T [50]
Gross TonnageFan et al. [29]; Yu et al. [48]; Zhang et al. [49]
Other
factors
SeasonLiu et al. [47]; Khan et al. [53]; Wang et al. [8]
TimeKhan et al. [53]; Wang et al. [8]
Table 2. Data-processing and analytical workflow.
Table 2. Data-processing and analytical workflow.
Step No.StageProcedureOutput
Step 1Data sourceCollect official accident investigation reports.Raw accident report source
Step 2Data extractionManually extract relevant information, build an Excel database, and retain only commercial–fishing collision cases.Final case dataset
Step 3Variable codingCode vessel, environmental, temporal, liability, and casualty variables.Structured analytical variables
Step 4DiscretizationStandardize and discretize variables.Model-ready dataset
Step 5TAN modelingConstruct the TAN-BN structure.Probabilistic network structure
Step 6Parameter learningEstimate conditional probabilities using EM.Trained TAN-BN model
Step 7Probability analysisConduct marginal probability analysis.Marginal distributions
Step 8MI analysisCalculate mutual information values.Key associated factors
Step 9MPE inferencePerform MPE analysis under selected conditions.Scenario configurations
Step 10InterpretationInterpret findings and derive implications.Practical conclusions
Table 3. Technical parameters of vessels.
Table 3. Technical parameters of vessels.
ParametersVessel AVessel B
Vessel’s nameDL ACACIAGuiqin Fishery 20216
Type of shipBulk carrierStern trawler
Hull materialSteelWood
Gross ton44,603 ton235 ton
Length224.76 m30.62 m
Ship age10 years7 years
DefectNo defectsNo defects
Table 4. Descriptive statistics of node variables (In the category column of the table, C denotes commercial vessels, and F denotes fishing vessels).
Table 4. Descriptive statistics of node variables (In the category column of the table, C denotes commercial vessels, and F denotes fishing vessels).
Node VariablesDescriptionsValueRatio
C Gross Tonnage
(tons)
[0–500)CGT120.5%
[500–3000)CGT232.1%
3000CGT347.4%
C Age
(years)
[0–20)CA188.9%
20CA211.1%
C Defects *defectiveYes6.5%
defect-freeNo93.5%
F Length
(meters)
[0–12)FL118.9%
[12–24)FL221.3%
24FL359.8%
F Age
(years)
[0–20)FA176.6%
20FA223.4%
F Defects *defectiveYes8.1%
defect-freeNo91.9%
SeasonspringS127.2%
summerS215.3%
autumnS330.6%
winterS426.9%
TimedaylightT127.8%
nightT272.2%
Visibility
(nm)
5good68.6%
[2–5)moderate17.6%
[0–2)poor13.8%
Wind Speed
(Beaufort scale)
Level 0–3WS121.1%
Level 4–6WS269.3%
7 and aboveWS39.6%
Locationwaterway or shipping routeL166.4%
in anchorageL210.3%
open waterL323.3%
Human Liability *commercial vesselsHE150.5%
fishing vesselsHE226.2%
equivalentHE323.3%
Casualties *no casualtiesC120.4%
1–3 casualtiesC254.1%
4–10 casualtiesC322.6%
more than 10 casualtiesC42.9%
* Variables marked with an asterisk (*) are further explained in Table 5.
Table 5. Detailed explanation of each indicator (In the category column of the table, C denotes commercial vessels, and F denotes fishing vessels).
Table 5. Detailed explanation of each indicator (In the category column of the table, C denotes commercial vessels, and F denotes fishing vessels).
Node VariablesDetailed Description
C Defects
  • Including main engine failure, electrical system failure, aging ship’s equipment, ships without valid certificates, illegal modifications, steering and maneuvering system problems, seawater system failures, crewing problems, etc.
F Defects
  • Including mechanical failures, aging ship’s equipment, ships without valid certificates, illegal conversions, design and manufacturing defects, crewing problems, electrical and communications problems, fuel and supply problems, etc.
Human Liability
  • This variable is coded according to the primary liability attribution reported in the official accident investigation and indicates whether human-related responsibility was mainly assigned to the commercial vessel, the fishing vessel, or both parties. It should be understood as a simplified report-based proxy for the dominant side of attributed human-related responsibility in the accident sequence, rather than as a comprehensive human-factors construct. Although the reports often contain richer narrative information on lookout deficiencies, judgment and decision-making problems, maneuvering errors, communication failures, and planning shortcomings, such information is not recorded in a sufficiently standardized and comparable way across all cases to support consistent multi-dimensional coding in the present model. Therefore, this variable retains only one human-related dimension of the accident reports and the corresponding findings should be interpreted within that limited scope.
Casualties
  • There were no casualties (less than 3 serious injuries), 1–3 casualties, 4–10 casualties, or more than 10 casualties (every three additional severe injuries is considered an increase in the number of casualties).
Table 6. TAN network construction procedure.
Table 6. TAN network construction procedure.
Step No.Procedure
Step 1Select casualties as the target variable, and C age, C gross tonnage, C defects, F length, F age, F defects, visibility, location, wind speed, season, time, and human liability as attributes.
Step 2Calculate the conditional mutual information between every two attributes to define their relationship.
Step 3Establish an undirected graph containing attributes and conditional mutual information.
Step 4Establish a spanning tree with maximum weight, ordering edge weights from large to small, and then choose the largest weights between all pairs of attributes.
Step 5Choose an attribute node as the target node which has the greatest spanning tree. In addition, each edge’s direction must be determined. Convert an undirected graph into a directed graph.
Step 6Construct the TAN model by adding the target variable (i.e., casualties) to the tree, and adding the arc from the target variable to each attribute. The TAN model is shown in Figure 2.
Table 7. Results in MPE mode (100% confidence).
Table 7. Results in MPE mode (100% confidence).
FactorsNo Casualties10 or More Casualties
Wind speedLevel 3–6Level 3–6
Visibilitygoodpoor
Timenight/daylightnight
Seasonautumnspring
Locationwaterway or shipping routeopen water
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, H.; Zhou, Y.; Wang, F.; Zhou, H.; Wang, Y.; Grifoll, M.; Zheng, P. Socio-Technical Drivers of Casualty Severity in Commercial–Fishing Vessel Collisions: A Bayesian Network Analysis. Sustainability 2026, 18, 4648. https://doi.org/10.3390/su18104648

AMA Style

Zhou H, Zhou Y, Wang F, Zhou H, Wang Y, Grifoll M, Zheng P. Socio-Technical Drivers of Casualty Severity in Commercial–Fishing Vessel Collisions: A Bayesian Network Analysis. Sustainability. 2026; 18(10):4648. https://doi.org/10.3390/su18104648

Chicago/Turabian Style

Zhou, Hongzhu, Yinjie Zhou, Fang Wang, Hongxia Zhou, Yibing Wang, Manel Grifoll, and Pengjun Zheng. 2026. "Socio-Technical Drivers of Casualty Severity in Commercial–Fishing Vessel Collisions: A Bayesian Network Analysis" Sustainability 18, no. 10: 4648. https://doi.org/10.3390/su18104648

APA Style

Zhou, H., Zhou, Y., Wang, F., Zhou, H., Wang, Y., Grifoll, M., & Zheng, P. (2026). Socio-Technical Drivers of Casualty Severity in Commercial–Fishing Vessel Collisions: A Bayesian Network Analysis. Sustainability, 18(10), 4648. https://doi.org/10.3390/su18104648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop