Integrated Modeling of Maritime Accident Hotspots and Vessel Traffic Networks in High-Density Waterways: A Case Study of the Strait of Malacca
Abstract
1. Introduction
1.1. Problem Statement and Motivation
1.2. Research Questions
- How can historical accident patterns be systematically integrated with real-time vessel traffic dynamics to enhance spatial risk assessment?
- What are the underlying risk mechanisms and associations shaping amplification at specific maritime network nodes and routes, and how can these be interpreted for proactive intervention?
- How can expert maritime domain knowledge be effectively integrated with data-driven insights through fuzzy logic approaches to improve both prediction accuracy and decision interpretability?
1.3. Research Contributions
Delineation of Novelty and Methodological Advancement
2. Literature Review
2.1. Traditional Accident Hotspot Analysis
2.2. AIS Data and Vessel Traffic Network Modeling
2.3. Complex Network Theory in Maritime Analysis
2.4. Formal Safety Assessment and Risk Integration
2.5. Machine Learning and Spatial Analysis in Maritime Domains
2.6. Risk Factors and Causation Analysis
2.7. Research Gap
- Static hotspot analysis methods such as KDE can identify where risks are concentrated but lack explanatory power regarding why risks occur.
- Dynamic AIS-based network analyses capture how vessels interact in real time but lack grounding in long-term accident outcomes.
3. Methodology
- -
- AIS trajectory data (49,964 vessels, 2023–2024)
- -
- Historical accident records (71 incidents)
- -
- Environmental data: ERA5 (ECMWF, Reading, UK; Version [e.g., 5/‘HRES’/release month-year]) and GEBCO_[Year/Release] (British Oceanographic Data Centre, Liverpool, UK)
- -
- Geospatial reference data
- -
- Trajectory skeletonization using Douglas-Peucker algorithm
- -
- Enhanced DBSCAN clustering with directional/speed filters
- -
- Network consolidation via Fréchet distance merging
- -
- Final network: 267 nodes, 263 directed edges
- -
- Environmental risk indicators (weight: 0.25)
- -
- Traffic complexity indicators (weight: 0.35)
- -
- Historical accident patterns (weight: 0.40)
- -
- FAHP integration based on expert consensus
- -
- Spatial risk distribution mapping
- -
- Network centrality analysis
- -
- Integrated risk scoring
- -
- Uncertainty quantification using Monte Carlo methods

3.1. Data Preparation and Preprocessing
- AIS Data:
- 2.
- Historical Accident Records:
- 3.
- Multi-Source Environmental Data Integration
- -
- Visibility: The Strait is highly susceptible to seasonal haze and dense fog, which are consistently cited as primary contributors to multi-vessel collisions in this high-traffic area.
- -
- Current Shear: Due to the Strait’s complex bathymetry, strong current shear (spatial change in current speed/direction) frequently occurs at narrow chokepoints. This significantly complicates ship handling and is a major factor in groundings and contacts.
- -
- Topographic adjustment: Elevation-dependent wind and precipitation correction
- -
- Land–sea transition modeling: Coastal boundary layer effects on visibility and wind patterns
- -
- Bathymetric wave refraction: Shallow water wave transformation using linear wave theory
- -
- Cross-validation between satellite and in situ observations (correlation > 0.85 required)
- -
- Temporal consistency checks (outlier detection using 3-sigma rule)
- -
- Physical plausibility constraints (e.g., wave height vs. wind speed relationships)
- 4.
- Geospatial Data: Coastlines, ports, islands, and maritime boundaries were extracted from vector datasets: Natural Earth (NACIS, Milwaukee, WI, USA); OpenStreetMap (OpenStreetMap Foundation, Cambridge, UK).
- -
- Seasonal stratification: Ensures representation of monsoon seasons, typhoon periods, and calm weather conditions
- -
- Rare event handling: Includes major incidents (oil spills, multi-vessel collisions) occurring once every 2–3 years
Software and Versions
3.2. Traffic Network Construction
3.3. Multi-Factor Risk Assessment
FAHP Workflow and Expert Elicitation
3.4. Risk–Network Coupling and Comprehensive Assessment
Context-Adaptive Weight Adjustment
3.5. Uncertainty Analysis and Confidence Intervals
- High-risk areas: 0.78 [0.71, 0.85]
- Medium-risk areas: 0.45 [0.39, 0.52]
- Low-risk areas: 0.18 [0.12, 0.24]
3.6. Local Environmental Risk Modeling
3.6.1. Visibility Micro-Climatology
- -
- Land–sea thermal gradients: Seasonal monsoon transitions can generate 15–20 °C differences between sea surface temperature and adjacent land, enhancing fog formation.
- -
- Topographic channeling: The mountainous Malaysian coastline creates localized airflow convergence zones, which act as preferential fog corridors.
- -
- Industrial pollution interaction: Emissions from major ports reduce visibility by 30–50% within a 5 km radius, further amplifying the occurrence of restricted-visibility conditions.
3.6.2. Current Shear Modeling
- -
- Monsoon-driven flows: Seasonal reversals generate current differences of 2–3 knots within 1 km.
- -
- Tidal amplification: Shallow regions amplify tidal currents up to fourfold.
- -
- Interaction effects: Opposing winds increase effective wave height by approximately 50%.
3.7. Associational and Quasi-Causal Analysis
3.7.1. Causal Graph Construction
- -
- Weather conditions → Visibility/Sea State → Navigation Difficulty → Human Error → Accidents
- -
- Traffic density → Vessel interactions → Collision probability → Accidents
- -
- Network centrality → Traffic convergence → Interaction frequency → Accidents
3.7.2. Associational Effect Identification
- -
- Traffic density → Accident risk: β = 0.34 [0.28, 0.40], p < 0.001
- -
- Visibility → Navigation difficulty: β = −0.67 [−0.74, −0.60], p < 0.001
- -
- Network centrality → Traffic convergence: β = 0.89 [0.83, 0.95], p < 0.001
3.7.3. Counterfactual Prediction
- -
- Speed limits reduce collision risk by 23% [CI: −0.31, −0.15] through lower vessel interactions.
- -
- Weather routing reduces accidents by 18% [CI: −0.25, −0.11] by avoiding visibility-related navigation difficulty.
- -
- Enhanced VTS information services reduce risk by 14% [CI: −0.21, −0.07] through improved situational awareness.
3.7.4. Counterfactual Analysis
- -
- Chain: Adverse weather → Reduced visibility → Navigation difficulty → Human error → Accident
- -
- Finding: Each 1 km reduction in visibility increased accident probability by 0.67 [0.60, 0.74], p < 0.001.
- -
- Policy: Sensor-based visibility enhancement (radar, lidar) outperforms simple speed restrictions.
- -
- Chain: Network bottlenecks → Traffic convergence → Vessel interactions → Accident
- -
- Finding: Risk scales non-linearly with density (Risk ~ Density^1.73). Each additional vessel within 1 km increased accident odds by 1.34 × [1.28, 1.41], p < 0.001.
- -
- Policy: Temporal separation schemes are more effective than spatial routing adjustments.
- -
- Chain: Past accidents → Risk perception → Cautious behavior, but infrastructure/design unchanged → Accident recurrence
- -
- Finding: Previous accidents increased subsequent risk by 2.1 × [1.7, 2.6] within 5 km, p < 0.001.
- -
- Policy: Hotspot remediation requires physical infrastructure improvements, not merely warnings.
4. Case Study and Experiment Design
4.1. Traffic Network Extraction
- -
- Consolidated network: 267 nodes, 263 directed edges
- -
- Underlying trajectory segments: 34,638 (for detailed analysis)
- -
- Average betweenness centrality: 0.089
- -
- Traffic flow distribution: Northbound (32.1%), Southbound (31.0%), East–West (36.9%)
- -
- Vessel type distribution: Oil tankers (45.4%), Container ships (21.5%), General cargo (22.6%), Others (10.5%)
4.2. Accident Hotspot Identification
- Hotspot 1: Near 1.25° N, 103.95° E, with the highest KDE density (>0.85), corresponding to a congested port approach.
- Hotspot 2: Near 1.75° N, 101.5° E, with KDE density of 0.62, associated with multiple collision events.
4.3. Integrated Risk Mapping
4.4. High-Risk Route Identification
- Route R01: High KDE accident density combined with high betweenness centrality, classified as “very high risk.”
- Route R47: Located in shallow waters with high traffic density, classified as “high risk.”
- Route R78: A tanker–container vessel crossing point with frequent adverse weather exposure, classified as “high risk.”
- -
- Environmental factors: 25.0% average weight
- -
- Traffic complexity: 35.0% average weight
- -
- Historical accidents: 40.0% average weight
- -
- Singapore Strait approaches: 28.6%
- -
- Central Malacca region: 21.4%
- -
- Northern approaches: 18.9%
- -
- Southern exits: 16.8%
- -
- Coastal waters: 14.3%
4.5. Quantitative Evaluation
- -
- Mean AUC: 0.78 ± 0.058 (CV = 7.4%)
- -
- Mean PR-AUC: 0.65 ± 0.052 (CV = 8.0%)
- -
- Stability Coefficient: 0.924 (excellent stability)
- -
- Performance Range: [0.72, 0.84]
- -
- All improvements significant at p < 0.01 level
- -
- Bootstrap iterations: 1000
- -
- Confidence intervals: Bias-corrected bootstrap method
- -
- Validation dataset: 24 incidents (2024)
- -
- AUC: 0.78 [95% CI: 0.74–0.82] vs. best baseline 0.64 [95% CI: 0.59–0.69], difference = 0.14 (Wilcoxon signed-rank test, p < 0.001, n = 1000 bootstrap iterations)
- -
- PR-AUC: 0.65 [95% CI: 0.61–0.69] vs. best baseline 0.49 [95% CI: 0.44–0.54], difference = 0.16 (33% relative improvement, p < 0.001)
- -
- IoU: 0.35 [95% CI: 0.31–0.39] vs. best baseline 0.29 [95% CI: 0.25–0.33], difference = 0.06 (21% relative improvement, p < 0.01)
- -
- Spearman correlation: 0.61 [95% CI: 0.54–0.68] vs. best baseline 0.31 [95% CI: 0.24–0.38], difference = 0.30 (97% relative improvement, p < 0.001)
- -
- Kendall’s tau: 0.50 [95% CI: 0.43–0.57] vs. best baseline 0.27 [95% CI: 0.20–0.34], difference = 0.23 (85% relative improvement, p < 0.001)
4.5.1. Grid-Level Logistic Regression Baseline
- -
- Accident Density (RKDE) and Betweenness Centrality (RCentrality) are the strongest positive predictors of accident risk, with the highest Odds Ratios (implied by the largest positive coefficients of 1.25 and 0.88, respectively), confirming the dual importance of historical incidents and structural traffic flow.
- -
- Traffic Density (RTraffic), Visibility Anomaly (RTraffic), and Current Shear (RCurrent Shear) also show statistically significant positive associations with risk, indicating that even simple models can capture the influence of instantaneous environmental and traffic factors.
| Covariate | Coefficient (β) | Cluster-Robust Std. Err. (CRSE) | Significance | 
|---|---|---|---|
| KDE Density () | 1.25 | (0.15) | *** | 
| Betweenness Centrality () | 0.88 | (0.11) | *** | 
| Traffic Density () | 0.45 | (0.09) | ** | 
| Visibility Anomaly () | 0.32 | (0.07) | * | 
| Current Shear () | 0.21 | (0.05) | * | 
| Intercept () | −5.67 | (0.22) | *** | 
| Model Metrics | |||
| PR-AUC | 0.62 | ||
| AUC | 0.75 | ||
| Observations (Grid Cells) | 34,638 | 
4.5.2. Diagnostic Evaluation: Brier Score and Calibration
- Brier Score (BS) and Calibration:
- 2.
- Residual Analysis:
4.6. Sensitivity Analysis
- -
- Alternative Clustering: Replacing DBSCAN with HDBSCAN (Hierarchical DBSCAN).
- -
- Alternative Weighting: Replacing FAHP weights with Entropy Weighting and Equal Weighting schemes.
- -
- Alternative Centrality: Replacing standard Betweenness Centrality with Current-Flow Betweenness.
4.7. Model Transferability Assessment
4.7.1. Validation Sites
- -
- Singapore Strait (Similar Characteristics): High traffic density (>2100 vessels/day), narrow channels (minimum 2.8 km), and comparable vessel composition (45% tankers, 25% containers).
- -
- Dover Strait (different context): European regulatory environment, dominance of regional flag vessels (65%), and distinct fog-prone, moderate wave climate.
- -
- Bosphorus Strait (extreme constraints): Geographically complex S-shaped channel, mixed passenger–cargo traffic, and strong currents up to 7 knots.
4.7.2. Transfer Learning Process
4.7.3. Results
4.7.4. Parameter Sensitivity Analysis Across Domains
- -
- Highly transferable: network centrality measures, KDE bandwidth principles, and FAHP hierarchy structure.
- -
- Moderately transferable: FAHP weights (affected by cultural/regulatory differences), environmental thresholds, and traffic normalization scales.
- -
- Domain-specific: DBSCAN clustering parameters (channel width dependence), vessel-type risk coefficients, and seasonal variation patterns.
4.8. Multi-Level Validation Strategy
4.8.1. Temporal Validation
4.8.2. Spatial Cross-Validation
4.8.3. Bootstrap Confidence Intervals
4.8.4. Sensitivity and Robustness Analysis
4.8.5. External Validation
- -
- Singapore Maritime Authority (2023–2024, 23 incidents): AUC = 0.73, PR-AUC = 0.58.
- -
- ReCAAP Piracy Database (2023–2024, 31 incidents): AUC = 0.69, PR-AUC = 0.54.
- -
- Lloyd’s List Casualty Database (2023–2024, 18 incidents): AUC = 0.81, PR-AUC = 0.67.
4.8.6. Cross-Validation Results
4.9. Counterfactual Simulation and Intervention Prediction
5. Discussion
5.1. Risk Amplification Mechanisms: Key Findings
- Environmental-Mediated Pathway (β = −0.67, p < 0.001): Reduced visibility conditions (fog, heavy precipitation) increase navigation difficulty, with each 1 km reduction in visibility increasing accident probability by 0.67 times. This pathway accounts for an estimated 34% of observed risk variance through mediation analysis.
- Network Convergence Pathway (β = 0.89, p < 0.001): Traffic bottlenecks with high betweenness centrality (>0.15) create interaction hotspots. Risk scales non-linearly with vessel density (Risk ∼ Density^1.73), with each additional vessel within 1 km increasing accident odds by 1.34 × [1.28, 1.41].
- Historical Persistence Pathway (β = 2.1, p < 0.001): Past incidents increase subsequent risk by 2.1 × within 5 km radius, suggesting infrastructure or procedural factors that perpetuate risk rather than random occurrence.
5.2. Methodological Innovations and Why They Matter
- Route-aware trajectory clustering (“skeletonization → improved DBSCAN → Fréchet merge”) produces a high-resolution, interpretable traffic network rather than a diffuse density map. This enables edge-level risk scoring and targeted interventions on specific segments.
- FAHP translates expert judgments into stable multi-criteria weights (Environmental 0.25; Traffic 0.35; Accident 0.40), yielding a risk surface that is accurate yet explainable to operators.
- Risk–network fusion links grid-level risk to edges/nodes, enabling comparative ranking of routes by both exposure (flow) and topological vulnerability (betweenness).
5.3. Practical Implications for Operations and Policy
5.4. Robustness Checks and Uncertainty
5.5. Limitations
Geographic Transferability Constraints
5.6. Generalizability and Future Extensions
Specific Limitations and Mitigation Strategies
- Temporal Coverage: 1-year validation period provides adequate temporal coverage but longer-term validation would strengthen generalizability. Future work should extend to multi-year validation.
- Sample Size: 71 incidents provide adequate statistical power for spatial analysis (post hoc power analysis: 0.84) but limit rare event modeling capability.
- Geographic Transferability: Model parameters calibrated for Strait of Malacca conditions require re-validation for other waterways with different traffic patterns and environmental conditions.
5.7. Scalability and Implementation Considerations
5.7.1. Computational Scalability
5.7.2. Implementation Architecture
5.7.3. Computational Efficiency Analysis
6. Conclusions and Future Work
6.1. Core Contributions and Novelty
6.2. Validation and Robustness
6.3. Practical Utility and Simulation
6.4. Limitations and Future Work
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| Vector of static FAHP base weights. | |
| Vector of standardized, real-time context indices (e.g., traffic, weather). | |
| Dynamic weighting sensitivity parameter ( used for case study). | |
| Dynamic weighting activation threshold (e.g., ). | |
| Dynamic weighting deactivation threshold ( for hysteresis). | 
Appendix A. FAHP Detailed Matrices
| Factor | |||||
|---|---|---|---|---|---|
| (1, 1, 1) | (1/3, 1/2, 2/3) | (2/3, 1, 3/2) | (3/2, 2, 5/2) | (2, 3, 4) | |
| (3/2, 2, 3) | (1, 1, 1) | (3/4, 1, 5/4) | (3/2, 2, 3) | (5/2, 3, 7/2) | |
| (2/3, 1, 3/2) | (4/5, 1, 4/3) | (1, 1, 1) | (3/2, 2, 5/2) | (2, 3, 4) | |
| (2/5, 1/2, 2/3) | (1/3, 1/2, 2/3) | (2/5, 1/2, 2/3) | (1, 1, 1) | (3/2, 2, 3) | |
| (1/4, 1/3, 1/2) | (2/7, 1/3, 2/5) | (1/4, 1/3, 1/2) | (1/3, 1/2, 2/3) | (1, 1, 1) | 
| Risk Factor | Defuzzified Weight () | Normalized Weight () | 
|---|---|---|
| 0.285 | 0.354 | |
| 0.245 | 0.304 | |
| 0.170 | 0.211 | |
| 0.065 | 0.081 | |
| 0.039 | 0.048 | |
| Sum | 0.804 | 1.000 | 
| Consistency Ratio (CR) | 0.089 | 
Appendix B. Context-Adaptive Weighting Pseudocode
| Algorithm A1. Adaptive Weights Adjustment. Note: The operator ‘ * ’ denotes element-wise multiplication. The hysteresis mechanism (lines 3–6) prevents rapid switching between static and dynamic states, enhancing system stability. | 
| Input: base weights w_{base}, standardized context indices z_{context}, α = 0.3, τ_{on} = 0.8, τ_{off} = 0.7 1: if max(z_{context}) ≥ τ_{on} then 2: w_{dynamic} ← normalize(w_{base} * (1 + α * z_{context})) 3: else if max(z_{context}) ≤ τ_{off} then 4: w_{dynamic} ← w_{base} (Hysteresis: revert to static base weights) 5: else 6: w_{dynamic} ← w_{previous} (Hysteresis: maintain current dynamic state) 7: return w_{dynamic} | 
Appendix C. Environmental Inputs & Validation
Appendix C.1. Temporal and Spatial Alignment
Appendix C.2. Current Shear Validation (Model Output vs. Field Observations)
| Metric | Current Shear (Calculated) vs. Observation (m/s/km) | 
|---|---|
| Root Mean Square Error (RMSE) | (e.g., 0.085) | 
| Mean Absolute Error (MAE) | (e.g., 0.052) | 
| Bias | (e.g., 0.011) | 
| Correlation Coefficient (R) | (e.g., 0.92) | 

Appendix D. DAG & Calibration
Appendix D.1. Association Directed Acyclic Graph (DAG)

Appendix D.2. Calibration Plot

Appendix D.3. Brier Score Summary
| Model | Brier Score (BS) | Interpretation | 
|---|---|---|
| Proposed Integrated Model | 0.075 | Superior Calibration | 
| Baseline 1 (KDE Only) | 0.120 | Poor Calibration | 
| Baseline 2 (Network Only) | 0.105 | Moderate Calibration | 
| Baseline 3 (Logistic Reg.) | 0.098 | Acceptable Calibration | 
Appendix E. Structural Robustness Checks
| Robustness Check | Comparison Methods | Key Metric (PR-AUC) | Result/Finding | 
|---|---|---|---|
| E1. Clustering Method | HDBSCAN vs. DBSCAN (Improved) | 0.68 vs. 0.78 | Improved DBSCAN yields higher PR-AUC due to better noise handling and directional filtering. | 
| E2. Weighting Scheme | FAHP vs. Entropy vs. Equal Weighting | 0.78 vs. 0.65 vs. 0.59 | FAHP provides the highest performance, confirming the benefit of incorporating expert knowledge. | 
| E3. Centrality Metric | Betweenness vs. Current-Flow Betweenness | 0.78 vs. 0.72 | Standard provides a more discriminative factor for accident risk segmentation. | 
Appendix F. Learning Curves
- -
- The model’s performance converges at approximately 80% of training data, confirming sufficient sample size;
- -
- The temporal validation approach is robust, as evidenced by stable performance metrics;
- -
- The model exhibits good sample efficiency, achieving operational performance with limited data.


References
- Zhang, B.; Hirayama, K.; Ren, H.; Wang, D.; Li, J. Ship anomalous behavior detection using clustering and deep recurrent neural network. J. Mar. Sci. Eng. 2023, 11, 763. [Google Scholar] [CrossRef]
- Rong, H.; Teixeira, A.P.; Soares, C.G. Data mining approach to shipping route characterization and anomaly detection based on AIS data. Ocean Eng. 2020, 198, 106936. [Google Scholar] [CrossRef]
- Anderson, T.K. Kernel density estimation and K-means clustering to profile road accident hotspots. Accid. Anal. Prev. 2009, 41, 359–364. [Google Scholar] [CrossRef]
- Xie, Z.; Yan, J. Kernel density estimation of traffic accidents in a network space. Comput. Environ. Urban Syst. 2008, 32, 396–406. [Google Scholar] [CrossRef]
- Erdogan, S.; Yilmaz, I.; Baybura, T.; Gullu, M. Geographical information systems aided traffic accident analysis: A case study in Afyonkarahisar, Turkey. Accid. Anal. Prev. 2008, 40, 174–181. [Google Scholar] [CrossRef]
- Zheng, M.; Xie, X.; Jiang, Y.; Xu, L.; Wang, F. Optimizing kernel density estimation bandwidth for road traffic accident hazard identification: A case study of the City of London. Sustainability 2024, 16, 6969. [Google Scholar] [CrossRef]
- Zhang, R.; Liu, Y.; Wang, Z.; Li, H.; Chen, J. Innovative prediction and causal analysis of accident vehicle towing probability using gradient boosting on road traffic scene data. Accid. Anal. Prev. 2025, 211, 107909. [Google Scholar] [CrossRef] [PubMed]
- Tian, J.; Li, Z.; Zhuang, S.; Xi, J.; Li, M. Grading of traffic interruptions in highways to Tibet based on the entropy weight–TOPSIS method and fuzzy C-means clustering Algorithm. Appl. Sci. 2024, 14, 9094. [Google Scholar] [CrossRef]
- Cai, X.; Li, Z.; Qiao, W.; Sun, Y.; Zhao, T. Research on road traffic safety risk assessment based on radar video integrated sensors. Promet–Traffic Transp. 2025, 37, 523–545. [Google Scholar] [CrossRef]
- Zhang, S.; Yu, X.; Mao, H.; Yao, H.; Li, P. Evaluating expressway safety based on fuzzy comprehensive evaluation with AHP–entropy method: A case study in Zhejiang, China. Systems 2023, 11, 496. [Google Scholar] [CrossRef]
- Bokau, J.R.K.; Samad, R.; Park, Y.; Kim, D. From managing risk to reality: Maritime safety in Makassar Port, Indonesia using FRAM and AIS data analysis. Ocean Eng. 2025, 339, 122012. [Google Scholar] [CrossRef]
- Fan, H.; Jia, H.; He, X.; Lyu, J. Navigating uncertainty: A dynamic Bayesian network-based risk assessment framework for maritime trade routes. Reliab. Eng. Syst. Saf. 2024, 250, 110311. [Google Scholar] [CrossRef]
- Xu, C.; Gao, J.; Zuo, F.; Ozbay, K. Estimating urban traffic safety and analyzing spatial patterns through the integration of city-wide near-miss data: A New York City case study. Appl. Sci. 2024, 14, 6378. [Google Scholar] [CrossRef]
- Wen, T.; Gao, Q.; Chen, Y.; Cheong, K.H. Exploring the vulnerability of transportation networks by entropy: A case study of Asia–Europe maritime transportation network. Reliab. Eng. Syst. Saf. 2022, 226, 108578. [Google Scholar] [CrossRef]
- Mou, F.; Fan, Z.; Li, X.; Wang, L.; Li, X. A Method for Clustering and Analyzing Vessel Sailing Routes Efficiently from AIS Data Using Traffic Density Images. J. Mar. Sci. Eng. 2024, 12, 75. [Google Scholar] [CrossRef]
- Liu, Z.; Zhang, B.; Zhang, M.; Wang, H.; Fu, X. A quantitative method for the analysis of ship collision risk using AIS data. Ocean Eng. 2023, 272, 113906. [Google Scholar] [CrossRef]
- Liu, Z.; Yang, H.; Xiong, F.; Xu, Z.; Gan, L.; Yan, T.; Shu, Y. Research on the Optimization of Ship Trajectory Clustering Based on the OD–Hausdorff Distance and DBSCAN. J. Mar. Sci. Eng. 2024, 12, 1398. [Google Scholar] [CrossRef]
- Yang, Y.; Liu, Y.; Li, G.; Zhang, Z.; Liu, Y. Harnessing the power of machine learning for AIS data-driven maritime research: A comprehensive review. Transp. Res. Part E Logist. Transp. Rev. 2024, 183, 103426. [Google Scholar] [CrossRef]
- Barabási, A.L.; Albert, R. Emergence of scaling in random networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef]
- Ducruet, C.; Notteboom, T. The worldwide maritime network of container shipping: Spatial structure and regional dynamics. Glob. Netw. 2012, 12, 395–423. [Google Scholar] [CrossRef]
- Kaluza, P.; Kölzsch, A.; Gastner, M.T.; Blasius, B. The complex network of global cargo ship movements. J. R. Soc. Interface 2010, 7, 1093–1103. [Google Scholar] [CrossRef]
- Peng, P.; Yang, Y.; Lu, F.; Cheng, S.; Mou, N.; Yang, R. Modelling the competitiveness of the ports along the Maritime Silk Road with big data. Transp. Res. Part A Policy Pract. 2018, 118, 852–867. [Google Scholar] [CrossRef]
- Zhang, D.; Tao, J.; Wan, C.; Huang, L.; Yang, M. Resilience analysis of maritime transportation networks: A systematic review. Transp. Saf. Environ. 2024, 6, tdae009. [Google Scholar] [CrossRef]
- Goodarzi, H.A.; Jabbarzadeh, A.; Fahimnia, B.; Paquet, M. Evaluating the sustainability and resilience of an intermodal transport network leveraging consolidation strategies. Transp. Res. Part E Logist. Transp. Rev. 2024, 188, 103616. [Google Scholar] [CrossRef]
- International Maritime Organization (IMO). Formal Safety Assessment (FSA)—Cruise Ships; Report Submitted by Denmark, MSC 83/INF.2; IMO: London, UK, 2007. [Google Scholar]
- Kristiansen, S. Maritime Transportation: Safety Management and Risk Analysis; Butterworth-Heinemann: Oxford, UK, 2013. [Google Scholar]
- Montewka, J.; Hinz, T.; Kujala, P.; Matusiak, J. Probability modelling of vessel collisions. Reliab. Eng. Syst. Saf. 2010, 95, 573–589. [Google Scholar] [CrossRef]
- Silveira, P.A.M.; Teixeira, A.P.; Soares, C.G. Use of AIS data to characterise marine traffic patterns and ship collision risk off the coast of Portugal. J. Navig. 2013, 66, 879–898. [Google Scholar] [CrossRef]
- Ma, Q.; Tang, H.; Liu, C.; Zhang, M.; Zhang, D.; Liu, Z.; Zhang, L. A big data analytics method for the evaluation of maritime traffic safety using automatic identification system data. Ocean Coast. Manag. 2024, 251, 107077. [Google Scholar] [CrossRef]
- Durlik, I.; Wróbel, K.; Montewka, J. Artificial Intelligence in Maritime Transportation: A Review. Appl. Sci. 2024, 14, 8420. [Google Scholar] [CrossRef]
- Rong, H.; Teixeira, A.P.; Soares, C.G. Ship collision avoidance behaviour recognition and analysis based on AIS data. Ocean Eng. 2022, 245, 110479. [Google Scholar] [CrossRef]
- Ripley, B.D. Modelling spatial patterns. J. R. Stat. Soc. Ser. B Methodol. 1977, 39, 172–212. [Google Scholar] [CrossRef]
- Rong, H.; Teixeira, A.P.; Guedes Soares, C. Spatial correlation analysis of near-ship collision hotspots with local maritime traffic characteristics. Reliab. Eng. Syst. Saf. 2021, 209, 107463. [Google Scholar] [CrossRef]
- Liu, J.; Shi, G.; Zhu, K. Vessel trajectory prediction model based on AIS sensor data and adaptive chaos differential evolution support vector regression (ACD-SVR). Appl. Sci. 2019, 9, 2983. [Google Scholar] [CrossRef]
- Wu, X.; Rahman, A.; Zaloom, V.A. Study of travel behavior of vessels in narrow waterways using AIS data. Ocean Eng. 2018, 147, 399–413. [Google Scholar] [CrossRef]
- Zhang, W.; Goerlandt, F.; Montewka, J.; Kujala, P. A method for detecting possible near miss ship collisions from AIS data. Ocean Eng. 2015, 107, 60–69. [Google Scholar] [CrossRef]
- Weng, J.; Meng, Q.; Qu, X. Vessel Collision Frequency Estimation in the Singapore Strait. J. Navig. 2012, 65, 207–221. [Google Scholar] [CrossRef]
- Goerlandt, F.; Montewka, J. Maritime transportation risk analysis: Review and analysis in light of some foundational issues. Reliab. Eng. Syst. Saf. 2015, 138, 115–134. [Google Scholar] [CrossRef]
- Li, S.; Meng, Q.; Qu, X. An overview of maritime waterway quantitative risk assessment models. Risk Anal. 2012, 32, 496–512. [Google Scholar] [CrossRef] [PubMed]
- Qu, X.; Meng, Q.; Suyi, L. Ship collision risk assessment for the Singapore Strait. Accid. Anal. Prev. 2011, 43, 2030–2036. [Google Scholar] [CrossRef]




| Category | Parameter | Value | Method/Source | 
|---|---|---|---|
| Study Area | Geographic Bounds | 100.7° E–104.3° E, 1.0° N–6.8° N | Strait of Malacca main channel | 
| Study Area | Spatial Resolution | 1 km × 1 km | Grid-based analysis | 
| Study Area | Temporal Period | 2023–2024 (2 years) | Temporal holdout validation | 
| Dataset | AIS Trajectories | 49,964 vessels | Automatic Identification System | 
| Dataset | Accident Records | 71 incidents (47 + 24) | Multi-source integration | 
| Dataset | Training/Validation | 2023/2024 split | Zero information leakage | 
| Clustering | DBSCAN Epsilon | 1500 m | Grid search optimization | 
| Clustering | Min Samples | 5 points | Sensitivity analysis | 
| Clustering | Direction Filter | ±30° tolerance | Consistency checking | 
| Clustering | Speed Range | 0.5–30 knots | Vessel capability limits | 
| Risk Assessment | KDE Bandwidth | 30 km | Cross-validation (Selected via 5-fold Spatial CV based on max Log-Likelihood) | 
| Risk Assessment | Expert Panel | 10 professionals | Maritime domain experts | 
| Risk Assessment | FAHP Weights | 0.25/0.35/0.40 | Expert consensus (CR = 0.089) | 
| Network | Final Nodes | 267 clusters | Traffic convergence zones | 
| Network | Final Edges | 263 connections | Directed route segments | 
| Risk Category (Weight) | Sub-Factor | Local Weight | Global Weight | 
|---|---|---|---|
| Environmental Risk (0.25) | Wind Speed | 0.28 | 0.070 | 
| Environmental Risk (0.25) | Wave Height | 0.24 | 0.060 | 
| Environmental Risk (0.25) | Visibility | 0.22 | 0.055 | 
| Environmental Risk (0.25) | Others | 0.26 | 0.065 | 
| Traffic Risk (0.35) | Vessel Density | 0.32 | 0.112 | 
| Traffic Risk (0.35) | Network Centrality | 0.24 | 0.084 | 
| Traffic Risk (0.35) | Vessel Diversity | 0.18 | 0.063 | 
| Traffic Risk (0.35) | Others | 0.26 | 0.091 | 
| Accident Risk (0.40) | Historical Density | 0.45 | 0.180 | 
| Accident Risk (0.40) | Severity Weight | 0.28 | 0.112 | 
| Accident Risk (0.40) | Others | 0.27 | 0.108 | 
| Metric | Proposed FAHP | KDE Only | Network Only | DBSCAN Only | p-Value | 
|---|---|---|---|---|---|
| AUC | 0.78 (0.74–0.82) | 0.64 (0.59–0.69) | 0.58 (0.52–0.64) | 0.61 (0.56–0.66) | <0.001 | 
| PR-AUC | 0.65 (0.61–0.69) | 0.49 (0.44–0.54) | 0.42 (0.37–0.47) | 0.46 (0.41–0.51) | <0.001 | 
| Route ID | Start Latitude | Start Longitude | End Latitude | End Longitude | Risk Score | Accident Count 2023–24 | Main Risk Factors | Intervention Priority | 
|---|---|---|---|---|---|---|---|---|
| R01 | 1.25° N | 103.95° E | 1.31° N | 103.88° E | 0.847 | 8 | Traffic convergence, Port proximity | Immediate | 
| R47 | 1.75° N | 101.50° E | 1.82° N | 101.45° E | 0.782 | 6 | Shallow waters, High density | High | 
| R78 | 2.15° N | 102.30° E | 2.22° N | 102.35° E | 0.756 | 5 | Weather exposure, Crossing point | High | 
| R23 | 1.89° N | 103.12° E | 1.95° N | 103.18° E | 0.723 | 4 | Current shear, Visibility | Medium | 
| R56 | 3.45° N | 100.98° E | 3.52° N | 101.05° E | 0.698 | 3 | Narrow channel, Tidal effects | Medium | 
| Rank | Node ID | Latitude | Longitude | Betweenness Centrality | Daily Traffic | Risk Level | 
|---|---|---|---|---|---|---|
| 1 | SB_059 | 1.354° N | 103.847° E | 0.245 | 1247 | Very High | 
| 2 | SB_023 | 2.187° N | 102.456° E | 0.198 | 892 | High | 
| 3 | SB_078 | 1.789° N | 101.234° E | 0.176 | 756 | High | 
| 4 | SB_145 | 3.456° N | 100.987° E | 0.134 | 634 | Medium | 
| 5 | SB_089 | 2.098° N | 103.234° E | 0.127 | 578 | Medium | 
| 6 | SB_156 | 4.123° N | 101.567° E | 0.098 | 445 | Medium | 
| 7 | SB_034 | 1.987° N | 102.789° E | 0.087 | 389 | Low | 
| 8 | SB_167 | 3.234° N | 100.678° E | 0.076 | 334 | Low | 
| 9 | SB_201 | 2.456° N | 103.456° E | 0.065 | 278 | Low | 
| 10 | SB_112 | 1.678° N | 102.123° E | 0.054 | 223 | Very Low | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, S.; Cai, X.; Qiao, J.; Yang, J.-B. Integrated Modeling of Maritime Accident Hotspots and Vessel Traffic Networks in High-Density Waterways: A Case Study of the Strait of Malacca. J. Mar. Sci. Eng. 2025, 13, 2052. https://doi.org/10.3390/jmse13112052
Chen S, Cai X, Qiao J, Yang J-B. Integrated Modeling of Maritime Accident Hotspots and Vessel Traffic Networks in High-Density Waterways: A Case Study of the Strait of Malacca. Journal of Marine Science and Engineering. 2025; 13(11):2052. https://doi.org/10.3390/jmse13112052
Chicago/Turabian StyleChen, Sien, Xuzhe Cai, Jiao Qiao, and Jian-Bo Yang. 2025. "Integrated Modeling of Maritime Accident Hotspots and Vessel Traffic Networks in High-Density Waterways: A Case Study of the Strait of Malacca" Journal of Marine Science and Engineering 13, no. 11: 2052. https://doi.org/10.3390/jmse13112052
APA StyleChen, S., Cai, X., Qiao, J., & Yang, J.-B. (2025). Integrated Modeling of Maritime Accident Hotspots and Vessel Traffic Networks in High-Density Waterways: A Case Study of the Strait of Malacca. Journal of Marine Science and Engineering, 13(11), 2052. https://doi.org/10.3390/jmse13112052
 
        

 
                                                
 
       