Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review

Jackai II, Isaac Ndumbe; Tezong Feudjio, Steffel Ludivin; Ndingwan, Tevoh Lordswill; Dindze, Olive Dubila; Usami, Davide Shingo; Gonzalez-Hernandez, Brayan; Persia, Luca

doi:10.3390/futuretransp6030107

Open AccessSystematic Review

Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review

by

Isaac Ndumbe Jackai II

^1,*

,

Steffel Ludivin Tezong Feudjio

¹

,

Tevoh Lordswill Ndingwan

¹

,

Olive Dubila Dindze

¹,

Davide Shingo Usami

¹

,

Brayan Gonzalez-Hernandez

²

and

Luca Persia

¹

Centre of Research for Transport and Logistics, Sapienza University of Rome, Via Eudossiana 18, 00184 Rome, Italy

²

Department of Civil and Environmental Engineering (DICEA), University of Naples Federico II, Via Claudio 21, 80125 Naples, Italy

^*

Author to whom correspondence should be addressed.

Future Transp. 2026, 6(3), 107; https://doi.org/10.3390/futuretransp6030107

Submission received: 31 March 2026 / Revised: 5 May 2026 / Accepted: 11 May 2026 / Published: 18 May 2026

Download

Browse Figures

Review Reports Versions Notes

Abstract

Real-time crash risk assessment is a key component of proactive road safety management, enabling the identification of hazardous conditions within short temporal intervals before crashes occur. Traditional crash-based models are unsuitable for such applications due to the rarity, reporting delay, and stochastic nature of crash data. Traffic conflicts, capturing near-miss interactions between road users, provide a practical alternative for real-time safety analysis. Over the past decade, numerous modelling approaches have been developed to translate conflict information into crash risk estimates; however, the literature remains fragmented and lacks a unified analytical synthesis. This review presents a state-of-the-art, model-centric analysis of conflict-based approaches, classifying them into five paradigms: statistical/regression-based, Bayesian, extreme value theory (EVT), machine learning (ML), and hybrid models. Beyond classification, the study conducts a structured cross-paradigm comparison across key dimensions, including conflict representation, data characteristics, temporal modelling, uncertainty treatment, validation strategies, computational complexity, and operational readiness. The paradigms are further interpreted through the complementary lenses of conflict frequency and severity. The review identifies key research gaps, including fragmented conflict definitions, challenges in modelling rare and extreme events, incomplete treatment of uncertainty and spatiotemporal dynamics, and limitations in validation, transferability, and deployment. Emerging research directions include standardized and adaptive conflict indicators, EVT–machine learning integration, integrated uncertainty-aware frameworks, advanced spatiotemporal modelling, transferable models, and scalable real-time implementation. By combining structured evidence mapping and cross-paradigm synthesis, this study supports model selection, development, and deployment for dynamic crash risk assessment.

Keywords:

real-time; traffic conflicts; conflict-based modelling

1. Introduction

Proactive road safety analysis aims to identify and mitigate hazardous conditions before crashes occur; however, crash-based models, which rely on rare and often underreported events that must accumulate over extended periods to support analysis [1,2] are inherently unsuitable for such applications, particularly in real-time contexts. This has led to an increasing reliance on traffic conflicts as surrogate safety indicators, which capture high-frequency interactions among road users that are associated with elevated crash likelihood and allow for more timely safety assessment [3,4,5].

The rapid development of sensing technologies, particularly video analytics and trajectory data extraction, has substantially expanded the scope of conflict-based modelling. Recent studies increasingly rely on high-resolution trajectory data derived from UAVs, computer vision, and sensor fusion systems to capture detailed spatiotemporal interactions among road users [6,7,8,9]. Early approaches primarily relied on statistical and regression formulations to model conflict frequency and severity [3,4,10]. More recent work has incorporated temporal structures and short-term updating mechanisms, including cycle-level modelling and sliding time windows enabling risk estimation over short intervals rather than purely static representations [11,12,13]. In parallel, Bayesian frameworks have been increasingly adopted to explicitly account for uncertainty, heterogeneity, and data limitations in dynamic traffic environments [12,14,15].

A major methodological development in the field has been the adoption of extreme value theory (EVT), which focuses on modelling rare but safety-critical conflict events. EVT-based approaches provide a probabilistic framework linking the tails of conflict distributions to crash likelihood, and have demonstrated strong potential for short-term crash risk estimation across different traffic environments [5,6,14,16,17,18]. At the same time, machine learning methods have gained prominence due to their ability to capture complex nonlinear relationships in high-dimensional data, particularly when using trajectory and video-based inputs [7,9,19,20]. While these approaches often achieve strong predictive performance, their interpretability and explicit treatment of uncertainty remain limited in many applications, although recent studies have begun to address these limitations through explainable AI and probabilistic deep learning frameworks [7,8].

More recently, hybrid modelling strategies have emerged, combining probabilistic and data-driven techniques to leverage complementary strengths. Examples include the integration of EVT with machine learning for crash risk forecasting [21], Bayesian–EVT frameworks for multi-level conflict modelling [22], machine learning–Bayesian spatial frameworks [23], and hierarchical hybrid models that jointly address occurrence and frequency of conflicts [24,25]. Such developments reflect a growing emphasis on balancing predictive performance, interpretability, uncertainty quantification, and operational feasibility.

Despite this rapid methodological evolution, the literature remains fragmented, with limited efforts to systematically compare modelling paradigms across key analytical dimensions. Existing studies differ substantially in terms of data sources (e.g., trajectory-based vs. aggregated traffic states), temporal representation (instantaneous, short-horizon, or cycle-level), conflict indicators (e.g., TTC, PET, MTTC, or alternative measures), and modelling objectives (frequency, severity, or risk evolution), making direct comparison challenging [8,12,13].

This review addresses these limitations by providing a structured, model-centric synthesis of conflict-based approaches for real-time crash risk assessment. The literature is organized into five major modelling paradigms: statistical and regression-based models, Bayesian frameworks, EVT-based approaches, machine learning methods, and hybrid models. This classification enables a systematic comparison of their underlying assumptions and methodological characteristics.

Given the evolving and interdisciplinary nature of the field, many studies combine elements from different modelling paradigms. Accordingly, the classification adopted in this review allows for overlap across paradigms, such that individual studies may be associated with more than one methodological category when they incorporate multiple modelling components. Hybrid models are, however, distinguished as a separate category, referring specifically to studies that explicitly integrate two or more paradigms within a unified modelling framework. In this sense, the classification is interpreted as heuristic rather than strictly mutually exclusive, reflecting the inherent methodological convergence and overlap that characterize the literature. To move beyond descriptive categorisation, the review adopts a structured cross-paradigm framework that examines models across multiple analytical dimensions organized into two complementary groups: (i) model-intrinsic methodological characteristics, which describe how models are formulated and evaluated, and (ii) application, data, and deployment characteristics, which describe how models are applied in real-world traffic environments. This distinction enables a clearer separation between methodological capability and practical applicability.

Beyond descriptive classification, the review makes three main contributions that address key gaps within the context of conflict-based real-time and short-term crash risk assessment. First, it advances a model-centric perspective that shifts the focus from application-specific studies to underlying modelling paradigms and their analytical logic. Second, it provides a structured cross-paradigm comparison across both methodological and application-oriented dimensions, enabling a systematic evaluation of the relative strengths, limitations, and deployment potential of different approaches. Third, it introduces a conceptual distinction between conflict frequency and conflict severity, treating them as complementary and cross-cutting dimensions of real-time crash risk, and using this lens to interpret model behaviour across paradigms.

The remainder of this paper is structured as follows (see Figure 1). Section 2 outlines the systematic review methodology. Section 3 presents the analytical synthesis of the literature, including a model-centric comparison across paradigms and a frequency–severity interpretive framework. Section 4 discusses key research gaps and emerging directions. Finally, Section 5 concludes the paper and provides actionable recommendations for research and policymakers.

2. Methodology

This study adopts the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines [26] to ensure a transparent, systematic, and reproducible review process. The objective is to comprehensively identify and synthesize studies examining the use of traffic conflicts as surrogate safety indicators for real-time and short-term crash risk assessment, with particular emphasis on modelling approaches.

A structured literature search was conducted across Scopus, Web of Science, and ProQuest, selected for their broad and multidisciplinary coverage of transportation, engineering, and data-driven research. These databases were accessed between 6 January and 28 March 2026.

Initial queries based on core terms (traffic conflict, conflict-based, real-time, and short-term) were progressively expanded to include related expressions such as surrogate safety, crash precursors, near-miss, dynamic, prediction, forecast, online, and near real-time. This expansion accounts for variations in terminology across studies describing similar concepts. Boolean operators (OR, AND) and keyword combinations were refined for each database to maximize retrieval efficiency while minimizing irrelevant results.

To ensure relevance and consistency, the search was restricted to peer-reviewed journal articles published in English between 2016 and 2026, thereby focusing on recent methodological developments, including the emergence of machine learning, extreme value theory, and hybrid modelling approaches. While this restriction may exclude earlier foundational work, it allows the review to reflect the current state of the art.

The study selection process consisted of four sequential stages:

Identification: The initial search yielded 853 records (233 from Scopus, 212 from Web of Science, and 408 from ProQuest). These records were consolidated into a unified dataset, and duplicates were removed to prevent bias arising from overlapping database indexing.
Screening: Titles and abstracts were reviewed to exclude studies not directly related to traffic conflicts, surrogate safety indicators, or real-time crash risk assessment. This step ensured conceptual alignment with the objectives of the review by filtering out studies focused solely on crash data or unrelated traffic phenomena.
Eligibility: Of the remaining 75 studies, 19 were excluded due to inaccessibility or invalid DOI. Full-text articles were assessed to determine methodological relevance. Studies were retained only if they (i) explicitly addressed real-time or short-term conflict-based modelling, and (ii) provided sufficient methodological detail to support meaningful comparison. Studies were excluded if they focused only on historical crash-frequency modelling, did not use traffic conflicts or surrogate safety indicators, lacked real-time or short-term relevance, were not peer-reviewed journal articles, were not written in English, or did not provide sufficient methodological detail for comparison.
Inclusion: A total of 56 studies were retained. To enhance coverage, a snowballing approach [27] was applied by examining reference lists and citations of eligible studies, identifying 14 additional relevant articles. This resulted in a final sample of 70 studies included in the review.

The screening and eligibility assessment were conducted by two independent reviewers to minimize subjective bias. Each reviewer independently evaluated titles, abstracts, and full-text articles against the predefined inclusion criteria. Discrepancies in study selection were discussed and resolved through consensus. This procedure ensured consistency in study selection and reduced the risk of individual reviewer bias.

To enhance reproducibility, the inclusion and exclusion criteria were defined a priori and applied consistently across all stages of the review. A standardized screening protocol was used to guide the evaluation of each record, including explicit criteria related to study scope, methodological relevance, and data adequacy. Prior to full screening, a pilot screening phase was conducted on a subset of studies to align reviewer interpretation of the criteria. Ambiguous cases were re-evaluated through joint discussion to ensure consistent application of the criteria.

Given the heterogeneity of modelling approaches, data sources, and evaluation metrics, a qualitative, model-centric synthesis was adopted. A quantitative meta-analysis was not appropriate due to the absence of standardized outcome measures and the substantial methodological diversity across studies. Instead, studies were categorized based on modelling paradigms and analyzed across key analytical dimensions. Classification was based on the different methodological approaches identified within each study, allowing for overlap where hybrid or multi-method structures were present. Accordingly, the classification is interpreted as heuristic and non-exclusive, reflecting the inherently hybrid and evolving nature of modelling approaches rather than rigid categorical boundaries.

The overall study identification and selection process is summarized in Figure 2.

3. Results and Analytical Synthesis

3.1. Descriptive Analysis

A summary of the reviewed studies is presented in Table 1. Excluding the two methodological references [26,27] and the three review/synthesis papers [1,28,29], the analytical dataset comprises 67 model-focused studies. Because several studies combine multiple modelling logics, the paradigm categories are non-mutually exclusive; thus, counts across categories exceed the number of unique studies. Machine learning and deep learning approaches appear most frequently, with 37 studies, followed by Bayesian probabilistic approaches (18 studies) and EVT-based models (18 studies), then statistical and regression-based models (17 studies) and hybrid frameworks (14 studies).

In line with the classification framework outlined in Section 1, studies may be associated with multiple paradigms when they incorporate different methodological components. Hybrid models are, however, identified more selectively, referring only to studies that explicitly integrate two or more paradigms within a unified modelling framework. Accordingly, hybrid studies are also reflected within their constituent paradigms (e.g., statistical, Bayesian, EVT, or machine learning) where applicable, while being distinguished as a separate category to represent their integrated design. This treatment ensures consistency between the paradigm counts and the underlying methodological composition of the reviewed studies. In particular, Bayesian methods used solely for parameter estimation within another modelling framework (e.g., EVT) are not treated as separate paradigms, whereas time-series formulations (e.g., ARIMA) are classified under the statistical/regression-based paradigm as extensions for modelling temporal dependence.

The distribution of studies across paradigms is further clarified in Figure 3 through the UpSet representation, which highlights both exclusive and overlapping modelling structures. The results indicate that a substantial proportion of studies are concentrated within single paradigms, particularly machine learning (28 studies) and statistical/regression-based approaches (10 studies), while a smaller but meaningful subset adopts dual or hybrid configurations. EVT-based approaches appear only occasionally as standalone models (three studies) and are more frequently integrated with other paradigms, reflecting their primary role in modelling extreme risk rather than general conflict occurrence. Similarly, Bayesian approaches appear as standalone models in four studies and more commonly intersect with EVT (four studies in B+E) and, to a lesser extent, machine learning (two studies in hybrid B+M), supporting uncertainty quantification and probabilistic inference within more complex modelling pipelines. Hybrid approaches (14 studies) remain limited but represent the most methodologically integrated designs, combining complementary strengths across paradigms. Among these, the most common configurations are Bayesian–EVT integrations (six studies), followed by EVT–machine learning (two studies), Bayesian–machine learning (two studies), statistical–EVT (two studies), statistical–machine learning (one study), and a single three-way integration (Bayesian–EVT–machine learning).

Overall, the descriptive results indicate three main patterns: first, machine learning approaches dominate the recent literature numerically; second, probabilistic, EVT-based, and hybrid models remain important because they address uncertainty, rare events, and integrated risk representation; third, methodological sophistication does not necessarily imply operational maturity, since deployment also depends on data availability, computational feasibility, interpretability, validation depth, and transferability. These patterns motivate the cross-paradigm analytical synthesis in Section 3.2 and the frequency–severity interpretation developed in Section 3.3.

3.2. Model-Centric Synthesis

Organizing the reviewed studies into five modelling paradigms provides a useful classification of the literature, consistent with prior reviews [1,28]. However, because several studies combine multiple modelling logics, this classification is interpreted as heuristic rather than strictly mutually exclusive. To move beyond descriptive categorisation, the following synthesis compares the five paradigms across nine analytical dimensions: conflict representation and modelling logic; application context and traffic environment; data requirements and input characteristics; sample size and dataset scale; uncertainty and heterogeneity modelling; temporal modelling and prediction horizon; model performance and validation strategy; computational complexity and implementation; operational readiness and transferability. These dimensions are summarized in Table 2 and Table 3, where model-intrinsic methodological characteristics and application/deployment characteristics are synthesized into structured comparative frameworks.

3.2.1. Conflict Representation and Modelling Logic

The five paradigms differ fundamentally in how they define and operationalise traffic conflicts. Statistical and regression-based models typically rely on threshold-based surrogate safety indicators, where conflicts are converted into counts, rates, or severity classes. Ref. [3] modelled rear-end conflicts at signalized intersections using cycle-level TTC-based counts, testing thresholds from 0.5 s to 3.0 s and adopting TTC ≤ 1.5 s as the main safety performance threshold. Ref. [4] extended this logic using TTC ≤ 2.5 s, MTTC ≤ 2.5 s, and DRAC ≥ 1.5 m/s², while ref. [11] compared conflict rate indicators using TTC ≤ 3.0 s, MTTC ≤ 3.0 s, and DRAC ≥ 1.5 m/s². These models therefore follow a count-based or censored rate logic and are mainly suited to representing conflict occurrence within short operational intervals.

This threshold-based logic has also been adapted to more diverse conflict types and road environments, including bicycle–vehicle conflicts [50], left-turn conflicts reconstructed from vehicle trajectories [51], rear-end conflicts at unsignalized intersections [7,30], vehicle–pedestrian interactions [31], and sharp curve traffic conflicts [32]. In freeway and heterogeneous traffic settings, conflict representation has been extended through severity indices, PET categories, macroscopic safety measures, and context-specific indicators [33,34,35,52]. More recent studies have also introduced composite or dependence-based indicators, including video-based TTC–ML representations in weaving areas [53], temporal margin and behavioural feature indicators for vehicle–bicycle conflicts [36], and vine copula-based spatial dependence modelling of ramp area conflict risk [13].

Bayesian models often use similar conflict indicators but embed them within probabilistic structures. Refs. [4,10,11] retained TTC-, MTTC-, and DRAC-based conflict definitions but estimated conflict risk through full Bayesian or Bayesian Tobit models. Bayesian formulations therefore do not necessarily redefine conflicts; rather, they reinterpret conflict occurrence probabilistically by estimating posterior distributions of risk, parameters, and heterogeneity. This logic is extended in spatial and hierarchical settings by [13,23,33,43].

EVT-based models represent a distinct modelling logic because they focus on extreme conflicts rather than all observed conflicts. Refs. [2,15] modelled extremes of surrogate safety indicators using BM–GEV or POT–GPD frameworks. Refs. [5,16,44] further developed Bayesian or hierarchical EVT formulations. In this paradigm, the analytical focus shifts from the frequency of all conflicts to the tail behaviour of severe interactions that are more closely associated with crash occurrence.

Machine learning models adopt a more data-driven representation. Rather than relying only on predefined TTC or PET thresholds, ML models learn conflict patterns from trajectory, video, LiDAR, connected vehicle, or communication-based data. Refs. [9,37,54] used high-dimensional data to predict conflict or risk states. LSTM and GRU models have been used to capture sequential pedestrian–vehicle conflict evolution [55,56], while GNN-based models represent spatiotemporal interaction structures in mixed traffic [19]. Other ML applications include urban freeway and congested highway rear-end conflict prediction [57,58], rear-end conflict identification at unsignalized intersections [7], tunnel risk modelling [59], roundabout conflict analysis [20], video-based conflict detection using spatial-temporal analytics [54], deep learning with LTE access data [60], and reinforcement learning-based collision risk assessment [61].

Hybrid models combine these logics. Refs. [24,38] integrated EVT with time-series forecasting, ref. [23] combined machine learning-derived predictors with Bayesian spatial Poisson modelling, and refs. [16,22] combined Bayesian and EVT components within hierarchical rare-event frameworks. Recent statistical–machine learning integrations further demonstrate how interpretable statistical structures can be combined with data-driven prediction modules for real-time conflict assessment [39]. Hybrid approaches therefore attempt to represent conflict occurrence, severity, uncertainty, and temporal evolution within a single modelling structure.

3.2.2. Application Context and Traffic Environment

A key but often underexplored dimension in conflict-based modelling is the application context, referring to the type of road facility, traffic conditions, and operational environment in which models are developed and applied. Across the reviewed studies, clear differences emerge in how modelling paradigms align with specific traffic contexts.

Statistical and regression-based models are predominantly applied to signalized intersections and controlled urban environments, where structured traffic flow and well-defined signal cycles support aggregation-based models [3,4,35]. Their use has been extended to heterogeneous traffic conditions, including weak lane discipline environments and mixed traffic involving bicycles and pedestrians [31,50,52], although such applications often require context-specific calibration.

Bayesian models expand this applicability to network-level and spatially distributed systems, including connected and autonomous vehicle (CAV) environments and merging areas [23,43]. Their ability to incorporate spatial dependence and hierarchical structure makes them particularly suitable for multi-site and system-level safety assessment, although their application remains concentrated in relatively structured datasets.

EVT-based models are most commonly applied in safety-critical contexts, including intersections, highways, and hazardous locations where extreme conflicts are observable [2,45]. They are especially relevant for risk-sensitive applications, such as safest route identification and high-risk location screening, but their reliance on extreme observations can limit applicability in low-conflict or data-sparse environments.

Machine learning models exhibit the broadest application coverage, spanning intersections, freeways, tunnels, roundabouts, and mixed traffic environments [9,20,59]. Their flexibility allows them to adapt to data-rich environments, including video analytics, LiDAR, and connected vehicle systems, making them well suited for complex and high-density traffic conditions. However, their performance remains strongly dependent on data availability and quality.

Hybrid models are primarily applied in complex, data-rich, and emerging environments, including network-level systems, multi-modal traffic, and CAV contexts [23,24]. These models are particularly relevant for integrated safety assessment and real-time forecasting, where multiple dimensions of risk (frequency, severity, uncertainty) must be jointly modelled. Nevertheless, their application remains limited to a relatively small number of studies due to high data and computational requirements.

Overall, application context acts as a structuring constraint on model selection: simpler models dominate structured and data-limited environments, while advanced and hybrid approaches are concentrated in complex, data-rich, and emerging traffic systems.

3.2.3. Data Requirements and Input Characteristics

The paradigms also differ substantially in their data requirements. Statistical and regression-based models are generally compatible with aggregated traffic variables, cycle-level data, and structured surrogate safety indicators. Refs. [3,4,11] used signal cycle-level indicators and traffic flow variables, while ref. [35] proposed a macroscopic framework based on PSD ≤ 1 and time spent in conflict. Ref. [52] further demonstrated how conflict-based safety evaluation can be adapted to heterogeneous and weak lane discipline traffic using context-sensitive indicators. These models are therefore suitable when only aggregated or moderately detailed trajectory-derived data are available.

Bayesian models require similar inputs when used as extensions of regression models but can also incorporate richer hierarchical, spatial, or multivariate data structures. Refs. [13,40] used copula-based dependence structures, while ref. [23] incorporated machine learning-derived features into a Bayesian spatial Poisson model for large-scale prediction. Ref. [43] applied a Bayesian hierarchical approach in connected and autonomous vehicle merging areas, demonstrating that Bayesian models can accommodate spatial, interaction-level, and site-specific data.

EVT models require high-resolution conflict observations because their main target is the tail of the conflict distribution. This makes them dependent on the reliable detection of severe conflicts or extreme surrogate safety values. Refs. [2,5,15,16] relied on sufficiently detailed conflict measurements to estimate tail distributions. EVT applications in safest route and hazardous location identification further demonstrate the need for high-frequency conflict data across space and time [45,46].

Machine learning models are the most data intensive. They typically require high-dimensional trajectory, video, LiDAR, connected vehicle, or multi-sensor inputs. Refs. [9,37,53,54] demonstrated the use of video and trajectory-based data; ref. [47] used LiDAR data with Bayesian deep learning; ref. [6] used real-world pre-crash trajectories, and ref. [62] used connected vehicle information for real-time longitudinal conflict risk prediction. Ref. [36] further illustrated how temporal margins and behavioural features can enrich early risk assessment in vehicle–bicycle interactions. These models benefit from rich data but are more sensitive to data quality, sensor coverage, labelling consistency, and distribution shift.

Hybrid models typically require the richest data structures because they combine several modelling components. Refs. [21,24] used AI-based video analytics with EVT and forecasting structures, ref. [23] used large-scale traffic conflict features with Bayesian spatial modelling, and ref. [48] developed a pedestrian-focused ML-based real-time crash risk forecasting framework. These models therefore require not only detailed data but also compatibility between statistical, probabilistic, and learning-based modules.

3.2.4. Sample Size and Dataset Scale

The required dataset scale increases from statistical models to Bayesian, EVT, ML, and hybrid approaches. Statistical and regression-based models can operate with relatively small to medium datasets, particularly when conflicts are aggregated by cycle, interval, or facility segment. Refs. [3,4,34,35] indicated that interpretable models can be built from structured conflict counts, PET classes, or macroscopic conflict exposure indicators. However, their reliability remains sensitive to the number of observed conflicts, the selected thresholds, and the representativeness of the collection period, as highlighted by [63,64].

Bayesian models can be useful in sparse data settings because prior distributions and hierarchical structures help stabilize inference. Refs. [3,4,11] illustrated the use of Bayesian formulations for cycle-level conflict rates, while refs. [23,43] demonstrated that Bayesian spatial and hierarchical models can borrow information across locations or contexts. However, Bayesian models still require sufficient data for convergence, posterior stability, and the credible estimation of heterogeneity.

EVT models have a specific data scale requirement: they need enough extreme observations rather than merely a large number of ordinary conflicts. Refs. [2,5,15,16,44] demonstrated the potential of EVT, but the reliability of GEV or POT–GPD estimates depends heavily on threshold selection, block definition, and the number of severe conflict observations. This makes EVT powerful for crash risk inference but vulnerable in data-sparse or low-conflict environments.

Machine learning models generally require larger datasets than statistical and Bayesian models. Deep learning models such as those used by [9,19,55,56,60,65] rely on sufficient labelled data to learn nonlinear and temporal patterns. Ref. [66] specifically highlights class imbalance as a major issue in traffic conflict prediction. Transfer learning and meta-learning approaches attempt to reduce the burden of large training datasets, as illustrated by [67,68], but these remain emerging rather than standard practice.

Hybrid models typically require medium to large datasets because they must support multiple analytical components. The models mentioned in refs. [21,24,38] require sufficient video-derived observations for both EVT and forecasting components, ref. [23] requires large-scale data to support ML and Bayesian spatial modelling. Their data demand is therefore high, particularly when rare-event, spatial, and temporal components are jointly modelled.

3.2.5. Uncertainty and Heterogeneity Modelling

Uncertainty treatment is one of the clearest differences across paradigms. Statistical and regression-based models handle uncertainty mainly through distributional assumptions, goodness-of-fit diagnostics, and random parameter extensions. Poisson and Negative Binomial models capture variability and overdispersion in conflict counts [3], while Tobit models address censoring in conflict rate data [4]. Ref. [34] used random parameter ordered logit models with heterogeneity in means, ref. [35] applied grouped random parameter Tobit models, and ref. [12] used Poisson-lognormal-Lindley distributions to address overdispersion and excess zero cycles.

Bayesian models provide the strongest formal treatment of uncertainty. Posterior distributions allow model parameters, conflict rates, and crash risk estimates to be interpreted probabilistically. Refs. [4,11] used Bayesian Tobit models to estimate censored conflict rates, while refs. [23,43] used Bayesian spatial and hierarchical structures to account for unobserved heterogeneity. Bayesian EVT models further extend uncertainty modelling to rare and severe conflicts, as shown by [5,15,16,18,44].

EVT models address uncertainty through tail distribution estimation and extreme value parameters. They are particularly strong in modelling uncertainty associated with rare events, but parameter stability depends on the availability of extreme observations and the selected threshold. Refs. [5,15,16] improved this by combining EVT with Bayesian inference. However, standard EVT applications may still be sensitive to threshold selection and limited tail samples, as also discussed by [29].

Machine learning models traditionally provide weaker explicit uncertainty treatment. Most ML models generate point predictions or classifications without formal predictive uncertainty. Recent work has begun to address this limitation through Bayesian deep learning [47], uncertainty-aware spatiotemporal learning [8], and imbalance-aware methods such as weighted losses and resampling [66]. Nevertheless, uncertainty propagation remains less mature in ML than in Bayesian and Bayesian–EVT frameworks.

Hybrid models offer the potential for more complete uncertainty treatment because they can combine probabilistic components with data-driven learning. Refs. [8,16,19,23,24] illustrated different ways of combining rare-event modelling, uncertainty estimation, statistical structure, and nonlinear prediction. However, uncertainty is not always propagated across all stages of the hybrid pipeline, making this a key remaining challenge.

3.2.6. Temporal Modelling and Prediction Horizon

Temporal modelling ranges from short aggregation windows to dynamic forecasting. Statistical and regression-based models usually operate over short intervals such as signal cycles, fixed time windows, or lane-level aggregation intervals. Refs. [3,4,11] linked cycle-level conflicts to traffic variables such as volume, queue length, shock-wave characteristics, and platoon ratio. Ref. [33] used 30 s freeway lane-level intervals, while ref. [35] computed conflict exposure in 60 m × 1 s spatiotemporal windows. Ref. [12] further incorporated autoregressive dependence across adjacent signal cycles. Spatiotemporal conflict risk evolution has also been explicitly analyzed using trajectory-based approaches [41].

Bayesian models extend temporal modelling through dynamic updating, time-varying parameters, and hierarchical temporal structures. Refs. [15,16] modelled evolving crash risk using dynamic Bayesian EVT frameworks, while ref. [11] incorporated temporal correlation into Bayesian Tobit conflict rate models. Ref. [12] provided empirical evidence of temporal correlation in severe conflicts, supporting the need for dynamic modelling. Reference ref. [47] also used Bayesian deep learning for cycle-level prediction using LiDAR data, and ref. [18] developed conditional Bayesian POT models for short-term crash risk forecasting.

EVT models are increasingly dynamic but remain uneven in temporal capability. Refs. [2,15,16] demonstrated real-time or dynamic EVT applications, while ref. [24] combined GEV theory with ARIMA for short-term forecasting. Refs. [18,21,38] further extended EVT into forecasting-oriented settings. Comparative forecasting work also shows how near-future crash prediction can be evaluated across alternative model families [69]. Nevertheless, many EVT applications still focus on short-window extreme event estimation rather than continuous multi-step forecasting.

Machine learning models are generally the strongest in temporal representation. LSTM and GRU models capture sequence dependence in pedestrian–vehicle conflict evolution [55,56], while GNN-based approaches capture spatiotemporal interactions [19]. Connected vehicle and trajectory-based studies also support real-time prediction at fine temporal scales [6,32,62,65]. Video-based weaving area analysis and communication data-based deep learning further illustrate how short-horizon conflict prediction can be implemented using dynamic traffic data streams [53,60]. However, many ML studies still focus on short-term classification or prediction rather than long-horizon forecasting.

Hybrid models provide the most integrated temporal structures by combining sequence learning, dynamic updating, and time-series forecasting. Refs. [24,38] combined EVT and time-series models, while ref. [21] proposed a bi-level real-time forecasting framework. Ref. [18] supports short-term forecasting using conditional EVT, and ref. [25] further developed dynamic short-term crash risk prediction using a novel conflict indicator in emerging mixed traffic flow. These approaches are promising for real-time crash risk forecasting, although continuous deployment and long-horizon forecasting remain limited.

3.2.7. Model Performance and Validation Strategy

Validation practices differ substantially across paradigms. Statistical and regression-based models are usually evaluated through goodness-of-fit and comparative statistical criteria. Ref. [3] used AIC, scaled deviance, Pearson χ², parameter significance, and Durbin–Watson tests. Refs. [4,11] used DIC, posterior significance, and convergence diagnostics for Bayesian Tobit formulations. Ref. [40] used Kendall’s tau, while ref. [34] used log-likelihood, AIC, BIC, odds ratios, and coefficient significance. Ref. [35] added stronger predictive validation using a 70/30 split, five-fold cross-validation, RMSE, MAE, MSE, R², and predicted versus observed comparisons.

Bayesian models rely on both predictive and Bayesian-specific validation. The authors of refs. [4,11] used DIC, convergence diagnostics, and posterior significance. Bayesian EVT applications assess posterior inference, model comparison, and tail adequacy [5,15,16,44]. Spatial and hierarchical models also involve comparison across locations or structures, as shown by [23,43].

EVT validation focuses on tail distribution adequacy and crash risk consistency. Common validation approaches include goodness-of-fit tests, tail diagnostics, comparison of predicted and observed crash risk, and sensitivity to thresholds. Ref. [49] concluded that EVT-based models can outperform traditional surrogate-based approaches in capturing the relationship between conflicts and crashes. However, transfer validation across sites remains limited, and EVT results remain sensitive to threshold choice and the availability of extreme observations.

Machine learning validation relies mainly on predictive performance metrics, including accuracy, precision, recall, F1 score, AUC, cross-validation, and benchmarking. Refs. [66,70,71] used comparative ML evaluation frameworks. Earlier ML-based freeway rear-end collision risk modelling also provides evidence on the predictive use of learning algorithms in real-time safety assessment [57]. Ref. [6] strengthened validation by using real-world pre-crash trajectory data, while refs. [67,68] explicitly addressed transferability through transfer learning and meta-learning. However, validation remains heterogeneous, and only a minority of studies test robustness under distribution shift.

Hybrid validation combines methods from multiple paradigms. Refs. [23,24] used combinations of statistical diagnostics, probabilistic evaluation, and predictive metrics such as RMSE, MAE, or AUC. Ref. [39] further illustrated how integrated statistical–ML frameworks require validation strategies that account for both interpretable model structure and predictive performance. Hybrid models therefore offer broader validation possibilities, but their evaluation is also more complex because each module may require separate diagnostics.

3.2.8. Computational Complexity and Implementation

The paradigms show clear differences in computational burden. Statistical and regression-based models are generally the least demanding. Their reliance on aggregated variables, count models, Tobit models, or random parameter extensions make them relatively easy to implement and interpret. This is evident in cycle-level and macroscopic applications by [3,4,12,34,35]. Their main implementation challenge lies not in computation but in threshold selection, data aggregation, and model specification.

Bayesian models are more computationally intensive because they require posterior estimation, convergence checking, and often MCMC-based inference. The models in [4,10,11,15,16,18] demonstrated the value of Bayesian inference, but these models require more careful calibration and diagnostic assessment. Their implementation burden increases further in spatial, hierarchical, or Bayesian–EVT formulations [23,43].

EVT models require specialized statistical calibration. Their computation is not always excessive, but implementation is sensitive to threshold selection, block definition, tail fit diagnostics, and parameter stability. Refs. [2,5,15,16] demonstrate EVT’s methodological rigour, while refs. [45,46] show its use in route and hazardous location applications. However, EVT implementation requires expertise in extreme value modelling and sufficient extreme observations.

Machine learning models require substantial computational resources, especially when using deep learning, video analytics, LiDAR, GNNs, reinforcement learning, or transfer learning. The models in [9,19,47,54,60,61,65] illustrate the computational demands associated with high-dimensional data streams and complex architectures. These models may be powerful but require training infrastructure, labelled data, tuning procedures, and often GPU-level computation.

Hybrid models are the most complex to implement because they combine several computational layers. Refs. [16,21,23,24,25,38,39] indicate that hybrid models require coordination between EVT, Bayesian inference, machine learning, statistical modelling, and forecasting components. Their complexity can improve predictive capacity but creates challenges in calibration, interpretation, reproducibility, and real-time deployment.

3.2.9. Operational Readiness and Transferability

Operational readiness depends not only on predictive performance but also on interpretability, data availability, computational cost, and transferability. Statistical and regression-based models are the most operationally ready because they are interpretable, relatively simple, and compatible with aggregated data. Refs. [3,4,35,64,71] demonstrate that these models can support short interval monitoring and practical safety assessment. The practical scope of such models is broadened by applications to heterogeneous weak lane discipline traffic, bicycle–vehicle conflicts, pedestrian interactions, sharp curves, and unsignalized intersections [30,31,32,50,52]. However, their transferability remains limited by context-specific thresholds and facility-specific calibration.

Bayesian models offer strong decision-support value because they provide uncertainty-aware outputs. This is useful for operational safety systems where confidence in risk estimates matters. The models in [4,11,23,43,47] demonstrate this potential. However, their operational use is constrained by computational complexity, convergence requirements, and the need for expert model specification.

EVT models are operationally valuable for high-risk event detection, crash risk inference, safest route analysis, and hazardous location identification. Refs. [2,15,16,45,46,49] indicate that EVT provides a theoretically grounded bridge between conflicts and crash risk. Yet EVT deployment remains constrained by sensitivity to thresholds, the need for sufficient extreme events, and limited transfer validation.

Machine learning models have strong potential for real-time prediction because they can process high-dimensional streaming data and capture nonlinear interaction patterns. Applications using video analytics, LiDAR, trajectories, connected vehicles, and communication-based traffic data demonstrate this potential [6,9,37,47,54,60,62]. However, operational readiness is reduced by interpretability limitations, data requirements, robustness issues, and transferability concerns. The directions taken in refs. [67,68] directly address these limitations through transfer learning and meta-learning, but such approaches remain emerging.

Hybrid models represent the most conceptually comprehensive but least operationally mature paradigm. They can jointly model frequency, severity, uncertainty, and temporal evolution, as illustrated by [16,21,22,23,24,38,39,48]. However, their deployment is limited by high complexity, calibration burden, interpretability challenges, and the difficulty of integrating multiple modules into real-time traffic management systems. Therefore, although hybrid models are theoretically promising, they should not be described as operationally superior without stronger evidence of real-world deployment and transfer validation.

3.3. Frequency and Severity Dimensions

In addition to the nine comparison dimensions, the reviewed paradigms can be interpreted through the complementary lenses of conflict frequency and conflict severity. This distinction is not a separate methodological dimension in the same sense as data requirements, uncertainty, validation, or deployment; rather, it is a cross-cutting conceptual lens that clarifies the functional role of each paradigm in real-time crash risk assessment.

Conflict frequency refers to how often conflicts occur within a defined time interval, road segment, signal cycle, or traffic state. Statistical and regression-based models are most directly aligned with this perspective because they model conflict counts, rates, or probabilities using threshold-based indicators such as TTC, PET, MTTC, and DRAC [3,4,35]. Bayesian models extend this frequency-oriented logic by incorporating posterior uncertainty, spatial variation, temporal dependence, and heterogeneity into conflict rate estimation [10,11,23].

Conflict severity refers to the intensity or crash relevance of a conflict, particularly when interactions approach extreme or safety-critical conditions. EVT-based models are most explicitly severity-oriented because they focus on the tail of the conflict distribution and model extreme values of surrogate indicators such as minimum TTC or PET exceedances [2,5,15]. ML models also contribute to severity modelling by learning risk scores or conflict escalation patterns from high-dimensional trajectory, video, LiDAR, connected vehicle, or communication-based traffic data [6,9,47,60]. However, their severity representation is often data-driven rather than explicitly grounded in physical thresholds.

Hybrid models provide the strongest integration of frequency and severity because they combine occurrence/exposure modelling with probabilistic, extreme value, statistical, and learning-based components [22,23,24,39,48]. This makes them particularly suitable for short-term crash risk forecasting, although their operational use remains limited by computational complexity and calibration requirements (see Table 4).

Thus, the frequency–severity distinction helps explain why no single paradigm is universally superior. Statistical and Bayesian models are better suited to monitoring conflict occurrence and rate variation; EVT models are better suited to estimating crash-relevant extreme risk; ML models are better suited to learning complex interaction patterns, and hybrid models attempt to combine these roles within integrated forecasting frameworks.

4. Research Gaps and Emerging Directions

Despite significant progress in conflict-based real-time crash risk modelling, the reviewed literature reveals persistent methodological and practical limitations. These gaps are evident across all modelling paradigms and highlight critical opportunities for future research. To provide clarity and analytical coherence, the discussion is organized into two parts: (i) key research gaps and (ii) emerging directions aimed at addressing these limitations.

4.1. Research Gaps

4.1.1. Fragmented Conflict Definitions and Limitations in Modelling Rare and Extreme Events

A fundamental limitation across the literature is the absence of standardized definitions for traffic conflicts. Most studies rely on surrogate safety indicators such as TTC, PET, and DRAC [3,28], whose effectiveness depends heavily on threshold selection and context-specific calibration [49]. Differences in thresholds, aggregation strategies, and interaction definitions lead to inconsistent conflict representations and limit comparability across studies. While recent efforts introduce continuous safety measures [33] and interaction-based indicators [31], these approaches do not fully resolve this fragmentation.

This limitation directly affects the modelling of rare and extreme events. Although traffic conflicts provide higher frequency observations than crashes, not all conflicts are equally informative. Many statistical and machine learning models treat conflicts uniformly, without distinguishing between ordinary and extreme interactions. EVT-based approaches explicitly model extremes [2,29], but their effectiveness depends on threshold selection and sufficient tail data. Machine learning models, in contrast, face challenges related to class imbalance and rare-event prediction [66], often leading to biased predictions toward non-critical events.

Overall, the lack of consistent and severity-sensitive conflict definitions constrains both the identification of crash-relevant events and the robustness of rare-event modelling.

4.1.2. Incomplete Treatment of Uncertainty and Fragmented Spatiotemporal Modelling

Uncertainty and spatiotemporal dynamics are unevenly addressed across modelling paradigms. Bayesian frameworks provide formal probabilistic inference [10,18], whereas most regression and machine learning models rely on point estimates without explicitly quantifying prediction uncertainty. Moreover, multiple sources of uncertainty including data noise, model specification, and environmental variability are rarely distinguished or consistently propagated through modelling pipelines.

At the same time, temporal and spatiotemporal dependencies are not fully integrated. Early models rely on aggregated conflict counts and assume temporal independence, while more recent studies incorporate temporal correlation [11,12] and spatiotemporal dynamics [19,41]. However, these features remain inconsistently represented across paradigms. Statistical and EVT-based models often lack rich temporal representations, whereas machine learning approaches, despite strong spatiotemporal capabilities, may lack interpretability and probabilistic structure.

This fragmented treatment limits the ability of existing models to reliably capture dynamic risk evolution under uncertainty.

4.1.3. Limited Validation and Transferability

Validation practices remain highly heterogeneous across conflict-based crash risk modelling studies, limiting the robustness and generalizability of reported findings. Many models are developed and evaluated using single-site or context-specific datasets, which constrains their applicability to broader traffic environments and operational conditions.

Across modelling paradigms, validation approaches differ substantially. Machine learning studies typically rely on internal validation techniques such as cross-validation and performance metrics (e.g., accuracy, precision, recall, AUC) [66], which assess predictive performance but do not guarantee transferability beyond the training context. Bayesian models employ posterior predictive checks and model comparison criteria [72], providing a more rigorous probabilistic assessment, yet these evaluations are also commonly conducted within the same dataset.

A key limitation is the limited use of external validation across independent sites, time periods, or traffic conditions. As a result, model performance often remains context-dependent, and the ability to generalize across heterogeneous environments is not systematically assessed. This issue is further compounded by the absence of standardized benchmark datasets and evaluation protocols, which restricts reproducibility and hinders direct comparison across studies.

Overall, the lack of consistent and externally validated evaluation frameworks limits confidence in model robustness and constrains the practical deployment of conflict-based crash risk models across diverse traffic settings.

4.1.4. Barriers to Scalability and Real-Time Deployment

Despite their explicit focus on real-time applications, many conflict-based crash risk models face significant challenges in scalability and operational deployment. These challenges arise from the combined effects of computational complexity, data requirements, and system integration constraints.

The nature of these limitations varies across modelling paradigms. Bayesian and hybrid models, while offering rich representations of uncertainty and risk, are often computationally intensive due to iterative inference procedures and multi-stage modelling structures. Machine learning approaches, particularly deep learning models, require large volumes of high-resolution data such as trajectory, LiDAR, or video data and substantial computational resources for training and real-time inference.

In addition to computational challenges, deployment is constrained by data availability, sensing infrastructure, and system interoperability. Many existing studies rely on high-quality datasets that may not be readily available in real-world settings, limiting the transfer of these models to operational environments. Furthermore, integrating modelling frameworks into existing traffic management systems remains complex, requiring robust data pipelines, real-time processing capabilities, and compatibility with intelligent transportation system architectures.

Although some studies demonstrate promising applications, including real-time warning systems [42] and safety-oriented routing strategies [45,46], large-scale and continuous deployment remains limited. This gap highlights the need for modelling approaches that are not only methodologically advanced but also computationally efficient, data-accessible, and compatible with real-world operational constraints.

4.2. Emerging Research Directions

4.2.1. Standardized and Adaptive Conflict Representations

To address inconsistencies in conflict definitions, future research should prioritize the development of standardized yet adaptive conflict indicators. Existing studies highlight the sensitivity of conflict-based models to the choice of surrogate measures and thresholds, as well as to data collection strategies and modelling assumptions [28,49,64]. Such dependencies limit comparability across studies and reduce the reliability of derived risk estimates.

Recent work has explored data-driven threshold selection within EVT frameworks and the development of context-aware indicators that adapt to varying traffic conditions [14,18].

In parallel, trajectory-based and interaction-driven formulations are gaining traction as alternatives to fixed threshold definitions, enabling a more continuous and behaviourally grounded representation of traffic interactions [31,33]. Advancing this direction requires not only methodological innovation but also the establishment of benchmark datasets and standardized evaluation protocols to support reproducibility, cross-study comparison, and model transferability.

4.2.2. EVT–Machine Learning Integration for Rare-Event Modelling

A key emerging direction is the integration of extreme value theory with machine learning to jointly model rare and high-risk traffic events. This approach leverages the theoretical strength of EVT in representing tail behaviour alongside the predictive flexibility of machine learning in capturing complex, nonlinear patterns.

Recent hybrid EVT–ML frameworks demonstrate strong potential for short-term crash risk forecasting, particularly in data-rich environments where both extreme event modelling and pattern recognition are required [23,24,38,39]. At the same time, the literature increasingly emphasizes the importance of addressing class imbalance and rare-event prediction challenges, which can bias models toward non-critical outcomes. Emerging solutions include imbalance-aware learning strategies, synthetic data generation, and simulation-enhanced training approaches, which aim to improve robustness under highly skewed safety datasets [66]. Further research is needed to refine these integrated frameworks, particularly for real-time and streaming applications.

4.2.3. Integrated Uncertainty-Aware and Spatiotemporal Modelling Frameworks

Future research should move toward integrated frameworks that jointly capture uncertainty and spatiotemporal dynamics in conflict-based safety modelling. Bayesian approaches provide a strong foundation for uncertainty quantification through probabilistic inference and hierarchical modelling [4,10,44], while recent advances in uncertainty-aware machine learning, including Bayesian deep learning, extend these capabilities to high-dimensional and real-time settings [8,47].

In parallel, the increasing availability of high-resolution trajectory and sensor data has enabled more sophisticated modelling of temporal dependencies and spatial interactions using sequence learning and graph-based approaches [19,41,65]. Bayesian models have also incorporated temporal correlation and heterogeneity in conflict rates [11], offering complementary perspectives on dynamic safety processes.

However, uncertainty and spatiotemporal dynamics are still rarely integrated within a unified framework. Advancing this direction requires the development of computationally efficient models capable of jointly representing temporal evolution, spatial interactions, and multiple sources of uncertainty, including both epistemic and aleatory components. Such integration is essential for improving model reliability, interpretability, and real-time decision support.

4.2.4. Transferability, Benchmarking, and Generalizable Modelling

Improving the transferability and generalizability of conflict-based models remains a critical research priority. Most existing models are developed and validated within specific traffic contexts, limiting their applicability across different environments and reducing confidence in their broader use [71].

Emerging approaches such as transfer learning and meta-learning offer promising solutions by enabling models to adapt to new conditions with limited additional data [67,68]. However, systematic evaluation of transferability remains limited, and the absence of standardized benchmark datasets further hinders consistent comparison across modelling approaches.

Future research should focus on establishing common evaluation frameworks, cross-site validation protocols, and shared datasets to support reproducibility and robust performance assessment. Strengthening these aspects is essential for moving from context-specific models toward scalable and widely applicable safety assessment tools.

4.2.5. Scalable Real-Time Deployment and Unified Hybrid Systems

Advancing conflict-based safety modelling requires not only methodological innovation but also scalable real-time implementation and system integration. The increasing availability of high-resolution data from trajectory tracking, video analytics, and connected vehicle systems has created new opportunities for real-time safety assessment [21,33,37]. Several studies demonstrate the feasibility of applications such as real-time crash risk forecasting, dynamic monitoring, and warning systems [21,42], as well as safety-oriented routing strategies [45,46].

At the same time, there is growing interest in unified hybrid modelling frameworks that combine statistical, Bayesian, EVT-based, and machine learning approaches to capture frequency, severity, uncertainty, and temporal dynamics within a single modelling pipeline [17,21,23,24,39]. These approaches offer a pathway toward more comprehensive and operationally relevant safety models.

However, challenges remain in achieving computational efficiency, ensuring data reliability, and integrating models within intelligent transportation systems and connected vehicle ecosystems. Future work should therefore focus on efficient real-time inference, edge computing solutions, and seamless system integration, enabling the deployment of robust, interpretable, and scalable safety modelling frameworks in real-world environments.

5. Conclusions

This review provides a model-centric synthesis of conflict-based approaches for real-time crash risk assessment by organizing the literature into five modelling paradigms: statistical/regression-based, Bayesian, EVT-based, machine learning, and hybrid approaches. This classification enables a structured comparison of how different modelling traditions conceptualize and operationalize traffic conflicts. The analysis suggests that no single paradigm is universally optimal; rather, each captures distinct dimensions of crash risk. Statistical and Bayesian models tend to emphasize conflict frequency and support interpretable inference, EVT-based approaches focus on extreme interactions that may be more closely associated with crash occurrence, while machine learning models are well suited to capturing complex, high-dimensional interaction patterns. Hybrid frameworks offer the potential to integrate these complementary strengths within a unified modelling structure.

From a practical perspective, model selection should be guided by data availability, prediction objectives, and operational constraints. In data-limited contexts, statistical and Bayesian models provide relatively interpretable and computationally efficient solutions, making them suitable for exploratory analysis and policy-oriented applications. EVT-based approaches are particularly relevant when the objective is to characterize rare and safety-critical events. In contrast, machine learning models are more appropriate in data-rich environments where high-resolution trajectory or sensor data are available, enabling short-term prediction of complex interaction patterns. Hybrid frameworks are most applicable in advanced settings where multiple modelling objectives—such as prediction, uncertainty representation, and interpretability—need to be addressed simultaneously, although their implementation typically requires greater data availability and computational resources.

For researchers, several directions warrant further investigation. These include the development of standardized yet adaptive conflict indicators, improved modelling of rare and extreme events, and the systematic incorporation of uncertainty quantification across modelling pipelines. In addition, greater emphasis on cross-site validation and benchmarking using heterogeneous datasets is needed to assess model robustness and transferability. For policymakers and practitioners, effective deployment depends on sustained investment in sensing infrastructure, data integration, and computational capacity. The integration of conflict-based models into traffic management and intelligent transportation systems has the potential to support real-time monitoring and proactive safety interventions, although practical implementation challenges remain.

Overall, effective real-time safety modelling requires balancing interpretability, predictive performance, and operational feasibility. Model selection should therefore be informed by the specific application context, including the required prediction horizon, acceptable levels of uncertainty, and available data infrastructure. By providing a comparative framework and a frequency–severity perspective on crash risk, this review contributes to more informed model selection and supports the development of scalable, reliable, and data-driven traffic safety management systems.

Author Contributions

Conceptualization and writing—original draft preparation, I.N.J.II; methodology, writing—review and editing, S.L.T.F., T.L.N. and O.D.D.; supervision, writing—review and editing, B.G.-H., D.S.U. and L.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hossain, M.; Abdel-Aty, M.; Quddus, M.A.; Muromachi, Y.; Sadeek, S.N. Real-time crash prediction models: State-of-the-art, design pathways and ubiquitous requirements. Accid. Anal. Prev. 2019, 124, 66–84. [Google Scholar] [CrossRef]
Zheng, L.; Sayed, T. A novel approach for real time crash prediction at signalized intersections. Transp. Res. Part C Emerg. Technol. 2020, 117, 102683. [Google Scholar] [CrossRef]
Essa, M.; Sayed, T. Traffic conflict models to evaluate the safety of signalized intersections at the cycle level. Transp. Res. Part C Emerg. Technol. 2018, 89, 289–302. [Google Scholar] [CrossRef]
Guo, Y.; Sayed, T.; Essa, M. Real-time conflict-based Bayesian Tobit models for safety evaluation of signalized intersections. Accid. Anal. Prev. 2020, 144, 105660. [Google Scholar] [CrossRef]
Ali, Y.; Haque, M.; Mannering, F. A Bayesian generalised extreme value model to estimate real-time pedestrian crash risks at signalised intersections using artificial intelligence-based video analytics. Anal. Methods Accid. Res. 2023, 38, 100264. [Google Scholar] [CrossRef]
Chen, K.; Li, Z.; Liu, P.; Xu, C.; Wang, Y. Real-time lane-changing crash prediction model at the individual vehicle level using real-world trajectories prior to crashes. Transp. Res. Part C Emerg. Technol. 2025, 176, 105171. [Google Scholar] [CrossRef]
Nasr, H.A.; Jin, J.; Huang, H.; Eljailany, H.A. Real-Time Risk Identification of Rear-End Conflicts at Unsignalized Intersections. Systems 2025, 13, 827. [Google Scholar] [CrossRef]
Zhao, C.; Li, M.; Liu, J.; Zhang, Z.; Niu, S.; Song, D. Uncertainty-aware spatiotemporal interaction learning for pre-conflict risk evolution with a risk-increase prior. Accid. Anal. Prev. 2026, 228, 108379. [Google Scholar] [CrossRef]
Formosa, N.; Quddus, M.; Ison, S.; Abdel-Aty, M.; Yuan, J. Predicting real-time traffic conflicts using deep learning. Accid. Anal. Prev. 2020, 136, 105429. [Google Scholar] [CrossRef] [PubMed]
Essa, M.; Sayed, T. Full Bayesian conflict-based models for real time safety evaluation of signalized intersections. Accid. Anal. Prev. 2018, 129, 367–381. [Google Scholar] [CrossRef]
Guo, Y.; Sayed, T.; Liu, P.; Wu, Y.; Yue, Q.; Guo, S. Modeling temporal correlation and heterogeneity in real-time conflict rates using Bayesian Tobit models for signalized intersections. Accid. Anal. Prev. 2024, 202, 107552. [Google Scholar] [CrossRef]
Yue, Q.; Guo, Y.; Sayed, T.; Liu, P.; Lyu, H. Understanding temporal correlation in severe traffic conflicts: Evidence from cycle-level at signalized intersections. Transp. Lett. 2026, 1–16. [Google Scholar] [CrossRef]
Gu, R.; Sze, N. A vine copula-based analysis of spatial dependence of traffic conflict risk at highway ramp areas. Accid. Anal. Prev. 2026, 225, 108330. [Google Scholar] [CrossRef]
Fu, C.; Sayed, T. A multivariate method for evaluating safety from conflict extremes in real time. Anal. Methods Accid. Res. 2022, 36, 100244. [Google Scholar] [CrossRef]
Fu, C.; Sayed, T. Bayesian dynamic extreme value modeling for conflict-based real-time safety analysis. Anal. Methods Accid. Res. 2022, 34, 100204. [Google Scholar] [CrossRef]
Fu, C.; Sayed, T. Dynamic Bayesian hierarchical peak over threshold modeling for real-time crash-risk estimation from conflict extremes. Anal. Methods Accid. Res. 2023, 40, 100304. [Google Scholar] [CrossRef]
Niu, D.; Sayed, T. Bayesian forecasting of short-term crash risk with conditional extreme value models: A comparison between one-stage and two-stage approaches. Anal. Methods Accid. Res. 2025, 48, 100409. [Google Scholar] [CrossRef]
Niu, D.; Sayed, T. Short-term conflict-based crash risk forecasting: A Bayesian conditional peak-over-threshold approach. Anal. Methods Accid. Res. 2025, 46, 100385. [Google Scholar] [CrossRef]
Liu, Z.; Zou, G.; Wang, T.; Tu, M.; Wang, H.; Li, Y. Learning and predicting traffic conflicts in mixed traffic: A spatiotemporal graph neural network with manifold similarity learning. Expert Syst. Appl. 2025, 309, 131183. [Google Scholar] [CrossRef]
Duan, Y.; Lin, Z.; Wang, Y.; Bai, Q. Analysis of Traffic Conflicts at Roundabout Entrances and Exits—A Machine Learning Approach for Enhanced Safety. Promet-Traffic Transp. 2025, 37, 947–962. [Google Scholar] [CrossRef]
Hussain, F.; Ali, Y.; Li, Y.; Haque, M. A bi-level framework for real-time crash risk forecasting using artificial intelligence-based video analytics. Sci. Rep. 2024, 14, 4121. [Google Scholar] [CrossRef]
Fu, C.; Lu, Z.; Liu, H.; Wang, X.; Ou, J.; Bai, W. Real-Time Safety Evaluation at Signalized Intersections: Hierarchical Bayesian Extreme Value Theory Models Based on Different Conflict Types. J. Adv. Transp. 2025, 2025, 6554672. [Google Scholar] [CrossRef]
Li, D.; Fu, C.; Sayed, T.; Wang, W. An integrated approach of machine learning and Bayesian spatial Poisson model for large-scale real-time traffic conflict prediction. Accid. Anal. Prev. 2023, 192, 107286. [Google Scholar] [CrossRef]
Hussain, F.; Ali, Y.; Li, Y.; Haque, M. Real-time crash risk forecasting using Artificial-Intelligence based video analytics: A unified framework of generalised extreme value theory and autoregressive integrated moving average model. Anal. Methods Accid. Res. 2023, 40, 100302. [Google Scholar] [CrossRef]
Fu, C.; Lu, Z.; Liu, H.; Wumaierjiang, A. Dynamic short-term crash risk prediction from traffic conflicts at signalized intersections with emerging mixed traffic flow: A novel conflict indicator. Accid. Anal. Prev. 2025, 217, 108065. [Google Scholar] [CrossRef]
Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G.; The PRISMA Group. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. J. Clin. Epidemiol. 2009, 62, 1006–1012. [Google Scholar] [CrossRef]
Wohlin, C. Guidelines for snowballing in systematic literature studies and a replication in software engineering. In Proceedings of the 18th International Conference on Evaluation and Assessment in Software Engineering, London, UK, 13–14 May 2014. [Google Scholar] [CrossRef]
Zheng, L.; Sayed, T.; Mannering, F. Modeling traffic conflicts for use in road safety analysis: A review of analytic methods and future directions. Anal. Methods Accid. Res. 2021, 29, 100142. [Google Scholar] [CrossRef]
Ali, Y.; Haque, M.; Mannering, F. Assessing traffic conflict/crash relationships with extreme value theory: Recent developments and future directions for connected and autonomous vehicle and highway safety research. Anal. Methods Accid. Res. 2023, 39, 100276. [Google Scholar] [CrossRef]
Cao, Q.; Zhao, Z.; Zeng, Q.; Wang, Z.; Long, K. Real-Time Vehicle Trajectory Prediction for Traffic Conflict Detection at Unsignalized Intersections. J. Adv. Transp. 2021, 2021, 8453726. [Google Scholar] [CrossRef]
Wang, T.; Ge, Y.-E.; Wang, Y.; Chen, W.; Fu, Q.; Niu, Y. A novel model for real-time risk evaluation of vehicle–pedestrian interactions at intersections. Accid. Anal. Prev. 2024, 206, 107727. [Google Scholar] [CrossRef]
Li, H.; Zhang, X. Vehicle trajectory-based prediction of traffic conflicts on sharp horizontal curves. Traffic Inj. Prev. 2025, 1–9. [Google Scholar] [CrossRef] [PubMed]
Hu, Y.; Li, Y.; Huang, H.; Lee, J.; Yuan, C.; Zou, G. A high-resolution trajectory data driven method for real-time evaluation of traffic safety. Accid. Anal. Prev. 2022, 165, 106503. [Google Scholar] [CrossRef]
Islam, Z.; Abdel-Aty, M.; Goswamy, A.; Abdelraouf, A.; Zheng, O. Effect of signal timing on vehicles’ near misses at intersections. Sci. Rep. 2023, 13, 9065. [Google Scholar] [CrossRef]
Gore, N.; Chauhan, R.; Easa, S.; Arkatkar, S. Traffic conflict assessment using macroscopic traffic flow variables: A novel framework for real-time applications. Accid. Anal. Prev. 2023, 185, 107020. [Google Scholar] [CrossRef] [PubMed]
Shen, S.; Hashimoto, M.; Oikawa, S.; Matsui, Y.; Hirose, T. Temporal Margins and Behavioral Features for Early Risk Assessment in Left-Turn Vehicle and Bicycle Conflicts at Signalized Intersections. Machines 2025, 13, 709. [Google Scholar] [CrossRef]
Ma, F.; Wang, X.; Yang, W. Real-time accident risk identification for freeway weaving segments based on video analytics. Measurement 2025, 242, 115783. [Google Scholar] [CrossRef]
Howlader, M.; Haque, M. Opposing-through crash risk forecasting using artificial intelligence-based video analytics for real-time application: Integrating generalized extreme value theory and time series forecasting models. Accid. Anal. Prev. 2025, 218, 108073. [Google Scholar] [CrossRef]
Fu, C.; Liu, J.; Liu, H.; Wang, X.; Lu, Z.; Ou, J.; Bai, W. Real-Time Traffic Conflict Prediction at Intersections: A Novel Approach Integrating Statistical Models and Machine Learning. J. Adv. Transp. 2025, 2025, 2239983. [Google Scholar] [CrossRef]
Hu, Y.; Li, Y.; Yuan, C.; Huang, H. Modeling conflict risk with real-time traffic data for road safety assessment: A copula-based joint approach. Transp. Saf. Environ. 2022, 4, tdac017. [Google Scholar] [CrossRef]
Hu, Y.; Li, Y.; Huang, H. Spatio-temporal dynamic change mechanism analysis of traffic conflict risk based on trajectory data. Accid. Anal. Prev. 2023, 191, 107203. [Google Scholar] [CrossRef]
Hu, C.; Xiong, C.; Guo, F.; Lee, J.; Yang, W.; Guo, Z. Effectiveness and Optimal Location of Real-Time Traffic Conflict Risk Warning System for Rural Unsignalized Intersections: A Driving Simulation Study. J. Adv. Transp. 2022, 2022, 2613465. [Google Scholar] [CrossRef]
Lou, Y.; Zhu, J. Real-time traffic conflict identification using Bayesian Hierarchical Approach at Merging Area in the Environment of Connected and Autonomous Vehicles. In Proceedings of the 2023 7th International Conference on Transportation Information and Safety (ICTIS), Xi’an, China, 4–6 August 2023; pp. 2007–2012. [Google Scholar] [CrossRef]
Ali, Y.; Washington, S.; Haque, M. Estimating real-time crash risk at signalized intersections: A Bayesian Generalized Extreme Value approach. Saf. Sci. 2023, 164, 106181. [Google Scholar] [CrossRef]
Ghoul, T.; Sayed, T.; Fu, C. Dynamic identification of short-term and longer-term hazardous locations using a conflict-based real-time extreme value safety model. Anal. Methods Accid. Res. 2023, 37, 100262. [Google Scholar] [CrossRef]
Ghoul, T.; Sayed, T.; Fu, C. Real-time safest route identification: Examining the trade-off between safest and fastest routes. Anal. Methods Accid. Res. 2023, 39, 100277. [Google Scholar] [CrossRef]
Wu, P.; Wei, W.; Zheng, L.; Hu, Z.; Essa, M. Cycle-level traffic conflict prediction at signalized intersections with LiDAR data and Bayesian deep learning. Accid. Anal. Prev. 2023, 192, 107268. [Google Scholar] [CrossRef]
Hussain, F.; Li, Y.; Haque, S.M.M. Machine learning-based real-time crash risk forecasting for pedestrians. Commun. Transp. Res. 2025, 5, 100224. [Google Scholar] [CrossRef]
Chen, K.; Xu, C.; Liu, P.; Li, Z.; Wang, Y. Evaluating the performance of traffic conflict measures in real-time crash risk prediction using pre-crash vehicle trajectories. Accid. Anal. Prev. 2024, 203, 107640. [Google Scholar] [CrossRef]
Darzian Rostami, A.; Katthe, A.; Sohrabi, A.; Jahangiri, A. Predicting Critical Bicycle-Vehicle Conflicts at Signalized Intersections. J. Adv. Transp. 2020, 2020, 8816616. [Google Scholar] [CrossRef]
Ma, Y.; Zhu, J. Left-turn conflict identification at signal intersections based on vehicle trajectory reconstruction under real-time communication conditions. Accid. Anal. Prev. 2021, 150, 105933. [Google Scholar] [CrossRef] [PubMed]
Patel, H.; Gore, N.; Easa, S.; Arkatkar, S. Novel Traffic Conflict-Based Framework for Real-Time Traffic Safety Evaluation Under Heterogeneous and Weak Lane-Discipline Traffic. Transp. Res. Rec. J. Transp. Res. Board 2024, 2678, 118–134. [Google Scholar] [CrossRef]
Xia, Y.; Qin, Y.; Li, X.; Xie, J. Risk Identification and Conflict Prediction from Videos Based on TTC-ML of a Multi-Lane Weaving Area. Sustainability 2022, 14, 4620. [Google Scholar] [CrossRef]
Singh, A.; Dass, S. Spatial–Temporal Video Analysis for Advanced Traffic Conflict Detection and Risk Assessment Using Yolov8 and Attention-Enhanced Safety Metrics. Int. J. Intell. Transp. Syst. Res. 2025, 23, 2252–2272. [Google Scholar] [CrossRef]
Zhang, S.; Abdel-Aty, M.; Cai, Q.; Li, P.; Ugan, J. Prediction of pedestrian-vehicle conflicts at signalized intersections based on long short-term memory neural network. Accid. Anal. Prev. 2020, 148, 105799. [Google Scholar] [CrossRef]
Zhang, S.; Abdel-Aty, M.; Wu, Y.; Zheng, O. Modeling pedestrians’ near-accident events at signalized intersections using gated recurrent unit (GRU). Accid. Anal. Prev. 2020, 148, 105844. [Google Scholar] [CrossRef] [PubMed]
Ma, X.; Yu, Q.; Liu, J. Modeling Urban Freeway Rear-End Collision Risk Using Machine Learning Algorithms. Sustainability 2022, 14, 12047. [Google Scholar] [CrossRef]
An, X.; Wu, X.; Liu, W.; Cheng, R. Real-time rear-end conflict prediction on congested highways sections using trajectory data. Chaos Solitons Fractals 2024, 187, 115391. [Google Scholar] [CrossRef]
Jin, J.; Li, J.; Tian, S.; Ye, Q. Zone-specific real-time traffic conflict risk modeling for freeway tunnels: A CrossTabNet approach. Accid. Anal. Prev. 2025, 223, 108274. [Google Scholar] [CrossRef]
Somnathe, A.T.; Selvi, V.T.; Stepha, N.G.; Kingslin, M.T.; Inamdar, F.M.; Reddy, P.C.S. An Intelligent Traffic Conflict Prediction Using Deep Learning with Long-Term Evolution Access Data. In Proceedings of the 2025 3rd International Conference on Smart Systems for applications in Electrical Sciences (ICSSES), Tumakuru, India, 21–22 March 2025; pp. 1–6. [Google Scholar] [CrossRef]
Sheikh, M.S.; Peng, Y. Assessment of Rear-End Collision Risk Based on a Deep Reinforcement Learning Technique: A Break Reaction Assessment Approach. IEEE Access 2025, 13, 20171–20190. [Google Scholar] [CrossRef]
Li, Y.; Li, H.; Fujiwara, A.; Zhang, J. Real-Time Prediction of Longitudinal Traffic Conflict Risk using Connected Vehicle and Deep Learning Approach. Int. J. Intell. Transp. Syst. Res. 2025, 24, 11–22. [Google Scholar] [CrossRef]
Katrakazas, C.; Quddus, M.; Chen, W.-H. A Simulation study of predicting real-time conflict-prone traffic conditions. IEEE Trans. Intell. Transp. Syst. 2018, 19, 3196–3207. [Google Scholar] [CrossRef]
Orsini, F.; Gastaldi, M.; Rossi, R. Conflict-Based Real-Time Road Safety Analysis: Sensitivity to Data Collection Duration and its Implications for Model Resilience. Transp. Res. Rec. J. Transp. Res. Board 2024, 2678, 460–472. [Google Scholar] [CrossRef]
Zhang, G.; Jin, J.; Chang, F.; Huang, H. Real-time traffic conflict prediction at signalized intersections using vehicle trajectory data and deep learning. Int. J. Transp. Sci. Technol. 2025, 20, 82–96. [Google Scholar] [CrossRef]
Formosa, N.; Quddus, M.; Man, C.K.; Timmis, A. Appraising Machine and Deep Learning Techniques for Traffic Conflict Prediction with Class Imbalance. Data Sci. Transp. 2023, 5, 4. [Google Scholar] [CrossRef]
Hou, Q.; Yang, Y.; Liang, J.; Huo, X.; Leng, J. A deep transfer learning approach for Real-Time traffic conflict prediction with trajectory data. Accid. Anal. Prev. 2025, 214, 107966. [Google Scholar] [CrossRef]
Wei, W.; Zheng, L.; El Esawey, M. Model-Agnostic Meta-Learning-Based Real-Time Traffic Conflict Prediction with Limited Sample at Heterogeneous Signalized Intersections. IEEE Trans. Intell. Transp. Syst. 2026, 27, 4811–4823. [Google Scholar] [CrossRef]
Cai, B.; Di, Q. Different Forecasting Model Comparison for Near Future Crash Prediction. Appl. Sci. 2023, 13, 759. [Google Scholar] [CrossRef]
Orsini, F.; Gecchele, G.; Gastaldi, M.; Rossi, R. Real-time conflict prediction: A comparative study of machine learning classifiers. Transp. Res. Procedia 2021, 52, 292–299. [Google Scholar] [CrossRef]
Orsini, F.; Gecchele, G.; Rossi, R.; Gastaldi, M. A conflict-based approach for real-time road safety analysis: Comparative evaluation with crash-based models. Accid. Anal. Prev. 2021, 161, 106382. [Google Scholar] [CrossRef]
Zheng, L.; Hu, Z.; Sayed, T. Traffic Conflict Prediction at Signal Cycle Level Using Bayesian Optimized Machine Learning Approaches. Transp. Res. Rec. J. Transp. Res. Board 2023, 2677, 183–195. [Google Scholar] [CrossRef]

Figure 1. Methodological framework.

Figure 2. Summary flow diagram of PRISMA approach.

Figure 3. Summary of studies per model paradigm.

Table 1. Summary of studies reviewed.

Modelling Paradigm	No. of Studies	Representative Approaches	Study IDs
Statistical/regression-based	17	Poisson, NB, Tobit, copula, macroscopic conflict models, ARIMA/time-series	[3,4,13,24,30,31,32,33,34,35,36,37,38,39,40,41,42]
Bayesian probabilistic	18	Bayesian Tobit, hierarchical Bayesian, spatial Bayesian models, Bayesian deep learning	[2,4,5,10,11,12,15,16,17,18,22,23,43,44,45,46,47,48]
EVT-based	18	GEV, POT-GPD, conditional EVT, multivariate EVT, EVT time-series	[2,5,6,14,15,16,17,18,21,22,24,25,38,44,45,46,48,49]
Machine learning/deep learning	37	DNN, CNN, LSTM, GRU, GNN, RL, transfer learning, meta-learning, video analytics	[7,8,9,19,20,21,23,25,30,32,35,39,47,48,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72]
Hybrid approaches	14	Bayesian–EVT, EVT–ML, ML–Bayesian, statistical–ML, multi-stage frameworks	[2,5,16,21,22,23,24,25,38,39,45,46,47,48]

Table 2. Model-intrinsic methodological comparison across paradigms.

Model	Conflict Representation and Modelling Logic	Uncertainty and Heterogeneity	Temporal Modelling	Validation Strategy	Computational Complexity
Statistical/Regression	Threshold-based (TTC, PET, MTTC, DRAC); conflict counts/rates; Poisson/NB/Tobit	Overdispersion (NB); censoring (Tobit); random parameters (limited–moderate use)	Short fixed windows (≈30–120 s; cycle-level dominant); temporal dependence rarely explicit	AIC, BIC, deviance, χ²; sensitivity analysis; limited predictive validation	Low
Bayesian	Probabilistic/hierarchical extensions of threshold indicators; latent variable structures	Full posterior inference (MCMC); spatial + temporal heterogeneity (moderate–strong use)	Dynamic updating; autoregressive/state-space; time-varying parameters (increasing but not universal)	DIC; posterior checks; convergence diagnostics; occasional out-of-sample validation	Moderate–High
EVT	Extreme value focus (min TTC, PET exceedances); GEV/POT frameworks	Tail uncertainty; parameter sensitivity; Bayesian EVT (applied in a subset of studies)	Sliding windows; time-varying EVT; forecasting applications limited to a minority of studies	KS, AD tests; QQ-plots; tail fit diagnostics; crash-consistency checks	Moderate
ML/Deep Learning	Data-driven; trajectory/video/CV-based interactions; learned risk scores (no fixed thresholds)	Limited explicit uncertainty; ensembles, MC dropout, Bayesian DL (applied in a minority)	Strong temporal modelling (LSTM, GRU, GNN); real-time sequence learning widely adopted	Accuracy, precision, recall, F1, AUC; cross-validation; benchmarking; limited robustness testing	High
Hybrid (EVT + Bayesian + ML)	Integrated: frequency + severity + learned patterns; multi-stage or multi-model frameworks	Combined uncertainty (Bayesian + EVT + ML); often partial propagation across stages	Multi-scale (time-series + sequence learning + dynamic EVT); forecasting-oriented	Combined statistical diagnostics + predictive metrics; occasional cross-site validation	Very High

TTC = Time to Collision; PET = Post-Encroachment Time; MTTC = Modified Time to Collision; DRAC = Deceleration Rate to Avoid Collision; NB = Negative Binomial; MCMC = Markov Chain Monte Carlo; DIC = Deviance Information Criterion; GEV = Generalized Extreme Value; POT = Peak Over Threshold; KS = Kolmogorov–Smirnov; AD = Anderson–Darling; DL = Deep Learning; LSTM = Long Short-Term Memory; GRU = Gated Recurrent Unit; GNN = Graph Neural Network; MC = Monte Carlo; AIC = Akaike Information Criterion; BIC = Bayesian Information Criterion; AUC = Area Under the Curve; AR = Autoregressive; ARIMA = Autoregressive Integrated Moving Average.

Table 3. Data, application and deployment comparison.

Model	Application Context	Data Type and Composition	Sample Size (Typical Range)	Prediction Window	Dataset Scale and Tools	Operational Readiness	Strengths	Limitations
Statistical/Regression	Primarily signalized intersections (dominant); some freeway and heterogeneous traffic applications	Aggregated counts + trajectory-derived indicators	10³–10⁵ observations	Cycle-level and short-term (≈30 s–2 min dominant)	Loop detectors; basic trajectory extraction tools	High	Interpretable; low computational cost; effective in structured and data-limited contexts	Threshold sensitivity; weak temporal modelling; limited severity representation; context-specific calibration
Bayesian	Intersections + emerging network-level and CAV contexts	Aggregated + trajectory + spatial features	10⁴–10⁶ observations	Short-term to near-term (cycle-level up to few minutes ahead)	Trajectory data; CV data; probabilistic modelling frameworks	Moderate	Explicit uncertainty quantification; robust with sparse/noisy data; captures heterogeneity	High computational cost; complex specification; convergence requirements; scalability limitations
EVT	Safety-critical environments (intersections, highways, hotspots)	High-frequency trajectory or video-derived indicators	Moderate overall datasets, but effective sample = tail observations (often limited)	Short-term extreme risk estimation (event-driven; seconds–minutes)	Video analytics; trajectory extraction tools	Moderate	Strong theoretical linkage to crash risk; effective rare-event modelling; severity-focused	Requires sufficient extreme events; sensitive to threshold selection; limited modelling of non-extreme conditions
ML/Deep Learning	All environments; strongest in data-rich settings (video, LiDAR, CV, complex traffic)	High-dimensional multi-source data (trajectory, video, LiDAR, CV)	10⁵–10⁷ observations	Short-term prediction (seconds–minutes); limited multi-step forecasting	Large-scale datasets; AI/video analytics platforms; LiDAR; CV systems	Moderate–Low	High predictive performance; captures nonlinear and spatiotemporal interactions; adaptable	High data demand; low interpretability; sensitivity to data quality; transferability challenges
Hybrid (EVT + Bayesian + ML)	Complex, multi-source, network-level and emerging (CAV, mixed traffic)	Multi-source integrated (trajectory + video + spatial + CV data)	10⁵–10⁷ observations	Short-term + near-term forecasting (seconds to several minutes ahead)	Advanced multi-source analytics platforms	Emerging	Integrates frequency, severity, and uncertainty; strongest modelling capability; supports forecasting	Very high complexity; difficult calibration; scalability and interpretability challenges; limited real-world deployment

LiDAR = Light Detection and Ranging; CV = Connected Vehicle; CAV = Connected and Autonomous Vehicle.

Table 4. Comparative analysis of modelling paradigms based on conflict frequency and severity representation.

Model (Representative Studies)	Conflict Frequency Representation	Conflict Severity Representation	Dominant Orientation	Underlying Mechanism
Statistical/Regression [3,13,35]	Explicit: aggregated counts, rates, or probabilities over intervals (TTC, PET, DRAC thresholds); Poisson/NB/Tobit formulations	Implicit: threshold-defined severity classes only; no continuous severity modelling	Strongly frequency-oriented	Count-based stochastic modelling (Poisson/NB/Tobit; threshold-driven indicators)
Bayesian [4,10,11,18,23]	Probabilistic frequency estimation with posterior distributions; accounts for spatial/temporal heterogeneity	Partial: via hierarchical structures or Bayesian–EVT extensions; severity not primary target	Frequency-oriented with uncertainty	Hierarchical probabilistic inference (posterior distributions; MCMC; latent structures)
EVT [2,5,15,16,24]	Not explicitly modelled (focus not on counts)	Explicit: tail modelling of extreme conflicts (min TTC, PET exceedances) via GEV/POT	Strongly severity-oriented	Tail distribution modelling (extreme value theory; GEV/POT; rare-event inference)
ML/DL [9,19,47,66]	Implicit: learned from trajectory patterns, class frequencies, or exposure proxies	Explicit but data-driven: continuous risk scores, conflict likelihood, or escalation patterns	Primarily severity-oriented in data-driven form, while also implicitly capturing frequency patterns	Nonlinear pattern learning (LSTM, GRU, GNN; high-dimensional feature extraction)
Hybrid [21,23,39,48]	Explicit + implicit: combines exposure, counts, and learned frequency components	Explicit: integrates EVT-based severity with ML/Bayesian risk estimation	Fully integrated frequency–severity	Multi-stage hybrid modelling (statistical + EVT + ML + Bayesian inference)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jackai II, I.N.; Tezong Feudjio, S.L.; Ndingwan, T.L.; Dindze, O.D.; Usami, D.S.; Gonzalez-Hernandez, B.; Persia, L. Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review. Future Transp. 2026, 6, 107. https://doi.org/10.3390/futuretransp6030107

AMA Style

Jackai II IN, Tezong Feudjio SL, Ndingwan TL, Dindze OD, Usami DS, Gonzalez-Hernandez B, Persia L. Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review. Future Transportation. 2026; 6(3):107. https://doi.org/10.3390/futuretransp6030107

Chicago/Turabian Style

Jackai II, Isaac Ndumbe, Steffel Ludivin Tezong Feudjio, Tevoh Lordswill Ndingwan, Olive Dubila Dindze, Davide Shingo Usami, Brayan Gonzalez-Hernandez, and Luca Persia. 2026. "Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review" Future Transportation 6, no. 3: 107. https://doi.org/10.3390/futuretransp6030107

APA Style

Jackai II, I. N., Tezong Feudjio, S. L., Ndingwan, T. L., Dindze, O. D., Usami, D. S., Gonzalez-Hernandez, B., & Persia, L. (2026). Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review. Future Transportation, 6(3), 107. https://doi.org/10.3390/futuretransp6030107

Article Menu

Conflict-Based Models for Real-Time Crash Risk Assessment: A State-of-the-Art Review

Abstract

1. Introduction

2. Methodology

3. Results and Analytical Synthesis

3.1. Descriptive Analysis

3.2. Model-Centric Synthesis

3.2.1. Conflict Representation and Modelling Logic

3.2.2. Application Context and Traffic Environment

3.2.3. Data Requirements and Input Characteristics

3.2.4. Sample Size and Dataset Scale

3.2.5. Uncertainty and Heterogeneity Modelling

3.2.6. Temporal Modelling and Prediction Horizon

3.2.7. Model Performance and Validation Strategy

3.2.8. Computational Complexity and Implementation

3.2.9. Operational Readiness and Transferability

3.3. Frequency and Severity Dimensions

4. Research Gaps and Emerging Directions

4.1. Research Gaps

4.1.1. Fragmented Conflict Definitions and Limitations in Modelling Rare and Extreme Events

4.1.2. Incomplete Treatment of Uncertainty and Fragmented Spatiotemporal Modelling

4.1.3. Limited Validation and Transferability

4.1.4. Barriers to Scalability and Real-Time Deployment

4.2. Emerging Research Directions

4.2.1. Standardized and Adaptive Conflict Representations

4.2.2. EVT–Machine Learning Integration for Rare-Event Modelling

4.2.3. Integrated Uncertainty-Aware and Spatiotemporal Modelling Frameworks

4.2.4. Transferability, Benchmarking, and Generalizable Modelling

4.2.5. Scalable Real-Time Deployment and Unified Hybrid Systems

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI