Next Article in Journal
Improving Raman-Based Models for Real-Time Monitoring the CHO Cell Culture Process with Effective Variable Selection Strategies
Previous Article in Journal
Analysis of Spatiotemporal Gait Variables before and after Unilateral Total Knee Arthroplasty
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identifying High-Risk Patterns in Single-Vehicle, Single-Occupant Road Traffic Accidents: A Novel Pattern Recognition Approach

Research Division Transportation System Planning, Institute for Spatial Planning, Vienna University of Technology, Karlsgasse 11, A-1040 Vienna, Austria
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(19), 8902; https://doi.org/10.3390/app14198902
Submission received: 30 August 2024 / Revised: 29 September 2024 / Accepted: 30 September 2024 / Published: 2 October 2024
(This article belongs to the Section Transportation and Future Mobility)

Abstract

Despite various interventions in road safety work, fatal and severe road traffic accidents ( R T A s ) remain a significant challenge, leading to human suffering and economic costs. Understanding the multicausal nature of R T A s , where multiple conditions and factors interact, is crucial for developing effective prevention measures in road safety work. This study investigates the multivariate statistical analysis of co-occurring conditions in R T A s , focusing on single-vehicle accidents with single occupancy and personal injury on Austrian roads outside built-up areas from 2012 to 2019. The aim is to detect recurring combinations of accident-related variables, referred to as blackpatterns ( B P s ), using the Austrian R T A database. This study proposes Fisher’s exact test to estimate the relationship between an accident-related variable and fatal and severe R T A s (severe casualties). In terms of pattern recognition, this study develops the maximum combination value ( M C V ) of accident-related variables, a procedure to search through all possible combinations of variables to find the one that has the highest frequency. The accident investigation proceeds with the application of pattern recognition methods, including binomial logistic regression and a newly developed method, the PATTERMAX method, created to accurately detect and analyse variable-specific B P s in R T A data. Findings indicate significant B P s contributing to severe accidents. The combination of binomial logistic regression and the PATTERMAX method appears to be a promising approach to investigate severe accidents, providing both insights into detailed variable combinations and their impact on accident severity.

1. Introduction

1.1. Relevance and Problem Statement

Road traffic accidents ( R T A s ) with personal injuries result in substantial material and immaterial costs. According to the Austrian Accident Cost Accounting from 2022, the economic costs of a single fatal R T A are estimated at 4,801,407 Euros, with accidents resulting in severe injuries costing 593,479 Euros each [1]. Despite various interventions, fatal R T A s remain a significant challenge worldwide. Austria experienced a peak in fatal R T A s in 1972, with 2948 fatalities. Since then, numerous safety interventions, such as speed limits and mandatory seatbelt use, have significantly reduced the number of fatal accidents [2]. However, Austria still ranks 11th in the EU with 47 traffic fatalities per million inhabitants in 2019 [3]. The Austrian Ministry of the Interior [4] identifies several major accident causes, including speeding, distraction, and priority violations. These causes are determined subjectively by police officers at the scene, leading to potential biases. Besides the definition of accident causes, road safety work also focuses on the identification of accident blackspots. Blackspots are road sections where accidents frequently occur. Identifying these points is crucial for implementing targeted safety measures. However, going beyond the definition of a major accident cause and the identification of blackspots, this study aims to identify blackpatterns ( B P s ), which we define as recurring combinations of accident-related variables [5]. We conduct a detailed examination of recorded accident conditions, regardless of the officially designated accident cause. Understanding the multicausal nature of R T A s , where multiple conditions and factors interact, is crucial for developing effective prevention measures. This study addresses the gap in multivariate statistical investigation and pattern analysis approaches of R T A s , proposing that accidents are influenced by a complex interplay of driver, vehicle, roadway, and situational variables. Single-vehicle accidents involving a single occupant and personal injury were chosen for this study to eliminate the complexity introduced by interactions between multiple vehicles and drivers. By focusing solely on these incidents, we aim to isolate and analyse the factors contributing to severe outcomes without the confounding variables associated with vehicle-to-vehicle crashes. This approach allows for a more controlled examination of the conditions leading to personal injury, ensuring that this study captures the direct relationships between driver, vehicle, roadway, and environmental factors without external influences from other traffic participants. It is also important to note that the PATTERMAX method proposed in this paper has already been rigorously compared with other pattern recognition techniques in previous research. In [5], PATTERMAX was systematically evaluated against Bayesian networks, decision trees, and logistic regression in terms of its ability to identify complex multivariate accident patterns. The results of that analysis demonstrated that PATTERMAX excels in detecting high-dimensional, non-linear interactions between accident-related variables, making it particularly suited for the analysis of road traffic accidents. Given that this validation has already been carried out, the current study focuses on applying PATTERMAX to the specific context of single-vehicle accidents rather than revalidating it. Future work may further expand the scope by applying the method to different accident types and datasets.

1.2. Literature Review

R T A s are a significant public health concern, influenced by a complex interplay of factors. Various studies emphasise the need for a multidimensional approach to understand and prevent R T A s . One such study reviewed various data sources and techniques for accident analysis, emphasising the benefits of combining multiple analytical methods [6]. Another study employed system dynamics to model the complexity of R T A s , highlighting the importance of considering non-linear interactions between variables [7]. A multidimensional and multi-period analysis of road safety, incorporating various criteria such as human factors, accident causes, and road characteristics, was proposed in previous research [8]. The multifactorial nature of accidents involving human, vehicular, and environmental elements has also been reviewed in several studies [9]. Furthermore, black spot identification methods that couple statistical analysis with accident severity indices have been discussed as a more reliable approach for road safety assessments [10]. Traditional methods, such as generalized logistic regression and classification trees, have been widely used to identify combinations of factors leading to fatal accidents [11]. Researchers have applied association rule mining to reveal complex interactions between human, vehicle, road, and environmental factors in multi-fatality crashes [12]. A novel matched crash vs. non-crash approach for analysing severe crash patterns on multilane highways has also been introduced [13]. Logistic regression models have been developed to estimate fatality and major injury probabilities in single-vehicle accidents, with the major injury model showing better explanatory power [14]. More recent advancements in machine learning have introduced techniques that enhance the precision and scope of R T A analysis. Machine learning methods such as random forests, support vector machines (SVM), and deep learning models are now being applied to traffic accident data to capture non-linear and complex relationships between variables. For instance, random forests have proven effective for accident severity prediction by leveraging ensemble learning, which boosts accuracy by combining multiple decision trees [14]. Similarly, SVMs have been useful for classifying accident severity outcomes by optimizing the margin between different classes for improved classification accuracy [15]. Deep learning approaches, such as neural networks, have shown promise in real-time prediction of accident severity and in identifying complex patterns that are less apparent through traditional methods [16]. In addition, unsupervised learning methods, like clustering and dimensionality reduction techniques, including principal component analysis (PCA), are being increasingly used to explore latent structures within complex accident datasets [17]. These methods complement traditional approaches and provide a more detailed understanding of accident dynamics, enabling better accident prediction and risk factor identification. When analysing R T A data, one of the key objectives is to quantify the influence of accident-related variables on the severity of injuries. Several studies have identified factors that influence both accident severity and frequency. Critical factors such as collision type, road configuration, vehicle type, driver characteristics, and environmental conditions have been highlighted in research [15,16]. Specifically, motorcycles, male drivers, elderly drivers, nighttime driving, high-speed roads, and unlit roads have been pointed out as significant risk factors associated with higher accident severity [16]. Furthermore, studies have examined the relationship between safety devices and injury outcomes. For instance, safety devices, narrow impact zones, ejection, airbag deployment, and higher speeds have been strongly correlated with more severe injuries [17]. Additional research has emphasised the critical role of airbag deployment, vehicle extrication, ejection, travel speed, and alcohol involvement in determining injury severity [18,19]. Multiple driver errors have also been shown to result in more severe crashes [20]. Moreover, factors such as environmental conditions, vehicle type, protective devices, and time of day significantly impact accident severity, as discussed in other studies [21,22]. Understanding these variables is crucial for conducting exploratory data analyses and developing effective road safety measures. These insights provide a solid foundation for identifying critical risk factors and implementing targeted interventions in traffic safety [23]. Further studies highlight the importance of comprehensive data analysis in developing effective road safety strategies [10,24,25,26]. A systems approach focusing on the entire road transport system rather than just individual behaviour has been advocated by researchers [27]. The effectiveness of various interventions, including educational, engineering, and multifaceted approaches, has been demonstrated in improving pedestrian safety [28]. It has been found that legislation combined with strong enforcement or as part of a multifaceted approach is most effective in low- and middle-income countries [29]. Moreover, the importance of awareness creation, strict implementation of traffic rules, and scientific engineering measures to prevent R T A s has been stressed [30]. The dynamic interactions between various factors in analyzing R T A s and developing more effective safety measures underscore the necessity of comprehensive, multidimensional approaches to R T A prevention.

1.3. Research Question and Scope

R T A s remain a significant challenge, with single-vehicle accidents accounting for a substantial portion of fatalities in Europe [14]. Within the scope of a multidimensional approach to R T A analysis, this study investigates how multivariate and recurrent B P s in single-vehicle accidents can be identified, as well as their significance for severe and fatal accidents (referred to as severe casualties). This study aims to represent driver, vehicle, roadway, and situational variables and their correlations with accident severity using advanced statistical methods. Additionally, it identifies significant B P s among these variables. These patterns provide a deeper understanding of accident circumstances and highlight potential areas for targeted safety interventions. By combining descriptive statistics, binomial logistic regression, and innovative methods like the PATTERMAX method, this study seeks to detect recurring patterns that contribute to severe accidents and evaluate their frequency and impact. This research is intended not only to improve road safety measures but also to facilitate the development of more precise prevention strategies that target the most hazardous accident B P s . Therefore, this paper addresses the following research question: How can multivariate and recurrent variable-specific blackpatterns ( B P s ) in single-vehicle, single-occupant road traffic accidents with personal injury be accurately identified and analysed, and what is their significance in mitigating severe and fatal accidents?

2. Methods

Figure 1 illustrates the key steps in the methodological approach used in this study. It begins with data preparation focused on single-vehicle, single-occupant accidents, followed by descriptive analyses using Fisher’s exact test and the phi coefficient to identify significant relationships between variables and accident severity. The maximum combination value (MCV) is then calculated to identify frequent co-occurrences of accident-related variables. Binomial logistic regression is applied to model the impact of these variables on severe casualties. Finally, the PATTERMAX method is employed to detect B P s , which are ranked using the blackpattern impact score ( B I S ) to prioritise high-risk combinations for targeted road safety interventions. The following chapters present these methodological steps in detail, providing a comprehensive explanation of each stage.

2.1. Data Preparation for Pattern Recognition

Between 2012 and 2019, 303,700 R T A s occurred on the Austrian road network. 110,666 road accidents occurred outside built-up areas, while 193,034 accidents occurred within built-up areas. This study focuses on single-vehicle accidents with single occupancy that occurred outside built-up areas between 2012 and 2019 ( n = 20,293). The chosen sample amounts to 7% of all R T A s with a personal injury in Austria between 2012–2019 ( n = 303,700). Within the period under review, 110,666 accidents with personal injury occurred outside the built-up area, of which the extracted sample comprises 18%. The selection of these specific accidents allows for an analysis that is not confounded by the presence of multiple vehicles or individuals, which could otherwise complicate the already complex nature of road traffic accidents. By isolating these accidents, this study can more effectively identify and examine the underlying BPs and factors contributing to severe outcomes, making this sample particularly valuable for targeted analysis. The data preparation involves creating a binary RTA database with over 150 accident-related variables. Figure 2 illustrates the extracted RTA data sample in relation to all recorded RTAs between 2012–2019 in Austria.

2.2. Accident-Related Variables

After recoding all accident-related characteristics and setting up a binary accident database, the next step in data preparation foresees the assignment of each binary variable to one of the following categories: driver-related variables (54 variables), vehicle-related variables (32 variables), roadway-related variables (50 variables), and situation-related variables (22 variables). Table 1 illustrates the categorisation scheme for the 158 analysed accident-related variables.
We aim to quantify each accident-related variable’s impact on the degree of injury. Therefore, the dependent variable shall combine severe injury and fatalities within the category of severe casualties. Regarding the Austrian Road Safety Strategy 2021–2030 [31], it is equally important to reduce fatalities and the number of severe injuries. Also, both categories (severe and fatal accidents) entail high economic costs and human suffering. These premises lead to the following classification of the degree of injury:
  • Casualties: minor injury, severe injury, death at accident site, death within 30 days,
  • Severe casualties: severe injury, death at accident site, death within 30 days.
Thus, the degree of injury comprises two categories within this study. The resulting dependent variable is severe casualties. This classification corresponds to the definition within the Handbook of Transportation System Planning [32] (p. 73).

2.3. Descriptive Analyses

Initial analyses include calculating conditional and joint probabilities, applying Fisher’s exact test, and estimating the phi coefficient for each accident-related variable in relation to severe casualties, treating severe casualties as the dependent variable. A bootstrap resampling method is used for robust parameter estimation, and a maximum combination value ( M C V ) is calculated as a key indicator for B P detection. This value indicates how often a specific variable co-occurs with one or more accident-related variables. Each accident-related variable is broken down into a contingency table, where the rows represent the accident variable and the columns represent the outcomes: casualty and severe casualty. The frequency n i j represents the number of occurrences where the accident variable takes the value x i and the outcome is either casualty or severe casualty, with severe casualty being treated as the dependent variable. The conditional probability P of an event A given another event B is denoted as P = A B :
P = P ( A B ) P ( B )
Here, P = ( A B ) is the joint probability of A and B , and P(B) is the probability of B . In the context of this analysis, A represents a specific accident variable, and B represents the outcome severe casualties. Fisher’s exact test calculates the exact probability of observing the distribution in the contingency table. This is particularly useful for small sample sizes or when examining the relationship between an accident variable and severe casualties. The phi coefficient is a measure of association between each accident-related variable and the outcome severe casualties. The probability P of observing this particular table is calculated using the hypergeometric distribution:
P = a + b a c + d c n a + c
where
  • a + b a is the binomial coefficient, calculated as a + b ! a ! × b ! ,
  • c + d c is the binomial coefficient for the second row,
  • n a + c is the binomial coefficient for the total table, where n = a + c + b + d .
As a next step, we apply Bootstrap resampling to estimate robust confidence intervals for the parameters. The 95% confidence intervals indicate that certain variables consistently contribute to severe accidents, reinforcing the findings from the Fisher’s test. As a first step towards pattern recognition, we want to identify the M C V , which tells us how often a specific variable co-occurs with one or more accident-related variables. Let D = D 1 , D 2 , , D n be a dataset with n entries. Each entry D i consists of a set of binary variables x 1 , x 2 , , x m where each x j can be either 0 or 1. The goal is to find the combination of variables that maximizes the occurrence of a specific outcome, Y , which could be severe accidents, for instance. To define the combination of variables that includes x j , let C = x j 1 , x j 2 , , x j k be a combination of x j with k other variables, where x j 1 , x j 2 , , x j k are selected from the full set x 1 , x 2 , , x m . The frequency F ( C ) of each combination C is defined as the number of entries D i , where all variables in C take the value 1.
F C = i = 1 n I ( C , D i )
where the indicator I C , D i is defined as follows:
I C , D i = 1     i f   x j 1 = 1 ,   x j 2 = 1 ,   , x j k = 1   i n   D i ,   0     o t h e r w i s e                                                                                            
The M C V is the combination C * that includes x j and maximises the frequency F ( C ) in relation to a specific outcome Y = 1 :
M C V = max C x 1 , x 2 , , x m , x j C F ( C | Y = 1 )
The M C V approach involves searching through all possible combinations that include the specific variable x j , calculating the frequency with which these combinations occur when a specific outcome Y = 1 is observed, and identifying the combination with the highest frequency. The M C V method analyses how frequently a particular variable occurs in combination with one or more other variables, identifying the most common combination in which the variable appears.

2.4. Binomial Logistic Regression

This study employs several pattern recognition methods. To investigate to what extent accident-related variable affects the probability of severe casualties, we apply binomial logistic regression, with severe casualties as the dependent variable. The logistic regression model is crucial for understanding how different accident-related variables, such as speeding, alcohol use, or road conditions, contribute to the probability of severe casualties. By examining these relationships, the model helps identify key factors that increase the risk of severe accidents.
log P ( Y = 1 ) P ( Y = 0 ) = β 0 + β 1 X 1 + β 2 X 2 + + β k X k
where
  • P ( Y = 1 ) is the probability of the outcome being severe casualty,
  • P ( Y = 0 ) is the probability of the outcome being a non-severe casualty,
  • log P Y = 1 P Y = 0 is the log-odds of the outcome occurring (severe casualties),
  • β 0 is the intercept term, representing the log-odds of severe casualties when all predictors X 1 , X 2 , , X k are zero,
  • β 1 , β 2 , , β k are coefficients associated with each accident-related predictor variable, X 1 , X 2 , , X k . These coefficients indicate the strength and direction of the relationship between each variable and the likelihood of severe casualties.

2.5. PATTERMAX Method

The developed PATTERMAX method analyses the frequencies of variable combinations ( B P s ) and examines their association strength with severe casualties. The R T A dataset D consists of n entries, where each entry is a sequence of x binary variables (0 s and 1 s). We aim to calculate the frequency of each B P , i.e., each identical sequence of 0 s and 1 s of length m in the dataset D . We define the B P of length m as a string of m binary variables, where B P = ( p 1 , p 2 , . . . , p m ) , with p i being either 0 or 1. To calculate the frequency F ( B P , D ) of B P in the dataset D , we use the PATTERMAX method, which proceeds as follows:
F B P , D = i = 1 n j = 1 x m + 1 I ( B P , D i j : j + m )
where
  • n is the number of entries in the dataset D ,
  • x is the number of binary variables in each entry,
  • D i represents the i-th entry in the dataset D ,
  • j is the position in the entry D i where the B P is checked,
  • I ( B P , D i j : j + m is an indicator function that returns 1 if the substring B P exactly matches D i from position j to j + m 1 , and 0 otherwise.
This formula describes the PATTERMAX method for calculating the frequency of the B P in the dataset D . To verify if the B P matches at a specific position, we iterate over each entry in the dataset, over all positions in the entry, and use the indicator function. The sum over all entries and positions returns the total frequency of B P in D . After identifying B P s using the PATTERMAX method, each generated B P is further examined using Fisher’s exact test to determine the p -value that quantifies the strength of the association between the B P and severe casualties. p F i s h e r ( B P ) represents the p -value obtained from Fisher’s exact test for B P . This step ensures that the B P s identified are not only frequent but also statistically significant in their relationship to severe accidents.

2.6. Blackpattern Impact Analysis

To calculate a blackpattern impact score ( B I S ), we combine four components: frequency of the B P   F B P , D , the statistical association between the B P and severe causalities (measured by ( p F i s h e r B P ) , the strength of this association (measured by the phi coefficient ϕ B P ), and the logistic regression coefficients β i corresponding to the variables in B P . These components are integrated into a comprehensive B I S to prioritize the identified B P s . This approach enables a precise assessment of the B P s concerning severe accidents by considering both their frequency and the strength of their association with severe casualties, thereby identifying B P s that are both frequent and impactful.
B I S B P = F ( B P , D ) × ϵ ϕ B P × l o g p F i s h e r B P × i = 1 k ϵ β i
where
  • β i represents the logistic regression coefficient for each variable V i in the B P ,
  • F B P , D is the frequency of the B P ,
  • ϕ B P is the phi coefficient, which measures the strength of the association between the B P and the outcome,
  • p F i s h e r B P is the p -value from Fisher’s exact test, indicating the statistical significance of the association between the B P and the outcome.
To amplify the influence of highly significant B P s (with very small p -values), the negative logarithm of p F i s h e r B P is used. The transformation l o g p F i s h e r B P converts very small p -values into larger positive numbers. This ensures that B P s with strong statistical significance have a greater impact on the B I S . Both the logistic regression coefficients β i and the phi coefficient ϕ B P represent the strength of association. Small coefficients or ϕ -values might otherwise have a minimal effect on the B I S . The exponential transformation ϵ β i and ϵ ϕ P magnifies these values, particularly when they are small. This emphasizes the contribution of B P s where the variables have a stronger association with the outcome.
The blackpattern impact analysis allows to identify B P s that are not only common and impactful but also statistically significant in their relationship with severe casualties. This approach provides a comprehensive and nuanced prioritization of B P s , ensuring that our analysis highlights the most relevant and meaningful B P s for further investigation or intervention. Table 2 illustrates the features of the blackpattern impact analysis that must be considered when interpreting the retrieved B I S .

3. Results

3.1. Descriptive Analyses Results

Descriptive statistics reveal the frequency and probability of each variable in severe and fatal accidents. Significant relationships between variables and accident severity are identified using Fisher’s exact test and the phi coefficient. Also, we generate the presented M C V . We conduct descriptive analyses for each variable within our defined categories (driver, vehicle, roadway, and situation). Detailed analysis results can be found in Appendix A.
Figure 3 presents the retrieved phi coefficients between various driver-related variables and the dependent variable, severe casualties. The phi coefficient measures the strength and direction of association between binary variables. Positive values indicate a direct association, where the presence of the variable correlates with an increased likelihood of severe casualties, while negative values indicate an inverse relationship. The figure’s colour gradient indicates the strength and direction of the correlation for each variable with severe casualties, where red represents positive values and blue represents negative values. Driving in parallel shows the highest positive phi coefficient (0.604), indicating a strong positive correlation with severe casualties. This suggests that when drivers engage in this behaviour, the likelihood of severe accidents significantly increases. No safety belt applied (0.240) and male drivers (0.133) also exhibit notable positive associations with severe casualties, reinforcing well-established road safety insights that males and lack of seatbelt use are high-risk factors. Age-related factors, such as drivers aged 64 and older (0.082), 45 to 54 years (0.046), and 55 to 64 years (0.044), also show moderate positive correlations with severe casualties, suggesting older age groups are more vulnerable to severe outcomes in accidents. Other factors like hitting a tree (0.062) and fatigue (0.030) also display positive correlations, indicating that these environmental and driver-related conditions contribute to more severe accident outcomes. In contrast, female drivers (−0.133), drivers aged 19 to 24 years (−0.085), and those with a probationary driving license (−0.065) show negative correlations with severe casualties, indicating a lower likelihood of severe injuries for these groups compared to others. Risky behaviours such as speeding (−0.011) and hitting a stationary vehicle (−0.005) show slight negative correlations, which may suggest that while these actions are dangerous, they are not as strongly associated with severe casualties in this dataset.
Figure 4 presents the phi coefficients between various vehicle-related variables and the dependent variable, severe casualties. Higher engine power correlates positively with severe casualties, as seen with vehicles having 110+ kW engine power (0.053) and 90–110 kW engine power (0.039). This suggests that vehicles with more powerful engines are more likely to be involved in accidents resulting in severe injuries. Similarly, vehicle fire (0.035) shows a positive association. Interestingly, vehicle kilometrage between 150,000 to 200,000 km (0.010) and 0–24 kW engine power (0.006) also present positive associations, suggesting that vehicles with higher mileage and very low engine power might also contribute to accident severity. The variable airbag not deployed shows the most substantial negative correlation (−0.149), suggesting that in cases where the airbag does not deploy, the likelihood of severe casualties is lower. This does not mean that the absence of airbag deployment directly reduces the risk of injury; rather, it reflects the fact that airbags are typically designed to deploy only in high-impact crashes. In lower-impact accidents, where the airbag does not activate, the injuries tend to be less severe. Therefore, the negative correlation likely indicates that accidents where airbags are not deployed are generally less severe and thus less likely to result in severe casualties. This interpretation aligns with the purpose of airbags, which are activated in the most dangerous collisions to prevent serious injury. Vehicles with 24–90 kW engine power (−0.066) and blue-coloured vehicles (−0.021) also exhibit negative associations with severe casualties. Additionally, variables like technical defects (−0.004), insufficient load securing (−0.008), and vehicle kilometrage between 15,000 and 75,000 km (−0,010) present slight negative correlations, suggesting a reduced risk of severe injuries in accidents involving vehicles with these characteristics.
Figure 5 presents the phi coefficients between various roadway-related variables and the dependent variable, severe casualties. As before, the phi coefficient indicates the strength and direction of the association between the roadway conditions and the likelihood of severe accidents. Positive values (red) suggest that the presence of a certain condition increases the likelihood of severe casualties, while negative values (blue) indicate that the condition is inversely related to severe outcomes. Dry roads show the highest positive correlation with severe casualties (0.095), suggesting that accidents occurring on dry roads are more likely to result in severe injuries. This could be due to higher speeds and less cautious driving on dry roads. Similarly, straight roads (0.040) and other roads (0.037) exhibit positive correlations, potentially because drivers may underestimate the risks on straightforward routes or less-frequented roads. Infrastructure elements like galleries (0.026), tunnels (0.022), and bridges (0.022) also show positive associations, indicating that these roadway types might pose a higher risk of severe accidents. Speed limits appear in both positive and negative associations, with the 100 km/h speed limit (0.019) showing a slight positive correlation, suggesting that accidents at this speed are more likely to be severe. In contrast, lower and higher speed limits, such as 60 km/h (−0.002), 120 km/h (−0.004), and 130 km/h (−0.006), are negatively correlated, which could reflect more controlled or lower risk driving behaviours at these speeds. Wet roads (−0.270) and wintry conditions (−0.090) show the strongest negative correlations with severe casualties. This may be due to more cautious driving in adverse weather conditions, as drivers tend to reduce speed and drive more carefully in slippery conditions, leading to less severe accidents. Similarly, curves (−0.042) and middle separation (−0.019) display negative correlations, suggesting that these road features may promote more cautious driving behaviour, thereby reducing the likelihood of severe accidents. Road types such as tunnels, bridges, and straight roads, along with dry conditions, seem to contribute more to severe outcomes, while adverse weather and curved roads tend to reduce the risk, possibly due to more cautious driving behaviours. These insights are valuable for road safety planning and interventions, as they suggest where targeted efforts can be made to reduce accident severity based on the road environment.
Figure 6 shows the phi coefficients between situation-related variables and severe casualties. Positive phi values (red) suggest that certain conditions are associated with a higher likelihood of severe accidents, while negative values (blue) indicate an inverse relationship. Clear or overcast weather (0.053) and the period 12 a.m. to 6 a.m. (0.051) exhibit the highest positive correlations, indicating that accidents occurring during these conditions are more likely to result in severe casualties. This might be due to higher speeds during clear weather, as drivers feel more confident under such conditions. Similarly, the period from 6 p.m. to 12 a.m. (0.023) shows a moderate positive correlation, possibly reflecting increased accident severity during evening hours when visibility may decrease, but drivers may still be inclined to drive at high speeds. Seasonal factors such as summer (0.025) and autumn (0.023) also show mild positive correlations, suggesting that accidents during these times of the year are more likely to result in severe outcomes. This could be related to higher traffic volumes during vacation seasons or more frequent long-distance travel. Glare from the sun (0.010) contributes to a smaller positive correlation, which might be due to reduced visibility affecting driver reactions. On the other hand, winter (−0.580) shows the most significant negative correlation with severe casualties. This strong inverse relationship suggests that accidents occurring in winter conditions are less likely to result in severe injuries, likely due to slower driving speeds and increased caution on icy or snow-covered roads. Similarly, snow (−0.067) and rain (−0.019) show negative correlations, which further supports the idea that adverse weather conditions encourage safer driving behaviour, leading to less severe accidents. Variables such as fog (−0.004), hail or freezing rain (−0.007), and limited visibility (−0.008) exhibit slight negative correlations, indicating that these conditions may reduce the risk of severe casualties. This might be because drivers are more cautious and reduce speed when faced with these challenging conditions. The analysis of situation-related variables suggests that clear weather and certain times of the day (e.g., nighttime) are more strongly associated with severe casualties, whereas winter conditions and precipitation tend to reduce the severity of accidents.

3.2. Logistic Regression Analysis Results

Binomial logistic regression shows the strength of relationships between accident-related variables and severe accidents, identifying high-risk variables with significant odds ratios. The logistic regression analysis in Table 3 reveals several key variables that significantly increase the likelihood of severe R T A s . One of the most influential factors is the non-use of a safety belt, which has the highest odds ratio ( e x p ( β ) = 5.015) among the variables analysed, indicating that drivers not wearing a seatbelt are over five times more likely to be involved in a severe accident. Other critical factors include young drivers, particularly those aged 16 to 18, who have an odds ratio of 2.317, and those aged 19 to 24, with an odds ratio of 2.101, reflecting a significantly higher risk for these age groups. Environmental and situational factors also play a substantial role. Driving during early morning hours (12 a.m. to 6 a.m.) increases the likelihood of severe accidents by 35.9% ( e x p ( β ) = 1.359), likely due to factors such as fatigue and reduced visibility. Road conditions, such as driving on a wet road or under wintry conditions, also contribute to higher accident severity, with odds ratios of 1.261 and 1.462, respectively. The presence of specific road features like curves, intersections, and tunnels significantly increases the risk, with tunnels showing an odds ratio of 1.674 and curves 1.198, indicating these features are critical risk factors. Vehicle-related factors also influence the severity of accidents. Vehicles with engine power between 24 and 90 kW show a 19,2 % higher likelihood of severe accidents, while certain actions like sudden braking or hitting an obstacle on the road increase the risk significantly, with odds ratios of 2,0 and 3.394, respectively. Interestingly, hitting a guardrail is associated with a lower likelihood of severe accidents, with an odds ratio of 0.731, suggesting that this might serve as a mitigating factor under certain conditions. The analysis also highlights the significant impact of alcohol, which nearly doubles the likelihood of severe accidents ( e x p ( β ) = 1.916), underscoring the critical danger posed by impaired driving. Additionally, vehicle-related variables like the colour green and the absence of airbag deployment are associated with higher risks, with odds ratios of 1.317 and 2.233, respectively.
When performing multiple logistic regression, some variables may be excluded from the final model, resulting in no regression coefficient being assigned to them. This exclusion occurs because the statistical model deems these variables to have an insignificant or non-contributory effect on the outcome, often due to multicollinearity, lack of variability, or because their contribution is already captured by other variables in the model.

3.3. PATTERMAX Method Results

The PATTERMAX method reveals critical B P s in the data, indicating combinations of factors that significantly contribute to severe accidents (Table 4). One of the most prominent B P s identified is the combination of a 130 km/h speed limit, driving on a highway, drifting to the right, and being a male driver. This B P is statistically significant with a p -value of 0.001 and a phi coefficient of 0.027, occurring 44 times in the dataset. This suggests that this specific combination of factors is strongly associated with severe accidents. Another significant B P involves a 100 km/h speed limit on a country road, with left drift and male drivers, showing an even stronger correlation with severe casualties ( p = 0.000, ϕ = 0.032) and a frequency of 41 occurrences. This B P underscores the heightened risk associated with country roads, particularly when combined with drifting and male drivers. Additional B P s include scenarios where male drivers on country roads, particularly under conditions of fatigue or without wearing a safety belt, show a strong association with severe accidents. For example, the combination of a 100 km/h speed limit, left drift, a male driver, and no safety belt applied is highly significant ( p = 0.000, ϕ = 0.031), though it occurs less frequently, with 10 recorded instances. This indicates that, although less common, this particular combination of factors leads to particularly severe outcomes. Other B P s highlight the risk posed by wet roads and darkness. A B P involving a 100 km/h speed limit on a country road, a wet road surface, a male driver aged 25–34, and a right drift shows a strong association with severe accidents ( p = 0.001, ϕ = 0.027). Similarly, driving in darkness on country roads with right drift and male drivers also presents a significant risk ( p = 0.003, ϕ = 0.026).

3.4. Blackpattern Impact Analysis Results

The blackpattern impact analysis results in Table 5 highlight the varying influence of different combinations of variables on the likelihood of severe R T A s , with the B I S providing a quantitative measure of their overall effect. In cases where a B P generated by the PATTERMAX method includes variables without a regression coefficient, we assign a value of zero ( β = 0 ) to these variables. By setting the coefficient to zero, we ensure that the variable neither positively nor negatively influences the B I S , reflecting the fact that the variable does not significantly impact the likelihood of severe outcomes according to our logistic regression model. The B P with the highest B I S involves a 100 km/h speed limit on a country road, left drift, a male driver, and the absence of a safety belt, which has a significant B I S of 982.9. This high B I S reflects the strong influence of not wearing a seatbelt, which substantially increases the likelihood of severe accidents, as indicated by the high regression coefficient ( β = 1.612). The combination of a 100 km/h speed limit, country road, and a male driver, whether drifting left or right, consistently yields high B I S (e.g., 804.7 and 167.6), indicating that these factors together significantly elevate the risk of severe accidents. A speed limit of 130 km/h on a highway with right drift and a male driver result in a relatively high B I S of 628.4 which still presents a notable risk. This B P highlights that while speed and road type are important, the absence of additional high-risk behaviours like seatbelt non-use somewhat mitigates the overall risk. B P s involving female drivers or those with a speed limit of 80 km/h on a country road with right drift show even lower B I S (e.g., 50.1 and 38.3), reflecting the reduced likelihood of severe outcomes compared to more dangerous combinations. This suggests that gender and lower speed limits contribute to safer outcomes, although they are not completely devoid of risk. The B I S also underscores the combined risk posed by fatigue and specific road conditions (e.g., 167.6 and 37.1).

4. Discussion

The findings from this study underscore the complex and multivariate nature of RTAs, particularly single-vehicle, single-occupant accidents outside built-up areas in Austria. By applying statistical methods such as binomial logistic regression and the PATTERMAX method, we have identified significant B P s that consistently correlate with severe casualties. These BPs provide critical insights into how specific combinations of driver-related, vehicle-related, roadway-related, and situational factors contribute to the severity of accidents. The multivariate approach used here echoes the work of [6], who found that considering multiple interacting factors is crucial for understanding R T A risk. One of the key observations from the logistic regression analysis is the substantial impact of not wearing a seatbelt, which emerged as the most influential variable, increasing the likelihood of severe accidents by over five times. This finding aligns with existing literature, such as [33], which consistently highlights the protective benefits of seatbelt usage in preventing severe injuries and fatalities. Seatbelt non-use remains one of the most critical behavioural risk factors in R T A s , as demonstrated in several international studies [34,35], which also emphasise the need for stricter enforcement of seatbelt laws to mitigate injury severity. Similarly, the significant influence of young drivers, particularly those aged 16 to 24, on accident severity is consistent with previous research that points to younger drivers’ higher propensity for risky behaviours, such as speeding and distracted driving [36]. Studies by [37] also note the higher incidence of severe accidents among this demographic due to inexperience and impulsive driving behaviours.
The PATTERMAX method further refines our understanding by identifying specific combinations of variables that, when occurring together, significantly increase the likelihood of severe outcomes. For instance, B P s involving high-speed limits, rural roadways, and male drivers frequently result in severe accidents, especially when compounded by factors such as driver fatigue or adverse weather conditions. This mirrors findings from studies like [38] that show how rural roads, higher speeds, and male drivers increase accident risk, particularly in environments with poor weather or lighting conditions. The importance of considering such multivariate patterns in road safety interventions is also emphasised in work by [39], who recommend tailored safety measures for rural roads with high-speed limits. Moreover, the B P impact analysis introduces a novel way of quantifying the combined effect of these variables, offering a clear prioritisation of the most dangerous combinations. This is particularly useful for designing targeted interventions that can address the most critical risks. For instance, the combination of a 100 km/h speed limit, a country road, left drift, and a male driver not wearing a seatbelt was identified as having the highest B I S , making it a prime target for road safety campaigns and enforcement measures. This is consistent with recommendations from [40], who suggests that high-risk locations and behaviours must be prioritised for intervention based on empirical accident data. Similarly, [41] highlights the effectiveness of focusing on specific behavioural interventions, such as enforcing speed limits and seatbelt use, especially in rural areas, to reduce severe accidents.
In conclusion, this study highlights the importance of multivariate approaches in road safety research and supports the growing body of literature that calls for a comprehensive, data-driven approach to accident prevention. Future research could benefit from incorporating more advanced machine learning techniques, as these methods are particularly well-suited to detecting complex patterns in large datasets, as demonstrated by [42]. Emerging technologies like digital twin theory for autonomous vehicle testing [43] and advanced trajectory extraction methods under challenging environmental conditions [44] could provide new avenues for refining accident prediction models and improving safety in more complex scenarios. Integrating these techniques into the B P analysis framework could enhance the accuracy and applicability of the results for road safety policy.

5. Conclusions

This study successfully identifies and quantifies the most significant B P s associated with severe single-vehicle, single-occupant R T A s on Austrian roads. By focusing on accidents occurring outside built-up areas, we were able to analyse specific high-risk combinations of variables, such as driver behaviour, vehicle characteristics, roadway conditions, and situational factors, that contribute to the severity of accidents. The use of binomial logistic regression and the PATTERMAX method provides a robust framework for understanding the complex, multicausal interactions that lead to severe accident outcomes. It is important to note that this paper is methodologically and hermeneutically driven. The primary focus is on the development, validation, and application of the B P detection method rather than on prescribing specific road safety interventions. As such, while the results offer valuable insights into high-risk combinations of factors, explicit recommendations for road safety measures are not the central aim of this study. Instead, the emphasis is on providing a versatile toolset that can be used by researchers and policymakers to further explore and address severe R T A s within their respective contexts. However, the findings should be considered within the context of several limitations. First, this study is based on a dataset of single-vehicle accidents, which excludes vehicle-to-vehicle crashes, potentially limiting the generalisability of the results to other types of RTAs. Second, the analysis relies on available data, which may not fully capture certain behavioural and environmental factors, such as driver distraction or precise weather conditions at the time of the accident. The lack of real-time behavioural data also means that some underlying causes of accidents, like driver fatigue or attention lapses, could not be directly analysed. Additionally, while the B P approach is effective for identifying multivariate risk patterns, it is important to acknowledge that more advanced machine learning techniques could further enhance predictive accuracy by capturing non-linear relationships between variables.
Despite these limitations, this study provides valuable insights for road safety interventions specific to the Austrian context. The identified B P s highlight the importance of addressing high-risk combinations of variables, such as high-speed limits, lack of safety equipment (e.g., seatbelt use), and rural road conditions. Policymakers can use these findings to design targeted interventions, such as stricter seatbelt enforcement, road infrastructure improvements, and the adjustment of speed limits based on road type and risk profile. Furthermore, public awareness campaigns aimed at high-risk groups, such as younger drivers or those driving in adverse conditions, could be crucial in mitigating the risks highlighted by the B P analysis.
Future research should expand the scope of this analysis to include other accident types and integrate additional data sources, such as real-time behavioural and environmental data, to develop more comprehensive accident prediction models. The integration of more advanced machine learning techniques, such as neural networks or random forests, could also be explored to enhance the detection of complex patterns. By doing so, we can refine our understanding of the factors contributing to severe accidents and improve the effectiveness of prevention strategies. Ultimately, the insights gained from this study provide a solid foundation for improving road safety and reducing the human and economic costs associated with severe R T A s on Austrian roads.

Author Contributions

Conceptualization, T.F.; methodology, T.F.; validation, T.F. and G.H.; formal analysis, T.F.; investigation, T.F.; data curation, T.F.; writing—original draft preparation, T.F.; writing—review and editing, T.F. and G.H.; visualization, T.F.; supervision, G.H. All authors have read and agreed to the published version of the manuscript.

Funding

Open Access Funding by TU Wien.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Descriptive Analysis Results

The driver-related outcomes in Table A1 reveal that male drivers are significantly more likely to be involved in severe accidents compared to female drivers, with a probability of 12.11% versus 4.79%, respectively. Age also plays a crucial role, with younger drivers aged 19 to 24 and older drivers aged 64 and above showing a higher likelihood of being involved in severe accidents. The analysis indicates that the absence of a driving license and probationary driving licenses are associated with increased accident severity, although their impact is relatively lower compared to other factors. Impairment due to alcohol, distraction, and fatigue are highlighted as significant contributors to severe accidents, but among these, fatigue shows a particularly strong correlation. The table also underscores the critical impact of not wearing a seatbelt, which is strongly associated with severe casualties, as evidenced by the highest phi coefficient in the analysis. Various driving manoeuvres, such as skidding, hitting a tree, and sudden braking, also exhibit significant relationships with accident severity, with some manoeuvres like hitting a tree being particularly indicative of severe outcomes. The M C V suggests that certain variables, like the absence of a seatbelt, tend to co-occur with other risk factors more frequently in severe accidents, further emphasising their role in contributing to accident severity.
Table A1. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by driver-related variables. n = 20,293 (3431 are severe casualties).
Table A1. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by driver-related variables. n = 20,293 (3431 are severe casualties).
VariableCasualties
n
Severe
Casualties
n
P (X ∩ SC)
%
Fisher’s
Exact Test
p
Phi
Coefficient
ϕ
MCV
n
SexMale11,576245812.11%0.0000.133817
Female87069724.79%0.000−0.1331.132
Unknown sex111----
Age class16 to 18 years14651620.80%0.000−0.044171
19 to 24 years65478063.97%0.000−0.0851.132
25 to 34 years43236973.43%0.120−0.011830
35 to 44 years24884682.31%0.0080.019432
45 to 54 years21804762.35%0.0000.046382
55 to 64 years14043231.59%0.0000.044212
64 and older18784992.46%0.0000.082303
Unknown age class8-----
* DLNo driving licence356940.46%0.0200.03415
Probationary driving licence28053031.49%0.000−0.065391
ImpairmentAlcohol28584812.37%0.934−0.001246
Distraction23694312.12%0.0790.01293
Fatigue15183171.56%0.0000.030134
Health432910.45%0.0210.01638
Drugs66150.07%0.2470.0093
Medicines50100.05%0.5700.0042
Excitation720.01%0.3370.0061
Driving manoeuvresSpeeding36085792.85%0.136−0.011131
Skidding18232391.18%0.000−0.03280
Hitting an obstacle next to road15122801.38%0.0860.01235
Hitting the guardrail13781810.89%0.000−0.02737
Hitting a tree12173181.57%0.0000.06223
Misconduct by pedestrians503790.39%0.505−0.00512
Hit and run371530.26%0.186−0.01022
Sudden braking149110.05%0.002−0.0229
Overtaking147260.13%0.8340.0028
Cutting curves128270.13%0.1940.0094
Hitting an obstacle on the road11760.03%0.001−0.0247
Changing lanes5890.04%1.000−0.0023
Inadequate safety distance3870.03%0.8280.0021
Reverse driving2660.03%0.4290.0062
Phoning2570.03%0.1750.0101
Turning around2240.02%0.7800.0013
Fall from the vehicle22110.05%0.0000.0292
Getting in lane1840.02%0.5290.0041
Disregarding driving direction1620.01%1.000−0.0031
Priority violation1540.02%0.3020.0071
Driving towards left-hand side of road930.01%0.1840.0091
Forbidden overtaking820.01%0.6300.0041
Hitting a moving vehicle800.00%0.367−0.0092
Disregarding driving ban520.01%0.2010.0101
Driving in parallel510.00%1.0000.6041
Opening the vehicle door520.01%0.2010.0101
Hitting a stationary vehicle300.00%1.000−0.0051
Wrong-way driver100.00%1.000−0.0031
Disregarding red light100.00%1.000−0.0031
Dangerous stopping and parking00----
Disregarding turning ban00----
Missing indication of direction change00----
Driving against one-way00----
** STDriving without mandatory light00----
No safety belt applied14016993.44%0.0000.24060
* DL: Driving licence; ** ST: Safety Settings.
Table A2 presents a comprehensive analysis of vehicle-related variables and their association with the severity of accidents. Engine power is a notable factor, with vehicles having higher engine power (over 110 kW) showing a higher probability of severe casualties, as indicated by the phi coefficient of 0.053 and a significant p -value of 0.000. This suggests that vehicles with greater engine power are more likely to be involved in severe accidents. In contrast, vehicles with lower engine power (24–90 kW) demonstrate a negative correlation with accident severity, as reflected by a negative phi coefficient (−0.066). Vehicle colours appear to play a neglectable role as the correlations are weak and not statistically significant. The table also highlights the impact of vehicle safety features on accident outcomes. Cases where the airbag did not deploy are strongly associated with severe casualties, as evidenced by a phi coefficient of −0.149, making it one of the most critical factors in the analysis. Other variables, such as technical defects and insufficient vehicle security, are less prevalent but still present some level of risk, particularly vehicle fires, which have a phi coefficient of 0.035.
Table A2. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by vehicle-related variables. n = 20,293 (3431 are severe casualties).
Table A2. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by vehicle-related variables. n = 20,293 (3431 are severe casualties).
VariableCasualties
n
Severe
Casualties
n
P (X ∩ SC)
%
Fisher’s
Exact Test
p
Phi
Coefficient
ϕ
MCV
n
Engine power (kW)0–24 kW1130.01%0.4110.0062
24–90 kW15,4122.39311.79%0.000−0.066975
90–11019284132.04%0.0000.039201
110+19474482.21%0.0000.053256
Kilometrage
(km)
0 to 15.000156240.12%0.662−0.00413
15.000 to 75.000605890.44%0.154−0.01051
75.000 to 100.000387700.34%0.541.00433
100.000 to 150.0006631040.51%0.428−0.00644
150.000 to 200.0009421760.87%0.1410.01056
Vehicle
colour
Beige1830.01%1.0000.0005
Blue31664782.36%0.003−0.021868
Brown193350.17%0.6370.00352
Bronze100.00%1.000−0.0031
Dark3060.03%0.6260.0036
Yellow129180.09%0.408−0.00637
Gold1830.01%1.0000.0005
Grey27024622.28%0.7840.002770
Green12192621.29%0.0000.031281
Bright820.01%0.6300.0042
Orange130240.12%0.6470.00341
Red22723811.88%0.857−0.001602
Black39816523.21%0.334−0.007958
Silver7161360.67%0.1270.011146
Purple4980.04%1.000−0.00111
White19073231.59%0.9770.000497
Others110.00%0.1690.0161
Vehicle
safety
Insufficient vehicle security1660.03%0.0400.0152
Insufficient load securing600.00%0.598−0.0081
Technical defects102150.07%0.682−0.0046
Vehicle fire18110.05%0.0000.0351
Airbag not deployed8.1388194.04%0.000−0.149975
Table A3 provides a detailed analysis of roadway-related variables and their impact on the severity of single-vehicle accidents that took place outside built-up areas in Austria between 2012 and 2019.
Table A3. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by roadway-related variables. n = 20.293 (3.431 are severe casualties).
Table A3. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by roadway-related variables. n = 20.293 (3.431 are severe casualties).
VariableCasualties
n
Severe
Casualties
n
P (X ∩ SC)
%
Fisher’s
Exact Test
p
Phi
Coefficient
ϕ
MCV
n
Speed limit
(km/h)
Driving ban22703801.87%0.833−0.002350
5110.00%0.1690.0161
10100.00%1.000−0.0031
20200.00%1.000−0.0041
30173330.16%0.4790.00513
404080.04%0.5330.0046
50505710.35%0.095−0.01256
60334550.27%0.877−0.00243
7014212181.07%0.108−0.011321
8012311920.95%0.225−0.009222
90300.00%1.000−0.0051
10012,292214810.58%0.0080.0192.232
1103540.02%0.502−0.00610
120200.00%1.000−0.0041
13019833211.58%0.377−0.006488
Road
type
Highway25934172.05%0.239−0.008488
Expressway595800.39%0.024−0.01682
Country road14,457241611.91%0.247−0.0082.232
Other roads22204632.28%0.0000.037248
Intersection439620.31%0.125−0.01162
Roundabout68160.08%0.1460.01011
Road
characteristics
Deceleration lane1020.01%0.6810.0021
Acceleration lane310.00%0.4260.0051
One-way144330.16%0.0540.01426
Construction site157210.10%0.286−0.00810
Cycle path400.00%1.000−0.0061
Crosswalk300.00%1.000−0.0061
Pedestrian and cycle path1020.01%0.6810.0023
Parking lane700.00%0.610−0.0081
Secondary lane510.00%1.0000.0011
Hard shoulder4590.04%0.5510.0047
Banquet123220.11%0.7290.00222
Straight road11,507209510.32%0.0000.0402.232
Tunnel89260.13%0.0040.0228
Gallery1580.04%0.0010.0261
Rest area2660.03%0.4290.0062
Traffic island81180.09%0.2330.0094
Underpass3270.03%0.4760.0053
Middle separation7771040.51%0.008−0.019137
Bridge157410.20%0.0030.0227
Curve8.39912646.23%0.000−0.0421.437
Narrow lane3080.04%0.1490.0103
Entry or exit57170.08%0.0190.0185
Tram or bus station820.01%0.6300.0041
Road
condition
Dry road10,441212610.48%0.0000.0952.232
Wet road57058724.30%0.000−0.271.225
Sand or grit on the road297480.24%0.809−0.00256
Wintry conditions37713701.82%0.000−0.090938
Other conditions (oil, soil)95170.08%0.7960.00216
TL *Traffic light in full operation2920.01%0.213−0.0104
* TL: Traffic lights.
One of the most significant findings is the relationship between speed limits and accident severity. Accidents occurring in areas with a 100 km/h speed limit show a higher probability of severe casualties, with a phi coefficient of 0.019 and a significant p -value of 0.008, indicating a moderate positive correlation. Similarly, roads with a 130 km/h speed limit also show a notable frequency of severe accidents, although the correlation is slightly weaker. The type of road is another critical factor, with accidents on country roads being particularly severe, as these roads account for the highest number of severe casualties, although the phi coefficient suggests only a weak correlation. Additionally, certain road characteristics, such as curves and straight roads, are strongly associated with severe accidents. Curves, in particular, have a significant negative phi coefficient (−0.042), indicating a strong correlation with accident severity. In contrast, straight roads, despite their higher overall accident frequency, show a positive phi coefficient (0.040), suggesting that while they are common sites for accidents, the severity is more strongly associated with other variables like speed or road conditions. The analysis also reveals that road conditions significantly impact accident severity, with dry roads being the most common setting for severe accidents, supported by a high phi coefficient (0.095). However, wet and wintry conditions also play a significant role, as indicated by negative phi coefficients, showing that these conditions are associated with less severe outcomes compared to dry conditions.
Table A4 provides an analysis of situation-related variables and their impact on the severity of single-vehicle accidents. The analysis highlights several critical situation-related factors that influence the severity of single-vehicle accidents. Time of day emerges as a significant variable, with accidents occurring between 12 a.m. and 6 a.m. showing a higher probability of severe casualties, indicated by a phi coefficient of 0.051 and a significant p -value of 0.000. This suggests that early morning hours are particularly dangerous, likely due to factors such as reduced visibility, fatigue, or lower traffic volumes leading to higher speeds. In contrast, the period from 12 p.m. to 6 p.m., although still significant, shows a negative correlation with accident severity, indicating fewer severe outcomes during daylight hours. The day of the week also plays a role, with accidents from Monday to Thursday slightly more likely to result in severe casualties compared to those occurring from Friday to Sunday. However, the correlation is weak, as reflected by the small phi coefficient (−0.025). Seasonal variation is evident, with summer showing a slightly higher likelihood of severe accidents, as suggested by a phi coefficient of 0.025. This could be attributed to increased travel and higher speeds during warmer weather. Winter, on the other hand, despite the challenging driving conditions, shows a negative correlation with severe outcomes, which may be due to more cautious driving during adverse weather conditions. Weather conditions have a notable impact, with clear or overcast weather being strongly associated with severe casualties, as indicated by a phi coefficient of 0.053. This finding may be counterintuitive, but it suggests that drivers might be less cautious during clear conditions, leading to higher speeds and more severe accidents. Snowy conditions, however, show a significant negative correlation with severe casualties, likely reflecting more careful driving in such conditions. Light conditions further influence accident severity, with darkness being associated with a higher likelihood of severe accidents, as shown by a phi coefficient of 0.044. This is consistent with the increased risks associated with driving at night, such as reduced visibility and driver fatigue.
Table A4. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by situation-related variables. n = 20,293 (3431 are severe casualties).
Table A4. Single-vehicle accidents with single occupation and personal injury that occurred outside built-up areas between 2012 and 2019 in Austria, broken down by situation-related variables. n = 20,293 (3431 are severe casualties).
VariableCasualties
n
Severe
Casualties
n
P (X ∩ SC)
%
Fisher’s
Exact Test
p
Phi
Coefficient
ϕ
MCV
n
Time12 a.m. to 6 a.m.33677133.51%0.0000.051245
6 a.m. to 12 p.m.62838894.38%0.000−0.049586
12 p.m. to 6 p.m.59159564.71%0.070−0.013578
6 p.m. to 12 a.m.47288734.30%0.0010.023368
WD *Mon to Thu11,13117888.81%0.000−0.025586
Fri to Sun916216438.10%0.0000.025430
SeasonSpring42797743.81%0.0210.016435
Summer48218964.42%0.0000.025578
Autumn48028854.36%0.0010.023394
Winter63918764.32%0.000−0.58586
Weather
condition
Clear or overcast weather15,541279713.78%0.0000.053586
Rain3.0134582.26%0.007−0.019110
Hail, freezing rain124170.08%0.398−0.00712
Snow19131750.86%0.000−0.067147
Fog6361020.50%0.588−0.00437
High wind377520.26%0.113−0.01117
Light
condition
Daylight11,54617908.82%0.000−0.043586
Dusk or dawn16042661.31%0.753−0.003111
Darkness6.82813116.46%0.0000.044368
Artificial light571930.46%0.730−0.00315
Limited visibility700.00%0.610−0.0081
Glare from the sun109240.12%0.1560.0108
* WD: Weekday.

References

  1. Herry Consult; KFV. Unfallkostenrechnung Straße 2022 (UKR 2022). Forschungsarbeiten des österreichischen Verkehrssicherheitsfonds; Bundesministerium für Klimaschutz, Umwelt, Energie, Mobilität, Innovation und Technologie (BMK): Vienna, Austria, 2022. [Google Scholar]
  2. European Commission: Directorate-General for Mobility and Transport; CE Delft. Handbook on the External Costs of Transport; Version 2019–1.1; Publications Office of the European Union: Luxembourg, 2020. [Google Scholar]
  3. European Commission. Archive: Road Safety Statistics—Characteristics at National and Regional Level. 18 December 2019. Available online: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Road_safety_statistics__characteristics_at_national_and_regional_level&oldid=463733 (accessed on 8 August 2021).
  4. Bundesministerium für Inneres (BMI). Straßenverkehrstote in Österreich. 2020. Available online: https://www.bmi.gv.at/202/Verkehrsangelegenheiten/unfallstatistik_vorjahr.aspx (accessed on 7 July 2021).
  5. Fian, T. From Blackspots to Blackpatterns: Pattern Recognition with Road Traffic Accident Data. Ph.D. Thesis, Vienna University of Technology, Vienna, Austria, 2021. [Google Scholar] [CrossRef]
  6. Gutierrez-Osorio, C.; Pedraza, C.A. Modern data sources and techniques for analysis and forecast of road accidents: A review. J. Traffic Transp. Eng. 2020, 7, 432–446. [Google Scholar] [CrossRef]
  7. Kizito, A.; Semwanga, A.R. Modeling the complexity of road accidents prevention: A system dynamics approach. Int. J. Syst. Dyn. Appl. 2020, 9, 24–41. [Google Scholar] [CrossRef]
  8. Martins, M.A.; Garcez, T.V. A multidimensional and multi-period analysis of safety on roads. Accid. Anal. Prev. 2021, 162, 106401. [Google Scholar] [CrossRef] [PubMed]
  9. Khalsa, A. Study on road traffic accidents and prevention in India: A review. Int. J. Res. Appl. Sci. Eng. Technol. 2019, 7, 1683–1685. [Google Scholar] [CrossRef]
  10. Karamanlis, I.; Nikiforiadis, A.; Botzoris, G.N.; Kokkalis, A.; Basbas, S. Towards sustainable transportation: The role of black spot analysis in improving road safety. Sustainability 2023, 15, 14478. [Google Scholar] [CrossRef]
  11. Reeves, K.; Chandan, J.S.; Bandyopadhyay, S. Using statistical modelling to analyze risk factors for severe and fatal road traffic accidents. Int. J. Inj. Control. Saf. Promot. 2019, 26, 364–371. [Google Scholar] [CrossRef]
  12. Gu, C.; Xu, J.; Gao, C.; Mu, M.; E, G.; Ma, Y. Multivariate analysis of roadway multi-fatality crashes using association rules mining and rules graph structures: A case study in China. PLoS ONE 2022, 17, e0276817. [Google Scholar] [CrossRef]
  13. Pande, A.; Abdel-Aty, M.A. A novel approach for analyzing severe crash patterns on multilane highways. Accid. Anal. Prev. 2009, 41, 985–994. [Google Scholar] [CrossRef]
  14. Alin, D.; Cofarue, C.; Popesku, M.V. Investigation of single vehicle accidents severity by using a probabilistic approach. Mobil. Veh. Mech. 2023, 49, 39–53. [Google Scholar] [CrossRef]
  15. Cioca, L.; Ivașcu, L. Risk indicators and road accident analysis for the period 2012–2016. Sustainability 2017, 9, 1530. [Google Scholar] [CrossRef]
  16. Shibani, A.; Pervin, M.S.A. Analysis of traffic accident severity on Great Britain roadways and junctions. Int. J. Built Environ. Asset Manag. 2016, 2, 37–66. [Google Scholar] [CrossRef]
  17. Ju, Y.H.; Sohn, S.Y. Quantification method analysis of the relationship between occupant injury and environmental factors in traffic accidents. Accid. Anal. Prev. 2011, 43, 342–351. [Google Scholar] [CrossRef]
  18. Yaman, T.T.; Bilgiç, E.; Esen, M.F. Analysis of traffic accidents to identify factors affecting injury severity with fuzzy and crisp techniques. In Proceedings of the International Conference on Intelligent and Fuzzy Systems, Istanbul, Türkiye, 21–23 July 2020. [Google Scholar] [CrossRef]
  19. Yaman, T.T.; Bilgiç, E.; Esen, M.F. Analysis of traffic accidents with fuzzy and crisp data mining techniques to identify factors affecting injury severity. J. Intell. Fuzzy Syst. 2021, 42, 575–592. [Google Scholar] [CrossRef]
  20. Shaon, M.R.; Qin, X. Crash data-based investigation into how injury severity is affected by driver errors. Transp. Res. Rec. 2020, 2674, 452–464. [Google Scholar] [CrossRef]
  21. Gilani, V.; Hosseinian, S.M.; Ghasedi, M.; Nikookar, M. Data-driven urban traffic accident analysis and prediction using logit and machine learning-based pattern recognition models. Math. Probl. Eng. 2021, 2021, 9974219. [Google Scholar] [CrossRef]
  22. Sohn, S.Y.; Shin, H. Pattern recognition for road traffic accident severity in Korea. Ergonomics 2001, 44, 107–117. [Google Scholar] [CrossRef]
  23. Khyara, H.; Amine, A.; Nassih, B. Dependent and independent variables for exploratory analysis of road traffic accidents. In Proceedings of the International Conference on Industrial Engineering and Operations Management, Manila, Philippines, 6–9 May 2023. [Google Scholar] [CrossRef]
  24. Athiappan, K.; Karthik, C.; Rajalaskshmi, M.; Subrata, C.; Rabiei-Dastjerdi, H.; Liu, Y.; Fernández-Campusano, C.; Gheisari, M. Identifying influencing factors of road accidents in emerging road accident blackspots. Adv. Civ. Eng. 2022, 2022, 9474323. [Google Scholar] [CrossRef]
  25. Theofilatos, A.; Yannis, G. A review of the effect of traffic and weather characteristics on road safety. Accid. Anal. Prev. 2014, 72, 244–256. [Google Scholar] [CrossRef]
  26. Aziz, S.; Ram, S. A meta-analysis of the methodologies practiced worldwide for the identification of Road accident black spots. Transp. Res. Procedia 2022, 62, 790–797. [Google Scholar] [CrossRef]
  27. Khorasani-Zavareh, D. System versus traditional approach in road traffic injury prevention: A call for action. J. Inj. Violence Res. 2011, 3, 61. [Google Scholar] [CrossRef][Green Version]
  28. Rezapur-Shahkolai, F.; Afshari, M.; Doosti-Irani, A.; Bashirian, S.; Maleki, S. Interventions to prevent road traffic injuries among pedestrians: A systematic review. Int. J. Inj. Control. Saf. Promot. 2022, 29, 533–549. [Google Scholar] [CrossRef] [PubMed]
  29. Staton, C.A.; Vissoci, J.R.; Gong, E.; Toomey, N.; Wafula, R.B.; Abdelgadir, J.; Zhou, Y.; Liu, C.; Pei, F.; Zick, B.; et al. Road traffic injury prevention initiatives: A systematic review and metasummary of effectiveness in low and middle income countries. PLoS ONE 2016, 11, e0144971. [Google Scholar] [CrossRef]
  30. Gopalakrishnan, S. A public health perspective of road traffic accidents. J. Fam. Med. Prim. Care 2012, 1, 144–150. [Google Scholar] [CrossRef] [PubMed]
  31. KFV; FGM. Austrian Road Safety Strategy 2021–2030; Bundesministerium für Klimaschutz: Vienna, Austria, 2021. [Google Scholar]
  32. Cerwenka, P.; Hauger, G.; Hörl, B.; Klamer, M. Handbook of Transportation System Planning; Austrian Art and Culture Publishers: Vienna, Austria, 2007. [Google Scholar]
  33. European Transport Safety Council. RoadPol Operation Reveals Persistent Seatbelt Violations across Europe. 18 April 2024. Available online: https://etsc.eu/roadpol-operation-reveals-persistent-seatbelt-violations-across-europe/ (accessed on 29 September 2024).
  34. World Health Organization. Global Status Report on Road Safety 2018; World Health Organization: Geneva, Switzerland, 2018. [Google Scholar]
  35. Kargar, S.; Ansari-Moghaddam, A.; Ansarim, H. The prevalence of seat belt use among drivers and passengers: A systematic review and meta-analysis. J. Egypt Public Health Assoc. 2023, 98, 14. [Google Scholar] [CrossRef]
  36. Clarke, D.D.; Ward, P.; Bartle, C. Young driver accidents in the UK: The influence of age, experience, and time of day. Accid. Anal. Prev. 2006, 38, 871–878. [Google Scholar] [CrossRef]
  37. Williams, A.F. Teenage drivers: Patterns of risk. J. Saf. Res. 2003, 34, 5–15. [Google Scholar] [CrossRef]
  38. Dumbaugh, E.; Li, W. Designing for the safety of pedestrians, cyclists, and motorists in urban environments. J. Am. Plan. Assoc. 2010, 76, 283–298. [Google Scholar] [CrossRef]
  39. Abdel-Aty, M.; Radwan, A.E. Modeling traffic accident occurrence and involvement. Accid. Anal. Prev. 2000, 32, 633–642. [Google Scholar] [CrossRef]
  40. Hauer, E. Observational Before-After Studies in Road Safety: Estimating the Effect of Highway and Traffic Engineering Measures on Road Safety; Pergamon: Oxford, UK, 1997. [Google Scholar]
  41. Elvik, R. The Handbook of Road Safety Measures; Emerald Group Publishing Limited: Bentley, UK, 2009. [Google Scholar]
  42. Liu, X.; Abdel-Aty, M. Real-time crash risk prediction on arterials based on LSTM neural network model. Accid. Anal. Prev. 2019, 124, 27–37. [Google Scholar] [CrossRef]
  43. Meng, Z.; Zhao, S.; Chen, H.; Hu, M.; Tang, Y.; Song, Y. The vehicle testing based on digital twins theory for autonomous vehicles. IEEE J. Radio Freq. Identif. 2022, 6, 710–714. [Google Scholar] [CrossRef]
  44. Chen, X.; Wei, C.; Yang, Y.; Luo, L.; Biancardo, S.A.; Mei, X. Personnel trajectory extraction from port-like videos under varied rainy interferences. IEEE Trans. Intell. Transp. Syst. 2024, 25, 6567–6579. [Google Scholar] [CrossRef]
Figure 1. Methodological flowchart.
Figure 1. Methodological flowchart.
Applsci 14 08902 g001
Figure 2. Development of road traffic accidents (RTAs) in Austria from 2012–2019. Own compilation based on RTA data from Statistics Austria.
Figure 2. Development of road traffic accidents (RTAs) in Austria from 2012–2019. Own compilation based on RTA data from Statistics Austria.
Applsci 14 08902 g002
Figure 3. Phi coefficient of driver-related variables.
Figure 3. Phi coefficient of driver-related variables.
Applsci 14 08902 g003
Figure 4. Phi coefficient of vehicle-related variables.
Figure 4. Phi coefficient of vehicle-related variables.
Applsci 14 08902 g004
Figure 5. Phi coefficient of roadway-related variables.
Figure 5. Phi coefficient of roadway-related variables.
Applsci 14 08902 g005
Figure 6. Phi coefficient of situation-related variables.
Figure 6. Phi coefficient of situation-related variables.
Applsci 14 08902 g006
Table 1. Categorisation scheme for accident-related variables.
Table 1. Categorisation scheme for accident-related variables.
DriverVehicleRoadwaySituation
  • Sex
  • Age Class
  • Driving licence
  • Impairment
  • Driving manoeuvres
  • Safety settings
  • Engine power
  • Kilometrage
  • Vehicle colour
  • Vehicle safety settings
  • Speed limits
  • Road characteristics
  • Traffic light
  • Road types
  • Road surface conditions
  • Daytime
  • Weekday
  • Meteorological seasons
  • Weather conditions
  • Light conditions
Table 2. Features of the blackpattern impact score (BIS).
Table 2. Features of the blackpattern impact score (BIS).
BIS FeaturesDescription
High FrequencyBlackpatterns that occur frequently in the dataset are prioritized.
High ImpactBlackpatterns with variables that have a strong influence on severe casualties are emphasized.
Strong AssociationBlackpatterns that are statistically significant in their association with severe casualties are given higher priority.
Table 3. Logistic regression analysis results.
Table 3. Logistic regression analysis results.
VariableRegression Coefficient βStandard Error SEMpexp(β)
No safety belt applied1.6120.0620.0005.015
Gallery1.5220.5890.0104.583
Vehicle fire1.3940.5410.0104.029
Hitting an obstacle on the road1.2220.4260.0043.394
Age class 16 to 180.8400.1040.0002.317
Airbag not deployed0.8030.0460.0002.233
Bridge0.7730.1970.0002.166
Age class 19 to 240.7430.0570.0002.101
Sudden braking0.6930.3240.0322.000
Alcohol0.6500.0620.0001.916
Hit and run0.5520.1610.0011.737
Tunnel0.5150.2580.0461.674
One-way0.5070.2190.0201.660
Age class 25 to 340.4920.0570.0001.635
Male driver0.4910.0450.0001.634
Intersection0.4500.1480.0021.569
Other road variables0.3970.0820.0001.487
Wintry conditions0.3800.0700.0001.462
Hitting a tree0.3650.0750.0001.441
Age class 35 to 440.3080.0650.0001.361
0 a.m. to 6 a.m.0.3070.0580.0001.359
Vehicle colour: green0.2750.0780.0001.317
County road0.2470.0620.0001.280
Dry road0.2320.0470.0001.261
Curve0.1800.0430.0001.198
Engine power 24–90 kW0.1750.0460.0001.192
Probationary driving licence0.1660.0780.0331.181
Darkness0.1650.0490.0011.180
Drifting left0.1470.0410.0001.158
Speed limit 100 km/h0.1140.0460.0131.120
Hitting a guardrail−0.3130.0910.0010.731
Speed limit 50 km/h−0.3290.1440.0220.719
Constant−9.2850.6110.000
Table 4. Blackpatterns showing a significant relationship with the target variable severe casualties, and a positive phi coefficient; n = 20,293 single-vehicle accidents with single occupation and personal injury occurring outside the built-up area on the Austrian road network (3431 are severe casualties).
Table 4. Blackpatterns showing a significant relationship with the target variable severe casualties, and a positive phi coefficient; n = 20,293 single-vehicle accidents with single occupation and personal injury occurring outside the built-up area on the Austrian road network (3431 are severe casualties).
BP IDBP VariablesFisher’s
Exact Test p
Phi
Coefficient ϕ
Frequency
n
BP1speed limit 130 km/h, highway, right drift, male driver0.0010.02744
BP2speed limit 100 km/h, country road, left drift, male driver0.0000.03241
BP3speed limit 100 km/h, country road, curve, left drift, male driver0.0110.02030
BP4country road, right drift, female driver0.0420.01528
BP5speed limit 100 km/h, country road, left drift, male driver, fatigue0.0010.02820
BP6speed limit 130 km/h, highway, drifting right, male driver, fatigue0.0400.01516
BP7speed limit 100 km/h, country road, wet road, age 25–34, right drift, male driver0.0010.02712
BP8speed limit 100 km/h, country road, left drift, male driver, no safety belt applied0.0000.03110
BP9speed limit 100 km/h, country road, darkness, right drift, male driver0.0030.02610
B10speed limit 80 km/h, country road, right drift, male driver0.0160.02010
Table 5. Blackpattern impact analysis results.
Table 5. Blackpattern impact analysis results.
BP IDBP
Frequency
n
BP Fisher’s Exact Test
p
BP Phi
Coefficient
ϕ
BP Variables and Their Regression Coefficients βBIS
BP1440.0010.027Speed limit 130 km/hHighwayRight driftMale driver 628.4
0000.491
BP2410.0010.032Speed limit 100 km/hCountry roadLeft driftMale driver 804.7
0.1140.2470.1470.491
BP3300.0110.020Speed limit 100 km/hCountry roadcurveLeft driftMale driver 194.9
0.1140.2470.1800.1470.491
BP4280.0420.015Country roadRight driftFemale driver 50.1
0.24700
BP5200.0010.028Speed limit 100 km/hCountry roadLeft driftMale driverFatigue 167.6
0.1140.2470.1470.4910
BP6160.0400.015Speed limit 130 km/hHighwayRight driftMale driverFatigue 37.1
0000.4910
BP7120.0010.027Speed limit 100 km/hCountry roadWet roadAge 25–34Right driftMale driver141.8
0.1140.24700.49200.491
BP8100.0000.031Speed limit 100 km/hCountry roadLeft driftMale driverNo safety belt 982.9
0.1140.2470.1470.4911.612
BP9100.0030.026Speed limit 100 km/hCountry roadDarknessRight driftMale driver 71.6
0.1140.2470.16500.491
BP10100.0160.020Speed limit 80 km/hCountry roadRight driftMale driver 38.3
00.24700.491
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fian, T.; Hauger, G. Identifying High-Risk Patterns in Single-Vehicle, Single-Occupant Road Traffic Accidents: A Novel Pattern Recognition Approach. Appl. Sci. 2024, 14, 8902. https://doi.org/10.3390/app14198902

AMA Style

Fian T, Hauger G. Identifying High-Risk Patterns in Single-Vehicle, Single-Occupant Road Traffic Accidents: A Novel Pattern Recognition Approach. Applied Sciences. 2024; 14(19):8902. https://doi.org/10.3390/app14198902

Chicago/Turabian Style

Fian, Tabea, and Georg Hauger. 2024. "Identifying High-Risk Patterns in Single-Vehicle, Single-Occupant Road Traffic Accidents: A Novel Pattern Recognition Approach" Applied Sciences 14, no. 19: 8902. https://doi.org/10.3390/app14198902

APA Style

Fian, T., & Hauger, G. (2024). Identifying High-Risk Patterns in Single-Vehicle, Single-Occupant Road Traffic Accidents: A Novel Pattern Recognition Approach. Applied Sciences, 14(19), 8902. https://doi.org/10.3390/app14198902

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop