Next Article in Journal
An Integrated Procedure for Ex-Ante Evaluations of Refurbishment Costs in Healthcare Facilities
Previous Article in Journal
Sustainable Development in the Economic, Environmental, and Social Fields of Ecuadorian Universities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling Collision Probability on Freeway: Accounting for Different Types and Severities in Various LOS

1
Jiangsu Key Laboratory of Urban ITS, Jiangsu Province Collaborative Innovation Center of Modern Urban Traffic Technologies, Southeast University, Nanjing 210096, China
2
Department of Civil Engineering, The University of British Columbia, Vancouver, BC V6T 1Z4, Canada
3
School of Automobile and Traffic Engineering, Hefei University of Technology, Hefei 230009, China
4
Civil Aviation College, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
*
Author to whom correspondence should be addressed.
Sustainability 2020, 12(18), 7386; https://doi.org/10.3390/su12187386
Submission received: 20 July 2020 / Revised: 26 August 2020 / Accepted: 3 September 2020 / Published: 9 September 2020
(This article belongs to the Section Sustainable Transportation)

Abstract

:
In this study, collision-related data were collected on the I-880 freeway of California in the United States from 2006 to 2011. Our objective was to study the collision probability of different collision types and severities in different traffic states. The traffic states were divided by the traditional level of service (LOS) method. Various Bayesian conditional logit models have been established to analyze the relationship between the collision probability of different collision patterns and LOSs. The results showed that LOS A had the best safety performance associated with all of the collision types and severities, LOS C had the worst safety performance associated with hit object collisions, LOS D had the worst safety performance associated with sideswipe collisions and rear end collisions, and LOS F had the worst safety performance associated with injury collisions. The five-stage Bayesian random parameter sequential logit model was established to quantify the effects of different variables on the collision probability of various collision types and severities. In addition to LOS, the visibility, road surface, weather, ramp, and number of lanes had significant effects on different collision types and severities.

1. Introduction

With the widespread use of freeway traffic surveillance systems, researchers have started using high-resolution dynamic traffic flow data to identify traffic conditions before collision occurrences. Numerous studies have developed real-time collision probability models for estimating the relative probability of collisions, given dynamic traffic flow data [1,2,3,4,5]. These studies have generally used a case-controlled study design structure, in which the traffic conditions before collisions were considered collision cases, while those under collision-free conditions were considered control cases. With the case-controlled dataset, researchers have developed real-time collision probability models to analyze the relationship between the probability of a collision and the traffic-related variables, including geometric design factors, environment factors, traffic flow factors, crash characteristic factors, driver behavior factors, and control strategy factors on a freeway.
Numerous researchers have studied the spatio-temporal evolution of traffic flows by dividing the traffic flows into different states. However, relatively few studies have investigated the collision types under different traffic conditions. Thus, it is necessary to explore the collision mechanism of different types and severities in various traffic flow states. In previous studies, it has been proven that there is a significant difference in safety performance for different collision types and severities in various traffic flow states [6,7,8].
In this study, the traffic flow is separated into six states by level of service (LOS). The main purpose is to identify the relationship between the LOS and different collision types and severities, and explore how contributing factors affect collision risks for different types and severities. The collision-related data were collected on the I-880 freeway of California in the United States from 2006 to 2011. The Bayesian conditional logit models have been established to analyze the statistical relationship between the collision probability of different collision patterns and LOSs. The five-stage Bayesian random parameter sequential logit model was established to quantify the effects of various variables on the collision probability of different types and severities. This research can help traffic management personnel better understand which LOS is more dangerous for different collision types and severities and realize the contributing factors of different collision types and severities in different LOSs. The results can be applied to reduce the collision probability of different types and severities in different LOSs.

2. Literature Review

Although most studies have explored the collision mechanism without considering the traffic flow states [1,2,3,4,5], some researchers analyzed the collision types and severities in different traffic flow states. In early studies, Golob et al. separated the traffic flow into various states. The researchers indicated that the traffic flow states with high densities could increase the probability of property damage only and multi-vehicle collisions, while the traffic flow states with low densities could increase the probability of single-vehicle and injury collisions [6]. Subsequently, Golob et al. separated the traffic flow into eight states and analyzed the relationship between collision types and traffic states. The results indicated that there is a significant difference in collision characteristics associated with various traffic states. However, there is insufficient qualitative analysis of the contributing factors in various traffic states [7].
Recently, Li et al. found that the speed-related variables can significantly affect the collision probability of different traffic flow states on a freeway. According to the speeds upstream and downstream of a crash, the traffic flow is separated into four states: back of queue, congested traffic, front of queue, and free flow. The results showed that the variation of speeds could increase the probability of a collision in free flow conditions, while the coefficient of speed variation could increase the probability of a collision in back of queue and congested traffic [8]. Subsequently, Li et al. found that there was a significant relationship between rear end collisions and the magnitude of lengthwise traffic variations, while sideswipe collisions were significantly related to the traffic variation between adjacent lanes on a freeway [9]. Wang et al. analyzed the short-term variation and spatial–temporal characteristics of traffic flow by sideswipe collisions. The results implied that the occurance of sideswipe collisions was significantly related to occupancy, average flow, and speed variance [10]. Kwak et al. defined the traffic flow states by uncongested and congested conditions. The results indicated that there was a significant difference of collision probability by different traffic states and road types [11]. Xu et al. applied four traffic states defined by four-phase traffic theory. The preliminary analysis showed that collision probability, as well as collision severities and types, were significantly affected by traffic flow states. Nonlinear canonical correlation analysis was applied to analyze the collision mechanism. The results showed that the contributing factors leading to the occurance of collisions were significantly different for varying traffic flow states [12]. Xu et al. developed collision probability models to explore the relationship between the probability of collisions and various traffic states separated by the three-phase traffic theory. The study implied that some transitional states were more dangerous than free flow, such as the transitional state from synchronized flow to free flow and the transitional state from wide moving jams to synchronized flow [13].
Numerous studies have focused on how traffic flow operates in different traffic states. The evolution of traffic dynamics on freeways is complex, and the formation of various traffic flow states is influenced by a set of factors. Therefore, traffic flow is classified into different states, typically based on traffic flow characteristics such as speed, flow rate, and density. Hall et al. separated the traffic flow into three states [14]. Wu separated the traffic flow into four states [15]. Kerner divided the traffic flow into three phases [16]. In this study, LOS is applied to separate the traffic states, which is one of the common methods for identifying traffic states. In LOS theory, traffic flow is separated into six states, according to density [17].

3. Data Sources

Crash data, environment data, geometric design data, and traffic flow data were collected from the I-880 freeway in California, United States between 2006 and 2011. The freeway is 34 miles in length and located between the cities of Oakland and San Jose. There are 119 loop detector stations and three weather stations along the freeway.
A total of 9919 collisions were reported and used for further data analysis. For every collision, to avoid the uncertainty of occurrence time, the collision-related data were collected from 5 min to 10 min before the occurrence time of the reported collision. In previous studies, this method has been proven to be effective [4,13]. Previous studies suggested that the statistical power is negligible by using a control-to-case ratio beyond 4:1 [13]. Thus, a control-to-case ratio of 4:1 was used in this study. For each collision case, the authors randomly selected four paired observations of the non-collision traffic data on the basis of three matching factors, including the time, the location, and the weather [13]. For example, collision No. 67 occurred at post-mile 3.95 at 15:00 on 9 November 2009. Traffic data taken at the nearest detector station from 2:50 p.m. to 2:55 p.m. on 9 November 2009 were included in the collision cases as an observation. Then, the paired collision-free traffic data taken at the same loop detector station during the same period on four randomly selected collision-free days in the same weather conditions were used as four observations in the non-collision cases. In this study, the severity of collision was divided into injury or fatal collisions and property damage only (PDO) collisions.
As shown in Table 1, the boundary values of density at different LOSs are presented. In this study, according to the LOS on the freeway, the traffic flow states were divided into six states. In addition, the statistical results in Table 2 show that the number of different collision types and severities in various LOSs are quite different.

4. Methods

In this study, the Bayesian conditional logit model was built to analyze the relative safety performance of different collision types and severities without considering other traffic-related factors in different LOSs. A five-stage Bayesian random parameter sequential logit model was applied to quantify the effects of various variables on the collision probability of different types and severities.

4.1. Bayesian Conditional Logit Model

In previous studies, the conditional logit model has already been used to analyze the safety performance of different traffic states [18,19]. The calculation method has been written as follows:
y i j k   Bernoulli ( p i j k )
P ( y i t k ) = 1 1 + exp α i + k = 1 K β k x i j k
where xijk is the kth unmatched factor for the jth sample or control in the ith matched sample. Therefore, X = {xijk} consists of all samples, and all matched samples are controlled. The value of i is from 1 to I. The value of j is from 1 to J. The value of k is from 1 to K. I denotes the number of matched samples; J represents the number of controls in every matched sample; and K represents the number of contributing factors. αi is the effect of matching factors on the probability of collision occurance for each matched sample; β k represents the estimated value of contributing factors; and xk is the unmatched contributing factors.
To account for the selection bias introduced by the matched case–control design, a conditional likelihood needed to be developed. The conditional probability that the first vector of the explanatory variables xi0 in the ith matched set corresponds to the case, conditional on xi0, xi1, …, xiJ being the vectors of explanatory variables in the ith matched set, is given as
P i c = exp k = 1 K β k x i 0 k exp k = 1 K β k x i 0 k + j = 1 J exp k = 1 K β k x i j k
Thus, the likelihood function of the conditional logit can be written as [15]
f Y β = i = 1 I f y i 0 = 1 β = i = 1 I P i c = exp i = 1 I k = 1 K β k x i 0 k i = 1 I log j = 0 J exp k = 1 K β k x i j k
The Bayesian inference method has been applied for this model using Markov Chain Monte Carlo (MCMC) methods, because there is a significant advantage of this method in that all parameters in the model have a prior distribution. The posterior distribution of parameters has been expressed as
f β Y = f Y , β f Y = f Y β π β f Y , β d β f Y β π β
where f (β |Y) is a posterior joint probability distribution (JPD) associate with parameter β, based on data set Y; f(Y, β) is a JPD associate with parameter β and data set Y; f(Y|β) denotes the probability conditional associated with parameter β; and π(β) is a prior distribution associated with parameter β. The non-informative prior distribution in this method has been written as
β   ~ Normal ( 0 K , 10 6 I K )
where 0K represents a K × 1 vector of zeros and IK represents a K × K matrix. Finally, the posterior JPD f(β |Y) has been written as
f β Y f Y β π β = i = 1 I f y i 0 = 1 β × k = 1 K N β k μ k , k exp i = 1 I k = 1 K β k x i 0 k i = 1 N log j = 0 J exp k = 1 K β k x i j k 1 2 k = 1 K β k 2 10 6

4.2. Bayesian Random Parameter Sequential Logit Model

In previous studies, the ordered logit model was one of the popular methods used to analyze collision severities. However, there are some limitations of this method for analyzing collision severities as follows:
  • There is a hypothesis of this method that the parameter estimates of different collision types and severities are the same [20,21]. However, compared to the ordered logit model, the sequential logit model can explain the difference of various contributing factors across different collision types and severities [20,21].
  • In addition, the sequential logit model explains the correlation of collision probability between different collision types and severities [22,23]. The expressions of collision probability by different collision types and severities have been calculated by Equations (8) through (13), respectively.
  • Moreover, collisions were affected by various traffic-related factors [24,25,26]. Thus, there is an unobserved heterogeneity in the sequential logit model [27,28,29]. The contributing factors in this study can not explain all of the variance in collision types and severities. The unobserved heterogeneity in models can result in inconsistent and biased estimation [30,31,32]. To overcome the limitation of unobserved heterogeneity in the sequential logit model, random parameters were applied in this study.
Therefore, the five-stage Bayesian random parameters sequential logit model was applied to calculate the collision probability of different severities and types. As shown in Figure 1, four Bayesian random parameters binary logit models were built from Stage 1 to Stage 2. Subsequently, three Bayesian random parameters binary logit models were built at the Stage 5. These Bayesian random parameters binary logit models formed the whole five-stage Bayesian random parameters sequential logit model.
Specifically, the Stage 1 model calculated the collision probability (PCollision) without considering different collision types and severities. The Stage 2 model calculated the probability of a hit object collision (PCollision×PHO) without considering collision severities. The Stage 3 model calculated the probability of a sideswipe collision (PCollision×(1-PHO) ×PSW) without considering collision severities. The Stage 4 model calculated the probability of a rear end collision (PCollision×(1-PHO) ×(1-PSW) ×PRE) without considering collision severities. Finally, three collision severity models were established at Stage 5. The injury probability of a hit object collision is PHO_I, the injury probability of a sideswipe collision is PSW_I, and the injury probability of a rear end collision is PRE_I. Finally, the absolute probability of collision by different types and severities have been given as follows:
P(Hit object collision with injury) = P(Collision)
× P(Hit object collision|Collision) × P(injury collision|Hit object collision) = PCollision × PHO × PHO_I
P(Hit object collision without injury) = P(Collision) × P(Hit object collision|Collision)
× P(PDO collision丨Hit object collision) = PCollision × PHO × (1 − PHO_I)
P(Sideswipe collision with injury) = P(Collision) × P(Non-Hit object collision
丨Collision) × P(Sideswipe collision丨Non-Hit object collision) × P(injury collision
丨Sideswipe collision) = PCollision×(1 − PHO) × PSW × PSW_I
P(Sideswipe collision without injury) = P(Collision) × P(Non-Hit object
collision|Collision) × P(Sideswipe collision|Non-Hit object collision) × P(PDO
collision|Sideswipe collision) = PCollision × (1 − PHO) × PSW×(1 − PSW_I)
P(Rear end collision with injury) = P(Collision) × P(Non-Hit object collision|Collision)
× P(Non-Sideswipe collision|Non-Hit object collision) × P(Rear end collision|Non-
Sideswipe collision) × P(injury collision|Rear end collision) = PCollision × (1−PHO) × (1−PSW)
× PRE × PRE_I
P(Rear end collision without injury) = P(Collision) × P(Non-Hit object
collision|Collision) × P(Non-Sideswipe collision|Non-Hit object collision) × P(Rear end
collision|Non-Sideswipe collision) × P(PDO collision|Rear end collision) = PCollision ×
(1−PHO) × (1−PSW) × PRE × (1−PRE_I)
The specification of the basic Bayesian random parameters binary logit model has already been introduced in previous studies [33,34].

5. Results and Discussion

To analyze the relative safety performance of various collision types and severities between different LOSs, the Bayesian conditional logit model was used in Section 5.1. In the Bayesian conditional logit models, only LOSs were regarded as candidate variables without considering other variables.
To quantify the effects of various variables on the collision probability of different collision types and severities in various LOSs, the five-stage Bayesian random parameter sequential logit model was applied in Section 5.2. In addition to LOS variables, five other candidate variables were also considered in the five-stage Bayesian random parameter sequential logit model.

5.1. Safety Performance of LOS by Different Collision Types and Severities

According to the LOS on the freeway, the Bayesian conditional logit model was used to analyze the relative safety performance of different collision types and severities without considering other traffic-related factors in different LOSs. There are five indicator variables in this model, including LOS B, LOS C, LOS D, LOS E, and LOS F. LOS A was considered as the reference level. Therefore, the purpose of this section is to explore the relative safety performance between LOS A and other LOSs. Other traffic flow variables were not included in this model, because LOSs were highly correlated with the traffic flow variables [34].
The process of MCMC chains for this model was composed of a total of 10,000 iterations, 4000 burn-in iterations, and three parallel MCMC chains for Bayesian inference [30,31]. The results of the Bayesian conditional logit models are shown in Table 3. The results show that LOSs significantly affect the collision probability for different types and severities. The 95% credible interval for each parameter in Table 3 indicates that the LOSs significantly affect the collision probability of different types and severities. The odds ratio for each variable was used to quantify the safety performance of each LOS.
Specifically, as shown in Table 3 for hit object collisions, the results suggest that the odds ratios of LOS B and LOS C were significantly greater than LOS A, and the odds ratios of LOS D, LOS E, and LOS F were not significantly greater than LOS A. Accordingly, LOS A was the safest traffic state according to the lowest hit object collision probability. However, LOS C had the highest hit object collision likelihood and was 3.319 times higher than LOS A. The hit object collision probability of LOS B was 1.878 times higher than LOS A, but lower than LOS C. In previous studies, the results indicated that a hit object collision was more likely to occur in traffic flow states with low density [6,7]. In this study, LOS C had a higher density than LOS A and LOS B. Thus, LOS C was the most dangerous for hit object collisions in all LOSs. The analysis of LOS C can also be applied to the results of LOS B.
The results of sideswipe collisions are shown in Table 3. Three odds ratios were significantly greater than LOS A, including LOS B, LOS C, and LOS D. Two odds ratios were not significantly greater than LOS A, including LOS E and LOS F. The highest sideswipe collision probability was for LOS D, followed by LOS C and LOS B. LOS A had the lowest sideswipe collision probability. Specifically, the sideswipe collision probability of LOS B was 2.130 times higher than LOS A, the sideswipe collision probability of LOS C was 3.349 times higher than LOS A, and the sideswipe collision probability of LOS D was 6.279 times higher than LOS A. LOS D had the highest density, followed by LOS C, LOS B, and LOS A. There were more and more lane-changing behaviors in traffic flow with the density increasing. More lane-changing behaviors can increase the risk of a sideswipe collision [16]. Thus, the highest sideswipe collision probability was for LOS D, followed by LOS C, LOS B, and LOS A.
In the Bayesian conditional logit model for rear end collisions, the results showed that there were some significant differences in different LOSs. All other LOSs were more dangerous than LOS A for rear end collisions. The highest rear end collision probability was for LOS D, followed by LOS C, LOS F, LOS E, and LOS B. Specifically, the rear end collision probability of LOS B was 2.995 times higher than LOS A, the rear end collision probability of LOS C was 5.032 times higher than LOS A, the rear end collision probability of LOS D was 6.078 times higher than LOS A, the rear end collision probability of LOS E was 4.053 times higher than LOS A, and the rear end collision probability of LOS F was 4.795 times higher than LOS A. The reason why LOS D had the highest rear end collision probability is similar to the sideswipe collisions above. In addition, although LOS E and LOS F had higher densities than LOS D, there was less space in LOS E and LOS F for vehicles to change lanes. Thus, LOS D had the highest rear end collision probability. Although LOS C was still in free flow, some transitional states from free flow to congested flow started to emerge with sudden reductions in speed. This is the reason why LOS C had the second-highest rear end collision probability.
As shown in Table 3, the estimation results of the Bayesian conditional logit model for injury collisions show that the LOS significantly affected the probability of injury collision occurrences. LOS F had the highest injury collision probability, followed by LOS D, LOS C, LOS B, and LOS E. Specifically, the injury collision probability of LOS B was 2.838 times higher than LOS A, the injury collision probability of LOS C was 3.923 times higher than LOS A, the injury collision probability of LOS D was 4.468 times higher than LOS A, the injury collision probability of LOS E was 2.494 times higher than LOS A, and the injury collision probability of LOS F was 5.098 times higher than LOS A. It has been proven that a higher density can lead to injury collisions [6,7]. Thus, LOS F had the highest injury collision probability. In LOS D and LOS C, more transitional states from free flow to congested flow started to emerge with sudden reductions in speed. Due to LOS D having a higher density than LOS C, LOS D had the second-highest injury collision probability, followed by LOS C.

5.2. The Sequential Logit Model for Collision Types and Severities

The five-stage Bayesian random parameter sequential logit model was established to quantify the relationship between LOS and collision probability by different types and severities. Specifically, Section 5.2.1 was used to explore the relationship between LOS and collision types. In this section, Stage 1 is used to predict the collision likelihood. Stages 2–4 are used to predict the hit object collision probability, sideswipe collision probability, and rear end collision probability, respectively. Section 5.2.2 was used to explore the relationship between LOS and collision severities. In this section, Stage 5 is used to predict the injury probability for different collision types. To avoid the correlation between traffic flow variables and LOS, only LOS was considered in the models. In addition to LOS, as shown in Table 4, five other candidate variables including visibility, road surface, weather, ramp, and number of lanes were also taken into consideration at every stage. The simulation method of this section is similar to that of Section 4.1.

5.2.1. Sequential Model for Collision Types

Table 5 presents the results of the collision probability for different types, from Stage 1 to Stage 4. As shown at Stage 1, low visibility significantly increases collision probability. LOS A has a random negative coefficient, indicating that the collision probability decreases in LOS A. In previous studies, it has been proven that free flow has the best safety performance [15] because the driver has sufficient time to adopt emergency measures in LOS A, with less flow and more space. LOS C has a positive correlation effect on the collision probability. Although vehicles are still in free flow in LOS C, the space between vehicles becomes smaller than in LOS A and LOS B. In previous studies, it has been demonstrated that more drivers will take advantage of higher speeds under uncongested conditions [35,36]. Thus, higher speed and smaller space can lead to less response time for drivers to take emergency measures.
As shown in Table 5, the hit object collision probability was calculated for Stage 2. This model has six significant variables, as shown in Table 5, including number of lanes, visibility, road surface, LOS B, LOS C, and LOS D. The results indicated that the hit object collision probability increases with less lanes, low visibility, and a worse road surface. Specifically for these reasons, less lanes can lead to less space between vehicles. Low visibility can result in less response time for drivers to take emergency measures. Vehicles need longer braking distances in worse road surface conditions. In addition, the hit object collision probability decreases in LOS B, LOS C, and LOS D.
The Stage 3 model was established to predict the likelihood of a sideswipe collision. This model has seven significant variables, as shown in Table 5, including number of lanes, visibility, ramp, LOS A, LOS B, LOS C, and LOS D. Number of lanes, visibility and ramp have random negative coefficients, indicating that the sideswipe collision probability increases with less lanes, low visibility, and non-ramp conditions. In LOS A, LOS B, LOS C, and LOS D situations, the sideswipe collision probability decreases. The results indicated that low occupancy can decrease the sideswipe collision probability because there is more space for drivers to take emergency measures in low occupancy conditions.
The Stage 4 model was established to predict rear end collision likelihood. This model has six significant variables, as shown in Table 5, including number of lanes, visibility, road surface, LOS A, LOS C, and LOS D. More lanes, high visibility, and a worse road surface have positive effects on rear end collision probability. For LOS A, LOS C, and LOS D, the rear end collision probability increases.

5.2.2. Sequential Model for Collision Severities by Different Types

Table 6 presents the results of the probability of collision severities by different types at Stage 5. It was found that there are significant differences in the contributing factors of the estimation results.
For hit object collision, the results indicate that four variables can significantly affect the severity of hit object collisions. All of the significant variables have negative effects on the injury probability of hit object collisions, specifically few number of lanes, non-ramp segments, normal weather, and low visibility can increase the injury probability of hit object collisions. The results indicate that higher speeds in non-ramp segments, normal weather conditions, fewer number of lanes, and less response time in low-visibility conditions can increase the injury probability of hit object collisions. In addition, it was found that LOS has no effects on the severity of hit object collisions.
For sideswipe collisions, the results imply that four variables can significantly affect the severity of sideswipe collisions. Specifically, fewer number of lanes, non-ramp segments, and low visibility can increase the injury probability of sideswipe collisions. The results are similar to the hit object collisions above. Moreover, a PDO sideswipe collision is more likely to occur in LOS D.
For rear end collisions, the results indicate that five variables can significantly affect the severity of rear end collisions. Specifically, few number of lanes, non-ramp segments, low visibility, and normal road surfaces can increase the injury probability of rear end collisions. Furthermore, a PDO rear end collision is more likely to occur in LOS A.

6. Conclusions

In this study, the main purpose was to identify the relationship between LOS and different collision types and severities, and explore how contributing factors affect collision risks for different types and severities. The collision-related data were obtained from the I-880 freeway, which is located in California, United States. The time interval was from 2006 to 2011. The Bayesian conditional logit model was built to analyze the relative safety performance of different collision types and severities without considering other traffic-related factors in different LOSs. A five-stage Bayesian random parameter sequential logit model was applied to quantify the effects of various variables of the collision probability of different types and severities.
Specifically, as shown in Figure 2, the results of the Bayesian conditional logit models in Table 3 indicate that LOS A is the safest traffic state for different collision types and severities. LOS C has the worst safety performance associated with hit object collisions, and the hit object collision probability in LOS C is 3.319 times higher than the one in LOS A. LOS D has the worst safety performance associated with sideswipe collisions and rear end collisions, the sideswipe collision probability and the rear end collision probability in LOS D is 6.279 and 6.078 times higher than the one in LOS A. LOS F has the worst safety performance associated with injury collisions, and the injury collision probability in LOS F is 5.098 times higher than the one in LOS A, because an injury collision is more likely to occur in traffic flow states with high occupancy [6,7].
The results of the sequential logit model showed that weather variables, road variables, and LOS had significant effects on different collision types and severities. It was found that fewer lanes and low visibility can both increase the injury probability of hit object collisions, sideswipe collisions, and rear end collisions. Ramp segments could decrease the injury probability of hit object collisions and sideswipe collisions. Normal weather conditions could increase the injury probability of hit object collisions and rear end collisions. Normal road surfaces could increase the injury probability of rear end collisions.
This research can help transportation professionals better understand which LOS is more dangerous for different collision types and severities, and realize the contributing factors of different collision types and severities in different LOSs. The results can be applied to reduce the collision probability of different types and severities in different LOSs.
However, there are still some issues that need to be studied in the future. Firstly, more divided methods of traffic flow states should be adapted, such as three-phase theory. Second, more traffic variables should be used in the models, such as driver behavior and geometric design. Finally, the transferability of the models in this study still needs to be verified in the future.

Author Contributions

Conceptualization, B.Y., Y.W.; methodology, B.Y.; software, J.B.; validation, B.Y., J.B., W.Z.; formal analysis, B.Y.; investigation, W.Z.; resources, W.Z.; data curation, Y.W.; writing—original draft preparation, B.Y.; writing—review and editing, B.Y.; visualization, J.B.; supervision, B.Y.; project administration, W.Z.; funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was sponsored by the Projects of the National Natural Science Foundation of China (71701046; 51878236), the Foundation for Jiangsu key Laboratory of Traffic and Transportation Security (TTS2020-03), and the Talent Research Start-Up Fund of Nanjing University of Aeronautics and Astronautics under grant (1007-YAH20100).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yang, B.; Liu, Z.-H.; Chan, C.-Y.; Xu, C.; Guo, Y. Identifying the crash characteristics on freeway segments based on different ramp influence areas. Traffic Inj. Prev. 2019, 20, 386–391. [Google Scholar] [CrossRef]
  2. Abdel-Aty, M.; Pemmanaboina, R. Calibrating a Real-Time Traffic Crash-Prediction Model Using Archived Weather and ITS Traffic Data. IEEE Trans. Intell. Transp. Syst. 2006, 7, 167–174. [Google Scholar] [CrossRef]
  3. Hossain, M.; Muromachi, Y. Understanding Crash Mechanism and Selecting Appropriate Interventions for Real-Time Hazard Mitigation on Urban Expressways. Transp. Res. Rec. 2011, 2213, 53–62. [Google Scholar] [CrossRef]
  4. Yang, B.; Guo, Y.; Xu, C. Analysis of freeway secondary crashes with a two-Step method by loop detector data. IEEE Access 2019, 7, 22884–22890. [Google Scholar] [CrossRef]
  5. Giovanny, P.; Blanca, A.; Camino, G.; Francisco, A. Influential Factors on Injury Severity for Drivers of Light Trucks and Vans with Machine Learning Methods. Sustainability 2020, 12, 1324. [Google Scholar]
  6. Golob, T.F.; Recker, W.W.; Alvarez, V.M. Freeway safety as a function of traffic flow. Accid. Anal. Prev. 2004, 36, 933–946. [Google Scholar] [CrossRef]
  7. Golob, T.F.; Recker, W.W. A method for relating type of crash to traffic flow characteristics on urban freeways. Transp. Res. Part A Policy Pract. 2004, 38, 53–80. [Google Scholar] [CrossRef]
  8. Li, Z.; Wang, W.; Chen, R.; Liu, P.; Xu, C. Evaluation of the Impacts of Speed Variation on Freeway Traffic Collisions in Various Traffic States. Traffic Inj. Prev. 2013, 14, 861–866. [Google Scholar] [CrossRef]
  9. Li, Z.; Wang, W.; Chen, R.; Liu, P. Conditional inference tree-based analysis of hazardous traffic conditions for rear-end and sideswipe collisions with implications for control strategies on freeways. IET Intell. Transp. Syst. 2014, 8, 509–518. [Google Scholar] [CrossRef]
  10. Wang, J.; Zhang, Q.; Wang, Y.; Weng, J.; Yan, X. Analysis of Sideswipe Collision Precursors Considering the Spatial-Temporal Characteristics of Freeway Traffic. J. Transp. Eng. 2016, 142, 04016064. [Google Scholar] [CrossRef]
  11. Kwak, H.-C.; Kho, S. Predicting crash risk and identifying crash precursors on Korean expressways using loop detector data. Accid. Anal. Prev. 2016, 88, 9–19. [Google Scholar] [CrossRef]
  12. Xu, C.; Wang, W.; Liu, P.; Zhang, F. Development of a Real-Time Crash Risk Prediction Model Incorporating the Various Crash Mechanisms Across Different Traffic States. Traffic Inj. Prev. 2014, 16, 28–35. [Google Scholar] [CrossRef]
  13. Xu, C.; Liu, Z.-H.; Wang, W.; Li, Z. Safety performance of traffic phases and phase transitions in three phase traffic theory. Accid. Anal. Prev. 2015, 85, 45–57. [Google Scholar] [CrossRef]
  14. Hall, F.; Hurdle, V.; Banks, J. Synthesis of recent work on the nature of speed-flow and flow-occupancy (or density) relationships on freeways. Transp. Res. Rec. 1992, 1365, 12–18. [Google Scholar]
  15. Wu, N. A new approach for modeling of Fundamental Diagrams. Transp. Res. Part A Policy Pract. 2002, 36, 867–884. [Google Scholar] [CrossRef]
  16. Kerner, B.S. Empirical macroscopic features of spatial-temporal traffic patterns at highway bottlenecks. Phys. Rev. E 2002, 65, 1–30. [Google Scholar] [CrossRef] [PubMed]
  17. Transportation Research Board. Highway Capacity Manual; Transportation Research Board of the National Academies: Washington, DC, USA, 2010. [Google Scholar]
  18. Bruce, N.; Pope, D.; Stanistreet, D. Quantitative Methods for Health Research; Wiley: Hoboken, NJ, USA, 2008. [Google Scholar]
  19. Xiao, G.; Wang, Z. Empirical Study on Bikesharing Brand Selection in China in the Post-Sharing Era. Sustainability 2020, 12, 3125. [Google Scholar] [CrossRef] [Green Version]
  20. Yamamoto, T.; Shankar, V.N. Bivariate ordered-response probit model of driver’s and passenger’s injury severities in collisions with fixed objects. Accid. Anal. Prev. 2004, 36, 869–876. [Google Scholar] [CrossRef]
  21. Yamamoto, T.; Hashiji, J.; Shankar, V.N. Underreporting in traffic accident data, bias in parameters and the structure of injury severity models. Accid. Anal. Prev. 2008, 40, 1320–1329. [Google Scholar] [CrossRef] [PubMed]
  22. Lord, D.; Mannering, F.L. The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives. Transp. Res. Part A Policy Pract. 2010, 44, 291–305. [Google Scholar] [CrossRef] [Green Version]
  23. Mothafer, G.I.; Yamamoto, T.; Shankar, V.N. Evaluating crash type covariances and roadway geometric marginal effects using the multivariate Poisson gamma mixture model. Anal. Methods Accid. Res. 2016, 9, 16–26. [Google Scholar] [CrossRef]
  24. Milton, J.C.; Shankar, V.N.; Mannering, F.L. Highway accident severities and the mixed logit model: An exploratory empirical analysis. Accid. Anal. Prev. 2008, 40, 260–266. [Google Scholar] [CrossRef] [PubMed]
  25. Xu, C.; Li, H.; Zhao, J.; Chen, J.; Wang, W. Investigating the relationship between jobs-housing balance and traffic safety. Accid. Anal. Prev. 2017, 107, 126–136. [Google Scholar] [CrossRef] [PubMed]
  26. Mannering, F.L.; Shankar, V.N.; Bhat, C.R. Unobserved heterogeneity and the statistical analysis of highway accident data. Anal. Methods Accid. Res. 2016, 11, 1–16. [Google Scholar] [CrossRef]
  27. Venkataraman, N.; Shankar, V.; Ulfarsson, G.F.; Deptuch, D. A heterogeneity-in-means count model for evaluating the effects of interchange type on heterogeneous influences of interstate geometrics on crash frequencies. Anal. Methods Accid. Res. 2014, 2, 12–20. [Google Scholar] [CrossRef]
  28. Venkataraman, N.; Ulfarsson, G.F.; Shankar, V.N. Extending the Highway Safety Manual (HSM) framework for traffic safety performance evaluation. Saf. Sci. 2014, 64, 146–154. [Google Scholar] [CrossRef]
  29. Guo, Y.; Liu, P.; Wu, Y.; Yang, M. Traffic Conflict Model Based on Bayesian Multivariate Poisson-lognormal Normal Distribution. China J. Highw. Transp. 2018, 31, 101–109. [Google Scholar]
  30. Guo, Y.; Li, Z.; Liu, Z.-H.; Wu, Y. Modeling correlation and heterogeneity in crash rates by collision types using full bayesian random parameters multivariate Tobit model. Accid. Anal. Prev. 2019, 128, 164–174. [Google Scholar] [CrossRef]
  31. Guo, Y.; Li, Z.; Sayed, T. Analysis of Crash Rates at Freeway Diverge Areas using Bayesian Tobit Modeling Framework. Transp. Res. Rec. 2019, 2673, 652–662. [Google Scholar] [CrossRef]
  32. Guo, Y.; Osama, A.; Sayed, T. A cross-comparison of different techniques for modeling macro-level cyclist crashes. Accid. Anal. Prev. 2018, 113, 38–46. [Google Scholar] [CrossRef]
  33. Rubin, G.P. Minor surgery. A text and atlas. J. S. Brown. 280 × 255 mm. Pp. 220 + xii. Illustrated and colour. 1986. London: Chapman and Hall. £45.00. BJS 1987, 74, 438. [Google Scholar] [CrossRef]
  34. Xu, C.; Wang, Y.; Liu, P.; Wang, W.; Bao, J. Quantitative risk assessment of freeway crash casualty using high-resolution traffic data. Reliab. Eng. Syst. Saf. 2018, 169, 299–311. [Google Scholar] [CrossRef]
  35. Qin, X.; Wang, K.; Cutler, C.E. Analysis of Crash Severity Based on Vehicle Damage and Occupant Injuries. Transp. Res. Rec. 2013, 2386, 95–102. [Google Scholar] [CrossRef] [Green Version]
  36. Zhang, X.; Xu, J.; Liang, Q.; Ma, F. Modeling Impacts of Speed Reduction on Traffic Efficiency on Expressway Uphill Sections. Sustainability 2020, 12, 587. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The framework of the five-stage sequential logit model.
Figure 1. The framework of the five-stage sequential logit model.
Sustainability 12 07386 g001
Figure 2. The collision probability of different types and severities in various LOSs.
Figure 2. The collision probability of different types and severities in various LOSs.
Sustainability 12 07386 g002
Table 1. Boundary values of density at different levels of service (LOSs).
Table 1. Boundary values of density at different levels of service (LOSs).
LOSBoundary Value of Density (Vehicle/km/Lane)
LOS A≤18
LOS B18–29
LOS C29–42
LOS D42–56
LOS E56–72
LOS F>72
Table 2. The number of collisions by different types and severities.
Table 2. The number of collisions by different types and severities.
LOSHit Object CollisionSideswipe CollisionRear end CollisionInjury CollisionTotal
LOS A13641888463423868326
LOS B54139593199811
LOS C3182381130505
LOS D52813443169
LOS E66341347
LOS F613371961
Total14662156581327909919
Table 3. The results of Bayesian conditional logit models.
Table 3. The results of Bayesian conditional logit models.
VariablesMeanMC Error2.50%97.50%Odds Ratio
Hit Object Collision
LOS B1.0570.2020.6561.4542.878
LOS C1.4630.2620.9371.9564.319
LOS D1.0400.668−0.3732.2562.829
LOS E1.1830.604−0.0432.3313.264
LOS F1.0231.121−1.1423.2702.782
LOS A *
Sideswipe Collision
LOS B1.1410.1260.8981.3793.130
LOS C1.4700.1571.1741.7844.349
LOS D1.9850.2871.4432.5477.279
LOS E0.4840.655−0.9081.6681.623
LOS F0.7370.821−0.9832.2902.090
LOS A *
Rear end Collision
LOS B1.3850.0651.2641.5153.995
LOS C1.7970.0851.6281.9636.032
LOS D1.9570.1361.692.2237.078
LOS E1.6230.2421.1532.0895.053
LOS F1.7570.3531.0782.4485.795
LOS A *
Injury Collision
LOS B1.3450.1021.1481.5413.838
LOS C1.5940.1321.3291.8574.923
LOS D1.6990.2161.2732.1185.468
LOS E1.2510.3720.4971.9773.494
LOS F1.8080.5270.782.8286.098
LOS A *
* denotes the reference level.
Table 4. Candidate variables.
Table 4. Candidate variables.
Candidate VariablesExplanation
ViVisibility (mile)
We1 = worse weather conditions; 0 = normal weather conditions;
Rs1 = worse road surface; 0 = normal road surface
Ra1 = ramp segment; 0 = non-ramp segment
NlNumber of lanes
LOS A1 = LOS A; 0 = otherwise
LOS B1 = LOS B; 0 = otherwise
LOS C1 = LOS C; 0 = otherwise
LOS D1 = LOS D; 0 = otherwise
LOS E1 = LOS E; 0 = otherwise
LOS F1 = LOS F; 0 = otherwise
Table 5. Results of sequential model from Stage 1 to Stage 4.
Table 5. Results of sequential model from Stage 1 to Stage 4.
VariablesMeanMC Error2.50%Median97.50%
Stage 1
Vi−0.1110.014−0.136−0.126−0.001
LOS A−0.0170.003−0.025−0.0180.000
LOS C0.0490.0090.0130.0510.082
Stage 2
Nl−0.1460.008−0.196−0.150−0.073
Vi−0.1530.002−0.163−0.155−0.141
Rs0.2640.0230.0080.2870.458
LOS B−0.2110.020−0.414−0.169−0.049
LOS C−0.2790.023−0.390−0.335−0.016
LOS D−0.6220.061−1.078−0.539−0.054
Stage 3
Nl−0.0330.002−0.061−0.031−0.017
Ra−0.2550.041−0.655−0.135−0.001
Vi−0.1060.002−0.119−0.107−0.087
LOS A−0.0450.002−0.059−0.046−0.015
LOS B−0.1370.015−0.241−0.164−0.033
LOS C−0.2720.019−0.398−0.301−0.019
LOS D−0.4690.056−0.908−0.614−0.010
Stage 4
Nl0.1700.0050.0970.1810.198
Vi0.1960.0030.1510.1990.208
Rs0.1580.0130.0290.1650.268
LOS A0.0300.0030.0050.0290.059
LOS C0.2010.0220.0860.1360.372
LOS D0.1260.0110.0110.1090.261
Table 6. Results of sequential model for Stage 5.
Table 6. Results of sequential model for Stage 5.
VariablesMeanMC Error2.50%Median97.50%
Hit Object Collision
Nl−0.0480.008−0.073−0.057−0.001
Ra−0.0620.013−0.130−0.059−0.006
We−0.0950.012−0.129−0.110−0.029
Vi−0.0400.003−0.052−0.041−0.016
Sideswipe Collision
Nl−0.0550.006−0.068−0.059−0.011
Ra−0.0790.014−0.168−0.072−0.004
Vi−0.1000.010−0.120−0.111−0.020
LOS D−0.3670.051−0.479−0.431−0.050
Rear end Collision
Nl−0.0730.004−0.102−0.075−0.036
We−0.0740.008−0.140−0.068−0.002
Vi−0.0490.002−0.069−0.050−0.034
Rs−0.1150.023−0.348−0.067−0.017
LOS A−0.0210.002−0.036−0.022−0.003

Share and Cite

MDPI and ACS Style

Yang, B.; Wu, Y.; Zhang, W.; Bao, J. Modeling Collision Probability on Freeway: Accounting for Different Types and Severities in Various LOS. Sustainability 2020, 12, 7386. https://doi.org/10.3390/su12187386

AMA Style

Yang B, Wu Y, Zhang W, Bao J. Modeling Collision Probability on Freeway: Accounting for Different Types and Severities in Various LOS. Sustainability. 2020; 12(18):7386. https://doi.org/10.3390/su12187386

Chicago/Turabian Style

Yang, Bo, Yao Wu, Weihua Zhang, and Jie Bao. 2020. "Modeling Collision Probability on Freeway: Accounting for Different Types and Severities in Various LOS" Sustainability 12, no. 18: 7386. https://doi.org/10.3390/su12187386

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop