Next Article in Journal
Environmental Scenario Analysis on Natural and Social-Ecological Systems: A Review of Methods, Approaches and Applications
Next Article in Special Issue
Calibrating the Wiedemann 99 Car-Following Model for Bicycle Traffic
Previous Article in Journal
Accelerating Climate Service Development for Renewable Energy, Finance and Cities
Previous Article in Special Issue
Surrogate Safety Measures from Traffic Simulation: Validation of Safety Indicators with Intersection Traffic Crash Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Feasibility of Stochastic Models for Evaluation of Potential Factors for Safety: A Case Study in Southern Italy

by
Giuseppe Guido
1,
Sina Shaffiee Haghshenas
1,
Sami Shaffiee Haghshenas
1,
Alessandro Vitale
1,
Vittorio Astarita
1,* and
Ashkan Shafiee Haghshenas
2
1
Department of Civil Engineering, University of Calabria, Via Bucci, 87036 Rende, Italy
2
W Booth School of Engineering Practice & Technology, McMaster University, Main St W 1280, Hamilton, ON L8S 4L8, Canada
*
Author to whom correspondence should be addressed.
Sustainability 2020, 12(18), 7541; https://doi.org/10.3390/su12187541
Submission received: 31 July 2020 / Revised: 9 September 2020 / Accepted: 11 September 2020 / Published: 12 September 2020

Abstract

:
There is no definite conclusion about what the main variables that play a fundamental role in road safety are. Therefore, the identification of significant factors in road accidents has been a primary concern of the transportation safety research community for many years. Every accident is influenced by multiple variables that, in a given time interval, concur to cause a crash scenario. Information coming from crash reports is very useful in traffic safety research, and several reported crash variables can be analyzed with modern statistical methods to establish whether a classification or clustering of different crash variables is possible. Hence, this study aims to use stochastic techniques for evaluating the role of some variables in accidents with a clustering analysis. The variables that are considered in this paper are light conditions, weekday, average speed, annual average daily traffic, number of vehicles, and type of accident. For this purpose, a combination of particle swarm optimization (PSO) and the genetic algorithm (GA) with the k-means algorithm was used as the machine-learning technique to cluster and evaluate road safety data. According to a multiscale approach, based on a set of data from two years of crash records collected from rural and urban roads in the province of Cosenza, 154 accident cases were accurately investigated and selected for three categories of accident places, including straight, intersection, and other, in each urban and rural network. PSO had a superior performance, with 0.87% accuracy on urban and rural roads in comparison with GA, although the results of GA had an acceptable degree of accuracy. In addition, the results show that, on urban roads, social cost and type of accident had the most and least influence for all accident places, while, on rural roads, although the social cost was the most notable factor for all accident places, the type of accident had the least effect on the straight sections and curves, and the number of vehicles had the least influence at intersections.

1. Introduction

Traffic accidents are in the top 10 major causes of injuries and deaths [1], representing, with the associated economic loss, an important socioeconomic concern. For these reasons, the identification of the significant factors associated with such crashes has become of major interest and a great challenge in transportation safety research. There are many problems related to the inadequacy and difficulty of obtaining data for analysis as well as the limited crash counts. Moreover, another challenge with such studies is that there may potentially be redundant information and correlations among different candidate factors. Consequently, considerable efforts have to be made in the field of accident analysis to address these problems properly and effectively.
Traditionally, in the scientific literature, statistical models based on crash frequency are a common tool for estimating the safety performance functions of many transportation systems. Count regression models are very often applied in road safety studies dealing with the crash-frequency context [2]. Among them there are Poisson or negative binomial (NB) models [3,4,5,6,7] and zero-inflated Poisson or NB models [5,8,9]. These safety models offer, as an advantage, the opportunity to correlate the specific roadway characteristics and expected crash frequencies. On the contrary, their limits are associated with strong parametric modeling assumptions and omitted variable bias, reducing the prediction suitability. In order to overcome the limits of traditional safety models, several researchers introduced heterogeneity models [10,11,12]. Random parameter models belong to this category of models [13,14,15]; in them, model parameters are allowed to vary from site to site, improving statistical inference and the predictive capability of such models [12]. In addition to traditional and heterogeneity safety models, researchers have proposed many nonparametric and artificial intelligence models for the study of crash injury patterns. Artificial intelligence approaches have been proven to be more efficient and effective in the modeling process [16]. These models are data driven and include decision trees (DT) [17,18,19], support vector machine (SVM) [20,21], and random forest (RF) [22]. The SVM has been utilized for the classification of crash injury severity [23,24] and outperforms the popular ordered probit model for the prediction of injury severity and factor impact assessment [23]. Thanks to the big-data era, optimization efficiency has become a primary concern for the analysis procedure. For these reasons, in the scientific literature, there is a need to develop novel algorithms to handle the traffic crash records efficiently.
The main aim of this paper is to propose a hybrid model combining particle swarm optimization (PSO) and genetic algorithm (GA) with k-means to cluster and to evaluate significant factors affecting road safety. The paper is organized as follows. A literature review of the different model categories and approaches is provided in the next section. Section 2 describes the methodological approach and the theoretical fundaments of particle swarm optimization (PSO) and the genetic algorithm (GA). The case study and the data description are provided in Section 3. In Section 4 and Section 5, the application of PSO and GA algorithms have been described. Section 6 and Section 7 refer to the discussion of results and conclusions.

2. Methodology

To cluster a set of unpredicted factors for road safety evaluation, two different stochastic techniques were applied: particle swarm optimization (PSO) and genetic algorithm (GA) with k-means. In both cases, as in several data mining applications, the optimization problem considered is the minimization of Euclidean distance among the different groups of factors. The results obtained by applying the two different techniques were compared in terms of the performance and accuracy of the determined solutions. As a final result, the selected factors were used to create a basic road safety pattern with its temporal and spatial property and dimension.

2.1. Genetic Algorithm (GA)

In artificial intelligence, the genetic algorithm (GA), an evolutionary algorithm, is a powerful tool in the evaluation of complex systems. It was introduced and developed by John Holland and his colleagues during the two decades from 1960 [25]. GA works based upon a stochastic search technique and GA’s performance is inspired by biological evolution. It has been widely applied in various fields of engineering, including computer sciences [26,27], transportation, and geotechniques [28,29,30,31] as well as mathematics [32]. There are three main operators in the process of GA, which include reproduction, crossover, and mutation rules. Generally speaking, at first, in the process of GA, the initial population is randomly generated as a set of number strings (representation of solutions). Then, the goodness and efficiency of each solution is evaluated by the fitness function (objective function), and the best population (solutions) is selected as generators (parents) based upon a reproduction operator to generate new generations (new solutions) based upon the crossover operator. According to the mutation operator, a random change emerges in the next generation, whereby this new generation has higher fitness values in comparison with the old generation. This loop continues and the accuracy of the process is investigated; if it reaches a desired optimization, the loop is stopped [33]. The basic form of the flowchart of GA is shown in Figure 1.

2.2. Particle Swarm Optimization (PSO)

In the modern world, most phenomena and problems deal with complex and nonlinear systems; therefore, they cannot be easily solved. Hence, using artificial intelligence techniques is imperative [35,36,37,38,39,40,41]. Metaheuristic algorithms, a branch of artificial intelligence techniques, are the precise scientific tools to solve uncertain systems in a wide range of sciences and industries [42,43,44,45,46,47,48,49]. Particle swarm optimization (PSO) is one of the notable metaheuristic algorithms that is appropriate for dealing with nonlinear relationships and complex problems. Recently, it has been used successfully to solve complex engineering problems determined by many parameters [28,50,51]. PSO was introduced by Kennedy [52,53]. PSO is inspired by swarm intelligence (SI) and the social behavior of birds and fish in finding enough food in a group. In the process of PSO, a set of solutions, which is a set of particles, is randomly generated; in fact, each solution is a particle. This algorithm works in a D-dimensional community or search space and each particle can affect other neighboring particles or the general particles. Each particle has a velocity vector and, for determining the best optimal answer (new position), two factors of personal best position (Pbest) and global best position (Gbest) are considered [54,55]. Figure 2 shows the movement of particles and their new positions, and the updated velocity vector and position are shown in Equations (1) and (2).
V i ( k + 1 ) = w V i k + c 1 r 1 · ( p b e s t i X i k ) + c 2 r 2 · ( g b e s t X i k )
X i ( k + 1 ) = X i k + V i k
where i = (1,2,3,…,N) and N is population size (particle). Vi(k+1) and Vi(k) are the new velocity vector and old velocity vector of the ith particle, respectively. Xi(k+1) and Xi(k) represent the new position and old position of the ith particle. Furthermore, K = (1,2,3,…) is the number of iterations. w is the inertia weight that is in the interval of 0.4 to 0.9, and it plays a key role in controlling the effect of the old velocity vector on the new velocity vector; r1 and r2 are random numbers that are in the interval of 0 and +1. C1 and C2 are the acceleration constants that should be evaluated by experts and based upon Equation (3). In fact, C1 and C2 are known as the individual learning and social learning factors, respectively [56,57]. Figure 3 shows the movement and modified position of a particle. The particles continue to search for the best position (solution) and, when they find the best position, the algorithm process is stopped. It is worth mentioning that PSO can be applied to solve continuous and discrete problems.
c 1 + c 2 4

2.3. The Optimization Function and the Correlation Analysis Method

As mentioned before, the fitness function contributes greatly to the determination of the solution. Hence, Lloyd’s algorithm (k-means clustering) is considered the fitness function for two metaheuristic algorithms according to the problem statement in this research work. Equation (4) represents Lloyd’s algorithm, which is a common and popular clustering algorithm in data mining [59].
O b j . F u n c t i o n = i = 1 n m i n 1 j k   d ( x i , m j )
where xi and mj are the data set and center of each cluster, respectively, and j = (1,2,3,…,k) indicates the number of the cluster. Furthermore, d is the Euclidean distance between each member and the center of each cluster. It is worth mentioning that two aims are considered (for the Lloyd’s algorithm) the fitness function: these are the distance of the centers of the clusters from each other and also the distance between the members of a cluster, which should become maximization and minimization, respectively.
Before modeling, correlation analysis of the data set is imperative because a high correlation between data sets can have a negative impact on the performances and results of clustering algorithms. Hence, although the dataset was considered by experts in this study, a correlation analysis was conducted. Pearson’s correlation coefficient, a suitable and common correlation analysis, was used for the evaluation of the data set. Mathematical relations are shown in Equations (5) to (8), in which ρ(r) is known as Pearson’s correlation coefficient for two independent parameters (X and Y). SPDxy is the covariance of X and Y. SSX and SSY indicate the standard deviation of X and Y, respectively [60,61,62]. It is worth mentioning that the value of ρ(r) is in the interval of −1 and +1, in which the absolute values of the coefficient demonstrate degrees of correlation and the signs only represent a direct (+) and reverse (−) relation between two independent parameters.
ρ = r = S P D x y S S X · S S Y
S P D x y = x y ( x ) ( y ) n
S S X = i n x i 2 ( x i ) 2 n
S S Y = i n y i 2 ( y i ) 2 n

3. Data Collection and Preparation

3.1. Crash, Traffic, and Speed Data

The crash data used in this research have been derived from the large database of Istituto Nazionale di Statistica (ISTAT). Each crash record was obtained from the collaboration of a multiplicity of entities: ISTAT, Automobile Club Italia (ACI), the Ministry of the Interior, the traffic police department, the Carabinieri, the provincial police, the municipal police, the offices of statistics of the provincial capitals, and the statistical offices of some provinces that have signed an agreement with ISTAT. The database contains details on road accidents that were reported by a police authority, occurred in the national territory over a calendar year, and caused injuries to people (death or injury). There is information on crash dynamics and location, on vehicles, and on the individuals involved, but it does not include property damage only (PDO) events in accordance with current Italian legislation. Indeed, Italian legislation defines road accidents as crashes only when they result in at least one injury [63].
In this research, the road accidents that were selected occurred during two years (2017 and 2018) on the main rural and urban roads of the province of Cosenza (Figure 4). A total of 154 crash events were elaborated, considering 77 accidents on rural roads and 77 on urban roads. In order to analyze and compare homogeneous road accident samples by applying PSO and GA techniques, two samples of 77 crashes for both urban and rural cases were extracted from an accident database. These crashes have been distinguished and grouped by territorial area (urban and rural) and homogeneous geometric elements (intersection, straight, curve, other, etc.). The ISTAT database fields considered in this study are summarized in Table 1.
The fields of the ISTAT database were matched up with ad hoc traffic surveys on the same rural and urban roads considered in order to derive average vehicular speeds and average traffic volumes. The surveys were conducted for one month in October 2019 with the support of Bluetooth radar sensors, which give output not only on traffic volumes but also vehicular speed values. In particular, radar sensors were located on the road sections with observed crashes. From an evaluation of traffic-volume and speed-value statistical trends over a 10-year period, and considering the social, economic, demographic, and travel demand characteristics of the study area, traffic volumes and vehicular speed values were considered invariant over the last five years. An important aspect for the present research was the definition of a segment surrounding each count station, where it could be assumed that homogeneous flow and speed conditions were present for the entire length. If the sensor was positioned on a link with homogeneous geometric characteristics greater than 2 miles in length, a circular buffer 2 miles in diameter around the location of the radar sensor (1 mile upstream and downstream) was defined [64]. The geometric homogeneity of a road segment was defined, taking into account the number of lanes, lane and shoulder width, speed limit, median type, and median width.

3.2. Correlation Data Analysis

Six unpredicted and effective variables in accidents, including the daylight, weekday, average speed, annual average daily traffic, number of vehicles, and type of accident were considered for urban and rural roads. As described in Section 2.3, during the first step, 77 accident cases for urban and rural roads related to the six mentioned variables were normalized between 0 and 1 and then correlation analyses were conducted.
The results of the correlation analysis for urban and rural roads are shown in Table 2 and Table 3.
It should be noted that when |ρ| > 0.85 is established, the correlation between two parameters was considered strong and was not appropriate for clustering analysis. According to the results of Table 3 and Table 4, the maximum correlation was −0.378 between the average speed and annual average daily traffic in the urban areas; they have a reverse relation with each other. On the other hand, there was the maximum correlation between the number of vehicles and type of accident of 0.396, with a direct relation between the two in rural areas. Hence, it can be concluded that all parameters in this study have a suitable correlation with each other, which is in full agreement with other studies on road safety [63].

4. Particle Swarm Optimization Modeling for Urban and Rural Area

Before modeling, some control parameters of the PSO algorithm should be determined at the first step. In order to reach a high level of precision in the performance of the algorithm, it is necessary to determine suitable control parameters, which contribute greatly to the increase of the convergence of the algorithm. There are no determined relations; in fact, these parameters were determined based upon trial and error and experts’ opinions [65,66]. The personal learning coefficient (C1) and global learning coefficient (C2) were considered equal to 2. The minimum acceptance precision, εL = 0.00001, and the maximum number of iterations and population size (swarm size) were considered 300 and 50, respectively. It is worth mentioning that these control parameters can be considered the same for urban and rural situations because the number and type of dataset are the same. Then, modeling was carried out by three classes (clusters) in a Matlab software environment. Figure 5 shows the best cost of performance of the model of the PSO algorithm for urban and rural areas.
According to Figure 5, the convergence trends of the algorithm for both areas shows that the performance of the algorithm was reliable because, after the 76th and 39th iterations, the convergence trends remained constant until the 300th iteration for urban and rural areas, respectively. Optimization and clustering of accident cases by the PSO algorithm for urban and rural areas are shown in Table 4 and Table 5, respectively.
According to the obtained results in Table 4 and Table 5, the Euclidean distance of each accident case from each class (cluster) was calculated based upon the six unpredicted and effective variables in accidents and the lowest distance shows which accident case belongs to which class, and the results are shown in the column of the recognized class. For example, the minimum Euclidean distance in the second case in urban areas was 0.05 from the first class, which shows that this accident case belongs to the first class. In rural areas, the 47th accident case with the minimum Euclidean distance of 0.899 from the third class shows that this case belongs to the third class. Then, the results of the column of the recognized class were compared with the column for the actual type of accident place. Ten cases were wrongly recognized from 77 accident cases for both areas. For instance, in urban areas, the minimum Euclidean distance of the 40th case was calculated as 0.021, which belongs to the first class, although this accident case occurred at the intersection in the actual type. In addition, the 70th case should be recognized at the intersection (second class), although, due to the minimum Euclidean distance, it was recognized in the third class. Generally, the accuracy of the algorithm’s performance was 0.87% for both urban and rural networks, and it should be noted that clustering of accident cases for both areas by model had an acceptable degree. The impacts of six unpredicted and effective variables on each category of accident places based upon the Euclidean distance are shown in Table 6.
According to the obtained results, daylight had the most influence on accident cases on straight roads for urban and rural areas (0 and 0.016, respectively) because, in both of them, daylight had the lowest Euclidean distance from straight in comparison with other effective variables. On the other hand, annual average daily traffic plays a key role in accident cases at intersections in urban areas, with the lowest Euclidean distance of 0.324, while the type of accident was the most important variable of accident cases at the intersection, with the lowest Euclidean distance of 0 among six variables in rural areas. Finally, the type of accident and the day of the week had the most impact on accident cases in the third category of accident places (other) in comparison with other variables with the lowest Euclidean distance of 0 in urban and rural areas.

5. Genetic Algorithm Modeling for Urban and Rural Areas

As mentioned before, the control parameters of the algorithm play a key role in the faster convergence of the algorithm. Hence, according to experts’ experiences and the number of data sets in this study, some of these parameters were selected, namely, the minimum acceptance precision, εL = 0.00001, population size (PS) = 250, maximum number of iterations = 250, and crossover and mutation rate (MR), which were 0.8 and 0.02, respectively [67,68]. It should be noted that these values are dimensionless. After determining the control parameters, the models were constructed with a pseudocode of GA in a Matlab environment. With regards to the data set and problem statement in this research work, the number of classes was considered to be three (clusters) for the urban and rural areas dataset. The performance of models according to the best cost is demonstrated in Figure 6.
According to Figure 6, the convergence trends achieved the desired precision level and it was fixed from the 94th to the 300th iteration for the urban dataset, and the convergence trend was fixed from the 102nd to the 300th iteration for the rural dataset. Hence, it can be concluded that the convergence trends of the algorithm had acceptable degrees and speed for urban and rural situations. Table 7 and Table 8 show the optimization and clustering of accident cases by the GA algorithm for the urban and rural areas, respectively.
The optimum partitions of three classes (clusters) were calculated for the urban and rural areas dataset, with these values being based upon the Euclidean distance and being dimensionless. For instance, in the first accident case of the urban area, the minimum Euclidean distance was 0.327, which belongs to the first class, and the results were in full agreement with the actual type of accident place. However, for the 37th accident case of the urban area, the minimum of Euclidean distance belonged to the second class, with 0.168 among three classes, so it did not match with the real case. In the rural area, the model could classify the second accident case in the third class, so it did match with the actual type of accident place. In addition, in the 63rd case of a rural area, the minimum Euclidean distance was 0.234 for the second class among the three classes that the model had an error for in this case because it was “straight” in the real case. Generally, the model of the GA algorithm can classify the urban and rural areas dataset with 82% and 81% accuracies, respectively. Table 9 shows the values of the Euclidean distance of each effective variable from the center of each class by GA.
The obtained results in Table 9 show that daylight had the most impact for “straight” in the urban and rural areas in comparison with other variables because it had the lowest Euclidean distance equal to zero. The annual average daily traffic and type of accident were the most important variables for the intersection, with 0.326 and 0 in urban and rural areas, respectively. Finally, the type of accident was the most effective factor, with a Euclidean distance of 0.001 for other sections in the urban area, while the number of vehicles contributes greatly for other sections in rural areas in comparison with other variables.

6. Results and Discussion

As mentioned before, the aim of the present research work is to propose the feasibility of stochastic models for the evaluation of potential factors for safety. Owing to the ambiguity and uncertainty present in the investigative process of road safety, two stochastic techniques have been proposed for clustering and evaluating safety. Using a data mining approach, two models were developed for evaluating potential safety factors, using 77 accident cases in urban and rural areas in Cosenza in southern Italy. A comparison was made between the obtained results of two algorithms’ performances, based upon their accuracies, in Figure 7. Consequently, it was found that, although the PSO algorithm provided more accurate clustering in comparison with GA, both stochastic techniques were reliable system modeling techniques for clustering and evaluating potential safety factors.
Furthermore, the two stochastic techniques applied to the case study underlined several common issues.
Models of both algorithms recognized that daylight had the most impact on the “straight” in urban and rural areas and that these results were in full agreement with real conditions of the straight place in two areas of Cosenza.
Annual average daily traffic was considered the most important factor for the “intersection” place only in urban areas by models of PSO and GA. However, these models considered that the type of accident was the most effective variable in rural areas.
Furthermore, in comparison to the Euclidean distances of six variables, both algorithms recognized that the type of accident was the most effective variable for “other” in urban areas.
The unique difference between the two approaches is that, in rural areas, the PSO approach considered the weekday as the most important variable for “other”, while the number of vehicles was recognized by the GA model. The reason could be that both PSO and GA start from nearby solutions but, then, PSO converges very quickly while GA struggles up to a great number of iterations and then converges to a solution nearer to PSO, but this solution is still not better than the one provided by PSO.
It should be noted that these two models based upon the determined control parameters and obtained results are case-specific and cannot be used directly for evaluating the safety of transportation systems of other cities. In addition, these techniques are not appropriate for the evaluation of incomplete data.

7. Conclusions

This paper presented two stochastic techniques to cluster and evaluate transportation safety. For this purpose, a combination of PSO and the genetic algorithm (GA) with the k-means algorithm were used as well as six notable unpredicted variables in accidents, including the daylight, weekday, average speed, annual average daily traffic, number of vehicles, and type of accident, which were considered for urban and rural areas in Cosenza in southern Italy. Some 77 cases of accidents for urban and rural areas were selected and all the cases were classified into three classes. The obtained results were validated based upon three types of accident places, including straight, curve, and intersection, in each urban and rural network. Although both algorithms had a suitable capability to classify case accidents, the PSO algorithm had a superior performance, with 0.86% and 0.87% accuracies on urban and rural roads, respectively, in comparison with GA. In addition, the obtained results of the PSO algorithm show that daylight was the most important parameter to evaluate safety for the straight section in urban and rural areas. In addition, the annual average daily traffic and type of accident played a key role in the safety of intersections in urban and rural areas, respectively. Finally, the type of accident and weekday had the most impact on safety in other sections of urban and rural areas.
For future research work, it is recommended to investigate the effectiveness of other artificial intelligence techniques to evaluate transportation safety, which is worth studying, and to extend the study to a five-year reporting period. In addition, the evaluation of the effects of geometric issues (inconsistencies) or pavement deterioration on safety analysis can be considered another important factor in rural environments in future works. Furthermore, it will be advisable to evaluate homogeneous road segments without reported crashes and to compare them with the used dataset to better understand the analyzed phenomenon by coupling artificial intelligence (AI) and machine learning (ML) techniques with other classic techniques, such as the naïve Bayesian (NB) algorithm and structural equation models (SEM).

Author Contributions

All authors have read and agreed to the published version of the manuscript. G.G., S.S.H. (Sina Shaffiee Haghshenas) and S.S.H. (Sami Shaffiee Haghshenas) were responsible for conceptualization and methodology. G.G. and A.V. analyzed the study context and extracted the dataset. A.V., V.A., and A.S.H. performed supervision, review and editing. S.S.H. (Sina Shaffiee Haghshenas) and S.S.H. (Sami Shaffiee Haghshenas) carried on the statistical analysis.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Blas, E.; Kurup, A.S. (Eds.) Equity, Social Determinants and Public Health Programmes; World Health Organization: Geneva, Switzerland, 2010. [Google Scholar]
  2. Li, Y.; Ma, D.; Zhu, M.; Zeng, Z.; Wang, Y. Identification of significant factors in fatal-injury highway crashes using genetic algorithm and neural network. Accid. Anal. Prev. 2018, 111, 354–363. [Google Scholar] [CrossRef] [PubMed]
  3. Jovanis, P.P.; Chang, H.L. Modeling the relationship of accidents to miles traveled. Transp. Res. Board 1986, 1068, 42–51. [Google Scholar]
  4. Poch, M.; Mannering, F. Negative binomial analysis of intersection-accident frequencies. J. Transp. Eng. 1996, 122, 105–113. [Google Scholar] [CrossRef] [Green Version]
  5. Shankar, V.; Milton, J.; Mannering, F. Modeling accident frequencies as zero-altered probability processes: An empirical inquiry. Accid. Anal. Prev. 1997, 29, 829–837. [Google Scholar] [CrossRef]
  6. Milton, J.; Mannering, F. The relationship among highway geometrics, traffic-related elements and motor-vehicle accident frequencies. Transportation 1998, 25, 395–413. [Google Scholar] [CrossRef]
  7. Hadi, M.A.; Aruldhas, J.; Chow, L.; Wattleworth, J.A. Estimating safety effects of cross-section design for various highway types using negative binomial regression. Transp. Res. Rec. 1995, 1500, 169–177. [Google Scholar]
  8. Miaou, S.P. The relationship between truck accidents and geometric design of road sections: Poisson versus negative binomial regressions. Accid. Anal. Prev. 1994, 26, 471–481. [Google Scholar] [CrossRef] [Green Version]
  9. Lee, J.; Mannering, F. Impact of roadside features on the frequency and severity of run-off-roadway accidents: An empirical analysis. Accid. Anal. Prev. 2020, 34, 149–161. [Google Scholar] [CrossRef]
  10. Lord, D.; Mannering, F. The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives. Transp. Res. Part A Policy Pract. 2010, 44, 291–305. [Google Scholar] [CrossRef] [Green Version]
  11. Mannering, F.L.; Shankar, V.; Bhat, C.R. Unobserved heterogeneity and the statistical analysis of highway accident data. Anal. Methods Accid. Res. 2016, 11, 1–16. [Google Scholar] [CrossRef]
  12. Mannering, F. Temporal instability and the analysis of highway accident data. Anal. Methods Accid. Res. 2018, 17, 1–13. [Google Scholar] [CrossRef]
  13. Park, E.S.; Lord, D. Multivariate Poisson-lognormal models for jointly modeling crash frequency by severity. Transp. Res. Rec. 2007, 2019, 1–6. [Google Scholar] [CrossRef] [Green Version]
  14. El-Basyouny, K.; Sayed, T. Accident prediction models with random corridor parameters. Accid. Anal. Prev. 2009, 41, 1118–1123. [Google Scholar] [CrossRef] [PubMed]
  15. Anastasopoulos, P.C.; Mannering, F.L. A note on modeling vehicle accident frequencies with random-parameters count models. Accid. Anal. Prev. 2009, 41, 153–159. [Google Scholar] [CrossRef] [PubMed]
  16. Abduljabbar, R.; Dia, H.; Liyanage, S.; Bagloee, S.A. Applications of artificial intelligence in transport: An overview. Sustainability 2019, 11, 189. [Google Scholar] [CrossRef] [Green Version]
  17. Karlaftis, M.G.; Golias, I. Effects of road geometry and traffic volumes on rural roadway accident rates. Accid. Anal. Prev. 2002, 34, 357–365. [Google Scholar] [CrossRef]
  18. Abdel-Aty, M.; Keller, J. Exploring the overall and specific crash severity levels at signalized intersections. Accid. Anal. Prev. 2005, 37, 417–425. [Google Scholar] [CrossRef]
  19. Park, Y.J.; Saccomanno, F.F. Collision frequency analysis using tree-based stratification. Transp. Res. Rec. 2005, 1908, 121–129. [Google Scholar] [CrossRef]
  20. Yuan, F.; Cheu, R.L. Incident detection using support vector machines. Transp. Res. Part C Emerg. Technol. 2003, 11, 309–328. [Google Scholar] [CrossRef]
  21. Yao, B.; Hu, P.; Zhang, M.; Jin, M. A support vector machine with the tabu search algorithm for freeway incident detection. Int. J. Appl. Math. Comput. Sci. 2014, 24, 397–404. [Google Scholar] [CrossRef] [Green Version]
  22. Siddiqui, C.; Abdel-Aty, M.; Huang, H. Aggregate nonparametric safety analysis of traffic zones. Accid. Anal. Prev. 2012, 45, 317–325. [Google Scholar] [CrossRef] [PubMed]
  23. Li, Z.; Liu, P.; Wang, W.; Xu, C. Using support vector machine models for crash injury severity analysis. Accid. Anal. Prev. 2012, 45, 478–486. [Google Scholar] [CrossRef]
  24. Chen, C.; Zhang, G.; Qian, Z.; Tarefder, R.A.; Tian, Z. Investigating driver injury severity patterns in rollover crashes using support vector machine models. Accid. Anal. Prev. 2016, 90, 128–139. [Google Scholar] [CrossRef] [PubMed]
  25. Holland, J.H. Adaptation in Natural and Artificial Systems; University of Michigan Press: Ann Arbor, MI, USA, 1975. [Google Scholar]
  26. Arbis, D.; Dixit, V.V. Game theoretic model for lane changing: Incorporating conflict risks. Accid. Anal. Prev. 2019, 125, 158–164. [Google Scholar] [CrossRef] [PubMed]
  27. Kim, Y.H.; Yoon, Y.; Geem, Z.W. A comparison study of harmony search and genetic algorithm for the max-cut problem. Swarm Evol. Comput. 2019, 44, 130–135. [Google Scholar] [CrossRef]
  28. Aryafar, A.; Mikaeil, R.; Shafiee Haghshenas, S.; Shafiei Haghshenas, S. Utilization of soft computing for evaluating the performance of stone sawing machines, Iranian Quarries. Int. J. Min. Geo-Eng. 2018, 52, 31–36. [Google Scholar]
  29. Fakharian, P.; Naderpour, H.; Haddad, A.; Rafiean, A.H.; Eidgahee, D.R. A proposed model for compressive strength prediction of FRP-confined rectangular column in terms of Genetic expression Programming (GEP). Concr. Res. 2018, 11, 5–18. [Google Scholar]
  30. Haghshenas, S.S.; Faradonbeh, R.S.; Mikaeil, R.; Haghshenas, S.S.; Taheri, A.; Saghatforoush, A.; Dormishi, A. A new conventional criterion for the performance evaluation of gang saw machines. Measurement 2019, 146, 159–170. [Google Scholar] [CrossRef]
  31. Pinto, B.Q.; Ribeiro, C.C.; Rosseti, I.; Noronha, T.F. A biased random-key genetic algorithm for routing and wavelength assignment under a sliding scheduled traffic model. J. Glob. Optim. 2020, 77, 949–973. [Google Scholar] [CrossRef]
  32. Luan, J.; Yao, Z.; Zhao, F.; Song, X. A novel method to solve supplier selection problem: Hybrid algorithm of genetic algorithm and ant colony optimization. Math. Comput. Simul. 2019, 156, 294–309. [Google Scholar] [CrossRef]
  33. Hosseini, S.M.; Ataei, M.; Khalokakaei, R.; Mikaeil, R.; Haghshenas, S.S. Study of the effect of the cooling and lubricant fluid on the cutting performance of dimension stone through artificial intelligence models. Eng. Sci. Technol. Int. J. 2020, 23, 71–81. [Google Scholar] [CrossRef]
  34. Salemi, A.; Mikaeil, R.; Haghshenas, S.S. Integration of Finite Difference Method and Genetic Algorithm to Seismic analysis of Circular Shallow Tunnels (Case Study: Tabriz Urban Railway Tunnels). KSCE J. Civ. Eng. 2017, 22, 1978–1990. [Google Scholar] [CrossRef]
  35. Slowik, A.; Kwasnicka, H. Nature inspired methods and their industry applications—Swarm intelligence algorithms. IEEE Trans. Ind. Inform. 2018, 14, 1004–1015. [Google Scholar] [CrossRef]
  36. Dulebenets, M.A. A Comprehensive Evaluation of Weak and Strong Mutation Mechanisms in Evolutionary Algorithms for Truck Scheduling at Cross-Docking Terminals. IEEE Access 2018, 6, 65635–65650. [Google Scholar] [CrossRef]
  37. Brezočnik, L.; Fister, I.; Podgorelec, V. Swarm intelligence algorithms for feature selection: A review. Appl. Sci. 2018, 8, 1521. [Google Scholar] [CrossRef] [Green Version]
  38. Anandakumar, H.; Umamaheswari, K. A bio-inspired swarm intelligence technique for social aware cognitive radio handovers. Comput. Electr. Eng. 2018, 71, 925–937. [Google Scholar] [CrossRef]
  39. Zhao, X.; Wang, C.; Su, J.; Wang, J. Research and application based on the swarm intelligence algorithm and artificial intelligence for wind farm decision system. Renew. Energy 2019, 134, 681–697. [Google Scholar] [CrossRef]
  40. Dulebenets, M.A. An Adaptive Island Evolutionary Algorithm for the berth scheduling problem. Memetic Comput. 2020, 12, 51–72. [Google Scholar] [CrossRef]
  41. Kandiri, A.; Golafshani, E.M.; Behnood, A. Estimation of the compressive strength of concretes containing ground granulated blast furnace slag using hybridized multi-objective ANN and salp swarm algorithm. Constr. Build. Mater. 2020, 248, 118676. [Google Scholar] [CrossRef]
  42. Mikaeil, R.; Haghshenas, S.S.; Haghshenas, S.S.; Ataei, M. Performance prediction of circular saw machine using imperialist competitive algorithm and fuzzy clustering technique. Neural Comput. Appl. 2018, 29, 283–292. [Google Scholar] [CrossRef]
  43. Mikaeil, R.; Haghshenas, S.S.; Hoseinie, S.H. Rock penetrability classification using artificial bee colony (ABC) algorithm and self-organizing map. Geotech. Geol. Eng. 2018, 36, 1309–1318. [Google Scholar] [CrossRef]
  44. Mikaeil, R.; Haghshenas, S.S.; Ozcelik, Y.; Gharehgheshlagh, H.H. Performance evaluation of adaptive neuro-fuzzy inference system and group method of data handling-type neural network for estimating wear rate of diamond wire saw. Geotech. Geol. Eng. 2018, 36, 3779–3791. [Google Scholar] [CrossRef]
  45. Mikaeil, R.; Haghshenas, S.S.; Sedaghati, Z. Geotechnical risk evaluation of tunneling projects using optimization techniques (case study: The second part of Emamzade Hashem tunnel). Nat. Hazards 2019, 97, 1099–1113. [Google Scholar] [CrossRef]
  46. Mikaeil, R.; Beigmohammadi, M.; Bakhtavar, E.; Haghshenas, S.S. Assessment of risks of tunneling project in Iran using artificial bee colony algorithm. SN Appl. Sci. 2019, 1, 1711. [Google Scholar] [CrossRef] [Green Version]
  47. Dormishi, A.; Ataei, M.; Mikaeil, R.; Khalokakaei, R.; Haghshenas, S.S. Evaluation of gang saws’ performance in the carbonate rock cutting process using feasibility of intelligent approaches. Eng. Sci. Technol. Int. J. 2019, 22, 990–1000. [Google Scholar] [CrossRef]
  48. Faradonbeh, R.S.; Haghshenas, S.S.; Taheri, A.; Mikaeil, R. Application of self-organizing map and fuzzy c-mean techniques for rockburst clustering in deep underground projects. Neural Comput. Appl. 2020, 32, 8545–8559. [Google Scholar]
  49. Fiorini Morosini, A.; Shaffiee Haghshenas, S.; Shaffiee Haghshenas, S.; Geem, Z.W. Development of a Binary Model for Evaluating Water Distribution Systems by a Pressure Driven Analysis (PDA) Approach. Appl. Sci. 2020, 10, 3029. [Google Scholar] [CrossRef]
  50. Sarkar, S.; Vinay, S.; Raj, R.; Maiti, J.; Mitra, P. Application of optimized machine learning techniques for prediction of occupational accidents. Comput. Oper. Res. 2019, 106, 210–224. [Google Scholar] [CrossRef]
  51. Liu, Y.; Zou, B.; Ni, A.; Gao, L.; Zhang, C. Calibrating microscopic traffic simulators using machine learning and particle swarm optimization. Transp. Lett. 2020, 1–13. [Google Scholar] [CrossRef]
  52. Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the ICNN’95-International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; pp. 1942–1948. [Google Scholar]
  53. Poli, R.; Kennedy, J.; Blackwell, T. Particle swarm optimization. Swarm Intell. 2007, 1, 33–57. [Google Scholar] [CrossRef]
  54. Frank, L.R.; Ferreira, Y.M.; Julio, E.P.; Ferreira, F.H.C.; Dembogurski, B.J.; Silva, E.F. Multilayer Perceptron and Particle Swarm Optimization Applied to Traffic Flow Prediction on Smart Cities. In International Conference on Computational Science and Its Applications; Springer: Cham, Switzerland, 2019; pp. 35–47. [Google Scholar]
  55. Chen, L.; Monteiro, T.; Wang, T.; Marcon, E. Design of shared unit-dose drug distribution network using multi-level particle swarm optimization. Health Care Manag. Sci. 2019, 22, 304–317. [Google Scholar] [CrossRef] [PubMed]
  56. Liu, P.; Xie, M.; Bian, J.; Li, H.; Song, L. A Hybrid PSO–SVM Model Based on Safety Risk Prediction for the Design Process in Metro Station Construction. Int. J. Environ. Res. Public Health 2020, 17, 1714. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Tharwat, A.; Elhoseny, M.; Hassanien, A.E.; Gabel, T.; Kumar, A. Intelligent Bézier curve-based path planning model using Chaotic Particle Swarm Optimization algorithm. Clust. Comput. 2019, 22, 4745–4766. [Google Scholar] [CrossRef]
  58. Noori, A.M.; Mikaeil, R.; Mokhtarian, M.; Haghshenas, S.S.; Foroughi, M. Feasibility of intelligent models for prediction of utilization factor of TBM. Geotech. Geol. Eng. 2020, 38, 3125–3143. [Google Scholar] [CrossRef]
  59. Lloyd, S. Least squares quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef]
  60. Feng, X.; Li, S.; Yuan, C.; Zeng, P.; Sun, Y. Prediction of slope stability using naive Bayes classifier. KSCE J. Civ. 2018, 22, 941–950. [Google Scholar] [CrossRef]
  61. Hosseini, S.M.; Ataei, M.; Khalokakaei, R.; Mikaeil, R.; Haghshenas, S.S. Investigating the Role of the Cooling and Lubricant Fluids on the Performance of Cutting Disks (Case Study: Hard Rocks). Rud. Geološko Naft. Zb. 2019, 34. [Google Scholar]
  62. Pirouz, B.; Shaffiee Haghshenas, S.; Shaffiee Haghshenas, S.; Piro, P. Investigating a serious challenge in the sustainable development process: Analysis of confirmed cases of COVID-19 (new type of coronavirus) through a binary classification using artificial intelligence and regression analysis. Sustainability 2020, 12, 2427. [Google Scholar] [CrossRef] [Green Version]
  63. Mussone, L.; Bassani, M.; Masci, P. Analysis of factors affecting the severity of crashes in urban road intersections. Accid. Anal. Prev. 2019, 103, 112–122. [Google Scholar] [CrossRef]
  64. Dutta, N.; Fontaine, M.D. Improving freeway segment crash prediction models by including disaggregate speed data from different sources. Accid. Anal. Prev. 2019, 132, 1–16. [Google Scholar] [CrossRef]
  65. Dormishi, A.R.; Ataei, M.; Khaloo Kakaie, R.; Mikaeil, R.; Shaffiee Haghshenas, S. Performance evaluation of gang saw using hybrid ANFIS-DE and hybrid ANFIS-PSO algorithms. J. Min. Environ 2020, 10, 543–557. [Google Scholar]
  66. Shaffiee Haghshenas, S.; Pirouz, B.; Shaffiee Haghshenas, S.; Pirouz, B.; Piro, P.; Na, K.S.; Geem, Z.W. Prioritizing and Analyzing the Role of Climate and Urban Parameters in the Confirmed Cases of COVID-19 Based on Artificial Intelligence Applications. Int. J. Env. Res. Public Health 2020, 17, 3730. [Google Scholar] [CrossRef] [PubMed]
  67. Mikaeil, R.; Bakhshinezhad, H.; Haghshenas, S.S.; Ataei, M. Stability analysis of tunnel support systems using numerical and intelligent simulations (case study: Kouhin Tunnel of Qazvin-Rasht Railway). Rud. Geološko Naft. Zb. 2019, 34, 1–10. [Google Scholar] [CrossRef] [Green Version]
  68. Mirjalili, S. Genetic algorithm. In Evolutionary Algorithms and Neural Networks; Springer: Cham, Switzerland, 2019; pp. 43–55. [Google Scholar]
Figure 1. The basic form of the flowchart of the genetic algorithm GA [34].
Figure 1. The basic form of the flowchart of the genetic algorithm GA [34].
Sustainability 12 07541 g001
Figure 2. New velocity and position vectors of a set of particles in a search space.
Figure 2. New velocity and position vectors of a set of particles in a search space.
Sustainability 12 07541 g002
Figure 3. Concept of the modification of the movement of a particle by the particle swarm optimization (PSO) algorithm [58].
Figure 3. Concept of the modification of the movement of a particle by the particle swarm optimization (PSO) algorithm [58].
Sustainability 12 07541 g003
Figure 4. Map of Cosenza province with accident locations (source: CRISC, Calabria).
Figure 4. Map of Cosenza province with accident locations (source: CRISC, Calabria).
Sustainability 12 07541 g004
Figure 5. The performance of the PSO algorithm based upon the best cost of fitness function for urban and rural areas.
Figure 5. The performance of the PSO algorithm based upon the best cost of fitness function for urban and rural areas.
Sustainability 12 07541 g005
Figure 6. The performance of the PSO algorithm based upon the best cost of fitness function for urban and rural areas.
Figure 6. The performance of the PSO algorithm based upon the best cost of fitness function for urban and rural areas.
Sustainability 12 07541 g006
Figure 7. Comparison of measured accuracies of two algorithms’ performances for urban and rural areas.
Figure 7. Comparison of measured accuracies of two algorithms’ performances for urban and rural areas.
Sustainability 12 07541 g007
Table 1. Istituto Nazionale di Statistica (ISTAT) database fields considered.
Table 1. Istituto Nazionale di Statistica (ISTAT) database fields considered.
Data Field TypeData FieldDescription
Human characteristicDriver genderMale or female
Vehicle characteristicVehicle typeCar, motorcycle, truck, and other
Road environmentRoad typeNational rural road, provincial rural road, national and provincial rural road in urban context, and urban road
Other environmentLight conditionsDaylight and nighttime
Day of the week Weekday and weekend
Location environmentMacroarea locationUrban and rural
Accident characteristicNumber of vehiclesNumber of vehicles involved
Accident natureWay out, collision with an accidental obstacle, side collision, front-side collision, rear-end collision, head-on collision, pedestrian collision, impact with parked vehicle, impact with stopped vehicle, fall from vehicle, and sudden braking
Accident severityInjuries and deaths
Table 2. Pearson’s correlation coefficient for the urban area dataset.
Table 2. Pearson’s correlation coefficient for the urban area dataset.
DaylightWeekdayAverage SpeedAnnual Average Daily TrafficNumber of VehiclesType of Accident
Daylight1
Weekday−0.3091
Average speed−0.1240.2621
Annual average daily traffic0.042−0.246−0.3781
Number of vehicles−0.012−0.1120.1480.1041
Type of accident−0.035−0.0290.060.0280.1821
Table 3. Pearson’s correlation coefficient for rural area dataset.
Table 3. Pearson’s correlation coefficient for rural area dataset.
DaylightWeekdayAverage SpeedAnnual Average Daily TrafficNumber of VehiclesType of Accident
Daylight1
Weekday−0.3271
Average speed−0.0780.1181
Annual average daily traffic0.1580.1090.3631
Number of vehicles−0.0490.2150.090.0531
Type of accident−0.2040.279−0.080.0230.3961
Table 4. Optimization and clustering of accident cases in urban area by PSO.
Table 4. Optimization and clustering of accident cases in urban area by PSO.
Case #Optimum PartitionRec. ClassActual Type of Accident PlaceCase #Optimum PartitionRec. ClassActual Type of Accident Place
The First ClassThe Second ClassThe Third ClassThe First ClassThe Second ClassThe Third Class
10.3271.0511.0331Straight400.0211.0141.0161Intersection
20.0501.0171.0161Straight410.1071.0201.0141Straight
31.1311.4980.4053Other420.4711.0961.1141Straight
41.4720.9371.6922Intersection430.2271.0541.0631Straight
51.4240.8691.6642Intersection440.9971.3211.3981Straight
60.2781.0721.0881Straight450.0211.0141.0161Straight
71.0231.4360.1893Other460.3891.1011.1251Straight
80.0501.0171.0161Straight470.2361.0441.0481Straight
90.0501.0171.0161Straight481.2731.5391.6511Straight
100.3581.0651.0431Straight490.2551.0431.0261Straight
110.7661.2381.2591Straight500.4711.0961.1141Straight
121.0010.1851.3592Intersection511.4140.9211.0142Intersection
130.2781.0721.0881Straight521.0220.2351.3602Straight
140.3171.0681.0631Straight531.0831.4700.3013Straight
151.4850.9521.7272Intersection541.4970.9791.7492Intersection
160.2171.0301.0181Straight551.1051.4840.4913Other
171.6031.1121.8232Intersection561.0100.1931.3552Intersection
180.2051.0471.0521Straight571.5861.7921.2243Straight
191.0010.1851.3592Intersection581.0230.2341.3592Straight
200.0501.0171.0161Straight591.4001.1131.4982Intersection
210.2981.0521.0331Straight600.2051.0471.0521Intersection
220.2051.0471.0521Straight610.4711.0961.1141Intersection
231.0101.3371.4291Straight621.0551.0790.4103Other
240.2171.0291.0221Straight631.4811.1321.0753Straight
250.0511.0161.0201Straight640.2051.0471.0521Straight
260.3921.1031.1291Straight650.2051.0471.0521Straight
270.0211.0141.0161Straight660.0191.0141.0141Straight
280.6981.2171.2521Straight670.4711.0961.1141Straight
290.2361.0341.0161Straight681.4210.8861.6872Intersection
301.0061.4280.1693Other690.1891.0391.0521Straight
310.7231.2381.2831Straight701.2200.6931.5442Intersection
320.4031.1071.1231Straight710.5071.1201.1481Straight
330.5161.1091.1151Straight720.2051.0471.0521Straight
340.4711.0961.1141Straight730.2551.0431.0261Straight
350.1071.0201.0141Straight740.0501.0121.0131Straight
360.0211.0141.0161Straight750.2051.0471.0521Straight
370.1661.0001.3591Straight760.0501.0121.0131Straight
381.0191.3311.3991Intersection771.0230.2261.3582Intersection
390.4711.0961.1141Intersection
Table 5. Optimization and clustering of accident cases in rural area by PSO.
Table 5. Optimization and clustering of accident cases in rural area by PSO.
Case #Optimum PartitionRec. ClassActual Type of Accident PlaceCase #Optimum PartitionRec. ClassActual Type of Accident Place
The First ClassThe Second ClassThe Third ClassThe First ClassThe Second ClassThe Third Class
10.1321.0111.1511Straight401.4030.9641.3162Intersection
21.4051.7110.7273Other411.1341.5010.7283Other
30.1521.0111.1521Straight420.0301.0071.1531Straight
40.1321.0111.1511Straight430.0731.0121.1561Straight
51.4221.7250.7753Other440.2351.0301.1841Straight
60.2731.0451.1961Straight450.2351.0301.1841Straight
71.4361.0290.7813Other460.1921.0271.1631Straight
81.0091.4220.5703Other471.7401.4050.8993Other
91.4161.7230.7423Other481.0110.1831.2412Straight
100.1281.0191.2401Straight490.2441.0351.1881Straight
110.1281.0191.2401Straight501.0300.2441.2692Straight
120.1161.0001.2391Straight510.1261.0141.1541Straight
131.3920.9951.2352Intersection520.2731.0451.1961Straight
140.2351.0301.1841Straight531.0881.4660.6643Other
150.2351.0301.1841Straight541.0360.2151.2492Straight
161.4241.7280.7813Other551.3940.9921.2372Intersection
171.7281.3920.8543Other561.3890.9841.2362Intersection
180.2351.0301.1841Straight571.4361.0290.7813Other
191.0121.4220.5723Other580.2351.0301.1841Straight
200.0821.0061.1531Straight590.2351.0301.1841Straight
210.1321.0111.1511Straight600.1321.0111.1511Straight
220.1521.0111.1521Straight610.2351.0301.1841Straight
231.0091.4200.5693Other621.4361.0290.7813Other
240.0301.0071.1531Straight631.0300.2441.2692Straight
250.0301.0071.1531Straight640.9861.3931.2391Intersection
260.3841.0571.1881Other650.0301.0071.1531Straight
271.7281.3900.8543Intersection660.3211.0461.1751Straight
280.9921.3921.2341Other670.2611.0231.1631Straight
290.2351.0301.1841Straight680.1771.0241.1601Straight
300.0821.0061.1531Straight691.0931.4700.6703Other
310.4281.0731.2001Straight701.4251.0240.7423Intersection
321.0300.2441.2692Straight710.0301.0071.1531Straight
331.7281.3900.8543Other720.1771.0241.1601Straight
340.2351.0301.1841Straight730.3151.0601.2081Straight
350.1311.0081.1511Straight741.7371.3980.8633Other
360.2441.0351.1881Straight750.3821.0671.1911Straight
371.4090.9681.3152Intersection761.3981.0091.2402Intersection
380.2351.0301.1841Straight770.1771.0241.1601Straight
391.0001.4190.5733Other
Table 6. Euclidean distance of each effective variable from the center of each class in urban and rural areas for PSO.
Table 6. Euclidean distance of each effective variable from the center of each class in urban and rural areas for PSO.
Urban Area
DaylightWeekdayAverage Speed (km/h)Annual Average Daily Traffic (Vehicle/Day)Number of VehiclesType of Accident
Straight0.0000.9910.7990.3090.4111.000
Intersection1.0000.8520.7320.3240.3691.000
Other0.0920.9630.7030.2580.3080.000
Rural Area
DaylightWeekdayAverage Speed (km/h)Annual Average Daily Traffic
(Vehicle /Day)
Number of VehiclesType of Accident
Straight0.0161.0000.7690.2830.1471.000
Intersection0.0421.0000.7650.2080.1030.000
Other0.4010.0000.7110.2290.1190.603
Table 7. Optimization and clustering of accident cases in urban area by GA.
Table 7. Optimization and clustering of accident cases in urban area by GA.
Case #Optimum PartitionRec. ClassActual Type of Accident PlaceCase #Optimum PartitionRec. ClassActual Type of Accident Place
The First ClassThe Second ClassThe Third ClassThe First ClassThe Second ClassThe Third Class
10.3271.0511.0331Straight400.0211.0141.0161Intersection
20.0501.0181.0161Straight410.1071.0211.0141Straight
31.1311.4980.4063Other420.4711.0961.1131Straight
41.4720.9341.6892Intersection430.2271.0541.0631Straight
51.4240.8671.6612Intersection440.9961.3201.3971Straight
60.2781.0731.0881Straight450.0211.0141.0161Straight
71.0231.4360.1933Other460.3891.1011.1241Straight
80.0501.0181.0161Straight470.2361.0441.0471Straight
90.0501.0181.0161Straight481.2731.5371.6491Straight
100.3581.0661.0431Straight490.2551.0441.0271Straight
110.7661.2361.2571Straight500.4711.0961.1131Straight
121.0010.1881.3572Intersection511.4141.0140.9173Intersection
130.2781.0731.0881Straight521.0220.2381.3582Straight
140.3171.0681.0631Straight531.0831.4710.3033Straight
151.4840.9491.7232Intersection541.4960.9761.7452Intersection
160.2171.0311.0181Straight551.1051.4830.4893Other
171.6031.1081.8202Intersection561.0100.1961.3522Intersection
180.2051.0481.0521Straight571.5851.7901.2223Straight
191.0010.1881.3572Intersection581.0230.2371.3562Straight
200.0501.0181.0161Straight591.1131.3971.4961Intersection
210.2981.0531.0331Straight600.2051.0481.0521Intersection
220.2051.0481.0521Straight610.4711.0961.1131Intersection
231.0101.3361.4281Straight620.4101.0791.0541Other
240.2171.0301.0221Straight631.4811.1321.0723Straight
250.0511.0161.0201Straight640.2051.0481.0521Straight
260.3921.1031.1281Straight650.2051.0481.0521Straight
270.0211.0141.0161Straight660.0201.0151.0141Straight
280.6981.2161.2511Straight670.4711.0961.1131Straight
290.2371.0341.0161Straight681.4200.8831.6842Intersection
301.0061.4290.1723Other690.1891.0401.0511Straight
310.7231.2371.2811Straight701.2190.6921.5412Intersection
320.4031.1071.1231Straight710.5071.1191.1471Straight
330.5161.1091.1141Straight720.2051.0481.0521Straight
340.4711.0961.1131Straight730.2551.0441.0271Straight
350.1071.0211.0141Straight740.0501.0121.0131Straight
360.0211.0141.0161Straight750.2051.0481.0521Straight
371.0000.1681.3562Straight760.0501.0121.0131Straight
381.0191.3301.3981Intersection771.0230.2291.3562Intersection
390.4711.0961.1131Intersection
Table 8. Optimization and clustering of accident cases in rural area by GA.
Table 8. Optimization and clustering of accident cases in rural area by GA.
Case #Optimum PartitionRecognized ClassActual Type of Accident PlaceCase #Optimum PartitionRecognized ClassActual Type of Accident Place
The First ClassThe Second ClassThe Third ClassThe First ClassThe Second ClassThe Third Class
10.1331.0191.0541Straight401.4140.9641.1552Intersection
21.4161.6540.4413Other411.1361.4510.8903Other
30.1531.0191.0561Straight420.0211.0101.0581Straight
40.1331.0191.0541Straight430.0711.0161.0611Straight
51.4331.6650.5233Other440.2331.0271.0961Straight
60.2721.0431.1071Straight450.2331.0271.0961Straight
71.4350.9291.1482Other460.1931.0351.0651Straight
81.0091.3590.7723Other471.7491.3310.9523Other
91.4271.6700.4533Other481.0110.2251.3212Straight
101.0190.1851.3242Straight490.2431.0321.0991Straight
111.0190.1851.3242Straight501.0300.2341.3552Straight
121.0000.1461.3222Straight510.1281.0211.0561Straight
131.0121.3950.8393Intersection520.2721.0431.1071Straight
140.2331.0271.0961Straight531.0891.4120.8413Other
150.2331.0271.0961Straight541.0370.2651.3312Straight
161.4351.6680.5293Other551.0081.3970.8393Intersection
171.7371.3250.9013Other561.0001.3890.8403Intersection
180.2331.0271.0961Straight571.4350.9291.1482Other
191.0121.3590.7753Other580.2331.0271.0961Straight
200.0781.0101.0601Straight590.2331.0271.0961Straight
210.1331.0191.0541Straight600.1331.0191.0541Straight
220.1531.0191.0561Straight610.2331.0271.0961Straight
231.0091.3570.7733Other621.4351.1480.9293Other
240.0211.0101.0581Straight631.0300.2341.3552Straight
250.0211.0101.0581Straight641.0020.8441.3932Intersection
260.3881.0721.0941Other650.0211.0101.0581Straight
271.7371.3220.9023Intersection660.3241.0601.0791Straight
281.0091.3950.8363Other670.2651.0351.0681Straight
290.2331.0271.0961Straight680.1761.0321.0621Straight
300.0781.0101.0601Straight691.0941.4160.8463Other
310.4311.0891.1061Straight701.4250.9351.1142Intersection
321.0300.2341.3552Straight710.0211.0101.0581Straight
331.7371.3220.9023Other720.1761.0321.0621Straight
340.2331.0271.0961Straight730.3151.0581.1181Straight
350.1321.0161.0561Straight741.7461.3330.9103Other
360.2431.0321.0991Straight750.3851.0821.0951Straight
371.4200.9731.1522Intersection761.0250.8450.9292Intersection
380.2331.0271.0961Straight770.1761.0321.0621Straight
391.0001.3530.7763Other
Table 9. Euclidean distance of each effective variable from the center of each class in urban and rural areas for GA.
Table 9. Euclidean distance of each effective variable from the center of each class in urban and rural areas for GA.
Urban Area
DaylightWeekdayAverage Speed (km/h)Annual Average Daily Traffic (Vehicle/Day)Number of VehiclesType of Accident
Straight0.0000.9910.7990.3090.4111.000
Intersection1.0000.8500.7300.3260.3700.999
Other0.0960.9610.7020.2600.3090.001
Rural Area
DaylightWeekdayAverage Speed (km/h)Annual Average Daily Traffic
(Vehicle /Day)
Number of VehiclesType of Accident
Straight0.0001.0000.7710.2870.1461.000
Intersection0.0460.9040.7950.2200.1010.000
Other0.7060.2420.6930.2390.1360.814

Share and Cite

MDPI and ACS Style

Guido, G.; Haghshenas, S.S.; Haghshenas, S.S.; Vitale, A.; Astarita, V.; Haghshenas, A.S. Feasibility of Stochastic Models for Evaluation of Potential Factors for Safety: A Case Study in Southern Italy. Sustainability 2020, 12, 7541. https://doi.org/10.3390/su12187541

AMA Style

Guido G, Haghshenas SS, Haghshenas SS, Vitale A, Astarita V, Haghshenas AS. Feasibility of Stochastic Models for Evaluation of Potential Factors for Safety: A Case Study in Southern Italy. Sustainability. 2020; 12(18):7541. https://doi.org/10.3390/su12187541

Chicago/Turabian Style

Guido, Giuseppe, Sina Shaffiee Haghshenas, Sami Shaffiee Haghshenas, Alessandro Vitale, Vittorio Astarita, and Ashkan Shafiee Haghshenas. 2020. "Feasibility of Stochastic Models for Evaluation of Potential Factors for Safety: A Case Study in Southern Italy" Sustainability 12, no. 18: 7541. https://doi.org/10.3390/su12187541

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop