4.1. Model Construction
This study used the libSVM toolbox in Matlab to build the SVM model. The SVM model is a generalized linear classifier for binary data classification in the supervised learning method. It is given input data and a learning objective in the classification problem , where each sample of input data constitutes the feature space, , and the learning objective is a binary variable , representing positive class and negative class, and the original data of the two types are separated by the optimal hyperplane.
The objective function and constraints of the support vector machine are as follows:
In these formulas, C is the penalty parameter, which is used to control the weight between two items in the objective function, is the slack variable, which is the allowable deviation of the corresponding data point from the separated interface, and is more significant than zero.
In constructing the SVM kernel function, the twenty-three subfactors screened out according to the traffic accident data of Jingshi Road are used as input variables. The number of samples was randomly divided into a training set and a test set in the proportion of 7:3. According to the Jingshi Road traffic accident data, the sample size in the feature space was inseparable. Hence, this article used the radial basis function (RBF) to map the feature space to the high-dimensional feature space to find the optimal hyperplane.
The radial basis function equation is shown in the following equation [
28]:
The kernel decision function formula:
In the process of SVM model training, the k-fold cross-validation (K-CV) method was selected to verify the model. The specific steps of K-CV are: Firstly, divide the total sample into k parts randomly, take one of them as the test set evaluation model and the rest of the k − 1 parts as the training set training model, and repeat this step k times. Each time, take a different subset as the test set. Finally obtain k other models and k scores. The performance of K models was integrated to evaluate the advantages and disadvantages of the model in the current problem, and the model with the best evaluation effect was selected as the verification model. In this paper, the K-CV fold was set as three folds. Then, the particle swarm search method was used to search for excellent nuclear parameters and select the optimal parameter combination.
The basic formulas of the particle swarm search are as follows:
In these formulas, vi is the speed of the particle, rand() is a random number between (0, 1), xi is the current position of the particle, c1 and c2 are the learning factors, is the individual optimal solution, and is the optimal global solution.
In addition to the above two standard formulas, the particle swarm also introduces an inertia factor, namely
w; the formula is as follows:
After the introduction of w, w can change linearly in the process of PSO search, or it can change dynamically according to a specific measure function of PSO performance to obtain a better optimization effect than a fixed value.
In this paper, the particle swarm parameters are all set as general values, that is,
c1 =
c2 = 2, the initial value of
w was set as 0.9 [
29], and the selection range of
c and
was 2
−8~2
8. After repeated debugging, the optimal parameter combination
g = 4.3677 of the kernel function and penalty parameter
C = 47.6773 were finally determined (
Figure 2,
Figure 3 and
Figure 4).
As seen in
Table 4, the accuracy of the training set and test set for minor accidents was higher than that of severe accidents. This phenomenon occurred because the number of minor accidents (990 cases) was about twice that of serious accidents (582 cases), which made the former more conducive to model training and testing, thus resulting in a reduction in the accuracy of the training set and test set of serious accident types. The accuracy of the training set was 74.9%, the accuracy of the test set was 69.1%, and the overall accuracy was about 70%. The specific reason was that some types of data accounted for too many samples, leading to a high probability of selecting this type in the model cross-validation, which was not conducive to sample classification and led to an accuracy of the model remaining of about 70%.
The classification accuracy of the model is as follows:
Table 4.
Model accuracy.
| Training Set Accuracy | Test Set Accuracy |
---|
Minor accident | 75.5% | 73.7% |
Serious accident | 73.4% | 54.0% |
Entirety | 74.9% | 69.1% |
4.2. SVM-RFE Sorting
After the SVM model completes the classification, RFE ranks the importance of features representing subfactors. SVM-RFE is a sequential backward selection algorithm based on the maximum interval principle of SVM.
The basic steps are as follows:
Train SVM on the initial feature set;
Formulate the weight vector and feature ranking according to the following formulas:
Find the features with the lowest ranking and exclude them;
Update the feature data set, and N = N − 1;
Repeat the above steps until the data set is empty.
Input the
C and
g obtained above into the SVM-RFE model. SVM-RFE selects a subfeature with the lowest contribution weight to the overall SVM through each iteration and obtains the importance ranking of all subfeatures through multiple iterations, as shown in
Table 5.
4.3. Discussion
Table 5 shows that summer was the most likely to cause casualties in the four seasons. The probability of losses caused by traffic accidents in spring was the smallest, and the possibility of deaths caused by traffic accidents in autumn and winter was in the middle. In hot and humid environments, drivers are prone to irritability, listlessness, inattention, and other emotions during driving, which constitute a significant potential safety hazard. Secondly, drivers are more susceptible to emotional control and rash judgment in a hot and dry environment. So, drivers should be careful to take proper rest to avoid fatigue while driving in summer and maintain their emotional stability. Because of the high temperatures in summer, many abnormalities can occur during the driving process, such as spontaneous combustion of tires, oil pipe bursts, etc. Therefore, it is necessary to strictly implement the relevant vehicle maintenance rules and regulations in summer, especially for passenger vehicles and dangerous chemical transport vehicles. In addition, the high temperatures in summer will make the pavement asphalt soften, which hurts vehicle braking performance. Therefore, it is vital for regular inspection and maintenance of road surfaces in summer.
Regarding time,
Table 5 shows that traffic accidents were more likely to cause casualties during daytime hours. Compared with other types of roads, roads with urban traffic flow adapts to the tidal phenomenon, and traffic outflows are concentrated during commuting. Because of the enormous traffic base during the day, the incidence of traffic accidents is higher, and the probability of casualties is higher.
Table 5 shows that traffic accidents between moving and stationary vehicles were more likely to cause casualties in accidental collisions. This phenomenon occurs because stationary vehicle drivers need time to start and react before and during clashes. Drivers who are unskilled or insensitive in operation will need a longer operation time, which increases the possibility of vehicle collision and casualties. The probability of losses caused by traffic accidents between people and vehicles is the smallest. That is because the traffic control measures of urban roads are more detailed, and drivers will pay more attention to pedestrians in the driving process because of the influence of the driving environment, which reduces the occurrence of collision accidents between people and vehicles.
The analysis of the accident location from
Table 5 concluded that casualties were more likely to occur when traffic accidents occurred on main roads. The main reason is that China’s current urban trunk road speed limit value-setting standard system has not been developed to perfection. Usually, the design speed is the speed limit standard. The difference between the design and operating speed has not been considered. Moreover, the road capacity will improve significantly due to the upgrading of urban road infrastructure. At this time, if the design speed is still taken as the road speed limit value without synchronous adjustment, the speeding phenomenon of some sections in good running condition and without monitoring will seriously threaten traffic safety. Additionally, the wonder of low-speed driving on the monitored road section will lead to traffic congestion and a waste of road resources, which improve the probability of traffic accidents on the main road. Therefore, in the urban road speed limit setting, the whole urban trunk road can be divided into several sections according to the different road alignments. Particular areas with significant changes in road alignment and road environment can be distinguished from the general roads. The speed limit value can be set according to the situation of the section. It is important to compare and adjust the speed limit value of each road section and set up a reasonable speed limit transition zone between different road sections [
30]. In addition, it can be seen from
Table 5 that the possibility of casualties in traffic accidents near buildings was minimal. When pedestrians or vehicles pass near buildings, these buildings obstruct their sight, so they cannot make judgments and are more alert and less likely to engage in impulsive behaviors.
In the judgment of weather influencing factors,
Table 5 showed that traffic accidents were more likely to cause casualties on rainy days. Factors such as slippery ground, low visibility, and the interference of rain sound are critical factors that induce traffic accidents on rainy days. See
Section 4.4 for a detailed analysis. Traffic accidents in foggy weather have the lowest probability of casualties. When driving in foggy weather, drivers are usually more vigilant. They pay more attention to changes in the driving environment, especially on roads with complex changes.
As seen in
Table 5, traffic accidents between nonmotor and motor vehicles were the most likely to cause casualties among vehicle types. Mainly because compared with motor vehicles, nonmotor vehicles generally have the characteristics of relatively high speed, greater flexibility, and inconvenience in control and interference sensitivity, and their own violation cost is lower. Rider protection measures and vehicle performance are generally in a weak position. When two types of vehicles collide, the possibility of casualties from nonmotor cars is high. Moreover, some nonmotor vehicle riders lack safety awareness and often take dangerous driving behaviors according to their wishes. For example, nonmotor vehicles often share lanes with motor vehicles, leading to poor traffic order, declining road traffic efficiency, and frequent collisions between two cars. Such accidents often cause more casualties and property losses. It is necessary to strengthen road control and realize the time–space isolation of intensive conflict sets (such as setting nonbarriers for signal lights, etc.) to reduce accidents. In addition, diversified traffic safety, civilized travel propaganda, and traffic-emergency behavioral education can be carried out according to different age groups of traffic participants [
31]. In addition, it can be seen from
Table 5 that the possibility of casualties between nonmotor vehicles was the smallest because the power of nonmotor vehicles is insufficient, the speed is not fast, and the nonmotor cars are generally small in size. They could operate flexibly to avoid accidents effectively.
4.4. Analysis of Traffic Accidents on Rainy Days
It is known from the above that traffic accidents on rainy days were the most likely to cause casualties. Therefore, this study analyzed the distribution of traffic accidents on rainy days and formulated preventive measures according to the analysis results.
4.4.1. Characteristics Analysis of Accidents on Rainy Days
One hundred twenty-four traffic accidents on rainy days were screened out from the traffic accident data of Jingshi Road in Jinan city from 2020 to 2021. The vehicle types distribution of traffic accidents on rainy days is shown in
Figure 5.
As shown in
Table 6, cars and minibuses were vehicle types that caused accidents easily on rainy days. The number of accidents for minibuses accounted for 41%, and the number for cars accounted for 30%. This phenomenon was related to the composition of traffic flow. In the traffic flow of urban roads, the proportion of minibuses and cars is high, leading to these two models frequently becoming involved in traffic accidents. Additionally, in the case of heavy rain, the drainage of some urban roads is insufficient. There will be a water film between the road surface and the tire, which reduces the friction between the road surface and the tires. Therefore, the “water skiing” phenomenon will occur when the vehicle speed is too fast, resulting in more traffic accidents. The number of casualties in “car” accidents was 24, a ratio of 64.9% to the number of accidents; the number of casualties in “minibus” accidents was 7, with a balance of 13.8%. The latter casualty rate is much lower than the former [
32]. The reason is that compared to car drivers, minibus drivers have higher driving skills and safety awareness and better emergency handling capabilities for accidents, reducing the number of accident casualties. Combined with the number of accidents and casualties, the car was the most likely to cause casualties on rainy city roads.
- 2.
Grade distribution of traffic accidents on rainy days
According to relevant traffic accident regulations, rainy traffic accidents were divided into minor, general, and significant.
It can be seen in
Table 7 that the number of minor accidents was the largest among traffic accidents on rainy days, accounting for 32.3%. This phenomenon is because the selected road section was the urban road. Compared with other types of roads, urban roads have a relatively complete traffic infrastructure and a lower speed limit, thus reducing the severity of traffic accidents. In addition,
Table 6 presents that the types of vehicles with frequent traffic accidents on rainy days were cars and minibuses. The impact range of the accident is easy to control when these two types of vehicles have a traffic accident, reducing the severity of the casualties.
- 3.
Time distribution of traffic accidents on rainy days
It can be seen in
Table 8 that the periods of frequent accidents on rainy days were 8:00–12:00 and 18:00–22:00. The number of accidents accounted for 20.2% and 27.4% of the total number of rainy day accidents in 2020 and 2021, respectively. Fewer vehicles and pedestrians are on the road during this period than during the peak period, and the freedom of cars and pedestrians increases. The traffic police are also slightly less strict during this period. At the same time, the rain interferes with the drivers’ vision. The above reasons will ultimately increase the potential for accidents. Casualties mainly occurred during the periods of 4:00–6:00, 8:00–10:00, and 18:00–20:00. The number of casualties within 8:00–12:00 and 18:00–22:00 accounted for 25.4% and 28.8% of rainy day accident casualties, respectively. According to the above, these two time periods were also accident-prone periods on rainy days. Deaths between 4:00–6:00 accounted for 11.9% of rain-related accident casualties. As the period is early in the morning, vehicles and pedestrians are sparse, vehicles are free and fast, and the impact of the vehicles in the event of an accident is more significant, resulting in more severe accident casualties. From the above analysis, we can see that 8:00–12:00 and 18:00–22:00 were the most likely times to lead to traffic accidents with casualties (
Figure 6).
4.4.2. Cause of Accidents on Rainy Days
The wiper can only keep the vehicle windshield clear and ensure the drivers’ front line of sight when driving in rainy weather, which leads to the drivers’ poor judgments on the side of the vehicle. After rainfall, dirty water will accumulate on the road, covering the road markings and signs and therefore causing the driver to make a misjudgment. The rain curtain and noise caused by rainfall will affect the drivers’ sight distance and judgment ability when heavy rain occurs.
- 2.
Influence of road-surface-gathered water
As mentioned above, ponding will form on the road after rainfall. When driving, the water between the vehicle tires and the road surface cannot be eliminated. The water will also exert pressure on the tire when the vehicles impose stress on the road surface, which causes a layer of a water film between the two. The appearance of water film will reduce road friction and the grip effect. Therefore, the possibility of the vehicle going out of control will increase when the cars adopt sharp braking and turning. In rainstorm weather, excessive water accumulation will appear on the road. There may be water in the exhaust pipe or brake friction plate when the vehicle passes through the ponding section, which will lead to the dangerous situation of vehicle ignition failure or brake failure, and further increase the probability of traffic accidents.
- 3.
Increase in vehicle braking distance
As mentioned in 2, rainy weather would reduce road friction and adhesion coefficient, leading to a change in vehicle braking distance. The braking distance formula is:
where
is the speed,
is the adhesion coefficient, and
represents the upslope and slope.
As expressed in the formula, in the case of the same speed and uphill and downhill conditions, the smaller the friction force, the larger the braking distance. So, the reduction of friction will increase the probability of traffic accidents.
- 4.
Reduced environmental visibility
On rainy days, rainfall intensity will affect the incidence of traffic accidents because ambient visibility gradually decreases as rainfall increases. Additionally, because most of our roads are made of asphalt, when driving at high speed, the wheels will roll on the water film, thus splashing water droplets and generating water mist of different sizes around the vehicle, which reduce visibility. In the driving process, low visibility will lead to a decrease in the driver’s sight distance and the decrease in the visibility of signs and markings. Driving safety will be affected when the visibility drops to 1000 m.
4.4.3. Governance Measures
It is suggested that drivers pay attention to weather changes and take preventive measures according to weather forecasts. Specifically, they should strengthen vehicle maintenance, check/replace wipers frequently, pay attention to tire wear, and ensure that the vehicle remains in good technical condition. Moreover, the correct use of lights in every unexpected situation is also advised to prevent transmitting wrong information to the surrounding cars. At the same time, drivers should improve their driving skills and emergency handling ability to avoid traffic accidents and casualties.
- 2.
Strengthen the drainage capacity of roads
Road surface ponding has a tremendous negative impact on vehicles driving on rainy days. Taking reasonable measures to reduce road surface ponding can effectively reduce traffic accidents on rainy days.
Road drainage mainly includes road surface drainage and road internal drainage. As for the road surface drainage, the drainage capacity of the drainage pipe network can be enhanced by broadening the diameter of the drainage pipe and increasing the outlet and the jet groove. As for the internal drainage of the road surface, new materials can be selected to enhance the drainage capacity of the road, such as adding recycled carbon fiber materials to concrete. This new material ensures the road drainage capacity and road firmness.
- 3.
Policy management
Government managers should carry out a reasonable regulatory policy for the change of visibility and adhesion coefficient of sight distance markings caused by the decrease in visibility and pavement friction coefficient on rainy days. For example, lane management is carried out to limit lane change and reduce vehicle conflicts according to the road traffic volume distribution. At the same time, traffic flow characteristics also significantly impact the probability of road conflict risk, and government managers should consider the differences between main lanes and lane differences to reduce the occurrence of road conflict [
33]. In addition, the traffic management department can also evaluate road traffic conditions, and this assessment uses the following conflict indexes to establish a model: mean time to correct (MTTC), postclosure time (PET), and deceleration rate to void a crash (DRAC). This model predicts real-time safety indicators such as joint return level, concomitant return level, and collision risk, reducing road conflicts [
34]. If some roads are crowded and congested, vehicles on the streets can drive to the normal streets according to the guidance methods, such as map navigation and traffic police command, to reduce traffic congestion.
- 4.
Autopilot technology
Automated driving technology is more efficient and safer than the traditional artificial driving mode. Therefore, connected and autonomous vehicles (CAVs) have always been vital research directions in road traffic safety operations. Some scholars focused on the interaction between autonomous driving trajectory planning and mixed traffic flow. It presented a reinforcement-learning modeling approach to optimize the safety of mixed traffic flow on a single-vehicle road controlled by a signalized intersection [
6]. Government managers can develop a coordination policy based on “catch-up” to gradually increase the penetration rate of the CAV market to reduce traffic oscillation and improve road capacity [
35]. In addition, a two-level optimization method for CAVs can optimize vehicle scheduling and then carry out optimal trajectory control to reduce vehicle delay and alleviate traffic conflicts [
36].