1. Introduction
Weather events are arbitrary and this makes it difficult to incorporate weather parameters into transportation models. The weather parameters include snowfall, temperature, precipitation, rainfall, smoke, wind speed, slit, hail, fog, lightning, etc. The weather has a significant impact on both roadway and its operation. The roadway impacts include obstruction in the lane change, significant damage to infrastructure, reduced visibility, etc. On the other hand, reduction in capacity and speed, increased delay, restrictions and closures, are the effects on traffic operation. Extreme weather events like lightning and extreme temperatures are responsible for the loss of communication/power and failure in traffic control. Further, choice of mode, driver behaviour and adaptability during different weather conditions influence the weather impact on mobility, safety and journey time.
Many researchers have worked on the analysis of the weather impacts on the variations of traffic volumes. In Scotland, Hassan and Barker [
1] studied the changes in the average weekday traffic activity. In abnormal conditions of extreme temperatures, seasonally changed daylight hours and precipitation, they observed that variations in activity were less than 3% compared to normal conditions. While the snow was lying on the ground, there was an average decrease of 10% in traffic activity during weekdays. Angel and Sando [
2] found that mean travel speeds have been reduced during rainfall events on Florida highways including the fact that increasing the intensity of rain caused greater reductions in speed. Furthermore, the results indicate that the hour of the day is a factor affecting the reduction of speed and traffic volumes by motorists during rainy conditions. This trend is more significant during peak hours. Singhal and Camille [
3] reported that weather event affects transit ridership most during the weekday evening peak time (16:00–18:00) followed by midday peak duration (10:00–16:00), and least the morning peak period (06:00–10:00).
Traffic stream variations have been observed due to severe winter weather conditions in Canada and the northern regions of the USA [
4,
5,
6,
7,
8,
9]. Datla and Sharma [
4] focused on the effects caused by winter climate conditions (snowfall and cold temperatures) on total traffic flow in the Alberta Highway network, Canada. They reported that significant changes in traffic variations were observed during the winter season, and the extent of change relies upon the locations of highway site, type of highway, the day of the week, the time of the day, and severity of weather conditions. The results indicated that the minimum decreases in traffic activity, up to 14%, were observed on commuter type highways due mainly to colder temperatures. On the other hand, the highest reductions, up to 31%, were experienced on recreational type highways. The effect of cold on traffic volume reduction raged from 6–10% during peak hours in commuter highways. In most cases, the reduction during off-peak hours ranging from 10–15% is higher than peak hours’ reduction. On the contrary, in the case of recreational type highways, the peak hours’ reduction ranging from 30–58% is higher than the off-peak hours that ranged from 18–30%.
Most of the studies in this regard are limited to total traffic only. Research on truck traffic variations related to extreme winter conditions is valuable but scarce. An investigation on truck traffic pattern changes could be valuable to provide input to support purposes such as designing changes in highway and developing sustainable winter highway maintenance programs, geometric design of highway, highway life cycle analysis, and so forth. This study focuses on the possible truck traffic volume variations at six different highway sites in the Alberta highway network in Canada for the study period from 2005 to 2009. An investigation of the impact of snowy days on daily truck traffic volume factor is carried out using data collected from the different type of highway facilities. In Canada, Roh and Sharma [
10] studied variations in highway truck traffic triggered by serious winter weather conditions. In their study, the traffic volume factor (
yi) was estimated with Equation (1) by considering snowfall, temperature, and expected daily volume factor (EDVF) as independent variables in the dummy variable regression model. It is revealed that the EDVF variable plays an important role to explain the variation of the estimated traffic volume because it contains systematic changes in historical data.
Roh and Sharma [
11] conducted a similar study on passenger car and truck traffic using the same study site with this research. They proposed another linear model written in Equation (2) to quantify the effect of snow and temperature on traffic volume changes.
In the model, temperature and snowfall were taken as continuous independent variables. The model was specified to capture the interaction effect of snowfall and temperature by cooperating with a term composed of the product of two variables. However, still, the model contains the same independent variable,
EDVF, and the term explains most of the variations in traffic volume changes. Therefore, the sensitivity of the effect of snowfall and temperature on traffic variations is not captured well and thus novel statistical approach is recommended to overcome the disadvantage of the models proposed in past studies. Based on these drawbacks of the past studies, in this research, the objective is to focus on the effects of snow and temperature on traffic volume changes with the methodological help of Maximal Information Coefficient (MIC), which stems from the maximal information-based nonparametric exploration (MINE) statistics created by Reshef and Reshef [
12]. MIC measures the strength of the relationship between two variables X and Y and has been utilized as a non-parametric statistical method to recognize a novel relationship among huge information collected.
2. Literature Review
This section explains prior research in the field of climate effects on traffic. Studies related to the impact of weather events such as rain, snow and temperature on traffic volume have been released with some significant results. In the research conducted by Keay and Simmond [
13], an examination regarding the impact of climate factor on traffic volume at Melbourne, Australia indicated rainfall is a factor that is correlated significantly with traffic volume changes. The amounts of decrease caused by rainfall in traffic volume were found to be 2.11% in spring and 1.35% in winter respectively. There were traffic volume reductions of 2–3% for the precipitation that ranged from 2 mm to 10 mm; the highest reduction is 3.43% for the rainfall ranging from 2–5 mm in spring. Samba [
14] investigated seven locations in Virginia and Minnesota and found that snow triggered traffic volume reduction up to 70% while the rain made 20% or less reduction. The amount of decrease in traffic volume fluctuated according to the time of day. During morning peak hours, little decreases were observed. Conversely, during early evening hours and afternoon peak hours, significant amounts of reductions ranging from 17–76% are observed. In the study of Sando [
2] for the Florida roads, it was observed that the average amount of decrease in speed due to rain event ranged from 1.5 to 3.5 mph for the control section of I-95, and it ranged from 1.8–2.5 mph for the control section of I-295. Hourly traffic volumes were also influenced by wet weather conditions. Average traffic demand is decreased by 2.5–10.7% under wet weather conditions along the I-95 and by 5.5–12.5% along the I-295.
Knapp and Smithson [
5] identified that the average reductions in traffic ranged from 16–47% for various storm occasions. Hanbali and Kuemmel [
15] have gathered information at 11 areas in the United States to investigate the impact of winter storm using the three months of data in 1991 and found that traffic volume is decreased on a weekday by 7–17% with light snow, but with substantial snow by 53%, and on ends of the week by 19–31% and 56%, respectively. Maze [
16] found a solid relationship between perceivability, wind speed and decrease in traffic volume in his study on Interstate Highway 35 in Iowa during snowy days. He found a 20% lessening in activity amid cold days with great perceivability and low wind speed. The decreases are around 80% when wind speed is high (64.37 km/h), with perceivability under 0.40 km.
In the study by Datla and Sharma [
4], reductions in traffic volume have been indicated clearly with detailing on an hourly basis. In their model (see Equation (3)), the independent variable Expected Volume Factor (EVF) contribution was significantly higher as compared to snow and cold while estimating the daily traffic volume. Roh et al. [
10,
11,
17] also used similar models (see Equations (1) and (2)) and it was found that the EDVF can explain about 78% of classified traffic variation. These studies along with other similar studies as shown in
Table 1 fail to explain the relationship between the traffic variation, snowfall and temperature. Roh’s [
17] work revealed that the linear association between snowfall and truck traffic volume is very low particularly in the case of a commuter type of road, although a linear model structure (see Equations (1) and (2)) was proposed in his study. Therefore, it would be worthwhile to detect a new relationship between weather variables and traffic volume variations with consideration of road type.
where,
VFWDYCG is the daily volume factor of a particular day group “D” in the week “W” of a year “Y” for the PTC site “C” that belongs to road type “G”.
EVFWDCG is the average
VFWDYCG calculated using all available years of data.
Unlike previous studies, we attempted to explore the dependence of traffic volume on snowfall and cold temperature, by the use of Maximal Information Coefficient (MIC) and MINE Statistics. Reshef et al. [
12] developed the MINE statistics and applied them to many data sets in diverse fields. In their study, they could identify some strong relationships that were unidentified by other correlation parameters like Pearson, Spearman, Kraskov Mutual Information, Principal curve-based estimator, etc. Later, MINE statistics were widely used in many other studies. Omar [
18] identified many linear and nonlinear relationships in his cluster analysis of left ventricular and left atrial mechanical deformation parameters. Valenzo [
19] utilized MIC in the estimation of Brain–heart linear and nonlinear coupling. Zhouzhou [
20] used MIC to allow the computer to automatically identify some simple shapes of the final forms (spatial layouts) of the design (such as a circle or a U shape) and generate the correlation of each shape with the corresponding design performance.
3. MIC, MINE Concept
Reshef et al. [
12] proposed a novel coefficient known as the Maximal Information Coefficient (MIC) to measure the relationship between two variables. MIC identifies both functional and non-functional relationships for a bivariate data set. MIC is a score ranging from 0 to 1. If the value is close to 1, a functional relationship can be established between the two variables and MIC value roughly equal to the coefficient of determination (
R2) for that functional relationship. MIC satisfies two heuristic properties: generality and equitability. This means MIC is capable of suggesting all types of functional relationships and it gives a similar score to relations of different types at similar noise levels.
Table 2 [
12] represents some sample MIC values with varying noise levels for several functional relationships. Additionally to MIC, Reshef et al. [
12,
21] developed other statistics, which are collectively referred to as Maximal information-based Nonparametric Exploration (MINE) statistics. These statistics when used collectively help us in detecting and categorizing relationships. In their study [
12], MIC and other MINE statistics were applied to data sets for global health, yeast gene expression profiles, 2008 major-league baseball season, and bacteria levels in the human gut microbiota and consequently identified known and novel relationships.
MIC explores the relationship between two variables. A grid is drawn on the bi-variate scatterplot to illustrate the relationship [
4]. Additional statistics such as Maximum Asymmetric Score (MAS), Maximum Edge Value (MEV) and Maximum Cell Number (MCN) can be computed to determine the functional relationship between two variables. MAS represents the deviation from monotonicity. MEV suggests the closeness to a function, whereas MCN indicates the complexity of the closest function.
5. Methodology
This study focuses on exploring the new relationship between truck traffic volume and weather parameters during weekdays with the help of the Maximum correlation coefficient (MIC). Daily volume factor (ratio of daily volume to average annual daily traffic) is used for analysis instead of absolute volume to take care of yearly variation in traffic volume. The variance in truck traffic volume factors (TRVF) for a specific day in a week was found to be lower when compared to variance TRVFs taken for all the days (weekday and weekend inclusive). Therefore, weekday volume is considered for analysis. The analysis is performed in two stages:
Six different highways in Canada have been considered for analysis, ED7, FM6, LE5, LV4, RD3, and VI8, using the data from 2005–2009. Across all the highways, all snowy days (days on which snow was greater than 0 cm) have been identified initially. We generated a new set of variables that relate snow (S
i), temperature (T
i) and truck volume factor (TRVF
i) of the ith day of a week to the values (S
i-7, T
i-7, TRVF
i-7) in the previous week. The notations have been diagrammatically represented in
Figure 1. Three new variables: Snow difference, Temperature difference, and Truck Volume Factor (TRVF) Difference are defined as follows:
- 1.
Snow Difference:
It is the difference in “snowfall on day of observation” and “snowfall on same weekday in the previous week”.
where
Si = Snow on the ith day
Si-7 = Snow on (i-7)th day (effectively same weekday in the previous week)
- 2.
Temperature Difference:
It is the difference in “Temperature on day of observation” and “Temperature on same weekday in the previous week”.
where
Ti = Temperature on ith day
Ti-7 = Temperature on (i-7)th day (effectively same weekday in the previous week)
- 3.
TRVF Difference:
It is the difference in “Truck traffic volume factor” on day of observation” and “Truck traffic volume factor” on the same weekday in the previous week”.
where
TRVFi = Truck traffic volume factor on ith day
TRVFi-7 = Truck traffic volume factor on (i-7)th day (effectively same weekday in the previous week)
For all the snowy days in each highway, the above-mentioned variable values were calculated, and based on their values, we categorized them into nine cases as defined below:
C1
Number of cases (days on which) when ∆Si > 0 (snowfall increases), ∆Ti > 0 (temperature increases) and ∆TRVFi > 0 (traffic count increases)
C2
Number of cases (days on which) when ∆Si > 0 (snowfall increases), ∆Ti > 0 (temperature increases) and ∆TRVFi < 0 (traffic count decreases)
C3
Number of cases (days on which) when ∆Si < 0 (snowfall decreases), ∆Ti > 0(temperature increases) and ∆TRVFi > 0 (traffic count increases)
C4
Number of cases (days on which) when ∆Si < 0 (snowfall decreases), ∆Ti > 0 (temperature increases) and ∆TRVFi < 0 (traffic count decreases)
C5
Number of cases (days on which) when ∆Si < 0 (snowfall decreases), ∆Ti < 0(temperature decreases) and ∆TRVFi > 0 (traffic count increases)
C6
Number of cases (days on which) when ∆Si < 0 (snowfall decreases), ∆Ti < 0 (temperature decreases) and ∆TRVFi < 0 (traffic decrease)
C7
Number of cases (days on which) when ∆Si > 0 (snowfall increases), ∆Ti < 0 (temperature decreases) and ∆TRVFi > 0 (traffic count increases)
C8
Number of cases (days on which) when ∆Si > 0 (snowfall increases), ∆Ti < 0 (temperature decreases) and ∆TRVFi < 0 (traffic count decreases)
C9
Number of cases for which, ∆Si = 0 or ∆Ti = 0
Table 4. presents a summary of these case definitions. Among these nine cases, MINE analysis was performed on potential cases to derive meaningful interpretations.
Figure 2 presents the different combinations of these eight case scenarios.
6. Results and Analysis
The results indicate a higher number of cases favoring C8 in most of the highways (See
Table 5). This hypothesizes that, during snowy days, when snowfall is higher and the temperature is lower, the truck volume experiences reduction. Past studies reports this is true in the case of total traffic, however, this effect has never been strong in the case of truck traffic. It is also observed that most of the cases belong to C1, C2, C7, and C8. These four categories have a common attribute that increases in snowfall. The higher number of cases for C1 than C2 implies that an increase in temperature caused an increase in traffic volume. Even though it is evident that snow has an impact on traffic reduction, it is important to determine the strength of the correlation between snow and truck traffic.
To further explore the C8 cases, MINE.r application in R software [
17] was used over C8 cases for all the highways, with snow as an independent variable and truck volume factor (TRVF) as a dependent variable.
Table 6 presents the results from MINE analysis on snow and truck volume factor.
Results in
Table 6 suggest that there is no linear relationship between snowfall and truck traffic volume as opposed to the model presented in Equations (1) and (2). From the sample values of MINE statistics mentioned in Reshef’s [
8] study, it may be understood that the closest function that can be mapped to MIC of 0.412 would be that of a circle for the site LV4. Since such a function does not have practical significance (as it would predict two-volume factors for single snow value), it is desired to further conduct hourly analysis to explore a new relationship between snowfall and truck volume.