1. Introduction
Annually, about 50% of vehicle occupants who are killed in vehicle crashes in the U.S. are unrestrained drivers and passengers [
1]. This is despite the importance of buckling up, as seatbelt usage could reduce front seat fatalities by as much as 60% [
2]. If all front-seat occupants in the US used their seatbelt, this is expected to have a 41% reduction in fatalities [
3]. Although seatbelt use in the U.S. has reached about 89% [
4], the number of buckled drivers in Wyoming is only about 79%. Over the years, an increase in the number of vehicle miles traveled per year has underlined the importance of identifying factors that impact seatbelt use.
Buckling up is mainly a result of factors such as attitudes and behaviors of drivers, which might result from various individual-specific demographic and background characteristics. Some drivers view buckling up as an inconvenience, whereas others might view that as a discomfort with no reward. Given the importance of buckling up, extensive efforts have been already made to identify factors in seatbelt choice, and how those factors increase the propensity of seatbelt usage. For instance, self-reported seatbelt use among front-seat passengers was evaluated in past studies [
5]. It was found that attitudes and the subjective norm have a positive relation with the intention of seatbelt use for both urban and rural roads.
Various factors have been linked to the choice of not buckling up in the literature reviewed. For instance, the psychological consideration of seatbelt use was evaluated in terms of experience and cognition [
6]. It was found that the decision of not buckling up might be related to the belief that it is irrational to bear the slight cost of buckling up for the reward of partial protection. Additionally, when drivers experience repeated safe trips, they might believe that crashes occasionally happen to others, which might contribute to the lack of seatbelt usage. For instance, the psychological aspect of driving indicates that each safe trip reinforces the non-use of seatbelts [
6].
A past study was conducted to investigate seatbelt wearing compliance across road users in Malaysia. Gender, time of the day and type of vehicles were some of the factors impacting seatbelt use [
7]. It should be noted that the study was cross-sectional, and just a descriptive summary of recorded observations was presented. The impact of demographic factors on seatbelt use by injured adults in crashes was reported [
8]. The data used in that study were from injured adults who were admitted to a trauma center. Standard logistic regression was used for the purpose of the analysis. The results indicated that those drivers making more yearly income, female and white drivers are more likely to have their seatbelts on while involved in crashes.
Occupation, education, driver age, gender, type and make of vehicles, road surface condition and type of roadway were some of the factors which were found to impact the likelihood of seatbelt use in another study [
9]. The collected data included a total of 1427 motor vehicles, and a descriptive summary of the data was presented in the study. In another study, occupant seatbelt use in Ghanaian University campus was recorded [
10]. The data were collected by an unobstructive survey by collecting the information from 5589 vehicles. It was found that vehicle type and gender were two factors that impact the usage of seatbelts. In that study, the chi-squared test was used to establish the relationship between seatbelt use and vehicle type. The results of a study conducted in Tennessee implied that seatbelt users can be heavily impacted by type, sex and belted driver status [
11].
A comprehensive evaluation of the front seat passenger seatbelt use was evaluated in another study [
12]. Various techniques such as the mixed model, accounting for scale and taste heterogeneity, were considered and compared. It was found that time of travel, vehicle type, and drivers’ gender are some of factors impacting the choice of passengers’ seatbelt use. Additionally, other studies used seatbelt datasets as a case study to evaluate the applicability of various advanced statistical techniques in modeling individual-specific observations [
13,
14,
15].
Given the sparse literature on the choice of seatbelt use, and the continuing rise in the number of vehicles, there is a need for safety engineers and decision makers to understand factors impacting the choice of seatbelt use so that appropriate countermeasures could be taken to tackle them. Despite the importance of buckling up, the majority of past studies mainly used traditional methods, which cannot account for the data heterogeneity, and consequently might result in biased or even erroneous point estimates.
Although the mixed model gives a higher flexibility to the estimation of model parameters, compared with the standard logit model, the mixed model might still be considered as a poor option for some drivers based on their gender: for example, they care less or more about seatbelt use. Here, to capture the possible heterogeneity based on the observed variables, the mixed model is extended to allow the means of the random coefficients to depend on other observed characteristics of drivers.
The focus of this study is only on seatbelt use in the state of Wyoming, located in the western and mountainous area in the U.S. To answer the question of factors impacting the choice of seatbelt use, the data regarding various sociodemographic characteristics of drivers and environmental and vehicle characteristics were used for the analysis. It should be reiterated that various vehicles and environmental characteristics were incorporated into the analysis as they could account for unseen factors that were not recorded at the time of the crashes. For instance, vehicle type might provide information regarding the incomes or personalities of drivers.
2. Data
The dataset used in this study was collected in 2019 in Wyoming. The observers, who collected the data, were trained before data collection to be conformed to the criteria highlighted for the state observational seatbelt issued in 2011 by the National Highway Traffic Safety Administration [
16]. The trained observers were placed in 289 sites within 17 counties in Wyoming, where they collected data related to a total of 18,286 vehicles.
The dependent variable in this study corresponds to the status of drivers’ seatbelt use at the time of data collection. The seatbelt usage in the state varies from a low of 63.5% belted in Sweetwater County to a high of 97.8% belted in Niobrara County. The dataset contains information regarding the drivers’ characteristics such as gender and various weather and roadway characteristics that are collected by the observers.
A set of dummy variables indicating various drivers, vehicles and weather characteristics were created and used. While creating dummy variables, we made sure that the n-1 rule for the number of dummy variables is fulfilled. The data were collected by the observers within the span of two hours from 7:30 a.m. to 5:30 p.m. (e.g., 7:30–9:30 a.m. to 3:30–5:30 p.m.). Various dummy variables of vehicle types include auto, sport utility vehicle (SUV), van and pickup truck.
Weather condition was recorded at the time of data collection by the observers. Those include weather conditions such as rain, snow, heavy rain or occasional rain. For instance, as can be seen from
Table 1, the majority of the data were collected in rural areas (mean = 1.7), and a significant proportion of the vehicles were pickup trucks (mean = 0.41).
3. Methodology
The logit model is a traditional method, which has been mostly used to model the choice of seatbelt use [
17]. However, one of the main shortcomings of the standard logit model is its inability to capture the possible unobserved heterogeneity across individual observations.
The random parameter binary logit model, on the other hand, assumes that the coefficients vary across individual drivers based on some continuous distribution [
18]. Here, the interests of the random parameters model are the first moment of the distribution, or the means of the parameters, and finding the second moment of distribution, or standard deviation, for capturing the unobserved factors.
This study evaluates the choice among drivers for a binary alternative of buckling up or not. To explain the mixed model, first we present the formulation of the standard logit model, and then its modification to the mixed model estimation.
The mixed model is a generalization of the standard logit model, by allowing the preference parameter to vary across each individual [
18]. The standard logit model could be written as [
19]:
where
is observed characteristics of observer
i,
is the i.i.d error term and
is a fixed coefficient. Now, the mixed logit could be extended from the standard logit model by letting
to vary across individuals of the population based on the continuous density of
f (
), where
is some parametric distribution.
It should be noted that the above formulation assumes constant random
βs for all observations based on preassigned continuous distribution. The above discussion could be modified to relax the restriction by considering the heterogeneity in means as follows [
20]:
where
is the vector of observation
i deviating from the mean, being based on an observed attribute. Additionally,
is assumed to be normally distributed.
This simulated maximum likelihood (SML) is used for the parameters’ estimates of the model. This is similar to the maximum likelihood with the difference that for estimating the random parameter coefficients, random draws are considered. The inference of the mixed model could be written as:
From the above,
has a conditional density of
, and density of
. After assuming the random parameter follows normal distribution as
, we have:
where
, and
includes parameters
of preassigned distribution, and the second part of the above equation is probability distribution function (PDF) of normal distribution. As the above equation has no close-form solution, an approximation of
could be obtained by random draws and the results would be summarized as follows:
The maximum likelihood, then, is employed to maximize the log of the above expression as [
21]:
To summarize the implementation of the model parameters, the estimation of the mixed model with heterogeneity in taste is:
where
is the initial values of random parameters, which might be set by employing the standard logistic regression,
is a matrix including the random parameters to be estimated,
is a covariate of a parameter that changes the mean of a random parameter, and
is a matrix of parameters to be estimated. On the other hand,
are random draws based on the normal distribution or random numbers based on some prime values, Halton draws.
For instance, if the mean of random parameters of gender is changed based on the mean of vehicle type, then is vehicle type. is a parameter which is estimated for a random parameter, when heterogeneity is considered, e.g., gender and vehicle type. It should be noted for just considering the standard mixed model, items of in Equation (7) would be ignored.
4. Results
The mixed model with heterogeneity in taste, standard mixed model, and standard logit model are considered and presented in
Table 2. The results of the best performed model, or the mixed model with heterogeneity in taste are presented in three subsections: fixed, random parameter and random parameters with heterogeneity in taste.
4.1. Fixed Effects
Types of vehicles including van and SUV, time of the data collection of 3:30–5:30, and number of lanes were some of the factors that found to be fixed across the population which are outlined in the next paragraphs.
4.1.1. Vehicle Types
Various types were converted into dummy variables, and those variables were found to be important to be considered in the final analysis. It was found that the drivers who drive SUVs and vans are less likely to be unbuckled, compared with other vehicles such as passenger cars. It should be noted that the effect of the SUV predictor on the reduction in being unbuckled is slightly higher,
, than the van,
. The findings somehow oppose the previous study that teenagers who drive SUVs and vans are less likely to be buckled up [
22]. However, that study only considered the data related to fatally injured teenage drivers.
4.1.2. Time of Data Collection: 3:30–5:30
We found that the drivers who drive their vehicles from 3:30 to 5:30 are more likely to be unbuckled, compared with drivers in other times. The result is somehow in line with the previous study that the later time of a day is negatively correlated with seatbelt usage [
23].
4.1.3. Number of Lanes
Number of lanes is another factor that was found to have a fixed effect on the seatbelt usage. The results indicate that when there are more lanes, drivers are more likely to wear their seatbelts. The impact might be related to the fact that often the roadways with less traffic and lanes are associated with lower enforcement, or possibly while traveling on roadways with lower traffic, drivers might think it is ok to be unbuckled.
4.2. Random Parameters
This section presents random parameters that were found to change across the population without any observed heterogeneity. The way that the random sampling is incorporated in the analysis is by the cross product of standard deviation (SD) of random parameters by Halton draws and summing the two matrices by the initial value of random parameters, as shown in Equation (8):
where
is the points’ estimates of random parameters, and
L is a diagonal matrix including starting values of the random parameters’ standard deviation. The values of random parameters estimate, and their standard deviations are estimated by the maximum likelihood process, and the
p-value is estimated from coefficients and standard error. Recall that for this random parameters’ deterministic taste, variations are constant across the population and do not change based on other observed variables.
4.2.1. Gender
The predictor of drivers’ gender indicates whether an observed driver was male as 1 or female as 2. The estimated normal distribution has a mean of −0.535 and standard deviation of 1.088. The cumulative density function (CDF) has an associated estimate of 0.69, indicating that almost 70% of male drivers are unbuckled compared with their female counterparts. This is in accordance with previous studies, which found that male drivers are more likely to be unbuckled compared with their female counterparts [
16]. However, that study did not account for the heterogeneity across observations.
4.2.2. Pickup Truck
This attribute highlights whether the drivers were driving a pickup as 1, versus others as 0. This random parameter has a mean of −0.13 and SD of 1.702, associated with CDF of 0.531. The results indicated that a slight majority of non-pickup drivers are more likely to be unbuckled. The result is in line with the previous studies that the pickup truck drivers have a lower seatbelt use compared with van and auto vehicles [
4].
4.2.3. Time of Data Collection: 1:30–3:30
The predictor of time of a day indicates whether drivers were observed to be buckled or unbuckled during 1:30–3:30 as 1, versus others. The estimated normal distribution has a mean of 0.43 with a standard deviation of 0.830. The associated CDF is 0.303, indicating that about 70% of the drivers during the times of 1:30–3:30 are likely to be unbuckled.
4.2.4. Day of a Week
The predictor of day of a week indicates whether seatbelt status was collected during weekdays as 1 versus weekend as 0. Almost 100% of the drivers during weekends were buckled up. The results are in line with the previous study that seatbelt usage is higher in weekends compared with weekdays [
24]. However, that study did not account for heterogeneity in the dataset. Results of a higher seatbelt usage might be attributed to a higher presence of cops on the roads or the fact that some drivers are accompanied by family members.
5. Heterogeneity in Tastes
The observed heterogeneity could be considered in the random parameter model by the inclusion of individual specific covariates [
21,
25]. The way that the heterogeneity in tastes work is similar to the standard random model (see Equation (2)) with the difference that the cross product of matrix parameter and vector of covariates of another of the observed variables,
, is included in the equation (see Equation (7)).
The following sections present the parameters that heterogeneity in tastes were considered. Three random parameters were found to vary based on the means of other variables. Those random parameters include roadway classification, vehicle registration and sunny weather condition. The following sections outline those variables. It should be noted that all variables were tested for the consideration of heterogeneity in means of random parameters, but only the below variables were considered due to their significance.
5.1. Road Classification
The means of this random parameter of road classification variy based on the tastes’ means of gender, day of a week and road classifications. This random parameter has a mean of −1.92 and SD of 0.887, with CDF of 0.98. This indicated that 98% of the drivers in urban areas are expected to be unbuckled.
5.1.1. ~
It is found that the mean of random parameter of road classification changes based on the mean of variable of gender. This indicated that the heterogeneity of road classification is varied based on gender across the population for status of seatbelt use. In other words, the higher the value of gender category of females (), the lower the impact of road classification of rural areas () and, consequently, the less likely the drivers are to be wearing their seatbelts. The impact of the gender mean on road classification is expected, as female drivers are less likely to drive in rural areas compared with their male counterparts, and, consequently, are more likely to wear their seatbelts.
5.1.2. ~
The mean of the day of the week is another factor that is found to impact the road classification taste heterogeneity. The results indicated that the weekday as 1 () increases the means of road classification as drivers are possibly more likely to use rural areas during the weekdays for careers. Additionally, the rural road classification () decreases the odds of being unbuckled.
5.2. Vehicle Registration
It is found that road classification and pickup trucks are determinants of vehicle registration, and they play a significant role in drivers’ decision-making process. The heterogeneity in mean of vehicle registration is varied based on the mean of two variables. This random parameter has a mean of −0.63, and SD of 1.128, with a CDF of 0.71. This indicates that 71% of drivers, with non-Wyoming vehicle registration, are more likely to be unbuckled.
This might be related to the fact that the out-of-state travelers are less likely to be buckled compared with the state drivers that drive for work or other nearby businesses. The results are somehow in line with the previous studies that seatbelt usage changes based on various traveling conditions [
26].
Additionally, as discussed by the previous study, the impact might be mainly related to the level of perceived risk for traveling condition. For instance, when state drivers, due to being experienced, consider the risk of traveling to be high due to nighttime or adverse weather conditions, they are more likely to be buckled.
5.2.1. ~
Rural types of road classification are found to increase the mean of non-Wyoming vehicle registration (). Consequently, the lower mean of vehicle registration results in a lower CDF and lower number of unbuckled non-Wyoming drivers.
5.2.2. ~
The type of vehicle, (pickup truck) , is found to impact the mean of vehicle registration (). In other words, driving a pickup truck was found to decrease the mean of non-Wyoming vehicle registration. Consequently, that effect decreases the impact of the non-Wyoming vehicle registration and the likelihood of being unbuckled.
5.2.3. ~
The time of 1:30–3:30 is found to be the only indicator that significantly impacts the heterogeneity in the mean of random parameters of sunny weather conditions. Sunny-weather condition has a mean of 0.43 and SD of 0.830, with CDF of 0.30. The value of CDF indicates that in 30% of non-sunny condition cases, the drivers are unbuckled. The impact might result from the fact that the drivers consider non-sunny weather conditions to be riskier, and are thus less likely to be unbuckled. The uncertainty in the significance of SD of sunny weather conditions should be taken into consideration (p-value = 0.1).
5.3. Models Comparison
For comparison purposes, in addition to the mixed model with heterogeneity in tastes, models including the standard mixed model, and the standard logit model were included (see
Table 2). From
Table 2, the mixed model with heterogeneity in taste (Akaike information criterion (AIC) = 17,976) has a better fit compared with the standard mixed model (AIC = 18,023) and the standard logit model (AIC = 18,071).
Based on the results, accounting for heterogeneity is important, moving to the more advanced technique of mixed model with heterogeneity in taste from the standard logit model. That is expected as the choice of seatbelt use involved a more complex relationship across various variables. AIC and Bayesian information criterion (BIC) were used due to reliabilities by penalizing for the number of included predictors.
A question arises: Why does the mixed model with heterogeneity in taste perform better compared with the standard mixed model? This might result from complexity of the choice that drivers make regarding the seatbelt usage, which the mixed model could only account for inefficiently. This complexity might end up in error terms, and decrease the goodness of fit, if they are not accounted for. This could be observed in the variation of the point estimates, and even the significance of the variables.
This is despite the fact that for fixed parameters in standard mixed, and mixed models, accounting for heterogeneity in taste, point estimates of fixed effects are almost identical. The standard deviation for the standard mixed model is significant for some random variables, e.g., sunny weather conditions, whereas they are not for the mixed model with heterogeneity in taste. That implies that those coefficients varying across observations based on some observed attributes, and thus the constant distribution mean might provide biased point estimates regarding the impacts of those variables.
Finally, considering the standard mixed model and mixed model with heterogeneity in tastes, the percentage of unbuckled male drivers changes from 69% to 83% for the mixed model with heterogeneity in taste and the standard mixed model, respectively. This is an indication that there are differences in the magnitudes of effects for various approaches.
6. Discussion
A lack of seatbelt usage is a major safety concern for policy makers. Thus, this study employed three methods of the standard logit, mixed model and the mixed model with heterogeneity in taste to explore the determinants of drivers’ choice of bucking up in a less-biased way. The response variable is whether drivers were buckled, as a reference category, or not. In the mixed approach, heterogeneity is accounted for by assuming that parameters (or the tastes of the individuals) vary across the population based on some pre-specified continuous distribution, which defines continuous segmentation of preferences.
Although the standard mixed model helps to capture the unobserved heterogeneity, the mixed model, accounting for the heterogeneity in tastes, could also help to capture observed heterogeneity [
27]. Based on the obtained results, we found that moving from the standard logit model to the standard mixed model and the mixed model, with heterogeneity in taste, allows new insights to emerge, and also results in an enhancement of the model’s overall fit.
The results indicated that the choice of seatbelt usage is a complex task, which depend on a number of factors. For instance, vehicle types, time of a day and number of lanes were some of the fixed factors that were found to impact the drivers’ choice of seatbelt usage. We found that SUV and vans drivers are less likely to be unbuckled. That result is against the previous study that those drivers are less likely to be buckled up [
22]. Again, the reason for the impacts might be due to many unseen factors that were not recorded at the time of data collection.
For instance, the reason why SUV users have higher seatbelt use might be due to a possible reason that those vehicles are primarily used by families. Another reason could be that factor highlights drivers with other specific characteristics that were not recorded at the time of data collection. For instance, it was highlighted in the literature that 80% of SUV owners live in urban areas [
28]. Additionally, it was discussed that women carry out one-third of all SUV purchases [
28], and that SUV drivers are less likely to be frustrated and those are overrepresented across highly educated individuals with higher income [
29]. More studies are needed to highlight the underlying factors behind the impact.
In addition, although the impact of gender was found to be random, the distribution of that variable highlighted the fact that male drivers are less likely to be buckled. The impact might be due to risk aversion between genders as various genders are expected to have different approaches toward risk-taking behaviors. The variation across genders in risk-taking behavior were explored in different contexts, e.g., investment [
30]. The difference has been linked to cognitive and noncognitive traits on which men and women differ [
31]. However, it should be noted that the impact for gender in risk-taking behaviors in the majority of past studies was considered as fixed.
Few variables were found to have unobserved heterogeneities in their means across the drivers’ population. Those include gender, time of a day, day of a week and type of vehicles. Most of the outcomes are in line with the previous studies. For instance, as expected, females in general are more inclined to be buckled [
32].
However, it is found that for some variables, heterogeneities are not constant across the population, and their means change across the population based on some observed heterogeneities. These include sunny weather conditions, road classification and vehicle registration. For instance, observed heterogeneity exists in the road classification: the heterogeneity of the mean of road classification is a function of gender and day of the week. Variation across gender in terms of the use of various road types and the complexity of travel behavior were discussed in the literature [
33]. It was found that women have lower levels of individual access to urban opportunities.
Finally, it should be reiterated that the inclusion of various environmental and vehicle types was considered in the analysis as they account for unobserved factors that the observers failed to record. For instance, the choice of drivers in buckling up in various days or environmental conditions or even type of vehicles they own, could provide information regarding the psychology and drivers’ attributes that we could not, or at least, would be very challenging, to record.
7. Recommendations
The findings of this study have important implications for the policy makers in the state. The results show that the governmental bodies should pay more attention to the underlying impact of the discussed variables. For instance, it was found that various demographic and environmental conditions impact the use of seatbelts. It is expected that the impacts of those factors are related to the psychology of drivers and how drivers are influenced by various environmental and drivers-related characteristics.
For instance, the fact that drivers in rural areas are more likely to be unbuckled is expected to be related to the drivers with non-Wyoming plate registration. This could be an indication that travelers from outside the state that come to Wyoming for a possible reason of leisure, are more prone to be unbuckled. Therefore, more attention should be given to the seatbelt use of outside-the-state travelers by placing billboards about seatbelt use at state borders. The importance of the role of communication by factors such as billboard, and media were discussed in the past study [
34].
One of the most effective population-based interventions for improving seatbelt use is seatbelt laws and the related enforcement [
35]. For instance, the literature review highlighted the importance of primary enforcement of seatbelt laws, where the officers stop drivers who are not wearing seatbelt and ticket them compared with the time the ticket issuance is after vehicle was stopped for other reasons [
36]. Thus, the highway patrol in the state could have an influential impact on the use of seatbelts, especially by targeting the right group.
Additionally, research suggests that enhanced enforcement through communication and outreach is associated with higher seatbelt usage [
37]. Educational programs are recommended for nonresident travelers of the state, especially regarding the danger of driving in a mountainous area with adverse conditions such as Wyoming.
Educational programs are especially important as it is expected that many road users might not be fully aware of the importance of the seatbelt use in prevention of crash fatalities and injuries. More education programs and training are needed to provide the drivers with the information regarding the importance of seatbelt use. The importance of education programs has been discussed extensively in the past studies. For instance, the impact of early elementary school safety education was evaluated and found to be successful on family seatbelt use [
38].
Educational programs are especially important as many drivers might not be fully aware of the importance of seatbelt use in reduction in crash severity. That could especially be employed by the DOT while training and issuing the driver’s license.
It is expected that the seatbelt policy and program initiatives will be associated with a reduction in occupant fatalities, especially in Wyoming, with one of the highest fatalities rates in the states. Especially the changes of regulations are important as highlighted that the importance of governmental legislation within each state in the reduction in related seatbelt injuries [
39]. More studies are needed to highlight the effectiveness of various possible countermeasures.