Next Article in Journal
Developing a Framework for the Sustainability Assessment of Urban Transportation and Its Implementation
Previous Article in Journal
Cargo Bikes and Van Deliveries in Rome: A Comparative Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling the Severity of Crashes in Rainy Weather by Driver Gender and Crash Type

1
Department of Civil Engineering, Shahid Bahonar University of Kerman, Kerman 7616914111, Iran
2
Department of Civil, Environmental and Construction Engineering, Texas Tech University, Lubbock, TX 79409, USA
3
Department of Civil, Construction, and Environmental Engineering, University of Delaware, Newark, DE 19716, USA
4
Department of Civil, Construction, and Environmental Engineering, University of Alabama at Birmingham, Birmingham, AL 35294, USA
5
Department of Civil Engineering, Sharif University of Technology, Tehran 1458889694, Iran
6
Department of Civil and Environmental Engineering, State College, The Pennsylvania State University, University Park, PA 16801, USA
7
Department of Civil and Environmental Engineering, Tarbiat Modares University, Tehran 1411944961, Iran
*
Author to whom correspondence should be addressed.
Future Transp. 2025, 5(4), 146; https://doi.org/10.3390/futuretransp5040146
Submission received: 31 August 2025 / Revised: 4 October 2025 / Accepted: 13 October 2025 / Published: 16 October 2025

Abstract

Rainy weather conditions can have significant impact on the severity and frequency of traffic crashes. This study investigated factors that influence the severity of vehicle crashes during rainy weather in California. Data from 23,242 rain-related crashes in California were taken from the Highway Safety Information System (HSIS) database. The data was divided into 12 groups based on driver gender (male and female) and crash type (six categories: rear-end, hit object, sideswipe, overturned, head-on, and broadside). Each group was assigned a logistic regression model for crash severity (Property Damage Only (PDO) vs. injuries or fatalities (NotPDO)) yielding 12 models for various combinations of driver gender and crash types. Results indicate that factors such as the number of vehicles involved, vehicle manufacturing year, annual average daily traffic (AADT), road topography, season of crash, number of lanes, and driver age group all significantly influenced crash severity across various scenarios. These findings provide detailed insights into how various factors contribute to crash severity in different scenarios, allowing policymakers to develop targeted interventions. Policymakers can utilize the findings of this study to implement targeted measures in areas with high frequencies of specific crash types, particularly during adverse environmental conditions.

1. Introduction

Traffic crashes represent a persistent and significant global public health challenge, resulting in millions of fatalities, injuries, and disabilities each year [1]. In the United States, these incidents are a leading cause of death, particularly among young and middle-aged individuals, highlighting the critical need for effective mitigation strategies [2]. California, with its high population density and heavy traffic volume, faces particularly acute challenges in managing road safety, making it a key region for targeted research [3]. A thorough analysis of the factors that influence crash occurrence and severity is therefore essential for developing effective, tailored interventions.
Weather conditions are well-established contributors to traffic accidents, with rainy weather posing an especially significant risk. Recent data from the National Highway Traffic Safety Administration [NHTSA] (2022) indicates that adverse weather conditions account for a significant proportion of crashes, with rain being a particularly common factor [4]. This highlights the importance of understanding the underlying mechanisms that increase accident risk and severity during rainy conditions [5]. Weather-related accidents are more likely to result in severe outcomes than those that occur in clear conditions, making this a critical area of focus [6]. Additionally, the increased mental and physical effort required while driving in adverse weather has been shown to compromise driver performance and safety [7].
Numerous factors have been identified as influencing crash severity in adverse weather. These include reduced visibility, changes in road surface friction, and heightened driver stress [8,9]. Peak traffic hours, unsafe driving behaviors, and even the lack of seatbelt use have been linked to greater crash severity in rainy conditions [6]. While roadway characteristics, such as wet surfaces and narrow segments, increase the risks of severe collisions, particularly in head-on crashes [10,11]. Furthermore, some studies have investigated the impacts of specific weather events, such as heavy rain, on crash risk [12]. Similarly, motorcycle crash studies have shown that temporal and seasonal variations (e.g., time-of-day and season combinations) play a critical role in shaping crash severity outcomes, highlighting the complex interactions between environment and driver behavior [13]. However, these studies often examine such factors in isolation, limiting our understanding of their complex interplay.
Individual driver characteristics, particularly gender, also influence crash dynamics. Research suggests that female drivers may sustain more severe injuries in certain crashes compared to their male counterparts [14,15,16]. Moreover, older female drivers are more likely to suffer severe injuries during adverse weather conditions, demonstrating a need for nuanced analyses that consider both gender and age-related factors [17,18,19]. In addition to the influence of gender, crash type is a substantial determinant of severity. Studies highlight the vulnerability of motorcyclists in multi-vehicle collisions during rain, and others focus on the increased risks associated with different collision types, such as rear-end, sideswipe, and head-on crashes, under wet conditions [20,21]. Recent disaggregated modeling of rear-end collisions further demonstrates that driver sex, age, area type, and lighting conditions significantly affect injury severity, underscoring the importance of incorporating demographic and environmental heterogeneity in crash severity research [22]. These patterns underscore the need to account for demographic and environmental heterogeneity in severity analyses.
Recent studies continue to investigate how traffic exposure, weather conditions, and crash characteristics interact to influence injury outcomes. For instance, one study using a correlated random parameters logit model with heterogeneity in means showed that real-time weather conditions significantly shift the risk of injury severity, while traffic volume can have a mitigating effect [23]. Another study identified persistent sex-based differences in injury risk even after accounting for vehicle mass and crash configuration, suggesting that some disparities remain despite controls [24]. Additional evidence suggests that weather conditions influence the severity of injuries in truck-involved crashes [25] and that sex-based differences vary significantly across different types of crashes [26].
While extensive research has examined the individual impacts of driver gender, crash type, and environmental factors, a significant gap remains in understanding their combined effects under high-risk conditions such as rainy weather. Comprehensive reviews (e.g., [8]) confirm that the general link between weather and crashes has been widely studied; however, dominant approaches often rely on aggregated models in which weather is treated as just one of many predictors for a general crash population. This tendency, along with studies that focus on specific crash types without adequately controlling for gender or environmental factors [27] or examine gender differences without considering crash types and road conditions [28], assumes stability across contexts and overlooks crucial interactions. Consequently, we still lack evidence on how crash type relates to driver gender in rainy conditions and which crash types disproportionately affect different gender groups.
The novelty of this study lies in directly challenging that aggregated paradigm. Rather than asking the broad question, “What is the effect of rain on crash severity?”, we ask the more actionable one: “Given that it is raining, how do the determinants of crash severity differ across scenarios jointly defined by driver gender and crash type?” By adopting a stratified analytical framework, we move beyond average effects to uncover hidden heterogeneity in risk patterns and identify high-risk scenarios and their contributing factors, thereby enabling targeted, evidence-based interventions.
Previous studies have acknowledged the role of adverse weather but often incorporate it as one of many independent variables within a single comprehensive model. While informative, this approach assumes that the effects of other critical factors—such as driver characteristics, vehicle type, or road geometry—remain consistent across clear and rainy conditions. Such assumptions may mask important interactions and lead to generalized findings that lack situational specificity.
Accordingly, this research departs from the traditional approach by defining rainy weather as the scope condition for the entire analysis. By isolating the high-risk environment of rain-related crashes and employing a stratified design that disaggregates data into 12 scenarios based on driver gender and crash type, we reveal how the significance of risk factors changes across specific contexts. This framework provides a more granular understanding of crash severity, moving beyond “what” factors are significant to “for whom” and “under what collision type”.
Therefore, the purpose of this study is to fill this gap by examining the combined effects of driver gender, crash type, and environmental factors on crash severity in rainy weather in California. Using Highway Safety Information System (HSIS) data and stratified binary logistic regression, the study addresses policy-relevant questions, such as whether pavement type (PCC vs. AC) differentially affects severity for male and female drivers in head-on crashes, or whether older drivers face greater risk in sideswipe crashes during rain. The findings will provide transportation agencies with evidence to create targeted, data-driven interventions and educational campaigns for specific driver groups, ultimately supporting safer and more efficient safety policies during adverse weather conditions.
Given our policy-oriented goal of achieving interpretable, scenario-specific effects, we employ stratified binary logistic regression, which provides transparent odds ratios for each predictor while avoiding the black-box complexity of machine-learning models and the parameter proliferation required by mixed logit models.
A detailed methodology follows, describing the HSIS dataset and the binary logistic regression approach. Next, we define all study variables (dependent and independent). The Results section reports findings from 12 stratified models, identifying significant predictors of crash severity by driver gender and crash type. The Conclusion synthesizes key findings, notes limitations, and offers targeted policy recommendations.

2. Materials and Methods

2.1. Research Framework

The overall research framework, from data extraction to final analysis, is visually summarized in Figure 1. This flowchart illustrates the step-by-step process employed in the study. The process begins with the extraction of data from the California HSIS, which is then filtered to isolate crashes that occur under rainy weather conditions. Subsequently, the filtered dataset is stratified based on two key criteria: driver gender (male/female) and six distinct crash types. This stratification yields 12 subgroups, for each of which a separate binary logistic regression model is developed. Each model utilizes a set of predictors (independent variables) to analyze crash severity (the dependent variable), which is defined as property damage only (PDO) versus Injury/Fatality (NotPDO). The final output is the identification of significant predictors for each specific scenario, providing targeted insights for developing effective infrastructure, policy, and educational safety interventions.

2.2. Binary Logistic Regression

Binary Logistic Regression is a statistical method that predicts or explains binary outcomes. It is used in various fields where linear regression is not suitable. This technique is used when the dependent variable is inherently binary, with only two possible values. The objective of binary logistic regression is therefore to estimate the probability that an instance belongs to a specific category, given one or more predictors [29]. The Binary Logistic Regression model is expressed mathematically as:
P Y = 1 | X =   1 1 + e β 0 + β 1 x 1 + β 2 x 2 + + β n x n
In this formula, P(Y = 1∣X) represents the probability that the dependent variable Y equals 1 given the independent variables X. The terms β 1 ,   β 2 ,   ,   β n are the coefficients that represent the relationship between the independent variables and the log-odds of the outcome [30]. The logistic function converts the linear combination of input variables into a probability value ranging from 0 to 1, making it suitable for classification problems, risk assessment, and decision-making with dichotomous outputs [31]. Because of the model’s interpretability and flexibility, which enable the modeling of both nominal and interval independent variables, it has received a lot of attention from researchers [32].
In this study, Binary Logistic Regression was used to analyze crash severity in relation to the driver’s gender and the type of crash. The data were divided into 12 subgroups based on two genders and six types of crash: Rear-End, Hit Object, Sideswipe, Overturned, Head-On, and Broadside. In the context of each group, a logistic regression analysis was used to model crash severity, with PDO and NotPDO as the two outcome categories.
This approach is ideal for this analysis for several reasons. First, the binary logistic regression model is well-suited to the scale of outcome variables PDO and NotPDO [33]. Second, by developing separate models for each gender-crash type combination, we can capture variations in the effects of various inputs on crash severity across each category. This stratified approach provides greater precision and targeted insights than a single, overall model. Likewise, logistic regression can analyze both categorical and continuous predictors, making it suitable for analyzing various factors associated with the occurrence of crashes [34]. Finally, logistic regression does not require normality of predictor distributions, making it more accurate for analyzing real-world crash data, where distributions are typically not normal [35].
The selection of binary logistic regression was a strategic choice guided by the primary objective of this study, which is to produce interpretable and actionable insights for safety practitioners and policymakers. While alternative methods like machine learning (e.g., Random Forest, Gradient Boosting) may yield higher predictive accuracy, they often operate as black boxes, making it difficult to quantify the specific impact of individual predictors. In contrast, the coefficients and odds ratios generated by logistic regression provide a clear and direct measure of how each factor influences the likelihood of a severe crash, which is essential for evidence-based decision-making.
Furthermore, regarding heterogeneity, rather than relying on more complex models such as mixed or random-parameters logit to capture unobserved differences, we address heterogeneity explicitly through full data stratification. We estimate 12 separate models, one for each gender–crash type combination, so that every parameter can vary freely across scenarios. This design is intentional, not redundant: our objective is not a single parsimonious predictor but a systematic comparison of risk patterns across distinct subgroups. A single model, even with numerous interaction terms, would average effects and obscure nuanced (and sometimes opposing) influences across contexts. The stratified framework enhances transparency and policy relevance, and we synthesize the findings in an integrated heatmap that summarizes the outcomes of all 12 models.
The evaluation criteria for goodness of fitting logistic regression models in this study include Deviance, AIC, McFadden R2, R2ML, R2CU, and AUC. The following summarizes the evaluation criteria.
  • Deviance: A reduction in Deviance indicates an improvement in model fit. Models with lower Deviance values are considered to have a better fit to the data.
  • Akaike Information Criterion (AIC): AIC is used for model comparison. Lower AIC values indicate a better model fit, while being simpler.
  • McFadden R2: This criterion assesses the overall quality of the model. Higher McFadden R2 values indicate a better fit.
  • R2ML and R2CU: These two criteria are also used to evaluate the quality of model fit. R2ML is based on maximum likelihood, and R2CU is calculated based on the Cox-Snell criterion. Higher values indicate a better fit.
  • Area Under the ROC Curve (AUC): The AUC is a metric used to evaluate the model’s ability to correctly identify different classes. AUC values range from 0.5 (random) to 1 (perfect discrimination). Higher AUC values indicate better model performance.

2.3. HSIS Data

The HSIS is a database that contains motor vehicle crash data, roadway characteristics, and traffic statistics from various states of the United States of America [36]. The primary purpose of HSIS is to facilitate highway safety research and provide data on traffic crashes to address safety issues. The collected data encompasses a wide range of crashes, roadways, and traffic variables that are crucial for identifying safety risks and designing interventions to reduce the number of crashes and fatalities.
The selection of the HSIS database was deliberate, as its unique characteristics make it particularly suitable for achieving the objectives of this research. Firstly, HSIS provides high-quality, integrated data that links crash records with detailed roadway characteristics (e.g., surface type, topography) and traffic volume inventories (e.g., AADT). This linkage is crucial for a comprehensive, multifaceted analysis and is often lacking in standalone crash databases. Secondly, the database offers richness in its variables, containing the specific and well-defined data points required for our stratified analysis, including weather conditions, manner of collision, and detailed driver demographics. Finally, the large sample size of crash records available for California ensures that even after filtering for rainy conditions and stratifying into 12 subgroups, each subgroup retains a sufficient number of observations for a statistically robust regression analysis, which would not be feasible with a smaller, less comprehensive dataset.
California has participated in the HSIS program since 1991. It has provided a considerable amount of high-quality data, improving the system’s functionality [36]. The HSIS dataset for California is quite extensive, consisting of inventories of roadways and intersections, crash characteristics, and vehicle involvement specifics [36]. The HSIS dataset for California has been refined over the years with the goal of enhancing both data quality and accessibility.
California presents a particularly compelling context for studying rain-related crashes. The state’s Mediterranean climate is characterized by long dry seasons followed by distinct wet periods. This pattern often leads to the first flush phenomenon, where initial rainfall mixes with accumulated oil and road debris, creating exceptionally slick pavement conditions [37]. Furthermore, because rainfall is relatively infrequent in much of California, rain events produce marked operational impacts, lower speeds and capacity, and higher headways, indicating region-specific responses to wet conditions, which has been linked to a disproportionate increase in crash frequency during rainfall events. This unique environmental and behavioral context makes California a critical region for this analysis [38].
It is essential to note that this analysis is based on data from California, a state with distinct climate patterns, traffic culture, and roadway infrastructure, and therefore, the findings may not be directly generalizable to other regions. However, this focused approach allows for a detailed, context-specific case study that can serve as a valuable benchmark for similar research elsewhere.
The WEATHER1 variable in the HSIS dataset was used to filter for crashes under rainy conditions. It is important to note that this variable is categorical (Raining) and does not provide further granularity regarding the intensity of the rain (e.g., light, moderate, heavy). Therefore, the analysis encompasses all crashes recorded under this general weather condition.
As presented in Table 1, which includes 20 randomly selected records, the data prepared for this research pertains to crashes that occurred in California during rainy weather between 2015 and 2017. These crashes are categorized as “PDO”, meaning property damage only, and “NotPDO”, for other crashes with injuries or fatalities. Other valuable information in the dataset includes time of day, weather conditions, and the type of road surface at the time of the crash. It also provides information on the number of vehicles, the number of lanes, and the type of road surface.
Moreover, Table 2 displays the frequency of each level of the variables used in the research. For instance, Table 2 indicates that there are 8168 observations where the driver’s gender is female (F) and 15,074 observations where the driver’s gender is male (M). The table also displays the frequency of each severity level, road surface type, light condition, number of vehicles involved, number of lanes, terrain, Annual Average Daily Traffic (AADT), urban-rural classification, and season.
The HOUR Class variable was created to categorize the time of the crash into five distinct periods, capturing different traffic patterns throughout the day. The classes are defined as follows:
  • Class 1: Early Morning (00:01–06:00)
  • Class 2: Midday (10:01–16:00)
  • Class 3: Night (20:01–24:00)
  • Class 4: Morning Peak (06:01–10:00)
  • Class 5: Evening Peak (16:01–20:00)

3. Results

This section presents the results of fitting logistic regression models to analyze the severity of crashes in rainy weather, using various combinations of driver gender and crash types. The evaluation criteria for goodness of fit include Deviance, AIC, McFadden R2, R2ML, R2CU, and AUC. Table 3 summarizes the goodness-of-fit criteria for logistic regression models of crash severity in rainy weather, by gender and crash type.
Rear-End crash models for male and female drivers have lower McFadden R2 values (0.006 and 0.007) and AUC (0.559). This indicates a weak fit for these models. These results could be attributed to the complexity and high variability of factors that affect Rear-End collisions.
Models for Hit_Object crashes for both genders have slightly higher McFadden R2 and AUC values compared to other models (0.018 and 0.576 for males, 0.019 and 0.583 for females). This indicates a relatively better fit for these models.
Notable McFadden R2 (0.047) and AUC (0.638) values are found in Sideswipe crashes model for males. These results suggest a relatively good fit in identifying the severity of Sideswipe crashes. The McFadden R2 and AUC values are slightly lower (0.010 and 0.565) for females.
The greatest McFadden R2 and AUC values are found in models for Head-On collisions for both male and female drivers (0.145 and 0.748 for women, 0.137 and 0.709 for males). This indicates a better fit and higher accuracy of these models in identifying severity.
Models for Broadside crashes demonstrate moderate performance. Male drivers have a McFadden R2 of 0.020 and an AUC of 0.597, while female drivers have lower values (0.009 and 0.558, respectively). This suggests that the model predicts Broadside crash severity more accurately for male drivers compared to female drivers.
The models for Overturned crashes perform similarly for both genders, with a slightly better result for female drivers. Male drivers have a McFadden R2 of 0.021 and an AUC of 0.572, while female drivers have values of 0.030 and 0.571, respectively. This indicates a moderate fit for both genders, with a slightly better performance in predicting Overturned crash severity for female drivers.
The results of the goodness-of-fit evaluation for the logistic regression models indicate that the model’s accuracy and performance vary depending on the combinations of driver gender and crash types. Models for Head-On crashes perform best for both genders, while models for Rear-End crashes need improvement. These analyses can serve as a basis for developing strategies to improve road safety during rainy weather conditions.
Table A1 in the Appendix A presents the results from 12 logistic regression models used to analyze the severity of crash types.
To provide a comprehensive and intuitive summary of the findings from the 12 logistic regression models, a heatmap of the coefficient estimates was generated (Figure 2). This visualization allows for the direct comparison of predictor effects across all scenarios. Each row corresponds to an explanatory variable, and each column to a specific model. The color scale indicates the sign and magnitude of the coefficient estimates (red = positive effect, blue = negative effect, white ≈ zero), while non-significant predictors (p > 0.05) are marked with a cross (×). This graphical representation facilitates the identification of overarching patterns, variable consistency, and context-dependent effects that would be difficult to discern from tabular data alone.
Model 1 (Male, Rear-End Crash): The results of the first model indicate that road surface type (Surface Type: PCC) and road topography (Terrain: M) have a positive and significant effect on crash severity. Additionally, the season of occurrence (Season: Winter) and driver age group (Age Group: +65) have significant effects. These findings suggest that road conditions and the season of occurrence can impact crash severity.
Model 2 (Male, Hit Object Crash): In the second model, the number of vehicles involved (No. Vehicles: 2, No. Vehicles: 3, and No. Vehicles: +3) has a negative and significant effect on crash severity. Also, road topography (Terrain: M) and driver age group (Age Group: 25–65) have a significant impact. These findings suggest that an increase in the number of vehicles involved, along with the type of topography, can lead to a decrease in crash severity.
Model 3 (Female, Rear-End Crash): The third model’s findings indicate that road surface type (Surface Type: PCC) and road topography (Terrain: M) have a positive and significant effect on crash severity. Additionally, daily traffic volume (AADT: 125k–175k, AADT: 175k–250k, and AADT: +250k) has a significant impact. These findings indicate that road conditions and traffic volume can affect crash severity.
Model 4 (Male, Sideswipe Crash): The fourth model shows that the number of vehicles involved (No. Vehicles: 3 and No. Vehicles: +3) has a negative and significant effect on reducing crash severity. This model also accounts for road surface type (PCC) and road topography (Terrain: M and Terrain: R). The results show that concrete roads and flat areas can help reduce crash severity.
Model 5 (Female, Hit Object Crash): In the fifth model, vehicle manufacturing year (Vehicle Year) and number of vehicles involved (No. Vehicles: 2 and No. Vehicles: 3) have a significant impact on crash severity. Newer vehicles (with a more recent manufacturing year) are less likely to be involved in severe crashes due to improved safety technologies. Similarly, an increase in the number of vehicles involved is associated with a decrease in crash severity.
Model 6 (Male, Broadside Crash): In the sixth model, the time of crash occurrence (Hour class) and ambient light (Light) have a significant impact on crash severity. Crash severity is higher at night (Hour class: 4 and Hour class: 5) and in poor lighting conditions (Light: Daylight). These results show that lighting conditions and the time of crash occurrence can have a significant impact on crash severity.
Model 7 (Female, Broadside Crash): The results show that the number of road lanes (No. Lanes: 6–7 and No. Lanes: +8) and the time class of crash occurrence (Hour class: 4) have a significant effect on crash severity. More lanes and nighttime crashes are associated with increased crash severity. These findings indicate that roads with more lanes and specific time conditions can lead to increased crash severity. Additionally, daily traffic volume (AADT) contributes to an increase in crash severity.
Model 8 (Male, Overturned Crash): The time class of crash occurrence (Hour class: 3) has a negative and significant effect on crash severity. This indicates that midday crashes are less severe. Additionally, road surface type (Surface Type: PCC) has a significant effect on reducing crash severity. These results show that concrete roads can help reduce crash severity.
Model 9 (Female, Overturned Crash): The vehicle manufacturing year (Vehicle Year: +2015) and the number of vehicles involved (No. Vehicles: 2) have negative and significant effects on crash severity. Newer vehicles have improved safety technologies, and crashes involving fewer vehicles are less severe.
Model 10 (Male, Head-On Crash): In the tenth model, the time class of crash occurrence (Hour class: 3) and the number of vehicles involved (No. Vehicles: 2, No. Vehicles: 3, and No. Vehicles: +3) have a significant effect on crash severity. While midday Crashes involving fewer vehicles are less severe.
Model 11 (Female, Head-On Crash): The results show that daily traffic volume (AADT: 125k–175k) has a negative impact on crash severity. This suggests that reducing roadway traffic volumes has a substantial impact on reducing crash severity.
Model 12 (Female, Head-On Crash): In the twelfth model, the time class of crash occurrence (Hour class: 2, Hour class: 3, Hour class: 5) and the number of vehicles involved (No. Vehicles: 2, No. Vehicles: 3, and No. Vehicles: +3) have significant effects on crash severity. Crash occurrences during nighttime hours and those involving fewer vehicles are associated with decreased crash severity. Additionally, the season of occurrence (Season: Winter) has a positive and significant effect on crash severity.

4. Discussion

This study employed a stratified modeling approach to identify the nuanced factors that influence crash severity during rainy weather. The results, summarized visually in Figure 2, not only identify significant predictors but also highlight their high degree of context-dependency. Some of the key findings are as follows:
The results show that crashes with multiple vehicles are consistently associated with reduced severity compared to single-vehicle crashes. Both No. Vehicles: 2 and No. Vehicles: 3 were significant in five of the twelve models, with negative coefficients, while No. Vehicles: 3+ displayed the strongest reductions. This suggests that crashes involving more vehicles tend to distribute the crash energy across multiple impacts, thereby lowering the severity of individual injuries. Prior studies have similarly reported that chain-reaction or multi-vehicle crashes are associated with less severe outcomes because impact forces are distributed across multiple vehicles [39,40]. Moreover, such crashes often occur under congested, lower-speed conditions.
Vehicle Manufacturing Year (Vehicle Year): The effect of vehicle model year on crash severity was inconsistent. Although newer vehicles (2015 and later) are expected to reduce injury severity through advanced safety features, the results show mixed outcomes, with coefficients ranging from negative to positive across different models. In some cases, newer vehicles were associated with slightly higher odds of severe outcomes, which may reflect risk compensation, where drivers take greater risks due to a perceived sense of safety, or the greater likelihood of newer vehicles being used on higher-speed facilities [41,42].
Time of Crash (Hour class): The timing of a crash showed a significant influence on injury severity. Nighttime crashes were generally associated with more severe outcomes, reflecting the combined effects of reduced visibility, fatigue, and a higher likelihood of risky behaviors such as impaired driving. Morning and evening peaks also showed positive associations with severity in some subgroups, suggesting that congestion does not always mitigate crash outcomes when high exposure and aggressive driving behaviors are present. These results are consistent with earlier findings that adverse conditions such as rain and low visibility increase crash severity in multivehicle events [10] and that precipitation-related crashes, especially under poor visibility, elevate both crash frequency and severity [6]. More recent analyses also reinforce that real-time weather and temporal variations interact to significantly influence crash severity outcomes [23].
Type of Road Surface (Surface Type): PCC was significant in three models, with one negative and two small positive coefficients, indicating no consistent advantage over AC. The effects were modest, suggesting surface type is a secondary factor in severity outcomes. This aligns with recent findings that skid resistance, rather than pavement type itself, is the more critical determinant of injury severity under wet conditions [43].
Annual Average Daily Traffic (AADT): The strongest effect in results appears at the very highest volumes. AADT ≥ 250,000 was significant in three models, consistently yielding positive coefficients, indicating higher injury severity. A likely mechanism is the combination of short headways, greater speed variability, and complex operations (such as weaving/heavy-vehicle mix), which compresses reaction time and increases impact energy [44]. By contrast, mid-range volumes (e.g., 125k–175k) trend lower or mixed, pointing to a non-linear volume–severity relationship.
Ambient Light (Light): Lighting conditions did not emerge as significant predictors of injury severity in this study. The Daylight indicator (with dark conditions as the baseline) was not statistically significant in any model. This contrasts with prior evidence linking low-light conditions to higher severity [6,23], suggesting that in these models, the effects of lighting are largely absorbed by other covariates.
Road Topography (Terrain): The rolling terrain showed a slight reduction in severity, with small negative effects observed in two models. Mountain terrain appeared in four models, with mixed and near-zero coefficients, indicating a minimal systematic impact. Previous studies, however, report higher severity on mountainous roads due to steep grades, sharp curves, and limited sight distance [45]. The difference may be because variables such as AADT and crash type already capture much of the risk usually attributed to topography.
Season of Crash (Season): Seasonal effects appeared in only a few models. Summer was significant in one model, and Winter in two models, both with positive coefficients, indicating a slight increase in severity. The limited presence and modest effect sizes suggest that the season itself is not a strong predictor; rather, its influence likely reflects associated factors such as weather, daylight hours, and surface conditions.
Number of Lanes (No. Lanes): Lane count showed consistent negative effects at the higher categories. Roadways with 6–7 lanes were significant in two models, and those with 8 or more lanes were significant in three models, all of which were associated with reduced severity compared to the baseline. This may reflect that wider roads often operate at lower effective speeds under congestion, and crash energy is more dispersed across multiple lanes. Prior studies have also found that multilane facilities are linked with lower injury severity due to similar operational characteristics [46].
Driver Age Group (Age Group): Both 25–65 and 65+ were significant in two models each, with negative coefficients (larger in magnitude for the 65+ group), indicating lower severity compared to the ≤24 group. This concentrates higher severities among the youngest drivers, while middle-aged and older drivers show lower severity in these models. Prior work similarly reports elevated risk for young drivers and shows that age effects attenuate once crash type, exposure, and speed environment are controlled [47,48].
The multifaceted nature of crash severity, as highlighted by this research, calls for a similarly multifaceted approach to policy development. Severity is highest on very high-exposure corridors (AADT ≥ 250,000), so network operations need to emphasize speed harmonization and variable limits, ramp metering, queue warning, and stronger weaving and merge control on these links. Because higher lane counts (6–7 and 8+) are associated with lower severity, preserving cross-section quality, such as shoulders, separation, and lane discipline, as well as active speed management, needs to be prioritized over capacity expansion, framed as a safety measure.
Findings by crash configuration suggest that countermeasures should focus on preventing the most severe outcomes. Since multi-vehicle crashes are generally less severe, policies need to focus on the most injurious patterns: single-vehicle, head-on, and rollover crashes. Key countermeasures include median and roadside barriers, centerline and shoulder rumble strips, curve speed management, and improved delineation with high-friction spot treatments at high-risk locations. Where the models indicate increased risk by hour class, operations need to be timed accordingly, with targeted enforcement, quick incident response, and speed management during those times.
With respect to road users and vehicles, the younger age group carries the highest severity. Education and enforcement efforts should be targeted at young drivers through refinements to graduated licensing, telematics-based feedback, and location-focused campaigns near schools and universities. Vehicle model year effects are mixed, which implies that technology alone does not guarantee lower severity. Incentives for active safety features, such as automatic emergency braking, electronic stability control, and lane departure warning, need to be paired with behavioral programs and real-time operational strategies on high-exposure corridors, and their effectiveness needs to be validated through ongoing evaluation.

5. Conclusions

This study employed stratified logistic models, stratified by driver gender and crash type, to investigate factors associated with injury severity in rainy conditions. The results show clear, context-dependent patterns rather than uniform effects. Crashes involving multiple vehicles were consistently less severe than single-vehicle events, with two- and three-vehicle crashes being significant in several models, and crashes involving three or more vehicles showing the strongest reductions. Very high exposure was the most consistent risk signal: facilities with an annual average daily traffic of 250,000 or above were linked to higher severity, whereas higher lane counts, six to seven and eight or more, were associated with lower severity. Seasonal indicators appeared only sparingly, and vehicle model year effects were mixed.
These findings point to targeted operational and engineering responses. On very high-volume corridors, network operations need to emphasize speed harmonization, ramp metering, queue warning, and stronger weaving and merge control. To mitigate the most severe outcomes, countermeasures should prioritize single-vehicle, head-on, and overturn crashes through the use of median and roadside barriers, centerline and shoulder rumble strips, curve speed management, and enhanced delineation at high-risk sites. Education and enforcement efforts should focus on the youngest drivers, who exhibit the highest severity, through refinements to graduated licensing, telematics-based feedback, and location-focused campaigns near schools and universities. Given the mixed model year results, incentives for active safety technologies should be paired with behavioral programs and real-time operational strategies on high-exposure links, and their benefits should be verified through ongoing evaluation.
Overall, policy making needs to concentrate on the conditions where the models provide the strongest and most reliable signals: very high traffic exposure, lane configuration that supports safe operations, prevention of the most severe crash types, and risk among the youngest drivers. Focusing resources on these leverage points offers the clearest path to practical reductions in crash severity during rainy weather.
While this study offers valuable insights, several limitations should be acknowledged. Our analysis is based on data from California (2015–2017), which may limit its applicability to other regions with different traffic patterns, road conditions, and weather. Additionally, our definition of “rainy weather” was constrained by the categorical nature of the HSIS data. Future studies could achieve a more nuanced understanding by integrating granular meteorological data to explore the differential effects of light versus heavy rainfall on crash severity. Unobserved variables, such as specific driver behaviors or vehicle maintenance, which are not captured in the HSIS database, may also influence the results. The broad categorization of crash types and our binary outcome measure (PDO vs. NotPDO) limit our ability to analyze subtle patterns and injury severities. Future research could address these points by utilizing data from diverse geographic regions and employing advanced statistical methods, such as machine learning algorithms. Incorporating more driver-specific information through surveys or direct observation studies, analyzing injury severity as a continuous outcome, and conducting longitudinal studies to evaluate policy effectiveness would further advance our understanding of crash severity during rainy conditions.
Finally, a key methodological consideration is the potential for temporal instability in the estimated parameters. By pooling data from 2015 to 2017, our models implicitly assume that the relationships between predictor variables and crash severity are stable over this period. However, a significant body of literature demonstrates that the effects of various factors can change over time due to shifts in vehicle technology, driver behavior, economic conditions, or policy [49,50]. A formal statistical investigation of temporal instability for each of our 12 subgroups was beyond the scope of this study. Nonetheless, we acknowledge this as a limitation and suggest that it represents a critical point for future research. Future work could apply appropriate statistical tests to explore how these scenario-specific risk patterns evolve over longer time horizons.

Author Contributions

Conceptualization, S.N. and M.S.; methodology, S.N. and M.S.; validation, S.N., M.S., M.B. and M.P.; formal analysis, S.N. and M.S.; data curation, S.N., M.S., E.R. and M.J.H.Z.; writing—original draft preparation, S.N., M.S., E.R. and M.L.; writing—review and editing, S.N., M.S., E.R., M.L., M.B., M.P., M.J.H.Z. and A.M.; visualization, S.N. and M.S.; supervision, S.N. and A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the Highway Safety Information System (HSIS), with details available at https://rosap.ntl.bts.gov/view/dot/77184 (accessed on 25 August 2024). The processed data and analysis codes generated for this research are not publicly archived but are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Detailed Regression Model Results

Table A1. Summary of Logistic Regression Models Examining Factors Influencing Crash Severity in Rainy Weather.
Table A1. Summary of Logistic Regression Models Examining Factors Influencing Crash Severity in Rainy Weather.
ModelTermEstimateStd. ErrorStatisticp-Value
1(Intercept)0.4090.0994.1150.000
Surface Type: PCC0.1840.0622.9730.003
Terrain: M0.4300.1323.2620.001
Terrain: R−0.0850.067−1.2590.208
AADT: 125k–175k0.1230.0851.4540.146
AADT: 175k–250k0.0110.0760.1500.881
AADT: +250k0.1600.0911.7490.080
Season: Spring0.0810.0850.9500.342
Season: Summer0.0780.1690.4640.643
Season: Winter0.1910.0742.5810.010
Age Group: 25–65−0.1340.071−1.8820.060
Age Group: +65−0.2520.126−2.0020.045
2(Intercept)1.2300.09512.9230.000
No. Vehicles: 2−0.4580.096−4.7460.000
No. Vehicles: 3−0.8110.203−4.0020.000
No. Vehicles: +3−1.7650.332−5.3100.000
No. Lanes: 6–7−0.0270.107−0.2570.797
No. Lanes: +8−0.2780.083−3.3320.001
Terrain: M−0.3140.104−3.0150.003
Terrain: R−0.1510.079−1.9030.057
Age Group: 25–65−0.1730.075−2.3120.021
Age Group: +65−0.3120.176−1.7740.076
3(Intercept)0.1760.0732.3950.017
Surface Type: PCC0.2100.0782.6840.007
Terrain: M0.3560.1622.2000.028
Terrain: R−0.0440.085−0.5210.603
AADT: 125k–175k0.2290.1022.2390.025
AADT: 175k–250k0.2500.0962.6090.009
AADT: +250k0.2830.1142.4750.013
4(Intercept)2.6480.6234.2500.000
No. Vehicles: 2−0.8910.610−1.4600.144
No. Vehicles: 3−1.9110.615−3.1060.002
No. Vehicles: +3−2.0130.631−3.1910.001
No. Lanes: 6–7−0.4310.163−2.6440.008
No. Lanes: +80.0490.1810.2720.786
Surface Type: PCC−0.3060.099−3.0870.002
Terrain: M−0.5420.174−3.1200.002
Terrain: R−0.2080.104−2.0050.045
AADT: 125k–175k−0.1160.158−0.7330.463
AADT: 175k–250k−0.4220.165−2.5660.010
AADT: +250k−0.0950.184−0.5180.604
Age Group: 25–650.0340.1120.3060.760
Age Group: +650.4270.2251.9000.057
5(Intercept)0.6740.1753.8450.000
Vehicle Year: 2000–20050.3660.1412.5940.009
Vehicle Year: 2005–20100.3550.1362.6160.009
Vehicle Year: 2010–20150.4270.1492.8640.004
Vehicle Year: +20150.5200.2002.6000.009
No. Vehicles: 2−0.3970.143−2.7800.005
No. Vehicles: 3−0.7960.309−2.5740.010
No. Vehicles: +3−0.8790.499−1.7610.078
No. Lanes: 6–7−0.2440.135−1.8140.070
No. Lanes: +8−0.3910.113−3.4700.001
Terrain: M−0.2450.140−1.7510.080
Terrain: R−0.2480.104−2.3930.017
Season: Spring−0.0320.134−0.2380.812
Season: Summer0.5500.2632.0940.036
Season: Winter0.0040.1170.0320.975
Age Group: 25–65−0.2190.096−2.2880.022
Age Group: +65−0.2610.257−1.0130.311
6(Intercept)0.3830.2041.8720.061
Hour class: 20.5820.2961.9650.049
Hour class: 30.0830.2270.3640.716
Hour class: 40.7430.2882.5800.010
Hour class: 50.4640.2302.0200.043
Light: Daylight−0.3730.210−1.7780.075
AADT: 125k–175k0.2320.1671.3920.164
AADT: 175k–250k0.2760.1471.8780.060
AADT: 250k+0.5180.1872.7640.006
7(Intercept)0.2360.1721.3700.171
Hour class: 20.1950.1781.0970.273
Hour class: 3−0.0170.206−0.0810.936
Hour class: 40.4220.2012.0960.036
Hour class: 5−0.0410.192−0.2160.829
No. Lanes: 6–7−0.6380.184−3.4770.001
No. Lanes: +8−0.4960.214−2.3170.021
AADT: 125k–175k−0.0550.208−0.2640.792
AADT: 175k–250k−0.0850.216−0.3930.694
AADT: +250k0.5550.2622.1180.034
8(Intercept)0.1120.2810.3980.690
Hour class: 2−0.3650.296−1.2340.217
Hour class: 3−0.8180.337−2.4240.015
Hour class: 4−0.4300.320−1.3450.179
Hour class: 5−0.3320.312−1.0650.287
Surface Type: PCC0.2340.1541.5250.127
9(Intercept)0.3400.2011.6910.091
Vehicle Year: 2000–2005−0.1750.247−0.7090.478
Vehicle Year: 2005–2010−0.0770.265−0.2900.772
Vehicle Year: 2010–2015−0.4230.302−1.4000.161
Vehicle Year: +2015−0.8940.363−2.4620.014
No. Vehicles: 2−1.2320.480−2.5660.010
10(Intercept)1.6220.4823.3650.001
Hour class: 2−0.1610.361−0.4460.656
Hour class: 30.9590.3962.4190.016
Hour class: 4−0.1130.407−0.2780.781
Hour class: 5−0.0070.388−0.0180.986
No. Vehicles: 2−1.8580.432−4.3020.000
No. Vehicles: 3−1.9000.547−3.4730.001
No. Vehicles: +3−3.9910.737−5.4180.000
Age Group: 25–65−0.5290.277−1.9100.056
Age Group: +65−1.1480.527−2.1780.029
11(Intercept)−0.3800.159−2.3990.016
AADT: 125k–175k−1.6570.634−2.6130.009
AADT: 175k–250k−0.1310.363−0.3600.719
AADT: +250k−0.3130.633−0.4950.621
12(Intercept)−0.6200.850−0.7300.466
Vehicle Year: 2000–20050.3320.5330.6230.533
Vehicle Year: 2005–2010−0.5260.562−0.9350.350
Vehicle Year: 2010–2015−0.3810.558−0.6830.494
Vehicle Year: +2015−0.9580.665−1.4420.149
Hour class: 21.2560.6961.8040.071
Hour class: 31.8960.7582.5020.012
Hour class: 40.3510.7530.4660.642
Hour class: 51.2160.7071.7200.085
No. Vehicles: 2−1.3850.529−2.6210.009
No. Vehicles: 3−1.7170.727−2.3620.018
No. Vehicles: +3−3.6181.175−3.0780.002
Season: Spring0.4030.4990.8070.420
Season: Summer1.0960.6001.8250.068
Season: Winter0.8530.4192.0380.042

References

  1. World Health Organization (WHO). Global Status Report on Road Safety 2018; WHO: Geneva, Switzerland, 2018; ISBN 978-92-4-156568-4.
  2. National Highway Traffic Safety Administration. Traffic Safety Facts: 2019 Data; National Highway Traffic Safety Administration: Washington, DC, USA, 2021.
  3. California Office of Traffic Safety. California Highway Safety Plan; California Office of Traffic Safety: Elk Grove, CA, USA, 2022.
  4. National Highway Traffic Safety Administration. Weather-Related Crash Statistics; National Highway Traffic Safety Administration: Washington, DC, USA, 2022.
  5. Abdel-Aty, M.; Pande, A. Identifying Crash Propensity Using Specific Traffic Speed Conditions. J. Saf. Res. 2005, 36, 97–108. [Google Scholar] [CrossRef]
  6. Qiu, L.; Nixon, W.A. Effects of Adverse Weather on Traffic Crashes: Systematic Review and Meta-Analysis. Transp. Res. Rec. 2008, 2055, 139–146. [Google Scholar] [CrossRef]
  7. Andrey, J.; Mills, B.; Vandermolen, J. Weather Information and Road Safety; Institute for Catastrophic Loss Reduction: Toronto, ON, Canada, 2001. [Google Scholar]
  8. Theofilatos, A.; Yannis, G. A Review of the Effect of Traffic and Weather Characteristics on Road Safety. Accid. Anal. Prev. 2014, 72, 244–256. [Google Scholar] [CrossRef]
  9. Zhu, X.; Yuan, Y.; Hu, X.; Chiu, Y.C.; Ma, Y.L. A Bayesian Network Model for Contextual versus Non-Contextual Driving Behavior Assessment. Transp. Res. Part C Emerg. Technol. 2017, 81, 172–187. [Google Scholar] [CrossRef]
  10. Jung, S.; Qin, X.; Noyce, D. Injury Severity of Multivehicle Crash in Rainy Weather. J. Transp. Eng. 2012, 138, 50–59. [Google Scholar] [CrossRef]
  11. Shankar, V.; Mannering, F.; Barfield, W. Effect of Roadway Geometrics and Environmental Factors on Rural Freeway Accident Frequencies. Accid. Anal. Prev. 1995, 27, 371–389. [Google Scholar] [CrossRef]
  12. Chung, E.; Ohtani, O.; Warita, H.; Kuwahara, M.; Morita, H. Effect of Rain on Travel Demand and Traffic Accidents. In Proceedings of the 8th International IEEE Conference on Intelligent Transportation Systems, Vienna, Austria, 13–16 September 2005; IEEE: New York, NY, USA, 2005; pp. 1080–1083. [Google Scholar]
  13. Naseralavi, S.; Soltanirad, M.; Igene, M.; Jimee, K.; Ranjbar, E. Determining Factors Affecting Motorcycle Crash Severity Based on Time of Day and Season Combinations. ENG Trans. 2025, 6, 1–11. [Google Scholar] [CrossRef]
  14. Bose, D.; Segui-Gomez, M.; Crandall, J.R. Vulnerability of Female Drivers Involved in Motor Vehicle Crashes: An Analysis of US Population at Risk. Am. J. Public Health 2011, 101, 2368–2373. [Google Scholar] [CrossRef]
  15. Ulfarsson, G.F.; Mannering, F.L. Differences in Male and Female Injury Severities in Sport-Utility Vehicle, Minivan, Pickup and Passenger Car Accidents. Accid. Anal. Prev. 2004, 36, 135–147. [Google Scholar] [CrossRef] [PubMed]
  16. Billah, K.; Sharif, H.O.; Dessouky, S. How Gender Affects Motor Vehicle Crashes: A Case Study from San Antonio, Texas. Sustainability 2022, 14, 7023. [Google Scholar] [CrossRef]
  17. Islam, S.; Mannering, F. Driver Aging and Its Effect on Male and Female Single-Vehicle Accident Injuries: Some Additional Evidence. J. Saf. Res. 2006, 37, 267–276. [Google Scholar] [CrossRef]
  18. Hill, J.D.; Boyle, L.N. Driver Stress as Influenced by Driving Maneuvers and Roadway Conditions. Transp. Res. Part F Traffic Psychol. Behav. 2007, 10, 177–186. [Google Scholar] [CrossRef]
  19. Naseralavi, S.S.; Soltanirad, M.; Mazaheri, A. Modeling the Severity of Motor-Vehicle Accidents by Different Age Groups of the Driver Using Multinomial Logit Regression. Q. J. Transp. Eng. 2023, 14, 3061–3083. [Google Scholar] [CrossRef]
  20. Abdel-Aty, M.; Ekram, A.-A.; Huang, H.; Choi, K. A Study on Crashes Related to Visibility Obstruction Due to Fog and Smoke. Accid. Anal. Prev. 2011, 43, 1730–1737. [Google Scholar] [CrossRef] [PubMed]
  21. Chen, C.; Zhang, G.; Tarefder, R.; Ma, J.; Wei, H.; Guan, H. A Multinomial Logit Model-Bayesian Network Hybrid Approach for Driver Injury Severity Analyses in Rear-End Crashes. Accid. Anal. Prev. 2015, 80, 76–88. [Google Scholar] [CrossRef] [PubMed]
  22. Naseralavi, S.; Soltanirad, M.; Baghersad, M.; Ranjbar, E.; Jimee, K.; Mazaheri, A. Exploring the Impact of Driver Sex, Driver Age, Area Type, and Lighting Conditions on Rear-End Collision Severity. Comput. Res. Prog. Appl. Sci. Eng. 2025, 11, 1–16. [Google Scholar] [CrossRef]
  23. Pathivada, B.K.; Banerjee, A.; Haleem, K. Impact of Real-Time Weather Conditions on Crash Injury Severity in Kentucky Using the Correlated Random Parameters Logit Model with Heterogeneity in Means. Accid. Anal. Prev. 2024, 196, 107453. [Google Scholar] [CrossRef] [PubMed]
  24. Brumbelow, M.L.; Jermakian, J.S. Injury Risks and Crashworthiness Benefits for Females and Males: Which Differences Are Physiological? Traffic Inj. Prev. 2022, 23, 11–16. [Google Scholar] [CrossRef]
  25. Uddin, M.; Huynh, N. Injury Severity Analysis of Truck-Involved Crashes under Different Weather Conditions. Accid. Anal. Prev. 2020, 141, 105529. [Google Scholar] [CrossRef]
  26. Craig, M.J.; Liu, C.; Zhang, F.; Enriquez, J. Sex-Based Differences in Odds of Motor Vehicle Crash Injury Outcomes. Accid. Anal. Prev. 2024, 195, 107100. [Google Scholar] [CrossRef]
  27. Ye, F.; Lord, D. Comparing Three Commonly Used Crash Severity Models on Sample Size Requirements: Multinomial Logit, Ordered Probit and Mixed Logit Models. Anal. Methods Accid. Res. 2014, 1, 72–85. [Google Scholar] [CrossRef]
  28. Kim, J.-K.; Ulfarsson, G.F.; Kim, S.; Shankar, V.N. Driver-Injury Severity in Single-Vehicle Crashes in California: A Mixed Logit Analysis of Heterogeneity Due to Age and Gender. Accid. Anal. Prev. 2013, 50, 1073–1081. [Google Scholar] [CrossRef] [PubMed]
  29. Peng, C.-Y.J.; Lee, K.L.; Ingersoll, G.M. An Introduction to Logistic Regression Analysis and Reporting. J. Educ. Res. 2002, 96, 3–14. [Google Scholar] [CrossRef]
  30. Hosmer, D.W., Jr.; Lemeshow, S.; Sturdivant, R.X. Applied Logistic Regression, 3rd ed.; Wiley: Hoboken, NJ, USA, 2013; ISBN 978-0-470-58247-3. [Google Scholar]
  31. Harrell, F.E., Jr. Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis, 2nd ed.; 2015 edition; Springer: Cham, Switzerland, 2015; ISBN 978-3-319-19424-0. [Google Scholar]
  32. Kleinbaum, D.G.; Klein, M. Logistic Regression: A Self-Learning Text, 3rd ed.; Springer: New York, NY, USA, 2010; ISBN 978-1-4419-1741-6. [Google Scholar]
  33. Sperandei, S. Understanding Logistic Regression Analysis. Biochem. Med. 2014, 24, 12–18. [Google Scholar] [CrossRef]
  34. Menard, S. Applied Logistic Regression Analysis; SAGE: Newbury Park, CA, USA, 2002; ISBN 978-0-7619-2208-7. [Google Scholar]
  35. Bagley, S.C.; White, H.; Golomb, B.A. Logistic Regression in the Medical Literature: Standards for Use and Reporting, with Particular Attention to One Medical Domain. J. Clin. Epidemiol. 2001, 54, 979–985. [Google Scholar] [CrossRef] [PubMed]
  36. Federal Highway Administration (FHWA). HSIS Guidebook—California; FHWA: Washington, DC, USA, 2022.
  37. Stenstrom, M.K.; Kayhanian, M. First Flush Phenomenon Characterization; California Department of Transportation, Division of Environmental Analysis: Sacramento, CA, USA, 2005; CTSW-RT-05-73-02.6.
  38. Eisenberg, D. The mixed effects of precipitation on traffic crashes. Accid. Anal. Prev. 2004, 36, 637–647. [Google Scholar] [CrossRef]
  39. Chen, F.; Song, M.; Ma, X. Investigation on the Injury Severity of Drivers in Rear-End Collisions between Cars Using a Random Parameters Bivariate Ordered Probit Model. Int. J. Environ. Res. Public Health 2019, 16, 2632. [Google Scholar] [CrossRef]
  40. Wang, C.; Chen, F.; Zhang, Y.; Cheng, J. Analysis of Injury Severity in Rear-End Crashes on an Expressway Involving Different Types of Vehicles Using Random-Parameters Logit Models with Heterogeneity in Means and Variances. Transp. Lett. 2023, 15, 742–753. [Google Scholar] [CrossRef]
  41. Yu, M.; Zheng, C.; Ma, C. Analysis of Injury Severity of Rear-End Crashes in Work Zones: A Random Parameters Approach with Heterogeneity in Means and Variances. Anal. Methods Accid. Res. 2020, 27, 100126. [Google Scholar] [CrossRef]
  42. National Highway Traffic Safety Administration. How Vehicle Age and Model Year Relate to Driver Injury Severity in Fatal Crashes; National Highway Traffic Safety Administration: Washington, DC, USA, 2013.
  43. Alnawmasi, N.; Alamri, M.; Jashami, H. Assessing Injury-Severity Outcomes in Wet-Pavement Conditions: Exploring the Impact of Pavement Skid Resistance through Temporal Analysis with Random Parameters and Heterogeneity in Means and Variances. Transp. Res. Rec. 2025, 2679, 50–77. [Google Scholar] [CrossRef]
  44. Bogue, S.; Paleti, R.; Balan, L. A modified rank ordered logit model to analyze injury severity of occupants in multivehicle crashes. Anal. Methods Accid. Res. 2017, 14, 22–40. [Google Scholar] [CrossRef]
  45. Islam, S.; Jones, S.L.; Dye, D. Comprehensive analysis of single- and multi-vehicle large truck at-fault crashes on rural and urban roadways in Alabama. Accid. Anal. Prev. 2014, 67, 148–158. [Google Scholar] [CrossRef]
  46. Sharafeldin, M.; Farid, A.; Ksaibati, K. Injury severity analysis of rear-end crashes at signalized intersections. Sustainability 2022, 14, 13858. [Google Scholar] [CrossRef]
  47. Adanu, E.K.; Lidbe, A.; Tedla, E.; Jones, S. Factors associated with driver injury severity of lane-changing crashes involving younger and older drivers. Accid. Anal. Prev. 2021, 149, 105867. [Google Scholar] [CrossRef] [PubMed]
  48. Lee, D.; Guldmann, J.-M.; von Rabenau, B. Impact of Driver’s Age and Gender, Built Environment, and Road Conditions on Crash Severity: A Logit Modeling Approach. Int. J. Environ. Res. Public Health 2023, 20, 2338. [Google Scholar] [CrossRef] [PubMed]
  49. Behnood, A.; Mannering, F.L. The Temporal Stability of Factors Affecting Driver-Injury Severities in Single-Vehicle Crashes: Some Empirical Evidence. Anal. Methods Accid. Res. 2015, 8, 7–32. [Google Scholar] [CrossRef]
  50. Al-Bdairi, N.S.S.; Behnood, A. Assessment of Temporal Stability in Risk Factors of Crashes at Horizontal Curves on Rural Two-Lane Undivided Highways. J. Saf. Res. 2021, 76, 205–217. [Google Scholar] [CrossRef]
Figure 1. Conceptual framework of the research methodology.
Figure 1. Conceptual framework of the research methodology.
Futuretransp 05 00146 g001
Figure 2. Heatmap of regression coefficients for crash severity.
Figure 2. Heatmap of regression coefficients for crash severity.
Futuretransp 05 00146 g002
Table 1. Twenty Random Records from the Prepared Dataset from HSIS.
Table 1. Twenty Random Records from the Prepared Dataset from HSIS.
Driver
Sex
Vehicle
Year
Hour
Class
SeverityLightNo.
Vehicles
No.
Lanes
Surface
Type
TerrainAADTRural/
Urban
SeasonCrash
Type
Age
Group
M2000–20055PDODark2+8PCCF125k–175kUWinterHit_Object25–65
F2000–20053PDODark3+8PCCF+250kUSpringRear_End25–65
F+20155PDODark3+8PCCF+250kUWinterRear_End25–65
F+20152NotPDODaylight2−5ACR−125kUWinterRear_End25–65
M2010–20153NotPDODark+3+8PCCF175k–250kUFallSideswipe25–65
F−20002NotPDODaylight16–7ACF−125kUFallHit_Object+65
M+20154PDODaylight2+8ACR−125kRSpringBroadside−25
F+20154NotPDODaylight1−5ACR−125kRFallHit_Object25–65
F2005–20102NotPDODaylight2−5ACF−125kRFallRear_End+65
M2000–20054PDODaylight3−5ACR−125kUSpringSideswipe25–65
M2010–20153PDODark3+8ACF175k–250kUSpringRear_End25–65
M2000–20055PDODark1−5ACF−125kRFallHit_Object−25
F2000–20055NotPDODaylight2−5ACF−125kUWinterSideswipe25–65
F−20002PDODaylight2−5ACM−125kRFallSideswipe25–65
M2005–20101PDODark1+8PCCR−125kRWinterHit_Object25–65
F2010–20155PDODark2+8PCCF175k–250kUWinterRear_End25–65
M−20004PDODaylight2+8PCCF175k–250kUWinterRear_End25–65
M2005–20104PDODark1+8PCCM−125kUWinterHit_Object−25
M+20154PDODark2+8PCCF+250kUWinterSideswipe25–65
M−20002PDODaylight36–7ACR−125kUWinterBroadside25–65
Table 2. Frequency of Each Level of Variables for the Prepared Dataset.
Table 2. Frequency of Each Level of Variables for the Prepared Dataset.
VariableFrequency
Driver SexF: 8168M: 15,074
Vehicle Year−2000: 34662000–2005: 52162005–2010: 59222010–2015: 5631+2015: 3007
Hour Class1: 37362: 65613: 32994: 42745: 5372
SeverityNotPDO: 8544PDO: 14,698
LightDark: 11,420Daylight: 11,822
No. Vehicles1: 64302: 11,9363: 3606+3: 1270
No. Lanes−5: 71046–7: 4131+8: 12,007
Surface TypeAC: 10,042PCC:13,200
TerrainF: 14,707M: 2467R: 6068
AADT−125K: 9910125k–175k:4050125k–175k: 4050175k–250k: 6265+250k: 3017
Rural/UrbanR: 4164U:19,078
SeasonFall: 4607Spring: 5431Summer: 734Winter: 12,470
Crash TypeBroadside: 2028Rear End: 8975Hit Object: 6517Sideswipe: 4316Head On: 600Overturned: 806
Age Group−25: 558425–65: 16,417+65: 1241
Table 3. Goodness of Fit Evaluation Criteria for Logistic Regression Models.
Table 3. Goodness of Fit Evaluation Criteria for Logistic Regression Models.
ModelDriver SexCrash TypenDevianceAICMcFadden R2R2MLR2CUAUC
1MRear_End55367180.4117204.4110.0060.0080.0110.559
2MHit_Object43115288.7385308.7380.0180.0220.0310.576
3FRear_End34394555.0684569.0680.0070.0090.0120.559
4MSideswipe29403215.3363243.3360.0470.0530.0770.638
5FHit_Object22062884.0482918.0480.0190.0250.0340.583
6FSideswipe13761670.8681688.8680.0100.0120.0180.565
7MBroadside13271803.1421823.1420.0200.0270.0360.597
8FBroadside701957.696969.6960.0090.0130.0170.558
9MOverturned563763.230775.2300.0210.0290.0380.572
10MHead_On397450.951470.9510.1370.1640.2250.709
11FOverturned243309.678317.6780.0300.0390.0530.571
12FHead_On203228.520258.5200.1450.1740.2380.748
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Naseralavi, S.; Soltanirad, M.; Ranjbar, E.; Lucero, M.; Baghersad, M.; Piri, M.; Hassan Zada, M.J.; Mazaheri, A. Modeling the Severity of Crashes in Rainy Weather by Driver Gender and Crash Type. Future Transp. 2025, 5, 146. https://doi.org/10.3390/futuretransp5040146

AMA Style

Naseralavi S, Soltanirad M, Ranjbar E, Lucero M, Baghersad M, Piri M, Hassan Zada MJ, Mazaheri A. Modeling the Severity of Crashes in Rainy Weather by Driver Gender and Crash Type. Future Transportation. 2025; 5(4):146. https://doi.org/10.3390/futuretransp5040146

Chicago/Turabian Style

Naseralavi, Saber, Mohammad Soltanirad, Erfan Ranjbar, Martin Lucero, Mahdi Baghersad, Mehran Piri, Mohammad Javad Hassan Zada, and Akram Mazaheri. 2025. "Modeling the Severity of Crashes in Rainy Weather by Driver Gender and Crash Type" Future Transportation 5, no. 4: 146. https://doi.org/10.3390/futuretransp5040146

APA Style

Naseralavi, S., Soltanirad, M., Ranjbar, E., Lucero, M., Baghersad, M., Piri, M., Hassan Zada, M. J., & Mazaheri, A. (2025). Modeling the Severity of Crashes in Rainy Weather by Driver Gender and Crash Type. Future Transportation, 5(4), 146. https://doi.org/10.3390/futuretransp5040146

Article Metrics

Back to TopTop