1. Introduction
Road traffic crashes are a major global public health and socioeconomic challenge, causing approximately 1.19 million deaths and millions of injuries annually [
1]. Low- and middle-income countries (LMICs) bear a disproportionate burden, accounting for over 90% of fatalities despite having fewer vehicles. Urban intersections are among the most hazardous locations due to the convergence of vehicles, pedestrians, and cyclists, creating multiple conflict points and a high potential for severe crashes [
2,
3]. Understanding the determinants of crash severity at intersections is critical for improving road safety and achieving Sustainable Development Goals.
In LMICs, intersections often feature mixed traffic, where motorized and non-motorized road users share limited space with minimal separation [
4,
5]. This heterogeneity increases crash frequency and severity, particularly in cities with unplanned growth, informal traffic behaviour, and limited pedestrian facilities [
6,
7,
8]. Vulnerable road users, such as pedestrians and cyclists, are disproportionately affected, and intersections in these contexts are often poorly designed or inadequately controlled, exacerbating the risk of severe collisions [
9].
Crash severity is influenced by a combination of human, environmental, and infrastructure-related factors. Driver characteristics, including age, gender, and risk-taking behaviour, are well-established predictors [
10,
11]. Environmental conditions, such as lighting, weather, and pavement state, also play a critical role [
12,
13,
14]. Road infrastructure elements including lane width, shoulder provision, medians, signage, and intersection geometry can mitigate or exacerbate crash severity [
6,
15,
16,
17,
18]. Poorly designed intersections lacking signals, markings, or channelization have consistently been associated with higher severity outcomes.
Interactions between environmental and geometric factors further influence crash risk. For example, inadequate lighting or wet pavement may amplify the negative effects of narrow lanes or insufficient shoulders [
19,
20,
21]. Most evidence comes from high-income countries with homogeneous traffic, limiting applicability in LMIC contexts where lane discipline is weak and traffic is heterogeneous [
22,
23]. Empirical studies that jointly assess infrastructure and environmental determinants in mixed-traffic LMIC intersections remain scarce [
24,
25], highlighting the need for context-specific analyses.
This study addresses this gap by examining the impact of road infrastructure and environmental factors on crash severity at intersections in Cameroon, a representative LMIC urban setting with mixed traffic [
26]. Using police-reported crash data collected over four and a half years, binary probit and logistic regression models estimate the likelihood of severe crashes based on roadway and environmental attributes [
27,
28]. The findings aim to inform policymakers and urban planners on targeted interventions, such as lane redesign, surface maintenance, and signal optimization, to reduce crash severity. The remainder of the paper is organized as follows:
Section 2 reviews relevant literature on infrastructure and environmental determinants;
Section 3 presents the methodology and data;
Section 4 discusses the results; and
Section 5 concludes with policy implications and recommendations for improving intersection safety in LMIC cities.
4. Data
4.1. Study Area and Context
This study focuses on Douala and Yaoundé, Cameroon’s largest cities and primary economic hubs. Both cities are undergoing rapid urbanization, leading to increasing travel demand, high traffic volumes, and recurrent congestion. These conditions, coupled with mixed traffic flows dominated by cars, motorcycles, and pedestrians, contribute to a high incidence of road crashes.
Travel demand in the two cities is substantial but varies in composition. In Yaoundé, approximately 8 million daily trips are generated, with cars accounting for 48.4%, pedestrians for 35%, motorcycles for 14.2%, and buses for 2.4% [
67]. In Douala, about 6.9 million daily trips are generated, of which 17% are by car, 35% by pedestrians, 44% by motorcycles, 1% by bus, and 3% by other modes [
26].
Despite these estimates, comprehensive and regularly updated traffic data remain unavailable. Permanent traffic sensors are lacking, and systematic data collection at intersections or along corridors is rarely conducted.
The overall transport and mobility system is inefficient, characterized by limited public transport options, irregular and uncomfortable services, severe congestion, and rising pollution levels [
67]. Road safety is a pressing concern, with crashes causing thousands of deaths and serious injuries annually in both cities [
26,
67]. The road network consists of arterial roads, secondary roads, and local streets, but official statistics on traffic flow, vehicle composition, and intersection characteristics are largely missing.
4.2. Data Collection and Cleaning
Four years and six months of crash data from Douala and Yaoundé were used in this study, covering the period from January 2021 to December 2022 for Douala and from January 2023 to June 2025 for Yaoundé. The data were obtained from the General Delegation for National Security (DGSN) of Cameroon, specifically from the central police headquarters of both cities. Crashes were reported to the police, who prepared official crash reports that were subsequently compiled into Excel spreadsheets by road safety experts.
A two-stage approach was adopted to address missing values. First, missing information on intersection location was completed by searching for the intersection names provided in the police reports on web-based mapping platforms such as Google Maps and Google Earth, from which geographic coordinates were extracted. Second, missing details on road characteristics, such as the presence of shoulders or delineation, were retrieved from the same sources. Road variables were then categorized for analysis: some were binary (e.g., presence or absence of shoulders), while others were classified into multiple categories (e.g., shoulder width).
After removing non-essential records, specifically crashes that occurred outside intersections or at locations that could not be clearly identified, the final dataset comprised 1242 crashes. The dataset was divided into two subsets: severe crashes and non-severe crashes.
4.3. Data Description
The dataset comprised 1242 intersection crashes, of which 173 (13.9%) were classified as severe and 1069 (86.1%) as non-severe. Severe crashes included possible, minor, incapacitating, and fatal injuries, while non-severe crashes consisted of property-damage-only (PDO) events. Overall, PDO crashes represented 86.1% of the sample, while injury and fatal crashes accounted for 13.9%.
In addition to severity, the dataset included environmental and road-related variables. For example, crash time was categorized by time of day, and street lighting conditions were coded as present or absent, with 15.3% of crashes occurring at intersections without lighting. Each record in the dataset represented a unique intersection crash, with its corresponding attributes systematically classified on a categorical or binary basis.
In SPSS, the dependent variable was defined as crash severity (1 = severe, 0 = non-severe), while the independent variables were those listed in
Table 1.
Table 2 summarizes the descriptive statistics of the dataset.
6. Discussions
The binary probit and logistic regression analyses provide complementary insights into the determinants of crash severity at intersections in Yaoundé and Douala. Both models consistently identified time interval, midblock configuration, and number of lanes as significant predictors (
p-value < 0.05), with shoulder rumble strips and poor road conditions emerging as marginally significant (
p-value < 0.10). The probit model quantified changes in the probability of severe crashes, whereas the logistic model provided odds ratios that are more intuitive for safety policy and intervention design [
27,
69]. This dual modelling approach thus combines the interpretive strength of marginal probabilities with the communicative clarity of odds ratios, bridging empirical prediction with policy translation, an essential step for evidence-based safety management in rapidly motorising LMIC contexts.
Time of day emerged as a major factor. The probit model showed that late-night/early-morning crashes (00:00–05:59) increased severity probability by 13.5%, while midday, afternoon, and evening periods reduced it by 10.2%, 9.5%, and 11.1%, respectively. The logistic model confirmed this, showing late-night crashes were 2.57 times more likely to be severe, while midday, afternoon, and evening crashes reduced odds by 58%, 52%, and 56%, respectively. These findings reinforce global evidence linking night-time crashes with higher severity due to reduced visibility, fatigue, or risky behaviours such as speeding and alcohol use [
12,
31]. Comparable studies in HICs [
11,
73] also report elevated night-time risk, yet LMIC settings often experience amplified effects due to weaker enforcement and limited illumination, highlighting how contextual risk multipliers shape crash severity outcomes. This pattern suggests that targeted night-time enforcement and roadway illumination programs could yield measurable reductions in severe crash probability, directly translating the estimated marginal effects into operational safety strategies.
Midblock design also significantly influenced severity. The probit model indicated that centre lines reduced severity probability by 3.9% compared to physical separation, while central hatching reduced it by 14.8% (though not statistically significant). Logistic results echoed this, with centre lines reducing severity odds by 71% (
OR = 0.295,
p-value < 0.05). This suggests that less complex road designs may lower severity by reducing speeds and simplifying manoeuvres, consistent with prior studies [
22]. Similar relationships were found by [
11] in Florida and [
70] in Ghana where simplified intersection geometry and delineation improved crash outcomes. These parallels reinforce that while geometric countermeasures are universally beneficial, their effectiveness depends on implementation fidelity and driver adaptation to mixed-traffic conditions. This also contributes theoretically by supporting the geometric risk–behavioural adaptation nexus, which posits that design simplicity can modulate human error probability under mixed-traffic dynamics.
The number of lanes was the most influential geometric factor. Probit results showed single-lane and two-lane roads increased severity probability by 21.5% and 9.1%, respectively, compared to three-lane roads. Logistic regression confirmed this, showing 5.38 and 3.90 times higher odds of severe crashes on single- and two-lane roads. These results align with earlier research demonstrating that narrower facilities constrain manoeuvring space and exacerbate collision impact [
11,
42]. In LMIC contexts, such as Ghana [
70] and Ethiopia [
31], this pattern is often compounded by mixed traffic and poor delineation, further elevating crash risk. In practical terms, the 21.5% marginal increase in severity probability offers a quantifiable design threshold for prioritising lane expansion or shoulder widening at high-risk intersections. These findings suggest that intersection expansion or lane channelisation could be cost-effective interventions, particularly when guided by marginal effect estimates that quantify how each geometric constraint translates into crash severity probability.
Shoulder rumble strips and road conditions showed marginal protective effects. Rumble strips reduced severity probability by 16.9% (probit) and odds by 29.9% (
OR = 0.701), while poor road conditions reduced severity probability by 9.4% (probit) and odds by 58.4% (
OR = 0.416). These findings suggest rumble strips may help drivers correct deviations, while poor surfaces might encourage cautious driving. However, the weak significance signals the need for more data to confirm these trends [
41]. Comparable protective effects of rumble strips have been reported in the U.S. [
74], while contrasting findings in Malaysia [
41] indicate that road surface quality interacts strongly with enforcement levels and driver caution. Hence, these marginally significant variables may represent latent behavioural adaptations rather than direct engineering effects. Nonetheless, their inclusion broadens the model’s explanatory scope, suggesting that even subtle tactile feedback mechanisms can induce corrective driver behaviour under uncertain roadway conditions.
By contrast, delineation, street lighting, crosswalks, and sidewalks were not statistically significant. This may reflect limited variability in the dataset (e.g., most intersections had lighting) or insufficient statistical power due to the imbalance between severe and non-severe crashes (13.9% severe). Similar findings have been noted in LMIC settings, where the effects of environmental factors are highly context-dependent [
51]. In contrast, HIC studies often report stronger effects for such [
11,
73] suggesting that contextual moderation through enforcement, driver compliance, and infrastructure quality should be modelled explicitly in future research. This underscores that environmental variables, particularly lighting condition and time of day, operate synergistically: while lighting was not directly significant, its influence is captured indirectly through the pronounced night-time effect, reinforcing the critical role of visibility in shaping crash outcomes.
From a theoretical perspective, the convergence of probit and logit estimates supports behavioural adaptation and geometric risk theory, suggesting that crash severity results from the joint influence of infrastructure-induced exposure and driver decision-making constraints. The quantified marginal effects offer direct translation into safety design, e.g., a 21.5% increase in severity probability for single-lane approaches could justify re-channelisation or added shoulders at high-volume intersections. This study therefore advances theoretical understanding by empirically demonstrating how structural road geometry interacts with temporal exposure to produce non-linear severity effects in mixed-traffic systems.
Policy-wise, these findings reinforce the importance of integrating marginal effect outputs into prioritisation tools for intersection upgrades. Decision-makers can use the probability-based estimates to rank sites by expected severity reduction potential rather than frequency alone a shift consistent with the Safe System approach promoted in both HIC and LMIC policy frameworks. Such translation of model outputs into design priorities marks a methodological contribution, as it demonstrates how econometric evidence can directly inform spatially targeted interventions and cost-effective policy sequencing.
From a modelling standpoint, the refined model’s improved AIC (321.78 vs. 481.91) and acceptable deviance ratio (≈1.63) illustrate that simplification enhanced fit while maintaining theoretical coherence. The logistic model’s modest pseudo-R2 values (Cox & Snell R2 = 0.066, Nagelkerke R2 = 0.118) are consistent with expectations for crash-severity data, confirming that explanatory clarity and parsimony were prioritised over overfitting. Future model improvements could incorporate hierarchical structures, exposure measures, or Bayesian updating to capture unobserved heterogeneity and strengthen predictive accuracy. In addition, testing interaction terms (e.g., time interval × number of lanes) could reveal compound risk dynamics, while adopting mixed-effects or penalised regression frameworks may enhance stability in small or imbalanced samples.
Overall, the consistency across models reinforces confidence in the key results while underscoring the need for richer datasets and advanced methods to capture the complex interplay of infrastructure, behavioural, and environmental factors. By combining robust model specification with interpretable marginal effects, this study contributes a replicable framework for LMIC crash-severity analysis that informs both academic theory and practical design guidance. In essence, it bridges the gap between quantitative inference and actionable safety design, aligning empirical modelling with real-world decision-making.
7. Conclusions
This study analysed 1242 intersection crashes in Douala and Yaoundé (2021–2025) using binary probit and logistic regression models to identify determinants of crash severity. Both models consistently highlighted time interval, midblock design, and number of lanes as significant predictors, with shoulder rumble strips and road condition showing marginal effects. The consistency between models strengthens the robustness of the findings and demonstrates that even with modest pseudo-R2 values, well-specified models can yield meaningful and policy-relevant insights. Together, these models provide complementary interpretive and predictive perspectives where the probit model quantifies risk changes in probability terms and the logistic model contextualises them within policy-relevant odds ratios.
The results demonstrate that single-lane roads substantially increase severity risk (probit: 21.5% probability; logistic: OR = 5.38), while late-night and early-morning crashes (00:00–05:59) are particularly dangerous (probit: 13.5%; logistic: OR = 2.57). Centre lines were associated with safer outcomes compared to physical separation (probit: –3.9%; logistic: OR = 0.295). Rumble strips and poor road conditions also appeared to reduce severity, although these effects were only marginally significant (probit: –16.9%; logistic: OR = 0.701 and probit: –9.4%; logistic: OR = 0.416, respectively) and require further study. Environmental factors, specifically lighting condition and time of day emerged as critical contextual moderators of severity outcomes. While lighting itself was statistically insignificant, its interplay with night-time crashes indicates that inadequate illumination magnifies crash severity by reducing visibility and driver response time, reinforcing the importance of continuous lighting and targeted night-time enforcement.
Collectively, these results highlight that geometric design and temporal exposure remain the most influential determinants of crash severity, yet their impacts are modulated by environmental quality. Poor illumination, uneven surfaces, and faded markings likely exacerbate crash risks during high-vulnerability periods such as late-night travel. This aligns with prior LMIC studies [
31,
70,
75] and indicates that environmental improvements, better lighting, resurfacing, and delineation maintenance should complement geometric and behavioural interventions. Thus, addressing temporal and geometric risks without improving the environmental context may yield only partial safety benefits.
These findings highlight the central role of road geometry and temporal factors in shaping crash outcomes in LMIC urban contexts, while also pointing to concrete strategies for intervention. From a policy perspective, translating the estimated marginal effects into design priorities offers clear pathways for action. Road widening and traffic calming on narrow roads, improved night-time visibility, and the adoption of simpler midblock designs such as centre lines emerge as practical measures to reduce crash severity. The modest yet consistent protective effects of rumble strips suggest that low-cost tactile and auditory countermeasures can meaningfully mitigate driver error. By translating statistical results into actionable guidance, this study provides policymakers in Cameroon and comparable LMICs with evidence-based directions for reducing crash severity. Moreover, strengthening data systems and advancing modelling approaches remain critical to developing more precise, context-sensitive strategies that align with global road safety goals. This synthesis of marginal effects and model outcomes thus offers both strategic and operational contributions to road safety management.
The study also has limitations. The probit model showed signs of overdispersion, and the logistic model’s modest explanatory power (Nagelkerke R2 = 0.118) suggests unobserved factors such as vehicle speed, driver behaviour, and traffic volume remain influential. Additionally, environmental exposure variables such as rainfall intensity, illumination quality, and intersection visibility were not directly measured, which may have understated their influence on crash severity. Data imbalance (13.9% severe crashes) constrained model sensitivity, especially for rare categories, while quasi-complete separation required the exclusion of paved shoulder variables. Nonetheless, these limitations point to promising avenues for future model enrichment rather than undermining the interpretive strength of current findings.
From a methodological standpoint, the refined model’s parsimony over the initial model indicates that simplified structures with theoretically grounded predictors may enhance interpretability and external validity, even at the cost of marginally lower
pseudo-R2 values [
27,
71,
76]. This emphasizes that model refinement should prioritise explanatory clarity and policy applicability over raw fit metrics. Such parsimony ensures that model outputs remain actionable, scalable, and transferable across urban LMIC contexts.
Future research should integrate richer datasets including driver, vehicle, and exposure variables, apply interaction terms (such as time × lanes), and adopt advanced methods such as penalized regression or machine learning to improve predictive accuracy [
30,
51,
77]. Incorporating continuous environmental indicators (e.g., light intensity, surface friction, and weather data) could further clarify how environmental degradation amplifies or mitigates crash severity risks. In addition, it would also be valuable to integrate traffic conflict analysis in place of, or alongside, crash data for earlier detection of risk patterns, supporting a proactive approach to safety analysis and complementing traditional crash-based modelling. These extensions will deepen both theoretical understanding and practical usability of crash-severity models in resource-limited contexts.
This study contributes both empirical and methodological value: empirically, by quantifying how geometric and temporal features interact with environmental conditions to influence crash severity; and methodologically, by demonstrating that balanced probit–logit modelling provides interpretable, transferable insights even in data-constrained LMIC contexts. Ultimately, the study provides a replicable analytical framework that advances both theory and practice linking statistical modelling to targeted, evidence-based road safety design and policy implementation.