Analysis of Injury Severity in Elderly Pedestrian Traffic Accidents Based on XGBoost
Abstract
Featured Application
Abstract
1. Introduction
2. Data and Methods
2.1. Data Sources and Preprocessing
2.2. Model Construction and Evaluation Metrics
2.3. SHAP Attribution Analysis Method
3. Results
3.1. Model Evaluation Results
3.2. Global Prediction Results
3.3. Local Prediction Results
4. Discussion
4.1. Global Results Analysis
4.2. Local Results Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Azami-Aghdash, S.; Aghaei, M.H.; Sadeghi-Bazarghani, H. Epidemiology of road traffic injuries among elderly people; a systematic review and meta-analysis. Bull. Emerg. Trauma 2018, 6, 279. [Google Scholar] [CrossRef]
- Furtado, B.M.A.S.M.; Lima, A.C.B.d.; Ferreira, R.C.G. Road traffic accidents involving elderly people: An integrative review. Rev. Bras. Geriatr. E Gerontol. 2019, 22, e190053. [Google Scholar] [CrossRef]
- World Health Organization. World Health Statistics 2025: Monitoring Health for the SDGs, Sustainable Development Goals; World Health Organization: Geneva, Switzerland, 2025. [Google Scholar]
- Fang, T.; Xu, F.; Zou, Z. Causal Factors in Elderly Pedestrian Traffic Injuries Based on Association Analysis. Appl. Sci. 2025, 15, 1170. [Google Scholar] [CrossRef]
- Nikolaou, P.; Dimitriou, L. Evaluation of road safety policies performance across Europe: Results from benchmark analysis for a decade. Transp. Res. Part A Policy Pract. 2018, 116, 232–246. [Google Scholar] [CrossRef]
- Levi, S.; De Leonardis, D.; Antin, J.; Angel, L. Identifying Countermeasure Strategies to Increase Safety of Older Pedestrians; National Highway Traffic Safety Administration: Washington, DC, USA, 2013.
- Sánta, E.; Szűcs, P.; Patocskai, G.; Lakatos, I. Prevalence and Characteristics of Traffic Accidents Endangering Vulnerable Pedestrians in Hungary. Eng. Proc. 2024, 79, 94. [Google Scholar]
- Hilbe, J.M. Negative Binomial Regression; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
- Famoye, F. On the bivariate negative binomial regression model. J. Appl. Stat. 2010, 37, 969–981. [Google Scholar] [CrossRef]
- Yang, S.; Berdine, G. The negative binomial regression. Southwest Respir. Crit. Care Chron. 2015, 3, 50–54. [Google Scholar]
- Hensher, D.A.; Greene, W.H. The mixed logit model: The state of practice. Transportation 2003, 30, 133–176. [Google Scholar] [CrossRef]
- Demaris, A. Logit modeling: Practical Applications; Sage: London, UK, 1992; Volume 86. [Google Scholar]
- Hausman, J.; McFadden, D. Specification tests for the multinomial logit model. Econom. J. Econom. Soc. 1984, 52, 1219–1240. [Google Scholar] [CrossRef]
- Naghawi, H. Negative binomial regression model for road crash severity prediction. Mod. Appl. Sci. 2018, 12, 38. [Google Scholar] [CrossRef]
- Sze, N.-N.; Wong, S. Diagnostic analysis of the logistic model for pedestrian injury severity in traffic crashes. Accid. Anal. Prev. 2007, 39, 1267–1278. [Google Scholar] [CrossRef]
- Xie, Y. Values and limitations of statistical models. Res. Soc. Stratif. Mobil. 2011, 29, 343–349. [Google Scholar] [CrossRef]
- Sarrias, M. Individual-specific posterior distributions from Mixed Logit models: Properties, limitations and diagnostic checks. J. Choice Model. 2020, 36, 100224. [Google Scholar] [CrossRef]
- Hossain, M.Z. A review on some alternative specifications of the logit model. J. Bus. Econ. Res. (JBER) 2009, 7, 15–24. [Google Scholar] [CrossRef]
- Rosenthal, H. The limitations of log-linear analysis. Contemp. Sociol. 1980, 9, 207–212. [Google Scholar] [CrossRef]
- Obasi, I.C.; Benson, C. Evaluating the effectiveness of machine learning techniques in forecasting the severity of traffic accidents. Heliyon 2023, 9, e18812. [Google Scholar] [CrossRef] [PubMed]
- AlMamlook, R.E.; Kwayu, K.M.; Alkasisbeh, M.R.; Frefer, A.A. Comparison of machine learning algorithms for predicting traffic accident severity. In Proceedings of the 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), Amman, Jordan, 9–11 April 2019; pp. 272–276. [Google Scholar]
- Infante, P.; Jacinto, G.; Afonso, A.; Rego, L.; Nogueira, V.; Quaresma, P.; Saias, J.; Santos, D.; Nogueira, P.; Silva, M. Comparison of statistical and machine-learning models on road traffic accident severity classification. Computers 2022, 11, 80. [Google Scholar] [CrossRef]
- Yan, M.; Shen, Y. Traffic accident severity prediction based on random forest. Sustainability 2022, 14, 1729. [Google Scholar] [CrossRef]
- Wang, J.; Ma, S.; Jiao, P.; Ji, L.; Sun, X.; Lu, H. Analyzing the risk factors of traffic accident severity using a combination of random forest and association rules. Appl. Sci. 2023, 13, 8559. [Google Scholar] [CrossRef]
- Yang, J.; Han, S.; Chen, Y. Prediction of traffic accident severity based on random forest. J. Adv. Transp. 2023, 2023, 7641472. [Google Scholar] [CrossRef]
- Pérez-Sala, L.; Curado, M.; Tortosa, L.; Vicent, J.F. Deep learning model of convolutional neural networks powered by a genetic algorithm for prevention of traffic accidents severity. Chaos Solitons Fractals 2023, 169, 113245. [Google Scholar] [CrossRef]
- Shaik, M.E.; Islam, M.M.; Hossain, Q.S. A review on neural network techniques for the prediction of road traffic accident severity. Asian Transp. Stud. 2021, 7, 100040. [Google Scholar] [CrossRef]
- Rahim, M.A.; Hassan, H.M. A deep learning based traffic crash severity prediction framework. Accid. Anal. Prev. 2021, 154, 106090. [Google Scholar] [CrossRef] [PubMed]
- Habibzadeh, M.; Ayar, P.; Mirabimoghaddam, M.H.; Ameri, M.; Sadat Haghighi, S.M. Analysis of the severity of accidents on rural roads using statistical and artificial neural network methods. J. Adv. Transp. 2023, 2023, 8089395. [Google Scholar] [CrossRef]
- Yang, Y.; Wang, K.; Yuan, Z.; Liu, D. Predicting freeway traffic crash severity using XGBoost-Bayesian network model with consideration of features interaction. J. Adv. Transp. 2022, 2022, 4257865. [Google Scholar] [CrossRef]
- Li, K.; Xu, H.; Liu, X. Analysis and visualization of accidents severity based on LightGBM-TPE. Chaos Solitons Fractals 2022, 157, 111987. [Google Scholar] [CrossRef]
- Zahid, M.; Habib, M.F.; Ijaz, M.; Ameer, I.; Ullah, I.; Ahmed, T.; He, Z. Factors affecting injury severity in motorcycle crashes: Different age groups analysis using Catboost and SHAP techniques. Traffic Inj. Prev. 2024, 25, 472–481. [Google Scholar] [CrossRef]
- Xue, Z.; Yao, T. Enhancing occluded pedestrian re-identification with the MotionBlur data augmentation module. Mechatron. Intell. Transp. Syst 2024, 3, 73–84. [Google Scholar] [CrossRef]
- Guo, M.; Yuan, Z.; Janson, B.; Peng, Y.; Yang, Y.; Wang, W. Older pedestrian traffic crashes severity analysis based on an emerging machine learning XGBoost. Sustainability 2021, 13, 926. [Google Scholar] [CrossRef]
- Wu, S.; Yuan, Q.; Yan, Z.; Xu, Q. Analyzing accident injury severity via an extreme gradient boosting (XGBoost) model. J. Adv. Transp. 2021, 2021, 3771640. [Google Scholar] [CrossRef]
- Ma, J.; Ding, Y.; Cheng, J.C.; Tan, Y.; Gan, V.J.; Zhang, J. Analyzing the leading causes of traffic fatalities using XGBoost and grid-based analysis: A city management perspective. IEEE Access 2019, 7, 148059–148072. [Google Scholar] [CrossRef]
- Jamal, A.; Zahid, M.; Tauhidur Rahman, M.; Al-Ahmadi, H.M.; Almoshaogeh, M.; Farooq, D.; Ahmad, M. Injury severity prediction of traffic crashes with ensemble machine learning techniques: A comparative study. Int. J. Inj. Control Saf. Promot. 2021, 28, 408–427. [Google Scholar] [CrossRef]
- Chen, H.; Chen, H.; Liu, Z.; Sun, X.; Zhou, R. Analysis of factors affecting the severity of automated vehicle crashes using XGBoost model combining POI data. J. Adv. Transp. 2020, 2020, 8881545. [Google Scholar] [CrossRef]
- Jiang, F.; Ma, J. A comprehensive study of macro factors related to traffic fatality rates by XGBoost-based model and GIS techniques. Accid. Anal. Prev. 2021, 163, 106431. [Google Scholar] [CrossRef]
- Parsa, A.B.; Movahedi, A.; Taghipour, H.; Derrible, S.; Mohammadian, A.K. Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accid. Anal. Prev. 2020, 136, 105405. [Google Scholar] [CrossRef] [PubMed]
- Laphrom, W.; Se, C.; Champahom, T.; Jomnonkwao, S.; Wipulanusatd, W.; Satiennam, T.; Ratanavaraha, V. XGBoost-SHAP and unobserved heterogeneity modelling of temporal multivehicle truck-involved crash severity patterns. Civ. Eng. J. 2024, 10, 1890–1908. [Google Scholar] [CrossRef]
- Ristić, B.; Bogdanović, V.; Stević, Ž. Urban evaluation of pedestrian crossings based on Start-Up Time using the MEREC-MARCOS Model. J. Urban Dev. Manag. 2024, 3, 34–42. [Google Scholar] [CrossRef]
- Li, Z. Extracting spatial effects from machine learning model using local interpretation method: An example of SHAP and XGBoost. Comput. Environ. Urban Syst. 2022, 96, 101845. [Google Scholar] [CrossRef]
- Uddin, M.; Ahmed, F. Pedestrian injury severity analysis in motor vehicle crashes in Ohio. Safety 2018, 4, 20. [Google Scholar] [CrossRef]
- Islam, M. An exploratory analysis of the effects of speed limits on pedestrian injury severities in vehicle-pedestrian crashes. J. Transp. Health 2023, 28, 101561. [Google Scholar] [CrossRef]
- Hussain, Q.; Feng, H.; Grzebieta, R.; Brijs, T.; Olivier, J. The relationship between impact speed and the probability of pedestrian fatality during a vehicle-pedestrian crash: A systematic review and meta-analysis. Accid. Anal. Prev. 2019, 129, 241–249. [Google Scholar] [CrossRef] [PubMed]
- Droździel, P.; Tarkowski, S.; Rybicka, I.; Wrona, R. Drivers’ reaction time research in the conditions in the real traffic. Open Eng. 2020, 10, 35–47. [Google Scholar] [CrossRef]
- Törnros, J. Effect of driving speed on reaction time during motorway driving. Accid. Anal. Prev. 1995, 27, 435–442. [Google Scholar] [CrossRef]
- Fisa, R.; Musukuma, M.; Sampa, M.; Musonda, P.; Young, T. Effects of interventions for preventing road traffic crashes: An overview of systematic reviews. BMC Public Health 2022, 22, 513. [Google Scholar] [CrossRef] [PubMed]
- Febres, J.D.; Mariscal, M.Á.; Herrera, S.; García-Herrero, S. Pedestrians’ injury severity in traffic accidents in Spain: A pedestrian actions approach. Sustainability 2021, 13, 6439. [Google Scholar] [CrossRef]
- Matsui, Y.; Oikawa, S. Effect of A-Pillar Blind Spots on a Driver’s Pedestrian Visibility during Vehicle Turns at an Intersection. Stapp Car Crash J. 2024, 68, 14–30. [Google Scholar] [CrossRef]
- Zhang, G.; Yau, K.K.; Zhang, X. Analyzing fault and severity in pedestrian–motor vehicle accidents in China. Accid. Anal. Prev. 2014, 73, 141–150. [Google Scholar] [CrossRef]
- Haleem, K.; Alluri, P.; Gan, A. Analyzing pedestrian crash injury severity at signalized and non-signalized locations. Accid. Anal. Prev. 2015, 81, 14–23. [Google Scholar] [CrossRef]
- Wilmut, K.; Purcell, C. Why are older adults more at risk as pedestrians? A systematic review. Hum. Factors 2022, 64, 1269–1291. [Google Scholar] [CrossRef]
- Asadamraji, M.; Saffarzadeh, M.; Ross, V.; Borujerdian, A.; Ferdosi, T.; Sheikholeslami, S. A novel driver hazard perception sensitivity model based on drivers’ characteristics: A simulator study. Traffic Inj. Prev. 2019, 20, 492–497. [Google Scholar] [CrossRef]
Variable Category | Variable Name | Variable Classification |
---|---|---|
Elderly Pedestrian Factors | Age | 0: 60–64 years; 1: 65–69 years; 2: 70–74 years; 3: 75–79 years; 4: 80 years and above |
Height | 0: Less than 150 cm; 1: 150–154 cm; 2: 155–159 cm; 3: 160–164 cm; 4: 165–169 cm; 5: 170 cm and above | |
Weight | 0: Less than 50 kg; 1: 50–59 kg; 2: 60–69 kg; 3: 70–79 kg; 4: 80 kg and above | |
Gender | 0: Male; 1: Female | |
Health Status | 0: Healthy; 1: Diseased (e.g., hypertension, diabetes) | |
Main Injury Location | 0: Head; 1: Neck; 2: Trunk; 3: Limbs | |
Awareness of Vehicle | 0: Unaware of the vehicle; 1: Aware of the vehicle; 2: Unknown if aware of the vehicle (including cases of death or when the pedestrian cannot provide information) | |
Driver Factors | Age | 0: 18–25 years; 1: 26–35 years; 2: 36–45 years; 3: 46–55 years; 4: 56–65 years; 5: 65 years and above |
Gender | 0: Male; 1: Female | |
Driving Experience | 0: 1–5 years; 1: 6–10 years; 2: 11–20 years; 3: Over 20 years | |
Driver Status | 0: Fatigued driving; 1: Drunk driving; 2: Unwell driving; 3: Normal driving | |
Health Status | 0: Healthy; 1: Unhealthy | |
Awareness of Pedestrian | 0: Aware before collision; 1: Aware after collision; 2: Unaware; 3: Unknown if aware (including cases of death or when the driver cannot provide information) | |
Vehicle Factors | Vehicle Type | 0: Small sedan; 1: Small passenger vehicle; 2: Large bus and truck; 3: Other |
Collision Instant Speed | 0: Below 10 km/h; 1: 10–20 km/h; 2: 20–30 km/h; 3: 30–40 km/h; 4: 40–50 km/h; 5: 50–60 km/h; 6: 60–70 km/h; 7: 70–80 km/h; 8: 80 km/h and above | |
Collision Direction | 0: Head-on collision; 1: Side collision; 2: Rear-end collision | |
Road Factors | Road Type | 0: Expressway; 1: National road; 2: Ordinary road section; 3: Provincial road; 4: County road; 5: Rural road |
Road Condition | 0: Dry; 1: Damp; 2: Pooled water; 3: Icy; 4: Muddy; 5: Other | |
Traffic Control | 0: No traffic control; 1: Traffic signal lights; 2: Zebra crossing; 3: Other warning signs | |
Road Surface Material | 0: Asphalt road; 1: Cement road; 2: Gravel road; 3: Other | |
Signs and Markings | 0: Clear; 1: Fuzzy; 2: Missing | |
Road Structure | 0: Straight road; 1: Curved road; 2: Intersection; 3: Crossroad; 4: Ramp | |
Traffic Volume on the Accident Section | 0: Very heavy traffic; 1: Heavy traffic; 2: Light traffic with sufficient distance between vehicles; 3: Light traffic | |
Environmental Factors | Weather | 0: Sunny; 1: Rainy; 2: Snowy; 3: Hazy; 4: Hail; 5: Overcast; 6: Other |
Lighting Conditions | 0: Daytime; 1: Nighttime with street lighting; 2: Nighttime without street lighting; 3: Dawn; 4: Dusk | |
Workday | 0: Yes; 1: No | |
Morning and Evening Rush Hours | 0: Yes; 1: No | |
Holidays | 0: Yes; 1: No | |
Visibility | 0: Below 50 m; 1: 50–100 m; 2: 100–200 m; 3: Above 200 m | |
Blind Spot | 0: Yes; 1: No |
Parameter | Description | Setting Value |
---|---|---|
n_estimators | Number of weak classifiers | 200 |
max_depth | Maximum tree depth | 6 |
learning_rate | Learning rate | 0.08 |
subsample | Row sampling ratio | 0.8 |
colsample_bytree | Feature sampling ratio | 0.8 |
min_child_weight | Minimum leaf node weight | 3 |
gamma | Minimum loss reduction for node split | 0.1 |
reg_alpha | L1 regularization weight | 0.1 |
reg_lambda | L2 regularization weight | 1 |
objective | Classification objective function | multi:softmax |
random_state | Random seed | 42 |
Accident Category | Accuracy | Precision | Recall | F1 Score |
---|---|---|---|---|
Minor Injury | 0.86 | 0.83 | 0.89 | 0.86 |
Severe Injury | 0.88 | 0.81 | 0.88 | 0.84 |
Fatality | 0.84 | 0.84 | 0.84 | 0.84 |
Macro Average | 0.86 | 0.83 | 0.87 | 0.85 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, H.; Liang, G. Analysis of Injury Severity in Elderly Pedestrian Traffic Accidents Based on XGBoost. Appl. Sci. 2025, 15, 9909. https://doi.org/10.3390/app15189909
Wang H, Liang G. Analysis of Injury Severity in Elderly Pedestrian Traffic Accidents Based on XGBoost. Applied Sciences. 2025; 15(18):9909. https://doi.org/10.3390/app15189909
Chicago/Turabian StyleWang, Hongxiao, and Guohua Liang. 2025. "Analysis of Injury Severity in Elderly Pedestrian Traffic Accidents Based on XGBoost" Applied Sciences 15, no. 18: 9909. https://doi.org/10.3390/app15189909
APA StyleWang, H., & Liang, G. (2025). Analysis of Injury Severity in Elderly Pedestrian Traffic Accidents Based on XGBoost. Applied Sciences, 15(18), 9909. https://doi.org/10.3390/app15189909