Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach
Abstract
:1. Introduction
- (1)
- Cost-sensitive classification: apply cost-sensitive loss function in the evaluation of the learned system with different costs for different types of misclassification errors. For example, Riccardi et al. [15] proposed cost-sensitive AdaBoost for ordinal regression. The problem of cost-sensitive classification is how to determine the cost matrix without priori knowledge of the ordinal classification.
- (2)
- Ordinal binary decomposition: decompose the ordinal target variable into several binary variables, which are then estimated by single or multiple models. Our new ordinal classification method falls in this category. The problem of existing ordinal binary decomposition methods is the violation of rank monotonicity or rank consistency. Related methods and their drawbacks are introduced in the method section later in more detail.
- (3)
- Threshold model: extension of the regression model in which distances among the ordered classes are not pre-defined but estimated by finding the optimal thresholds dividing classes [16]. Li and Lin [17] proposed a general reduction framework to transform ordinal regression as a series of binary classification sub-problems and demonstrated that many threshold models and ordinal binary decomposition methods are equivalent.
- (1)
- To the authors’ knowledge, this is the first paper applying ordinal classification machine learning to predict traffic crash injury severity using real-world crash data.
- (2)
- We propose an ordinal classification machine learning method that satisfies rank monotonicity and rank consistency and takes advantage of probability calibration and the movement of optimal probability threshold to generate superior classification results compared to existing ordinal classification algorithms.
- (3)
- We test six severity category-combination strategies and find the best three-class combination plan.
2. Data Description
3. Methodology
3.1. Imbalanced Data Preprocessing
3.2. Ordinal Classification
3.2.1. Cumulative Binary Decomposition
3.2.2. One-vs-All Binary Decomposition
3.2.3. Existing Drawbacks
3.2.4. Proposed Method
3.3. Machine Learning Algorithms
3.4. Cross-Validation and Evaluation Metrics
3.5. Statistical Significance Test
- is the scores difference of two algorithms for the first fold of the i-th 2-fold cross-validation;
- is the scores difference of two algorithms for the second fold of the i-th 2-fold cross-validation;
- is the mean of scores difference for the first 2-fold cross-validation.
4. Results
4.1. Comparison of Classifiers
4.2. Comparison of Category-Combination and Sampling
4.3. Feature Importance
4.4. Comparison of Ordinal Classifications
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A
| No. | Variables | Code | Value | Severity | ||||
|---|---|---|---|---|---|---|---|---|
| NIC | COP | OVI | SI | KSI | ||||
| 1 | occupant: occupant type | 1 | Driver | 0 | 27,964 | 10,989 | 1904 | 461 | 
| 2 | Passenger | 80,474 | 13,674 | 4209 | 809 | 199 | ||
| 4 | Bicyclist | 0 | 0 | 2 | 0 | 0 | ||
| 5 | Other | 0 | 4 | 0 | 1 | 0 | ||
| 2 | seat: seating position | 0 | Other occupants | 1022 | 81 | 37 | 16 | 3 | 
| 1 | Driver | 42,623 | 34,360 | 12,728 | 2220 | 581 | ||
| 2~6 | Passengers | 33,206 | 6849 | 2301 | 449 | 68 | ||
| 7 | Station wagon rear | 1402 | 164 | 61 | 15 | 1 | ||
| 8 | Truck/van rear | 1121 | 102 | 44 | 8 | 2 | ||
| 9 | Position unknown | 1100 | 86 | 29 | 6 | 5 | ||
| 3 | collision: type of collision | 10 | Head-on | 883 | 1045 | 588 | 277 | 106 | 
| 11 | Sideswipe | 17,943 | 4244 | 1717 | 271 | 51 | ||
| 12 | Rear end | 43,690 | 23,685 | 3632 | 472 | 95 | ||
| 13 | Broadside | 5460 | 4436 | 1563 | 317 | 62 | ||
| 14 | Hit object | 9572 | 6093 | 4962 | 832 | 232 | ||
| 15 | Overturned | 1531 | 1829 | 2520 | 502 | 105 | ||
| 16 | Auto-pedestrian | 192 | 16 | 22 | 1 | 0 | ||
| 17 | Other | 981 | 187 | 147 | 37 | 9 | ||
| 4 | factor: first associated factor | 10 | Vehicle code violation | 3338 | 1981 | 2762 | 671 | 186 | 
| 14 | Vision obscurement | 100 | 63 | 39 | 1 | 1 | ||
| 15 | Inattention | 1438 | 747 | 462 | 68 | 4 | ||
| 16 | Stop and go traffic | 5056 | 2358 | 301 | 39 | 5 | ||
| 17 | Enter/leave ramp | 2163 | 994 | 310 | 32 | 7 | ||
| 18 | Previous collision | 928 | 493 | 154 | 37 | 10 | ||
| 19 | Unfamiliar with road | 119 | 35 | 36 | 11 | 1 | ||
| 20 | Defect vehicle equipment | 259 | 158 | 132 | 20 | 6 | ||
| 21 | Uninvolved Vehicle | 612 | 345 | 190 | 25 | 0 | ||
| 22 | Other | 327 | 198 | 135 | 27 | 10 | ||
| 23 | None apparent | 65,694 | 33,904 | 10,586 | 1765 | 427 | ||
| 24 | Runaway vehicle | 100 | 49 | 14 | 0 | 0 | ||
| 5 | cause: primary collision factor | 1 | Under influence of alcohol | 3594 | 2044 | 2478 | 648 | 146 | 
| 2 | Following too closely | 3990 | 1382 | 476 | 77 | 17 | ||
| 3 | Failure to yield | 3197 | 2162 | 695 | 139 | 27 | ||
| 4 | Improper turn | 9711 | 6417 | 4527 | 741 | 227 | ||
| 5 | Speeding | 41,831 | 23,529 | 4710 | 681 | 137 | ||
| 6 | Other violations | 18,151 | 6108 | 2314 | 428 | 106 | ||
| 6 | road: roadway class | 1 | Urban freeways | 58,265 | 28,482 | 9088 | 1273 | 255 | 
| 2 | Urban freeways < 4 lanes | 259 | 91 | 49 | 9 | 1 | ||
| 3 | Urban two-lane roads | 1398 | 969 | 289 | 73 | 17 | ||
| 4 | Urban multilane divided non-freeways | 4133 | 3376 | 810 | 121 | 21 | ||
| 5~11 | Others | 16,419 | 8724 | 4964 | 1238 | 366 | ||
| 7 | eject: ejected from | 0 | Not ejected | 79,883 | 40,309 | 13,382 | 1969 | 400 | 
| 1 | Fully ejected | 24 | 760 | 1500 | 658 | 223 | ||
| 2 | Partially ejected | 4 | 68 | 134 | 52 | 33 | ||
| 3 | Unknown | 563 | 505 | 184 | 35 | 4 | ||
| 8 | object: first object struck | 1~7 | Bridge structure | 281 | 229 | 181 | 44 | 19 | 
| 10~15 | Pole or sign post | 1573 | 1003 | 828 | 138 | 26 | ||
| 16~24, 27, 30 | Concrete barrier | 5646 | 4761 | 3840 | 611 | 176 | ||
| 25~26 | Water /drainage ditch | 122 | 116 | 116 | 28 | 9 | ||
| 28~29, 40 | Plants | 279 | 223 | 227 | 69 | 12 | ||
| 41~43 | Temporary barricades | 1416 | 296 | 286 | 53 | 18 | ||
| 44~46 | Overturned/crash- cushion | 993 | 1213 | 1864 | 377 | 62 | ||
| 51 | Call box | 53 | 23 | 13 | 6 | 2 | ||
| 98~99 | Unkown or no object involved | 486 | 191 | 225 | 42 | 18 | ||
| 100 | Vehicle | 69,495 | 33,449 | 7517 | 1339 | 317 | ||
| 9 | vehicles: number of vehicles | 1 | Total vehicle number = 1 | 10,087 | 7272 | 6971 | 1240 | 304 | 
| 2 | Total vehicle number = 2 | 56,147 | 24,906 | 6116 | 1159 | 268 | ||
| 3 | Total vehicle number > 2 | 14,240 | 9464 | 2113 | 315 | 88 | ||
| 10 | alcohol: alcohol involved | 1 | Yes | 6327 | 3183 | 1209 | 214 | 32 | 
| 2 | No | 74,147 | 38,459 | 13,991 | 2500 | 628 | ||
| 11 | motor: motorcycle involved | 1 | Yes | 78,400 | 40,480 | 14,780 | 2635 | 642 | 
| 2 | No | 1517 | 738 | 292 | 56 | 15 | ||
| 12 | drv_gender: driver’s gender | 1 | Female | 32,269 | 19,975 | 5481 | 792 | 185 | 
| 2 | Male | 48,205 | 21,667 | 9719 | 1922 | 475 | ||
| Total | 80,474 | 41,642 | 15,200 | 2714 | 660 | |||
References
- Farid, A.; Ksaibati, K. Modeling two-lane highway passing-related crashes using mixed ordinal probit regression. J. Transp. Eng. Part A. Syst. 2020, 146, 04020092. [Google Scholar] [CrossRef]
- Rezapour, M.; Wulff, S.S.; Molan, A.M.; Ksaibati, K. Application of Bayesian ordinal logistic model for identification of factors to traffic barrier crashes: Considering roadway classification. Transp. Lett. 2021, 13, 308–314. [Google Scholar] [CrossRef]
- Cerwick, D.M.; Gkritza, K.; Shaheed, M.S.; Hans, Z. A comparison of the mixed logit and latent class methods for crash severity analysis. Anal. Methods Accid. Res. 2014, 3–4, 11–27. [Google Scholar] [CrossRef]
- Haghighi, N.; Liu, X.C.; Zhang, G.; Porter, R.J. Impact of roadway geometric features on crash severity on rural two-lane highways. Accid. Anal. Prev. 2018, 111, 34–42. [Google Scholar] [CrossRef] [PubMed]
- Iranitalab, A.; Khattak, A. Comparison of four statistical and machine learning methods for crash severity prediction. Accid. Anal. Prev. 2017, 108, 27–36. [Google Scholar] [CrossRef] [PubMed]
- Chang, L.-Y.; Wang, H.-W. Analysis of traffic injury severity: An application of non-parametric classification tree techniques. Accid. Anal. Prev. 2006, 38, 1019–1027. [Google Scholar] [CrossRef]
- Abdel-Aty, M.A.; Abdelwahab, H.T. Predicting injury severity levels in traffic crashes: A modeling comparison. J. Transp. Eng. 2004, 130, 204–210. [Google Scholar] [CrossRef]
- Delen, D.; Sharda, R.; Bessonov, M. Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks. Accid. Anal. Prev. 2006, 38, 434–444. [Google Scholar] [CrossRef]
- Alkheder, S.; Taamneh, M.; Taamneh, S. Severity prediction of traffic accident using an artificial neural network. J. Forecast. 2017, 36, 100–108. [Google Scholar] [CrossRef]
- Savolainen, P.T.; Mannering, F.L.; Lord, D.; Quddus, M.A. The statistical analysis of highway crash-injury severities: A review and assessment of methodological alternatives. Accid. Anal. Prev. 2011, 43, 1666–1676. [Google Scholar]
- Yasmin, S.; Eluru, N.; Ukkusuri, S.V. Alternative ordered response frameworks for examining pedestrian injury severity in New York City. J. Transp. Saf. Secur. 2014, 6, 275–300. [Google Scholar] [CrossRef]
- Taylor, S.G.; Russo, B.J.; James, E. A comparative analysis of factors affecting the frequency and severity of freight-involved and non-freight crashes on a major freight corridor freeway. Transp. Res. Rec. 2018, 2672, 49–62. [Google Scholar] [CrossRef]
- Chang, L.-Y.; Chien, J.-T. Analysis of driver injury severity in truck-involved accidents using a non-parametric classification tree model. Saf. Sci. 2013, 51, 17–22. [Google Scholar] [CrossRef]
- Gutierrez, P.A.; Perez-Ortiz, M.; Sanchez-Monedero, J.; Fernandez-Navarro, F.; Hervas-Martinez, C. Ordinal regression methods: Survey and experimental study. IEEE Trans. Knowl. Data Eng. 2016, 28, 127–146. [Google Scholar] [CrossRef] [Green Version]
- Riccardi, A.; Fernández-Navarro, F.; Carloni, S. Cost-sensitive AdaBoost algorithm for ordinal regression based on extreme learning machine. IEEE Trans. Cybern. 2014, 44, 1898–1909. [Google Scholar] [CrossRef] [PubMed]
- Verwaeren, J.; Waegeman, W.; Baets, B.D. Learning partial ordinal class memberships with kernel-based proportional odds models. Comput. Stat. Data Anal. 2012, 56, 928–942. [Google Scholar] [CrossRef]
- Niu, Z.; Zhou, M.; Wang, L.; Gao, X.; Hua, G. Ordinal regression with multiple output CNN for age estimation. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 4920–4928. [Google Scholar]
- Jeong, H.; Jang, Y.; Bowman, P.J.; Masoud, N. Classification of motor vehicle crash injury severity: A hybrid approach for imbalanced data. Accid. Anal. Prev. 2018, 120, 250–261. [Google Scholar] [CrossRef]
- Basso, F.; Basso, L.J.; Bravo, F.; Pezoa, R. Real-time crash prediction in an urban expressway using disaggregated data. Transp. Res. Part C Emerging Technol. 2018, 86, 202–219. [Google Scholar] [CrossRef]
- Drosou, K.; Georgiou, S.; Koukouvinos, C.; Stylianou, S. Support vector machines classification on class imbalanced data: A case study with real medical data. J. Data Sci. 2014, 12, 727–754. [Google Scholar] [CrossRef]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Frank, E.; Hall, M. A simple approach to ordinal classification. In Proceedings of the 2001 European Conference on Machine Learning, Freiburg, Germany, 5–7 September 2001; pp. 145–156. [Google Scholar]
- Cheng, J.; Wang, Z.; Pollastri, G. A neural network approach to ordinal regression. In Proceedings of the 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–8 June 2008; pp. 1279–1284. [Google Scholar]
- Beckham, C.; Pal, C. A simple squared-error reformulation for ordinal classification. arXiv 2020, arXiv:1612.00775. [Google Scholar]
- Cao, W.; Mirjalili, V.; Raschka, S. Rank consistent ordinal regression for neural networks with application to age estimation. Pattern Recognit. Lett. 2020, 140, 325–331. [Google Scholar] [CrossRef]
- Collell, G.; Prelec, D.; Patil, K.R. A simple plug-in bagging ensemble based on threshold-moving for classifying binary and multi-class imbalanced data. Neurocomputing 2018, 275, 330–340. [Google Scholar] [CrossRef]
- Platt, J.C. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Advances in Large Margin Classifiers; MIT Press: Cambridge, MA, USA, 1999; pp. 61–74. [Google Scholar]
- Zadrozny, B.; Elkan, C. Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers. In Proceedings of the Eighteenth International Conference on Machine Learning, San Francisco, CA, USA, 28 June–1 July 2001; pp. 609–616. [Google Scholar]
- Zadrozny, B.; Elkan, C. Transforming classifier scores into accurate multi-class probability estimates. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, AB, Canada, 23–26 July 2002; pp. 694–699. [Google Scholar]
- Sarkar, S.; Vinay, S.; Raj, R.; Maiti, J.; Mitra, P. Application of optimized machine learning techniques for prediction of occupational accidents. Comput. Oper. Res. 2019, 106, 210–224. [Google Scholar] [CrossRef]
- Rahim, M.A.; Hassan, H.M. A deep learning based traffic crash severity prediction framework. Accid. Anal. Prev. 2021, 154, 106090. [Google Scholar] [CrossRef] [PubMed]
- Yang, C.; Chen, M.; Yuan, Q. The application of XGBoost and SHAP to examining the factors in freight truck-related crashes: An exploratory analysis. Accid. Anal. Prev. 2021, 158, 106153. [Google Scholar] [CrossRef] [PubMed]
- Guo, M.; Yuan, Z.; Janson, B.; Peng, Y.; Yang, Y.; Wang, W. Older pedestrian traffic crashes severity analysis based on an emerging machine learning XGBoost. Sustainability 2021, 13, 926. [Google Scholar] [CrossRef]
- Parsa, A.B.; Movahedi, A.; Taghipour, H.; Derrible, S.; Mohammadian, A. Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accid. Anal. Prev. 2020, 136, 105405. [Google Scholar] [CrossRef] [PubMed]
- Chen, C.; Zhang, G.; Qian, Z.; Tarefder, R.A.; Tian, Z. Investigating driver injury severity patterns in rollover crashes using support vector machine models. Accid. Anal. Prev. 2016, 90, 128–139. [Google Scholar] [CrossRef] [PubMed]
- Li, X.; Lord, D.; Zhang, Y.; Xie, Y. Predicting motor vehicle crashes using support vector machine models. Accid. Anal. Prev. 2008, 40, 1611–1618. [Google Scholar] [CrossRef]
- Bergstra, J.; Yamins, D.; Cox, D.D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 16–21 June 2013; pp. 115–123. [Google Scholar]
- Dietterich, T.G. Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput. 1998, 10, 1895–1923. [Google Scholar] [CrossRef] [PubMed] [Green Version]








| Class Prediction Based on | Cumulative Binary Decomposition | One-vs-All Binary Decomposition | 
|---|---|---|
| Class Probability | Frank’s method [22] | Beckham1 and Beckham2 [24] | 
| Cumulative Probability | Cheng’s method [23] | This paper | 
| Classifier | Precision (%) | Macro-Average | ||||
|---|---|---|---|---|---|---|
| NIC | COP | OVI | SI | KSI | ||
| MLP | 99.1 | 56.4 | 36.5 | 0.4 | 1.9 | 38.9 | 
| XGBoost | 99.3 | 64.9 | 23.4 | 1.9 | 4.5 | 38.8 | 
| SVM | 100 | 64.7 | 12.7 | 0 | 0 | 35.5 | 
| Combination | Precision (%) | Macro-Average | ||
|---|---|---|---|---|
| Class 1 | Class 2 | Class 3 | ||
| 1 | 98.0 | 73.7 | 0.0 | 57.2 | 
| 2 | 98.4 | 73.1 | 4.8 | 58.8 | 
| 3 | 99.3 | 62.6 | 33.6 | 65.2 | 
| 4 | 97.7 | 34.6 | 0.0 | 44.1 | 
| 5 | 98.3 | 25.0 | 6.9 | 43.4 | 
| 6 | 99.9 | 1.1 | 6.1 | 35.7 | 
| Combination | Group | Class 1 | Class 2 | Class 3 | Macro-Average | 
|---|---|---|---|---|---|
| 1 | Precision (%) | 97.8 | 71.0 | 22.7 | 63.9 | 
| Recall (%) | 83.5 | 95.6 | 7.2 | 62.1 | |
| F1 (%) | 90.1 | 81.5 | 11.0 | 60.9 | |
| 2 | Precision (%) | 98.0 | 67.8 | 27.5 | 64.4 | 
| Recall (%) | 83.3 | 92.6 | 21.4 | 65.8 | |
| F1 (%) | 90.1 | 78.3 | 24.1 | 64.2 | |
| 3 | Precision (%) | 99.3 | 62.6 | 33.6 | 65.2 | 
| Recall (%) | 82.4 | 75.4 | 68.2 | 75.4 | |
| F1 (%) | 90.1 | 68.4 | 45.0 | 67.8 | |
| 4 | Precision (%) | 97.1 | 23.1 | 36.4 | 52.2 | 
| Recall (%) | 91.2 | 66.4 | 5.6 | 54.4 | |
| F1 (%) | 94.1 | 34.3 | 9.7 | 46.0 | |
| 5 | Precision (%) | 97.1 | 11.8 | 36.7 | 48.5 | 
| Recall (%) | 90.7 | 54.8 | 18.4 | 54.6 | |
| F1 (%) | 93.8 | 19.4 | 24.5 | 45.9 | |
| 6 | Precision (%) | 97.0 | 21.6 | 27.3 | 48.6 | 
| Recall (%) | 98.4 | 18.6 | 8.7 | 41.9 | |
| F1 (%) | 97.7 | 20.0 | 13.2 | 43.6 | 
| Variable | Definition | Combination | |||||
|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | ||
| occupant | Occupant type | 0.13 | 0.25 | 0.19 | 0.07 | 0.07 | 0.04 | 
| seat | Seating position | 0.06 | 0.06 | 0.07 | 0.05 | 0.06 | 0.06 | 
| collision | Collision type | 0.06 | 0.06 | 0.07 | 0.07 | 0.07 | 0.07 | 
| factor | First associated factor | 0.05 | 0.04 | 0.05 | 0.06 | 0.06 | 0.06 | 
| cause | Primary collision cause | 0.05 | 0.03 | 0.04 | 0.06 | 0.05 | 0.05 | 
| road | Roadway class | 0.05 | 0.03 | 0.04 | 0.06 | 0.05 | 0.07 | 
| eject | Ejected from | 0.10 | 0.09 | 0.11 | 0.02 | 0.11 | 0.09 | 
| object | First object struck | 0.06 | 0.06 | 0.05 | 0.05 | 0.07 | 0.07 | 
| vehicles | number of vehicles | 0.11 | 0.10 | 0.06 | 0.07 | 0.08 | 0.12 | 
| alcohol | alcohol involved | 0.01 | 0.01 | 0.01 | 0.05 | 0.01 | 0.02 | 
| gender | driver’s gender | 0.02 | 0.01 | 0.01 | 0.06 | 0.01 | 0.02 | 
| drv_safe | Driver safety equipment | 0.05 | 0.06 | 0.06 | 0.10 | 0.07 | 0.06 | 
| occ_age | Occupant’s age | 0.06 | 0.04 | 0.04 | 0.01 | 0.06 | 0.07 | 
| drv_age | Driver’s age | 0.04 | 0.03 | 0.03 | 0.01 | 0.04 | 0.04 | 
| motor | Motorcycle involved | 0.02 | 0.01 | 0.01 | 0.11 | 0.01 | 0.02 | 
| occ_safe | Occupant safety equipment | 0.06 | 0.06 | 0.06 | 0.06 | 0.07 | 0.06 | 
| veh_year | Vehicle model year | 0.08 | 0.07 | 0.11 | 0.11 | 0.11 | 0.08 | 
| Method | Evaluation Metric | Class | Macro-Average | ||
|---|---|---|---|---|---|
| 1 | 2 | 3 | |||
| Nominal classification | Precision (%) | 98.0 | 67.8 | 27.5 | 64.4 | 
| Recall (%) | 83.3 | 92.6 | 21.4 | 65.8 | |
| F1 (%) | 90.1 | 78.3 | 24.1 | 64.2 | |
| Beckham1 | Precision (%) | 92.9 | 73.6 | 22.7 | 63.0 | 
| Recall (%) | 85.6 | 83.9 | 21.8 | 63.8 | |
| F1 (%) | 89.1 | 78.4 | 22.3 | 63.2 | |
| Beckham2 | Precision (%) | 95.6 | 75.7 | 0.00 | 57.1 | 
| Recall (%) | 84.4 | 86.9 | 0.00 | 57.1 | |
| F1 (%) | 89.7 | 80.9 | 0.00 | 56.9 | |
| Frank | Precision (%) | 97.6 | 68.2 | 31.6 | 65.8 | 
| Recall (%) | 83.6 | 92.1 | 23.6 | 66.4 | |
| F1 (%) | 90.1 | 78.4 | 27.0 | 65.1 | |
| Cheng | Precision (%) | 96.0 | 70.3 | 29.2 | 65.2 | 
| Recall (%) | 84.4 | 88.9 | 24.1 | 65.8 | |
| F1 (%) | 89.8 | 78.5 | 26.4 | 64.9 | |
| This paper | Precision (%) | 94.0 | 68.9 | 41.2 | 68.0 | 
| Recall (%) | 85.2 | 86.4 | 21.3 | 64.3 | |
| F1 (%) | 89.4 | 76.7 | 28.1 | 64.7 | |
| Method A | Method B | t | p-Value | 
|---|---|---|---|
| This paper | Nominal classification | 24.99 | 0.000 | 
| Beckham1 | 15.82 | 0.000 | |
| Beckham2 | 12.68 | 0.000 | |
| Frank | 4.89 | 0.005 | |
| Cheng | 16.58 | 0.000 | 
| Method | Evaluation Metric | Class | Macro-Average | ||
|---|---|---|---|---|---|
| 1 | 2 | 3 | |||
| T1 = 0.5 T2 = 0.5 | Precision (%) | 86.4 | 81.5 | 15.8 | 61.2 | 
| Recall (%) | 88.2 | 77.4 | 27.3 | 64.3 | |
| F1 (%) | 87.3 | 79.4 | 20.0 | 62.2 | |
| T1 = 0.43 T2 = 0.33 | Precision (%) | 94.0 | 68.9 | 41.2 | 68.0 | 
| Recall (%) | 85.2 | 86.4 | 21.3 | 64.3 | |
| F1 (%) | 89.4 | 76.7 | 28.1 | 64.7 | |
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. | 
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhu, S.; Wang, K.; Li, C. Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach. Int. J. Environ. Res. Public Health 2021, 18, 11564. https://doi.org/10.3390/ijerph182111564
Zhu S, Wang K, Li C. Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach. International Journal of Environmental Research and Public Health. 2021; 18(21):11564. https://doi.org/10.3390/ijerph182111564
Chicago/Turabian StyleZhu, Shengxue, Ke Wang, and Chongyi Li. 2021. "Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach" International Journal of Environmental Research and Public Health 18, no. 21: 11564. https://doi.org/10.3390/ijerph182111564
APA StyleZhu, S., Wang, K., & Li, C. (2021). Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach. International Journal of Environmental Research and Public Health, 18(21), 11564. https://doi.org/10.3390/ijerph182111564
 
         
                                                

 
       