Next Article in Journal
Alterations in Serotonin Neurotransmission in Hyperdopaminergic Rats Lacking the Dopamine Transporter
Previous Article in Journal
HIF-2α Controls Expression and Intracellular Trafficking of the α2-Subunit of Na,K-ATPase in Hypoxic H9c2 Cardiomyocytes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predictive Modeling and Integrated Risk Assessment of Postoperative Mortality and Pneumonia in Traumatic Brain Injury Patients through Clustering and Machine Learning: Retrospective Study

1
Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon 24253, Republic of Korea
2
Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Republic of Korea
3
Department of Neurosurgery, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon 24253, Republic of Korea
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work as the first authors.
These authors contributed equally to this work.
Biomedicines 2023, 11(11), 2880; https://doi.org/10.3390/biomedicines11112880
Submission received: 20 September 2023 / Revised: 15 October 2023 / Accepted: 19 October 2023 / Published: 24 October 2023
(This article belongs to the Section Biomedical Engineering and Materials)

Abstract

:
This study harnessed machine learning to forecast postoperative mortality (POM) and postoperative pneumonia (PPN) among surgical traumatic brain injury (TBI) patients. Our analysis centered on the following key variables: Glasgow Coma Scale (GCS), midline brain shift (MSB), and time from injury to emergency room arrival (TIE). Additionally, we introduced innovative clustered variables to enhance predictive accuracy and risk assessment. Exploring data from 617 patients spanning 2012 to 2022, we observed that 22.9% encountered postoperative mortality, while 30.0% faced postoperative pneumonia (PPN). Sensitivity for POM and PPN prediction, before incorporating clustering, was in the ranges of 0.43–0.82 (POM) and 0.54–0.76 (PPN). Following clustering, sensitivity values were 0.47–0.76 (POM) and 0.61–0.77 (PPN). Accuracy was in the ranges of 0.67–0.76 (POM) and 0.70–0.81 (PPN) prior to clustering and 0.42–0.73 (POM) and 0.55–0.73 (PPN) after clustering. Clusters characterized by low GCS, small MSB, and short TIE exhibited a 3.2-fold higher POM risk compared to clusters with high GCS, small MSB, and short TIE. In summary, leveraging clustered variables offers a novel avenue for predicting POM and PPN in TBI patients. Assessing the amalgamated impact of GCS, MSB, and TIE characteristics provides valuable insights for clinical decision making.

1. Introduction

Traumatic brain injury (TBI) is a global health concern that exacts a heavy toll on lives and well-being across all age groups. The grim statistics reveal its pervasive impact: approximately 4.48 million people have succumbed to trauma-related incidents, constituting 8% of all global fatalities, with TBI accounting for around 2 million of these tragic deaths [1,2,3]. Of particular concern, about 5% of TBI cases necessitate surgical intervention, often marked by elevated postoperative mortality rates. Furthermore, the duration of ventilator support following surgery has been closely associated with patient outcome [1].
The intricate web of connections between the brain and various bodily organs, including the central nervous system and the gastrointestinal tract, is well documented and commonly referred to as the “brain–gut axis” [4,5]. Recent research has unveiled a new axis—the “lung–brain axis” [6,7]. Within the lungs, the exchange of O2 and CO2 between atmospheric air and the bloodstream is a vital process. Remarkably, the brain, which is the body’s oxygen powerhouse, orchestrates this exchange, with blood CO2 levels playing a pivotal role. However, lung function is intricately linked to cerebrovascular health [8] and can be compromised by conditions such as pneumonia, neurogenic pulmonary edema, and acute dyspnea syndrome. Importantly, these respiratory complications are frequently encountered in patients recovering from brain injuries, often stemming from inhalation or ventilator support, and are among the most prevalent respiratory challenges observed alongside acute respiratory failure [9].
Numerous factors exert a profound influence on the outcomes of TBI cases. Swift transfers to healthcare facilities equipped with neurosurgical expertise have been associated with more favorable prognoses [10]. The severity of the injury itself is a critical determinant of mortality rates [11], with prolonged dependence on mechanical ventilation further complicating matters. Notably, ventilator-associated pneumonia emerges as a significant contributor to mortality among patients with cerebral hemorrhage [12], accentuating the peril of extended ventilator reliance in this population. Additionally, individuals grappling with severe brain injuries face an elevated risk of pulmonary aspiration [13]. Postoperative pneumonia (PPN) ranks as the third most prevalent complication in the realm of surgical procedures [14]. PPN not only prolongs hospital stays by an average of 7 to 9 days but also incurs escalated medical expenditures [15,16,17]. These multifaceted factors collectively contribute to the occurrence and severity of PPN.
Machine learning (ML), a subfield of artificial intelligence (AI), possesses the prowess to analyze copious amounts of patient data, discern intricate patterns, and provide accurate predictions and risk assessments—a capability with far-reaching implications in the medical landscape, including predictive analytics [18,19,20,21,22]. Clustering, an indispensable data mining technique, enables the classification of data groups based on shared characteristics, thereby identifying representative central points within these groups [23,24]. In this study, we harness the potential of machine learning to predict postoperative mortality (POM) and PPN in TBI patients. Key determinants, including the severity of TBI denoted by Glasgow Coma Scale (GCS) scores, midline brain shift, and temporal features such as the time elapsed between injury and emergency room admission, serve as foundational variables. In an innovative twist, these three variables are subjected to clustering, amalgamating them into a single novel variable. This novel variable is then examined to explore its impact on predicting the occurrence of POM and PPN.

2. Materials and Methods

2.1. Study Design

This retrospective study aimed to predict and analyze the impact of three important factors on postoperative mortality and postoperative pneumonia (PPN) in patients undergoing surgical intervention for TBI: the time from injury to hospital admission, the severity of TBI as measured by GCS score, and midline shift of the brain. Additionally, this study investigated the impact of clustering variables derived from these factors on prediction and outcome. The analysis included patient cohort data from January 2012 to July 2022.

Ethical Approval/Informed Consent

Prior to commencement, this study received approval from the Clinical Research Ethics Committee of Chuncheon Sacred Hospital (IRB No. 2022-09-007). All research activities adhered strictly to the ethical principles outlined in the Helsinki Declaration. Given that the clinical data utilized were obtained after the conclusion of patient treatment, informed consent was waived for this study, which involved vulnerable participants, specifically individuals with traumatic brain injuries.

2.2. Primary Outcomes

The primary outcomes for this study included postoperative mortality (POM) and postoperative pneumonia (PPN). POM was defined as death occurring prior to discharge following surgery, while PPN was diagnosed based on the presence of pneumonia or pulmonary infiltrations evident in chest X-rays, coupled with fever (>38 °C), abnormal white blood cell counts (<4000/mL or >12,000/mL), or positive blood culture findings.

2.3. Data Sources and Extraction

Data were sourced from the clinical data repository of Hallym University Medical Center, encompassing data from five hospitals collected over a decade. This comprehensive database houses an array of medical records, prescription information, laboratory test results, and radiological findings. Its extensive data repository allows for the retrieval and analysis of patient data, considering demographic attributes, prescriptions, diagnostic examinations, and treatment details. In addition to structured data, the clinical data repository also contains unstructured textual information, including patient notes, test results, and medication administration records. Several previous studies have successfully utilized data from this repository [19,25,26,27,28,29,30]. Patient selection for analysis was based on a history of TBI and subsequent surgical treatment, as identified through emergency room (ER) records. A list of keywords associated with trauma is provided in Appendix A.

2.4. Participants

This study included patients who had undergone surgical intervention for TBI and met the criteria for prediction and analysis as described above. Patients with preoperative pneumonia or incomplete data were excluded from the analysis.

2.5. Primary Predictors and Other Features

The main predictors in this study included three essential variables: the time between injury and emergency room (TIE) admission (days), the Glasgow Coma Scale (GCS) score, and the size of midline brain shift (mm), all derived from evaluations performed in the ER. In addition to the above three, we considered a comprehensive set of 33 features, including patient attributes and pre- and postoperative data, to enable early prediction. These included age; sex; surgery duration; obesity (defined as body mass index ≥ 30 kg/m2); alcohol and tobacco use; imaging findings; presence of skull fracture; various types of hemorrhage (intracerebral, subdural, epidural, intraventricular, and subarachnoid); American Society of Anesthesiologists physical status; blood and fluid volumes administered during surgery; urine output during surgery; estimated blood loss during surgery; packed red blood cells transfused during surgery; fresh frozen plasma transfused and platelet count; preoperative blood urea nitrogen, creatinine, albumin, sodium, and potassium levels; total duration of ventilator support; ratio of arterial oxygen partial pressure to inspired oxygen fraction (PO2/FiO2 < 300); and the nature of the initial surgical procedure (burr hole, craniectomy, craniotomy, cranioplasty or endovascular coiling, cooperative surgery).

2.6. Data Preprocessing and Machine Learning

2.6.1. Data Preprocessing

The data were categorized into continuous and categorical variables. Continuous data were standardized by centering on the mean and scaling to unit variance [31]. Notably, this study encountered an imbalanced target variable distribution for both POM and PPN. Cases without POM or PPN were more prevalent than those with such outcomes, a disparity known to impact classification accuracy [32]. To mitigate this imbalance, we applied the synthetic minority oversampling technique (SMOTE) [33], which generates new data for the minority class using the k-nearest neighbors algorithm. Datasets were prepared separately for POM and PPN and subsequently divided into training and test sets in an 8:2 ratio. The distribution of POM and PPN instances was evenly randomized between training and test sets.

2.6.2. Machine Learning

Five distinct machine learning algorithms were employed to predict both POM and PPN. These algorithms encompassed logistic regression, random forest, light gradient boosting machine, multilayer perceptron, and support vector machine [34,35,36,37,38]. Additionally, predictions for POM and PPN were made using the balanced random forest algorithm without SMOTE. A balanced random forest randomly undersamples each bootstrap sample to balance it [39]. The construction of prediction models involved training each algorithm with a designated training dataset. Model performance was evaluated using five metrics: area under the receiver operating characteristic curve (AUROC), recall, precision, F1 score, and accuracy. Bootstrapping (n = 1000) was applied to calculate 95% confidence intervals (CIs). We originally built and evaluated a model that included three main predictors and all other variables. After clustering, we built and evaluated a model that included the clustering variable and all other variables instead of the three main variables.

2.6.3. Feature Importance

The importance of features in predicting both POM and PPN was assessed using mutual information (MI), a widely recognized metric in machine learning and feature selection tasks [40,41,42]. It quantifies the dependency or information shared between two random variables, making it a useful measure for assessing the relevance of a feature with respect to the target variable. In the context of feature importance, mutual information can be used to rank or select features based on their relevance to the target variable. Higher mutual information values indicate stronger relationships between a feature and the target [43].

2.7. Clustering

To generate a novel predictor, we adopted the agglomerative (bottom-up) hierarchical clustering technique, utilizing the primary predictors as the foundation for the clustering process. Agglomerative hierarchical clustering employs a bottom-up methodology, whereby clusters encompass sub-clusters that progressively branch into additional sub-clusters, thereby constructing a hierarchical structure. The process commences by assigning individual objects to distinct clusters. Subsequent iterations involve the amalgamation (merging) of the closest pair of clusters based on specific similarity criteria. This iterative procedure persists until all data points are encompassed within a single cluster [44]. We analyzed the distance between clusters using a dendrogram. Machine learning and clustering were performed using Anaconda (Python version 3.7; Anaconda, Austin, TX, USA). Each cluster was assigned a unique identifier (cluster number) for identification purposes. To ensure transparency and interpretability, the cluster number was added as an additional categorical feature for each patient in the dataset.

2.8. Statistics

In the realm of descriptive statistics, continuous features were represented by medians alongside interquartile ranges, while categorical features were denoted by numbers accompanied by corresponding percentages. Disparities between groups were expressed as absolute standardized differences. To assess the relationship between clustering and POM as well as PPN, odds ratios and their corresponding 95% confidence intervals were computed through logistic regression. Adjusted odds ratios were obtained using backward elimination. Statistical analyses were conducted using SPSS statistical software (version 26.0; IBM Corp., Armonk, NY, USA). Hypothesis testing was conducted in a two-sided manner, with a Bonferroni correction applied (alpha = 0.05/2 = 0.025).

3. Results

In the current study, a total of 720 patients were initially considered for inclusion, and 103 patients were subsequently excluded as outlined in Figure 1. Among the remaining 617 patients, 141 (22.9%) unfortunately experienced postoperative mortality, and 185 (30.0%) experienced PPN. Detailed characteristics and perioperative data are comprehensively presented in Table 1. Among the 36 features examined, the ASD for each feature concerning postoperative mortality ranged from 0.0 to 0.8.

3.1. Prediction of POM and PPN

The dataset for training comprised 432 patients, while the test dataset included 185 patients. The performance metrics for predicting POM and PPN are summarized in Table 2 and Table 3, respectively.

3.1.1. Performance of POM and PPN Predictions

The first evaluation included a total of 36 features listed in Table 1. For POM prediction, the models achieved AUROC values ranging from 0.63 to 0.72, precision values spanning from 0.38 to 0.47, recall values varying from 0.43 to 0.82, accuracy scores ranging from 0.67 to 0.76, and F1 scores between 0.42 and 0.54. In the case of PPN prediction, the models attained AUROC values within the range of 0.68 to 0.78, precision values from 0.5 to 0.68, recall values spanning 0.54 to 0.76, accuracy scores ranging from 0.7 to 0.81, and F1 scores between 0.55 and 0.69.

3.1.2. Feature Importance

In the context of predicting both POM and PPN, the Glasgow Coma Scale (GCS) exhibited rankings ranging from 7 to 11 for POM and 2 to 25 for PPN. Similarly, the TIE displayed rankings between 10 and 16 for POM and 10 and 18 for PPN. Furthermore, the midline shift of the brain (MSB) revealed rankings spanning from 20 to 33 for POM and 18 to 30 for PPN. For a comprehensive overview of the importance of GCS, TIE, and MSB with respect to the prediction algorithms for POM and PPN, please refer to Table 3. Additionally, detailed information about the importance of other features is provided in Appendix B.

3.2. Clustering

The dendrogram, akin to Figure 2, served as a vital tool in determining the optimal number of clusters. Through careful consideration of the number of features employed in clustering, the distances observed between clusters on the dendrogram, and the overall clinical distribution, we determined that the most suitable number of clusters was five. The distribution of patients is depicted graphically in Figure 3.
The number of patients in each of clusters 0 to 4 was 148, 11, 206, 126, and 126, respectively. The number of deaths in each of clusters 0 to 4 was 19 (12.8%), 0 (0%), 20 (9.7%), 49 (38.9%), and 53 (42.1%), respectively. The number of patients experiencing PPN in each of clusters 0 to 4 was 47 (31.8%), 0 (0%), 34 (16.5%), 47 (37.3%), and 57 (45.2%), respectively.
The following is an analysis of the characteristics observed within each cluster:
  • Cluster 0 predominantly comprises individuals with high GCS scores ranging from 10 to 15, accompanied by low MSB values spanning from 0 to 7.1 and minimal TIE ranging from 0 to 5.
  • Cluster 1 is characterized by elevated GCS scores, ranging from 8 to 15, yet exhibits variability in both MSB (ranging from 0 to 18.1) and TIE (ranging from 27 to 64).
  • Cluster 2 generally consists of individuals with high GCS scores, ranging from 8 to 15, paired with low TIE values ranging from 0 to 21. However, the MSB values in this cluster vary between 2.0 and 23.3.
  • Cluster 3 is distinguished by low GCS scores spanning from 3 to 10, coupled with low MSB values ranging from 0 to 9.7 and minimal TIE ranging from 0 to 5.
  • Cluster 4 features individuals with low GCS scores (ranging from 3 to 10), minimal TIE values (ranging from 0 to 1), and notably high MSB values, which fall within the range of 6.5 to 31.9.

3.3. Prediction of POM and PPN with Clustering Feature Added

3.3.1. Performance of POM and PPN Predictions

The model including clustering features was evaluated with 34 features, excluding GCS, TIE, and MSB. The prediction results of POM and PPN with clustering results added as features in addition to the basic features are summarized in Table 4. The models of POM prediction achieved AUROC values ranging from 0.46 to 0.75, precision values ranging from 0.42 to 0.73, recall values ranging from 0.47 to 0.76, accuracy values ranging from 0.42 to 0.73, and F1 scores ranging from 0.33 to 0.67. The models of PPN prediction achieved AUROC values ranging from 0.65 to 0.8, precision values ranging from 0.68 to 0.81, recall values ranging from 0.61 to 0.77, accuracy values ranging from 0.55 to 0.73, and F1 scores ranging from 0.53 to 0.72.

3.3.2. Feature Importance

Within the context of predicting both POM and PPN, the cluster feature exhibited rankings within the range of 5 to 24 for POM and 14 to 37 for PPN. Further details on feature importance, categorized by prediction algorithm for both POM and PPN, can be found in Appendix C.

3.4. Odds Ratio of Cluster Variable for POM and PPN

In our analysis, the cluster variable was employed as a categorical factor in logistic regression. Within this framework, we designated cluster 0 as the reference group, distinguished by generally high GCS scores, minimal time elapsed between ER admission and trauma, and limited midline brain shifts. Cluster 3 had a 3.2-fold higher risk of POM than cluster 0. The findings, comprising both unadjusted and adjusted odds ratios for the cluster variable with respect to occurrences of POM and PPN, are succinctly presented in Table 5.

4. Discussion

In this study, we used machine learning to predict postoperative mortality and pneumonia in patients undergoing surgery for TBI. We also investigated the impact of GCS score, MSB, and TIE on prediction. Additionally, we investigated the impact of new variables that cluster these variables on the prediction and risk of POM and PPN. Our analysis included data from a large patient cohort of 617 patients from January 2012 to July 2022. We observed that 22.9% of patients experienced postoperative mortality, while 30.0% experienced postoperative pneumonia. According to the algorithms, the AUCROC for POM and PPN prediction before including the cluster variables was 0.63–0.72 and 0.68–0.78, respectively, and after including the cluster variables, it was 0.46–0.75 and 0.65–0.80, respectively. When calculating the odds ratio according to the new cluster variables, cluster 3 (low GCS score, small midline shift, short time from injury to ER) had a 3.2-fold higher risk of postoperative mortality than cluster 0 (high GCS score, small midline shift, short time from injury to ER).
Abujaber et al., reported the prediction of in-hospital mortality in patients with TBI [45]. They achieved an optimal performance with an accuracy of 95.6% and AUROC of 96%. Midline shift was one of the important features. Hsu et al., predicted the in-hospital mortality rate of TBI patients using GCS score, injury severity scale, and blood pressure, emphasizing the importance of GCS [46]. Lee et al., also predicted the early death of TBI patients using a machine learning model. It is also well known that general anesthesia, surgery, and TBI all affect pneumonia. There have been several studies that have predicted pneumonia in patients who have undergone surgery and TBI. Some studies have used machine learning to predict pneumonia after surgery [47,48,49]. However, there have been no studies on postoperative pneumonia in patients who have undergone surgery due to TBI to our knowledge. There is one reported study on the prediction of pneumonia after TBI. However, previous studies on mortality or pneumonia prediction for TBI included non-surgical patients, so their study population is different from ours.
Our findings follow previous studies highlighting the importance of rapid transfer to a hospital with neurosurgical facilities, high GCS score, and small midline shift of the brain in achieving better outcomes for TBI patients. [50,51,52]. These features have been reported as important factors in the mortality and pneumonia incidence of TBI patients in previous studies using machine learning. GCS score and midline shift were included as important factors [53]. GCS score and MSB may be important factors in hospital-acquired death or early death in TBI patients, and TIE may be an important factor in hospital-acquired pneumonia in TBI patients.
Although we can know the importance of variables in the prediction of machine learning, we cannot know how the variables work in some studies [45,53]. Even if we know how one variable works, we cannot know the effect of integrated information [46]. Clustering variables can have a positive impact on prediction by capturing complex relationships and patterns within data. Depending on the clustering technique used, the resulting clusters themselves can serve as interpretable features [54]. This can facilitate a deeper understanding of how patient subgroups relate to POM and PPN risk. Clustering allowed us to create composite features based on the original predictors. These composite features could represent interactions and combinations of variables that are relevant for predicting postoperative outcomes but may not be explicit in the individual predictors. Clustering can be seen as a form of dimensionality reduction. By grouping similar patients together, we reduced the complexity of the data. This simplification might help the models focus on the most informative aspects of the data, potentially reducing noise. Clustering enabled the stratification of patients into distinct groups based on similarities in their clinical profiles. This grouping might uncover specific subpopulations within the dataset, each with its own risk profile for postoperative outcomes [55]. The inclusion of cluster assignments as categorical features provided a way to interpret model results in the context of these patient subgroups. This can be valuable for clinical decision making. The clustering method can guide doctors in risk assessment, patient stratification, and the development of tailored treatment and monitoring strategies for TBI patients undergoing surgery. We verified the effect of clusters using logistic regression analysis and showed that there were differences in postoperative mortality risk between clusters. This will allow clinicians to assess patients more easily. In a previous study, the time from injury to the emergency room was the most important factor [56]. In our study, we also found that this factor was important in several models, but it was not the most important factor. In our study, we included surgical patients, so anesthesia and surgical factors may have played a role.
This study has the following strengths: First, it provides a foundation for the outcomes by analyzing a significant amount of patient data from 617 patients over a period of 10 years through comprehensive data analysis. Second, it is partially consistent with previous studies that emphasize the importance of factors such as rapid transportation to the emergency room, GCS score, and midline shift of the brain, which has clinical implications for the management of TBI patients. Third, it shows the potential of AI and ML in the medical field by using machine learning algorithms to predict POM and PPN. Fourth, it is innovative to use clustering to generate new variables based on the time from injury to the emergency room and the severity of TBI. This approach allows for a more nuanced understanding of patient subgroups and related risks. However, this study has several limitations. First, this study is retrospective, meaning it relies on past data. This design can introduce bias and limitations related to data quality and the uncontrollability of variables. Second, the study results are based on data from a specific patient population within a specific healthcare system. As healthcare practices and patient demographics can vary, it may be difficult to generalize the results to a wider population. Third, this study uses multiple machine learning algorithms, but the algorithm selection can affect the results. The performance of an ML model can vary depending on the specific algorithm selected. Fourth, external validation of the prediction model on a separate dataset is essential to assess the generalizability and robustness of the results. This study does not mention external validation. Fifth, we did not explicitly determine cutoff values for feature importance, and we used all available features for model development. We are aware of the limitations of the cutoff value determination process. It is important to note that it can be difficult to determine optimal cutoff values for individual algorithms, and there may not be a single solution that applies to all cases. However, feature optimization is needed for performance improvement in future studies.

5. Conclusions

This study harnessed the power of machine learning to predict POM and PPN in patients with TBI. Utilizing the GCS score, MSB, and TIE as foundational predictor variables, we embarked on an innovative predictive journey by incorporating clustered variables. These newly derived variables, crafted through advanced clustering techniques, encapsulate critical information from the original predictors, enabling a comprehensive assessment of the amalgamated risk posed by multiple contributing factors within each patient. For future investigations, it is imperative to broaden the scope by encompassing data from diverse healthcare institutions, thus enhancing the generalizability of our clustering-derived variables through rigorous external validation processes. This collaborative effort will advance our understanding of postoperative outcomes in TBI patients and contribute to more effective clinical decision making.

Author Contributions

Conceptualization, Y.-S.K. and H.-J.C.; methodology, J.-J.L., J.-H.K. and H.-J.C.; software, K.-M.C.; validation, Y.-S.K.; formal analysis, Y.-S.K.; investigation, J.-H.K.; resources, J.-J.L. and H.-J.C.; data curation, J.-H.K. and K.-M.C.; writing—original draft preparation, Y.-S.K. and J.-H.K.; writing—editing, all authors; visualization, J.-H.K.; supervision, H.-J.C.; project administration, K.-M.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the Clinical Research Ethics Committee of Chuncheon Sacred Hospital (No. 2022-05-012).

Informed Consent Statement

Not applicable.

Data Availability Statement

Restrictions apply to the availability of these data. Data were obtained from Hallym Medical Center and are available from the clinical data warehouse of Hallym Medical Center with the permission of Hallym Medical Center.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A

Table A1. Trauma types and the words describing them.
Table A1. Trauma types and the words describing them.
Word
Slip DownSlip down
Fall DownFall down, Falling, Fall off
Pedestrian Traffic AccidentsPedestrian, out car
Motorcycle AccidentsMotorcycle
BumpCollision, Crash, Bump
Bicycle AccidentsBicycle, Cycle
Car AccidentsIn car, Car, Bus, Truck, Driving, Vehicle

Appendix B

Table A2. Feature importance scores according to the algorithm for predicting postoperative mortality and pneumonia.
Table A2. Feature importance scores according to the algorithm for predicting postoperative mortality and pneumonia.
Postoperative MortalityPostoperative Pneumonia
LRRFLGBMMLPSVCBRFLRRFLGBMMLPSVCBRF
Age00.0530.08600.11300.1490.0650.100.0330.059
Male00.090.480.84900.3760.2580.1750.0120.390.1380.139
Duration of surgery0.0040.090.09700.2160.1420.0860.33900.1120.0470.26
TIE0.0680.1820.1260.1430.1620.2720.0490.1650.0860.1030.0770.19
Cooperative surgery0.7721.0060.7040.8540.1070.4920.1390.3780.3830.9570.2440.134
Obesity0000.0140.13100.0360.0630000.039
Alcohol0.0140.0840.190.08600.00500.190.04600.0080
Smoking00.056000.1240.0150.0920.0200.01700.072
Midline shift0.13100.0090.03800.100.0260.06900.0020
Skull fracture0.1890.08600.1470.1550.11800.033000.0450.086
Intracerebral hemorrhage0.0610.08600.050.0070.12200.07000.0010.092
Subdural hemorrhage0.6130.8540.4950.7090.0710.3030.1970.420.2860.430.1010.431
Epidural hemorrhage0.2010.2090.0720.0340.1630.20300000.0880.068
Intraventricular hemorrhage00.12200.0250.0100.0560.0240.082000.018
Subarachnoid hemorrhage00.07200.0140.0180.00500.1320.0740.0730.0830
ASA PS0.3080.4180.3750.3420.0080.2070.1090.2010.2840.3130.1280.004
Administered blood0.1370.2410.0670.110.5160.4550.1080.2750.1720.0330.1780.432
Administered fluid0.0640.2010.03100.1280.3990.150.2170.1640.0560.1880.275
Urine output00.0770.0920.030.2220.2270.0940.1830.0040.0350.0930.191
Estimated blood loss0.290.2860.0490.1090.3290.3780.2450.2220.1130.0760.1340.305
Intraoperative PRCs0.1310.2960.10.2160.3040.5660.1090.3560.120.0230.1330.462
Intraoperative FFP0.350.4060.2070.160.1160.2920.1110.2610.110.1160.2990.395
Intraoperative PC0.7760.6180.6150.9660.1740.3460.2890.2790.5180.95400.312
Blood urea nitrogen00.04200000.0160.03000.0780.129
Creatinine00.0560.01700.03400000.010.0430
Albumin00000000.00700.07100.141
Sodium0.0130.190000.1390.13400.072000
Potassium0.05300.0420.0350.0580.09200.030.015000.119
GCS score0.2010.2290.1230.2040.210.2890.1910.2750.0120.0310.070.504
Total duration of ventilator care0.2270.4520.2970.1780.2970.4860.5930.6450.550.3230.4211.021
O2/FiO2 <30000.0190.070.0520.1080.1250.0610.1070.0540.0480.1210
Burr hole0.1250.080.0670.0190.2560.1820.1020.2150.0310.0210.0380.087
Craniectomy0.5770.7130.2350.1440.3250.4640.1210.4070.0760.2770.040.215
Craniotomy0.0480.2280.0730.0630.2090.17700.0970.0010.1020.0160
Cranioplasty0000.0200.0050.02600.0830.00500
Coiling0.080.0270.0870.1380.0910000.0620.0340.0120.068
ASA PS, American Society of Anesthesiologists physical status; BRF, balanced random forest; FFP, fresh frozen plasma; GCS, Glasgow Coma Scale; LGBM, light gradient boosting machine; LR, logistic regression; MLP, multilayer perceptron; PC, platelet concentration; RF, random forest; PRCs, packed red blood cells; SVC, support vector machine; TIE, time between traumatic injury and emergency room.

Appendix C

Table A3. Feature importance scores according to the algorithm for predicting postoperative mortality and pneumonia when a cluster feature is added.
Table A3. Feature importance scores according to the algorithm for predicting postoperative mortality and pneumonia when a cluster feature is added.
Postoperative MortalityPostoperative Pneumonia
LRRFLGBMMLPSVCBRFLRRFLGBMMLPSVCBRF
Age000.0100.05500.0710.003000.0140
Male0.230.2980.5080.5730.2590.210.12200.20.4430.0320.031
Duration of surgery0.0140.1160.02600.2950.2260.0760.1650.0710.1010.0690.4
TIE0.1160.1470.0580.0870.0540.2290.0850.2620.1290.1090.0580.191
Cooperative surgery0.8450.2390.5640.660.3280.4210.4520.041.0330.8380.4230.146
Obesity0000.00300.0040.012000.0750.0140
Alcohol0.1550.0130.05600.1630.1330.140.010.1790.2700.002
Smoking0.0360.0040.0250000.0630.0220.004000.159
Midline shift0.1510.11300.0150.0590.0240.03700.164000.062
Skull fracture0.30.190.03300.2130.2260.1760.051000.0860.011
Intracerebral hemorrhage0.0490.0300.070.0830.0470.0240000.0640
Subdural hemorrhage1.0040.4870.5540.4540.4120.4050.2720.1480.9010.5620.4550.191
Epidural hemorrhage0.0340.160.23100.0690.1790.090.03500.0050.0270.013
Intraventricular hemorrhage00.0090.0310.04800.0980.0050000.0540
Subarachnoid hemorrhage00.0940000.01400.0850.040.1270.0220.078
ASA PS0.2030.1640.3140.2750.280.220.2970.0880.3460.2480.3230.064
Administered blood0.130.3370.140.090.3440.4080.2640.2830.1850.0170.1030.295
Administered fluid0.0940.1370.070.1190.1440.320.2190.1710.080.0820.2090.276
Urine output0.0640.056000.160.10.0740.1610.05300.0640.115
Estimated blood loss0.1270.2210.0820.1350.2790.4670.1470.1550.13800.2580.234
Intraoperative PRCs0.1330.1760.1230.1560.1520.4220.2180.2780.1620.0230.1120.307
Intraoperative FFP0.190.1930.2470.1290.2440.5430.1270.1650.19100.3550.087
Intraoperative PC0.6450.2540.3380.6530.430.4990.71400.9250.860.3360.203
Blood urea nitrogen000000.070.1360.0210.0180.0120.0760
Creatinine00.02100.0250000000.0570
Albumin0000.02400.07700.146000.0190.168
Sodium0.020.0440.1390.0790.0140.0040.1290.00600.01300.067
Potassium0.0950.01500.0180.0430.03900.0450000
GCS score0.190.2610.1090.1730.2730.3150.1920.2060.070.0130.180.172
Total duration of ventilator care0.2260.390.2550.1620.3520.5820.630.7370.4420.2890.460.733
O2/FiO2 <3000.160.052000.0360.02700.050.3970.1210.0650.114
Burr hole0.0980.1480.0130.0550.230.2270.1290.1620.03500.0670.161
Craniectomy0.6460.7980.2350.0850.2910.3030.380.1380.2020.4140.2590.134
Craniotomy0.0340.2060.2590.0530.1720.2660.090.025000.0830.017
Cranioplasty0.0630.0070.091000.0240.06400.048000.072
Coiling0.0280000.0170.0430.0040.0300.0010.0290
0.2160.1560.0130.2760.3360.1700.10.0040.0170.0320.149
ASA PS, American Society of Anesthesiologists physical status; BRF, balanced random forest; FFP, fresh frozen plasma; GCS, Glasgow Coma Scale; LGBM, light gradient boosting machine; LR, logistic regression; MLP, multilayer perceptron; PC, platelet concentration; RF, random forest; PRCs, packed red blood cells; SVC, support vector machine; TIE, time between traumatic injury and emergency room.

References

  1. Demlie, T.A.; Alemu, M.T.; Messelu, M.A.; Wagnew, F.; Mekonen, E.G. Incidence and predictors of mortality among traumatic brain injury patients admitted to Amhara region Comprehensive Specialized Hospitals, northwest Ethiopia, 2022. BMC Emerg. Med. 2023, 23, 55. [Google Scholar] [CrossRef] [PubMed]
  2. James, S.L.; Theadom, A.; Ellenbogen, R.G.; Bannick, M.S.; Montjoy-Venning, W.; Lucchesi, L.R.; Abbasi, N.; Abdulkader, R.; Abraha, H.N.; Adsuar, J.C. Global, regional, and national burden of traumatic brain injury and spinal cord injury, 1990–2016: A systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 2019, 18, 56–87. [Google Scholar] [CrossRef] [PubMed]
  3. Kamal, V.K.; Agrawal, D.; Pandey, R.M. Epidemiology, clinical characteristics and outcomes of traumatic brain injury: Evidences from integrated level 1 trauma center in India. J. Neurosci. Rural. Pract. 2016, 7, 515–525. [Google Scholar] [CrossRef] [PubMed]
  4. Cryan, J.F.; Dinan, T.G. Mind-altering microorganisms: The impact of the gut microbiota on brain and behaviour. Nat. Rev. Neurosci. 2012, 13, 701–712. [Google Scholar] [CrossRef]
  5. Needham, B.D.; Kaddurah-Daouk, R.; Mazmanian, S.K. Gut microbial molecules in behavioural and neurodegenerative conditions. Nat. Rev. Neurosci. 2020, 21, 717–731. [Google Scholar] [CrossRef]
  6. Bajinka, O.; Simbilyabo, L.; Tan, Y.; Jabang, J.; Saleem, S.A. Lung-brain axis. Crit. Rev. Microbiol. 2022, 48, 257–269. [Google Scholar] [CrossRef]
  7. Li, C.; Chen, W.; Lin, F.; Li, W.; Wang, P.; Liao, G.; Zhang, L. Functional Two-Way Crosstalk Between Brain and Lung: The Brain–Lung Axis. Cell. Mol. Neurobiol. 2022, 43, 991–1003. [Google Scholar] [CrossRef]
  8. Ziaka, M.; Exadaktylos, A. Brain–lung interactions and mechanical ventilation in patients with isolated brain injury. Crit. Care 2021, 25, 358. [Google Scholar] [CrossRef]
  9. Mascia, L.; Zavala, E.; Bosma, K.; Pasero, D.; Decaroli, D.; Andrews, P.; Isnardi, D.; Davi, A.; Arguis, M.J.; Berardino, M. High tidal volume is associated with the development of acute lung injury after severe brain injury: An international observational study. Crit. Care Med. 2007, 35, 1815–1820. [Google Scholar] [CrossRef]
  10. Zafrullah Arifin, M.; Gunawan, W. Analysis of presurgery time as a prognostic factor in traumatic acute subdural hematoma. J. Neurosurg. Sci. 2013, 57, 277–280. [Google Scholar]
  11. Alagoz, F.; Yildirim, A.; Sahinoglu, M.; Korkmaz, M.; Secer, M.; Celik, H.; Yel, C.; Guvenc, Y.; Uckun, O.; Narin, F. Traumatic acute subdural hematomas: Analysis of outcomes and predictive factors at a single center. Turk. Neurosurg. 2017, 27, 187–191. [Google Scholar] [CrossRef] [PubMed]
  12. Xue, L.Y.; Gaowa, S.; Wang, W.; Zhao, F.; Zhu, H.C.; Yu, X.Y.; Gong, Y. Ventilator-associated pneumonia in patients with cerebral hemorrhage: Impact on mortality and microbiological characterization. Med. Clínica 2020, 154, 400–405. [Google Scholar] [CrossRef] [PubMed]
  13. Lee, K.; Rincon, F. Pulmonary Complications in Patients with Severe Brain Injury. Crit. Care Res. Pract. 2012, 2012, 207247. [Google Scholar] [CrossRef] [PubMed]
  14. Kazaure, H.S.; Martin, M.; Yoon, J.K.; Wren, S.M. Long-term results of a postoperative pneumonia prevention program for the inpatient surgical ward. JAMA Surg. 2014, 149, 914–918. [Google Scholar] [CrossRef]
  15. Chastre, J.; Fagon, J.-Y. Ventilator-associated pneumonia. Am. J. Respir. Crit. Care Med. 2002, 165, 867–903. [Google Scholar] [CrossRef]
  16. Rello, J.; Ollendorf, D.A.; Oster, G.; Vera-Llonch, M.; Bellm, L.; Redman, R.; Kollef, M.H. Epidemiology and outcomes of ventilator-associated pneumonia in a large US database. Chest 2002, 122, 2115–2121. [Google Scholar] [CrossRef]
  17. Warren, D.K.; Shukla, S.J.; Olsen, M.A.; Kollef, M.H.; Hollenbeak, C.S.; Cox, M.J.; Cohen, M.M.; Fraser, V.J. Outcome and attributable cost of ventilator-associated pneumonia among intensive care unit patients in a suburban medical center. Crit. Care Med. 2003, 31, 1312–1317. [Google Scholar] [CrossRef]
  18. Javaid, M.; Haleem, A.; Pratap Singh, R.; Suman, R.; Rab, S. Significance of machine learning in healthcare: Features, pillars and applications. Int. J. Intell. Netw. 2022, 3, 58–73. [Google Scholar] [CrossRef]
  19. Kim, J.H.; Kim, Y.; Yoo, K.; Kim, M.; Kang, S.S.; Kwon, Y.-S.; Lee, J.J. Prediction of Postoperative Pulmonary Edema Risk Using Machine Learning. J. Clin. Med. 2023, 12, 1804. [Google Scholar] [CrossRef]
  20. Peng, X.; Zhu, T.; Chen, G.; Wang, Y.; Hao, X. A multicenter prospective study on postoperative pulmonary complications prediction in geriatric patients with deep neural network model. Front. Surg. 2022, 9, 976536. [Google Scholar] [CrossRef]
  21. Xue, B.; Li, D.; Lu, C.; King, C.R.; Wildes, T.; Avidan, M.S.; Kannampallil, T.; Abraham, J. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw. Open 2021, 4, e212240. [Google Scholar] [CrossRef]
  22. Xue, Q.; Wen, D.; Ji, M.-H.; Tong, J.; Yang, J.-J.; Zhou, C.-M. Developing machine learning algorithms to predict pulmonary complications after emergency gastrointestinal surgery. Front. Med. 2021, 8, 655686. [Google Scholar] [CrossRef] [PubMed]
  23. Baek, M.S.; Kim, J.H.; Kwon, Y.S. Cluster analysis integrating age and body temperature for mortality in patients with sepsis: A multicenter retrospective study. Sci. Rep. 2022, 12, 1090. [Google Scholar] [CrossRef] [PubMed]
  24. Madhulatha, T.S. An overview on clustering methods. arXiv 2012, arXiv:1205.1117. [Google Scholar] [CrossRef]
  25. Kim, J.H.; Kim, N.; Lee, S.K.; Kwon, Y.S. Effect of Pregnancy on Postoperative Nausea and Vomiting in Female Patients Who Underwent Nondelivery Surgery: Multicenter Retrospective Cohort Study. Int. J. Environ. Res. Public. Health 2022, 19, 5132. [Google Scholar] [CrossRef]
  26. Kim, J.H.; Lim, M.S.; Choi, J.W.; Kim, H.; Kwon, Y.S.; Lee, J.J. Comparison of the Effects of Sugammadex, Neostigmine, and Pyridostigmine on Postoperative Nausea and Vomiting: A Propensity Matched Study of Five Hospitals. J. Clin. Med. 2020, 9, 3477. [Google Scholar] [CrossRef]
  27. Kim, J.H.; Lim, M.S.; Lee, S.H.; Kwon, Y.S.; Lee, J.J.; Sohn, J.H. Consideration of Migraines Among Risk Factors for Postoperative Nausea and Vomiting. J. Clin. Med. 2020, 9, 3154. [Google Scholar] [CrossRef] [PubMed]
  28. Kwon, Y.S.; Baek, M.S. Development and Validation of a Quick Sepsis-Related Organ Failure Assessment-Based Machine-Learning Model for Mortality Prediction in Patients with Suspected Infection in the Emergency Department. J. Clin. Med. 2020, 9, 875. [Google Scholar] [CrossRef]
  29. Kwon, Y.S.; Choi, J.W.; Lee, H.S.; Kim, J.H.; Kim, Y.; Lee, J.J. Effect of a Preoperative Proton Pump Inhibitor and Gastroesophageal Reflux Disease on Postoperative Nausea and Vomiting. J. Clin. Med. 2020, 9, 825. [Google Scholar] [CrossRef]
  30. Kwon, Y.S.; Kim, J.H.; Hwang, S.M.; Choi, J.W.; Kang, S.S. Comparison of the Effect of Sugammadex and Pyridostigmine on Postoperative Catheter-Related Bladder Discomfort: A Retrospective Matched Cohort Analysis. Medicina 2022, 58, 590. [Google Scholar] [CrossRef]
  31. sklearn.preprocessing.StandardScaler. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html (accessed on 30 January 2023).
  32. Tyagi, S.; Mittal, S. Sampling approaches for imbalanced data classification problem in machine learning. In Proceedings of the ICRIC 2019: Recent Innovations in Computing, Jammu-Kashmir, India, 8–9 March 2019; Springer: Cham, Switzerland, 2020; pp. 209–221. [Google Scholar]
  33. Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  34. LightGBM. Available online: https://lightgbm.readthedocs.io/en/v3.3.2/ (accessed on 30 January 2023).
  35. sklearn.neural_network.MLPClassifier. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.neural_network.MLPClassifier.html (accessed on 30 January 2023).
  36. sklearn.linear_model.LogisticRegression. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html (accessed on 30 January 2023).
  37. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  38. Evgeniou, T.; Pontil, M. Support vector machines: Theory and applications. In Advanced Course on Artificial Intelligence; Springer: Berlin/Heidelberg, Germany, 1999; pp. 249–257. [Google Scholar]
  39. BalancedRandomForestClassifier. Available online: https://imbalanced-learn.org/stable/references/generated/imblearn.ensemble.BalancedRandomForestClassifier.html (accessed on 30 January 2023).
  40. Fang, L.; Zhao, H.; Wang, P.; Yu, M.; Yan, J.; Cheng, W.; Chen, P. Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data. Biomed. Signal Process. Control 2015, 21, 82–89. [Google Scholar] [CrossRef]
  41. Liang, J.; Hou, L.; Luan, Z.; Huang, W. Feature Selection with Conditional Mutual Information Considering Feature Interaction. Symmetry 2019, 11, 858. [Google Scholar] [CrossRef]
  42. Zhao, F.; Zhao, J.; Niu, X.; Luo, S.; Xin, Y. A Filter Feature Selection Algorithm Based on Mutual Information for Intrusion Detection. Appl. Sci. 2018, 8, 1535. [Google Scholar] [CrossRef]
  43. Gajowniczek, K.; Wu, J.; Gupta, S.; Bajaj, C. HOFS: Higher order mutual information approximation for feature selection in R. SoftwareX 2022, 19, 101148. [Google Scholar] [CrossRef]
  44. Sasirekha, K.; Baby, P. Agglomerative hierarchical clustering algorithm—A review. Int. J. Sci. Res. Publ. 2013, 83, 83. [Google Scholar]
  45. Abujaber, A.; Fadlalla, A.; Gammoh, D.; Abdelrahman, H.; Mollazehi, M.; El-Menyar, A. Prediction of in-hospital mortality in patients with post traumatic brain injury using National Trauma Registry and Machine Learning Approach. Scand. J. Trauma Resusc. Emerg. Med. 2020, 28, 44. [Google Scholar] [CrossRef]
  46. Hsu, S.-D.; Chao, E.; Chen, S.-J.; Hueng, D.-Y.; Lan, H.-Y.; Chiang, H.-H. Machine Learning Algorithms to Predict In-Hospital Mortality in Patients with Traumatic Brain Injury. J. Pers. Med. 2021, 11, 1144. [Google Scholar] [CrossRef]
  47. Song, X.; Li, H.; Chen, Q.; Zhang, T.; Huang, G.; Zou, L.; Du, D. Predicting pneumonia during hospitalization in flail chest patients using machine learning approaches. Front. Surg. 2022, 9, 1060691. [Google Scholar] [CrossRef]
  48. Song, Y.; Liu, J.; Lei, M.; Wang, Y.; Fu, Q.; Wang, B.; Guo, Y.; Mi, W.; Tong, L. An External-Validated Algorithm to Predict Postoperative Pneumonia Among Elderly Patients with Lung Cancer After Video-Assisted Thoracoscopic Surgery. Front. Oncol. 2021, 11, 777564. [Google Scholar] [CrossRef] [PubMed]
  49. Xiang, G.; Dong, X.; Xu, T.; Feng, Y.; He, Z.; Ke, C.; Xiao, J.; Weng, Y.-M. A nomogram for prediction of postoperative pneumonia risk in elderly hip fracture patients. Risk Manag. Healthc. Policy 2020, 13, 1603–1611. [Google Scholar] [CrossRef] [PubMed]
  50. Marino, M.A.; Siddiqi, I.; Maniakhina, L.; Burton, P.M.; Reier, L.; Duong, J.; Miulli, D.E. Neurosurgical Outcomes in Severe Traumatic Brain Injuries Between Service Lines: Review of a Single Institution Database. Cureus 2023, 15, e37445. [Google Scholar] [CrossRef]
  51. Okidi, R.; Ogwang, D.M.; Okello, T.R.; Ezati, D.; Kyegombe, W.; Nyeko, D.; Scolding, N.J. Factors affecting mortality after traumatic brain injury in a resource-poor setting. BJS Open 2019, 4, 320–325. [Google Scholar] [CrossRef] [PubMed]
  52. Pastor, I.S.; Dumbravă, L.P.; Siserman, C.; Stan, H.; Para, I.; Florian, I. Predictive factors of 30-day mortality in patients with traumatic subdural hematoma. Exp. Ther. Med. 2021, 22, 757. [Google Scholar] [CrossRef]
  53. Lee, S.H.; Lee, C.H.; Hwang, S.H.; Kang, D.H. A Machine Learning–Based Prognostic Model for the Prediction of Early Death After Traumatic Brain Injury: Comparison with the Corticosteroid Randomization After Significant Head Injury (CRASH) Model. World Neurosurg. 2022, 166, e125–e134. [Google Scholar] [CrossRef]
  54. Kauffmann, J.; Esders, M.; Ruff, L.; Montavon, G.; Samek, W.; Müller, K.-R. From clustering to cluster explanations via neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2022, 1–15. [Google Scholar] [CrossRef]
  55. Balaji, K. Machine learning algorithm for feature space clustering of mixed data with missing information based on molecule similarity. J. Biomed. Inform. 2022, 125, 103954. [Google Scholar] [CrossRef]
  56. Abujaber, A.; Fadlalla, A.; Gammoh, D.; Al-Thani, H.; El-Menyar, A. Machine Learning Model to Predict Ventilator Associated Pneumonia in patients with Traumatic Brain Injury: The C.5 Decision Tree Approach. Brain Inj 2021, 35, 1095–1102. [Google Scholar] [CrossRef]
Figure 1. Flow chart.
Figure 1. Flow chart.
Biomedicines 11 02880 g001
Figure 2. Dendrogram.
Figure 2. Dendrogram.
Biomedicines 11 02880 g002
Figure 3. Clustering distribution of patients who underwent surgery due to traumatic brain injury. ER, emergency room; GCS, Glasgow Coma Scale.
Figure 3. Clustering distribution of patients who underwent surgery due to traumatic brain injury. ER, emergency room; GCS, Glasgow Coma Scale.
Biomedicines 11 02880 g003
Table 1. The patient characteristics and perioperative data.
Table 1. The patient characteristics and perioperative data.
Postoperative SurvivalPostoperative DeathASD
Age, years49 (60–73)50.5 (61–75)0.1
Male345 (72.5)111 (78.7)0.1
Duration of surgery, hours64 (100–150)79 (110–149)0.1
TIE0 (0–4.8)0 (0–2.4)0.6
Cooperative surgery10 (2.1)1 (0.7)0.1
Obesity 10 (2.1)4 (2.8)0
Alcohol222 (46.6)57 (40.4)0.1
Smoking149 (31.3)42 (29.8)0
Midline shift, mm2 (5.9–10.5)3.5 (8.8–15.2)0.4
Skull fracture180 (37.8)60 (42.6)0.1
Intracerebral hemorrhage89 (18.7)25 (17.7)0
Subdural hemorrhage317 (66.6)123 (87.2)0.5
Epidural hemorrhage168 (35.3)23 (16.3)0.4
Intraventricular hemorrhage14 (2.9)13 (9.2)0.3
Subarachnoid hemorrhage89 (18.7)50 (35.5)0.4
ASA PS3 (3–4)3 (3–4)0.1
Administered blood0 (200–862.5)500 (1100–1880)0.8
Administered fluid800 (1800–2800)2100 (3100–4300)0.8
Urine output100 (300–600)175 (360–675)0.2
Estimated blood loss200 (600–1500)1000 (2000–3000)0.8
Intraoperative PRCs0 (1–3)2 (4–6)0.8
Intraoperative FFP0 (0–2)0 (2–3)0.7
Intraoperative PC0 (0–0)0 (0–0)0.2
Blood urea nitrogen, mg/dL11.5 (14.5–17.8)12.5 (14.6–19.7)0.2
Creatinine, mg/dL0.7 (0.8–1)0.7 (0.9–1.1)0.3
Albumin, g/dL3.9 (4.2–4.4)3.8 (4.1–4.4)0.2
Sodium, mmol/L137 (139–141)135 (138–141)0.1
Potassium, mmol/L3.4 (3.7–4)3.1 (3.5–3.8)0.3
GCS score8 (13–15)4 (6–10)1.1
Total duration of ventilator care0 (2–10)3 (6–11)0.2
O2/FiO2 < 30092 (19.3)67 (47.5)0.6
Burr hole103 (21.6)3 (2.1)0.7
Craniectomy155 (32.6)103 (73)0.9
Craniotomy203 (42.6)28 (19.9)0.5
Cranioplasty5 (1.1)1 (0.7)0
Coiling6 (1.3)5 (3.5)0.2
ASA PS, American Society of Anesthesiologists physical status; ASD, absolute standardized difference; FFP, fresh frozen plasma; GCS, Glasgow Coma Scale; PC, platelet concentration; PRCs, packed red blood cells; TIE, time between traumatic injury and emergency room.
Table 2. AUROC, precision, recall, accuracy, and F1 scores for prediction of postoperative mortality and pneumonia in patients undergoing surgery with traumatic brain injury.
Table 2. AUROC, precision, recall, accuracy, and F1 scores for prediction of postoperative mortality and pneumonia in patients undergoing surgery with traumatic brain injury.
ModelAUROC
(95% CI)
Precision
(95% CI)
Recall
(95% CI)
Accuracy
(95% CI)
F1 Score
(95% CI)
POMLogistic Regression0.71 (0.61–0.81)0.46 (0.31–0.63)0.64 (0.45–0.81)0.75 (0.68–0.83)0.54 (0.36–0.67)
Random Forest0.63 (0.53–0.73)0.41 (0.24–0.61)0.43 (0.25–0.61)0.73 (0.66–0.81)0.42 (0.26–0.57)
Light GBM0.65 (0.55–0.75)0.45 (0.26–0.64)0.46 (0.29–0.65)0.75 (0.67–0.82)0.46 (0.29–0.61)
Multilayer Perceptron0.69 (0.59–0.79)0.47 (0.29–0.65)0.57 (0.39–0.75)0.76 (0.68–0.83)0.52 (0.35–0.65)
SVM0.64 (0.54–0.74)0.38 (0.24–0.54)0.54 (0.36–0.72)0.7 (0.62–0.78)0.45 (0.29–0.59)
Balanced Random Forest0.72 (0.63–0.81)0.39 (0.27–0.52)0.82 (0.68–0.96)0.67 (0.58–0.74)0.53 (0.4–0.65)
PPNLogistic Regression0.78 (0.7–0.86)0.68 (0.53–0.83)0.7 (0.55–0.84)0.81 (0.74–0.88)0.69 (0.56–0.8)
Random Forest0.74 (0.65–0.82)0.62 (0.46–0.78)0.65 (0.5–0.8)0.77 (0.71–0.85)0.63 (0.5–0.75)
Light GBM0.73 (0.64–0.82)0.57 (0.42–0.72)0.68 (0.52–0.82)0.75 (0.67–0.82)0.62 (0.48–0.74)
Multilayer Perceptron0.69 (0.6–0.78)0.57 (0.4–0.73)0.57 (0.41–0.72)0.74 (0.67–0.81)0.57 (0.42–0.7)
SVM0.68 (0.59–0.77)0.56 (0.39–0.72)0.54 (0.37–0.7)0.73 (0.65–0.81)0.55 (0.41–0.67)
Balanced Random Forest0.72 (0.64–0.8)0.5 (0.37–0.64)0.76 (0.61–0.89)0.7 (0.62–0.77)0.6 (0.48–0.71)
Prediction of POM and PPN using 36 features. AUROC, area under the receiver operating characteristic curve; CI, confidence interval; GBM, gradient boosting machine; POM, postoperative mortality; PPN, postoperative pneumonia; SVM, support vector machine.
Table 3. Feature importance of GCS, TIE, and MSB according to algorithm for prediction of POM and PPN.
Table 3. Feature importance of GCS, TIE, and MSB according to algorithm for prediction of POM and PPN.
VariableLogistic RegressionRandom ForestLight GBMMultilayer PerceptronSVMBalanced Random Forest
GCS0.201 (10)0.229 (11)0.123 (11)0.204 (7)0.21 (9)0.289 (12)
POMTIE0.068 (17)0.182 (16)0.126 (10)0.143 (12)0.162 (13)0.272 (13)
MSB0.131 (13)0 (33)0.009 (26)0.038 (20)0 (32)0.1 (24)
GCS0.191 (6)0.275 (9)0.012 (25)0.031 (20)0.07 (17)0.504 (2)
PPNTIE0.049 (22)0.165 (18)0.086 (12)0.103 (10)0.077 (16)0.19 (13)
MSB0 (27)0.026 (28)0.069 (18)0 (29)0.002 (27)0 (30)
Prediction of POM and PPN using 36 features. GBM, gradient boosting machine; GCS, Glasgow Coma Scale; MSB, midline shift of the brain; TIE, time between traumatic injury and emergency room; POM, postoperative mortality; PPN, postoperative pneumonia; SVM, support vector machine.
Table 4. AUROC, precision, recall, accuracy, and F1 scores for prediction of postoperative mortality and pneumonia in patients undergoing surgery with traumatic brain injury when a cluster feature is added.
Table 4. AUROC, precision, recall, accuracy, and F1 scores for prediction of postoperative mortality and pneumonia in patients undergoing surgery with traumatic brain injury when a cluster feature is added.
ModelAUROC
(95% CI)
Precision
(95% CI)
Recall
(95% CI)
Accuracy
(95% CI)
F1 Score
(95% CI)
POMLogistic Regression0.72 (0.62–0.82)0.64 (0.54–0.75)0.67 (0.57–0.77)0.66 (0.55–0.76)0.6 (0.49–0.7)
Random Forest0.46 (0.31–0.62)0.42 (0.25–0.59)0.47 (0.29–0.66)0.42 (0.26–0.58)0.33 (0.19–0.48)
Light GBM0.68 (0.5–0.86)0.46 (0.29–0.65)0.5 (0.32–0.68)0.54 (0.35–0.73)0.46 (0.28–0.64)
Multilayer Perceptron0.75 (0.68–0.82)0.73 (0.65–0.81)0.76 (0.68–0.83)0.73 (0.65–0.8)0.67 (0.59–0.75)
SVM0.55 (0.4–0.68)0.44 (0.27–0.58)0.48 (0.3–0.62)0.47 (0.31–0.61)0.39 (0.23–0.52)
Balanced Random Forest0.72 (0.62–0.82)0.64 (0.54–0.75)0.67 (0.57–0.77)0.66 (0.55–0.76)0.6 (0.49–0.7)
PPNLogistic Regression0.77 (0.69–0.85)0.77 (0.69–0.86)0.72 (0.64–0.82)0.69 (0.6–0.78)0.67 (0.57–0.76)
Random Forest0.65 (0.5–0.8)0.68 (0.51–0.81)0.61 (0.46–0.76)0.55 (0.39–0.7)0.53 (0.37–0.69)
Light GBM0.7 (0.55–0.84)0.68 (0.53–0.81)0.62 (0.46–0.78)0.57 (0.4–0.73)0.54 (0.37–0.71)
Multilayer Perceptron0.8 (0.73–0.86)0.81 (0.73–0.88)0.77 (0.69–0.84)0.73 (0.65–0.81)0.72 (0.63–0.79)
SVM0.68 (0.52–0.79)0.68 (0.54–0.78)0.61 (0.46–0.73)0.56 (0.43–0.68)0.53 (0.38–0.66)
Balanced Random Forest0.77 (0.69–0.85)0.77 (0.69–0.86)0.72 (0.64–0.82)0.69 (0.6–0.78)0.67 (0.57–0.76)
Prediction of POM and PPN using 34 features. AUROC, area under the receiver operating characteristic curve; CI, confidence interval; GBM, gradient boosting machine; POM, postoperative mortality; PPN, postoperative pneumonia; SVM, support vector machine.
Table 5. Odds ratios of the cluster variable for POM and PPN occurrence.
Table 5. Odds ratios of the cluster variable for POM and PPN occurrence.
Postoperative MortalityPostoperative Pneumonia
Unadjusted ORpAdjusted ORpUnadjusted ORpAdjusted ORp
Cluster 0reference reference reference reference
Cluster 10 (0–0)>0.9990 (0–0)>0.9990 (0–0)>0.9990 (0–0)>0.999
Cluster 20.73 (0.37–1.42)0.3550.82 (0.36–1.88)0.6410.42 (0.26–0.7)0.0010.52 (0.28–0.97)0.040
Cluster 34.32 (2.37–7.87)<0.0013.2 (1.54–6.64)0.0021.28 (0.78–2.11)0.3360.72 (0.39–1.33)0.290
Cluster 44.93 (2.71–8.96)<0.0012.32 (1.09–4.92)0.0291.78 (1.08–2.91)0.0221.07 (0.58–1.97)0.820
OR, odds ratio.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kim, J.-H.; Chung, K.-M.; Lee, J.-J.; Choi, H.-J.; Kwon, Y.-S. Predictive Modeling and Integrated Risk Assessment of Postoperative Mortality and Pneumonia in Traumatic Brain Injury Patients through Clustering and Machine Learning: Retrospective Study. Biomedicines 2023, 11, 2880. https://doi.org/10.3390/biomedicines11112880

AMA Style

Kim J-H, Chung K-M, Lee J-J, Choi H-J, Kwon Y-S. Predictive Modeling and Integrated Risk Assessment of Postoperative Mortality and Pneumonia in Traumatic Brain Injury Patients through Clustering and Machine Learning: Retrospective Study. Biomedicines. 2023; 11(11):2880. https://doi.org/10.3390/biomedicines11112880

Chicago/Turabian Style

Kim, Jong-Ho, Kyung-Min Chung, Jae-Jun Lee, Hyuk-Jai Choi, and Young-Suk Kwon. 2023. "Predictive Modeling and Integrated Risk Assessment of Postoperative Mortality and Pneumonia in Traumatic Brain Injury Patients through Clustering and Machine Learning: Retrospective Study" Biomedicines 11, no. 11: 2880. https://doi.org/10.3390/biomedicines11112880

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop