Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan

García-García, Sergio; Cepeda, Santiago; Müller, Dominik; Mosteiro, Alejandra; Torné, Ramón; Agudo, Silvia; de la Torre, Natalia; Arrese, Ignacio; Sarabia, Rosario

doi:10.3390/brainsci14010010

Open AccessArticle

Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan

by

Sergio García-García

^1,*,†

,

Santiago Cepeda

^1,†

,

Dominik Müller

²

,

Alejandra Mosteiro

³,

Ramón Torné

³

,

Silvia Agudo

¹,

Natalia de la Torre

¹,

Ignacio Arrese

¹ and

Rosario Sarabia

¹

Neurosurgery Department, Rio Hortega University Hospital, 47012 Valladolid, Spain

²

IT-Infrastructure for Translational Medical Research, University of Augsburg, 86159 Augsburg, Germany

³

Neurosurgery Department, Hospital Clinic de Barcelona, 08036 Barcelona, Spain

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Brain Sci. 2024, 14(1), 10; https://doi.org/10.3390/brainsci14010010

Submission received: 5 November 2023 / Revised: 10 December 2023 / Accepted: 21 December 2023 / Published: 22 December 2023

(This article belongs to the Special Issue Clinical Application of Neuroimaging in Cerebral Vascular Diseases)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Background: Subarachnoid hemorrhage (SAH) entails high morbidity and mortality rates. Convolutional neural networks (CNN) are capable of generating highly accurate predictions from imaging data. Our objective was to predict mortality in SAH patients by processing initial CT scans using a CNN-based algorithm. Methods: We conducted a retrospective multicentric study of a consecutive cohort of patients with SAH. Demographic, clinical and radiological variables were analyzed. Preprocessed baseline CT scan images were used as the input for training using the AUCMEDI framework. Our model’s architecture leveraged a DenseNet121 structure, employing transfer learning principles. The output variable was mortality in the first three months. Results: Images from 219 patients were processed; 175 for training and validation and 44 for the model’s evaluation. Of the patients, 52% (115/219) were female and the median age was 58 (SD = 13.06) years. In total, 18.5% (39/219) had idiopathic SAH. The mortality rate was 28.5% (63/219). The model showed good accuracy at predicting mortality in SAH patients when exclusively using the images of the initial CT scan (accuracy = 74%, F1 = 75% and AUC = 82%). Conclusion: Modern image processing techniques based on AI and CNN make it possible to predict mortality in SAH patients with high accuracy using CT scan images as the only input. These models might be optimized by including more data and patients, resulting in better training, development and performance on tasks that are beyond the skills of conventional clinical knowledge.

Keywords:

subarachnoid hemorrhage; convolutional neural networks; artificial intelligence; mortality; prognosis; CT scan

1. Introduction

Subarachnoid hemorrhage (SAH) is a devastating form of hemorrhagic stroke with an incidence of 6–8 persons per 100,000 inhabitants per year and a higher incidence in specific regions such as Japan, Finland or Indiana [1]. Around 70–80% of spontaneous SAHs are caused by the rupture of an intracranial aneurysm, known as aneurysmal SAH (aSAH) [2]. Despite its low incidence, aSAH is a major burden for healthcare systems due to its high mortality and morbidity rates despite optimal treatment [3].

In a modern series, 30-day mortality rates range between 27% and 44%. Little improvement has been achieved in the last decade despite extensive efforts to treat its causes or understand the pathophysiology of the many and treacherous complications that may arise along its course [4,5,6]. Predictors of in-hospital mortality include the admission clinical grade, rebleeding, delayed cerebral ischemia, treatment-related ischemia and intraventricular hemorrhage [4,7]. Early brain injury due to the initial hemorrhagic insult and aneurysm rebleeding account for most fatalities [4]. Therefore, efforts have been addressed to prevent aSAH by controlling vascular risk factors and to prevent rebleeding by granting an early exclusion of the aneurysm. The latter is the epitome of medical debate; optimal timing and the best therapeutic approach are fiercely discussed [8,9,10]. However, survival and functional results have scarcely improved during these first decades of the century. The accurate prediction of outcomes in patients with moderate to poor grades remains a challenge.

Accurate predictions in the medical field often require a large amount of data from large cohorts of patients. Although patient data are increasingly accessible, managing such complex information has led to the development of modern predictive algorithms and models based on artificial intelligence (AI). Convolutional neural networks (CNNs), a form of deep learning (DL), mimic the entangled and complex system of connections existing in biological neural structures. Nodes are organized into layers and are interconnected to generate and spread output signals resulting from multiple interlinked activation functions. CNNs can modify their behavior as they learn from their training. In addition, CNNs might consider features or variables otherwise ignored by the observer. CNNs have shown excellent performance in accurately predicting various targeted variables in the medical field based on different imaging modalities [11,12]. Some known risk factors for in-hospital mortality associated with aSAH can be identified from the initial CT scan (blood amount, intraventricular hemorrhage, edema, ischemic changes, etc.) [4,7,13,14]. This is advantageous as a single sequence of images acquired upon admission can provide most of the relevant information necessary to predict a patient’s course.

In this clinical investigation, we sought to design, create and evaluate a model based on a CNN applied to initial CT scans to predict the mortality of patients admitted to the hospital with a SAH at three months.

2. Materials and Methods

The present investigation was conducted following the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) [15] and the Checklist for Artificial Intelligence in Medical Imaging (CLAIM) [16] guidelines. The study protocol was approved by the Institutional Review Board (22-PI180).

2.1. Study Population

This was a retrospective, non-interventional study. The clinical records of a consecutive cohort of patients diagnosed with SAH admitted to our institution (Hospital Universitario Rio Hortega, Valladolid, Spain) between 2011 and 2022 were retrospectively reviewed. Additionally, a consecutive series of patients from another institution (Hospital Clinic de Barcelona, Barcelona, Spain) with the same inclusion criteria was included to test the robustness of the algorithm. Therefore, the inclusion criteria comprised aneurysmal and non-aneurysmal (perimesencephalic) spontaneous SAH diagnoses based on compatible clinical signs and a positive CT scan as well as known survival status at three months. Aneurysmal and non-aneurysmal etiologies were, respectively, established by a positive or negative AngioCT and/or digital subtraction angiography. Patients whose CT scans were acquired later than 24 h from the onset of symptoms or those that could not properly be processed were excluded.

2.2. Variables

Demographic data such as age, sex, cardiovascular risk factors (smoking, hypertension, diabetes, dyslipidemia and family history of SAH), the admission clinical severity scales (World Federation of Neurosurgical Societies (WFNS) and Hunt and Hess (HH) grading scales), modified Fisher (mF) scale and mortality at 3 months were obtained from the clinical records of included patients [17,18,19].

2.3. Image Acquisition

CT scans from the institutional and external cohorts were, respectively, acquired using Phillips Ingenuity CT (Koninklijke, The Netherlands) and Siemens Somaton CT (Munich, Germany) scanners (Supplementary Table S1).

2.4. Image Preprocessing

CT images were sourced using the Digital Imaging and Communications in Medicine (DICOM) format. An initial step involved transformation into the Neuroimaging Informatics Technology Initiative (NIfTI) format and employing the dicom2niix tool v1.0.20220720 (https://github.com/rordenlab/dcm2niix/releases/tag/v1.0.20220720 accessed on 18 January 2023).

To avoid negative Hounsfield units (Hus), we implemented an intensity normalization through a lossless transformation to Cormack units using the Clinical Toolbox for SPM (https://github.com/neurolabusc/Clinical accessed on 20 January 2023).

Subsequently, we conducted a brain extraction procedure using the Brain Extraction Tool (BET) from the Functional MRI of the Brain Software Library (FSL) v6.0 (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/BET/UserGuide accessed on 20 January 2023). The final step involved registration to a CT template image with dimensions of 1 × 1 × 1 mm and a 193, 229 and 193 size by applying diffeomorphic registrations using symmetric normalization (SyN) from the Advanced Normalization Tools program (https://github.com/ANTsX/ANTs accessed on 21 January 2023).

2.5. Neural Network

In our research, the AUCMEDI (Automated Classification of Medical Images) framework (https://frankkramer-lab.github.io/aucmedi/ accessed on 10 January 2023) [20] was used to instruct a deep neural network to differentiate between two patient outcomes: survival and death.

2.6. Architecture

DenseNet121, a derivative of the Dense Convolutional Network (DenseNet), was implemented [21]. This architecture was selected after trying other CNNs (DenseNet201, DenseNet161, VGG19 and ResNext) because of its higher efficiency in terms of CPU requirements, image size and results. DenseNet121 stands out due to its dense connectivity pattern, its computational efficiency and minimal memory usage, which stems from the reutilization of features. It was selected for its proficiency in extracting intricate and hierarchical features from input images, a critical component in medical image analysis (Figure 1).

2.7. Activation Output

For our binary classification task, the softmax function was used as the final activation function. The softmax function converts output logits into probabilities by normalizing them into a probability distribution so the sum of the output probabilities equals 1. The class with the higher probability is chosen as the output prediction. Although softmax is often associated with multiclass classification problems, it is equally applicable to binary classification. The model predicts the class with the higher probability, so this might provide relevant insights to understand the probability of an outcome. Therefore, one key advantage of using softmax over sigmoid in binary classification is its interpretability as it offers a confidence level associated with the prediction that can be used to support a given clinical statement.

2.8. Class Imbalance and Loss Function

We initially calculated the class weights to be implemented by the categorical focal loss function. Class weights were computed using n_samples/(n_classes × bincount(y)), inspired by the work of King et al. [22]. Focal loss function prioritizes instances that are harder to classify and downplays simpler examples, thereby guiding the model to concentrate more on a balanced set of challenging samples. This method has demonstrated its robustness to class imbalance across different datasets and tasks, thereby enhancing the model’s performance [23].

2.9. Data Augmentation

In our CNN model, we employed several image augmentation techniques to increase the diversity and robustness of our training dataset. These techniques included mirroring (reflecting images across their vertical or horizontal axis), rotation (adjusting images by a certain degree around the center point), scaling (changing the size of the images) and elastic transformation (locally distorting the image by randomly displacing each pixel to simulate natural variations). As suggested by Isensee et al., augmentation techniques are highly efficient procedures to increase the potential generalization of a model as they allow for a better performance with unseen data [24].

2.10. Callbacks

In our model, we employed several callbacks, including EarlyStopping, ModelCheckpoint and ReduceLROnPlateau. EarlyStopping is used to halt training when a monitored metric has stopped improving, preventing overfitting and saving computational resources. ModelCheckpoint allows for the saving of the model after each epoch, ensuring the retention of the best performing model. On the other hand, ReduceLROnPlateau lowers the learning rate when a metric has ceased to improve, optimizing the model’s ability to find the global minimum and enhance the training performance. These strategies work together to mitigate overfitting and reduce unnecessary training time.

2.11. Transfer Learning

Transfer learning is a machine learning (ML) approach that applies a pre-trained model to a new, but related, task. This strategy bolsters learning efficiency, especially when data for the new task are limited. The model retains or “freezes” the learned weights from the prior task while fine-tuning the classification layer to the new task. After several epochs, the model is fully unfrozen for additional fine-tuning, thereby conserving computational resources and training time. Transfer learning was conducted for 10 epochs using the Adam optimizer with an initial learning rate of 1 × 10⁻⁴ and a batch size of 4 for DenseNet121.

2.12. Explainable Artificial Intelligence

We employed gradient-weighted class activation mapping (Grad-CAM) for explainable artificial intelligence [25]. This technique provides visual elucidations for decisions made by CNNs. It uses the gradients of any targeted concept flowing into the final convolutional layer to generate a coarse localization map that emphasizes the crucial regions in the image for a prediction of the result. The provided heat map offers insights into interpretability and helps to identify potential dataset bias.

2.13. Metadata

A second CNN predictive model was developed that incorporated baseline CT scan images and admission-related clinical information as the input. The clinical data were limited to the variables available upon admission (age, sex, hypertension, WFNS grade, acute hydrocephalus, etc.) and demonstrated a statistically significant association with mortality. This model was created to determine if the addition of clinical information could enhance the performance of the image-based model.

2.14. Statistics

Excel (Microsoft, Redmon, WA, USA; version 16.16.4) and SPSS Statistics (IBM, Armonk, NY, USA; version 24) were implemented to run conventional statistical methods. The distribution of continuous variables was assessed using a normality test. Categorical variables were expressed as frequencies and percentages. The categorical variables were compared using chi-squared and Fisher’s exact tests. The association between mortality and the continuous variables was analyzed using a Student’s t-test or Wilcoxon U test. Univariate analysis was performed to study the association of clinical variables with mortality at three months. The performance of the CNN was evaluated using the metrics typically implemented in DL methods such as sensibility, specificity, accuracy, F1 score and the area under the curve (AUC) for the receiver operating characteristic (ROC) curve [26].

3. Results

A total of 219 patients met the inclusion criteria for the study (Figure 2). Among them, 47.5% (104/219) were males and the mean age was 58 (SD = 13.06). A perimesencephalic pattern on the initial CT scan was observed in 37 patients (16.9%). In 42 cases, the initial arteriography did not detect the presence of an aneurysm; out of these, 39 cases (17.8%) were confirmed as idiopathic SAH.

Aneurismatic SAH was reported in 180 patients, with 222 aneurysms and 36 cases (20%) of multiple aneurysms. The mean WFNS and HH on admission were, respectively, 2.5 (SD = 1.6) and 2.2 (SD = 1.6), with a mode of 2 in both cases. The mean mF scale was 3.3 (SD = 0.9) and the mode was 4. For aSAH, 91 (50.5%) patients were surgically treated, 72 (40%) were endovascularly treated and 17 (9%) were not treated due to brain death signs prior to it being possible to provide any effective treatment. In 54.6% of treated patients, the aneurysm was excluded in the first 24 h after the diagnosis. Rebleeding occurred in 15 patients and only 4 of them survived. In the sample of 219 patients, the mean stay was 24 days and the mortality rate was 28.5% (Table 1).

Among the patients with SAH, mortality was significantly superior in older individuals (61.2 vs. 56.5 years old; F = 5.12; t = 2.48; p = 0.014), female patients (35.6% vs. 23.1%; X² = 4.14; p = 0.042) and patients with hypertension (40.6% vs. 21.1%; X² = 9.81; p = 0.002), intraparenchymal hematoma (48.4% vs. 21.9%; X² = 15.24; p < 0.001) and acute hydrocephalus (42.7% vs. 19.8%; X² = 13.04; p < 0.001). Patients with higher grades from the modified Fisher (X² = 39.9; p < 0.001), WFNS (X² = 46.9; p < 0.001) and HH (X² = 48.6; p < 0.001) scales experienced higher mortality rates (Figure S1). All these variables were included as metadata in the CNN model based on baseline CT scan images and clinical information. Other cardiovascular risk factors like diabetes, dyslipidemia or smoking were not associated with a higher risk of mortality. Subdural hematoma or seizures on admission were not associated with mortality.

The highest grades on the WFNS, HH and mF scales demonstrated a strong association with mortality. Remarkably, mF grades 3 or 4 proved to be a strong risk factor for mortality compared with mF grade 1 or 2 (odds ratio of 21.7 (p = 0.003; 95% confidence interval: 2.91–161.71)). The results for other variables are shown in Table 2.

CNN algorithms were developed, trained, validated and tested in this study. Among the models created, the one exclusively based on the initial CT scan demonstrated the best performance. Optimal performance was achieved during the final epoch, with the following metrics: sensitivity = 0.75 (SD = 0.025; 95% CI = 0.716–0.786); specificity = 0.75 (SD = 0.025; 95% CI = 0.716–0.786); accuracy = 0.74 (SD = 0); F1 score = 0.72 (SD = 0.025; 95% CI = 0.615–0.829); and AUC (area under the curve) = 0.82 (SD = 0). The inclusion of additional clinical metadata in the model did not significantly enhance its performance. The best F1 score obtained with the combined model was as follows: sensitivity = 0.75 (SD = 0.025; 95% CI = 0.716–0.786); specificity = 0.75 (SD = 0.025; 95% CI = 0.716–0.786); accuracy = 0.74 (SD = 0); F1 score = 0.74 (SD = 0.077; 95% CI = 0.663–0.817); and AUC = 0.80 (SD = 0). The results are presented in Table 3 and depicted in Figure 2, Figure 3 and Figure 4 (Supplementary Table S2).

4. Discussion

In this investigation, we retrospectively reviewed all consecutive cases of SAH patients admitted to our institution and we validated our results with an external cohort from another center. Images and data were preprocessed and used to train a CNN to predict mortality in a test cohort of patients. The results demonstrated that a CNN predictive algorithm exclusively based on the initial CT outperformed a combination of images and clinical data. The results of this image-based algorithm proved the ability of the CNN to establish solid predictions using medical images as the input. We aimed to develop an innovative, open-source classification model that could readily be tested on diverse datasets. To accomplish this, we chose to utilize a standardized framework (AUCMEDI). Our methodology included preprocessing techniques like resampling, clipping and intensity normalization to minimize potential image variability. Additionally, we employed image augmentation to mitigate the risk of overfitting and to enhance the model’s efficacy on previously unseen datasets. A transfer learning approach was adopted, leveraging the pre-trained models to provide well-established and effective weights, thereby boosting the model’s performance. Lastly, the entire pipeline was not only fully open but also comprehensively documented, ensuring its availability and ease of implementation on new datasets in the future. To the best of our knowledge, this study represents the first successful development of an image-based CNN algorithm that accurately predicts mortality in patients with SAH.

Several studies have demonstrated the ability of DL models to identify abnormalities such as hemorrhages, fractures, strokes and edemas from head CT scans [27,28]. These investigations require extensive labelled image datasets as inputs to build the model [27]. Different approaches have been used for this purpose, from classifying slices as pathologic or normal to the automatic segmentation of abnormal areas [28,29,30,31]. Using AI models to accomplish iterative and tedious tasks such as blood segmentation is a significant advancement that reduces working times and allows large samples of patients to be to processed, increasing the statistical power of the clinical investigation. However, in most of the available scientific papers regarding SAH or brain hemorrhages, the automated processes are limited to feature extractions. These features are then used with conventional statistical methods or in ML algorithms, but a fully automated pipeline capable of accurately predicting a clinical outcome from raw images is lacking for SAH. In this sense, our DL model represents a further leap forward.

Regarding SAH, efforts have also focused on aneurysm detection. Using different modalities of images (CT, DSA or MRI) and approaches (stand-alone AI or AI supporting a clinician), several reports have demonstrated the ability of AI to assist in aneurysm detection [32]. Bo et al. demonstrated the utility of a DL-based model to assist radiologists in the detection of intracranial aneurysms using AngioCT [33]. Increasing the reliability, particularly the specificity, of these automated models could allow for the future screening of intracranial aneurysms in large populations in a context in which human intervention could be relegated to supervision and the final confirmation of positive results.

Mortality and outcome predictions have classically relied on risk factors and clinical and radiological scales [34]. Advanced methods of data processing represent a great opportunity to exploit the information patients harbor early on admission. Therefore, studies have implemented ML methods to extract the best from features with known implications on the final outcome. Dengler et al. compared the performance of ML methods on outcome predictions for aSAH patients and established clinico-radiological scores [35]. The authors found that GCS and age were the most relevant features for outcome predictions and that ML methods were not superior to conventional scores [35]. In a study based on clinical features and ML methods, Toledo et al. achieved an AUC for the ROC curve of 0.85 in a decision tree built with Fisher and WFNS scales to predict functional outcomes [36]. Lo et al. used an extensive database to create an predictive algorithm for outcomes [37]. This model was based on multiple demographic and clinical variables that were used as the input for a Bayesian CNN with fuzzy logic inferences [37]. The AUC for the ROC curve was 0.85. However, many of the features that fed the algorithm were not present on admission; therefore, an early prediction of patient outcomes was not feasible [37]. Our model has the ability to make predictions without any clinical input or need for expert assessments, which has potential for automatization, generalization and applicability in primary and secondary centers referring patients to tertiary hospitals.

Although it remains challenging to speculate about the potential clinical applications of this model beyond the current level of evidence, this prognostic information complements other well-known predictive factors, aiding physicians in daily decision-making for critically ill patients. Thus, we believe that certain potential applications may emerge. These include improving communication among healthcare teams, supporting the information conveyed to families, aiding in decisions related to end-of-life care, the withdrawal of invasive treatments, the implementation of rescue therapies and assisting in determining the optimal timing of treatment. Although our approach serves as an initial step in this direction, it requires further development and validation to be decisive in such a critical, intricate and ethically sensitive subject as mortality prediction. At this juncture, prior to subjecting our model to a comprehensive validation process using larger and independent datasets, it would be reckless to regard the predictions of our model as an absolute and reliable truth and guide the clinical management of these patients based solely on their varying probabilities of survival. For instance, based on a high probability of death provided by our model, clinical decisions could be skewed, resulting in prematurely discontinuing the best available treatment for a patient and presumably leading to a self-fulfilling prophecy. Therefore, the ethical challenges involved in the personalized prognostication of life-threatening conditions like SAH, particularly in terms of interpreting and conveying the inherent uncertainty to those making decisions on behalf of patients, must be considered. In this scenario, the advent of artificial-intelligence-assisted prognostication calls for a contemporary and enduring framework [38]. Such a framework should ensure that physicians, patients and their families are provided with reassurance amidst the uncertainties surrounding the unfathomable question of life and death in critically ill patients.

A paradoxical finding of our research was the null improvement of the predictive model with the addition of clinical metadata with an otherwise proven association with mortality. It was hypothesized that the CNN would extract information from the images beyond human capacity, but we also expected that the clinical data would improve the model. The image-based model was likely able to estimate the quantity and distribution of blood as well as detect signs of brain damage such as edema and the herniation and effacement of basal cisterns. Many of these radiological signs are known factors of a poor clinical grade on admission and have previously been correlated with mortality [4,13,14]. Previous works in other areas have highlighted how the clinical information adds up to an image-based model, while other groups have demonstrated exactly the opposite [39,40,41]. These conflicting experiences might be due to differences in the targeted prediction, the architecture of the NN or even how the clinical information is introduced into the model. It is also possible that baseline CT scans harbor highly valuable information that significantly impacts the outcome. This would challenge the idea of the influence of clinical management and delayed cerebral lesions on SAH mortality and emphasize the relevance of initial damage on the final outcome.

Limitations

In addition to the retrospective nature of the sample, which might have led to the unnoticed loss of some patients, the present study harbored some limitations. First, the predictions were based on a ground truth, which originated from the results of our practice in this case. Mortality rates, causes of death, risk factors, treatment choices and overall management may vary amongst institutions. This flaw can only be tackled if larger training cohorts from different institutions representing different management protocols are used to build the model. Efforts should be made in this regard to aim for a predictive tool that can be applied to as many healthcare contexts as possible. Second, CT scanner protocols and manufacturers might change the information the model extracts from the image and, consequently, the class assigned to a particular case. However, the methodology we implemented to harmonize CT scans was designed to minimize variability in the imaging data to improve the robustness of the training as well as the accuracy and reliability of our model. Third, it can be argued that perimesencephalic SAH and aSAH are completely different diseases with vast differences in clinical evolution and outcomes; therefore, they should not be mixed in a mortality prediction study. Trained specialists in neurovascular emergencies might correctly identify a SAH as perimesencephalic with a rapid view of the CT scan and short assessment of the patient. However, one of the main applications of an image-based model such as the one herein presented is to support clinicians with their decisions, especially in non-tertiary centers where knowledge about alarm signs and prognosis might be scarce. Fourth, although the size of the training cohort was deemed to be sufficient for the construction of a precise predictive model, we acknowledge that larger samples are often preferred. As previously stated, we implemented various image augmentation techniques to increase the diversity and robustness of our training dataset. These techniques are highly effective in mitigating issues associated with smaller sample sizes and enhancing the generalizability of a model [24]. Finally, one of the main problems of DL is that the algorithm does not disclose what their decisions were based on; in other words, we cannot fully explain why the model classifies a given case into a specific class. Efforts are being made to unlock the black box that DL methods often represent. These efforts are referred as Explainable Artificial Intelligence or XAI. In DL, XAI methods are mainly post hoc, meaning that the trained model is analyzed to find learned associations [42]. Our team is currently working on visual activation maps or saliency maps based on gradient-weighted class activation mapping (Grad-CAM), which are graphic representations of the areas of an image that are important for the model to make a decision or classify a case into a group [25,43] (Figure 5 and Figure 6). The use of saliency maps will assuredly contribute to improving our understanding of prognostic model’s performance and aid in thoroughly examining each misclassified case. In this sense, saliency maps are poised to contribute to the development of future research lines. For example, if we can establish that the maps of most survivors share common patterns, this may provide valuable insights into the understanding of factors impacting the patient’s status on admission. Conversely, the maps of non-survivors may exhibit specific patterns on the initial CT scan, which could signify whether there is a critical sign demanding urgent attention or an ominous sign that would potentially render all our efforts futile.

Future research will seek to further validate the present algorithm and apply it to classification tasks such as the differentiation of perimesencephalic SAH from aSAH and the prediction of complication occurrence (vasospasm, shunt-dependent hydrocephalus, delayed cerebral ischemia, etc.).

5. Conclusions

DL algorithms based on initial CT scans allowed us to provide accurate predictions of mortality for SAH patients. The limited improvement seen with the addition of clinical information suggested that many factors influencing patient outcomes are present in the early stages of the disease and could be identified from the initial CT scan. AI predictive models are a promising tool that could significantly improve the understanding of, and decision-making process in, complex pathologies like SAH. However, further optimization of these models through the inclusion of more data and patients is necessary to enhance their performance on complex tasks that are beyond the potential of conventional clinical knowledge.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/brainsci14010010/s1. Figure S1: Distribution of mortality according to risk factors: (A) sex; (B) age; (C) hypertension (HT); (D) intraparenchymal hematoma (IPH); (E) acute hydrocephalus; (F) WFNS; and (G) modified Fisher (mF); Table S1: CT scan protocol parameters. CTDIvol: Computed Tomography Dose Index volume. Table S2: Performance of Neural Networks Algorithms. Results for classes. FDR: False Discovery Rate; FN: False Negative; FP: False Positive; R: Rate; TN: True Negative; TP: True Positive.

Author Contributions

Conceptualization, S.G.-G. and S.C.; methodology, S.G.-G., S.C. and D.M.; software, S.C. and D.M.; validation, S.G.-G., S.C. and D.M.; formal analysis, S.G.-G., S.C. and D.M.; investigation, S.G.-G., A.M., R.T. and I.A.; resources, S.G.-G., S.C. and R.S.; data curation, S.G.-G., A.M., S.A., N.d.l.T.; writing—original draft preparation, S.G.-G.; writing—review and editing, all authors; supervision, I.A. and R.S.; project administration, S.G.-G. and S.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Hospital Universitario Río Hortega (approved code 22-PI180; approved date 12 December 2022).

Informed Consent Statement

Patient consent was waived due to the non-interventional and retrospective nature of the study.

Data Availability Statement

The source code for the AUCMEDI framework can be found at https://github.com/frankkramer-lab/aucmedi. Additionally, the pipeline utilized in our study is publicly available at https://github.com/smcch/Subarachnoid_Hemorrhage_segmentation_and_mortality_prediction. These repositories provide access to the respective source codes, enabling researchers and interested individuals to explore and utilize the frameworks and pipelines implemented in our study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Linn, F.H.; Rinkel, G.J.; Algra, A.; van Gijn, J. Incidence of subarachnoid hemorrhage: Role of region, year, and rate of computed tomography: A meta-analysis. Stroke 1996, 27, 625–629. [Google Scholar] [CrossRef] [PubMed]
Mensing, L.A.; Vergouwen, M.D.I.; Laban, K.G.; Ruigrok, Y.M.; Velthuis, B.K.; Algra, A.; Rinkel, G.J.E. Perimesencephalic Hemorrhage: A Review of Epidemiology, Risk Factors, Presumed Cause, Clinical Course, and Outcome. Stroke 2018, 49, 1363–1370. [Google Scholar] [CrossRef] [PubMed]
GBD 2019 Stroke Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 2021, 20, 795–820. [Google Scholar] [CrossRef] [PubMed]
Stienen, M.N.; Germans, M.; Burkhardt, J.K.; Neidert, M.C.; Fung, C.; Bervini, D.; Zumofen, D.; Rothlisberger, M.; Marbacher, S.; Maduri, R.; et al. Predictors of In-Hospital Death After Aneurysmal Subarachnoid Hemorrhage: Analysis of a Nationwide Database (Swiss SOS [Swiss Study on Aneurysmal Subarachnoid Hemorrhage]). Stroke 2018, 49, 333–340. [Google Scholar] [CrossRef] [PubMed]
Nieuwkamp, D.J.; Setz, L.E.; Algra, A.; Linn, F.H.; de Rooij, N.K.; Rinkel, G.J. Changes in case fatality of aneurysmal subarachnoid haemorrhage over time, according to age, sex, and region: A meta-analysis. Lancet Neurol. 2009, 8, 635–642. [Google Scholar] [CrossRef] [PubMed]
Lovelock, C.E.; Rinkel, G.J.; Rothwell, P.M. Time trends in outcome of subarachnoid hemorrhage: Population-based study and systematic review. Neurology 2010, 74, 1494–1501. [Google Scholar] [CrossRef] [PubMed]
Mayfrank, L.; Hutter, B.O.; Kohorst, Y.; Kreitschmann-Andermahr, I.; Rohde, V.; Thron, A.; Gilsbach, J.M. Influence of intraventricular hemorrhage on outcome after rupture of intracranial aneurysm. Neurosurg. Rev. 2001, 24, 185–191. [Google Scholar] [CrossRef]
Catapano, J.S.; Labib, M.A.; Srinivasan, V.M.; Nguyen, C.L.; Rumalla, K.; Rahmani, R.; Cole, T.S.; Baranoski, J.F.; Rutledge, C.; Chapple, K.M.; et al. Saccular aneurysms in the post-Barrow Ruptured Aneurysm Trial era. J. Neurosurg. 2021, 137, 148–155. [Google Scholar] [CrossRef]
Spetzler, R.F.; McDougall, C.G.; Zabramski, J.M.; Albuquerque, F.C.; Hills, N.K.; Russin, J.J.; Partovi, S.; Nakaji, P.; Wallace, R.C. The Barrow Ruptured Aneurysm Trial: 6-year results. J. Neurosurg. 2015, 123, 609–617. [Google Scholar] [CrossRef]
Molyneux, A.; Kerr, R.; Stratton, I.; Sandercock, P.; Clarke, M.; Shrimpton, J.; Holman, R.; International Subarachnoid Aneurysm Trial Collaborative Group. International Subarachnoid Aneurysm Trial (ISAT) of neurosurgical clipping versus endovascular coiling in 2143 patients with ruptured intracranial aneurysms: A randomised trial. Lancet 2002, 360, 1267–1274. [Google Scholar] [CrossRef]
Ngiam, K.Y.; Khor, I.W. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. 2019, 20, e262–e273. [Google Scholar] [CrossRef] [PubMed]
Garcia-Garcia, S.; Garcia-Galindo, M.; Arrese, I.; Sarabia, R.; Cepeda, S. Current Evidence, Limitations and Future Challenges of Survival Prediction for Glioblastoma Based on Advanced Noninvasive Methods: A Narrative Review. Medicina 2022, 58, 1746. [Google Scholar] [CrossRef] [PubMed]
Helbok, R.; Kurtz, P.; Vibbert, M.; Schmidt, M.J.; Fernandez, L.; Lantigua, H.; Ostapkovich, N.D.; Connolly, S.E.; Lee, K.; Claassen, J.; et al. Early neurological deterioration after subarachnoid haemorrhage: Risk factors and impact on outcome. J. Neurol. Neurosurg. Psychiatry 2013, 84, 266–270. [Google Scholar] [CrossRef] [PubMed]
Lagares, A.; Jimenez-Roldan, L.; Gomez, P.A.; Munarriz, P.M.; Castano-Leon, A.M.; Cepeda, S.; Alen, J.F. Prognostic Value of the Amount of Bleeding After Aneurysmal Subarachnoid Hemorrhage: A Quantitative Volumetric Study. Neurosurgery 2015, 77, 898–907; discussion 907. [Google Scholar] [CrossRef] [PubMed]
Vandenbroucke, J.P.; von Elm, E.; Altman, D.G.; Gotzsche, P.C.; Mulrow, C.D.; Pocock, S.J.; Poole, C.; Schlesselman, J.J.; Egger, M.; Initiative, S. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and elaboration. Epidemiology 2007, 18, 805–835. [Google Scholar] [CrossRef] [PubMed]
Mongan, J.; Moy, L.; Kahn, C.E., Jr. Checklist for Artificial Intelligence in Medical Imaging (CLAIM): A Guide for Authors and Reviewers. Radiol. Artif. Intell. 2020, 2, e200029. [Google Scholar] [CrossRef]
Frontera, J.A.; Claassen, J.; Schmidt, J.M.; Wartenberg, K.E.; Temes, R.; Connolly, E.S., Jr.; MacDonald, R.L.; Mayer, S.A. Prediction of symptomatic vasospasm after subarachnoid hemorrhage: The modified fisher scale. Neurosurgery 2006, 59, 21–27; discussion 21–27. [Google Scholar] [CrossRef]
Teasdale, G.M.; Drake, C.G.; Hunt, W.; Kassell, N.; Sano, K.; Pertuiset, B.; De Villiers, J.C. A universal subarachnoid hemorrhage scale: Report of a committee of the World Federation of Neurosurgical Societies. J. Neurol. Neurosurg. Psychiatry 1988, 51, 1457. [Google Scholar] [CrossRef]
Hunt, W.E.; Hess, R.M. Surgical risk as related to time of intervention in the repair of intracranial aneurysms. J. Neurosurg. 1968, 28, 14–20. [Google Scholar] [CrossRef]
Müller, D.; Hartmann, D.; Soto-Rey, I.; Kramer, F. (Eds.) Abstract: AUCMEDI2023; Springer Fachmedien Wiesbaden: Wiesbaden, Germany, 2023. [Google Scholar]
Huang, G.; Liu, Z.; Pleiss, G.; Maaten, L.V.; Weinberger, K.Q. Convolutional Networks with Dense Connectivity. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 8704–8716. [Google Scholar] [CrossRef]
King, G.; Zeng, L. Logistic Regression in Rare Events Data. Political Anal. 2001, 9, 137–163. [Google Scholar] [CrossRef]
Yeung, M.; Sala, E.; Schonlieb, C.B.; Rundo, L. Unified Focal loss: Generalising Dice and cross entropy-based losses to handle class imbalanced medical image segmentation. Comput. Med. Imaging Graph. 2022, 95, 102026. [Google Scholar] [CrossRef] [PubMed]
Isensee, F.; Jaeger, P.F.; Kohl, S.A.A.; Petersen, J.; Maier-Hein, K.H. nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 2021, 18, 203–211. [Google Scholar] [CrossRef]
Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int. J. Comput. Vis. 2020, 128, 23. [Google Scholar] [CrossRef]
Muller, D.; Soto-Rey, I.; Kramer, F. Towards a guideline for evaluation metrics in medical image segmentation. BMC Res. Notes 2022, 15, 210. [Google Scholar] [CrossRef] [PubMed]
Chilamkurthy, S.; Ghosh, R.; Tanamala, S.; Biviji, M.; Campeau, N.G.; Venugopal, V.K.; Mahajan, V.; Rao, P.; Warier, P. Deep learning algorithms for detection of critical findings in head CT scans: A retrospective study. Lancet 2018, 392, 2388–2396. [Google Scholar] [CrossRef] [PubMed]
Heit, J.J.; Coelho, H.; Lima, F.O.; Granja, M.; Aghaebrahim, A.; Hanel, R.; Kwok, K.; Haerian, H.; Cereda, C.W.; Venkatasubramanian, C.; et al. Automated Cerebral Hemorrhage Detection Using RAPID. AJNR Am. J. Neuroradiol. 2021, 42, 273–278. [Google Scholar] [CrossRef] [PubMed]
Mansour, R.F.; Aljehane, N.O. An optimal segmentation with deep learning based inception network model for intracranial hemorrhage diagnosis. Neural Comput. Appl. 2021, 33, 12. [Google Scholar] [CrossRef]
Thanellas, A.; Peura, H.; Lavinto, M.; Ruokola, T.; Vieli, M.; Staartjes, V.E.; Winklhofer, S.; Serra, C.; Regli, L.; Korja, M. Development and External Validation of a Deep Learning Algorithm to Identify and Localize Subarachnoid Hemorrhage on CT Scans. Neurology 2023, 100, e1257–e1266. [Google Scholar] [CrossRef]
Rajagopal, M.; Buradagunta, S.; Almeshari, M.; Alzamil, Y.; Ramalingam, R.; Ravi, V. An Efficient Framework to Detect Intracranial Hemorrhage Using Hybrid Deep Neural Networks. Brain Sci. 2023, 13, 400. [Google Scholar] [CrossRef]
Din, M.; Agarwal, S.; Grzeda, M.; Wood, D.A.; Modat, M.; Booth, T.C. Detection of cerebral aneurysms using artificial intelligence: A systematic review and meta-analysis. J. Neurointerv. Surg. 2023, 15, 262–271. [Google Scholar] [CrossRef] [PubMed]
Bo, Z.H.; Qiao, H.; Tian, C.; Guo, Y.; Li, W.; Liang, T.; Li, D.; Liao, D.; Zeng, X.; Mei, L.; et al. Toward human intervention-free clinical diagnosis of intracranial aneurysm via deep neural network. Patterns 2021, 2, 100197. [Google Scholar] [CrossRef] [PubMed]
de Winkel, J.; Cras, T.Y.; Dammers, R.; van Doormaal, P.J.; van der Jagt, M.; Dippel, D.W.J.; Lingsma, H.F.; Roozenbeek, B. Early predictors of functional outcome in poor-grade aneurysmal subarachnoid hemorrhage: A systematic review and meta-analysis. BMC Neurol. 2022, 22, 239. [Google Scholar] [CrossRef] [PubMed]
Dengler, N.F.; Madai, V.I.; Unteroberdorster, M.; Zihni, E.; Brune, S.C.; Hilbert, A.; Livne, M.; Wolf, S.; Vajkoczy, P.; Frey, D. Outcome prediction in aneurysmal subarachnoid hemorrhage: A comparison of machine learning methods and established clinico-radiological scores. Neurosurg. Rev. 2021, 44, 2837–2846. [Google Scholar] [CrossRef] [PubMed]
De Toledo, P.; Rios, P.M.; Ledezma, A.; Sanchis, A.; Alen, J.F.; Lagares, A. Predicting the outcome of patients with subarachnoid hemorrhage using machine learning techniques. IEEE Trans. Inf. Technol. Biomed. 2009, 13, 794–801. [Google Scholar] [CrossRef] [PubMed]
Lo, B.W.; Macdonald, R.L.; Baker, A.; Levine, M.A. Clinical outcome prediction in aneurysmal subarachnoid hemorrhage using Bayesian neural networks with fuzzy logic inferences. Comput. Math. Methods Med. 2013, 2013, 904860. [Google Scholar] [CrossRef] [PubMed]
Lissak, I.A.; Edlow, B.L.; Rosenthal, E.; Young, M.J. Ethical Considerations in Neuroprognostication Following Acute Brain Injury. Semin. Neurol. 2023, 43, 758–767. [Google Scholar] [CrossRef] [PubMed]
Pinto, A.; McKinley, R.; Alves, V.; Wiest, R.; Silva, C.A.; Reyes, M. Stroke Lesion Outcome Prediction Based on MRI Imaging Combined with Clinical Information. Front. Neurol. 2018, 9, 1060. [Google Scholar] [CrossRef]
Ningrum, D.N.A.; Yuan, S.P.; Kung, W.M.; Wu, C.C.; Tzeng, I.S.; Huang, C.Y.; Li, J.Y.; Wang, Y.C. Deep Learning Classifier with Patient’s Metadata of Dermoscopic Images in Malignant Melanoma Detection. J. Multidiscip. Healthc. 2021, 14, 877–885. [Google Scholar] [CrossRef]
Mitani, A.; Huang, A.; Venugopalan, S.; Corrado, G.S.; Peng, L.; Webster, D.R.; Hammel, N.; Liu, Y.; Varadarajan, A.V. Detection of anaemia from retinal fundus images via deep learning. Nat. Biomed. Eng. 2020, 4, 18–27. [Google Scholar] [CrossRef]
Van der Velden, B.H.M.; Kuijf, H.J.; Gilhuijs, K.G.A.; Viergever, M.A. Explainable artificial intelligence (XAI) in deep learning-based medical image analysis. Med. Image Anal. 2022, 79, 102470. [Google Scholar] [CrossRef] [PubMed]
Windisch, P.; Weber, P.; Furweger, C.; Ehret, F.; Kufeld, M.; Zwahlen, D.; Muacevic, A. Implementation of model explainability for a basic brain tumor detection using convolutional neural networks on MRI slices. Neuroradiology 2020, 62, 1515–1518. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Diagram depicting the workflow from DICOM raw images to classification results provided by the algorithm. Image preprocessing, data augmentation, neural network architecture and output classification function are herein represented.

Figure 2. Flowchart describing the screened, included and excluded patients for the institutional and external cohort of patients whose images and data were used to create, train and evaluate the model.

Figure 3. Performance of the image-based neural network algorithm. Each metric is represented for each class by its correspondent bar and numeric value within.

Figure 4. Performance evaluation: (A) confusion matrix of the CNN considering “dead” as a positive result for the test; (B) receiver operating characteristic curve of the CNN.

Figure 5. Baseline CT scan (upper row) and gradient-weighted class activation mapping or Grad-CAM (lower row) for three patients (A–C) from the test cohort who were alive three months after suffering a subarachnoid hemorrhage. Saliency maps highlighted regions in red that were more significant when classifying patients; in this case, into the group of patients who survived the event. Thus, it was possible to create a visual depiction of the process the model followed to allocate patients into each class. These maps highlighted supratentorial brain areas and seemed to disregard hemorrhages, except when they followed a perimesencephalic pattern. (A) A 60-year-old male who suffered a perimesencephalic SAH whose angio-MR and angiography were negative. (B) A 43-year-old male who was diagnosed with a SAH caused by a right middle cerebral artery aneurysm. He was admitted in good condition (WFNS 1) and was surgically treated and discharged without major neurological deficits on postoperative day 19. (C) A 75-year-old female who suffered a SAH and was admitted to the hospital with a WFNS grade 4. The left posteroinferior cerebellar artery aneurysm was coiled. The patient survived the event, but was still severely impaired at the three-month follow-up (modified Rankin Scale: 4).

Figure 6. Baseline CT scan (upper row) and gradient-weighted class activation mapping or Grad-CAM (lower row) for three patients (A–C) from the test cohort who died as a result of a subarachnoid hemorrhage (SAH). These maps visually illustrate the areas the model considered to allocate patients into the “dead” group. Grad-CAM maps show that posterior fossa and intraventricular and cisternal blood might be relevant areas or items to consider in order to classify patients as dead. (A) A 51-year-old male who suffered a SAH due to the rupture of a left middle cerebral artery aneurysm. The patient initially presented with a WFNS grade 2, but abruptly deteriorated to a WFNS grade 5 requiring emergent surgical treatment. The patient died on postoperative day 56 as a consequence of both systemic and neurological complications. (B) A 70-year-old female who was diagnosed with a SAH caused by a right posterior communicating artery aneurysm who died 40 days after her admission due to a combination of factors, including delayed cerebral ischemia, meningitis and pneumonia. (C) A 78-year-old male with a SAH caused by an anterior communicating artery who was admitted to the hospital with a WFNS grade 5 and an mF grade 4 who died the next day after the event.

Table 1. Characterization of the sample according to analyzed variables.

Variable	Mean/Mode	Number	Percentage
Female		115	52.5%
Age	57.9 (SD = 13.06) years
RISK FACTORS
HT		96	43.8%
Tobacco		90	41%
Smoker, Female		40	34.8% of females
HT + Tobacco		36	16.5% of total
Diabetes		20	9%
Dyslipidemia		81	37%
Familial History		3	1.5%
Idiopathic SAH		39	18%
Aneurysmal SAH		180	82%
Multiple		36	20%
Anterior Circulation		174	97%
Aneurysm Diameter	7.9 mm (SD = 5.6)
TREATMENT
Surgical		91	50.5%
Endovascular		72	40%
No Treatment		17	9%
Timing of Treatment
Ultra Early (<24 h)		114	70%
Early (24–72 h)		33	20%
Delayed (>72 h)		16	10%
ADMISSION
Hunt and Hess	2.2/2
I		103	47%
II		45	20.5%
III		10	4.5%
IV		17	8%
V		44	20%
WFNS	2.5/2
I		93	42.5%
II		46	21%
III		8	3.5%
IV		26	12%
V		46	21%
Modified Fisher	3.2/4
I		15	7%
II		25	11.5%
III		35	16%
IV		144	65.5%
Intraparenchymal Hematoma		63	35%
Subdural Hematoma		9	5%
COMPLICATIONS
Acute Hydrocephalus		96	44%
Shunt-Dependent Hydrocephalus		38	17.5%
Seizure		37	17%
Epilepsy		12	6.5%
Symptomatic Vasospasm		38	17.5%
Delayed Cerebral Ischemia		52	23.5%
Length of Stay	24 days
OUTCOME
mRS at 3 Months	3/6
0		46	21%
1		37	17%
2		16	7.5%
3		19	8.5%
4		15	7%
5		23	10.5%
6		63	28.5%
Mortality		63	28.5%

HT: hypertension; mRS: modified Rankin Scale; WFNS: World Federation of Neurological Societies.

Table 2. Odds ratios for variables demonstrating statistically significant association with mortality.

Variable	Reference	Degrees of Freedom	p-Value	OR	95% CI
Sex (Male)	Male	1	0.042	0.54	0.30–0.98
Age	1		0.014	1.03	1.01–1.05
Hypertension	Yes	1	0.002	2.55	1.41–4.63
Intraparenchimatous Hematoma	Yes	1	<0.001	3.34	1.79–6.22
Acute Hydrocephalus	Yes	1	<0.001	3.01	1.64–5.55
WFNS	1	4	<0.001
WFNS 2			0.007	3.67	1.44–9.42
WFNS 3			0.033	5.60	1.14–27.40
WFNS 4			0.001	5.83	2.05–16.62
WFNS 5			<0.001	17.50	6.99–43.77
Hunt and Hess	1	4	<0.001
HH 2			0.002	4.51	1.72–11.86
HH 3			0.047	5.00	1.02–24.51
HH 4			0.02	4.63	1.27–16.87
HH 5			<0.001	25.00	8.99–69.56
Modified Fisher, Dichotomized	<2	1	<0.001
mF > 2			0.003	21.70	2.91–161.72

Table 3. Performance of neural networks algorithms. Results are presented as the average of both analyzed classes (dead or alive at three-month follow-up).

Model	Image-Based Neural Network Performance				IMAGE- and Metadata-Based Neural Network Performance
Epoch	Best AUC	Best F1	Best Loss	Last	Best AUC	Best F1	Best Loss	Last
Metric	Best AUC	Best F1	Best Loss	Last	Best AUC	Best F1	Best Loss	Last
TP	14.5	15	15	16	16	16.5	15.5	15
TN	14.5	15	15	16	16	16.5	15.5	15
FP	7	6.5	6.5	5.5	5.5	5	6	6.5
FN	7	6.5	6.5	5.5	5.5	5	6	6.5
Sensitivity	0.53	0.61	0.74	0.75	0.60	0.75	0.69	0.50
Specificity	0.53	0.61	0.74	0.75	0.60	0.75	0.69	0.50
Precision	0.56	0.63	0.70	0.72	0.75	0.73	0.68	0.35
FP Rate	0.47	0.39	0.26	0.25	0.40	0.25	0.31	0.50
FN Rate	0.47	0.39	0.26	0.25	0.40	0.25	0.31	0.50
FDR	0.44	0.37	0.30	0.28	0.25	0.27	0.32	0.15
Accuracy	0.67	0.70	0.70	0.74	0.74	0.77	0.72	0.70
F1	0.51	0.61	0.69	0.72	0.60	0.74	0.68	0.41
AUC	0.72	0.74	0.73	0.82	0.78	0.80	0.78	0.35

FDR: false discovery rate; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

García-García, S.; Cepeda, S.; Müller, D.; Mosteiro, A.; Torné, R.; Agudo, S.; de la Torre, N.; Arrese, I.; Sarabia, R. Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan. Brain Sci. 2024, 14, 10. https://doi.org/10.3390/brainsci14010010

AMA Style

García-García S, Cepeda S, Müller D, Mosteiro A, Torné R, Agudo S, de la Torre N, Arrese I, Sarabia R. Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan. Brain Sciences. 2024; 14(1):10. https://doi.org/10.3390/brainsci14010010

Chicago/Turabian Style

García-García, Sergio, Santiago Cepeda, Dominik Müller, Alejandra Mosteiro, Ramón Torné, Silvia Agudo, Natalia de la Torre, Ignacio Arrese, and Rosario Sarabia. 2024. "Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan" Brain Sciences 14, no. 1: 10. https://doi.org/10.3390/brainsci14010010

APA Style

García-García, S., Cepeda, S., Müller, D., Mosteiro, A., Torné, R., Agudo, S., de la Torre, N., Arrese, I., & Sarabia, R. (2024). Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan. Brain Sciences, 14(1), 10. https://doi.org/10.3390/brainsci14010010

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mortality Prediction of Patients with Subarachnoid Hemorrhage Using a Deep Learning Model Based on an Initial Brain CT Scan

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Population

2.2. Variables

2.3. Image Acquisition

2.4. Image Preprocessing

2.5. Neural Network

2.6. Architecture

2.7. Activation Output

2.8. Class Imbalance and Loss Function

2.9. Data Augmentation

2.10. Callbacks

2.11. Transfer Learning

2.12. Explainable Artificial Intelligence

2.13. Metadata

2.14. Statistics

3. Results

4. Discussion

Limitations

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI