Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods

Tarkanyi, Gabor; Tenyi, Akos; Hollos, Roland; Kalmar, Peter Janos; Szapary, Laszlo

doi:10.3390/life12020230

Open AccessArticle

Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods

by

Gabor Tarkanyi

¹,

Akos Tenyi

²

,

Roland Hollos

²,

Peter Janos Kalmar

¹

and

Laszlo Szapary

^1,*

¹

Department of Neurology, Medical School, University of Pécs, 7624 Pécs, Hungary

²

Smart Data Group, E-Group ICT Software Zrt., 1027 Budapest, Hungary

^*

Author to whom correspondence should be addressed.

Life 2022, 12(2), 230; https://doi.org/10.3390/life12020230

Submission received: 3 December 2021 / Revised: 27 January 2022 / Accepted: 31 January 2022 / Published: 3 February 2022

(This article belongs to the Special Issue Challenges in Stroke Care)

Download Review Reports Versions Notes

Abstract

:

The early detection of large-vessel occlusion (LVO) strokes is increasingly important as these patients are potential candidates for endovascular therapy, the availability of which is limited. Prehospital LVO detection scales mainly contain symptom variables only; however, recent studies revealed that other types of variables could be useful as well. Our aim was to comprehensively assess the predictive ability of several clinical variables for LVO prediction and to develop an optimal combination of them using machine learning tools. We have retrospectively analysed data from a prospectively collected multi-centre stroke registry. Data on 41 variables were collected and divided into four groups (baseline vital parameters/demographic data, medical history, laboratory values, and symptoms). Following the univariate analysis, the LASSO method was used for feature selection to select an optimal combination of variables, and various machine learning methods (random forest (RF), logistic regression (LR), elastic net method (ENM), and simple neural network (SNN)) were applied to optimize the performance of the model. A total of 526 patients were included. Several neurological symptoms were more common and more severe in the group of LVO patients. Atrial fibrillation (AF) was more common, and serum white blood cell (WBC) counts were higher in the LVO group, while systolic blood pressure (SBP) was lower among LVO patients. Using the LASSO method, nine variables were selected for modelling (six symptom variables, AF, chronic heart failure, and WBC count). When applying machine learning methods and 10-fold cross validation using the selected variables, all models proved to have an AUC between 0.736 (RF) and 0.775 (LR), similar to the performance of National Institutes of Health Stroke Scale (AUC: 0.790). Our study highlights that, although certain neurological symptoms have the best ability to predict an LVO, other variables (such as AF and CHF in medical history and white blood cell counts) should also be included in multivariate models to optimize their efficiency.

Keywords:

acute ischemic stroke; large-vessel occlusion; prehospital care; stroke scales; machine learning

1. Introduction

Large-vessel occlusion (LVO) is present in 20–40% of acute ischemic stroke (AIS) cases, resulting in more severe symptoms and worse outcomes if not treated urgently [1]. In addition to well-established intravenous thrombolysis (IVT), experience using endovascular thrombectomy (EVT) to treat AIS patients with LVO is increasing [2]. However, the number of EVT-capable institutions, so-called comprehensive stroke centres (CSC), is limited [3]. The reliable detection of an LVO is currently only possible using radiological methods, primarily computed tomography angiography (CTA), which is mostly available in hospitals only [4].

Regarding patient pathways, two approaches have emerged. According to the first approach, AIS patients should first be transported to the nearest IVT-capable primary stroke center (PSC). If the presence of an LVO is confirmed, the patient is referred and transported to a CSC for EVT (drip-and-ship approach). In these cases, IVT could be started as soon as possible; however, the time spent in the PSC and the time of transportation may significantly delay the administration of EVT [5]. It should also be considered that IVT is only moderately effective if an LVO is present [6]. The second approach is to transport patients with a high likelihood of LVO directly to a CSC (mothership approach). This may slightly delay the start of the IVT due to the longer transportation time; however, it could significantly reduce the time to EVT administration [5].

One of the current major limitations of applying the mothership approach routinely is the deficit of easy-to-perform and sufficiently reliable prehospital methods to identify LVO [7]. Current stroke scales primarily focus on the assessment of clinical symptoms; however, other factors such atrial fibrillation (AF) in medical history or systolic blood pressure (SBP) may also have good predictive value [8,9]. The aim of our study was to comprehensively assess the associations between clinical symptoms, medical history variables, vital parameters, laboratory values and the presence of LVO in AIS, and to develop an optimal combination of them using machine learning tools and methods.

2. Methods

2.1. Study Cohort

A cross-sectional, observational study was performed based on a prospective registry of consecutive AIS patients presenting up to 4.5 h after symptom onset at the CSC of three university hospitals in Hungary (Figure S1 of the Supplementary Materials) between November 2017 and July 2019. Data on medical history were collected from past medical documentation and based on personal interview with the patient and relatives upon arrival to the emergency department (ED) when possible. Baseline vital parameters and laboratory values were measured as a part of standard care. On admission, stroke symptoms and severity were assessed using the National Institutes of Health Stroke Scale (NIHSS). Detailed information on the registry is available in the Supplementary Materials.

2.2. Outcome

Our outcome of interest was the presence of LVO on the on-admission CTA scan. Acute occlusions of the internal carotid artery (ICA), M1, M2 and M3 segments of the middle cerebral artery (MCA), A1 and A2 segments of the anterior cerebral artery (ACA), P1 and P2 segments of the posterior cerebral artery (PCA), basilar artery (BA), vertebral artery (VA) and tandem occlusions were considered according to Rennert et al. [10]. Scans were evaluated by trained neuroradiologist (who were blinded to clinical parameters) as a part of standard care. Patients who did not undergo CTA were excluded.

2.3. Statistical Analysis

Continuous variables were presented as mean and standard deviation (SD) or as median and interquartile range (IQR). Normality was assessed using the Shapiro–Wilk test and visually, based on Q–Q plots and histograms. Categorical variables were presented as counts and percentages. In the univariate analysis, a comparison of continuous variables was performed using a t-test or Mann–Whitney U test. Categorical data were compared using the Pearson X² test or the Fischer exact test where appropriate. Receiver operating curve (ROC) analysis was used to assess the ability of variables and models to discriminate the presence of an LVO. The optimal cut-off score was calculated using the Youden J index.

2.4. Data Analysis

Data on 41 variables were collected and used for the modelling task. During pre-processing, variables were excluded from the analysis based on (i) having more than 20% missing values (Body temperature, SpO₂), (ii) larger than 0.9 correlation with another variable (Hgb), and/or (iii) near zero variance (Extinction). Rows with missing values were omitted from the analysis. Variables were further processed with Yeo Johnson transformation to reduce skewness in lab variables and variables were centered and scaled to obtain statistical uniformity for machine learning (ML) modeling. Smote resampling was used to balance the sample difference in LVO and non-LVO groups. Grid search was used to select optimal hyperparameter for the models. For final model validation, a randomly selected hold-out test cohort was used consisting of 20% of the patient population. To assess the generalizability of the models a 10-fold cross validation was used.

Four covariate groups were created based on the nature of variables including 6 baseline and demographic variables, 9 medical history variables with yes/no values, 10 laboratory variables with numeric values and 14 symptom-related variables with values on an ordinal scale. The predictive ability of these groups of variables was measured using binary logistic regression analysis and ROC analysis was performed based on probability values.

Feature selection was carried out using least absolute shrinkage and selection operator (LASSO) regression to determine the optimal combination of variables to predict LVO [11]. For further ML modeling, the selected variables were used only as covariates. The performance of three ML models—namely, logistic regression, random forest, and neural network—and elastic net method was compared with each other and with a logistic regression model with NIHSS as the only covariate using area under the ROC curve (AUC) statistic (see Figure S2 in the Supplementary Materials). For neural network modeling, a multi-layer perceptron was used with one hidden layer of four neurons. Analysis was carried out in SPSS (version 26, IBM, New York, NY, USA) and R using the Caret ML library [12,13].

3. Results

A total of 646 patients were screened during the study period, 526 (81.4%) of whom underwent CTA imaging and were finally included in the analysis (46.2% female). The mean age of the study cohort was 68 ± 13 years; 227 patients had LVO (43.2%). The baseline characteristics of the study cohort and the ability of the variables to distinguish an LVO are presented in Table 1. NIHSS had the best discriminative ability with an AUC of 0.783 (95% CI: 0.742–0.824); the optimal cut-off value of NIHSS to detect an LVO was ≥9 points (sensitivity: 70.9%; specificity: 72.6%). The prevalence of several symptoms and the severity of symptoms were higher among LVO patients (Table 2.) The distribution of LVO location was as follows: 54 (23.8%) ICA, 74 (32.6%) MCA M1, 52 (22.9%) MCA M2, 4 (1.8%) MCA M3, 2 (0.9%) ACA, 1 (0.4%) PCA, 12 (5.3%) BA, 11 (4.8%) VA, and 17 (7.5%) tandem occlusions. The etiology of LVO strokes was more commonly cardioembolism and less commonly small-vessel disease, as compared to non-LVO cases (Table 1).

Regarding predefined covariate groups, the combination of symptoms had the best ability to discriminate an LVO (AUC: 0.779 on hold-out set and 0.785 after 10-fold cross validation; p < 0.001, respectively), followed by medical history (AUC: 0.602 and 0.686; p < 0.001), laboratory values (AUC: 0.637 and 0.641; p < 0.001) and baseline and demographic parameters (0.599 and 0.567; p < 0.001). NIHSS had an AUC of 0.783 and 0.790 after cross validation (p < 0.001).

Data Driven Analysis

The results of the covariate group analysis showed that, over a combination of symptoms (NIHSS items), further variables could have potential discriminative power for LVO, especially among the anamnestic and laboratory related variables. Thus, we explored the potential of a mixed-covariate model for discriminate LVO patients using data-driven analysis and a variable selection process (see Supplementary Figure S2).

In the initial dataset, there was a relatively high amount of missing data (4% of the dataset), mainly at random properties (see Supplementary Table S1 and related comments) and was mainly concentrated in a few variables. Our analysis showed that imputing missing values would negatively affect the performance of the final models (see Supplementary Materials); thus, patients with missing values were omitted from the analysis and a two-step approach was followed to maximize sample size for modelling. After preprocessing the dataset, all samples with missing values were omitted (n = 293) and lasso regression was used to select the most predictive variables to LVO. Then, the final data-driven analysis was carried out using the original dataset, filtering only to these selected variables, and omitting patients with missing values (n = 483). During feature selection, a total of nine variables were selected for subsequent ML modelling (six symptom variables: language, facial palsy, LOC questions, visual field disturbance, gaze palsy and upper limb weakness; two medical history variables: atrial fibrillation (AF) and chronic heart failure (CHF); and one laboratory value: white blood cell (WBC) count).

Including the selected variables, four ML tools were applied: random forest (RF), logistic regression (LR), elastic net method (ENM), and simple neural network (SNN). The calculated AUC values on the hold-out set and after 10-fold cross-validation were 0.986 and 0.736 for the RF model, 0.816 and 0.775 for the LR, 0.813 and 0.773 for ENM and 0.808 and 0.772 for SNN.

4. Discussion

Our study has highlighted that the severity of certain neurological symptoms may have the best ability to predict an LVO, but our results also pointed out that other variables (notably, AF or CHF in medical history and on-admission WBC values) also have good predictive ability.

The clinical presentation of LVO in AIS is highly dependent on the site of occlusion [10]. Currently, NIHSS is the “gold-standard” for stroke severity assessment and has the best ability to detect LVOs—the previously reported AUC values were similar to our findings [7]. Despite the wide spectrum of symptoms assessed in NIHSS, it still occasionally fails to detect and assess posterior territory strokes appropriately. For short stroke scales, the challenge is to examine the full spectrum of symptoms corresponding to different vascular territory strokes without the process becoming too complicated. The results of a retrospective study suggested that cortical symptoms are better predictors of LVO than motor symptoms, but their combination has the highest accuracy [14]. Our findings showed that upper and lower extremity weakness had the best discriminative abilities, followed by gaze disturbance and facial palsy. However, it should be noted that the majority of the LVO cases in our study involved anterior circulation; therefore, the findings should be interpreted accordingly.

The use of ML methods to optimize prediction models is emerging in the field of stroke research to maximize the predictive performance of variable combinations [15]. Based on the previously mentioned findings, it is not surprising that feature selection using the LASSO method in our study mainly selected symptom variables (motor and cortical symptoms as well) for modeling. The selected symptoms represent a wide spectrum of LVOs in various vascular regions, as they mostly occur in anterior and posterior territory strokes as well. In addition, variables that had a strong association with the presence of LVO in the univariate analysis were selected—notably, AF, CHF, and WBC count. In a recent article by Wang et al. using a similar approach, a set of variables were initially selected based on research in the literature and clinical relevance for subsequent feature selection [15]. In contrast, in our study, we included all variables that were available in adequate quality from a multi-center registry. However, after feature selection, it appeared in both studies that, although symptoms provide the backbone of the models, other types of variables may be important factors and should be included as well.

Including these variables, all applied ML tools performed well on the full set of data (AUC > 0.800); however, after 10-fold cross validation, the performance of each markedly decreased and the AUC values of three models (RF, LR and ENM) ranged from 0.775 to 0.772; the SNN lagged slightly with an AUC of 0.736. The study by Wang et al. has applied a similar approach to optimize LVO prediction, and their results regarding the performance of ML tools were quite similar. The abilities of stroke scales for LVO detection has also been reported generally around this range in previous retro- and prospective studies [7,8,15].

Over recent years, a plethora of LVO detection methods have been developed and examined. For a tool to be applicable for prehospital use, several criteria must be met, such as high diagnostic accuracy, easy and fast application, user-friendliness, and cost-effectiveness [16]. The NIHSS may be too complex for routine prehospital use; therefore, the use of shorter scales is warranted at the cost of some reduction in accuracy. It should also be noted that some symptoms are not easily examinable by non-neurologists, such as gaze disturbance and visual field loss, two symptoms that were also selected for modelling in our study and, therefore, may limit prehospital applicability [17]. However, the inclusion of non-symptom variables is not common in LVO scales yet.

Regarding patient history and clinical parameters, a study has found that the history of AF and SBP ≤ 170 mm Hgmm are independent predictors of LVO in AIS, and these correlations were also confirmed by our results [9]. There have been some attempts to attach AF to various scales with heterogeneous results. A retrospective analysis has shown no improvement in the accuracy of four broadly used short stroke scales when AF was added as an element [18]. In contrast, another study found that the adding of AF to the Los Angeles Motor Scale (LAMS) could significantly improve its ability to detect LVOs [19]. In addition, several recently created LVO scales include AF as a variable [20,21]. The utility of including SBP in stroke scales is much less studied. A prospective observational study demonstrated that SBP may help to identify patients potentially eligible for EVT [22]. Chronic heart failure is an independent risk factor of stroke, and other diseases should be considered (such as AF, CAD and valvular disease) that are predisposing factors for CHF and AIS [23]. The association between CHF and the presence of LVO probably represents a wide spectrum of confounding and additive conditions. Therefore, CHF might be interchangeable or be combinable with the aforementioned cardiac diseases. Future studies may use a combined variable containing all predisposing cardiac diseases at once.

Despite the amount of biomarker research in the field of AIS, so far, only a few markers that are potentially suitable for LVO detection have been identified. Our group has previously found an association between WBC counts and the presence of LVO which is also confirmed by the current investigation; however, the studied population was partially overlapping [24]. Other studies have revealed independent associations between protein markers (such as serum troponin and D-dimer) [25,26]. However, to date, they are not routinely used for screening in the prehospital setting.

Univariate analyses in our study revealed that the strength of associations between most variables and LVO is mild to moderate, the reason for which is probably that associations are affected by many known and unknown confounding factors (e.g., LVO location regarding symptoms). It is also clear that a combination of variables with such specificity cannot exceed a certain accuracy. The study highlighted that machine learning tools are extremely useful to reduce the dimensions of large datasets, and to assess and optimize predictive ability. However, the result should also be approached and interpreted from clinical and practical aspects as well, since the heterogeneity of clinical presentations may limit the clinical utility of these methods.

Molecular biomarkers supporting the clinical care of stroke, especially its classification and objective monitoring, are yet to be available. A better understanding of the biochemical and pathophysiological pathways and processes associated with LVO is needed to identify more specific biomarkers. Screening for a large number of potential biomarkers, i.e., the “omics” approach, and the combined analysis of multi-omic data, including proteomic, more recently glycomic, and metabolomic data, is a particularly promising solution for identifying new biomarkers. Extended stroke registers and multi-omic databases combining clinical and biomedical data are needed together with data analysis platforms that can facilitate to organize and analyze large amounts of data with modern machine learning methods, to identify new, complex biomarkers that support stroke typing and therapy monitoring [27,28].

It should also be noted that the definition of LVO is quite heterogenous, and previous studies and clinical trials have used various criteria for LVO classification [10]. Mechanical thrombectomy cannot be performed in some cases that are radiologically considered as cases of LVO. However, from a clinical aspect, the 2019 AHA/ASA stroke guidelines recommend considering MT in a wide spectrum of LVO cases. In the case of distal occlusions (e.g., MCA M2 and M3) and occlusions in the posterior circulation, the decision to indicate MT should be made on a case-by-case basis, weighing the potential costs and benefits [29]. Consequently, the scope of future studies should not only be the detection of LVO, but to detect the eligibility to MT early on.

Anterior and posterior circulation territory occlusions and strokes may show quite different clinical appearances and have different predisposing factors [30,31]. The NIHSS also investigates more anterior territory stroke symptoms and, thus, occasionally fails to correctly assess the severity of posterior strokes [32]. Although we aimed to create a universal LVO detection model in our study, we considered all types of LVO. However, for future studies, it may be worthwhile to optimize the prediction of anterior and posterior circulation LVOs separately in a similar way using ML methods, due to the aforementioned differences. Another possible direction is that, after performing a method optimized for anterior circulation LVOs, a method optimized for a posterior circulation LVOs should follow.

The main strength of our study is the comprehensive assessment of real-life, prospectively collected data from multiple centers using novel statistical methods that are not extensively used in medical research yet. However, our study also has some limitations. Firstly, the cross-sectional design only allows to assess associations but not causality. It is important to emphasize that potentially important variables may not have been included to the analyses due to multiple reasons (e.g., a large amount of missing data, or variables were not available in the stroke registry) which could have caused bias. In this study, we used 10-fold cross validation to estimate the generalizability and the true accuracy of the models; however, validation using an external dataset is needed to clinically validate our findings. Finally, ML tools function the best when applied to large datasets (“big-data”), which our dataset did not necessarily match.

5. Conclusions

The need for accurate LVO detection scales is emerging. A novel approach for this could be the machine-learning-based development of prediction models. Our study confirmed this, highlighting that neurological symptoms are the most useful to increase the accuracy of prediction models, but other types of variables (certain medical history data, and laboratory values) should also be included to maximize efficiency.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/life12020230/s1, Figure S1: Location of participating centres and distribution of patient enrolment; Table S1: Proportions of missing values; Figure S2: Chart of analysis workflow; Figure S3: Comparison of the distribution of variables between samples with- and without missing values; Table S2: Comparison of variables distributions of samples of the original dataset vs dataset used for variable selection; Table S3: Comparison of variables distributions of samples of the original dataset vs dataset used for model comparison after feature selection; Figure S4: Effect of missing value imputation methods (predictive mean matching (PMM), midas touch, random forest, CART, random sampling, omitting missing values) on the performance of predicting LVO (measured by AUC) of the different imputation methodologies

Author Contributions

G.T. designed the study, performed literature search, data acquisition and, interpreted the results and wrote the manuscript. A.T. designed the study, performed statistical analysis, interpreted the results and wrote the manuscript. R.H. performed statistical analysis, interpreted the results. P.J.K. performed data acquisition. L.S. designed the concepts of the study, interpreted the data, reviewed and approved the manuscript. L.S. is the guarantor. All authors have read and agreed to the published version of the manuscript.

Funding

In this study we used data from the STAY ALIVE Acute Stroke Registry, the operation of which was funded by the Economic Development and Innovation Operative Programme Grant (GINOP 2.3.2-15-2016-00048). Statistical analysis was carried out in cooperation with E-Group ICT Software Zrt. as a part of the InnoHealth Datalake project (GINOP 2.2.1-15-2017-00067). None of the authors received personalized funding for this work.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and was approved by the Hungarian Medical Research Council (35403–2/2017/EKU).

Informed Consent Statement

Written informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to patient privacy considerations (HIPPA).

Acknowledgments

We would like to thank to Istvan Szegedi (Department of Neurology, University of Debrecen, Hungary) and to Adam Annus (Department of Neurology, University of Szeged, Hungary) for their contribution in the data acquisition for the STAY ALIVE Stroke Registry, which was one of the foundations of our project.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lakomkin, N.; Dhamoon, M.; Carroll, K.; Singh, I.P.; Tuhrim, S.; Lee, J.; Fifi, J.T.; Mocco, J. Prevalence of large vessel occlusion in patients presenting with acute ischemic stroke: A 10-year systematic review of the literature. J. NeuroInterv. Surg. 2018, 11, 241–245. [Google Scholar] [CrossRef] [PubMed]
Goyal, M.; Menon, B.K.; Van Zwam, W.H.; Dippel, D.W.J.; Mitchell, P.J.; Demchuk, A.M.; Dávalos, A.; Majoie, C.B.L.M.; Van Der Lugt, A.; De Miquel, M.A.; et al. Endovascular thrombectomy after large-vessel ischaemic stroke: A meta-analysis of individual patient data from five randomised trials. Lancet 2016, 387, 1723–1731. [Google Scholar] [CrossRef]
Aguiar de Sousa, D.; von Martial, R.; Abilleira, S.; Gattringer, T.; Kobayashi, A.; Gallofré, M.; Fazekas, F.; Szikora, I.; Feigin, V.; Caso, V.; et al. Access to and delivery of acute ischaemic stroke treatments: A survey of national scientific societies and stroke experts in 44 European countries. Eur. Stroke J. 2019, 4, 13–28. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Almekhlafi, M.; Kunz, W.; Menon, B.; McTaggart, R.; Jayaraman, M.; Baxter, B.; Heck, D.; Frei, D.; Derdeyn, C.; Takagi, T.; et al. Imaging of Patients with Suspected Large-Vessel Occlusion at Primary Stroke Centers: Available Modalities and a Suggested Approach. Am. J. Neuroradiol. 2019, 40, 396–400. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Romoli, M.; Paciaroni, M.; Tsivgoulis, G.; Agostoni, E.C.; Vidale, S. Mothership versus Drip-and-Ship Model for Mechanical Thrombectomy in Acute Stroke: A Systematic Review and Meta-Analysis for Clinical and Radiological Outcomes. J. Stroke 2020, 22, 317–323. [Google Scholar] [CrossRef]
Saqqur, M.; Uchino, K.; Demchuk, A.M.; Molina, C.A.; Garami, Z.; Calleja, S.; Akhtar, N.; Orouk, F.O.; Salam, A.; Shuaib, A.; et al. Site of Arterial Occlusion Identified by Transcranial Doppler Predicts the Response to Intravenous Thrombolysis for Stroke. Stroke 2007, 38, 948–954. [Google Scholar] [CrossRef] [Green Version]
Smith, E.E.; Kent, D.M.; Bulsara, K.R.; Leung, L.Y.; Lichtman, J.H.; Reeves, M.J.; Towfighi, A.; Whiteley, W.; Zahuranec, D.B. Accuracy of Prediction Instruments for Diagnosing Large Vessel Occlusion in Individuals with Suspected Stroke: A Systematic Review for the 2018 Guidelines for the Early Management of Patients with Acute Ischemic Stroke. Stroke 2018, 49, e111–e122. [Google Scholar] [CrossRef]
Vidale, S.; Agostoni, E. Prehospital stroke scales and large vessel occlusion: A systematic review. Acta Neurol. Scand. 2018, 138, 24–31. [Google Scholar] [CrossRef]
Inoue, M.; Noda, R.; Yamaguchi, S.; Tamai, Y.; Miyahara, M.; Yanagisawa, S.; Okamoto, K.; Hara, T.; Takeuchi, S.; Miki, K.; et al. Specific Factors to Predict Large-Vessel Occlusion in Acute Stroke Patients. J. Stroke Cerebrovasc. Dis. 2018, 27, 886–891. [Google Scholar] [CrossRef]
Rennert, R.C.; Wali, A.R.; Steinberg, J.A.; Santiago-Dieppa, D.R.; Olson, S.E.; Pannell, J.S.; Khalessi, A.A. Epidemiology, natural history, and clinical presentation of large vessel is-chemic stroke. Neurosurgery 2019, 85, S4–S8. [Google Scholar] [CrossRef] [Green Version]
Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J. R. Stat. Soc. Ser. B 1996, 58, 267–288. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; Available online: https://www.R-project.org/ (accessed on 12 December 2019).
Kuhn, M. A Short Introduction to the Caret Package; R Foundation for Statistical Computing: Vienna, Austria, 2015; Volume 1, pp. 1–10. [Google Scholar]
Beume, L.-A.; Hieber, M.; Kaller, C.P.; Nitschke, K.; Bardutzky, J.; Urbach, H.; Weiller, C.; Rijntjes, M. Large Vessel Occlusion in Acute Stroke. Stroke 2018, 49, 2323–2329. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Zhang, J.; Gong, X.; Zhang, W.; Zhou, Y.; Lou, M. Prediction of large vessel occlusion for ischaemic stroke by using the machine learning model random forests [published online ahead of print, 2021 Oct 26]. Stroke Vasc Neurol 2021, svn-2021-001096. [Google Scholar] [CrossRef] [PubMed]
van Meenen, L.C.C.; van Stigt, M.N.; Siegers, A.; Smeekes, M.D.; van Grondelle, J.A.; Geuzebroek, G.; Marquering, H.A.; Majoie, C.B.; Roos, Y.B.; Koelman, J.H.; et al. Detection of large vessel occlusion stroke in the prehospital setting: Electroencephalography as a potential triage instrument. Stroke 2021, 52, e347–e355. [Google Scholar] [CrossRef]
Purrucker, J.C.; Härtig, F.; Richter, H.; Engelbrecht, A.; Hartmann, J.; Auer, J.; Hametner, C.; Popp, E.; Ringleb, P.A.; Nagel, S.; et al. Design and validation of a clinical scale for prehospital stroke recognition, severity grading and prediction of large vessel occlusion: The shortened NIH Stroke Scale for emergency medical services. BMJ Open 2017, 7, e016893. [Google Scholar] [CrossRef]
Grewal, P.; Lahoti, S.; Aroor, S.; Snyder, K.; Pettigrew, L.C.; Goldstein, L.B. Effect of known atrial fibrillation and anticoagulation status on the prehospital identification of large vessel occlusion. J. Stroke Cerebrovasc. Dis. 2019, 28, 104404. [Google Scholar] [CrossRef]
Narwal, P.; Chang, A.D.; Mac Grory, B.; Jayaraman, M.; Madsen, T.; Paolucci, G.; Cutting, S.; Burton, T.; Dakay, K.; Schomer, A.; et al. The Addition of Atrial Fibrillation to the Los Angeles Motor Scale May Improve Prediction of Large Vessel Occlusion. J. Neuroimaging 2019, 29, 463–466. [Google Scholar] [CrossRef] [Green Version]
Wang, J.; Gong, X.; Zhong, W.; Zhou, Y.; Lou, M. Novel Prehospital Triage Scale for Detecting Large Vessel Occlusion and Its Cause. J. Am. Heart Assoc. 2021, 10, e021201. [Google Scholar] [CrossRef]
Ohta, T.; Nakahara, I.; Matsumoto, S.; Kondo, D.; Watanabe, S.; Okada, K.; Fukuda, M.; Masahira, N.; Tsuno, T.; Matsuoka, T.; et al. Optimizing in-hospital triage for large vessel occlusion using a novel clinical scale (GAI2AA). Neurology 2019, 93, e1997–e2006. [Google Scholar] [CrossRef] [Green Version]
Rodríguez-Pardo, J.; Riera-López, N.; Fuentes, B.; de Leciñana, M.A.; Secades-García, S.; Álvarez-Fraga, J.; Carneado-Ruiz, J.; Díaz-Guzmán, J.; Egido-Herrero, J.; Gil-Núñez, A.; et al. Prehospital selection of thrombectomy candidates beyond large vessel occlusion: M-DIRECT scale. Neurology 2020, 94, e851–e860. [Google Scholar] [CrossRef]
Kim, W.; Kim, E.J. Heart Failure as a Risk Factor for Stroke. J. Stroke 2018, 20, 33–45. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tarkanyi, G.; Karadi, Z.N.; Szabo, Z.; Szegedi, I.; Csiba, L.; Szapary, L. Relationship between leukocyte counts and large vessel occlusion in acute ischemic stroke. BMC Neurol. 2020, 20, 440. [Google Scholar] [CrossRef]
Chang, A.; Ricci, B.; Mac Grory, B.; Cutting, S.; Burton, T.; Dakay, K.; Jayaraman, M.; Merkler, A.; Reznik, M.; Lerario, M.; et al. Cardiac Biomarkers Predict Large Vessel Occlusion in Patients with Ischemic Stroke. J. Stroke Cerebrovasc. Dis. 2019, 28, 1726–1731. [Google Scholar] [CrossRef] [PubMed]
Ramos-Pachón, A.; López-Cancio, E.; Bustamante, A.; de la Ossa, N.P.; Millán, M.; Hernández-Pérez, M.; Garcia-Berrocoso, T.; Cardona, P.; Rubiera, M.; Serena, J.; et al. D-Dimer as Predictor of Large Vessel Occlusion in Acute Ischemic Stroke. Stroke 2021, 52, 852–858. [Google Scholar] [CrossRef]
Montaner, J.; Ramiro, L.; Simats, A.; Tiedt, S.; Makris, K.; Jickling, G.C.; Debette, S.; Sanchez, J.-C.; Bustamante, A. Multilevel omics for the discovery of biomarkers and therapeutic targets for stroke. Nat. Rev. Neurol. 2020, 16, 247–264. [Google Scholar] [CrossRef] [PubMed]
Csecsei, P.; Várnai, R.; Nagy, L.; Kéki, S.; Molnár, T.; Illés, Z.; Farkas, N.; Szapáry, L. L-arginine pathway metabolites can discriminate paroxysmal from permanent atrial fibrillation in acute ischemic stroke. Ideggyogy Szle 2019, 72, 79–88. [Google Scholar] [CrossRef]
Powers, W.J.; Rabinstein, A.A.; Ackerson, T.; Adeoye, O.M.; Bambakidis, N.C.; Becker, K.; Biller, J.; Brown, M.; Demaerschalk, B.M.; Hoh, B.; et al. Guidelines for the Early Management of Patients with Acute Ischemic Stroke: 2019 Update to the 2018 Guidelines for the Early Management of Acute Ischemic Stroke: A Guideline for Healthcare Professionals from the American Heart Association/American Stroke Association. Stroke 2019, 50, e344–e418. [Google Scholar] [CrossRef]
Jia, B.; Ren, Z.; Mokin, M.; Burgin, W.S.; Bauer, C.T.; Fiehler, J.; Mo, D.; Ma, N.; Gao, F.; Huo, X.; et al. Current Status of Endovascular Treatment for Acute Large Vessel Occlusion in China: A Real-World Nationwide Registry. Stroke 2021, 52, 1203–1212. [Google Scholar] [CrossRef]
Hendrix, P.; Killer-Oberpfalzer, M.; Broussalis, E.; Melamed, I.; Sharma, V.; Mutzenbach, S.; Pikija, S.; Collins, M.; Lieberman, N.; Hecker, C.; et al. Mechanical Thrombectomy for Anterior versus Posterior Circulation Large Vessel Occlusion Stroke with Emphasis on Posterior Circulation Outcomes [published online ahead of print, 2021 Nov 8]. World Neurosurg. 2021, S1878-8750(21)01698-3. [Google Scholar] [CrossRef]
Schneck, M.J. Current Stroke Scales May Be Partly Responsible for Worse Outcomes in Posterior Circulation Stroke. Stroke 2018, 49, 2565–2566. [Google Scholar] [CrossRef]

Table 1. Baseline characteristics of the cohort according to the presence of LVO.

	LVO Present (n = 227)	LVO Absent (n = 299)	p Value	AUC (95% CI)
Demographic characteristics
Age, years, median (IQR)	68 (61–79)	69 (59–77)	0.231	0.524 (0.467–0.582)
Gender, female, % (n)	49.8 (113)	43.5 (130)	0.151	0.530 (0.474–0.587)
Elapsed times
Onset-to-ER assessment time, min, median (IQR)	83 (58–124)	88 (59–135)	0.110	-
ER assessment-to-CTA time, min, median (IQR)	14 (6–23)	17 (6–32)	0.043	-
Parameters on admission
NIHSS score on admission, median (IQR)	12 (8–16)	6 (4–9)	<0.001	0.783 (0.742–0.824)
On admission SBP, mmHg, median (IQR)	160 (140–178)	169.5 (145–185)	0.005	0.420 (0.365–0.474)
On admission DBP, mmHg, median (IQR)	86 (78–99)	90 (80–100)	0.034	0.456 (0.401–0.511)
Heart rate, 1/min, median (IQR)	82 (72–93)	80 (71–92)	0.251	0.533 (0.477–0.589)
SpO₂, %, median (IQR)	97 (96–98)	97 (96–99)	0.025	0.447 (0.345–0.550)
Body temperature, °C, median (IQR)	36.4 (36.0–36.5)	36.5 (36.2–36.6)	0.008	0.372 (0.270–0.474)
BMI, kg/m², median (IQR)	25.78 (23.34–30.12)	26.72 (23.46–31.21)	0.125	0.447 (0.392–0.502)
Laboratory parameters
Blood glucose, mmol/L, median (IQR)	6.90 (5.91–8.28)	6.50 (5.60–8.30)	0.084	0.548 (0.495–0.602)
INR, ratio, median (IQR)	1.03 (0.96–1.10)	1.00 (0.95–1.05)	<0.001	0.587 (0.534–0.640)
CRP, mg/L, median (IQR)	3.30 (1.50–7.20)	2.98 (1.55–5.80)	0.262	0.540 (0.486–0.595)
WBC, 10⁹/L, median (IQR)	8.62 (6.88–10.62)	7.94 (6.55–9.61)	0.005	0.583 (0.530–0.636)
Platelet, 10⁹/L, median (IQR)	233.5 (195–271)	224 (186–267)	0.078	0.532 (0.479–0.586)
Haematocrit, %, median (IQR)	40.0 (37.6–42.8)	41.1 (38.0–44.0)	0.034	0.449 (0.396–0.503)
Haemoglobin, g/dL, median (IQR)	138 (126–146)	141 (130–152)	0.005	0.433 (0.380–0.486)
Creatinine, µmol/L, median (IQR)	82 (69–99)	83 (69–101)	0.561	0.485 (0.431–0.539)
BUN, mmol/L, median (IQR)	6.26 (4.80–8.19)	6.10 (4.68–7.63)	0.173	0.527 (0.473–0.581)
AST, U/L, median (IQR)	20 (16–24)	20 (16–25)	0.480	0.476 (0.422–0.530)
ALT, U/L, median (IQR)	15 (11–22)	16 (12–22.5)	0.381	0.466 (0.412–0.520)
Presence of vascular risk factors
Smoking, % (n)	34.9 (66)	31.4 (85)	0.424	0.517 (0.460–0.574)
Hypertension, % (n)	81.4 (180)	80.4 (234)	0.768	0.496 (0.439–0.553)
Diabetes mellitus, % (n)	21.5 (47)	28.6 (82)	0.069	0.475 (0.418–0.531)
Hyperlipidaemia, % (n)	59.2 (125)	58.3 (161)	0.840	0.495 (0.438–0.552)
Atrial fibrillation, % (n)	35.8 (78)	17.5 (50)	<0.001	0.590 (0.533–0.647)
Coronary artery disease, % (n)	29.6 (64)	21.9 (61)	0.051	0.535 (0.478–0.592)
Chronic heart failure, % (n)	17.9 (39)	8.9 (25)	0.002	0.549 (0.492–0.606)
Previous stroke/TIA, % (n)	21.0 (46)	23.2 (66)	0.564	0.494 (0.438–0.551)
Malignancy, % (n)	15.6 (33)	11.7 (33)	0.217	0.520 (0.462–0.577)
Etiology (TOAST), % (n)			<0.001
Large-artery atherosclerosis	26.4 (60)	27.8 (83)
Cardioembolism	51.1 (116)	20.7 (62)
Small vessel disease	0 (0)	21.7 (65)
Other determined origin	0.4 (1)	5.0 (15)
Undetermined etiology	22.0 (50)	24.7 (74)

Abbreviation: LVO, large-vessel occlusion; AUC, area under the curve; CI, confidence interval; IQR, interquartile range; ER, emergency room; CTA, CT angiography; NIHSS, National Institutes of Health Stroke Scale; SBP, systolic blood pressure; DBP, diastolic blood pressure; BMI, body mass index; INR, International Normalized Ratio; CRP, C-reactive protein; WBC, white blood cell; BUN, blood urea nitrogen; AST, aspartate-aminotransferase; ALT, alanine-aminotransferase, TIA, transient ischemic attack.

Table 2. Distribution of symptom severity and prevalence as a function of LVO.

Symptoms (NIHSS Items)	Points			Presence			AUC (95% CI)
Symptoms (NIHSS Items)	LVO Present	LVO Absent	p Value	LVO Present	LVO Absent	p Value	AUC (95% CI)
1A. Level of consciousness (LOC)	0 (0–0)	0 (0–0)	0.003	12.8%	5.4%	0.003	0.537 (0.487–0.587)
1B. LOC questions	1 (0–2)	0 (0–1)	<0.001	56.4%	33.1%	<0.001	0.638 (0.589–0.686)
1C. LOC commands	0 (0–2)	0 (0–0)	<0.001	47.1%	24.7%	<0.001	0.618 (0.569–0.667)
2. Gaze	0 (0–2)	0 (0–0)	<0.001	46.3%	15.1%	<0.001	0.666 (0.617–0.714)
3. Visual fields	0 (0–2)	0 (0–0)	<0.001	47.6%	21.4%	<0.001	0.632 (0.583–0.681)
4. Facial palsy	2 (1–2)	1 (0–2)	<0.001	85.9%	70.9%	<0.001	0.644 (0.597–0.692)
5. Arm weakness	3 (1–4)	1 (0–2)	<0.001	91.2%	72.6%	<0.001	0.738 (0.695–0.782)
6. Leg weakness	3 (1–3)	1 (0–2)	<0.001	83.3%	64.9%	<0.001	0.717 (0.671–0.762)
7. Limb ataxia	0 (0–0)	0 (0–0)	0.001	7.0%	17.4%	<0.001	0.450 (0.401–0.499)
8. Sensory deficit	0 (0–1)	0 (0–1)	0.688	26.9%	30.1%	0.418	0.492 (0.442–0.542)
9. Language/aphasia	1 (0–2)	0 (0–1)	<0.001	56.8%	37.1%	<0.001	0.634 (0.586–0.683)
10. Dysarthria	0 (0–1)	0 (0–1)	0.893	37.0%	38.1%	0.792	0.497 (0.447–0.547)
11. Extinction/inattention	0 (0–0)	0 (0–0)	0.001	9.7%	2.7%	0.001	0.535 (0.485–0.585)

Abbreviation: LVO, large-vessel occlusion; NIHSS, National Institutes of Health Stroke Scale; AUC, area under the curve; CI, confidence interval.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tarkanyi, G.; Tenyi, A.; Hollos, R.; Kalmar, P.J.; Szapary, L. Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods. Life 2022, 12, 230. https://doi.org/10.3390/life12020230

AMA Style

Tarkanyi G, Tenyi A, Hollos R, Kalmar PJ, Szapary L. Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods. Life. 2022; 12(2):230. https://doi.org/10.3390/life12020230

Chicago/Turabian Style

Tarkanyi, Gabor, Akos Tenyi, Roland Hollos, Peter Janos Kalmar, and Laszlo Szapary. 2022. "Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods" Life 12, no. 2: 230. https://doi.org/10.3390/life12020230

APA Style

Tarkanyi, G., Tenyi, A., Hollos, R., Kalmar, P. J., & Szapary, L. (2022). Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods. Life, 12(2), 230. https://doi.org/10.3390/life12020230

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimization of Large Vessel Occlusion Detection in Acute Ischemic Stroke Using Machine Learning Methods

Abstract

1. Introduction

2. Methods

2.1. Study Cohort

2.2. Outcome

2.3. Statistical Analysis

2.4. Data Analysis

3. Results

Data Driven Analysis

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI