Next Article in Journal
Extracellular Vesicles in Inner Ear Therapies—Pathophysiological, Manufacturing, and Clinical Considerations
Previous Article in Journal
A Review of Ophthalmic Complications in Inflammatory Bowel Diseases
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Answering Clinical Questions Using Machine Learning: Should We Look at Diastolic Blood Pressure When Tailoring Blood Pressure Control?

1
Department of Internal Medicine, Hypertension and Vascular Diseases, Medical University of Warsaw, Banacha 1a, 02-097 Warsaw, Poland
2
Department of Information and Knowledge Engineering, Faculty of Informatics and Statistics, Prague University of Economics and Business, W. Churchill Sq. 4, 120 00 Prague, Czech Republic
3
Department of Physiology and Pathophysiology, Wroclaw Medical University, Chałubińskiego 10, 50-368 Wroclaw, Poland
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
J. Clin. Med. 2022, 11(24), 7454; https://doi.org/10.3390/jcm11247454
Submission received: 11 November 2022 / Revised: 6 December 2022 / Accepted: 13 December 2022 / Published: 15 December 2022
(This article belongs to the Section Cardiology)

Abstract

:
Background: The guidelines recommend intensive blood pressure control. Randomized trials have focused on the relevance of the systolic blood pressure (SBP) lowering, leaving the safety of the diastolic blood pressure (DBP) reduction unresolved. There are data available which show that low DBP should not stop clinicians from achieving SBP targets; however, registries and analyses of randomized trials present conflicting results. The purpose of the study was to apply machine learning (ML) algorithms to determine, whether DBP is an important risk factor to predict stroke, heart failure (HF), myocardial infarction (MI), and primary outcome in the SPRINT trial database. Methods: ML experiments were performed using decision tree, random forest, k-nearest neighbor, naive Bayesian, multi-layer perceptron, and logistic regression algorithms, including and excluding DBP as the risk factor in an unselected and selected (DBP < 70 mmHg) study population. Results: Including DBP as the risk factor did not change the performance of the machine learning models evaluated using accuracy, AUC, mean, and weighted F-measure, and was not required to make proper predictions of stroke, MI, HF, and primary outcome. Conclusions: Analyses of the SPRINT trial data using ML algorithms imply that DBP should not be treated as an independent risk factor when intensifying blood pressure control.

1. Introduction

Diastolic blood pressure (DBP) is often a neglected parameter when making decisions concerning the optimization of blood pressure (BP) control aimed at reducing the risk of cardiovascular (CV) events. Most of the trials and recommendations focus on systolic blood pressure (SBP), leaving DBP values as a subject for discussion [1,2]. However, discussion concerning the role of DBP and the DBP-related J-curve is still relevant. Previously, it was shown that lower DBP may be related to the increased rate of myocardial infarction in hypertensive subjects [3]. The results of other studies addressing this issue confirm that intensive DBP reduction may increase the risk of coronary artery disease [4]. As demonstrated by the results of interventional clinical trials, low DBP may be related to increased CV risk; however, it cannot be interpreted as an independent CV risk factor [5,6]. Contrary to the results of clinical trials obtained using multiple statistical approaches, data from registries [7,8] and, importantly, common clinical concerns seem to point in another direction. Furthermore, the concern that lower DBP may be harmful can promote treatment inertia, resulting in poor BP control at an individual and population level.
As the problem of DBP relevance continues to remain a hot topic in hypertension, an alternative methodological approach incorporating artificial intelligence methods was used to strengthen the data that low DBP should not be the reason to abandon low SBP targets during treatment of hypertension. Machine learning (ML) methods focus on the prediction of an outcome based on variables, while statistical methods focus on the inference between variables and outcomes. Therefore, machine learning methods have different goals than conventional statistics, and might be more suitable to answer the question posed above.
The purpose of the current study was to apply machine learning algorithms to determine whether DBP is an important risk factor to predict stroke, heart failure (HF), myocardial infarction (MI), and primary cardiovascular outcome in the Systolic Blood Pressure Intervention Trial (SPRINT) database.

2. Methods

The SPRINT was a randomized, multi-center open-label trial aimed to verify the hypothesis that a lower SBP target would reduce clinical events [9]. The SPRINT included 9361 patients aged 50 and older with a screening SBP of 130 to 180 mmHg. Patients were included in the trial if they had at least one of the following: an increased CV disease (CVD) risk, defined as a history of clinical or subclinical CVD; chronic kidney disease (estimated glomerular filtration rate 20–60 mL min−1 1.73 m−2); 10-year Framingham CVD risk of 15% or higher; or if they were 75 or older. Patients were excluded from the SPRINT if they had diabetes mellitus, heart failure, or a previous stroke. The SPRINT showed that achieving SBP of less than 120 mmHg, rather than 140 mmHg, contributed to a reduced risk of fatal and nonfatal major cardiovascular events and overall mortality. Primary outcome, myocardial infarction, acute coronary syndrome, stroke, congestive heart failure, and cardiovascular death were significantly reduced in the intensive BP (SBP < 120 mmHg) management arm compared with the standard management arm (SBP < 150 mmHg) (hazard ratio 0.75, 95% CI 0.64–0.89; p < 0.0001). The rationale, protocol, and results of the trial were published and widely discussed elsewhere [9,10].
We used the SPRINT trial data obtained from the NHLBI Biologic Specimen and Data Repository Information Coordinating Center (accession number HLB02021921a). The data are available upon reasonable request. The authors of this manuscript have no right to share the data. This manuscript does not necessarily reflect the opinions or views of the SPRINT Research Group or the NHLBI. The analysis was approved by the Ethics Committee at the Medical University of Warsaw.
The study rationale was based on the assumption that adding DBP to a ML model will not change model performance in classifying whether the outcome (stroke, MI, HF, primary outcome) is present or absent. Consequently, it would imply that the role of DBP is not relevant in predicting the events. Machine learning (ML) experiments were performed using the RapidMiner data science software platform [11]. The variables—age, sex, history of clinical cardiovascular disease, chronic kidney disease, allocation to the treatment arm, smoking status, heart rate, in-trial SBP, and in-trial DBP—which were used previously in a multivariate Cox model, were chosen out of the SPRINT database [5,6]. All experiments were performed separately for target (outcome) variables: stroke, myocardial infarction (MI), heart failure exacerbation (HF), and primary composite outcome (primary). The primary composite outcome was defined as the occurrence of a myocardial infarction, acute coronary syndrome other than myocardial infarction, stroke, acute decompensated heart failure, or death from a cardiovascular cause.
Individual phases of the ML experiments were performed as follows (Figure 1). Firstly, data were retrieved from the database. Then, the role of all variables was specified. As the values of target variables were imbalanced, the data was up-sampled using the synthetic minority oversampling technique (SMOTE) algorithm. The SMOTE algorithm increases the number of minority-class examples by artificially creating new examples similar to those existing in the original data [12]. In the up-sampled dataset, cases with DBP < 70 mmHg were selected. All further steps were performed for unselected (i.e., all cases) and selected (i.e., cases with DBP < 70 mmHg) populations. Each of the experiments was run separately for each outcome—primary, stroke, MI, and HF. Tenfold cross-validation was used to train and test the classification models. The training consisted of running a particular ML algorithm. We used decision tree (with gain ratio as the splitting criterion, maximum depth 10 and pruning), random forest (100 trees with gain ratio as the splitting criterion, maximum depth 10 and pruning), k-nearest neighbor classifier (with k = 5, weighted voting and mixed Euclidean distance), naive Bayesian classifier, multi-layer perceptron (with single hidden layer), and logistic regression. The performance of each classifier was evaluated using overall accuracy (denoted as accuracy in Table 1 and Table 2), the AUC for the detection of positive outcomes (denoted as AUC in Table 1 and Table 2), the mean of the F-measure for positive and negative outcomes (denoted as mF1 in Table 1 and Table 2), and the weighted F-measure for positive and negative outcomes (denoted as wF1 in Table 1 and Table 2). These are standard measures used to evaluate classification models [13,14]. The accuracy, mF1, and wF1 measures were used to evaluate the model as a whole, while the AUC was used to evaluate the performance of the model when predicting the presence of a corresponding diagnosis. Each of the experiments was performed with and without the DBP variable as the predictor.

Statistical Analysis

This is a retrospective analysis of the SPRINT data. The performance of each machine learning algorithm for each of the outcomes (primary, stroke, MI, and HF), expressed by accuracy, AUC, mF1, and wF1 measures, was compared when taking into account the presence or absence of the DBP variable as the predictor. The paired two-sided t-test was used to compare the measures of the classifier performance with and without DBP as the predictor. The Welch t-test was also used for the comparison of classifiers’ performance in the selected and unselected population. The differences were considered significant at p < 0.05. All statistical computations were performed in an R 4.1.0 environment [15]. Standard ‘ggplot2’ and ‘tidyvers’ packages were used [16,17].

3. Results

The baseline characteristic of the study population was presented in detail in the original SPRINT report [9]. Tables S1 and S2 (Supplemental Material) present patients’ baseline characteristics and results in the intensive and standard treatment arms. The performance of each of the classifiers in the detection of analyzed outcomes is presented in Table 1 and Table 2. In the analysis of the unselected population, there were no significant differences in the performance of all of the classifiers when DBP was included as the predictor (the same was true for the opposite case with all outcome variables examined) (Figure 2). In the subpopulation of patients with DBP < 70 mmHg, the same relation occurred as mentioned above (Figure 3). There was also no difference detected in the performance of the classifiers when the selected and unselected populations were compared for all outcome variables, and when DBP was included and excluded as the variable (Figure 4). There were however significant differences when evaluating the particular machine learning algorithms. Regardless of the target class and experimental setting (excluding or including DBP, when using the whole population or the selected population with DBP < 70 mmHg), the multi-layer perceptron was always the best model and DT was the least effective one (Figure 5).
The results of the ML experiments performed show that variables were properly selected for the analysis with good goals of the prediction measures (Table 1 and Table 2). Models selected for the experiments showed consistent results of the performance measures (Table 1 and Table 2).

4. Discussion

The main finding of the study is that adding DBP as a factor does not affect the classifier performance in the prediction of stroke, MI, HF, and primary CV outcomes in the high-CV-risk populations of the SPRINT. DBP as a factor also does not improve the performance of the ML models. Even in the subpopulation of patients with DBP < 70 mmHg, adding DBP to the ML models did not improve the classifier performance. The foregoing is indirect evidence that DBP is not an important risk factor to predict CV outcomes. The question remains whether such a generalization might be made. Observational studies show the J-curve relation of DBP and cardiovascular events, especially in respect of the population with coronary artery disease [18,19,20]. A recent analysis of the EPHESUS randomized study in patients with coronary artery disease and heart failure showed an increased risk for all-cause death, cardiovascular death, and cardiovascular hospitalization in the subpopulation of subjects without reperfusion and with DBP lower than 70 mmHg [21]. The SPRINT secondary analysis also demonstrated there was a J-shaped association between the follow-up DBP and the composite cardiovascular outcome, regardless of intensive or standard treatment [22]. Additionally, although the authors of the study do no exclude reverse causality with respect to low DBP, such results are in line with the intuition of the practitioner associating reduced perfusion of organs with clinical outcomes. The results of analysis of other randomized studies do not confirm that rationale [23,24,25]. Recent studies using the mendelian randomization method [26,27] showed a linear, but not J-curved, relation between MI and DBP, and did not prove causality between DBP and CV events. The results of the current study reinforce those findings. In the currently published analysis of the SPS3 trial performed by Shihab et al. [28], the authors found that even in patients with low baseline DBP and previous stroke, intensive SBP lowering did not increase the risk of stroke. The commented paper has been acknowledged in recent research showing adequate perfusion of the brain in MRI in patients with low DBP values [29]. Our findings support those of Shihab et al. [28], as DBP in the chosen ML models did not improve stroke predictions.
A different methodological approach is applied in machine learning analysis in contrast to statistical analysis. ML prediction results are now widely used in consumer and medical decision-making algorithms, showing high efficiency. In the field of cardiology and imaging medicine, practitioners use ML algorithms [30,31,32,33,34]; yet, clinical study data are not often analyzed in that way [35,36]. It has been changing, though. Currently, new research results based on analysis of the SPRINT and ACCORD trial data incorporating ML methods demonstrated potential for the individualization of hypertension treatment [37].
Analysis performed previously by our team concerning the relationship between low DBP and the risk of cardiovascular events using a classic statistical approach [5,6,38] provided evidence supporting the findings of other researches. However, the current analysis might more effectively finally answer the question posed in the introduction. We believe that ML methods may add value or change researchers’ perspectives in cases where contrary results appear in various classical statistical methods. Nevertheless, we are far from treating ML methods as giving definite answers. As ML and AI methods are now increasingly used to develop new ways of diagnosing and evaluating clinical results, a critical approach to this methodology is necessary. Recommendations summarizing how to conduct and report the course of an experiment involving these methods have recently been published [39]. Since the authors of the present manuscript agree with this new approach, answers to the questions suggested in the recommendations are presented below in Table 3.
In the current study, experiments were performed using the chosen ML classifiers of various types. Other ML methods such as support vector machine (SVM) methods did not perform better than those chosen, but the future use of alternative classifiers such as the (artificial neural network (ANN) or the XBoost regressor does not exclude obtaining different results. However, it is worth mentioning that the study did not focus on finding the best classifier to detect CV outcomes. The objective of the experiments was to explore the role of DBP as a variable, and it was found that it is not a significant factor. Interestingly, adding DBP as a factor did not change the effect of predictions using both the best- and the worst-performing classifiers.
The current study has some limitations. Firstly, it was based on a single randomized study. Although it included 9361 patients, the study population did not cover the full spectrum of CVD risks, as patients with diabetes were excluded. Research using more complex databases and different variables is needed to fully establish the role of DBP as a CVD risk factor. As mentioned in the Methods section, variables selected for the current experiments were the same as those used previously in the multivariate Cox model [5,6]. If different variables were used, we could not exclude the possibility of a positive impact of DBP on outcomes (this was not the case in our study, though). In that case, the positive impact would be an effect of the different variable selection and not an effect of the ML method used. Current experiments were also performed with the additional risk factor of increased pulse pressure. No differences were noted in the study results after the inclusion of this parameter, and therefore the data are not presented in the tables and figures.
In the original database, a significant imbalance in the count of positive and negative outcomes was noted. Therefore, automatically performed up-sampling (SMOTE) was applied. Consequently, ML experiments were performed on the up-sampled SPRINT database which—even though the SMOTE procedure is designed for such purposes—may affect study results.
In conclusion, the study shows that results of the randomized clinical trials may be analyzed using ML methods and such analysis may have clinical implications. Our analysis using various types of ML models shows that adding DBP to the model does not improve model performance. This may imply that DBP should not be treated as an independent risk factor in predicting CV outcomes. The methodological approach which was used in the current study may be implemented in other cases.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jcm11247454/s1, Table S1: Baseline characteristics of the Systolic Blood Pressure Intervention Trial (SPRINT) participants, according to the treatment group; Table S2: The Systolic Blood Pressure Intervention Trial (SPRINT) outcomes.

Author Contributions

Conceptualization, M.S., P.B. and J.L.; methodology, M.S. and P.B.; Software, M.S. and P.B.; validation, J.L. and K.P.; formal analysis, P.B. and M.S.; investigation, M.S., P.B. and J.L.; resources, P.S.; data curation, P.S. and K.P.; writing—original draft preparation, M.S.; writing—review and editing, M.S., J.L., K.P., A.S. and B.P.; visualization, K.P.; supervision, J.L. and A.S.; funding acquisition, A.S. and B.P. All authors have read and agreed to the published version of the manuscript.

Funding

The current analyses were performed in cooperation with the consortium HeartBIT_4.0– Application of Innovative Medical Data Science technologies for heart diseases. This project was funded from the European Union’s Horizon 2020 research and innovation program under grant agreement number 857446.

Institutional Review Board Statement

It is a secondary analysis of the data from SPRINT. SPRINT was conducted according to the guidelines of the Declaration of Helsinki and approved by the institutional review board of each study site involved.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the SPRINT study.

Data Availability Statement

Raw data are available upon reasonable request from NHLBI.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Williams, B.; Mancia, G.; Spiering, W.; Agabiti Rosei, E.; Azizi, M.; Burnier, M.; Clement, D.L.; Coca, A.; de Simone, G.; Dominiczak, A.; et al. 2018 ESC/ESH Guidelines for the management of arterial hypertension. Eur. Heart J. 2018, 39, 3021–3104. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Sobieraj, P.; Lewandowski, J.; Sinski, M. Determination of the optimal on-treatment diastolic blood pressure range using automated office measurements in patients without cardiovascular disease. Pol. Arch. Int. Med. 2021, 131, 249–256. [Google Scholar] [CrossRef] [PubMed]
  3. Stewart, I.M. Relation of reduction in pressure to first myocardial infarction in patients receiving treatment for severe hypertension. Lancet 1979, 1, 861–865. [Google Scholar] [CrossRef] [PubMed]
  4. Alderman, M.H.; Ooi, W.L.; Madhavan, S.; Cohen, H. Treatment-induced blood pressure reduction and the risk of myocardial infarction. JAMA 1989, 262, 920–924. [Google Scholar] [CrossRef] [PubMed]
  5. Sobieraj, P.; Lewandowski, J.; Sinski, M.; Symonides, B.; Gaciong, Z. Low Diastolic Blood Pressure is Not Related to Risk of First Episode of Stroke in a High-Risk Population: A Secondary Analysis of SPRINT. J. Am. Heart Assoc. 2019, 8, e010811. [Google Scholar] [CrossRef] [Green Version]
  6. Sobieraj, P.; Lewandowski, J.; Sinski, M. Low on-treatment diastolic blood pressure is not independently associated with increased cardiovascular risk: An analysis of the SPRINT trial. Eur. Heart J. 2019, 40, 2094–2095. [Google Scholar] [CrossRef]
  7. Vidal-Petiot, E.; Greenlaw, N.; Ford, I.; Ferrari, R.; Fox, K.M.; Tardif, J.C.; Tendera, M.; Parkhomenko, A.; Bhatt, D.L.; Steg, P.G. Relationships Between Components of Blood Pressure and Cardiovascular Events in Patients with Stable Coronary Artery Disease and Hypertension. Hypertension 2018, 71, 168–176. [Google Scholar] [CrossRef] [Green Version]
  8. Lip, S.; Tan, L.E.; Jeemon, P.; McCallum, L.; Dominiczak, A.F.; Padmanabhan, S. Diastolic Blood Pressure J-Curve Phenomenon in a Tertiary-Care Hypertension Clinic. Hypertension 2019, 74, 767–775. [Google Scholar] [CrossRef]
  9. Wright, J.T., Jr.; Williamson, J.D.; Whelton, P.K.; Snyder, J.K.; Sink, K.M.; Rocco, M.V.; Reboussin, D.M.; Rahman, M.; Oparil, S.; Lewis, C.E.; et al. A Randomized Trial of Intensive versus Standard Blood-Pressure Control. N. Engl. J. Med. 2015, 373, 2103–2116. [Google Scholar] [CrossRef]
  10. Ambrosius, W.T.; Sink, K.M.; Foy, C.G.; Berlowitz, D.R.; Cheung, A.K.; Cushman, W.C.; Fine, L.J.; Goff, D.C., Jr.; Johnson, K.C.; Killeen, A.A.; et al. The design and rationale of a multicenter clinical trial comparing two strategies for control of systolic blood pressure: The Systolic Blood Pressure Intervention Trial (SPRINT). Clin. Trials 2014, 11, 532–546. [Google Scholar] [CrossRef]
  11. Rapid Miner Data Science Platform. Available online: https://rapidminer.com/platform/educational (accessed on 7 March 2022).
  12. Chawla, N.; Bowyer, K.; Hall, L.; Kegelmeyer, W. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  13. Lever, J.; Krzywinski, M.; Altman, N. Classification evaluation. Nat. Methods 2016, 13, 603–604. [Google Scholar] [CrossRef]
  14. Szakonyi, B.; Vassányi, I.; Schumacher, E.; Kósa, I. Efficient methods for acute stress detection using heart rate variability data from Ambient Assisted Living sensors. Biomed. Eng. Online 2021, 20, 73. [Google Scholar] [CrossRef] [PubMed]
  15. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Available online: https://www.R-project.org/ (accessed on 15 March 2022).
  16. Systolic Blood Pressure Intervention Trial (SPRINT) Protocol. Available online: https://www.sprinttrial.org/public/Protocol_Current.pdf (accessed on 15 March 2022).
  17. Kassambara, A. Drawing Survival Curves Using ‘ggplot2’. Available online: https://CRAN.R-project.org/package=survminer (accessed on 15 March 2022).
  18. Cruickshank, J.M.; Thorp, J.M.; Zacharias, F.J. Benefits and potential harm of lowering high blood pressure. Lancet 1987, 1, 581–584. [Google Scholar] [CrossRef]
  19. Farnett, L.; Mulrow, C.D.; Linn, W.D.; Lucey, C.R.; Tuley, M.R. The J-curve phenomenon and the treatment of hypertension. Is there a point beyond which pressure reduction is dangerous? JAMA 1991, 265, 489–495. [Google Scholar] [CrossRef]
  20. Messerli, F.H.; Mancia, G.; Conti, C.R.; Hewkin, A.C.; Kupfer, S.; Champion, A.; Kolloch, R.; Benetos, A.; Pepine, C.J. Dogma disputed: Can aggressively lowering blood pressure in hypertensive patients with coronary artery disease be dangerous? Ann. Intern. Med. 2006, 144, 884–893. [Google Scholar] [CrossRef]
  21. Böhm, M.; Ferreira, J.P.; Mahfoud, F.; Duarte, K.; Pitt, B.; Zannad, F.; Rossignol, P. Myocardial reperfusion reverses the J-curve association of cardiovascular risk and diastolic blood pressure in patients with left ventricular dysfunction and heart failure after myocardial infarction: Insights from the EPHESUS trial. Eur. Heart J. 2020, 41, 1673–1683. [Google Scholar] [CrossRef]
  22. Khan, N.A.; Rabkin, S.W.; Zhao, Y.; McAlister, F.A.; Park, J.E.; Guan, M.; Chan, S.; Humphries, K.H. Effect of Lowering Diastolic Pressure in Patients With and Without Cardiovascular Disease: Analysis of the SPRINT (Systolic Blood Pressure Intervention Trial). Hypertension 2018, 71, 840–847. [Google Scholar] [CrossRef]
  23. Beddhu, S.; Chertow, G.M.; Cheung, A.K.; Cushman, W.C.; Rahman, M.; Greene, T.; Wei, G.; Campbell, R.C.; Conroy, M.; Freedman, B.I.; et al. Influence of Baseline Diastolic Blood Pressure on Effects of Intensive Compared With Standard Blood Pressure Control. Circulation 2018, 137, 134–143. [Google Scholar] [CrossRef]
  24. Ilkun, O.L.; Greene, T.; Cheung, A.K.; Whelton, P.K.; Wei, G.; Boucher, R.E.; Ambrosius, W.; Chertow, G.M.; Beddhu, S. The Influence of Baseline Diastolic Blood Pressure on the Effects of Intensive Blood Pressure Lowering on Cardiovascular Outcomes and All-Cause Mortality in Type 2 Diabetes. Diabetes Care 2020, 43, 1878–1884. [Google Scholar] [CrossRef]
  25. Zhang, W.; Zhang, S.; Deng, Y.; Wu, S.; Ren, J.; Sun, G.; Yang, J.; Jiang, Y.; Xu, X.; Wang, T.D.; et al. Trial of Intensive Blood-Pressure Control in Older Patients with Hypertension. N. Engl. J. Med. 2021, 385, 1268–1279. [Google Scholar] [CrossRef] [PubMed]
  26. Arvanitis, M.; Qi, G.; Bhatt, D.L.; Post, W.S.; Chatterjee, N.; Battle, A.; McEvoy, J.W. Linear and Nonlinear Mendelian Randomization Analyses of the Association Between Diastolic Blood Pressure and Cardiovascular Events: The J-Curve Revisited. Circulation 2021, 143, 895–906. [Google Scholar] [CrossRef] [PubMed]
  27. Malik, R.; Georgakis, M.K.; Vujkovic, M.; Damrauer, S.M.; Elliott, P.; Karhunen, V.; Giontella, A.; Fava, C.; Hellwege, J.N.; Shuey, M.M.; et al. Relationship Between Blood Pressure and Incident Cardiovascular Disease: Linear and Nonlinear Mendelian Randomization Analyses. Hypertension 2021, 77, 2004–2013. [Google Scholar] [CrossRef] [PubMed]
  28. Shihab, S.; Boucher, R.E.; Abraham, N.; Wei, G.; Beddhu, S. Influence of Baseline Diastolic Blood Pressure on the Effects of Intensive Systolic Blood Pressure Lowering on the Risk of Stroke. Hypertension 2022, 79, 785–793. [Google Scholar] [CrossRef] [PubMed]
  29. Dolui, S.; Detre, J.A.; Gaussoin, S.A.; Herrick, J.S.; Wang, D.J.J.; Tamura, M.K.; Cho, M.E.; Haley, W.E.; Launer, L.J.; Punzi, H.A.; et al. Association of Intensive vs Standard Blood Pressure Control With Cerebral Blood Flow: Secondary Analysis of the SPRINT MIND Randomized Clinical Trial. JAMA Neurol. 2022, 79, 380–389. [Google Scholar] [CrossRef] [PubMed]
  30. Avanzo, M.; Wei, L.; Stancanello, J.; Vallières, M.; Rao, A.; Morin, O.; Mattonen, S.A.; El Naqa, I. Machine and deep learning methods for radiomics. Med. Phys. 2020, 47, e185–e202. [Google Scholar] [CrossRef]
  31. Zhang, F. Application of machine learning in CT images and X-rays of COVID-19 pneumonia. Medicine 2021, 100, e26855. [Google Scholar] [CrossRef]
  32. Tokodi, M.; Schwertner, W.R.; Kovács, A.; Tősér, Z.; Staub, L.; Sárkány, A.; Lakatos, B.K.; Behon, A.; Boros, A.M.; Perge, P.; et al. Machine learning-based mortality prediction of patients undergoing cardiac resynchronization therapy: The SEMMELWEIS-CRT score. Eur. Heart J. 2020, 41, 1747–1756. [Google Scholar] [CrossRef] [Green Version]
  33. Cohen-Shelly, M.; Attia, Z.I.; Friedman, P.A.; Ito, S.; Essayagh, B.A.; Ko, W.Y.; Murphree, D.H.; Michelena, H.I.; Enriquez-Sarano, M.; Carter, R.E.; et al. Electrocardiogram screening for aortic valve stenosis using artificial intelligence. Eur. Heart J. 2021, 42, 2885–2896. [Google Scholar] [CrossRef]
  34. Hannun, A.Y.; Rajpurkar, P.; Haghpanahi, M.; Tison, G.H.; Bourn, C.; Turakhia, M.P.; Ng, A.Y. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat. Med. 2019, 25, 65–69. [Google Scholar] [CrossRef]
  35. Sidey-Gibbons, J.A.M.; Sidey-Gibbons, C.J. Machine learning in medicine: A practical introduction. BMC Med. Res. Methodol. 2019, 19, 64. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Niedziela, J.T.; Cieśla, D.; Wojakowski, W.; Gierlotka, M.; Dudek, D.; Witkowski, A.; Zdrojewski, T.; Lesiak, M.; Buszman, P.; Gąsior, M. Is neural network better than logistic regression in death prediction in patients after ST-segment elevation myocardial infarction? Kardiol. Pol. (Pol. Heart J.) 2021, 79, 1353–1361. [Google Scholar] [CrossRef] [PubMed]
  37. Oikonomou, E.K.; Spatz, E.S.; Suchard, M.A.; Khera, R. Individualising intensive systolic blood pressure reduction in hypertension using computational trial phenomaps and machine learning: A post-hoc analysis of randomised clinical trials. Lancet Digit. Health 2022, 4, e796–e805. [Google Scholar] [CrossRef] [PubMed]
  38. Sinski, M.; Sobieraj, P.; Lewandowski, J. Not afraid of low diastolic blood pressure anymore? Aging 2019, 11, 9229–9230. [Google Scholar] [CrossRef] [PubMed]
  39. van Smeden, M.; Heinze, G.; Van Calster, B.; Asselbergs, F.W.; Vardas, P.E.; Bruining, N.; de Jaegere, P.; Moore, J.H.; Denaxas, S.; Boulesteix, A.L.; et al. Critical appraisal of artificial intelligence-based prediction models for cardiovascular disease. Eur. Heart J. 2022, 43, 2921–2930. [Google Scholar] [CrossRef]
Figure 1. Flowchart presenting ML experiment phases. DBP—diastolic blood pressure, SMOTE—synthetic minority oversampling technique.
Figure 1. Flowchart presenting ML experiment phases. DBP—diastolic blood pressure, SMOTE—synthetic minority oversampling technique.
Jcm 11 07454 g001
Figure 2. Comparison of the performance of the classifiers when DBP was included (DBP (+)) and excluded (DBP (−)) as the predictor presented as boxplots for all outcome variables examined in the unselected population. AUC—area under curve, mF1—mean F-measure, wF1—mean weighted F-measure, no differences detected for all of the comparisons.
Figure 2. Comparison of the performance of the classifiers when DBP was included (DBP (+)) and excluded (DBP (−)) as the predictor presented as boxplots for all outcome variables examined in the unselected population. AUC—area under curve, mF1—mean F-measure, wF1—mean weighted F-measure, no differences detected for all of the comparisons.
Jcm 11 07454 g002
Figure 3. Comparison of the performance of the classifiers when DBP was included (DBP (+)) and excluded (DBP (−)) as the predictor presented as boxplots for all outcome variables examined in the selected population of subjects with DBP < 70 mmHg. AUC—area under curve, mF1—mean F-measure, wF1—mean weighted F-measure, no differences detected for all of the comparisons.
Figure 3. Comparison of the performance of the classifiers when DBP was included (DBP (+)) and excluded (DBP (−)) as the predictor presented as boxplots for all outcome variables examined in the selected population of subjects with DBP < 70 mmHg. AUC—area under curve, mF1—mean F-measure, wF1—mean weighted F-measure, no differences detected for all of the comparisons.
Jcm 11 07454 g003
Figure 4. Comparison of the performance of the classifiers in the selected (selection) and unselected (no selection) population presented as boxplots for all of outcome variables when diastolic blood pressure was included and excluded as the predictor. AUC—area under curve, mF1—mean F-measure, wF1—mean weighted F-measure, no differences detected for all of the comparisons.
Figure 4. Comparison of the performance of the classifiers in the selected (selection) and unselected (no selection) population presented as boxplots for all of outcome variables when diastolic blood pressure was included and excluded as the predictor. AUC—area under curve, mF1—mean F-measure, wF1—mean weighted F-measure, no differences detected for all of the comparisons.
Jcm 11 07454 g004
Figure 5. ROC curves of the classifiers detecting of primary outcome when DBP was included as the risk factor. DT—decision tree Jcm 11 07454 i001, KNN—k-nearest neighbor classifier Jcm 11 07454 i002, LR—logistic regression Jcm 11 07454 i003, MLP—multi-layer perceptron Jcm 11 07454 i004, NB—naive Bayesian classifier Jcm 11 07454 i005, RF—random forest Jcm 11 07454 i006.
Figure 5. ROC curves of the classifiers detecting of primary outcome when DBP was included as the risk factor. DT—decision tree Jcm 11 07454 i001, KNN—k-nearest neighbor classifier Jcm 11 07454 i002, LR—logistic regression Jcm 11 07454 i003, MLP—multi-layer perceptron Jcm 11 07454 i004, NB—naive Bayesian classifier Jcm 11 07454 i005, RF—random forest Jcm 11 07454 i006.
Jcm 11 07454 g005
Table 1. Performance of the classifiers for different outcomes, excluding and including diastolic blood pressure in the model for the whole population.
Table 1. Performance of the classifiers for different outcomes, excluding and including diastolic blood pressure in the model for the whole population.
Model Excluding DBPModel Including DBP
ML AlgorithmAccuracy (%)AUC (%)mF1 (%)wF1 (%)Accuracy (%)AUC (%)mF1 (%)wF1 (%)
PRIMARY OUTCOME
DT63.261.958.762.363.659.059.362.7
RF64.169.159.564.361.368.155.461.5
KNN64.467.461.864.164.567.462.164.3
NB65.968.662.765.465.668.362.865.2
MLP68.973.466.368.569.373.666.668.9
LR65.668.361.764.965.768.461.865.1
STROKE
DT64.357.254.762.864.355.257.165.0
RF66.976.363.270.064.372.658.266.1
KNN73.079.270.673.074.881.072.874.9
NB66.068.961.065.265.768.560.965.0
MLP75.081.772.374.875.782.273.175.5
LR66.870.060.765.766.970.060.965.9
MYOCARDIAL INFARCTION
DT65.362.860.164.465.061.159.864.1
RF65.272.161.468.363.970.861.268.7
KNN70.075.167.469.970.876.168.370.7
NB68.368.964.267.767.668.563.867.1
MLP72.677.469.272.273.678.470.473.2
LR67.368.061.866.467.368.061.866.4
HEART FAILURE
DT69.571.065.769.066.268.464.566.8
RF73.279.569.572.971.878.267.571.5
KNN74.279.771.974.274.280.472.074.2
NB70.175.066.669.769.875.166.869.6
MLP78.084.275.777.979.185.376.978.9
LR70.775.966.670.271.076.467.070.4
AUC—area under curve, DT—decision tree, KNN—k-nearest neighbor classifier, LR—logistic regression, mF1—mean F—measure, MLP—multi-layer perceptron, NB—naive Bayesian classifier, RF—random forest, wF1—mean weighted F-measure.
Table 2. Performance of the classifiers for different outcomes, excluding and including diastolic blood pressure in the model for the population with DBP < 70 mmHg.
Table 2. Performance of the classifiers for different outcomes, excluding and including diastolic blood pressure in the model for the population with DBP < 70 mmHg.
Model Excluding DBPModel Including DBP
ML AlgorithmAccuracy (%)AUC (%)mF1 (%)wF1 (%)Accuracy (%)AUC (%)mF1 (%)wF1 (%)
PRIMARY OUTCOME
DT61.163.260.461.061.161.560.060.9
RF65.070.864.165.062.069.461.463.0
KNN63.368.662.863.363.068.262.663.0
NB64.268.463.764.264.268.463.664.1
MLP68.274.567.768.268.575.767.868.4
LR64.367.963.464.264.368.063.464.2
STROKE
DT59.960.556.058.158.556.157.762.1
RF68.178.967.272.664.776.564.367.3
KNN72.180.171.575.572.680.172.272.8
NB64.369.863.066.264.269.862.864.0
MLP77.084.376.580.277.784.877.077.7
LR65.769.963.866.766.069.964.065.7
MYOCARDIAL INFARCTION
DT63.563.861.763.262.860.360.662.5
RF68.173.966.668.364.472.063.265.8
KNN66.974.166.367.068.074.967.568.2
NB65.869.664.665.765.069.363.764.8
MLP72.479.871.772.474.180.973.474.1
LR65.568.363.865.265.568.463.765.2
HEART FAILURE
DT67.668.767.567.658.556.558.159.1
RF70.879.970.870.969.478.369.369.4
KNN71.979.272.172.170.978.271.071.1
NB68.875.668.768.968.875.268.768.8
MLP76.283.676.276.377.284.277.277.2
LR71.076.670.871.071.076.670.971.0
AUC—area under curve, DT—decision tree, KNN—k-nearest neighbor classifier, LR—logistic regression, mF1—mean F—measure, MLP—multi-layer perceptron, NB—naive Bayesian classifier, RF—random forest, wF1—mean weighted F-measure.
Table 3. Questions about artificial intelligence-based prediction of cardiovascular disease.
Table 3. Questions about artificial intelligence-based prediction of cardiovascular disease.
Is AI Needed to Solve the Targeted Medical Problem?
In the Current Study ML Methods Were Used to Solve the Problem of the DBP Relevance as an Independent Risk Factor. Previous Analysis Using Classical Methods Gave Ambiguous Results.
How does the AI prediction model fit in the existing clinical workflow?
The current analysis is a secondary analysis of the clinical trial. It does not apply to any specific existing clinical workflow.
Are the data for prediction model development and testing representative for the targeted patient population and intended use?
The SPRINT population was representative for the population of patients with high cardiovascular risk requiring antihypertensive treatment.
Is the (time)point of prediction clear and aligned with the feature measurements?
The current analysis is a secondary analysis of the clinical trial.
Is the outcome variable labelling procedure reliable, replicable and independent?
It is secondary analysis of the clinical trial. We used labeling of the original SPRINT database.
Was the sample size sufficient for AI prediction model development and testing?
The sample size of the analysis was determined by the sample size of the SPRINT. The environment used for the analysis did not reveal sample size issues. A posteriori sample size calculation was not performed.
Is optimism of predictive performance of the AI prediction model avoided?
The performance of AI prediction models was tested.
Was the AI model performance evaluated beyond simple classification statistics?
The created models were evaluated using 10-fold cross-validation. Various quality criteria was computed. Overall accuracy, mF1, and wF1 measures were used to evaluate the model as a whole, AUC was used to evaluate the model when predicting the diagnosis.
Were the relevant reporting guidelines for AI prediction model studies followed?
The topics included in the TRIPOD statement were addressed in the manuscript.
Is algorithmic (un)fairness considered and appropriately addressed?
SPRINT focuses on the population of patients with high cardiovascular risk requiring antihypertensive treatment and the results were biased towards this group of patients.
Is the developed AI prediction model open for use, further testing,
critical appraisal and updating, and use in daily practice?
Standard data mining platform RapidMiner was used so it is easy to perform retest.
Are the presented relations between individual features and the outcome
not overinterpreted?
Conclusions were drawn with caution.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Siński, M.; Berka, P.; Lewandowski, J.; Sobieraj, P.; Piechocki, K.; Paleczny, B.; Siennicka, A. Answering Clinical Questions Using Machine Learning: Should We Look at Diastolic Blood Pressure When Tailoring Blood Pressure Control? J. Clin. Med. 2022, 11, 7454. https://doi.org/10.3390/jcm11247454

AMA Style

Siński M, Berka P, Lewandowski J, Sobieraj P, Piechocki K, Paleczny B, Siennicka A. Answering Clinical Questions Using Machine Learning: Should We Look at Diastolic Blood Pressure When Tailoring Blood Pressure Control? Journal of Clinical Medicine. 2022; 11(24):7454. https://doi.org/10.3390/jcm11247454

Chicago/Turabian Style

Siński, Maciej, Petr Berka, Jacek Lewandowski, Piotr Sobieraj, Kacper Piechocki, Bartłomiej Paleczny, and Agnieszka Siennicka. 2022. "Answering Clinical Questions Using Machine Learning: Should We Look at Diastolic Blood Pressure When Tailoring Blood Pressure Control?" Journal of Clinical Medicine 11, no. 24: 7454. https://doi.org/10.3390/jcm11247454

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop