Submit to this Journal Review for this Journal Propose a Special Issue

Article Menu

Share Help Cite Discuss in SciProfiles

Open AccessArticle

Peer-Review Record

Evaluation of Machine Learning and Traditional Statistical Models to Assess the Value of Stroke Genetic Liability for Prediction of Risk of Stroke Within the UK Biobank

Healthcare 2025, 13(9), 1003; https://doi.org/10.3390/healthcare13091003

by Gideon MacCarthy¹ and Raha Pazoki^1,2,*

Reviewer 1:

Nagaraj Naik

Reviewer 2: Anonymous

Reviewer 3: Anonymous

Healthcare 2025, 13(9), 1003; https://doi.org/10.3390/healthcare13091003

Submission received: 12 February 2025 / Revised: 18 April 2025 / Accepted: 19 April 2025 / Published: 26 April 2025

(This article belongs to the Special Issue Advances in Prediction, Prevention and Precision Medicine for Myocardial Infarction and Stroke)

Round 1

Reviewer 1 Report

Comments and Suggestions for Authors

Create a workflow diagram illustrating the inputs and output for machine learning models.

Justify why machine learning outperformed statistical analysis.

Explain the selection of the following models: Cox Proportional Hazards (CoxPH), Gradient Boosting Model (GBM), Decision Tree (DT), and Random Forest (RF). Clarify why more advanced machine learning models were not used

Provide a detailed conclusion summarizing key findings and their implications.

Avoid using the word "our" in the manuscript, specifically in line 486.

Incorporate previous studies’ results in the table with proper citations to strengthen the discussion.

In table 4, why Random Forest (RF) performed worse than Decision Tree (DT) in this case (RF = 65, DT = 67), despite RF typically outperforming DT.

In Table 4, justify why some p-values are marked as REF or NaN and explain their significance.

Figure 3: Add missing subcaptions and provide a clear description of panels A, B, and C.

Improve readability in Section 4.

Author Response

Please see the attached. Thank you.

Author Response File: Author Response.pdf

Reviewer 2 Report

Comments and Suggestions for Authors

This study investigates the potential added predictive value of incorporating genome-wide stroke genetic liability into traditional and machine learning models for stroke risk prediction. Following major comments:

Was it appropriate to exclude persons with non-European heritage, and would this have limited how broadly the results could be applied?
Did preliminary tests or literature references sufficiently support the selection of parameters for machine learning models (e.g., number of trees in RF, learning rate in GBM)? The disease categorization job has a number of papers, including doi: 10.1371/journal.pone.0268555.
Do the authors explain why, in contrast to the Cox model, introducing genetic liability has no effect on machine learning models?

Comments on the Quality of English Language

The English could be improved to more clearly express the research.

Author Response

Please see the attachment. Thank you.

Author Response File: Author Response.pdf

Reviewer 3 Report

Comments and Suggestions for Authors

Report is attached

Comments for author File: Comments.pdf

Comments on the Quality of English Language

Report is attached

Author Response

Please see the attachment. Thank you.

Author Response File: Author Response.pdf

Round 2

Reviewer 1 Report

Comments and Suggestions for Authors

I am pleased to inform you that the authors have incorporated the suggested revisions. Accept in present form

Comments on the Quality of English Language

Please check the grammer.

Author Response

Thank you for pointing this out. We have accordingly revised the manuscript.

Article Menu

Evaluation of Machine Learning and Traditional Statistical Models to Assess the Value of Stroke Genetic Liability for Prediction of Risk of Stroke Within the UK Biobank

Further Information

Guidelines

MDPI Initiatives

Follow MDPI