Next Article in Journal
Endoscopic Management of Post-Bariatric Surgery Complications: Diagnostic Work-Up and Innovative Approaches for Leak, Fistula, and Stricture Management
Previous Article in Journal
Diagnostic Limitations, Patient Characteristics, and Confounding Factors Impacting Neurotologic Lesion Imaging: A Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Reproducible Post-Valve-Replacement EHR Cohort for Comparative AI Studies

Innovation Center Computer Assisted Surgery (ICCAS), Leipzig University, Semmelweisstrasse 14, 04103 Leipzig, Germany
*
Author to whom correspondence should be addressed.
Diagnostics 2026, 16(3), 447; https://doi.org/10.3390/diagnostics16030447 (registering DOI)
Submission received: 13 November 2025 / Revised: 23 January 2026 / Accepted: 28 January 2026 / Published: 1 February 2026
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

Background/Objectives: Valve replacement (VR) patients are at high risk of postoperative complications, but reproducible Electronic Health Record (EHR) benchmarks for evaluating sequential AI models in this setting are lacking. We develop a reproducible pipeline that extracts two EHR datasets from MIMIC-IV (a general-purpose and a predictive benchmark dataset) capturing perioperative histories, high-resolution time-series, and clinically motivated outcome labels. Methods: The cohort comprises 3890 VR patients with clinician-guided feature selection across diagnoses, procedures, laboratory measurements, medications, and physiological monitoring. As an exemplary use case, we define ICU readmission at first ICU discharge as a surrogate for postoperative risk and derive a predictive benchmark under strict label-leakage control. We then compare a Transformer model trained on tokenized longitudinal EHR sequences with Transformer and XGBoost baselines trained on aggregated feature statistics, and assess performance differences using paired statistical tests across validation splits. Results: ICU readmission stratified in-hospital and 100-day outcomes, including mortality, complications, and rehospitalization, confirming the clinical relevance of the prediction target. The sequential Transformer achieved 0.87 AUROC and 0.69 AUPRC. Corrected resampled t-tests confirm improved performance over the non-sequential Transformer, while the comparison with XGBoost indicates a favorable trend without conclusive evidence. Conclusions: Our findings suggest that leveraging longitudinal EHR sequences yields higher predictive performance than static feature summaries for postoperative risk prediction. The publicly released preprocessing pipeline and cohort-construction code enable researchers with MIMIC-IV access to reproduce the datasets and provide a robust benchmark for developing and comparing time-series models in post-valve replacement care.
Keywords: data preprocessing; machine learning; electronic health records (EHR); personalized medicine; perioperative care data preprocessing; machine learning; electronic health records (EHR); personalized medicine; perioperative care

Share and Cite

MDPI and ACS Style

Blattmann, M.; Katalinic, M.; Lindenmeyer, A.; Franke, S.; Neumuth, T.; Schneider, D. A Reproducible Post-Valve-Replacement EHR Cohort for Comparative AI Studies. Diagnostics 2026, 16, 447. https://doi.org/10.3390/diagnostics16030447

AMA Style

Blattmann M, Katalinic M, Lindenmeyer A, Franke S, Neumuth T, Schneider D. A Reproducible Post-Valve-Replacement EHR Cohort for Comparative AI Studies. Diagnostics. 2026; 16(3):447. https://doi.org/10.3390/diagnostics16030447

Chicago/Turabian Style

Blattmann, Malte, Mika Katalinic, Adrian Lindenmeyer, Stefan Franke, Thomas Neumuth, and Daniel Schneider. 2026. "A Reproducible Post-Valve-Replacement EHR Cohort for Comparative AI Studies" Diagnostics 16, no. 3: 447. https://doi.org/10.3390/diagnostics16030447

APA Style

Blattmann, M., Katalinic, M., Lindenmeyer, A., Franke, S., Neumuth, T., & Schneider, D. (2026). A Reproducible Post-Valve-Replacement EHR Cohort for Comparative AI Studies. Diagnostics, 16(3), 447. https://doi.org/10.3390/diagnostics16030447

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop