1. Introduction
Metabolic syndrome—characterized by central obesity, atherogenic dyslipidemia, hypertension, and impaired fasting glucose—has spread as a global epidemic, with recent estimates suggesting that more than 1.5 billion adults now meet diagnostic criteria [
1]. In 2025, the European Atherosclerosis Society moved beyond viewing it as a risk factor cluster and reframed it as a staged systemic metabolic disorder, with heart failure and advanced coronary artery disease as the most severe downstream consequences [
2]. The link with acute coronary disease is well established; between 37% and over 60% of patients presenting with acute coronary syndromes meet criteria for the syndrome [
3,
4], a co-prevalence consistent with the role of insulin resistance, endothelial dysfunction, and a prothrombotic milieu in driving plaque rupture [
5].
However, the long-term prognostic role of metabolic syndrome after STEMI remains less clear. A recent meta-analysis of eleven studies confirmed an association with major adverse cardiovascular events (MACE) but found the evidence for long-term mortality inconsistent [
6], and individual cohorts have produced divergent results [
7,
8]. Two methodological problems likely contribute to this picture. First, follow-up has typically been short, as most published studies report one- to three-year outcomes, with only a handful extending beyond five years; yet Dohi and colleagues showed that mortality curves for MetS and non-MetS patients begin to separate only after five years, suggesting that shorter studies systematically underestimate the long-term burden [
9]. Second, the obesity paradox—the consistent observation that obese STEMI patients have lower long-term mortality than their normal weight counterparts [
10]—is rarely accounted for in MetS analyses, despite the fact that most MetS patients in Western cohorts are overweight or obese.
Heart failure has received the least attention among post-STEMI endpoints in this literature, even though it has become increasingly relevant as primary PCI has improved early survival and patients live long enough for chronic complications to emerge. Metabolic syndrome is biologically plausible as a driver of post-infarction heart failure, as insulin resistance impairs myocardial glucose uptake, forcing a shift to fatty acid oxidation with mitochondrial dysfunction, and AGE–RAGE-mediated signaling promotes interstitial fibrosis [
11,
12]. Superimposed on the ischemic remodeling that follows STEMI, these mechanisms may produce a synergistic trajectory toward heart failure that has not been examined over a ten-year horizon.
We previously followed a prospective cohort of 531 STEMI patients treated with primary PCI for four years and reported that metabolic syndrome was independently associated with MACE (HR 1.83) but not with mortality [
13]. The present study aimed to extend that follow-up to ten years in 506 patients with complete outcome data, using hospitalization for heart failure as the new primary endpoint. Three additional questions are addressed: whether MetS independently predicts ten-year all-cause mortality and MACE; whether obesity modifies the MetS–mortality relationship in the directions suggested by the obesity paradox literature; and whether the binary MetS diagnosis or the cumulative number of fulfilled criteria is the more informative individual-level prognostic signal, evaluated using an explainable machine learning approach (XGBoost with SHAP attribution and a Random Survival Forest sensitivity analysis).
2. Materials and Methods
2.1. Study Population
Between December 2009 and June 2010, 531 consecutive patients with ST-segment elevation myocardial infarction (STEMI) underwent primary percutaneous coronary intervention (pPCI) at the University Clinical Centre of Serbia in Belgrade and were enrolled in this prospective cohort. The four-year outcomes of this cohort have been reported previously [
13]; the present analysis extends the follow-up to ten years.
Patients were eligible if they presented within 12 h of chest pain onset, had ST-segment elevation in at least two contiguous leads, and underwent successful pPCI of the infarct-related artery (TIMI 3 flow at the end of the procedure). All patients received aspirin, clopidogrel, and unfractionated heparin before the procedure and were managed according to the 2008 ESC STEMI guidelines [
14]. Procedures were performed via the femoral route using a standardized technique, and a coronary stent was implanted in every patient. Patients were excluded if they had received thrombolysis prior to admission, had a TIMI flow < 3 at the end of the procedure, were younger than 18 years, and were unable to provide informed consent.
Of the 531 enrolled patients, 24 were lost to follow-up during the ten-year observation period (alive but unreachable by telephone or at their registered home addresses), and one additional patient was excluded because follow-up duration could not be determined. The final analytical sample comprised 506 patients (
Figure S1, Supplementary Material, CONSORT flow diagram).
The protocol was approved by the Institutional Review Board of the Faculty of Medicine at the University of Belgrade (no. 29/XI-20, November 2013) and was conducted in accordance with the Declaration of Helsinki. All patients provided written informed consent after clinical stabilization.
2.2. Data Collection
Baseline data were collected prospectively by trained research staff who were blinded to the study aims and outcome status. Demographics, anthropometric measurements (height, weight, waist circumference, body mass index), cardiovascular risk factors, prior cardiovascular history, comorbidities, and clinical presentation data (Killip class, symptom-to-balloon time, infarct territory, culprit artery) were recorded using standardized instruments.
Serum creatine kinase (CK) and CK-MB were serially measured during the first 48 h after symptom onset, and peak values were recorded. Fasting glucose, HDL cholesterol, and triglycerides were measured from a fasting blood draw obtained on the morning of the final inpatient day. Left ventricular ejection fraction (LVEF) was determined by transthoracic echocardiography during the index admission, in accordance with ACC/AHA/ASE guidelines [
15]. The SYNTAX score was calculated from the diagnostic coronary angiogram by two interventional cardiologists blinded to clinical data, with disagreements resolved by consensus.
2.3. Definition of Metabolic Syndrome
Metabolic syndrome (MetS) was defined by the AHA/NHLBI criteria [
16], which require at least three of the following five components: abdominal obesity (waist circumference ≥ 102 cm in men, ≥88 cm in women); elevated triglycerides (≥1.7 mmol/L or lipid-lowering therapy); low HDL cholesterol (<1.03 mmol/L in men, <1.29 mmol/L in women, or therapy); arterial hypertension (≥130/85 mmHg or antihypertensive therapy); and elevated fasting glucose (≥5.6 mmol/L or antidiabetic therapy).
MetS status was assessed only at the index hospitalization, using fasting laboratory values from the final inpatient blood draw. Patients were classified as MetS-positive (n = 216, 42.7%) or MetS-negative (n = 290, 57.3%). The number of criteria met (range 0–5) was retained as a continuous variable for the dose–response and machine learning analyses.
2.4. Follow-Up and Outcome Definitions
Follow-up visits were scheduled at 1, 6, 12, 24, 36, 48, 60, 96, and 120 months after the index event. Each visit included a clinical examination, electrocardiography, and a review of current medications. Patients who missed scheduled visits were contacted by telephone, either directly or through a family member or general practitioner. For patients who died during follow-up, hospital records and necropsy data were reviewed.
The three primary endpoints were all-cause mortality, MACE, and hospitalization for heart failure. MACE was defined as the composite of cardiovascular death, non-fatal myocardial infarction, new revascularization (PCI or CABG), and stroke or transient ischemic attack; the time of the earliest component event was used in time-to-event analyses. Hospitalization for heart failure was defined as the first post-discharge admission with a primary diagnosis of acute decompensated heart failure, supported by clinical signs and symptoms, elevated natriuretic peptides, and/or radiographic evidence of pulmonary congestion. Heart failure events that occurred during the index STEMI hospitalization were not counted as follow-up events; only the first post-discharge episode was included in the time-to-event analysis. Secondary endpoints were the individual components of MACE, analyzed separately, and target vessel revascularization. New-onset diabetes during follow-up—recorded as an exploratory outcome—was defined as a new clinical diagnosis of type 2 diabetes documented at any follow-up visit in a patient who had no diabetes at the index event.
Follow-up time was calculated in months from the index pPCI procedure. For patients without an event, follow-up was administratively censored at 120 months.
2.5. Statistical Analysis
Continuous variables were expressed as mean ± SD, and categorical variables as counts (percentages). Between-group comparisons used Student’s t-test for continuous variables and the χ2 test (or Fisher’s exact test when expected cell counts were below 5) for categorical variables. Within-group changes in medication use between the 1-month and 60-month follow-ups were assessed with McNemar’s test. Statistical significance was defined as two-sided p < 0.05. The number of missing values is reported separately for each variable; missing values were not imputed for descriptive comparisons, and the multivariable Cox models used complete case analysis. Descriptive analyses and Cox regression were performed in IBM SPSS Statistics version 20 (IBM, Armonk, NY, USA).
Cumulative event rates were estimated using the Kaplan–Meier method (1 − S(t)) and compared with the log-rank test. Number-at-risk tables are shown beneath each survival curve. Independent predictors of each primary endpoint were identified using multivariable Cox proportional-hazards regression, adjusted for sex, age, LVEF, previous myocardial infarction, Killip class ≥ 2, MetS status, and multivessel coronary disease—variables selected a priori on the basis of clinical relevance and the prior literature. The proportional hazards assumption was assessed graphically using log-minus-log plots and statistically using Schoenfeld residuals. Hazard ratios are reported with 95% confidence intervals. The SYNTAX score was not included in the primary models to avoid collinearity with multivessel disease but was examined separately by Spearman correlation and SYNTAX tertile survival analysis.
Two additional sensitivity analyses were performed for the heart failure endpoint to account for the competing risk of death. The cumulative incidence function was estimated using the Aalen–Johansen method, and a Fine–Gray subdistribution hazard Cox model was fitted with inverse probability of censoring weights, following Geskus [
17]. Bootstrap 95% confidence intervals (1000 resamples) are reported for the cumulative incidence at ten years.
The dose–response relationship between the number of MetS criteria met and each outcome was assessed using Spearman’s rank correlation, both for the full cohort and the subset of MetS-positive patients (MetS criteria range: 3–5). For the obesity paradox analysis, obesity was defined using the same waist circumference thresholds as those used in the MetS definition, and the interaction between MetS and obesity was examined using a stratified Kaplan–Meier analysis. New-onset diabetes was compared across MetS groups using a χ2 test.
2.6. Machine Learning Analysis
To complement the Cox regression and provide patient-level interpretation, four machine learning classifiers were trained to predict each ten-year binary outcome using a feature set of 16 clinical variables available at or shortly after the index hospitalization: logistic regression (with L2 regularization, C = 0.1), random forest (300 trees, max depth 5), gradient boosting (200 estimators, max depth 3, learning rate 0.05) and an XGBoost equivalent gradient boosting model (500 estimators, max depth 4, learning rate 0.03, row and column subsampling 0.8). Models were evaluated using 10-fold stratified cross-validation. Missing feature values were imputed with the median within each training fold to prevent data leakage. Discrimination was assessed using AUC-ROC, and calibration was assessed using the Brier score and graphical calibration curves; observed event rates were also compared across quartiles of predicted probability. All ML analyses were implemented using scikit-learn 1.8, NumPy, pandas, and SciPy.
Feature importance was quantified in two complementary ways. For the gradient boosting models, the mean decrease in impurity (Gini importance) was averaged across the random forest and gradient boosting models. For the XGBoost equivalent model, SHAP (SHapley Additive exPlanations) values were computed using the TreeSHAP algorithm from the ‘shap’ Python library (version 0.46) on a stratified random subsample of 200 patients (random seed 42). To assess the robustness of feature rankings, 95% confidence intervals for each feature’s mean absolute SHAP value were derived from 1000 bootstrap resamples.
To address the binary outcome simplification in the SHAP analysis, survival-aware sensitivity analysis was performed using a Random Survival Forest [
18], with 200 trees, a minimum of 10 samples per leaf, and √
p random feature sampling at each split. Discrimination was assessed using Harrell’s concordance index in 10-fold cross-validation, and feature importance was quantified by permutation; each feature was permuted in turn, and the resulting drop in the c-index was recorded.
2.7. Reproducibility and Data Availability
All random number generators used in the analysis were seeded with random_state = 42. The analytical dataset contains direct patient identifiers and cannot be shared publicly under institutional policy and the Serbian Law on Personal Data Protection (Official Gazette RS, No. 87/2018). De-identified summary statistics and the complete analysis code are available from the corresponding author upon reasonable request.
4. Discussion
We followed 506 STEMI patients treated with primary PCI for ten years, stratified by AHA/NHLBI metabolic syndrome status. The principal findings are summarized as follows. Metabolic syndrome was not an independent predictor of all-cause or cardiovascular mortality at ten years, despite numerically higher rates in the MetS-positive group. However, it was an independent predictor of MACE (HR 1.473, p = 0.028), driven primarily by recurrent myocardial infarction and target-vessel revascularization. Most importantly, MetS was a strong independent predictor of hospitalization for heart failure (cause-specific HR 2.86, Fine–Gray subdistribution HR 2.61, both p < 0.005), with reduced LVEF as the only co-significant predictor. The excess mortality risk associated with MetS was selectively attenuated by obesity, providing empirical support for an obesity paradox specific to the MetS phenotype in STEMI. Two methodologically independent machine learning attribution frameworks—TreeSHAP for a binary classification XGBoost model and permutation importance for a survival aware Random Survival Forest—converged on the cumulative number of MetS criteria, rather than the binary diagnosis, as the leading individual-level contributor to heart failure risk in this cohort.
The independent association between MetS and ten-year MACE (HR 1.473) extends our four-year analysis from the same cohort, which yielded a stronger HR of 1.834 [
13]. The attenuation over time is biologically coherent, as MetS predominantly accelerates early atherosclerotic progression and inflammatory restenosis, mechanisms that are most active in the first five years after revascularization, while age and left ventricular function increasingly dominate the prognosis later. The mechanism is supported by direct anatomical evidence—MetS-positive patients had a significantly higher SYNTAX score (19.10 ± 6.76 vs. 17.34 ± 6.83,
p = 0.006), with a strong positive Spearman correlation between SYNTAX and ten-year MACE (ρ = 0.337,
p < 0.001). This is consistent with the well-established role of insulin-resistance-driven inflammation, NLRP3 inflammasome activation, and a prothrombotic state in promoting plaque vulnerability and in-stent restenosis [
5]. The convergence of two ML frameworks reinforces this conclusion: TreeSHAP ranked age and SYNTAX score as the dominant patient-level features for MACE (mean |SHAP| 0.36 and 0.34), and the Random Survival Forest reached the same conclusion via permutation importance. For ischemic recurrence, anatomical complexity of established coronary disease—which MetS helps create—is the proximate determinant; MetS itself contributes to MACE primarily by promoting more severe baseline anatomy.
The failure of MetS to independently predict mortality at ten years—despite numerically higher rates in the MetS-positive group (29.6% vs. 22.7%)—is among the most contentious findings in the literature, and our result aligns with the majority of published cohorts. Won et al. found no significant effect on long-term survival in 1100 STEMI patients with drug-eluting stents [
7], while Geraiely et al., in a propensity score-matched analysis of 2651 pPCI patients, found no association between MetS and one-year mortality [
8]. Dohi et al. observed mortality curves diverging only after five years [
9], a pattern which was also visible in our data, though not reaching statistical significance, raising the possibility that even longer follow-up could ultimately reveal a mortality signal.
Several non-mutually exclusive hypotheses may help explain this null mortality finding. Medication confounding is plausible: the MetS-negative group experienced a sharper decline in statin use from 84.5% at month 1 to 74.1% at month 60 (McNemar
p < 0.001) than the MetS-positive group. The FAST-STEMI registry showed that statin discontinuation within six months after STEMI was independently associated with more than a twofold increase in cardiovascular and all-cause mortality [
19], so suboptimal long-term adherence in the MetS-negative group may have selectively eroded their survival advantage. The independent predictors of mortality in our Cox model—age, LVEF, previous MI, and Killip class—are acute hemodynamic and structural parameters that dominate prognosis irrespective of metabolic status; this is consistent with the CAMI registry XGBoost/SHAP analysis [
20] and with our own SHAP results, in which age (mean |SHAP| 0.62) contributed more than any other feature to mortality. We emphasize that these are interpretive hypotheses consistent with our observations rather than causal conclusions, which an observational design cannot establish.
The obesity paradox offers a second, complementary explanation. Among non-obese patients, MetS-positive status was associated with significantly higher ten-year mortality (42.9% vs. 21.1%,
p = 0.008), whereas among obese patients the difference disappeared (26.5% vs. 23.2%,
p = 0.529). The 21.8 percentage point excess mortality among non-obese MetS-positive patients fell to 3.3 points in the obese stratum. The 2023 meta-analysis by Wang et al. confirmed lower in-hospital and long-term mortality among obese STEMI patients [
21], and Şaylık et al. reached the same conclusion in a comprehensive meta-analysis of ACS [
22]. Our data add a previously unreported dimension: the obesity paradox in STEMI is not uniform but is specifically operative among patients with MetS. The lean MetS phenotype—characterized by visceral adiposity, severe insulin resistance, and unfavorable adipokine profiles despite an unremarkable BMI—may carry a disproportionate share of the prognostic burden of metabolic dysregulation in this stratum. The likely biological mediator is adiponectin, whose levels are relatively preserved in obese MetS patients, which exerts direct cardioprotective and anti-inflammatory effects [
23]. Crucially, the paradox was selective, as MetS retained meaningful excess risk for both MACE (excess +16.4% non-obese vs. +12.2% obese) and heart failure (+13.7% vs. +11.6%) in both obesity strata, so the survival benefit of obesity attenuates the inflammatory pathways driving early mortality without reducing the anatomical or metabolic cardiomyopathy pathways.
The strongest finding of this study is that MetS nearly triples the cause-specific hazard of heart failure hospitalization (HR 2.86, 95% CI 1.57–5.22, p < 0.001). This conclusion is robust in competing risk treatment: the Fine–Gray subdistribution hazard model yielded an essentially identical HR of 2.61 (95% CI 1.44–4.75, p = 0.002). The Aalen–Johansen cumulative incidence at ten years was 16.7% in MetS-positive and 5.9% in MetS-negative patients, with the naive Kaplan–Meier estimator overestimating absolute incidence by 2.9 and 0.8 percentage points, respectively. The strength and persistence of this association—in patients with virtually identical admission LVEF (49.6% vs. 49.7%, p = 0.974)—implicates pathophysiological mechanisms beyond the acute infarction and points to the metabolic cardiomyopathy that characterizes the MetS phenotype.
The pathophysiology is well characterized. The primary mediator is insulin resistance, which impairs myocardial glucose uptake and forces cardiomyocytes to shift to fatty acid oxidation—a less metabolically efficient substrate that increases oxygen consumption, generates reactive oxygen species, and progressively impairs mitochondrial function [
24]. Using
18F-FDG PET-MR in 821 asymptomatic individuals from the PESA cohort, Succurro et al. directly demonstrated that MetS traits are associated with a significantly lower insulin-stimulated myocardial glucose metabolic rate—the earliest detectable stage of metabolic cardiomyopathy [
25]. At the molecular level, MetS induces NLRP3 inflammasome activation, NF-κB signaling, and AGE–RAGE-mediated myocardial fibrosis [
5,
25], collectively promoting interstitial fibrosis, cardiomyocyte hypertrophy, microvascular dysfunction, and the clinical phenotype of HFpEF. The significantly higher rate of new-onset diabetes during follow-up in MetS-positive patients (7.9% vs. 2.8%,
p = 0.016) likely contributes to the late-divergence pattern of heart failure incidence visible after month 72 in our Kaplan–Meier curves.
Three methodologically independent attribution frameworks identified the cumulative number of MetS criteria—rather than the binary diagnosis—as one of the top three individual-level drivers of heart failure risk. In the TreeSHAP analysis of the XGBoost binary classifier, the number of MetS criteria ranked second only to LVEF (mean |SHAP| 0.32 vs. 0.48; bootstrap 95% CIs cleanly separated from the rest of the feature set) and was approximately fivefold higher than the contribution of the binary MetS diagnosis (0.07). To address the binary outcome simplification of the SHAP analysis, a survival-aware Random Survival Forest with permutation importance was used: in this analysis, the number of MetS criteria emerged as the top-ranked feature for heart failure (Δc-index = +0.035), surpassing LVEF, age, and SYNTAX score, with the RSF achieving the highest cross-validated discrimination of any ML model (Harrell’s c-index 0.843). The averaged Gini importance across the random forest and gradient boosting classifiers ranked the cumulative criteria count third for heart failure (Gini 0.118, 95% CI 0.092–0.148), behind LVEF and peak CK only, and approximately fourfold higher than the binary MetS classification.
The convergence across three distinct frameworks—together with the consistency with the Cox HR of 2.86—is compatible with a continuous burden interpretation of metabolic risk rather than a binary-threshold interpretation of heart failure risk after STEMI; the binary MetS diagnosis ranked outside the top five features in all three frameworks. To our knowledge, this is the first patient-level, model-agnostic demonstration of the continuous nature of the MetS–HF dose–response relationship in a long-term STEMI cohort. We emphasize, however, that MetS status was assessed only at admission, so our findings reflect the prognostic value of the baseline metabolic profile rather than evidence that prospective modification of individual MetS components reduces heart failure incidence—a question that requires longitudinal MetS exposure data and dedicated interventional trials.
The clinical implications of this study are actionable but constrained by the limits of an observational design. The identification of MetS as an independent predictor of both MACE and heart failure over ten years supports systematic metabolic assessment of all STEMI survivors using AHA/NHLBI criteria before discharge. The obesity paradox has a specific implication for risk stratification, where algorithms based on the binary MetS classification will overestimate mortality risk in obese MetS-positive patients and, more critically, underestimate it in lean MetS-positive patients, who accounted for 42.9% of ten-year mortality in our cohort. The pattern we observed in the non-obese stratum is consistent with the lean cardiometabolic phenotype described in the obesity paradox literature; lean MetS-positive patients may warrant closer follow-up in dedicated studies, although our subgroup numbers were small and this observation should be regarded as exploratory. Convergent ML evidence that the number of metabolic criteria, rather than the binary diagnosis, drives the heart failure signal is consistent with the hypothesis that patients with more concurrent metabolic abnormalities at baseline carry incrementally higher long-term heart failure risk; whether prospective modification of individual MetS components reduces this risk cannot be answered by our data and warrants dedicated interventional studies. The present cohort predates the routine secondary-prevention use of SGLT2 inhibitors and GLP-1 receptor agonists; whether these therapies, which have since acquired class 1A indications for HFpEF in the 2023 ESC Heart Failure guidelines [
26], modify ten-year heart failure incidence in STEMI survivors with high baseline metabolic burden remains a hypothesis for dedicated randomized trials.
This study has several limitations. It is a single-center, prospective, observational cohort study at a Serbian tertiary academic institution, limiting generalizability to other populations and healthcare systems; bare-metal stents predominated, reflecting the 2009–2010 standard of care, and contemporary drug-eluting stents would likely attenuate the revascularization component of MACE. MetS was defined only at admission—the acute phase of STEMI can transiently elevate glucose, triglycerides, and blood pressure, potentially overclassifying some patients—and longitudinal MetS exposure could not be quantified; consequently, our findings on the cumulative number of MetS criteria as a predictor of heart failure should be interpreted as reflecting the prognostic value of the baseline metabolic profile, not as evidence that modifying individual components during follow-up reduces incidence. Medication adherence data were available only at months 1 and 60, so continuous adherence and the time-dependent introduction of novel agents (P2Y12 inhibitors, SGLT2 inhibitors, GLP-1 receptor agonists) represent residual confounding inherent to a long-term observational design. Several potentially relevant predictors were not captured (serial LVEF, NT-proBNP, diastolic function indices, adipokines), and the subgroup meeting all five MetS criteria (n = 19) was too small to support firm conclusions about the extreme end of metabolic burden. With respect to the survival analyses, the Fine–Gray sensitivity analysis confirmed that competing-risk effects modestly inflate naive heart failure incidence estimates without altering the principal conclusion. The machine learning analysis was performed on a single cohort with stratified 10-fold internal cross-validation but without external geographic validation; the models should therefore be interpreted as tools for feature attribution and hypothesis generation rather than deployable clinical risk calculators. Finally, no formal multiplicity correction was applied across the family of secondary endpoints, which may inflate the type I error for those comparisons; the three primary endpoints (mortality, MACE, heart failure) were pre-specified and should be interpreted with the highest weight.