1. Introduction
Acute pyelonephritis remains one of the most common infectious admissions to internal medicine and infectious-disease wards across Europe, accounting for a substantial fraction of unscheduled antimicrobial prescribing and hospital occupancy [
1,
2]. Although the syndrome is broadly recognizable, its clinical heterogeneity—from young women with uncomplicated infection to elderly polymorbid patients with urological obstruction—translates into widely varying microbiology, therapeutic adequacy, and outcomes [
2]. In recent years, the increasing prevalence of multidrug-resistant (MDR) uropathogens has further complicated empirical decision-making, especially in centers serving older populations with frequent healthcare contact [
1,
3].
Romania consistently reports some of the highest rates of antimicrobial resistance in the European Union. Surveillance data from the European Centre for Disease Prevention and Control (ECDC) place Romania among the leading countries for resistance among
Enterobacterales, including extended-spectrum β-lactamase (ESBL) production and carbapenem non-susceptibility [
3]. These patterns are most pronounced in tertiary hospitals concentrated in larger urban areas such as Timișoara, where referral pathways gather complex cases, surgical complications, and patients with recurrent antibiotic exposure. The interface between community-onset and healthcare-associated pyelonephritis is, therefore, particularly fluid in this setting and warrants explicit examination [
3,
4,
5].
Several patient-level risk factors for MDR uropathogens have been described, including prior antibiotic use, recent hospitalization, indwelling urinary devices, nephrolithiasis/urolithiasis, diabetes mellitus, renal impairment, immunosuppression, and prior urinary tract instrumentation [
4,
5]. These variables are clinically important because they capture different mechanisms of resistance risk: antibiotic selection pressure, healthcare-associated colonization, impaired host defense, urinary stasis or obstruction, and procedure-related introduction of resistant flora. However, most published predictive models have been developed in Western European or North American populations, where baseline resistance rates differ substantially from those observed in South-Eastern Europe [
4,
6,
7,
8,
9]. The transferability of such models to Romanian clinical practice has not been systematically evaluated, and locally calibrated tools are scarce. This gap is particularly relevant for inpatient pyelonephritis, where the empirical regimen chosen at admission is often the single strongest determinant of subsequent clinical trajectory [
5,
10,
11].
Beyond the simple presence of MDR organisms, growing attention is being paid to the architecture of resistance—the way in which different resistance phenotypes co-occur within a single isolate [
6]. Co-resistance patterns shape both the probability that a given empirical regimen will be active and the selection pressure exerted by re-treatment failures. Analyses based on Jaccard similarity and pairwise odds ratios have begun to map these clusters in
Enterobacterales globally, identifying particularly tight links between ESBL production, third-generation cephalosporin resistance, and fluoroquinolone resistance [
6,
7]. Reproducing such analyses in local cohorts can directly inform empirical-therapy escalation rules and stewardship feedback to prescribers [
7,
10].
In addition to microbiological characterization, clinical decision support increasingly relies on structured risk scores that integrate readily available bedside information [
8,
9]. Although several scores exist for community-acquired UTI and complicated UTI in the outpatient setting, few have been specifically derived for hospitalized acute pyelonephritis with MDR as the primary endpoint [
9,
11]. A pragmatic, transparent, points-based score with internal validation could complement local antibiograms by helping clinicians stratify patients at admission and prioritize broad-spectrum empirical coverage for those most likely to harbor MDR pathogens, while sparing low-risk patients from unnecessary broad-spectrum exposure [
8,
10,
11,
12,
13,
14].
Against this background, the present study had three a priori objectives. First, we aimed to describe the MDR landscape and pathogen distribution among adults hospitalized with acute pyelonephritis at a tertiary university hospital in Western Romania between March 2022 and March 2025 [
3,
15]. Second, we sought to identify independent clinical predictors of MDR using multivariable logistic regression, with particular attention to modifiable exposures, such as prior antibiotic use, hospitalization, and urinary catheterization/instrumentation, as well as immunosuppression and nephrolithiasis/urolithiasis [
4,
5]. Third, we derived and internally validated a bedside risk score—PYELO-MDR-Risk—and explored its calibration, decision-relevant cut-points, and behavior across clinically meaningful subgroups, framing the analysis within the broader context of antimicrobial stewardship in Eastern European pyelonephritis care [
9,
10].
3. Results
During the 36-month enrolment window, 147 adult admissions met the initial syndromic criteria for acute pyelonephritis. Eighteen were excluded after detailed chart review (
n = 9 polymicrobial cultures with >2 organisms,
n = 5 perinephric abscesses requiring percutaneous drainage,
n = 3 ultimately reclassified as lower urinary tract infection without parenchymal involvement, and
n = 1 incomplete follow-up), leaving 129 patients in the final analytic cohort. Of these, 54 (41.9%) had MDR pyelonephritis and 75 (58.1%) had non-MDR pyelonephritis. The mean age of the cohort was 59.6 ± 16.4 years, and 94 (72.9%) patients were female. Overall median hospital length of stay was 11 days (IQR 8–14), and in-hospital mortality was 5/129 (3.9%) (
Table 1).
Patients in the MDR group were, on average, about 7 years older than those with non-MDR pyelonephritis (63.4 ± 14.7 vs. 56.8 ± 17.3 years;
p = 0.022) and carried a significantly higher comorbidity burden, with median Charlson scores of 4 versus 2 (
p = 0.003). Diabetes mellitus and renal impairment/renal-failure status at admission were both significantly more common in the MDR group (38.9% vs. 22.7%,
p = 0.046; and 25.9% vs. 12.0%,
p = 0.043, respectively), in keeping with the enrichment of resistant pathogens in patients with metabolic and nephrological vulnerability. Immunosuppression was numerically more frequent among MDR cases (14.8% vs. 6.7%;
p = 0.129), and prior urinary tract instrumentation within 90 days was significantly enriched in the MDR group (22.2% vs. 9.3%;
p = 0.042). Nephrolithiasis/urolithiasis was also more frequent among MDR cases but did not reach statistical significance (20.4% vs. 12.0%;
p = 0.193). The strongest single discriminators between groups were exposure variables: prior antibiotic use within 90 days was reported in 61.1% of MDR cases, compared with 18.7% of non-MDR cases (
p < 0.001), and recent hospitalization showed a comparable gradient (35.2% vs. 10.7%,
p < 0.001). A history of recurrent urinary tract infection (≥3 episodes in the preceding year) was almost threefold more frequent in MDR patients (44.4% vs. 14.7%,
p < 0.001), reinforcing the role of prior healthcare contact as a clinical proxy for resistance carriage. From a severity standpoint, MDR cases presented with greater inflammatory activation, with a mean baseline CRP of 142.6 ± 58.3 mg/L versus 117.4 ± 51.8 mg/L (
p = 0.012) and a significantly higher median procalcitonin (3.7 vs. 1.9 ng/mL;
p = 0.008), suggesting either a higher initial bacterial load or a delayed control of infection. Although concomitant bacteremia was almost twice as frequent in the MDR group (31.5% vs. 17.3%), this difference reached only borderline statistical significance (
p = 0.062) given the modest sample size (
Table 2).
The pathogen distribution was dominated by
Enterobacterales, which collectively accounted for 102/129 (79.1%) isolates and provided the substrate for the resistance phenotype analyses described later.
Escherichia coli remained the single most common organism (72/129, 55.8%) but was significantly less prevalent among MDR cases (40.7%) than among non-MDR cases (66.7%;
p = 0.003), reproducing a well-recognized pattern in which the relative contribution of
E. coli decreases as antimicrobial selection pressure increases. Conversely,
Klebsiella pneumoniae was numerically over-represented in the MDR group (24.1% vs. 12.0%), although the difference reached only borderline significance (
p = 0.075), likely reflecting power constraints inherent to the cohort size.
Pseudomonas aeruginosa, which is intrinsically less susceptible to many first-line agents, was almost four times more frequent among MDR cases (14.8% vs. 4.0%;
p = 0.027), highlighting the need to consider antipseudomonal coverage in selected high-risk admissions.
Proteus mirabilis showed an even split between MDR and non-MDR groups (7.4% vs. 5.3%;
p = 0.722), consistent with its modest contribution to overall resistance burden in this cohort.
Enterococcus spp. (almost exclusively
E. faecalis with one
E. faecium isolate) accounted for 8.5% of cultures, with no vancomycin-resistant strains detected during the study period. The very small mixed/other category (3.9%) had a balanced distribution across groups and was retained for completeness without further subgroup analysis. Taken together, these proportions describe a pyelonephritis ecology dominated by a shifting balance between
E. coli and
K. pneumoniae, with
P. aeruginosa as a clinically important MDR-enriched minority (
Table 3).
Stratification of the 102
Enterobacterales isolates by acquisition setting revealed a strikingly different resistance landscape between healthcare-associated pyelonephritis (HCA-PN,
n = 44) and community-acquired pyelonephritis (CA-PN,
n = 58). ESBL production was identified in 43.2% of HCA-PN isolates compared with 19.0% of CA-PN isolates (
p = 0.007), and the corresponding figures for third-generation cephalosporin resistance were 40.9% versus 15.5% (
p = 0.003). These two phenotypes closely track one another and reflect a shared underlying mechanism, suggesting that empirical ceftriaxone monotherapy is becoming an unreliable first-line option for HCA-PN in our setting. Fluoroquinolone resistance was even more prevalent in HCA-PN (54.5% vs. 29.3%;
p = 0.011), a worrying observation given that oral ciprofloxacin remains a frequently chosen step-down agent for both inpatient and outpatient pyelonephritis. The proportion of isolates resistant to trimethoprim–sulfamethoxazole exceeded 50% in the HCA-PN group, effectively excluding this agent as an empirical option. Piperacillin–tazobactam resistance affected one-quarter of HCA-PN isolates, raising legitimate concerns about its role as a workhorse broad-spectrum agent in patients with healthcare-related risk factors. Carbapenem non-susceptibility, although relatively uncommon overall, was almost exclusively confined to HCA-PN (13.6% vs. 1.7%;
p = 0.018), a finding consistent with regional ECDC surveillance and underscoring the necessity of carbapenem-sparing strategies even in this low-prevalence niche. Fosfomycin and nitrofurantoin resistance rates were numerically higher in HCA-PN. However, they did not reach statistical significance, supporting the continued utility of these agents for selected step-down indications, particularly when isolates remain susceptible (
Table 4).
Clinical outcomes diverged sharply between MDR and non-MDR groups, both for direct measures of treatment response and for downstream resource utilization. The mean hospital length of stay was 13.7 ± 4.6 days in MDR pyelonephritis versus 8.9 ± 3.2 days in non-MDR cases (
p < 0.001), corresponding to an excess of approximately 4.8 hospital days per MDR admission. This gap was paralleled by a near-doubling of the median time to defervescence (76.3 vs. 41.8 h;
p < 0.001), suggesting that the LOS difference reflects genuinely slower biological resolution rather than purely administrative delays. Time-to-effective therapy was almost three times longer in MDR cases (28.4 vs. 9.7 h;
p < 0.001), and adequate empirical coverage at admission was achieved in only 29.6% of MDR cases compared with 85.3% of non-MDR cases (
p < 0.001), highlighting the fundamental mismatch between routine empirical regimens and resistant pathogen profiles. ICU transfer occurred in 16.7% of MDR patients and 5.3% of non-MDR patients (
p = 0.038), and septic shock and in-hospital mortality were each three- to fivefold higher in the MDR group. However, these endpoints did not reach formal statistical significance because of the limited number of severe events. Thirty-day all-cause readmission was 2.5 times more frequent in MDR survivors (20.4% vs. 8.0%;
p = 0.038), reflecting incomplete eradication, recurrent infection, and the cumulative comorbidity burden of this population. Total antibiotic duration was correspondingly longer in MDR cases (median 14 vs. 10 days;
p < 0.001), with implications for both selection pressure and toxicity exposure (
Table 5).
After multivariable adjustment, four exposures retained independent associations with MDR pyelonephritis at the conventional 0.05 threshold, and an additional three covariates approached significance. They were retained for clinical interpretability and inclusion in the candidate score. Antibiotic exposure within 90 days emerged as by far the strongest predictor, increasing the odds of MDR roughly sixfold (aOR 5.7, 95% CI 2.4–13.6;
p < 0.001), in line with the well-described selection pressure exerted by even brief antibiotic courses on the urinary microbiome. A history of recurrent UTI was associated with a more than threefold increase in the odds of MDR (aOR 3.4, 1.4–8.2;
p = 0.006), independent of comorbidity, suggesting that recurrent infection itself—rather than its associated treatment—functions as a marker of cumulative resistance acquisition. Recent hospitalization carried a similar magnitude of risk (aOR 3.1, 1.2–8.0;
p = 0.020), reproducing the consistent finding from European surveillance that even short hospital stays alter the urinary flora for several months. Renal impairment/renal-failure status at presentation remained independently associated with MDR (aOR 2.4, 1.0–6.2;
p = 0.046), likely reflecting both prior antibiotic exposures associated with recurrent UTI in this group and impaired renal clearance of antimicrobials. Urinary catheterization within 30 days, diabetes, and age each approached but did not reach statistical significance after adjustment. Immunosuppression, prior urinary tract instrumentation, and nephrolithiasis/urolithiasis were then assessed in the expanded candidate model. Prior instrumentation retained a positive but attenuated association (aOR 2.3, 0.8–6.7;
p = 0.116), whereas immunosuppression (aOR 1.8, 0.5–6.0;
p = 0.334) and nephrolithiasis/urolithiasis (aOR 1.5, 0.5–4.4;
p = 0.474) did not add independent discrimination after adjustment for stronger exposure variables. Together, these covariates explained a substantial proportion of the variance in MDR status (Nagelkerke R
2 = 0.46), with adequate calibration (Hosmer–Lemeshow
p = 0.451) and good discrimination (AUC = 0.84), supporting their use as the building blocks of a transparent bedside score (
Table 6).
Patterns of empirical prescribing differed between MDR and non-MDR cases in clinically meaningful ways. Ceftriaxone monotherapy, the historic backbone of empirical pyelonephritis therapy, was used somewhat less frequently in patients who turned out to have MDR pyelonephritis (35.2% vs. 50.7%; p = 0.082), suggesting some degree of clinician anticipation of resistance based on bedside risk factors. Conversely, piperacillin–tazobactam and carbapenems were preferred in approximately 39% of MDR admissions (combined), against 21% in non-MDR cases, although neither comparison individually reached significance, again reflecting limited power. Despite these adjustments, the most striking divergence was in the timing of correct coverage: only 25.9% of MDR cases received an active agent within 12 h, compared with 68.0% of non-MDR cases (p < 0.001), and more than half of MDR patients waited longer than 24 h for an active drug (p < 0.001). De-escalation within 72 h was achievable in only 22.2% of MDR cases, compared with 52.0% of non-MDR cases (p < 0.001), and the corresponding figures for the intravenous-to-oral switch within 72 h were 14.8% versus 45.3% (p < 0.001). Together, these data show that even when broad-spectrum agents were chosen empirically for MDR pyelonephritis, the lag in confirming activity against the index isolate translated into protracted intravenous therapy, delayed transition to ward-friendly oral options, and prolonged occupancy of inpatient beds, all of which compound the direct microbiological consequences of resistance.
Figure 1 maps the architecture of co-resistance in our
Enterobacterales subset and illustrates two clinically relevant clusters. The strongest single co-occurrence was between ESBL production and third-generation cephalosporin resistance, with a Jaccard index of 0.78 and a pairwise OR of 18.4, a finding consistent with the underlying mechanistic link between these two phenotypes; in practice, this means that 78% of isolates with either trait carried both, and that detection of ESBL by phenotypic testing essentially predicts cephalosporin failure. A second tightly linked cluster was formed by ESBL production and fluoroquinolone resistance (Jaccard 0.62; OR 6.8) and, in turn, by fluoroquinolone resistance and trimethoprim–sulfamethoxazole resistance (Jaccard 0.43; OR 4.1), confirming that loss of fluoroquinolone activity in our cohort almost always travels with loss of two additional commonly prescribed oral agents. Carbapenem non-susceptibility, in contrast, behaved as a relatively isolated phenotype (Jaccard ≤ 0.18 against any other phenotype; OR ≤ 3.6), consistent with the view that carbapenemase-producing isolates form a small, mechanistically distinct subset rather than a smooth extension of broader resistance gradients. Piperacillin–tazobactam resistance occupied an intermediate position, with moderate links to aminoglycoside (Jaccard 0.41) and ESBL (Jaccard 0.34) phenotypes, suggesting that its empirical use should be informed by the local prevalence of these neighboring resistances. The overall pattern argues for an integrated empirical strategy in which ESBL and fluoroquinolone risk are evaluated jointly rather than independently when MDR pyelonephritis is suspected.
Table 7 summarizes the derivation of the PYELO-MDR-Risk score, a points-based instrument that translates the multivariable model in
Table 5 into a bedside tool. The largest coefficient by far was that for prior antibiotic exposure (β = 1.74), which translated into 3 score points and reflects an adjusted odds ratio of 5.7. Recurrent UTI and recent hospitalization each contributed 2 points, mirroring their adjusted odds ratios of approximately 3.4 and 3.1. Urinary catheterization within 30 days, which had an adjusted OR of 2.6 but did not reach statistical significance in the regression model (
p = 0.082), was retained as a 2-point variable because of its strong biological rationale and its consistent direction of effect across bootstrap resamples. The remaining variables—renal impairment/renal-failure status at admission, diabetes mellitus, and age ≥ 65 years—contributed 1 point each, reflecting smaller adjusted odds ratios in the 2.0–2.4 range. Immunosuppression, prior urinary tract instrumentation, and nephrolithiasis/urolithiasis were considered for score inclusion but did not improve bootstrap-corrected discrimination or calibration and were therefore excluded from the final point system. The resulting 0–12 score showed apparent discrimination (AUC 0.86), which fell only modestly to 0.84 after optimism correction across 1000 bootstrap resamples, suggesting limited overfitting given the cohort size. Calibration was excellent: the calibration slope of 0.96 was very close to the ideal value of 1, the Brier score of 0.142 indicated overall accurate probability estimates, and the Hosmer–Lemeshow test (
p = 0.624) found no significant departure from the predicted probability across deciles. At the prespecified cut-point of ≥4 points, the score showed balanced sensitivity (78.4%) and specificity (79.2%), with a negative predictive value of 83.6%—a property particularly useful when the goal is to identify low-risk patients in whom narrow-spectrum empirical therapy can be safely prioritized. Three risk strata (0–3, 4–6, and ≥7 points) corresponded to observed MDR proportions of approximately 12%, 52%, and 84%, providing intuitive thresholds for empirical-therapy decisions (
Table 8).
Subgroup analyses provided reassurance about the generalizability of the antibiotic-exposure effect across clinically meaningful strata. The point estimate of the adjusted odds ratio was greater than 3.5 in every subgroup tested, with all 95% confidence intervals excluding unity. Effect modification was tested formally using multiplicative interaction terms; none reached the 0.10 threshold. The largest numerical differences were observed in patients aged ≥65 years (aOR 7.4 vs. 3.6 in younger patients; p_interaction = 0.27) and in those with renal impairment/renal-failure status at admission (aOR 9.1 vs. 4.6; p_interaction = 0.34). Although these heterogeneities did not reach statistical significance, they are biologically plausible and raise the possibility that the score may be even more discriminatory in older patients with reduced renal function than overall figures suggest. Importantly, the effect of antibiotic exposure was preserved both in patients with bacteremia (aOR 6.7) and in those without (aOR 5.1), and across both
E. coli (aOR 4.4) and non-
E. coli (aOR 9.8) aetiologies, suggesting that the predictive value of recent antibiotic use is not driven by a single pathogen subgroup. Sex showed no meaningful interaction (
p = 0.62), although the male subgroup had wider confidence intervals reflecting smaller numbers (
n = 35). Taken together, these analyses argue that the PYELO-MDR-Risk score, which leans heavily on antibiotic exposure, can be applied with similar confidence across the principal demographic and clinical strata represented in this cohort, and that interpretation does not require subgroup-specific recalibration in the population studied (
Table 9).
Time-to-event analyses provided a complementary perspective on the impact of MDR by treating recovery itself as a continuous, censored outcome rather than collapsing it into binary success or failure. In univariable Cox analysis, MDR pyelonephritis was associated with a substantially slower transition to clinical stability (HR 0.42, 95% CI 0.28–0.62; p < 0.001), corresponding to roughly a halving of the instantaneous probability of stabilization at any given hour. The effect was attenuated but not abolished after adjustment for bacteremia, comorbidity, time-to-effective therapy, and diabetes (aHR 0.51, 95% CI 0.33–0.78; p = 0.002), supporting an independent biological signal beyond the indirect pathway through delayed coverage. Time-to-effective therapy beyond 24 h emerged as an independent driver of slower recovery (aHR 0.62; p = 0.045), confirming the prognostic relevance of empirical-therapy timeliness. Median time to clinical stability was 4.7 days in MDR cases compared with 2.6 days in non-MDR cases, a clinically meaningful gap of approximately two hospital days. The proportional hazards assumption was supported globally (Schoenfeld, p = 0.41). For the secondary endpoint of 30-day all-cause readmission, the Fine–Gray competing-risks model treating in-hospital death as a competing event yielded a sub-distribution hazard ratio of 2.84 (95% CI 1.05–7.69; p = 0.040) for MDR versus non-MDR, demonstrating that the higher readmission rate among MDR survivors persists after appropriate handling of competing mortality. Together, these analyses confirm that MDR is a robust, independently relevant signal for both delayed in-hospital recovery and increased post-discharge healthcare utilization.
Figure 2 displays the discrimination of the PYELO-MDR-Risk score against a parsimonious clinical baseline. The score achieved an area under the ROC curve of 0.84 (95% CI 0.77–0.91), substantially exceeding the baseline clinical model that used age, sex, and Charlson Comorbidity Index alone (AUC 0.71, 95% CI 0.62–0.79); the difference was statistically significant on DeLong’s test (
p = 0.004). At the operationally relevant cut-point of ≥4 points, the score yielded a sensitivity of 78.4% and a specificity of 79.2%, with a positive predictive value of 73.3% and a negative predictive value of 83.6%, balancing the competing demands of broad-spectrum stewardship and adequate empirical coverage of high-risk patients. The relative position of the two curves is informative across the entire operating range: the PYELO-MDR-Risk curve dominates the clinical-baseline curve at every false-positive threshold below approximately 0.6, indicating uniformly better classification at the sensitivity values most relevant to inpatient empirical decision-making. The shaded area between the two curves represents the additional information captured by the bedside exposure variables (recent antibiotic use, recurrent UTI, hospitalization, catheter, and renal impairment) over and above the demographic-and-comorbidity baseline. From a clinical-utility standpoint, the chosen ≥4-point cut-off lies on the steep part of the ROC curve, where each additional point of sensitivity is purchased at a relatively small specificity cost, supporting its adoption as the default threshold for stratifying empirical-therapy decisions in this population.
Figure 3 visualizes the calibration of the PYELO-MDR-Risk score across deciles of predicted risk. Observed proportions of MDR within each decile closely follow the diagonal of perfect calibration throughout the entire risk spectrum from approximately 4% to 86% predicted risk, with no systematic over- or underestimation at either tail. The cubic-smoothed line lies almost exactly on the identity line, and 95% confidence intervals for each decile cross the diagonal in nine of ten bins. Quantitatively, the calibration slope was 0.96, very close to the ideal value of 1; the calibration-in-the-large was −0.04; the Brier score was 0.142; and the Hosmer–Lemeshow goodness-of-fit test was non-significant (χ
2 = 6.2, df = 8,
p = 0.624). The pale histogram at the foot of the figure illustrates the distribution of predicted risks across the cohort and shows a clearly bimodal pattern, with one peak around 0.10 (corresponding to the low-risk stratum, score 0–3) and a second around 0.55 (corresponding to the intermediate-to-high-risk strata, score 4–6 and ≥7). This bimodality suggests that the score naturally partitions patients into two clinically meaningful groups—those in whom narrow-spectrum empirical therapy is likely to suffice and those in whom broader-spectrum coverage should be considered—with relatively few patients falling in the ambiguous intermediate-risk zone. Together with the discrimination metrics in
Figure 2, these calibration properties support the use of the score as a transparent and well-behaved bedside instrument in this population.