1. Introduction
Acid–base changes during dialysis can be interpreted through two complementary frameworks. The traditional Henderson–Hasselbalch (HH) model relates pH to the ratio of bicarbonate (HCO
3−) and carbon dioxide tension (PCO
2) and has long underpinned clinical reasoning, treatment of acid base disorders, and dialysate prescriptions. In contrast, Stewart’s physicochemical approach [
1] treats HCO
3− as a dependent variable and posits three independent determinants of pH: the strong ion difference (SID), PCO
2, and the total concentration of non-volatile weak acids (ATOT). Abnormality of one or more of the independent variables underlies all acid–base disturbances [
2]. Adjustment of the independent variables is the essence of all therapeutic interventions, because none of the “dependent variables” [e.g., pH, Base Excess (BE), HCO
3−] can be changed primarily or individually. Instead, the dependent variables change, all of them simultaneously, if, and only if, one or more of the independent variables change [
3]. In this view, pH changes arise from shifts in strong ions and weak acids rather than from “adding bicarbonate” per se.
Several studies have applied the Stewart framework in dialysis and critical illness. Libório et al. showed that in maintenance HD, acidosis was mainly attributable to unmeasured anions, hyperchloremia, and hyperphosphatemia, with chloride and unidentified anions accounting for nearly 90% of the metabolic component [
4]. Prior comparative work has reported ~28% disagreement between Stewart-based and traditional classifications when the anion gap is left uncorrected for albumin. Lee et al. observed that in HD patients with metabolic acidosis, the strong ion gap (SIG) was consistently elevated and better captured unmeasured anions than conventional AG [
5]. However, correcting the anion gap for albumin (AGc) materially improves its performance, and some studies show comparable discrimination between AGc and SIG. Moreover, there is no consistent evidence that Stewart-derived indices outperform traditional metrics in predicting clinical outcomes (e.g., mortality) [
6].
In critically ill dialysis patients, chloride balance was highlighted as a key determinant of acidosis correction, with the relationship between plasma and dialysate chloride influencing the degree of improvement [
7]. Constable and Kurtz et al. emphasized that while the Stewart approach is chemically rigorous, it does not surpass the Henderson–Hasselbalch framework in predictive accuracy—particularly in open systems like hemodialysis, where bicarbonate flux is the dominant driver [
8,
9].
Stewart theory predicts that the alkalinizing effect is mediated primarily by the sodium cation, which raises SID, while the administered HCO3− equilibrates with CO2 and water, with excess CO2 subsequently eliminated. Thus, the rise in pH reflects a change in strong-ion composition, not an independent action of bicarbonate. This distinction is directly relevant to dialysis, where patients are exposed to large bicarbonate fluxes and dialysate chloride is simultaneously lower; both processes can shift SID and ATOT, while PCO2 often changes little across a session.
Despite several physiochemical analyses in dialysis, no prior study has directly evaluated, in the same cohort, how well the Stewart model predicts both absolute pH and dialysis-induced ΔpH compared with the Henderson–Hasselbalch framework, nor whether Figge/Constable-based pH predictions systematically deviate from measured values. This gap leaves uncertainty about the practical performance of the Stewart model under real-world bicarbonate-based dialysis conditions, which our analysis addresses through paired HD data, cross-modal comparisons, and Bland–Altman evaluation.
Accordingly, we designed two complementary analyses. First, a paired Δ-analysis in hemodialysis (HD) examined whether ΔpH was explained more accurately by the Henderson–Hasselbalch (HH) framework (ΔHCO3−, ΔPCO2) or by a Stewart model (ΔSIDa, ΔATOT, ΔPCO2).
Second, a cross-sectional level analysis pooled pre-HD, post-HD, and peritoneal dialysis (PD) observations to compare HH-based models against a Stewart model for predicting absolute pH.
In parallel, we decomposed the biochemical drivers of alkalinization, quantifying the specific contributions of chloride shifts (reflected in SIDa), changes in weak acids (ATOT; albumin and phosphate), and the burden of unmeasured anions (AGc/SIG), in the context of relatively stable PCO2 during dialysis.
2. Results
The study cohort comprised 94 patients, including 54 males (mean age 66.6 ± 12.2 years) and 40 females (mean age 68.6 ± 10.5 years), with an overall mean age of 67.5 ± 11.5 years. ANOVA showed no significant difference in age between males and females (F = 0.704,
p = 0.404). Patients on PD (n = 41, mean age 67.8 ± 12.5 years) and HD (n = 53, mean age 67.2 ± 10.8 years) also had comparable ages, with no significant difference by dialysis modality (F = 0.062,
p = 0.804). The distribution of dialysis modality was balanced across gender groups (42.6% of males and 45.0% of females on PD), and Chi-Square analysis confirmed the absence of association between sex and dialysis modality (χ
2 = 0.054,
p = 0.816). No significant differences were observed between males and females in biochemical or acid–base parameters, indicating that sex did not influence systemic electrolyte balance or acid–base homeostasis in this cohort (
Supplementary Table S1).
2.1. Analysis of the Whole Study Group
Across the three groups (PD, pre-HD, post-HD), acid–base parameters and several electrolytes differed significantly (
Table 1). The lowest pH was observed pre-HD (7.31), accompanied by reduced bicarbonate (~19 mmol/L), whereas post-HD showed clear alkalinization (pH 7.42, HCO
3− ~25 mmol/L; both ANOVA
p < 0.001). PD patients exhibited intermediate values (pH 7.39, HCO
3− 23 mmol/L). PCO
2 remained unchanged across groups (~40 mmHg,
p = 0.94).
During HD, SIDa rose slightly (+1.39 mmol/L, p = 0.025) and ATOT remained stable (–0.40 mmol/L, p = 0.44), changes insufficient to explain the pH shift. As expected, potassium was higher pre-dialysis and decreased significantly after HD (p < 0.01). Chloride was lower in PD than pre-HD and declined modestly post-HD, while phosphate decreased sharply after HD. Albumin increased after HD, consistent with hemoconcentration.
Post hoc (Tukey) testing showed that pre-HD values differed significantly from both PD and post-HD for most parameters (pH, HCO
3−, SIDe, SIG, AGc). Post-HD values converged toward PD for pH and bicarbonate but remained distinct in terms of lower SIG/AGc, indicating a greater burden of unmeasured anions in PD (
Supplementary Tables S2 and S3).
2.1.1. Regression Analysis of Absolute Measured pH Values Across Modalities
In the cross-sectional level analysis (all observations pooled, n = 147), the Stewart model using SIDa, ATOT, and PCO2 explained only 36% of the variance in measured pH (R2 = 0.362, adjusted R2 = 0.349; SEE = 0.061). In contrast, the Henderson–Hasselbalch framework, incorporating HCO3− and PCO2, accounted for 96% of the variance (R2 = 0.961, adjusted R2 = 0.960; SEE = 0.015).
2.1.2. Bland–Altman Analysis of Stewart-Predicted pH
We applied the Figge electroneutrality equation [
5] to predict pH from SIDa, ATOT and PCO
2 in the whole group and the subgroups.
We found that it systematically overestimated the measured pH across all modalities, as is shown with the Bland–Altman analysis below. In the overall cohort (n = 147), the mean bias was +0.111 pH units. In PD patients the bias was +0.137; pre-HD, the bias was +0.141 and the post-HD group the bias was +0.06 (
Figure 1).
Thus, the Figge approach consistently predicted higher pH values than measured, with the largest overestimation observed before HD, slightly less in PD, and the lowest in post-dialysis samples. Importantly, in all subgroups the 95% limits of agreement remained entirely positive, indicating a systematic upward bias. For sensitivity analysis we also applied the simplified Constable equation [
6], which produced a slightly higher systematic bias (+0.112 pH units), indicating that both Stewart-based formulations overestimate pH values.
Each panel depicts the agreement between measured arterial pH and pH predicted using the Figge–Fencl Stewart-based formulation in all patients (Overall) and in the subgroups of peritoneal dialysis (PD), hemodialysis before treatment (HD-PRE), and after treatment (HD-POST). The thick dashed line indicates the mean bias, and the thin dashed lines represent the 95% limits of agreement (LoA = bias ± 1.96 SD). Numerical annotations show the bias, LoA range, and sample size (N) for each comparison. The overall bias was +0.111 pH units (LoA −0.009 to +0.231; N = 147), higher in HD-PRE (+0.141) and PD (+0.137), and lower in HD-POST (+0.060). A positive bias indicates systematic overestimation of alkalinization by the Figge–Fencl equation relative to measured values.
2.1.3. Strong Ion Gap or Corrected Anion Gap
In the regression analysis we observed a strong linear relationship between AGc and SIG (AGc = 7.94 + 0.97·SIG (R
2 = 0.86,
p < 0.0001, N = 147)) (
Figure 2). The slope was close to 1.0, indicating parallel changes in the two variables across the cohort. The intercept was approximately 8 mEq/L.
2.2. Analysis of Changes in the HD Group (Pre/Post)
Beyond cross-sectional comparisons, we analyzed paired pre–post changes within the hemodialysis (HD) cohort to identify which modeling framework best describes ΔpH. Post-HD, pH increased by 0.11, base excess rose by 7.04, and bicarbonate increased by 5.67, consistent with effective alkali delivery (
Table 2). In contrast, SIDa changed only marginally (by +1.39) because opposing shifts largely canceled each other out: chloride declined (which tends to raise SIDa) while potassium also declined substantially (which tends to lower SIDa). ATOT remained relatively stable, reflecting a compensatory pattern of increased albumin (hemoconcentration) and decreased phosphate (dialytic clearance). As expected, PCO
2 showed no significant net change during HD, consistent with steady CO
2 content in the dialysate. Ordinary least-squares regression on paired ΔpH (
Table 3) favored the Henderson framework (ΔHCO
3 + ΔPCO
2, R
2 = 0.903, adj. R
2 = 0.899), while the Stewart model (ΔSIDa, ΔATOT, ΔPCO
2) explained about half of the variance (R
2 = 0.514, adj. R
2 = 0.485).
Regarding the three linear models for ΔpH, the analysis clearly favors the Henderson model (ΔHCO3− + ΔPCO2), which demonstrated the highest R2/adjusted R2, the lowest SEE, and the most favorable AIC/BIC values, indicating superior fit and efficiency. The ΔHCO3-only model underfits (low R2, higher SEE, worse AIC/BIC), confirming that bicarbonate alone does not capture inter-patient variability in ΔpH. The Stewart-only model performs moderately (R2 = 0.485) but is decisively outperformed by Henderson on all criteria.
Coefficient patterns are physiologically coherent. In the Henderson model, ΔHCO3 is strongly positive and ΔPCO2 strongly negative—both highly significant—consistent with the Henderson–Hasselbalch framework. In the Stewart model, ΔSIDa is a positive predictor (significant), ΔPCO2 remains negative (significant), and ΔATOT is non-significant—compatible with the small, opposing albumin/phosphate changes within a single HD session.
In this cohort, mean ΔSIDa was +1.39 mmol/L, whereas ΔHCO3 averaged +5.67 mmol/L with ΔPCO2 +0.38 mmHg. Using the fitted coefficients, the Stewart slope for ΔSIDa (0.009 pH units per mmol/L) implies an expected ΔpH of only ~0.012—far below the observed effect. By contrast, the Henderson model coefficients (0.0233 for ΔHCO3 and −0.011 for ΔPCO2) yield a predicted ΔpH ≈ 0.126 for ΔHCO3 = +5.6 mmol/L and ΔPCO2 = +0.38 mmHg, closely matching the observed mean ΔpH of +0.116.
3. Discussion
In this cohort of 41 PD patients and 53 HD patients, we analyzed 147 blood gas samples to compare the explanatory performance of the “Boston” and the Stewart frameworks across modalities and within paired HD sessions. Hemodialysis produced a characteristic alkalinization with substantial increase in HCO3− accompanied by a fall in chloride and a reduction in unmeasured anions, yielding a mean ∆pH of ~0.11.
Importantly, ATOT remained essentially unchanged across dialysis because hemoconcentration-related increases in albumin were offset by dialytic phosphate removal. Consequently, changes in weak-acid charge cannot account for the rise in pH.
The mean SIDa increased modestly by +1.39 mmol/L—far too little to explain the magnitude of the pH or bicarbonate change. Mechanistically, sodium remained relatively stable during HD (+0.43), while the decrease in chloride (ΔCl− = −2.34) was partially neutralized in the SIDa equation by a concomitant fall in potassium (−1.53); these opposing charge shifts blunted the net change in SIDa. The fall in chloride and potassium is consistent with the dialysate composition with higher HCO3− and lower Cl− and K+ than plasma.
Moreover, PCO2 in blood mirrored the near-constant CO2 content maintained in the dialysate, so neither ATOT nor PCO2 shifted in a way that could drive the observed alkalinization within a strictly Stewart formulation.
To probe the unmeasured-anion component, we examined SIG and AGc in the whole group. According to our data the relationship between the two of them was AGc = 8 + 0.97·SIG (
Figure 2). In his paper in 1999, Constant supports that SIG is more specific in detecting the presence of unmeasured strong ions in plasma than the anion gap [
8]. Similar results were reported by Lee et al. [
5]. In our analysis AGc and SIG track the same physiology—unmeasured anions—but on slightly different zero points. The perfect slope (~1) indicates they move together almost identically, while the ~8 mEq/L intercept reflects definitional choices (albumin/phosphate/organic anions for AGc vs. organic anions in SIG). Practically, AGc and SIG are interchangeable proxies for the unmeasured-anion burden. While SIDa rose only slightly, both SIG and AGc declined substantially, indicating clearance of unmeasured anions. This is consistent with removal of uremic organic acids—e.g., indoxyl sulfate, hippurate, p-cresyl sulfate, urate, ketoacids.
From a Stewart perspective, the pH rise is captured by a pronounced increase in SIDe and a fall in SIG; critically, the SIDe increase is driven almost entirely by the bicarbonate gain. In parallel, the relatively low chloride in the dialysate (due to displacement by the bicarbonate load) corrects the hyperchloremic fraction of acidosis, further contributing to the net alkalinization.
From the Henderson–Hasselbalch viewpoint, the interpretation is straightforward: exogenous bicarbonate enters from the dialysate and pH increases with PaCO2 essentially unchanged. Our paired analysis showed that the observed rise in pH was almost completely explained by the increase in plasma bicarbonate.
Both frameworks therefore describe the same physical process from different angles: the dialysate is relatively rich in HCO3−, poor in Cl−, and effectively devoid of organic anions. The small acetate/citrate load is titrated by bicarbonate to yield a stable CO2 concentration in the effluent, while the corresponding anions are cleared. Thus, HCO3− accumulation, Cl− displacement, and organic-anion removal occur simultaneously. This explains why SIDa—which excludes HCO3−—underestimates the alkalinizing effect, whereas SIDe and SIG, which include HCO3−, track the process more faithfully.
A conceptual paradox emerges here. In Stewart’s approach, bicarbonate is a dependent variable, yet in practice the model requires HCO3− (via SIDe) to reveal the fall in SIG. Without the bicarbonate term, the physicochemical model would miss the contribution of organic-anion removal—an effect that, conversely, is readily detected by the cAG. Thus, while Stewart’s approach elegantly emphasizes independent strong ions and weak acids, its practical application in HD paradoxically leans on bicarbonate to uncover the exact changes in SIG.
In its conventional closed-system form (using only SIDa, ATOT, and PCO2 as independent variables), the Stewart model underestimated the alkalinization observed during HD, because these variables changed very little. In contrast, when the analysis was extended to an open-system context that also considers the exogenous bicarbonate load (through SIDe), the rise in pH was fully captured. At the same time, the fall in SIG/AGc reflected removal of unmeasured organic anions. Thus, using SIDe and SIG rather than SIDa alone allows the Stewart framework to describe the alkalinizing effect of dialysis, showing that the apparent mismatch comes from the restricted closed-system implementation, which cannot account for bicarbonate infusion. These findings also align with prior critiques of the closed-system Stewart formulation. Constable emphasized that SIDa cannot capture alkalinization when bicarbonate is added or removed and that SIDe/SIG—because they incorporate HCO3− and weak-acid charge—provide a more complete representation of pH under such conditions. Similarly, Rastegar noted that in clinical settings where changes in bicarbonate dominate the acid–base response, the Henderson–Hasselbalch framework remains more predictive than a Stewart model restricted to SIDa, ATOT and PCO2. Our results in dialysis, an archetypal open system with substantial bicarbonate gain and organic-anion clearance, are fully consistent with these observations.
With our analysis we also found that the Figge–Constable equations systematically overestimated measured pH in the overall cohort, most notably pre-HD, with smaller bias post-HD. If the problem were primarily due to differences in ion measurement techniques that have changed since the time of Figge and Constable, the bias would have been consistent pre- and post-dialysis, since sodium and chloride were measured using the same contemporary methods on the same analyzer. The reduction in bias after dialysis, when mean pH increased from 7.31 to 7.42, suggests that methodological factors cannot explain the mismatch. Rather, the shift toward alkalemia likely altered the effective dissociation behavior of albumin and phosphate, bringing their apparent Ka values closer to those assumed by the Figge–Constable model. In addition, dialysis reduces the burden of organic anions, thereby decreasing the requirement for counterbalancing strong cations and simplifying the ionic milieu in a way that further enhances model performance. Taken together, these findings suggest that the choice of dissociation constants may contribute substantially to prediction error, and that reevaluating the Ka values under the dynamic pH and organic anion conditions encountered during hemodialysis may improve the accuracy of the Stewart approach.
Our findings are consistent with those of Hernández et al., who, in a cohort of 35 HD patients, found that the Henderson-based framework provided a closer fit to measured pH [
10]. Consistent results were also observed in the intensive care setting: in a study of 172 patients, the standard base excess (SBE) model was a superior predictor of acidemia and alkalemia compared with the apparent SID-based model [
11], while in another ICU cohort the bicarbonate–cAG approach showed diagnostic performance comparable to that of the Stewart framework [
12].
Although our study focuses on chronic dialysis, broader evidence suggests that SIG may have clinical relevance beyond dialysis. In patients with AKI and metabolic acidosis, SIG has been independently associated with both short- and long-term mortality [
13], underscoring its prognostic value and supporting its use in general nephrology practice.
With regard to the PD group, patients exhibited higher AGc and SIG values than HD patients, indicating a greater burden of unmeasured anions in PD. This observation aligns with the findings of Klabosh et al., who demonstrated that metabolic acidosis in PD is characterized by an increased SIG compared with healthy controls [
14].
This study has several strengths. First, it uniquely combines paired pre/post hemodialysis data with a peritoneal dialysis cohort, allowing direct comparison of acid–base responses across modalities. Second, it applies both Henderson–Hasselbalch and Stewart–Figge frameworks in parallel, not only at absolute levels but also in Δ-models, providing a rigorous head-to-head assessment of predictive performance. Finally, the dataset is clinically relevant, reflecting routine dialysis practice with contemporary bicarbonate-based solutions, and the results directly inform the ongoing debate over the utility of physicochemical versus traditional models in real-world dialysis physiology.
Limitations include the single-cohort design and modest sample size. Dialysate composition (HCO3−/Cl−) was uniform rather than randomized, and we did not directly quantify the full panel of organic anions. Additionally, the generalizability of our findings may be influenced by inter-center variability in dialysate composition. Differences in bicarbonate concentration, chloride content, potassium bath, or acetate/citrate buffering can modify the relative contributions of HCO3− gain, chloride displacement, and organic-anion clearance to pH changes; thus, the exact magnitude of the effects observed here may differ in dialysis units using alternative prescriptions. Nevertheless, the conceptual distinction between closed- and open-system Stewart interpretation, and the superior predictive performance of the Henderson–Hasselbalch framework under bicarbonate-based dialysis, should remain broadly applicable
4. Materials and Methods
4.1. Study Population, Dialysis Solutions and Sampling
We enrolled 53 chronic hemodialysis (HD) patients with paired arterial blood gases and blood chemistry immediately before and after routine sessions (106 observations total) and 41 stable peritoneal dialysis (PD) patients sampled cross-sectionally (41 observations). Accordingly, cross-sectional (“level”) analyses comprised 147 observations (pre-HD, post-HD, PD).
Dialysis treatments were performed with a Fresenius 4008S machine (Fresenius Medical Care, Bad Homburg, Germany). The standard dialysate composition was sodium 138 mEq/L, potassium 2.0 mEq/L, calcium 3.0 mEq/L, magnesium 1.0 mEq/L, chloride 108 mEq/L, acetate 2.7 mEq/L, and bicarbonate 32 mEq/L, with minor adjustments according to individual clinical requirements.
Peritoneal dialysis was performed with bicarbonate-buffered bicaVera solutions (Fresenius Medical Care, Bad Homburg, Germany) and, when indicated, with icodextrin (Extraneal, Baxter Healthcare, Deerfield, IL, USA) for long dwell exchanges. After mixing of the dual chambers, the final composition of bicaVera was sodium 134 mEq/L, calcium 1.75 mmol/L, magnesium 0.5 mmol/L, chloride 101 mEq/L, bicarbonate 34 mEq/L, and glucose at 1.5%, 2.3%, or 4.25% (corresponding to 83, 126, or 236 mmol/L). Icodextrin solution contained sodium 133 mEq/L, calcium 1.75 mmol/L, magnesium 0.25 mmol/L, chloride 96 mEq/L, lactate 40 mEq/L, and 7.5% icodextrin as the osmotic agent.
Patients continued on their individualized PD and HD prescriptions without modification on the day of measurement. The study was conducted in accordance with the principles of the Declaration of Helsinki. All participants provided written informed consent prior to enrollment. As all laboratory tests were part of routine clinical care and no additional interventions were performed, the requirement for formal ethics committee approval was waived by the local institutional review board.
4.2. Measurements and Derived Variables
Measured variables included pH, PCO2 (mmHg), HCO3− (mEq/L), Na+, K+, ionized Ca2+ (iCa), ionized Mg2+ (iMg), Cl− (all in mEq/L), lactate (mmol/L), albumin (g/L), and phosphate (P) (mmol/L).
We implemented the Figge/Stewart framework [
2,
3,
8,
15,
16]. The equations used were defined as follows:
Weak acid anions:
where [Alb
−] is albumin charges and [Pi
−] is phosphate charges; [Alb
−] = [Alb]·(0.123·pH − 0.631) and [P
−] = [P]·(0.309·pH − 0.469), with [Alb] in g/L and [P] in mmol/L.
Apparent strong-ion difference:
Effective strong-ion difference:
Stewart-predicted pH (Figge electroneutrality equation):
S = 0.0301 mmol/L/mmHg and pK
a = 6.10.
The Constable simplified equation, which is derived from Equation (5), was solved numerically for each sample to obtain the Constable-predicted pH:
where K′
l is the apparent equilibrium constant for the Henderson–Hasselbalch equation, S is the solubility of CO
2 in plasma, PCO
2 is arterial carbon dioxide tension, K
a is the dissociation constant for weak acids, [ATOT] is the total concentration of weak acids and [SID
+] is the strong-ion difference SIDa.
Anion gap was calculated as
and corrected for the effect of abnormal albumin concentration with the formula
4.3. Statistical Analysis
For paired HD Δ-models, the outcome was ΔpH (post − pre). We fitted a Henderson–Hasselbalch model (ΔpH defined by ΔHCO3− and ΔPCO2) and a Stewart model (ΔpH defined by ΔSIDa, ΔATOT and ΔPCO2).
For cross-sectional level models (all observations; n = 147), the outcome was pH, modeled as HH model (pH defined by HCO3− and PCO2) and Stewart model (pH defined by SIDa, ATOT and PCO2). All models used ordinary least squares, and prespecified performance metrics were R2/adjusted R2.
All statistical analyses were performed using IBM SPSS Statistics, version 26 (IBM Corp., Armonk, NY, USA). As SPSS does not provide a built-in function for Bland–Altman analysis, we generated the corresponding plots using R code version 4.3.2 (R Foundation for Statistical Computing, Vienna, Austria) obtained with the assistance of ChatGPT (OpenAI GPT-5.1 model; OpenAI, San Francisco, CA, USA).
Model assumptions for linear regression were systematically assessed. Normality of ΔpH and model residuals was verified using Shapiro–Wilk tests (p = 0.87) and Q–Q plot inspection, confirming an approximately normal distribution without need for transformation. Homoscedasticity and independence of residuals were evaluated graphically and showed no violations. Multicollinearity was minimal (VIF < 2 for all predictors). Influential observations were assessed with Cook’s distance; although two data points exceeded the conventional 4/n threshold, they reflected physiologically plausible changes and were retained. No outliers were removed.