1. Introduction
Hyperglycaemia and hypertension co-occur in 40–60% of individuals with type 2 diabetes and jointly define much of the excess cardiovascular risk attributed to the metabolic syndrome. The biological mechanisms linking the two conditions are well characterised and bidirectional. Hyperglycaemia promotes endothelial dysfunction through advanced glycation end-product accumulation, oxidative stress, and impaired nitric oxide bioavailability, thereby elevating peripheral vascular resistance and systolic blood pressure [
1,
2]. In the reverse direction, hypertension activates the renin–angiotensin–aldosterone system (RAAS), promotes insulin resistance through reduced skeletal-muscle perfusion, and may directly impair pancreatic beta-cell function via angiotensin II signalling [
3,
4]. Despite these well-characterised pathways, whether the net population-level relationship is predominantly forward (glucose → BP), reverse (BP → glucose), or driven by shared upstream factors remains contested.
Conventional observational studies rely on multivariable regression, which adjusts for measured confounders but cannot distinguish forward from reverse causation. Mendelian randomisation (MR) studies have provided some evidence for a glucose-to-BP effect [
5,
6]; however, genetic instruments for complex metabolic traits carry limitations including horizontal pleiotropy and limited power to detect modest effects in the reverse direction. Importantly, MR instruments capture lifelong genetic effects, whereas the clinically relevant question often concerns short-to-medium-term phenotypic relationships. Our framework complements genetic approaches by isolating these phenotypic effects using multiple non-genetic causal inference methods, each with distinct identifying assumptions.
The present study applies such a triangulated framework to the National Health and Nutrition Examination Survey (NHANES), a nationally representative, cross-sectional programme with a sufficient sample size and phenotypic breadth to support multiple causal inference methods. We construct a directed acyclic graph (DAG) grounded in domain knowledge, test its implied conditional independencies empirically, and then estimate the causal effects using structural equation modelling (SEM), propensity score matching (PSM), inverse probability weighting (IPW), and augmented IPW (AIPW). Robustness of the conclusions is evaluated through the E-value analysis and Rosenbaum sensitivity bounds, and the structural path estimates are externally validated against the Framingham Heart Study data.
The novelty of this work lies not in the clinical observation that hyperglycaemia and hypertension co-occur, which is well established, but in the specific integration of DAG-based causal identification, latent-variable SEM decomposition of the metabolic syndrome confounding, doubly robust counterfactual estimation, and formal sensitivity analysis within a single analytical pipeline. While individual components of this framework have been applied in other settings, their combined deployment for a bidirectional cardiometabolic question is, to our knowledge, novel and provides a methodological template that may be applied to other interacting chronic conditions where causal direction is clinically consequential.
The primary objective of this study is to test whether genuine bidirectional causal effects exist between glycaemic status and blood pressure after accounting for the shared metabolic confounders. Secondary objectives include: (a) quantifying the magnitude of each directional effect using complementary methods, (b) assessing the robustness of observed effects to unmeasured confounding, and (c) validating the causal structure in an independent cohort. Our primary hypotheses are: (H1) hyperglycaemia exerts a positive causal effect on blood pressure after adjustment for metabolic-syndrome confounders, although the net direction of this direct effect may be attenuated or reversed once shared metabolic syndrome variance is partialled out in structural models; and (H2) hypertension exerts a positive causal effect on fasting plasma glucose independently of adiposity and inflammation. We further hypothesise that the BP → glucose direction is of greater magnitude, consistent with the dominant RAAS-mediated mechanism.
2. Materials and Methods
2.1. Data Source and Study Population
NHANES is a complex multistage probability survey conducted by the U.S. National Centre for Health Statistics [
7] that combines interviews, physical examinations, and biological specimen collection. We pooled data from survey cycles spanning 1999–2023. Participants were included if they were aged 18–80 years, had a valid Mobile Examination Center (MEC) weight, and had non-missing systolic blood pressure and at least one glycaemic measure (FPG or HbA1c). Exclusion criteria were age <18 or >80 years, missing age or sex, missing or zero survey sampling weights, pregnancy (self-reported), and absence of both fasting glucose and HbA1c measurements. After applying these criteria, the analytic sample comprised 55,386 adults. Of these, 25,689 individuals had complete metabolic panels—comprising fasting plasma glucose (FPG), HbA1c, HOMA-IR, BMI, waist circumference, systolic and diastolic blood pressure, total and HDL cholesterol, and C-reactive protein—and were therefore included in the SEM analyses. To quantify potential selection bias from complete-case analysis, we compared the complete-case subsample with excluded participants: the groups did not differ materially in age (mean 49.1 vs. 48.3 years) or sex distribution (52.1% vs. 53.6% female), although excluded participants were somewhat more likely to be non-fasting at examination and had slightly lower income-to-poverty ratios. These differences are consistent with the fasting subsample protocol rather than systematic selection.
For the purposes of phenotypic stratification, participants were classified into four mutually exclusive groups: neither hyperglycaemia nor hypertension (n = 36,554), hyperglycaemia-only (FPG ≥ 126 mg/dL or HbA1c ≥ 6.5%; n = 1303); hypertension-only (SBP ≥ 140 mmHg, DBP ≥ 90 mmHg, or currently on antihypertensive medication; n = 15,629), and both conditions (n = 1900). These groups served as the basis for descriptive analyses and propensity score procedures but were not used as analytical categories in the SEM, which modelled continuous and latent constructs throughout.
2.2. Directed Acyclic Graph Construction and Testing
A directed acyclic graph was constructed a priori based on established physiological relationships, prior epidemiological literature [
8], and biological plausibility constraints. The DAG encodes assumptions about which variables are common causes (confounders), mediators, colliders, or instruments in the glucose-BP system [
9]. Key nodes included age, sex, BMI, waist circumference, HOMA-IR, FPG, HbA1c, total and HDL cholesterol, systolic BP, diastolic BP, CRP, and antihypertensive medication use. Directed edges were drawn according to temporally plausible biological mechanisms: for example, adiposity (BMI, waist) precedes insulin resistance (HOMA-IR), which in turn influences both glycaemic status and BP through distinct pathways. The resulting DAG is displayed in
Figure 1. An overview schematic of the full analytical pipeline—data harmonisation, DAG conditional-independence testing, structural equation modelling, propensity-score matching, IPW with the doubly robust AIPW extension, sensitivity analyses (E-value, Rosenbaum bounds), and Framingham external validation—is provided as
Figure S1 in Supplement S3.
Once the DAG was specified, its implied conditional independencies were derived algorithmically using the d-separation criterion [
10]. Fifty-one such implied independencies were identified and subjected to empirical testing through partial correlation analysis, conditioning on the appropriate adjustment sets as prescribed by the DAG. A dependency was considered empirically inconsistent with the DAG if the conditional partial correlation differed significantly from zero (two-sided
p < 0.05 after Bonferroni correction). An important caveat in large samples is that trivially small partial correlations can achieve statistical significance; we therefore report both
p-values and effect sizes (partial r) in
Section 3.2 and interpret practical significance as |partial r| > 0.05. Of the 51 independencies tested, only 5 were consistent with the DAG structure. While this high rejection rate partly reflects the statistical power of
n > 25,000 to detect negligible associations, it also indicates the genuine residual complexity in the metabolic network. These findings motivated a latent-variable SEM representation that could accommodate the rich intercorrelation among observed indicators rather than assuming their conditional independence.
2.3. Structural Equation Modelling
SEM was implemented as a two-component model comprising a measurement sub-model (confirmatory factor analysis) and a structural sub-model specifying directional paths among latent constructs. Three latent variables were specified based on domain knowledge: (i) MetS, a metabolic syndrome severity factor with BMI, waist circumference, and HOMA-IR as indicators; (ii) Glycaemic, a glycaemic status factor with FPG and HbA1c as indicators; and (iii) BPState, a blood pressure state factor with SBP and DBP as indicators. Age and sex were included as observed exogenous covariates with direct paths to all three latent constructs.
The structural sub-model encoded the hypothesised bidirectional relationship as a pair of cross-lagged paths between Glycaemic and BPState, while MetS was positioned as a common upstream cause of both. Identification of bidirectional paths from cross-sectional data is a recognised challenge. The model was identified through three classes of constraints: (a) scaling restrictions fixing one indicator loading per factor to unity; (b) zero covariances among structural disturbances of distinct latent variables, implying that all shared variance between Glycaemic and BPState passes through either MetS or the direct cross-paths; and (c) the constraint that MetS influences glycaemic and haemodynamic outcomes only through latent factors, not through individual indicator-level paths. Under these restrictions the structural model satisfies the rank condition for identification. We acknowledge that alternative equivalent models exist; the chosen specification is defended on theoretical grounds (biological plausibility of the MetS common-cause structure) rather than purely statistical criteria. All parameters were estimated using maximum likelihood with robust standard errors (MLR) to accommodate non-normality of the metabolic indicators.
The primary equations of the structural sub-model, expressed in standardised form, are as follows. For the glycaemic latent variable:
and for the blood pressure state latent variable:
The corresponding fitted SEM path diagram is shown in
Figure 2. where
η denotes standardised latent factor scores,
β and
γ are standardised path coefficients, and
ζ are structural disturbances assumed to be uncorrelated with exogenous variables. Model fit was evaluated using the comparative fit index (CFI), Tucker–Lewis index (TLI), root mean square error of approximation (RMSEA), and standardised root mean square residual (SRMR). Conventional thresholds of CFI/TLI > 0.90 and RMSEA < 0.08 were applied, with the recognition that large-sample chi-square statistics are routinely inflated and should not be interpreted in isolation.
2.4. Propensity Score Matching
PSM was applied separately for each hypothesised causal direction to estimate average treatment effects on the treated (ATT). For the glucose → BP direction, treatment was defined as FPG ≥ 126 mg/dL (fasting hyperglycaemia), and the outcome was systolic blood pressure. For the BP → glucose direction, treatment was defined as SBP ≥ 140 mmHg, DBP ≥ 90 mmHg, or current antihypertensive medication, and the outcome was fasting plasma glucose. In both analyses, propensity scores were estimated using logistic regression with age, sex, BMI, waist circumference, HOMA-IR, total cholesterol, HDL cholesterol, CRP, and smoking status as covariatesselection was based on DAG adjustment sets to block backdoor paths without conditioning on colliders or mediators.
Nearest-neighbour matching without replacement was performed using a calliper of 0.2 standard deviations of the logit-transformed propensity score [
11]. Covariate balance was assessed using standardised mean differences (SMD), with SMD < 0.1 considered indicative of adequate balance [
12]. For the glucose → BP analysis, 1771 matched pairs were retained from 1947 treated cases and 30,575 controls, achieving balance on all 14 covariates. For the BP → glucose analysis, 2841 matched pairs were retained from 5172 treated cases and 10,949 controls, with 12 of 13 covariates meeting the balance criterion. Love plots displaying pre- and post-matching SMD distributions are shown in
Figure 3 and
Figure 4.
Post-matching treatment effects were estimated using paired
t-tests on the matched sample, with 95% confidence intervals derived by bootstrap resampling (1000 iterations). Overlap in propensity score distributions between treated and control groups before and after matching is illustrated in
Figure 5.
2.5. Inverse Probability Weighting and Doubly Robust Estimation
To complement PSM with population average treatment effect (ATE) estimates, IPW was applied using stabilised weights constructed from the same propensity score models [
13]. Extreme weights were trimmed at the 1st and 99th percentiles to reduce the variance inflation.
Doubly robust augmented IPW (AIPW) estimation was additionally implemented, combining the propensity score model with an outcome regression model to achieve consistent ATE estimates if either, but not necessarily both, models are correctly specified [
14]. The outcome models were fitted using gradient-boosted regression trees (scikit-learn GradientBoostingRegressor) with the following hyperparameters selected via 5-fold cross-validation: learning rate 0.05, maximum tree depth 4, 500 estimators, and minimum 20 samples per leaf. Early stopping on validation loss was used to prevent overfitting. Standard errors for all weighted estimators were obtained via the sandwich variance estimator with 500 iteration bootstrap verification.
2.6. Sensitivity Analyses
Two complementary sensitivity analyses were conducted to assess the robustness to unmeasured confounding. First, E-values were calculated for each primary effect estimate using the method of VanderWeele and Ding [
15]. The E-value quantifies the minimum strength of association, on the risk ratio scale, that an unmeasured confounder would need to have with both the treatment and the outcome to fully explain away the observed effect. Second, Rosenbaum’s sensitivity analysis for matched studies was performed, estimating the maximum degree of the unobserved selection bias Γ that the design could tolerate while maintaining statistical significance at
p < 0.05 [
16]. Higher Γ values indicate greater robustness. Together, these approaches provide explicit bounds on the magnitude of unmeasured confounding required to invalidate the study’s conclusions.
2.7. External Validation
To evaluate whether the SEM structural path directions obtained from NHANES generalise to an independent cohort, the same measurement and structural model were re-fitted on the Framingham Heart Study data (n = 4240 participants with complete panels). An important limitation is that Framingham lacks several NHANES variables (HbA1c, HOMA-IR, waist circumference, CRP), so the measurement model was adapted to use the available subset of indicators. Consequently, the bidirectional Glycaemic ↔ BPState paths could not be directly replicated in the latent-variable framework. Framingham provides longitudinal phenotyping with repeated measurements of glucose, blood pressure, and anthropometry, enabling a partial check on the cross-sectional NHANES results. The primary criterion for validation was directional consistency of those standardised path coefficients estimable in both datasets; formal equivalence testing was not pursued given the different survey designs and demographic compositions of the cohorts.
2.8. Statistical Software
All analyses were conducted in Python 3.11. Structural equation modelling used the semopy 2.3.x library, following the estimation approach described by Rosseel [
17] for latent variable models. Propensity score estimation and matching followed the preprocessing framework of Ho et al. [
18], implemented using scikit-learn 1.7x, scipy 1.16x, and custom routines. IPW and AIPW were implemented following the counterfactual framework of Imbens and Rubin [
9]. DAG construction and d-separation testing drew on the formal theory of Spirtes et al. [
8]. Sensitivity analyses followed the formula of VanderWeele and Ding [
15] and Rosenbaum [
16]. All code and analytic scripts are provided in the
Supplementary Materials.
4. Discussion
This study presents a comprehensive causal inference framework for interrogating the bidirectional relationship between blood glucose dysregulation and blood pressure elevation in a large, nationally representative population sample. The triangulation of evidence across five methodological approaches—DAG testing, SEM, PSM, IPW/AIPW, and sensitivity analysis—consistently supports the existence of genuine causal effects in both directions, with the BP → glucose direction showing larger, more robust, and more precisely estimated effects than the forward glucose → BP pathway.
An essential interpretive point is that the different methods estimate fundamentally different causal parameters, which partly explains the variation in effect magnitudes across approaches. PSM estimates the average treatment effect on the treated (ATT), which is the expected effect among those with the condition. IPW estimates the average treatment effect (ATE) across the entire population. SEM estimates the structural path coefficients in a latent-variable framework, conditioning on shared metabolic variance. The convergence of all three approaches on a significant bidirectional relationship, despite their different targets of inference and identifying assumptions, constitutes the strongest form of triangulated evidence available from observational data.
The SEM findings deserve particular interpretive attention. The negative sign of the Glycaemic → BPState path (β = −0.1506) within the SEM, at odds with the positive PSM ATT in the same direction, arises from the conditioning structure of the model: once MetS severity is held constant as a latent confounder, the residual association between the glycaemic burden and blood pressure state becomes modestly negative. This is a formal consequence of conditioning on a common cause when both outcomes are components of that common cause and is analogous to the M-bias and collider-stratification phenomena extensively discussed in the causal inference literature [
20,
21]. The PSM estimate, which matches on observed covariates without imposing a structural model, yields the more intuitively interpretable positive ATT of 1.76 mmHg. Both estimates are formally valid within their respective frameworks; the discrepancy underscores the importance of specifying the causal quantity of interest before choosing an analytical method.
The BP → glucose effect magnitude (ATT = 6.55 mg/dL by PSM, ATE = 6.15 mg/dL by AIPW) is clinically substantial. Hypertension activates the RAAS, and angiotensin II has been shown to impair insulin-stimulated glucose uptake in skeletal muscle, reduce pancreatic beta-cell perfusion, and promote hepatic glucose production through aldosterone-mediated pathways [
3,
22]. These mechanisms predict precisely the magnitude of effect we observe: a 6–7 mg/dL elevation in FPG is consistent with the known shift toward impaired fasting glucose in hypertensive populations seen in prospective cohort studies [
23,
24] and aligns with the broader metabolic syndrome framework in which visceral adiposity [
25] and clustered cardiometabolic risk [
26] jointly drive both glycaemic and haemodynamic dysregulation. Intervention evidence supports this direction: in the NAVIGATOR trial, valsartan (an ARB) reduced the incidence of diabetes by 14% [
27]; in the HOPE trial, ramipril reduced new-onset diabetes by 34% [
28]; and the ALLHAT trial showed differential diabetes incidence across antihypertensive classes, with thiazides and beta-blockers increasing risk relative to ACE inhibitors. These trial findings align with our observational estimate that hypertension causally elevates glucose levels. Notably, our doubly robust AIPW estimate (6.15 mg/dL) represents a lower bound on the true ATE under the assumption of a correctly specified propensity score or outcome model, providing stronger causal guarantees than either PSM or IPW alone.
The cardiovascular consequences of hyperglycaemia extend beyond blood pressure to encompass direct cardiac injury. Recent evidence indicates that acute and chronic hyperglycaemia promotes myocardial oxidative stress, fibrosis, and diastolic dysfunction through mechanisms that overlap with the vascular pathways examined here [
29,
30]. Furthermore, hyperglycaemia has been associated with adverse cerebrovascular outcomes and increased stroke severity [
31] and with broader multi-organ complications that compound cardiovascular risk [
32]. These findings reinforce the clinical importance of the glucose → BP pathway identified in our study, even though its magnitude is smaller than the reverse direction: the vascular and cardiac consequences of even modest glycaemic elevations may be clinically significant in the long term.
The sensitivity analyses reveal an important asymmetry in the robustness of the two causal directions. The glucose → BP effect (E-value 1.40, Γ = 1.8) is relatively sensitive to unmeasured confounding: a moderately strong unobserved confounder—such as an unmeasured component of insulin resistance not captured by HOMA-IR—could plausibly attenuate this estimate. The BP → glucose effect (E-value 1.64, Γ = 2.5) is substantially more robust, requiring a stronger unmeasured confounder to explain away. This asymmetry is consistent with the view that RAAS-mediated glucose dysregulation is a more direct biological process than the vascular consequences of mild hyperglycaemia, which may operate through slower mechanisms such as endothelial dysfunction accumulation and arterial stiffening.
Our DAG testing results, showing 46 of 51 implied independencies rejected, warrant careful interpretation. The DAG should be understood as a useful approximation of the causal structure rather than a validated representation of the true data-generating process. A high rejection rate does not necessarily invalidate the DAG as a causal model; rather, it indicates that the true data-generating process has more edges (direct causal effects) than our a priori structure assumed or that unmeasured common causes induce residual correlations that appear as direct effects. The five consistent independencies—including the HOMA_IR–SEX conditional independence and the BMI–HDL conditional independence—provide confirmatory evidence for specific causal assumptions that are theoretically motivated and empirically supported; however, the overall pattern underscores that any tractable DAG is necessarily a simplification of the complex metabolic network.
Several limitations merit acknowledgment. First, NHANES is cross-sectional, precluding the direct observation of temporal precedence between glucose and blood pressure changes. While PSM and IPW provide a formal causal framework, identifying bidirectional effects from cross-sectional data requires a strong assumption: that the observational distribution at time of measurement reflects a system in quasi-equilibrium; that is, that the metabolic relationships have reached a stable steady state at the time of measurement, such that cross-sectional associations approximate the long-run causal structure. This assumption, while common in structural equation modelling of chronic disease [
9,
20], is ultimately untestable and represents the principal threat to causal interpretation. Longitudinal data with repeated measurements would provide stronger causal evidence, and the Framingham validation using partially repeated data is a step in this direction. Second, the propensity score models, though comprehensive, cannot account for unmeasured confounders such as diet quality, physical activity intensity, sleep quality, or medication non-adherence patterns, all of which may jointly influence glucose and blood pressure. The E-value analyses provide explicit bounds on how large these unmeasured effects would need to be to overturn our conclusions. Third, the SEM model fit was below conventional thresholds, which, while attributable to large-sample sensitivity, indicates that the structural model is a simplification of the true data-generating process. The structural path estimates should be interpreted as confirmatory tests of specific hypotheses rather than as a complete characterisation of the metabolic network. Accordingly, the standardised path coefficients reported in
Table 3 should be read primarily as direction-of-effect indicators rather than as well-identified point estimates: their magnitudes carry limited precision, and quantitative interpretation of any single path coefficient should be tempered by the converging PSM, IPW and AIPW estimates that target the same causal contrasts under different identifying assumptions. We therefore base substantive conclusions about the direction and approximate size of the bidirectional glucose–BP effects on the triangulated evidence rather than on individual SEM coefficients in isolation.
The methodological contribution of this work lies in its demonstration that a principled causal inference framework—moving from DAG specification through multiple estimation methods to explicit sensitivity analysis—can be applied to large epidemiological datasets to yield interpretable, robust causal estimates. This template may be applicable to other bidirectional relationships in chronic disease epidemiology, including the obesity–depression cycle, the sleep–metabolic syndrome relationship, and the chronic kidney disease–hypertension feedback loop, though its performance in those settings would need to be evaluated empirically. In each of these cases, the question of causal direction has direct implications for intervention design: if A causes B, targeting A should reduce B; if the relationship is bidirectional, simultaneous targeting of both may be more effective than sequential treatment.
For clinical practice, our findings suggest that hypertension management may have underappreciated benefits for glycaemic control beyond the glucose-lowering properties of specific antihypertensive agents such as ACE inhibitors and ARBs that block RAAS [
27,
28]. Conversely, glycaemic optimisation may modestly reduce blood pressure through attenuation of advanced glycation end-product-mediated vascular stiffening, though the effect magnitude is smaller and more sensitive to residual confounding. These findings support integrated cardiometabolic risk management strategies that treat glucose and blood pressure as jointly determined outcomes rather than independent therapeutic targets.