2. Methods
2.1. Study Overview
We conducted a retrospective, single-center observational study to evaluate the association between PFO detected on routine TTE and cerebral WMH identified on brain MRI. The study was designed to assess whether literature-derived associations between PFO and WMH are reproducible in routine clinical practice and to determine whether these signals can be translated into patient-level risk estimates.
To address this objective, we implemented a PAMAP framework that integrates evidence from published studies with real-world clinical data. The framework was structured to (i) align exposure definition with routine clinical phenotyping (TTE-detected PFO patients), (ii) incorporate literature-derived effect estimates as fixed model components, and (iii) evaluate transportability of these associations at both the cohort and patient levels prior to any local model updating.
All analyses were conducted using a predefined workflow, with subsequent steps including evidence assembly and tiering, model specification and locking, transportability testing, and minimal refit. This retrospective observational study was reviewed and approved by the Institutional Review Board of JSC “Medicine”. The requirement for informed consent was waived due to the retrospective nature of the study, which involved analysis of existing de-identified clinical and imaging data and posed minimal risk to participants.
2.2. Study Population
We used data from a retrospective, single-center, case–control study that included patients admitted to our institution between January 2015 and December 2022 as the local transport cohort. A schematic of the patient selection process is shown in
Figure 1, and detailed inclusion and exclusion criteria are summarized in
Supplement Section S1. The PFO group consisted of consecutive patients aged 18–80 years with a PFO confirmed by transthoracic echocardiography. Patients with PFO who did not undergo brain magnetic resonance imaging (MRI) were excluded. The control group included consecutive patients without PFO who were evaluated or hospitalized during the same period, all of whom underwent both TTE and brain MRI. After applying the inclusion and exclusion criteria, the two cohorts were matched using propensity scores to reduce baseline differences. Demographic data, presenting symptoms, comorbidities, and medical histories were retrospectively reviewed and obtained from the institutional digital health records.
2.3. Outcomes, Definitions, and Statistical Analysis
The primary MRI endpoint for PAMAP transport analyses was any WMH, defined as Fazekas grade ≥ 1 on FLAIR. The Fazekas grading system classifies white-matter hyperintensities on MRI according to lesion extent and confluence, ranging from grade 0 (absence of lesions) to grade 3 (large confluent abnormalities involving extensive white-matter regions). In the present study, WMH positivity was defined as Fazekas grade ≥ 1. MRI interpretation was performed independently and blinded to echocardiographic findings, including PFO status, to reduce observational bias during WMH assessment.
Descriptive analyses used the matched cohort of 149 patients (112 controls and 37 PFO-positive patients). Continuous variables are presented as mean ± SD and were compared with Welch’s t test; categorical variables were compared with Fisher’s exact test. All p values are two-sided and should be interpreted descriptively given the retrospective matched design.
2.4. Imaging and Echocardiography Standards
MRI outcomes were defined according to STRIVE criteria and graded by the Fazekas scale. Echocardiographic parameters, including diastolic function indices, followed ASE recommendations. Detailed TTE and MRI acquisition protocols are provided in
Supplement Sections S2 and S3. These harmonized standards were used to align the literature-trained model with the institutional validation cohort.
2.5. PAMAP Framework Overview
The Practice-Anchored, Literature-Trained, Mode-Aware Predictive (PAMAP) framework is a prespecified 4-step analytical approach designed to translate literature-derived associations into clinically interpretable, patient-level risk estimates. It aligns exposure definitions with routine clinical phenotyping (TTE-detected PFO), incorporates meta-analytic effect estimates as fixed model components, and evaluates transportability of these associations to real-world cohorts.
The model defines the exposure as PFO detected on routine TTE—the phenotype cardiologists use in real-world care—rather than research-only shunt quantification. This aligns the analytical framework with the exposure used in our single-center dataset.
2.5.1. Systematic Assembly and Tiering of PFO–MRI Literature
To establish a transparent, evidence-based foundation, we systematically collected and organized published studies examining the relationship between PFO and WMH/WMLs.
A structured search was performed using PubMed and Scopus databases (search window through March 2025: 1997–2025, reflecting the period from early mechanistic studies (Knauth et al., 1997 [
3]) through modern quantitative MRI analyses (Niiyama et al., 2024 [
10]) with combinations of the following terms: “patent foramen ovale” OR “right-to-left shunt” OR “atrial septal aneurysm,” AND “white-matter lesions” OR “leukoaraiosis” OR “white matter hyperintensities” OR “magnetic resonance imaging” OR “stroke” OR “migraine.” Inclusion required human studies reporting both PFO detection method (TTE, TEE, or TCD) and MRI brain findings (qualitative or quantitative). Abstract screening and full-text review were performed independently by two reviewers, with consensus adjudication.
Studies were screened and categorized into three evidence tiers according to methodological rigor and population characteristics:
Tier 1—Mechanistic or research-grade: Controlled or mechanistic studies using contrast-enhanced transesophageal (TEE) or transcranial Doppler (TCD) shunt quantification and standardized MRI lesion scoring.
Tier 2—Clinical or hospital-based: Consecutive symptomatic cohorts (e.g., stroke, migraine, or unexplained neurological symptoms) assessed by TEE/TTE and clinical MRI.
Tier 3—Population or low-intensity detection: Community-based or registry cohorts, typically using non-contrast TTE with simplified WMH assessment.
For each study, key parameters were extracted: patient type, imaging modality, PFO detection method, and reported effect sizes (e.g., odds ratios or mean WMH burden differences). Effect sizes were standardized and aggregated by tier to describe the consistency and directionality of the PFO–WMH relationship across different study designs. This structured tiering enabled the integration of heterogeneous evidence into a unified analytical framework while making the results interpretable for clinicians.
2.5.2. Literature-Trained and Locked Coefficient Building Based upon the Tiered Synthesis
Literature-anchored model—termed The PAMAP specification—was developed. The PAMAP framework was designed to represent mechanistically distinct but clinically recognizable pathways by which PFO may contribute to cerebral MRI abnormalities. Rather than developing a de novo local model. PAMAP coefficients were grounded in meta-analytic evidence, ensuring that estimated relationships reflected previously validated literature rather than single-center statistical fitting.
2.5.3. Mode-Aware Risk Decomposition
PAMAP risk was partitioned into three interpretable modes that correspond to clinically meaningful mechanisms:
H-mode (Shunt/Hypoxemia): Capturing right-to-left shunt burden, transient pressure gradients, and oxygen desaturation linked to PFO patency. To represent the shunt/hypoxemia (H) mode, we synthesized count-eligible studies [
1,
4,
5,
6] using a random-effects meta-analysis to derive a pooled odds ratio for the association between PFO and WMH/WMLs. This pooled estimate was then locked as the H-mode coefficient without further tuning before testing on the institutional cohort.
E-mode (Embolic Context): Reflecting embolic potential from paradoxical embolism or coexisting venous thromboembolism traversing the PFO [
2,
11].
A-mode (Atrial/Diastolic Stress): Encompassing atrial enlargement, diastolic dysfunction, or atrial cardiomyopathic remodeling that may coexist with or be potentiated by PFO physiology [
9,
10].
Each mode incorporated literature-derived variables (e.g., shunt size, migraine or stroke phenotype, left atrial strain, E/e′ ratio). By combining a structured evidence synthesis (Step 1) with a clinically interpretable, literature-anchored model (Step 2), this approach bridges research findings with bedside decision support. Thus, it enables clinicians to understand PFO-related MRI findings through mechanistic and hemodynamic lenses—without requiring complex statistical retraining or purely data-driven inference. Model coefficients were locked before local validation to preserve external validity, ensuring that subsequent analyses tested the transportability of literature-based associations rather than overfitting to local data.
After locking the PAMAP specification to published meta-analytic evidence, we evaluated how well the expected prevalence and directionality of PFO-related MRI findings translated to the institutional cohort. This step assessed average-level (group-wise) model transportability, meaning whether the general relationships observed in the literature could be reproduced in real-world patients before any local model adjustment. Two complementary tests were performed:
Cohort-level comparison: Expected versus observed prevalence of white-matter hyperintensities (WMH; primary endpoint defined as Fazekas grade ≥ 1) was compared between our institutional cohort and the pooled reference value derived from the random-effects meta-analysis. Expected prevalence in the PFO-positive group was calculated from the control-group prevalence (p0) and the pooled odds ratio using p1 = (OR × p0)/(1 − p0 + OR × p0). Observed values were derived from the matched TTE-detected PFO cohort.
Subgroup-Level Concordance: Differences in mean WMH burden and selected echocardiographic markers (e.g., left atrial size, E/e′ ratio, and PTFV1 when available) were evaluated between PFO-positive and PFO-negative groups.
We quantified the directional consistency (same versus opposite direction) and the magnitude ratio (observed effect ÷ expected effect) to determine the degree of alignment with the published evidence.
By doing so, we tested whether literature-derived trends—such as higher WMH burden in PFO-positive patients—remained evident when applied to a routine, TTE-based clinical population.
Following average-level validation, individual-level translation was evaluated to test whether PAMAP could reliably predict MRI outcomes for individual patients:
Pre-Refit Evaluation: The model’s predicted probability of WMH was compared against observed MRI findings. Calibration was quantified using the Brier score, calibration slope, and observed-to-expected ratio, while discrimination was assessed by the area under the receiver-operating characteristic curve (AUC).
Minimal refit and post-refit evaluation: We first assessed calibration of the locked model using intercept, slope, Brier score, and AUC. We then performed a parsimonious refit that re-estimated only the pre-specified Age, H, E, and A terms; no new variables or interaction terms were introduced. Post-refit metrics were compared to determine whether limited local updating improved fit while preserving the literature-anchored structure.
2.6. Use of Generative Artificial Intelligence
Generative artificial intelligence (GenAI) tools were used exclusively to assist with language refinement, editorial organization of manuscript text, and preparation of the graphical abstract. No GenAI tools were used for study design, data collection, data curation, statistical analysis, neuroimaging interpretation, echocardiographic interpretation, model development, or generation of scientific conclusions. All scientific content, methodological decisions, data analyses, interpretations, and conclusions were independently developed, reviewed, and approved by the authors.
4. Discussion
Meta-analysis remains essential for establishing whether an association is present; however, it is inherently limited for clinical decision-making because it yields pooled effect estimates rather than calibrated probabilities at the patient level. In the context of PFO and WMH, this limitation is particularly relevant given the heterogeneity of populations, diagnostic approaches, and clinical phenotypes.
The PAMAP framework addresses this translational gap by using pooled evidence as an anchor while preserving mechanistic interpretability through its component domains (age, shunt, embolic, and atrial modes). Rather than replacing evidence synthesis, PAMAP extends it by testing whether literature-derived signals transport to routine TTE-defined PFO and by converting pooled associations into calibrated, patient-level risk estimates.
In this study, the literature-derived shunt signal remained applicable at both the cohort and individual levels, supporting the premise that heterogeneous evidence can be translated into clinically meaningful predictions when appropriately structured. Importantly, the persistence of the shunt/hypoxemia component after minimal refit suggests that right-to-left shunting represents a stable and transferable determinant of WMH risk, even in routine clinical phenotypes defined by standard TTE. Clinically, at the bedside, PFO should be interpreted using this structured approach, incorporating (i) shunt evidence, (ii) embolic plausibility, and (iii) atrial/diastolic dysfunction features. PAMAP, as an instrument, operationalizes this by combining H–E–A domains into a unified probability estimate, enabling transition from PFO presence to quantified, patient-specific WMH risk. Its applicability, not re-estimation alone, is key to converting meta-analytic signals into actionable clinical tools.
4.1. Pathophysiological Context
Clinical symptoms were nonspecific in both groups and should be interpreted cautiously, given the retrospective, descriptive design. In contrast, the most reproducible signal in this study was the imaging phenotype itself—WMH—which represents cumulative cerebral injury related to small-vessel disease, embolic phenomena, and impaired perfusion, and is associated with subsequent stroke, cognitive decline, and functional impairment [
14].
In the setting of PFO, several interacting mechanisms may contribute to this phenotype. Right-to-left shunting provides a direct pathway for paradoxical embolism or microembolization into the cerebral circulation. In parallel, shunt-related hypoxemia may impair cerebrovascular autoregulation and reduce perfusion reserve, further predisposing to white-matter injury.
Importantly, the observed differences in atrial size and diastolic indices suggest an accompanying atrial–hemodynamic substrate consistent with early atrial cardiomyopathy [
15]. Such a substrate may promote blood stasis within the left atrium, increasing susceptibility to embolic phenomena even in the absence of clinically overt atrial fibrillation.
Taken together, these findings support a multi-component model in which shunt physiology, atrial remodeling, and cerebral perfusion abnormalities converge to produce the WMH phenotype. This integrated framework is consistent with the structure of the PAMAP model and provides biological plausibility for the observed transportability of the PFO–WMH association in routine clinical practice.
4.2. Cardiac Structural and Echocardiographic Correlates
Although PFO in this study was identified using routine TTE, its association with WMH suggests that even hemodynamically mild forms may be accompanied by intermittent interatrial flow and subtle abnormalities in diastolic function. These findings point to a potential overlap between PFO and an underlying atrial–hemodynamic phenotype.
Compared with controls, patients with PFO demonstrated higher left atrial and left ventricular indices, including LAD, LAVI, LVMI, and pulmonary artery pressure, although values remained within reference ranges. The consistent directionality of these differences supports the presence of early atrial remodeling.
Taken together, these observations are consistent with an emerging model in which PFO interacts with an atrial cardiomyopathy, rather than functioning solely as an isolated anatomical conduit [
15]. This interpretation aligns with the broader concept of a cardio-cerebral axis linking atrial dysfunction, shunt physiology, and cerebral white-matter injury.
4.3. Modeling Framework and Transportability
A central challenge in clinical translation is determining whether associations derived from published studies apply to local patient populations and can inform individual risk. Meta-analyses provide pooled effect estimates across studies, whereas single-center analyses reflect local experience; however, these approaches operate in parallel and do not directly address whether literature-derived signals are transferable to routine clinical practice.
To bridge this gap, we adopted a literature-trained approach in which the effect size linking PFO to WMH was derived from prior studies and incorporated as a fixed (“locked”) coefficient. By design, this coefficient was not re-estimated using local data, allowing for direct testing of whether the published association remains valid in a real-world TTE-defined cohort.
This framework enables two complementary and clinically interpretable transportability assessments. First, average-level calibration compares the expected WMH prevalence—derived from the locked effect size—with the observed prevalence in the local cohort. Second, patient-level calibration evaluates agreement between predicted and observed outcomes using standard performance metrics, including calibration intercept and slope, Brier score, and the area under the receiver operating characteristic curve (AUC). Together, these steps provide a structured approach to determine whether literature-derived associations can be translated into clinically meaningful, patient-level risk estimates.
4.4. Clinical Interpretation
Our findings indicate that patients seen in TTE routine with PFO often already show MRI evidence of subclinical cerebral injury. In this context, the most consistent and reproducible signal was the WMH phenotype itself, rather than any specific symptom profile (but dizziness), emphasizing the importance of imaging-defined disease burden in individual risk assessment.
The PAMAP framework provides several clinically relevant advantages. First, it enables patient-level risk estimation by integrating routinely available variables, including age, PFO status on TTE, and echocardiographic indices. Second, the model preserves physiological interpretability by partitioning risk into shunt-related, embolic, and atrial components, thereby linking prediction to underlying mechanisms rather than treating risk as a purely statistical construct.
Third, the framework demonstrated applicability within a routine-care cohort, with close agreement between expected and observed WMH prevalence and acceptable calibration, suggesting that literature-derived associations can be meaningfully applied in real-world settings. At the same time, the need for external validation remains, particularly across diverse populations and imaging practices.
Finally, PAMAP is aligned with the phenotype clinicians actually encounter—PFO detected incidentally on standard TTE—thereby avoiding the selection biases inherent to research cohorts defined by TEE or TCD. In this way, PAMAP complements rather than replaces meta-analysis: whereas meta-analysis defines the existence of an association, this framework operationalizes that evidence into a clinically interpretable and transportable estimate of individual risk.
4.5. Clinical Implications
The central clinical question was whether the published PFO–WMH association retains relevance in routine cardiology practice. The PAMAP framework addresses this by combining baseline WMH prevalence with a small set of clinically accessible variables—age, PFO status on TTE, atrial rhythm context, and echocardiographic indices—to generate absolute risk estimates at the individual level.
This approach enables two complementary use cases. In settings without local outcome data, the locked model provides a direct translation of published evidence into bedside risk estimation. In settings with local data, limited recalibration using the same prespecified variables allows for refinement without departing from the original evidence structure.
By design, this framework shifts the role of meta-analysis from a terminal summary measure to an applicable component of possible clinical prediction. The locked model tests whether the literature signal survives in routine care, whereas the minimal refit quantifies the degree of local adaptation required. Because the variable set is fixed and mechanistically interpretable, the model can be implemented across a variety of clinical environments without sacrificing consistency with the underlying evidence base.
Future studies should evaluate whether mechanism-based phenotyping using the H–E–A framework may help identify patient subgroups with differing biological pathways or phenotypes of cerebrovascular vulnerability. In particular, prospective investigations are needed to determine whether patients characterized by prominent shunt physiology, embolic susceptibility, or atrial–hemodynamic substrate demonstrate differential risk trajectories or derive benefit from intensified surveillance, rhythm monitoring, medical therapy, or potentially PFO closure. However, the present findings are observational and hypothesis-generating and should not be interpreted as evidence supporting therapeutic intervention or closure strategies at this stage.
4.6. Study Limitations
1. This was a retrospective, single-center study, and PFO classification was based on routine transthoracic echocardiography color Doppler rather than on contrast-enhanced transesophageal echocardiography or transcranial Doppler, which may have led to underdetection of small or transient shunts. The relatively small number of PFO-positive patients limited statistical precision and increased the risk of overinterpretation, particularly for subgroup analyses, patient-level calibration, and the parsimonious refit. Accordingly, the present study should be interpreted as an exploratory, proof-of-concept transportability analysis rather than a definitive predictive modeling study.
2. Selection bias also represents an important limitation. Nearly half of initially identified PFO-positive patients did not undergo brain MRI and were therefore excluded, potentially enriching the analyzed cohort for individuals with neurological symptoms or greater clinical concern. Consequently, the observed WMH prevalence may not reflect the broader population of patients with incidentally detected PFO in routine clinical practice.
3. White matter hyperintensities were assessed using visual Fazekas grading rather than quantitative volumetric measures. In addition, the literature-derived shunt coefficient was based on a limited number (just 4) of currently available heterogeneous studies spanning different populations, imaging protocols, and ascertainment strategies, resulting in substantial statistical heterogeneity. Therefore, the pooled estimate should be interpreted as an approximate transportability anchor rather than a universally stable biological effect size.
4. The present analysis represents an internal assessment of transportability within a single clinical setting rather than formal external validation across independent cohorts**. Despite propensity matching, residual confounding remains plausible, particularly from vascular risk burden, referral patterns, shunt magnitude, subclinical atrial cardiomyopathy, and occult atrial fibrillation, which may influence both PFO status and WMH burden. Consequently, although the H–E–A framework is mechanistically informed and biologically grounded, the present observational analysis does not establish causality. Rather, the framework is intended to provide a clinically interpretable structure for contextualizing PFO-associated cerebral vulnerability and translating literature-derived associations into patient-level risk estimates.
5. A younger-age sensitivity analysis was considered because younger patients may represent a phenotype less confounded by age-related vascular disease and therefore more reflective of shunt-related mechanisms. However, the limited number of PFO-positive patients within age-stratified subgroups substantially reduced statistical stability and interpretability. Future multicenter studies with larger cohorts should specifically evaluate younger populations to better define the mechanistic contribution of PFO-associated shunting to WMH burden.
The limitations above underscore the need for prospective, multicenter studies with standardized imaging protocols, quantitative WMH assessment, and broader phenotypic and age characterizations to confirm the generalizability and clinical utility of the framework.