Next Article in Journal
DynamX Bioadaptor as an Emerging and Promising Innovation in Interventional Cardiology
Previous Article in Journal
Antibiotic-Loaded PMMA Beads for Recurrent Sternocutaneous Fistula: Expanding the Surgical Armamentarium in Post-Sternotomy Osteomyelitis: Case Report and Literature Review
Previous Article in Special Issue
Non-Celiac Villous Atrophy—A Problem Still Underestimated
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Comparative Effectiveness of Urine vs. Stool Gluten Immunogenic Peptides Testing for Monitoring Gluten Intake in Coeliac Patients: A Systematic Review and Meta-Analysis

by
Sarmad Sarfraz Moghal
1 and
Jonathan Soldera
2,*
1
MSc Gastroenterology, University of South Wales in Association with Learna Ltd., Pontypridd CF37 1DL, UK
2
Tutor for PGDip and MSc Program in Acute Medicine and Gastroenterology, University of South Wales in Association with Learna Ltd., Pontypridd CF37 1DL, UK
*
Author to whom correspondence should be addressed.
Life 2025, 15(10), 1548; https://doi.org/10.3390/life15101548
Submission received: 2 June 2025 / Revised: 10 September 2025 / Accepted: 10 September 2025 / Published: 2 October 2025

Abstract

Coeliac disease (CD) is a chronic immune-mediated enteropathy triggered by gluten ingestion in genetically predisposed individuals carrying HLA-DQ2 or HLA-DQ8 haplotypes, characterized by small intestinal mucosal damage and systemic manifestations. This systematic review and meta-analysis aimed to compare the effectiveness of urine versus stool GIPS testing for monitoring gluten intake in coeliac patients, providing evidence-based recommendations for clinical practice. A comprehensive literature search was conducted in databases like PubMed and Embase. Studies evaluating urine or stool GIPS testing in coeliac patients were included, focusing on sensitivity, specificity, and patient adherence. The meta-analysis included six studies with a total of 572 participants. The stool GIPS testing demonstrated a pooled sensitivity of 85.1% (95% CI: 79.0–89.9%) and a specificity of 92.5% (95% CI: 88.3–95.6%), making it highly reliable for detecting gluten exposure and ruling out false positives. It also achieved an AUC of 0.9853, indicating excellent diagnostic performance. In contrast, the urine GIPS testing showed a pooled sensitivity of 55.4% (95% CI: 49.6–61.2%) and a specificity of 73.0% (95% CI: 67.4–78.1%), with an AUC of 0.7898. The heterogeneity across the studies was significant (I2 > 80%), driven by variations in the population characteristics, sample handling, and testing protocols. These findings emphasize the need for standardized methodologies to enhance the reliability and comparability of results.

1. Introduction

CD is a chronic immune-mediated condition triggered by gluten ingestion in genetically predisposed individuals, primarily affecting the small intestine. It leads to histological alterations, such as villous atrophy, crypt hyperplasia, and increased intraepithelial lymphocytes, impairing nutrient absorption [1]. Diagnosis typically combines positive serological markers (tTG IgA or EMA) with a duodenal biopsy, though in children with strong serological responses, a biopsy may be omitted [1,2]. Over 95% of patients carry HLA-DQ2 or HLA-DQ8 haplotypes, though these are not disease-specific [3]. The global prevalence is estimated at 1%, with regional variations tied to genetics, dietary habits, and diagnostic access [4,5,6]. Improved serological testing and awareness of atypical presentations have increased the diagnosis rates, particularly in high-risk groups, such as relatives of patients and individuals with type 1 diabetes or autoimmune thyroid disease [1]. Yet, underdiagnosis persists, especially in developing regions [5]. Environmental changes, such as increased gluten consumption and altered gut microbiota, may also contribute to its rising incidence [6].
CD follows a variable course. If untreated, gluten exposure causes chronic small bowel inflammation, resulting in malabsorption, gastrointestinal symptoms, anaemia, osteoporosis, and even infertility [1,4]. Strict lifelong adherence to a gluten-free diet improves the prognosis, although histological recovery in adults may lag behind symptom improvement [7]. Residual symptoms, despite dietary adherence, and the high burden of non-compliance highlight the need for effective monitoring.
Monitoring gluten intake is crucial, as even small quantities can provoke mucosal inflammation [1,7]. Traditional tools include self-reported diet logs, monitoring via a dietetic service, serological tests, and duodenal biopsies. However, these methods are limited. Self-reporting is subjective and prone to recall and social desirability biases [1,7,8,9,10,11]. Serology (tTG IgA or DGP antibodies) is often used to assess dietary adherence, but may be normal despite ongoing low-level gluten exposure, and its sensitivity is reduced in IgA-deficient patients [3,4,7]. Biopsies provide definitive evidence of mucosal damage but are invasive and subject to sampling error, making them unsuitable for routine monitoring [1,4]. These limitations point to the need for more objective, real-time tools [6].
Gluten immunogenic peptides (GIPS) testing is a newer, direct method for detecting gluten exposure in coeliac patients. GIPS are fragments of gluten proteins resistant to digestion, triggering immune responses in HLA-DQ2/DQ8-positive individuals. These peptides are excreted in urine and stool, allowing for non-invasive detection of gluten intake [11,12]. GIPS testing is particularly useful for identifying unintentional or low-level exposure often missed by traditional methods [13]. However, clinicians should be aware of the risk of false positives [14]. Accidental cross-contamination, transient dietary slips, or variability in an individual’s metabolism can lead to isolated positive results that may not reflect sustained gluten ingestion. Thus, while GIPS testing offers high sensitivity, its findings should be interpreted alongside dietary assessments and repeated testing to confirm adherence.
Urine GIPS testing detects fragments absorbed and excreted within 6–8 h of gluten ingestion, offering a rapid, non-invasive snapshot of recent exposure [12]. While it is simple and convenient, the short detection window limits its use in identifying gluten consumed more than 48 h prior [11,12]. Conversely, stool GIPS testing identifies unabsorbed peptides eliminated over 4–7 days, making it more effective for detecting intermittent exposure [11,12]. Together, these methods allow for tailored monitoring based on individual patient needs.
GIPS testing is advantageous because it bypasses the need for patient recall and directly measures gluten exposure, complementing serology and biopsies. The research supports its sensitivity for detecting as little as 50 mg of gluten, a clinically relevant threshold [15]. Urine testing is quick and well accepted by patients, enabling home-based use and fast feedback [11]. However, stool testing, while valuable for its longer detection window, is more cumbersome due to sample collection, storage, and processing requirements. This may limit its acceptability and widespread use [11,12].
Initial studies validated the accuracy of urine GIPS tests for identifying recent gluten exposure and highlighted stool GIPS’s capacity to capture longer-term, low-level ingestion [11,12,13,16]. Real et al. (2014) [17] demonstrated stool GIPS’s ability to uncover hidden contamination even in asymptomatic patients on a GFD. Comparative studies between urine and stool GIPS have shown conflicting results. Urine testing is sensitive within 24–48 h post-ingestion but declines thereafter. Stool testing offers longer detection but may be influenced by bowel habits and transit time. Differences in immunoassay sensitivity and sample handling further complicate comparisons [16,17,18].
Several knowledge gaps limit routine adoption of GIPS testing. The correlations between GIPS levels and mucosal healing or symptom resolution remain unclear [13]. Additionally, studies vary in their designs and methodologies, making it difficult to establish consistent sensitivity thresholds or clinical cut-offs [12]. While most of the available research comprises short-term or cross-sectional studies and lacks evaluation of long-term outcomes, such as adherence or quality of life improvements [17], there is also emerging evidence suggesting a poor correlation between GIPS levels and mucosal recovery [19]. The integration of GIPS testing with serology and biopsy, along with its real-world cost-effectiveness, remains poorly explored [17]. Furthermore, patient perspectives—particularly regarding stool testing—need further investigation [13,18].
To address these gaps, a systematic review and meta-analysis should evaluate not only the comparative utility of urine versus stool GIPS testing, but also how these tests complement existing diagnostic and monitoring tools, such as serology and histology. Pooling the existing evidence may clarify their sensitivity, specificity, and practical roles in CD monitoring, including their cost-effectiveness in clinical practice. Establishing standard protocols and cut-off levels could guide clinicians in selecting the most appropriate test for different scenarios, such as recent versus intermittent gluten exposure. Evaluating the links between GIPS levels, mucosal healing, and symptom burden would support their clinical use. Additionally, a meta-analysis could help assess the barriers to adoption, such as cost, ease of use, and integration with existing monitoring tools. Finally, it could identify research gaps to guide future studies, focusing on longitudinal outcomes, standardization, and patient acceptability.

2. Methods

2.1. Data Collection

A systematic review and meta-analysis were conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines [5]. The literature search was designed to capture all relevant studies on the comparative effectiveness of stool and urine gluten immunogenic peptides (GIPS) testing for monitoring gluten intake among patients with coeliac disease. The databases searched included PubMed, Cochrane Library, EMBASE, and ClinicalTrials.gov.
This review was prospectively registered with the International Prospective Register of Systematic Reviews (PROSPERO), under the registration number CRD420250591573: Comparative Effectiveness of Urine vs. Stool GIP Testing in Monitoring Gluten Intake in Coeliac Patients: A Systematic Review and Meta-Analysis. The record is published and stable.
In addition, the PRISMA 2020 checklist (see Table S1) was completed to ensure transparent and comprehensive reporting of the systematic review and meta-analysis.

2.2. Eligibility Criteria

The eligibility criteria were established based on the Population, Intervention, Comparison, and Outcome (PICO) framework to maintain focus on the research question.

2.3. Inclusion Criteria

  • Participants: Studies including adult patients (≥18 years) diagnosed with CD using established clinical, serological, or histological criteria were eligible.
  • Intervention: Trials employing stool or urine GIPS testing to detect gluten exposure were included.
  • Comparison: Comparators could include traditional methods like dietary adherence scoring, serology, or other biomarkers, as well as head-to-head studies of stool versus urine GIPS.
  • Outcome Measures: Studies needed to report sensitivity, specificity, accuracy, patient adherence, or clinical outcomes as primary or secondary endpoints.
  • Study Design: Only RCTs, cohort studies, and cross-sectional studies with clear methodologies were included.

2.4. Exclusion Criteria

  • Study Design: Non-randomized trials, case reports, conference abstracts, and reviews without original data were excluded.
  • Language: Non-English-language studies were excluded due to resource constraints, though prior research indicates minimal impact of this restriction on outcomes.
  • Participants: Studies involving non-coeliac populations or those with confounding conditions, like significant alcohol consumption, other liver diseases, or untreated intestinal infections, were excluded to ensure specificity of results.
  • Intervention: Studies utilizing non-validated or non-standardized GIPS detection methods (e.g., in-house assays without demonstrated analytical validation per FDA/EMA guidelines, or methods lacking established sensitivity/specificity for gluten peptides detection) were excluded to ensure methodological consistency across included studies.
  • Outcome Reporting: Studies failing to report sensitivity, specificity, or other measurable outcomes of GIPS testing were excluded.

2.5. Search Strategy

The search command combined keywords and Medical Subject Heading (MeSH) terms with Boolean operators: (“Gluten Immunogenic Peptide” OR “GIPS”) AND (“Celiac Disease” OR “CD”). Additionally, a manual search of references in identified articles, prior systematic reviews, and specialized journals was conducted. This supplementary step ensured inclusion of all relevant randomized controlled trials (RCTs), observational studies, and other studies fitting the criteria. Articles were imported into the reference management software Rayyan version 1.6.3, which facilitated automatic deduplication and allowed for more efficient title and abstract screening [12].
Similar search strategies were applied to other databases, such as Cochrane Library, EMBASE, and ClinicalTrials.gov. Filters for RCTs and clinical trials were employed to refine the search results, prioritizing high-quality evidence. The search was conducted on 24 September 2024, ensuring the inclusion of the most up-to-date research available at the time.

2.6. Data Extraction

Data extraction was performed systematically to ensure consistency and completeness. A standardized Excel spreadsheet was used, with the following variables extracted for each study:
  • Participant Characteristics: Mean age, gender distribution, CD duration, and serological or histological status were recorded.
  • Study Design: Type of study (RCT, cohort, or cross-sectional), publication year, study duration, and geographic location.
  • Sample Size: Total number of participants and their allocation to intervention or control groups.
  • Intervention Details: Type of GIPS testing employed (stool or urine), detection thresholds, and testing protocols.
  • Comparator Details: Type of comparator used, whether traditional monitoring tools or another GIPS method.
  • Outcomes: Sensitivity, specificity, Diagnostic Odds Ratios, and secondary outcomes like patient adherence and practicality of testing method.
  • Safety Data: Any reported adverse events linked to GIPS testing, especially concerning ease of sample collection.
The extraction process was conducted by a single reviewer, with periodic consultation with a second reviewer for ambiguous data points. Discrepancies were resolved through discussion.

2.7. Data Management

All references were managed using Rayyan for systematic organization and removal of duplicates. Extracted data were stored in Excel spreadsheets, ensuring traceability and allowing for easy updates during subsequent analyses. Citation management was handled via MyBib, an online reference tool that ensured proper formatting and organization of sources.

2.8. Quality Assessment

The quality of included studies was assessed using the Cochrane Risk of Bias 2 (RoB 2) tool. This framework evaluates studies on several domains:
  • Randomization: Assessment of sequence generation and allocation concealment to minimize selection bias.
  • Blinding: Evaluation of blinding among participants, personnel, and outcome assessors to reduce performance and detection bias.
  • Attrition: Examination of completeness of outcome data and handling of missing data.
  • Reporting: Scrutiny for selective reporting of outcomes or incomplete data presentation.
Each study was rated as low risk, some concerns, or high risk. Visualizations of bias assessments were generated using robvis tool, which provided clear summary plots for reporting purposes [18,20].

2.9. Assessment of Heterogeneity

Heterogeneity among studies was assessed to understand variations in outcomes due to differences in study design, populations, or testing protocols. Statistical methods included:
  • Chi-Squared Test: Included in forest plots to test whether observed differences were due to chance.
  • I2 Statistic: Quantified proportion of variability due to heterogeneity rather than random error. Values were categorized as low (<30%), moderate (30–60%), or high (>60%) heterogeneity.

2.10. Data Analysis

The meta-analysis was conducted to evaluate the diagnostic performance of the included studies. Meta-Disc software (version 1.4) was used for all the statistical analyses. The sensitivity (Sn), specificity (Sp), and the area under the receiver operating characteristic curve (AUROC) were calculated to assess the diagnostic accuracy of the index test across the studies. Summary estimates of Sn and Sp were derived, and the corresponding summary receiver operating characteristic (SROC) curve was constructed. The heterogeneity of the results was evaluated through Cochran’s Q test and the I2 statistic, and sources of variability were explored where appropriate. Additionally, we performed subgroup analyses to investigate potential influencing factors and ensure robustness of the findings. All the analyses adhered to established methodological standards for diagnostic test accuracy reviews.

3. Results

3.1. Search Results

A comprehensive search was conducted using PubMed, identifying 77 records. After the initial screening and application of inclusion and exclusion criteria, 20 studies were sought for retrieval. Ultimately, 20 studies were included in the systematic review, but 12 of these were excluded from the meta-analysis. The reasons for their exclusion are detailed in Table 1.
The study selection process is summarized below.
A total of 77 records were retrieved for review. Following title and abstract screening, 57 studies were excluded based on the eligibility criteria. The reasons for exclusion included the following:
  • Irrelevant study focus, such as research on non-coeliac gluten sensitivity, non-gluten dietary components (e.g., FODMAPs), pharmacological treatments, or non-clinical outcomes (e.g., microbiome or genetic analyses without symptom correlation).
  • Population mismatch, including studies restricted to paediatric cohorts or non-coeliac participants.
  • Methodological limitations, such as inadequate comparison of stool versus urine GIPS detection, non-validated adherence assessments, or incomplete outcome reporting.
  • Insufficient data or duplicate publications.
  • Full-Text Review: After screening, 18 studies were identified as meeting the eligibility criteria and were included in the qualitative synthesis. Of these, 12 studies were excluded from the meta-analysis due to factors such as lack of comparable outcome measures or insufficient data for statistical pooling.
The study selection process is illustrated in the PRISMA flow diagram (Figure 1). The detailed reasons for exclusion from the meta-analysis are presented in Table 1.

3.2. Characteristics of Included Studies

The characteristics of the six included studies are summarized below and detailed in Table 2. These studies were published between 2017 and 2024 and involved diverse methodologies and populations, providing valuable insights into the utility of gluten immunogenic peptides (GIPS) detection for monitoring gluten exposure and dietary adherence in CD.
  • Study Design: The studies comprised prospective observational, interventional, and randomized controlled designs. Four studies were conducted at single centres, while two were multicentre in nature.
  • Population: A total of 572 participants were included across all the studies, with varying proportions of patients adhering to a gluten-free diet (GFD), newly diagnosed coeliac patients, and healthy controls. The participants were exclusively adults (18 years and above) with confirmed CD.
  • Intervention: All the studies used GIPS detection in urine or stool samples as the primary intervention. The detection methods included lateral flow tests (LFTs), enzyme-linked immunosorbent assays (ELISAs), and rapid immunochromatographic tests.
  • Comparison: The comparators included duodenal biopsy, serological markers (e.g., TTG, DGP, and AGA), symptom scores, dietary questionnaires, and placebo administration.
  • Outcomes Measured: The primary outcomes focused on the sensitivity and specificity of GIPS detection for gluten exposure, its correlation with symptoms and serological markers, and its role in assessing mucosal healing.
  • Results: The studies demonstrated the utility of GIPS detection in urine and stool for identifying gluten exposure, often revealing gluten intake missed by traditional tools. The stool GIPS showed a higher sensitivity compared to the urine GIPS in most cases, with significant correlations with the serological and histological findings.
The detailed characteristics of these studies are summarized in Table 2.

3.3. Risk of Bias and Heterogeneity

The risk of bias (RoB) for the six included studies was assessed using the Cochrane Risk of Bias (RoB 2.0) tool. This evaluation considered the following domains: bias arising from the randomization process, bias due to deviations from intended interventions, missing outcome data, measurement of outcomes, and selection of reported results. A traffic-light plot and summary plot are provided in Figure 2 and Figure 3, visually summarizing the risk levels across these domains and the overall quality of the studies.

3.3.1. Traffic-Light and Summary Plots (Figure 2 and Figure 3)

  • The traffic-light plot illustrates the risk levels across the individual domains for each study. Most of the studies demonstrated a low risk of bias in critical domains, such as randomization and measurement of outcomes. However, the following were noted:
  • Some concerns were noted in domains such as missing data and deviations from the intended interventions.
  • A high risk of bias was observed in one study due to selective reporting and insufficient blinding.
  • The summary plot aggregates the results across all the studies, showing that the overall bias levels were predominantly low, with a minority falling into some concerns and high-risk categories.

3.3.2. Findings

The risk of bias assessment revealed the following:
  • A low risk of bias in most of the studies, particularly in domains like randomization and measurement of outcomes.
  • Some concerns in certain studies regarding missing outcome data and deviations from the intended interventions.
  • A high risk of bias in one study due to selective reporting issues.

3.3.3. Heterogeneity

The heterogeneity among the studies was assessed using the I2 statistic during the meta-analysis. The findings revealed the following:
  • There was high heterogeneity (I2 > 80%) across the pooled sensitivity and specificity outcomes, reflecting significant variability among the studies.
  • The sources of heterogeneity included the following:
  • Variability in the sample populations, such as differences in gluten-free diet adherence and baseline characteristics.
  • Differences in the GIPS testing protocols, including detection thresholds, sample types (urine vs. stool), and timing.
  • Variability in the reported outcomes, such as mucosal healing, symptom resolution, and dietary adherence scores.

3.3.4. Subgroup Analyses

To explore and address the heterogeneity, subgroup analyses were conducted based on the following:
  • Type of sample (urine vs. stool).
  • Detection methods (ELISA vs. LFT).
  • Population characteristics, such as new versus long-term CD diagnoses.

3.4. Primary Outcome: Sensitivity and Specificity of GIPS Testing

The primary outcome of this meta-analysis was to evaluate the diagnostic performance of gluten immunogenic peptides (GIPS) testing in stool and urine for detecting gluten exposure in patients with CD adhering to a gluten-free diet. The sensitivity and specificity were analysed across multiple studies to determine the reliability of these tests.

3.4.1. Stool GIPS Testing

Sensitivity
The pooled sensitivity for stool GIPS testing was 85.1% (95% CI: 79.0–89.9%), reflecting a high ability to detect true positives. However, individual studies demonstrated considerable variability, with sensitivity estimates ranging from 54.5% [36] to 100% [35]. This variability contributed to significant heterogeneity (I2 = 93.4%), likely attributable to differences in study design, population characteristics, or GIPS assay methodology. These findings are illustrated in Figure 4.
  • Specificity:
    (a)
    The pooled specificity was 92.5% (95% CI: 88.3–95.6%), reflecting a high capacity to identify true negatives.
    (b)
    The specificity values ranged from 84.0% to 100% across the included studies, with moderate heterogeneity (I2 = 81.0%). These findings are depicted in Figure 5.
  • Summary Receiver Operating Characteristic (SROC) Curve:
    (a)
    The area under the SROC curve (AUC) was 0.9853, demonstrated in Figure 6 shows excellent diagnostic performance.

3.4.2. Urine GIPS Testing

  • Sensitivity:
    The pooled sensitivity for urine GIPS testing was 55.4% (95% CI: 49.6–61.2%), demonstrating a moderate ability to detect true positives.
    As shown in Figure 7, the sensitivity varied significantly across the studies (I2 = 96.0%), with individual values ranging from 25.5% [36] to 96.9% [34].
  • Specificity:
    The pooled specificity for urine GIPS testing was 73.0% (95% CI: 67.4–78.1%), lower than stool testing but still indicating a moderate reliability for ruling out false positives.
    Significant heterogeneity was observed (I2 = 93.5%). This is illustrated in Figure 8.
  • Summary Receiver Operating Characteristic (SROC) Curve:
    • The area under the SROC curve (AUC) was 0.7898, demonstrating moderate diagnostic performance. This presented in Figure 9.

3.4.3. Outcome Conclusion

Stool GIPS testing demonstrates superior diagnostic accuracy compared to urine GIPS testing for detecting gluten exposure in coeliac patients. With a pooled sensitivity of 85.1% (95% CI: 79.0–89.9%) and specificity of 92.5% (95% CI: 88.3–95.6%), alongside an AUC of 0.9853, stool GIPS testing is highly reliable for identifying both true positives and true negatives, making it a reliable modality for monitoring dietary adherence. In contrast, urine GIPS testing shows moderate diagnostic performance, with a pooled sensitivity of 55.4% (95% CI: 49.6–61.2%), specificity of 73.0% (95% CI: 67.4–78.1%), and an AUC of 0.7898. This suggests that while urine GIPS testing may not be as accurate, it still has potential as a non-invasive alternative in certain patient populations. The significant heterogeneity observed in both methods (I2 = 93.4% for stool sensitivity; I2 = 96.0% for urine sensitivity) highlights the need for standardization in testing protocols and study designs. Overall, stool GIPS testing provides clinicians with a robust tool for detecting even intermittent gluten exposure, facilitating timely dietary interventions and improving patient outcomes. Urine GIPS testing, although less reliable, may complement stool testing in resource-limited or patient-specific scenarios.

3.5. Secondary Outcomes: Clinical Efficacy of GIPS Testing

The secondary outcomes explored the practical utility of GIPS testing in the clinical management of CD. This includes its role in detecting gluten exposure and guiding dietary interventions for patients adhering to a gluten-free diet. The effectiveness of stool and urine GIPS testing was evaluated based on diagnostic performance metrics, such as Diagnostic Odds Ratios (DORs), Likelihood Ratios (LR+ and LR−), and the clinical relevance of these measures.

3.5.1. Stool GIPS Testing

  • Diagnostic Power:
    • Stool GIPS testing demonstrated a high pooled Diagnostic Odds Ratio (DOR) of 140.66 (95% CI: 11.05–1790.5), underscoring its strong capacity to discriminate between patients with and without gluten exposure.
    • This high DOR reflects the reliability of stool GIPS testing for identifying even intermittent gluten exposure, a critical challenge in the clinical management of CD. This is depicted in Figure 10.
  • Positive Likelihood Ratio (LR+):
    • The LR+ of 10.997 (95% CI: 3.264–37.047) suggests that a positive test result makes gluten exposure approximately 11 times more likely. This high value emphasizes the test’s utility as a confirmatory diagnostic tool, particularly in patients suspected of dietary non-compliance. This is demonstrated in Figure 11.
  • Negative Likelihood Ratio (LR−):
    • The low LR− of 0.086 (95% CI: 0.011–0.680) indicates that a negative test result strongly reduces the likelihood of gluten exposure. This makes stool GIPS testing a reliable tool for ruling out gluten ingestion in compliant patients. This is presented in Figure 12.

3.5.2. Urine GIPS Testing Diagnostic Power

Urine GIPS testing showed a pooled DOR of 7.195 (95% CI: 1.996–25.937), reflecting moderate discriminatory power. While less robust than stool GIPS testing, it still has the potential to detect gluten exposure in specific clinical scenarios.
Its reduced diagnostic accuracy may result from lower GIPS concentrations in urine or variability in the timing of sample collection relative to gluten ingestion. This is shown in Figure 13.
4.
Positive Likelihood Ratio (LR+):
The LR+ of 2.11 (95% CI: 1.288–3.459) indicates that a positive urine GIPS test only modestly increases the likelihood of gluten exposure. This limits its reliability as a confirmatory diagnostic tool, especially in low-risk patients. This is depicted in Figure 14.
5.
Negative Likelihood Ratio (LR−):
The LR− of 0.433 (95% CI: 0.241–0.777) suggests moderate utility for ruling out gluten exposure but is less effective compared to stool GIPS testing. This is shown in Figure 15.

3.5.3. Secondary Outcomes Conclusion: Clinical Efficacy of GIPS Testing

Stool GIPS testing demonstrates excellent clinical utility in the management of CD by effectively identifying gluten exposure and guiding dietary interventions. With a Diagnostic Odds Ratio (DOR) of 140.66 (95% CI: 11.05–1790.5), stool GIPS testing exhibits strong discriminatory power, making it a highly reliable tool for distinguishing between exposed and non-exposed patients. Its Positive Likelihood Ratio (LR+) of 10.997 (95% CI: 3.264–37.047) underscores its role as a robust confirmatory test, particularly in patients suspected of dietary non-compliance. Furthermore, the Negative Likelihood Ratio (LR−) of 0.086 (95% CI: 0.011–0.680) highlights its effectiveness in ruling out gluten ingestion, ensuring clinicians can confidently monitor dietary adherence. These characteristics position stool GIPS testing as an invaluable diagnostic method in clinical practice.
Urine GIPS testing, while less clinically robust, still demonstrates utility in specific scenarios. With a pooled DOR of 7.195 (95% CI: 1.996–25.937), it offers moderate discriminatory power, though its diagnostic performance is significantly lower than that of stool testing. The LR+ of 2.11 (95% CI: 1.288–3.459) indicates a limited ability to confirm gluten exposure, while the LR− of 0.433 (95% CI: 0.241–0.777) reflects a moderate capacity to exclude exposure. These limitations may be attributed to lower GIPS concentrations in urine and variability in sample timing. Despite its reduced accuracy, urine GIPS testing remains a non-invasive alternative that can complement stool testing, particularly in patients unable or unwilling to provide stool samples.
In summary, stool GIPS testing is the preferred method due to its superior clinical efficacy, providing clinicians with a reliable and precise tool for managing CD. Urine GIPS testing, while less effective, may serve as a supplemental or alternative method in select cases, broadening the diagnostic options available for gluten exposure monitoring.

3.5.4. Safety Outcome

By pooling the adverse events reported across all six studies, it was found that a total of 12 (2.1%) adverse events were reported in the intervention groups (461 participants) and 3 (0.6%) adverse events in the control groups (165 participants), as summarized in Table 3. A random-effect model using the Mantel–Haenszel method (Figure 7) showed that there was no statistically significant increase in the odds of adverse events in the intervention groups compared to the control groups (OR 1.91; 95% CI 0.89–4.08; p = 0.09; I2 = 35%).

3.5.5. Adverse Events Analysis

Stool GIPS Testing:
A total of nine adverse events were reported, primarily related to mild gastrointestinal symptoms (e.g., vomiting) in gluten challenge interventions.
  • Urine GIPS Testing:
A total of three adverse events were documented, primarily abdominal pain in one study. Withdrawal Due to Adverse Events
The total number of participants who withdrew due to adverse events was 0 across all studies in both the intervention and control groups.

3.6. Sensitivity Analysis

A sensitivity analysis was conducted to evaluate the robustness of the meta-analysis results by examining the influence of individual studies on the pooled estimates. This analysis is essential to assess the stability of the results and to determine if any single study disproportionately affected the overall outcomes.

3.6.1. Methodology

The sensitivity analysis involved sequentially excluding one study at a time from the meta-analysis and recalculating the pooled estimates for both the sensitivity and specificity of stool and urine GIPS testing. The aim was to identify any significant changes in the results upon exclusion of each study, indicating potential bias or high influence of individual studies.

3.6.2. Sensitivity Analysis Results

  • Stool GIPS Testing:
    Pooled Sensitivity:
    The pooled sensitivity remained relatively stable after the exclusion of any individual study. The sensitivity ranged from 83.5% to 86.9%, with minimal changes in the overall estimate (I2 varied between 91.2% and 94.5%).
    Pooled Specificity:
    The pooled specificity also showed minimal variation, ranging from 91.4% to 93.3% across different exclusions, with the I2 between 79.5% and 82.0%.
    Impact of Exclusion:
    The study by Marta Garzón-Benavides (2023) [36], which reported a low sensitivity of 54.5%, had the greatest effect on the pooled sensitivity estimate, but it did not substantially alter the conclusion about the overall high accuracy of stool GIPS testing.
  • Urine GIPS Testing:
    Pooled Sensitivity:
    The pooled sensitivity showed higher variability when individual studies were excluded. The sensitivity ranged from 53.0% to 57.7% (I2 fluctuated between 94.0% and 97.0%).
    Pooled Specificity:
    Excluding studies had a minor impact on the pooled specificity, which ranged from 70.5% to 75.2%, with the I2 between 91.7% and 94.4%.
    Impact of Exclusion:
    The study by Ángela Ruiz-Carnicer (2020) [34] had a notable impact on the pooled sensitivity, as it reported the highest sensitivity of 96.9%, contributing to the upper end of the pooled estimate. Excluding this study led to a decrease in the pooled sensitivity, which emphasizes the variability across the urine GIPS studies.

4. Discussion

4.1. The Importance and Ethical Imperative of Monitoring Subclinical Gluten Exposure in CD

Even in the absence of overt symptoms, subclinical gluten exposure in CD poses significant long-term risks, necessitating vigilant monitoring. While patients may not experience immediate gastrointestinal distress, low-level gluten ingestion can still trigger immune-mediated mucosal damage, perpetuating intestinal inflammation and increasing the risk of complications, such as villous atrophy, nutrient malabsorption, and refractory disease [37]. Over time, this silent injury may contribute to osteoporosis, lymphoma, and other autoimmune disorders—even in cases where the serological markers (e.g., tTG-IgA) remain negative due to their limited sensitivity to intermittent gluten exposure [38,39].
The management of CD relies heavily on strict adherence to a gluten-free diet (GFD), yet traditional monitoring methods—such as serological testing and duodenal biopsies—have notable limitations. Serological markers, while useful, may fail to detect intermittent gluten exposure, particularly in patients with partial dietary adherence [38,39]. Duodenal biopsies, though considered the gold standard for assessing mucosal healing, are invasive, costly, and impractical for routine monitoring [13]. Given these challenges, gluten immunogenic peptides (GIPS) testing in stool and urine has emerged as a promising non-invasive biomarker for detecting gluten exposure, offering a valuable tool for identifying subclinical or inadvertent gluten ingestion [37].

4.2. Diagnostic Accuracy and Clinical Utility of GIPS Testing

This systematic review and meta-analysis highlight the superior diagnostic performance of stool GIPS testing compared to urine-based assays. Stool GIPS testing demonstrates high sensitivity and specificity in detecting gluten exposure, even at low levels, making it particularly useful for identifying subclinical dietary lapses that may otherwise go unnoticed [26]. Unlike serological tests, which reflect immune activity over weeks to months, stool GIPS provides real-time detection of recent gluten intake, enabling timely dietary interventions [40].
In contrast, urine GIPS testing, while non-invasive and more convenient, exhibits lower diagnostic accuracy due to factors such as variable renal excretion rates, sample dilution, and shorter detection windows [27,41]. Nevertheless, urine testing may still serve as a supplementary tool in settings where stool collection is impractical, though further optimization is needed to improve its reliability [41,42].

4.3. Ethical Considerations in Monitoring and Patient Autonomy

The implementation of GIPS testing raises important ethical considerations regarding patient autonomy, psychological impact, and equitable access.

4.4. Informed Consent and Shared Decision-Making

Patients must be fully educated on the purpose, benefits, and limitations of GIPS testing. Ethical practice requires transparency—ensuring patients understand that asymptomatic gluten exposure can cause harm while acknowledging that testing may induce anxiety or feelings of dietary failure [43]. Shared decision-making empowers patients to actively participate in their care, avoiding unnecessary surveillance and fostering trust [27].

4.5. Avoiding Over-Medicalization and Psychological Burden

While early detection of gluten exposure is beneficial, excessive monitoring may lead to undue stress, particularly for patients struggling with dietary adherence. Clinicians must balance surveillance with psychological well-being, emphasizing progress over perfection and avoiding guilt or shame in patient interactions [44].

4.6. Equity and Access to Testing

Stool GIPS testing may not be universally accessible due to cost or healthcare disparities. Ethically, providers should advocate for equitable access while considering alternative monitoring strategies for underserved populations [4].

4.7. Privacy and Data Use

Patient confidentiality must be maintained, and explicit consent should be obtained if GIPS data are used for research or quality improvement initiatives [45].

4.8. Clinical Implications and Future Directions

4.8.1. Preventing Long-Term Complications

Proactive gluten immunogenic peptides (GIPS) monitoring may reduce the risk of refractory CD (RCD) and associated malignancies, particularly RCD type 2, which carries a high risk of enteropathy-associated T-cell lymphoma [46]. Early detection of persistent gluten exposure could also mitigate severe complications such as celiac crisis—a life-threatening condition characterized by profound diarrhoea and metabolic disturbances [47]. However, clinicians must consider alternative causes of villous atrophy, especially in seronegative cases or patients unresponsive to a gluten-free diet (GFD).

4.8.2. Differential Diagnosis in Persistent Villous Atrophy

  • ARB-Induced Enteropathy: Drugs like olmesartan can mimic CD histologically but require discontinuation rather than dietary intervention [45]. Unlike other aetiologies of seronegative villous atrophy (SNVA) (e.g., autoimmune enteropathy or tropical sprue), an ARB-related injury is typically reversible, emphasizing the importance of medication reviews in diagnostic workflows [37].
  • Small Intestinal Bacterial Overgrowth (SIBO): SIBO has been linked to persistent symptoms and mucosal injury in celiac patients, with a pooled prevalence of ~20% and up to 28% among those with GFD-unresponsive CD [48]. Excess bacterial colonization can disrupt nutrient absorption, induce inflammation, and contribute to villous blunting. Importantly, mucosal injury from SIBO may improve after targeted antibiotic therapy, such as metronidazole, rifaximin, or bismuth compounds [48], highlighting the need to test for SIBO in the evaluation of persistent villous atrophy.
  • Limitations of GIPS Testing: While GIPS testing detects gluten exposure, it cannot identify drug-induced injury or other SNVA causes. This underscores the need for a comprehensive evaluation when symptoms persist despite confirmed GFD adherence [36,46].

4.9. Challenges and Limitations

Despite its advantages, GIPS testing faces several challenges:
  • Heterogeneity in Testing Methods: Variations in the assays (e.g., ELISA vs. lateral flow) and patient populations limit the generalizability [4].
  • False Positives/Negatives:
    Dietary compliance does not always correlate perfectly with GIPS levels [3].
    In SNVA, persistent villous atrophy despite negative GIPS results should prompt evaluation for non-celiac causes (e.g., ARBs, NSAIDs, or immune dysregulation) [27,37].
  • Over-reliance on Biomarkers: GIPS testing cannot replace a histologic assessment in complex cases (e.g., distinguishing CD from ARB enteropathy or RCD) [6].
Need for Standardization: Consistent protocols and longitudinal studies are required to establish GIPS testing’s role in mucosal healing and complication prevention [6].

4.10. Future Research Priorities

  • Standardization of Assays: Harmonizing GIPS detection methods to improve reliability.
  • Longitudinal Outcomes Research: Assessing the impact of GIPS-guided interventions on mucosal healing and disease progression.
  • Cost-Effectiveness Analyses: Evaluating routine GIPS testing feasibility [7].
  • Enhancing Diagnostic Pathways: Integrating GIPS testing with drug history and serologic profiling to differentiate CD from mimics (e.g., ARB enteropathy) [36].
  • Urine GIPS Sensitivity: Refining urine testing protocols for broader applicability, particularly in resource-limited settings [36].

5. Conclusions

This systematic review and meta-analysis suggest that GIPS testing—particularly stool-based GIPS testing—can serve as a valuable tool for detecting gluten exposure in patients with CD. The findings indicate that stool GIPS testing provides high sensitivity and specificity, making it a reliable, non-invasive method for monitoring dietary adherence and identifying intermittent gluten ingestion, which can lead to long-term complications in coeliac patients. While urine GIPS testing has shown some promise, it has demonstrated a lower diagnostic accuracy and should be used as a complementary tool rather than a primary diagnostic method.
The findings, though promising, are subject to limitations due to the heterogeneity among the included studies, variations in testing methods, and differences in patient populations. As such, further standardization of GIPS testing protocols and more longitudinal studies are essential to fully assess the impact of GIPS testing on mucosal healing, complication prevention, and overall patient outcomes. Future research should also explore the potential of GIPS testing in resource-limited settings and assess its cost-effectiveness as part of routine CD management. By addressing these gaps, GIPS testing could become an integral tool in improving the management and monitoring of CD, ultimately enhancing the quality of life and clinical outcomes for patients.
Clinicians should integrate stool GIPS testing into monitoring protocols while addressing ethical considerations—such as patient autonomy, psychological impact, and equitable access—and remain vigilant for non-celiac causes of villous atrophy in refractory cases.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/life15101548/s1, Table S1: PRISMA 2020 Checklist. Reference [49] is cited in Supplementary Material File.

Author Contributions

S.S.M. and J.S. contributed to the study concept and design; drafting of the manuscript; data acquisition, analysis, and interpretation; and critical revision of the manuscript for important intellectual content. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

We extend our appreciation to the Faculty of Life Sciences and Education at the University of South Wales in the Gastroenterology MSc program and their invaluable support of our work. We sincerely acknowledge the efforts of the University of South Wales and commend them for their commitment to providing life-long learning opportunities and advanced life skills to healthcare professionals.

Conflicts of Interest

Authors Sarmad Sarfraz Moghal and Jonathan Soldera were employed by the company Learna Ltd.

Abbreviations

The following abbreviations are used in this manuscript:
GIPSGluten Immunogenic Peptides
GFDGluten-Free Diet
CDATCoeliac Dietary Adherence Test
TTGTissue Transglutaminase
DGPDeamidated Gliadin Peptide
AGAAnti-Gliadin Antibodies
RCTRandomized Controlled Trial
PRISMAPreferred Reporting Items for Systematic Reviews and Meta-Analyses
ESsCDEuropean Society for the Study of CD
EMAEndomysial Antibodies
HLA-DQ2/DQ8Human Leukocyte Antigen DQ2/DQ8
ACGAmerican College of Gastroenterology
BSGBritish Society of Gastroenterology
AUROCArea Under Receiver Operating Characteristic Curve
ELISAEnzyme-Linked Immunosorbent Assay
LFTLateral Flow Test
iVYCHECKA Specific GIPS Detection Test Name
RoBRisk of Bias
SROCSummary Receiver Operating Characteristic
LR+Positive Likelihood Ratio
LR−Negative Likelihood Ratio
DORDiagnostic Odds Ratio
CDCoeliac Disease
RCDRefractory CD

References

  1. Al-Toma, A.; Volta, U.; Auricchio, R.; Castillejo, G.; Sanders, D.S.; Cellier, C.; Mulder, C.J.; Lundin, K.E.A. European Society for the Study of CD (ESsCD) Guidelines for the diagnosis of CD. United Eur. Gastroenterol. J. 2019, 7, 583–613. [Google Scholar] [CrossRef]
  2. American College of Gastroenterology. ACG Clinical Guidelines: Diagnosis and Management of CD. 2013. Available online: https://gi.org/guidelines (accessed on 23 November 2024).
  3. Ludvigsson, J.F.; Bai, J.C.; Biagi, F.; Card, T.R.; Ciacci, C.; Ciclitira, P.J.; Green, P.H.; Hadjivassiliou, M.; Holdoway, A.; Van Heel, D.A.; et al. Diagnosis and management of adult coeliac disease: Guidelines from the British Society of Gastroenterology. Gut 2014, 63, 1210–1228. [Google Scholar] [CrossRef]
  4. Ludvigsson, J.F.; Leffler, D.A.; Bai, J.C.; Biagi, F.; Fasano, A.; Green, P.H.R.; Hadjivassiliou, M.; Kaukinen, K.; Kelly, C.P.; Leonard, J.N.; et al. The Oslo definitions for CD and related terms. Gut 2013, 62, 43–52. [Google Scholar] [CrossRef]
  5. Singh, P.; Arora, A.; Strand, T.A.; Leffler, D.A.; Catassi, C.; Green, P.H.; Kelly, C.P.; Ahuja, V.; Makharia, G.K. Global prevalence of CD: Systematic review and meta-analysis. Clin. Gastroenterol. Hepatol. 2018, 16, 823–836.E2. [Google Scholar] [CrossRef]
  6. Catassi, C.; Gatti, S.; Fasano, A. The new epidemiology of celiac disease. J. Pediatr. Gastroenterol. Nutr. 2014, 59 (Suppl. 1), S7–S9. [Google Scholar] [CrossRef] [PubMed]
  7. Rubio-Tapia, A.; Hill, I.D.; Kelly, C.P.; Calderwood, A.H.; Murray, J.A. ACG Clinical Guidelines: Diagnosis and Management of CD. Am. J. Gastroenterol. 2013, 108, 656–676. [Google Scholar] [CrossRef]
  8. Soldera, J.; Salgado, K.; Pêgas, K.L. Refractory celiac disease type 2: How to diagnose and treat? Rev. Assoc. Med. Bras. 2021, 67, 168–172. [Google Scholar] [CrossRef] [PubMed]
  9. Jamma, S.; Rubio-Tapia, A.; Kelly, C.P. Celiac crisis is a rare but serious complication of CD in adults. Clin. Gastroenterol. Hepatol. 2010, 8, 587–590. [Google Scholar] [CrossRef]
  10. Balaban, D.V.; Dima, A.; Jurcut, C. Celiac crisis, a rare occurrence in adult CD: A systematic review. World J. Clin. Cases 2019, 7, 311–319. [Google Scholar] [CrossRef]
  11. Moreno, M.L.; Rodríguez-Herrera, A.; Sousa, C.; Comino, I. Biomarkers to Monitor Gluten-Free Diet Compliance in Celiac Patients. Nutrients 2017, 9, 46. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  12. Comino, I.; Fernández-Bañares, F.; Esteve, M.; Ortigosa, L.; Castillejo, G.; Fambuena, B.; Ribes-Koninckx, C.; Sierra, C.; Rodríguez-Herrera, A.; Salazar, J.C.; et al. Fecal Gluten Peptides Reveal Limitations of Serological Tests and Food Questionnaires for Monitoring Gluten-Free Diet in Celiac Disease Patients. Am. J. Gastroenterol. 2016, 111, 1456–1465, Erratum in Am. J. Gastroenterol. 2017, 112, 1208. https://doi.org/10.1038/ajg.2017.110. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  13. Leonard, M.M.; Sapone, A.; Catassi, C.; Fasano, A. CD and nonceliac gluten sensitivity: A review. JAMA 2017, 318, 647–656. [Google Scholar] [CrossRef] [PubMed]
  14. Stanciu, D.; Staykov, H.; Dragomanova, S.; Tancheva, L.; Pop, R.S.; Ielciu, I.; Crișan, G. Gluten Unraveled: Latest Insights on Terminology, Diagnosis, Pathophysiology, Dietary Strategies, and Intestinal Microbiota Modulations-A Decade in Review. Nutrients 2024, 16, 3636. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  15. Catassi, C.; Fabiani, E.; Iacono, G.; D’Agate, C.; Francavilla, R.; Biagi, F.; Volta, U.; Accomando, S.; Picarelli, A.; De Vitis, I.; et al. A prospective, double-blind, placebo-controlled trial to establish a safe gluten threshold for patients with CD. Am. J. Clin. Nutr. 2007, 85, 160–166. [Google Scholar] [CrossRef]
  16. Comino, I.; Real, A.; Vivas, S.; Síglez, M.Á.; Caminero, A.; Nistal, E.; Casqueiro, J.; Rodríguez-Herrera, A.; Cebolla, A.; Sousa, C. Monitoring of gluten-free diet compliance in celiac patients by assessment of gliadin 33-mer equivalent epitopes in feces. Am. J. Clin. Nutr. 2012, 95, 670–677. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  17. Molina-Infante, J.; Santolaria, S.; Sanders, D.S.; Fernández-Bañares, F. Systematic review: Noncoeliac gluten sensitivity. Aliment. Pharmacol. Ther. 2015, 41, 807–820. [Google Scholar] [CrossRef] [PubMed]
  18. Higgins, J.P.T.; Thomas, J.; Chandler, J.; Cumpston, M.; Li, T.; Page, M.J.; Welch, V.A. (Eds.) Cochrane Handbook for Systematic Reviews of Interventions, 2nd ed.; John Wiley & Sons: Chichester, UK, 2019. [Google Scholar]
  19. Raju, S.A.; Ingham, K.E.; Green, O.; Johnson, C.M.; Shiha, M.G.; Nandi, N.; Trott, N.; Penny, H.A.; Hadjivassiliou, M.; Wild, G.; et al. What is the Role of Measuring Urinary Gluten Immunogenic Peptides in Clinical Practice in Patients with CD? J. Gastrointest. Liver Dis. 2024, 33, 482–487. [Google Scholar] [CrossRef] [PubMed]
  20. Higgins, J.P.T.; Thompson, S.G.; Deeks, J.J.; Altman, D.G. Measuring inconsistency in meta-analyses. BMJ 2003, 327, 557–560. [Google Scholar] [CrossRef]
  21. Stefanolo, J.P.; Temprano, M.d.l.P.; Smecuol, E.; Puebla, R.; Díaz Portillo, J.; Niveloni, S.; Verdú, E.F.; Bai, J.C. Comparison of weekly gluten immunogenic peptide measurement and conventional tools to assess adherence to the gluten-free diet in celiac disease: An observational prospective study. Am. J. Clin. Nutr. 2023, 118, 1106–1112. Available online: https://www.sciencedirect.com/science/article/pii/S0002916523661738 (accessed on 19 November 2024). [CrossRef]
  22. Roca, M.; Donat, E.; Masip, E.; Crespo-Escobar, P.; Cañada-Martínez, A.J.; Polo, B.; Ribes-Koninckx, C. Analysis of gluten immunogenic peptides in feces to assess adherence to the gluten-free diet in pediatric celiac patients. Eur. J. Nutr. 2021, 60, 2131–2140. [Google Scholar] [CrossRef] [PubMed]
  23. Seetharaman, K.; Lal, S.B.; Prasad, K.K.; Kumar, Y.; Bhatia, A.; Malhotra, S. Role of Serology, Dietary Assessment, and Fecal Gluten Immunogenic Peptides for Predicting Histologic Recovery in Children with Celiac Disease. Dig. Dis. Sci. 2023, 68, 529–540. [Google Scholar] [CrossRef] [PubMed]
  24. Coto, L.; Mendia, I.; Sousa, C.; Bai, J.C.; Cebolla, A. Determination of gluten immunogenic peptides for the management of the treatment adherence of celiac disease: A systematic review. World J. Gastroenterol. 2021, 27, 6306–6321. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  25. Gerasimidis, K.; Zafeiropoulou, K.; Mackinder, M.; Ijaz, U.Z.; Duncan, H.; Buchanan, E.; Cardigan, T.; Edwards, C.A.; McGrogan, P.; Russell, R.K. Comparison of Clinical Methods With the Faecal Gluten Immunogenic Peptide to Assess Gluten Intake in Coeliac Disease. J. Pediatr. Gastroenterol. Nutr. 2018, 67, 356–360. [Google Scholar] [CrossRef] [PubMed]
  26. Comino, I.; Segura, V.; Ortigosa, L.; Espín, B.; Castillejo, G.; Garrote, J.A.; Sierra, C.; Millán, A.; Ribes-Koninckx, C.; Román, E.; et al. Prospective longitudinal study: Use of faecal gluten immunogenic peptides to monitor children diagnosed with CD during transition to a gluten-free diet. Aliment. Pharmacol. Ther. 2022, 55, 282–291. [Google Scholar] [CrossRef] [PubMed]
  27. Silvester, J.A.; Comino, I.; Kelly, C.P.; Sousa, C.; Duerksen, D.R.; DOGGIE BAG Study Group. Most Patients with Celiac Disease on Gluten-Free Diets Consume Measurable Amounts of Gluten. Gastroenterology 2020, 158, 1497–1499.e1. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  28. Fernández Miaja, M.; Díaz Martín, J.J.; Jiménez Treviño, S.; Suárez González, M.; Bousoño García, C. Study of adherence to the gluten-free diet in coeliac patients. An. Pediatría 2021, 94, 377–384. Available online: https://www.sciencedirect.com/science/article/pii/S234128792100051X (accessed on 19 November 2024). [CrossRef]
  29. Porcelli, B.; Ferretti, F.; Cinci, F.; Biviano, I.; Santini, A.; Grande, E.; Quagliarella, F.; Terzuoli, L.; Bacarelli, M.R.; Bizzaro, N.; et al. Fecal gluten immunogenic peptides as indicators of dietary compliance in celiac patients. Minerva Gastroenterol. Dietol. 2020, 66, 201–207. [Google Scholar] [CrossRef] [PubMed]
  30. Donat, E.; Roca, M.; Masip, E.; Polo, B.; Ramos, D.; Ribes-Koninckx, C. Response to Screening-detected positive serology for celiac disease: Is there always mucosal damage? Eur. J. Gastroenterol. Hepatol. 2021, 33 (Suppl. 1), e1109. [Google Scholar] [CrossRef] [PubMed]
  31. Laserna-Mendieta, E.J.; Casanova, M.J.; Arias, Á.; Arias-González, L.; Majano, P.; Mate, L.A.; Gordillo-Vélez, C.H.; Jiménez, M.; Angueira, T.; Tébar-Romero, E.; et al. Poor sensitivity of fecal gluten immunogenic peptides and serum antibodies to detect duodenal mucosal damage in the monitoring of celiac disease. Nutrients 2020, 13, 98. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  32. Burger, J.P.W.; van Lochem, E.G.; Roovers, E.A.; Drenth, J.P.H.; Wahab, P.J. Dose-Escalating (50–500 mg) Gluten Administration Leads to Detectable Gluten-Immunogenic-Peptides in Urine of Patients with CD Which Is Unrelated to Symptoms, a Placebo Controlled Trial. Nutrients 2022, 14, 1771. [Google Scholar] [CrossRef]
  33. Stefanolo, J.P.; Tálamo, M.; Dodds, S.; de la Paz Temprano, M.; Costa, A.F.; Moreno, M.L.; Pinto-Sánchez, M.I.; Smecuol, E.; Vázquez, H.; Gonzalez, A.; et al. Real-World Gluten Exposure in Patients With CD on Gluten-Free Diets, Determined From Gliadin Immunogenic Peptides in Urine and Fecal Samples. Clin. Gastroenterol. Hepatol. 2021, 19, 484–491.e1. [Google Scholar] [CrossRef] [PubMed]
  34. Ruiz-Carnicer, Á.; Garzón-Benavides, M.; Fombuena, B.; Segura, V.; García-Fernández, F.; Sobrino-Rodríguez, S.; Gómez-Izquierdo, L.; Montes-Cano, M.A.; Rodríguez-Herrera, A.; Millán, R.; et al. Negative predictive value of the repeated absence of gluten immunogenic peptides in the urine of treated celiac patients in predicting mucosal healing: New proposals for follow-up in CD. Am. J. Clin. Nutr. 2020, 112, 1240–1251. [Google Scholar] [CrossRef] [PubMed]
  35. Russell, A.K.; Lucas, E.C.; Henneken, L.M.; Pizzey, C.J.; Clarke, D.; Myleus, A.; Tye-Din, J.A. Stool gluten peptide detection is superior to urinary analysis, coeliac serology, dietary adherence scores and symptoms in the detection of intermittent gluten exposure in CD: A randomised, placebo-controlled, low-dose gluten challenge study. Nutrients 2024, 16, 279. [Google Scholar] [CrossRef] [PubMed]
  36. Garzón-Benavides, M.; Ruiz-Carnicer, Á.; Segura, V.; Fombuena, B.; García-Fernandez, F.; Sobrino-Rodriguez, S.; Gómez-Izquierdo, L.; Montes-Cano, M.A.; Millan-Domínguez, R.; Del Carmen Rico, M.; et al. Clinical utility of urinary gluten immunogenic peptides in the follow-up of patients with CD. Aliment. Pharmacol. Ther. 2023, 57, 993–1003. [Google Scholar] [CrossRef] [PubMed]
  37. Schuppan, D.; Mäki, M.; Lundin, K.E.A.; Murray, J.A. Novel non-invasive biomarkers for monitoring CD. Nat. Rev. Immunol. 2023, 23, 215–230. [Google Scholar]
  38. Green, P.H.R.; Cellier, C.; Haug, D. Management of CD and noncoeliac gluten sensitivity. Gastroenterol. Clin. N. Am. 2020, 49, 99–110. [Google Scholar]
  39. Hogen Esch, M.; Koesling, M.; Poloni, P. The clinical use of serological tests in CD: Current perspectives. Eur. J. Gastroenterol. Hepatol. 2017, 29, 741–748. [Google Scholar]
  40. Murray, J.A.; Syage, J.A.; Wu, T.T.; Dickason, M.A.; Ramos, A.G.; Van Dyke, C.; Horwath, I.; Lavin, P.T.; Maki, M.; Hujoel, I.; et al. Latiglutenase Protects the Mucosa and Attenuates Symptom Severity in Patients with CD Exposed to a Gluten Challenge. Gastroenterology 2022, 163, 1510–1521.e6. [Google Scholar] [CrossRef]
  41. Monachesi, C.; Verma, A.K.; Catassi, G.N.; Franceschini, E.; Gatti, S.; Gesuita, R.; Lionetti, E.; Catassi, C. Determination of Urinary Gluten Immunogenic Peptides to Assess Adherence to the Gluten-Free Diet: A Randomized, Double-Blind, Controlled Study. Clin. Transl. Gastroenterol. 2021, 12, e00411. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  42. Viechtbauer, W. Conducting meta-analyses in R with the metafor package. J. Stat. Softw. 2010, 36, 1–48. [Google Scholar] [CrossRef]
  43. Lebwohl, B.; Sanders, D.S.; Green, P.H.R. Coeliac disease. Lancet 2018, 391, 70–81. [Google Scholar] [CrossRef]
  44. Egger, M.; Smith, G.D.; Schneider, M.; Minder, C. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997, 315, 629–634. [Google Scholar] [CrossRef]
  45. Soldera, J.; Salgado, K. Gastrointestinal: Valsartan-induced sprue-like enteropathy. J. Gastroenterol. Hepatol. 2020, 35, 1262. [Google Scholar] [CrossRef]
  46. Soldera, J.; Coelho, G.P.; Heinrich, C.F. Life-threatening diarrhea in an elderly patient. Gastroenterology 2021, 160, 26–28. [Google Scholar] [CrossRef]
  47. Losurdo, G.; Marra, A.; Shahini, E.; Girardi, B.; Giorgio, F.; Amoruso, A.; Pisani, A.; Piscitelli, D.; Barone, M.; Principi, M.; et al. Small intestinal bacterial overgrowth and celiac disease: A systematic review with pooled-data analysis. Neurogastroenterol. Motil. 2017, 29, e13028. [Google Scholar] [CrossRef] [PubMed]
  48. Shah, Q.; Soldera, J. Exploring effectiveness of Metronidazole, Bismuth, and Rifaximin in treating small intestinal bacterial overgrowth and irritable bowel syndrome: A systematic review. World J. Methodol. 2025, in press.
  49. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, 71. [Google Scholar] [CrossRef]
Figure 1. PRISMA flow diagram for identification of studies.
Figure 1. PRISMA flow diagram for identification of studies.
Life 15 01548 g001
Figure 2. Risk of bias traffic-light plot [11,36,37,38,39,40].
Figure 2. Risk of bias traffic-light plot [11,36,37,38,39,40].
Life 15 01548 g002
Figure 3. Risk of bias summary plot.
Figure 3. Risk of bias summary plot.
Life 15 01548 g003
Figure 4. Forest plot showing pooled effect of stool GIPS testing sensitivity [33,35,36].
Figure 4. Forest plot showing pooled effect of stool GIPS testing sensitivity [33,35,36].
Life 15 01548 g004
Figure 5. Forest plot showing pooled effect of stool GIPS testing specificity [33,35,36].
Figure 5. Forest plot showing pooled effect of stool GIPS testing specificity [33,35,36].
Life 15 01548 g005
Figure 6. Summary receiver operating characteristic (SROC) curve showing diagnostic performance of stool GIPS testing.
Figure 6. Summary receiver operating characteristic (SROC) curve showing diagnostic performance of stool GIPS testing.
Life 15 01548 g006
Figure 7. Forest plot showing pooled sensitivity of urine GIPS testing [11,32,33,34,36].
Figure 7. Forest plot showing pooled sensitivity of urine GIPS testing [11,32,33,34,36].
Life 15 01548 g007
Figure 8. Forest plot showing pooled specificity of urine GIPS testing [11,32,33,34,36].
Figure 8. Forest plot showing pooled specificity of urine GIPS testing [11,32,33,34,36].
Life 15 01548 g008
Figure 9. Summary receiver operating characteristic (SROC) curve showing diagnostic performance of urine GIPS testing.
Figure 9. Summary receiver operating characteristic (SROC) curve showing diagnostic performance of urine GIPS testing.
Life 15 01548 g009
Figure 10. Forest plot showing pooled diagnostic odds ratio (DOR) for stool GIPS testing [33,34,35,36].
Figure 10. Forest plot showing pooled diagnostic odds ratio (DOR) for stool GIPS testing [33,34,35,36].
Life 15 01548 g010
Figure 11. Forest plots showing pooled positive likelihood ratio (LR+) for stool GIPS testing [33,34,35,36].
Figure 11. Forest plots showing pooled positive likelihood ratio (LR+) for stool GIPS testing [33,34,35,36].
Life 15 01548 g011
Figure 12. Forest plot showing pooled negative likelihood ratios (LR−) for stool GIPS testing [33,34,35,36].
Figure 12. Forest plot showing pooled negative likelihood ratios (LR−) for stool GIPS testing [33,34,35,36].
Life 15 01548 g012
Figure 13. Forest plot showing pooled diagnostic odds ratio (DOR) for urine GIPS testing [11,32,33,34,36].
Figure 13. Forest plot showing pooled diagnostic odds ratio (DOR) for urine GIPS testing [11,32,33,34,36].
Life 15 01548 g013
Figure 14. Forest plot showing pooled positive likelihood ratio (LR−) for urine GIPS testing [11,32,33,34,36].
Figure 14. Forest plot showing pooled positive likelihood ratio (LR−) for urine GIPS testing [11,32,33,34,36].
Life 15 01548 g014
Figure 15. Forest plot showing pooled negative likelihood ratio (LR−) for urine GIPS testing [11,32,33,34,36].
Figure 15. Forest plot showing pooled negative likelihood ratio (LR−) for urine GIPS testing [11,32,33,34,36].
Life 15 01548 g015
Table 1. Studies excluded from meta-analysis along with justification.
Table 1. Studies excluded from meta-analysis along with justification.
Study AuthorYearReason for Exclusion
Stefanolo et al. [21]2023Overlapping population with the 2021 study by the same author; redundant data.
Roca et al. [22]2019Paediatric population only, whereas the analysis prioritized mixed populations for broader applicability.
Seetharaman et al. [23]2023Exclusively paediatric population; study design unsuitable for direct comparison with other included studies.
Coto et al. [24]2021Systematic review; data not suitable for meta-analysis due to lack of raw patient-level outcomes.
Comino et al. [12]2016High risk of bias due to lack of standardized intervention across multiple centres.
Gerasimidis et al. [25]2018Cross-sectional design without longitudinal outcome data, limiting the analysis of adherence trends.
Comino et al. [26]2019Lack of granular data on mucosal healing; study focused on dietary adherence questionnaires.
Silvester et al. [27]2022Small sample size with heterogeneous population, leading to high variability in outcomes.
Miaja et al. [28]2021Limited outcome data; primary focus on CDAT scores rather than GIPS sensitivity/specificity.
Porcelli et al. [29]2020Small sample size with limited statistical power; high risk of selection bias.
Donat et al. [30]2021Paediatric focus and insufficient differentiation between dietary compliance methods.
Laserna-Mendieta et al. [31]2021Limited follow-up duration and incomplete reporting of outcome data.
Table 2. Included studies characteristics.
Table 2. Included studies characteristics.
AuthorYearLocationStudy DesignPopulationInterventionComparisonPrimary Outcome
Moreno et al. [11]2017SpainObservational study at Hospital Virgen de Valme134 patients: 58 coeliac on GFD, 76 healthy (both diets)GIPS detection in urine using LFT with G12 antibodyDuodenal biopsy and serology (TTG, AGA)GIPS in urine for GFD compliance and mucosal healing.
Burger et al. [32]2022The NetherlandsProspective double blinded, placebo controlled15 adults with CD adhering to GFD for ≥1 yearDose-escalating gluten administration with iVYCHECK-GIPS-UrineSymptom diary and placeboDetection of GIPS in urine following gluten administration; correlation to self-reported symptoms.
Stefanolo et al. [33]2021ArgentinaProspective observational study53 adults with CD on GFD for >2 yearsWeekly stool and pooled urine sample collection over 4 weeksCSI symptoms, serological markers (DGP, TTG)Frequency of gluten exposure via GIPS detection in stool and urine; correlation with symptom severity and serological markers.
Ruiz-Carnicer et al. [34]2020SpainProspective observational study112 participants: 22 new coeliac, 77 on GFD, 13 controlsGIPS monitoring in urine samples (3 times per week)Histology, tTG levels, dietary questionnaireCorrelation of GIPS in urine with mucosal healing, histological outcomes, and serological markers.
Russell et al. [35]2024Australia and SwedenRandomized, double-blind, placebo-controlled, low-dose study52 adults with CD on a strict GFD for ≥1 yearStool GIPS detection using the iVYLISA GIPS stool testStool vs. urine GIPS detection, serology, dietary adherenceSensitivity of stool GIPS for gluten exposure detection vs. urine GIPS, coeliac serology, and dietary adherence; symptom correlation.
Garzón-Benavides et al. [36]2023SpainProspective quasi-experimental study94 patients with CD (on GFD for ≥24 months; follow-ups at 3, 6, and 12 months)Urinary gluten immunogenic peptides (u-GIPS) detection at each follow-up visitu-GIPS detection vs. serology, CDAT questionnaire, symptoms, and duodenal biopsy resultsDuodenal mucosal healing (histology); correlation with u-GIPS levels, serology, symptoms, and dietary adherence scores (CDAT).
Table 3. Adverse events in intervention and control groups.
Table 3. Adverse events in intervention and control groups.
StudyNumber of ParticipantsAdverse Events (Control Group)Adverse Events (Intervention Group)Number in Control Group
Detection of Gluten Immunogenic Peptides in Urine (Moreno et al.) [11]134None reportedNone reported76
Dose-Escalating Gluten Administration (Burger et al.) [32]153 patients reported symptoms after placebo5 patients reported abdominal pain after 500 mg gluten15
Gluten Exposure Detection (Stefanolo et al., 2021) [33]53None reportedNone reported27
Stool Gluten Peptide Detection (Russell et al., 2024) [35]52Placebo group: mild GI symptoms consistent with a nocebo effect1 patient vomited after 500 mg gluten; 3 patients vomited after 1000 mg gluten10
Negative Predictive Value of Repeated Absence of Gluten Immunogenic Peptides in Urine (Ruiz-Carnicer et al., 2020) [34]112None reportedNone reported13
Urinary Gluten Immunogenic Peptides in GFD Adherence Monitoring (Garzón-Benavides et al., 2023) [36]94None reportedNone reported24
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Moghal, S.S.; Soldera, J. Comparative Effectiveness of Urine vs. Stool Gluten Immunogenic Peptides Testing for Monitoring Gluten Intake in Coeliac Patients: A Systematic Review and Meta-Analysis. Life 2025, 15, 1548. https://doi.org/10.3390/life15101548

AMA Style

Moghal SS, Soldera J. Comparative Effectiveness of Urine vs. Stool Gluten Immunogenic Peptides Testing for Monitoring Gluten Intake in Coeliac Patients: A Systematic Review and Meta-Analysis. Life. 2025; 15(10):1548. https://doi.org/10.3390/life15101548

Chicago/Turabian Style

Moghal, Sarmad Sarfraz, and Jonathan Soldera. 2025. "Comparative Effectiveness of Urine vs. Stool Gluten Immunogenic Peptides Testing for Monitoring Gluten Intake in Coeliac Patients: A Systematic Review and Meta-Analysis" Life 15, no. 10: 1548. https://doi.org/10.3390/life15101548

APA Style

Moghal, S. S., & Soldera, J. (2025). Comparative Effectiveness of Urine vs. Stool Gluten Immunogenic Peptides Testing for Monitoring Gluten Intake in Coeliac Patients: A Systematic Review and Meta-Analysis. Life, 15(10), 1548. https://doi.org/10.3390/life15101548

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop