Next Article in Journal
HDAC Inhibitors: Dissecting Mechanisms of Action to Counter Tumor Heterogeneity
Next Article in Special Issue
Transcriptome Profiling and Metagenomic Analysis Help to Elucidate Interactions in an Inflammation-Associated Cancer Mouse Model
Previous Article in Journal
COVID-19 Vaccine Safety in Cancer Patients: A Single Centre Experience
Previous Article in Special Issue
The Identification of RNA-Binding Proteins Functionally Associated with Tumor Progression in Gastrointestinal Cancer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Consistent Major Differences in Sex- and Age-Specific Diagnostic Performance among Nine Faecal Immunochemical Tests Used for Colorectal Cancer Screening

1
German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Division of Preventive Oncology, 69120 Heidelberg, Germany
2
German Cancer Research Center (DKFZ), Division of Clinical Epidemiology and Aging Research, 69120 Heidelberg, Germany
3
German Cancer Research Center (DKFZ), Division of Biostatistics, 69120 Heidelberg, Germany
4
German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
5
Medical Faculty Heidelberg, University of Heidelberg, 69120 Heidelberg, Germany
*
Author to whom correspondence should be addressed.
These authors contributed equally to this paper.
Cancers 2021, 13(14), 3574; https://doi.org/10.3390/cancers13143574
Submission received: 10 May 2021 / Revised: 9 July 2021 / Accepted: 14 July 2021 / Published: 16 July 2021

Abstract

:

Simple Summary

We evaluated the performance of nine faecal immunochemical tests among participants of screening colonoscopy. A total of 216 cases of advanced neoplasia (AN, colorectal cancer or advanced adenoma) and 300 randomly selected participants without AN were included. Diagnostic performance for detection of AN was assessed by sex and age (50–64 vs. 65–79 years), for each of the nine faecal immunochemical tests (FITs) individually and for all FITs combined. Major differences in diagnostic performance by sex and age were consistently seen across nine different FIT brands. Sensitivities were consistently lower, and specificities were consistently higher, for females as compared with males. Positive predictive values were similar between both sexes, but negative predictive values were higher for females. A negative FIT is less reliable in ruling out AN among men than among women and among older than among younger participants.

Abstract

Evidence on diagnostic performance of faecal immunochemical tests (FITs) by sex and age is scarce. We aimed to evaluate FIT performance for detection of advanced colorectal neoplasia (AN) by sex and age across nine different FIT brands in a colonoscopy-controlled setting. The faecal samples were obtained from 2042 participants of colonoscopy screening. All eligible cases with AN (n = 216) and 300 randomly selected participants without AN were included. Diagnostic performance for detection of AN was assessed by sex and age (50–64 vs. 65–79 years for each of the nine FITs individually and for all FITs combined. Sensitivity was consistently lower, and specificity was consistently higher for females as compared with males (pooled values at original FIT cutoffs, 25.7% vs. 34.6%, p = 0.12 and 96.2% vs. 90.8%, p < 0.01, respectively). Positive predictive values (PPVs) were similar between both sexes, but negative predictive values (NPVs) were consistently higher for females (pooled values, 91.8% vs. 86.6%, p < 0.01). Sex-specific cutoffs attenuated differences in sensitivities but increased differences in predictive values. According to age, sensitivities and specificities were similar, whereas PPVs were consistently lower and NPVs were consistently higher for the younger participants. A negative FIT is less reliable in ruling out AN among men than among women and among older than among younger participants. Comparisons of measures of diagnostic performance among studies with different sex or age distributions should be interpreted with caution.

1. Introduction

Worldwide, colorectal cancer (CRC) accounts for approximately 1 million new cases among men and for approximately 800,000 new cases among women annually [1]. Faecal immunochemical tests (FITs) are widely recommended [2,3] and used [4,5] for population-wide screening and early detection of CRC and its precancerous lesions. The diagnostic performance of quantitative FITs has been assessed in many studies and has been summarized in meta-analyses [6,7]; however, evidence on FIT performance according to sex and age derived from colonoscopy-controlled studies is scarce and limited to only a few FIT brands [8]. Furthermore, it is unclear whether differences exist for detection of advanced colorectal neoplasia (AN) by sex or by age across different FIT brands.
We aimed to evaluate the diagnostic performance of a large number of different quantitative FITs according to sex and to age, using faecal samples obtained from individuals undergoing colonoscopy screening in Germany.

2. Materials and Methods

The Standards for the Reporting of Diagnostic Accuracy Studies (STARD) [9] and the standard for Faecal Immunochemical Tests for Haemoglobin Evaluation Reporting (FITTER) [10] were followed.

2.1. Study Design and Population

This analysis was carried out following a direct comparison and combination of nine quantitative FITs for detection of AN, details of which have been published previously [11,12]. Briefly, this study is based on the BLITZ study, which has been running since 2005 with the aim to collect blood and stool samples among average-risk individuals before undergoing colonoscopy screening for evaluation of novel non-invasive CRC screening tests. Study participants are informed and recruited during their preparatory visit (typically 1 week before colonoscopy) in cooperating gastroenterology practices in southwest Germany.
The study was approved by the Ethics committee of University of Heidelberg (178/2005) and those of the state chambers of physicians of Baden-Wuerttemberg (M118-05-f), Rhineland-Palatinate (837.047.06(5145)) and Saarland (217/13). The BliTz study was registered in the German Clinical Trials Register (DRKSID: DRKS00008737). Written informed consent was obtained from each study participant. Further information about the BLITZ study has been provided elsewhere [13,14,15].

2.2. Selection of Study Participants

A total of 2042 participants, who were recruited until 2010 and who provided faecal samples in stool containers (60 mL), were eligible for this project. After excluding participants <50 or ≥80 years of age (n = 52), with inflammatory bowel disease (n = 10), history of previous colorectal neoplasia (n = 39), colonoscopy in the past 5 years (n = 114), stool sample collection not prior to colonoscopy (n = 75), and incomplete colonoscopy (n = 8) or inadequate bowel cleansing (n = 77), 1667 participants fulfilled the inclusion criteria for this analysis. All participants diagnosed with AN, i.e., either CRC or advanced adenoma (AA, defined as adenomas with either size ≥1 cm, villous/tubulovillous components, or high-grade dysplasia) who provided enough faeces for the evaluation of 9 FITs were included (n = 216). For one FIT (immoCARE-C), the analyses were based on one less AN case (n = 215), because one FIT measurement was missing. To save resources and capacities, 300 participants without CRC and AA were randomly selected and included for specificity calculations.

2.3. Data/Sample Collection and Processing

Participants were asked to collect a faecal sample before starting bowel preparation for colonoscopy, to store the sample in a freezer (or, if not possible, in a refrigerator), and to bring it in a temperature-isolated bag to the gastroenterology practice on the day of the scheduled colonoscopy. In the practices, the samples were kept at −20 °C and sent on dry ice to a central laboratory, and afterwards to the German Cancer Research Center (DKFZ, Heidelberg) for final storage at −80 °C. Although this preanalytical sample procedure differs from the recommended faecal sampling procedure (i.e., filling the faecal sampling tubes directly with fresh stool), we have found in a recent retrospective analysis that estimates of diagnostic performance of FITs remained fairly stable even after long-term frozen storage and repeat thawing and freezing cycles [16].
The screening colonoscopy was performed by experienced colonoscopists who were unaware of the FIT result. Afterwards, colonoscopy (and histology) reports were collected, and relevant data were extracted by trained medical data officers who were likewise blinded to any FIT result.

2.4. Test Analysis

Faecal samples were thawed in 2016 in order to measure different FITs in parallel, as previously described [11,12]. Overall, 516 faecal samples from average-risk participants of screening colonoscopy were measured using nine quantitative FITs from seven manufactures. All FITs were approved for use in Germany. Detailed test characteristics are shown in Table S1. Before filling the single faecal sample collection tubes, the stool within each container was mixed to reduce heterogeneity in faecal haemoglobin distribution [17]. All nine FITs were evaluated simultaneously under the same preanalytical and analytical conditions: Stool specimens were extracted for the nine FITs using the special sampling tubes that had been designed to transfer a defined amount of faeces into a haemoglobin-stabilizing buffer of the tube. Afterwards, the tubes were shaken and kept at ambient temperature (range 20–24 °C) until they were blindly measured on the next day. Further detailed information on test analysis has been published previously [11,12].

2.5. Statistical Analysis

All quantitative faecal haemoglobin measurements were converted to the same, and directly comparable, unit of µg haemoglobin per gram faeces (µg/g) [18].
Sensitivities, specificities, positive and negative predictive values (PPVs and NPVs) with their 95% confidence interval (CI) were calculated for detection of AN (either CRC or AA) by sex and by age (50–64 and 65–79 years). The analyses were conducted at thresholds recommended by the manufacturers (=original thresholds, range 2–17 µg/g) and at thresholds yielding an equal overall specificity of 95%. In addition, positivity rates with their 95% CIs were computed. Due to the overrepresentation of participants with AN (all AN cases, n = 216) in comparison to the participants without AN (random sample, n = 300) by design, PPVs, NPVs and positivity rates were derived from weighted analyses. Weights were calculated by dividing original fractions, which were observed among the 1667 eligible study participants, by sampling fractions for inclusion in the FIT. This way, positivity rates, PPVs, and NPVs reflect the prevalence of AN, sex, and age distribution observed in the cohort of eligible participants (n = 1667) who fulfilled the inclusion criteria. Testing for statistical differences by sex and by age was conducted using logistic regression models. For positivity rates, PPVs, and NPVs, a weighted logistic regression model was fitted. p-values and CIs were based on the Wald test.
Generalized estimating equations (GEE) logistic regression models were used to derive pooled estimates including 95% CIs of the various measures of diagnostic performance by sex and by age across the nine FITs and to test for the associations of age and sex with diagnostic performance, taking FIT effects and dependency of observations within the same individuals into account. Statistical testing by sex and by age was conducted using the Wald test.
To assess the overall diagnostic performance within the clinically relevant segment of ≥80% specificity, partial areas under the curves (AUCs) were calculated in such a way that they became 50% for nondiscriminant and 100% for perfectly discriminating tests. Derivation of 95% CIs and testing for statistical significance of differences in partial AUCs were done using 2000 bootstrap replicates.
Two-sided p-values that were below 0.05 were considered statistically significant. The analyses of partial AUCs were conducted using R (version 3.6.0, R Core Team, Vienna, Austria) with the R package ‘pROC’ (version 1.16.2), whereas all other analyses and statistical tests were conducted using SAS enterprise guide (version 7.1, Cary, NC, USA).

3. Results

3.1. Study Population

The characteristics of all eligible 1667 participants of colonoscopy screening are shown in Table 1. The sample included approximately equal numbers of women and men. The age distribution was similar among both sexes, and most participants were between 50 and 64 years old. Advanced neoplasia was detected in 230 participants. Among these, colorectal cancer and advanced adenomas were the most advanced finding of colonoscopy screenings for 16 (1.0%) and 214 (12.8%) participants. The overall AN prevalence was 13.8% in the total study population. It was higher among men (17.3%) than among women (10.0%) and in the age group 65–79 (17.6%) than in the age group 50–64 (11.3%).

3.2. Diagnostic Performance by Sex

Across all FITs and all assessed thresholds, sensitivities were consistently lower and specificities were consistently higher among females as compared with among males (Table 2). At original threshold values, substantial differences in measures of diagnostic performance were observed among the different single FIT brands. However, when threshold values were adjusted to yield identical levels of overall specificity (to enhance the comparability between the FITs), no meaningful differences were observed among the different FIT brands. At original thresholds, pooled sensitivities were 25.7% vs. 34.6% (p = 0.12) and pooled specificities were 96.2% vs. 90.8% (p = 0.005) for females and males, respectively. Similar sex differences were observed at thresholds adjusted to yield equal overall specificities (95%) across the FITs. PPVs were similar between both sexes, but NPVs were consistently higher for females (pooled values 91.8% vs. 86.6%, p < 0.01) (Table 3). Differences in sensitivities diminished when using sex-specific cutoffs, whereas differences in PPVs increased. Pooled sex-specific differences in sensitivity, specificity, PPV, NPV, and positivity at original thresholds are summarized in Figure 1A.
The overall diagnostic performance, measured by the partial AUC (between 80% and 100% specificity), showed no clinically relevant difference between men and women (Table S2). For four of the nine FITs, partial AUCs were slightly higher for men (up to 3.1%), whereas for the other five FITs, the partial AUCs were slightly higher for women (up to 2.4%), but none of these small differences reached statistical significance.

3.3. Diagnostic Performance by Age

Pooled age-specific differences in sensitivity, specificity, PPV, NPV, and positivity are summarized in Figure 1B. Differences in sensitivity and specificity between younger (50–64 years) and older (65–79 years) study participants were generally small (Table 4) and less consistent than differences according to sex. Similar results were observed at thresholds adjusted to yield an overall specificity of 95%, and none of these differences between both age groups was statistically significant.
PPVs were consistently higher among the older age group as compared with the younger age group (Table 5), but differences were not statistically significant. At original thresholds, pooled PPVs were 37.6% and 51.0% (p = 0.16) for the younger and older age groups, respectively. NPVs were consistently lower (by about 5%) for the older age group, and pooled estimates were statistically significantly different (p = 0.02) between both age groups, across all thresholds. Differences in NPVs were slightly larger when age-specific cutoffs were used (about 6%).
Partial AUCs were slightly higher (up to 2.7%) for the younger study group for five of the nine FITs, and for the other four FITs these estimates were slightly higher (up to 3.8%) for the older study group, but none of these differences was statistically significant (Table S2).

4. Discussion

In this study, we assessed the diagnostic performance for detection of AN of nine different quantitative FITs according to sex and age, using stool samples collected from average-risk participants of screening colonoscopy. Even when adjusting FIT cutoffs to yield equal specificity in the entire study population, pooled sensitivities were consistently higher, whereas pooled specificities were statistically significantly lower among males as compared with females. Pooled PPVs were very similar between both sexes. By contrast, pooled NPVs were statistically significantly lower for males, suggesting that a negative FIT is less reliable in ruling out AN among men than among women. When using sex-specific cutoffs with respect to specificities, differences in sensitivities by sex diminished, whereas differences in PPVs and NPVs became greater. According to age, pooled sensitivities and specificities were very similar between both age groups, but pooled PPVs were consistently higher and pooled NPVs were statistically significantly lower for the older age group.
To the best of our knowledge, this is the first study to assess diagnostic performance of several different FIT brands in parallel for detection of AN by sex among participants of colonoscopy screening. There are only a few previous studies that have investigated FIT performance for AN detection with respect to sex [19,20,21,22]. Each of these studies assessed only one specific FIT brand and consistently reported higher sensitivity for men than for women, although the magnitude of the difference varied among studies. Specificities were generally lower among men, but sex differences were generally smaller for specificity, varying only by a few percent units. It was unclear, however, to what extent differences in the magnitude of sex- and age-specific variations were due to differences in study populations and age groups or to differences in FIT brands assessed in these studies. Our study demonstrates across several different FIT brands consistently higher sensitivities (by 3–13% units) along with consistently lower specificities (by 2–10% units) for men as compared with women at original cutoffs. Differences persisted when adjusting to equal specificity in the entire study population, but diminished when using equal specificities among women and among men, respectively.
It remains to be investigated by future studies if age and sex are similarly, or possibly more strongly, associated with FIT performance among symptomatic patients than among screening participants. Symptomatic patients may comprise a very heterogeneous group and it is conceivable that differences in performance characteristics vary in strength or even direction across heterogeneous symptomatic groups (e.g., those reporting abdominal pain vs. change in bowel habits). In both symptomatic and screening populations, it should be considered that age and sex may interact or be associated with other covariates potentially influencing FIT results. For example, intake of proton pump inhibitors (PPIs) has been suggested to be associated with reduced accuracy of FIT by some [23,24] but not all [25] studies. Furthermore, interactions with sex or intake of other drugs have been suggested [26].
The reasons for the higher sensitivity and lower specificity of FITs among men than among women at equal cutoffs remain to be fully explored. Possible reasons may include the higher proportion of AN located in the distal colon and rectum that are more frequently detected by FIT than proximal AN [27,28], higher rates of aspirin use for cardio prevention [29], and a shorter colonic transit time [30] that may be associated with less Hb degradation prior to defecation. The higher positivity rate and the lower NPV among men might also be partly explained by the higher prevalence of AN among men than among women (17.3% versus 10.0% in our study population).
We are aware of only three previous studies that assessed the FIT performance for AN detection among participants of colonoscopy screening according to age [20,22,31]. Again, each of these studies assessed only one specific FIT brand. Furthermore, they were conducted in very different study populations, used different age categorizations and yielded inconsistent results. In our study, no consistent differences in sensitivity and specificity were found across nine different FIT brands evaluated in parallel in the same study population. However, PPVs consistently tended to be higher and NPVs consistently tended to be lower among older as compared with younger age groups. Given the lack of differences in sensitivity and specificity, the differences in PPVs and NPVs most likely are due to differences in prevalence of advanced adenomas, which is higher in older than in younger participants of colonoscopy screening (17.6% versus 11.3% in our study population).
Finally, but importantly, eight out of nine FITs yielded very similar overall measures of diagnostic performance, as quantified by partial AUCs. Slightly lower partial AUCs were observed only for QuikRead go iFOBT, but this observation was caused by the lower analytical working limit being unusually high (15 µg/g) for this FIT. Furthermore, partial AUCs were very similar between sexes and age groups for each of the nine FIT brands. Therefore, no clinically relevant differences in overall diagnostic performance by sex and age were observed between FITs from different manufacturers.
A major strength of our study is the parallel measurement of the faecal haemoglobin concentration across nine different quantitative FITs under the same preanalytical and analytical test requirements in a colonoscopy-controlled study setting. The few previous studies that assessed the diagnostic performance according to sex [19,20,21,22] and age [20,22] included only a single FIT brand each. A further strength is that stool samples were collected from participants of colonoscopy screening prior to bowel preparation and the samples were stored in the same manner until parallel laboratory test analysis. Furthermore, the results of colonoscopy screening with adequate bowel preparation served as a reference standard to calculate diagnostic performance. In order to enhance comparability of diagnostic performance by sex and age across different FITs, we adjusted the cutoffs to yield equal overall specificities.
Our study also has limitations. Although more than 2000 participants of colonoscopy screening were recruited, the limited overall number of AN cases (n = 216) and randomly selected controls (n = 300) did not allow for in-depth analyses for each sex- and age-specific subset, for example, stratified by adenoma location. Despite the limited numbers, the pooled results of the GEE model revealed statistically significant differences in overall specificity, NPV, and positivity rate. The suggested differences in sensitivities warrant further research with larger case numbers. Future studies should also investigate potential mechanisms by which FIT sensitivity varies across groups of participants, for example, prevalence of anemia according to age and sex.

5. Conclusions

In conclusion, we observed consistently higher sensitivities and lower specificities among males as compared with among females with a number of different FITs and a broad range of threshold values. Furthermore, the analyses among men yielded consistently higher positivity rates, comparable PPVs, and lower NPVs than among women. According to age, no major differences in sensitivity and specificity were observed, but positive and negative predictive values differed, probably reflecting differences in AN prevalence between sexes and age groups. A negative FIT is less reliable for ruling out AN among men than among women and among older than among younger individuals. Further studies should address if, and to what extent, the sex- and age-specific differences might be relevant for the design of screening offers and interpretation of FIT results in various groups of screening participants, for example, by using sex- and age-specific cutoffs. These questions could, for example, be addressed in comprehensive modelling studies for which our results provide important input information. The diagnostic performance of FITs should be interpreted and compared with caution among studies with different sex or age distributions.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/cancers13143574/s1, Table S1: Test characteristics, Table S2: Partial area under the curve (% (95% CI)) for detection of advanced neoplasia by sex and by age.

Author Contributions

Conceptualization, A.G. and H.B.; methodology, A.G. and T.H. (Thomas Hielscher); formal analysis, A.G. and T.H. (Thomas Hielscher); data curation, A.G. and H.B.; writing—original draft preparation, A.G. and T.N.; writing—review and editing, H.B., T.N., E.A., K.W., T.H. (Thomas Heisser), P.S.-K. and M.H.; visualization, A.G.; supervision, H.B.; project administration, H.B.; funding acquisition, H.B. All authors have read and agreed to the published version of the manuscript.

Funding

The BLITZ study was partly funded by grants from the German Research Council (DFG, grant number BR1704/16-1). This work was partly funded by the German Federal Ministry of Education and Research (BMBF, grant number 01GL1712). The funder had no role in the study’s design, conduct, interpretation and reporting.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Ethics Committee of University of Heidelberg (178/2005) and those of the state chambers of physicians of Baden-Wuerttemberg (M118-05-f), Rhineland-Palatinate (837.047.06(5145)) and Saarland (217/13).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data supporting reported results can be made available upon reasonable request.

Acknowledgments

The authors thank Katarina Cuk for her excellent help in planning and conducting the project. They also thank Sabine Eichenherr, Romana Kimmel and Ulrike Schlesselmann for their excellent work in laboratory preparation of the stool samples and Volker Herrmann for his help in preparing the project.

Conflicts of Interest

The authors declare no conflict of interest. The funder had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef] [Green Version]
  2. European Colorectal Cancer Screening Guidelines Working Group: European guidelines for quality assurance in colorectal cancer screening and diagnosis: Overview and introduction to the full Supplement publication. Endoscocpy 2012, 45, 51–59. [CrossRef] [Green Version]
  3. Wolf, A.M.D.; Fontham, E.T.H.; Church, T.R.; Flowers, C.R.; Guerra, C.E.; LaMonte, S.J.; Etzioni, R.; McKenna, M.T.; Oeffinger, K.C.; Shih, Y.-C.T.; et al. Colorectal cancer screening for average-risk adults: 2018 guideline update from the American Cancer Society. CA Cancer J. Clin. 2018, 68, 250–281. [Google Scholar] [CrossRef]
  4. Schreuders, E.H.; Ruco, A.; Rabeneck, L.; Schoen, R.E.; Sung, J.J.Y.; Young, G.; Kuipers, E.J. Colorectal cancer screening: A global overview of existing programmes. Gut 2015, 64, 1637–1649. [Google Scholar] [CrossRef] [PubMed]
  5. Senore, C.; Basu, P.; Anttila, A.; Ponti, A.; Tomatis, M.; Vale, D.B.; Ronco, G.; Soerjomataram, I.; Primic-Žakelj, M.; Riggi, E.; et al. Performance of colorectal cancer screening in the European Union Member States: Data from the second European screening report. Gut 2018, 68, 1232–1244. [Google Scholar] [CrossRef]
  6. Gies, A.; Bhardwaj, M.; Stock, C.; Schrotz-King, P.; Brenner, H. Quantitative fecal immunochemical tests for colorectal cancer screening. Int. J. Cancer 2018, 143, 234–244. [Google Scholar] [CrossRef] [Green Version]
  7. Imperiale, T.F.; Gruber, R.N.; Stump, T.E.; Emmett, T.W.; Monahan, P.O. Performance Characteristics of Fecal Immunochemical Tests for Colorectal Cancer and Advanced Adenomatous Polyps: A Systematic Review and Meta-analysis. Ann. Intern. Med. 2019, 170, 319–329. [Google Scholar] [CrossRef] [Green Version]
  8. Selby, K.; Levine, E.H.; Doan, C.; Gies, A.; Brenner, H.; Quesenberry, C.; Lee, J.K.; Corley, D.A. Effect of Sex, Age, and Positivity Threshold on Fecal Immunochemical Test Accuracy: A Systematic Review and Meta-analysis. Gastroenterology 2019, 157, 1494–1505. [Google Scholar] [CrossRef] [Green Version]
  9. Bossuyt, P.M.; Reitsma, J.B.; Bruns, E.D.; Gatsonis, A.C.; Glasziou, P.; Irwig, L.; Lijmer, J.G.; Moher, D.; Rennie, D.; De Vet, H.C.W.; et al. STARD 2015: An updated list of essential items for reporting diagnostic accuracy studies. BMJ 2015, 351, h5527. [Google Scholar] [CrossRef] [Green Version]
  10. Fraser, C.G.; Allison, J.E.; Young, G.P.; Halloran, S.P.; Seaman, H.E. Improving the reporting of evaluations of faecal immunochemical tests for haemoglobin. Eur. J. Cancer Prev. 2015, 24, 24–26. [Google Scholar] [CrossRef]
  11. Gies, A.; Cuk, K.; Schrotz-King, P.; Brenner, H. Direct Comparison of Diagnostic Performance of 9 Quantitative Fecal Immunochemical Tests for Colorectal Cancer Screening. Gastroenterology 2018, 154, 93–104. [Google Scholar] [CrossRef] [Green Version]
  12. Gies, A.; Cuk, K.; Schrotz-King, P.; Brenner, H. Combination of Different Fecal Immunochemical Tests in Colorectal Cancer Screening: Any Gain in Diagnostic Performance? Cancers 2019, 11, 120. [Google Scholar] [CrossRef] [Green Version]
  13. Hundt, S.; Haug, U.; Brenner, H. Comparative evaluation of immunochemical fecal occult blood tests for colorectal adenoma detection. Ann. Intern. Med. 2009, 150, 162–169. [Google Scholar] [CrossRef] [PubMed]
  14. Brenner, H.; Tao, S.; Haug, U. Low-Dose Aspirin Use and Performance of Immunochemical Fecal Occult Blood Tests. JAMA 2010, 304, 2513–2520. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Brenner, H.; Tao, S. Superior diagnostic performance of faecal immunochemical tests for haemoglobin in a head-to-head comparison with guaiac based faecal occult blood test among 2235 participants of screening colonoscopy. Eur. J. Cancer 2013, 49, 3049–3054. [Google Scholar] [CrossRef] [PubMed]
  16. Gies, A.; Niedermaier, T.; Weigl, K.; Schrotz-King, P.; Hoffmeister, M.; Brenner, H. Effect of long-term frozen storage and thawing of stool samples on faecal haemoglobin concentration and diagnostic performance of faecal immunochemical tests. Clin. Chem. Lab. Med. 2019, 58, 390–398. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Rosenfield, R.E.; Kochwa, S.; Kaczera, Z.; Maimon, J. Nonuniform Distribution of Occult Blood in Feces. Am. J. Clin. Pathol. 1979, 71, 204–209. [Google Scholar] [CrossRef]
  18. Fraser, C.G.; Allison, E.J.; Halloran, S.P.; Young, G. A Proposal to Standardize Reporting Units for Fecal Immunochemical Tests for Hemoglobin. J. Natl. Cancer Inst. 2012, 104, 810–814. [Google Scholar] [CrossRef] [Green Version]
  19. Brenner, H.; Haug, U.; Hundt, S. Sex Differences in Performance of Fecal Occult Blood Testing. Am. J. Gastroenterol. 2010, 105, 2457–2464. [Google Scholar] [CrossRef]
  20. Bakker, C.A.K.-D.; Jonkers, D.M.; Sanduleanu-Dascalescu, S.; De Bruïne, A.P.; Meijer, G.A.; Janssen, J.B.; Van Engeland, M.; Stockbrügger, R.W.; Masclee, A.A. Test Performance of Immunologic Fecal Occult Blood Testing and Sigmoidoscopy Compared with Primary Colonoscopy Screening for Colorectal Advanced Adenomas. Cancer Prev. Res. 2011, 4, 1563–1571. [Google Scholar] [CrossRef] [Green Version]
  21. Grobbee, E.J.; Wieten, E.; Hansen, B.E.; Stoop, E.M.; De Wijkerslooth, T.R.; Lansdorp-Vogelaar, I.; Bossuyt, P.M.; Dekker, E.; Kuipers, E.J.; Spaander, M.C.W. Fecal immunochemical test-based colorectal cancer screening: The gender dilemma. United Eur. Gastroenterol. J. 2017, 5, 448–454. [Google Scholar] [CrossRef] [Green Version]
  22. Brenner, H.; Qian, J.; Werner, S. Variation of diagnostic performance of fecal immunochemical testing for hemoglobin by sex and age: Results from a large screening cohort. Clin. Epidemiol. 2018, 10, 381–389. [Google Scholar] [CrossRef] [Green Version]
  23. Alonso, L.R.; Moranta, F.R.; Arajol, C.; Gilabert, P.; Serra, K.; Martin-Cardona, A.; Ibáñez-Sanz, G.; Moreno, V.; Guardiola, J. Proton pump inhibitors reduce the accuracy of faecal immunochemical test for detecting advanced colorectal neoplasia in symptomatic patients. PLoS ONE 2018, 13, e0203359. [Google Scholar] [CrossRef]
  24. Ibáñez-Sanz, G.; Milà, N.; de la Peña-Negro, L.C.; Garcia, M.; Vidal, C.; Rodríguez-Alonso, L.; Binefa, G.; Rodríguez-Moranta, F.; Moreno, V. Proton-pump inhibitors are associated with a high false-positivity rate in faecal immunochemical testing. J. Gastroenterol. 2021, 56, 42–53. [Google Scholar] [CrossRef] [PubMed]
  25. Chandrapalan, S.; Hee, S.W.; Widlak, M.M.; Farrugia, A.; Alam, M.T.; Smith, S.; Arasaradnam, R.P. Performance of the faecal immunochemical test for the detection of colorectal neoplasms and the role of proton pump inhibitors in their diagnostic accuracy. Colorectal Dis. 2021. [Google Scholar] [CrossRef]
  26. Arnal, D.M.J.; Garcia Mateo, S.; Hermoso-Duran, S.; Abad, D.; Carrera-Lasfuentes, P.; Velazquez-Campoy, A.; Abian Franco, O.; Lanas, A. False-positive fecal immunochemical test results in colorectal cancer screening and gastrointestinal drug use. Int. J. Colorectal Dis. 2021. [Google Scholar] [CrossRef]
  27. Brenner, H.; Niedermaier, T.; Chen, H. Strong subsite-specific variation in detecting advanced adenomas by fecal immunochemical testing for hemoglobin. Int. J. Cancer 2017, 140, 2015–2022. [Google Scholar] [CrossRef] [Green Version]
  28. Niedermaier, T.; Weigl, K.; Hoffmeister, M.; Brenner, H. Diagnostic performance of flexible sigmoidoscopy combined with fecal immunochemical test in colorectal cancer screening: Meta-analysis and modeling. Eur. J. Epidemiol. 2017, 32, 481–493. [Google Scholar] [CrossRef]
  29. Brenner, H.; Calderazzo, S.; Seufferlein, T.; Ludwig, L.; Dikopoulos, N.; Mangold, J.; Böck, W.; Stolz, T.; Eisenbach, T.; Block, T.; et al. Effect of a Single Aspirin Dose Prior to Fecal Immunochemical Testing on Test Sensitivity for Detecting Advanced Colorectal Neoplasms: A Randomized Clinical Trial. JAMA 2019, 321, 1686–1692. [Google Scholar] [CrossRef]
  30. Sadik, R.; Abrahamsson, H.; Stotzer, P.-O. Gender Differences in Gut Transit Shown with a Newly Developed Radiological Procedure. Scand. J. Gastroenterol. 2003, 38, 36–42. [Google Scholar] [CrossRef]
  31. Kim, N.H.; Park, J.H.; Park, D.I.; Sohn, C.I.; Choi, K.; Jung, Y.S. The fecal immunochemical test has high accuracy for detecting advanced colorectal neoplasia before age 50. Dig. Liver Dis. 2017, 49, 557–561. [Google Scholar] [CrossRef]
Figure 1. Diagnostic performance parameters by sex (panel A) and age (panel B) for detection of advanced neoplasia pooled across 9 FIT brands at original positivity thresholds.
Figure 1. Diagnostic performance parameters by sex (panel A) and age (panel B) for detection of advanced neoplasia pooled across 9 FIT brands at original positivity thresholds.
Cancers 13 03574 g001
Table 1. Characteristics of the study population.
Table 1. Characteristics of the study population.
CharacteristicnCol %FIT Measurements
nRow %
SexWomen80648.4
Men86151.6
Age50–64101460.8
65–7965339.2
Advanced
Neoplasia
Yes23013.8216 *93.9 *
Colorectal cancer161.016 *
Advanced
adenoma
21412.8200 *
No143786.2300 **20.9 **
* All participants with sufficient stool for conducting 9 FITs. ** Random sample.
Table 2. Sensitivity and specificity for detection of advanced neoplasia by sex.
Table 2. Sensitivity and specificity for detection of advanced neoplasia by sex.
FIT BrandSensitivity (%)Specificity (%)
FemaleMaleDiff.pFemaleMaleDiff.p
At original thresholds recommended by the manufacturers
IDK Hb ELISA40.549.3−8.80.2290.380.7+9.60.02
QuantOn Hem36.548.6−12.10.0989.082.1+6.90.09
immoCARE-C33.841.1−7.30.2992.986.9+6.00.09
CAREprime28.438.0−9.60.1695.586.9+8.60.01
RIDASCREEN Hb32.443.0−10.60.1394.886.2+8.60.01
Eurolyser FOB test17.625.4−7.80.2098.795.2+3.50.09
OC-Sensor17.623.9−6.30.2899.495.9+3.50.08
QuikRead go iFOBT14.925.4−10.50.0898.794.5+4.20.06
SENTiFIT-FOB Gold17.623.9−6.30.2898.793.8+4.90.04
GEE-Model25.734.6−8.90.1296.290.8+5.40.005
At adjusted thresholds yielding 95% specificity among all study participants
IDK Hb ELISA25.733.8−8.10.2296.893.1+3.70.15
QuantOn Hem24.327.5−3.20.6295.594.5+1.00.69
immoCARE-C27.034.0−7.00.2996.893.1+3.70.15
CAREprime18.928.2−9.30.1498.191.7+6.40.02
RIDASCREEN Hb25.736.6−10.90.1196.193.8+2.30.36
Eurolyser FOB test21.628.2−6.60.3098.191.7+6.40.02
OC-Sensor20.328.9−8.60.1797.492.4+5.00.06
QuikRead go iFOBT14.925.4−10.50.0898.794.5+4.20.06
SENTiFIT-FOB Gold23.028.9−5.90.3598.191.7+6.40.02
GEE-Model22.430.2−7.80.1897.393.0+4.30.04
At adjusted thresholds yielding 95% specificity among women and among men, respectively
IDK Hb ELISA28.424.63.80.5594.895.2−0.40.89
QuantOn Hem29.725.44.30.4994.895.2−0.40.89
immoCARE-C32.423.49.00.1694.895.2−0.40.89
CAREprime31.123.97.20.2694.895.2−0.40.89
RIDASCREEN Hb32.424.67.80.2294.895.2−0.40.89
Eurolyser FOB test25.726.1−0.40.9596.195.2+0.90.68
OC-Sensor27.026.10.90.8894.895.2−0.40.89
QuikRead go iFOBT14.925.4−10.50.0898.795.1+3.60.09
SENTiFIT-FOB Gold24.323.90.40.9596.895.2+1.60.48
GEE-Model27.324.82.50.6695.695.2+0.40.82
Abbreviations: CI, confidence interval; FIT, faecal immunochemical test; GEE, generalized estimating equations; Hb, haemoglobin. Bold numerals: statistically significant differences (p < 0.05).
Table 3. Positive (PPV) and negative (NPV) predictive values for detection of advanced neoplasia by sex.
Table 3. Positive (PPV) and negative (NPV) predictive values for detection of advanced neoplasia by sex.
FIT BrandPPV (% (95% CI))NPV (% (95% CI))
FemaleMaleDiff.pFemaleMaleDiff.p
At original thresholds recommended by the manufacturers
IDK Hb ELISA31.935.1−3.20.7592.888.04.80.10
QuantOn Hem27.436.7−9.30.3792.388.14.20.16
immoCARE-C35.539.7−4.20.7392.387.35.00.09
CAREprime41.638.63.00.8392.086.75.30.08
RIDASCREEN Hb41.140.11.00.9492.387.54.80.10
Eurolyser FOB test63.956.97.00.7591.185.65.50.06
OC-Sensor76.657.119.50.4191.285.55.70.06
QuikRead go iFOBT57.951.36.60.7790.985.55.40.08
SENTiFIT-FOB Gold63.147.415.70.4791.185.25.90.05
GEE-Model44.244.3−0.11.0091.886.65.20.01
At adjusted thresholds yielding 95% specificity among all study participants
IDK Hb ELISA46.850.9−4.10.8092.187.05.10.08
QuantOn Hem37.151.8−14.70.3691.886.25.60.05
immoCARE-C48.250.4−2.20.8992.287.15.10.07
CAREprime51.840.711.10.5691.585.95.60.05
RIDASCREEN Hb42.355.2−12.90.4192.087.64.40.12
Eurolyser FOB test55.341.114.20.4491.885.95.90.04
OC-Sensor46.444.32.10.9191.686.15.50.06
QuikRead go iFOBT55.848.17.70.7391.285.85.40.06
SENTiFIT-FOB Gold56.941.315.60.3991.986.05.90.04
GEE-Model47.646.90.70.9691.886.45.40.005
At adjusted thresholds yielding 95% specificity among women and among men, respectively
IDK Hb ELISA37.850.9−13.10.4192.285.76.50.03
QuantOn Hem38.852.8−14.00.3792.385.96.40.03
immoCARE-C41.049.4−8.40.5992.685.67.00.02
CAREprime41.050.2−9.20.5292.585.66.90.02
RIDASCREEN Hb41.150.9−9.80.5392.685.76.90.02
Eurolyser FOB test42.451.7−8.30.5892.086.06.00.04
OC-Sensor36.651.7−15.10.3492.186.06.10.04
QuikRead go iFOBT55.851.64.20.8591.285.95.30.07
SENTiFIT-FOB Gold45.649.0−3.40.8491.985.66.30.03
GEE-Model40.950.9−10.00.4492.185.86.30.001
Abbreviations: CI, confidence interval; FIT, faecal immunochemical test; GEE, generalized estimating equations; Hb, haemoglobin. Bold numerals: statistically significant differences (p < 0.05).
Table 4. Sensitivity and specificity for detection of advanced neoplasia by age.
Table 4. Sensitivity and specificity for detection of advanced neoplasia by age.
FIT BrandSensitivity (% (95% CI))Specificity (% (95% CI))
50–64 Years65–79 YearsDiff.p50–64 Years65–79 YearsDiff.p
At original thresholds recommended by the manufacturers
IDK Hb ELISA44.048.6−4.60.5088.481.56.90.10
QuantOn Hem43.145.8−2.70.6986.284.91.30.75
immoCARE-C37.040.2−3.20.6490.689.11.50.67
CAREprime32.137.4−5.30.4291.291.6−0.40.90
RIDASCREEN Hb35.843.0−7.20.2893.486.66.80.05
Eurolyser FOB test21.124.3−3.20.5896.198.3−2.20.29
OC-Sensor21.122.4−1.30.8197.298.3−1.10.55
QuikRead go iFOBT22.021.50.50.9396.197.5−1.40.53
SENTiFIT-FOB Gold22.920.62.30.6795.697.5−1.90.40
GEE-Model29.130.9−1.80.7394.393.80.50.76
At adjusted thresholds yielding 95% specificity among all study participants
IDK Hb ELISA28.433.6−5.20.4195.694.11.50.57
QuantOn Hem24.828.0−3.20.5995.694.11.50.57
immoCARE-C28.734.6−5.90.3595.694.11.50.57
CAREprime22.927.1−4.20.4894.595.8−1.30.61
RIDASCREEN Hb30.335.5−5.20.4195.694.11.50.57
Eurolyser FOB test23.928.0−4.10.4895.095.00.00.98
OC-Sensor25.726.2−0.50.9496.193.32.80.27
QuikRead go iFOBT22.021.50.50.9396.197.5−1.40.53
SENTiFIT-FOB Gold23.929.9−6.00.3294.595.8−1.30.61
GEE-Model25.629.4−3.80.4995.494.90.50.80
At adjusted thresholds yielding 95% specificity among younger and older participants, respectively
IDK Hb ELISA30.321.58.80.1495.095.00.00.98
QuantOn Hem25.727.1−1.40.8195.095.00.00.98
immoCARE-C29.628.01.60.8095.095.00.00.98
CAREprime22.928.0−5.10.3995.095.00.00.98
RIDASCREEN Hb31.222.48.80.1595.095.00.00.98
Eurolyser FOB test23.933.69.70.1195.095.00.00.98
OC-Sensor27.523.44.10.4895.095.00.00.98
QuikRead go iFOBT22.021.50.50.9396.197.5−1.40.53
SENTiFIT-FOB Gold23.934.6−10.70.0895.095.00.00.98
GEE-Model26.326.7−0.40.9595.295.20.00.97
Abbreviations: CI, confidence interval; FIT, faecal immunochemical test; GEE, generalized estimating equations; Hb, haemoglobin. Bold numerals: statistically significant differences (p < 0.05).
Table 5. Positive (PPV) and negative (NPV) predictive value for detection of advanced neoplasia by age.
Table 5. Positive (PPV) and negative (NPV) predictive value for detection of advanced neoplasia by age.
FIT BrandPPV (% (95% CI))NPV (% (95% CI))
50–64
Years
65–79
Years
Diff.p50–64
Years
65–79
Years
Diff.p
At original thresholds recommended by the manufacturers
IDK Hb ELISA31.735.4−3.70.7092.688.44.20.15
QuantOn Hem26.737.5−10.80.2692.288.24.00.17
immoCARE-C32.343.1−10.80.3592.087.84.20.14
CAREprime31.549.4−17.90.1591.487.63.80.20
RIDASCREEN Hb40.340.9−0.60.9692.087.94.10.15
Eurolyser FOB test41.876.5−34.70.0790.686.34.30.14
OC-Sensor54.778.3−23.60.2190.886.14.70.11
QuikRead go iFOBT42.865.9−23.10.2290.785.75.00.09
SENTiFIT-FOB Gold42.967.2−24.30.1990.885.75.10.08
GEE-Model37.651.0−13.40.1691.587.14.40.02
At adjusted thresholds yielding 95% specificity among all study participants
IDK Hb ELISA44.455.0−10.60.4891.286.94.30.13
QuantOn Hem41.850.4−8.60.5890.885.94.90.10
immoCARE-C44.155.5−11.40.4591.387.04.30.14
CAREprime33.357.7−24.40.1390.585.94.60.13
RIDASCREEN Hb46.056.1−10.10.5091.487.24.20.14
Eurolyser FOB test36.654.2−17.60.2690.786.04.70.12
OC-Sensor44.645.2−0.60.9791.085.55.50.07
QuikRead go iFOBT40.864.3−23.50.2190.685.35.30.08
SENTiFIT-FOB Gold34.260.2−260.1090.686.44.20.16
GEE-Model40.554.9−14.40.2590.986.24.70.02
At adjusted thresholds yielding 95% specificity among younger and older participants, respectively
IDK Hb ELISA43.054.3−11.30.2691.484.96.50.03
QuantOn Hem39.653.3−13.70.3890.985.95.00.09
immoCARE-C41.953.9−12.00.4491.486.05.40.07
CAREprime35.754.1−18.40.2490.686.04.60.13
RIDASCREEN Hb43.748.6−4.90.7691.585.16.40.03
Eurolyser FOB test36.658.7−22.10.1590.787.03.70.21
OC-Sensor40.749.5−8.80.5891.185..25.90.05
QuikRead go iFOBT40.864.3−23.50.2190.685.35.30.08
SENTiFIT-FOB Gold36.659.4−22.80.1490.787.13.60.23
GEE-Model39.954.3−14.40.2691.085.85.20.01
Abbreviations: CI, confidence interval; FIT, faecal immunochemical test; GEE, generalized estimating equations; Hb, haemoglobin. Bold numerals: statistically significant differences (p < 0.05).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gies, A.; Niedermaier, T.; Alwers, E.; Hielscher, T.; Weigl, K.; Heisser, T.; Schrotz-King, P.; Hoffmeister, M.; Brenner, H. Consistent Major Differences in Sex- and Age-Specific Diagnostic Performance among Nine Faecal Immunochemical Tests Used for Colorectal Cancer Screening. Cancers 2021, 13, 3574. https://doi.org/10.3390/cancers13143574

AMA Style

Gies A, Niedermaier T, Alwers E, Hielscher T, Weigl K, Heisser T, Schrotz-King P, Hoffmeister M, Brenner H. Consistent Major Differences in Sex- and Age-Specific Diagnostic Performance among Nine Faecal Immunochemical Tests Used for Colorectal Cancer Screening. Cancers. 2021; 13(14):3574. https://doi.org/10.3390/cancers13143574

Chicago/Turabian Style

Gies, Anton, Tobias Niedermaier, Elizabeth Alwers, Thomas Hielscher, Korbinian Weigl, Thomas Heisser, Petra Schrotz-King, Michael Hoffmeister, and Hermann Brenner. 2021. "Consistent Major Differences in Sex- and Age-Specific Diagnostic Performance among Nine Faecal Immunochemical Tests Used for Colorectal Cancer Screening" Cancers 13, no. 14: 3574. https://doi.org/10.3390/cancers13143574

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop