Next Article in Journal
The Conceptual Independence of Health Status, Respiratory Symptoms and Dyspnea in Chronic Obstructive Pulmonary Disease in Real Clinical Practice
Next Article in Special Issue
The Association of suPAR with Cardiovascular Risk Factors in Young and Healthy Adults
Previous Article in Journal
Alternate Antimicrobial Therapies and Their Companion Tests
Previous Article in Special Issue
Concentrations of Serum Brain Injury Biomarkers Following SARS-CoV-2 Infection in Individuals with and without Long-COVID—Results from the Prospective Population-Based COVI-GAPP Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning in Antibody Diagnostics for Inflammatory Bowel Disease Subtype Classification

by
Christiane Sokollik
1,†,
Aurélie Pahud de Mortanges
2,†,
Alexander B. Leichtle
3,4,
Pascal Juillerat
5,6 and
Michael P. Horn
3,* on behalf of the Swiss IBD Cohort Study Group
1
Division of Pediatric Gastroenterology, Hepatology and Nutrition, University Children’s Hospital, Inselspital, University of Bern, 3010 Bern, Switzerland
2
ARTORG Center for Biomedical Engineering Research, University of Bern, 3010 Bern, Switzerland
3
Department of Clinical Chemistry, Inselspital, Bern University Hospital, University of Bern, 3010 Bern, Switzerland
4
Center for Artificial Intelligence in Medicine (CAIM), University of Bern, 3010 Bern, Switzerland
5
Department of Gastroenterology, Clinic for Visceral Surgery and Medicine, Inselspital, Bern University Hospital, University of Bern, 3010 Bern, Switzerland
6
Crohn’s and Colitis Center, Gastroenterology Beaulieu SA, 1004 Lausanne, Switzerland
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Diagnostics 2023, 13(15), 2491; https://doi.org/10.3390/diagnostics13152491
Submission received: 19 June 2023 / Revised: 21 July 2023 / Accepted: 25 July 2023 / Published: 26 July 2023
(This article belongs to the Special Issue Highlights in Swiss Laboratory Medicine 2023)

Abstract

:
Antibody testing in inflammatory bowel disease (IBD) can add to diagnostic accuracy of the main subtypes Crohn’s disease (CD) and ulcerative colitis (UC). Whether modern modeling techniques such as supervised and unsupervised machine learning are of value for finer distinction of subtypes such as IBD-unclassified (IBD-U) is not known. We determined the antibody profile of 100 adult IBD patients from the Swiss IBD cohort study with known subtype (50 CD, 50 UC) as well as of 76 IBD-U patients. We included ASCA IgG and IgA, p-ANCA, MPO- and PR3-ANCA, and xANCA measurements for computing different antibody panels as well as machine learning models. The AUC of an optimized antibody panel was 85% (95%CI, 78–92%) to distinguish CD from UC patients. The antibody profile of IBD-U patients was closely related to UC. No specific antibody profile was predictive for IBD-U nor for re-classification. The panel diagnostic was in favor of UC reclassification prediction with a correct assignment rate of 69.2–73.1% depending on the cut-off applied. Supervised machine learning could not distinguish between CD, UC, and IBD-U. More so, unsupervised machine learning suggested only two distinct clusters as a likely number of IBD subtypes. Antibodies in IBD are supportive in confirming clinical determined subtypes CD and UC but have limited capacity to predict IBD-U and reclassification during follow-up. In terms of antibody profiles, IBD-U is not a distinct subtype of IBD.

1. Introduction

The term inflammatory bowel disease (IBD) summarizes a spectrum of chronic diseases characterized by recurrent episodes of intestinal inflammation. There are two main subtypes: Crohn’s disease (CD) and ulcerative colitis (UC). In recent years, it was proposed to better describe the continuum within IBD by dividing the classification more precisely [1,2]. Herewith, special attention is paid to CD patients with isolated colonic disease location [3,4,5] as well as patients where no classification is possible [6,7]. For the latter patient group, the term IBD-unclassified (IBD-U) is established. In different cohorts, 6–13% of IBD patients are labeled as IBD-U [8,9]. Clinically, there is evidence that a finer distinction of IBD subtypes may be important for prognosis and management strategies [5,9]. However, it is still not clear whether IBD-U is a distinct disease entity of IBD or a milder and earlier stage of CD or UC. Cleynen et al. developed a CD versus UC genetic risk score, which placed colonic CD as well as colonic IBD-U between ileal CD and UC [1]. On the other hand, antibody testing placed colonic CD closer to CD and IBD-U closer to UC [5,7].
Previously, we have shown that antibody-based panel diagnostics was superior to single antibody testing in distinguishing between CD and UC in pediatric IBD patients [10]. In this study, we performed antibody diagnostics in a cohort of adult IBD patients with known CD and UC as well as on IBD-U patients. We validated the classification power of antibodies employing modern modeling including supervised and unsupervised machine learning. Furthermore, we tested different modeling approaches in IBD-U patients without and with reclassification during follow-up.

2. Materials and Methods

2.1. Study Population and Design

We enrolled 176 IBD patients (50 CD, 50 UC, and 76 IBD-U patients) of the Swiss IBD Cohort Study (SIBDCS). The SIBDCS prospectively follows IBD patients with yearly-standardized follow-ups, which combine clinical data collection and bio-sampling [11,12]. Collected clinical data include sex as well as age at diagnosis, enrollment, follow-up, and serum sampling. The diagnosis of IBD was based according to international standards on a combination of clinical, biochemical, stool, endoscopic, and histological examinations [13]. The Montreal classification was used for clinical phenotyping including IBD-U, and the UC nomenclature was used for disease location of IBD-U patients [14,15]. IBD-U patients had no definitive histological or other evidence, which was in favor of either CD or UC. Patients with isolated colonic disease were subordinated to the CD group. A subset of 20 IBD patients with a definite diagnosis at the last follow-up was reviewed by an independent gastroenterologist (F.B.) for the appropriateness of the classification or reclassification during follow-up using available endoscopic, histological, and radiological reports. He was blinded to the diagnoses documented in the SIBDCS database.
All sera included in the analyses were sampled after inclusion into the study. In the subgroup of IBD-U patients who were re-classified during follow up, serum sampling always took place before reclassification. At time of serum sampling, all patients had already received treatment at the discretion of their physician.
The Swiss IBD cohort study has been approved by the local ethics committee of each participating center (institutional review board No. EK-1316, approved on 5 February 2007 and KEK of Canton of Zurich, 2018–02068 on 9 March 2020). Patients gave written informed consent for inclusion in the SIBDCS. The study was conducted in accordance with the Helsinki declaration.

2.2. Analysis of Antibodies

All sera were analyzed in a blinded fashion without knowledge of patient diagnosis or other clinical information. We determined the antibodies as previously described [10]. All tests were performed according to the manufacturer’s instructions and were carried out in our diagnostic routine laboratory. Commercial tests and normal reference values as used in routine diagnostic were applied: ASCA IgA and IgG < 7 U/mL (EliA, Thermo Fisher Diagnostics (Reinach, Switzerland)); PR3-ANCA < 5.0 U/mL, MPO-ANCA < 6.0 U/mL (both CLIA QuantaFlash, INOVA Dx (San Diego, CA, USA)); indirect immunofluorescence titers of xANCA, cANCA, and pANCA (EtOH fixed neutrophil granulocytes, INOVA Dx); as well as ANA < 1:80 (HEp-2; INOVA Dx). All atypical pANCA reactions (defined as pANCA reaction on EtOH fixed neutrophil granulocytes without MPO-ANCA) were set to negative, if the ANA titer was higher than the ANCA titer, as the pANCA reaction was disturbed by ANA.

2.3. Statistical Analysis

All analyses were performed using SAS software, version 9.3 (SAS Institute, Cary, NC, USA) or R, version 4.2.2 (The R Foundation, Vienna, Austria). A p-value < 0.05 was considered statistically significant for all analyses.
Univariate analysis was performed using Wilcoxon rank score test for continuous variables and Chi square or Fisher’s exact test for categorical variables. Sensitivity and specificity were calculated for the antibody panels in order to predict the disease subtype.
A quasi-exhaustive logistic regression approach on a predefined set of antibody data was applied to find the best discriminative model for CD and UC. Based on the Akaike Information Criterion (AIC, Supplementary Figure S1), an efficient branch-and-bound algorithm was used for the exhaustive search for the best subsets of the variables [16]. The dichotomous diagnosis class (CD vs. UC) was used as dependent and the antibodies as independent variables (R v.4.2.2), and the model space was assessed using ‘leaps’-based (package ‘leaps_3.0′) wrapper functions. ROC curves were drawn with the pROC package (v. 1.9.1), and the computation of optimal cutoffs was done with Youden’s J statistic (package pROC).
Using Bayesian Model Averaging (BMA) for binomial logistic regression, we extracted the variables with the highest predictive importance for the distinction between UC and CD according to their inclusion probability into the generated models. In this analysis, patients who were reclassified from IBD-U to UC or CD by the end of the follow-up period were also included. Bayesian Model Averaging was conducted with the BMA package.
After investigating prediction of IBD patients for two classes, we also conducted a multinomial logistic regression with a three-class outcome, UC, CD, and IBD-U. Additionally, an XGBoost algorithm was trained for the three-class disease prediction to explore potential added value of more advanced machine learning techniques over logistic regression. For both three-class prediction models, the data were randomly split into training and test subsets in a 66:34 ratio, to ensure adequate representation of all three classes in both the training and the test subsets derived from our relatively small overall data set. Both models were trained with 5-fold internal cross-validation and created with the caret package.
To scrutinize whether a three-class model of IBD (CD vs. UC vs. IBD-U) is supported by the underlying data, we conducted k-means clustering, an unsupervised learning technique, in which n observations are partitioned into a pre-specified number of k clusters according to the nearest cluster mean. In concordance to the three-class hypothesis, we conducted k-means clustering with k = 3. We also sought to determine the number of clusters suggested by the data using the “elbow method” (based on within-cluster-sum of squared errors) as well as by the “silhouette method” (based on the silhouette coefficient). Additionally, general agglomerative hierarchical clustering was done applying Ward’s minimum variance method. Here the objective function, depending on which clusters are merged, is the error sum of squares. A corresponding heatmap of the clustering was created.
K-means clustering and ward hierarchical clustering were performed using the stats package, and pheatmap was used for heatmap creation.

3. Results

3.1. Antibody Status and Panel Diagnostic in Adult CD and UC Patients

An overview of clinical data and single antibody results of CD and UC patients are provided in Supplementary Tables S1 and S2.
ASCA IgA and IgG antibodies were highly specific for CD (p < 0.001 and p < 0.001, respectively) whereas xANCA and PR3-ANCA were specific for UC (p < 0.001 and p = 0.006, respectively). The positivity of cANCA and (atypical) pANCA did not discriminate between the CD and UC. None of the analyzed serum samples was positive for MPO-ANCA.
We first applied our previously described antibody panel [10] (named panel BeLu) for its classification power in adult IBD patients. The panel BeLu correctly assigned 37/50 CD patients (74%) and 32/50 UC patients (64%), respectively. This adds up to 69/100 (69%) correct assignments with an AUC of 78% (95%CI, 69–87%) (Figure 1, red line).
Next, we used all determined antibodies as the input for computing an optimized panel for adult patients (named panel SIBDCS). The resulting model selected only the positivity status of PR3-ANCA, xANCA, ASCA IgA, and ASCA IgG. Intercept and the factors of the single antibody results are shown in Table 1. Using this panel, the correct assignment rate increased in comparison to the panel BeLu to 77/100 (77%) with an AUC of 85% (95%CI, 78–92%) (Figure 1, blue line). Three different cut-offs, 0.61, 0.62, and 0.65, were calculated to be optimal for the distinction between CD and UC depending on the priority which subtype should be predicted. By adapting the predictor cut-off from 0.61 to 0.62 and 0.65, the specificity for the classification of CD increased from 88% (95%CI, 78–96%) to 92% (95%CI, 84–98%) and 94% (95%CI, 88–100%), respectively. However, this adaptation lowered the sensitivity for CD from 66% (95%CI, 52–78%) to 62% (95%CI, 48–76%) and 60% (95%CI, 46–74%), respectively.

3.2. Antibody Status and Panel Diagnostics in IBD-U Patients

Next, we evaluated the predictive capacity of antibodies and derived panels for the classification of IBD-U patients. We determined the same antibodies as above in 76 IBD-U patients of the SIBDCS (Table 2 and Table 3). Furthermore, 46/76 (60.5%) of IBD-U patients were negative for PR3-ANCA as well as ASCA. In general, the antibody status of IBD-U patients was closely related to UC with a high number of xANCA (46/76 (60.5%) vs. 25/50 (50%) UC patients, respectively) and PR3-ANCA (20/76 (26.3%) vs. 12/50 (24%), respectively) positivity as well as a low number and low titer of ASCA positivity (14/76 (18.4) vs. 34/50 (68%) CD patients). Independently of the applied panel—BeLu or SIBDSC—the panel diagnostic favored UC classification in most IBD-U patients as well (Figure 2a,b).
During follow-up, 26/76 (34.2%) of the IBD-U patients were reclassified. Of these 26 patients, 8 reclassified to CD (30.8%) and 18 to UC (69.2%). Furthermore, 50 IBD-U patients were not reclassified during follow-up time of mean 6.5 y (IQR 3.9–12.1 y).
The reclassification to CD was associated with ASCA-IgA positivity in 2/8 patients (25.0%), and the reclassification to UC was associated with xANCA positivity in 11/18 (61.1%) and PR3-ANCA positivity in 7/18 (38.9%) patients.
The panel BeLu correctly assigned 19/26 (73.1%) of the reclassified IBD-U patients: 5/8 (62.5%) of those who reclassified to CD and 14/18 (77.8%) of those who reclassified to UC (Figure 2a). The panel SIBDCS correctly predicted the reclassified subtype in 18/26 (69.2%) IBD-U patients when using the cut-off of 0.65 (sensitivity 60% and specificity 94%) (Figure 2b). The panel assigned correctly 1/8 (12.5%) who reclassified to CD and 17/18 (94.4%) who reclassified to UC. When applying the lower cut-offs of 0.62 or 0.61, the accuracy for CD remained at 1/8 patients, but the correct prediction rate for UC dropped to 15/18 patients. For reclassification of the IBD-U patients, both panels performed with similar accuracy.

3.3. Supervised and Unsupervised Machine Learning for Bi- and Multiclass Prediction Models (CD vs. UC and CD vs. UC vs. IBD-U)

After developing the logistic regression models, BeLu and SIBDCS, we investigated in different exploratory analyses whether modern machine learning techniques offer added value. As an extension of the above-presented logistic regression-based models, we conducted BMA, as model averaging procedures provide better predictive performance in the presence of model uncertainty [17]. In the analysis, more than 60 models were generated, showing an overall accuracy of 74.6% (sensitivity: 88.2%, specificity: 58.6%) for the distinction between UC and CD when calculated from the whole dataset without holdouts. The parameters “xANCA (0/1)”, “PR3-ANCA (0/1)”, and “any ASCA (0/1)” were most frequently included in models and could therefore be assumed to hold the most predictive power (Figure 3).
After exploring various models with the binary outcome CD vs. UC, we were also interested in multinomial approaches classifying CD vs. UC vs. IBD-U, following the hypothesis that IBD-U might be its own class, distinct from CD and UC. Both three-class multinomial logistic regression as well as XGBoost prediction algorithm did not offer added classification performance advantages compared to the two class exhaustive logistic regression models presented above. Multinomial logistic regression yielded an overall accuracy of 65% for the training subset with sensitivities for the three classes ranging from 39–77% and specificities of 72–88%. On the testing subset, the performance dropped to 48% (sensitivities: 39–63%, specificities: 63–79%). Similarly, XGBoost algorithm yielded an out-of-fold prediction accuracy of 76% (sensitivities: 58–87%, specificities: 82–92%) and a test accuracy of 70% (sensitivities: 52–63%, specificities: 74–81%).
The three-cluster k-means clustering showed a low overall accuracy of 41%, exhibiting specificities between 57% and 90% and sensitivity from 0% to 64% for the three classes CD, UC, and IBD-U. The sensitivity for IBD-U was 0%. This difficulty of separating IBD-U cases is demonstrated in Figure 4, where the visualization of the clustering showed that CD cases separate rather distinctly, while UC and IBD-U cases exhibit a large overlap. Visualization of the optimal number of clusters through the “elbow method” (based on Within-Cluster-Sum of Squared Errors) as well as through the “silhouette method” (based on the silhouette coefficient) indicates that the true number of clusters is more likely to be two than three (Supplementary Figure S2A,B). Ward hierarchical clustering and the corresponding heatmap also indicated poor clustering based on a three-class diagnosis-based model (Figure 5A,B).

4. Discussion

Our study was set up to test the predictive value of antibody diagnostics for classification in IBD. Employing our previously established panel BeLu [10], we reached an AUC 77% in an adult cohort for correct classification of CD and UC. We could even improve the AUC to 85% with an optimized panel SIBDCS, which only used the positivity or negativity of four widely available antibodies (PR3-ANCA, xANCA, ASCA IgA and ASCA IgG).
We did not find a specific antibody profile to reliably classify IBD-U patients. Our results of supervised machine learning point in the direction that antibody testing is valuable for binomial distinction in CD and UC but lacks capability for finer tuning. Furthermore, unsupervised machine learning suggested that the true underlying number of clusters generatable by antibody testing is more likely to be two than three. Following this, it can be deduced that, indeed, IBD-U is from an antibody point of view not a distinct disease entity of IBD.
In our cohort, 26/76 (34.2%) IBD-U patients were reclassified during follow-up: 8/26 (30.8%) to CD and 18/26 (69.2%) to UC. These rates are in the same range as described in previous cohorts studied with reclassification rates between 21 and 54%. Of the reclassified patients, 28–83% were reclassified to CD and 17–71% to UC [2,7,9,18]. The wide range of the reclassification rates may be explained by selection bias of the cohorts including age of diagnosis or length of follow-up. For example, Rinawi et al. [18] reported a median follow-up at reclassification to CD of 9.4 years, whereas most other studies had a much shorter median follow-up time. The unbalanced distribution between CD and UC reclassification with a preferable reclassification to UC has to be seen as a reflection of natural history but also may argue in favor of IBD-U as an early stage of IBD [7,9].
There are two previous studies investigating the predictive value of antibody profiles for reclassification [19,20]. In their studies, Joossens et al. and Birimberg-Schwartz et al. assessed ASCA, pANCA (whereas pANCA also included atypical pANCA and xANCA), and their combinations. They found that pANCA negativity in combination with ASCA negativity was most prevalent in IBD-U and associated with the least probability of reclassification during follow-up. In our study, we found that PR3-ANCA negativity in combination with ASCA negativity was the most prevalent antibody combination in IBD-U patients independent of whether they later were reclassified or not. These findings may therefore reflect a true IBD-U characteristic. However, they could also be conditioned by biased physicians, who unconsciously add antibody results in their classification algorithms.
Fitting with our previous observation that PR3-ANCA positivity is predictive of UC in pediatric IBD patients [10], PR3-ANCA positivity was associated with reclassification to UC. In addition, our Bayesian model averaging identified PR3-ANCA to be used in most models for classification of CD vs. UC. In the same line, we could show for panel diagnostics a low sensitivity for reclassification to CD and a better performance for correct prediction of reclassification to UC in IBD-U patients.
Reflecting all results, antibody testing supports the distinction into two subtypes of IBD. One argument to focus on finer distinction of subtypes was, for example, the introduction of biological treatments with a benefit of early anti-TNF alpha treatment in CD patients [21]. However, with newer biologicals, which are efficiently independent of the IBD subtype, this argument may again fade into the background. Indeed, clinical factors (e.g., age or frailty of the patient, comorbidities, genetic background) as well as predictive molecular markers specific to the therapeutic agent will become more important for the choice of treatment [22].
We acknowledge the small sample size as the main limitations of our study. This prohibited deeper analysis of isolated colonic CD patients, which we included in the CD group. A strength is the inclusion of all IBD-U patients from the SIBDCS, which has a good representation of IBD in Switzerland. The use of well-established antibody markers, which can be determined in most immunology routine laboratories, ameliorates the utility of antibody diagnostics. However, a general challenge in antibody testing is the analytical method. For example, Mahler et al. showed for PR3-ANCA a positive rate of 88/283 UC samples with chemiluminescence and of 17/283 with ELISA [23].

5. Conclusions

Our study shows that antibody diagnostics are helpful for CD vs. UC classification but have limited potential in finer subtyping of IBD or predicting reclassification of IBD-U patients. New modeling techniques based on antibodies support the classification in two main IBD subtypes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics13152491/s1,. Table S1: Clinical characteristics of adult CD and UC patients; Table S2: antibody status of CD vs. UC patients; Figure S1: Akaike Information Criterion (AIC) profile plot. The horizontal line delineates models that are less than 2 AIC units away from the best model (on the very left). In total, 376 model combinations were assessed. Figure S2A,B: Visualization of the number of clusters suggested by the data. Both plots indicate that the true underlying number of clusters is more likely to be two than three. The optimal number of clusters found with this method is nine-though this is clinically not applicable.

Author Contributions

The whole study was conceived and designed by C.S. and M.P.H.; statistical analyses were performed by A.B.L., A.P.d.M. and M.P.H.; A.B.L. and A.P.d.M. developed the machine learning models; C.S., A.B.L., A.P.d.M., P.J. and M.P.H. were involved in the interpretation of the results and discussion; C.S. and M.P.H. drafted the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Swiss IBD cohort study [project no. 177523].

Institutional Review Board Statement

The Swiss IBD cohort study has been approved by the local ethics committee of each participating center (institutional review board No. EK-1316, approved on 5 February 2007 and KEK of Canton of Zurich, 2018–02068 on 9 March 2020). The study was conducted in accordance with the Helsinki declaration.

Informed Consent Statement

Written informed consent has been obtained while the patients have been included in the Swiss IBD Cohort Study.

Data Availability Statement

The data underlying this article can be shared on reasonable request to the corresponding author.

Acknowledgments

Thanks to all contributors and patients of the SIBDCS, F. Bravo for verifying the diagnoses, as well as M. Hurni, J. Schlatter, T. Momot, and O. Schaerer for performing the antibody routine tests.

Conflicts of Interest

MPH received speaker’s honoraria from Phadia ThermoFisher Scientific and RUWAG Handels AG (Distributor of INOVA Reagents). PJ received speaker’s honoraria and consulting fees from AbbVie, Arena, Amgen, BMS, Ferring, Gilead, Janssen, Elli Lilly, MSD, Pfizer, Pierre Fabre, Roche, Sandoz, Takeda, Tillots, UCB Pharma, and Vifor Pharma.

Abbreviations

ASCA: anti-Saccharomyces cerevisiae antibodies; cANCA, cytoplasmic anti-neutrophil cytoplasmic antibodies; CD, Crohn’s disease; IBD, inflammatory bowel disease; MPO-ANCA, myeloperoxidase-specific anti-neutrophil cytoplasmic antibodies; pANCA, perinuclear anti-neutrophil cytoplasmic antibodies; PR3-ANCA, proteinase-3-specific anti-neutrophil cytoplasmic antibodies; UC, ulcerative colitis; xANCA, atypical anti-neutrophil cytoplasmic antibodies.

References

  1. Cleynen, I.; Boucher, G.; Jostins, L.; Schumm, L.P.; Zeissig, S.; Ahmad, T.; Andersen, V.; Andrews, J.M.; Annese, V.; Brand, S.; et al. Inherited determinants of Crohn’s disease and ulcerative colitis phenotypes: A genetic association study. Lancet 2016, 387, 156–167. [Google Scholar] [CrossRef] [Green Version]
  2. Birimberg-Schwartz, L.; Zucker, D.M.; Akriv, A.; Cucchiara, S.; Cameron, F.L.; Wilson, D.C.; Lazowska, I.; Yianni, L.; Paul, S.P.; Romano, C.; et al. Development and Validation of Diagnostic Criteria for IBD Subtypes Including IBD-unclassified in Children: A Multicentre Study From the Pediatric IBD Porto Group of ESPGHAN. J. Crohn’s Colitis 2017, 11, 1078–1084. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Hedrick, T.L.; Friel, C.M. Colonic crohn disease. Clin. Colon Rectal Surg. 2013, 26, 84–89. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Subramanian, S.; Ekbom, A.; Rhodes, J.M. Recent advances in clinical practice: A systematic review of isolated colonic Crohn’s disease: The third IBD? Gut 2017, 66, 362–381. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Berger, T.D.; Lee, H.M.; Padmanaban, L.R.; Wine, E.; Yerushalmy-Feler, A.; Hojsak, I.; Kazeka, D.; Serban, D.E.; Yogev, D.; Ledder, O.; et al. Clinical Features and Outcomes of Paediatric Patients With Isolated Colonic Crohn Disease. J. Pediatr. Gastroenterol. Nutr. 2022, 74, 258–266. [Google Scholar] [CrossRef] [PubMed]
  6. Everhov, A.H.; Sachs, M.C.; Malmborg, P.; Nordenvall, C.; Myrelid, P.; Khalili, H.; Elmberg, M.; Ekbom, A.; Askling, J.; Jakobsson, G.; et al. Changes in inflammatory bowel disease subtype during follow-up and over time in 44,302 patients. Scand. J. Gastroenterol. 2019, 54, 55–63. [Google Scholar] [CrossRef] [Green Version]
  7. Chandradevan, R.; Hofmekler, T.; Mondal, K.; Harun, N.; Venkateswaran, S.; Somineni, H.K.; Ballengee, C.R.; Kim, M.O.; Griffiths, A.; Noe, J.D.; et al. Evolution of Pediatric Inflammatory Bowel Disease Unclassified (IBD-U): Incorporated With Serological and Gene Expression Profiles. Inflamm. Bowel Dis. 2018, 24, 2285–2290. [Google Scholar] [CrossRef]
  8. Prenzel, F.; Uhlig, H.H. Frequency of indeterminate colitis in children and adults with IBD—A metaanalysis. J. Crohn’s Colitis 2009, 3, 277–281. [Google Scholar] [CrossRef]
  9. Burisch, J.; Zammit, S.C.; Ellul, P.; Turcan, S.; Duricova, D.; Bortlik, M.; Andersen, K.W.; Andersen, V.; Kaimakliotis, I.P.; Fumery, M.; et al. Disease course of inflammatory bowel disease unclassified in a European population-based inception cohort: An Epi-IBD study. J. Gastroenterol. Hepatol. 2019, 34, 996–1003. [Google Scholar] [CrossRef]
  10. Horn, M.P.; Peter, A.M.; Righini Grunder, F.; Leichtle, A.B.; Spalinger, J.; Schibli, S.; Sokollik, C. PR3-ANCA and panel diagnostics in pediatric inflammatory bowel disease to distinguish ulcerative colitis from Crohn’s disease. PLoS ONE 2018, 13, e0208974. [Google Scholar] [CrossRef] [Green Version]
  11. Pittet, V.; Juillerat, P.; Mottet, C.; Felley, C.; Ballabeni, P.; Burnand, B.; Michetti, P.; Vader, J.P.; Swiss IBD Cohort Study Group. Cohort profile: The Swiss Inflammatory Bowel Disease Cohort Study (SIBDCS). Int. J. Epidemiol. 2009, 38, 922–931. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Pittet, V.; Michetti, P.; Mueller, C.; Braegger, C.P.; von Kanel, R.; Schoepfer, A.; Macpherson, A.J.; Rogler, G. Cohort Profile Update: The Swiss Inflammatory Bowel Disease Cohort Study (SIBDCS). Int. J. Epidemiol. 2019, 48, 385–386f. [Google Scholar] [CrossRef] [PubMed]
  13. Gomollon, F.; Dignass, A.; Annese, V.; Tilg, H.; Van Assche, G.; Lindsay, J.O.; Peyrin-Biroulet, L.; Cullen, G.J.; Daperno, M.; Kucharzik, T.; et al. 3rd European Evidence-based Consensus on the Diagnosis and Management of Crohn’s Disease 2016: Part 1: Diagnosis and Medical Management. J. Crohn’s Colitis 2017, 11, 3–25. [Google Scholar] [CrossRef] [Green Version]
  14. Silverberg, M.S.; Satsangi, J.; Ahmad, T.; Arnott, I.D.; Bernstein, C.N.; Brant, S.R.; Caprilli, R.; Colombel, J.F.; Gasche, C.; Geboes, K.; et al. Toward an integrated clinical, molecular and serological classification of inflammatory bowel disease: Report of a Working Party of the 2005 Montreal World Congress of Gastroenterology. Can. J. Gastroenterol. 2005, 19 (Suppl. A), 5A–36A. [Google Scholar] [CrossRef]
  15. Satsangi, J.; Silverberg, M.S.; Vermeire, S.; Colombel, J.F. The Montreal classification of inflammatory bowel disease: Controversies, consensus, and implications. Gut 2006, 55, 749–753. [Google Scholar] [CrossRef] [Green Version]
  16. Miller, A.J. Selection of Subsets of Regression Variables. J. R. Stat. Soc. 1984, 147, 36. [Google Scholar] [CrossRef]
  17. Raftery, A.E.; Madigan, D.; Hoeting, J.A. Bayesian Model Averaging for Linear Regression Models. J. Am. Stat. Assoc. 1997, 92, 179–191. [Google Scholar] [CrossRef]
  18. Rinawi, F.; Assa, A.; Eliakim, R.; Mozer-Glassberg, Y.; Nachmias Friedler, V.; Niv, Y.; Rosenbach, Y.; Silbermintz, A.; Zevit, N.; Shamir, R. The natural history of pediatric-onset IBD-unclassified and prediction of Crohn’s disease reclassification: A 27-year study. Scand. J. Gastroenterol. 2017, 52, 558–563. [Google Scholar] [CrossRef]
  19. Joossens, S.; Reinisch, W.; Vermeire, S.; Sendid, B.; Poulain, D.; Peeters, M.; Geboes, K.; Bossuyt, X.; Vandewalle, P.; Oberhuber, G.; et al. The value of serologic markers in indeterminate colitis: A prospective follow-up study. Gastroenterology 2002, 122, 1242–1247. [Google Scholar] [CrossRef] [Green Version]
  20. Birimberg-Schwartz, L.; Wilson, D.C.; Kolho, K.L.; Karolewska-Bochenek, K.; Afzal, N.A.; Spray, C.; Romano, C.; Lionetti, P.; Hauer, A.C.; Martinez-Vinson, C.; et al. pANCA and ASCA in Children with IBD-Unclassified, Crohn’s Colitis, and Ulcerative Colitis-A Longitudinal Report from the IBD Porto Group of ESPGHAN. Inflamm. Bowel Dis. 2016, 22, 1908–1914. [Google Scholar] [CrossRef] [Green Version]
  21. Hamdeh, S.; Aziz, M.; Altayar, O.; Olyaee, M.; Murad, M.H.; Hanauer, S.B. Early vs Late Use of Anti-TNFa Therapy in Adult Patients With Crohn Disease: A Systematic Review and Meta-Analysis. Inflamm. Bowel Dis. 2020, 26, 1808–1818. [Google Scholar] [CrossRef] [PubMed]
  22. Verstockt, B.; Salas, A.; Sands, B.E.; Abraham, C.; Leibovitzh, H.; Neurath, M.F.; Vande Casteele, N.; Alimentiv Translational Research Consortium (ATRC). IL-12 and IL-23 pathway inhibition in inflammatory bowel disease. Nat. Rev. Gastroenterol. Hepatol. 2023, 20, 433–446. [Google Scholar] [CrossRef] [PubMed]
  23. Mahler, M.; Bogdanos, D.P.; Pavlidis, P.; Fritzler, M.J.; Csernok, E.; Damoiseaux, J.; Bentow, C.; Shums, Z.; Forbes, A.; Norman, G.L. PR3-ANCA: A promising biomarker for ulcerative colitis with extensive disease. Clin. Chim. Acta 2013, 424, 267–273. [Google Scholar] [CrossRef] [PubMed]
Figure 1. ROC curve analysis of the antibody panels BeLu and SIBDCS in patients diagnosed with Crohn’s disease (CD) and ulcerative colitis (UC). Red line with red shaded area, ROC curve of panel BeLu +/− 95%CI; blue line with blue shaded area, ROC curve of panel SIBDCS +/− 95% CI; cross hairs, optimized cut-offs distinguishing CD from UC.
Figure 1. ROC curve analysis of the antibody panels BeLu and SIBDCS in patients diagnosed with Crohn’s disease (CD) and ulcerative colitis (UC). Red line with red shaded area, ROC curve of panel BeLu +/− 95%CI; blue line with blue shaded area, ROC curve of panel SIBDCS +/− 95% CI; cross hairs, optimized cut-offs distinguishing CD from UC.
Diagnostics 13 02491 g001
Figure 2. Antibody panels predicting reclassification of IBD-U patients during follow-up. (a) Performance of the BeLu panel. Dashed line indicates previously published cut-off 0.63 [10]. (b) Performance of the SIBDCS panel. Dashed line indicates optimized cut-off of 0.65 with a sensitivity of 60% and a specificity of 94% for the classification of CD. Each symbol represents the calculated predictor value of a patient. Lines and whiskers indicate mean and 95% CI of the different patient groups.
Figure 2. Antibody panels predicting reclassification of IBD-U patients during follow-up. (a) Performance of the BeLu panel. Dashed line indicates previously published cut-off 0.63 [10]. (b) Performance of the SIBDCS panel. Dashed line indicates optimized cut-off of 0.65 with a sensitivity of 60% and a specificity of 94% for the classification of CD. Each symbol represents the calculated predictor value of a patient. Lines and whiskers indicate mean and 95% CI of the different patient groups.
Diagnostics 13 02491 g002
Figure 3. Overview of Bayesian model averaging models. Over 60 were generated and ordered by decreasing importance from left to right on the x-axis. Red and blue bars indicate inclusion and sign of the variable into a certain model. Variables with large continuous bars can be read to be included in many models and are therefore very likely to hold high predictive value. BMA: Bayesian model averaging.
Figure 3. Overview of Bayesian model averaging models. Over 60 were generated and ordered by decreasing importance from left to right on the x-axis. Red and blue bars indicate inclusion and sign of the variable into a certain model. Variables with large continuous bars can be read to be included in many models and are therefore very likely to hold high predictive value. BMA: Bayesian model averaging.
Diagnostics 13 02491 g003
Figure 4. K-means clustering with k = 3 pre-specified groups. The data points each represent one patient. Point shapes indicate the assigned cluster one to three; the colors indicate the clinical diagnosis of CD, UC, or IBD-U, respectively. Shaded areas indicate the Euclidean distance from the group center.
Figure 4. K-means clustering with k = 3 pre-specified groups. The data points each represent one patient. Point shapes indicate the assigned cluster one to three; the colors indicate the clinical diagnosis of CD, UC, or IBD-U, respectively. Shaded areas indicate the Euclidean distance from the group center.
Diagnostics 13 02491 g004
Figure 5. Ward hierarchical clustering and corresponding heatmap. (A) Green rectangles are drawn around clusters in the dendrogram as found by the ward hierarchical clustering algorithm. CD, UC, and IBD-U labels indicate the true disease class membership. The colors of the class labels correspond to the classes as found by the k-means clustering. Blue = cluster 1, magenta = cluster 2, orange = cluster 3. (B) The dendrogram on the x-axis, including the rectangles around clusters, above the heatmap corresponds to figure (A). The dendrogram on the y-axis represents the grouping of predictor variables. Values are scaled per parameter.
Figure 5. Ward hierarchical clustering and corresponding heatmap. (A) Green rectangles are drawn around clusters in the dendrogram as found by the ward hierarchical clustering algorithm. CD, UC, and IBD-U labels indicate the true disease class membership. The colors of the class labels correspond to the classes as found by the k-means clustering. Blue = cluster 1, magenta = cluster 2, orange = cluster 3. (B) The dendrogram on the x-axis, including the rectangles around clusters, above the heatmap corresponds to figure (A). The dendrogram on the y-axis represents the grouping of predictor variables. Values are scaled per parameter.
Diagnostics 13 02491 g005
Table 1. Panel description.
Table 1. Panel description.
AnalyteBeLu *SIBDCS
Intercept0.59370.425704
PR-3 ANCA positivity−0.4085−0.22856
xANCA positivity−0.328−0.25171
pANCA positivity−0.6299n.a.
ASCA IgG, Titer0.0052n.a.
ASCA IgG positivityn.a.0.277831
ASCA IgA positivityn.a.0.301052
* BeLu (Bern Lucerne) panel taken from [10]; n.a. not applied; SIBDCS Swiss IBD cohort study.
Table 2. Characteristics of IBD-U patients.
Table 2. Characteristics of IBD-U patients.
All IBD-UIBD-U w/o ReclassificationIBD-U w/Reclassification
number of patients765026
males, n (%) 38 (50)24 (48)14 (53.8)
age at diagnostic, median (IQR), y 20 (12–31)23 (13–37)17.5 (11–28)
age at serum sampling,
median (IQR), y
23.5 (14–40)28 (14–46)17.5 (12–30)
disease duration at serum sampling,
median (IQR), y
2 (1–5)3 (1–6.5)1 (0–5)
disease duration at last follow-up, median (IQR), y6.5 (4–12)7 (4–11)6 (4–16)
disease duration at reclassification, median (IQR), yn.a.n.a.3.5 (3–9)
need of surgery, n (%)6 (7.9)3 (6.0)3 (11.5)
ever treated with biologicals, n (%)43 (56.6)27 (54)16 (61.5)
Disease location at diagnosis and last follow-up, n (%)diagnosisfollow-updiagnosisfollow-updiagnosisfollow-up
E1: proctitis3 (3.9)4 (5.3)2 (2.6)1 (1.3)1 (1.3)3 (3.9)
E2: left-sided colitis15 (19.7)18 (23.7)11 (14.5)13 (17.1)4 (5.3)5 (6.6)
E3: extensive (pancolitis) 41 (53.9)41 (53.9)24 (31.6)30 (39.5)17 (22.4)11 (14.5)
unknown17 (22.4)6 (7.9)13 (17.1)6 (7.9)4 (5.3)0 (0)
L1: ilealn.a.1 (1.3)n.a.n.a.n.a.1 (1.3)
L2: colonicn.a.4 (5.3)n.a.n.a.n.a.4 (5.3)
L3: ileo-colonicn.a.0 (0)n.a.n.a.n.a.0 (0)
L4: upper GI diseasen.a.0 (0)n.a.n.a.n.a.0 (0)
no endoscopyn.a.2 (2.6)0 (0)0 (0)0 (0)2 (2.6)
IQR: interquartile range. y: years. n.a.: not applicable.
Table 3. Antibody status of IBD-U patients.
Table 3. Antibody status of IBD-U patients.
All IBD-UIBD-U w/o ReclassificationIBD-U w/Re-
Classification *
IBD-U → CD *IBD-U → UC *
number of patients765026818
ANCA positive
cANCA, n (%)5 (6.6)4 (8.0)1 (3.8)0 (0)1 (5.6)
(atypical) pANCA, n (%)3 (3.9)3 (6.0)0 (0)0 (0)0 (0)
xANCA, n (%)46 (60.5)32 (64.0)14 (53.8)3 (37.5)11 (60.5)
PR3-ANCA, n (%)20 (26.3)11 (22.0)9 (34.6)2 (25.0)7 (38.9)
PR3-ANCA U/mL, Median, IQR1.3 (0–5.2)0.9 (0.4–3.1)1.9 (0.8–6.9)1.2 (0.5–6.2)2.8 (1.0–7.0)
MPO-ANCA, n (%)1 (1.3)1 (2.0)0 (0)0 (0)0 (0)
MPO-ANCA U/mL, Median00000
ASCA positive
IgA, n (%)13 (17.1)8 (16.0)5 (19.2)2 (25.0)3 (16.7)
IgA U/mL, Median, IQR2.0 (1–3.9)2.3 (1–4)1.7 (0–3.9)2.5 (1.4–6.7)1.6 (0–2.5)
IgG, n (%)6 (7.9)4 (7.9)2 (7.7)0 (0)2 (11.1)
IgG U/mL, Median, IQR1.4 (0.6–3.1)1.5 (0.6–3.0)1.1 (0.6–3.4)1.3 (0.6–2.9)0.8 (0.6–3.5)
Antibody combinations (n (%))
xANCA neg, PR3-ANCA neg, all ASCA neg21 (27.6)13 (26.0)8 (30.8)4 (50.0)4 (22.2)
xANCA pos, PR3-ANCA pos15 (19.7)9 (18.0)6 (23.1)2 (25.0)4 (22.2)
xANCA neg, all ASCA neg25 (32.9)15 (30.0)10 (38.5)4 (50.0)6 (33.3)
xANCA neg, any ASCA pos5 (6.6)3 (6.0)2 (7.7)1 (12.5)1 (5.6)
xANCA pos, all ASCA neg37 (48.7)26 (52.0)11 (42.3)2 (25.0)9 (50.0)
xANCA pos, any ASCA pos9 (11.8)6 (12.0)3 (11.5)1 (12.5)2 (11.1)
PR3-ANCA neg, all ASCA neg46 (60.5)31 (62.0)15 (57.7)5 (62.5)10 (55.6)
PR3-ANCA neg, any ASCA pos10 (13.2)8 (16.0)2 (7.7)1 (12.5)1 (5.6)
PR3-ANCA pos, all ASCA neg16 (21.1)10 (20.0)6 (23.1)1 (12.5)5 (27.8)
PR3-ANCA pos, any ASCA pos4 (5.3)1 (2.0)3 (11.5)1 (12.5)2 (11.1)
* neither any of the tested antibody nor any antibody combination significantly distinguished reclassified patients from IBD-U patients w/o reclassification.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sokollik, C.; Pahud de Mortanges, A.; Leichtle, A.B.; Juillerat, P.; Horn, M.P., on behalf of the Swiss IBD Cohort Study Group. Machine Learning in Antibody Diagnostics for Inflammatory Bowel Disease Subtype Classification. Diagnostics 2023, 13, 2491. https://doi.org/10.3390/diagnostics13152491

AMA Style

Sokollik C, Pahud de Mortanges A, Leichtle AB, Juillerat P, Horn MP on behalf of the Swiss IBD Cohort Study Group. Machine Learning in Antibody Diagnostics for Inflammatory Bowel Disease Subtype Classification. Diagnostics. 2023; 13(15):2491. https://doi.org/10.3390/diagnostics13152491

Chicago/Turabian Style

Sokollik, Christiane, Aurélie Pahud de Mortanges, Alexander B. Leichtle, Pascal Juillerat, and Michael P. Horn on behalf of the Swiss IBD Cohort Study Group. 2023. "Machine Learning in Antibody Diagnostics for Inflammatory Bowel Disease Subtype Classification" Diagnostics 13, no. 15: 2491. https://doi.org/10.3390/diagnostics13152491

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop