Next Article in Journal
Epidemiology and Prevention of Healthcare-Associated Infections in Geriatric Patients: A Narrative Review
Previous Article in Journal
Cognitive Neuroscience Methods in Enhancing Health Literacy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Risk Prediction of Barrett’s Esophagus in a Taiwanese Health Examination Center Based on Regression Models

1
Department of Emergency Medicine, Kaohsiung Veterans General Hospital, Kaohsiung 813, Taiwan
2
Department of Electrical Engineering, I-Shou University, Kaohsiung 840, Taiwan
3
Division of Gastroenterology and Hepatology, Department of Internal Medicine, Kaohsiung Veterans General Hospital, Kaohsiung 813, Taiwan
4
Health Management Center, Kaohsiung Veterans General Hospital, 386, Ta-Chung 1st Road, Kaohsiung 813, Taiwan
5
Institute of Health Care Management, Department of Business Management, National Sun Yat-sen University, Kaohsiung 804, Taiwan
6
Department of Nursing, Meiho University, Pingtung 912, Taiwan
7
Department of Information Engineering, I-Shou University, Kaohsiung 840, Taiwan
8
Department of Emergency Medicine, Taichung Veterans General Hospital Chiayi Branch, Chia-Yi 600, Taiwan
9
Department of Chemical Engineering and Institute of Biotechnology and Chemical Engineering, I-Shou University, Kaohsiung 840, Taiwan
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(10), 5332; https://doi.org/10.3390/ijerph18105332
Submission received: 9 April 2021 / Revised: 13 May 2021 / Accepted: 14 May 2021 / Published: 17 May 2021
(This article belongs to the Section Public Health Statistics and Risk Assessment)

Abstract

:
Determining the target population for the screening of Barrett’s esophagus (BE), a precancerous condition of esophageal adenocarcinoma, remains a challenge in Asia. The aim of our study was to develop risk prediction models for BE using logistic regression (LR) and artificial neural network (ANN) methods. Their predictive performances were compared. We retrospectively analyzed 9646 adults aged ≥20 years undergoing upper gastrointestinal endoscopy at a health examinations center in Taiwan. Evaluated by using 10-fold cross-validation, both models exhibited good discriminative power, with comparable area under curve (AUC) for the LR and ANN models (Both AUC were 0.702). Our risk prediction models for BE were developed from individuals with or without clinical indications of upper gastrointestinal endoscopy. The models have the potential to serve as a practical tool for identifying high-risk individuals of BE among the general population for endoscopic screening.

1. Introduction

Esophageal adenocarcinoma (EAC) has a poor prognosis, with a 5-year survival rate of <15% [1]. Over the last three decades, the incidence of EAC has risen sharply in many countries around the world [2]. In the United States, the incidence of EAC increased from 0.40 cases per 100,000 individuals in 1975 to 2.58 cases per 100,000 individuals in 2009, a more than 6-fold increase [3]. Upward trends in the incidence of EAC have also been observed in Asian countries [4,5]. Barrett’s esophagus (BE) is a well-documented precancerous condition of EAC [1]. Although the prevalence of BE has been lower in Asian populations than in Western populations, the rate has increased from 0.8% in 1991–1999 to 2.2% in 2010–2014 [6]. A study reported that BE is probably under diagnosed in most parts of Asia [7], which may affect the prognosis of EAC. A recent meta-analysis demonstrated that the development of endoscopic therapy for BE and early-stage EAC, targeted BE screening, and endoscopic surveillance of patients with BE could provide better survival outcomes [8].
For now, the diagnostic tools for BE are upper gastrointestinal (UGI) endoscopy and histological confirmation [9,10]. Due to their invasiveness and cost, however, the methods’ usefulness in BE screening is limited. Identifying target populations for BE screening therefore remains a challenge. The current BE screening guidelines recommend that patients with gastroesophageal reflux disease (GERD) with several risk factors including the male sex, an age >50 years, central obesity, and a history of smoking should be considered for screening with UGI endoscopy [9,10]. However, these guidelines do not provide quantitative data to stratify the risk of BE while combining multiple risk factors. A number of risk prediction models for other diseases are widely employed to help clinicians make individualized medical decisions for their patients [11,12,13,14]; however, most of the published BE prediction models were constructed for Western populations and have yet to be verified for Asian populations [11,15,16,17,18,19,20], a relevant issue given that BE presents different patterns for Asian and Western populations [21].
Logistic regression (LR) and machine learning are two common methods for constructing risk prediction models [12,22]. LR is a classical statistical method for building prediction models, while artificial neural networks (ANNs) are state-of-the-art learning machines in the machine learning area for modeling a complex system, such as modeling, voice recognition, and image classification [23]. An ANN consists of multi-layered processing units called neurons that resemble the structure and behavior of biological neurons. The class of ANNs has the universal approximation property in the sense that they can approximate any reasonable function to any desired degree of accuracy. Traditional ANNs are referred to as “shallow” because usually only one hidden layer is employed. Recent studies have employed numerous hidden layers with different variations, which are known as deep neural networks. Srinivas et al. compared the performance between LR and ANN models for BE in patients with GERD [15]. The authors demonstrated that the ANN model was superior to the LR model, with a slightly larger area under the curve (AUC). Nevertheless, no further studies were conducted on the BE prediction models using both LR and ANN methods, especially in Asian populations.
We therefore conducted this retrospective cross-sectional study by analyzing a database of information collected during physical examinations at a health examination center in southern Taiwan. The study population underwent UGI endoscopy as part of their health examination with or without clinical indications. The aim of this study was to develop LR and ANN risk prediction models for BE in an Asian population and to comprehensively compare the predictive performance of the LR and ANN models.

2. Materials and Methods

2.1. Study Population and Data Collection

A total of 11,879 adults aged ≥20 years with or without gastrointestinal symptoms underwent UGI endoscopy during physical examinations at the Kaohsiung Veterans General Hospital, Taiwan, between 1 January 2016 and 31 December 2018. Their demographic data, history of comorbidities (including hypertension and diabetes mellitus), history of smoking (number of packs, frequency and duration), alcohol intake (number, frequency and alcohol percentage per week), exercise habits (frequency and duration per week), and GERD symptoms (e.g., heart burn, regurgitation and dysphagia) in the past three months were recorded as medical records during the pre-endoscopic examination interview. Their weight, height, and waist circumferences (WC) were measured routinely by trained examiners during health examination at our center and also recorded as medical records.
Six experienced physicians performed the endoscopic examinations at our hospital. In accordance with the American College of Gastroenterology clinical guidelines, we diagnosed BE if salmon-colored mucosa was observed extending 1 cm or more above the gastroesophageal junction, with histological confirmation of intestinal metaplasia [10]. Biopsies were obtained from four quadrants (2 cm apart) of the circumferential part of the endoscopically suspected esophageal metaplasia (ESEM). For mucosal tongues of non-circumferential ESEM, the sites and biopsies were evaluated under narrowband imaging and then decided upon by the physicians.
Of these 11,879 adults, 2233 were excluded because one had a history of esophageal cancer and 2232 had missing data in the dataset. Therefore, 9646 adults were included. The sources of the dataset were from the medical records of our hospital and the data were anonymous in the dataset. The study was approved by the Ethics Committee of the Kaohsiung Veterans General Hospital (VGHKS21-CT3-12). Participants’ consents were not required in the study because this is a retrospective study, and all data were analyzed in anonymity.

2.2. Model Development

We analyzed twelve candidate variables: sex, age, GERD symptoms (yes/no), a history of hypertension (yes/no), a history of diabetes mellitus (yes/no), WC, body height, body weight, BMI, history of smoking (non-smoker; ≤20 pack-years; >20 pack-years), alcohol intake (none; light-moderate drinking; heavy drinking), and regular exercise (≥3 sessions/week and ≥30 min/session).
To select the variables, we employed statsmodels (version 0.9.0, [24] in Python to construct a generalized linear model from the twelve variables (setting link function = logistic and family = binomial in statsmodels) [24] and checked each variable to determine its statistical significance (p < 0.05). The most significant variables will then be employed to construct the LR and ANN models for comparisons. Given the different measurement scales between the categorical and continuous variables, we employed the z-score formula to standardize all continuous variables including age, height, weight, BMI, and WC. With x as one of the input variables, the standardization process for x is performed through Equation (1):
x ^ = x μ x σ x
where μx is the mean value and σx is the standard deviation (SD) of the variable x. Note that in the future predicting phase, numerical type variables need to be standardized before applying the model.
The LR models were built with the Python package scikit-learn (version 0.20.3) [25] and validated by stratified 10-fold cross-validation. To construct the ANN models, we employed the Python package Keras (version 2.2.4) [26]. The architecture of our proposed ANN model, presenting four hidden layers with LeakyReLU or ReLU activation functions, is shown in Figure 1. To prevent overfitting, we employed the dropout regularization mechanism [27]. Dropout is a technique which drops out the neuron connection randomly. We also adapted the batch normalization by re-centering and re-scaling the layers’ inputs for each mini batch to reduce the covariate shift between the layers in the neural network. Furthermore, the residual connection, which injects the outputs of earlier layers directly into the inputs of latter layers, is used to decrease the likelihood of representational bottleneck and vanishing gradient [28,29].
We employed AUC as a metric to measure model performance and validated the LR and ANN models using stratified 10-fold cross-validation. We also compared the sensitivity, specificity, and accuracy under different threshold settings of the LR and ANN models.

3. Results

3.1. Characteristics of the Subjects and Variables Selection

The simulation programs were coded using the Python programming language (version 3.5.4) running on Microsoft Windows 10 on an Intel Core i5 CPU with 8 GB of RAM. Table 1 shows the baseline characteristics of the 9646 participants, which included 5020 men and 4384 women (mean age, 50.41 years old; SD, 11.75 years), 242 of whom (2.51%) had BE. We first constructed the generalized linear model using statsmodels and analyzed the twelve variables one by one to check their statistical significance. The corresponding p-values are shown in Table 2. We included the four statistically significant variables (p < 0.05) in the successive simulations of the LR and ANN models: age (odds ratio [OR], 1.03; 95% confidence interval [CI] 1.01–1.04; p < 0.001), gender (OR, 1.80; 95% CI 1.15–2.82; p = 0.01), GERD symptoms (OR, 2.14; 95% CI 1.63–2.83; p< 0.001) and smoking (OR, 1.44; 95% CI 1.20–1.72; p< 0.001).

3.2. LR and ANN Models Development and Their Predictive Performance Comparisons

To better assess the trained models, we employed a stratified 10-fold cross-validation. Table 3 shows the mean performance of the stratified 10-fold cross-validation for the two models in the sampling threshold settings. With a threshold setting of 90% sensitivity, the specificity of the LR and ANN models was 31% and 20%, respectively. At 90% specificity, the sensitivity of the LR and ANN models was 30% and 28%, respectively. The point (0,1) is the best cutoff point for the perfect model (AUC = 1) on the receiver operating characteristic (ROC) curve. We therefore assumed a “closest to (0,1)” criterion to find the closest point to the top left corner (0,1) on the ROC curve, which serves as a method for finding the optimal cutoff value [30]. The values for the performance of the closest point to point (0,1) for the LR and ANN models, which are similar, are shown in Table 3.
We computed the AUC and SD for the AUCs across folds for the LR and ANN models. Figure 2 shows the mean ROC curves (a green line) for the LR (Figure 2a) and ANN (Figure 2b) models. The gray areas represent the performance within two SD around the mean ROC. The ROC curves for the two models are very similar, with almost the same AUC (LR model: AUC, 0.702; SD, 0.040 vs. ANN model: AUC, 0.702; SD, 0.035).

3.3. Final Mean LR Model

We obtained the coefficients, which included the intercept of the final mean LR model, by taking the mean of the relevant coefficients across the ten folds in the cross-validation (Table 4). Assume the variables xj, j = 1,2,3,4,5 have been standardized using Equation (1) and that y denotes the estimated probability of BE, the final mean LR model through cross-validation was described using the following Equation (2):
logit y = ln y 1 y = 0.893 + 0.356 x 1 + 0.697 x 2 + 0.716 x 3 + 0.293 x 4 + 0.826 x 5
where
  • x1 is the age
  • x2 is 1 if the sex is male, otherwise 0
  • x3 is 1 if the patient has presented GERD symptoms in the past 3 months, otherwise 0
  • x4 is 1 if the patient’s cumulative smoking exposure is >0 but ≤20 pack-years, otherwise 0
  • x5 is 1 if the patient’s cumulative smoking exposure is >20 pack-years, otherwise 0
We provided the sampling threshold settings using the final mean LR model in Table 4. Based on a threshold setting of 90% sensitivity and of 90% specificity, a cutoff point was 0.33 and 0.67, respectively. Based on the “closest to (0,1)” criteria, a cutoff point was 0.52. Users can choose a suitable cutoff point according to their clinical considerations. In addition, the code of the ANN model is provided in the Supplementary Materials (see Supplementary Code S1).

4. Discussion

This is a large-scale study for building a risk prediction model for BE. Age, sex, GERD symptoms, and smoking were significantly associated with BE and employed to construct models. In the case of different ethnicity with different pattern of BE, our LR model had comparable performance with those of previously published models [11,15,16,17,18,19]. To our knowledge, only one published model used ANNs. Compared with that, the study had larger sample sizes and our ANN model had better performance [15]. In addition, different from previous studies, the participants in our study were not only from physicians’ referral [17,18,20] and those with subjective symptoms [15], but also from individuals without symptoms. Rubenstein et al. developed a prediction model from participants without a referral for a clinical indication; however, the study population was limited to older males (50–79 years) [11]. Our prediction models constructed by those who aged over 20 years and were not restricted by clinical indications of UGI endoscopy have the potential to be applied to the general population.
ANNs can capture nonlinear relationships between inputs (or variables) and outputs (or responses) which is considered as a powerful method of for model development. When constructing the ANNs employed in this study, we used dropout regularization techniques to avoid overfitting the training data, batch normalization to reduce the covariate shift between network layers, and residual connection to decrease the likelihood of representational bottleneck and vanishing gradient [28,29]. These methods are state-of-the-art techniques employed in deep learning and were adopted for our proposed neural network model. However, the performance of the ANN model, which was constructed by using four hidden layers, was not superior to that of the LR model in the study. The low number of significant variables for the predictive model could be a contributing factor for the result because the strength of the neural network for nonlinear mapping among these variables is limited [31].
The four most significant variables (age, sex, presence of GERD symptoms and a history of smoking), which were included as predictors in the construction of the LR and ANN models, are compatible to previous studies as risk factors [9,10]. In clinical practice, information on these four variables can be easily obtained through interviews, and the prediction process can be started with only a few variables. The stratified 10-fold cross-validation was performed in our study to minimize the variability of the models’ performance. This study demonstrated the performance characteristics of the final mean LR model for various cutoff points (Table 4). A sensitivity and specificity close to 90% having differing clinical objectives. High sensitivity is better suited to personal physical examinations to lower the proportion of undetected cases. Given the low prevalence of BE in Asian populations, a high specificity may be taken as a reference from the public health perspective. Thrift et al. indicated that, given the low risk of progression to cancer for patients with BE, the higher threshold setting for the cutoff point may be considered [16].
As a practical example for the final mean LR model, consider a 55-year-old man with GERD symptoms in the past three months but no history of smoking. After standardizing the numerical predictors in the data preprocessing step, the age variable should be adjusted to 55–50.4 ([grand mean (age)])/11.8 [SD (age)]. The formula to apply would therefore be the following (Equation (3)):
BE   probability         = 1 1 + e 0.893 + 0.356 ×   x 1 + 0.697 ×   x 2 + 0.716 ×   x 3 + 0.293 ×   x 4 + 0.826 ×   x 5 = 0.66
where
  • x 1 is the adjusted age = 55 50.4   grand   mean   age 11.8   standard   deviation   age = 0.39
  • x 2 is 1, because the patient is male.
  • x 3   is 1, because of the GERD symptoms in the past 3 months.
  • x 4 is 0, because the cumulative smoking exposure is 0.
  • x 5 is 0, because the cumulative smoking exposure is 0.
The resulting BE probability is therefore 0.66. According to the cutoff points listed in Table 4, 0.66 is greater than 0.33, which is the cutoff point for 90% sensitivity. Endoscopic BE screening can be considered if we set cutoff point at 90% sensitivity from the personal health examination standpoint. However, the BE probability is <0.67, which is the cutoff point for 90% specificity. Endoscopic BE screening may not be suggested from the public health perspective if we set cutoff point at 90% specificity.
Our study had some limitations. First, the study enrolled participants from the center for out-of-pocket physical examinations, which may enroll individuals with a higher socioeconomic status. However, the role played by socioeconomic status in the risk of BE has not been established [9,10]. Further studies about association between socioeconomic status and BE may be needed. Second, a number of potential factors (e.g., hiatal hernia and H. pylori infection) were not included in this analysis [32,33], which might have improved the model. However, this information would limit the study population to those who underwent UGI endoscopy prior to the study and complicate the data collection from interviews. Third, although the stratified 10-fold cross-validation decreases the overfitting problem and provides low variability for the prediction results, external validation is still needed to confirm the model’s generalizability.

5. Conclusions

Both LR and ANN models exhibit good discriminative power. The ANN model did not exhibit superior performance to the LR model in the study. LR may be a preferable method for model development while low number variables employed because of its comparable performance with ANNs and interpretability in the clinical settings. However, careful interpretation of the association between independent and dependent variables is crucial due to the concerns of collapsibility and exchangeability for LR. Further studies are needed to validate the models and to evaluate the cost benefit of BE screening in different clinical settings.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijerph18105332/s1, Code S1: The code for ANN model is provided in the file. Data S2: The dataset used in this study is provided in the file.

Author Contributions

Conceptualization: P.-H.L. and P.-C.W.; Methodology: J.-G.H. and J.-H.J.; Software: P.-H.L. and C.-H.C.; Validation: J.-G.H., H.-C.Y. and J.-H.J.; Formal Analysis, P.-H.L.; Investigation: P.-C.W.; Data Curation: C.-L.H.; Writing—Original Draft Preparation: P.-H.L. and P.-C.W.; Writing—Review & Editing: J.-G.H., H.-C.Y. and J.-H.J.; Supervision, J.-G.H. and H.-C.Y.; Project Administration: P.-H.L.; Funding Acquisition: C.-L.H. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by research grants from Kaohsiung Veterans General Hospital (KSVGH110-008).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board of the Kaohsiung Veterans General Hospital (VGHKS21-CT3-12).

Informed Consent Statement

Patient consent was waived because this is a retrospective study, and all data were analyzed in anonymity.

Data Availability Statement

Data is contained within the Supplementary Materials. The data presented in this study are available in Supplementary Data S2.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Di Pietro, M.; Chan, D.; Fitzgerald, R.C.; Wang, K.K. Screening for Barrett’s Esophagus. Gastroenterology 2015, 148, 912–923. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Edgren, G.; Adami, H.O.; Weiderpass, E.; Nyrén, O. A global assessment of the oesophageal adenocarcinoma epidemic. Gut 2013, 62, 1406–1414. [Google Scholar] [CrossRef] [PubMed]
  3. Hur, C.; Miller, M.; Kong, C.Y.; Dowling, E.C.; Nattinger, K.J.; Dunn, M.; Feuer, E.J. Trends in esophageal adenocarcinoma incidence and mortality. Cancer 2013, 119, 1149–1158. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Fan, Y.J.; Song, X.; Li, J.L.; Li, X.M.; Liu, B.; Wang, R.; Fan, Z.M.; Wang, L.D. Esophageal and gastric cardia cancers on 4238 Chinese patients residing in municipal and rural regions: A histopathological comparison during 24-year period. World J. Surg. 2008, 32, 1980–1988. [Google Scholar] [CrossRef] [PubMed]
  5. Shibata, A.; Matsuda, T.; Ajiki, W.; Sobueet, T. Trend in incidence of adenocarcinoma of the esophagus in Japan, 1993–2001. Jpn. J. Clin. Oncol. 2008, 38, 464–468. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Shiota, S.; Singh, S.; Anshasi, A.; El-Serag, H.B. Prevalence of Barrett’s Esophagus in Asian Countries: A Systematic Review and Meta-analysis. Clin. Gastroenterol. Hepatol. 2015, 13, 1907–1918. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Ho, K.Y.; Asia-Pacific Barrett’s, C. Is Barrett’s esophagus an over-hyped disease in the West, and an underdiagnosed disease in the East? Dig. Endosc. 2013, 25 (Suppl. 2), 157–161. [Google Scholar] [CrossRef] [Green Version]
  8. Codipilly, D.C.; Chandar, A.K.; Singh, S.; Wani, S.; Shaheen, N.J.; Inadomi, J.M.; Chak, A.; Iyer, P.G. The Effect of Endoscopic Surveillance in Patients With Barrett’s Esophagus: A Systematic Review and Meta-analysis. Gastroenterology 2018, 154, 2068–2086.e5. [Google Scholar] [CrossRef] [Green Version]
  9. Fitzgerald, R.C.; Di Pietro, M.; Ragunath, K.; Ang, Y.; Kang, J.Y.; Watson, P.; Trudgill, N.; Patel, P.; Kaye, P.V.; Sanders, S.; et al. British Society of Gastroenterology guidelines on the diagnosis and management of Barrett’s oesophagus. Gut 2014, 63, 7–42. [Google Scholar] [CrossRef] [Green Version]
  10. Shaheen, N.J.; Falk, G.W.; Iyer, P.G.; Gerson, L.B.; American College of Gastroenterology. ACG Clinical Guideline: Diagnosis and Management of Barrett’s Esophagus. Am. J. Gastroenterol. 2016, 111, 30–50. [Google Scholar] [CrossRef]
  11. Rubenstein, J.H.; Morgenstern, H.; Appelman, H.; Scheiman, J.; Schoenfeld, P.; McMahon, L.F., Jr.; Metko, V.; Near, E.; Kellenberg, J.; Kalish, T.; et al. Prediction of Barrett’s esophagus among men. Am. J. Gastroenterol. 2013, 108, 353–362. [Google Scholar] [CrossRef] [Green Version]
  12. Tang, Z.H.; Liu, J.; Zeng, F.; Li, Z.; Yu, X.; Zhou, L. Comparison of prediction model for cardiovascular autonomic dysfunction using artificial neural network and logistic regression analysis. PLoS ONE 2013, 8, e70571. [Google Scholar] [CrossRef] [Green Version]
  13. Wong, G.L.; Ma, A.J.; Deng, H.; Ching, J.Y.; Wong, V.W.; Tse, Y.K. Machine learning model to predict recurrent ulcer bleeding in patients with history of idiopathic gastroduodenal ulcer bleeding. Aliment Pharmacol. Ther. 2019, 49, 912–918. [Google Scholar] [CrossRef]
  14. Hsieh, M.H.; Sun, L.-M.; Lin, C.-L.; Hsieh, M.J.; Hsu, C.Y.; Kao, C.H. Development of a prediction model for pancreatic cancer in patients with type 2 diabetes using logistic regression and artificial neural network models. Cancer Manag. Res. 2018, 10, 6317–6324. [Google Scholar] [CrossRef] [Green Version]
  15. Gaddam, S.; Rastogi, A.; Gupta, N.; Wani, S.B.; Bansal, A.; Singh, M.; Singh, V.; Reddymasu, S.; Moloney, B.; Sharma, P. Prediction of Barrett’s Esophagus (BE) in Patients With Gastroesophageal Reflux Disease (GERD) Using Logistic Regression Model (LRM) and Artificial Neural Network (ANN). Gastrointest. Endosc. 2010, 71, AB245. [Google Scholar] [CrossRef]
  16. Thrift, A.P.; Kendall, B.J.; Pandeya, N.; Vaughan, T.L.; Whiteman, D.C. Study of Digestive Health. A clinical risk prediction model for Barrett esophagus. Cancer Prev. Res. 2012, 5, 1115–1123. [Google Scholar] [CrossRef] [Green Version]
  17. Liu, X.; Wong, A.; Kadri, S.R.; Corovic, A.; O’Donovan, M.; Lao-Sirieix, P.; Lovat, L.B.; Burnham, R.W.; Fitzgerald, R.C. Gastro-esophageal reflux disease symptoms and demographic factors as a pre-screening tool for Barrett’s esophagus. PLoS ONE 2014, 9, e94163. [Google Scholar] [CrossRef] [Green Version]
  18. Ireland, C.J.; Fielder, A.L.; Thompson, S.K.; Laws, T.A.; Watson, D.I.; Esterman, A. Development of a risk prediction model for Barrett’s esophagus in an Australian population. Dis. Esophagus. 2017, 30, 1–8. [Google Scholar] [CrossRef]
  19. Baldwin-Hunter, B.L.; Knotts, R.M.; Leeds, S.D.; Rubenstein, J.H.; Lightdale, C.J.; Abrams, J.A. Use of the Electronic Health Record to Target Patients for Non-endoscopic Barrett’s Esophagus Screening. Dig. Dis. Sci. 2019, 64, 3463–3470. [Google Scholar] [CrossRef]
  20. Rosenfeld, A.; Graham, D.G.; Jevons, S.; Ariza, J.; Hagan, D.; Wilson, A.; Lovat, S.J.; Sami, S.S.; Ahmad, O.F.; Novelli, M.; et al. Development and validation of a risk prediction model to diagnose Barrett’s oesophagus (MARK-BE): A case-control machine learning approach. Lancet Digit. Health 2020, 2, E37–E48. [Google Scholar] [CrossRef] [Green Version]
  21. Lee, H.S.; Jeon, S.W. Barrett esophagus in Asia: Same disease with different pattern. Clin. Endosc. 2014, 47, 15–22. [Google Scholar] [CrossRef] [PubMed]
  22. Christodoulou, E.; Ma, J.; Collins, G.S.; Steyerberg, E.W.; Verbakel, J.Y.; Calster, B.V. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 2019, 110, 12–22. [Google Scholar] [CrossRef] [PubMed]
  23. Abiodun, O.I.; Jantan, A.; Omolara, A.E.; Dada, K.V.; Mohamed, N.A.; Arshad, H. State-of-the-art in artificial neural network applications: A survey. Heliyon 2018, 4, e00938. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Seabold, S.P.J. Statsmodels Econometric and Statistical Modeling with Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July; 2010; Volume 57, p. 61. [Google Scholar]
  25. Pedregosa, F.; Varoquaux, G.; Gramfort, A. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  26. Chollet, F. “Keras”. 2015. Available online: https://github.com/fchollet/keras (accessed on 16 May 2021).
  27. Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout:A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
  28. Wu, S.; Li, G.; Deng, L.; Liu, L.; Xie, Y.; Shi, L. L1-Norm Batch Normalization for Efficient Training of Deep Neural Networks. IEEE Trans. Neural Netw. Learn Syst. 2019, 30, 2043–2051. [Google Scholar] [CrossRef] [Green Version]
  29. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  30. Perkins Neil, J.; Schisterman Enrique, F. The Inconsistency of “Optimal” Cut-points Using Two ROC Based Criteria. Am. J. Epidemiol. 2006, 163, 670–675. [Google Scholar]
  31. Ayer, T.C.J.; Alagoz, O.; Kahn, C.E., Jr.; Woods, R.W.; Burnside, E.S. Informatics in radiology: Comparison of logistic regression and artificial neural network models in breast cancer risk estimation. Radiographics 2010, 30, 13–22. [Google Scholar] [CrossRef] [Green Version]
  32. Cameron, A.J. Barrett’s esophagus: Prevalence and size of hiatal hernia. Am. J. Gastroenterol. 1999, 94, 2054–2059. [Google Scholar] [CrossRef]
  33. Fischbach, L.A.; Graham, D.Y.; Kramer, J.R.; Rugge, M.; Verstovsek, G.; Parente, P.; Alsarraj, A.; Fitzgerald, S.; Shaib, Y.; Abraham, N.S.; et al. Association between Helicobacterpylori and Barrett’s esophagus: A case-control study. Am. J. Gastroenterol. 2014, 109, 357–368. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The architecture of the proposed artificial neural network model.
Figure 1. The architecture of the proposed artificial neural network model.
Ijerph 18 05332 g001
Figure 2. The receiver operating characteristic (ROC) curves for the logistic regression (LR) and artificial neural network (ANN) models. The green lines show the mean ROC curves, and the gray areas represent the performance within two SD around the mean ROC. (a) The ROC curve of LR model (AUC = 0.702, SD = 0.040); (b) The ROC curve of ANN model. (AUC = 0.702, SD = 0.035). AUC: area under cure; SD: standard deviation.
Figure 2. The receiver operating characteristic (ROC) curves for the logistic regression (LR) and artificial neural network (ANN) models. The green lines show the mean ROC curves, and the gray areas represent the performance within two SD around the mean ROC. (a) The ROC curve of LR model (AUC = 0.702, SD = 0.040); (b) The ROC curve of ANN model. (AUC = 0.702, SD = 0.035). AUC: area under cure; SD: standard deviation.
Ijerph 18 05332 g002
Table 1. Characteristics of the subjects with Barrett’s esophagus and the controls.
Table 1. Characteristics of the subjects with Barrett’s esophagus and the controls.
VariablesBE (−)
N = 9404
BE (+)
N = 242
Grand Mean
[SD]
Mean age [SD] (years)50.3 [11.7]54.7 [11.4]50.4 [11.8]
Gender
Male5020 (53.4%)184 (76.0%)
Female4384 (46.6%)58 (24.0%)
Height [SD] (cm)166.0 [8.5]168.1 [8.1]166.1 [8.5]
Weight [SD] (kg)65.9 [13.2]70.8 [12.0]66.0 [13.1]
BMI [SD] (kg/m2)23.7 [3.6]24.9 [3.1]23.8 [3.6]
Waist circumference [SD] (cm)83.8 [9.8]87.6 [8.7]83.9 [9.8]
Hypertension1597 (17.0%)67 (27.7%)
Diabetes mellitus662 (7.0%)31 (12.8%)
GERD symptoms1605 (17.1%)81 (33.5%)
Alcohol intake
No3992 (42.5%)86 (35.5%)
Not heavy drinking 4973 (52.9%)142 (58.7%)
Heavy drinking 439 (4.7%)14 (5.8%)
Smoking
Non-smoker6711 (71.4%)125 (51.7%)
≤20 pack-years1790 (19.0%)61 (25.2%)
>20 pack-years903 (9.6%)56 (23.1%)
Having Exercise habits (≥3 times/week and ≥30 mins/time)2675 (28.4%)77 (31.8%)
BE: Barrett’s esophagus; BMI: Body mass index; GERD: gastroesophageal reflux disease; SD: standard deviation, Heavy drinking was defined as 8 or more drinks a week for women and 15 or more drinks a week for men.
Table 2. Variables associated with Barrett’ esophagus according to multivariate analysis using generalized linear models.
Table 2. Variables associated with Barrett’ esophagus according to multivariate analysis using generalized linear models.
VariablesOdds Ratio95%CIp Value
Age1.031.01–1.04<0.001 *
Gender (male)1.801.15–2.820.01 *
Height (cm)0.990.95–1.030.63
Weight (kg)1.010.97–1.060.49
BMI (kg/m2)0.990.87–1.130.91
Waist circumference (cm)1.010.98–1.040.66
Hypertension1.110.80–1.520.54
Diabetes mellitus1.190.79–1.780.41
GERD symptoms2.141.63–2.83<0.001 *
Alcohol intake0.920.73–1.170.52
Smoking1.441.20–1.72<0.001 *
Having exercise habits0.970.73–1.300.86
BMI: Body mass index; CI: confidence interval; GERD: gastroesophageal reflux disease. * Variables with p < 0.05 were considered to enter into a prediction model.
Table 3. Mean performances of striated 10-fold cross-validation for the logistic regression and artificial neural network models using the sampling threshold settings.
Table 3. Mean performances of striated 10-fold cross-validation for the logistic regression and artificial neural network models using the sampling threshold settings.
Prevalence = 2.51%LR ModelANN Model
Threshold SettingSensitivitySpecificityAccuracySensitivitySpecificityAccuracy
Sensitivity~90%0.900.310.320.900.200.22
Specificity~90%0.300.900.880.280.900.88
The Closest to (0,1) Criteria0.650.680.680.630.650.65
ANN: artificial neural network; LR: logistic regression.
Table 4. Coefficients of the final mean logistic regression model and its performance using the sampling threshold settings.
Table 4. Coefficients of the final mean logistic regression model and its performance using the sampling threshold settings.
Coefficients and Adjusted ORPerformances of Whole Data Input in Final Mean LR Model
VariablesAdjusted ORThreshold SettingCutoff PointSensitivitySpecificityAccuracy
Age [SD]1.43Specificity~90%0.670.300.900.88
Gender(male) [SD]2.01Specificity~80%0.580.460.800.80
GERD [SD]2.05Sensitivity~90%0.330.900.320.33
Smoking Sensitivity~80%0.460.800.460.40
Non smokers1The Closest to (0,1) Criteria0.520.650.690.70
≤20 pack-years [SD]1.34
>20 pack-years [SD]2.28
Intercept
LR: logistic regression; OR: odds ratio; SD: standard deviation; GERD: gastroesophageal reflux disease.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lin, P.-H.; Hsieh, J.-G.; Yu, H.-C.; Jeng, J.-H.; Hsu, C.-L.; Chen, C.-H.; Wu, P.-C. Risk Prediction of Barrett’s Esophagus in a Taiwanese Health Examination Center Based on Regression Models. Int. J. Environ. Res. Public Health 2021, 18, 5332. https://doi.org/10.3390/ijerph18105332

AMA Style

Lin P-H, Hsieh J-G, Yu H-C, Jeng J-H, Hsu C-L, Chen C-H, Wu P-C. Risk Prediction of Barrett’s Esophagus in a Taiwanese Health Examination Center Based on Regression Models. International Journal of Environmental Research and Public Health. 2021; 18(10):5332. https://doi.org/10.3390/ijerph18105332

Chicago/Turabian Style

Lin, Po-Hsiang, Jer-Guang Hsieh, Hsien-Chung Yu, Jyh-Horng Jeng, Chiao-Lin Hsu, Chien-Hua Chen, and Pin-Chieh Wu. 2021. "Risk Prediction of Barrett’s Esophagus in a Taiwanese Health Examination Center Based on Regression Models" International Journal of Environmental Research and Public Health 18, no. 10: 5332. https://doi.org/10.3390/ijerph18105332

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop