Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy

Shao, Shengli; Liu, Lu; Zhao, Yufeng; Mu, Lei; Lu, Qiyi; Qin, Jichao

doi:10.3390/jpm11080748

Open AccessArticle

Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy

by

Shengli Shao

^1,2,

Lu Liu

^1,2,

Yufeng Zhao

^1,3,

Lei Mu

^1,2,

Qiyi Lu

^1,2 and

Jichao Qin

^1,2,*

¹

Department of Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China

²

Molecular Medicine Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China

³

Department of Vascular Surgery, First Hospital of Lanzhou University, Lanzhou University, Lanzhou 730030, China

^*

Author to whom correspondence should be addressed.

J. Pers. Med. 2021, 11(8), 748; https://doi.org/10.3390/jpm11080748

Submission received: 25 June 2021 / Revised: 26 July 2021 / Accepted: 27 July 2021 / Published: 29 July 2021

(This article belongs to the Special Issue Application of Artificial Intelligence in Personalized Medicine)

Download

Browse Figures

Versions Notes

Abstract

:

Anastomotic leakage is a life-threatening complication in patients with gastric adenocarcinoma who received total or proximal gastrectomy, and there is still no model accurately predicting anastomotic leakage. In this study, we aim to develop a high-performance machine learning tool to predict anastomotic leakage in patients with gastric adenocarcinoma received total or proximal gastrectomy. A total of 1660 cases of gastric adenocarcinoma patients who received total or proximal gastrectomy in a large academic hospital from 1 January 2010 to 31 December 2019 were investigated, and these patients were randomly divided into training and testing sets at a ratio of 8:2. Four machine learning models, such as logistic regression, random forest, support vector machine, and XGBoost, were employed, and 24 clinical preoperative and intraoperative variables were included to develop the predictive model. Regarding the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy, random forest had a favorable performance with an AUC of 0.89, a sensitivity of 81.8% and specificity of 82.2% in the testing set. Moreover, we built a web app based on random forest model to achieve real-time predictions for guiding surgeons’ intraoperative decision making.

Keywords:

artificial intelligence; machine learning; anastomotic leakage; gastric adenocarcinoma; total gastrectomy; proximal gastrectomy

1. Introduction

Gastric adenocarcinoma is the most common malignancy in the upper gastrointestinal tract, and total and proximal gastrectomy are the two main surgical procedures to remove gastric adenocarcinoma in the proximal two-thirds of the stomach [1]. However, there are serious complications in both procedures, the most serious being anastomotic leakage (AL). The incidence of AL in esophagogastrostomy or esophagojejunostomy varies from 1.7% to 15% [2,3,4]; AL is not only associated with 0% to 50% perioperative mortality but also poor overall survival [4]. Early detection of AL is critical because delayed treatment is associated with higher morbidity and mortality. Identifying high-risk patients of AL is important for guiding the surgeons’ decision making, such as a more rigorous anastomotic operation and placing a jejunal feeding tube. Due to low morbidity, it is difficult to evaluate the risk of AL individually. Although there is ever-increasing knowledge about AL and some studies have attempted to analyze risk factors to build predicting tools, there is still no reported model accurately predicting AL in patients with gastric adenocarcinoma who received total or proximal gastrectomy [5,6,7].

Artificial intelligence has recently shown great potential in various medical fields [8,9]. Machine learning, a subset of artificial intelligence, outperforms other technologies in developing predictive models [10,11]. Machine learning is to “learn” from data without explicit programming, which means that the performance of a specific task improves with experience (i.e., more data and variables). Recently, machine learning has reached encouraging achievements in diagnostic methods, such as the accuracy of the Gastrointestinal Artificial Intelligence Diagnostic System in detecting upper gastrointestinal cancer, which was more than 91.7% [12]. Deep learning models successfully classified microsatellite instability in gastrointestinal cancer [13,14]. In addition, Eiryo et al. developed a model for preoperative diagnostic and prognostic prediction of epithelial ovarian cancer based on peripheral blood biomarkers through machine learning [15]. Although many previous studies have demonstrated the advantages of artificial intelligence in classifying diseases, there are still no models for predicting AL in patients with gastric adenocarcinoma who received total or proximal gastrectomy. In this study, we aimed to develop a diagnostic system using preoperative and intraoperative variables through machine learning algorithms to predict AL in patients with gastric adenocarcinoma who received total or proximal gastrectomy.

2. Materials and Methods

2.1. Patients and Variables

Data from 1915 consecutive patients diagnosed with gastric adenocarcinoma who received total or proximal gastrectomy from 1 January 2010 to 31 December 2019 in the Department of Gastrointestinal Surgery, Tongji Hospital, Huazhong University of Science and Technology, were collected. The following 24 variables were included: gender, age, body mass index (BMI), American Society of Anesthesiologists classification score (ASA), previous abdominal surgical history, hypertension, diabetes, Brinkman index (the number of cigarettes smoked per day multiplied by the number of years of smoking), alcohol use, tumorous obstruction, total or proximal gastrectomy, esophagogastrostomy or esophagojejunostomy, combined resection of other organs, type of surgery, operative time, intraoperative blood loss, neoadjuvant chemotherapy or radiotherapy, intraperitoneal chemotherapy, drainage tube, nasogastric tube, preoperative albumin and hemoglobin levels, maximum tumor diameter, and clinical stages. Senior surgeons performed all procedures, and the D2 procedure was adopted as the standard surgical technique. In order to develop the machining learning model, patients with the following factors were excluded: acute complications of the adenocarcinoma such as perforation or bleeding (n = 58), palliative excision (R1 or R2, n = 52), and missing data (n = 145). Finally, 1660 patients were chosen for the study; among them, 525 patients received proximal gastrectomy, and 1135 patients received total gastrectomy. Three authors independently collected all clinical variables and the conflict data were recorded by one of the authors and confirmed through final discussion.

2.2. Outcome

The diagnosis of AL is based on the combination of clinical manifestations and imaging findings. The diagnosis of AL is determined when the passage of gastrointestinal contents from the drainage tube or the oral water-soluble contrast agent leak outside of the gastrointestinal tract. Alternatively, AL can be diagnosed through secondary surgical exploration when the integrity of the anastomosis is interrupted within 30 days after surgery. Case collectors recorded cases with an ambiguous diagnosis of AL, and the classification of these cases was determined during a final discussion by the review team, which comprised two senior gastrointestinal surgeons.

2.3. Machine Learning Algorithms

In this study, four types of machine learning algorithms were assessed: logistic regression (LR), random forest (RF), support vector machine (SVM), and XGBoost. The data were randomly divided into training and testing sets (8:2); the under-sampling method was used to train all algorithms because of the class imbalance of the data. In order to increase the accuracy of the algorithms, simple min-max normalization was used to keep the continuous variables within a range of [0, 1]. The performance of each model was optimized by hyperparameter adjustment. In the testing set, the performance of the machine learning models was evaluated by area under the receiver operating characteristic curve (AUC); the diagnostic ability of the models was verified by calculating sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy. All machine learning algorithms were implemented using scikit-learn package, version 0.24.1 in Python 3.8.5. A web app was built using the Streamlit package, version 0.78.0 through Spyder 4.2.5.

2.4. Statistical Analysis

Continuous variables were shown as mean (SD) and categorical variables as count (%). Student’s t-test was used to compare the difference for continuous variables; categorical variables were compared through the Chi-square test. All statistical tests were two-tailed, and p < 0.05 is considered a statistically significant difference. Confidence intervals (CIs) of sensitivity, specificity, PPV, NPV, and accuracy were calculated using Clopper—Pearson method. The above analyses were performed in IBM SPSS 24.0 (SPSS for Windows, IBM Corporation, Armonk, NY, USA) or VassarStats (online).

3. Results

3.1. Summary of Demographic and Clinical Characteristics for Training and Testing Sets

The study included 1660 patients, and the incidence of AL was 2.17% (36/1660). In order to develop the machine learning model, 1328 cases were assigned to the training set, and the remaining 332 cases were assigned to the testing set. A comparison of the training set and the testing set are shown in Table 1. In the training set, 31.9% of the patients received esophagogastrostomy, compared with 26.8% in the testing set. Total gastrectomy was performed in 67.6% of cases in the training set and 71.4% of cases in the testing set. The incidence of AL was 1.9% (25/1328) of cases in the training set and 3.3% (11/332) of cases in the testing set.

3.2. Performance of the Machine Learning Algorithms

We evaluated the predictive performance of four machine learning algorithms in the testing set by AUC. The data indicated that RF and XGBoost had better predictive performance (RF-AUC = 0.90, XGBoost-AUC = 0.89), whereas SVM performed poorly (SVM-AUC = 0.81) (Figure 1). Notably, RF and XGBoost are ensemble classifiers based on weak classifiers. The predictive results of each machine model in the testing set are shown in Table 2.

3.3. Predictive Abilities of the Machine Learning Models

Five indicators were used to calculate the machine learning models’ predictions in the testing set. The results indicated that RF model performed with higher specificity (0.822 (0.775–0.862) vs. 0.701 (0.647–0.750), p < 0.001) and accuracy (0.822 (0.776–0.861) vs. 0.708 (0.656–0.756), p < 0.001) than SVM. Moreover, when compared with XGBoost, RF model also had higher specificity (0.822 (0.775–0.862) vs. 0.723 (0.670–0.770), p = 0.003) and accuracy (0.822 (0.776–0.861) vs. 0.729 (0.678–0.776), p = 0.004), no statistical difference was observed between LR and RF in the five indicators (Table 3). To make the model more clinically practical, we developed an online app (https://gasal.21cloudbox.com/ (available from 14 May 2021 to 14 May 2024)) based on the RF model, which allows us to calculate the risk of AL in real-time according to 24 clinical variables from the preoperative and intraoperative periods.

3.4. Feature Importance Analysis

To our best knowledge, the occurrence of AL is a result of the interaction of all the relative factors. In order to gain insight into the contribution of each clinical variable to AL, the importance of each clinical variable was calculated through feature importance analysis, and the results showed that hypertension, diabetes, BMI, Brinkman index, albumin, hemoglobin, tumor size, tumorous obstruction, ASA score, and operation time were the ten most important features in the RF model (Figure 2).

4. Discussion

AL of esophagogastrostomy or esophagojejunostomy is a serious and life-threatening complication in patients with gastric cancer who received total or proximal gastrectomy. Once AL is diagnosed, continuous parenteral nutrition is a necessary treatment for fasting and gastrointestinal decompression, even though it increases the incidence of related complications. In addition, secondary surgery is required to establish smooth drainage of the leakage and indwelling a jejunal nutrition tube to support enteral nutrition for serious AL. Hence, preoperative or intraoperative identification of high-risk patients with AL may assist intraoperative decision making, such as establishing smooth drainage of the anastomotic site and placing a jejunal feeding tube.

Although the rigorous anastomotic operation is an essential measure in preventing AL, the heterogeneity of individual patients also plays an important role in the occurrence of AL. Most clinicians are familiar with the risk factors of AL, such as anemia, prognostic nutritional index, cardiovascular disease, obesity, and smoking [4,16]. However, it is rare for each patient to have all the risk factors, and these risk factors may have different contributions to the development of AL. Thus, accurately calculating the risk of AL for individual patients has always been a great challenge for surgeons. In order to overcome this difficulty, several attempts have been made to develop prediction models of AL through binary logistic regression analysis. For example, Tu RH et al. proposed a nomogram based on independent risk factors, including age, hemoglobin, and malnourishment, but the model was not validated, and the performance of the model was poor (c-index = 0.675) [5]. Additionally, Chikara Kunisaki et al. also developed a model based on independent risk factors; the data suggested that the model failed to accurately predict AL (AUC = 0.658) [17]. Binary logistic regression analysis is frequently used in analyzing independent risk factors and modeling, which weighs the independent risk factors and generates a linear formula to achieve predictions. Due to the complexity of clinical data distribution, such as multi-dimensional and non-linearly related variables [18], it is difficult for binary logistic regression analysis to generate a high-performance model. In recent years, the global enthusiasm for machine learning technology based on artificial intelligence seems exponential, and machine learning has achieved impressive results due to improvements in computing power. Some evidence shows that machine learning outperforms statistical models [19,20,21,22]. In the realm of precision medicine, which emphasizes personalized treatment, traditional guidelines or a clinicians’ experience can no longer meet the needs of medical decision making. Machine learning, an innovative tool, may meet the needs of precision medicine and select the best treatment strategy for different individual patients. Therefore, we applied machine learning algorithms that do not depend on independent risk factors to develop a predictive model for individual decision making.

In this study, we investigated 1660 cases of gastric adenocarcinoma patients who received total or proximal gastrectomy in the past 10 years and found that the incidence of AL was 2.17% (36/1660), which similar to previous reports [23,24]. In order to gain a high-performance tool, we applied four machine learning algorithms and found that RF produced the largest AUC and higher specificity and accuracy compared with SVM and XGBoost. To better satisfy the needs of clinicians, we designed a web app based on RF (81.8% sensitivity, 82.2% specificity, and 0.90 AUC) for achieving real-time predictions online. In order to explore the contribution of each variable to the development of AL, feature importance analysis was performed, and the data suggested that hypertension, diabetes, BMI, Brinkman index, albumin, hemoglobin, tumor size, tumorous obstruction, ASA score, and operation time were the ten most important features. Many of these features have been previously reported as important factors in the development of AL [5,25,26,27,28,29,30]. RF is an ensemble learning algorithm that showed great capability in regression and classification tasks and widely applied in medical modeling and feature importance analysis. For example, Tien S Dong et al. employed the RF algorithm to train a predictive model by identifying factors significantly associated with the presence of esophageal varices. They found that the AUC of the model in the validation set was 0.75 [31]. In addition, Chieh-Chen Wu et al. developed a model based on the RF algorithm to predict fatty liver disease using 577 patients’ data and the model’s performance was favorable (AUC = 0.925) [32]. Hence, there is great potential in using RF to develop high-performance models. To our best knowledge, this is the first study to apply a machine learning model, which was developed through clinical preoperative and intraoperative variables to predict AL in patients with gastric adenocarcinoma who received total or proximal gastrectomy.

There are several limitations to this study. First, this is a retrospective study based on a single center and selection bias, which is difficult to completely avoid. In addition, data from the tension and blood supply of the anastomosis could not be collected in the present study. However, both factors may play important roles in developing AL. Second, we retrospectively analyzed medical records for 10 years, which is not a short period. It is difficult to assess how advancements in medical technology contribute to decreasing AL. Third, the sensitivity of the model at 95% CI is too wide, and the cases diagnosed by the machine learning model for low risk of AL must be further evaluated. Fourth, the model needs external validation. To overcome these limitations, we intend to conduct a further multicenter study.

5. Conclusions

In conclusion, based on clinical preoperative and intraoperative variables, a high-performance machine learning model was developed, which may be helpful to surgeons by identifying patients with a high risk of AL, guiding surgeons in intraoperative decision making, and improving perioperative management for the patients. Most importantly, an online app (https://gasal.21cloudbox.com/ (available from 14 May 2021 to 14 May 2024)) was built to meet the needs of further investigations such as the multicenter validation and prospective study. Applying this app can help predict the risk of AL in patients with gastric adenocarcinoma who received total or proximal gastrectomy in a real-time manner.

Author Contributions

Methodology, formal analysis, writing—review and editing, S.S.; supervision, software, L.L.; data curation, Y.Z.; data curation, L.M. and Q.L.; conceptualization, formal analysis, methodology, writing—review and editing, J.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by grants from National Natural Science Foundation of China (No. 81903047).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the local ethics committee of Tongji Hospital of Huazhong University of Science and Technology (protocol no. 2021-0522, date of approval: 3 June 2021).

Informed Consent Statement

The patients’ consents were waived due to the nature of retrospective study.

Data Availability Statement

The data used in the present study is available from the corresponding author on reasonable request.

Acknowledgments

We thank all members of the Department of Gastrointestinal Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology for their sincere support.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef] [Green Version]
Kim, H.H.; Hyung, W.J.; Cho, G.S.; Kim, M.C.; Han, S.U.; Kim, W.; Ryu, S.W.; Lee, H.J.; Song, K.Y. Morbidity and mortality of laparoscopic gastrectomy versus open gastrectomy for gastric cancer: An interim report--A phase III multicenter, prospective, randomized Trial (KLASS Trial). Ann. Surg. 2010, 251, 417–420. [Google Scholar] [CrossRef] [PubMed]
Aurello, P.; Berardi, G.; Moschetta, G.; Cinquepalmi, M.; Antolino, L.; Nigri, G.; D’Angelo, F.; Valabrega, S.; Ramacciato, G. Recurrence Following Anastomotic Leakage After Surgery for Carcinoma of the Distal Esophagus and Gastroesophageal Junction: A Systematic Review. Anticancer Res. 2019, 39, 1651–1660. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gong, W.; Li, J. Combat with esophagojejunal anastomotic leakage after total gastrectomy for gastric cancer: A critical review of the literature. Int. J. Surg. 2017, 47, 18–24. [Google Scholar] [CrossRef] [PubMed]
Tu, R.H.; Lin, J.X.; Zheng, C.H.; Li, P.; Xie, J.W.; Wang, J.B.; Lu, J.; Chen, Q.Y.; Cao, L.L.; Lin, M.; et al. Development of a nomogram for predicting the risk of anastomotic leakage after a gastrectomy for gastric cancer. Eur. J. Surg. Oncol. 2017, 43, 485–492. [Google Scholar] [CrossRef]
Makuuchi, R.; Irino, T.; Tanizawa, Y.; Bando, E.; Kawamura, T.; Terashima, M. Esophagojejunal anastomotic leakage following gastrectomy for gastric cancer. Surg. Today 2019, 49, 187–196. [Google Scholar] [CrossRef] [PubMed]
Tanaka, Y.; Kanda, M.; Tanaka, C.; Kobayashi, D.; Mizuno, A.; Iwata, N.; Hayashi, M.; Niwa, Y.; Takami, H.; Yamada, S.; et al. Usefulness of preoperative estimated glomerular filtration rate to predict complications after curative gastrectomy in patients with clinical T2-4 gastric cancer. Gastric Cancer 2017, 20, 736–743. [Google Scholar] [CrossRef] [Green Version]
Bhinder, B.; Gilvary, C.; Madhukar, N.S.; Elemento, O. Artificial Intelligence in Cancer Research and Precision Medicine. Cancer Discov. 2021, 11, 900–915. [Google Scholar] [CrossRef] [PubMed]
Quer, G.; Arnaout, R.; Henne, M.; Arnaout, R. Machine Learning and the Future of Cardiovascular Care: JACC State-of-the-Art Review. J. Am. Coll. Cardiol. 2021, 77, 300–313. [Google Scholar] [CrossRef]
Shung, D.L.; Au, B.; Taylor, R.A.; Tay, J.K.; Laursen, S.B.; Stanley, A.J.; Dalton, H.R.; Ngu, J.; Schultz, M.; Laine, L. Validation of a Machine Learning Model That Outperforms Clinical Risk Scoring Systems for Upper Gastrointestinal Bleeding. Gastroenterology 2020, 158, 160–167. [Google Scholar] [CrossRef] [PubMed]
Kudo, S.E.; Ichimasa, K.; Villard, B.; Mori, Y.; Misawa, M.; Saito, S.; Hotta, K.; Saito, Y.; Matsuda, T.; Yamada, K.; et al. Artificial Intelligence System to Determine Risk of T1 Colorectal Cancer Metastasis to Lymph Node. Gastroenterology 2020. [Google Scholar] [CrossRef]
Luo, H.; Xu, G.; Li, C.; He, L.; Luo, L.; Wang, Z.; Jing, B.; Deng, Y.; Jin, Y.; Li, Y.; et al. Real-time artificial intelligence for detection of upper gastrointestinal cancer by endoscopy: A multicentre, case-control, diagnostic study. Lancet Oncol. 2019, 20, 1645–1654. [Google Scholar] [CrossRef]
Kather, J.N.; Pearson, A.T.; Halama, N.; Jäger, D.; Krause, J.; Loosen, S.H.; Marx, A.; Boor, P.; Tacke, F.; Neumann, U.P.; et al. Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer. Nat. Med. 2019, 25, 1054–1056. [Google Scholar] [CrossRef] [PubMed]
Yamashita, R.; Long, J.; Longacre, T.; Peng, L.; Berry, G.; Martin, B.; Higgins, J.; Rubin, D.L.; Shen, J. Deep learning model for the prediction of microsatellite instability in colorectal cancer: A diagnostic study. Lancet Oncol. 2021, 22, 132–141. [Google Scholar] [CrossRef]
Kawakami, E.; Tabata, J.; Yanaihara, N.; Ishikawa, T.; Koseki, K.; Iida, Y.; Saito, M.; Komazaki, H.; Shapiro, J.S.; Goto, C.; et al. Application of Artificial Intelligence for Preoperative Diagnostic and Prognostic Prediction in Epithelial Ovarian Cancer Based on Blood Biomarkers. Clin. Cancer Res. 2019, 25, 3006–3015. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Oshi, M.; Kunisaki, C.; Miyamoto, H.; Kosaka, T.; Akiyama, H.; Endo, I. Risk Factors for Anastomotic Leakage of Esophagojejunostomy after Laparoscopy-Assisted Total Gastrectomy for Gastric Cancer. Dig. Surg. 2018, 35, 28–34. [Google Scholar] [CrossRef] [PubMed]
Kunisaki, C.; Miyata, H.; Konno, H.; Saze, Z.; Hirahara, N.; Kikuchi, H.; Wakabayashi, G.; Gotoh, M.; Mori, M. Modeling preoperative risk factors for potentially lethal morbidities using a nationwide Japanese web-based database of patients undergoing distal gastrectomy for gastric cancer. Gastric Cancer 2017, 20, 496–507. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kann, B.H.; Hosny, A.; Aerts, H. Artificial intelligence for clinical oncology. Cancer Cell 2021, 39, 916–927. [Google Scholar] [CrossRef] [PubMed]
Ngiam, K.Y.; Khor, I.W. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. 2019, 20, e262–e273. [Google Scholar] [CrossRef]
Goecks, J.; Jalili, V.; Heiser, L.M.; Gray, J.W. How Machine Learning Will Transform Biomedicine. Cell 2020, 181, 92–101. [Google Scholar] [CrossRef]
Ichimasa, K.; Kudo, S.E.; Mori, Y.; Misawa, M.; Matsudaira, S.; Kouyama, Y.; Baba, T.; Hidaka, E.; Wakamura, K.; Hayashi, T.; et al. Artificial intelligence may help in predicting the need for additional surgery after endoscopic resection of T1 colorectal cancer. Endoscopy 2018, 50, 230–240. [Google Scholar] [CrossRef] [PubMed]
Nudel, J.; Bishara, A.M.; de Geus, S.W.L.; Patil, P.; Srinivasan, J.; Hess, D.T.; Woodson, J. Development and validation of machine learning models to predict gastrointestinal leak and venous thromboembolism after weight loss surgery: An analysis of the MBSAQIP database. Surg. Endosc. 2021, 35, 182–191. [Google Scholar] [CrossRef] [PubMed]
Inokuchi, M.; Otsuki, S.; Fujimori, Y.; Sato, Y.; Nakagawa, M.; Kojima, K. Systematic review of anastomotic complications of esophagojejunostomy after laparoscopic total gastrectomy. World J. Gastroenterol. 2015, 21, 9656–9665. [Google Scholar] [CrossRef]
Nakagawa, M.; Tokunaga, M.; Aburatani, T.; Sato, Y.; Matsuyama, T.; Nakajima, Y.; Kinugasa, Y. Feasibility and Safety of Early Oral Intake and Discharge After Total or Proximal Gastrectomy: An Analysis of Consecutive Cases Without Exclusion Criteria. Ann. Surg. Oncol. 2020, 27, 812–821. [Google Scholar] [CrossRef] [PubMed]
Miyawaki, Y.; Sato, H.; Fujiwara, N.; Sugita, H.; Sakuramoto, S.; Okamoto, K.; Yamaguchi, S.; Koyama, I. Evaluation of the Associations between Gastric Tube Preparation Methods and the Incidence of Cervical Anastomotic Leakage after Esophagectomy for Thoracic Esophageal Cancer. Dig. Surg. 2020, 37, 154–162. [Google Scholar] [CrossRef] [PubMed]
Li, S.J.; Wang, Z.Q.; Li, Y.J.; Fan, J.; Zhang, W.B.; Che, G.W.; Liu, L.X.; Chen, L.Q. Diabetes mellitus and risk of anastomotic leakage after esophagectomy: A systematic review and meta-analysis. Dis. Esophagus 2017, 30, 1–12. [Google Scholar] [CrossRef]
Hasegawa, T.; Kubo, N.; Ohira, M.; Sakurai, K.; Toyokawa, T.; Yamashita, Y.; Yamazoe, S.; Kimura, K.; Nagahara, H.; Amano, R.; et al. Impact of body mass index on surgical outcomes after esophagectomy for patients with esophageal squamous cell carcinoma. J. Gastrointest. Surg. 2015, 19, 226–233. [Google Scholar] [CrossRef]
Ji, L.; Wang, T.; Tian, L.; Gao, M. The early diagnostic value of C-reactive protein for anastomotic leakage post radical gastrectomy for esophagogastric junction carcinoma: A retrospective study of 97 patients. Int. J. Surg. 2016, 27, 182–186. [Google Scholar] [CrossRef]
Deguchi, Y.; Fukagawa, T.; Morita, S.; Ohashi, M.; Saka, M.; Katai, H. Identification of Risk Factors for Esophagojejunal Anastomotic Leakage after Gastric Surgery. World J. Surg. 2012, 36, 1617–1622. [Google Scholar] [CrossRef]
Zhao, G.F.; Zhang, K.P.; Gao, S.G.; Mu, J.W.; Mao, Y.S.; Wang, D.L.; Gao, Y.S.; Lyu, F.; Zhao, L.; Xue, Q. Analysis of the risk factors for postoperative cervical anastomotic leakage after McKeown’s esophagectomy. Zhonghua Zhong Liu Za Zhi [Chin. J. Oncol.] 2017, 39, 287–292. [Google Scholar] [CrossRef]
Dong, T.S.; Kalani, A.; Aby, E.S.; Le, L.; Luu, K.; Hauer, M.; Kamath, R.; Lindor, K.D.; Tabibian, J.H. Machine Learning-based Development and Validation of a Scoring System for Screening High-Risk Esophageal Varices. Clin. Gastroenterol. Hepatol. 2019, 17, 1894–1901.e1891. [Google Scholar] [CrossRef] [PubMed]
Wu, C.C.; Yeh, W.C.; Hsu, W.D.; Islam, M.M.; Nguyen, P.A.A.; Poly, T.N.; Wang, Y.C.; Yang, H.C.; Jack Li, Y.C. Prediction of fatty liver disease using machine learning algorithms. Comput. Methods Programs Biomed. 2019, 170, 23–29. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Performance of the machine learning algorithms for predicting AL in the testing set. ROC: receiver operating characteristic curve; LR: logistic regression; RF: random forest; SVM: support vector machine; TPR: true positive rate; FPR, false positive rate.

Figure 2. Radar plot for the ten most important variables in predicting AL of the RF model. BMI, body mass index; ASA, American Society of Anesthesiologists classification score; RF, random forest.

Table 1. Comparison of the training and testing sets.

Variables	Training Set (n = 1328)	Testing Set (n = 332)	p Value
Male, n (%)	983 (74.0%)	242 (72.9%)	0.626
Age, mean (SD), years	58.94 (9.80)	59.66 (10.71)	0.242
BMI, mean (SD), kg/m²	21.01 (2.65)	21.02 (2.76)	0.930
Hypertension, n (%)	312 (23.5%)	67 (20.2%)	0.214
Diabetes, n (%)	88 (6.6%)	23 (6.9%)	0.807
Previous abdominal surgery, n (%)	260 (19.6%)	60 (18.1%)	0.586
Brinkman index, mean (SD)	221.34 (412.59)	199.94 (316.26)	0.388
Alcohol use, n (%)	272 (20.5%)	64 (19.3%)	0.648
Hemoglobin, mean (SD), g/L	119.96 (22.17)	120.24 (23.73)	0.839
Albumin, mean (SD), g/L	38.46 (4.57)	38.74 (4.90)	0.325
Tumor size, mean (SD), cm	4.34 (2.33)	4.36 (2.40)	0.881
Tumorous obstruction, n (%)	226 (17.0%)	58 (17.5%)	0.871
Neoadjuvant, n (%)	33 (2.5%)	7 (2.1%)	0.842
Total gastrectomy, n (%)	898 (67.6%)	237 (71.4%)	0.210
Esophagogastrostomy, n (%)	424 (31.9%)	89 (26.8%)	0.073
Combined resection, n (%)	68 (5.1%)	21 (6.3%)	0.413
Laparoscopic surgery, n (%)	1133 (85.3%)	283 (85.2%)	1.000
Blood loss, mean (SD), ml	146.95 (252.80)	140.66 (222.05)	0.678
Intraperitoneal chemotherapy, n (%)	979 (73.7%)	254 (76.5%)	0.326
Nasogastric tube, n (%)	1305 (98.3%)	323 (97.3%)	0.263
Indwelling drainage tube, n (%)	1317 (99.2%)	325 (97.9%)	0.068
Operative time, mean (SD), minutes	304.28 (60.71)	312.17 (63.11)	0.036
ASA			0.083
1	205 (15.4%)	48 (14.5%)
2	971 (73.1%)	233 (70.2%)
3	145 (10.9%)	51 (15.4%)
4	7 (0.5%)	0 (0.0%)
Clinical stages			0.353
1	200 (15.0%)	49 (14.8%)
2	454 (34.2%)	108 (32.5%)
3	608 (45.8%)	150 (45.2%)
4	66 (5.0%)	25 (7.5%)
AL	25 (1.9%)	11 (3.3%)	0.087

SD, standard deviation; BMI, body mass index; ASA, American Society of Anesthesiologists score; AL, anastomotic leakage.

Table 2. Predictive results of four machine learning models in the testing set.

Predictions	True Label
Predictions	Cases with AL	Cases without AL
LR
AL(+)	10	74
AL(−)	1	247
RF
AL(+)	9	57
AL(−)	2	264
SVM
AL(+)	10	96
AL(−)	1	225
XGBoost
AL(+)	10	89
AL(−)	1	232

AL, anastomotic leakage; LR, logistic regression; RF, random forest; SVM, support vector machine.

Table 3. Performance of machine learning models in the testing set.

	RF	LR	SVM	XGBoost	p Valve (RF vs.)
	RF	LR	SVM	XGBoost	LR	SVM	XGBoost
Sensitivity (95% CI)	0.818 (0.478–0.968)	0.909 (0.572–0.995)	0.909 (0.572–0.995)	0.909 (0.572–0.995)	0.534	0.534	0.534
Specificity (95% CI)	0.822 (0.775–0.862)	0.770 (0.719–0.814)	0.701 (0.647–0.750)	0.723 (0.670–0.770)	0.096	<0.001	0.003
PPV (95% CI)	0.137 (0.068–0.248)	0.119(0.062–0.212)	0.094 (0.049–0.171)	0.101 (0.052–0.182)	0.752	0.392	0.486
NPV (95% CI)	0.992 (0.970–0.999)	0.996 (0.974–1.000)	0.996 (0.972–1.000)	0.996 (0.973–1.000)	0.978	0.981	0.98
Accuracy (95% CI)	0.822 (0.776–0.861)	0.774(0.725–0.818)	0.708 (0.656–0.756)	0.729 (0.678–0.776)	0.122	<0.001	0.004

CI: confidence interval; PPV, positive predictive value; NPV, negative predictive value; LR, logistic regression; RF, random forest; SVM, support vector machine.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shao, S.; Liu, L.; Zhao, Y.; Mu, L.; Lu, Q.; Qin, J. Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy. J. Pers. Med. 2021, 11, 748. https://doi.org/10.3390/jpm11080748

AMA Style

Shao S, Liu L, Zhao Y, Mu L, Lu Q, Qin J. Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy. Journal of Personalized Medicine. 2021; 11(8):748. https://doi.org/10.3390/jpm11080748

Chicago/Turabian Style

Shao, Shengli, Lu Liu, Yufeng Zhao, Lei Mu, Qiyi Lu, and Jichao Qin. 2021. "Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy" Journal of Personalized Medicine 11, no. 8: 748. https://doi.org/10.3390/jpm11080748

APA Style

Shao, S., Liu, L., Zhao, Y., Mu, L., Lu, Q., & Qin, J. (2021). Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy. Journal of Personalized Medicine, 11(8), 748. https://doi.org/10.3390/jpm11080748

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Application of Machine Learning for Predicting Anastomotic Leakage in Patients with Gastric Adenocarcinoma Who Received Total or Proximal Gastrectomy

Abstract

1. Introduction

2. Materials and Methods

2.1. Patients and Variables

2.2. Outcome

2.3. Machine Learning Algorithms

2.4. Statistical Analysis

3. Results

3.1. Summary of Demographic and Clinical Characteristics for Training and Testing Sets

3.2. Performance of the Machine Learning Algorithms

3.3. Predictive Abilities of the Machine Learning Models

3.4. Feature Importance Analysis

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI