Next Article in Journal
A Cross-Sectoral Telemedicine Network (sekTOR-HF) for Patients with Heart Failure
Previous Article in Journal
Transcatheter Aortic Valve Implantation and Replacement: The Latest Advances and Prospects
Previous Article in Special Issue
Oligometastatic Mixed Neuroendocrine Adenocarcinoma of the Esophago-Gastric Junction: A Case of Successful Multidisciplinary Management, the Lessons Learnt and Review of the Literature
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Current Role of Artificial Intelligence in the Management of Esophageal Cancer

by
Evgenia Mela
1,*,
Dimitrios Tsapralis
2,
Dimitrios Papaconstantinou
3,
Panagiotis Sakarellos
1,
Chrysovalantis Vergadis
4,
Michail E. Klontzas
5,6,7,8,
Ioannis Rouvelas
9,10,
Antonios Tzortzakakis
5,11 and
Dimitrios Schizas
1,*
1
First Department of Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, 11527 Athens, Greece
2
Department of Surgery, General Hospital of Ierapetra, 72200 Ierapetra, Greece
3
Third Department of Surgery, National and Kapodistrian University of Athens, Attikon University Hospital, 12462 Athens, Greece
4
Department of Radiology, Laikon General Hospital, 11527 Athens, Greece
5
Department for Clinical Science, Intervention and Technology (CLINTEC), Division of Radiology, Karolinska Institutet, 14152 Stockholm, Sweden
6
Department of Medical Imaging, University Hospital of Heraklion, 71500 Crete, Greece
7
Computational BioMedicine Laboratory, Institute of Computer Science, Foundation for Research and Technology (FORTH), 71500 Heraklion, Greece
8
Department of Radiology, School of Medicine, University of Crete, Voutes Campus, 70013 Heraklion, Greece
9
Department of Clinical Science, Intervention and Technology (CLINTEC), Division of Surgery and Oncology, Karolinska Institutet, 14152 Stockholm, Sweden
10
Department of Upper Abdominal Diseases, Karolinska University Hospital, Huddinge, 14152 Stockholm, Sweden
11
Medical Radiation Physics and Nuclear Medicine, Section for Nuclear Medicine, Karolinska University Hospital, Huddinge, 14152 Stockholm, Sweden
*
Authors to whom correspondence should be addressed.
J. Clin. Med. 2025, 14(6), 1845; https://doi.org/10.3390/jcm14061845
Submission received: 21 January 2025 / Revised: 3 March 2025 / Accepted: 7 March 2025 / Published: 9 March 2025

Abstract

:
Background/Objectives: Esophageal cancer (EC) represents a major global contributor to cancer-related mortality. The advent of artificial intelligence (AI), including machine learning, deep learning, and radiomics, holds promise for enhancing treatment decisions and predicting outcomes. The aim of this review is to present an overview of the current landscape and future perspectives of AI in the management of EC. Methods: A literature search was performed on MEDLINE using the following keywords: “Artificial Intelligence”, “Esophageal cancer”, “Barrett’s esophagus”, “Esophageal Adenocarcinoma”, and “Esophageal Squamous cell carcinoma”. All titles and abstracts were screened; the results included 41 studies. Results: Over the past five years, the number of studies focusing on the application of AI to the treatment and prognosis of EC has surged, leveraging increasingly larger datasets with external validation. The simultaneous incorporation in AI models of clinical factors and features from several imaging modalities displays improved predictive performance, which may enhance patient outcomes, based on direct personalized therapeutic options. However, clinicians and researchers must address existing limitations, conduct randomized controlled trials, and consider the ethical and legal aspects that arise to establish AI as a standard decision-support tool. Conclusions: AI applications may result in substantial advances in EC management, heralding a new era. Considering the complexity of EC as a clinical entity, the evolving potential of AI is anticipated to ameliorate patients’ quality of life and survival rates.

1. Introduction

Esophageal cancer represents the eighth-most prevalent malignancy worldwide and constitutes a leading cause of cancer-related mortality [1,2]. The global burden of esophageal cancer has grown more than sixfold, attributed to the world’s aging population along with the rising prevalence of related risk factors, the most prominent being tobacco and alcohol use alongside obesity [2]. By 2030, esophageal adenocarcinoma is expected to experience a significant rise in incidence, while esophageal squamous cell carcinoma is projected to decline steadily [3]. Associated with a dismal prognosis, the reported 5-year survival rate is 22%, which declines to 6% in the occurrence of distant metastases [4]. EC is estimated to decrease the absolute years of life by 7.8 (95% confidence interval: 2.3–12.7) [5].
The advent of AI, which simulates human cognitive abilities, has revolutionized numerous disciplines. AI constitutes an appealing approach to medical management mainly by employing two major tools: traditional machine learning (ML) and deep learning (DL) [6]. The implementation of ML and DL techniques lessens the prospect of human error. ML is the discipline in which models are trained through automated learning using expert-provided data to identify specified patterns. DL, a subset of ML, utilizes multilayered neural networks and larger datasets and simulates a more complex form of learning that enhances accuracy and outcomes over time [7]. Meanwhile, radiomics offers a quantitative perspective on imaging data, permitting the recognition of fine patterns corresponding to the tumor’s biological behavior and heterogeneity that are imperceptible through conventional means [8].
EC remains a significant global health concern, affecting both its diagnosis and management [1,9]. The emergence of AI and its application for diagnostic, therapeutic, and prognostic purposes presents an opportunity to improve the poor prognosis associated with EC [9]. AI has been utilized across the entire spectrum of EC management, particularly in diagnosis. In this regard, hyperspectral and multispectral imaging techniques, which constitute AI-based diagnostic imaging techniques, have been employed to enhance EC detection rates with encouraging outcomes [10,11,12]. While AI has been primarily employed in the identification of early precursor lesions and to enhance prognosis, notable advancements have also been made in its role in patient care, including treatment optimization and survival prediction [9]. Given the scarcity of non-invasive biomarkers, there is a significant reliance on radiological imaging during treatment planning and assessment, wherein AI could serve a pivotal role [8].
The purpose of this review is to critically evaluate and consolidate the existing literature concerning the use of AI for optimizing treatment decisions and their implementation in the prognosis prediction of EC. To our knowledge, this is the first narrative review that focuses exclusively on the benefits of AI in the management of EC, aiming to provide an overview of its current and future applications.

2. Materials and Methods

We performed a literature search of MEDLINE from its start date up to June 2024. The search strategy included the following terms: “Artificial Intelligence”, “Esophageal cancer”, “Barrett’s esophagus”, “Esophageal Adenocarcinoma” and “Esophageal Squamous cell carcinoma”. Eligibility criteria included randomized controlled trials, non-randomized cohort or prospective studies, and retrospective studies, which were published in the English language and referred to applications of AI in treatment decisions, planning, surgical treatment, and prognostic prediction of EC. Articles without an available full text and irrelevant or unavailable data were excluded. The exclusion criteria also included conference abstracts, preprints, and non-English studies. There were no particular inclusion or exclusion criteria defined by AI model validation methods or sample size.
The titles and abstracts of all articles were initially independently screened by two researchers (E.M and D.P.), and any ensuing conflicts were addressed by a third researcher (D.T.). Full-text articles were subsequently reviewed for data extraction. Ultimately, 41 studies were included (Figure 1), and considering the heterogeneity of the study pool, a narrative presentation was chosen.

3. AI Applications in EC Treatment and Prognosis Prediction

3.1. AI in Treatment Decision and Planning

AI has the potential to revolutionize the current multidisciplinary treatment planning for EC by further optimizing effective time management and ensuring consistency in the practices of multidisciplinary tumor boards. In 2023, British experts introduced a multinomial logistic regression algorithm aiming at replicating the decision-making of the upper gastrointestinal multidisciplinary boards, which was tested retrospectively. The findings, utilizing a single-center database, demonstrated good performance, with a mean area under the curve (AUC) of 0.793. Notably, age was identified as the most significant variable affecting predictive accuracy [13]. Additionally, to facilitate multidisciplinary team decisions, Zahedi et al. developed a multilayer neural network incorporating particle swarm optimization and an error back propagation program to calculate the optimal chemotherapy dosage. Particle swarm optimization is a bio-inspired program that simulates the collective behavior of social creatures, such as birds. It is formed by a swarm of particles, each reflecting a solution to an optimization issue, which collectively modify their velocity and position to attain the global optimal state. Simultaneously, error back propagation is a supervised training approach for artificial neural networks (ANNs) that shifts the weights of the network in response to the error function. The model was tested retrospectively and achieved a reported accuracy of 77.3% [14].
Supplementary treatment-planning algorithms for EC have likewise been developed for radiotherapy. In 2021, Barragán-Montero et al. emphasized the importance of a large, high-quality dataset for constructing DL models for the dosage prediction of intensity-modulated radiotherapy (IMRT) [15]. Meanwhile, a key aspect of radiation planning is clinical target volume (CTV) segmentation, which demands time and relies on the physician’s expertise and experience [16]. The first study on CTV auto-segmentation, employing CT scans for stage I and II esophageal cancer after surgical resection, utilized a deep dilated convolutional U-network (DDUnet) model architecture for image segmentation. DDUnet delivers a more comprehensive image segmentation compared to the traditional Unet by retrieving and maintaining contextual data from the provided images. The study documented an 86,7% dice similarity coefficient (DSC) value and a total time of 25 s per patient [16]. Similarly, Jin et al. focused on convolutional neural network (CNN)-based models for gross tumor volume contour delineation and found that their proposed algorithm, VUMix-Net, achieved superior quantitative and qualitative performance, comparable to that of human experts, particularly for the middle third of the esophagus [17]. Subsequently, with the intent of maximizing time and workload efficiency, alongside the accuracy of radiation dosage allocation, an asymmetrical 3D ResNet DL model was proposed for patients receiving a range of prescribed doses. The model projected dose distribution with a mean relative error of 2% and an average DSC for isodose volumes of 0.93. In addition, it was demonstrated that the model’s incorporation enhanced clinical workflow efficacy by elevating the accuracy of clinical decisions for conventional or dose-reduced radiotherapy, from 95.24% to 100% for physicians, by significantly decreasing planning time by 61.57% for junior and 52.47% for senior dosimetrists, along with reducing the average dosage variance by 78.64% among the two [18].
With encouraging findings and the potential of personalized treatment plans, numerous predictive models have been generated for estimating radiation-derived complications. Zhu et al. developed an ML approach for predicting radiotherapy-related immunotoxicity in EC patients undergoing proton and photon treatment. The proposed model displayed an AUC of 0.783 and an accuracy of 0.717, whereas parameters such as age, absolute lymphocyte count, and cardiopulmonary dosage were found to influence lymphopenia risk when shifting to proton therapy [19]. Yet, Chinese researchers created another model, HybridNet, incorporating pulmonary dose distribution and clinical features to predict the occurrence of radiation pneumonitis. HybridNet exhibited an accuracy of 0.85 and an AUC of 0.91, highlighting its potential as a future decision-support tool in order to simplify clinical workflow and optimize treatment quality [20].

3.2. AI in EC Surgical Treatment

Esophagectomy remains the cornerstone treatment for localized EC. Despite the current applications of AI in surgery being limited, substantial research has been conducted. In the study of Sato et al., a DL algorithm for image analysis was developed to accurately detect the position of recurrent laryngeal nerves intraoperatively. This software surpassed the average performance of general surgeons’ and nearly matched that of upper gastrointestinal surgeons [21]. With the future prospect of aiding in the intraoperative identification of tumor margins and the achievement of a higher percentage of R0 resection, Maktabi et al. introduced a hyperspectral imaging algorithm, which identified malignant tissue with a sensitivity of 63% and a specificity of 69%, within a timeframe of less than a second [22]. Nonetheless, the inquiry for future research persists regarding the potential of AI to facilitate the achievement of greater R0 rates. Likewise, AI algorithms have been devised to automatically identify endoscopic or surgical phases, enabling potential continuous monitoring and enhancing team performance. Particularly, the model applied in robot-assisted esophagectomy displayed an accuracy rate of 84%, whereas the initial model deployed for endoscopic submucosal dissection, which is considered the mainstay of treatment for superficial tumors, indicated an overall accuracy of 90% [23,24]. This superior performance in endoscopic settings can be attributed to the uniformity and the minimal anatomic variations encountered during surgical procedures [23].
Considering that esophagectomy is a procedure carrying an increased risk of complications, Jung et al. constructed a neural network to predict their occurrence. The results indicated a poor overall accuracy of 0.688 and an AUC of 0.672 for complications equal to or greater than IIIa, according to the Clavien–Dindo classification. In this context, the neural network outperformed the Cologne risk score, which is an established clinical risk calculator, in terms of accuracy for predicting Clavien–Dindo IIIa or higher complications (accuracy 0.688 versus 0.510, respectively) and surgical complications (accuracy 0.667 versus 0.624, respectively) [25]. Similarly, van Kooten et al. conducted a comparative study to assess the advantages of ML in conjunction with linear regression algorithms in forecasting anastomotic leakage and respiratory sequelae following upper gastrointestinal tract oncologic surgery. The researchers found that the linear regression model accomplished the highest overall prediction accuracy for esophageal surgery, followed by neural networks for estimating anastomotic leak probability [26]. A novel ML hybrid algorithm has been recently introduced, integrating preoperative CT and clinical variables in order to accurately estimate the probability of an anastomotic leak. The reported performance was an AUC of 79.2%, facilitating the prompt identification and treatment of an anastomotic leak, even in cases with minimal clinical suspicion [27]. Algorithms have also been generated to estimate early postoperative re-admissions within 30 days, and it has been demonstrated that ML models can disclose associations that are imperceptible by traditional regression algorithms and discover independent predictive variables for complications or re-admissions, hence enhancing perioperative treatment and recovery [26,27,28]. Table 1 summarizes the results of AI applications in surgical treatment.

3.3. AI in Treatment Response Prediction

Although preoperative assessment and prediction of treatment response are crucial for guiding individualized therapeutic decisions, this remains a challenge in clinical practice. Despite the potential confounding effects, involving differences in neoadjuvant treatment regimens amongst different protocols, numerous studies have established that imaging can serve as a prognostic indicator of treatment responsiveness. In 2020, Hu et al. demonstrated that incorporating both peritumoral and intratumoral radiomic features enhances predictive efficacy for pathological complete response (pCR) following neoadjuvant treatment in esophageal squamous cell carcinoma (ESCC) [29]. Subsequently, a comparative analysis was conducted between radiomics, CNNs, and clinical models, which reported that the optimal model integrating features by ResNet50, which is a CNN model for image classification, displaying an AUC of 0.805 and with all radiological algorithms outperforming the conventional clinical models [30]. However, a subsequent study indicated that the integration of clinical data with radiomics enhances performance, as demonstrated by a reported AUC of 0.891, as opposed to 0.817 for the radiomics-only model, in predicting the pCR of primary ESCC lesions [31]. In contrast to these retrospective studies, the multicenter study by Li et al. established an advanced three-dimensional prediction algorithm that demonstrated a 100% positive predictive value and an AUC of 0.833 in identifying non-responders to neoadjuvant chemoradiotherapy for locally advanced ESCC [32]. Additionally, a CT radiomics model was developed to estimate responses to immunotherapy and chemotherapy in patients with advanced ESCC. The study evaluated two-dimensional and three-dimensional CT radiomics models, which were established on 2D regions of interest (ROIs) at the level of greatest transverse tumor diameter and 3D ROIs derived by the entire tumor volume, respectively. The two-dimensional corrected model outperformed the three-dimensional model, yielding an accuracy of 79.6% in the validation cohort [33]. While the Response Evaluation Criteria in Solid Tumors (RECIST) serve as an established method for assessing residual disease in solid tumors, they exhibit limitations in EC due to indistinct margins or fibrotic tissue following neoadjuvant treatment, and there have been no direct comparisons in terms of performance among RECIST and neural networks for EC [34].
In the context of radiotherapy response, the absence of biomarkers precludes the individualization of treatment plans. To address this, researchers utilized the pre-trained ResNet50 for predicting radiotherapy response, achieving an AUC value of 0.732 in external validation [35]. In addition, Jin et al. demonstrated that merging radiomic and dosimetric variables enhances prediction accuracy and efficiency. Recently a dose map-guided AI model was introduced for estimating pCR, reporting an impressive AUC of 0.928 [36,37].
Although the reduction in tumor metabolic rate, as evaluated by standardized uptake value (SUV) within the second week of neoadjuvant treatment using 18F-FDG positron emission tomography (18F-FDG PET), has been recognized in the global literature as having predictive significance, its sensitivity and specificity remain below 70% [38,39]. Ypsilantis et al. were the first to develop a three-slice CNN, which employed PET scan data to predict response to neoadjuvant chemotherapy, with an accuracy of 73.4% and sensitivity and specificity exceeding 80% [40]. Furthermore, the combination of variables from PET scan and CT imaging delivers superior results compared to each modality individually, with a reported AUC of 0.87 [41]. Likewise, Qi et al., who first conducted a prospective study combining PET, CT, and clinical features, concluded that incorporating clinical data optimizes the performance of predictive models for ESCC patients undergoing neoadjuvant chemoradiotherapy and immunotherapy. Therefore, the multimodal support vector machine ML model significantly outperformed the ones utilizing only CT or PET variables in the testing dataset (AUC = 0.852, 95% CI: 0.824–0.876) [42].
Finally, in addition to imaging modalities, there have been advancements in the development of computational models making use of endoscopic images for predicting response following neoadjuvant therapy. The initial assessment of a deep neural network endoscopy-based algorithm for predicting pCR using pretreatment images in locally advanced ESCC was conducted by Kawahara et al. in 2022 [43]. An accuracy rate of 64% was recorded, which subsequently rose to 80.6% when imaging filters were employed [43]. Similarly, Matsuda et al. constructed an AI-based predictive model for pCR using pretreatment endoscopy images for ESCC patients. Their findings indicated a moderate degree of accuracy, surpassing that of endoscopists [44]. The researchers also created an AI-guided endoscopic response assessment model for ESCC patients following neoadjuvant treatment, achieving comparable or superior results compared to the average performance of endoscopists [45]. Table 2 summarizes the outcomes of the aforementioned studies.

3.4. AI in EC Prognosis

One of the most significant applications of AI is the prediction of disease prognosis. The traditional primary tumor, lymph nodes, and distant metastasis (TNM) staging system has long been utilized as the main indicator for patient outcomes [46]. However, in the fundamental study of Sato et al. ANNs were implemented for clinicopathologic, biologic, and genetic variables and significantly outperformed the TNM classification system in predicting 1- and 5-year survival rates (p < 0.0001) [47]. In support of the use and the superiority of ANNs were also Modifi et al. for survival estimates following surgical resection. In particular, the ANNs displayed a statistically significant superiority in accuracy compared to the TNM classification system for 1-year disease-free survival (DFS), 88% versus 71.6% (p < 0.01), and for 3 years, at 91.5% versus 74.7% (p < 0.05) [48].
In addition to ANNs, the emergence of radiomics and their extensive application in the quantitative interpretation of imaging data has resulted in their integration into predictive survival models. While Larue et al. underlined that CT radiomics models achieve greater prognostic performance than the clinical models due to the additional features defining tumor heterogeneity, Luo et al. demonstrated that the combined model integrating radiomics and clinical features was superior for predicting local progression-free survival [49,50]. In this context, Cui et al. developed a composite ML model to predict progression-free survival (PFS) and overall survival (OS) in non-surgical ESCC patients and concluded that incorporating clinical data improves predictive efficacy, reporting an AUC of 0.856 and 0.742 for PFS and OS respectively [51]. Furthermore, a DL radiomics model has been assessed in conjunction with handcrafted features radiomics model, such as the tumor’s shape and texture. The DL radiomics model exhibited an advantage with regard to Harrel’s concordance index and AUC, correlating with 3-year OS [52]. In 2023, Zhang H et al. established and validated a new DL staging system for patients with ESCC using a database of 6020 patients. The novel system displayed a remarkable capacity for differentiation regarding 3- and 5-year OS rates [53]. Subsequently, Zhang K et al. sought to forecast the OSs of patients with ESCC by using a Cox proportional hazards (CoxPH) model utilizing a substantial dataset and machine-learning techniques. The optimal model, which comprised 10 variables, demonstrated an AUC of 0.760, surpassing the risk model that incorporated the five most important variables, with an AUC of 0.745, and achieved superior distinctive ability compared to the AJCC eighth staging system [54].
Likewise, magnetic resonance imaging (MRI) has been utilized alongside models incorporating CT radiomics features, demonstrating remarkable consistency in tumor imaging. Combining MRI-based radiomics features with clinical parameters yielded good predictive ability for DFS and OS in ESCC patients [55].
Predictive models have also been created to assess survival and clinical effectiveness following immunotherapy. PD-L1 expression constitutes a prominent biomarker, despite its uncertain prognostic significance [56,57]. For this purpose, Chen et al. established a seven-immune gene model that exhibited an AUC performance of 0.825 at 1 year for the testing group, with the limitation of low efficiency at 3 years due to an insufficient number of patients with follow-up exceeding 3 years [58]. Furthermore, Li B et al. developed a DL pathomics system, named ESCC-PS, to estimate PFS following immunotherapy, with a concordance index of 0.806. This model outperformed PD-L1, indicating that ESCC-PS provides supplemental prognostic information [59]. Table 3 consolidates the findings for AI in EC prognosis prediction.

4. Limitations

Among the limitations of this review is its narrative methodology, which differs from a systematic analysis of the current literature. Consequently, it is inherently more susceptible to selection bias. In this context, this descriptive presentation may hinder generalizability due to the variability in AI model development, preprocessing techniques, and validation methods among the included studies. The lack of systematic analysis of bias in AI algorithms, data privacy concerns, and regulatory approval protocols also stems from the narrative methodology. Furthermore, bridging the gap between research and clinical practice requires the application of AI on larger datasets within the context of multicenter clinical trials and the direct comparison of AI models to conventional treatment and prognostic assessment. Finally, given that ESCC is the predominant histological type globally, particularly in Eastern Asian countries, the majority of the included papers originate from Asia, potentially affecting the generalizability of the findings.

5. Future Perspectives

The current landscape highlights the substantial potential of AI in treatment decision-making and outcome prediction for EC patients. Ongoing clinical trials and observational studies aim to address the challenges that must be overcome before AI can be integrated into clinical practice. Specifically, researchers in Singapore and China are working to develop and assess a clinical decision-assistance system for upper gastrointestinal tract malignancies. In particular, this clinical trial aims to develop and prospectively validate a clinical decision support system, along with comparing it with the recommendations of expert oncological teams in order to improve therapeutic decision making in upper gastrointestinal malignancies (NCT04675138). Concurrently, a prediction algorithm for symptoms and complications following chemotherapy is under development for several malignancies, including EC. The objective of this prospective observational study is to acquire disease-specific biomarkers from EC patients undergoing chemotherapy via wearable devices while concurrently measuring self-reported adverse events, emergency department visits, and chemotherapy completion rates through an application, to ultimately develop an AI-based complication prediction algorithm based solely on biomarkers (NCT05937477). Meanwhile, a research team from India is constructing an AI-based system to predict the pathological response of EC to neoadjuvant chemoradiotherapy and aims to minimize the ‘black box’ phenomenon, wherein the underlying rationale of a CNN remains obscure, while also demonstrating the biological coherence of the predictions, enhancing thus the transparency, comprehensibility, and interpretability of the AI models. This observational study recognizes the lack of transparency of CNNs and aims to address it by utilizing activation maps that highlight the critical areas of an image pertinent to a given prediction, thus illustrating the biological plausibility of the predictions (NCT04489368).
In this context, due to the scarcity of contemporary research in esophageal cancer that improves AI transparency and explainability, there is an imperative need for more studies focusing on interpretative methods to augment AI comprehensibility. Several interpretation tools, including SHAP, LIME, and Grad-CAM, have been introduced in the medical field to elucidate the decision-making process of DL models [60]. Furthermore, causal inference offers a novel approach for investigating causality as opposed to simple correlation, hence enhancing the reliability of AI prediction in the medical field [61]. The ethical and legal implications also arising from this ‘black box’ nature warrant further examination. These encompass issues of privacy, surveillance, bias, and the overarching philosophical controversy over the significance of human reasoning [62]. An additional major challenge is the lack of explicit accountability in instances of errors, privacy violations, and mishandling of big datasets. Therefore, to effectively leverage AI’s potential in EC management, it is essential to consider securing informed consent approval for data utilization, upholding safety, and AI explainability, minimizing algorithmic biases and safeguarding confidentiality of information [63]. Nevertheless, as AI is becoming a clinical decision-support tool, developers must integrate additional human behavioral and reasoning factors into the decision-making process and address the lack of direct comparisons among AI models and established surgical risk calculators, treatment responses, and prognostic evaluation methods to bridge the gap between theory and clinical practice.

6. Conclusions

In conclusion, the emergence of AI tends to revolutionize the management of EC patients. This review outlines the prospects of AI applications in the treatment and prognosis of EC. Numerous models have demonstrated performance that equals or exceeds that of healthcare specialists in treatment decision-making. However, despite the promising capabilities of AI and the encouraging results, further multicenter clinical trials must be conducted to eliminate current barriers and facilitate in the future its integration into routine clinical practice. Given the complex nature of EC, the implementation of these AI-driven approaches is expected to allow their regular use as decision-support systems, ultimately aiming to improve patients’ quality of life and survival rates.

Author Contributions

Conceptualization, E.M. and D.S.; methodology, E.M. and D.T.; validation, D.P., P.S. and M.E.K.; formal analysis, D.P., I.R. and A.T.; investigation and data curation, E.M., D.T. and C.V.; writing—Original draft preparation, E.M., D.T., D.P. and P.S.; writing—Review and editing, E.M., D.S., C.V., M.E.K., I.R., A.T. and D.S.; supervision, I.R. and D.S.; project administration, D.S.; All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ECEsophageal cancer
AIArtificial Intelligence
MLMachine learning
DLDeep learning
AUCArea under the curve
ANNArtificial neural network
CTVClinical target volume
CNNConvolutional neural network
pCRPathological complete response
ESCCEsophageal squamous cell carcinoma
SUVStandardized uptake value
PETPositron emission tomography
PFSProgression-free survival
OSOverall survival
MRIMagnetic resonance imaging
DFSDisease-free survival

References

  1. Collaborators CBDOC. The global, regional, and national burden of oesophageal cancer and its attributable risk factors in 195 countries and territories, 1990–2017: A systematic analysis for the global burden of disease study 2017. Lancet Gastroenterol. Hepatol. 2020, 5, 582–597. [Google Scholar] [CrossRef] [PubMed]
  2. Liu, C.Q.; Ma, Y.L.; Qin, Q.; Wang, P.H.; Luo, Y.; Xu, P.F.; Cui, Y. Epidemiology of esophageal cancer in 2020 and projections to 2030 and 2040. Thorac. Cancer 2023, 14, 3–11. [Google Scholar] [CrossRef]
  3. Arnold, M.; Laversanne, M.; Brown, L.M.; Devesa, S.S.; Bray, F. Predicting the Future Burden of Esophageal Cancer by Histological Subtype: International Trends in Incidence up to 2030. Am. J. Gastroenterol. 2017, 112, 1247–1255. [Google Scholar] [CrossRef] [PubMed]
  4. American Cancer Society. Cancer Facts & Figures 2024; American Cancer Society: Atlanta, GA, USA, 2024. [Google Scholar]
  5. Global Burden of Disease Cancer Collaboration; Fitzmaurice, C.; Allen, C.; Barber, R.M.; Barregard, L.; Bhutta, Z.A.; Brenner, H.; Dicker, D.J.; Chimed-Orchir, O.; Dandona, R.; et al. Global, Regional, and National Cancer Incidence, Mortality, Years of Life Lost, Years Lived With Disability, and Disability-Adjusted Life-years for 32 Cancer Groups, 1990 to 2015: A Systematic Analysis for the Global Burden of Disease Study. JAMA Oncol. 2017, 3, 524–548, Erratum in JAMA Oncol. 2017, 3, 418. [Google Scholar] [CrossRef] [PubMed]
  6. Zhang, S.; Mu, W.; Dong, D.; Wei, J.; Fang, M.; Shao, L.; Zhou, Y.; He, B.; Zhang, S.; Liu, Z.; et al. The Applications of Artificial Intelligence in Digestive System Neoplasms: A Review. Health Data Sci. 2023, 3, 0005. [Google Scholar] [CrossRef]
  7. Moglia, A.; Georgiou, K.; Morelli, L.; Toutouzas, K.; Satava, R.M.; Cuschieri, A. Breaking down the silos of artificial intelligence in surgery: Glossary of terms. Surg. Endosc. 2022, 36, 7986–7997. [Google Scholar] [CrossRef]
  8. Garbarino, G.M.; Polici, M.; Caruso, D.; Laghi, A.; Mercantini, P.; Pilozzi, E.; van Berge Henegouwen, M.I.; Gisbertz, S.S.; van Grieken, N.C.T.; Berardi, E.; et al. Radiomics in Oesogastric Cancer: Staging and Prediction of Preoperative Treatment Response: A Narrative Review and the Results of Personal Experience. Cancers 2024, 16, 2664. [Google Scholar] [CrossRef]
  9. Zhang, Y.H.; Guo, L.J.; Yuan, X.L.; Hu, B. Artificial intelligence-assisted esophageal cancer management: Now and future. World J. Gastroenterol. 2020, 26, 5256–5271. [Google Scholar] [CrossRef]
  10. Tsai, T.J.; Mukundan, A.; Chi, Y.S.; Tsao, Y.M.; Wang, Y.K.; Chen, T.H.; Wu, I.C.; Huang, C.W.; Wang, H.C. Intelligent Identification of Early Esophageal Cancer by Band-Selective Hyperspectral Imaging. Cancers 2022, 14, 4292. [Google Scholar] [CrossRef]
  11. Fang, Y.J.; Huang, C.W.; Karmakar, R.; Mukundan, A.; Tsao, Y.M.; Yang, K.Y.; Wang, H.C. Assessment of Narrow-Band Imaging Algorithm for Video Capsule Endoscopy Based on Decorrelated Color Space for Esophageal Cancer: Part II, Detection and Classification of Esophageal Cancer. Cancers 2024, 16, 572. [Google Scholar] [CrossRef]
  12. Fang, Y.J.; Mukundan, A.; Tsao, Y.M.; Huang, C.W.; Wang, H.C. Identification of Early Esophageal Cancer by Semantic Segmentation. J. Pers. Med. 2022, 12, 1204. [Google Scholar] [CrossRef] [PubMed]
  13. Thavanesan, N.; Bodala, I.; Walters, Z.; Ramchurn, S.; Underwood, T.J.; Vigneswaran, G. Machine learning to predict curative multidisciplinary team treatment decisions in oesophageal cancer. Eur. J. Surg. Oncol. 2023, 49, 106986. [Google Scholar] [CrossRef] [PubMed]
  14. Zahedi, H.; Mehrshad, N.; Anvari, K. Intelligent modelling of oesophageal cancer treatment and its use to determine the dose of chemotherapy drug. J. Med. Eng. Technol. 2012, 36, 261–266. [Google Scholar] [CrossRef]
  15. Barragán-Montero, A.M.; Thomas, M.; Defraene, G.; Michiels, S.; Haustermans, K.; Lee, J.A.; Sterpin, E. Deep learning dose prediction for IMRT of esophageal cancer: The effect of data quality and quantity on model performance. Phys. Med. 2021, 83, 52–63. [Google Scholar] [CrossRef] [PubMed]
  16. Cao, R.; Pei, X.; Ge, N.; Zheng, C. Clinical Target Volume Auto-Segmentation of Esophageal Cancer for Radiotherapy After Radical Surgery Based on Deep Learning. Technol. Cancer Res. Treat. 2021, 20, 15330338211034284. [Google Scholar] [CrossRef]
  17. Jin, L.; Chen, Q.; Shi, A.; Wang, X.; Ren, R.; Zheng, A.; Song, P.; Zhang, Y.; Wang, N.; Wang, C.; et al. Deep Learning for Automated Contouring of Gross Tumor Volumes in Esophageal Cancer. Front. Oncol. 2022, 12, 892171. [Google Scholar] [CrossRef]
  18. Duan, Y.; Wang, J.; Wu, P.; Shao, Y.; Chen, H.; Wang, H.; Cao, H.; Gu, H.; Feng, A.; Huang, Y.; et al. AS-NeSt: A Novel 3D Deep Learning Model for Radiation Therapy Dose Distribution Prediction in Esophageal Cancer Treatment With Multiple Prescriptions. Int. J. Radiat. Oncol. Biol. Phys. 2024, 119, 978–989. [Google Scholar] [CrossRef]
  19. Zhu, C.; Mohan, R.; Lin, S.H.; Jun, G.; Yaseen, A.; Jiang, X.; Wang, Q.; Cao, W.; Hobbs, B.P. Identifying Individualized Risk Profiles for Radiotherapy-Induced Lymphopenia Among Patients With Esophageal Cancer Using Machine Learning. JCO Clin. Cancer Inform. 2021, 5, 1044–1053. [Google Scholar] [CrossRef]
  20. Sheng, L.; Zhuang, L.; Yang, J.; Zhang, D.; Chen, Y.; Zhang, J.; Wang, S.; Shan, G.; Du, X.; Bai, X. Radiation pneumonia predictive model for radiotherapy in esophageal carcinoma patients. BMC Cancer 2023, 23, 988. [Google Scholar] [CrossRef]
  21. Sato, K.; Fujita, T.; Matsuzaki, H.; Takeshita, N.; Fujiwara, H.; Mitsunaga, S.; Kojima, T.; Mori, K.; Daiko, H. Correction to: Real-time detection of the recurrent laryngeal nerve in thoracoscopic esophagectomy using artificial intelligence. Surg. Endosc. 2022, 36, 9483. [Google Scholar] [CrossRef]
  22. Maktabi, M.; Köhler, H.; Ivanova, M.; Jansen-Winkeln, B.; Takoh, J.; Niebisch, S.; Rabe, S.M.; Neumuth, T.; Gockel, I.; Chalopin, C. Tissue classification of oncologic esophageal resectates based on hyperspectral data. Int. J. Comput. Assist. Radiol. Surg. 2019, 14, 1651–1661. [Google Scholar] [CrossRef] [PubMed]
  23. Furube, T.; Takeuchi, M.; Kawakubo, H.; Maeda, Y.; Matsuda, S.; Fukuda, K.; Nakamura, R.; Kato, M.; Yahagi, N.; Kitagawa, Y. Automated artificial intelligence-based phase-recognition system for esophageal endoscopic submucosal dissection (with video). Gastrointest. Endosc. 2024, 99, 830–838. [Google Scholar] [CrossRef] [PubMed]
  24. Takeuchi, M.; Kawakubo, H.; Saito, K.; Maeda, Y.; Matsuda, S.; Fukuda, K.; Nakamura, R.; Kitagawa, Y. Automated Surgical-Phase Recognition for Robot-Assisted Minimally Invasive Esophagectomy Using Artificial Intelligence. Ann. Surg. Oncol. 2022, 29, 6847–6855. [Google Scholar] [CrossRef]
  25. Jung, J.O.; Pisula, J.I.; Bozek, K.; Popp, F.; Fuchs, H.F.; Schröder, W.; Bruns, C.J.; Schmidt, T. Prediction of postoperative complications after oesophagectomy using machine-learning methods. Br. J. Surg. 2023, 110, 1361–1366. [Google Scholar] [CrossRef]
  26. van Kooten, R.T.; Bahadoer, R.R.; Ter Buurkes de Vries, B.; Wouters, M.W.J.M.; Tollenaar, R.A.E.M.; Hartgrink, H.H.; Putter, H.; Dikken, J.L. Conventional regression analysis and machine learning in prediction of anastomotic leakage and pulmonary complications after esophagogastric cancer surgery. J. Surg. Oncol. 2022, 126, 490–501. [Google Scholar] [CrossRef]
  27. Klontzas, M.E.; Ri, M.; Koltsakis, E.; Stenqvist, E.; Kalarakis, G.; Boström, E.; Kechagias, A.; Schizas, D.; Rouvelas, I.; Tzortzakakis, A. Prediction of Anastomotic Leakage in Esophageal Cancer Surgery: A Multimodal Machine Learning Model Integrating Imaging and Clinical Data. Acad. Radiol. 2024, 31, 4878–4885. [Google Scholar] [CrossRef]
  28. Bolourani, S.; Tayebi, M.A.; Diao, L.; Wang, P.; Patel, V.; Manetta, F.; Lee, P.C. Using machine learning to predict early readmission following esophagectomy. J. ThoracCardiovasc. Surg. 2021, 161, 1926–1939.e8, Erratum in J. Thorac. Cardiovasc. Surg. 2020. [Google Scholar] [CrossRef]
  29. Hu, Y.; Xie, C.; Yang, H.; Ho, J.W.K.; Wen, J.; Han, L.; Chiu, K.W.H.; Fu, J.; Vardhanabhuti, V. Assessment of Intratumoral and Peritumoral Computed Tomography Radiomics for Predicting Pathological Complete Response to Neoadjuvant Chemoradiation in Patients With Esophageal Squamous Cell Carcinoma. JAMA Netw. Open. 2020, 3, e2015927. [Google Scholar] [CrossRef]
  30. Hu, Y.; Xie, C.; Yang, H.; Ho, J.W.K.; Wen, J.; Han, L.; Lam, K.O.; Wong, I.Y.H.; Law, S.Y.K.; Chiu, K.W.H.; et al. Computed tomography-based deep-learning prediction of neoadjuvant chemoradiotherapy treatment response in esophageal squamous cell carcinoma. Radiother. Oncol. 2021, 154, 6–13. [Google Scholar] [CrossRef]
  31. Wang, J.; Zhu, X.; Zeng, J.; Liu, C.; Shen, W.; Sun, X.; Lin, Q.; Fang, J.; Chen, Q.; Ji, Y. Using clinical and radiomic feature-based machine learning models to predict pathological complete response in patients with esophageal squamous cell carcinoma receiving neoadjuvant chemoradiation. Eur. Radiol. 2023, 33, 8554–8563. [Google Scholar] [CrossRef]
  32. Li, X.; Gao, H.; Zhu, J.; Huang, Y.; Zhu, Y.; Huang, W.; Li, Z.; Sun, K.; Liu, Z.; Tian, J.; et al. 3D Deep Learning Model for the Pretreatment Evaluation of Treatment Response in Esophageal Carcinoma: A Prospective Study (ChiCTR2000039279). Int. J. Radiat. Oncol. Biol. Phys. 2021, 111, 926–935. [Google Scholar] [CrossRef] [PubMed]
  33. Zhu, Y.; Yao, W.; Xu, B.C.; Lei, Y.Y.; Guo, Q.K.; Liu, L.Z.; Li, H.J.; Xu, M.; Yan, J.; Chang, D.D.; et al. Predicting response to immunotherapy plus chemotherapy in patients with esophageal squamous cell carcinoma using non-invasive Radiomic biomarkers. BMC Cancer 2021, 21, 1167. [Google Scholar] [CrossRef] [PubMed]
  34. Xie, C.-Y.; Pang, C.-L.; Chan, B.; Wong, E.Y.-Y.; Dou, Q.; Vardhanabhuti, V. Machine Learning and Radiomics Applications in Esophageal Cancers Using Non-Invasive Imaging Methods—A Critical Review of Literature. Cancers 2021, 13, 2469. [Google Scholar] [CrossRef]
  35. Xie, Y.; Liu, Q.; Ji, C.; Sun, Y.; Zhang, S.; Hua, M.; Liu, X.; Pan, S.; Hu, W.; Ma, Y.; et al. An artificial neural network-based radiomics model for predicting the radiotherapy response of advanced esophageal squamous cell carcinoma patients: A multicenter study. Sci. Rep. 2023, 13, 8673. [Google Scholar] [CrossRef]
  36. Jin, X.; Zheng, X.; Chen, D.; Jin, J.; Zhu, G.; Deng, X.; Han, C.; Gong, C.; Zhou, Y.; Liu, C.; et al. Prediction of response after chemoradiation for esophageal cancer using a combination of dosimetry and CT radiomics. Eur. Radiol. 2019, 29, 6080–6088. [Google Scholar] [CrossRef]
  37. Yap, W.K.; Hsiao, I.T.; Yap, W.L.; Tsai, T.Y.; Lu, Y.A.; Yang, C.K.; Peng, M.T.; Su, E.L.; Cheng, S.C. A Radiotherapy Dose Map-Guided Deep Learning Method for Predicting Pathological Complete Response in Esophageal Cancer Patients after Neoadjuvant Chemoradiotherapy Followed by Surgery. Biomedicines 2023, 11, 3072. [Google Scholar] [CrossRef]
  38. Schollaert, P.; Crott, R.; Bertrand, C.; D’Hondt, L.; Borght, T.V.; Krug, B. A systematic review of the predictive value of (18)FDG-PET in esophageal and esophagogastric junction cancer after neoadjuvant chemoradiation on the survival outcome stratification. J. Gastrointest. Surg. 2014, 18, 894–905. [Google Scholar] [CrossRef]
  39. Zhu, W.; Xing, L.; Yue, J.; Sun, X.; Sun, X.; Zhao, H.; Yu, J. Prognostic significance of SUV on PET/CT in patients with localised oesophagogastric junction cancer receiving neoadjuvant chemotherapy/chemoradiation: A systematic review and meta-analysis. Br. J. Radiol. 2012, 85, e694–e701. [Google Scholar] [CrossRef]
  40. Ypsilantis, P.P.; Siddique, M.; Sohn, H.M.; Davies, A.; Cook, G.; Goh, V.; Montana, G. Predicting Response to Neoadjuvant Chemotherapy with PET Imaging Using Convolutional Neural Networks. PLoS ONE 2015, 10, e0137036. [Google Scholar] [CrossRef]
  41. Rishi, A.; Zhang, G.G.; Yuan, Z.; Sim, A.J.; Song, E.Y.; Moros, E.G.; Tomaszewski, M.R.; Latifi, K.; Pimiento, J.M.; Fontaine, J.P.; et al. Pretreatment CT and 18 F-FDG PET-based radiomic model predicting pathological complete response and loco-regional control following neoadjuvant chemoradiation in oesophageal cancer. J. Med. Imaging Radiat. Oncol. 2021, 65, 102–111. [Google Scholar] [CrossRef]
  42. Qi, W.X.; Li, S.; Xiao, J.; Li, H.; Chen, J.; Zhao, S. A machine learning approach using 18F-FDG PET and enhanced CT scan-based radiomics combined with clinical model to predict pathological complete response in ESCC patients after neoadjuvant chemoradiotherapy and anti-PD-1 inhibitors. Front. Immunol. 2024, 15, 1351750. [Google Scholar] [CrossRef] [PubMed]
  43. Kawahara, D.; Murakami, Y.; Tani, S.; Nagata, Y. A prediction model for pathological findings after neoadjuvant chemoradiotherapy for resectable locally advanced esophageal squamous cell carcinoma based on endoscopic images using deep learning. Br. J. Radiol. 2022, 95, 20210934. [Google Scholar] [CrossRef] [PubMed]
  44. Matsuda, S.; Irino, T.; Okamura, A.; Mayanagi, S.; Booka, E.; Takeuchi, M.; Kawakubo, H.; Takeuchi, H.; Watanabe, M.; Kitagawa, Y. Endoscopic Evaluation of Pathological Complete Response Using Deep Neural Network in Esophageal Cancer Patients Who Received Neoadjuvant Chemotherapy-Multicenter Retrospective Study from Four Japanese Esophageal Centers. Ann. Surg. Oncol. 2023, 30, 7472–7480. [Google Scholar] [CrossRef]
  45. Matsuda, S.; Irino, T.; Kawakubo, H.; Takeuchi, M.; Nishimura, E.; Hisaoka, K.; Sano, J.; Kobayashi, R.; Fukuda, K.; Nakamura, R.; et al. Evaluation of Endoscopic Response Using Deep Neural Network in Esophageal Cancer Patients Who Received Neoadjuvant Chemotherapy. Ann. Surg. Oncol. 2023, 30, 3733–3742. [Google Scholar] [CrossRef]
  46. Brierley, J.D.; Gospodarowicz, M.K.; Wittekind, C. TNM Classification of Malignant Tumours; John Wiley & Sons: Hoboken, NJ, USA, 2017. [Google Scholar]
  47. Sato, F.; Shimada, Y.; Selaru, F.M.; Shibata, D.; Maeda, M.; Watanabe, G.; Mori, Y.; Stass, S.A.; Imamura, M.; Meltzer, S.J. Prediction of survival in patients with esophageal carcinoma using artificial neural networks. Cancer 2005, 103, 1596–1605. [Google Scholar] [CrossRef]
  48. Mofidi, R.; Deans, C.; Duff, M.D.; de Beaux, A.C.; Paterson Brown, S. Prediction of survival from carcinoma of oesophagus and oesophago-gastric junction following surgical resection using an artificial neural network. Eur. J. Surg. Oncol. 2006, 32, 533–539. [Google Scholar] [CrossRef]
  49. Larue, R.T.H.M.; Klaassen, R.; Jochems, A.; Leijenaar, R.T.H.; Hulshof, M.C.C.M.; van Berge Henegouwen, M.I.; Schreurs, W.M.J.; Sosef, M.N.; van Elmpt, W.; van Laarhoven, H.W.M.; et al. Pre-treatment CT radiomics to predict 3-year overall survival following chemoradiotherapy of esophageal cancer. Acta Oncol. 2018, 57, 1475–1481. [Google Scholar] [CrossRef]
  50. Luo, H.S.; Chen, Y.Y.; Huang, W.Z.; Wu, S.X.; Huang, S.F.; Xu, H.Y.; Xue, R.L.; Du, Z.S.; Li, X.Y.; Lin, L.X.; et al. Development and validation of a radiomics-based model to predict local progression-free survival after chemo-radiotherapy in patients with esophageal squamous cell cancer. Radiat. Oncol. 2021, 16, 201. [Google Scholar] [CrossRef]
  51. Cui, Y.; Li, Z.; Xiang, M.; Han, D.; Yin, Y.; Ma, C. Machine learning models predict overall survival and progression free survival of non-surgical esophageal cancer patients with chemoradiotherapy based on CT image radiomics signatures. Radiat. Oncol. 2022, 17, 212. [Google Scholar] [CrossRef]
  52. Wang, J.; Zeng, J.; Li, H.; Yu, X. A Deep Learning Radiomics Analysis for Survival Prediction in Esophageal Cancer. J. Healthc. Eng. 2022, 2022, 4034404. [Google Scholar] [CrossRef]
  53. Zhang, H.; Jiang, X.; Yu, Q.; Yu, H.; Xu, C. A novel staging system based on deep learning for overall survival in patients with esophageal squamous cell carcinoma. J. Cancer Res. Clin. Oncol. 2023, 149, 8935–8944. [Google Scholar] [CrossRef] [PubMed]
  54. Zhang, K.; Ye, B.; Wu, L.; Ni, S.; Li, Y.; Wang, Q.; Zhang, P.; Wang, D. Machine learningbased prediction of survival prognosis in esophageal squamous cell carcinoma. Sci. Rep. 2023, 13, 13532. [Google Scholar] [CrossRef]
  55. Chu, F.; Liu, Y.; Liu, Q.; Li, W.; Jia, Z.; Wang, C.; Wang, Z.; Lu, S.; Li, P.; Zhang, Y.; et al. Development and validation of MRI-based radiomics signatures models for prediction of disease-free survival and overall survival in patients with esophageal squamous cell carcinoma. Eur. Radiol. 2022, 32, 5930–5942. [Google Scholar] [CrossRef] [PubMed]
  56. Paver, E.C.; Cooper, W.A.; Colebatch, A.J.; Ferguson, P.M.; Hill, S.K.; Lum, T.; Shin, J.S.; O’Toole, S.; Anderson, L.; Scolyer, R.A.; et al. Programmed death ligand-1 (PD-L1) as a predictive marker for immunotherapy in solid tumours: A guide to immunohistochemistry implementation and interpretation. Pathology 2021, 53, 141–156. [Google Scholar] [CrossRef] [PubMed]
  57. Yang, H.; Wang, K.; Wang, T.; Li, M.; Li, B.; Li, S.; Yuan, L. The Combination Options and Predictive Biomarkers of PD-1/PD-L1 Inhibitors in Esophageal Cancer. Front. Oncol. 2020, 10, 300. [Google Scholar] [CrossRef]
  58. Chen, H.; Luo, J.; Guo, J. Construction and Validation of a 7-Immune Gene Model for Prognostic Assessment of Esophageal Carcinoma. Med. Sci. Monit. 2020, 26, e927392. [Google Scholar] [CrossRef]
  59. Li, B.; Qin, W.; Yang, L.; Li, H.; Jiang, C.; Yao, Y.; Cheng, S.; Zou, B.; Fan, B.; Dong, T.; et al. From pixels to patient care: Deep learning-enabled pathomics signature offers precise outcome predictions for immunotherapy in esophageal squamous cell cancer. J. Transl. Med. 2024, 22, 195. [Google Scholar] [CrossRef]
  60. Abas Mohamed, Y.; Ee Khoo, B.; Shahrimie Mohd Asaari, M.; Ezane Aziz, M.; Rahiman Ghazali, F. Decoding the black box: Explainable AI (XAI) for cancer diagnosis, prognosis, and treatment planning-A state-of-the art systematic review. Int. J. Med. Inform. 2025, 193, 105689. [Google Scholar] [CrossRef]
  61. Pearl, J. Causal inference in statistics: An overview. Statist. Surv. 2009, 3, 96–146. [Google Scholar] [CrossRef]
  62. Tu, J.X.; Lin, X.T.; Ye, H.Q.; Yang, S.L.; Deng, L.F.; Zhu, R.L.; Wu, L.; Zhang, X.Q. Global research trends of artificial intelligence applied in esophageal carcinoma: A bibliometric analysis (2000–2022) via CiteSpace and VOSviewer. Front. Oncol. 2022, 12, 972357. [Google Scholar] [CrossRef]
  63. Gerke, S.; Minssen, T.; Cohen, G.; Bohr, A.; Memarzadeh, K. Chapter 12-Ethical and legal challenges of artificial intelligence-driven healthcare. In Artificial Intelligence in Healthcare; Academic Press: Cambridge, MA, USA, 2020; pp. 295–336. [Google Scholar]
Figure 1. PRISMA flowchart of study selection.
Figure 1. PRISMA flowchart of study selection.
Jcm 14 01845 g001
Table 1. Applications for surgical treatment.
Table 1. Applications for surgical treatment.
AuthorYearSample SizeStudy DesignObjectiveResults
Sato K et al. [21]20223000 images from 20 thoracoscopic videos and 40 images from 8 thoracoscopic videosRetrospectiveDevelopment of a DL-based AI image model for identifying the location of the recurrent laryngeal nerve and assess if it reduces the incidence of recurrent laryngeal nerve paralysisThe model had an average dice coefficient of 0.58. This was not significantly different from the group of specialized esophageal surgeons (p = 0.26) but was significantly superior to that of the group of certified general gastrointestinal surgeons (p = 0.019).
Maktabi M et al. [22]201911 patientsRetrospectiveEvaluation of intraoperative hyperspectral imaging for identifying the tumor margins intraoperativelyThe support vector machines (SVM) algorithm has the best performance, with 63% sensitivity and 69% specificity for identifying cancerous tissue.
Furube T et al. [23]202494 videosRetrospectiveDevelopment of an AI-based automated ESD phase recognition system based on video imagesThe model displayed an overall accuracy of 90%, an average precision of 91%, and a recall of 90%.
Takeuchi M et al. [24]202231 patientsRetrospectiveDevelopment of an AI-based automated surgical-phase recognition system for RAMIEThe model achieved an overall accuracy of 84%.
Jung JO et al. [25]2023864 patientsRetrospectiveDevelopment of ML-based methods for predicting Clavien–Dindo grade IIIa or greater complications following esophagectomyThe neural network achieved an overall accuracy of 68.8% and accuracies of 69.2% for medical complications, and 66.7% for surgical complications. The AUC of neural network was 0.672 for Clavien–Dindo grade IIIa or higher, 0.695 for medical complications, and 0.653 for surgical complications.
van Kooten RT et al. [26]20224288 patientsRetrospectiveEvaluation of the added value of ML methods for predicting postoperative complications following esophagectomy and development of a predictive model for anastomotic leakage and cardiopulmonary complicationsThe optimal model GLM displayed an AUC of 0.619 for anastomotic leakage and 0.644 for pulmonary complications.
Klontzas M et al. [27]2024471 patientsRetrospectiveDevelopment of an ML model combining CT and clinical variables for predicting anastomotic leakage following esophagectomyThe XGboost model achieved an AUC of 0.792, 77.46% specificity, 69.22% sensitivity, 48.39% positive predictive value, and 87.3% negative predictive value.
Bolourani S et al. [28]20212037 patientsRetrospectiveDevelopment of ML-based prediction model for early readmissions within 30 days following esophagectomyThe ML model for clinical decision achieved a sensitivity of 71.7% and specificity of 51.4%, while the ML model for quality review achieved an accuracy of 84.8%, specificity of 98.7%, and sensitivity of 23%
DL: deep learning, AI: artificial intelligence, RAMIE: robot-assisted minimally invasive esophagectomy, ESD: endoscopic submucosal dissection, ML: machine learning.
Table 2. Treatment response prediction.
Table 2. Treatment response prediction.
AuthorYearSample SizeStudy DesignTreatmentObjectiveResults
Hu Y et al. [29]2020231RetrospectivenCRT and surgeryDevelopment and validation of CT-based models to identify pCR using intratumoral and peritumoral features in ESCC patientsThe model combining 7 intratumoral and 6 peritumoral features displayed superior discriminative performance, with an AUC of 0.852 (95% CI, 0.753–0.951), accuracy of 84.3%, sensitivity of 90.3% and specificity of 79.5% in the test set.
Hu Y et al. [30]2021231RetrospectivenCRT and surgeryEvaluation and validation of a CT-based model using DL features for predicting pCR to nCRT in ESCC patientsThe model integrating features from ResNet50 achieved an AUC of 0.805 (95% CI, 0.696–0.913) and an accuracy of 77.1%, compared with 0.725 (95% CI, 0.605- 0.846) and 67.1% for the radiomics model in the test set.
Wang J et al. [31]2023112RetrospectivenCRT and surgeryDevelopment and validation of a radiomics feature-based ML model for predicting pCR to nCRT in ESCC patientsFor pCR prediction models, the model combining rad-score and clinical features achieved an AUC of 0.891 (95% CI, 0.823–0.950), compared with 0.817 (95% CI, 0.732–0.896) for the radiomics model in the test set.
Li X et al. [32]2021306ProspectivenCRTDevelopment and validation of a 3-dimensional DL radiomics model for predicting response to nCRT in ESCC patientsThe model achieved an AUC of 0.897 (95% CI, 0.840–0.959) in the training cohort and 0.833 (95% CI, 0.654–1.000) in the validation cohort
Zhu Y et al. [33]202164RetrospectiveChemotherapy and PD-1 inhibitorDevelopment and validation of a radiomics model for evaluating treatment response to chemotherapy and immunotherapy in ESCC patientsThe 2-dimensional corrected model achieved an AUC of 0.843 (95% CI, 0.736–0.950) in the training cohort and 0.914 (95% CI, 0.775–1.000) in the validation cohort, compared to the 3-dimensional corrected model with an AUC 0.658 (95% CI, 0.502–0.813) and 0.670 (95% CI, 0.511–0.849) respectively
Xie Y et al. [35]2023248RetrospectivenCRTEstablishment and validation of an ANN-based radiomics model for predicting radiotherapy response in patients with stage III ESCCThe pretrained network ResNet50 displayed superior performance, with AUCs of 0.876, 0.802, and 0.732 in the training, internal validation, and external validation cohorts respectively
Jin X et al. [36]201994RetrospectiveCRTEvaluation of a model combining CT radiomics and dosimetric features for predicting response to CRT in esophageal cancer patientsThe XGBoost plus principal component analysis model combining radiomics features with dosimetric parameters achieved a prediction accuracy of 70.8% and an AUC of 0.689 compared to the radiomics model with an accuracy of 62.5% and AUC of 0.412
Yap WK et al. [37]202380RetrospectivenCRT and surgeryDevelopment of radiotherapy dose map-guided DL model for predicting ypCR to nCRT in ESCC patientsThe HRNetV2p model with dose contextual representations achieved superior performance, with an AUC of 0.928 (95% CI, 0.884–0.972)
Ypsilantis PP et al. [40]2015107ProspectivenCTEvaluation of the predictive ability of an ML algorithm (3S-CNN) for representing esophageal cancer’s metabolic profile using 18F-FDG PET imagingThe 3S-CNN achieved superior performance with an average 80.7% sensitivity, 81.6% specificity, and 73.4% accuracy in predicting non-responders
Rishi A et al. [41]202068RetrospectivenCRT and surgeryDevelopment and validation of a combined CT and PET/CT radiomics model for predicting pCR to nCRT in patients with advanced esophageal cancerThe combined PET/CT radiomics model achieved improved predictive performance with an AUC of 0.87 compared to CT- or PET-only models with AUCs of 0.73 and 0.66 respectively
Qi WX et al. [42]2024126ProspectivenCRT, anti-PD1 inhibitors and surgeryEvaluation of an integrated multimodal radiomics with ML model for predicting pCR to nCRT and anti-PD1 in ESCC patientsSupport vector machine ML trained on CT, PET, and clinical features achieved superior performance compared to CT and PET-only models with an AUC of 0.997 in the training set and 0.852 in the testing set
Kawahara D et al. [43]202298RetrospectivenCRT and surgeryProposal of a DL-based model using endoscopic images for predicting pCR to nCRT in ESCC patientsThe model achieved 64% accuracy, 80.3% sensitivity, 36.3% specificity, and 0.58 AUC in the testing set compared to 80.6%, 79.8%, 80.6%, and 0.83 with wavelet filter, respectively
Matsuda S et al. [44]2023123RetrospectivenCRT and surgeryDevelopment of a deep neural network using endoscopic images for identifying pCR to nCT preoperatively in ESCC patientsIn 20 models, the median sensitivity, specificity, positive predictive value, negative predictive value, and accuracy for endoscopic response evaluation were 60%, 81%, 77%, 67%, and 70% compared to 43%, 90%, 85%, 65%, and 66% for endoscopists respectively
Matsuda S et al. [45]2023193RetrospectivenCRT and surgeryDevelopment of a deep neural network for evaluating endoscopic response to nCT in ESCC patient development of a deep neural network for evaluating endoscopic response to nCT in ESCC patientsIn 10 models the median sensitivity, specificity, positive predictive value, and negative predictive value were 60%, 100%, 100%, and 71% compared to 80%, 80%, 81%, and 81% for endoscopists respectively
nCRT: neoadjuvant chemoradiotherapy, nCT: neoadjuvant chemotherapy, DL: deep learning, ML: machine learning, ANN: artificial neural network, pCR: pathological complete response.
Table 3. Prognosis prediction.
Table 3. Prognosis prediction.
AuthorYearSample SizeStudy DesignObjectiveResults
Sato F et al. [47]2005418RetrospectiveDevelopment of an ANN model for predicting 1- and 5-year survival for esophageal cancer patientsThe optimal ANN models for predicting 1- and 5-year survival consisted of 65 variables (AUR = 0.883) and 60 variables (AUR = 0.884), respectively, and outperformed the TNM solely based model (p < 0.0001).
Modifi R et al. [48]2006216RetrospectiveEvaluation of the performance of an ANN model for predicting 1- and 3-year disease-free survival for esophageal and EGJ carcinoma patientsThe accuracy, sensitivity, and specificity of the ANN for predicting survival at 1 year was 88%, 92.3%, and 84.5%, and at 3 years was 91.5%, 94.6%, and 88%, and was statistically significantly superior to TNM classification system.
Larue RTHM et al. [49]2018239RetrospectiveTraining and validation of two models using radiomics or clinical variables for predicting 3-year overall survival for esophageal cancer patients following nCRT.The radiomics-based RF model achieved superior prognostic performance for 3-year overall survival with an AUC of 0.69 and 0.61 in the training and validation dataset compared to 0.63 and 0.62 for the clinical feature-based model, respectively.
Luo HS et al. [50]2021221RetrospectiveDevelopment and validation of a radiomics and clinical feature-based model for predicting local progression-free survival for ESCC patients following concurrent CRTThe C-index of the prediction model was 0.745 (95% CI, 0.770–0.790) in the training cohort and 0.723 (95% CI, 0.654–0.791) in the validation cohort.
Cui Y et al. [51]2022204RCTDevelopment of ML models for predicting progression-free survival and overall survival for ESCC patientsCombined models using clinical and radiomics features achieved over 70% accuracy, an AUC of 0.833, and a C-index of 0.79 for predicting PFS and 0.768 and 0.71 for predicting OS in the test cohort.
Wang J et al. [52]2022154RetrospectiveDevelopment and validation of a DL radiomics-based model for predicting 3-year overall survival for esophageal cancer patientsThe deep-learning radiomics model achieved an AUC of 0.984 and a C-index of 0.76 in the training cohort and 0.942 and 0.784 in the validation cohort for predicting 3-year OS, respectively.
Zhang H et al. [53]20236020RetrospectiveDevelopment and validation of a DL model for predicting overall survival and a novel staging system for ESCC patientsThe deep-learning model achieved superior performance than the traditional nomogram for predicting OS with C-index 0.732 [95% CI 0.714–0.750] and 0.671 [95% CI 0.647–0.695], respectively. The model in the test cohort displayed an AUC of 0.805 at 3-year OS and 0.825 at 5-year OS
Zhang K et al. [54]20232441RetrospectiveDevelopment of an ML survival prediction model using 6 different ML g approaches for ESCC patientsThe ML-extended CoxPH model achieved superior discriminative performance, with AUCs for 1-, 3-, and 5-year overall survival of 0.760, 0.735, and 0.746 in the training cohort and 0.725, 0.720, and 0.752 in the validation cohort, respectively.
Chu F et al. [55]2022434RCTDevelopment and validation of an MRI-based radiomics model for predicting disease-free survival and overall survival in ESCC patientsThe combined model based on 7 radiomics features and clinical features achieved the best performance, with a C-index of 0.730 in the training cohort and 0.712 in the validation cohort for OS and 0.714 and 0.729 for DFS respectively.
Chen H et al. [58]2020159RetrospectiveDevelopment and validation of a 7 immune gene model for predicting prognosis of esophageal cancerThe model achieved an AUC of 0.825 for 1 year and 0.596 for 3 years in the test set.
Li B et al. [59]2024163RetrospectiveAssessment of a DL-based pathomics signature for predicting clinical benefits of immunotherapy in ESCC patientsThe ESCC-PS achieved an accuracy of 84.5% in the validation cohort and the combined model based on ESCC-PS and PD-L1 expression displayed an AUC of 0.904 and a C-index of 0.814 for a 6-month PFS.
RCT: randomized controlled trial, nCRT: neoadjuvant chemoradiotherapy, DL: deep learning, ML: machine learning, ANN: artificial neural network.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mela, E.; Tsapralis, D.; Papaconstantinou, D.; Sakarellos, P.; Vergadis, C.; Klontzas, M.E.; Rouvelas, I.; Tzortzakakis, A.; Schizas, D. Current Role of Artificial Intelligence in the Management of Esophageal Cancer. J. Clin. Med. 2025, 14, 1845. https://doi.org/10.3390/jcm14061845

AMA Style

Mela E, Tsapralis D, Papaconstantinou D, Sakarellos P, Vergadis C, Klontzas ME, Rouvelas I, Tzortzakakis A, Schizas D. Current Role of Artificial Intelligence in the Management of Esophageal Cancer. Journal of Clinical Medicine. 2025; 14(6):1845. https://doi.org/10.3390/jcm14061845

Chicago/Turabian Style

Mela, Evgenia, Dimitrios Tsapralis, Dimitrios Papaconstantinou, Panagiotis Sakarellos, Chrysovalantis Vergadis, Michail E. Klontzas, Ioannis Rouvelas, Antonios Tzortzakakis, and Dimitrios Schizas. 2025. "Current Role of Artificial Intelligence in the Management of Esophageal Cancer" Journal of Clinical Medicine 14, no. 6: 1845. https://doi.org/10.3390/jcm14061845

APA Style

Mela, E., Tsapralis, D., Papaconstantinou, D., Sakarellos, P., Vergadis, C., Klontzas, M. E., Rouvelas, I., Tzortzakakis, A., & Schizas, D. (2025). Current Role of Artificial Intelligence in the Management of Esophageal Cancer. Journal of Clinical Medicine, 14(6), 1845. https://doi.org/10.3390/jcm14061845

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop