Artificial-Intelligence-Based Radiologic, Histopathologic, and Molecular Models for the Diagnosis and Classification of Malignant Salivary Gland Tumors: A Systematic Review and Functional Meta-Synthesis
Abstract
1. Introduction
2. Materials and Methods
2.1. Eligibility Criteria
2.1.1. Inclusion Criteria
2.1.2. Exclusion Criteria
2.2. Information Sources and Search Strategy
2.3. Study Selection
2.4. Data Extraction
2.5. Risk of Bias and Reporting Quality Assessment
2.6. Assessment of Reporting Bias
2.7. Certainty of Evidence
2.8. Data Synthesis and Functional Meta-Synthesis
3. Results
3.1. Study Selection Process
3.2. Descriptive Characteristics of Included Studies
3.3. External Validation Performance (Primary Outcome)
3.4. Internal Validation or Cross-Validation Performance (Secondary Outcomes)
3.5. Functional Meta-Synthesis
3.5.1. Domain 1—Malignancy Discrimination (Benign vs. Malignant)
3.5.2. Domain 2—Histopathologic Tumor Subtype Classification
3.5.3. Domain 3—Molecular and Epigenetic Taxonomy Refinement
3.5.4. Cross-Domain Observations
3.6. Additional Model Characteristics: Predictive Values, Calibration, Clinical Utility, and Interpretability
3.6.1. Positive and Negative Predictive Values
3.6.2. Calibration Metrics
3.6.3. Clinical Utility and Decision Curve Analysis
3.6.4. Interpretability Methods
3.7. Risk of Bias Assessment
3.8. Reporting Transparency (TRIPOD/TRIPOD-AI)
3.9. Reporting Bias Assessment
3.10. Certainty of Evidence (GRADE) for Externally Validated CT-Based Benign–Malignant Discrimination
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Lépine, C. Salivary gland tumors: 2022 WHO blue book and beyond. Ann. Pathol. 2024, 44, 175–182. [Google Scholar] [CrossRef]
- Jurmeister, P.; Leitheiser, M.; Arnold, A.; Capilla, E.P.; Mochmann, L.H.; Zhdanovic, Y.; Schleich, K.; Jung, N.; Chimal, E.C.; Jung, A.; et al. DNA Methylation Profiling of Salivary Gland Tumors Supports and Expands Conventional Classification. Mod. Pathol. 2024, 37, 100625. [Google Scholar] [CrossRef]
- Schulz, T.; Becker, C.; Kayser, G. Comparison of four convolutional neural networks for histopathological diagnosis of salivary gland carcinomas. HNO 2023, 71, 170–176. [Google Scholar] [CrossRef]
- Skálová, A.; Vanecek, T.; Sima, R.; Laco, J.; Weinreb, I.; Perez-Ordonez, B.; Starek, I.; Geierova, M.; Simpson, R.H.; Passador-Santos, F.; et al. Mammary analogue secretory carcinoma of salivary glands, containing the ETV6-NTRK3 fusion gene: A hitherto undescribed salivary gland tumor entity. Am. J. Surg. Pathol. 2010, 34, 599–608. [Google Scholar] [CrossRef]
- Mitani, Y.; Li, J.; Rao, P.H.; Zhao, Y.J.; Bell, D.; Lippman, S.M.; Weber, R.S.; Caulin, C.; El-Naggar, A.K. Comprehensive analysis of the MYB-NFIB gene fusion in salivary adenoid cystic carcinoma: Incidence, variability, and clinicopathologic significance. Clin. Cancer Res. 2010, 16, 4722–4731. [Google Scholar] [CrossRef]
- Capper, D.; Jones, D.T.W.; Sill, M.; Hovestadt, V.; Schrimpf, D.; Sturm, D.; Koelsche, C.; Sahm, F.; Chavez, L.; Reuss, D.E.; et al. DNA methylation-based classification of central nervous system tumours. Nature 2018, 555, 469–474. [Google Scholar] [CrossRef] [PubMed]
- Koelsche, C.; Schrimpf, D.; Stichel, D.; Sill, M.; Sahm, F.; Reuss, D.E.; Blattner, M.; Worst, B.; Heilig, C.E.; Beck, K.; et al. Sarcoma classification by DNA methylation profiling. Nat. Commun. 2021, 12, 498. [Google Scholar] [CrossRef] [PubMed]
- Steiner, D.F.; MacDonald, R.; Liu, Y.; Truszkowski, P.; Hipp, J.D.; Gammage, C.; Thng, F.; Peng, L.; Stumpe, M.C. Impact of Deep Learning Assistance on the Histopathologic Review of Lymph Nodes for Metastatic Breast Cancer. Am. J. Surg. Pathol. 2018, 42, 1636–1646. [Google Scholar] [CrossRef]
- Campanella, G.; Hanna, M.G.; Geneslaw, L.; Miraflor, A.; Werneck Krauss Silva, V.; Busam, K.J.; Brogi, E.; Reuter, V.E.; Klimstra, D.S.; Fuchs, T.J. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 2019, 25, 1301–1309. [Google Scholar] [CrossRef]
- Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; van Stiphout, R.G.P.M.; Granton, P.; Zegers, C.M.L.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef] [PubMed]
- Fruehwald-Pallamar, J.; Czerny, C.; Holzer-Fruehwald, L.; Nemec, S.F.; Mueller-Mang, C.; Weber, M.; Mayerhoefer, M.E. Texture-based and diffusion-weighted discrimination of parotid gland lesions on MR images at 3.0 Tesla. NMR Biomed. 2013, 26, 1372–1379. [Google Scholar] [CrossRef] [PubMed]
- Rao, Y.; Ma, Y.; Wang, J.; Xiao, W.; Wu, J.; Shi, L.; Guo, L.; Fan, L. Performance of radiomics in the differential diagnosis of parotid tumors: A systematic review. Front. Oncol. 2024, 14, 1383323. [Google Scholar] [CrossRef]
- Kazerooni, A.F.; Nabil, M.; Alviri, M.; Koopaei, S.; Salahshour, F.; Assili, S.; Rad, H.S.; Aghaghazvini, L. Radiomic Analysis of Multi-parametric MR Images (MRI) for Classification of Parotid Tumors. J. Biomed. Phys. Eng. 2022, 12, 599–610. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- Whiting, P.F.; Rutjes, A.W.S.; Westwood, M.E.; Mallett, S.; Deeks, J.J.; Reitsma, J.B.; Leeflang, M.M.G.; Sterne, J.A.C.; Bossuyt, P.M.M.; QUADAS-2 Group. QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med. 2011, 155, 529–536. [Google Scholar] [CrossRef]
- Sounderajah, V.; Ashrafian, H.; Rose, S.; Shah, N.H.; Ghassemi, M.; Golub, R.; Kahn, C.E.; Esteva, A.; Karthikesalingam, A.; Mateen, B.; et al. A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI. Nat. Med. 2021, 27, 1663–1665. [Google Scholar] [CrossRef] [PubMed]
- Wolff, R.F.; Moons, K.G.M.; Riley, R.D.; Whiting, P.F.; Westwood, M.; Collins, G.S.; Reitsma, J.B.; Kleijnen, J.; Mallett, S. PROBAST: A tool to assess the risk of bias and applicability of prediction model studies. Ann. Intern. Med. 2019, 170, 51–58. [Google Scholar] [CrossRef] [PubMed]
- Collins, G.S.; Reitsma, J.B.; Altman, D.G.; Moons, K.G.M. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. Ann. Intern. Med. 2015, 162, 55–63. [Google Scholar] [CrossRef]
- Deeks, J.J.; Macaskill, P.; Irwig, L. The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J. Clin. Epidemiol. 2005, 58, 882–893. [Google Scholar] [CrossRef]
- Brozek, J.L.; Akl, E.A.; Jaeschke, R.; Lang, D.M.; Bossuyt, P.; Glasziou, P.; Helfand, M.; Ueffing, E.; Alonso-Coello, P.; Meerpohl, J.; et al. Grading quality of evidence and strength of recommendations in clinical practice guidelines: Part 2 of 3. The GRADE approach to grading quality of evidence about diagnostic tests and strategies. Allergy 2009, 64, 1109–1116. [Google Scholar] [CrossRef]
- He, Z.; Mao, Y.; Lu, S.; Tan, L.; Xiao, J.; Tan, P.; Zhang, H.; Li, G.; Yan, H.; Tan, J.; et al. Machine learning-based radiomics for histological classification of parotid tumors using morphological MRI: A comparative study. Eur. Radiol. 2022, 32, 8099–8110. [Google Scholar] [CrossRef]
- Yu, Q.; Ning, Y.; Wang, A.; Li, S.; Gu, J.; Li, Q.; Chen, X.; Lv, F.; Zhang, X.; Yue, Q.; et al. Deep learning-assisted diagnosis of benign and malignant parotid tumors based on contrast-enhanced CT: A multicenter study. Eur. Radiol. 2023, 33, 6054–6065. [Google Scholar] [CrossRef]
- Shen, Q.; Xiang, C.; Huang, K.; Xu, F.; Zhao, F.; Han, Y.; Liu, X.; Li, Y. Preoperative CT-based intra- and peri-tumoral radiomic models for differentiating benign and malignant tumors of the parotid gland: A two-center study. Am. J. Cancer Res. 2024, 14, 4445–4458. [Google Scholar] [CrossRef]
- Committeri, U.; Barone, S.; Salzano, G.; Arena, A.; Borriello, G.; Giovacchini, F.; Fusco, R.; Vaira, L.A.; Scarpa, A.; Abbate, V.; et al. Support Tools in the Differential Diagnosis of Salivary Gland Tumors through Inflammatory Biomarkers and Radiomics Metrics: A Preliminary Study. Cancers 2023, 15, 1876. [Google Scholar] [CrossRef] [PubMed]
- Sousa-Neto, S.S.; Nakamura, T.C.R.; Giraldo-Roldan, D.; Dos Santos, G.C.; Fonseca, F.P.; de Cáceres, C.V.B.L.; Rangel, A.L.C.A.; Martins, M.A.T.; Gabriel, A.D.F.; Zanella, V.G.; et al. Development and Evaluation of a Convolutional Neural Network for Microscopic Diagnosis Between Pleomorphic Adenoma and Carcinoma Ex-Pleomorphic Adenoma. Head Neck 2025, 47, 832–838. [Google Scholar] [CrossRef] [PubMed]
- Sousa-Neto, S.S.; Nakamura, T.C.R.; Dos Santos, G.C.; Giraldo-Roldan, D.; Gonçalves, M.W.A.; de Cáceres, C.V.B.L.; Fonseca, F.P.; Moreira, V.H.L.d.O.; Mendonça, E.F.; Martins, M.D.; et al. Comparative analysis of convolutional neural network models for the histopathological differentiation of acinic cell carcinoma and secretory carcinoma. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 2026, 141, 387–399. [Google Scholar] [CrossRef]
- Park, S.H.; Han, K. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction. Radiology 2018, 286, 800–809. [Google Scholar] [CrossRef]
- Ardila, C.M.; Vivares-Builes, A.M.; Pineda-Vélez, E. Molecular Biomarkers and Machine Learning in Oral Cancer: A Systematic Review and Meta-Analysis. Oral Dis. 2025. [Google Scholar] [CrossRef] [PubMed]
- Sweileh, M.W. AI-Powered histopathology slide image interpretation in oncology: A comprehensive knowledge mapping and bibliometric analysis. Digit. Health. 2025, 11, 20552076251393286. [Google Scholar] [CrossRef]
- Steiner, D.F.; Chen, P.H.C.; Mermel, C.H. Closing the translation gap: AI applications in digital pathology. Biochim. Biophys. Acta (BBA)-Rev. Cancer 2021, 1875, 188452. [Google Scholar] [CrossRef]
- Bi, W.L.; Hosny, A.; Schabath, M.B.; Giger, M.L.; Birkbak, N.J.; Mehrtash, A.; Allison, T.; Arnaout, O.; Abbosh, C.; Dunn, I.F.; et al. Artificial intelligence in cancer imaging: Clinical challenges and applications. CA Cancer J. Clin. 2019, 69, 127–157. [Google Scholar] [CrossRef]
- Jiang, R.; Woo, C.W.; Qi, S.; Wu, J.; Sui, J. Interpreting Brain Biomarkers: Challenges and solutions in interpreting machine learning-based predictive neuroimaging. IEEE Signal Process. Mag. 2022, 39, 107–118. [Google Scholar] [CrossRef]
- Ardila, C.M.; Vivares-Builes, A.M.; Pineda-Vélez, E. Image Quality, Radiation Dose, and Patient Comfort Associated with Wireless Sensors in Digital Radiography: A Systematic Review. Dent. J. 2024, 12, 267. [Google Scholar] [CrossRef]
- Ontario Health (Quality). DNA Methylation-Based Classification for Central Nervous System Tumours: A Health Technology Assessment. Ont. Health Technol. Assess. Ser. 2025, 25, 1–93. [Google Scholar]
- Echle, A.; Rindtorff, N.T.; Brinker, T.J.; Luedde, T.; Pearson, A.T.; Kather, J.N. Deep learning in cancer pathology: A new generation of clinical biomarkers. Br. J. Cancer 2021, 124, 686–696. [Google Scholar] [CrossRef] [PubMed]
- Topol, E.J. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 2019, 25, 44–56. [Google Scholar] [CrossRef] [PubMed]
- Ardila, C.M.; Pineda-Vélez, E.; Vivares-Builes, A.M. Computational Stemness and Cancer Stem Cell Markers in Oral Squamous Cell Carcinoma: A Systematic Review, Dual Meta-Analysis, and Functional Meta-Synthesis. Med. Sci. 2025, 14, 21. [Google Scholar] [CrossRef]


| Author (Year) | Country | Modality | Sample Size (n) | Diagnostic Task | Validation Strategy |
|---|---|---|---|---|---|
| Jurmeister et al. (2024) [2] | Germany (multicenter) | DNA methylation profiling + SVM | 363 | Multi-entity SGT classification | Repeated cross-validation |
| Schulz et al. (2023) [3] | Germany | Digital histopathology (WSI CNN) | 68 | Salivary gland carcinoma classification | Train–test split (internal) |
| He et al. (2022) [21] | China | MRI radiomics + ML | 298 | Four-class parotid tumor classification | 7:3 train–test split |
| Yu et al. (2023) [22] | China (multicenter) | CT deep learning (CNN) | 573 | Benign vs. malignant parotid tumors | Training, internal testing, external testing |
| Shen et al. (2024) [23] | China (multicenter) | CT radiomics (tumor + peritumor) | 374 | Benign vs. malignant parotid tumors | Training, internal validation, external validation |
| Committeri et al. (2023) [24] | Italy | MRI radiomics + inflammatory biomarkers | 117 | WT vs. PA vs. malignant | Train–test split |
| Sousa-Neto et al. (2025) [25] | Brazil | WSI deep learning (ResNet-50) | 83 | CXPA vs. PA | Train–test split |
| Sousa-Neto et al. (2026) [26] | Brazil | WSI deep learning (multiple CNNs) | 46 | AciCC vs. SC | Train–test split |
| Study | Model/Architecture | Validation Type | AUC (95% CI If Reported) | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|---|---|
| Jurmeister et al. [2] | SVM methylation classifier | Internal (cross-validation) | Balanced accuracy 0.991 * | NR | NR | NR |
| Schulz et al. [3] | Inception v3 (CNN) | NR | 0.847 | Up to 0.85 (recall) | NR | |
| He et al. [21] | XGBoost (Step 1 test) | Internal (test set) | 0.826 (NR) | 0.899 | 0.647 | 0.958 |
| Yu et al. [22] | MobileNet V3 (CNN) | External | 0.890 (0.844–0.937) | 0.846 | 0.828 | 0.860 |
| Shen et al. [23] | Radiomics (Tumor + External2, SVM) | External | 0.745 (0.699–0.791) | 0.773 | 0.794 | 0.714 |
| Committeri et al. [24] | SVM multivariate model | Internal (test set) | NR (ROC reported) | 0.86 | 0.68 | 0.91 |
| Sousa-Neto et al. [25] | ResNet-50 (CNN) | Internal (test set) | 0.97 (NR) | 0.93 | 0.94 | 0.88 |
| Sousa-Neto et al. [26] | InceptionV3 (best accuracy); VGG16 (highest AUC) | Internal (test set) | 0.86 (NR) | 0.81 | 0.90 | 0.73 |
| Study | Patient Selection | Index Test | Reference Standard | Flow and Timing | Applicability: Patients | Applicability: Index Test | Applicability: Reference Standard |
|---|---|---|---|---|---|---|---|
| Jurmeister et al. [2] | High | High | Low | Unclear | Unclear | Low | Low |
| Schulz et al. [3] | High | High | Low | Unclear | Unclear | Low | Low |
| He et al. [21] | High | High | Low | Unclear | Unclear | Low | Low |
| Yu et al. [22] | High | High | Low | Unclear | Low | Low | Low |
| Shen et al. [23] | High | High | Low | Unclear | Low | Low | Low |
| Committeri et al. [24] | High | High | Low | Unclear | Unclear | Low | Low |
| Sousa-Neto et al. [25] | High | High | Low | Unclear | Unclear | Low | Low |
| Sousa-Neto et al. [26] | High | High | Low | Unclear | Unclear | Low | Low |
| Study | Participants | Predictors | Outcome | Analysis |
|---|---|---|---|---|
| Jurmeister et al. [2] | High | Low | Low | High |
| Schulz et al. [3] | High | Low | Low | High |
| He et al. [21] | High | Low | Low | High |
| Yu et al. [22] | High | Low | Low | High |
| Shen et al. [23] | High | Low | Low | High |
| Committeri et al. [24] | High | Low | Low | High |
| Sousa-Neto et al. [25] | High | Low | Low | High |
| Sousa-Neto et al. [26] | High | Low | Low | High |
| TRIPOD Element | Jurmeister et al. [2] | Schulz et al. [3] | He et al. [21] | Yu et al. [22] | Shen et al. [23] | Committeri et al. [24] | Sousa-Neto et al. [25] | Sousa-Neto et al. [26] |
|---|---|---|---|---|---|---|---|---|
| Population eligibility/setting described | Partial | Partial | Partial | Yes | Yes | Partial | Partial | Partial |
| Reference standard (histopathology) stated | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Sample size reported | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Index test/model description (modality + algorithm) | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Model development details sufficient for replication | Partial | No/limited | Partial | Partial | Partial | Partial | Partial | Partial |
| Handling of missing data reported | Not reported | Not reported | Not reported | Not reported | Not reported | Not reported | Not reported | Not reported |
| Validation approach reported | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| External validation performed | No | No | No | Yes | Yes | No | No | No |
| Discrimination metrics reported (e.g., AUC/accuracy) | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| PPV/NPV reported for the main model | No | No | No/unclear | Yes | Yes | Unclear/limited | Partial (NPV/precision) | No |
| Calibration metrics reported (e.g., calibration curve/Brier) | No | No | No | No | Yes | No | No | No |
| Clinical utility analysis reported (e.g., DCA/net benefit) | No | No | No | No (NRI/IDI reported) | Yes | No | No | No |
| Explicit interpretability method reported (e.g., Grad-CAM/SHAP) | No | No | No | Yes (Grad-CAM) | No | No | No | No (stated not implemented) |
| Data/code availability statement | No/unclear | No/unclear | No/unclear | No/unclear | No/unclear | No/unclear | Yes (upon request) | No/unclear |
| Outcome/Evidence Base | Risk of Bias | Inconsistency | Indirectness | Imprecision | Overall Certainty |
|---|---|---|---|---|---|
| Benign vs. malignant parotid tumors (CT-based AI models with external validation): Yu et al. [22] external AUC 0.890 (95% CI 0.844–0.937), accuracy 0.846, sensitivity 0.828, specificity 0.860, PPV 0.716, NPV 0.917; Shen et al. [23] external AUC 0.745 (95% CI 0.699–0.791) for Tumor + External2 radiomics (SVM), accuracy 0.773, sensitivity 0.794, specificity 0.714, PPV 0.885, NPV 0.555; calibration curves/Brier scores and DCA reported by Shen et al. [23]. | Serious (downgrade 1) | Serious (downgrade 1) | Not serious | Serious (downgrade 1) | Low |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ardila, C.M.; Pineda-Vélez, E.; Vivares-Builes, A.M.; Díaz-Laclaustra, A.I. Artificial-Intelligence-Based Radiologic, Histopathologic, and Molecular Models for the Diagnosis and Classification of Malignant Salivary Gland Tumors: A Systematic Review and Functional Meta-Synthesis. Med. Sci. 2026, 14, 183. https://doi.org/10.3390/medsci14020183
Ardila CM, Pineda-Vélez E, Vivares-Builes AM, Díaz-Laclaustra AI. Artificial-Intelligence-Based Radiologic, Histopathologic, and Molecular Models for the Diagnosis and Classification of Malignant Salivary Gland Tumors: A Systematic Review and Functional Meta-Synthesis. Medical Sciences. 2026; 14(2):183. https://doi.org/10.3390/medsci14020183
Chicago/Turabian StyleArdila, Carlos M., Eliana Pineda-Vélez, Anny M. Vivares-Builes, and Alejandro I. Díaz-Laclaustra. 2026. "Artificial-Intelligence-Based Radiologic, Histopathologic, and Molecular Models for the Diagnosis and Classification of Malignant Salivary Gland Tumors: A Systematic Review and Functional Meta-Synthesis" Medical Sciences 14, no. 2: 183. https://doi.org/10.3390/medsci14020183
APA StyleArdila, C. M., Pineda-Vélez, E., Vivares-Builes, A. M., & Díaz-Laclaustra, A. I. (2026). Artificial-Intelligence-Based Radiologic, Histopathologic, and Molecular Models for the Diagnosis and Classification of Malignant Salivary Gland Tumors: A Systematic Review and Functional Meta-Synthesis. Medical Sciences, 14(2), 183. https://doi.org/10.3390/medsci14020183

