Development and Validation of a Non-Invasive, Chairside Oral Cavity Cancer Risk Assessment Prototype Using Machine Learning Approach
Abstract
:1. Introduction
2. Materials and Methods
2.1. Data Preparation
2.2. Feature Ranking
2.3. Dimensionality Reduction
2.4. Data Analysis
2.5. Sensitivity Analysis
3. Results
4. Discussion
Limitations
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- The Oral Cancer Foundation. Rates of Occurrence in the United States. Published 2022. Available online: https://oralcancerfoundation.org/facts/ (accessed on 9 March 2022).
- American Cancer Society. Key Statistics for Oral Cavity and Oropharyngeal Cancers. Published 2022. Available online: https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/annual-cancer-facts-and-figures/2019/cancer-facts-and-figures-2019.pdf (accessed on 9 March 2022).
- Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef] [PubMed]
- Cancer.Net. Oral and Oropharyngeal Cancer: Statistics | Cancer.Net. Available online: https://www.cancer.net/cancer-types/oral-and-oropharyngeal-cancer/statistics (accessed on 9 March 2022).
- Glurich, I.; Acharya, A.; Brilliant, M.H.; Shukla, S.K. Progress in oral personalized medicine: Contribution of ‘omics’. J. Oral Microbiol. 2015, 7, 28223. [Google Scholar] [CrossRef] [PubMed]
- American Cancer Society. Risk Factors for Oral Cavity and Oropharyngeal Cancers. Published 2022. Available online: https://www.cancer.org/cancer/oral-cavity-and-oropharyngeal-cancer/causes-risks-prevention/risk-factors.html (accessed on 9 March 2022).
- Yardimci, G. Precancerous lesions of oral mucosa. World J. Clin. Cases 2014, 2, 866–872. [Google Scholar] [CrossRef] [PubMed]
- Tovaru, S.; Costache, M.; Perlea, P.; Caramida, M.; Totan, C.; Warnakulasuriya, S.; Parlatescu, I. Oral leukoplakia: A clinicopathological study and malignant transformation. Oral Dis. 2022. [Google Scholar] [CrossRef] [PubMed]
- Chaturvedi, P.; Singhvi, H.R.; Malik, A. The role of chronic mucosal trauma in oral cancer: A review of literature. Indian J. Med Paediatr. Oncol. 2017, 38, 44–50. [Google Scholar] [CrossRef]
- Jurel, S.K.; Gupta, D.S.; Singh, R.D.; Singh, M.; Srivastava, S. Genes and oral cancer. Indian J. Hum. Genet. 2014, 20, 4–9. [Google Scholar] [CrossRef] [PubMed]
- Vogelstein, B.; Kinzler, K.W. The multistep nature of cancer. Trends Genet. 1993, 9, 138–141. [Google Scholar] [CrossRef]
- Van der Riet, P.; Nawroz, H.; Hruban, R.H.; Corio, R.; Tokino, K.; Koch, W.; Sidransky, D. Frequent Loss of Chromosome 9p21-22 Early in Head and Neck Cancer Progression1. Available online: http://aacrjournals.org/cancerres/article-pdf/54/5/1156/2456291/cr0540051156.pdf (accessed on 11 March 2022).
- Shimpi, N.; Bharatkumar, A.; Jethwani, M.; Chyou, P.-H.; Glurich, I.; Blamer, J.; Acharya, A. Knowledgeability, Attitude and Behavior of Primary Care Providers Towards Oral Cancer: A Pilot Study. J. Cancer Educ. 2018, 33, 359–364. [Google Scholar] [CrossRef]
- Atchison, K.; Weintraub, J.; Rozier, R. Bridging the dental-medical divide: Case studies integrating oral health care and primary care. J. Am. Dent. Assoc. 2018, 149, 850–858. [Google Scholar] [CrossRef]
- Nycz, G.; Shimpi, N.; Glurich, I.; Ryan, M.; Sova, G.; Weiner, S.; Nichols, L.; Acharya, A. Positioning operations in the dental safety net to enhance value-based care delivery in an integrated health-care setting. J. Public Health Dent. 2020, 80, S71–S76. [Google Scholar] [CrossRef]
- Hegde, H.; Shimpi, N.; Panny, A.; Glurich, I.; Christie, P.; Acharya, A. Development of non-invasive diabetes risk prediction models as decision support tools designed for application in the dental clinical environment. Inform. Med. Unlocked 2019, 17, 100254. [Google Scholar] [CrossRef] [PubMed]
- Shimpi, N.; McRoy, S.; Zhao, H.; Wu, M.; Acharya, A. Development of a periodontitis risk assessment model for primary care providers in an interdisciplinary setting. Technol. Health Care 2020, 28, 143–154. [Google Scholar] [CrossRef] [PubMed]
- Rindal, D.B.; Mabry, P.L. Leveraging Clinical Decision Support and Integrated Medical-Dental Electronic Health Records to Implementing Precision in Oral Cancer Risk Assessment and Preventive Intervention. J. Pers. Med. 2021, 11, 832. [Google Scholar] [CrossRef] [PubMed]
- García-Pola, M.; Pons-Fuster, E.; Suárez-Fernández, C.; Seoane-Romero, J.; Romero-Méndez, A.; López-Jornet, P. Role of Artificial Intelligence in the Early Diagnosis of Oral Cancer. A Scoping Review. Cancers 2021, 13, 4600. [Google Scholar] [CrossRef] [PubMed]
- Niu, X.; Roche, L.M.; Pawlish, K.S.; Henry, K.A. Cancer survival disparities by health insurance status. Cancer Med. 2013, 2, 403–411. [Google Scholar] [CrossRef]
- Tsai, I.-J.; Shen, W.-C.; Lee, C.-L.; Wang, H.-D.; Lin, C.-Y. Machine Learning in Prediction of Bladder Cancer on Clinical Laboratory Data. Diagnostics 2022, 12, 203. [Google Scholar] [CrossRef]
- Asri, H.; Mousannif, H.; Al Moatassime, H.; Noel, T. Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis. Procedia Comput. Sci. 2016, 83, 1064–1069. [Google Scholar] [CrossRef] [Green Version]
- Osareh, A.; Shadgar, B. Machine learning techniques to diagnose breast cancer. In Proceedings of the 2010 5th International Symposium on Health Informatics and Bioinformatics HIBIT, Ankara, Turkey, 20–22 April 2010; pp. 114–120. [Google Scholar] [CrossRef]
- Wulczyn, E.; Steiner, D.F.; Moran, M.; Plass, M.; Reihs, R.; Tan, F.; Flament-Auvigne, I.; Brown, T.; Regitnig, P.; Chen, P.-H.C.; et al. Interpretable survival prediction for colorectal cancer using deep learning. NPJ Digit. Med. 2021, 4, 1–13. [Google Scholar] [CrossRef]
- Nartowt, B.J.; Hart, G.R.; Muhammad, W.; Liang, Y.; Stark, G.F.; Deng, J. Robust Machine Learning for Colorectal Cancer Risk Prediction and Stratification. Front. Big Data 2020, 3, 6. [Google Scholar] [CrossRef] [Green Version]
- Podolsky, M.D.; Barchuk, A.; Kuznetcov, V.I.; Gusarova, N.; Gaidukov, V.S.; Tarakanov, S.A. Evaluation of Machine Learning Algorithm Utilization for Lung Cancer Classification Based on Gene Expression Levels. Asian Pac. J. Cancer Prev. 2016, 17, 835–838. [Google Scholar] [CrossRef] [Green Version]
- Cai, Z.; Xu, D.; Zhang, Q.; Zhang, J.; Ngai, S.-M.; Shao, J. Classification of lung cancer using ensemble-based feature selection and machine learning methods. Mol. BioSyst. 2015, 11, 791–800. [Google Scholar] [CrossRef] [PubMed]
- Forgionne, G.; Gangopadhyay, A.; Adya, M. Cancer Surveillance using Data Warehousing, Data Mining, and Decision Support Systems. Top Health Inf Manage. Published online 1 August 2000. Available online: https://epublications.marquette.edu/mgmt_fac/76 (accessed on 9 March 2022).
- Sharma, N.; Om, H. Framework for early detection and prevention of oral cancer using data mining. Int. J. Adv. Eng. Technol. 2012, 302, 302–310. [Google Scholar]
- Sharma, N.; Om, H. Usage of Probabilistic and General Regression Neural Network for Early Detection and Prevention of Oral Cancer. Sci. World, J. 2015, 2015, 234191. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Speight, P.M.; Elliott, A.E.; Jullien, J.A.; Downer, M.C.; Zakzrewska, J.M. The use of artificial intelligence to identify people at risk of oral cancer and precancer. Br. Dent. J. 1995, 179, 382–387. [Google Scholar] [CrossRef]
- Shimpi, N.; Glurich, I.; Acharya, A. Integrated Care Case Study: Marshfield Clinic Health System. In Integration of Medical and Dental Care and Patient Data, 2nd ed.; Acharya, A., Powell, V., Torres-Urquidy, M.H., Posteraro, R.H., Thyvalikakath, T., Eds.; Springer: Cham, Switzerland, 2019; pp. 315–326. [Google Scholar] [CrossRef]
- MD. JavaScript | MDN. Published 2020. Available online: https://developer.mozilla.org/en-US/docs/Web/JavaScript (accessed on 23 January 2020).
- Witten, I.H.; Frank, E.; Hall, M.A. Data Mining: Practical Machine Learning Tools and Techniques, 3rd ed.; Morgan Kaufmann/Elsevier: Burlington, NJ, USA, 2011; Volume 54. [Google Scholar] [CrossRef]
- Saltelli, A.; Ratto, M.; Andres, T.; Campolongo, F.; Cariboni, J.; Gatelli, D.; Saisana, M.; Tarantola, S. (Eds.) Global Sensitivity Analysis, 1st ed.; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2008; Volume 1. [Google Scholar]
- Yao, J.T. Sensitivity Analysis for Data Mining. In Proceedings of the 22nd International Conference of the North American Fuzzy Information Processing Society (NAFIPS); 2003; pp. 272–277. Available online: http://www2.cs.uregina.ca/~jtyao/Papers/NAFIPS-E-178.pdf (accessed on 10 March 2022).
- Kent, S. Diagnosis of Oral Cancer Using Genetic Programming; Tech. Rep. CSTR-96-14; Brunel University: Uxbridge, UK, 1996. [Google Scholar]
- Tseng, W.-T.; Chiang, W.-F.; Liu, S.-Y.; Roan, J.; Lin, C.-N. The Application of Data Mining Techniques to Oral Cancer Prognosis. J. Med. Syst. 2015, 39, 59. [Google Scholar] [CrossRef]
- Rosma, M.; Sameem, A.; Basir, A.; Mazlipah, I.; Norzaidi, M. The use of artificial intelligence to identify people at risk of oral cancer: Empirical evidence in Malaysian university. Int. J. Sci. Res. Educ. 2010, 3, 10–20. Available online: https://www.researchgate.net/publication/299747442_The_use_of_artificial_intelligence_to_identify_people_at_risk_of_oral_cancer_empirical_evidence_in_Malaysian_university (accessed on 10 March 2022).
- Tan, M.S.; Tan, J.W.; Chang, S.-W.; Yap, H.J.; Kareem, S.A.; Zain, R.B. A genetic programming approach to oral cancer prognosis. PeerJ 2016, 4, e2482. [Google Scholar] [CrossRef] [Green Version]
- National Cancer Institute. Alcohol and Cancer Risk Fact Sheet—National Cancer Institute. Published 2022. Available online: https://www.cancer.gov/about-cancer/causes-prevention/risk/alcohol/alcohol-fact-sheet (accessed on 10 March 2022).
- Hegde, H.; Shimpi, N.; Glurich, I.; Acharya, A. Tobacco use status from clinical notes using Natural Language Processing and rule based algorithm. Technol. Health Care 2018, 26, 445–456. [Google Scholar] [CrossRef]
Feature | Gain Ratio |
---|---|
Diseases of lips | 0.1429 |
Disorders of oral soft tissue | 0.1277 |
Leukoplakia in oral mucosa | 0.1232 |
Presence of swelling or lump in mouth | 0.0483 |
Throat pain | 0.0398 |
Esophageal reflux | 0.0287 |
Stomatitis and mucositis | 0.0099 |
Radiation therapy | 0.0087 |
Oral aphthae | 0.0032 |
Oral thrush | 0.0021 |
Tobacco use | 0.0008 |
Chemotherapy | 0.0003 |
Alcohol abuse | 0.0000 |
Reference No | N | ML Algorithms Used | Sensitivity | Specificity |
---|---|---|---|---|
Speight et al. [31] | 1662 | Neural network (NN) | 80% | 77% |
Kent et al. [37] | 1939 | Genetic programming (GP) NN | 73% GP 64% NN | 65% GP 68% NN |
Tseng et al. [38] | 1099 | Decision tree (DT) NN Logistic regression (LR) | Total accuracy: DT: 95.8% LR: 67.6% NN: 93.8% | |
Rosma et al. [39] | 191 | Fuzzy NNs Fuzzy regression | 46% fuzzy NN 69% fuzzy regression | 85% fuzzy NN 65% fuzzy regression |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shimpi, N.; Glurich, I.; Rostami, R.; Hegde, H.; Olson, B.; Acharya, A. Development and Validation of a Non-Invasive, Chairside Oral Cavity Cancer Risk Assessment Prototype Using Machine Learning Approach. J. Pers. Med. 2022, 12, 614. https://doi.org/10.3390/jpm12040614
Shimpi N, Glurich I, Rostami R, Hegde H, Olson B, Acharya A. Development and Validation of a Non-Invasive, Chairside Oral Cavity Cancer Risk Assessment Prototype Using Machine Learning Approach. Journal of Personalized Medicine. 2022; 12(4):614. https://doi.org/10.3390/jpm12040614
Chicago/Turabian StyleShimpi, Neel, Ingrid Glurich, Reihaneh Rostami, Harshad Hegde, Brent Olson, and Amit Acharya. 2022. "Development and Validation of a Non-Invasive, Chairside Oral Cavity Cancer Risk Assessment Prototype Using Machine Learning Approach" Journal of Personalized Medicine 12, no. 4: 614. https://doi.org/10.3390/jpm12040614