A Weakly Supervised Approach for HPV Status Prediction in Oropharyngeal Carcinoma from H&E-Stained Slides
Simple Summary
Abstract
1. Introduction
1.1. Human Papillomavirus (HPV) in Head and Neck Cancer
1.2. Artificial Intelligence for HPV Status Prediction from H&E Slides
2. Materials and Methods
2.1. Data Collection
2.2. Slide Digitization
2.3. Workflow Overview
2.4. Computational Framework: CLAM
2.4.1. Overview
2.4.2. Performance Evaluation and Interpretability
2.5. Feature Evaluation
2.5.1. Morphological Feature-Based Classification
2.5.2. Computational Environment
3. Results
3.1. CLAM Model Performance on Internal Cross-Validation
3.2. Global Classification and Probability Analysis
3.3. External Test Set Performance
3.4. Interpretability Through Attention Maps
3.5. Cell-Level Analysis
4. Discussion
5. Limitations and Future Directions
6. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ACC | Accuracy |
| AJCC | American Joint Committee on Cancer |
| AUC | Area under the ROC curve |
| AI | Artificial intelligence |
| CLAM | Clustering-constrained Attention Multiple-Instance Learning |
| DL | Deep learning |
| FFPE | Formalin-fixed paraffin-embedded |
| H&E | Hematoxylin and eosin |
| HPV | Human papillomavirus |
| IHC | Immunohistochemistry |
| ISH | In situ hybridization |
| ML | Machine learning |
| OPSCC | Oropharyngeal squamous cell carcinoma |
| OS | Overall survival |
| ROI | Region of interest |
| RF | Random Forest |
| TCGA | The Cancer Genome Atlas |
| WSI | Whole-slide image |
References
- Lechner, M.; Liu, J.; Masterson, L.; Fenton, T.R. HPV-associated oropharyngeal cancer: Epidemiology, molecular biology and clinical management. Nat. Rev. Clin. Oncol. 2022, 19, 306–327. [Google Scholar] [CrossRef]
- Ang, K.K.; Harris, J.; Wheeler, R.; Weber, R.; Rosenthal, D.I.; Nguyen-Tân, P.F.; Westra, W.H.; Chung, C.H.; Jordan, R.C.; Lu, C.; et al. Human papillomavirus and survival of patients with oropharyngeal cancer. N. Engl. J. Med. 2010, 363, 24–35. [Google Scholar] [PubMed]
- Fakhry, C.; Westra, W.H.; Li, S.; Cmelak, A.; Ridge, J.A.; Pinto, H.; Forastiere, A.; Gillison, M.L. Improved survival of patients with human papillomavirus–positive head and neck squamous cell carcinoma in a prospective clinical trial. J. Natl. Cancer Inst. 2008, 100, 261–269. [Google Scholar]
- Rischin, D.; Young, R.J.; Fisher, R.; Fox, S.B.; Le, Q.T.; Peters, L.J.; Solomon, B.; Choi, J.; O’Sullivan, B.; Kenny, L.M.; et al. Prognostic significance of p16INK4A and human papillomavirus in patients with oropharyngeal cancer treated on TROG 02.02 phase III trial. J. Clin. Oncol. 2010, 28, 4142–4148. [Google Scholar] [CrossRef]
- Posner, M.; Lorch, J.; Goloubeva, O.; Tan, M.; Schumaker, L.; Sarlis, N.; Haddad, R.; Cullen, K. Survival and human papillomavirus in oropharynx cancer in TAX 324: A subset analysis from an international phase III trial. Ann. Oncol. 2011, 22, 1071–1077. [Google Scholar] [CrossRef]
- Perri, F.; Longo, F.; Caponigro, F.; Sandomenico, F.; Guida, A.; Della Vittoria Scarpati, G.; Ottaiano, A.; Muto, P.; Ionna, F. Management of HPV-related squamous cell carcinoma of the head and neck: Pitfalls and caveat. Cancers 2020, 12, 975. [Google Scholar] [CrossRef]
- Rosenberg, A.J.; Vokes, E.E. Optimizing treatment de-escalation in head and neck cancer: Current and future perspectives. Oncologist 2021, 26, 40–48. [Google Scholar]
- Cmelak, A.; Li, S.; Marur, S.; Zhao, W.; Westra, W.H.; Chung, C.H.; Gillison, M.L.; Gilbert, J.; Bauman, J.E.; Wagner, L.I.; et al. E1308: Reduced-dose IMRT in human papilloma virus (HPV)-associated resectable oropharyngeal squamous carcinomas (OPSCC) after clinical complete response (cCR) to induction chemotherapy (IC). J. Clin. Oncol. 2014, 32, LBA6006. [Google Scholar] [CrossRef]
- Chera, B.S.; Amdur, R.J.; Tepper, J.E.; Tan, X.; Weiss, J.; Grilley-Olson, J.E.; Hayes, D.N.; Zanation, A.; Hackman, T.G.; Patel, S.; et al. Mature results of a prospective study of deintensified chemoradiotherapy for low-risk human papillomavirus-associated oropharyngeal squamous cell carcinoma. Cancer 2018, 124, 2347–2354. [Google Scholar]
- Yom, S.; Harris, J.; Caudell, J.; Geiger, J.; Waldron, J.; Gillison, M.; Subramaniam, R.; Yao, M.; Xiao, C.; Kovalchuk, N.; et al. Interim futility results of NRG-HN005, A randomized, phase II/III non-inferiority trial for non-smoking p16+ oropharyngeal cancer patients. Int. J. Radiat. Oncol. Biol. Phys. 2024, 120, S2–S3. [Google Scholar] [CrossRef]
- Craig, S.G.; Anderson, L.A.; Schache, A.G.; Moran, M.; Graham, L.; Currie, K.; Rooney, K.; Robinson, M.; Upile, N.S.; Brooker, R.; et al. Recommendations for determining HPV status in patients with oropharyngeal cancers under TNM8 guidelines: A two-tier approach. Br. J. Cancer 2019, 120, 827–833. [Google Scholar] [CrossRef]
- Jordan, R.C.; Lingen, M.W.; Perez-Ordonez, B.; He, X.; Pickard, R.; Koluder, M.; Jiang, B.; Wakely, P.; Xiao, W.; Gillison, M.L. Validation of methods for oropharyngeal cancer HPV status determination in US cooperative group trials. Am. J. Surg. Pathol. 2012, 36, 945–954. [Google Scholar] [CrossRef] [PubMed]
- Singhi, A.D.; Westra, W.H. Comparison of human papillomavirus in situ hybridization and p16 immunohistochemistry in the detection of human papillomavirus-associated head and neck cancer based on a prospective clinical experience. Cancer 2010, 116, 2166–2173. [Google Scholar] [CrossRef]
- Wang, H.; Zhang, Y.; Bai, W.; Wang, B.; Wei, J.; Ji, R.; Xin, Y.; Dong, L.; Jiang, X. Feasibility of immunohistochemical p16 staining in the diagnosis of human papillomavirus infection in patients with squamous cell carcinoma of the head and neck: A systematic review and meta-analysis. Front. Oncol. 2020, 10, 524928. [Google Scholar] [CrossRef]
- Cui, M.; Zhang, D.Y. Artificial intelligence and computational pathology. Lab. Investig. 2021, 101, 412–422. [Google Scholar] [PubMed]
- Tizhoosh, H.R.; Pantanowitz, L. Artificial intelligence and digital pathology: Challenges and opportunities. J. Pathol. Inform. 2018, 9, 38. [Google Scholar] [PubMed]
- Song, C.; Chen, X.; Tang, C.; Xue, P.; Jiang, Y.; Qiao, Y. Artificial intelligence for HPV status prediction based on disease-specific images in head and neck cancer: A systematic review and meta-analysis. J. Med. Virol. 2023, 95, e29080. [Google Scholar]
- Broggi, G.; Maniaci, A.; Lentini, M.; Palicelli, A.; Zanelli, M.; Zizzo, M.; Koufopoulos, N.; Salzano, S.; Mazzucchelli, M.; Caltabiano, R. Artificial intelligence in head and neck cancer diagnosis: A comprehensive review with emphasis on radiomics, histopathological, and molecular applications. Cancers 2024, 16, 3623. [Google Scholar] [CrossRef]
- Migliorelli, A.; Manuelli, M.; Ciorba, A.; Stomeo, F.; Pelucchi, S.; Bianchini, C. Role of Artificial Intelligence in Human Papillomavirus Status Prediction for Oropharyngeal Cancer: A Scoping Review. Cancers 2024, 16, 4040. [Google Scholar] [CrossRef]
- Yao, H.; Zhang, X. A comprehensive review for machine learning based human papillomavirus detection in forensic identification with multiple medical samples. Front. Microbiol. 2023, 14, 1232295. [Google Scholar] [CrossRef]
- Klein, S.; Quaas, A.; Quantius, J.; Löser, H.; Meinel, J.; Peifer, M.; Wagner, S.; Gattenlöhner, S.; Wittekindt, C.; von Knebel Doeberitz, M.; et al. Deep learning predicts HPV association in oropharyngeal squamous cell carcinomas and identifies patients with a favorable prognosis using regular H&E stains. Clin. Cancer Res. 2021, 27, 1131–1138. [Google Scholar] [CrossRef]
- Wang, R.; Khurram, S.A.; Walsh, H.; Young, L.S.; Rajpoot, N. A novel deep learning algorithm for human papillomavirus infection prediction in head and neck cancers using routine histology images. Mod. Pathol. 2023, 36, 100320. [Google Scholar] [CrossRef]
- Varricchio, S.; Ilardi, G.; Crispino, A.; D’Angelo, M.P.; Russo, D.; Di Crescenzo, R.M.; Staibano, S.; Merolla, F. A machine learning approach to predict HPV positivity of oropharyngeal squamous cell carcinoma. Pathol.-J. Ital. Soc. Anat. Pathol. Diagn. Cytopathol. 2024, 116. [Google Scholar] [CrossRef]
- Lu, M.Y.; Williamson, D.F.; Chen, T.Y.; Chen, R.J.; Barbieri, M.; Mahmood, F. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat. Biomed. Eng. 2021, 5, 555–570. [Google Scholar]
- Bankhead, P.; Loughrey, M.B.; Fernández, J.A.; Dombrowski, Y.; McArt, D.G.; Dunne, P.D.; McQuaid, S.; Gray, R.T.; Murray, L.J.; Coleman, H.G.; et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 2017, 7, 16878. [Google Scholar] [CrossRef]
- Klein, S.; Wuerdemann, N.; Demers, I.; Kopp, C.; Quantius, J.; Charpentier, A.; Tolkach, Y.; Brinker, K.; Sharma, S.J.; George, J.; et al. Predicting HPV association using deep learning and regular H&E stains allows granular stratification of oropharyngeal cancer patients. npj Digit. Med. 2023, 6, 152. [Google Scholar] [CrossRef]
- Adachi, M.; Taki, T.; Sakamoto, N.; Kojima, M.; Hirao, A.; Matsuura, K.; Hayashi, R.; Tabuchi, K.; Ishikawa, S.; Ishii, G.; et al. Extracting interpretable features for pathologists using weakly supervised learning to predict p16 expression in oropharyngeal cancer. Sci. Rep. 2024, 14, 4506. [Google Scholar] [CrossRef] [PubMed]
- Lewis, J.S. Morphologic diversity in human papillomavirus-related oropharyngeal squamous cell carcinoma: Catch Me If You Can! Mod. Pathol. 2017, 30, S44–S53. [Google Scholar] [CrossRef][Green Version]
- Chernock, R.D.; El-Mofty, S.K.; Thorstad, W.L.; Parvin, C.A.; Lewis, J.S. HPV-Related Nonkeratinizing Squamous Cell Carcinoma of the Oropharynx: Utility of Microscopic Features in Predicting Patient Outcome. Head Neck Pathol. 2009, 3, 186–194. [Google Scholar] [CrossRef] [PubMed]
- Lim, Y.X.; Mierzwa, M.L.; Sartor, M.A.; D’Silva, N.J. Clinical, morphologic and molecular heterogeneity of HPV-associated oropharyngeal cancer. Oncogene 2023, 42, 2939–2955. [Google Scholar] [CrossRef] [PubMed]
- Molony, P.; Werner, R.; Martin, C.; Callanan, D.; Nauta, I.; Heideman, D.; Sheahan, P.; Heffron, C.; Feeley, L. The role of tumour morphology in assigning HPV status in oropharyngeal squamous cell carcinoma. Oral Oncol. 2020, 105, 104670. [Google Scholar] [CrossRef]
- Skjervold, A.H.; Pettersen, H.S.; Valla, M.; Opdahl, S.; Bofin, A.M. Visual and digital assessment of Ki-67 in breast cancer tissue—A comparison of methods. Diagn. Pathol. 2022, 17, 45. [Google Scholar] [CrossRef] [PubMed]
- Martino, F.; Varricchio, S.; Russo, D.; Merolla, F.; Ilardi, G.; Mascolo, M.; dell’Aversana, G.O.; Califano, L.; Toscano, G.; De Pietro, G.; et al. A Machine-learning Approach for the Assessment of the Proliferative Compartment of Solid Tumors on Hematoxylin-Eosin-Stained Sections. Cancers 2020, 12, 1344. [Google Scholar] [CrossRef]
- Angeloni, M.; Rizzi, D.; Schoen, S.; Caputo, A.; Merolla, F.; Hartmann, A.; Ferrazzi, F.; Fraggetta, F. Closing the gap in the clinical adoption of computational pathology: A standardized, open-source framework to integrate deep-learning models into the laboratory information system. Genome Med. 2025, 17, 60. [Google Scholar] [CrossRef] [PubMed]






| Fold | Test AUC | Val AUC | Test Acc | Val Acc |
|---|---|---|---|---|
| 0 | 0.4821 | 0.2424 | 0.4667 | 0.7857 |
| 1 | 0.4571 | 1.0000 | 0.4167 | 0.9091 |
| 2 | 0.7407 | 0.5833 | 0.5000 | 0.4000 |
| 3 | 0.7143 | 0.1750 | 0.6667 | 0.7143 |
| 4 | 0.5000 | 0.8571 | 0.5455 | 0.8889 |
| 5 | 0.4286 | 0.9394 | 0.6667 | 0.8571 |
| 6 | 1.0000 | 0.6667 | 0.8000 | 0.6429 |
| 7 | 0.1875 | 1.0000 | 0.5625 | 0.9231 |
| 8 | 0.4688 | 0.8393 | 0.5833 | 0.8000 |
| 9 | 0.3500 | 0.8750 | 0.4444 | 0.9000 |
| Average | 0.5324 | 0.7178 | 0.5652 | 0.7821 |
| Dataset | Total WSIs | Correctly Classified | Misclassified | Accuracy (%) |
|---|---|---|---|---|
| OPSCC-UNINA | 113 | 90 | 23 | 79.65 |
| TCGA | 10 | 7 | 3 | 70.0 |
| Total | 123 | 97 | 26 | 78.9 |
| Class | Precision | Recall | F1-Score |
|---|---|---|---|
| Negative | 0.82 | 0.84 | 0.83 |
| Positive | 0.84 | 0.81 | 0.83 |
| Macro average | 0.83 | 0.83 | 0.83 |
| Weighted average | 0.83 | 0.83 | 0.83 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Crispino, A.; Varricchio, S.; Marfella, A.; Cerbone, D.; Russo, D.; Di Crescenzo, R.M.; Staibano, S.; Merolla, F.; Ilardi, G. A Weakly Supervised Approach for HPV Status Prediction in Oropharyngeal Carcinoma from H&E-Stained Slides. Cancers 2025, 17, 3938. https://doi.org/10.3390/cancers17243938
Crispino A, Varricchio S, Marfella A, Cerbone D, Russo D, Di Crescenzo RM, Staibano S, Merolla F, Ilardi G. A Weakly Supervised Approach for HPV Status Prediction in Oropharyngeal Carcinoma from H&E-Stained Slides. Cancers. 2025; 17(24):3938. https://doi.org/10.3390/cancers17243938
Chicago/Turabian StyleCrispino, Angela, Silvia Varricchio, Alessandra Marfella, Dora Cerbone, Daniela Russo, Rosa Maria Di Crescenzo, Stefania Staibano, Francesco Merolla, and Gennaro Ilardi. 2025. "A Weakly Supervised Approach for HPV Status Prediction in Oropharyngeal Carcinoma from H&E-Stained Slides" Cancers 17, no. 24: 3938. https://doi.org/10.3390/cancers17243938
APA StyleCrispino, A., Varricchio, S., Marfella, A., Cerbone, D., Russo, D., Di Crescenzo, R. M., Staibano, S., Merolla, F., & Ilardi, G. (2025). A Weakly Supervised Approach for HPV Status Prediction in Oropharyngeal Carcinoma from H&E-Stained Slides. Cancers, 17(24), 3938. https://doi.org/10.3390/cancers17243938

