Open AccessArticle
Nomogram Based on the Most Relevant Clinical, CT, and Radiomic Features, and a Machine Learning Model to Predict EGFR Mutation Status in Non-Small Cell Lung Cancer
by
Anass Benfares, Abdelali yahya Mourabiti, Badreddine Alami, Sara Boukansa, Ikram Benomar, Nizar El Bouardi, Moulay Youssef Alaoui Lamrani, Hind El Fatimi, Bouchra Amara, Mounia Serraj, Mohammed Smahi, Abdeljabbar Cherkaoui, Mamoun Qjidaa, Ahmed Lakhssassi, Mohammed Ouazzani Jamil, Mustapha Maaroufi and Hassan Qjidaa
Viewed by 296
Abstract
Background: This study aimed to develop a nomogram based on the most relevant clinical, CT, and radiomic features comprising 11 key signatures (2 clinical, 2 CT-based, and 7 radiomic) for the non-invasive prediction of the EGFR mutation status and to support the timely
[...] Read more.
Background: This study aimed to develop a nomogram based on the most relevant clinical, CT, and radiomic features comprising 11 key signatures (2 clinical, 2 CT-based, and 7 radiomic) for the non-invasive prediction of the EGFR mutation status and to support the timely initiation of tyrosine kinase inhibitor (TKI) therapy in patients with non-small cell lung cancer (NSCLC) adenocarcinoma. Methods: Retrospective real-world data were collected from 521 patients with histologically confirmed NSCLC adenocarcinoma who underwent CT imaging and either surgical resection or pathological biopsy for EGFR mutation testing. Five Random Forest classification models were developed and trained on various datasets constructed by combining clinical, CT, and radiomic features extracted from CT image regions of interest (ROIs), with and without feature preselection. Results: The model trained exclusively on the most relevant clinical, CT, and radiomic features demonstrated superior predictive performance compared to the other models, with strong discrimination between EGFR-mutant and wild-type cases (AUC = 0.88; macro-average = 0.90; micro-average = 0.89; precision = 0.90; recall = 0.94; F1-score = 0.91; and accuracy = 0.87). Conclusions: A nomogram constructed using a Random Forest model trained solely on the most informative clinical, CT, and radiomic features outperformed alternative approaches in the non-invasive prediction of the EGFR mutation status, offering a promising decision-support tool for precision treatment planning in NSCLC.
Full article
►▼
Show Figures