Next Article in Journal
Association of Alpha-1 Antitrypsin Pi*Z Allele Frequency and Progressive Liver Fibrosis in Two Chronic Hepatitis C Cohorts
Previous Article in Journal
Phosphaturic Mesenchymal Tumors: Rethinking the Clinical Diagnosis and Surgical Treatment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Role of Radiomics Features and Machine Learning for the Histological Classification of Stage I and Stage II NSCLC at [18F]FDG PET/CT: A Comparison between Two PET/CT Scanners

1
Nuclear Medicine, ASST Spedali Civili Brescia, 25123 Brescia, Italy
2
Dipartimento di Scienze Cliniche e Sperimentali, Università degli Studi di Brescia, 25123 Brescia, Italy
3
Nuclear Medicine, Università degli Studi di Brescia and ASST Spedali Civili Brescia, 25123 Brescia, Italy
4
Nuclear Medicine, Imaging Institute of Southern Switzerland, Ente Ospedaliero Cantonale, 6500 Bellinzona, Switzerland
5
Department of Nuclear Medicine and Molecular Imaging, Lausanne University Hospital, University of Lausanne, 1011 Lausanne, Switzerland
6
Faculty of Biomedical Sciences, Università della Svizzera Italiana, 6900 Lugano, Switzerland
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2023, 12(1), 255; https://doi.org/10.3390/jcm12010255
Submission received: 4 November 2022 / Revised: 7 December 2022 / Accepted: 28 December 2022 / Published: 29 December 2022
(This article belongs to the Section Nuclear Medicine & Radiology)

Abstract

:
The aim of this study was to compare two different PET/CT tomographs for the evaluation of the role of radiomics features (RaF) and machine learning (ML) in the prediction of the histological classification of stage I and II non-small-cell lung cancer (NSCLC) at baseline [18F]FDG PET/CT. A total of 227 patients were retrospectively included and, after volumetric segmentation, RaF were extracted. All of the features were tested for significant differences between the two scanners and considering both the scanners together, and their performances in predicting the histology of NSCLC were analyzed by testing of different ML approaches: Logistic Regressor (LR), k-Nearest Neighbors (kNN), Decision Tree (DT) and Random Forest (RF). In general, the models with best performances for all the scanners were kNN and LR and moreover the kNN model had better performances compared to the other. The impact of the PET/CT scanner used for the acquisition of the scans on the performances of RaF was evident: mean area under the curve (AUC) values for scanner 2 were lower compared to scanner 1 and both the scanner considered together. In conclusion, our study enabled the selection of some [18F]FDG PET/CT RaF and ML models that are able to predict with good performances the histological subtype of NSCLC. Furthermore, the type of PET/CT scanner may influence these performances.

1. Introduction

Non-small-cell lung cancer (NSCLC) is a frequent form of neoplasm with globally rising incidence, accounting for most cancer-related deaths worldwide [1,2,3]. The risk factors associated with the development of disease are mainly environmental, with cigarette smoking as the most important [4,5]. The three main histological types of NSCLC, according to World Health Organization (WHO), are adenocarcinoma (ADK), squamous cell carcinoma (SCC) and large cell carcinoma [4].
The correct diagnosis of NSCLC is often made in advanced stages, since the disease can become symptomatic only in these stages. In this setting, the main symptoms are presented by cough, hemoptysis, chest pain and dyspnea [4,6].
The correct diagnostic and staging assessment of NSCLC is performed with imaging evaluation with chest X-ray (XR) and chest computed tomography (CT) that are pivotal for the evaluation of disease. In this setting, such imaging modalities directly drive the treatment of the disease and are also able to perform distant staging, being useful even for its follow-up [4]. The clinical outcome of NSCLC is directly related to its stage at diagnosis: early stages are usually managed with surgical resection and good prognosis, while advanced and metastatic disease can benefit from adjuvant therapy [7,8,9,10,11].
In the recent years, positron emission tomography/computed tomography (PET/CT) with different tracers is emerging as a fundamental imaging modality for the assessment of a high amount of neoplastic and infectious diseases [12,13]. In this scenario, 18F-fluorodeoxyglucose ([18F]FDG) PET/CT is routinely performed for the staging of NSCLC and the usefulness of some semiquantitative parameters in predicting the prognosis of patients has been proved [14,15]. In this setting, the possible role of [18F]FDG PET/CT radiomics features (RaF) to differentiate between malignant and benign lesions in various organs has recently emerged and pulmonary nodules do not make exception [16,17]. Furthermore, it has also been reported that texture analysis is somehow able to differentiate between ADK and SCC, however with heterogeneous findings in terms of PET/CT acquisition, RaF extraction and results [18,19,20,21,22,23,24,25]. The populations considered in such studies were characterized by high heterogeneity in terms of clinicopathological features and, as mentioned, prognosis of patients is related to the stage of disease at the time of diagnosis. Furthermore, it is known that different scanners and protocols used for the acquisition and reconstruction of PET images are able to influence the extraction of RaF and therefore affect the results of such analysis [16,17,18].
The aim of this study is therefore to analyze the value of baseline [18F]FDG PET/CT RaF and ML models for the prediction of final histological diagnosis in patients with stage I and stage II NSCLC and also to assess the influence of different scanners on this scenario.

2. Materials and Methods

2.1. Patient Selection

We retrospectively screened our database in order to find patients submitted to our center to perform [18F]FDG PET/CT for the initial staging of NSCLC. The screening was performed from January 2014 until February 2022 and a total of 2332 subjects were selected. Inclusion criteria were the presence of a histologically proven diagnosis of stage I or stage II NSCLC, the presence of a baseline [18F]FDG PET/CT performed before any treatment and the presence of tracer uptake by NSCLC higher than liver uptake. After applying such inclusion criteria, 233 patients were included in the study.
Clinicopathological information including gender, age, size of NSCLC measured on histological evaluation, grading, lobe involved by the disease, therapy performed, TNM category and American Joint Commission on Cancer (AJCC) VIII Edition stage were collected. Furthermore, histological classification was collected and since only 6 patients had the presence of adenosquamous carcinoma, they were excluded from the present study. A total of 227 patients were therefore finally included in the study.

2.2. [18F]FDG PET/CT Acquisition and Interpretation

Patients fasted for at least 6 h before tracer injection and had a glucose blood level below 150 mg/dL (mean: 116, standard deviation [SD]: 19, range: 83–148). In order to perform PET/CT scan, 3.5–4.5 MBq/kg of [18F]FDG were intravenously injected to the patients and before images acquisition they were instructed to void. No contrast agent, intestinal preparation with purge or enteric contrast were used.
Images were acquired 60 min after radiotracer injection, from the vertex to the midthigh on two different PET/CT tomographs. The first one (scanner 1) was a Discovery 690 PET/CT (General Electric Company, Milwaukee, WI, USA) while the second (scanner 2) was a Discovery STE PET/CT (General Electric Company, Milwaukee, WI, USA). On both, standard acquisition parameters (CT: 80 mA, 120 kV without contrast; 2.5–4 min per bed PET-step, axial width 15 cm) and standard reconstruction parameters were used (256 × 256 matrix and 60 cm field of view). Furthermore, scanner 1 had LYSO (cerium-doped lutetium yttrium oxyorthosilicate) scintillator crystals with a decay time of 45 ns, while scanner 2 had BGO (bismuth germanate) scintillator crystals with a decay time of 300 ns. Scanners were not harmonized with a cross-calibration program and all PET/CT scans were acquired at free-breath, instructing the patients to have regular breathing. For anatomical correlation and to perform attenuation correction, a low dose CT at free breathing and without contrast agent was acquired for both the scanners. More in detail, CT acquisition parameters for scanner 1 were: 120 kV, fixed tube current ≈ 60 mAs (40–100 mAs), 64 slices × 3.75 mm and 3.27 mm interval, pitch 0.984:1, tube rotation 0.5 s. CT acquisition parameters for scanner 2 were: 120 kV, fixed tube current ≈ 73 mAs (40–160 mAs), 4 slices × 3.75 mm and 3.27 mm interval, pitch 1.5:1, tube rotation 0.8 s. Furthermore, on scanner 1 time of flight (TOF) and point spread function (PSF) algorithm were used for the reconstruction of images, with filter cut-off 5 mm, 18 subsets and 3 iterations. Moreover, on scanner 2 an ordered subset expectation maximization (OSEM) algorithm with filter cut-off 5 mm, 21 subsets and 2 iterations was used.
PET images were visually and semiquantitatively analyzed by a nuclear physician with at least 10 years of experience and every focal tracer uptake deviating from physiological distribution and background was regarded as suggestive of disease localization.

2.3. Radiomics Features Extraction

Before RaF extraction, PET/CT scan were segmented with LIFEx 2.20 software (http://www.lifexsoft.org, accessed on 10 September 2021) by manually drawing volume of interest (VOI) on hypermetabolic lesions [26]. RaF were extracted using Moddicom [27], a software library compliant with the Image Biomarker Standardization Initiative [28]. For each image series, a set of 216 features were extracted, grouped in three main families: morphological (12), grey-level histogram based (21) and textural (190).

2.4. Statistical Analysis

Statistical analyses were performed using R (version 3.6.3). The descriptive analysis of categorical variables comprised the calculation of simple and relative frequencies. The numeric variables were described as mean, SD, minimum and maximum (range).
The general statistical analysis line of the study was structured of various steps and was aimed at training a predictive model testing different Machine Learning (ML) approaches: Logistic Regressor (LR), k-nearest neighbors (kNN), Decision Tree (DT) and Random Forest (RF). Due to the nature of the used ML techniques, different approaches were used to cope with the feature selection strategy. For LR and kNN, for example, to reduce the complexity of the space, we used a Wilcoxon analysis after a 50-cross fold validation for all RaF and we removed the feature poorly correlated with the outcome. The 50-cross fold validation was performed randomly splitting the cohort in 80% for training and 20% per validation, 50 times. In this setting, a p-value ≤ 0.001 was considered as cutoff. DT and RF, on the other hand, did not need any preliminary feature selection strategy because they operate a feature selection exploiting a measure of the GINI index to split the node in the different branches (and this operation can be performed also with a relatively high number of variables). The ML models were than trained to find the best model and the corresponding most representative RaF. The different models were trained in the following ways:
  • LR: a bivariate Logistic Regressor was trained using the RaF survived at the feature selection strategy. All the possible couple of RaF with a Spearman’s correlation coefficient lower than 0.3 were tested and only the LR models were both the p-values were lower than 0.05 were considered for the testing. This bivariate analysis was conducted in order to classify these couples based on the area under the curve (AUC) value of the receiver operating characteristic (ROC) analysis. The entire process was repeated in a 50 cross-fold validation, in order to be able to measure the mean and the SD of the AUCs, for each tested couple of RaF.
  • kNN: kNN was trained with a 50 cross-fold validation technique for each couple of RaF tested for LR. This was done to assess the different performances between LR and kNN on the same couple of RaF. Again, mean and SD of the AUCs were measured.
  • DT and RF were tested with a 50 cross-fold validation technique on all the available RaF. In this case, for each run of the cross-fold validation, only two model were trained (one for DT and one for RF) and the mean and the SD of the AUC were measured on the base of the 50 different training runs.
The entire aforementioned analyses were performed by considering the performances of scanner 1 alone, scanner 2 alone and both the scanners together.

3. Results

Among the total number of 227 patients included in the study, 147 were men (64.8%) and the mean age was 70 years (SD: 8, range: 38–87). The mean size of neoplasms was 32 mm (SD: 15, range: 7–69). Regarding histological classification of NSCLC, 169 patients (74.4%) had ADK while 58 (25.6%) were affected by SCC. In this setting, the lobe involved by disease was represented by left superior lobe (LSL) in 54 patients (23.8%), left inferior lobe (LIL) in 39 (17.2%), right superior lobe (RSL) in 82 (36.1%), middle lobe (ML) in 7 (3.1%) and right inferior lobe (RIL) in 47 (19.8%). For what concerns the TNM classification, 75 patients (33.0%) had a T1 disease, 114 (50.2%) had a T2 disease, while 38 (16.7%) had a T3 disease. In particular, 1 (0.4%) had a T1mi disease, 26 (11.5%) had a T1a disease, 31 (13.7%) had a T1b disease, 11 (4.8%) had a T1c disease, 2 (0.9%) had a T1aN1 disease, 3 (1.3%) had a T1bN1 disease, 1 (0.4%) had a T1cN1 disease, 59 (26.0%) had a T2a disease, 23 (10.1%) had a T2b disease, 23 (10.1%) had a T2aN1 disease, 9 (4.0%) had a T2bN1 disease and 38 (16.7%) had a T3 disease. As a consequence, the presence of nodal localization of disease was reported in 38 patients (16.7%) (Table 1).
According to the VIII edition of AJCC staging system, 128 patients (56.4%) had stage I disease while 99 (43.6%) had stage II disease. In particular, 27 (11.9%) subjects had stage IA1 disease, 31 (13.7%) had stage IA2 disease, 11 (4.8%) had stage IA3 disease, 59 (26.0%) had stage IB disease, 23 (10.1%) had stage IIA disease while 76 (33.5%) had stage IIB disease.
Data about the grading of disease were available only for 100 patients and in this setting 1 patient (1.0%) had a G1 disease, 42 (42.0%) had G2 disease, while 57 (57.0%) had G3 disease. Furthermore, a total of 142 (62.6%) scans were performed on the Discovery 690 tomograph (scanner 1), while 85 (37.4%) of them were acquired on the Discovery STE tomograph (scanner 2). Analyzing PET/CT acquisition depending on the tomograph used for their execution, in 107 (75.4%) scans performed on scanner 1 the presence of ADK was revealed while in 35 (24.6%) the presence of SCC was reported. On scanner 2, ADK was present in 66 (77.6%) subjects while SCC was reported in 19 (22.4%) cases. No significant difference in terms of final diagnosis was reported between the 2 scanners (p-value 0.7).
RaF selection analyses and cross correlation matrixes before applying the ML models for all the scanners are presented in Figure 1. After the cross-correlation selection, scanner 2 had more removed variables, compared to scanner 1 and both the scanners considered together.
The best results of LR analysis for scanner 1, scanner 2 and both the scanners together are presented in Table 2. In this setting, for scanner 1 L_least, F_cm.2.5Dmerged.diff.entr, L_major and F_cm_2.5D.diff.entr were between the RaF with the best performances and AUCs over 0.8 were reported. On scanner 2 the best performances were obtained by F_cm.clust.shade, F_cm_merged.inv.var, F_cm.inv.var and F_cm_merged.clust.shade, with AUCs comprised between 0.6 and 0.8. When considering the combination of the two scanner together L_major, F_rlm.2.5Dmerged.sre, F_stat.entropy and F_rlm_2.5D.sre were part of the couples with the best performances, again with AUCs comprised between 0.6 and 0.8.
The couples of variables that presented the best performances at kNN analyses for all the scanners are presented in Table 3. For scanner 1 F_stat.entropy, F_cm_2.5D.clust.shade, F_cm.2.5Dmerged.clust.shade, F_morph.surface, F_cm_merged.clust.prom and F_cm.clust.prom had the best performances, with general AUCs above 0.8. Moreover, for scanner 2 the best performances were obtained by F_stat.median, F_cm.clust.shade, F_cm_merged.clust.shade and F_cm.joint.max, with the AUCs values that were comprised between 0.6 and 0.8. In the general analysis considering all the scanner, F_stat.uniformity, F_cm.clust.shade, F_cm_merged.clust.shade, F_rlm.lre, F_cm.2.5Dmerged.sum.entr and F_cm_2.5D.clust.shade were part of the couples of RaF with best performances, with AUCs above 0.8. In this setting, a visual representation of the best combination of RaF for kNN and LR are presented in Figure 2.
Furthermore, RF and TM were also applied to our cohort and a comparison of the performances of such analyses with LR and kNN for all the PET/CT scanners are presented in Figure 3 and Table 4. In this setting, TM had the lower performances compared to other models and both TM and RF had AUC values that were lower and more heterogeneous compared to LR and kNN. In particular, for scanner 1 mean AUC for LR, kNN, RF and TM were 0.852, 0.882, 0.793 and 0.701, respectively; for scanner 2 mean AUC were 0.777, 0.870, 0.703 and 0.496, respectively, while when considering both the scanner together such values were 0.784, 0.859, 0.775 and 0.682, respectively.

4. Discussion

As previously underlined in the literature, the technology of the scanner used to acquire PET/CT images directly afflicts the subsequent extraction of RaF and, in this setting, the use of different tomographs in the same department is frequent in daily practice [7,17,18]. These insights suggest that different scanners can potentially have different preferred features in terms of correlations with a clinical outcome and that radiomics models coming from centers adopting different technologies should be critically considered. In our cohort, we had to deal with the presence of two different scanners and our results confirmed this point. Due to the different technologies, for example, we reported a decreasing order of AUCs values for LR trained/tested, respectively on scanner 1, both scanners and on scanner 2. As expected, the p-values of the features in the related models were lower for scanner 1, both scanners and scanner 2, respectively. Moreover, the selection of RaF before applying any ML models selected only a small sample of features for scanner 2 compared to scanner 1 and the analysis for both scanners. This evidence was confirmed also when considering the kNN model, with generally higher AUC values and lower p-values for scanner 1 and both the scanner compared to scanner 2. The higher performance of kNN can be probably due to the linear limitations of the LR modes; on the other hand, LR provides a more communicative model which can be easily plotted in a 2D space or shaped in form of normogram. Our findings are therefore in line with the concept that different PET/CT technologies can influence the contouring and/or the feature values, and therefore the performances, of RaF. In contrast with our findings, Ma et al. [20] revealed that in a large cohort of NSCLC, for most RaF the influence of different scanner on their extraction was not present. However, this is not completely unexpected: in their study they used two different tomographs with a different technological gap and different reconstructing protocols and acquisition parameters. Even if, in their case, the gap between the two scanners was not pivotal, it still remains, in general, an open challenge in this kind of analysis.
Our study was performed by comparing different ML models. In this setting, the models with best performances in the analyses for all the scanners were kNN and LR and, in general, the kNN model had better performances compared to the others. A possible reason of the highest performances of kNN can be due to its non-linearity in cutting the space. This can surely be a pros but, on the other hand, it can carry to a higher risk of overfitting. In addition, a kNN model is computational expensive (to make a prediction the DSS needs to compare the distances with all the cases in the data base) and cannot resumed in an easy-to-use graphical representation. On the other hand, LR has lower but similar performances, reduces the risk of overfitting, clearly show the role of the features in terms of p-value and can be easily reshaped in form of easy-to-use nomogram. In general, in choosing the best predictor, all the aforementioned points should be qualitatively considered, to reach the best trade-off for the specific needs. RF and TM had lower mean AUCs compared to the aforementioned models, and moreover such values were really heterogeneous in each analysis based on the single scanner. Furthermore, as previously mentioned, the impact of the scanner on such analyses is clearly evident and also in this scenario this evidence is confirmed: mean AUC values for scanner 2 were the lowest compared to scanner 1 and both the scanner considered together.
In general, our results underline the ability of [18F]FDG PET/CT RaF to discriminate between ADK and SCC in stage I and stage II NSCLC. The first study to investigate the ability of RaF in this setting was proposed by Ha et al. [23] who reported that these NSCLC entities had different tumoral heterogeneity, with 15 RaF that were able to discriminate between them and that a linear discriminant analysis with such parameters was able to clearly classify them with high performances. More recently, Kim et al. [24] reported that tumor heterogeneity of [18F]FDG uptake was significantly different between ADK and SCC and such parameter was able to predict recurrence of ADC but not SCC in patients who have undergone curative surgery. Orhlac et al. [22] reported the role of different resampling method on RaF in distinguish between ADC and SCC, underling that textural parameters using absolute resampling can vary in function of the cancer subtype more than in relative resampling. An interesting study by Ma et al. [20] investigated the role of texture and colour analyses in differentiating between NSCLC subtypes in a large cohort. They revealed that a combination of both methods had higher performances compared to single method in differentiating between SCC and ADK, with an AUC of 0.89. More recently, Bianconi et al. [19] reported that SCC had significantly higher degree of heterogeneity, stronger variability and lower uniformity of [18F]FDG uptake compared to other subtypes, while ADK had lower heterogeneity, weaker variability and higher uniformity compared to other subtypes of NSCLC. Lastly, Aydos et al. [21] revealed that kurtosis was the only RaF able to differentiate between such histological subtypes, given the fact that in SCC it was significantly lower compared to ADK. Interestingly, some RaF were also able to differentiate between moderate and poorly differentiated ADK.
Generally speaking, even with a high degree of heterogeneity in terms of number of patients, number of scanners used, methods used for RaF extraction and analyses of such parameters, a promising role for texture analysis in differentiating between NSCLC has emerged. In this setting, our study confirms such evidence, reflecting also the influence of different scanners in this context that, as mentioned, can be frequent in daily practice.
Our work is not without limitations. First of all, this is a retrospective study. Second, conventional PET/CT scanners but not last-generation tomographs are used. Moreover, even if characterized by the presence of a relatively high number of patients, in particular compared to similar works, the sample of patients included still appears sub-optimal to clearly evaluate the predictive abilities of texture analysis. The fact that the two tomographs used in our work had different reconstruction algorithm can be another confounding factor. Lastly, the problem of the reproducibility of radiomics analysis in terms of multicentric evaluation is still an open issue and further research in this field is mandatory.

5. Conclusions

In conclusion, our study enabled the selection of some [18F]FDG PET/CT RaF and ML models that are able to predict with good performances the histological subtype of NSCLC. Furthermore, evident influences of the type of PET/CT scanner on such performances were underlined.

Author Contributions

Conceptualization, F.D. and R.G.; methodology, R.G. and F.D.; formal analysis, R.G., P.B., L.C.; writing—original draft preparation, F.D., R.G., D.A., F.B.; writing—review and editing, F.D., D.A., L.C., G.T., F.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study, due to the retrospective design of the study according to the local laws.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data available on request due to privacy/ethical restrictions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Park, S.Y.; Cho, A.; Yu, W.S.; Lee, C.Y.; Lee, J.G.; Kim, D.J.; Chung, K.Y. Prognostic value of total lesion glycolysis by 18F-FDG PET/CT in surgically resected stage IA non-small cell lung cancer. J. Nucl. Med. 2015, 56, 45–49. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Rami-Porta, R.; Bolejack, V.; Giroux, D.J.; Chansky, K.; Crowley, J.; Asamura, H.; Goldstraw, P.; on behalf of the International Association for the Study of Lung Cancer Staging and Prognostic Factors Committee, Advisory Board Members and Participating Institutions. The IASLC lung cancer staging project: The new database to inform the eighth edition of the TNM classification of lung cancer. J. Thorac. Oncol. 2014, 9, 1618–1624. [Google Scholar] [PubMed] [Green Version]
  4. Duma, N.; Santana-Davila, R.; Molina, J.R. Non-Small Cell Lung Cancer: Epidemiology, Screening, Diagnosis, and Treatment. Mayo Clin. Proc. 2019, 94, 1623–1640. [Google Scholar] [CrossRef] [PubMed]
  5. Goldstraw, P.; Chansky, K.; Crowley, J.; Rami-Porta, R.; Asamura, H.; Eberhardt, W.E.; Nicholson, A.G.; Groome, P.; Mitchell, A.; Bolejack, V. The IASLC lung cancer staging project: Proposals for revision of the TNM stage groupings in the forthcoming (eighth) edition of the TNM classification for lung cancer. J. Thorac. Oncol. 2016, 11, 39–51. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Kocher, F.; Hilbe, W.; Seeber, A.; Pircher, A.; Schmid, T.; Greil, R.; Auberger, J.; Nevinny-Stickel, M.; Sterlacci, W.; Tzankov, A.; et al. Longitudinal analysis of 2293 NSCLC patients: A comprehensive study from the TYROL registry. Lung Cancer 2015, 87, 193–200. [Google Scholar] [CrossRef] [PubMed]
  7. Gallamini, A.; Zwarthoed, C.; Borra, A. Positron Emission Tomography (PET) in Oncology. Cancers 2014, 6, 1821–1889. [Google Scholar] [CrossRef] [Green Version]
  8. Albano, D.; Gatta, R.; Marini, M.; Rodella, C.; Camoni, L.; Dondi, F.; Giubbini, R.; Bertagna, F. Role of 18F-FDG PET/CT Radiomics Features in the Differential Diagnosis of Solitary Pulmonary Nodules: Diagnostic Accuracy and Comparison between Two Different PET/CT Scanners. J. Clin. Med. 2021, 10, 5064. [Google Scholar] [CrossRef]
  9. Martini, N.; Bains, M.S.; Burt, M.E.; Zakowski, M.F.; McCormack, P.; Rusch, V.W.; Ginsberg, R.J. Incidence and local recurrence and second primary tumors in resected stage I lung cancer. J. Thorac. Cardiovasc. Surg. 1995, 109, 120–129. [Google Scholar] [CrossRef] [Green Version]
  10. Goya, T.; Asamura, H.; Yoshimura, H.; Kato, H.; Shimokata, K.; Tsuchiya, R.; Sohara, Y.; Miya, T.; Miyaoka, E. Japanese Joint Committee of Lung Cancer Registry. Prognosis of 6644 resected non-small cell lung cancers in Japan: A Japanese lung cancer registry study. Lung Cancer 2005, 50, 227–234. [Google Scholar] [CrossRef]
  11. Harpole, D.H., Jr.; Herndon, J.E., II; Young, W.G., Jr.; Wolfe, W.G.; Sabiston, D.C., Jr. Stage I non-small cell lung cancer: A multivariate analysis of treatment methods and patterns of recurrence. Cancer 1995, 76, 787–796. [Google Scholar] [CrossRef] [PubMed]
  12. Dondi, F.; Albano, D.; Cerudelli, E.; Gazzilli, M.; Giubbini, R.; Treglia, G.; Bertagna, F. Radiolabelled PSMA PET/CT or PET/MRI in hepatocellular carcinoma (HCC): A systematic review. Clin. Transl. Imaging 2020, 8, 461–467. [Google Scholar] [CrossRef]
  13. Albano, D.; Dondi, F.; Gazzilli, M.; Giubbini, R.; Bertagna, F. Meta-Analysis of the Diagnostic Performance of 18F-FDG-PET/CT Imaging in Native Valve Endocarditis. JACC Cardiovasc. Imaging 2021, 14, 1063–1065. [Google Scholar] [CrossRef]
  14. Williams, D.E.; Pairolero, P.C.; Davis, C.S.; Bernatz, P.E.; Payne, W.S.; Taylor, W.F.; Uhlenhopp, M.A.; Fontana, R.S. Survival of patients surgically treated for stage I lung cancer. J. Thorac. Cardiovasc. Surg. 1981, 82, 70–76. [Google Scholar] [CrossRef] [PubMed]
  15. Wisnivesky, J.P.; Henschke, C.; McGinn, T.; Iannuzzi, M.C. Prognosis of stage II non small cell lung cancer according to tumor and nodal status at diagnosis. Lung Cancer 2005, 49, 181–186. [Google Scholar] [CrossRef] [PubMed]
  16. Al-Sarraf, N.; Gately, K.; Lucey, J.; Aziz, R.; Doddakula, K.; Wilson, L.; McGovern, E.; Young, V. Clinical implication and prognostic significance of standardised uptake value of primary non-small cell lung cancer on positron emission tomography: Analysis of 176 cases. Eur. J. Cardiothorac. Surg. 2008, 34, 892–897. [Google Scholar] [CrossRef] [Green Version]
  17. Dondi, F.; Pasinetti, N.; Gatta, R.; Albano, D.; Giubbini, R.; Bertagna, F. Comparison between Two Different Scanners for the Evaluation of the Role of 18F-FDG PET/CT Semiquantitative Parameters and Radiomics Features in the Prediction of Final Diagnosis of Thyroid Incidentalomas. J. Clin. Med. 2022, 11, 615. [Google Scholar] [CrossRef]
  18. Sollini, M.; Cozzi, L.; Antunovic, L.; Chiti, A.; Kirienko, M. PET Radiomics in NSCLC: State of the art and a proposal for harmonization of methodology. Sci. Rep. 2017, 7, 358. [Google Scholar] [CrossRef] [Green Version]
  19. Bianconi, F.; Palumbo, I.; Fravolini, M.L.; Chiari, R.; Minestrini, M.; Brunese, L.; Palumbo, B. Texture Analysis on [18F]FDG PET/CT in Non-Small-Cell Lung Cancer: Correlations Between PET Features, CT Features, and Histological Types. Mol. Imaging Biol. 2019, 21, 1200–1209. [Google Scholar] [CrossRef]
  20. Ma, Y.; Feng, W.; Wu, Z.; Liu, M.; Zhang, F.; Liang, Z.; Cui, C.; Huang, J.; Li, X.; Guo, X. Intra-tumoural heterogeneity characterization through texture and colour analysis for differentiation of non-small cell lung carcinoma subtypes. Phys. Med. Biol. 2018, 63, 165018. [Google Scholar] [CrossRef]
  21. Aydos, U.; Ünal, E.R.; Özçelik, M.; Akdemir, D.; Ekinci, Ö.; Taştepe, A.İ.; Memiş, L.; Atay, L.Ö.; Akdemir, Ü.Ö. Texture features of primary tumor on 18F-FDG PET images in non-small cell lung cancer: The relationship between imaging and histopathological parameters. Rev. Esp. Med. Nucl. Imagen Mol. 2021, 40, 343–350. [Google Scholar] [CrossRef] [PubMed]
  22. Orlhac, F.M.; Soussan, M.; Chouahnia, K.; Martinod, E.; Buvat, I. 18F-FDG PET-derived textural indices reflect tissue-specific uptake pattern in non-small cell lung cancer. PLoS ONE 2015, 10, e0145063. [Google Scholar] [CrossRef] [PubMed]
  23. Ha, S.; Choi, H.; Cheon, G.J.; Kang, K.W.; Chung, J.K.; Kim, E.E.; Lee, D.S. Autoclustering of Non-small Cell Lung Carcinoma Subtypes on 18F-FDG PET Using Texture Analysis: A Preliminary Result. Nucl. Med. Mol. Imaging 2014, 48, 278–286. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Kim, D.H.; Jung, J.H.; Son, S.H.; Kim, C.Y.; Hong, C.M.; Oh, J.R.; Jeong, S.Y.; Lee, S.W.; Lee, J.; Ahn, B.C. Prognostic Significance of Intratumoral Metabolic Heterogeneity on 18F-FDG PET/CT in Pathological N0 Non-Small Cell Lung Cancer. Clin. Nucl. Med. 2015, 40, 708–714. [Google Scholar] [CrossRef] [PubMed]
  25. van Gómez López, O.; García Vicente, A.M.; Honguero Martínez, A.F.; Soriano Castrejón, A.M.; Jiménez Londoño, G.A.; Udias, J.M.; León Atance, P. Heterogeneity in [18F]fluorodeoxyglucose positron emission tomography/computed tomography of non-small cell lung carcinoma and its relationship to metabolic parameters and pathologic staging. Mol. Imaging 2014, 13. [Google Scholar] [CrossRef]
  26. Nioche, C.; Orlhac, F.; Boughdad, S.; Reuzé, S.; Goya-Outi, J.; Robert, C.; Pellot-Barakat, C.; Soussan, M.; Frouin, F.; Buvat, I. LIFEx: A Freeware for Radiomic Feature Calculation in Multimodality Imaging to Accelerate Advances in the Characterization of Tumor Heterogeneity. Cancer Res. 2018, 78, 4786–4789. [Google Scholar] [CrossRef] [Green Version]
  27. Dinapoli, N.; Alitto, A.R.; Vallati, M.; Gatta, R.; Autorino, R.; Boldrini, L.; Damiani, A.; Valentini, V. Moddicom: A complete and easily accessible library for prognostic evaluations relying on image features. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 2015, 2015, 771–774. [Google Scholar] [PubMed] [Green Version]
  28. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef]
Figure 1. Cross correlation matrixes for radiomics features (RaF) for scanner 1 (A), scanner 2 (B) and both the scanner considered together (C).
Figure 1. Cross correlation matrixes for radiomics features (RaF) for scanner 1 (A), scanner 2 (B) and both the scanner considered together (C).
Jcm 12 00255 g001
Figure 2. Visual representation with the couples of RaF with best performances at LR and k-nearest neighbors (kNN) analyses for scanner 1 (A), scanner 2 (B) and both the scanner considered together (C).
Figure 2. Visual representation with the couples of RaF with best performances at LR and k-nearest neighbors (kNN) analyses for scanner 1 (A), scanner 2 (B) and both the scanner considered together (C).
Jcm 12 00255 g002
Figure 3. Comparison between LR, kNN, random forest and tree models performances for scanner 1 (A), scanner 2 (B) and both the scanner considered together (C).
Figure 3. Comparison between LR, kNN, random forest and tree models performances for scanner 1 (A), scanner 2 (B) and both the scanner considered together (C).
Jcm 12 00255 g003
Table 1. Characteristics of the 227 patients included in the study.
Table 1. Characteristics of the 227 patients included in the study.
Characteristicn (%)
Sex
  Male147 (64.8%)
  Female80 (35.2%)
Age (mean ± SD, range)70 ± 8, 38–87
Histology
  Adenocarcinoma169 (74.4%)
  Squamous cell carcinoma58 (25.6%)
Size (mean ± SD, range) (mm)32 ± 15, 7–69
Grading *
  G11 (1.0%)
  G2 42 (42.0%)
  G357 (57.0%)
Lobe
  LSL54 (23.8%)
  LIL39 (17.2%)
  RSL82 (36.1%)
  ML7 (3.1%)
  RIL45 (19.8%)
TNM stage
  T1mi1 (0.4%)
  T1a26 (11.5%)
  T1b31 (13.7%)
  T1c11 (4.8%)
  T1aN12 (0.9%)
  T1bN13 (1.3%)
  T1cN11 (0.4%)
  T2a59 (26.0%)
  T2b23 (10.1%)
  T2aN123 (10.1%)
  T2bN19 (4.0%)
  T338 (16.7%)
AJCC stage
  I
   IA127 (11.9%)
   IA231 (13.7%)
   IA311 (4.8%)
   IB59 (26.0%)
  II
   IIA23 (10.1%)
   IIB76 (33.5%)
Nodal metastasis
  Yes38 (16.7%)
  No189 (83.3%)
PET/CT scanner
Scanner 1 (Discovery 690) 142 (62.6%)
Scanner 2 (Discovery STE)85 (37.4%)
* Data available only for 100 patients. SD: standard deviation; mm: millimeters; LSL: left superior lobe; LIL: left inferior lobe; RSL: right superior lobe; ML: middle lobe; RIL: right inferior lobe; AJCC: American Joint Commission on Cancer; PET/CT: positron emission tomography/computed tomography.
Table 2. Couples of RaF with best performances at logistic regression analysis for scanner 1, scanner 2 and both scanners together.
Table 2. Couples of RaF with best performances at logistic regression analysis for scanner 1, scanner 2 and both scanners together.
Covariate 1Covariate 2 Mean AUCSD AUCMean p-Value 1Mean p-Value 2
Scanner 1
L_leastF_cm.2.5Dmerged.diff.entr0.8520.015<0.0010.020
L_leastF_cm_2.5D.diff.entr0.8500.017<0.0010.020
L_majorF_cm_2.5D.diff.entr0.8470.018<0.001<0.001
L_majorF_cm.2.5Dmerged.diff.entr0.8460.019<0.001<0.001
F_cm_2.5D.diff.entrF_cm_2.5D.inv.diff.mom.norm0.8450.019<0.001<0.001
F_cm_2.5D.diff.entrF_cm.2.5Dmerged.inv.diff.mom.norm0.8450.019<0.001<0.001
F_cm_2.5D.diff.entrF_szm_2.5D.zsnu0.8450.0180.012<0.001
F_cm.2.5Dmerged.diff.entrF_cm.2.5Dmerged.inv.diff.mom.norm0.8440.019<0.001<0.001
F_cm.diff.entrF_cm_2.5D.inv.diff.mom.norm0.8440.017<0.001<0.001
F_cm.diff.entrF_cm.2.5Dmerged.inv.diff.mom.norm0.8440.016<0.001<0.001
Scanner 2
F_cm.clust.shadeF_cm_merged.inv.var0.7770.0270.0290.021
F_cm.inv.varF_cm.clust.shade0.7770.0280.0190.030
F_cm.inv.varF_cm_merged.clust.shade0.7770.0280.0190.030
F_cm_merged.inv.varF_cm_merged.clust.shade0.7770.0270.0210.029
Scanner 1 + 2
L_majorF_rlm.2.5Dmerged.sre0.7840.020<0.001<0.001
F_stat.entropyF_rlm.2.5Dmerged.sre0.7840.020<0.001<0.001
L_leastF_rlm.2.5Dmerged.sre0.7840.021<0.001<0.001
F_stat.entropyF_rlm_2.5D.sre0.7840.020<0.001<0.001
L_majorF_rlm_2.5D.sre0.7840.020<0.001<0.001
L_leastF_rlm_2.5D.sre0.7840.021<0.001<0.001
L_minorF_rlm.2.5Dmerged.sre0.7820.020<0.001<0.001
L_minorF_rlm_2.5D.sre0.7820.020<0.001<0.001
F_morph.surfaceF_rlm.2.5Dmerged.sre0.7820.021<0.001<0.001
F_szm_2.5D.zsnuF_rlm.2.5Dmerged.sre0.7820.020<0.001<0.001
AUC: area under the curve; SD: standard deviation; RaF: radiomics features.
Table 3. Couples of RaF with best performances at k-nearest neighbors analysis for scanner 1, scanner 2 and both scanners together.
Table 3. Couples of RaF with best performances at k-nearest neighbors analysis for scanner 1, scanner 2 and both scanners together.
Covariate 1Covariate 2 Mean AUCSD AUCMean p-Value 1Mean p-Value 2
Scanner 1
F_stat.entropyF_cm_2.5D.clust.shade0.9380.012<0.0010.271
F_stat.entropyF_cm.2.5Dmerged.clust.shade0.9380.012<0.0010.272
F_morph.surfaceF_cm_merged.clust.prom0.9360.016<0.0010.613
F_morph.surfaceF_cm.clust.prom0.9360.016<0.0010.613
F_stat.entropyF_cm_merged.clust.prom0.9350.013<0.0010.610
F_stat.entropyF_cm.clust.prom0.9350.013<0.0010.611
F_cm.energyF_cm.2.5Dmerged.inv.diff.mom.norm0.9330.0150.0460.008
F_stat.entropyF_cm_2.5D.clust.prom0.9330.013<0.0010.515
F_stat.entropyF_cm.2.5Dmerged.clust.prom0.9330.013<0.0010.516
F_cm.energyF_cm_2.5D.inv.diff.mom.norm0.9320.0140.0440.007
Scanner 2
F_stat.medianF_cm.clust.shade0.9030.0140.0650.026
F_stat.medianF_cm_merged.clust.shade0.9090.0140.0630.026
F_cm.joint.maxF_cm.clust.shade0.8990.0250.0140.140
F_cm.joint.maxF_cm_merged.clust.sade0.8970.0240.0140.139
F_cm.clust.shadeF_cm_2.5D.joint.max0.8930.0160.1110.028
F_cm.2.5Dmerged.energyF_cm.2.5Dmerged.clust.shade0.8920.0210.0130.791
F_cm_2.5D.joint.maxF_cm_merged.clust.shade0.8920.0160.0280.111
F_cm_2.5D.clust.shadeF_cm.2.5Dmerged.energy0.8920.0220.7900.013
F_cm.clust.shadeF_cm.2.5Dmerged.joint.max0.8860.0200.1080.031
F_cm_merged.clust.shadeF_cm.2.5Dmerged.joint.max0.8850.0200.1070.031
Scanner 1 + 2
F_stat.uniformityF_cm.clust.shade0.9120.0110.0080.110
F_stat.uniformityF_cm_merged.clust.shade0.9110.0110.0080.111
F_rlm.lreF_cm.2.5Dmerged.sum.entr0.9100.0100.857<0.001
F_stat.uniformityF_cm_2.5D.clust.shade0.9070.0140.0080.158
F_stat.uniformityF_cm.2.5Dmerged.clust.shade0.9070.0150.0080.158
F_morph.volumeF_rlm_2.5D.gl.var0.9030.0130.1170.030
F_rlm.lreF_cm_2.5D.sum.entr0.9030.0130.838<0.001
F_morph.volumeF_rlm.2.5Dmerged.gl.var0.9030.0140.1170.031
F_stat.uniformityF_cm.diff.avg0.9030.0140.0080.005
F_stat.uniformityF_cm.dissimilarity0.9030.0140.0080.005
AUC: area under the curve; SD: standard deviation; RaF: radiomics features.
Table 4. Predictive efficiency (AUC) of ML models for scanner 1, scanner 2 and both scanners together.
Table 4. Predictive efficiency (AUC) of ML models for scanner 1, scanner 2 and both scanners together.
ML ModelScanner 1Scanner 2Scanner 1 + 2
LR0.8520.7770.784
kNN0.8820.8700.860
RF0.7930.7040.775
DT0.7010.4960.682
AUC: area under the curve; ML: machine learning; LR: Logistic Regressor; kNN: k-Nearest Neighbors; RF: Random Forest; DT: Decision Tree.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dondi, F.; Gatta, R.; Albano, D.; Bellini, P.; Camoni, L.; Treglia, G.; Bertagna, F. Role of Radiomics Features and Machine Learning for the Histological Classification of Stage I and Stage II NSCLC at [18F]FDG PET/CT: A Comparison between Two PET/CT Scanners. J. Clin. Med. 2023, 12, 255. https://doi.org/10.3390/jcm12010255

AMA Style

Dondi F, Gatta R, Albano D, Bellini P, Camoni L, Treglia G, Bertagna F. Role of Radiomics Features and Machine Learning for the Histological Classification of Stage I and Stage II NSCLC at [18F]FDG PET/CT: A Comparison between Two PET/CT Scanners. Journal of Clinical Medicine. 2023; 12(1):255. https://doi.org/10.3390/jcm12010255

Chicago/Turabian Style

Dondi, Francesco, Roberto Gatta, Domenico Albano, Pietro Bellini, Luca Camoni, Giorgio Treglia, and Francesco Bertagna. 2023. "Role of Radiomics Features and Machine Learning for the Histological Classification of Stage I and Stage II NSCLC at [18F]FDG PET/CT: A Comparison between Two PET/CT Scanners" Journal of Clinical Medicine 12, no. 1: 255. https://doi.org/10.3390/jcm12010255

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop