Next Article in Journal
Topical Imiquimod in Primary Cutaneous Extramammary Paget’s Disease: A Systematic Review
Previous Article in Journal
Breast Cancer Treatment: To tARget or Not? That Is the Question
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Neoadjuvant Chemoradiotherapy Response in Rectal Cancer Patients Using Harmonized Radiomics of Multcenter 18F-FDG-PET Image

1
Radiological and Medico-Oncological Sciences, University of Science and Technology, Daejeon 34113, Republic of Korea
2
Division of Applied RI, Korea Institute of Radiological and Medical Sciences, Seoul 07812, Republic of Korea
3
Department of Nuclear Medicine, Soonchunhyang University Bucheon Hospital, Bucheon 14584, Republic of Korea
4
Department of Nuclear Medicine, Korea Institute of Radiological and Medical Sciences, Seoul 07812, Republic of Korea
5
Department of Surgery, Korea Institute of Radiological and Medical Sciences, Seoul 07812, Republic of Korea
*
Author to whom correspondence should be addressed.
Cancers 2023, 15(23), 5662; https://doi.org/10.3390/cancers15235662
Submission received: 5 October 2023 / Revised: 7 November 2023 / Accepted: 28 November 2023 / Published: 30 November 2023
(This article belongs to the Topic Artificial Intelligence in Cancer, Biology and Oncology)

Abstract

:

Simple Summary

Neoadjuvant chemotherapy is the standard treatment for locally advanced rectal cancer. Preoperative chemoradiotherapy yields clinically significant tumor regression; while some patients exhibit a minimal response, others exhibit a complete pathologic response. We developed deep learning and machine learning models to predict chemoradiotherapy response across external tests using multicenter data. The machine learning model, which used harmonized image features extracted from 18F-FDG PET, showed higher performance and demonstrated reproducibility through external tests compared to the deep learning models using 18F-FDG PET images. Our study highlights the feasibility of predicting the chemoradiotherapy response of individual patients using non-invasive and reliable image feature values.

Abstract

We developed machine and deep learning models to predict chemoradiotherapy in rectal cancer using 18F-FDG PET images and harmonized image features extracted from 18F-FDG PET/CT images. Patients diagnosed with pathologic T-stage III rectal cancer with a tumor size > 2 cm were treated with neoadjuvant chemoradiotherapy. Patients with rectal cancer were divided into an internal dataset (n = 116) and an external dataset obtained from a separate institution (n = 40), which were used in the model. AUC was calculated to select image features associated with radiochemotherapy response. In the external test, the machine-learning signature extracted from 18F-FDG PET image features achieved the highest accuracy and AUC value of 0.875 and 0.896. The harmonized first-order radiomics model had a higher efficiency with accuracy and an AUC of 0.771 than the second-order model in the external test. The deep learning model using the balanced dataset showed an accuracy of 0.867 in the internal test but an accuracy of 0.557 in the external test. Deep-learning models using 18F-FDG PET images must be harmonized to demonstrate reproducibility with external data. Harmonized 18F-FDG PET image features as an element of machine learning could help predict chemoradiotherapy responses in external tests with reproducibility.

1. Introduction

More than 100,000 individuals worldwide are diagnosed with rectal cancer annually [1]. Rectal cancer is generally treated with neoadjuvant chemoradiotherapy, and tumor responses to therapy are diverse, with 54–75% of patients experiencing tumor downstaging [2]. The reasons for these changes in treatment response are poorly understood, and there is no exact method for predicting the treatment response [3]. Only 15–27% of patients show no residual viable tumors on pathological examination, pathological complete response (pCR) to chemoradiotherapy, and surgery [4]. An accurate imaging biomarker for predicting and evaluating chemotherapy could the early classification of patients into different prognostic groups and personalized treatment approaches. Early detection of patients who might respond poorly to chemoradiotherapy can provide them the opportunity to undergo surgery and receive enhanced treatments to maximize treatment response.
Medical imaging can be used to noninvasively evaluate therapeutic responses to chemotherapy. Jang et al. developed an MRI-based deep learning model for predicting chemotherapy response in rectal cancer and reported the area under receiver operating characteristic curve (AUC) of 0.76 and an accuracy of 0.85. 18F-FDG PET/CT has also been widely used to monitor treatment response in many types of malignancies, stages, and diagnoses. 18F-FDG PET can help detect glucose metabolism and reveal tumor characteristics. As the anatomical data obtained from CT in rectal cancer patients can help distinguish between physiological and pathological intestinal absorption [5], 18F-FDG PET/CT is generally considered a standard tool for predicting the response to chemotherapy in rectal cancer. The radiomics features of 18F-FDG PET/CT can also facilitate the prediction of chemoradiotherapy. Taking this into consideration, researchers are increasingly exploring the potential of incorporating radiomic features from 18F-FDG PET/CT scans into predictive models to enhance the accuracy and reliability of forecasting responses to chemoradiotherapy.
Recently, the use of machine learning techniques for large and complex biological data analysis has increased. Deep learning techniques are considered among the most powerful tools and are frequently used in bioinformatics because they can allow the analysis of vast amounts of data. Many radiomics studies utilize features extracted by manual method, and these methods are significantly influenced by the knowledge and experience of individual researchers [6]. Consequently, deep learning techniques for computing task-adaptive feature representations by learning layers of complex features directly from medical images are considered suitable tools for predicting prognosis. Deep learning techniques that can automatically learn representative information from raw image data to decode the radiation expression type of tumors can assist in disease diagnosis, prognostic evaluation, and treatment sensitivity prediction [7]. The model performance of deeper hidden layers for pattern recognition has recently begun to surpass that of classical methods in different fields. One of the most popular deep neural networks is the Convolutional Neural Network (CNN). Random forest (RF) technology, which includes an ensemble of decision trees and naturally integrates feature selection and interaction during learning, is a popular choice in personalized medicine. It is nonparametric, efficient, and has a high predictive accuracy for many types of data. RF model is increasingly being adopted because of its advantages in dealing with small sample sizes, high-dimensional feature spaces, and complex data structures [8].
In oncology research, particularly when assessing rectal cancer responses to therapy, the role of SUVmax and SUVmean values derived from 18F-FDG PET/CT scans has been under critical evaluation, as illustrated by several independent studies. Two independent studies showed that the SUVmax predicted chemotherapy with a specificity and overall accuracy of only 35% and 44%, respectively [9,10]. SUVmean, dissimilarity, and contrast from the neighborhood intensity-difference matrix (NGTDM contrast) were significantly and independently associated with OS [11]. A decrease in metabolic tumor volume (MTV) and total lesion glycolysis (TLG) values was suggested to be an indicator of a positive response to chemotherapy [12]. Chemotherapy response predictions using 18F-FDG PET/CT are not sufficiently accurate to distinguish patients showing treatment response from those who respond poorly to the treatment [13]. Several studies have reported that radiation features were scanner or protocol-sensitive, highlighting the importance of harmonizing image features to reduce multicenter variability before pooling data from multiple sites [14,15].
In the present study, we evaluated the use of machine learning to predict chemoradiotherapy responses using radiomics harmonization and demonstrated the reproducibility and repeatability of the findings through rigorous external testing. Our effort is not only to address the limitations of the current methodologies but also to contribute to the development of a more robust and universally applicable predictive model for chemoradiotherapy responses in cancer treatment.

2. Materials and Methods

2.1. Patient Cohort

All patients were diagnosed with pathologic T-stage III rectal cancer, with tumor growth into the outer lining of the bowel wall without breaching its integrity. Patients with a tumor size > 2 cm were treated with neoadjuvant chemoradiotherapy before surgery. The internal and external cohorts comprised 116 patients from internal institutions (Korea Institute of Radiological and Medical Sciences) and 40 patients from independent institutions (Soonchunhyang University Bucheon Hospital). The internal cohort comprised 21 patients diagnosed with pCR and 95 patients diagnosed with non-pCR. The external cohort consisted of six patients diagnosed with pCR and 31 patients diagnosed with non-pCR. The rectal cancer region was cropped from an 18F-FDG PET image (Figure 1).

2.2. Image Feature Extraction

We utilized LIFEx (Local Image Features Extraction, version 4.90) software to calculate image features from 18F-FDG PET/CT images of rectal cancer patients. In total, 55 image features were extracted. The region of interest (ROI) was marked manually with an SUV threshold of 2.0 (Figure 2). Tumor lesions were identified in the area of 18F-FDG uptake, which was pathologically increased and was in contrast to the CT images. To predict chemotherapy response in rectal cancer, first- and second-order images were used separately to compare intensity-based and GLCM-based image characteristics. The AUC was calculated to select the image features from the first- and second-order features using R (version 4.2.2) software (R Foundation for Statistical Computing, Vienna, Austria).

2.3. Harmonization Methodology

Harmonization of the image features from the internal and external 18F-FDG PET/CT datasets was performed. Both of training set and test set were harmonized in separate manner. The harmonization (ComBat) method was used with an online application (https://forlhac.shinyapps.io/Shiny_ComBat/, accessed on 28 November 2023). ComBat is a batch-matching technology initially proposed for gene expression microarrays [16] and has been widely used in the field of imaging. The ComBat model is given by
yij = α + γi + δiεij
where j indicates the specific measurement of image feature y, i indicates the setting of the scanner, protocol effect, or even observer effect (called the site effect), α represents the average value of the image features denoted as y, γi signifies additive batch effect influence on measurement, δi represents multiplicative batch effect, and εij is an error term. Batch i represents the experimental settings employed for y measurement, including the possible scanner effect. Site effects γi and δi can be estimated using conditional posterior means and subsequently corrected using
y i j C o m B a t = y i j α ^ γ i   ^ δ ^ i + α ^
where α ^ ,   γ i   ^ and δ ^ i are estimators of α, γi and δi. yijComBat is the converted yij measured value devoid of the site i effect.

2.4. Deep Learning and Machine Learning

The CNN structure consisted of input, convolution, batch normalization, ReLU, max pooling, linear, dropout, and output layers. The CNN parameters comprised the optimizer, learning rate, and epoch; the values were set to Adam, 0.0002, and 200, respectively. Two convolutional layers are used. The CNN structure was constructed using two-dimensional input slices taken from each patient. The chemotherapy prediction performance of the RF model was internally and externally evaluated using the scikit-learn library (version 1.2.0) in Python (version 3.10.11).
Augmentation techniques were employed to resolve the data imbalance between pCR and non-pCR. The “RandomRotation” function of PyTorch livery in Python were used to randomly rotate input images by a certain angle to increase the diversity of the training dataset. The “RandomResizedCrop” function of PyTorch livery in Python is employed to randomly select a portion of the input image and subsequently resize it, serving the purpose of augmenting the training dataset and enhancing its variety. The Synthetic minority oversampling technique was implemented on the training dataset for machine learning to mitigate data imbalance.
After splitting the internal dataset at a 7:3 ratio, internal test were performed for both models through evaluating AUC, accuracy, precision, and sensitivity. External test were proceed using independent institution dataset. Confusion matrix-based evaluation metrics including accuracy, sensitivity and precision were estimated and the threshold probability was adjusted to the value that maximizes Youden’s index.

3. Results

3.1. Patients Cohort

18F-FDG PET/CT images from 116 internal and 40 external datasets were used for model estimation. The average ages of the internal and external datasets were 61.85 years and 59.88, respectively. The internal cohort comprised 75 males (64.66%) and 41 females (35.34%). The external cohort comprised 27 males (67.5%) and 13 females (32.5%). A summary of the demographic characteristics and pathological TNM stages is presented in Table 1. The patient cohort included patients who developed lymph node- or distant organ-metastases.

3.2. Evaluation of Deep Learning Model

The CNN model for rectal cancer chemoradiotherapy prediction was developed using 18F-FDG PET images. The number of pCR data points from the internal and external data increased through augmentation to 84 and 24, respectively. To equalize the amount of pCR and non-pCR data, the pCR data from the internal and external cohorts were decreased by random sampling. The deep learning model showed a performance, with an accuracy of 0.867 and 0.789 in the internal test (Table 2). However, in the external test, the deep learning signature showed an accuracy of 0.557 and 0.355 (Table 3). The deep learning models showed higher performance in internal test then external test.

3.3. Image Feature Extraction and Harmonization

A total of 55 image featuers were quantitatively calculated from 18F-FDG PET and CT images. The image features were separated into first-order features, including conventional indices, shapes, and histogram-based intensity values (n = 23). The image texture features were assigned as second-order features, including a Gray-level co-occurrence matrix (GLCM), neighborhood gray-level difference matrix (NGLDM), Gray-level run-length matrix (GLRLM), and Gray-level zone length matrix (GLZLM) (n = 22) (Figure 2). AUC was calculated to determine image features capable of distinguishing between chemotherapy and non-PCR cases. Subsequently, image features from the internal dataset were selected and used for machine learning. First-order features extracted from 18F-FDG PET and CT with AUC over 0.65 and 0.55 were used for machine learning, respectively (Table 4). Second-order features extracted from 18F-FDG PET and CT with AUC over 0.7 and 0.6 were used for machine learning, respectively (Table 5). Image feature values from internal and external institutions were harmonized to reduce multicenter variations. GLZLM GLNU, which had the largest change in the distribution of values before and after harmonization, was visualized (Figure 3).

3.4. Evaluation of Machine Learning Model

The extracted primary and secondary features were used as variables for the RF model, and each model was evaluated using internal and external tests. The RF model using harmonized first-order features showed an accuracy and AUC of 0.771, which is higher than before harmonization in the external test. The RF model using secondary features exhibited an accuracy and AUC of 0.675 and 0.603 in the external test after harmonization, lower than those without harmonization. The first-order features showed higher accuracy and AUC for the external datasets than the second-order features. In the external test set, the 18F-FDG PET image feature as a machine learning signature achieved the highest accuracy with an AUC value of 0.875 and 0.896 (95% confidence interval 0.562–1) (Table 6).

4. Discussion

The performance of the machine learning models in predicting chemoradiotherapy response using imaging features extracted from 18F-FDG PET images was estimated using an external test. Conducting multicenter studies is one of the main objectives of clinical applications. However, medical images acquired from different institutions may introduce biases due to variations in imaging devices, data acquisition methods, and protocols [17,18]. Because radiomics is sensitive, variations in feature values may occur even in cases where the same feature is extracted from multiple organs. Large-scale radiomic data analysis is required to verify the reproducibility of radiomics, and radiomic features extracted from images acquired from different centers must be integrated. In this study, radiomics harmonization was performed to reduce batch effects. Our results indicated that the harmonization of image features extracted from multiple datasets is essential as a predictor.
In several studies related to cancers, the RF model has shown a high potential in predicting clinical outcomes [19,20,21,22]. The RF model demonstrated reproducibility and repeatability in external tests when utilizing the features extracted from 18F-FDG PET images. Because the RF model generates predictions by randomly selecting a decision tree, it mitigates the risk of overfitting. As it traverses the decision tree, it learns the image features that best encapsulate the discriminatory factors for distinguishing tumor characteristics. Moreover, it is expected to yield superior outcomes because it employs an optimal cut-off value for discriminating between pCR and non-pCR patients based on image features. These attributes of the RF model appear to have further enhanced its predictive accuracy and AUC in the context of chemoradiotherapy prognosis.
Medical imaging offers vital insights into the progress of patients with rectal cancer, and AI holds promise for developing quantitative treatment decision support tools. Some studies have shown that tumor metabolic changes on 18F-FDG PET were more predictive than tumor morphological modifications on CT [23,24,25]. In our study, image features extracted from 18F-FDG PET images showed higher machine learning performance than those extracted from CT images. The imaging features of CT in the external tests showed an accuracy and AUC of 0.425 and 0.593, whereas those extracted from 18F-FDG PET showed an accuracy and AUC of 0.875 and 0.896. Our study indicate that the radiomics of 18F-FDG PET have a more complementary effect then CT in predicting the pCR of rectal cancer. 18F-FDG PET imaging is crucial for monitoring alterations in tumor metabolic activity, playing a vital role in prognostic predictions for patients undergoing concurrent chemoradiotherapy. Although CT imaging provides comprehensive details pertaining to the tumor’s size and shape, excelling in anatomical delineation, it falls short in effectively predicting tumor responses to chemoradiotherapy. This discrepancy highlights a potential limitation in its prognostic utility for this specific therapeutic context. It has been observed that the integration of radiomic features extracted from both 18F-FDG PET and CT into predictive models can lead to a decrement in performance, suggesting a paradoxical reduction in the model’s efficacy despite the amalgamation of data from both imaging techniques. This underscores the need for careful consideration when combining features from different modalities to enhance the accuracy of treatment response predictions.
The first and second selected features for AUC values encompassed those previously identified as having prognostic significance in other investigations. The significance of SUVmax, SUVmean, and Uniformity, which are image feature values, has been demonstrated in previous studies. The secondary features based on GLRLM, NGLDM, and GLRM were incorporated as important variables in the radiochemotherapy prediction model. These feature values have demonstrated their predictive utility in various cancers. When the chemoradiotherapy response was predicted using harmonized first-order features, it showed a higher performance than second-order features. The first-order features were derived from histograms, whereas the second-order features were based on the GLCM. As the first-order values exhibited significant alterations following harmonization, the impact of harmonization is noteworthy. Conversely, the second-order values displayed negligible changes after harmonization. Consequently, the model utilizing first-order features exhibited superior performance in predicting rectal cancer chemotherapy outcomes.
There are several 18F-FDG PET/CT predictive radiomics for pCR to chemotherapy, including visual response, maximum standardized uptake value (SUVmax), percentage SUVmax reduction, TLG, and MTV [26,27,28,29]. Lovinfosse et al. revealed that SUVmean, dissimilarity, and contrast from contrast NGTDM were significantly and independently associated with OS in patients with rectal cancer. Jean-Emmanuel et al. predicted a complete response using a deep neural network after rectal chemoradiotherapy with 80% accuracy in a multicenter cohort using radiomics extracted from CT. Xiaolu M et al. The RF model for the degree of differentiation, T-stage, and N-stage were obtained using radiomics from MRI (AUC, 0.746; 95% CI, 0.622–0.872; sensitivity, 79.3%; and specificity, 72.2%). Giannini et al. evaluated a logistic regression model using six texture features (five from PET and one from T2w MRI) to determine the chemotherapy outcomes (AUC = 0.86; sensitivity = 86%, and specificity = 83%).
We estimated the performance of the deep learning model in predicting the outcomes of neoadjuvant responses using multicenter 18F-FDG PET images. However, the model performance proved insignificant in external tests conducted with datasets from independent institutions. Deep learning demonstrated subpar performance in external tests owing to the omission of dataset harmonization, which failed to account for potential biases between the internal and external datasets. In the case of machine learning, the difference between the internal and external datasets was drastically reduced through the harmonization of the image feature values shown in the ROI; thus, reproducibility as a predictor of machine learning was confirmed. Batch effects can be mitigated by preprocessing the images employed in deep learning, involving techniques such as slope distortion correction, bias slope distortion correction, bias field correction, and intensity normalization, which help standardize the data [30,31]. Reducing batch effects through harmonization at the image level is expected to show high performance in sufficiently predicting chemotherapy, even in external tests.
Our study has some limitations. Deep learning exhibited a lower performance in external tests than in internal tests. This outcome may be attributed to the absence of harmonization between internal and external datasets. Because the CNN model makes predictions using the image itself, it is necessary to harmonize the image. The number of patients within the presently registered external data may be relatively limited, leading to suboptimal performance in external tests. Deep learning techniques in the realm of medical image analysis are challenged by their black-box characteristics, which pose issues for interpretability. Additionally, given the extensive discussion in this article about how chemotherapy and radiotherapy can significantly increase the risk of infertility for women wishing to conceive in the future, we propose a more proactive approach. Women should be given greater autonomy over their reproductive timelines, particularly through the strategic use of oocyte vitrification prior to undergoing such medical interventions [32].

5. Conclusions

Our research underscores the critical significance of image harmonization in multicenter studies for accurate chemotherapy response prediction in pancreatic cancer while also highlighting the potential of noninvasive radiomics-based machine learning models in predicting neoadjuvant chemoradiotherapy response in rectal cancer. A machine learning model predicting radiochemotherapy outcomes for pancreatic cancer using harmonized 18F-FDG PET imaging features was confirmed to be reproducible and repeatable in external testing using multicenter data. A deep model using 18F-FDG PET images without the harmonization process performed poorly in predicting neoadjuvant chemoradiotherapy response, demonstrating the importance of image harmonization in multicenter studies. We confirmed the possibility of using a machine learning model to predict the chemoradiotherapy response of rectal cancer before treatment using radiomics, which can be obtained noninvasively.

Author Contributions

Conceptualization, H.-M.J., J.Y., U.-S.S., S.C. and S.-K.W.; Methodology, H.-M.J., J.Y. and S.-K.W.; Software, H.-M.J., J.Y. and S.-K.W.; validation, J.Y.; formal analysis, J.Y.; investigation, J.Y.; resources, S.-K.W.; data curation, J.-M.P., J.-H.C., H.S., B.-I.K., U.-S.S., S.M.M. and S.-K.W.; writing—original draft preparation, H.-M.J.; writing—review and editing, H.-M.J., U.-S.S. and S.-K.W.; visualization, J.Y.; supervision, S.-K.W.; project administration, S.-K.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Research Foundation of Korea (NRF) grant funded by the Ministry of Science and ICT (No. 2020M2D9A1094070).

Institutional Review Board Statement

This study was approved by the Institutional Review Board of KIRAMS (IRB No.: KIRAMS 2021-03-009).

Informed Consent Statement

Informed consent was obtained from all participants involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy considerations.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Liu, Z.; Zhang, X.-Y.; Shi, Y.-J.; Wang, L.; Zhu, H.-T.; Tang, Z.; Wang, S.; Li, X.-T.; Tian, J.; Sun, Y.-S.; et al. Radiomics analysis for evaluation of pathological complete response to neoadjuvant chemoradiotherapy in locally advanced rectal cancer. Clin. Cancer Res. 2017, 23, 7253–7262. [Google Scholar] [CrossRef] [PubMed]
  2. Valentini, V.; Coco, C.; Cellini, N.; Picciocchi, A.; Genovesi, D.; Mantini, G.; Barbaro, B.; Cogliandolo, S.; Mattana, C.; Ambesiimpiombato, F.; et al. Preoperative chemoradiation for extraperitoneal T3 rectal cancer: Acute toxicity, tumor response, and sphincter preservation. Int. J. Radiat. Oncol. Biol. Phys. 1998, 40, 1067–1075. [Google Scholar] [CrossRef] [PubMed]
  3. Pham, T.T.; Liney, G.P.; Wong, K.; Barton, M.B. Functional MRI for quantitative treatment response prediction in locally advanced rectal cancer. Br. J. Radiol. 2017, 90, 20151078. [Google Scholar] [CrossRef] [PubMed]
  4. Maas, M.; Nelemans, P.J.; Valentini, V.; Das, P.; Rödel, C.; Kuo, L.J.; Beets, G.L. Long-term outcome in patients with a pathological complete response after chemoradiation for rectal cancer: A pooled analysis of individual patient data. Lancet Oncol. 2010, 11, 835–844. [Google Scholar] [CrossRef]
  5. Maffione, A.; Chondrogiannis, S.; Capirci, C.; Galeotti, F.; Fornasiero, A.; Crepaldi, G.; Grassetto, G.; Rampin, L.; Marzola, M.; Rubello, D. Early prediction of response by 18F-FDG PET/CT during preoperative therapy in locally advanced rectal cancer: A systematic review. Eur. J. Surg. Oncol. (EJSO) 2014, 40, 1186–1194. [Google Scholar] [CrossRef] [PubMed]
  6. Liu, X.; Li, K.W.; Yang, R.; Geng, L.S. Review of deep learning based automatic segmentation for lung cancer radiotherapy. Front. Oncol. 2021, 11, 717039. [Google Scholar] [CrossRef]
  7. Zhong, Y.; She, Y.; Deng, J.; Chen, S.; Wang, T.; Yang, M.; Ma, M.; Song, Y.; Qi, H.; Wang, Y.; et al. Multi-omics Classifier for Pulmonary Nodules (MISSION) Collaborative Group. Deep learning for prediction of N2 metastasis and survival for clinical stage I non–small cell lung cancer. Radiology 2022, 302, 200–211. [Google Scholar] [CrossRef]
  8. Kim, J.; Oh, J.E.; Lee, J.; Kim, M.J.; Hur, B.Y.; Sohn, D.K.; Lee, B. Rectal cancer: Toward fully automatic discrimination of T2 and T3 rectal cancers using deep convolutional neural network. Int. J. Imaging Syst. Technol. 2019, 29, 247–259. [Google Scholar] [CrossRef]
  9. Palma, P.; Conde-Muíño, R.; Rodríguez-Fernández, A.; Segura-Jiménez, I.; Sánchez-Sánchez, R.; Martín-Cano, J.; Gómez-Río, M.; Ferrón, J.A.; Llamas-Elvira, J.M. The value of metabolic imaging to predict tumour response after chemoradiation in locally advanced rectal cancer. Radiat. Oncol. 2010, 5, 1–8. [Google Scholar] [CrossRef]
  10. Martoni, A.A.; Di Fabio, F.; Pinto, C.; Castellucci, P.; Pini, S.; Ceccarelli, C.; Cuicchi, D.; Iacopino, B.; Di Tullio, P.; Giaquinta, S.; et al. Prospective study on the FDG–PET/CT predictive and prognostic values in patients treated with neoadjuvant chemoradiation therapy and radical surgery for locally advanced rectal cancer. Ann. Oncol. 2011, 22, 650–656. [Google Scholar] [CrossRef]
  11. Lovinfosse, P.; Polus, M.; Van Daele, D.; Martinive, P.; Daenen, F.; Hatt, M.; Hustinx, R. FDG PET/CT radiomics for predicting the outcome of locally advanced rectal cancer. Eur. J. Nucl. Med. Mol. Imaging 2018, 45, 365–375. [Google Scholar] [CrossRef]
  12. Sun, W.; Xu, J.; Hu, W.; Zhang, Z.; Shen, W. The role of sequential 18F-FDG PET/CT in predicting tumour response after preoperative chemoradiation for rectal cancer. Color. Dis. 2013, 15, e231–e238. [Google Scholar] [CrossRef] [PubMed]
  13. Joye, I.; Deroose, C.M.; Vandecaveye, V.; Haustermans, K. The role of diffusion-weighted MRI and 18F-FDG PET/CT in the prediction of pathologic complete response after radiochemotherapy for rectal cancer: A systematic review. Radiother. Oncol. 2014, 113, 158–165. [Google Scholar] [CrossRef] [PubMed]
  14. Orlhac, F.; Eertink, J.J.; Cottereau, A.S.; Zijlstra, J.M.; Thieblemont, C.; Meignan, M.; Boellaard, R.; Buvat, I. A guide to ComBat harmonization of imaging biomarkers in multicenter studies. J. Nucl. Med. 2022, 63, 172–179. [Google Scholar] [CrossRef] [PubMed]
  15. Sampaio, I.W.; Tassi, E.; Bellani, M.; Benedetti, F.; Poletti, S.; Spalletta, G.; Piras, F.; Bianchi, A.M.; Brambilla, P.; Maggioni, E. Comparison of Multi-Site Neuroimaging Data Harmonization Techniques for Machine Learning Applications. In Proceedings of the IEEE EUROCON 2023-20th International Conference on Smart Technologies, Torino, Italy, 6–8 July 2023. [Google Scholar]
  16. Johnson, W.E.; Li, C.; Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 2017, 8, 118–127. [Google Scholar] [CrossRef] [PubMed]
  17. Jovicich, J.; Czanner, S.; Greve, D.; Haley, E.; van Der Kouwe, A.; Gollub, R.; Kennedy, D.; Schmitt, F.; Brown, G.; MacFall, J.; et al. Reliability in multi-site structural MRI studies: Effects of gradient non-linearity correction on phantom and human data. Neuroimage 2006, 30, 436–443. [Google Scholar] [CrossRef] [PubMed]
  18. Shinohara, R.T.; Oh, J.; Nair, G.; Calabresi, P.A.; Davatzikos, C.; Doshi, J.; Henry, R.G.; Kim, G.; Linn, K.A.; Papinutto, N.; et al. Volumetric analysis from a harmonized multisite brain MRI study of a single subject with multiple sclerosis. Am. J. Neuroradiol. 2017, 38, 1501–1509. [Google Scholar] [CrossRef]
  19. Macaulay, B.O.; Aribisala, B.S.; Akande, S.A.; Akinnuwesi, B.A.; Olabanjo, O.A. Breast cancer risk prediction in African women using random forest classifier. Cancer Treat. Res. Commun. 2021, 28, 100396. [Google Scholar] [CrossRef]
  20. Kesler, S.R.; Rao, A.; Blayney, D.W.; Oakley-Girvan, I.A.; Karuturi, M.; Palesh, O. Predicting long-term cognitive outcome following breast cancer with pre-treatment resting state fMRI and random forest machine learning. Front. Hum. Neurosci. 2017, 11, 555. [Google Scholar] [CrossRef]
  21. Li, N.; Luo, P.; Li, C.; Hong, Y.; Zhang, M.; Chen, Z. Analysis of related factors of radiation pneumonia caused by precise radiotherapy of esophageal cancer based on random forest algorithm. Math. Biosci. Eng. 2021, 18, 4477–4490. [Google Scholar] [CrossRef]
  22. Bi, L.; Guo, Y. Development and Validation of the Random Forest Model via Combining CT-PET Image Features and Demographic Data for Distant Metastases among Lung Cancer Patients. J. Healthc. Eng. 2022, 2022, 7793533. [Google Scholar] [CrossRef]
  23. Zhang, J.; Zhao, X.; Zhao, Y.; Zhang, J.; Zhang, Z.; Wang, J.; Wang, Y.; Dai, M.; Han, J. Value of pre-therapy 18F-FDG PET/CT radiomics in predicting EGFR mutation status in patients with non-small cell lung cancer. Eur. J. Nucl. Med. Mol. Imaging 2020, 47, 1137–1146. [Google Scholar] [CrossRef] [PubMed]
  24. Kaira, K.; Higuchi, T.; Naruse, I.; Arisaka, Y.; Tokue, A.; Altan, B.; Suda, S.; Mogi, A.; Shimizu, K.; Sunaga, N.; et al. Metabolic activity by 18F-FDG-PET/CT is predictive of early response after nivolumab in previously treated NSCLC. Eur. J. Nucl. Med. Mol. Imaging 2018, 45, 56–66. [Google Scholar] [CrossRef]
  25. Mu, W.; Tunali, I.; Gray, J.E.; Qi, J.; Schabath, M.B.; Gillies, R.J. Radiomics of 18F-FDG PET/CT images predicts clinical benefit of advanced NSCLC patients to checkpoint blockade immunotherapy. Eur. J. Nucl. Med. Mol. Imaging 2020, 47, 1168–1182. [Google Scholar] [CrossRef]
  26. Huh, J.W.; Min, J.J.; Lee, J.H.; Kim, H.R.; Kim, Y.J. The predictive role of sequential FDG-PET/CT in response of locally advanced rectal cancer to neoadjuvant chemoradiation. Am. J. Clin. Oncol. 2012, 35, 340–344. [Google Scholar] [CrossRef]
  27. Capirci, C.; Rubello, D.; Pasini, F.; Galeotti, F.; Bianchini, E.; Del Favero, G.; Panzavolta, R.; Crepaldi, G.; Rampin, L.; Facci, E.; et al. The role of dual-time combined 18-fluorodeoxyglucose positron emission tomography and computed tomography in the staging and restaging workup of locally advanced rectal cancer, treated with preoperative chemoradiation therapy and radical surgery. Int. J. Radiat. Oncol. Biol. Phys. 2009, 74, 1461–1469. [Google Scholar] [CrossRef]
  28. Melton, G.B.; Lavely, W.C.; Jacene, H.A.; Schulick, R.D.; Choti, M.A.; Wahl, R.L.; Gearhart, S.L. Efficacy of preoperative combined 18-fluorodeoxyglucose positron emission tomography and computed tomography for assessing primary rectal cancer response to neoadjuvant therapy. J. Gastrointest. Surg. 2007, 11, 961–969. [Google Scholar] [CrossRef]
  29. Guillem, J.G.; Puig-La Calle, J.; Akhurst, T.; Tickoo, S.; Ruo, L.; Minsky, B.D.; Gollub, M.J.; Klimstra, D.S.; Mazumdar, M.; Paty, P.B.; et al. Prospective assessment of primary rectal cancer response to preoperative radiation and chemotherapy using 18-fluorodeoxyglucose positron emission tomography. Dis. Colon Rectum 2000, 43, 18–24. [Google Scholar] [CrossRef]
  30. Nyúl, L.G.; Udupa, J.K. On standardizing the MR image intensity scale. Magn. Reson. Med. Off. J. Int. Soc. Magn. Reson. Med. 1999, 42, 1072–1081. [Google Scholar] [CrossRef]
  31. Shinohara, R.T.; Sweeney, E.M.; Goldsmith, J.; Shiee, N.; Mateen, F.J.; Calabresi, P.A.; Jarso, S.; Pham, D.L.; Reich, D.S.; Crainiceanu, C.M. Statistical normalization techniques for magnetic resonance imaging. NeuroImage Clin. 2014, 6, 9–19. [Google Scholar] [CrossRef]
  32. Gullo, G.; Petousis, S.; Papatheodorou, A.; Panagiotidis, Y.; Margioula-Siarkou, C.; Prapas, N.; D’Anna, R.; Perino, A.; Cucinella, G.; Prapas, Y. Closed vs. Open oocyte vitrification methods are equally effective for blastocyst embryo transfers: Prospective study from a sibling oocyte donation program. Gynecol. Obstet. Investig. 2020, 85, 206–212. [Google Scholar] [CrossRef]
Figure 1. The corp process of rectal cancer region from 18F-FDG PET image.
Figure 1. The corp process of rectal cancer region from 18F-FDG PET image.
Cancers 15 05662 g001
Figure 2. Radiomics extracted from 18F-FDG PET/CT.
Figure 2. Radiomics extracted from 18F-FDG PET/CT.
Cancers 15 05662 g002
Figure 3. Distribution of GLZLM GLNU value before and after harmonization: (a) Distribution of GLZLM GLNU extracted from all T-stage patients before harmonization; (b) Distribution of GLZLM GLNU max extracted from all T-stage patients after harmonization.
Figure 3. Distribution of GLZLM GLNU value before and after harmonization: (a) Distribution of GLZLM GLNU extracted from all T-stage patients before harmonization; (b) Distribution of GLZLM GLNU max extracted from all T-stage patients after harmonization.
Cancers 15 05662 g003
Table 1. Characteristics of the study cohort.
Table 1. Characteristics of the study cohort.
CharacteristicsInternal Dataset
(n = 116)
External Dataset
(n = 40)
Chemoradiotherapy response (%)
pCR21 (18.1)6 (15)
non-pCR95 (81.9)34 (85)
Age (%)
<6569 (59.48)23 (57.5)
≥6547 (40.52)17 (42.5)
Mean age (y)61.8559.88
Sex (%)
Male75 (64.66)27 (67.5)
Female41 (35.34)13 (32.5)
Clinical T-stage, n (%)
T3116 (100)40
Clinical N stage (%)
N019 (16.38)5 (12.5)
N131 (26.72)8 (20)
N1a2 (1.72)
N1b13 (11.21)1 (2.5)
N237 (31.9)6 (15)
N2a13 (11.21)12 (30)
N2b1 (0.86)8 (20)
Clinical M stage (%)
M0106 (91.38)32 (80)
M16 (5.17)
M1a3 (2.59)8 (50)
M1b1 (0.86)
Clinical stage (%)
IIA 5 (12.5)
IIB18 (15.52)
IIC
IIIA42 (36.21)21 (52.5)
IIIB46 (39.66)6 (15)
IIIC 8 (20)
IVA10 (8.62)
pCR: pathological complete response.
Table 2. Internal test of CNN model using 18F-FDG PET images.
Table 2. Internal test of CNN model using 18F-FDG PET images.
Number of DataEfficiency Evaluation
Data SetpCRNon-pCRAccuracyPrecisionSensitivityAUC
(95% CI)
Imbalanced21210.8670.8710.8710.903
(0.856–0.949)
Balanced84950.7890.8430.6770.835
(0.804–0.866)
pCR: pathological complete response; AUC: area under receiver operating characteristic curve; CI: Confidence interval.
Table 3. External test of CNN model using 18F-FDG PET images.
Table 3. External test of CNN model using 18F-FDG PET images.
Number of DataEfficiency Evaluation
Data SetpCRNon-pCRAccuracyPrecisionSensitivityAUC
(95% CI)
Imbalanced660.5570.5420.4950.498
(0.412–0.583)
Balanced24250.3550.2410.4750.443
(0.378–0.509)
pCR: pathological complete response; AUC: area under receiver operating characteristic curve; CI: Confidence interval.
Table 4. Extraction of first-order image features by AUC cut-off value.
Table 4. Extraction of first-order image features by AUC cut-off value.
First-Order Image Feature
18F-FDG PETAUCCTAUC
SHAPE Sphericity 0.715Uniformity0.663
SUVQ10.707Entropy log100.659
SUVmean0.694Entropy log20.659
SUVQ30.692SHAPE Compacity0.618
SUVQ20.69SHAPE Volume0.604
Uniformity0.681SUVstd0.6
Entropy log100.677SUVmax0.593
Entropy log20.677SUVQ30.589
SUVstd0.667Kurtosis0.582
SUVmin 0.65ExcessKurtosis0.582
Volume0.663
Sphericity 0.579
Skewness0.578
TLG 0.563
Abbreviations: SUVQ, Standardized Uptake Value Quotient; SUV, Standardized Uptake Value; SUVstd, Standardized Uptake Value Standard Deviation; SUVmin, Standardized Uptake Value Minimum; SHAPE, Sphericity, Histogram Analysis, and Parametric Evaluation; SUVmax, Standardized Uptake Value Maximum; TLG, Total Lesion Glycolysis.
Table 5. Extraction of second-order image features by AUC cut-off value.
Table 5. Extraction of second-order image features by AUC cut-off value.
Second-Order Image Feature
18F-FDG PETAUCCTAUC
GLZLM LZLGE0.766NGLDM Contrast0.704
GLZLM LZE0.765GLZLM ZP0.698
GLRLM GLNU0.763GLRLM LRE0.69
GLRLM SRE0.756GLRLM RP0.69
GLRLM RP0.755GLRLM SRE0.689
GLRLM LRE0.753GLZLM LZLGE0.689
NGLDM Contrast0.74GLCM Homogeneity0.689
GLZLM ZP0.74GLZLM LZE0.685
GLZLM LZHGE0.74GLZLM LZHGE0.683
GLCM Homogeneity0.734GLCM Energy 0.683
NGLDM Busyness0.732GLCM Entropy log100.667
GLRLM LRLGE0.731GLCM Entropy log2 0.667
GLCM Dissimilarity0.71GLCM Dissimilarity0.661
GLCM Contrast 0.702GLRLM GLNU0.647
GLRLM LGRE0.701GLRLM LRHGE0.633
NGLDM Busyness0.628
GLRLM SRHGE0.617
GLCM Contrast 0.613
GLRLM LRLGE0.613
Abbreviations: GLZLM, Gray-Level Zone Length Matrix; LZLGE, Long Zone Low Gray-level Emphasis; LZE, Low Gray-level Zone Emphasis; GLRLM, Gray-Level Run Length Matrix; SRE, Short Run Emphasis; RP, Run Percentage; LRE, Gray-Level Run Length Matrix; NGLDM, Neighborhood Gray-Level Dependence Matrix; ZP, Zone Percentage; LZHGE, Long-Zone High-Grey level Emphasis; GLCM, Gray-Level Co-occurrence Matrix, LRLGE, Long Run Low Gray-level Emphasis; LGRE, Low Gray-level Run Emphasis.
Table 6. Internal and external test of RF model.
Table 6. Internal and external test of RF model.
Image FeatureValueWithout HarmonizationWithout HarmonizationWith Harmonization
Internal TestExternal TestExternal Test
CTPETPET/CTCTPETPET/CTCTPETPET/CT
First orderAccuracy0.540.620.560.550.70.5250.60.6460.771
Precision0.5240.5750.6150.2270.20.190.2220.7690.882
Sensitivity0.880.920.320.8330.3330.6670.6670.4170.625
AUC0.540.620.560.6670.5490.5830.6270.6460.771
95% CI for AUC---0.412–0.9210.291–0.8070.325–0.8420.37–0.8850.469–0.9620.429–0.934
Second orderAccuracy0.520.640.70.4250.5250.70.650.5830.675
Precision0.5160.630.7270.1850.190.250.250.70.632
Sensitivity0.640.680.640.8330.6670.50.6670.2920.5
AUC0.520.640.70.5930.5830.6180.6570.5830.603
95% CI for AUC---0.334–0.8520.325–0.8420.36–0.8760.402–0.9120.562–10.344–0.862
AllAccuracy0.680.760.70.650.6750.7750.4250.8750.725
Precision0.7650.810.6390.2140.2670.3330.1850.9520.333
Sensitivity0.520.680.920.50.6670.50.8330.8330.833
AUC0.680.760.70.5880.6720.6620.5930.8960.77
95% CI for AUC---0.329–0.8470.418–0.9250.556–10.334–0.8520.562–10.536–1
AUC: area under receiver operating characteristic curve; CI: Confidence interval.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ju, H.-M.; Yang, J.; Park, J.-M.; Choi, J.-H.; Song, H.; Kim, B.-I.; Shin, U.-S.; Moon, S.M.; Cho, S.; Woo, S.-K. Prediction of Neoadjuvant Chemoradiotherapy Response in Rectal Cancer Patients Using Harmonized Radiomics of Multcenter 18F-FDG-PET Image. Cancers 2023, 15, 5662. https://doi.org/10.3390/cancers15235662

AMA Style

Ju H-M, Yang J, Park J-M, Choi J-H, Song H, Kim B-I, Shin U-S, Moon SM, Cho S, Woo S-K. Prediction of Neoadjuvant Chemoradiotherapy Response in Rectal Cancer Patients Using Harmonized Radiomics of Multcenter 18F-FDG-PET Image. Cancers. 2023; 15(23):5662. https://doi.org/10.3390/cancers15235662

Chicago/Turabian Style

Ju, Hye-Min, Jingyu Yang, Jung-Mi Park, Joon-Ho Choi, Hyejin Song, Byung-Il Kim, Ui-Sup Shin, Sun Mi Moon, Sangsik Cho, and Sang-Keun Woo. 2023. "Prediction of Neoadjuvant Chemoradiotherapy Response in Rectal Cancer Patients Using Harmonized Radiomics of Multcenter 18F-FDG-PET Image" Cancers 15, no. 23: 5662. https://doi.org/10.3390/cancers15235662

APA Style

Ju, H. -M., Yang, J., Park, J. -M., Choi, J. -H., Song, H., Kim, B. -I., Shin, U. -S., Moon, S. M., Cho, S., & Woo, S. -K. (2023). Prediction of Neoadjuvant Chemoradiotherapy Response in Rectal Cancer Patients Using Harmonized Radiomics of Multcenter 18F-FDG-PET Image. Cancers, 15(23), 5662. https://doi.org/10.3390/cancers15235662

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop