Next Article in Journal
General Features and Novel Gene Signatures That Identify Epstein-Barr Virus-Associated Epithelial Cancers
Next Article in Special Issue
Multitask Learning Radiomics on Longitudinal Imaging to Predict Survival Outcomes following Risk-Adaptive Chemoradiation for Non-Small Cell Lung Cancer
Previous Article in Journal
Vaccine Strategies for Human Papillomavirus-Associated Head and Neck Cancers
Previous Article in Special Issue
Artificial Neural Networks Predicted the Overall Survival and Molecular Subtypes of Diffuse Large B-Cell Lymphoma Using a Pancancer Immune-Oncology Panel
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Impact of Resampling and Denoising Deep Learning Algorithms on Radiomics in Brain Metastases MRI

1
Medical Physics Department, CLCC François Baclesse, 14000 Caen, France
2
UMR GREYC, Normandie University, UNICAEN, ENSICAEN, CNRS, 14000 Caen, France
3
ISTCT/CERVOxy Group, Normandie University, UNICAEN, CEA, CNRS, 14000 Caen, France
4
Radiology Department, CLCC François Baclesse, 14000 Caen, France
5
Radiotherapy Department, CLCC François Baclesse, 14000 Caen, France
6
GenesisCare Theranostics, Building 1 & 11, The Mill, 41-43 Bourke Road, Alexandria, NSW 2015, Australia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Cancers 2022, 14(1), 36; https://doi.org/10.3390/cancers14010036
Submission received: 29 September 2021 / Revised: 14 December 2021 / Accepted: 18 December 2021 / Published: 22 December 2021
(This article belongs to the Special Issue Medical Imaging and Machine Learning​)

Abstract

:

Simple Summary

Due to the central role of magnetic resonance Imaging (MRI) in the management of patients with cancer, waiting lists exceed clinically relevant delays. For this reason, many research groups and MRI manufacturers develop algorithms as resampling and denoising models to allow faster acquisition time without deterioration in image quality. Whereas these algorithms are available in all new MRI, it is not clear how they will impact image features as well as the validity of statistical model of radiomics which use deep images characteristics to predict treatment outcome. The aim of this study was to develop resampling and denoising deep learning (DL) models and evaluate their impact on radiomics from post-Gd-T1w-MRI brain images with brain metastases. We show that resampling and denoising DL models reconstruct low resolution and noised MRI images acquired quickly into high quality images. While fast acquisition loses most of the radiomic-features and invalidates predictive radiomic models, DL models restore these parameters.

Abstract

Background: Magnetic resonance imaging (MRI) is predominant in the therapeutic management of cancer patients, unfortunately, patients have to wait a long time to get an appointment for examination. Therefore, new MRI devices include deep-learning (DL) solutions to save acquisition time. However, the impact of these algorithms on intensity and texture parameters has been poorly studied. The aim of this study was to evaluate the impact of resampling and denoising DL models on radiomics. Methods: Resampling and denoising DL model was developed on 14,243 T1 brain images from 1.5T-MRI. Radiomics were extracted from 40 brain metastases from 11 patients (2049 images). A total of 104 texture features of DL images were compared to original images with paired t-test, Pearson correlation and concordance-correlation-coefficient (CCC). Results: When two times shorter image acquisition shows strong disparities with the originals concerning the radiomics, with significant differences and loss of correlation of 79.81% and 48.08%, respectively. Interestingly, DL models restore textures with 46.15% of unstable parameters and 25.96% of low CCC and without difference for the first-order intensity parameters. Conclusions: Resampling and denoising DL models reconstruct low resolution and noised MRI images acquired quickly into high quality images. While fast MRI acquisition loses most of the radiomic features, DL models restore these parameters.

1. Introduction

Multimodal imaging is a central diagnostic tool in medicine, especially for the management of patients with cancers. From 2009 to 2019 the number of imaging examinations in the USA increased by 18% and 42% for computed tomography (CT) and magnetic resonance imaging (MRI), respectively [1]. In cancer treatment centres, MRI is used for the diagnosis and treatment follow-up of patients, which has placed a significant demand on resources. All of these factors have led to an increased delay in obtaining an MRI appointment with waiting times now up to weeks or month(s) in France/Europe (30 days on average) [2]. There is thus a pressing unmet need to reduce MRI acquisition time, to allow for better patient management. To reduce acquisition time, several approaches have been developed, such as partial Fourier transforms and parallel imaging. However, these techniques cause significant image degradation [3,4]. Compressed sensing, a signal processing technique for efficient signal acquisition and reconstruction by finding solutions to underdetermined linear systems, undersample the k-space. This allows for shorter acquisition times and estimation of the non-acquired k-space data through an iterative process [5]. This technique is frequently used in clinical settings to reduce acquisition times but has reached its limits and delays to obtain an MRI acquisition remain too long.
Artificial intelligence (AI) and especially deep learning (DL) -a subset of AI- has seen massive development in medical systems in the last five years [6]. Beyond the use of DL for automated diagnosis from radiology images [7], DL can be used to accelerate image acquisition. In 2019, the New York University Langone Health and Facebook AI Research consortium created a fast MRI challenge [8] to investigate the use of DL to make MRI scans faster while maintaining high image quality. The dataset released contained k-space data, and the idea was to simulate acquisition matrix subsampling by applying a mask in the Fourier domain and learn the mapping between subsampled images and fully sampled images. Thanks to the advances of DL algorithms, manufacturers now sell MRI machines equipped with DL algorithms to accelerate scan times combining them for example with compressed sensing algorithms [9] or with DL denoising algorithms that work directly on the final image, reducing the number of measurements of the signal or number of excitation (NEX) [10]. Finally, in this constantly progressive research field, numerous algorithms have been developed. The most adapted DL architecture model for medical image restoration is convolutional neural networks (CNN) [11] with efficient sub-pixel convolutional neural network (ESPCN) [12] or fully convolutional network (U-Net) [13] architecture.
Despite the Food and Drug Administration (FDA) and European Medicines Agency agreement for routine clinical use, it is not clear how these algorithms will impact the image or affect the validity of statistical models derived from radiomics. Radiomics provide a methodology to extract different features based on intensity, shape or texture from images in order to build predictive models [14]. This approach holds great promise to predict patient prognosis, treatment response or the identification of molecular markers. For example, an overall survival predictive model including radiomics features was computed in lung cancer [15]. A recent review by Lohmann et al. studied the models in patients with brain metastases [16]. They presented 13 radiomic models with an area under the curve (obtained with a receiver operating characteristic curve and which define the specificity and the sensibility) between 0.73 and 0.98 to differentiate between glioblastoma (GBM) and BM, prediction of BM origin, treatment response and overall survival.
This very promising emerging field has numerous pitfalls that have been identified by the radiomics community including study design, data acquisition, segmentation, features calculation and modelling [17]. The stability of these predictive models has to be challenged before their adoption as a standard of care. In this study, we focus on the effect of AI denoising and resampling on radiomics predictive models.
A study of radiomic feature reproducibility with DL algorithms is mandatory in the new generation MRI machines using DL algorithms to confirm the validity of radiomic analysis. We propose using DL models for resampling and denoising to accelerate acquisition time by a factor of ~2 (dividing by two the acquisition matrix and the NEX respectively), and to investigate the impact of these DL models on radiomic feature reproducibility.

2. Materials and Methods

2.1. Patients

This retrospective study was approved by the local institutional review board. Eighty-five patients presenting with brain metastases (BM) referred to our oncological center between January 2017 and December 2019 were included. Post-Gd T1 brain imaging was undertaken for initial diagnosis or treatment efficacy follow-up. MR-004, a national French institution (INDS) defining health research conduct guidelines was used for this study. The study population characteristics are shown in Table 1.

2.2. Magnetic Resonance Imaging (MRI) Acquisition

MRI was performed on an AREA SIEMENS 1.5 Tesla magnet using a brain dedicated 16 channels coil with the patient in a supine position. Prior to the examination patients were injected with 0.2 mL/kg of DOTAREM (500 µmol/mL). After a shimming process and scout imaging scan, tumor gadolinium enhancement was detected with a post-Gd T1 brain sequence (TR/TEeff = 2070/3.15 ms; Angle = 15°; NEX = 1; 208 contiguous slices; resolution = 0.5 × 0.5 × 1 mm; acquisition matrix = 256 × 256 pixels and acquisition time = 4 min 48).

2.3. Dataset, Resampling and Denoisning DL Models

To reduce the acquisition time in MRI, acquisitions can be made with half the acquisition matrix or half the NEX. However, this leads to undersampled and noisy images. Therefore, a supervised learning approach was used to “learn” a function that maps low quality images (acquired rapidly) to high quality images (acquired slowly). As it was not possible to obtain true downsampled and noisy images owing to the limited availability of the MRI, downsampled and noisy images were simulated by halving the acquisition matrix using linear interpolation to decrease spatial resolution and adding Rician noise in the MRI image, respectively. The NEX was halved by transforming the image in the Fourier domain then adding an additive random Gaussian noise both to the real part and the imaginary part. The magnitude of the noisy complex image was then computed before finally transforming it back to the spatial (pixel) domain [18,19]. The flow diagram of the method used in this study is presented in Figure S1.
DL models were developed with a total of 14,243 unique post-Gd T1 brain images obtained from 85 acquisitions which were split into 9756, 2438 and 2049 images for training, validation, and testing, respectively. DL models were developed using the Keras python library [20] which is based on a U-Net architecture [21], described in Figure 1.
The loss function used to train the model is described below:
M i x E Y ,   Y ^ = M S E Y ,   Y ^ + 0 ,   1   M G E G Y , G ^ Y ^ + 0 ,   1   M S _ S S I M Y ,   Y ^
With:
M S E Y ,   Y ^ = 1 m n   i = 1 m j = 1 n ( Y ^ i , j Y ( i , j ) ) 2
M G E G Y , G ^ Y ^ = 1 m n   i = 1 m j = 1 n ( G ^ Y ^ i , j G Y ( i , j ) ) 2
G Y i , j = G Y x 2 i , j + G Y y 2 i , j
G Y x = Y [ 1 2 1 0 0 0 1 2 1 ]
G Y y = Y [ 1 0 1 2 0 2 1 0 1 ]
S S I M P Y ^ , P Y = 1 N P Y ^   P Y 2 μ P Y ^ μ P Y + c 1 2 σ P Y ^ σ P Y + c 2 μ 2 P Y ^ + μ 2 P Y + c 1 σ 2 P Y ^ + σ 2 P Y + c 2
where:
  • MSE: Mean Squared Error
  • MGE: Mean Gradient Error
  • MS_SSIM: Multi-Scale SSIM corresponding to multiple SSIM image evaluations at different image scales [22].
  • N: Number of batch over which SSIM has been averaged
  • μ P Y and μ P Y ^ : Mean of patches PY and P Y ^ , respectively
  • σ P Y and σ P Y ^ : Deviation of patches PY and P Y ^ , respectively
  • c 1 and c 2 : Constants
The Python code for the resampling and denoising DL model is available at: https://github.com/AurelienCD/Resampling_Denoising_Deep_Learning_MRI (accessed on 21 December 2021).
The quality of the model was then evaluated comparing the peak signal-to-noise ratio (PSNR, formula (8)) and the structural similarity method between input and output images with the original MRI image as reference. The input and output imaging format was DICOM.
P S N R Y ,   Y ^ = 10   l o g 10 M A X I 2 M S E Y ,   Y ^  
M A X I is the maximum possible pixel value of the image.

2.4. Image Processing, Radiomics Extraction and Analysis

All image processing was performed using a 3D slicer version 4.10 [23] and ImageJ software [24]. A total of 40 BM contours from 11 patients were obtained using the 3D slicer segmentation program designed for brain tumors [25]. Volumes of interest (VOIs) obtained from the original images were also used on DL input and output images.
Radiomic feature values were extracted from BM VOI using the Pyradiomics python library [26] as previously described [27]. Through radiomics extraction in accordance with the Imaging Biomarker Standardization Initiative (IBSI) [28], up to seven classes of features can be obtained. (1) The First-order intensity class describes the distribution of pixel values. (2) The Gray Level Co-occurrence Matrix (GLCM) class describes the occurrence of similar pixel values in the image. (3) The Gray Level Size Zone Matrix (GLSZM) features quantify gray level zones in an image. A gray level zone is defined as the number of connected voxels that share the same gray level intensity in three dimensions. (4) The Gray Level Run Length Matrix (GLRLM) class evaluates the gray level runs, which are defined as the length in number of pixels, of consecutive pixels that have the same gray level value in one dimension. (5) The Neighboring Gray Tone Difference Matrix (NGTDM) describes the difference between a gray value of a pixel and the average gray value of neighbors. (6) The Gray Level Dependence Matrix (GLDM) characterizes the number of connected voxels within a distance from the center voxel in function of their grey level. (7) The IQ wavelets class contains two features, a local analysis of the VOI only and a global analysis of the whole image. These metrics characterize image quality as the ratio between high and low wavelet frequencies. To evaluate the impact of DL algorithms on radiomics, we compared predictive radiomic models values before and after DL algorithm processing. Two predictive models of radiomic were used base on [29,30], which are radiomic models for the prediction of treatment response (overall survival) of BM from NSCLC and BM classification. More details on the radiomic models can be found in Table S2.
Lastly, to evaluate the performance of the DL model in comparison with the twice-shorter acquired images (downsampled or noisy images), maps of change in pixel value between post-processing and reference images were computed with ImageJ as follows:
abs Postprocessing   image reference   image reference   image 100

2.5. Statistical Analysis

Data are presented using boxplots with minimum, maximum, 1st quartile and 3rd quartile. p-values < 0.05 were considered statistically significant. A paired student’s t-test was used to compare features in original and DL images. The correlation between original, fast and DL images was analysed with a Pearson test and the Concordance Correlation Coefficient (CCC) [31]. CCC values of ± 1 describe a perfect positive/negative correlation respectively and a value of 0, no correlation. Features with a minimum CCC of 0.85 were considered as statistically reproducible and concordant and the radiomic values stable [32]. All the statistical analysis were performed using python [33] and SciPy library. All python codes used in the analysis are available on https://github.com/AurelienCD/Resampling_Denoising_Deep_Learning_MRI (accessed on 21 December 2021). Finally, to more deeply understand the mechanism of the DL models, scatter plots of radiomics features for original and DL images were performed.

3. Results

3.1. Resampling DL Model

3.1.1. Quality of the Resampling DL Model

As expected, and presented in Figure 2, fast image with an acquisition matrix divided by two, present with low resolution with or without an underlying pathologic condition. More interestingly, the DL resampling model was able to reconstruct well-defined MRI images with an increase in PSNR and SSIM values in comparison to fast acquired images (PSNR: 31.44 ± 2.89 vs. 34.24 ± 2.80, p < 0.001 and SSIM: 0.93 ± 0.03 vs. 0.96 ± 0.03, p < 0.001 for fast and DL images, respectively).
To investigate the impact of DL reconstruction on BM signal intensity, difference maps from the reference MRI image were computed and are presented in Figure 3. The BM signal intensity was significantly closer to the reference image in comparison to the fast image (difference value (%) 4.88 ± 2.17 vs. 4.67 ± 2.13, p < 0.05, for fast and DL images, respectively).

3.1.2. Impact of Resampling DL Model on Radiomics Features

The stability of the radiomics features after fast acquisition and DL reconstruction was investigated in BM lesions. Paired t-test analysis showed that fast images present marked disparities compared to the original images with significant differences in up to 83 of the 104 texture parameters (79.81%). Particularly, there were significant differences between the basic intensity values, such as minimum, maximum, mean, median and coefficient of variation (p < 0.05). Interestingly, the DL reconstruction allowed a restoration of the majority 48/104 (46.15%) of the previously unstable parameters with an absence of significant difference for the basic intensity parameters previously mentioned, except the coefficient of variation as presented in Table 2.
A significant difference in values was observed; however, these values were highly correlated if the differences in values are the same for all the data. In that case, radiomic predictive models would be valid as the difference between responders and non-responders are maintained. To test this hypothesis, the correlation (CCC) between radiomic values in reference and post-processing (fast and DL reconstructed images) were analyzed.
CCC values comparing fast images and DL images with the reference images were analyzed and are presented in Figure 4, left-part. The marked disparities between the fast images and the reference images in terms of radiomic values is concomitant to a loss of correlation as demonstrated by the CCC below 0.85 for 50/104 (48.08%) of the feature parameters. Interestingly, the DL model restores the correlation of the majority of the parameters as only 27/104 (25.96%) features have a CCC below 0.85 (Figure 4, left-part). It is of note that all the intensity features were stable after resampling DL reconstruction.
To further explore the radiomic stability evaluation after DL processing, we evaluated the difference in predictive radiomic model results after DL resampling or fast images in comparison with reference images. Published overall survival [29] and classification [30] predictive models were used in this study. As shown in Figure 5, Bland–Altman plots highlight important differences in predictive values obtained with reference and fast images (mean difference = −0.86, p < 0.001). In comparison, predictive values obtained from DL images are slightly different from the values obtained from reference images (mean difference = −0.24, p < 0.05).
A similar approach was undertaken with another radiomic model [30] which showed significant differences in predictive values for the fast images and non-significant differences for the DL images when compared to the reference images (mean difference = −0.36 and −0.07 for fast image and DL image, respectively and p < 0.001 for fast image, Figure S2a,b).

3.2. Denoising DL Model

3.2.1. Quality of the Denoising DL Model

As observed in Figure 6, the denoising DL model was able to reduce noise in the fast image and produce a high quality image similar to the reference image both in the healthy and pathological tissues. For the whole image, PSNR and SSIM were significantly increased with the use of the model (PSNR: 35.48 ± 6.2 vs. 41.32 ± 4.93, p < 0.001 and SSIM: 0.74 ± 0.18 vs. 0.96 ± 0.04, p < 0.001 for fast and DL images, respectively, (Figure S3). In the brain metastases regions, the coefficient of variation (CV) and entropy, which both reflect the noise in the image, were reduced in brain metastases in the DL images (CV (%) 0.17 ± 0.08 vs. 0.16 ± 0.08, p < 0.001, for fast and DL images, respectively, and entropy 5.83 ± 0.22 vs. 5.79 ± 0.25, p < 0.001, for fast and DL images, respectively, (Figure 6).

3.2.2. Impact of Denoising DL Model on Radiomics Features

The stability of the radiomics features after denoised DL reconstruction was then investigated in BM lesions. Paired t-test analysis showed that images reconstructed in half the time present were significantly different for 75 of the 104 texture parameters (72.12%) from the original images. The denoising DL reconstruction restored the majority of these parameters. There were 40/104 (38.46%) residual unstable parameters (Table S1). Interestingly, even if radiomics classes such as Gray Level Size Zone Matrix remained stable, significant differences compared with the reference images were observed for the intensity parameters as mean, min, max and coefficient of variation were observed (p < 0.01) However, as previously mentioned, radiomic features could remain stable even with a significant difference in values compared to the reference image. CCC was then evaluated and showed that, for intensity radiomic class, only first order entropy is unstable after denoising DL reconstruction (Figure 4, right part). For the other radiomic classes, only Gldm_Small Dependence Low Gray Level Emphasis and Glszm_Small Area Low Gray Level Emphasis were with a CCC < 0.8. There were 8/104 (7.69%) residual unstable radiomic features for the denoising DL reconstruction only, in comparison to the 40/104 (38.46%) unstable radiomic features for fast images acquisition. Wavelets, which are a ratio of high to low frequencies in the image, i.e., the noise in the image, were, as expected, strongly affected by fast acquisition (CCC < 0.1 and < 0.2 for local and global wavelets, respectively). It is interesting to note that, as a proof of the efficiency of the denoising DL model, wavelet features had a CCC > 0.85.
As for the resampling DL model, a difference in the predicted radiomic model results after DL denoising in comparison to fast and reference images were evaluated. Published overall survival [29] and classification [30] predictive models were used in this study. As shown in Figure 7, Bland–Altman plots highlight important differences in predictive values obtained with reference and fast images (mean difference = 0.96, p < 0.001). In comparison, predicted values obtained from DL images were slightly different from the values obtained from reference images (mean difference = 0.12, p < 0.05).
Results for the other radiomic model [30] showed significant differences in predicted values for the fast images and non-significant differences for the DL images (mean difference = 0.15 and 0.01 for fast image and DL images, respectively. and p < 0.001 for fast images, Figure S2c,d).

4. Discussion

Long acquisition times result in unacceptable delays in patient access to MRI examinations. For this reason, many research groups, MRI manufacturers and digital startups in medical imaging are actively developing resampling and denoising models to allow faster acquisition times without a loss in image quality. Classical methods used bicubic interpolation [34] to create new neighboring pixels to upsample the image, but the resulting images were artificially smooth with some interpolation artifacts. For denoising, the state-of-the-art classical denoising method, BM3D [35], achieves good image quality without noise but smooths the image. A specified sigma value is thus required to remove noise but this may in turn remove important details in the image, critical for diagnostic images. In recent years, many deep learning architectures have been introduced for resampling (or super resolution) such as Efficient Sub-Pixel Convolutional Neural Network) (ESPCN) [12], that uses a sub-pixel convolution layer at the end to reconstruct the high resolution image [36,37] or U-NET. In our study, we initially used ESPCN architecture as a resampling solution. However, as shown in Figure S4, some cerebral structures were lost after the algorithm reconstruction. U-NET, another deep learning architecture, was recently proposed for image segmentation; however, it has shown additional utility in image resampling owing to its encoder-decoder architecture using concatenation layers to allow more information to be retained from previous layers of the network [13]. In our study, U-NET architecture was better than ESPCN for resampling purposes as it enabled the reconstruction of small cerebral structures with improved quality image metrics, such as PSNR and SSIM (p < 0.001), with a decrease in pixel value differences in healthy or tumour regions in comparison to reference images (p < 0.001, Figure S4). These differences may be due to the fact that ESPCN architecture failed to capture local information since ESPCN does not use downsampling/upsampling layers combined with skip connections for extraction of local information and because the convolution layers are applied to large size maps. However, the U-NET model used in this study could be improved by using a more complex deep learning model such as Generative Adversarial Networks (GAN). However, GAN models need large volumes of images and can elucidate some non-existing information which is a critical point for clinical imaging used for diagnosis [38]. Unsupervised models such as Deep Image Prior could also be an interesting alternative, however the reconstruction process is very long (taking a minimum of 1000 iterations ~5 min to reconstruct one image) and would not be possible for routine purposes in clinic [39].
For denoising, Denoising Convolutional Neural Network (DnCNN) is a state-of-the-art denoising method and is very efficient at removing additive white Gaussian noise [40]. However, MRI noise is not Gaussian additive and can be approximated by Gaussian noise in both the imaginary and the real parts of the k-space [18,19].
When trying to denoise MRI images, the model could confuse some important details (for example veins) for noise and remove them [41]. To avoid this problem, Gondara and colleagues [42] showed that autoencoders using convolutional layers are efficient for medical image denoising even for high noise levels, while others have shown that this model does not require a large training set to give good results [21]. For these reasons we decided to use a U-NET type architecture for both MRI resampling and denoising. We remove from vanilla (original) U-NET architecture the batch normalization as Zhang and colleagues found that they deteriorate the accuracy of image super-resolution tasks [43].
Finally, concerning the loss function, in this study we used a mix loss that combines MSE, MGE and SSIM which proved to be efficient for reconstructing low level details and structures. MSE on its own can lead to a pixel-wise average of plausible solutions which result in lack of high-frequency details (such as edges and textures) [44].
The choice of the image used for model training purposes is crucial. True fast images are hardly feasible to obtain as MRI acquisition time is already at a premium. It would require two acquisitions per patient of the same sequence adding unnecessary time to the total workflow. For this reason, all the literature publications on medical imaging simulate the noise or the downsampling from the reference images. Simulated downsampling can be obtained with bicubic interpolation [12] (with or without blurring the image using a Gaussian kernel) [45] of the reference image. In this study we choose DnCNN degradation (downsampling followed by upsampling) because it introduces degradation while preserving the image size (our model architecture requires that input and output sizes are the same) [40]. In fast MRI, they went further and undersampled the image in the Fourier domain to simulate k-space undersampling. Their approach is a better approximation to the real acquisition but it requires k-space data before any processing, which is not possible in the large majority of the clinical centers.
CNNs can be efficient at removing motion artefacts from MR images. Authors have generated motion artefacts on MR images of the liver by simulating the phase error in k-space and have trained a fully convolutional network to remove this artefact [46]. In our study, without explicitly training our denoising model to remove motion artefacts, the model captured it as if it were MRI noise and was able, in some cases, to remove it successfully (Figure S5).
The family of features were differently affected by fast imaging and DL reconstruction, (Figure 4). The intensity features that did not account for the spatial position of the voxel, which were restored from 66 to 100% by DL resampling and from 85 to 96% by DL denoising. The occurrence (GLCM) features were refurbished from 46 to 71% and from 66 to 100%, respectively. This family of features depends on the volume and quantification level but also on SNR and contrast. One-dimensional features (GLRLM) were the most affected by fast sampling from 44 to 62% and from 50 to 81% by DL resampling and DL denoising, respectively. If the spatial position of the voxel was accounted for (GLDM, GLSZM, NGTDM) the number of stability features increased from 51 to 66% for resampling and from 48 to 89% for denoising. Halving the acquisition matrix and DL resampling lead to less stable features than decreasing the number of NEX and DL denoising.
Predictive models based on radiomics is a fast-evolving field. Outstanding advances have been made since 2014 [47]. One of the main challenges still to be addressed is the interoperability and stability of these models. Over the last decade numerous studies have evaluated the impact of these bias factors in multimodal imaging [17]. Lambin and colleagues have evaluated the radiomics stability over test-retest in diffusion MRI in ovarian, colorectal and lung cancers [32] and 4DCT [48], CT [49] and PET in lung cancer [47]. These studies showed radiomic stability for 25 to 71% of all radiomic features. We observed the same order of magnitude in radiomic stability in our study for fast imaging reconstruction (>50% and >61%, respectively, for resampling and denoising) which was much lower than that obtained by DL reconstruction (>74% and >92% for resampling and denoising, respectively). The results of the previous study mentioned and the results of our study suggest that patient positioning during test-retest, magnetic field and the MRI manufacturer have a greater impact than DL reconstruction. DL algorithms are now proposed by most vendors with the release of new generation MRIs It is therefore important to evaluate the validity of these radiomic models. To the best of our knowledge, this the first study evaluating the impact of DL acceleration on radiomic stability in BM at the clinical level.
In our study we analysed the stability of the radiomic features using Pearson’s correlation and CCC. We chose to use both correlation factors as they measure different parameters. Pearson’s correlation is a measure of linearity whereas CCC is a measure of agreement. For the resampling approaches, Pearson’s correlation showed significant correlation between the fast image and DL image compared with the reference images for all the parameters. However, if we consider the CCC of the Kurtosis and Skewness features, the non-stable features with fast image reconstruction become stable after DL reconstruction. CCC thus seems a more sensitive measure with correlation clearly increased (Figure S6). Glcm-MCC, which represents the complexity of the signal in the ROI, is not stable after DL reconstruction but Person’s correlation is increased in comparison with the fast image.
We show that unstable radiomic features after resampling the DL reconstruction had a greater effect than the denoising DL reconstruction. We hypothesize that this could be due to the fact that downsampling deletes information whereas noise covers the information without deleting it. As a result, the denoising model will have to characterize the noise before its removal [50].
Finally, radiomic model validity after DL processing was evaluated as a final output. In this study we showed that whereas predictive values were strongly modified after fast acquisition, differences in predictive values after DL image processing in comparison to the reference was less important for one radiomic model [29] and with no difference for another one [30]. It is important to note that these radiomic models were obtained using a combination of radiomic features. We summarize in Table S2, the radiomic features used in the two radiomic models reported on in this study (top and middle rows of the table) and another classification model from the literature ([51], bottom row of the table). We observe that for the resampling purpose, radiomic features were unstable for 13/19 of the features used for the radiomic models, compared with only 3/19 unstable features after DL resampling processing. Concerning the radiomic models non-tested in this study, only 4/21 were unstable after DL resampling. More interestingly, the DL denoising model worked better with no unstable radiomic features for the two radiomic models [29,30] used in this study compared with only one for the radiomics model for classification used in the Qian study [51]. In regards to clinical implication, we assessed the impact of DL processing on the validity of predicted values from radiomic models. To do that, we used Bland–Altman plots and analysed the accuracy of the predicted values from radiomic model after DL processing in comparison to the original predicted values. Considering radiomic model from [29], only 2 and 3 predicted values are not accurate after resampling and denoising DL process, respectively, which represent accuracies of 95 and 92.5% (same results were obtained with radiomic model from [30]). These results highlight the impressive ability of DL to capture the shape and very precise features of the reference/high quality images during the training step and then re-inject them into new downsampled or denoised images.
As a limitation, our study was a retrospective study, as the problematic here is the delay in obtaining an MRI appointment it was not possible to negatively impact the patient’s medical path in adding for example, MRI sequences. For this reason, further prospective studies need to be undertaken for clinical validation.
Not to diminish the importance of DL reconstruction on radiomics stability, of far greater importance is the impact that MRI parameters as defined by different vendors or as used in different clinical settings can have on radiomics stability. For example, Lambin and colleagues showed that some radiomics in MRI are unstable during test-retest in multiple clinical centres [32], as well as during FDG-PET test-retest [47] or CT test-retest [49]. Knowing that, we could hypothesise that if a DL algorithm was trained using sufficient MRI images from a large number of imaging centres, it could facilitate harmonization of the image data between the different centres. In turn, this approach could standardise MRI imaging data in multicentric clinical trials similar to what the EARL approach achieved for PET imaging [52,53]. Further multicentric studies are necessary to validate this hypothesis.

5. Conclusions

The DL model developed in this study allows 128 × 128 pixel images with a number of average (NEX) of 1, to be reconstructed as 256 × 256 T1 images of good quality, similar to the reference image acquired in clinical routines with a NEX of 2 and an acquisition time twice as long. Concerning the texture parameters, while rapid, fast MRI acquisition loses most of the radiomic features in particular with regards to the first order intensity values. Pearson correlation and CCC analysis shows that DL models allow for the restoration of the majority of the radiomic characteristics of the original image. Finally, the majority of the radiomic features used to compute predictive radiomic models are restored after DL algorithms. This first study, which would need to be confirmed by other studies, highlights the possibility of using DL reconstructed MRI images of brain metastases for predictive radiomic model purposes.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/cancers14010036/s1, Figure S1: Experimental paradigm and schema of the study. Figure S2: Bland–Altman plots showing the difference between predictive values obtained from radiomic model. Figure S3: Quantitative analyzes of the efficiency of the denoising DL model with the comparison with fast image concerning. Figure S4: ESPCNN and U-NET DL models comparison. Figure S5: Example of motion artefact corrected by the U-NET DL model. Figure S6: Pearson correlation of kurtosis, skweness, glcm MCC values between reference and fast and DL images. Table S1: Paired t-test of DL resampling impact on radiomic features. Table S2: Comparison of ICC for different radiomic models features between reference, fast image and DL images

Author Contributions

Conceptualization, A.C.-D., C.J., I.M. and A.L.; methodology, A.C.-D., C.J., I.M. and A.L.; software, A.C.-D., C.J., I.M. and A.L.; validation, all authors; formal analysis, A.C.-D., C.J. and I.M.; investigation, A.C.-D., C.J., I.M. and A.L.; resources, A.C.-D., C.J., I.M., J.T. and Z.S.; data curation, A.C.-D., C.J., I.M., J.T. and Z.S.; writing—original draft preparation, A.C.-D., C.J. and I.M.; writing—review and editing, I.M., C.J., A.L., S.V., C.R., Z.S., J.T., N.F., J.L., A.B., A.C.-D.; visualization, I.M., C.J., A.L., S.V., C.R., Z.S., J.T., N.F., J.L., A.B., A.C.-D.; supervision, A.C.-D. and C.J.; project administration, A.C.-D. and C.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and MR-004, a national French institution (INDS) defining health research conduct guidelines and approved by the Institutional Review Board (or Ethics Committee) of Centre François Baclesse (“RESIMET” protocol, January 2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study and the python code used are openly available at https://github.com/AurelienCD/Resampling_Denoising_Deep_Learning_MRI (accessed on 21 December 2021).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. OECD Statistiques. Available online: https://stats.oecd.org (accessed on 21 December 2021).
  2. Graciet, A.; Detournay, B.; Courouve, L. Les Insuffisances En Matière D’équipements D’imagerie Médicale En France; Banque de Données en Santé Publique: Bourg-la-Reine, France, 2018; Volume 33. [Google Scholar]
  3. McGibney, G.; Smith, M.R.; Nichols, S.T.; Crawley, A. Quantitative Evaluation of Several Partial Fourier Reconstruction Algorithms Used in MRI. Magn. Reson. Med. 1993, 30, 51–59. [Google Scholar] [CrossRef] [PubMed]
  4. Pruessmann, K.P.; Weiger, M.; Scheidegger, M.B.; Boesiger, P. SENSE: Sensitivity Encoding for Fast MRI. Magn. Reson. Med. 1999, 42, 952–962. [Google Scholar] [CrossRef]
  5. Jaspan, O.N.; Fleysher, R.; Lipton, M.L. Compressed Sensing MRI: A Review of the Clinical Literature. Br. J. Radiol. 2015, 88, 20150487. [Google Scholar] [CrossRef] [PubMed]
  6. Wang, F.; Preininger, A. AI in Health: State of the Art, Challenges, and Future Directions. Yearb. Med. Inf. 2019, 28, 16–26. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Ahuja, A.S. The Impact of Artificial Intelligence in Medicine on the Future Role of the Physician. PeerJ 2019, 2019, e7702. [Google Scholar] [CrossRef]
  8. Knoll, F.; Zbontar, J.; Sriram, A.; Muckley, M.J.; Bruno, M.; Defazio, A.; Parente, M.; Geras, K.J.; Katsnelson, J.; Chandarana, H.; et al. FastMRI: A Publicly Available Raw k-Space and DICOM Dataset of Knee Images for Accelerated MR Image Reconstruction Using Machine Learning. Radiol. Artif. Intell. 2020, 2, e190007. [Google Scholar] [CrossRef] [PubMed]
  9. Reina, G.A.; Stassen, M.; Pezzotti, N. White Paper Philips Healthcare Uses the Intel® Distribution of OpenVINOTM Toolkit and the Intel® DevCloud for the Edge to Accelerate Compressed Sensing Image Reconstruction Algorithms for MRI Authors. Available online: https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/philips-healthcare-mri-white-paper.pdf (accessed on 21 December 2021).
  10. Behl, N. Deep Resolve—Mobilizing the Power of Networks. Available online: https://cdn0.scrvt.com/39b415fb07de4d9656c7b516d8e2d907/cc86f5120c06595c/d13b5f98477c/siemens-healthineers_MR_Deep_Resolve_Whitepaper.pdf (accessed on 21 December 2021).
  11. Zaharchuk, X.G.; Gong, X.E.; Wintermark, X.M.; Rubin, X.D.; Langlotz, X.C.P. Deep Learning in Neuroradiology. Am. J. Neuroradiol. 2018, 39, 1776–1784. [Google Scholar] [CrossRef] [Green Version]
  12. Shi, W.; Caballero, J.; Huszar, F.; Totz, J.; Aitken, A.P.; Bishop, R.; Rueckert, D.; Wang, Z. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 2016, 12, 1874–1883. [Google Scholar] [CrossRef] [Green Version]
  13. Lu, Z.; Chen, Y. Single Image Super Resolution Based on a Modified U-Net with Mixed Gradient Loss. arXiv 2019, arXiv:1911.09428. [Google Scholar] [CrossRef]
  14. Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; Van Stiphout, R.G.P.M.; Granton, P.; Zegers, C.M.L.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting More Information from Medical Images Using Advanced Feature Analysis. Eur. J. Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef] [Green Version]
  15. Aerts, H.J.W.L.; Velazquez, E.R.; Leijenaar, R.T.H.; Parmar, C.; Grossmann, P.; Cavalho, S.; Bussink, J.; Monshouwer, R.; Haibe-Kains, B.; Rietveld, D.; et al. Decoding Tumour Phenotype by Noninvasive Imaging Using a Quantitative Radiomics Approach. Nat. Commun. 2014, 5. [Google Scholar] [CrossRef] [PubMed]
  16. Lohmann, P.; Kocher, M.; Ruge, M.I.; Visser-Vandewalle, V.; Shah, N.J.; Fink, G.R.; Langen, K.J.; Galldiks, N. PET/MRI Radiomics in Patients with Brain Metastases. Front. Neurol. 2020, 11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Hatt, M.; Cheze Le Rest, C.; Antonorsi, N.; Tixier, F.; Tankyevych, O.; Jaouen, V.; Lucia, F.; Bourbonne, V.; Schick, U.; Badic, B.; et al. Radiomics in PET/CT: Current Status and Future AI-Based Evolutions. Semin. Nucl. Med. 2021, 51, 126–133. [Google Scholar] [CrossRef] [PubMed]
  18. Edelstein, W.A.; Glover, G.H.; Hardy, C.J.; Redington, R.W. The Intrinsic Signal-to-noise Ratio in NMR Imaging. Magn. Reson. Med. 1986, 3, 604–618. [Google Scholar] [CrossRef] [PubMed]
  19. Macovski, A. Noise in MRI. Magn. Reson. Med. 1996, 36, 494–497. [Google Scholar] [CrossRef]
  20. Chollet, F. Keras 2015. Available online: https://github.com/fchollet/keras (accessed on 21 December 2021).
  21. Weng, W.; Zhu, X. U-Net: Convolutional Networks for Biomedical Image Segmentation. IEEE Access 2021, 9, 16591–16603. [Google Scholar] [CrossRef]
  22. Wang, Z.; Simoncelli, E.P.; Bovik, A.C. Multiscale Structural Similarity for Image Quality Assessment. In Proceedings of the Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, Pacific Grove, CA, USA, 9–12 November 2003; IEEE: New York, NY, USA, 2003; Volume 2, pp. 1398–1402. [Google Scholar]
  23. Fedorov, A.; Beichel, R.; Kalpathy-Cramer, J.; Finet, J.; Fillion-Robin, J.C.; Pujol, S.; Bauer, C.; Jennings, D.; Fennessy, F.; Sonka, M.; et al. 3D Slicer as an Image Computing Platform for the Quantitative Imaging Network. Magn. Reson. Imaging 2012, 30, 1323–1341. [Google Scholar] [CrossRef] [Green Version]
  24. Schneider, C.A.; Rasband, W.S.; Eliceiri, K.W. NIH Image to ImageJ: 25 Years of Image Analysis. Nat. Methods 2012, 9, 671–675. [Google Scholar] [CrossRef] [PubMed]
  25. Egger, J.; Kapur, T.; Fedorov, A.; Pieper, S.; Miller, J.V.; Veeraraghavan, H.; Freisleben, B.; Golby, A.J.; Nimsky, C.; Kikinis, R. GBM Volumetry Using the 3D Slicer Medical Image Computing Platform. Sci. Rep. 2013, 3, 1364. [Google Scholar] [CrossRef]
  26. Van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.C.; Pieper, S.; Aerts, H.J.W.L. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Jaudet, C.; Weyts, K.; Lechervy, A.; Batalla, A.; Bardet, S.; Corroyer-Dulmont, A. The Impact of Artificial Intelligence CNN Based Denoising on FDG PET Radiomics. Front. Oncol. 2021, 11, 3136. [Google Scholar] [CrossRef] [PubMed]
  28. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-Based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef] [Green Version]
  29. Liao, C.-Y.; Lee, C.-C.; Yang, H.-C.; Chen, C.-J.; Chung, W.-Y.; Wu, H.-M.; Guo, W.; Liu, R.; Lu, C. Enhancement of Radiosurgical Treatment Outcome Prediction Using MRI Radiomics in Patients with Non-Small Cell Lung Cancer Brain Metastases. Cancers 2021, 13, 4030. [Google Scholar] [CrossRef] [PubMed]
  30. Su, C.Q.; Chen, X.T.; Duan, S.F.; Zhang, J.X.; You, Y.P.; Lu, S.S.; Hong, X.N. A Radiomics-Based Model to Differentiate Glioblastoma from Solitary Brain Metastases. Clin. Radiol. 2021, 76, 629.e11–629.e18. [Google Scholar] [CrossRef] [PubMed]
  31. Lin, L.I.-K. A Concordance Correlation Coefficient to Evaluate Reproducibility. Biometrics 1989, 45, 255–268. [Google Scholar] [CrossRef] [PubMed]
  32. Peerlings, J.; Woodruff, H.C.; Winfield, J.M.; Ibrahim, A.; Van Beers, B.E.; Heerschap, A.; Jackson, A.; Wildberger, J.E.; Mottaghy, F.M.; DeSouza, N.M.; et al. Stability of Radiomics Features in Apparent Diffusion Coefficient Maps from a Multi-Centre Test-Retest Trial. Sci. Rep. 2019, 9, 4800. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Anaconda Software Distribution. (2020). Anaconda Documentation. Anaconda Inc. Available online: https://docs.anaconda.com (accessed on 21 December 2021).
  34. Keys, R. Cubic Convolution Interpolation for Digital Image Processing. IEEE Trans. Acoust. 1981, 29, 1153–1160. [Google Scholar] [CrossRef] [Green Version]
  35. Makinen, Y.; Azzari, L.; Foi, A. Collaborative Filtering of Correlated Noise: Exact Transform-Domain Variance for Improved Shrinkage and Patch Matching. IEEE Trans. Image Process. 2020, 29, 8339–8354. [Google Scholar] [CrossRef]
  36. Kim, J.; Lee, J.K.; Lee, K.M. Accurate Image Super-Resolution Using Very Deep Convolutional Networks. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 2016, 2016, 1646–1654. [Google Scholar] [CrossRef] [Green Version]
  37. Zhang, Y.; Li, K.; Li, K.; Wang, L.; Zhong, B.; Fu, Y. Image Super-Resolution Using Very Deep Residual Channel Attention Networks. Lect. Notes Comput. Sci. 2018, 11211, 294–310. [Google Scholar] [CrossRef] [Green Version]
  38. Cohen, J.P.; Luck, M.; Honari, S. Distribution Matching Losses Can Hallucinate Features in Medical Image Translation. Lect. Notes Comput. Sci. 2018, 11070, 529–536. [Google Scholar] [CrossRef] [Green Version]
  39. Ulyanov, D.; Vedaldi, A.; Lempitsky, V. Deep Image Prior. Int. J. Comput. Vis. 2020, 128, 1867–1888. [Google Scholar] [CrossRef]
  40. Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef] [Green Version]
  41. Imai, K.; Miyata, T. Gated Texture CNN for Efficient and Configurable Image Denoising. Lect. Notes Comput. Sci. 2020, 12537, 665–681. [Google Scholar] [CrossRef]
  42. Gondara, L. Medical Image Denoising Using Convolutional Denoising Autoencoders. In Proceedings of the 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW), Barcelona, Spain, 12–15 December 2016; pp. 241–246. [Google Scholar] [CrossRef] [Green Version]
  43. Xu, J.; Chae, Y.; Stenger, B.; Datta, A. Residual Dense Network for Image Super Resolution. Int. Conf. Image Process. 2018, 71–75. [Google Scholar] [CrossRef]
  44. Zhao, H.; Gallo, O.; Frosio, I.; Kautz, J. Loss Functions for Image Restoration with Neural Networks. IEEE Trans. Comput. Imaging 2017, 3, 47–57. [Google Scholar] [CrossRef]
  45. Johnson, J.; Alahi, A.; Fei-Fei, L. Perceptual Losses for Real-Time Style Transfer and Super-Resolution. Lect. Notes Comput. Sci. 2016, 9906, 694–711. [Google Scholar] [CrossRef] [Green Version]
  46. Tamada, D.; Kromrey, M.L.; Ichikawa, S.; Onishi, H.; Motosugi, U. Motion Artifact Reduction Using a Convolutional Neural Network for Dynamic Contrast Enhanced Mr Imaging of the Liver. Magn. Reson. Med. Sci. 2020, 19, 64–76. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Leijenaar, R.T.H.; Carvalho, S.; Velazquez, E.R.; Van Elmpt, W.J.C.; Parmar, C.; Hoekstra, O.S.; Hoekstra, C.J.; Boellaard, R.; Dekker, A.L.A.J.; Gillies, R.J.; et al. Stability of FDG-PET Radiomics Features: An Integrated Analysis of Test-Retest and Inter-Observer Variability. Acta Oncol. 2013, 52, 1391–1397. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Larue, R.T.H.M.; Van De Voorde, L.; van Timmeren, J.E.; Leijenaar, R.T.H.; Berbée, M.; Sosef, M.N.; Schreurs, W.M.J.; van Elmpt, W.; Lambin, P. 4DCT Imaging to Assess Radiomics Feature Stability: An Investigation for Thoracic Cancers. Radiother. Oncol. 2017, 125, 147–153. [Google Scholar] [CrossRef] [Green Version]
  49. Van Timmeren, J.E.; Leijenaar, R.T.H.; van Elmpt, W.; Wang, J.; Zhang, Z.; Dekker, A.; Lambin, P. Test-Retest Data for Radiomics Feature Stability Analysis: Generalizable or Study-Specific? Tomography 2016, 2, 361–365. [Google Scholar] [CrossRef]
  50. Gao, R.; Grauman, K. On-Demand Learning for Deep Image Restoration. IEEE Int. Conf. Comput. Vis. 2017, 2017, 1095–1104. [Google Scholar] [CrossRef] [Green Version]
  51. Yi, X.; Pei, Q.; Zhang, Y.; Zhu, H.; Wang, Z.; Chen, C.; Li, Q.; Long, X.; Tan, F.; Zhou, Z.; et al. MRI-Based Radiomics Predicts Tumor Response to Neoadjuvant Chemoradiotherapy in Locally Advanced Rectal Cancer. Front. Oncol. 2019, 9, 552. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Kaalep, A.; Burggraaff, C.N.; Pieplenbosch, S.; Verwer, E.E.; Sera, T.; Zijlstra, J.; Hoekstra, O.S.; Oprea-Lager, D.E.; Boellaard, R. Quantitative Implications of the Updated EARL 2019 PET-CT Performance Standards. EJNMMI Phys. 2019, 6, 28. [Google Scholar] [CrossRef] [PubMed]
  53. Aide, N.; Lasnon, C.; Veit-Haibach, P.; Sera, T.; Sattler, B.; Boellaard, R. EANM/EARL Harmonization Strategies in PET Quantification: From Daily Practice to Multicentre Oncological Studies. Eur. J. Nucl. Med. Mol. Imaging 2017, 44, 17–31. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Deep learning model architecture.
Figure 1. Deep learning model architecture.
Cancers 14 00036 g001
Figure 2. Resampling DL model. Representative magnetic resonance imaging (MRI) of reference images, fast acquisition images and DL reconstruction images in healthy (a) and pathological conditions (b). Quantitative analysis of the efficiency of the resampling DL model with the comparison with fast acquired image concerning PSNR (c) and SSIM (d) metrics. n = 2049 for both groups, *** p < 0.001 vs. fast acquired image.
Figure 2. Resampling DL model. Representative magnetic resonance imaging (MRI) of reference images, fast acquisition images and DL reconstruction images in healthy (a) and pathological conditions (b). Quantitative analysis of the efficiency of the resampling DL model with the comparison with fast acquired image concerning PSNR (c) and SSIM (d) metrics. n = 2049 for both groups, *** p < 0.001 vs. fast acquired image.
Cancers 14 00036 g002
Figure 3. Effect of resampling DL model on BM signal intensity. (a) Representative MRI of reference images (left), difference map with fast acquisition images (middle) and DL reconstruction images (right). (b) Quantitative analysis of pixel value difference (%) in fast and DL images. n = 40 for both groups, * p < 0.05 vs. fast acquired image. Bars represent minimum and maximum values.
Figure 3. Effect of resampling DL model on BM signal intensity. (a) Representative MRI of reference images (left), difference map with fast acquisition images (middle) and DL reconstruction images (right). (b) Quantitative analysis of pixel value difference (%) in fast and DL images. n = 40 for both groups, * p < 0.05 vs. fast acquired image. Bars represent minimum and maximum values.
Cancers 14 00036 g003
Figure 4. Effect of fast acquisition and resampling DL model on the correlation between reference and post-processing image radiomic values. Red bars represent unstable radiomic features below a CCC value threshold of 0.85. Blue bars represent stable radiomic features after fast acquisition or resampling DL reconstruction.
Figure 4. Effect of fast acquisition and resampling DL model on the correlation between reference and post-processing image radiomic values. Red bars represent unstable radiomic features below a CCC value threshold of 0.85. Blue bars represent stable radiomic features after fast acquisition or resampling DL reconstruction.
Cancers 14 00036 g004
Figure 5. Bland–Altman plots showing the difference between predictive values obtained from the radiomic model [29] and reference images to fast downsampling images (a) and DL resampling image (b) in brain metastatic lesions.
Figure 5. Bland–Altman plots showing the difference between predictive values obtained from the radiomic model [29] and reference images to fast downsampling images (a) and DL resampling image (b) in brain metastatic lesions.
Cancers 14 00036 g005
Figure 6. Denoising DL model. Representative MRI of reference images, fast acquisition images and DL reconstruction images in whole brain (a) and brain metastases (b). Quantitative analysis of the efficiency of the denoising DL model in brain metastases regions in comparison with fast images as evaluated by the coefficient of variation (c) and entropy (d) metrics. n = 40 for both groups, *** p < 0.001 vs. fast acquired image.
Figure 6. Denoising DL model. Representative MRI of reference images, fast acquisition images and DL reconstruction images in whole brain (a) and brain metastases (b). Quantitative analysis of the efficiency of the denoising DL model in brain metastases regions in comparison with fast images as evaluated by the coefficient of variation (c) and entropy (d) metrics. n = 40 for both groups, *** p < 0.001 vs. fast acquired image.
Cancers 14 00036 g006
Figure 7. Bland–Altman plots showing the difference between predictive values obtained from radiomic model [29] from reference image to fast noising image (a) and DL denoising image (b) in brain metastases lesions, n = 40 for both groups.
Figure 7. Bland–Altman plots showing the difference between predictive values obtained from radiomic model [29] from reference image to fast noising image (a) and DL denoising image (b) in brain metastases lesions, n = 40 for both groups.
Cancers 14 00036 g007
Table 1. Description of the patient cohort.
Table 1. Description of the patient cohort.
Included Patients (N)85Number
Sex58Female %
Age (Y)66.48 ± 10.31Mean ± SD
(46–88)[range]
Origin of BMNumber (%)
Lung42 (48%)
Breast28 (32%)
Kidney6 (6.9%)
Digestive System3 (3.4%)
Melanoma3 (3.4%)
Gynecologic2 (2.3%)
Table 2. Paired t-test, of DL resampling impact on radiomic features. Green highlight shows stable radiomic features values. * p < 0.05, ** p < 0.01 or *** p < 0.001 vs. features in the original image.
Table 2. Paired t-test, of DL resampling impact on radiomic features. Green highlight shows stable radiomic features values. * p < 0.05, ** p < 0.01 or *** p < 0.001 vs. features in the original image.
ClassesFeaturesSignicantly DifferentClassesFeaturesSignicantly Different
IntensityMinNS p = 0.73Gray Level Dependence Matrix gldm_DependenceEntropy***
gldm_DependenceNonUniformity*
gldm_DependenceNonUniformityNormalized**
MaxNS p = 0.47gldm_DependenceVariance*
PeakNS p = 0.27gldm_GrayLevelNonUniformityNS p = 0.56
MeanNS p = 0.07gldm_GrayLevelVarianceNS p = 0.69
MedianNS p = 0.06gldm_HighGrayLevelEmphasisNS p = 0.10
Skewness *gldm_LargeDependenceEmphasis**
Kurtosis NS p = 0.36gldm_LargeDependenceHighGrayLevelEmphasis**
CV(%)*gldm_LargeDependenceLowGrayLevelEmphasisNS p = 0.81
MaxOnMeanRingNS p = 0.08gldm_LowGrayLevelEmphasisNS p = 0.59
firstorder_10PercentileNS p = 0.14gldm_SmallDependenceEmphasis**
firstorder_90Percentile**gldm_SmallDependenceHighGrayLevelEmphasisNS p = 1.00
firstorder_EnergyNS p = 0.16gldm_SmallDependenceLowGrayLevelEmphasisNS p = 0.55
firstorder_EntropyNS p = 0.97Gray Level Run Length Matrixglrlm_GrayLevelNonUniformityNS p = 0.87
firstorder_InterquartileRangeNS p = 0.10glrlm_GrayLevelNonUniformityNormalizedNS p = 0.83
firstorder_KurtosisNS p = 0.36glrlm_GrayLevelVarianceNS p = 0.71
firstorder_MaximumNS p = 0.47glrlm_HighGrayLevelRunEmphasisNS p = 0.10
firstorder_MeanAbsoluteDeviationNS p = 0.13glrlm_LongRunEmphasis**
firstorder_MeanNS p = 0.07glrlm_LongRunHighGrayLevelEmphasis*
firstorder_MedianNS p = 0.10glrlm_LongRunLowGrayLevelEmphasisNS p = 0.64
firstorder_MinimumNS p = 0.73glrlm_LowGrayLevelRunEmphasisNS p = 0.61
firstorder_RangeNS p = 0.46glrlm_RunEntropyNS p = 0.52
firstorder_RobustMeanAbsoluteDeviationNS p = 0.20glrlm_RunLengthNonUniformity*
firstorder_RootMeanSquaredNS p = 0.05glrlm_RunLengthNonUniformityNormalized**
firstorder_Skewness*glrlm_RunPercentage**
firstorder_TotalEnergyNS p = 0.15glrlm_RunVariance**
firstorder_UniformityNS p = 0.89glrlm_ShortRunEmphasis**
firstorder_VarianceNS p = 0.11glrlm_ShortRunHighGrayLevelEmphasisNS p = 0.11
Gray Level Co-occurrence Matrixglcm_Autocorrelation*glrlm_ShortRunLowGrayLevelEmphasisNS p = 0.63
glcm_ClusterProminenceNS p = 0.81Gray Level Size Zone Matrixglszm_GrayLevelNonUniformity**
glcm_ClusterShadeNS p = 0.81glszm_GrayLevelNonUniformityNormalized**
glcm_ClusterTendencyNS p = 0.86glszm_GrayLevelVarianceNS p = 0.90
glcm_Contrast**glszm_HighGrayLevelZoneEmphasisNS p = 0.88
glcm_Correlation***glszm_LargeAreaEmphasisNS p = 0.16
glcm_DifferenceAverage**glszm_LargeAreaHighGrayLevelEmphasisNS p = 0.07
glcm_DifferenceEntropy***glszm_LargeAreaLowGrayLevelEmphasisNS p = 0.10
glcm_DifferenceVariance**glszm_LowGrayLevelZoneEmphasisNS p = 0.28
glcm_Id***glszm_SizeZoneNonUniformityNS p = 0.82
glcm_Idm***glszm_SizeZoneNonUniformityNormalizedNS p = 0.05
glcm_Idmn***glszm_SmallAreaEmphasis**
glcm_Idn***glszm_SmallAreaHighGrayLevelEmphasisNS p = 0.58
glcm_Imc1NS p = 0.11glszm_SmallAreaLowGrayLevelEmphasisNS p = 0.35
glcm_Imc2*glszm_ZoneEntropy**
glcm_InverseVariance***glszm_ZonePercentage**
glcm_JointAverage*glszm_ZoneVarianceNS p = 0.09
glcm_JointEnergy*Neighbouring Gray Tone Difference Matrixngtdm_Busyness***
glcm_JointEntropy*ngtdm_Coarseness***
glcm_MCC***ngtdm_Complexity*
glcm_MaximumProbability*ngtdm_Contrast**
glcm_SumAverage*ngtdm_Strength*
glcm_SumEntropyNS p = 0.85IQ wavelets IQwavelet_global ***
glcm_SumSquaresNS p = 0.33 IQwavelet_local***
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Moummad, I.; Jaudet, C.; Lechervy, A.; Valable, S.; Raboutet, C.; Soilihi, Z.; Thariat, J.; Falzone, N.; Lacroix, J.; Batalla, A.; et al. The Impact of Resampling and Denoising Deep Learning Algorithms on Radiomics in Brain Metastases MRI. Cancers 2022, 14, 36. https://doi.org/10.3390/cancers14010036

AMA Style

Moummad I, Jaudet C, Lechervy A, Valable S, Raboutet C, Soilihi Z, Thariat J, Falzone N, Lacroix J, Batalla A, et al. The Impact of Resampling and Denoising Deep Learning Algorithms on Radiomics in Brain Metastases MRI. Cancers. 2022; 14(1):36. https://doi.org/10.3390/cancers14010036

Chicago/Turabian Style

Moummad, Ilyass, Cyril Jaudet, Alexis Lechervy, Samuel Valable, Charlotte Raboutet, Zamila Soilihi, Juliette Thariat, Nadia Falzone, Joëlle Lacroix, Alain Batalla, and et al. 2022. "The Impact of Resampling and Denoising Deep Learning Algorithms on Radiomics in Brain Metastases MRI" Cancers 14, no. 1: 36. https://doi.org/10.3390/cancers14010036

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop