1. Introduction
Pediatric-type diffuse gliomas (PDGs) comprise a heterogeneous group of central nervous system tumors, ranging from low-grade to highly malignant tumors, characterized by different biological behaviors and molecular profiles when compared with their adult counterparts [
1].
Among the different diagnostic techniques, magnetic resonance imaging (MRI) represents the current imaging modality of choice in the evaluation of pediatric gliomas not only because it is safe due to the absence of radiation exposure but also for its spatial and contrast resolution [
2]. Beyond conventional MRI, advanced MRI techniques including perfusion weighted imaging (PWI), diffusion tensor imaging (DTI), and MR spectroscopy (MRS) can also add hemodynamic, microstructural, and metabolic information under a multiparametric approach. Indeed, multiparametric MRI has been demonstrated to provide valuable information for tumor characterization and grading, treatment planning, and post-treatment surveillance [
3,
4].
In parallel with the increased application of multiparametric MRI, a growing body of evidence suggests that positron emission tomography (PET) imaging with radiolabelled amino acids can provide further and complementary insights into pediatric glioma evaluation complementing MRI, as recently outlined in the Joint EANM/SIOPE/RAPNO practice guidelines/SNMMI procedure standards for imaging of pediatric gliomas using PET with radiolabelled amino acids and [
18F]FDG [
5]. Of note, to reduce concerns regarding the radiation dose associated with PET/CT, which represents a potential limitation of this technique, it is essential to optimize and harmonize the acquisition protocols of pediatric PET/CT to use reasonably low radiation doses while maintaining acceptable image quality and duration [
6]. Among amino acid PET tracers, [
18F]F-DOPA has demonstrated a high potential in defining tumor grade and outcome in PDGs; furthermore, it can be used for biopsy planning and treatment surveillance (e.g., discrimination between disease progression and treatment-related changes) [
7,
8,
9,
10,
11,
12,
13].
Visual evaluation of the PET scan by a trained nuclear medicine physician is often complemented by semi-quantitative measures that estimate the tracer uptake in PET scans. Standardized uptake value (
SUV) represents the most used technique due to the simplicity of its application and its efficacy in assessing response to treatment [
14]. Yet, many different factors can affect the
SUV, for instance, the time interval between the acquisition and scanning, the image acquisition setting, and the algorithm used to define the tumor. Therefore, it is difficult to compare
SUVs acquired in different centers when acquisition and analysis protocols are not shared and agreed upon [
15,
16]. Great efforts are currently being made to standardize the computation of the
SUV to enable comparison across and between centers [
17].
The most common ways to report SUVs are the SUVmean and the SUVmax, which are the mean or maximum SUV of all voxels within a target region of interest (ROI). The ratio between two SUVs, calculated between a target ROI and a reference one, is named as the standardized uptake value ratio (SUVr). ROIs are often manually drawn on a single, bidimensional slice of the scan, which is a non-trivial, time-consuming task. The size and placement of ROIs, for instance, may be inadequate to establish the pathological and healthy tissue uptake references. Furthermore, the definition of the ROIs can make the procedure largely operator dependent and the use of a single bidimensional slice of the image, without exploiting the information coming from the volume of the lesion, can cause a loss of information. All these issues are likely to influence the semi-quantification, and their effect could potentially be mitigated by the introduction of automatic methods to support clinicians in the estimation of the SUVr.
Based on these considerations, the aim of this study was to develop a computer-aided, semi-quantification method on [
18F]F-DOPA PET/CT images co-registered with MRIs of PDGs. The proposed method mimics the manual procedure described by Morana et al. [
8].
2. Materials and Methods
Briefly, the study by Morana et al. [
8] relies on the human assessment performed on a fused PET/MRI visualization. A square-shaped
ROI (side 1.8 cm) was drawn manually on the axial section of the tumor area exhibiting the highest [
18F]F-DOPA uptake. In cases where the increase in uptake was not apparent, a
ROI of the same size was manually placed on the bulk of the lesion according to standard diagnostic criteria. The radiotracer concentration in the
ROI was then normalized to the injected dose per patient’s body weight, and the
SUVmax was calculated for each lesion (
SUVmax,T; g/mL). A same-sized
ROI was mirrored in the matching contralateral hemisphere, and another was drawn over the contralateral striatum, centered on the putamen.
SUVmax values were calculated for each
ROI (
SUVmax,N and
SUVmax,S; g/mL). The lesion-to-normal-tissue uptake (
tn) and lesion-to-striatal uptake (
ts) ratios were calculated by dividing the
SUVmax,T by the
SUVmax,N and by the
SUVmax,S, respectively.
We sought a semi-quantification algorithm which does not need to manually define uptake and reference ROIs of fixed size and shape; rather, we explored a more robust way to locate data-driven regions that represent the uptake of unaffected tissue, thus improving on the normalization values of the lesion. Similarly to the procedure described above, we looked for two normalization values representing the average uptake of the healthy tissue and striatum, respectively. These values were used to normalize the whole image, from which we extracted the two SUVr corresponding to the scores estimated with the manual procedure. In the development of the method, we were blind to the manual SUVr and to the clinical data (e.g.,: survival, tumor grading) that were used only retrospectively to conduct the validation analysis.
Some minimal assumptions were made in the definition of the automated approach. First, we assumed that gliomas were characterized by an increased uptake of [
18F]F-DOPA [
18]. We also assumed that the approximate volume of each tumor was significantly smaller than the brain parenchyma volume. Additionally, we assumed that the gliomas were predominantly monolateral. The latter hypothesis was visually verified on the analyzed dataset. Therefore, the regions that best represented the uptake of the healthy tissue were obtained starting from the contralateral, non-lesioned brain hemisphere.
2.1. Patients
Our dataset consisted of 18 patients: 7 females and 11 males aged from 5 to 17 (µ = 10.3 ± 3.9) who underwent PET study due to the presence of an infiltrating lesion on conventional MRI. The MRI and [
18F]F-DOPA PET were acquired within a 2-week window. Overall, there were 9 pediatric-type diffuse low-grade gliomas (PLGG; all with WHO grade II [
1,
5]) and 9 pediatric-type diffuse high-grade gliomas (PHGG; 4 patients with WHO grade III and 5 with WHO grade IV [
1,
5]). Patients’ demographics and lesion characteristics are summarized in
Table 1.
2.2. Image Protocol
[18F]F-DOPA was purchased from a commercial supplier (IASOdopa; IASON Labormedizine Ges.Mbh & Co., KG, Graz-Seiersberg, Austria). Volumetric scans were acquired 20 min after the injection of 185 MBq of a median injected activity of [18F]F-DOPA, with a dedicated PET/CT system (Discovery ST; GE Healthcare, Milwaukee, WI, USA) using the three-dimensional mode with a scanning time of 30 min. A non-diagnostic low-dose CT scan (120 kV, 80 mA, 0.6 s per rotation) was used for attenuation correction. The interpretation of all the [18F]F-DOPA PET scans involved a qualitative analysis based on visual assessment, as well as a semi-quantitative analysis using the SUVr.
MRI studies were performed on a 1.5 T magnet (Intera Achieva, Philips, Best, The Netherlands). The routine brain MRI examination consisted of axial fluid attenuation inversion recovery (FLAIR) and T1-weighted (T1w) images (axial, coronal and sagittal) acquired after gadolinium chelate bolus administration (0.1 mmol/kg).
2.3. Spatial Normalization
The target PET image was registered to the corresponding MRI T1w with a 7-parameter transformation and the mutual information metric. An automatic multi-step registration procedure involving three subsequent steps of increasing accuracy (linear registration with 6, 7, and 12 degrees of freedom) was applied to map the T1w onto the MNI space (isotropic spacing and voxel dimension of 1 × 1 × 1 mm, see
Figure S1; Supplementary Materials). The resulting transformation was applied to the corresponding PET image which was thereby mapped to MNI coordinates. Hereinafter, we refer to the PET mapped on MNI coordinates as P and each voxel of P will be denoted as
p (
p ∈ P).
Once all the MR images (e.g., T1w and FLAIR) and the PET scans were registered onto the MNI space, we proceeded with a multi-step, post-processing stage to find the uptake values that could be representative of healthy, striatum, and tumorous tissue. The spatial normalization pipeline was implemented with the Insight Toolkit (ITK, version 4.12).
2.4. Identification of Brain Non-Affected Hemisphere
The following steps were all implemented in Matlab (version R2017b). To discriminate between injured and healthy tissue, we trained a support vector machine (SVM) over all the FLAIR images of the dataset. Indeed, FLAIR has proven to be effective in tumor delineating even in PLGG [
19]. Any single scan was independently parcelled into ~300 districts by means of an intensity-based K-means algorithm. Among all the generated districts (~18 × 300), a randomized sample of 360 regions was labelled as healthy or pathological by a trained neuroradiologist. We then selected 300 volumes to form a balanced set of affected and non-affected regions.
In analogy to what was accomplished by Soltaninejad et al. [
20], the SVM was fed with a set of 14 texture features computed on each region (i.e., average, standard deviation, variance, mean and median of the absolute deviation, skewness, kurtosis, maximum, minimum, median and mode of the intensity values, range, interquartile range, and entropy). The classifier was then tested on the whole dataset giving out a three-dimensional mask (V
t), for every image, containing the affected volume. The position of V
t allowed us to identify the hemisphere in which each lesion was predominantly located. All these steps are illustrated in
Figure 1.
Notably, the SVM classification was not used to segment the tumor. Rather, it was only used as seed to define the affected and non-affected brain hemisphere. Therefore, segmentation accuracy is not a concern as any coarse segmentation would serve the purpose.
2.5. Reference Volume Extraction
The contralateral hemisphere (assumed as mostly unaffected) was divided into 5 lobes (V
i, i = 1, …, 5) by application of the MNI atlas, with the exclusion of the pons and the striatum. Since all V
i belong to the non-affected hemisphere, we expected that they could be used to estimate the general intensity distribution of the whole unaffected tissue (reference). From each V
i we computed a reference tissue region M
i (i = 1, …, 5): we applied a moving-window filter that compares the intensity statistics in a neighborhood I(
p) surrounding each voxel
p of the brain, with that of V
i. For simplicity we chose a cubic shaped sliding window centered in
p (I(
p) = (2l + 1)
3 with l = 3 voxels). To efficiently describe the intensity distribution of V
i, we sampled its histogram into K = 10 quantile bins. The set of voxels whose intensity distribution in I(
p) is not different from V
i, are assigned to M
i. This comparison has been accomplished by applying a threshold to an χ
2-like function
Y (Equation (2)). The threshold on
Y was heuristically set to 10
−3.
where
ej is the reference intensity distribution (the one from V
i, intensity quantile
j) and
oj is the observed distribution from the sliding window I(
p).
The final mask M
n was then computed by averaging the five M
i. In the end, each PET P was provided with a mask M
n containing the normal, reference, tissue. An infographic of the workflow, for the extraction of M
n, is summarized in
Figure S2a (Supplementary Materials). Each mask M
n was usually a large, connected area consisting of non-affected tissue. Its intensity characteristics showed a mono-modal distribution, which is coherent with the removal of districts showing significantly different uptake (
Supplementary Materials, Figure S2b). The arithmetic mean c
n of the intensities in M
n was chosen to be the representative of the normal tissue. Finally, the normalized image P
n was obtained dividing the intensity of each voxel of P by c
n. An analogous procedure was carried out to normalize P to the activity of the normal striatum. Briefly, the uptake of the striatum in [
18F]F-DOPA scans allowed the
Y-filter to operate a coarse segmentation of subcortical structures. Knowing the position of the striata from the spatial normalization to MNI coordinates, we could outline a volume M
s containing points discarded from
Y which in turn contained the whole striatum located in the non-affected hemisphere. From M
s a representative value of the striatal activity c
s was extracted. In
Figure 2, the steps leading to the definition of M
s are summarized. Target PET P voxels were then divided by c
s, and the resulting P
s is the PET scannormalized to the mean activity of the normal striatum.
2.6. Maximum SUVr Extraction
The maximum intensities within the lesions, in P
n and P
s, are conceptually analogous to the
TN and
TS derived with the manual procedure (Equation (1)). If we denote by M
c all the voxels of the brain except M
n and M
s, the mask obtained contains both the striatum within the lesioned hemisphere and the glioma. To compare the results of the two procedures, it was necessary to identify a value corresponding to the local maximum intensity of the lesion, and this search was limited to M
c. There are two criticalities in the definition of this value. The first is related to the presence of the striatum in the volume M
c. This subcortical structure presents a tracer uptake that could be comparable to that of the glioma. Consequently, it is possible for a minimum of two hyperintense areas to be observable, one of which may remain unaltered by the tumor and where the highest degree of uptake may be localized. The key point here is that it was not possible to discriminate the increased uptake of the healthy striatum from the increased uptake of lesions. It was also impossible, relying only on [
18F]F-DOPA PET, to determine whether the infiltration of the striata by the glioma had clearly occurred.
Figure 3 illustrates two examples of this heterogeneity: the tumor could present with less, more, or as much activity as the striatum, which could be spared by the tumor or not.
This consideration led us to abandon the idea of providing a single maximum within the lesion from Pn and Ps. Therefore, the adopted solution was to provide two maximum values: one located within the striatum (min) and one located outside of it (mout). Two scenarios are possible: if mout ≥ min, then mout is selected to represent the maximum activity of the glioma, otherwise the involvement of the basal ganglia by the tumor must be evaluated. When mout < min, two options arise: (1) infiltration of glioma in the basal ganglia is evident, and therefore min can be considered a reliable marker of tumor activity, or (2) if the infiltration is unclear, it is reasonable to regard mout as the most representative value, which is frequently observed in low-grade gliomas. To find min, it was necessary to identify a binary mask (Mc,s) containing the striatum inside Mc. This operation is conceptually similar to the one performed to obtain Ms, but in this case, the raw segmentation operated by the Y-filter was not sufficient. Indeed, we needed to consider that extended gliomas can deform surrounding tissues, with subcortical structures included. Thus, we chose to create Mc,s by inflating the raw mask obtained through the filter to be entirely encircling the striatum located in the affected hemisphere. The expansion of Mc,s was influenced by the intensity of neighboring voxels, implying that Mc,s could potentially incorporate affected tissue. The fraction of Mc not containing Mc,s represents affected tissue, except for the striatum in the impacted hemisphere; we refer to this volume as Mc,n (Mc = Mc,s ∪ Mc,n). The most appropriate value between min and mout was then chosen by expert clinicians taking into consideration their coordinates and the extension of each glioma.
The second issue concerned the choice of a single voxel value as maximum intensity; indeed, the intensity of just one voxel is prone to fluctuation and noise. Thus, it was decided to calculate local averages using a moving window approach and to consider them as representative of tumor malignancy. The size of the cubic moving window was set to be comparable to those of the
ROIs used in the manual procedure (l = 2). Values m
out and m
in were consequently the highest local means computed in M
c,n and M
c,s, respectively. Let m
1, …, m
n be the mean intensities in the local neighborhood I(
p1), …, I(
pn) with
p1, …,
pn ∈ M
c,s, then m
in = max{m
1, …, m
n }. Similarly, m
out = max{m
1, …, m
n } where
p1, …,
pn ∈ M
c,n. The whole procedure that leads to the definition of both m
in and m
out is summarized in
Figure 4.
The selection of the local mean was substantiated by the consistent observation that the examined tumors demonstrated a maximum uptake in the central region, which gradually declined towards the periphery.
This entire procedure was performed on both Pn and Ps, providing two scores that reflected the maximum activity in the diseased tissue normalized, respectively, by the mean activity in the healthy tissue (tn) and by the mean activity of the striatum (ts).
2.7. Evaluation Metrics
The correlation between automated and manual results has been quantified by the Pearson correlation coefficient ρ between the scores obtained with the manual (TN, TS) and the automatic (tn, ts) procedure. We also verified the internal coherence of each semi-quantification procedure by comparing the residual dispersion of the linear regression of ts∼tn with that of the model TS∼TN.
An unpaired t-test was used to test the scores with respect to the lesion grading for the purpose of evaluating the potential diagnostic relevance.
The potential prognostic value of tn and ts was assessed by performing a survival analysis with the Kaplan–Meier (KM) method that estimates the survival function. A cut-off value tn,cut for tn scores was calculated to maximize the sensitivity in the classification of prognosis: tn,cut is defined as the value that minimizes the rate of false negatives and all subjects with tn < tn,cut survived. To assess the discriminatory capability of the tn,cut in stratifying patients with distinct prognoses, we compared the KM curves of two subsets based on their tn. The same procedure (cut-off calculation and KM curve analysis) was repeated with the ts values.
3. Results
As evidenced by the correlation coefficients, the values
tn and
ts are comparable to those obtained with the manual procedure (Equation (1)). Strong correlation was observed between
TN and
tn (ρ = 0.814,
p < 10
−4), and even higher values of ρ were observed between
TS and
ts (ρ = 0.93,
p < 10
−4). In
Figure 5, the linear regression models of
tn~
TN (a) and
ts~
TS (b) are displayed.
The dispersion of the residuals, used to evaluate the internal consistency of the two methods, is shown in
Figure 6 referring to the manual and automated approach. Compared to the manual method, the automated procedure produced less dispersion of the residuals (σ = 0.181 and σ = 0.096, respectively).
As
Figure 7 displays, significant differences were observed for both
tn and
ts between PLGG and PHGG (unpaired
t-test,
p ≤ 10
−4). Analogous results were obtained comparing the values provided by the manual procedure with the degree of malignancy (
Supplementary Material, Figure S3).
When assessing the prognostic value of the two automatically derived scores, it was observed that elevated
tn and
ts values were linked to reduced overall survival, indicating poorer prognoses (
p < 0.001, log-rank test; see
Figure 8 and
Figure 9). The same survival analysis was replicated with the maximum
SUVr scores obtained with the manual procedure. Subjects were divided based on their scores and a KM curve was estimated for each group (
Supplementary Materials, Figure S4). It was also observed in this case that the overall survival was significantly shorter in patients with higher values of
TN and
TS (
p < 0.001, log-rank test).
4. Discussion
Approaches for analyzing [
18F]F-DOPA PET/CT images, either semi-automated or fully automated, have been developed for use in Parkinson’s disease, where they can benefit from prior knowledge of the target and reference regions. A variety of techniques can be employed, including the extraction of radiomic features from the image, the application of classical machine learning methods, and the calculation of the striatal-to-occipital ratio [
21,
22]. Defining target and normalization regions in oncological PET imaging is a challenging task due to the inherent variability and the methodological drawbacks of common manual or threshold-based procedures (e.g., time consumption, low reproducibility, scanner type, reconstruction algorithm, and image noise [
23]). This also certainly applies to the quantitative approach to [
18F]F-DOPA PET [
24,
25,
26]. However, as far as we know, there is currently no method for extracting unbiased quantitative information from neuro-oncological [
18F]F-DOPA, whereas methods for such extraction are available for adults [
18F]FET PET analysis [
27,
28]. Furthermore, a more in-depth analysis of the effect of reference region selection to improve analysis robustness has been conducted only on [
18F]FET PET reporting that the selection of small bidimensional or three-dimensional
ROIs is associated to significant background SUV changes that may be attributed to the potential differences in size, the inadequate representation of various tissue types, as well as the imprecise and arbitrary placement [
29]. To the best of our knowledge, this is the only approach that allows for a significant reduction in intra- and inter-reader variability in defining the reference region on [
18F]FET PET of adults with glioblastoma multiforme. This method, developed by Brighi and colleagues, involves the generation of a mirrored
ROI of the lesion in the contralateral hemisphere with respect to the anterior–posterior midline [
28]. Nonetheless, this approach may not be appropriate for patients who have tumor lesions significantly affecting regions located along the anterior–posterior midline, patients whose tumor growth has substantially compromised anatomical symmetry in the contralateral lobe, or those with multifocal bilateral diseases.
The method proposed in the present study mimics the manual routine adopted in clinical practice [
8]. This automated approach could be useful in the definition of the
ROIs drawn on tumor, striata, and normal brain parenchyma, in a [
18F]F-DOPA PET scan, thus speeding up computation and reducing operator dependence for lesion-to-normal-tissue and lesion-to-striatal uptake ratios (
TN and
TS, respectively).
To validate this approach, we first verified the correlation of
tn and
ts with the
SUVr computed with the manual procedure (
TN and
TS, respectively). An excellent correlation exists between the
TS and
ts, while a weaker, but still high, correlation was reached between
TN and
tn. The residual dispersion of the linear regression of
TS and
TN is greater than the same dispersion computed for
ts and
tn, suggesting that the scores issued from the automated procedure are more consistent than the couple of manually generated values. The consistency measured by the residual analysis assumes that normal tissue and normal striatum are in close relationship. That is to say that both are proxies of a tumor-free region, and hence, they can be thought of as the same measure. Moreover, the difference between the
SUVr values is less evident between the
TS and
ts in determining a simple shift towards lower values for manual scores. This finding may be explained by the positioning of the reference
ROI on the striatum body in the manual procedure. Indeed, the fixed-sized
ROIs used to compute the T
S have always been placed in the center of the putamen where the uptake of the [
18F]F-DOPA is higher than other cerebral tissues increasing the stability of these values [
30]. Thus, we imply that (even though
SUVmax is less sensitive to
ROI definition) the 1.8 × 1.8 cm squared
ROI used in the manual procedure to obtain the
TN can be more prone to uncertainty than the respective normal tissue reference c
n obtained from the extended volumes M
n. Moreover, our data highlighted the ability of the
tn and
ts scores to distinguish between PLGGs and PHGGs in this study population. These results confirm the literature findings on [
18F]F-DOPA uptake to discriminate between high- and low-degree lesions [
7,
8,
9,
10,
11].
The survival analysis showed significant differences in patient prognosis for the subjects characterized by high tn, ts with respect to those having lower values.
The same analysis, performed on TN and TS, yielded comparable results suggesting that, for this population, a cut-off for discriminating between patients with significant differences in prognosis could be derived from both the automated and manual scores.
Study Limitations
The inadequacy of the automatic procedure to assess striatal involvement of gliomas, which is beyond the purposes of the method itself, led to the decision of providing two local maxima for the
SUVr estimation. However, a study conducted on pediatric patients with PDGs suggested that [
18F]F-DOPA PET does not appear to be a main limitation in the evaluation of tumors extending into the basal ganglia but fused [
18F]F-DOPA PET/MRI can be used to assess the striatal involvement [
31]. Thus, MRI/PET co-registered scans coupled with the positions of two local maxima are adequate to allow a correct identification of the most appropriate site for
SUVr estimation. The proposed method was tested on a small sample, and needs to be evaluated on a greater, multicentric dataset. However, our cohort included only patients with supratentorial lesions presenting an infiltrative pattern on MRI who underwent [
18F]F-DOPA PET, which is rather uncommon, especially in the pediatric population. The automatic procedure was implemented without regard to tumor segmentation, rather, we were only interested in obtaining sufficiently large, connected volumes to ensure a stable representation of brain healthy tissue uptake. Thus, all the parameters of the filter (i.e., the size of the sliding window, the number K of quantiles used to sample the histogram) were estimated according to this principle, although their range could be further optimized.