1. Introduction
Testicular germ cell cancer (TGCC) account for 1–3% of all cancers in men, being most common in the age group 15 to 35 years and considered curable in more than 90% of cases [
1,
2]. TGCCs typically metastasize in a predictable way, via lymphatic drainage and retroperitoneal lymph nodes (RPLN) are the most common sites for metastatic disease [
2,
3,
4]. The lungs are the most frequent sites of hematogenous spread of non-seminomatous GCC (NSGCC).
Proper staging and follow-up of testicular cancer patients is essential for planning the treatment and avoiding unnecessary interventions. Approximately 70–80% of seminomas and one third of NSGCC have clinical stage I disease at diagnosis (there is no evidence of spread to either lymph nodes or other organs) [
5,
6]. Since clinical stage Ι TGCCs without risk factors have a low risk of relapse, most patients are offered surveillance by clinical examination, serologic studies, and cross-sectional imaging techniques every 3–6 months for the first year and then twice yearly [
7]. Because of the potential harmful effects of irradiation in these young men, the follow-up must be planned carefully by keeping radiation doses “as low as reasonably achievable” and some new protocols include low-dose computed tomography (CT) examinations [
8].
Magnetic resonance imaging (MRI) is known as a non-irradiating alternative to CT, providing high quality images of the whole body with excellent anatomic detail and functional information by adding diffusion weighted imaging (DWI) sequences [
9]. Whole body MRI protocols (WB-MRI) have developed and used successfully in numerous oncological applications [
10,
11]. Yet, the role of WB-MRI in the staging and follow up of TGCC is not well studied.
We developed a fast (30-min) WB-MRI protocol without contrast material for the detection of lymph node (LN) metastasis in patients with TGCC. This study aimed to prospectively compare the diagnostic value of thoraco-abdominopelvic CT and WB-MRI in the LN staging of TGCC patients.
2. Materials and Methods
2.1. Study Population
This prospective single-center study was approved by the institutional ethical board (2017/30MAI/306). Informed consent was obtained from all patients. Forty-four patients entered the study between July 2017 and October 2020. One patient was excluded due to claustrophobia. Inclusion and exclusion criteria were: (i) patient ≥ 18 years old with first diagnosis of TGCC or patients with suspected or confirmed relapse, (ii) contraindications were incompatible MRI medical devices.
2.2. Imaging Protocols
The patients were imaged with thoraco-abdominopelvic CT and WB-MRI and then underwent a second pair of examinations depending on the type of cancer and disease stage between 2 and 12 months after treatment. CT and MRI examinations were performed within less than 30 days.
2.2.1. Computed Tomography (CT)
All patients underwent a thoraco-abdominopelvic exam on an IQON
® Spectral CT scanner (Philips Healthcare, Best, The Netherlands) with a 120 kVp voltage and adapted mAs according to dose modulation. Portal phase images were obtained after intravenous contrast medium (100 mL, Xenetix 350
® Guerbet, Villepinte, France) in all patients, except for one with history of serious allergic reaction to iodinated contrast material. One of the patients received positive oral contrast material diluted in mineral water to a concentration of 5% (Telebrix Gastro
®, Guerbet, Aulnay-sous-Bois, France). Patients were imaged in the supine position from the pulmonary apex to pelvic symphysis in one spiral acquisition. Radiation dose was evaluated using patient protocols [Mean volume computed tomography dose index (CTDIvol) and Dose length product (DLP)] available in our picture archiving and communication system (PACS, Carestream; Carestream Health, Rochester, New York, NY, USA). The effective dose in millisievert (mSv) was calculated using a “k factor” of 0.015, according to recommendations from International Commission of Radiological protection (ICRP) [
12].
2.2.2. Whole-Body Magnetic Resonance Imaging (WB-MRI)
All patients were imaged with a 3.0-Tesla MRI unit; 32 patients with a Philips Ingenia (Philips Healthcare, Amsterdam, The Netherlands) and 11 patients with a General Electric Signa Premier (General Electrics, Boston, MA, USA). The WB-MRI protocol consisted of an axial Turbo Spin Echo (TSE) T2 weighted sequence (T2W), a breath-hold coronal mDIXON three-dimensional (3D) gradient echo (GRE) T1-weighted (T1W) sequence, a free breathing axial DWI (b-values: 0, 50, 150, 1000 s/mm
2) sequence and Apparent Diffusion Coefficient (ADC) maps. All sequences covered the body from the vertex to midthighs (4 stacks). The detailed MRI protocol is shown in
Table 1. Sequence acquisition time was less than 20 min and scan time—including positioning of the patient and calibration—was 30 min.
2.3. Image Analysis
Two radiologists [reader 1 (R1) and reader 2 (R2), with 15 and 13 years of experience, respectively] blindly and independently reviewed the CTs and WB-MRIs for the presence of metastatic disease. They reviewed the CT and MR images in random order and in separate sessions at least 2 weeks apart, in order to prevent recall bias. Imaging criteria for the characterization of LNs as abnormal were as follows: (i) short-axis diameter larger than 10 mm on the anatomic sequences, (ii) loss of the normal oblong kidney bean shape and of the fatty hilum, and (iii) irregular outline [
13].
The following LN regions were considered: distant lymph nodes (DLN) supra-diaphragmatic (cervical, axillary, hilar and mediastinal), scrotal and inguinal LNs and RPLN (para-caval, pre-caval, inter-aortocaval, pre-aortic, para-aortic, right and left supra-hilar, right and left internal/external/common iliac, right and left gonadal) [
14].
Although this was not the primary purpose of our study, the readers also noted for suspicious lung, liver, kidney or other metastatic lesions.
The effectiveness of each WB-MRI sequence in detecting abnormal LNs (per-patient analysis) was assessed qualitatively. To this purpose, a five-point Likert scale was used for each patient for a given sequence. Likert scale is a rating scale used to collect responders’ attitudes and opinions. In this study Likert scale is used to assess image quality. Radiologists rated their overall impression concerning the effectiveness of every WB-MRI sequence for detection of abnormal LNs as follows: 0, very poor; 1, poor; 2, fair; 3, good; 4, very good; 5, excellent. Then a total score (TS) was calculated resulting from the sum of the scores for each sequence.
Quality of WB-MR images was assessed by measuring both the signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) by placing regions of interest (ROIs) within the lesions and the reference tissues. As reference regions were chosen the retroperitoneal (infra-renal) and the gluteal subcutaneous fat.
2.4. Reference Standard
For 3 patients in whom surgical lymphadenectomy was performed, the histopathologic examination was used as a reference standard. For the remaining 40 patients, in whom biopsy or surgical resection of lesions was not performed, a best valuable comparator (BVC) was used as reference standard. BVC was established by a panel of experts, reviewing all baseline imaging studies along with the prospective systematic imaging, clinical and biologic follow-up of at least 1 year. The panel consisted of 4 radiologists (VP, SVN, LA, FL), 2 urologists (BT, JVD) and 1 oncologist (JPM), who determined in consensus the final diagnosis for each patient, i.e., metastatic or not.
2.5. Statistical Analysis
The primary endpoint was to assess the performance of each imaging protocol for the detection of metastasis on a per-patient basis. The protocols assessed were: CT, total WB-MRI protocol including all sequences (
totalWB-MRI = T1W + T2W + DWI), and two fast protocols including only one anatomic sequence and DWI (T1W + DWI and T2W + DWI). The analysis was performed for both readers independently. True Positive (TP), False Positive (FP), True Negative (TN), False Negative (FN), Sensitivity (Se), Specificity (Sp), Predictive Accuracy defined as Acc = (TP correctly classified + TN correctly classified)/(P + N), as well as the agreement measurement with the reference standard (based on Gwet’s AC1 coefficient) were reported. 95% confidence intervals (CI) on Se, Sp, Acc and AC1 were also provided [
15].
An Exact test for paired data was performed to assess differences in Acc between CT and totalWB-MRI. As 3 lesion sites (RPLN, DLNs and other lesions) and two readers were considered, a Bonferroni correction was applied and a significance level of p < 0.0083 was considered for this test.
Inter-reader agreement was assessed for each imaging protocol using Gwet’s AC1 coefficient. Strength of agreement was interpreted as follows: slight (AC1 ≤ 0.2), fair (0.2 < AC1 ≤ 0.4), moderate (0.4 < AC1 ≤ 0.6), good (0.6 < AC1 ≤ 0.8) and very good agreement (AC1 > 0.8).
MR image quality was first assessed using a 5-point Likert scale. Measurements of SNR in the metastatic lesions and CNR (taking fat as reference) were then performed. A Kruskal-Wallis analysis followed by a paired Wilcoxon test was used to assess potential statistical differences in SNR/CNR between T1W, T2W and DWI sequences. A significance level of p < 0.0167 was considered for this test.
All data were analyzed using the Statsdirect statistical software version 3.1.20.
4. Discussion
This prospective study assessed the performance of WB-MRI for the detection of lymph node metastases in patients with TGCC. totalWB-MRI including DWI + axial T2W and DIXON T1W sequences without contrast material administration, had a high predictive Acc (at least 95%, all sites of lesions considered), similar to that of CT (at least 93%, all sites considered). The strict overlap of 95% CIs associated with Acc suggests that there was no statistically significant difference in Acc between totalWB-MRI, T1W + DWI and T2W + DWI, regardless of the reader. All RPLN patients were detected by both thoraco-abdominopelvic CT and WB-MRI. Two and three patients with DLN were missed by R2 and R1 respectively in CT readings.
Reproducibility of the measurements was very good, regardless of the WB-MRI imaging protocol. However, as higher Likert score and SNR were observed in DWI, followed by T2W and T1W sequences, a faster MRI protocol including on T2W and DWI only may be sufficient for the LN staging of the patients with TGCCs.
TGCC primarily affects young men and carry excellent prognosis, but these patients require close follow-up by multiple cross-sectional imaging studies [
17]. The benefits of frequent CTs should be weighed against the effects of radiation exposure and secondary malignancies [
18]. Moreover, it is known that the use of intravenous iodinated contrast media bears a risk of adverse reactions and contrast-induced nephropathy [
19]. Our WB-MRI protocol including DWI is a fast, non-irradiating technique which also avoids the need for intravenous contrast material.
The retroperitoneum is the commonest site of spread of TGCC and our study showed that WB-MRI, compared to the reference standard, missed neither RPLN nor DLN and can be used instead of thoraco-abdominopelvic CT. It has been demonstrated that MRI with DWI is as good as thoraco-abdominopelvic CT in detection of RPLN metastases [
20]. An earlier study compared abdominal MRI to CT for detection of retroperitoneal metastasis in TGCC and demonstrated a sensitivity ranging from 78% to 96% [
21]. Unlike our report, the aforementioned study did not use WB-MRI, focused to retroperitoneum and did not provide data for other metastatic LN sites. Moreover, the use of DWI sequences in our study may explain the better detection rates of WB-MRI in our study, as DWI is a powerful tool to detect tumoral involvement and confer added value data to anatomic WB-MRI sequences. Mosavi et al. demonstrated the feasibility of WB-MRI with DWI for the follow-up of patients with testicular cancer and the added value of DWI [
22]. A recent study showed that unenhanced MRI of the abdomen and pelvis is an adequate tool for surveillance of stage I testicular cancer [
23].
Chest staging is important, as TGCC may spread to supradiaphragmatic LNs and lung. For patients with NSGCC, there is a risk of pulmonary metastasis without retroperitoneal disease [
24]. Our WB-MRI protocol showed excellent sensitivity and specificity in detecting supra-diaphragmatic and in general DLNs. Detection of lung nodules was not the primary focus of our study, as it is known that the conventional MR sequences still underperform in lung imaging. Our MRI protocol was not optimized for imaging of the lung parenchyma and, as expected, WB-MRI was not as sensitive as CT for the detection of lung nodules, but it was as specific. The addition of optimized sequences for the imaging of lung parenchyma, as 3D Ultrashort Echo Time or other short T2* sequences to WB-MRI, might allow better detection of lung nodules [
25,
26,
27].
Detection of scrotal contralateral metachronous tumors is another caveat of the follow-up in TGCC [
28]. Our protocol could fit the minimal requirement for detection, but not characterization of testicular lesions with the addition of a coronal T2W sequence, thus limiting the need for scrotal ultrasound [
7,
29].
According to the current guidelines, abdominal cross-sectional imaging is recommended for the follow up of TGCC patients after curative treatment, whereas chest imaging should be reserved only for patients at higher risk for thoracic involvement [
7,
17]. To put our findings into context, WB-MRI can safely replace abdominopelvic CT for the follow up of patients with TGCC after curative treatment. As we showed in the radiation dose analysis, this policy spares the patients a substantial 25 to 30.6 mSv of total radiation over a 5-year follow-up. An additional chest CT is warranted in selected patients with higher suspicion for lung metastases (lung metastases at diagnosis, symptomatic patients, patients with RPLN and patients with abnormal X-ray) [
17,
30]. In these cases, a low-dose CT protocol could limit the radiation exposure.
The small number of patients is a limitation of our study and larger prospective studies are needed to confirm our findings. The use of the multidisciplinary and multimodality approach as a reference-standard, in the absence of histologic proof, could be a weakness of our protocol, as it imposes a risk for differential verification biases among the patients. However, it is a standard approach in this kind of studies as the use of biopsy or surgery to obtain histopathological results would not be ethically justifiable nor feasible in clinical practice. We did not use optimized sequences for the imaging of lung parenchyma. However, conventional MR imaging is still considered suboptimal for lung imaging and was not a primary goal in our study. Finally, WB-MRI and in general MRI exams, are less available, more time consuming and costly than CT. The availability of MRI units relative to the number of inhabitants depends on the country. Moreover, to obtain WB-MRI exams, specific antennas, software, and of course specialized personnel are needed. As a result, there are countries, mainly in the developing world, in which the use of MRI is more limited due to the lack of infrastructure and the important costs, and the CT is a much more approachable imaging technique. A major future goal is to reduce the duration of the MRI acquisitions without sacrificing the quality of image.