Next Article in Journal
Changes of Atherosclerotic Plaque in Cerebral Artery Stenosis According to High-Resolution MR Imaging
Next Article in Special Issue
An Efficient Multi-Scale Convolutional Neural Network Based Multi-Class Brain MRI Classification for SaMD
Previous Article in Journal
Quantitative Imaging in Oncology
Previous Article in Special Issue
AI Denoising Significantly Enhances Image Quality and Diagnostic Confidence in Interventional Cone-Beam Computed Tomography
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

AI Denoising Improves Image Quality and Radiological Workflows in Pediatric Ultra-Low-Dose Thorax Computed Tomography Scans

Department of Diagnostic and Interventional Radiology, Eberhard-Karls University, D-72076 Tuebingen, Germany
*
Author to whom correspondence should be addressed.
Tomography 2022, 8(4), 1678-1689; https://doi.org/10.3390/tomography8040140
Submission received: 14 May 2022 / Revised: 9 June 2022 / Accepted: 23 June 2022 / Published: 24 June 2022

Abstract

:
(1) This study evaluates the impact of an AI denoising algorithm on image quality, diagnostic accuracy, and radiological workflows in pediatric chest ultra-low-dose CT (ULDCT). (2) Methods: 100 consecutive pediatric thorax ULDCT were included and reconstructed using weighted filtered back projection (wFBP), iterative reconstruction (ADMIRE 2), and AI denoising (PixelShine). Place-consistent noise measurements were used to compare objective image quality. Eight blinded readers independently rated the subjective image quality on a Likert scale (1 = worst to 5 = best). Each reader wrote a semiquantitative report to evaluate disease severity using a severity score with six common pathologies. The time to diagnosis was measured for each reader to compare the possible workflow benefits. Properly corrected mixed-effects analysis with post-hoc subgroup tests were used. Spearman’s correlation coefficient measured inter-reader agreement for the subjective image quality analysis and the severity score sheets. (3) Results: The highest noise was measured for wFBP, followed by ADMIRE 2, and PixelShine (76.9 ± 9.62 vs. 43.4 ± 4.45 vs. 34.8 ± 3.27 HU; each p < 0.001). The highest subjective image quality was measured for PixelShine, followed by ADMIRE 2, and wFBP (4 (4–5) vs. 3 (4–5) vs. 3 (2–4), each p < 0.001) with good inter-rater agreement (r ≥ 0.790; p ≤ 0.001). In diagnostic accuracy analysis, there was a good inter-rater agreement between the severity scores (r ≥ 0.764; p < 0.001) without significant differences between severity score items per reconstruction mode (F (5.71; 566) = 0.792; p = 0.570). The shortest time to diagnosis was measured for the PixelShine datasets, followed by ADMIRE 2, and wFBP (2.28 ± 1.56 vs. 2.45 ± 1.90 vs. 2.66 ± 2.31 min; F (1.000; 99.00) = 268.1; p < 0.001). (4) Conclusions: AI denoising significantly improves image quality in pediatric thorax ULDCT without compromising the diagnostic confidence and reduces the time to diagnosis substantially.

1. Introduction

Computed Tomography (CT) of the chest is an important diagnostic tool in pediatric patients to rule out severe cases of pneumonia. The COVID-19 pandemic has underlined the usefulness of chest CT in this context [1]. CT may also help identify potentially critical emergencies in oncological cases, especially pneumonia after stem cell transplantations [2]. Furthermore, in some congenital diseases, regional structural lung damage may be visible in CT before symptom onset, allowing for preventative rather than reactive therapy [3]. Nonetheless, radiation exposure in pediatric patients causes concern for difficultly in predictable long-term harms, as multiple studies have described increased cancer risks in similar contexts [4,5,6]. Hence, many ultra-low-dose CT (ULDCT) protocols have been established in the last decade, prominently featuring spectral shaping [7]. Spectral shaping is a technique that typically employs extra tin filter layers in front of the x-ray tube, effectively absorbing low-energy photons [8]. Especially in pulmonary CT, shaping the spectrum in favor of higher energies is acceptable, as the contribution of low-energy photons to overall signal yield is negligible [9]. However, although being more dose-efficient, spectral shaping ULDCT protocols nonetheless have lower overall image quality than their standard counterparts due to higher noise [10]. Recently, AI-based denoising software solutions enabled image quality enhancement capabilities beyond the limits of conventional reconstruction methods [11]. Yet, these novel solutions come with novel challenges, particularly loss of spatial information and blurring [12]. Therefore, prominent review articles advise evaluating such algorithms strictly on a clinical use-case basis [13]. To the best of our knowledge, a thorough investigation of the potential effects of AI denoising in pediatric thorax ULDCT has not been attempted so far. Therefore, our goal was to evaluate the effects of AI denoising in this context. We hypothesized that image quality and workflows in pediatric thorax ULDCT might benefit from AI denoising.

2. Materials and Methods

2.1. Study Design, Population, and Radiation Dose

An a priori power analysis using the software solution G*Power (ver. 3.1.9.7, Franz Faul, University of Kiel, Kiel, Germany) determined the necessary sample size (f = 1.18, α = 0.05, 1-β = 0.95) to be 100 patients [14]. The institutional review board approved retrospective eligibility analysis and data collection of our centers’ in-patients for this single-center study’s purpose from 1 January 2021 to 1 January 2022 with a waiver for the need for informed consent (#167/2022BO2). Initially, we performed a database search (keywords: pneumonia AND pediatric AND computed tomography). If a patient had received multiple thorax ULDCT in the given timeframe, we only included the most recent and removed the others (“duplicates”). Further exclusion criteria were clinical indications other than (suspected) pneumonia, contrast-enhanced imaging, and image acquisition protocols other than ULDCT. From the thus enrolled 100 patients, we collected age and sex at the procedure time from their clinical reports. For radiation dose comparison, the patient’s mean CT dose index (CTDIvol in mGy), the mean dose-length product (DLP in mGy*cm), and the mean SSDE (size-specific dose estimate in mGy, [15,16]) were extracted using the dose-management software DoseM ® (Infinitt Europe GmbH, Frankfurt am Main, Germany).

2.2. Image Acquisition and Reconstruction Parameters

All CT examinations were non-contrast-enhanced and performed on the same third generation CT scanner (SOMATOM Force; Siemens Healthineers, Erlangen, Germany). Attenuation-based tube current modulation (CARE Dose4D, reference mAs 190) was activated for the examination. The tube voltage was set to Sn100 (single-source 100 kV with tin filter). Collimation was set to 0.6 × 192/128 mm, pitch was 2.8 (“Flash mode”), and gantry rotation time to 0.25 s. A medium-soft kernel (Br40d) was used to reconstruct all images. The CT datasets were reconstructed in an axial orientation with a slice thickness and an increment of 1 mm using weighted Filtered Back Projection (wFBP) and Advanced Modeled Iterative Reconstruction strength 2 (ADMIRE®, Siemens Healthineers, Erlangen, Germany), as is standard in our institute. Additionally, a novel AI-based postprocessing software solution (PixelShine®, AlgoMedica, Sunnyvale, CA, USA) was used to denoise the wFBP images, resulting in three datasets per patient.

2.3. Objective Image Quality

All corresponding series per patient (wFBP, ADMIRE 2, PixelShine) were loaded into the open-source ImageJ distribution FIJI (ver. 1.53 K, Wayne Rasband, National Institutes of Health-NIH, Maryland, MD, USA) [17]. The wFBP series were used to draw a total of 30 regions of interest (ROIs) per patient bilaterally in homogeneous paraspinal muscles (3 ROIs on each side in 5 consecutive slices, diameter ≥ 1 cm2). The program then conveyed those ROIs into each loaded series and performed place-consistent measurements of mean CT numbers in Hounsfield Units (HU) and their standard deviation (SD). The SD of HU was defined as image noise.

2.4. Subjective Image Quality

The patient datasets were anonymized and randomized by a group member otherwise not associated with this project. A total of 8 blinded readers with different experience levels in pediatric thorax ULDCT independently rated the subjective image quality (Rater 1 = medical student at the end of a 3 month internship, Rater 2–6 = radiology residents with 1, 2, 3, 4, and 5 years of experience, Rater 7–8 = radiology consultants with 6 and 7 years of experience). Subjective image quality was rated on a 5-point Likert scale (1 = poor, 2 = subpar, 3 = fair; 4 = good, 5 = excellent) according to the diagnostic requirements mentioned in the chapter “Chest, General” of the European Guidelines on Image Quality in Computed Tomography [18].

2.5. Diagnostic Accuracy

Each reader filled out a custom semiquantitative severity score report table during the subjective image quality analysis sessions. This score evaluates the subjective conspicuity (0 = no affection, 1 = 0–50% affection, 2 = 51–100% affection) of six common pneumonia-related pathologies (peribronchial cuffing, mucus plugging, ground glass opacities, cavities, consolidations, air trapping) in six pulmonary regions (right upper lobe, right middle lobe, right lower lobe, left upper lobe, lingula segment, and left lower lobe).

2.6. Time to Diagnosis

The time to diagnosis (until the severity score report table was finished) was measured for each reader to explore the potential effects of reconstruction mode on radiological workflows. In addition, we performed post-hoc subgroup analyses for professional experience levels to investigate potential dependencies.

2.7. Statistical Analysis

Statistical analysis and illustration were performed using GraphPad Prism version 9.3.1 for Windows (GraphPad Software, San Diego, CA, USA). Data distribution was tested using the Shapiro–Wilk test. Normally distributed variables were expressed as mean ± SD, and non-normally distributed variables as median and interquartile range (IQR). Data analysis ensued using mixed-effects models with Greenhouse–Geisser correction in case of violation of sphericity, and two-stage step-up correction after Benjamini, Krieger, and Yekutieli to counteract the type-1-error increase in post-hoc multiple comparisons. An adjusted p-value ≤ 0.05 indicated statistical significance. Spearman’s correlation coefficient (r) was used to quantify the inter-rater agreement of the subjective image quality analysis and the severity score points. Correlation coefficient values of 0–0.30 were interpreted as negligible, 0.31–0.50 as low, 0.51–0.70 as moderate, 0.71–0.90 as good, and 0.91–1.00 as excellent levels of agreement.

3. Results

3.1. Patient Population

We evaluated 317 pediatric thorax ULDCT for eligibility, excluded 217, and included 100 examinations (1 examination per patient). From the included 100 examinations, we generated 300 datasets through reconstruction and postprocessing for further investigations. Figure 1 visualizes patient enrollment and the study workflow.
Our study cohort comprised 45 females and 55 males. At the time of the examination, the mean overall age was 10 ± 7 years (females: 10 ± 8 years; males 10 ± 6 years). Mean overall radiation exposure (SSDE) was 0.23 ± 0.09 mGy (females: 0.25 ± 0.11 mGy; males 0.22 ± 0.06 mGy). Mean overall CTDIvol was 0.30 ± 0.16 mGy (females: 0.32 ± 0.20 mGy; males 0.28 ± 0.11 mGy) and mean overall DLP was 8.69 ± 5.54 mGy*cm (females: 8.80 ± 6.63 mGy*cm; males 0.60 ± 4.66 mGy*cm). In 60 patients (28 female), pneumonia screening CT was indicated due to oncological diseases, in 20 patients (8 female) due to infectious diseases, in 15 patients (8 female) due to congenital diseases, and in 5 patients (all male) in the course of surgical issues.

3.2. Objective Image Quality

The highest noise was measured for wFBP (76.9 ± 9.62 HU), followed by ADMIRE 2 (43.4 ± 4.45 HU) and PixelShine (34.8 ± 3.27 HU). Mixed-effects analysis showed significant differences between the groups (F (1.00; 99.0) = 4348; p < 0.001). Figure 2 graphs the data distribution of the noise measurements and corrected p-values of post hoc pairwise comparisons between each group.

3.3. Subjective Image Quality

In summary, subjective image quality was generally rated fair (median 3) for wFBP, good (median 4) for ADMIRE 2, and excellent (median 5) for PixelShine. There was a good inter-rater agreement for wFBP (r ≥ 0.764; p ≤ 0.001) and for PixelShine (r ≥ 0.790; p ≤ 0.001), and an excellent inter-rater agreement for ADMIRE 2 (r ≥ 0.908; p ≤ 0.001). See Table 1 for further details about subject image quality ratings and inter-rater-agreement.
Overall, the highest pooled subjective image quality ratings were measured for PixelShine (4 (4–5)), followed by ADMIRE 2 (3 (4–5) and by wFBP (3 (2–4)), with statistically significant steps between each. Figure 3 shows the pooled subjective image quality rating distributions and corrected p-values of pairwise comparisons.

3.4. Diagnostic Accuracy

In wFBP reconstructions, the patients had a mean severity score of 10.00 ± 6.4 points, in ADMIRE 2 reconstructions 10.00 ± 6.40 points, and in PixelShine reconstructions 10.00 ± 6.63 points with good inter-rater agreement (wFBP r ≥ 0.764; ADMIRE 2 r ≥ 0.777; PixelShine r ≥ 0.826; each p < 0.001). See Table 2 for further details about severity score points and inter-rater-agreement.
While there were significant interactions between the score items themselves (F (3.68; 364) = 45.7; p < 0.001), mixed-effects analysis did not reveal significant interactions for severity score items per reconstruction mode (F (5.71; 566) = 0.792; p = 0.570). Figure 4 shows post-hoc pairwise comparisons for the mean severity score sums per score item and series (wFBP, ADMIRE 2, PixelShine), as well as score-item-specific exemplary images.

3.5. Time to Diagnosis

Overall, time to diagnosis was highest in wFBP datasets (2.66 ± 2.31 min), followed by ADMIRE 2 (2.45 ± 1.90 min) and PixelShine (2.28 ± 1.56 min). Mixed-effects analysis showed significant interactions between these groups (F (1.000; 99.00) = 268.1; p < 0.001). Unsurprisingly, the post-hoc subgroup analyses revealed consultants to produce the fastest results (1.10 ± 0.01 min), followed by residents (3.05 ± 0.32 min) and medical students (6.46 ± 1.04 min). When examining the effect of reconstruction mode within these groups, it became clear that reports by medical students and residents were done significantly faster on the denoised datasets than on ADMIRE 2 and wFBP (each p < 0.001). For consultants, reconstruction mode did not affect the time to diagnosis (p ≥ 0.249). Figure 5 graphs the time to diagnosis per reconstruction mode in minutes as a function of experience levels in years.

4. Discussion

Although CT to rule out pneumonia is generally used with great reluctance in pediatric patients because of the difficultly predictable long-term effects, it is still a standard tool for time-efficient and comprehensive diagnoses. A promising technique to minimize potentially harmful radiation exposure in pediatric thorax ultra-low-dose CT protocols is to improve the dose-effectiveness by shaping the X-ray spectrum in favor of higher energy photons. However, the typical trade-off of such approaches is lower image quality than standard protocols due to higher image noise. This study examined the impact of a novel AI denoising algorithm on image quality, diagnostic accuracy, and radiological workflows. Multiple previous studies demonstrated impaired image quality in spectral shaping ULDCT protocols compared to full-spectrum protocols. For example, while Bodelle et al. described constant subjective image quality, image noise was higher in their spectral shaping study group [9]. Suntharalingam et al. reiterated these findings, describing objective image quality deterioration due to higher noise in their spectral shaping study group [19]. However, these studies focused on evaluating spectral shaping protocols in the first place, without additional postprocessing. Wetzl et al. reported significant image quality improvements in spectral shaping protocols using advanced iterative reconstruction [20]. In our study, AI denoising was capable of improving image quality beyond that of standard wFBP and iterative reconstruction datasets. Evaluating AI postprocessing has been the scope of several previous studies as well. For example, Kolb et al. pointed out significantly improved objective image quality in their denoised simulated low dose datasets. However, they also reported the subjective image quality of the original 100% dose images to still be significantly higher than that of the denoised images [21]. On the other hand, further studies described significantly improved subjective image quality of denoised datasets compared to regular reconstruction methods, with possible radiation dose reductions of up to 70% [11]. However, these studies focused on low-dose imaging via tube current-time product (mAs) reductions in full-spectrum protocols. AI image quality enhancement in spectral shaping protocols has not been attempted so far to the best of our knowledge, so the comparability of our results may be limited. Furthermore, minding the spectral distribution and the low acquisition parameters of the ULDCT protocol used in this study, the feasibility of similar dose reduction approaches remains the subject of further research. When using AI denoising, possible diagnostic confidence distortions through algorithmic misinterpretations are a commonly voiced concern [22]. However, Yang et al. reported improved lesion delineation when using CT AI denoising [23]. In our study, we measured retained diagnostic confidence on a radiological score-item level, proving the technique’s feasibility in this context. Several studies have evaluated the integrability of AI into radiological workflows and its potential benefits [24,25,26]. However, these applications generally increase the reporting time via supportive workflow improvements and computer-aided diagnoses. In our study, time to diagnosis increased significantly for medical students and residents when using the denoised datasets. However, the reporting time of consultant radiologists was not affected by reconstruction mode. As similar results have not been described so far, we can only hypothesize that denoising may help “declutter” medical images, which may be especially helpful for less-experienced radiologists. This study has several limitations. First, we used a retrospective study design. Although an a priori power analysis confirmed the validity of our results in this setup, any conclusions about the prospective clinical decision-making are nonetheless limited. Second, this study examined image quality metrics and diagnostic criteria related to suspected pneumonia in pediatric patients. As previous review articles have pointed out, AI denoising algorithms should be carefully evaluated on a use-case level. In our case, this especially pertains to potential incidental findings in thorax ULDCT, such as vascular malformations or potentially malignant lesions. As such analyses require thorough preparatory work to match normal prevalence distributions of potential incidental findings in pediatric ULDCT cohorts, additional studies are needed to validate the algorithm’s performance outside of the investigated medical issue. Lastly, this study was performed using a high-end third generation CT scanner, which might not be readily available at every site. Our results might therefore be specific to this setup.

5. Conclusions

AI denoising significantly improves image quality in pediatric thorax ULDCT without compromising the diagnostic confidence and reduces the time to diagnosis substantially.

Author Contributions

Conceptualization, A.S.B., S.A. and I.T.; data curation, U.S.; formal analysis, A.S.B. and U.S.; investigation, A.S.B., U.S., D.P., M.C., M.M., R.W., J.S. (Jakob Spogis), A.E. and M.E.; methodology, A.S.B., S.M. and S.A.; project administration, I.T.; resources, J.S. (Jürgen Schäfer), S.A. and I.T.; software, A.S.B. and I.T.; supervision, S.A.; validation, A.S.B. and I.T.; visualization, A.S.B.; writing—original draft, A.S.B.; writing—review and editing, U.S., D.P., M.C., M.M., R.W., S.M., J.S. (Jakob Spogis), A.E., M.E., S.A. and I.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board of the University Hospital of Tuebingen (#167/2020BO2).

Informed Consent Statement

Patient consent was waived due to retrospective data acquisition.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

AlgoMedica provided the investigated AI denoising software but had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Song, W.; Li, J.; Zou, N.; Guan, W.; Pan, J.; Xu, W. Clinical features of pediatric patients with coronavirus disease (COVID-19). J. Clin. Virol. 2020, 127, 104377. [Google Scholar] [CrossRef]
  2. Handa, A.; Nozaki, T.; Makidono, A.; Okabe, T.; Morita, Y.; Fujita, K.; Matsusako, M.; Kono, T.; Kurihara, Y.; Hasegawa, D.; et al. Pediatric oncologic emergencies: Clinical and imaging review for pediatricians. Pediatrics Int. 2019, 61, 122–139. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Goralski, J.L.; Stewart, N.J.; Woods, J.C. Novel imaging techniques for cystic fibrosis lung disease. Pediatric Pulmonol. 2021, 56, S40–S54. [Google Scholar] [CrossRef] [PubMed]
  4. Ward, R.; Carroll, W.D.; Cunningham, P.; Ho, S.-A.; Jones, M.; Lenney, W.; Thompson, D.; Gilchrist, F.J. Radiation dose from common radiological investigations and cumulative exposure in children with cystic fibrosis: An observational study from a single UK centre. BMJ Open 2017, 7, e017548. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Zondervan, R.L.; Hahn, P.F.; Sadow, C.A.; Liu, B.; Lee, S.I. Frequent body CT scanning of young adults: Indications, outcomes, and risk for radiation-induced cancer. J. Am. Coll. Radiol. 2011, 8, 501–507. [Google Scholar] [CrossRef]
  6. Pearce, M.S.; Salotti, J.A.; Little, M.P.; McHugh, K.; Lee, C.; Kim, K.P.; Howe, N.L.; Ronckers, C.M.; Rajaraman, P.; Craft, A.W. Radiation exposure from CT scans in childhood and subsequent risk of leukaemia and brain tumours: A retrospective cohort study. Lancet 2012, 380, 499–505. [Google Scholar] [CrossRef] [Green Version]
  7. Gordic, S.; Morsbach, F.; Schmidt, B.; Allmendinger, T.; Flohr, T.; Husarik, D.; Baumueller, S.; Raupach, R.; Stolzmann, P.; Leschka, S.; et al. Ultralow-Dose Chest Computed Tomography for Pulmonary Nodule Detection: First Performance Evaluation of Single Energy Scanning With Spectral Shaping. Investig. Radiol. 2014, 49, 465–473. [Google Scholar] [CrossRef] [Green Version]
  8. Haubenreisser, H.; Meyer, M.; Sudarski, S.; Allmendinger, T.; Schoenberg, S.O.; Henzler, T. Unenhanced third-generation dual-source chest CT using a tin filter for spectral shaping at 100 kVp. Eur. J. Radiol. 2015, 84, 1608–1613. [Google Scholar] [CrossRef]
  9. Bodelle, B.; Fischbach, C.; Booz, C.; Yel, I.; Frellesen, C.; Kaup, M.; Beeres, M.; Vogl, T.J.; Scholtz, J.-E. Single-energy pediatric chest computed tomography with spectral filtration at 100 kVp: Effects on radiation parameters and image quality. Pediatric Radiol. 2017, 47, 831–837. [Google Scholar] [CrossRef]
  10. Braun, F.M.; Johnson, T.R.C.; Sommer, W.H.; Thierfelder, K.M.; Meinel, F.G. Chest CT using spectral filtration: Radiation dose, image quality, and spectrum of clinical utility. Eur. Radiol. 2015, 25, 1598–1606. [Google Scholar] [CrossRef]
  11. Brendlin, A.S.; Plajer, D.; Chaika, M.; Wrazidlo, R.; Estler, A.; Tsiflikas, I.; Artzner, C.P.; Afat, S.; Bongers, M.N. AI Denoising Significantly Improves Image Quality in Whole-Body Low-Dose Computed Tomography Staging. Diagnostics 2022, 12, 225. [Google Scholar] [CrossRef] [PubMed]
  12. Diwakar, M.; Kumar, M. A review on CT image noise and its denoising. Biomed. Signal Processing Control 2018, 42, 73–88. [Google Scholar] [CrossRef]
  13. Mohammadinejad, P.; Mileto, A.; Yu, L.; Leng, S.; Guimaraes, L.S.; Missert, A.D.; Jensen, C.T.; Gong, H.; McCollough, C.H.; Fletcher, J.G. CT Noise-Reduction Methods for Lower-Dose Scanning: Strengths and Weaknesses of Iterative Reconstruction Algorithms and New Techniques. RadioGraphics 2021, 41, 1493–1508. [Google Scholar] [CrossRef] [PubMed]
  14. Faul, F.; Erdfelder, E.; Buchner, A.; Lang, A.-G. Statistical power analyses using G* Power 3.1: Tests for correlation and regression analyses. Behav. Res. Methods 2009, 41, 1149–1160. [Google Scholar] [CrossRef] [Green Version]
  15. Boone, J.; Strauss, K.; Cody, D.; McCollough, C.; McNitt-Gray, M.; Toth, T.; Goske, M.; Frush, D. AAPM Report n. 204: Size-Specific Dose Estimates (SSDE) in Pediatric and Adult Body CT Examinations; The Report of AAPM Task Group: College Park, MD, USA, 2011; Volume 204. [Google Scholar]
  16. McCollough, C.; Bakalyar, D.M.; Bostani, M.; Brady, S.; Boedeker, K.; Boone, J.M.; Chen-Mayer, H.H.; Christianson, O.I.; Leng, S.; Li, B. Use of water equivalent diameter for calculating patient size and size-specific dose estimates (SSDE) in CT: The report of AAPM task group 220. AAPM Rep. 2014, 2014, 6. [Google Scholar]
  17. Schindelin, J.; Arganda-Carreras, I.; Frise, E.; Kaynig, V.; Longair, M.; Pietzsch, T.; Preibisch, S.; Rueden, C.; Saalfeld, S.; Schmid, B.; et al. Fiji: An open-source platform for biological-image analysis. Nat. Methods 2012, 9, 676–682. [Google Scholar] [CrossRef] [Green Version]
  18. Bongartz, G.; Golding, S.J.; Jurik, A.G.; Leonardi, M.; van Meerten, E.v.P.; Geleijns, J.; Jessen, K.A.; Panzer, W.; Shrimpton, P.C.; Tosi, G.; et al. European Guidelines on Quality Criteria for Computed Tomography; Publications Office of the European Union: Luxembourg, 2000. [Google Scholar]
  19. Suntharalingam, S.; Mikat, C.; Wetter, A.; Guberina, N.; Salem, A.; Heil, P.; Forsting, M.; Nassenstein, K. Whole-body ultra-low dose CT using spectral shaping for detection of osteolytic lesion in multiple myeloma. Eur. Radiol. 2018, 28, 2273–2280. [Google Scholar] [CrossRef]
  20. Wetzl, M.; May, M.S.; Weinmann, D.; Hammon, M.; Treutlein, C.; Zeilinger, M.; Kiefer, A.; Trollmann, R.; Woelfle, J.; Uder, M.; et al. Dual-source computed tomography of the lung with spectral shaping and advanced iterative reconstruction: Potential for maximum radiation dose reduction. Pediatric Radiol. 2020, 50, 1240–1248. [Google Scholar] [CrossRef]
  21. Kolb, M.; Storz, C.; Kim, J.H.; Weiss, J.; Afat, S.; Nikolaou, K.; Bamberg, F.; Othman, A.E. Effect of a novel denoising technique on image quality and diagnostic accuracy in low-dose CT in patients with suspected appendicitis. Eur. J. Radiol. 2019, 116, 198–204. [Google Scholar] [CrossRef]
  22. McCollough, C.H.; Leng, S. Use of artificial intelligence in computed tomography dose optimisation. Ann. ICRP 2020, 49, 113–125. [Google Scholar] [CrossRef]
  23. Yang, Q.; Yan, P.; Kalra, M.K.; Wang, G. CT image denoising with perceptive deep neural networks. arXiv 2017, arXiv:1702.07019. [Google Scholar]
  24. Hosny, A.; Parmar, C.; Quackenbush, J.; Schwartz, L.H.; Aerts, H.J.W.L. Artificial intelligence in radiology. Nat. Rev. Cancer 2018, 18, 500–510. [Google Scholar] [CrossRef]
  25. Kapoor, N.; Lacson, R.; Khorasani, R. Workflow Applications of Artificial Intelligence in Radiology and an Overview of Available Tools. J. Am. Coll. Radiol. 2020, 17, 1363–1370. [Google Scholar] [CrossRef]
  26. Dikici, E.; Bigelow, M.; Prevedello, L.; White, R.; Erdal, B. Integrating AI into radiology workflow: Levels of research, production, and feedback maturity. J. Med. Imaging 2020, 7, 016502. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Patient enrollment and study workflow.
Figure 1. Patient enrollment and study workflow.
Tomography 08 00140 g001
Figure 2. Noise measurements with pairwise comparisons.
Figure 2. Noise measurements with pairwise comparisons.
Tomography 08 00140 g002
Figure 3. Pooled subjective image quality ratings with pairwise comparisons.
Figure 3. Pooled subjective image quality ratings with pairwise comparisons.
Tomography 08 00140 g003
Figure 4. Mean severity score sums with pairwise comparisons and exemplary images. The green arrows highlight the investigated pathologies.
Figure 4. Mean severity score sums with pairwise comparisons and exemplary images. The green arrows highlight the investigated pathologies.
Tomography 08 00140 g004
Figure 5. Time to diagnosis and pairwise comparisons.
Figure 5. Time to diagnosis and pairwise comparisons.
Tomography 08 00140 g005
Table 1. Subjective image quality ratings and inter-rater-agreement.
Table 1. Subjective image quality ratings and inter-rater-agreement.
RatingSpearman Correlation Coefficient
Median (IQR)
Reader 1Reader 2Reader 3Reader 4Reader 5Reader 6Reader 7Reader 8
wFBPReader 13 (2–3)1.0000.8430.9750.8340.9620.8230.9530.813
Reader 23 (2–4)0.8431.0000.8140.9910.8030.9800.7940.970
Reader 33 (2–4)0.9750.8141.0000.8050.9880.7940.9790.784
Reader 43 (2–4)0.8340.9910.8051.0000.7930.9890.7850.979
Reader 53 (2–4)0.9620.8030.9880.7931.0000.7820.9910.772
Reader 63 (2–4)0.8230.9800.7940.9890.7821.0000.7740.990
Reader 73 (2–4)0.9530.7940.9790.7850.9910.7741.0000.764
Reader 83 (2–4)0.8130.9700.7840.9790.7720.9900.7641.000
ADMIRE 2Reader 14 (3–5)1.0000.9700.9440.9330.9220.9170.9120.908
Reader 24 (3–5)0.9701.0000.9710.9570.9450.9390.9330.928
Reader 34 (3–5)0.9440.9711.0000.9850.9710.9640.9570.951
Reader 44 (3–5)0.9330.9570.9851.0000.9850.9780.9710.964
Reader 54 (3–5)0.9220.9450.9710.9851.0000.9920.9850.978
Reader 64 (3–5)0.9170.9390.9640.9780.9921.0000.9920.985
Reader 74 (3–5)0.9120.9330.9570.9710.9850.9921.0000.992
Reader 84 (3–5)0.9080.9280.9510.9640.9780.9850.9921.000
PixelShineReader 15 (4–5)1.0000.9210.8820.8450.8260.8080.8080.790
Reader 25 (4–5)0.9211.0000.9580.9180.8980.8780.8780.858
Reader 35 (4–5)0.8820.9581.0000.9580.9370.9160.9160.895
Reader 45 (4–5)0.8450.9180.9581.0000.9780.9560.9560.935
Reader 55 (4–5)0.8260.8980.9370.9781.0000.9780.9780.956
Reader 65 (4–5)0.8080.8780.9160.9560.9781.0001.0000.977
Reader 75 (4–5)0.8080.8780.9160.9560.9781.0001.0000.977
Reader 85 (4–5)0.7900.8580.8950.9350.9560.9770.9771.000
wFBP = weighted filtered back-projection; ADMIRE 2 = Advanced Modeled Iterative Reconstruction strength 2; IQR = interquartile range.
Table 2. Severity score ratings and inter-rater-agreement.
Table 2. Severity score ratings and inter-rater-agreement.
Severity ScoreSpearman Correlation Coefficient
(Mean ± SD)
Reader 1Reader 2Reader 3Reader 4Reader 5Reader 6Reader 7Reader 8
wFBPReader 111.90 ± 6.721.0000.9900.9790.9700.7640.7720.8130.784
Reader 211.70 ± 6.720.9901.0000.9890.9800.7740.7820.8230.794
Reader 311.60 ± 6.830.9790.9891.0000.9910.7850.7930.8340.805
Reader 411.50 ± 6.860.9700.9800.9911.0000.7940.8030.8430.814
Reader 59.35 ± 5.920.7640.7740.7850.7941.0000.9910.9530.979
Reader 69.15 ± 5.840.7720.7820.7930.8030.9911.0000.9620.988
Reader 79.03 ± 5.880.8130.8230.8340.8430.9530.9621.0000.975
Reader 89.03 ± 5.810.7840.7940.8050.8140.9790.9880.9751.000
ADMIRE 2Reader 111.60 ± 6.721.0000.9960.9930.9890.7820.7780.7830.777
Reader 211.50 ± 6.760.9961.0000.9980.9940.7840.7790.7850.780
Reader 311.40 ± 6.760.9930.9981.0000.9970.7900.7870.7940.786
Reader 411.30 ± 6.710.9890.9940.9971.0000.7970.7950.8020.794
Reader 59.17 ± 5.70.7820.7840.7900.7971.0000.9960.9850.989
Reader 69.12 ± 5.780.7780.7790.7870.7950.9961.0000.9860.991
Reader 79.00 ± 5.850.7830.7850.7940.8020.9850.9861.0000.990
Reader 89.02 ± 5.810.7770.7800.7860.7940.9890.9910.9901.000
PixelShineReader 111.20 ± 6.451.0000.9980.9960.9960.8300.8310.8610.826
Reader 211.10 ± 6.490.9981.0000.9970.9960.8310.8320.8640.827
Reader 311.00 ± 6.430.9960.9971.0001.0000.8450.8450.8780.840
Reader 411.00 ± 6.450.9960.9961.0001.0000.8460.8460.8790.841
Reader 59.26 ± 5.930.8300.8310.8450.8461.0000.9990.9890.998
Reader 69.23 ± 5.920.8310.8320.8450.8460.9991.0000.9900.999
Reader 79.41 ± 5.930.8610.8640.8780.8790.9890.9901.0000.988
Reader 89.18 ± 5.900.8260.8270.8400.8410.9980.9990.9881.000
wFBP = weighted filtered back-projection; ADMIRE 2 = Advanced Modeled Iterative Reconstruction strength 2; SD = standard deviation.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Brendlin, A.S.; Schmid, U.; Plajer, D.; Chaika, M.; Mader, M.; Wrazidlo, R.; Männlin, S.; Spogis, J.; Estler, A.; Esser, M.; et al. AI Denoising Improves Image Quality and Radiological Workflows in Pediatric Ultra-Low-Dose Thorax Computed Tomography Scans. Tomography 2022, 8, 1678-1689. https://doi.org/10.3390/tomography8040140

AMA Style

Brendlin AS, Schmid U, Plajer D, Chaika M, Mader M, Wrazidlo R, Männlin S, Spogis J, Estler A, Esser M, et al. AI Denoising Improves Image Quality and Radiological Workflows in Pediatric Ultra-Low-Dose Thorax Computed Tomography Scans. Tomography. 2022; 8(4):1678-1689. https://doi.org/10.3390/tomography8040140

Chicago/Turabian Style

Brendlin, Andreas S., Ulrich Schmid, David Plajer, Maryanna Chaika, Markus Mader, Robin Wrazidlo, Simon Männlin, Jakob Spogis, Arne Estler, Michael Esser, and et al. 2022. "AI Denoising Improves Image Quality and Radiological Workflows in Pediatric Ultra-Low-Dose Thorax Computed Tomography Scans" Tomography 8, no. 4: 1678-1689. https://doi.org/10.3390/tomography8040140

Article Metrics

Back to TopTop